收录:
摘要:
Learning is the main aim of robotics. In this paper we present a new stochastic learning automaton called a Skinner automaton as a psychological model for formalizing the theory of operant conditioning. We identify animal operant learning with a thermodynamic process, and derive a so-called Skinner algorithm from Monte Carlo method and Metropolis algorithm and simulated annealing. The Skinner automaton is implemented on a two-wheeled robot with a flexible lumbar in a simulation experiment and it learns to keep balance successfully.
关键词:
通讯作者信息:
来源 :
PROCEEDINGS OF THE 3RD INTERNATIONAL CONFERENCE ON ELECTRIC AND ELECTRONICS
年份: 2013
页码: 253-256
语种: 英文
归属院系: