Indexed by:
Abstract:
This paper constructs a learning probabilistic automata (PA) model with response of operant conditioning (OC) behavior, which used for simulating skinner-pigeon experiment. The PA model with OC is a form of animal learning in that it allows an agent to adapt its actions to gain maximally from the environment while only being rewarded for correct performance. The learning mechanism achieved by design probability of action selection, which is updated by the information of reward and punishment form the environment, and then the agent select an action random according to the probability of action selection. We apply our model to skinner-pigeon experiment, the peck button task. The pigeon learn this task in stages. In simulation, our model also acquires the task in a similar manner.
Keyword:
Reprint Author's Address:
Email:
Source :
PROCEEDINGS OF THE 2009 WRI GLOBAL CONGRESS ON INTELLIGENT SYSTEMS, VOL III
Year: 2009
Page: 578-581
Language: English
Cited Count:
WoS CC Cited Count: 2
SCOPUS Cited Count: 5
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 4
Affiliated Colleges: