Online Sensor Selection in Reinforcement Learning Environment
Kôichirô Ishikawa, Akito Sakurai, Tsutomu Fujinami, Susumu Kunifuji
- Year
- 2005
- Citations
- 2
- Access
- Open access
Abstract
More sensors do not necessarily result in more appropriate state descriptions, so that a mobile robot has to select an appropriate set of sensors besides learning a state-action function in a reinforcement learning environment. We present a multi-armed bandit formulation of the problem and apply it to mobile robot navigation task. We modified the reinforcement comparison method to suit our problem and build a system where the selection of optimal set of sensors and the learning of state-action functions are done simultaneously. Our approach is evaluated on a Khepera robot simulator and the results reveal that our approach works well as an integrated learning system to identify the best set of sensors and reduce learning time.
Keywords
Related papers
Statistical Learning Theory
Yuhai Wu, Vladimir Vapnik
1999
Artificial intelligence: a modern approach
1995
Applied Nonlinear Control
Jean-Jacques Slotine, Weiping Li
1991
A new optimizer using particle swarm theory
R.C. Eberhart, James Kennedy
2002