首页 /研究 /Improving Learning for Embodied Agents in Dynamic--Environments by State Factorisation
LEARNING

Improving Learning for Embodied Agents in Dynamic--Environments by State Factorisation

Daniel Jacob, Daniel Polani, Chrystopher L. Nehaniv

发表年份
2004
引用次数
3
访问权限
开放获取

摘要

A new reinforcement learning algorithm de-signed specifically for robots and embodied sys-tems is described. Conventional reinforcement learning methods intended for learning general tasks suffer from a number of disadvantages in this domain including slow learning speed, an in-ability to generalise between states, reduced per-formance in dynamic environments, and a lack of scalability. Factor-Q, the new algorithm, uses factorised state and action, coupled with mul-tiple structured rewards, to address these is-sues. Initial experimental results demonstrate that Factor-Q is able to learn as efficiently in dy-namic as in static environments, unlike conven-tional methods. Further, in the specimen task, obstacle avoidance is improved by over two or-ders of magnitude compared with standard Q-learning. 1.

关键词

Reinforcement learningComputer scienceEmbodied cognitionScalabilityObstacleTask (project management)Artificial intelligenceFactor (programming language)Obstacle avoidanceState (computer science)

相关论文

查看 LEARNING 分类全部论文