首页 /研究 /Improving Learning for Embodied Agents in Dynamic--Environments by State Factorisation

LEARNING

Improving Learning for Embodied Agents in Dynamic--Environments by State Factorisation

Daniel Jacob, Daniel Polani, Chrystopher L. Nehaniv

发表年份: 2004
引用次数: 3
访问权限: 开放获取

摘要

A new reinforcement learning algorithm de-signed specifically for robots and embodied sys-tems is described. Conventional reinforcement learning methods intended for learning general tasks suffer from a number of disadvantages in this domain including slow learning speed, an in-ability to generalise between states, reduced per-formance in dynamic environments, and a lack of scalability. Factor-Q, the new algorithm, uses factorised state and action, coupled with mul-tiple structured rewards, to address these is-sues. Initial experimental results demonstrate that Factor-Q is able to learn as efficiently in dy-namic as in static environments, unlike conven-tional methods. Further, in the specimen task, obstacle avoidance is improved by over two or-ders of magnitude compared with standard Q-learning. 1.

关键词

Reinforcement learningComputer scienceEmbodied cognitionScalabilityObstacleTask (project management)Artificial intelligenceFactor (programming language)Obstacle avoidanceState (computer science)

Improving Learning for Embodied Agents in Dynamic--Environments by State Factorisation

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory