Home /Research /<title>SAN-RL: combining spreading activation networks and reinforcement learning to learn configurable behaviors</title>
LEARNING

<title>SAN-RL: combining spreading activation networks and reinforcement learning to learn configurable behaviors</title>

Daniel Gaines, D.M. Wilkes, Kanok Kusumalnukool, Siripun Thongchai, K. Kawamura, John H. White

Year
2002
Citations
2

Abstract

Reinforcement learning techniques have been successful in allowing an agent to learn a policy for achieving tasks. The overall behavior of the agent can be controlled with an appropriate reward function. However, the policy that is learned will be fixed to this reward function. If the user wishes to change his or her preference about how the task is achieved the agent must be retrained with this new reward function. We address this challenge by combining Spreading Activation Networks and Reinforcement Learning in an approach we call SAN-RL. This approach provides the agent with a causal structure, the spreading activation network, relating goals to the actions that can achieve those goals. This enables the agent to select actions relative to the goal priorities. We combine this with reinforcement learning to enable the agent to learn a policy. Together, these approaches enable the learning of a configurable behaviors, a policy that can be adapted to meet the current preferences. We compare the approach with Q-learning on a robot navigation task. We demonstrate that SAN-RL exhibits goal-directed behavior before learning, exploits the causal structure of the network to focus its search during learning and results in configurable behaviors after learning.

Keywords

Reinforcement learningComputer scienceTask (project management)Function (biology)ExploitPreferenceArtificial intelligenceReinforcementPolicy learningFocus (optics)

Related papers

Browse all LEARNING papers