首页 /研究 /Coevolution of a Backgammon Player
LEARNING

Coevolution of a Backgammon Player

Jordan Pollack, Alan Blair

发表年份
1996
引用次数
91

摘要

One of the persistent themes in Artificial Life research is the use of co-evolutionary arms races in the development of specific and complex behaviors. However, other than Sims's work on artificial robots, most of the work has attacked very simple games of prisoners dilemma or predator and prey. Following Tesauro's work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of the dice, application of the network to all legal moves, and choosing the move with the highest evaluation. However, no back-propagation, reinforcement or temporal difference learning methods were employed. Instead we apply simple hillclimbing in a relative fitness environment. We start with an initial champion of all zero weights and proceed simply by playing the current champion network against a slightly mutated challenger, changing weights when the challenger wins. Our results show co-evolution to be a powerful machin...

关键词

ChampionComputer scienceArtificial intelligenceSophisticationArtificial neural networkSuperrationalitySimple (philosophy)Reinforcement learningTask (project management)Dilemma

相关论文

查看 LEARNING 分类全部论文