Coevolution of a Backgammon Player

Jordan Pollack, Alan Blair

发表年份: 1996
引用次数: 91

摘要

One of the persistent themes in Artificial Life research is the use of co-evolutionary arms races in the development of specific and complex behaviors. However, other than Sims&apos;s work on artificial robots, most of the work has attacked very simple games of prisoners dilemma or predator and prey. Following Tesauro&apos;s work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of the dice, application of the network to all legal moves, and choosing the move with the highest evaluation. However, no back-propagation, reinforcement or temporal difference learning methods were employed. Instead we apply simple hillclimbing in a relative fitness environment. We start with an initial champion of all zero weights and proceed simply by playing the current champion network against a slightly mutated challenger, changing weights when the challenger wins. Our results show co-evolution to be a powerful machin...

关键词

ChampionComputer scienceArtificial intelligenceSophisticationArtificial neural networkSuperrationalitySimple (philosophy)Reinforcement learningTask (project management)Dilemma

Coevolution of a Backgammon Player

摘要

关键词

相关论文

Statistical Learning Theory

Artificial intelligence: a modern approach

Applied Nonlinear Control

A new optimizer using particle swarm theory