Home /Research /Coevolution of a Backgammon Player
LEARNING

Coevolution of a Backgammon Player

Jordan Pollack, Alan Blair

Year
1996
Citations
91

Abstract

One of the persistent themes in Artificial Life research is the use of co-evolutionary arms races in the development of specific and complex behaviors. However, other than Sims's work on artificial robots, most of the work has attacked very simple games of prisoners dilemma or predator and prey. Following Tesauro's work on TD-Gammon, we used a 4000 parameter feed-forward neural network to develop a competitive backgammon evaluation function. Play proceeds by a roll of the dice, application of the network to all legal moves, and choosing the move with the highest evaluation. However, no back-propagation, reinforcement or temporal difference learning methods were employed. Instead we apply simple hillclimbing in a relative fitness environment. We start with an initial champion of all zero weights and proceed simply by playing the current champion network against a slightly mutated challenger, changing weights when the challenger wins. Our results show co-evolution to be a powerful machin...

Keywords

ChampionComputer scienceArtificial intelligenceSophisticationArtificial neural networkSuperrationalitySimple (philosophy)Reinforcement learningTask (project management)Dilemma

Related papers

Browse all LEARNING papers