On scalability, generalization, and hybridization of coevolutionary learning : a case study for Othello

Marcin Szubert; Wojciech Jaśkowski; Krzysztof Krawiec

doi:10.1109/TCIAIG.2013.2258919

Scientific Information System of the Poznań University of Technology

PL EN

Main page / Publications / On scalability, generalization, and hybridization of coevolutionary learning : a case study for Othello

Submit a comment

Article

Download BibTeX

Title

On scalability, generalization, and hybridization of coevolutionary learning : a case study for Othello

Authors

Marcin Szubert (WI) ^{[ 1 ][ P ]}
Wojciech Jaśkowski (WI) ^{[ 1 ][ P ]}
Krzysztof Krawiec (WI) ^{[ 1 ][ P ]}

^{[ 1 ]} Instytut Informatyki, Wydział Informatyki, Politechnika Poznańska | ^{[ P ]} employee

Year of publication

2013

Published in

IEEE Transactions on Computational Intelligence and AI in Games

Journal year: 2013 | Journal volume: vol. 5 | Journal number: iss. 3

Article type

scientific article

Publication language

english

Keywords

EN

coevolution
n-tuple systems
Othello
temporal difference learning (TDL)

Abstract

EN This study investigates different methods of learning to play the game of Othello. The main questions posed concern scalability of algorithms with respect to the search space size and their capability to generalize and produce players that fare well against various opponents. The considered algorithms represent strategies as n-tuple networks, and employ self-play temporal difference learning (TDL), evolutionary learning (EL) and coevolutionary learning (CEL), and hybrids thereof. To assess the performance, three different measures are used: score against an a priori given opponent (a fixed heuristic strategy), against opponents trained by other methods (round-robin tournament), and against the top-ranked players from the online Othello League. We demonstrate that although evolutionary-based methods yield players that fare best against a fixed heuristic player, it is the coevolutionary temporal difference learning (CTDL), a hybrid of coevolution and TDL, that generalizes better and proves superior when confronted with a pool of previously unseen opponents. Moreover, CTDL scales well with the size of representation, attaining better results for larger n-tuple networks. By showing that a strategy learned in this way wins against the top entries from the Othello League, we conclude that it is one of the best 1-ply Othello players obtained to date without explicit use of human knowledge.

Pages (from - to)

214 - 226

DOI

10.1109/TCIAIG.2013.2258919

URL

https://ieeexplore.ieee.org/document/6504736

Ministry points / journal

30

Impact Factor

1,167

System created by Poznań University of Technology and Poznan Supercomputing and Networking Center

Log in through eKonto to add to SIS