Multi-Criteria Comparison of Coevolution and Temporal Difference Learning on Othello

Wojciech Jaśkowski; Marcin Szubert; Paweł Liskowski

doi:10.1007/978-3-662-45523-4_25

System Informacji Naukowej Politechniki Poznańskiej

PL EN

Strona główna / Publikacje / Multi-Criteria Comparison of Coevolution and Temporal Difference Learning on Othello

Zgłoś uwagę

Rozdział

Pobierz BibTeX

Tytuł

Multi-Criteria Comparison of Coevolution and Temporal Difference Learning on Othello

Autorzy

Wojciech Jaśkowski (WI) ^{[ 1 ][ P ]}
Marcin Szubert (WI) ^{[ 1 ][ P ]}
Paweł Liskowski (WI) ^{[ 1 ][ P ]}

^{[ 1 ]} Instytut Informatyki, Wydział Informatyki, Politechnika Poznańska | ^{[ P ]} pracownik

Rok publikacji

2014

Typ rozdziału

referat

Język publikacji

angielski

Słowa kluczowe

EN

reinforcement learning
coevolutionary algorithm
Reversi
Othello
board evaluation function
weighted piece counter
interactive domain

Streszczenie

EN We compare Temporal Difference Learning (TDL) with Coevolutionary Learning (CEL) on Othello. Apart from using three popular single-criteria performance measures: (i) generalization performance or expected utility, (ii) average results against a hand-crafted heuristic and (iii) result in a head to head match, we compare the algorithms using performance profiles. This multi-criteria performance measure characterizes player’s performance in the context of opponents of various strength. The multi-criteria analysis reveals that although the generalization performance of players produced by the two algorithms is similar, TDL is much better at playing against strong opponents, while CEL copes better against weak ones. We also find out that the TDL produces less diverse strategies than CEL. Our results confirms the usefulness of performance profiles as a tool for comparison of learning algorithms for games.

Strony (od-do)

301 - 312

DOI

10.1007/978-3-662-45523-4_25

URL

https://link.springer.com/chapter/10.1007/978-3-662-45523-4_25

Książka

Applications of Evolutionary Computation : 17th European Conference, EvoApplications 2014, Granada, Spain, April 23-25, 2014

Zaprezentowany na

17th European Conference on the Applications of Evolutionary Computation, EvoApplications 2014, 23-25.04.2014, Granada, Spain

Publikacja indeksowana w

WoS (15)

System tworzony przez Politechnikę Poznańską oraz Poznańskie Centrum Superkomputerowo-Sieciowe

Zaloguj się przez eKonto, aby dodać do SIN