Persistent URL of this record https://hdl.handle.net/1887/3209232
Documents
-
- Download
- Title pages_Contents
- open access
-
- Download
- Chapter 2
- open access
- Full text at publishers site
-
- Download
- Chapter 4
- open access
- Full text at publishers site
-
- Download
- Chapter 5
- open access
- Full text at publishers site
-
- Download
- Chapter 7
- open access
- Full text at publishers site
-
- Download
- Appendices_Bibliography
- open access
-
- Download
- Summary in English
- open access
-
- Download
- Summary in Dutch
- open access
-
- Download
- Acknowledgements_Curriculum Vitae
- open access
-
- Download
- Propositions
- open access
In Collections
This item can be found in the following collections:
Searching by learning: Exploring artificial general intelligence on small board games by deep reinforcement learning
We study table based classic Q-learning on the General Game Playing (GGP) system, showing that classic Q-learning works on GGP, although convergence is slow, and it is computationally expensive to learn complex games.
This dissertation uses an AlphaZero-like self-play framework to explore AGI on small games. By tuning different hyper-parameters, the role, effects and contributions of searching and learning are studied. A further experiment shows that search techniques can contribute as experts to generate better training examples to speed up the start phase of training.
In order to extend the AlphaZero-likeself-play approach to single player complex games, the Morpion Solitaire game is implemented by combining...Show moreIn deep reinforcement learning, searching and learning techniques are two important components. They can be used independently and in combination to deal with different problems in AI. These results have inspired research into artificial general intelligence (AGI).
We study table based classic Q-learning on the General Game Playing (GGP) system, showing that classic Q-learning works on GGP, although convergence is slow, and it is computationally expensive to learn complex games.
This dissertation uses an AlphaZero-like self-play framework to explore AGI on small games. By tuning different hyper-parameters, the role, effects and contributions of searching and learning are studied. A further experiment shows that search techniques can contribute as experts to generate better training examples to speed up the start phase of training.
In order to extend the AlphaZero-likeself-play approach to single player complex games, the Morpion Solitaire game is implemented by combining Ranked Reward method. Our first AlphaZero-based approach is able to achieve a near human best record.
Overall, in this thesis, both searching and learning techniques are studied (by themselves and in combination) in GGP and AlphaZero-like self-play systems. We do so for the purpose of making steps towards artificial general intelligence, towards systems that exhibit intelligent behavior in more than one domain.
Show less
- All authors
- Wang, H.
- Supervisor
- Emmerich, M.; Plaat, A.
- Co-supervisor
- Preuss, M.
- Committee
- Bäck, T.; Bonsangue, M.; Batenburg, J.; Winands, M.; Baratchi, M.; Moerland, T.; Schwab, I.
- Qualification
- Doctor (dr.)
- Awarding Institution
- Leiden Institute of Advanced Computer Science (LIACS), Faculty of Science, Leiden University
- Date
- 2021-09-07
- ISBN (print)
- 9789464192537
Funding
- Sponsorship
- China Scholarship Council