![]() Different playing styles for different seeds This means you can get as many different games as you want, in fact at the bottom of the article there are a couple more with different opening moves. So the first 4 moves of the game where set by humans, only afterwards AlphaZero started to play by itself. The video you linked links to a article, with some more details: This is mostly to ensure game diversity, since many engines are (almost) deterministic, so you'd just end up with a single game. Something you seem to be missing is that typically when engine vs engine matches are played an opening book is used. I talk about this is a bit more in this answer and its comments. There is probably a bit of inherent randomness from multithreading, but no seed as such. MCTS as implemented by AlphaZero does not intentionally use any randomness during tournament play. ![]() They only trained AlphaZero once, and then let it play against itself. The last question should be answerable also by those of you who don't know the video and how the premises have been. In other words: Can AlphaZero develop significantly different playing styles when starting tabula rasa? In this case, Sadler/Regan's book would describe just one instance of AlphaZero. In the latter case, I wonder if AlphaZero's playing style (which is for example analyzed in the book Game Changer by Matthew Sadler and Natasha Regan) may depend on the seeds used for generating random games (assuming that the same number of test games is played in the learning phase). different seeds when learning (the seeds when playing don't matter then).same seeds when learning, different seeds when playing.same seeds when learning, same seeds when playing.In any case there are three possible games all of which may be called "AlphaZero vs. ![]() when generating random games? So were they perfect copies of each other at the beginning of the game?ĭo the two copies of AlphaZero calculate their moves with the same random seeds when doing Monte Carlo tree search? It leaves some questions open and I'd like to ask them here:ĭid the two copies of AlphaZero use the same random seeds in the learning phase, esp. There is a quite popular video analysing a chess game AlphaZero vs.
0 Comments
Leave a Reply. |
AuthorWrite something about yourself. No need to be fancy, just an overview. ArchivesCategories |