alphazero-quoridor / README.md
mishasamin's picture
Update README.md
8da849c
metadata
pipeline_tag: reinforcement-learning

AlphaZero, Quoridor Version!

Based on the framework provided here.

To start training, modify parameters in main.py and then start using

python main.py

Playing against it

quoridor

Once you're done training, you need to modify pit.py to create one NN player, pointing it to your best.pth.tar and a human player.

During the game, you have a choice of ten actions:

  • u (up)
  • d (down)
  • r (right)
  • l (left)
  • plus four diagonal move ur, ul, dr, dl

In order to place walls, you type h (for horizontal wall) or v (for vertical wall), press enter followed by x y of where you want the wall to be placed.