Lections/gym-tic-tac-toe

Go to file

Nicolai Czempin 32e17f864e Play around with SBRT for a bit, I'm probably doing it wrong

2018-12-16 22:58:53 +01:00

gym_tic_tac_toe

Make the board prettier, provide some additional stats

2018-12-16 21:30:13 +01:00

.gitignore

Add IntelliJ files and directories to .gitignore

2018-12-16 17:29:02 +01:00

LICENSE

Initial commit

2017-03-31 00:36:55 +02:00

random_tic_tac_toe.py

Play around with SBRT for a bit, I'm probably doing it wrong

2018-12-16 22:58:53 +01:00

README.md

Update README.md

2017-04-03 21:55:40 +02:00

setup.py

Make the board prettier, provide some additional stats

2018-12-16 21:30:13 +01:00

README.md

gym-tic-tac-toe

An example of a custom environment for https://github.com/openai/gym.

I want to try out self-play in a Reinforcement Learning context. Rather than the board game environments on openai/gym right now, which are "single-player" by providing a built-in opponent, I want to create an agent that learns a strategy by playing against itself, so it will try to maximize the reward for "player 1" and minimize it for "player 2".

The canonical example of a simple two player game is Tic Tac Toe, also known as Noughts and Crosses.