gym-tic-tac-toe/README.md

# gym-tic-tac-toe

An example of a custom environment for https://github.com/openai/gym.

I want to try out self-play in a Reinforcement Learning context. Rather than the board game environments on openai/gym right now, which are "single-player" by providing a built-in opponent, I want to create an agent that learns a strategy by playing against itself, so it will try to maximize the reward for "player 1" and minimize it for "player 2".

The canonical example of a simple two player game is Tic Tac Toe, also known as Noughts and Crosses.
Update README.md 2017-04-03 23:55:40 +04:00			`# gym-tic-tac-toe`
Create README.md 2017-04-01 13:42:58 +04:00
Update README.md 2017-04-03 23:55:40 +04:00			`An example of a custom environment for https://github.com/openai/gym.`
Update README.md 2017-04-03 23:47:10 +04:00
Update README.md 2017-04-03 23:55:40 +04:00			`I want to try out self-play in a Reinforcement Learning context. Rather than the board game environments on openai/gym right now, which are "single-player" by providing a built-in opponent, I want to create an agent that learns a strategy by playing against itself, so it will try to maximize the reward for "player 1" and minimize it for "player 2".`
Update README.md 2017-04-03 23:47:10 +04:00
Update README.md 2017-04-03 23:55:40 +04:00			`The canonical example of a simple two player game is Tic Tac Toe, also known as Noughts and Crosses.`