A simple two-player environment for openai/gym
gym_tic_tac_toe | ||
.gitignore | ||
LICENSE | ||
random_tic_tac_toe.py | ||
README.md | ||
setup.py |
gym-tic-tac-toe
An example of a custom environment for https://github.com/openai/gym.
I want to try out self-play in a Reinforcement Learning context. Rather than the board game environments on openai/gym right now, which are "single-player" by providing a built-in opponent, I want to create an agent that learns a strategy by playing against itself, so it will try to maximize the reward for "player 1" and minimize it for "player 2".
The canonical example of a simple two player game is Tic Tac Toe, also known as Noughts and Crosses.