gym-tic-tac-toe/README.md

# gym-random-walk

A minimal example of a custom environment for https://github.com/openai/gym.

What the environment provides is not that important; this is meant to show how what you need to do to create your own environments for openai/gym.

For concreteness I used an example in the recordings of David Silver's lectures on Reinforcement Learning at UCL.

(0) - A - B - C - D - E - (+1)

You start off at one of the positions A to E, you can move right or left, reaching the "+1" terminal state gives you a reward of +1, and going all the way "to the left" will give you a terminal reward of 0.

Instead of calling them the above, I just made them the states 0, 1, ...6.
Create README.md 2017-04-01 13:42:58 +04:00			`# gym-random-walk`

Update README.md 2017-04-03 23:47:10 +04:00			`A minimal example of a custom environment for https://github.com/openai/gym.`

			`What the environment provides is not that important; this is meant to show how what you need to do to create your own environments for openai/gym.`

			`For concreteness I used an example in the recordings of David Silver's lectures on Reinforcement Learning at UCL.`
Create README.md 2017-04-01 13:42:58 +04:00
			`(0) - A - B - C - D - E - (+1)`

			`You start off at one of the positions A to E, you can move right or left, reaching the "+1" terminal state gives you a reward of +1, and going all the way "to the left" will give you a terminal reward of 0.`

			`Instead of calling them the above, I just made them the states 0, 1, ...6.`