From 6a31e4da02fac0b49823b94884a122eabb375aeb Mon Sep 17 00:00:00 2001 From: Nicolai Czempin Date: Sat, 1 Apr 2017 11:42:58 +0200 Subject: [PATCH] Create README.md --- README.md | 9 +++++++++ 1 file changed, 9 insertions(+) create mode 100644 README.md diff --git a/README.md b/README.md new file mode 100644 index 0000000..86c5c23 --- /dev/null +++ b/README.md @@ -0,0 +1,9 @@ +# gym-random-walk + +A minimal example of a custom environment for https://github.com/openai/gym, based on an example in the recordings of David Silver's lectures on Reinforcement Learning at UCL. + +(0) - A - B - C - D - E - (+1) + +You start off at one of the positions A to E, you can move right or left, reaching the "+1" terminal state gives you a reward of +1, and going all the way "to the left" will give you a terminal reward of 0. + +Instead of calling them the above, I just made them the states 0, 1, ...6.