GitHub

This is my reinforcement learning code. I went through four different algorithms to solve an example of navigating a maze.

The first three examples show an agent navigating a grid maze. It includes Value Iteration, Policy Iteration, and Q-Learning Images are provided to show the results.

The last example shows an agent moving towards a target. This includes Deep Q Learning. This is stored in the dqn folder.

I hope you find this code useful in learning how these algorithms work. I made many comments to demonstrate the different steps of the algorithm.

Value Iteration Result

Policy Iteration Result

Q Learning Values

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
dqn		dqn
maze.py		maze.py
policy_iter_ex.jpg		policy_iter_ex.jpg
q-maze.py		q-maze.py
q_learning_values.jpeg		q_learning_values.jpeg
readme.md		readme.md
value_iter_ex.jpg		value_iter_ex.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

orgulous/maze

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages