This is my reinforcement learning code. I went through four different algorithms to solve an example of navigating a maze.

The first three examples show an agent navigating a grid maze. It includes Value Iteration, Policy Iteration, and Q-Learning Images are provided to show the results.

The last example shows an agent moving towards a target. This includes Deep Q Learning. This is stored in the dqn folder.

I hope you find this code useful in learning how these algorithms work. I made many comments to demonstrate the different steps of the algorithm.

Value Iteration Result

Policy Iteration Result

Q Learning Values

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

readme.md

readme.md

Files

readme.md

Latest commit

History

readme.md

File metadata and controls