This is merely re-hashing of pointers. (For rendering Latex equ, follow this)
See also:
- https://github.com/tttor/ml-foundation
- https://github.com/tttor/math-foundation
- https://github.com/tttor/robot-foundation
- https://github.com/aikorea/awesome-rl
- https://github.com/vmayoral/basic_reinforcement_learning
- http://www-anw.cs.umass.edu/rlr/
- https://deeplearning4j.org/deepreinforcementlearning
- https://github.com/dennybritz/reinforcement-learning
- https://github.com/brylevkirill/notes/blob/master/Reinforcement%20Learning.md
- https://github.com/mpatacchiola/dissecting-reinforcement-learning
- https://nanjiang.cs.illinois.edu/cs598project/
- https://masterscrat.github.io/rl-insights/
- Reinforcing behaviour to birds: https://twitter.com/kareem_carr/status/1180880588376596481
- https://github.com/Machine-Learning-Tokyo/Deep_Reinforcement_Learning
- https://github.com/andyljones/reinforcement-learning-discord-wiki