Local-Options

Methods for learning local options, Q-learning given local options and experiments

agents.py contains agent classes that implement a policy defined by a dictionary with states as keys and arrays of action probabilities as entries.

algorithms.py contains implementations of Q-learning, SARSA, Td(0) that work with vectorized rewards (action selection is handled by scalarization in Q-learning). It also contains algorithms to learn option models, either by solving a local MDP for different rewards and solving the resulting system of equations, or by an approach based on learning the transition model.

environments.py contains a generic MDP class, that implements a transition and reward model. There are also functions to randomly generate an MDP and the robot MDP:

wrapper.py contains wrappers that use a base MDP and turn it into a local MDP on parts of the state space.

Test.py contains

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
images		images
.gitattributes		.gitattributes
.gitignore		.gitignore
Experiment1.py		Experiment1.py
Experiment2.py		Experiment2.py
Ma.pdf		Ma.pdf
README.md		README.md
SARSA_intro.png		SARSA_intro.png
agents.py		agents.py
algorithms.py		algorithms.py
environments.py		environments.py
functions.py		functions.py
misc.py		misc.py
wrapper.py		wrapper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Local-Options

About

Releases

Packages

Languages

flodorner/Local-Options

Folders and files

Latest commit

History

Repository files navigation

Local-Options

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages