mdp-value-iteration

Basic implementation of Markov Decision Processes using Values Iteration algorithm with graphic representation in Unity

Basic usage

Open Unity project and find MPD gameObject in the hierarchy. There is MDP script attached to this gameObject. There are some values to change:

Penatly - reward/penalty for each step
Probability - probability of choosing best action
Discount factor - gamma parameter in Bellman's quation
Rows/Columns - size of enviroment table
Good/Bad Rewards - value of preferable/unpreferable state
Epsilon - stops the algorithm when absolute value of changes is less than that value

Use left mouse button to create preferable/unpreferable states by clicking on cells. Use right mouse button to create walls. Use spacebar to start the algorithm. Morover after pressing mouse wheel arows with directions (policy) will be shown.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Assets		Assets
Packages		Packages
ProjectSettings		ProjectSettings
UserSettings		UserSettings
img		img
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mdp-value-iteration

Basic usage

About

Releases

Packages

Languages

Neqit/mdp-value-iteration

Folders and files

Latest commit

History

Repository files navigation

mdp-value-iteration

Basic usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages