Skip to content

Code for Pommerman experiments in the paper "Learning to Play Imperfect-Information Games by Imitating an Oracle Planner"

License

Notifications You must be signed in to change notification settings

rinuboney/l2p-pommerman

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Learning to Play Imperfect-Information Games by Imitating an Oracle Planner

This repository contains source code for Pommerman experiments in the paper titled "Learning to Play Imperfect-Information Games by Imitating an Oracle Planner" by Rinu Boney, Alexander Ilin, Juho Kannala and Jarno Seppänen.

Dependencies:

Oracle Planner with Full Observability

The planning results reported in the paper can be reproduced by running:

python plan.py

Optional arguments:

Parameter Default Description
--planner fdts 'fdts' or 'mcts' or 'mcs'
--mab ts 'ts' or 'ucb'
--n_simulations 100 number of planning rollouts at each time-step
--horizon 20 depth of planning rollouts in FDTS and MCS
--n_threads 5 number of games to play in parallel
--n_episodes 100 total number of games to play

Training Follower Policy with Partial Observability

The follower results reported in the paper can be reproduced by running:

python follow.py

License

This project is licensed under the terms of the GPL-3.0 License. See LICENSE file for details.

About

Code for Pommerman experiments in the paper "Learning to Play Imperfect-Information Games by Imitating an Oracle Planner"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages