inf581-project

This repo corresponds to a group project in a RL class at Ecole Polytechnique. It is depreciated.

The environment developed during this project gave birth to poke-env, an Open Source environment for RL Pokemons bots, which is currently being developed.

inf581-project

The goal of this project is to implement a pokemon battling bot powered by reinforcement learning.

Installation

Ubuntu

Run

sh scripts/ubuntu-setup.sh

Mac OS

Run

sh scripts/macos-setup.sh

Windows

We recommend using a Windows Linux Subsystem.

How to run

You need to have a Pokemon Showdown server running on localhost (node pokemon-showdown in the Pokemon-Showdown folder). We recommend modifying it at bit to run things more quickly (try running sh scripts/update-showdown.sh ;) - this is actually automatically done during installation if you use our installation scripts).

To launch the current project, run python3 src/main.py. It will launch an agent using a pre-trained model. If you want to train your own agent, use python3 src/train_policy.py.

What is implemented

Base player classes

Base PlayerNetwork class. Responsible for managing player network interaction (eg. send and receive messages to the server) with as many utilities as deemed useful
Base Player class. Responsible for common player mecanisms. In particular, it can challenge and receive challenges.
Base ModelManager class. Responsible for managing an agent using a keras neural network.
Base ModelManagerTF class. Responsible for managing an agent using a tensorflow neural network.

Other base classes exist, such as _MLRandomBattlePlayer, but they are not supposed to be used directly.

Environment

Battle class. Stores information on a battle as it goes on.
Pokemon class. Stores information on pokemons during the battle.
Move class. Stores information on moves.

This work is considered as good enough ; there is a lot of things to be done and extended, but the current focus of the project is on implementing a first working battling AI based on the current environment. In particular, please do not change the API or the dict returned by dic_state.

Players

RandomRandomBattlePlayer. A player playing random battles in a random fashion.
PolicyNetwork. An agent based on deep policy reinforcement learning in a pruned environment. It beats the random agent approximately on 90% of the battles.
FullyConnectedRandomModel. An agent based on deep policy reinforcement learning in a full environment. It does not converge at the time.

Acknowledgements

We use Pokemon Showdown and our code was parially inspired by the showdown-battle-bot project.

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
Pokemon-Showdown		Pokemon-Showdown
data		data
models/PolicyNetwork		models/PolicyNetwork
scripts		scripts
showdown-improvements		showdown-improvements
src		src
.gitignore		.gitignore
01_reward.png		01_reward.png
0k_reward.png		0k_reward.png
README.md		README.md
config.json		config.json
hp_reward.png		hp_reward.png
length_reward.png		length_reward.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

inf581-project

Installation

Ubuntu

Mac OS

Windows

How to run

What is implemented

Base player classes

Environment

Players

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

hsahovic/reinforcement-learning-pokemon-bot

Folders and files

Latest commit

History

Repository files navigation

inf581-project

Installation

Ubuntu

Mac OS

Windows

How to run

What is implemented

Base player classes

Environment

Players

Acknowledgements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages