Skip to content

hsahovic/reinforcement-learning-pokemon-bot

Repository files navigation

This repo corresponds to a group project in a RL class at Ecole Polytechnique. It is depreciated.

The environment developed during this project gave birth to poke-env, an Open Source environment for RL Pokemons bots, which is currently being developed.


inf581-project

The goal of this project is to implement a pokemon battling bot powered by reinforcement learning.

Installation

Ubuntu

Run

sh scripts/ubuntu-setup.sh

Mac OS

Run

sh scripts/macos-setup.sh

Windows

We recommend using a Windows Linux Subsystem.

How to run

You need to have a Pokemon Showdown server running on localhost (node pokemon-showdown in the Pokemon-Showdown folder). We recommend modifying it at bit to run things more quickly (try running sh scripts/update-showdown.sh ;) - this is actually automatically done during installation if you use our installation scripts).

To launch the current project, run python3 src/main.py. It will launch an agent using a pre-trained model. If you want to train your own agent, use python3 src/train_policy.py.

What is implemented

Base player classes

  • Base PlayerNetwork class. Responsible for managing player network interaction (eg. send and receive messages to the server) with as many utilities as deemed useful
  • Base Player class. Responsible for common player mecanisms. In particular, it can challenge and receive challenges.
  • Base ModelManager class. Responsible for managing an agent using a keras neural network.
  • Base ModelManagerTF class. Responsible for managing an agent using a tensorflow neural network.

Other base classes exist, such as _MLRandomBattlePlayer, but they are not supposed to be used directly.

Environment

  • Battle class. Stores information on a battle as it goes on.
  • Pokemon class. Stores information on pokemons during the battle.
  • Move class. Stores information on moves.

This work is considered as good enough ; there is a lot of things to be done and extended, but the current focus of the project is on implementing a first working battling AI based on the current environment. In particular, please do not change the API or the dict returned by dic_state.

Players

  • RandomRandomBattlePlayer. A player playing random battles in a random fashion.
  • PolicyNetwork. An agent based on deep policy reinforcement learning in a pruned environment. It beats the random agent approximately on 90% of the battles.
  • FullyConnectedRandomModel. An agent based on deep policy reinforcement learning in a full environment. It does not converge at the time.

Acknowledgements

We use Pokemon Showdown and our code was parially inspired by the showdown-battle-bot project.