BASALT Competition

This approach won second place in the 2021 NeurIPS MineRL BASALT competition:

The MineRL Benchmark for Agents that Solve Almost-Lifelike Tasks (MineRL BASALT) competition aims to promote research in the area of learning from human feedback, in order to enable agents that can pursue tasks that do not have crisp, easily defined reward functions.

This developed in collaboration with Divyansh Garg. Our general approach was to use IQ-Learn for online imitation learning. It is implemented using PyTorch.

Citation

If you use this repo in your research, please consider citing the IQ-Learn paper as follows:

@article{
    title={IQ-Learn: Inverse soft-Q Learning for Imitation},
    author={Divyansh Garg, Shuvam Chakraborty, Chris Cundy, Jiaming Song, Stefano Ermon},
    year={2021},
    eprint={2106.12142},
    archivePrefix={arXiv},
    primaryClass={cs.LG}
}

Setup

Options

Locally: Package dependencies can be found in environment.yml, which can be set up with conda.
Docker: you can generate a Docker image with utility/docker_build.sh
Google Colab: You can run training via the utility/colab.ipynb file. It expects that you have this repository in a google drive folder

Other Requirements

MineRL: To set up MineRL, follow the setup instructions here.
WandB: We use Weights & Biases to track training metrics. You'll need to set up an account and log in when running training.

Datasets

Follow the instructions here to download and set up the BASALT competition datasets.

Training

To train locally, run python train_submission_code.py. Helpful flags include --virtual-display-false, --debug-env, and --wandb-false.
Explore the config files in conf/ to see the parameters available for modification and their default values. These can be overridden with additional arguments, e.g. env=waterfall method.training_steps=100000

Evaluation

Use generate_trajectory.py to download a model from wandb and generate trajectories.

Testing

Tests are implemented with pytest.

License

Please see the LICENSE for the licensing terms for this code.

Name		Name	Last commit message	Last commit date
Latest commit History 255 Commits
agents		agents
algorithms		algorithms
conf		conf
contexts/minerl		contexts/minerl
core		core
modules		modules
networks		networks
scripts		scripts
sweeps		sweeps
tests		tests
utility		utility
.gitignore		.gitignore
LICENSE.pdf		LICENSE.pdf
README.md		README.md
aicrowd.json		aicrowd.json
aicrowd_helper.py		aicrowd_helper.py
apt.txt		apt.txt
environment.yml		environment.yml
generate_trajectory.py		generate_trajectory.py
run.py		run.py
test_framework.py		test_framework.py
test_submission_code.py		test_submission_code.py
train_submission_code.py		train_submission_code.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BASALT Competition

Citation

Contents

Environments

Algorithms

Other Elements

Setup

Options

Other Requirements

Datasets

Training

Evaluation

Testing

License

About

Releases

Packages

Contributors 2

Languages

edmundmills/basalt-competition

Folders and files

Latest commit

History

Repository files navigation

BASALT Competition

Citation

Contents

Environments

Algorithms

Other Elements

Setup

Options

Other Requirements

Datasets

Training

Evaluation

Testing

License

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages