Name	Name	Last commit message	Last commit date
parent directory ..
docs	docs
nf	nf
normflows	normflows
skrl	skrl
src	src
tests	tests
thirdparty	thirdparty
.gitignore	.gitignore
.pre-commit-config.yaml	.pre-commit-config.yaml
.readthedocs.yaml	.readthedocs.yaml
CHANGELOG.md	CHANGELOG.md
CONTRIBUTING.md	CONTRIBUTING.md
LICENSE	LICENSE
README.md	README.md
clean_cache.sh	clean_cache.sh
pyproject.toml	pyproject.toml
trainer_meow.py	trainer_meow.py
trainer_sac.py	trainer_sac.py
tuner_meow_allegro.py	tuner_meow_allegro.py
tuner_meow_ant.py	tuner_meow_ant.py
tuner_meow_anymal.py	tuner_meow_anymal.py
tuner_meow_franka.py	tuner_meow_franka.py
tuner_meow_humanoid.py	tuner_meow_humanoid.py
tuner_meow_ingenuity.py	tuner_meow_ingenuity.py
tuner_sac_allegro.py	tuner_sac_allegro.py
tuner_sac_ant.py	tuner_sac_ant.py
tuner_sac_anymal.py	tuner_sac_anymal.py
tuner_sac_franka.py	tuner_sac_franka.py
tuner_sac_humanoid.py	tuner_sac_humanoid.py
tuner_sac_ingenuity.py	tuner_sac_ingenuity.py

MEow in SKRL Implementation

This folder contains the implementation of MEow based on SKRL. SKRL is a library that is compatible with the NVIDIA Omniverse Isaac Gym environments. NVIDIA Omniverse Isaac Gym enables agents' simultaneous training of several robotics tasks simulated based on real-world applications. More details about SKRL and NVIDIA Omniverse Isaac Gym are provided in their documentations (i.e., SKRL and Isaac Sim).

Install Dependencies

Prerequisites

(1) Ensure you have NVIDIA RTX GPUs, as the following installation commands are based on this hardware setup.
(2) Clone both meow and OmniIsaacGymEnvs, and arrange them as follows:

meow/
  ├── ...
  ├── skrl/
  └── OmniIsaacGymEnvs/

NOTE: The following installation instructions are based on the release/2023.1.1 version of OmniIsaacGymEnvs. Revert to this version using git reset --hard release/2023.1.1.

(3) Download the docker image using the following commands:

# login to NGC
docker login nvcr.io

Enter $oauthtoken as the username, and enter NGC API Key as the password. For more details regarding the login instructions, please refer to Steps 4 and 5 of NVIDIA's documentation.

Username: $oauthtoken
Password: ${api_key}

After logging in, pull the Docker image using the following command.

docker pull nvcr.io/nvidia/isaac-sim:2023.1.1

NOTE: The above steps (i.e., (1)~(3)) are needed to be performed only at the first time.

Docker and Installation

(1) Launch a docker image through the following commands at ./meow:

docker run --name isaac-sim --entrypoint bash -it --gpus all -e "ACCEPT_EULA=Y" --rm --network=host \
    -e "PRIVACY_CONSENT=Y" \
    -v ~/docker/isaac-sim/cache/kit:/isaac-sim/kit/cache:rw \
    -v ~/docker/isaac-sim/cache/ov:/root/.cache/ov:rw \
    -v ~/docker/isaac-sim/cache/pip:/root/.cache/pip:rw \
    -v ~/docker/isaac-sim/cache/glcache:/root/.cache/nvidia/GLCache:rw \
    -v ~/docker/isaac-sim/cache/computecache:/root/.nv/ComputeCache:rw \
    -v ~/docker/isaac-sim/logs:/root/.nvidia-omniverse/logs:rw \
    -v ~/docker/isaac-sim/data:/root/.local/share/ov/data:rw \
    -v ~/docker/isaac-sim/documents:/root/Documents:rw \
    -v $(pwd):/workspace \
    -v /tmp/.X11-unix:/tmp/.X11-unix \
    -e DISPLAY=$DISPLAY \
    nvcr.io/nvidia/isaac-sim:2023.1.1

(2) Install the dependencies using the following commands:

# re-path the python command
alias PYTHON_PATH=/isaac-sim/python.sh
# install the default dependencies of OmniIsaacGymEnvs
cd /workspace/OmniIsaacGymEnvs/
PYTHON_PATH -m pip install -e .

# install the default dependencies of meow/skrl
cd /workspace/skrl
PYTHON_PATH -m pip install --upgrade pip
PYTHON_PATH -m pip install -e .["torch"]
apt-get update && apt-get install -y libglib2.0-0 libsm6 libxrender1 libxext6

# install the ray tune package for parallelizable training / tuning
PYTHON_PATH -m pip install ray[tune]

(3) Replace the vec_env_base.py file in isaac-sim:

cp /workspace/skrl/thirdparty/vec_env_base.py /isaac-sim/exts/omni.isaac.gym/omni/isaac/gym/vec_env/vec_env_base.py

NOTE: Ensure that the /workspace/skrl/runs directory is removed during installation. If necessary, remove it using rm -rf /workspace/skrl/runs.

Commands for Training

Use the following commands to reproduce the results of SAC and MEow.

SAC

Train SAC on the AllegroHand environment using the default configuration with the following command:

PYTHON_PATH trainer_sac.py

NOTE: You may adjust the training configuration (e.g., num_envs, memory_size, and directory) in the main() function in trainer_sac.py.

NOTE: To terminate the job, please use kill -9 ${PID}.

Parallelizable training using Ray Tune:

# AllegroHand
PYTHON_PATH tuner_sac_allegro.py
# Ant
PYTHON_PATH tuner_sac_ant.py
# ANYmal
PYTHON_PATH tuner_sac_anymal.py
# FrankaCabinet
PYTHON_PATH tuner_sac_franka.py
# Humanoid
PYTHON_PATH tuner_sac_humanoid.py
# Ingenuity
PYTHON_PATH tuner_sac_ingenuity.py

NOTE: If you encounter the error message Exception: Failed to create simulation view backend, please clean up the cache by running sh clean_cache.sh.

MEow

Train MEow on the AllegroHand environment using the default configuration with the following command:

PYTHON_PATH trainer_meow.py

NOTE: You may adjust the training configuration (e.g., num_envs, memory_size, and directory) in the main() function in trainer_meow.py.

NOTE: To terminate the job, please use kill -9 ${PID}.

Parallelizable training using Ray Tune:

# AllegroHand
PYTHON_PATH tuner_meow_allegro.py
# Ant
PYTHON_PATH tuner_meow_ant.py
# ANYmal
PYTHON_PATH tuner_meow_anymal.py
# FrankaCabinet
PYTHON_PATH tuner_meow_franka.py
# Humanoid
PYTHON_PATH tuner_meow_humanoid.py
# Ingenuity
PYTHON_PATH tuner_meow_ingenuity.py

NOTE: If you encounter the error message Exception: Failed to create simulation view backend, please clean up the cache by running sh clean_cache.sh.

Demonstration

The following table presents the demonstration videos of MEow during inference time:

Ant	Humanoid

Ingenuity	ANYmal

AllegroHand	FrankaCabinet

Evaluation Results

Most of the evaluation results presented in the paper can be found here. Read our instruction for reproducing the plots presented in our paper.

References

This code implementation is developed based on the following repositories:

Toni-SM/skrl (at commit 631613a) is licensed under the MIT License.
VincentStimper/normalizing-flows (at commit 848277e) is licensed under the MIT License.

Cite this Repository

If you find this repository useful, please consider citing our paper:

@inproceedings{chao2024maximum,
    title={Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow},
    author={Chao, Chen-Hao and Feng, Chien and Sun, Wei-Fang and Lee, Cheng-Kuang and See, Simon and Lee, Chun-Yi},
    booktitle={Proceedings of the International Conference on Neural Information Processing Systems (NeurIPS)},
    year={2024}
}

Contributors of the Code Implementation

Visit our GitHub pages by clicking the images above.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

skrl

skrl

README.md

MEow in SKRL Implementation

Install Dependencies

Prerequisites

Docker and Installation

Commands for Training

SAC

MEow

Demonstration

Evaluation Results

References

Cite this Repository

Contributors of the Code Implementation

Files

skrl

Directory actions

More options

Directory actions

More options

Latest commit

History

skrl

Folders and files

parent directory

README.md

MEow in SKRL Implementation

Install Dependencies

Prerequisites

Docker and Installation

Commands for Training

SAC

MEow

Demonstration

Evaluation Results

References

Cite this Repository

Contributors of the Code Implementation