SICMDP

Liangyu Zhang, Yang Peng, Wenhao Yang, Zhihua Zhang. Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning

The repository includes the implementation of the environments and algorithms (SI-CPO and SI-CPPO) in the paper, see https://github.com/pengyang7881187/SICMDP for the implemntation of SI-CRL algorithm.

The tabular environment discharge of sewage and SI-CPO algorithm are included in tabular_envs, and other directories include the code for the ship route planning environment and SI-CPPO algorithm based on RLlib.

Requirements

Gurobipy
Gymnasium: pip install gymnasium
Pytorch: pip install torch==1.13.1+cu117 --extra-index-url https://download.pytorch.org/whl/cu117
RLlib: pip install "ray[rllib]"

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
algorithms		algorithms
configs		configs
envs		envs
eval		eval
models		models
tabular_envs		tabular_envs
utils		utils
README.md		README.md
find_feasible_pollution_env.py		find_feasible_pollution_env.py
train_SICPO_and_baseline.py		train_SICPO_and_baseline.py
train_SICPPO_and_baseline.py		train_SICPPO_and_baseline.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SICMDP

Requirements

About

Releases

Packages

Languages

pengyang7881187/SICMDP-new

Folders and files

Latest commit

History

Repository files navigation

SICMDP

Requirements

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages