A List of Recent Safe Reinforcement Learning Papers

This repo lists most recent papers with their code in safe RL, some papers without available code are not included. Welcome to change this list if additional documents found.

Algorithms

Safe Reinforcement Learning with Stability Guarantees

Human in the loop

Trial without Error: Towards Safe Reinforcement Learning via Human Intervention

Applications

Combine with other methods:

Provably Safe Model-Based Meta Reinforcement Learning: An Abstraction-Based Approach
Context-Aware Safe Reinforcement Learning for Non-Stationary Environments, 2021 arxiv, no code, meta-learning
Learning to be Safe: Deep RL with a Safety Critic, 2020 arxiv, no code, transfer learning
Safe exploration of nonlinear dynamical systems: A predictive safety filter for reinforcement learning, no code, arxiv
REINFORCEMENT LEARNING WITH SAFE EXPLORATION FOR NETWORK SECURITY, no code
Continuous Safe Learning Based on First Principles and Constraints for Autonomous Driving, no code
Blind Spot Detection for Safe Sim-to-Real Transfer, code
UAV-aided cellular communications with deep reinforcement learning against jamming, no code
Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences, ICML 2020, code
Safe policy improvement with baseline bootstrapping, ICML 2019
Safe policy improvement with baseline bootstrapping in factored environments, aaai 2020

Surveys

Benchmarks

Benchmarking Safe Exploration in Deep Reinforcement Learning

Thesis

SAFE REINFORCEMENT LEARNING

Lectures

Safe Reinforcement Learning

Classical paper

Sui, Y., Gotovos, A., Burdick, J. W., and Krause, A. Safe exploration for optimization with Gaussian processes. In International Conference on Machine Learning (ICML), 2015.
Turchetta, M., Berkenkamp, F., and Krause, A. Safe exploration in finite Markov decision processes with Gaussian processes. In Neural Information Processing Systems (NeurIPS), 2016. code
Wachi et al. "Safe Exploration and Optimization of Constrained MDPs using Gaussian Processes." AAAI 2018. no code

Theory

Robust control

S. Bansal, M. Chen, S. Herbert, and C. J. Tomlin, “Hamilton-jacobi reachability: A brief overview and recent advances”, in Conference on Decision and Control (CDC), 2017.
S. Li and O. Bastani, “Robust model predictive shielding for safe reinforcement learning with stochastic dynamics”, in Proc. IEEE Int. Conf. Robotics and Automation (ICRA), 2020.
J. F. Fisac, A. K. Akametalu, M. N. Zeilinger, S. Kaynama, J. Gillula, and C. J. Tomlin, “A general safety framework for learningbased control in uncertain robotic systems”, in IEEE Transactions on Automatic Control, 2018.
J. H. Gillula and C. J. Tomlin, “Guaranteed safe online learning via reachability: Tracking a ground target using a quadrotor”, in Proc. IEEE Int. Conf. Robotics and Automation (ICRA), 2012.
E. Altman, Constrained Markov Decision Processes.1999, p. 260.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A List of Recent Safe Reinforcement Learning Papers

Algorithms

Safe Exploration

Safe Planning

Policy Learning

Safe Reinforcement Learning with Stability Guarantees

Human in the loop

Applications

Combine with other methods:

Surveys

Benchmarks

Thesis

Lectures

Classical paper

Theory

Robust control

About

Releases

Packages

hlhang9527/Recent-Safe-RL-Papers-with-Code

Folders and files

Latest commit

History

Repository files navigation

A List of Recent Safe Reinforcement Learning Papers

Algorithms

Safe Exploration

Safe Planning

Policy Learning

Safe Reinforcement Learning with Stability Guarantees

Human in the loop

Applications

Combine with other methods:

Surveys

Benchmarks

Thesis

Lectures

Classical paper

Theory

Robust control

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages