model.pretrain() using DDPG+HER #249

Cladett · 2020-12-01T18:29:13Z

Hello,

I have tried to use HER+DDPG to pretrain an agent based on some recorded demonstrations.
From the error I obtained i believe right now the library does not offer this feature, correct?
When i pretrain only using DDPG everything run smootly but with her i encounter problems.

Training with HER

  model_class = DDPG  # works also with SAC, DDPG and TD3                                                                                                        
 goal_selection_strategy = 'future' # equivalent to GoalSelectionStrategy.FUTURE                                                                                
 print('Ready to test HER')                                                                                                                                     
 model = HER('MlpPolicy', wrapped_env, model_class, n_sampled_goal=4, goal_selection_strategy=goal_selection_strategy,                                           tensorboard_log="./her_dvrl_tensorboard/" ,verbose=1 )                                                                                                                                                                 
                                                                                                                                                                
 # Pretrain the model using the demonstration                                                                                                                   
  model.pretrain(dataset, n_epochs=1000)                                                                                                                         
  model.save("./models/pretrain_1.12.20", cloudpickle = True)

The text was updated successfully, but these errors were encountered:

Miffyli · 2020-12-01T21:26:21Z

Yes, there is no pretrain in stable-baselines3 (there is some crude support in stable-baselines). See e.g. d3rlpy for offline RL algorithms.

Cladett · 2020-12-02T08:14:37Z

Thank you for your answer. How about the imitation library in stable-baselines3?
I thought that was the equivalent of model.pretrain() from stable-baselines.

araffin · 2020-12-02T09:18:17Z

Hello,
You have different solution:

if you just want to pretrain using behavior cloning, best is probably to write some custom code, we give some examples (see Add support for pretraining [feature request] #27 )
for behavior cloning, you can also take a look at the imitation repo (cf doc), but I'm not sure if HER is supported
if you want to use offline learning, then you can take a look at the offline learning repo that @Miffyli mentioned (but I'm also not sure that HER is supported there)

EDIT: note for myself, we need to add this to the migration guide, it is missing currently

OscarGarciaF · 2020-12-07T22:33:46Z

So there won't be any model.pretrain() in the future?

Why would a feature that was so simple to do on previous versions be missing: https://stable-baselines.readthedocs.io/en/master/guide/pretrain.html

Miffyli · 2020-12-07T22:35:28Z

So there won't be any model.pretrain() in the future?

Not planned anytime soon as other libraries fill this part. It might be considered later on, however to correctly (and cleanly) support these would require extensive additions and careful implementing, hence we refer to other libraries focusing on those algorithms for the time being.

Cladett added custom gym env Issue related to Custom Gym Env question Further information is requested labels Dec 1, 2020

araffin added the duplicate This issue or pull request already exists label Dec 2, 2020

araffin added the documentation Improvements or additions to documentation label Dec 7, 2020

araffin mentioned this issue Feb 1, 2021

Fix numpy warning and update migration guide #307

Merged

14 tasks

araffin closed this as completed in #307 Feb 1, 2021

araffin mentioned this issue Dec 13, 2021

[Question] I have a data set of 'State - action - reward - next state', can I pretrain my model? #690

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

model.pretrain() using DDPG+HER #249

model.pretrain() using DDPG+HER #249

Cladett commented Dec 1, 2020

Miffyli commented Dec 1, 2020 •

edited

Loading

Cladett commented Dec 2, 2020

araffin commented Dec 2, 2020 •

edited

Loading

OscarGarciaF commented Dec 7, 2020

Miffyli commented Dec 7, 2020

model.pretrain() using DDPG+HER #249

model.pretrain() using DDPG+HER #249

Comments

Cladett commented Dec 1, 2020

Training with HER

Miffyli commented Dec 1, 2020 • edited Loading

Cladett commented Dec 2, 2020

araffin commented Dec 2, 2020 • edited Loading

OscarGarciaF commented Dec 7, 2020

Miffyli commented Dec 7, 2020

Miffyli commented Dec 1, 2020 •

edited

Loading

araffin commented Dec 2, 2020 •

edited

Loading