-
Notifications
You must be signed in to change notification settings - Fork 34
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
* Decoupled RSSM for DV3 agent * Initialize posterior with prior if is_first is True * Fix PlayerDV3 creation in evaluation * Fix representation_model * Fix compute first prior state with a zero posterior * DV3 replay ratio conversion * Removed expl parameters dependent on old per_Rank_gradient_steps * feat: update repeats computation * feat: update learning starts in config * fix: remove files * feat: update repeats * Let Dv3 compute bootstrap correctly * feat: added replay ratio and update exploration * Fix exploration actions computation on DV1 * Fix naming * Add replay-ratio to SAC * feat: added replay ratio to p2e algos * feat: update configs and utils of p2e algos * Add replay-ratio to SAC-AE * Add DrOQ replay ratio * Fix tests * Fix mispelled * Fix wrong attribute accesing * FIx naming and configs * feat: add terminated and truncated to dreamer, p2e and ppo algos * fix: dmc wrapper * feat: update algos to split terminated from truncated * fix: crafter and diambra wrappers * feat: replace done with truncated key in when the buffer is added to the checkpoint * Set Distribution.validate_args once at the beginning * Move validate_args in run method * Defined PPOPlayer * Defined RecurrentPPOPlayer * feat: added truncated/terminated to minedojo environment * Add SACPlayer * FIx evaluate.py for PPO and RecurrentPPO * FIx DrOQ build_agent * Delete unused training agent during evaluation * Fix SACPlayer * Fix DrOQ build_agent * Fix PPO decoupled creating single-device fabric * Adapt SAC decoupled to new build_agent * Fix typings + add get_actions method * Add SACAEPlayer * Add A2CPlayer from PPOPlayer * Fix get_single_device_fabric * Fix calling get_values on player instead of agent * Setup PLayerDV1 in build_agent * Fix typings * DV2 player * Remove one weight tie * Fix DV3 player in build_agent * Fix return player from build_agent * Set actor_type during evaluation * Build player in build_agent for P2E-DV3 * Update comments * Update dreamer-v3 cfg * Learnable initial recurrent state choice * Fix DecoupledRSSM to accept the learnable_initial_recurrent_state flag * Preserve input dtype after LayerNorm (pytorch/pytorch#66707 (comment)) * Fix imports * Move hyperparams to rightful key inside world_model * sample_actions to greedy * From sample_actions to greedy * From sample_actions to greedy * unwrap_fabric before test in p2e * Fix player in notebook * Update how-tos --------- Co-authored-by: Michele Milesi <[email protected]>
- Loading branch information
1 parent
5e75246
commit 32a3736
Showing
53 changed files
with
1,574 additions
and
623 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.