Models used in paper #2

emlynw · 2024-09-17T12:46:22Z

Were the "dino_ensemble", "MaskVisionTransformerEnc" and "KeypointsVisionTransformerEnc" methods in obs_wrapper.py used for the results presented in the paper, or was the CLS token used to represent the state for all ViT based models?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Models used in paper #2

Models used in paper #2

emlynw commented Sep 17, 2024

Models used in paper #2

Models used in paper #2

Comments

emlynw commented Sep 17, 2024