You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Were the "dino_ensemble", "MaskVisionTransformerEnc" and "KeypointsVisionTransformerEnc" methods in obs_wrapper.py used for the results presented in the paper, or was the CLS token used to represent the state for all ViT based models?
Thanks!
The text was updated successfully, but these errors were encountered:
Were the "dino_ensemble", "MaskVisionTransformerEnc" and "KeypointsVisionTransformerEnc" methods in obs_wrapper.py used for the results presented in the paper, or was the CLS token used to represent the state for all ViT based models?
Thanks!
The text was updated successfully, but these errors were encountered: