You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi, I'm trying to reproduce the OpenVLA-SFT results on Simpler-Env subject generalization tasks. I have some questions about the data collection process:
For in-domain tasks like PutCarrotOnPlate, how many episodes did you collect per task? The paper mentions 100 trajectories total but doesn't specify the per-task breakdown.
For SFT training, how many successful episodes did you use per task? Since OpenVLA-SFT uses only successful trajectories, it would be helpful to know the target number.
For challenging tasks where episodes often fail (e.g., PutSpoonOnTableClothInScene where all 20 episodes failed in my attempts), what was your approach? Did you:
Extend the episode range until getting enough successes?
Make environmental adjustments?
Have a minimum success rate requirement?
This information would help ensure proper replication of the training setup. Thanks!
The text was updated successfully, but these errors were encountered:
Hi, I'm trying to reproduce the OpenVLA-SFT results on Simpler-Env subject generalization tasks. I have some questions about the data collection process:
For in-domain tasks like PutCarrotOnPlate, how many episodes did you collect per task? The paper mentions 100 trajectories total but doesn't specify the per-task breakdown.
For SFT training, how many successful episodes did you use per task? Since OpenVLA-SFT uses only successful trajectories, it would be helpful to know the target number.
For challenging tasks where episodes often fail (e.g., PutSpoonOnTableClothInScene where all 20 episodes failed in my attempts), what was your approach? Did you:
This information would help ensure proper replication of the training setup. Thanks!
The text was updated successfully, but these errors were encountered: