-
Notifications
You must be signed in to change notification settings - Fork 6
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Eval] How to choose the best checkpoint in the paper? #6
Comments
By the way, the valiadation sets has not been accessed from the ftp server. Could you please upload the relevant datasets? |
Let me know if that helps |
Thank you for your reply. The data download is now very convenient! However, I still have some doubts about the time consumption for training/evaluating.
The paper mentions that using the bimanual setting would result in a total training time of about 54 hours. However, my single-task training takes 15 hours, and the total training time for all tasks is 15 * 13 = 195 hours, which far exceeds the time reported in the paper. Is there anything I should improve? And the evaluating period also takes too much time, how can I do to reduce the cost? |
That's great that you're able to train the network! Just to clarify, the paper doesn't mention the total training cost; instead, Table 4 reports the average training time. To estimate your total training time, you'd multiply this average by the number of tasks you’re running. Given that your setup may differ in hardware or other configurations, it's also expected that the actual time might vary. Regarding the evaluation, I assume this is due to the headless mode, but I need more information about this. Is this some kind of HPC system? Happy to chat since to speed things up |
I apologize for mistakenly considering the average task training time in the paper as the total training time. So far, I have only completed the training for the coordinated_lift_ball task and have not yet conducted a full test to verify the effectiveness of the training. Additionally, I am not very familiar with HPC. I am using a regular GPU computing server without any special modifications. By the way, could you please provide the specific configurations for training and validation? This would help us troubleshoot in case any issues arise. |
Hi, could you release the model checkpoints (including ACT/RVT-LF/Peract-LF/Peract^2) for reimplementing the result cited in the paper? |
Hi aopolin-lv, I have the first results for multi-task training! I will update the webpage with the results soon. But I would like to first finish the documentation. I can also share my checkpoints then Let me know if you have any further questions. Kind regards, |
Hello, I have a question. How do I get the initialization positions and rotation matrices of the two robotic arms in the scene? This way, I can use the initial position and rotation matrix to get the end effect position. |
Hi aopolin-lv, The end-effector pose is in the data-set. Unfortunately, the pose of the robot arms is not stored. But this one is static so I can get this for you. I am writing a small tool to display the trajectory of the end-effectors and a point cloud. I hope I will be done with the documentation soon. Kind regards, Markus PS: I am still working on the multi-task training and I hope I have some results soon. |
Hi, Robot poses: What would be your reference coordinate system? Do you need something like this? Scene visualization: I have added a tool to visualize a scene including the trajectory of the end-effectors. You can find it here https://github.com/markusgrotz/RLBench/blob/main/tools/visualize_dataset.py Does this help you? |
Hello, after completing the training of the model, I don't know how to choose the right ckpt. So, I would appreciate it if I could answer any questions.
When evaluating and testing, do you execute the
eval.py
script on the ckpt saved every 10k steps to select the ckpt with the highest score, after the training process completed? Specifically, given the number of training steps is 40k, the ckpt with 10k, 20k, 30k, and 40k will be evaluated one by one, and the ckpt with the highest score will be selected for the final test on the testset.How can I improve the speed of testing? Specifically, when I run the
eval.py
script, it takes 1 hour to complete 25 episodes of a single task. The hardware I'm using includes an Intel-8352V CPU with 72 cores and an A800-80G GPU with performance similar to the A100-80G. May I ask what your typical efficiency is when runningeval.py
?The text was updated successfully, but these errors were encountered: