-
Notifications
You must be signed in to change notification settings - Fork 760
bad performance on the same wild video #6
Comments
In terms of the output resolution, you set that with I'm not sure how much difference it'll make, but consider also changing center as well since all 3 are used to renormalize the camera. How did you build your dataset file? |
Did you follow the instructions mentioned in my last post here? Also, in this comment I mentioned that we used CPN to extract the 2D keypoints for the videos in the wild, which produces slightly better results. Anyway, if you followed the steps correctly, Detectron poses should be very similar. We took the video from YouTube as well, in 1080p resolution. |
@bucktoothsir Regarding getting visualizations of in-the-wild videos, in the second step, where you converted the input video to individual frames, how did you preprocess this incoming frame (scale, crop, center, etc.?) before getting the output from the Detectron? |
your advices works. thanks. Now I get a high resolution output, but the performance remains bad. I built a dataset file as the same structure as original dataset file. Specifically, I built a fake 3d dataset file and a 2d dataset file. The structure is 'S0/skating' and you could rename subjects and actions, then change the corresponding name in your test scripts. |
I didn't take any preprocessing steps. |
I also write a dataset file by myself. |
@bucktoothsir Thanks for the response. Also, I'm trying to get keypoints on my images using the Detectron Model (using the R-50-FPN End-to-End Keypoint-Only Mask R-CNN Baseline model, in this page), using the command: python Detectron.pytorch/tools/infer_simple.py --dataset coco --cfg Detectron.pytorch/configs/baselines/e2e_keypoint_rcnn_R-50-FPN_1x.yaml --load_detectron Detectron.pytorch/data/pretrained_model/e2e_keypoint_rcnn_R-50-FPN_1x.pkl --image_dir videoframes --output_dir Detectron.pytorch/keypoints but getting this error: RuntimeError: The expanded size of the tensor (81) must match the existing size (2) at non-singleton dimension 0 So, it'll be great if you (or anyone else reading this) could provide any hints on how you're obtaining keypoints through this process. Thanks! |
@wishvivek which version of python do you use? |
@dariopavllo I have the 3D predictions from the model for my in-the-wild video, but they're all normalized (i.e., [-1,1]). So,
Any help will be great, thanks! |
How to get keypts and bboxes ? for 12_2017_baselines/e2e_keypoint_rcnn_R-101-FPN_s1x.yaml. or I need to train on Detetron to get the model?Anyone can help me? thanks a lot |
I used detectron as the author's advice. |
Thanks @bucktoothsir to point me to this issue! As I already mentioned in #2 I also was able to run the code on a in the wild example with my own fork of this repository. I also have some notes for Detectron in there for the people with difficulties. My 3D results are also way worse than the results created by @dariopavllo. I think my 2D poses are not accurate enough - also thanks to @lxy5513 who also suggested that. So my next step would be to actually run the detectron poses through CPN to get better 2D results! If someone has another opinion please share maybe I did something wrong in my code? My outputAuthors output |
@Godatplay @tobiascz I use the inference code to run my own video, taking Detecton's 2d keypoints as input. the buttocks in my output seems fixed, while i think it should move. Have you met similar problem? Is there any potential solution i can try to improve the result? |
Hey @YCyuchen, The reason for that is that the 3D Skeleton is always visualized relative to the center hip joint (you called it buttock). To avoid this you could use the ankles as the relative center of the visualization. In this issue @dariopavllo already discussed this |
hello
Obviously it is wrose than your result
I noticed that your video is with a high resolution and much more accurate 2d joints. Could you please release the original skate video and test in the wild code?
The text was updated successfully, but these errors were encountered: