-
Notifications
You must be signed in to change notification settings - Fork 216
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MaskDino Fails to learn Precise Bounding Boxes on custom dataset but Dino does #242
Comments
To reproduce the results, I have used a public dataset with similar characteristics. In The COB-3D dataset, see: https://arxiv.org/abs/2210.07424 . I have extracted rgb images, bounding boxes, instance mask in the coco format. The dataset is a bit small (~6k images) and can be downloaded here. The original data is here. Please note that the data is published under the CC, non-comercial see https://github.com/wyndwarrior/autoregressive-bbox/blob/main/LICENSE . An image of the predictions with maskdino and the gt are: The logs for dino and mask dino are uploaded below. Interestingly:
|
Hello, I notice that the boxes by maskdino are all shifted upper right a little bit. I guess there may be some bugs in the postprocessing code. |
@FabianSchuetze When you have relatively small datasets, Mask-RCNN usually can do good enough. MaskDINO and DINO are suitable for relatively large datasets such as COCO. |
@FabianSchuetze We fixed a bug in #249. Maybe you can run again to see if this solved your problem. Please also refer to the discussions in #247 . |
Thank you so much, @HaoZhang534 ! I will train the model again tomorrow and report back. |
@HaoZhang534 . I have worked wit the new commits but the bounding boxes are still shifted. I have commented again in #247 . Furthermore, I am still not getting very good results. Maybe the training process is not really possible with just a batch size of 4? I will try to train on MS CoCo and see whether I can reproduce the original results. Can you maybe attach a log of the original training process? That would be wonderful & would make a comparison easier. |
Thanks for the wonderful repo. It's a pleasure to work with it and to read the code.
When training MaskDino on a custom dataset, the bounding box predictions are not very good. Interestingly:
Does anybody have an idea what I could tune to generate good bb results?
Training Details:
I have slightly modified the training process (see this branch https://github.com/FabianSchuetze/detrex/tree/my_changes). I added amp training and have included some gradient checkpointing. I train with one GPU and a batch size of four (for MaskDino, Dino works with a batch size of 8). The learning rate is lowered linearly.
Data:
The instances are very dense, similar to the "is-crowded" scenes of COCO. There is only one class. I have adjusted the num_objects in the config files.
Logs:
Logs of the training runs are attached below. There are three logs:
Hyparameters:
Comparing the parameters, the following aspects seem notable:
maskdino_0.4_noise_scale.txt
maskdino_1.0_noise_scale.txt
dino_log.txt
Does anybody have an idea how to debug the problem?
The text was updated successfully, but these errors were encountered: