Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

finetune image size #82

Open
hardlipay opened this issue May 30, 2023 · 5 comments
Open

finetune image size #82

hardlipay opened this issue May 30, 2023 · 5 comments

Comments

@hardlipay
Copy link

The official mentioned that blip uses a resolution of 224, which may not be good for image detail understanding, can finetune training modify the image size? If not, and need to train from zero, then is it possible to train only visual model? Or do I need to do a complete alignment training?
官方提到blip使用了224的分辨率,这对图片细节理解可能不好,finetune训练可以修改图片大小吗?如果不能,需要从零训练,那么是可以只训练visual model吗?还是需要做完整的对齐训练?

@freelancerllm
Copy link

验证是可以修改图片大小,目前我设置为540,但是效果还没评估?

@hardlipay
Copy link
Author

验证是可以修改图片大小,目前我设置为540,但是效果还没评估?

试了finetune训练,直接在源码里改了size,tensor size不匹配,传递不了,应该不可以,要改可能要从头做对齐训练

@zgjiangtoby
Copy link

验证是可以修改图片大小,目前我设置为540,但是效果还没评估?

试了finetune训练,直接在源码里改了size,tensor size不匹配,传递不了,应该不可以,要改可能要从头做对齐训练

所以需要等pre-training的代码release吗?

@cdqncn
Copy link

cdqncn commented Oct 31, 2023

怎么才能将分辨率提高到384呢

@1049451037
Copy link
Member

可以参考这里:#296

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

5 participants