Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training data and inclusion #13

Open
mashdragon opened this issue Dec 31, 2024 · 2 comments
Open

Training data and inclusion #13

mashdragon opened this issue Dec 31, 2024 · 2 comments

Comments

@mashdragon
Copy link

Hi there,

I have noticed that JoyCaption sometimes skips image features I want the captions to talk about. I know I can fine tune it, but I am curious about where the training data came from and reviewing the training data might help explain to me why the model behaves this way.

Will you publish the training data set or will that always remain hidden?

@fpgaminer
Copy link
Owner

All of the training data will be published (with images as hashes and urls where possible). I'm working on getting the dataset organized (it's a complete mess at the moment), so expect it to be uploaded closer to a version 1.0 release.

Yeah, the model will miss features for a variety of reasons. Finetuning will always be the best for improving that, but I'm working to get better instruction following into JoyCaption so that it can be guided by a prompt to focus on whatever specifically you want it to describe.

@mashdragon
Copy link
Author

Thank you for responding, I'm looking forward to seeing the dataset. From a user perspective, using a torrent would be most convenient as it has hash checking included, but I understand if it's too large for that to be practical.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants