Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix(vlm): handle legacy conversation data format and check image in data #2018

Merged
merged 6 commits into from
Dec 3, 2024

Conversation

NanoCode012
Copy link
Collaborator

Description

The current vision data processing expects in the OAI format and with images. This PR allows passing text-only dataset to training vision models and converts the old sharegpt-like format datasets.

The caveat is that, it still does not allow mixing text-only and text+image data points per batch yet. We would need to patch upstream for this.

Motivation and Context

Could not pass normal sharegpt dataset.

How has this been tested?

Not yet after refactor.

Screenshots (if appropriate)

Types of changes

Social Handles (Optional)

@hjc-puro

@NanoCode012 NanoCode012 marked this pull request as ready for review November 11, 2024 14:53
@winglian
Copy link
Collaborator

would love to have a unit test for this in place.

@NanoCode012 NanoCode012 changed the title fix: handle legacy conversation data format and check image in data fix(vlm): handle legacy conversation data format and check image in data Nov 23, 2024
@winglian winglian merged commit 822c904 into main Dec 3, 2024
14 checks passed
@winglian winglian deleted the fix/mm_chat_template branch December 3, 2024 05:01
bursteratom pushed a commit that referenced this pull request Dec 4, 2024
…ata (#2018) [skip ci]

* fix: handle legacy conversation data format and check image in data

* feat: add test for llama vision

* feat: add max_steps to test

* fix: incorrect indent and return preprocess

* feat: use smaller model and dataset

* chore: add extra config for sharegpt dataset
bursteratom pushed a commit that referenced this pull request Dec 4, 2024
…ata (#2018) [skip ci]

* fix: handle legacy conversation data format and check image in data

* feat: add test for llama vision

* feat: add max_steps to test

* fix: incorrect indent and return preprocess

* feat: use smaller model and dataset

* chore: add extra config for sharegpt dataset
djsaunde pushed a commit that referenced this pull request Dec 16, 2024
…ata (#2018) [skip ci]

* fix: handle legacy conversation data format and check image in data

* feat: add test for llama vision

* feat: add max_steps to test

* fix: incorrect indent and return preprocess

* feat: use smaller model and dataset

* chore: add extra config for sharegpt dataset
djsaunde pushed a commit that referenced this pull request Dec 17, 2024
…ata (#2018) [skip ci]

* fix: handle legacy conversation data format and check image in data

* feat: add test for llama vision

* feat: add max_steps to test

* fix: incorrect indent and return preprocess

* feat: use smaller model and dataset

* chore: add extra config for sharegpt dataset
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants