LLaVA OV: fix unpadding precision #34779

zucchini-nlp · 2024-11-18T11:20:33Z

What does this PR do?

Fixes #34625. There was a small precision error in unpadding because the modeling code casts the size to list, while processing code works with tensors. This PR casts everything to list to match the calculations

HuggingFaceDocBuilderDev · 2024-11-18T11:47:03Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

qubvel

Thanks for the fix! Just a question re types

src/transformers/models/llava_onevision/processing_llava_onevision.py

zucchini-nlp · 2024-11-18T14:06:40Z

sorry @qubvel , wrong tag for review

ArthurZucker

Thanks, does it fix #34625 entirely? (let's close it if so!)

Scyther-07 · 2024-12-02T10:09:40Z

Hey @zucchini-nlp,
I am running the following code but it is throwing me a TypeError:

from transformers import AutoProcessor, LlavaForConditionalGeneration, BitsAndBytesConfig, LlavaNextProcessor
model_id = "llava-hf/llava-v1.6-mistral-7b-hf"
processor = AutoProcessor.from_pretrained(model_id)

Error:
TypeError: LlavaNextProcessor.__init__() got an unexpected keyword argument 'image_token'

Now, I looked at the commits of this PR and it looks good to me. The issue might be with your last PR #33424. Kindly look into this.

zucchini-nlp · 2024-12-02T10:23:03Z

@Scyther-07 hey, which transformers version you are using?

Scyther-07 · 2024-12-02T10:27:05Z

@Scyther-07 hey, which transformers version you are using?

It's 4.39.3.
I just noticed that the above code works fine on Google Colab but throws an error in the Kaggle Notebook. I don't know what to make of it. I think I should shift to Colab.

zucchini-nlp · 2024-12-02T10:32:30Z

@Scyther-07 hmm, the 4.39.3 should throw error indeed and you need at least v4.43 to bypass the error. In fact we are currently changing the way inputs for VLMs are processed, thus I'd recommend to use the latest transformers after release. It will be v4.47 around next 1-2 weeks, not released yet now :)

Scyther-07 · 2024-12-02T11:51:12Z

Yeah, it worked. Too foolish of me. Thanks for the help!

* fix * propagate * type check

zucchini-nlp added 2 commits November 18, 2024 12:16

fix

3f4dbcd

propagate

f5313ef

zucchini-nlp requested a review from qubvel November 18, 2024 11:20

qubvel reviewed Nov 18, 2024

View reviewed changes

src/transformers/models/llava_onevision/processing_llava_onevision.py Outdated Show resolved Hide resolved

type check

496e56c

qubvel approved these changes Nov 18, 2024

View reviewed changes

zucchini-nlp requested review from qubvel and ArthurZucker November 18, 2024 14:06

qubvel removed their request for review November 18, 2024 14:40

ArthurZucker approved these changes Nov 19, 2024

View reviewed changes

zucchini-nlp merged commit 145fbd4 into huggingface:main Nov 20, 2024
10 checks passed

chenweize1998 mentioned this pull request Nov 27, 2024

Bug: orig_height and orig_width variable undeifined in llava processing #34952

Closed

4 tasks

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

LLaVA OV: fix unpadding precision (huggingface#34779)

87015b1

* fix * propagate * type check

BernardZach pushed a commit to innovationcore/transformers that referenced this pull request Dec 6, 2024

LLaVA OV: fix unpadding precision (huggingface#34779)

7e08780

* fix * propagate * type check

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LLaVA OV: fix unpadding precision #34779

LLaVA OV: fix unpadding precision #34779

zucchini-nlp commented Nov 18, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 18, 2024

qubvel left a comment

zucchini-nlp commented Nov 18, 2024

ArthurZucker left a comment

Scyther-07 commented Dec 2, 2024

zucchini-nlp commented Dec 2, 2024

Scyther-07 commented Dec 2, 2024

zucchini-nlp commented Dec 2, 2024

Scyther-07 commented Dec 2, 2024

LLaVA OV: fix unpadding precision #34779

LLaVA OV: fix unpadding precision #34779

Conversation

zucchini-nlp commented Nov 18, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Nov 18, 2024

qubvel left a comment

Choose a reason for hiding this comment

zucchini-nlp commented Nov 18, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

Scyther-07 commented Dec 2, 2024

zucchini-nlp commented Dec 2, 2024

Scyther-07 commented Dec 2, 2024

zucchini-nlp commented Dec 2, 2024

Scyther-07 commented Dec 2, 2024

zucchini-nlp commented Nov 18, 2024 •

edited

Loading