Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Llava Onevision: add model #32673

Merged
merged 34 commits into from
Sep 5, 2024
Merged
Changes from 1 commit
Commits
Show all changes
34 commits
Select commit Hold shift + click to select a range
04ea11d
working version
zucchini-nlp Aug 15, 2024
380e99a
fix copies
zucchini-nlp Aug 15, 2024
2c82735
update
zucchini-nlp Aug 15, 2024
1f854cc
tests
zucchini-nlp Aug 15, 2024
4c41164
update docs
zucchini-nlp Aug 15, 2024
7926afd
codestyle
zucchini-nlp Aug 15, 2024
ed4c3ad
Merge branch 'huggingface:main' into llava-onevision
zucchini-nlp Aug 15, 2024
e6ca32c
add more tests
zucchini-nlp Aug 15, 2024
b734893
add returns for docs
zucchini-nlp Aug 15, 2024
c660105
clean up
zucchini-nlp Aug 16, 2024
f85aecf
Update src/transformers/models/llava_onevision/processing_llava_onevi…
zucchini-nlp Aug 19, 2024
4829831
updates
zucchini-nlp Aug 19, 2024
1254c13
codestyle
zucchini-nlp Aug 19, 2024
fdbd460
Merge branch 'main' into llava-onevision
zucchini-nlp Aug 19, 2024
3ecaa0d
style
zucchini-nlp Aug 19, 2024
6ac443e
shouldn't be reversed
zucchini-nlp Aug 21, 2024
6025390
[run-slow] llava_onevision
zucchini-nlp Aug 21, 2024
d7789f1
Merge remote-tracking branch 'upstream/main' into llava-onevision
zucchini-nlp Aug 21, 2024
9c44c23
[run-slow] llava_onevision
zucchini-nlp Aug 21, 2024
3dc34bf
Merge branch 'huggingface:main' into llava-onevision
zucchini-nlp Aug 22, 2024
90ff94d
add pooling in videos
zucchini-nlp Aug 30, 2024
1b99e48
Merge remote-tracking branch 'upstream/main' into llava-onevision
zucchini-nlp Aug 30, 2024
c5ccad1
[run-slow] llava_onevision
zucchini-nlp Aug 30, 2024
73f100e
num-logits-to-keep
zucchini-nlp Aug 30, 2024
44352f9
[run-slow] llava_onevision
zucchini-nlp Aug 30, 2024
ae18fc8
[run-slow] llava_onevision
zucchini-nlp Aug 30, 2024
2a69a9a
Update tests/test_modeling_common.py
zucchini-nlp Aug 30, 2024
5322d6f
video matched orig impl
zucchini-nlp Sep 2, 2024
b27c4f3
Merge remote-tracking branch 'upstream/main' into llava-onevision
zucchini-nlp Sep 2, 2024
ecd6743
fix tests
zucchini-nlp Sep 2, 2024
278eb86
chat template was modified
zucchini-nlp Sep 2, 2024
4ce02ed
Update docs/source/en/model_doc/llava_onevision.md
zucchini-nlp Sep 4, 2024
7c5ae0d
add morer info in the doc page
zucchini-nlp Sep 4, 2024
1fcb179
Merge branch 'main' into llava-onevision
zucchini-nlp Sep 4, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
codestyle
zucchini-nlp committed Aug 19, 2024
commit 1254c135950a35d528758e1c5fd0bd11d82db04f
Original file line number Diff line number Diff line change
@@ -55,12 +55,12 @@ class LlavaOnevisionProcessor(ProcessorMixin):
[`~LlavaOnevisionVideoProcessor.__call__`], [`~LlavaNextProcessor.__call__`] and [`~LlavaNextProcessor.decode`] for more information.

Args:
video_processor ([`LlavaOnevisionVideoProcessor`], *optional*):
The video processor is a required input.
image_processor ([`LlavaNextImageProcessor`], *optional*):
The image processor is a required input.
tokenizer ([`LlamaTokenizerFast`], *optional*):
The tokenizer is a required input.
video_processor ([`LlavaOnevisionVideoProcessor`], *optional*):
The video processor is a required input.
num_image_tokens (`int`, *optional*):
Number of image tokens for one imagethat will be returned by vision tower.
vision_feature_select_strategy (`str`, *optional*):