add initial design for uniform processors + align model #31197

molbap · 2024-06-03T07:41:39Z

What does this PR do?

Adds a uniform signature for processors. This PR adds the initial design + one model for the larger #30511.

Usage

As before, kwargs that are passed to processors at __call__ time take priority. However, per-modality processors can be instantiated with their own kwargs, and if they are not overriden at call time, they will serve as defaults.

Type hinting of kwargs is preserved if they are passed as structured dictionary entries

It also works with kwargs passed without nesting:

Merging of kwargs and handling priority order is done in processing_utils through a dedicated method.
The order of operations is as follows:

kwargs passed as before have highest priority to preserve BC.

high_priority_kwargs = {"crop_size" = (224, 224), "padding" = "max_length"}
processor(..., **high_priority_kwargs)

kwargs passed as modality-specific kwargs have second priority. This is the recommended API.

processor(..., text_kwargs={"padding": "max_length"}, images_kwargs={"crop_size": (224, 224)}})

kwargs passed during instantiation of a modality processor have fourth priority.

tokenizer = tokenizer_class(..., {"padding": "max_length"})
image_processor = image_processor_class(...)
processor(tokenizer, image_processor) # will pass max_length unless overriden by kwargs at call

defaults kwargs specified at processor level have lowest priority.

class MyProcessingKwargs(ProcessingKwargs, CommonKwargs, TextKwargs, ImagesKwargs, total=False):
    _defaults = {
        "text_kwargs": {
            "padding": "max_length",
            "max_length": 64,
        },
    }

Missing:

Move tests of kwargs checking to a common util

HuggingFaceDocBuilderDev · 2024-06-03T08:04:41Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

molbap · 2024-06-03T08:09:44Z

cc @amyeroberts for a review with a narrower scope than the parent PR 😁

molbap · 2024-06-04T11:28:30Z

cc @amyeroberts, sorry to divide my pings 😅 PR is big and I wanted to split it up, this one should be merge-able and be the basis for the rest, I'll rebase afterwards (and @qubvel welcome if you want to take a look!)

It includes the kwargs merging just mentioned in the other PR, moved them to processing common!

amyeroberts

Thanks!

Same comment as for the other PR - it would be good to move out the kwarg prep logic to out of the config and tests to make sure we can properly control tokenizer kwargs with tokenizer.init_kwargs and the input kwargs.

cc @qubvel

src/transformers/tokenization_utils_base.py

amyeroberts · 2024-06-04T11:30:07Z

@molbap OK, sounds good! Let's wait for @qubvel's eagle eye. Once tests are added, ping me again for a final review 🤗

qubvel

Nice to see _merge_kwargs as a separate method, this is exactly what came to my mind while reviewing the first PR 🙂

src/transformers/processing_utils.py

ArthurZucker

a lot cleaner!

ArthurZucker · 2024-06-06T06:33:36Z

src/transformers/models/align/processing_align.py

    }

+    ```
+    """
+
    _defaults = {


Suggested change

_defaults = {

padding: "max_length"

max_lenght: 64

should work no? Or does it not update the default for type-hints?

yes it works for sure, this was to have a structured dict for defaults. Can change :)

ah, now I remember, it actually can't work like that since Typed Dicts don't support default values, they are made to hold the layout. They can have any attributes however, but it won't pass a value as default -like a dataclass would, but in this case we'd lose typing-, hence the manual operation

ok got it thanks! Let's maybe comment about this!

Do we have a comment for future code inspectors? I'm assuming here isn't the best place (we don't want it for all models) but didn't find a corresponding one elsewhere on a quick skim

On that: there's doc in processing_utils.ProcessingKwargs, I added a comment nudging users to check there for documentation!

molbap · 2024-06-07T13:37:58Z

I have updated the PR description to be more self-contained: by using typing_extensions the imports for Unpack work, but type hinting doesn't for versions of python < 3.11. Functionality, though, is preserved for older versions I tested.

amyeroberts

Looks great!

Just a few small comments

amyeroberts · 2024-06-12T14:16:21Z

src/transformers/models/align/processing_align.py

    }

+    ```
+    """
+
    _defaults = {


Do we have a comment for future code inspectors? I'm assuming here isn't the best place (we don't want it for all models) but didn't find a corresponding one elsewhere on a quick skim

tests/test_processing_common.py

src/transformers/processing_utils.py

molbap · 2024-06-13T10:54:36Z

@amyeroberts FYI Kept digging for the kwargs merging logic, and found an edge case that was giving unreliable results in the tokenizer. Refactored and doubled number of tests to avoid further trickery (including an edge case found earlier by @qubvel ), logic should be easier to read now. Nothing else changed, and tests should pass reliably.

…31197) * add initial design for uniform processors + align model * fix mutable default 👀 * add configuration test * handle structured kwargs w defaults + add test * protect torch-specific test * fix style * fix * fix assertEqual * move kwargs merging to processing common * rework kwargs for type hinting * just get Unpack from extensions * run-slow[align] * handle kwargs passed as nested dict * add from_pretrained test for nested kwargs handling * [run-slow]align * update documentation + imports * update audio inputs * protect audio types, silly * try removing imports * make things simpler * simplerer * move out kwargs test to common mixin * [run-slow]align * skip tests for old processors * [run-slow]align, clip * !$#@!! protect imports, darn it * [run-slow]align, clip * [run-slow]align, clip * update doc * improve documentation for default values * add model_max_length testing This parameter depends on tokenizers received. * Raise if kwargs are specified in two places * fix * expand VideoInput * fix * fix style * remove defaults values * add comment to indicate documentation on adding kwargs * protect imports * [run-slow]align * fix * remove set() that breaks ordering * test more * removed unused func * [run-slow]align

* add initial design for uniform processors + align model * fix mutable default 👀 * add configuration test * handle structured kwargs w defaults + add test * protect torch-specific test * fix style * fix * fix assertEqual * move kwargs merging to processing common * rework kwargs for type hinting * just get Unpack from extensions * run-slow[align] * handle kwargs passed as nested dict * add from_pretrained test for nested kwargs handling * [run-slow]align * update documentation + imports * update audio inputs * protect audio types, silly * try removing imports * make things simpler * simplerer * move out kwargs test to common mixin * [run-slow]align * skip tests for old processors * [run-slow]align, clip * !$#@!! protect imports, darn it * [run-slow]align, clip * [run-slow]align, clip * update doc * improve documentation for default values * add model_max_length testing This parameter depends on tokenizers received. * Raise if kwargs are specified in two places * fix * expand VideoInput * fix * fix style * remove defaults values * add comment to indicate documentation on adding kwargs * protect imports * [run-slow]align * fix * remove set() that breaks ordering * test more * removed unused func * [run-slow]align

add initial design for uniform processors + align model

b85036f

This was referenced Jun 3, 2024

Image + text + audio uniform processors #30511

Open

add uniform processors for altclip + chinese_clip #31198

Merged

molbap added 8 commits June 3, 2024 10:58

fix mutable default 👀

bb8ac70

add configuration test

cd8c601

handle structured kwargs w defaults + add test

f00c852

protect torch-specific test

693036f

fix style

766da3a

fix

844394d

fix assertEqual

c19bbc6

move kwargs merging to processing common

3c38119

amyeroberts reviewed Jun 4, 2024

View reviewed changes

src/transformers/tokenization_utils_base.py Outdated Show resolved Hide resolved

qubvel reviewed Jun 4, 2024

View reviewed changes

src/transformers/processing_utils.py Outdated Show resolved Hide resolved

src/transformers/processing_utils.py Outdated Show resolved Hide resolved

qubvel reviewed Jun 4, 2024

View reviewed changes

src/transformers/processing_utils.py Outdated Show resolved Hide resolved

rework kwargs for type hinting

81ae819

ArthurZucker reviewed Jun 6, 2024

View reviewed changes

just get Unpack from extensions

ce4abcd

molbap added the run-slow label Jun 7, 2024

molbap added 4 commits June 7, 2024 13:40

run-slow[align]

3acdf28

handle kwargs passed as nested dict

404239f

add from_pretrained test for nested kwargs handling

603be40

[run-slow]align

71c9d6c

molbap added 3 commits June 7, 2024 15:47

update documentation + imports

26383c5

update audio inputs

4521f4f

protect audio types, silly

b96eb64

molbap added 3 commits June 11, 2024 15:08

Raise if kwargs are specified in two places

39c1587

fix

1f73bdf

Merge branch 'main' into uniform_processors_1

b3f98ba

amyeroberts approved these changes Jun 12, 2024

View reviewed changes

molbap added 12 commits June 12, 2024 18:24

expand VideoInput

e4d6d12

fix

1e09e4a

fix style

d4232f0

remove defaults values

162b1a7

add comment to indicate documentation on adding kwargs

0da1dc3

Merge branch 'main' into uniform_processors_1

f955510

protect imports

f6f1dac

[run-slow]align

c4b7e84

fix

3ce3608

remove set() that breaks ordering

6b83e39

test more

3818b86

removed unused func

31b7a60

molbap added 2 commits June 13, 2024 15:41

[run-slow]align

4072336

Merge branch 'main' into uniform_processors_1

bcce007

molbap merged commit c624d5b into huggingface:main Jun 13, 2024
24 checks passed

molbap mentioned this pull request Jun 14, 2024

Adding imagebind #30690

Open

zucchini-nlp mentioned this pull request Jul 11, 2024

Uniform kwargs for processors #31911

Open

40 tasks

EduardoPach mentioned this pull request Jul 14, 2024

[GroundingDino] - GroundingDinoProcessor kwargs is Broken #31952

Closed

4 tasks

molbap mentioned this pull request Jul 17, 2024

Adding mplugdocowl #31792

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add initial design for uniform processors + align model #31197

add initial design for uniform processors + align model #31197

molbap commented Jun 3, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 3, 2024

molbap commented Jun 3, 2024

molbap commented Jun 4, 2024

amyeroberts left a comment

amyeroberts commented Jun 4, 2024

qubvel left a comment

ArthurZucker left a comment

ArthurZucker Jun 6, 2024

molbap Jun 7, 2024

molbap Jun 7, 2024

ArthurZucker Jun 7, 2024

amyeroberts Jun 12, 2024

molbap Jun 12, 2024

molbap commented Jun 7, 2024

amyeroberts left a comment

amyeroberts Jun 12, 2024

molbap commented Jun 13, 2024

add initial design for uniform processors + align model #31197

add initial design for uniform processors + align model #31197

Conversation

molbap commented Jun 3, 2024 • edited Loading

What does this PR do?

Usage

Missing:

HuggingFaceDocBuilderDev commented Jun 3, 2024

molbap commented Jun 3, 2024

molbap commented Jun 4, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts commented Jun 4, 2024

qubvel left a comment

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

ArthurZucker Jun 6, 2024

Choose a reason for hiding this comment

molbap Jun 7, 2024

Choose a reason for hiding this comment

molbap Jun 7, 2024

Choose a reason for hiding this comment

ArthurZucker Jun 7, 2024

Choose a reason for hiding this comment

amyeroberts Jun 12, 2024

Choose a reason for hiding this comment

molbap Jun 12, 2024

Choose a reason for hiding this comment

molbap commented Jun 7, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Jun 12, 2024

Choose a reason for hiding this comment

molbap commented Jun 13, 2024

molbap commented Jun 3, 2024 •

edited

Loading