T5 compile compatibilty #34089

zucchini-nlp · 2024-10-11T11:11:05Z

What does this PR do?

Same as #33754, I accidentally force pushed to wrong branch and the PR got closed 🙃

gante

Thank you for this refactor 🙏

I've left a question for us to double-check, but approving since it looks good to me.

⚠️ before merging, make sure to run the following slow tests to confirm that there are no regressions vs main: {all touched models} + {llama, whisper, caches}

gante · 2024-10-16T15:10:54Z

tests/models/t5/test_modeling_t5.py

+                "Liana Barrientos has been married 10 times, nine of them in the Bronx . Her husbands filed for "
+                "permanent residence after the marriages, prosecutors say ."


Is this a new failure?

If yes: can you confirm a) the generated tokens are different b) compare the logprobs of the different tokens before/after the change, to see whether it can be explained by tiny fluctuations?

If no: can you add a comment with the PR that causes the difference? 🙏

Added a comment, the regression was caused by changes in the tokenization. So nothing breaks in terms of modeling or generation. Not sure if the fix in tokenization was intended to break T5 (i.e. it actually fixes T5) or we need another PR to fix tokenization back, so I'll leave a comment for future us

ArthurZucker

NICE! 🔥 thanks for adding the compile integration tests 🤗

zucchini-nlp · 2024-10-21T08:01:49Z

Will merge this later today if we are all okay. The comment for tokenization is added within code with a TODO for us, in case it is not a bug we can remove the comment later

* this worked in normal generation, needs more tests * fix almost all tests in t5 * nit * longt5, umt5, mt5 * style * udop, pix2struct * more models * fix some tests * fix onnx tests * tracing tests fixed * compile enabled and tested for t5 models * fix small bug in slow tests * [run-slow] t5 * uncomment * style * update with new generation refactoring * nit * fix copies * this is the fix, had to change t5 to fix copies * update * [run-slow] t5 * [run-slow] t5 * update * add test for encoder only T5 * clean up after rebase * fix pop2piano * add comment * style * fix copies after rebase * fix copies missed this one

zucchini-nlp added 25 commits September 19, 2024 18:27

this worked in normal generation, needs more tests

15abc14

fix almost all tests in t5

06d9d62

nit

51c689c

longt5, umt5, mt5

9e5244c

style

417dd6d

udop, pix2struct

814a405

more models

0bc8b54

fix some tests

7c5925b

fix onnx tests

038bb1e

tracing tests fixed

df98842

compile enabled and tested for t5 models

0544b65

fix small bug in slow tests

1063971

[run-slow] t5

0e7fb50

uncomment

c4ccdea

Merge remote-tracking branch 'upstream/main' into t5-compile

11065c9

style

993f318

update with new generation refactoring

41911b7

nit

2449e32

fix copies

df0a05c

this is the fix, had to change t5 to fix copies

c98e541

update

4f16856

[run-slow] t5

d7260d3

[run-slow] t5

5f5f66f

update

e404063

add test for encoder only T5

47d70c5

zucchini-nlp requested a review from ArthurZucker October 11, 2024 11:11

zucchini-nlp added 3 commits October 14, 2024 09:35

Merge remote-tracking branch 'upstream/main' into t5-compile

042101f

clean up after rebase

3048ab8

fix pop2piano

2c805f2

gante approved these changes Oct 16, 2024

View reviewed changes

ArthurZucker approved these changes Oct 16, 2024

View reviewed changes

gante mentioned this pull request Oct 17, 2024

tracker: generate compatibility with torch.compile #28981

Closed

33 tasks

add comment

9e1fefa

zucchini-nlp added 5 commits October 21, 2024 10:02

Merge branch 'main' into t5-compile

0cb6036

style

56d036c

fix copies after rebase

c25a8a4

Merge remote-tracking branch 'upstream/main' into t5-compile

3086178

fix copies missed this one

befe2d8

zucchini-nlp merged commit 73d65e6 into huggingface:main Oct 22, 2024
24 of 26 checks passed

ArthurZucker mentioned this pull request Oct 22, 2024

Add SDPA support for T5 Style Models #30375

Closed

5 tasks

IlyasMoutawwakil mentioned this pull request Oct 24, 2024

Fix pix2struct #34374

Merged

5 tasks

zucchini-nlp mentioned this pull request Oct 30, 2024

transformers 4.44.2 doesn't work with torch.compile and torch.export on T5 generate() #33283

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

T5 compile compatibilty #34089

T5 compile compatibilty #34089

zucchini-nlp commented Oct 11, 2024

gante left a comment

gante Oct 16, 2024

zucchini-nlp Oct 18, 2024 •

edited

Loading

ArthurZucker left a comment

zucchini-nlp commented Oct 21, 2024

		"Liana Barrientos has been married 10 times, nine of them in the Bronx . Her husbands filed for "
		"permanent residence after the marriages, prosecutors say ."

T5 compile compatibilty #34089

T5 compile compatibilty #34089

Conversation

zucchini-nlp commented Oct 11, 2024

What does this PR do?

gante left a comment

Choose a reason for hiding this comment

gante Oct 16, 2024

Choose a reason for hiding this comment

zucchini-nlp Oct 18, 2024 • edited Loading

Choose a reason for hiding this comment

ArthurZucker left a comment

Choose a reason for hiding this comment

zucchini-nlp commented Oct 21, 2024

zucchini-nlp Oct 18, 2024 •

edited

Loading