Deprecate legacy cache + use cache position #31491

zucchini-nlp · 2024-06-19T12:37:29Z

What does this PR do?

This PR deprecates legacy cache in all models that currently support Cache class. Also, these models now rely on cache position and update-causal-mask to get 4d attention mask. Tests are passing on my end, will activate slow tests with commit msg on PR later

HuggingFaceDocBuilderDev · 2024-06-19T12:57:15Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

🔥 super nice! Thanks for porting this and adding the deprecation
I think in the futur we will want to use generation_config.cache_config.ma_length maybe instead of calling get_max_length() but anyways super good porting!

src/transformers/models/dbrx/modeling_dbrx.py

Co-authored-by: Arthur <[email protected]>

zucchini-nlp · 2024-06-20T05:42:45Z

I think in the futur we will want to use generation_config.cache_config.ma_length maybe instead of calling get_max_length()

Yes, sounds good if we can start adopting cache config for all cache related arguments

gante

💛

* tmp * update models * revert utils * delete * Update src/transformers/models/dbrx/modeling_dbrx.py Co-authored-by: Arthur <[email protected]> * modify warning msg --------- Co-authored-by: Arthur <[email protected]>

zucchini-nlp added 5 commits June 18, 2024 08:35

tmp

295aaea

update models

2ca6e21

Merge remote-tracking branch 'upstream/main' into deprecate_cache

bd842f9

revert utils

107d6e9

delete

2cb6907

zucchini-nlp requested review from gante and ArthurZucker June 19, 2024 12:37

ArthurZucker approved these changes Jun 19, 2024

View reviewed changes

src/transformers/models/dbrx/modeling_dbrx.py Outdated Show resolved Hide resolved

zucchini-nlp and others added 2 commits June 20, 2024 10:33

Update src/transformers/models/dbrx/modeling_dbrx.py

b938597

Co-authored-by: Arthur <[email protected]>

modify warning msg

e0ffa09

gante approved these changes Jun 20, 2024

View reviewed changes

zucchini-nlp merged commit 730a440 into huggingface:main Jun 21, 2024
22 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deprecate legacy cache + use cache position #31491

Deprecate legacy cache + use cache position #31491

zucchini-nlp commented Jun 19, 2024

HuggingFaceDocBuilderDev commented Jun 19, 2024

ArthurZucker left a comment

zucchini-nlp commented Jun 20, 2024

gante left a comment

Deprecate legacy cache + use cache position #31491

Deprecate legacy cache + use cache position #31491

Conversation

zucchini-nlp commented Jun 19, 2024

What does this PR do?

HuggingFaceDocBuilderDev commented Jun 19, 2024

ArthurZucker left a comment

Choose a reason for hiding this comment

zucchini-nlp commented Jun 20, 2024

gante left a comment

Choose a reason for hiding this comment