Fix `attn_implementation` documentation #29295

fxmarty · 2024-02-26T11:37:06Z

As reported in #26572 (comment), attn_implementation is wrongfully documented under PretrainedConfig, and is not under AutoModel.from_config & PreTrainedModel.from_pretrained as it should be.

HuggingFaceDocBuilderDev · 2024-02-26T11:59:49Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

amyeroberts · 2024-02-26T12:25:02Z

@fxmarty I don't think this is quite right - if someone can pass in

model = AutoModel.from_config(config, attn_implementation="foo")

they should also be able to pass it in for the config constructor and then pass that to the model

config = ModelConfig(attn_implementation="foo")
model = AutoModel.from_config(config)

fxmarty · 2024-02-26T12:41:30Z

@amyeroberts This is not what was suggested in #26572 (comment) by @patrickvonplaten.

Happy to change that, but I think it should be in an other PR. This PR simply reflects in the documentation the current expected usage by users:

model = AutoModel.from_config(cfg, attn_implementation="eager")
model = LlamaModel.from_pretrained("xxx", attn_implementation="eager")

while the following is illegal/not supported:

from transformers import AutoModelForCausalLM, AutoConfig, LlamaForCausalLM

cfg = AutoConfig.from_pretrained("fxmarty/tiny-llama-fast-tokenizer")
cfg.attn_implementation = "eager"

model = LlamaForCausalLM(cfg)

they should also be able to pass it in for the config constructor

Ultimately I agree with you, PreTrainedModel.__init__ should in the future have a way to obey to a specified attn_implementation (currently not exposed to users, only config._attn_implementation = "eager" works).

amyeroberts · 2024-02-26T18:44:52Z

@fxmarty OK, thanks for explaining and linking to the relevant comment! If it's something that causes a lot of confusion we can circle back on enabling it through the config creation

amyeroberts

Thanks for following up on this and making the docstrings consistent!

fix

fix

1744742

fxmarty requested review from amyeroberts and ArthurZucker February 26, 2024 11:37

amyeroberts approved these changes Feb 26, 2024

View reviewed changes

ArthurZucker approved these changes Feb 27, 2024

View reviewed changes

fxmarty merged commit 6d3b643 into huggingface:main Feb 27, 2024
8 checks passed

itazap pushed a commit that referenced this pull request May 14, 2024

Fix attn_implementation documentation (#29295)

12b25a0

fix

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `attn_implementation` documentation #29295

Fix `attn_implementation` documentation #29295

fxmarty commented Feb 26, 2024

HuggingFaceDocBuilderDev commented Feb 26, 2024

amyeroberts commented Feb 26, 2024 •

edited

Loading

fxmarty commented Feb 26, 2024 •

edited

Loading

amyeroberts commented Feb 26, 2024

amyeroberts left a comment

Fix attn_implementation documentation #29295

Fix attn_implementation documentation #29295

Conversation

fxmarty commented Feb 26, 2024

HuggingFaceDocBuilderDev commented Feb 26, 2024

amyeroberts commented Feb 26, 2024 • edited Loading

fxmarty commented Feb 26, 2024 • edited Loading

amyeroberts commented Feb 26, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

Fix `attn_implementation` documentation #29295

Fix `attn_implementation` documentation #29295

amyeroberts commented Feb 26, 2024 •

edited

Loading

fxmarty commented Feb 26, 2024 •

edited

Loading