Use `SFTConfig` instead of `SFTTrainer` keyword args #2150

qgallouedec · 2024-10-14T10:08:28Z

SFTTrainer's keyword args like packing, dataset_kwargs, dataset_text_field, max_seq_length has been deprecated and will be soon removed. Instead, we use SFTConfig (subclass of TrainingArguments)

This PR updates the code related to SFTTrainer.

BenjaminBossan · 2024-10-14T14:31:44Z

Thanks for addressing these trl deprecations. For my understanding, some arguments have been dropped without adding them to SFTConfig. Is that because for those, the default values were used?

qgallouedec · 2024-10-14T15:06:04Z

Which ones?

BenjaminBossan · 2024-10-14T15:14:36Z

docs/source/accelerate/fsdp.md

-    packing=data_args.packing,
-    dataset_kwargs={
-        "append_concat_token": data_args.append_concat_token,
-        "add_special_tokens": data_args.add_special_tokens,
-    },
-    dataset_text_field=data_args.dataset_text_field,
-    max_seq_length=data_args.max_seq_length,


Here would be an example where arguments are removed from SFTTrainer but no equivalent arguments were added to training_args.

Ok I see what you mean.

When you run the script with, for example, --max_seq_length 123, the value will no longer feed data_args but training_args. Everything happens behind the scenes when arguments are parsed.

python example.py --output_dir tmp --max_seq_length 123

Before:

from dataclasses import dataclass from typing import Optional from transformers import HfArgumentParser, TrainingArguments @dataclass class DataTrainingArguments: dataset_name: Optional[str] = None max_seq_length: int = 512 if __name__ == "__main__": parser = HfArgumentParser((DataTrainingArguments, TrainingArguments)) data_args, training_args = parser.parse_args_into_dataclasses() print(data_args.max_seq_length) # 123

After:

from dataclasses import dataclass from typing import Optional from transformers import HfArgumentParser from trl import SFTConfig @dataclass class DataTrainingArguments: dataset_name: Optional[str] = None if __name__ == "__main__": parser = HfArgumentParser((DataTrainingArguments, SFTConfig)) data_args, training_args = parser.parse_args_into_dataclasses() print(training_args.max_seq_length) # 123

HuggingFaceDocBuilderDev · 2024-10-14T15:56:34Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

BenjaminBossan

Thanks for the PR and explaining the change. I tried the updated script and it worked. LGTM.

…#2150) Update training script using trl to fix deprecations in argument usage.

qgallouedec added 4 commits October 14, 2024 10:05

use SFTConfig instead of SFTTrainer keyword args

a0c6277

Refactor TrainingArguments to ScriptArguments

9c53b9a

Remove unused import statement

51e83f1

Refactor imports in pissa_finetuning.py

99287c2

BenjaminBossan reviewed Oct 14, 2024

View reviewed changes

Remove unused fields in DataTrainingArguments class

3634649

BenjaminBossan approved these changes Oct 15, 2024

View reviewed changes

BenjaminBossan merged commit 93ddb10 into main Oct 15, 2024
15 checks passed

qgallouedec deleted the update_trl branch October 15, 2024 09:28

yaswanth19 pushed a commit to yaswanth19/peft that referenced this pull request Oct 20, 2024

FIX Use SFTConfig instead of SFTTrainer keyword args (huggingface…

6b8cc33

…#2150) Update training script using trl to fix deprecations in argument usage.

yaswanth19 pushed a commit to yaswanth19/peft that referenced this pull request Oct 20, 2024

FIX Use SFTConfig instead of SFTTrainer keyword args (huggingface…

1c701dc

…#2150) Update training script using trl to fix deprecations in argument usage.

BenjaminBossan pushed a commit to BenjaminBossan/peft that referenced this pull request Oct 22, 2024

FIX Use SFTConfig instead of SFTTrainer keyword args (huggingface…

e74a6b9

…#2150) Update training script using trl to fix deprecations in argument usage.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use `SFTConfig` instead of `SFTTrainer` keyword args #2150

Use `SFTConfig` instead of `SFTTrainer` keyword args #2150

qgallouedec commented Oct 14, 2024

BenjaminBossan commented Oct 14, 2024

qgallouedec commented Oct 14, 2024

BenjaminBossan Oct 14, 2024

qgallouedec Oct 14, 2024

HuggingFaceDocBuilderDev commented Oct 14, 2024

BenjaminBossan left a comment

Use SFTConfig instead of SFTTrainer keyword args #2150

Use SFTConfig instead of SFTTrainer keyword args #2150

Conversation

qgallouedec commented Oct 14, 2024

BenjaminBossan commented Oct 14, 2024

qgallouedec commented Oct 14, 2024

BenjaminBossan Oct 14, 2024

Choose a reason for hiding this comment

qgallouedec Oct 14, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Oct 14, 2024

BenjaminBossan left a comment

Choose a reason for hiding this comment

Use `SFTConfig` instead of `SFTTrainer` keyword args #2150

Use `SFTConfig` instead of `SFTTrainer` keyword args #2150