Incorrect padding_side Setting as 'left' in Llama Family Model #25022

voidful · 2023-07-23T09:16:04Z

System Info

transformers version: 4.30.2
Platform: Linux-5.15.0-1041-azure-x86_64-with-glibc2.29
Python version: 3.8.10
Huggingface_hub version: 0.16.2
Safetensors version: 0.3.1

Who can help?

text models: @ArthurZucker and @younesbelkada generate: @gante

Information

The official example scripts
My own modified scripts

Tasks

An officially supported task in the examples folder (such as GLUE/SQuAD, ...)
My own task or dataset (give details below)

Reproduction

When utilizing the Llama Family Model for batch generation, an issue arises due to the lack of a padding token. To clarify, the original model uses pad_id = -1, implying the absence of a padding token. This logic is infeasible for our scenario.

Here is our proposed solution:

Firstly, a padding token should be added using the command tokenizer.add_special_tokens({"pad_token":""}), following which the token embedding must be resized accordingly. It's essential to also set model.config.pad_token_id. The embed_tokens layer of the model is initialized with self.embed_tokens = nn.Embedding(config.vocab_size, config.hidden_size, self.config.padding_idx). This ensures that encoding the padding token outputs zeros. Therefore, passing it during initialization is recommended.

Expected behavior

Another important aspect is setting the padding_side to 'right'. This is crucial for correct padding direction.

The text was updated successfully, but these errors were encountered:

ArthurZucker · 2023-07-24T09:43:35Z

Hey! Indeed, as it was written in the documentation a padding token is required. Seems that by default the padding side is set to left. We cannot update the tokenization file (for backward compatibility reasons) but we can update the tokenizers online to make sure they use padding_side = right by default.

voidful · 2023-07-24T18:09:44Z

Hey! Indeed, as it was written in the documentation a padding token is required. Seems that by default the padding side is set to left. We cannot update the tokenization file (for backward compatibility reasons) but we can update the tokenizers online to make sure they use padding_side = right by default.

Great, I would be nice to update the default padding_side of those model.

anmolagarwal999 · 2023-09-10T06:04:12Z

There does not seem to be any documentation regarding what the correct padding_side should be for CodeLLAMA family. Is there a way to find this out ? @ArthurZucker I also opened a related issue here.

ArthurZucker · 2023-09-21T09:47:59Z

CodeLlama is Llama family so same padding side. I answered on your issue 🤗

ScottLiao920 · 2024-11-21T06:14:32Z

Hi there, just curious about the default setting of padding_side. If I understand this correctly, normally for decoder-only LLMs tokenizers should have padding_size='right', meaning the padding tokens appear after the actual input text tokens. However, I get this warning recently:
A decoder-only architecture is being used, but right-padding was detected! For correct generation results, please set `padding_side='left'` when initializing the tokenizer.
I am running transformers of version 4.46.2, here're a test example using llama-3.1-8B-instruct, seems left is the "right" side to go.

voidful closed this as completed Jul 24, 2023

anmolagarwal999 mentioned this issue Sep 10, 2023

Regarding padding and batched inference for LLAMA-2 and CodeLLAMA #26072

Closed

4 tasks

grimulkan mentioned this issue Oct 10, 2023

Tokenizer padding & vocab size dvlab-research/LongLoRA#51

Closed

ScottLiao920 mentioned this issue Nov 21, 2024

setting of padding_side in Llama tokenizers #34842

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Incorrect padding_side Setting as 'left' in Llama Family Model #25022

Incorrect padding_side Setting as 'left' in Llama Family Model #25022

voidful commented Jul 23, 2023

ArthurZucker commented Jul 24, 2023

voidful commented Jul 24, 2023

anmolagarwal999 commented Sep 10, 2023

ArthurZucker commented Sep 21, 2023 •

edited

Loading

ScottLiao920 commented Nov 21, 2024 •

edited

Loading

Incorrect padding_side Setting as 'left' in Llama Family Model #25022

Incorrect padding_side Setting as 'left' in Llama Family Model #25022

Comments

voidful commented Jul 23, 2023

System Info

Who can help?

Information

Tasks

Reproduction

Expected behavior

ArthurZucker commented Jul 24, 2023

voidful commented Jul 24, 2023

anmolagarwal999 commented Sep 10, 2023

ArthurZucker commented Sep 21, 2023 • edited Loading

ScottLiao920 commented Nov 21, 2024 • edited Loading

ArthurZucker commented Sep 21, 2023 •

edited

Loading

ScottLiao920 commented Nov 21, 2024 •

edited

Loading