Support --remove_input_padding for BERT models? #1755

Altair-Alpha · 2024-06-08T07:11:13Z

Hi, we're trying out build and inference for BERT models (specifically, BertForSequenceClassification). We would have request batch in which samples can vary largely on seq_len, and as the document says, the --remove_input_padding option which packs samples into 1D tensor without padding should be beneficial on performance.

However, we didn't found this parameter in examples/bert/build.py, and the engine built with the script seems to have "remove_input_padding": False in config.json. Also, we didn't see any implementation details about this in tensorrt_llm/models/bert/model.py, while the enc-dec model has. Is there a plan on supporting this feature for BERT models? Or are we missing the possible way?

The text was updated successfully, but these errors were encountered:

nv-guomingz · 2024-06-11T05:39:25Z

Hi @QiJune would u please take a look this question?

Altair-Alpha · 2024-06-24T07:17:52Z

We made our implementation referring to enc_dec models. For anyone who may concern, basically things you need to modify are: (1) inputs, refer to enc_dec implementation; (2) set remove_input_padding to true in plugin_config; (3) the forward of the final pooler layer in BertForSequenceClassification to make it select first token in each sample, according to input_lengths. Closing this.

QiJune · 2024-06-24T07:24:54Z

@Altair-Alpha Do you mind contributing you changes back to Github? Thanks

Altair-Alpha · 2024-06-25T11:40:38Z

@QiJune hi, I made a PR here: #1834. Remind that this is only implemented and tested for BertForSequenceClassification models, and maybe the official team can work further on this :)

nv-guomingz assigned QiJune Jun 11, 2024

nv-guomingz added question Further information is requested triaged Issue has been triaged by maintainers labels Jun 11, 2024

Altair-Alpha closed this as completed Jun 24, 2024

Altair-Alpha mentioned this issue Jun 25, 2024

support remove_input_padding for BertForSequenceClassification models #1834

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support --remove_input_padding for BERT models? #1755

Support --remove_input_padding for BERT models? #1755

Altair-Alpha commented Jun 8, 2024

nv-guomingz commented Jun 11, 2024

Altair-Alpha commented Jun 24, 2024

QiJune commented Jun 24, 2024

Altair-Alpha commented Jun 25, 2024

Support --remove_input_padding for BERT models? #1755

Support --remove_input_padding for BERT models? #1755

Comments

Altair-Alpha commented Jun 8, 2024

nv-guomingz commented Jun 11, 2024

Altair-Alpha commented Jun 24, 2024

QiJune commented Jun 24, 2024

Altair-Alpha commented Jun 25, 2024