[Feature]: Support sentence-transformers configuration files #9388

maxdebayser · 2024-10-15T19:41:11Z

🚀 The feature, motivation and pitch

Currently support for sentence transformer models being added in PRs such as #9056. However, these models come with extra configuration files for settings such as the pooling method and whether normalization of embeddings has to be done.

There is a modules.json file as such as [this one](https://huggingface.co/sentence-transformers/all-MiniLM-L6-v2/blob/main/modules.json_ containing configuration of layers:

[
  {
    "idx": 0,
    "name": "0",
    "path": "",
    "type": "sentence_transformers.models.Transformer"
  },
  {
    "idx": 1,
    "name": "1",
    "path": "1_Pooling",
    "type": "sentence_transformers.models.Pooling"
  },
  {
    "idx": 2,
    "name": "2",
    "path": "2_Normalize",
    "type": "sentence_transformers.models.Normalize"
  }
]

The path refers to a directory that can be empty or non-existent in the case of default parameters. For example, in the case of the model above, 1_Pooling contains a config.json file with the following:

{
  "word_embedding_dimension": 384,
  "pooling_mode_cls_token": false,
  "pooling_mode_mean_tokens": true,
  "pooling_mode_max_tokens": false,
  "pooling_mode_mean_sqrt_len_tokens": false
}

Currently the implementation of BERT models and Roberta Models hard codes the most common option of these parameters, but to really support sentence transformer models we need to load the settings from these files.

cc: @robertgshaw2-neuralmagic @flaviabeo

Alternatives

No response

Additional context

No response

Before submitting a new issue...

Make sure you already searched for relevant issues, and asked the chatbot living at the bottom right corner of the documentation page, which can answer lots of frequently asked questions.

The text was updated successfully, but these errors were encountered:

maxdebayser added the feature request label Oct 15, 2024

robertgshaw2-redhat mentioned this issue Oct 18, 2024

Support BERTModel (first encoder-only embedding model) #9056

Merged

flaviabeo mentioned this issue Oct 18, 2024

Adds method to read the pooling types from model's files #9506

Merged

DarkLight1337 closed this as completed in #9506 Nov 7, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature]: Support sentence-transformers configuration files #9388

[Feature]: Support sentence-transformers configuration files #9388

maxdebayser commented Oct 15, 2024

[Feature]: Support sentence-transformers configuration files #9388

[Feature]: Support sentence-transformers configuration files #9388

Comments

maxdebayser commented Oct 15, 2024

🚀 The feature, motivation and pitch

Alternatives

Additional context

Before submitting a new issue...