/v1/inference/embeddings input and output shape mismatch #922

mattf · 2025-02-01T14:35:55Z

/v1/inference/embeddings: model x List[InterleavedContent] -> List[List[float]]

the shape mismatch comes from InterleavedContent allowing for List[InterleavedContentItem].

example: [string, [text0, text1], image] -?-> [embedding of string, embedding of text0, embedding of text1, embedding of image]

i suggest aligning the shapes.

my preference is to change the input shape, and use an input of array of string | array of InterleavedContentItem, which keeps string (untyped) and text / image (typed) inputs separate.

a further enhancement: embedding is often done in two modes, batch and query. in batch mode many items are embedded for storage. in query mode a single item is embedded for lookup. allowing input of string | array of string | array of InterleavedContentItem facilitates this use case.

The text was updated successfully, but these errors were encountered:

mattf · 2025-02-01T14:36:12Z

cc @raghotham @ashwinb @yanxi0830

ashwinb · 2025-02-03T14:14:33Z

Good spot. I definitely agree with at least changing it to List[InterleavedContentItem] immediately otherwise the contract is broken.

#1161) See Issue #922 The change is slightly backwards incompatible but no callsite (in our client codebases or stack-apps) every passes a depth-2 `List[List[InterleavedContentItem]]` (which is now disallowed.) ## Test Plan ```bash $ cd llama_stack/providers/tests/inference $ pytest -s -v -k fireworks test_embeddings.py \ --inference-model nomic-ai/nomic-embed-text-v1.5 --env EMBEDDING_DIMENSION=784 $ pytest -s -v -k together test_embeddings.py \ --inference-model togethercomputer/m2-bert-80M-8k-retrieval --env EMBEDDING_DIMENSION=784 $ pytest -s -v -k ollama test_embeddings.py \ --inference-model all-minilm:latest --env EMBEDDING_DIMENSION=784 ``` Also ran `tests/client-sdk/inference/test_embeddings.py`

hardikjshah added this to the v0.1.4 milestone Feb 12, 2025

hardikjshah assigned ashwinb Feb 14, 2025

ashwinb linked a pull request Feb 20, 2025 that will close this issue

fix(api): update embeddings signature so inputs and outputs list align #1161

Merged

ashwinb mentioned this issue Feb 20, 2025

fix(api): update embeddings signature so inputs and outputs list align #1161

Merged

ashwinb closed this as completed in #1161 Feb 21, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

/v1/inference/embeddings input and output shape mismatch #922

/v1/inference/embeddings input and output shape mismatch #922

mattf commented Feb 1, 2025

mattf commented Feb 1, 2025

ashwinb commented Feb 3, 2025

/v1/inference/embeddings input and output shape mismatch #922

/v1/inference/embeddings input and output shape mismatch #922

Comments

mattf commented Feb 1, 2025

mattf commented Feb 1, 2025

ashwinb commented Feb 3, 2025