Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TEI embedding error with Milvus data-prep file upload #564

Closed
CharleneHu-42 opened this issue Aug 26, 2024 · 1 comment
Closed

TEI embedding error with Milvus data-prep file upload #564

CharleneHu-42 opened this issue Aug 26, 2024 · 1 comment
Assignees
Labels
DEV features

Comments

@CharleneHu-42
Copy link

After started dataprep-milvus-server with TEI embedding service, when uploading file to Milvus db with:

curl -X POST "http://${host_ip}:6010/v1/dataprep" \
     -H "Content-Type: multipart/form-data" \
     -F "files=@./nke-10k-2023.pdf"

TEI embedding service throws error:

2024-08-26T07:43:48.857102Z ERROR embed: text_embeddings_router::http::server: router/src/http/server.rs:525: batch size 270 > maximum allowed batch size 32
@CharleneHu-42 CharleneHu-42 changed the title TEI embedding error with Milvus data-prep TEI embedding error with Milvus data-prep file upload Aug 26, 2024
@letonghan letonghan self-assigned this Aug 27, 2024
@preethivenkatesh preethivenkatesh added the DEV features label Aug 28, 2024
@letonghan
Copy link
Collaborator

Caused by batch size limitation of TEI service.
Fixed by PR #570. Will close this issue.

lkk12014402 pushed a commit that referenced this issue Sep 19, 2024
* Convert HTML to markdown format.

Signed-off-by: zepan <[email protected]>

* [pre-commit.ci] auto fixes from pre-commit.com hooks

for more information, see https://pre-commit.ci

---------

Signed-off-by: zepan <[email protected]>
Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
DEV features
Projects
None yet
Development

No branches or pull requests

3 participants