Upgrade FastEmbed Version #493

NirantK · 2024-02-15T04:24:43Z

All Submissions:

Contributions should target the dev branch. Did you create your branch from dev?
Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?

New Feature Submissions:

Does your submission pass tests?
Have you installed pre-commit with pip3 install pre-commit and set up hooks with pre-commit install?

Changes to Core Features:

Have you added an explanation of what your changes do and why you'd like us to include them?
Have you written new tests for your core changes, as applicable?
Have you successfully ran tests with your changes locally?

netlify · 2024-02-15T04:24:47Z

✅ Deploy Preview for poetic-froyo-8baba7 ready!

Name	Link
🔨 Latest commit	`28881be`
🔍 Latest deploy log	https://app.netlify.com/sites/poetic-froyo-8baba7/deploys/65e778d0e802e10008677e55
😎 Deploy Preview	https://deploy-preview-493--poetic-froyo-8baba7.netlify.app
📱 Preview on mobile	Toggle QR Code... Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify site configuration.

joein · 2024-02-15T11:59:20Z

qdrant_client/qdrant_fastembed.py

    "BAAI/bge-small-en-v1.5": (384, models.Distance.COSINE),
    "BAAI/bge-base-en-v1.5": (768, models.Distance.COSINE),
    "intfloat/multilingual-e5-large": (1024, models.Distance.COSINE),
 }


 class QdrantFastembedMixin(QdrantBase):
-    DEFAULT_EMBEDDING_MODEL = "BAAI/bge-small-en"
+    DEFAULT_EMBEDDING_MODEL = "BAAI/bge-small-en-v1.5"


I don't think we can just silently replace the default model

We need to make at least one major release which warns that we are going to change the model

joein · 2024-02-15T12:01:48Z

qdrant_client/qdrant_fastembed.py


 SUPPORTED_EMBEDDING_MODELS: Dict[str, Tuple[int, models.Distance]] = {
-    "BAAI/bge-base-en": (768, models.Distance.COSINE),


we can't just silently remove models which could have already been used by users

joein · 2024-03-02T23:06:33Z

We need to fix our tests to run them without fastembed installed as well

joein · 2024-03-03T01:05:31Z

#522

…ate fastembed

generall · 2024-02-15T08:50:39Z

qdrant_client/async_qdrant_fastembed.py

    "BAAI/bge-small-en-v1.5": (384, models.Distance.COSINE),
    "BAAI/bge-base-en-v1.5": (768, models.Distance.COSINE),
    "intfloat/multilingual-e5-large": (1024, models.Distance.COSINE),
 }


 class AsyncQdrantFastembedMixin(AsyncQdrantBase):
-    DEFAULT_EMBEDDING_MODEL = "BAAI/bge-small-en"
-    embedding_models: Dict[str, "DefaultEmbedding"] = {}
+    DEFAULT_EMBEDDING_MODEL = "BAAI/bge-small-en-v1.5"


I don't feel so good about this change. We should deprecate things, not suddenly disable defaults

* Update fastembed to v0.2.1 * chore(qdrant_fastembed.py): update DEFAULT_EMBEDDING_MODEL * fix(fastembed integration): upgrade to latest version * Prefer black over ruff * Prefer black over ruff * Remove hardcoded directory structure from Qdrant Client checks * new: deprecate current default model, deprecate max token length, update fastembed * fix: make embedding_model_name method sync * fix: update poetry lock * refactor: use list_supported_models() (#501) * fix: fix fastembed check * fix: fix fastembed class var assignment * fix: remove fastembed deprecation from qdrant client (#524) --------- Co-authored-by: George Panchuk <[email protected]> Co-authored-by: Anush <[email protected]>

joein requested changes Feb 15, 2024

View reviewed changes

Anush008 mentioned this pull request Feb 21, 2024

refactor: use list_supported_models() from FastEmbed #501

Merged

2 tasks

joein force-pushed the nirant/upgrade-fastembed-version branch from 0803fb5 to 34f6037 Compare March 2, 2024 22:32

joein mentioned this pull request Mar 3, 2024

Tracking issue: local mode for Qdrant v1.8 #490

Closed

3 tasks

NirantK changed the title ~~Nirant/upgrade-fastembed-version~~ Upgrade FastEmbed Version Mar 4, 2024

NirantK and others added 12 commits March 4, 2024 16:39

Update fastembed to v0.2.1

1d4ad22

chore(qdrant_fastembed.py): update DEFAULT_EMBEDDING_MODEL

7c9913f

fix(fastembed integration): upgrade to latest version

c55fa46

Prefer black over ruff

269bb6c

Prefer black over ruff

4a73d91

Remove hardcoded directory structure from Qdrant Client checks

2c1a481

new: deprecate current default model, deprecate max token length, upd…

f966c9b

…ate fastembed

fix: make embedding_model_name method sync

ad73511

fix: update poetry lock

acdc38f

refactor: use list_supported_models() (#501)

d36603b

fix: fix fastembed check

dac7b0c

fix: fix fastembed class var assignment

9323c0a

joein force-pushed the nirant/upgrade-fastembed-version branch from 510efbc to 9323c0a Compare March 4, 2024 15:40

generall approved these changes Mar 5, 2024

View reviewed changes

generall requested a review from joein March 5, 2024 18:39

fix: remove fastembed deprecation from qdrant client (#524)

28881be

joein approved these changes Mar 5, 2024

View reviewed changes

joein merged commit fc4b3cf into dev Mar 5, 2024
14 checks passed

NirantK deleted the nirant/upgrade-fastembed-version branch March 6, 2024 09:05

geetu040 mentioned this pull request Mar 6, 2024

Upgrade fastembed version from 0.1.1 to 0.2.1 (latest) #486

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Upgrade FastEmbed Version #493

Upgrade FastEmbed Version #493

NirantK commented Feb 15, 2024

netlify bot commented Feb 15, 2024 •

edited

Loading

joein Feb 15, 2024

joein Feb 15, 2024

joein Feb 15, 2024

joein commented Mar 2, 2024

joein commented Mar 3, 2024

generall Feb 15, 2024


		SUPPORTED_EMBEDDING_MODELS: Dict[str, Tuple[int, models.Distance]] = {
		"BAAI/bge-base-en": (768, models.Distance.COSINE),

Upgrade FastEmbed Version #493

Upgrade FastEmbed Version #493

Conversation

NirantK commented Feb 15, 2024

All Submissions:

New Feature Submissions:

Changes to Core Features:

netlify bot commented Feb 15, 2024 • edited Loading

✅ Deploy Preview for poetic-froyo-8baba7 ready!

joein Feb 15, 2024

Choose a reason for hiding this comment

joein Feb 15, 2024

Choose a reason for hiding this comment

joein Feb 15, 2024

Choose a reason for hiding this comment

joein commented Mar 2, 2024

joein commented Mar 3, 2024

generall Feb 15, 2024

Choose a reason for hiding this comment

netlify bot commented Feb 15, 2024 •

edited

Loading