Implement UINT8 vector type - [MOD-8230, MOD-8408] #584

GuyAv46 · 2024-12-26T14:17:27Z

Implement support for new type `UINT8`

This PR follows the implementation of the INT8 type and implements all the functionality required for the UINT8 type.

Mark if applicable

This PR introduces API changes
This PR introduces serialization changes

codecov · 2024-12-30T15:54:52Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 97.19%. Comparing base (3291f63) to head (727b86c).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #584      +/-   ##
==========================================
+ Coverage   97.09%   97.19%   +0.10%     
==========================================
  Files         104      106       +2     
  Lines        5503     5713     +210     
==========================================
+ Hits         5343     5553     +210     
  Misses        160      160

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

src/VecSim/spaces/IP/IP_AVX512F_BW_VL_VNNI_INT8.h

src/VecSim/spaces/functions/implementation_chooser.h

src/VecSim/spaces/normalize/normalize_naive.h

tests/unit/unit_test_utils.cpp

src/VecSim/spaces/L2/L2_AVX512F_BW_VL_VNNI_UINT8.h

alonre24 · 2025-01-01T10:28:45Z

src/VecSim/spaces/IP_space.cpp

+        // For uint8 vectors with cosine distance, the extra float for the norm shifts alignment to
+        // `(dim + sizeof(float)) % 32`.
+        // Vectors satisfying this have a residual, causing offset loads during calculation.
+        // To avoid complexity, we skip alignment here, assuming the performance impact is
+        // negligible.


I need to better understand the considerations here (I see it follows INT8 as well)

👍🏼. I also thought about it and if we put the norm at the beginning of the vector we will be able to use alignment

As discussed - let's open a (small) spike to try that

tests/flow/test_bruteforce.py

meiravgri

cool

src/VecSim/spaces/IP/IP.cpp

src/VecSim/spaces/IP/IP_AVX512F_BW_VL_VNNI_UINT8.h

src/VecSim/spaces/L2/L2.cpp

src/VecSim/spaces/functions/AVX512F_BW_VL_VNNI.cpp

src/VecSim/spaces/normalize/normalize_naive.h

tests/benchmark/spaces_benchmarks/bm_spaces_int8.cpp

tests/benchmark/benchmarks.sh

tests/flow/test_bruteforce.py

alonre24 · 2025-01-02T22:08:08Z

src/VecSim/spaces/IP/IP.cpp

+// The type should be able to hold `dimension * MAX_VAL(int_elem_t) * MAX_VAL(int_elem_t)`.
+// To support dimension up to 2^16, we need the difference between the type and int_elem_t to be at
+// least 2 bytes. We assert that in the implementation.
+template <typename int_elem_t>
+using ret_t = std::conditional_t<sizeof(int_elem_t) == 1, int, long long>;
+
+template <typename int_elem_t>
+static inline ret_t<int_elem_t>
+INTEGER_InnerProductImp(const int_elem_t *pVect1, const int_elem_t *pVect2, size_t dimension) {
+    static_assert(sizeof(ret_t<int_elem_t>) - sizeof(int_elem_t) * 2 >= sizeof(uint16_t));
+    ret_t<int_elem_t> res = 0;


This generic introduced for possible future integer types larger than 1 byte?

GuyAv46 added 8 commits December 25, 2024 17:28

cleanup implementation chooser

db8be36

defining new API and naive implementation

36b8fa1

define new uint8 type definitions

f91ff30

add new type to all the factories

1a3f11f

add new type tp the python bindings

bd571af

format

a8cee7b

first attempt of implementing optimized implementation

a7c6075

implement benchmarks for uint8

433eedf

GuyAv46 added the bm-spaces label Dec 26, 2024

GuyAv46 added 10 commits December 26, 2024 18:16

implement unit tests

cd6b58e

fix L2 implementation

496d52f

fix uint8 range test

86a948e

format

4d1a837

cleanup

8034476

added flow/bindings tests

c7935cf

fix int benchmarks files

9c99d19

added cosine to naive int implementations

4be3396

Merge remote-tracking branch 'origin/main' into guyav-implement_uint8

e78c017

fix flow tests

dfd51f3

GuyAv46 changed the title ~~Implement UINT8 vector type - [MOD-8230]~~ Implement UINT8 vector type - [MOD-8230, MOD-8408] Dec 30, 2024

GuyAv46 added the backport 8.0 label Dec 30, 2024

GuyAv46 marked this pull request as ready for review December 30, 2024 14:48

GuyAv46 requested a review from meiravgri December 30, 2024 14:48

GuyAv46 added 2 commits December 30, 2024 16:53

unpack lo before high

6443294

remove todos

aa21e23

GuyAv46 requested a review from alonre24 December 31, 2024 09:14

alonre24 reviewed Jan 1, 2025

View reviewed changes

Merge remote-tracking branch 'origin/main' into guyav-implement_uint8

4c73a3b

meiravgri reviewed Jan 1, 2025

View reviewed changes

GuyAv46 added 3 commits January 2, 2025 13:51

address some review comments

8274a2b

alternative implementation of residual handling

9ee0d45

extend test coverage

727b86c

GuyAv46 requested review from meiravgri and alonre24 January 2, 2025 15:28

alonre24 reviewed Jan 2, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement UINT8 vector type - [MOD-8230, MOD-8408] #584

Implement UINT8 vector type - [MOD-8230, MOD-8408] #584

GuyAv46 commented Dec 26, 2024 •

edited

Loading

codecov bot commented Dec 30, 2024 •

edited

Loading

alonre24 Jan 1, 2025

GuyAv46 Jan 1, 2025

alonre24 Jan 1, 2025

meiravgri left a comment

alonre24 Jan 2, 2025

Implement UINT8 vector type - [MOD-8230, MOD-8408] #584

Are you sure you want to change the base?

Implement UINT8 vector type - [MOD-8230, MOD-8408] #584

Conversation

GuyAv46 commented Dec 26, 2024 • edited Loading

Implement support for new type UINT8

codecov bot commented Dec 30, 2024 • edited Loading

Codecov Report

alonre24 Jan 1, 2025

Choose a reason for hiding this comment

GuyAv46 Jan 1, 2025

Choose a reason for hiding this comment

alonre24 Jan 1, 2025

Choose a reason for hiding this comment

meiravgri left a comment

Choose a reason for hiding this comment

alonre24 Jan 2, 2025

Choose a reason for hiding this comment

GuyAv46 commented Dec 26, 2024 •

edited

Loading

Implement support for new type `UINT8`

codecov bot commented Dec 30, 2024 •

edited

Loading