[FEATURE] Latency metrics around graph creation and other KNN operation. #999

dhwanilpatel · 2023-07-20T12:36:29Z

Problem statement
KNN index performs graph creations during refresh/flush of the index. Graph creation is considered as very expensive operation and can take long time based on different parameters like refresh interval (i.e size of translog).

User don't have visibility around how much time does it take for graph creation. This will provide better visibility to user which can help tune various parameters.

Refresh/flush is not only triggered in background activities but it can be triggered during other operations as well like bulk/recovery/etc. Graph creation can increase the overall latency for these operation as well. These metrics can help us triaging such issues as well like long running bulk/recovery.

What solution would you like?

We should expose the overall latency metrics for graph creation. We can see how better we can expose this as with cumulative metrics or individual metrics for each graph creation.

We can plan to expose it per index or per shard level, as this can change based on data or configuration of index/shard as well.

We should add latency metrics around other KNN operations as well and not just for graph creation, it will provide better visibility in KNN operations.

navneet1v · 2023-07-21T16:31:09Z

@dhwanilpatel
Some questions on this:

Apart from graph creation, what are the other operations where we need latency metrics? Can you please provide some details around that.
Graphs are created per segments of a shard, and where we should put metrics depends on what use case we want to solve. So can you please add some details around what is the exact customer need.

dhwanilpatel added enhancement untriaged labels Jul 20, 2023

navneet1v removed the untriaged label Jul 21, 2023

vamshin added the v2.10.0 Issues targeting release v2.10.0 label Jul 21, 2023

vamshin assigned navneet1v Jul 21, 2023

navneet1v mentioned this issue Sep 21, 2023

Add graph creation stats to the KNNStats API #1141

Merged

5 tasks

navneet1v added v2.11.0 and removed v2.10.0 Issues targeting release v2.10.0 labels Sep 22, 2023

heemin32 closed this as completed Oct 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Latency metrics around graph creation and other KNN operation. #999

[FEATURE] Latency metrics around graph creation and other KNN operation. #999

dhwanilpatel commented Jul 20, 2023

navneet1v commented Jul 21, 2023

[FEATURE] Latency metrics around graph creation and other KNN operation. #999

[FEATURE] Latency metrics around graph creation and other KNN operation. #999

Comments

dhwanilpatel commented Jul 20, 2023

navneet1v commented Jul 21, 2023