Adds Open Distro Elastic Search's KNN plugin support. Closes #174. #202

stephenleo · 2020-12-12T02:54:26Z

Adding Open Distro Elasticsearch KNN plugin support by borrowing on the ES setup from elasticsearch and elastiknn. Below is a comparison on fashion-mnist between this opendistroknn and elastiknn, where opnedistroknn has ~3X better queries/s at comparable recall.

stephenleo · 2020-12-12T03:02:11Z

@alexklibisz could you pls help to review?
I have 2 questions:

I tried increasing the JVM memory (Xms and Xmx) together with an increase in translog.flush_threshold_size as discussed in Link but didn't seem to impact the speed here. So seems like we are CPU bottlenecked instead of memory right?
We cannot increase knn.algo_param.index_thread_qty since we want to limit it to only 1 core right?

erikbern · 2020-12-12T13:25:57Z

Yeah, you're limited in terms of memory and cores by Docker, so I don't think it will bring anything to change config within the container. Almost all algorithms benefit from more memory/CPU/cores so we try to keep the benchmark simple.

erikbern · 2020-12-12T13:26:58Z

This looks great – thanks!

install/Dockerfile.opendistroknn

ann_benchmarks/algorithms/opendistroknn.py

alexklibisz · 2020-12-12T16:54:19Z

I tried increasing the JVM memory (Xms and Xmx) together with an increase in translog.flush_threshold_size as discussed in Link but didn't seem to impact the speed here. So seems like we are CPU bottlenecked instead of memory right?

Almost certainly CPU bottlenecked, but you never know without profiling. You can add ports={"8097":"8097"} as an arg to this method call, run the runner, open VisualVM, and it should recognize that there's an Elasticsearch JVM running and let you view memory usage and sample CPU usage. I was able to run Elasticsearch and Elastiknn comfortably with a 3GB heap, but opendistro is running an additional process for HNSW. I'd also suggest booting up an EC2 c5.xlarge or something with a similar memory profile and making sure you can run all the way through with --parallelism 3, since that's the setting @erikbern uses.

We cannot increase knn.algo_param.index_thread_qty since we want to limit it to only 1 core right?

I'm not sure about that setting, but yes you generally want to specify parallelism=1 at every level. The runner pins your container to a single core, so if you try to do parallel CPU-bound work on many threads you'll either get killed by the container runtime or waste time context switching.

stephenleo · 2020-12-13T08:18:12Z

Thank you for reviewing @alexklibisz . I think EC2 c5.xlarge is insufficient for --parallelism 3 since it only has 8GB RAM while we need 3*3GB. Instead, I've run it successfully on a GCP n1-standard-4 (4 vCPUs, 15 GB RAM) machine.

Here is a snapshot of docker stats during the query (I wasn't successful in getting VisualVM to show the stats). I'm not sure why the memory usage for each container is exceeding 4GB despite setting -Xmx3G. What do you think?

CONTAINER ID        NAME                CPU %               MEM USAGE / LIMIT     MEM %               NET I/O             BLOCK I/O           PIDS
21d5eae537df        beautiful_hermann   98.53%              4.019GiB / 4.511GiB   89.10%              1.44kB / 0B         23.3MB / 1.11GB     59
73b22733b190        dazzling_swartz     98.79%              4.209GiB / 4.512GiB   93.29%              1.44kB / 0B         43.5MB / 942MB      59
5d08d3625cfa        pedantic_ishizaka   98.95%              4.24GiB / 4.511GiB    93.98%              1.44kB / 0B         722MB / 571MB       59

alexklibisz · 2020-12-13T14:49:30Z

Oops I meant 4xlarge! Must’ve been a typo. Opendistro is using nmslib to run the hnsw model. That likely consumes a chunk of memory. Also the JVM Itself uses memory. The 3gb is just what’s allocated for ES.

…

On Sun, Dec 13, 2020 at 3:18 AM Marie Stephen Leo ***@***.***> wrote: Thank you for reviewing @alexklibisz <https://github.com/alexklibisz> . I think EC2 c5.xlarge is insufficient for --parallelism 3 since it only has 8GB RAM while we need 3*3GB. Instead, I've run it successfully on a GCP n1-standard-4 (4 vCPUs, 15 GB RAM) machine. Here is a snapshot of docker stats during the query (I wasn't successful in getting VisualVM to show the stats). I'm not sure why the memory usage for each container is exceeding 4GB despite setting -Xmx3G. What do you think? CONTAINER ID NAME CPU % MEM USAGE / LIMIT MEM % NET I/O BLOCK I/O PIDS 21d5eae537df beautiful_hermann 98.53% 4.019GiB / 4.511GiB 89.10% 1.44kB / 0B 23.3MB / 1.11GB 59 73b22733b190 dazzling_swartz 98.79% 4.209GiB / 4.512GiB 93.29% 1.44kB / 0B 43.5MB / 942MB 59 5d08d3625cfa pedantic_ishizaka 98.95% 4.24GiB / 4.511GiB 93.98% 1.44kB / 0B 722MB / 571MB 59 — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub <#202 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AB5E27DG46DWTYTVI23EZ5DSUR2FBANCNFSM4UXTGSLQ> .

erikbern · 2020-12-13T18:53:12Z

is this ready to be merged?

stephenleo · 2020-12-14T01:17:33Z

I'm good to merge unless @alexklibisz has any concerns?

alexklibisz · 2020-12-14T01:21:18Z

LGTM. Excited to see all the results side-by-side.

alexklibisz · 2020-12-14T01:22:04Z

@erikbern slight tangent: did travis get removed for the repo? If you'd be interested, I've recently converted some other repos to use Github Actions. I could take a pass at that here.

erikbern · 2020-12-15T20:00:48Z

i'm not sure what happened to travis. it's still running, but something is broken with the PR integration, I think

will merge this!

stephenleo · 2020-12-16T02:27:34Z

Strange, the master build failed. Though the PR itself passed...
Pls, let me know if there is anything I should fix

erikbern · 2020-12-16T16:37:15Z

i'll take a look at it

stephenleo · 2020-12-17T09:23:52Z

found the issue @erikbern , open distro 1.12.0 was released on 14th and breaks the installation. I've submitted a PR with the fix.

maumueller · 2020-12-17T20:18:15Z

@erikbern slight tangent: did travis get removed for the repo? If you'd be interested, I've recently converted some other repos to use Github Actions. I could take a pass at that here.

I would actually like to see CI via github actions. I like that we could just upload the produced plot artifacts to the builds.

alexklibisz · 2020-12-17T20:45:38Z

I'm not sure if I follow this part:

I like that we could just upload the produced plot artifacts to the builds.

maumueller · 2020-12-17T21:37:23Z

Sorry for the imprecision.

I would like to see the plots generated via https://github.com/erikbern/ann-benchmarks/blob/master/.travis.yml#L43-L44 to be uploaded to the CI build via https://github.com/actions/upload-artifact. This could be useful for performance bug hunting.

erikbern · 2020-12-17T23:32:59Z

that would be a good idea – I'd be supportive of the change!

Adds Open Distro Elastic Search's KNN plugin support. Closes #174.

stephenleo added 3 commits December 4, 2020 21:33

Added Dockerfile and installs without issue

07b4646

initial working version

57c0d83

fixing the timeouts, warmup and alignning names

e6bf39b

stephenleo added 3 commits December 12, 2020 11:04

add warning to docker file

6c43408

updating README

ac4fe1f

updating travis

58a572b

alexklibisz reviewed Dec 12, 2020

View reviewed changes

install/Dockerfile.opendistroknn Show resolved Hide resolved

alexklibisz reviewed Dec 12, 2020

View reviewed changes

ann_benchmarks/algorithms/opendistroknn.py Outdated Show resolved Hide resolved

import es_wait from .elasticsearch instead

eb97504

erikbern merged commit f126b20 into erikbern:master Dec 15, 2020

erikbern added a commit that referenced this pull request Apr 14, 2023

Merge pull request #202 from stephenleo/master

50ce86b

Adds Open Distro Elastic Search's KNN plugin support. Closes #174.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds Open Distro Elastic Search's KNN plugin support. Closes #174. #202

Adds Open Distro Elastic Search's KNN plugin support. Closes #174. #202

stephenleo commented Dec 12, 2020

stephenleo commented Dec 12, 2020

erikbern commented Dec 12, 2020 •

edited

Loading

erikbern commented Dec 12, 2020

alexklibisz commented Dec 12, 2020

stephenleo commented Dec 13, 2020

alexklibisz commented Dec 13, 2020 via email

erikbern commented Dec 13, 2020

stephenleo commented Dec 14, 2020

alexklibisz commented Dec 14, 2020

alexklibisz commented Dec 14, 2020 •

edited

Loading

erikbern commented Dec 15, 2020

stephenleo commented Dec 16, 2020

erikbern commented Dec 16, 2020

stephenleo commented Dec 17, 2020

maumueller commented Dec 17, 2020

alexklibisz commented Dec 17, 2020

maumueller commented Dec 17, 2020

erikbern commented Dec 17, 2020

Adds Open Distro Elastic Search's KNN plugin support. Closes #174. #202

Adds Open Distro Elastic Search's KNN plugin support. Closes #174. #202

Conversation

stephenleo commented Dec 12, 2020

stephenleo commented Dec 12, 2020

erikbern commented Dec 12, 2020 • edited Loading

erikbern commented Dec 12, 2020

alexklibisz commented Dec 12, 2020

stephenleo commented Dec 13, 2020

alexklibisz commented Dec 13, 2020 via email

erikbern commented Dec 13, 2020

stephenleo commented Dec 14, 2020

alexklibisz commented Dec 14, 2020

alexklibisz commented Dec 14, 2020 • edited Loading

erikbern commented Dec 15, 2020

stephenleo commented Dec 16, 2020

erikbern commented Dec 16, 2020

stephenleo commented Dec 17, 2020

maumueller commented Dec 17, 2020

alexklibisz commented Dec 17, 2020

maumueller commented Dec 17, 2020

erikbern commented Dec 17, 2020

erikbern commented Dec 12, 2020 •

edited

Loading

alexklibisz commented Dec 14, 2020 •

edited

Loading