You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We've been asked by several folks for a matrix of benchmarks across different algorithms across different scales of datasets. We should just go ahead and do one which we publish on RAFT's documentation (preferably using RAFT's bench-ann Python scripts to make them consistently reproducible).
Also I think this is a good case to advertise my gbench PR #1661, which should make writing the configs much easier and speed-up the benchmarks by orders of magnitude.
@achirkin, if we can prioritize getting the gbench changes so the outputs function like the existing benchmarks (the benchmarks dump to files), then I'm definitely on board w/ getting those merged in. I'd like to avoid breaking the Python scripts in the meantime now that we're pointing users to them.
We've been asked by several folks for a matrix of benchmarks across different algorithms across different scales of datasets. We should just go ahead and do one which we publish on RAFT's documentation (preferably using RAFT's bench-ann Python scripts to make them consistently reproducible).
I would propose the following matrix:
Batch Size: 1, 100, 10k
Scales: 100k, 10M, 100M
K: 1, 10, 1k
Hardware: T4, V100, A100, H100 (potentially L4, A10)
Algorithms: Brute-force (for smaller scale), IVF-PQ, IVF-Flat, CAGRA
Datasets:
The text was updated successfully, but these errors were encountered: