diff --git a/README.md b/README.md index 2c7f83ad02..0e8b2d9c89 100755 --- a/README.md +++ b/README.md @@ -2,7 +2,23 @@ ![RAFT tech stack](img/raft-tech-stack-vss.png) -## Resources + +## Contents +
+ +1. [Useful Resources](#useful-resources) +2. [What is RAFT?](#what-is-raft) +2. [Use cases](#use-cases) +3. [Is RAFT right for me?](#is-raft-right-for-me) +4. [Getting Started](#getting-started) +5. [Installing RAFT](#installing) +6. [Codebase structure and contents](#folder-structure-and-contents) +7. [Contributing](#contributing) +8. [References](#references) + +
+ +## Useful Resources - [RAFT Reference Documentation](https://docs.rapids.ai/api/raft/stable/): API Documentation. - [RAFT Getting Started](./docs/source/quick_start.md): Getting started with RAFT. @@ -12,7 +28,9 @@ - [GitHub repository](https://github.com/rapidsai/raft): Download the RAFT source code. - [Issue tracker](https://github.com/rapidsai/raft/issues): Report issues or request features. -## Overview + + +## What is RAFT? RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications. @@ -23,16 +41,16 @@ By taking a primitives-based approach to algorithm development, RAFT While not exhaustive, the following general categories help summarize the accelerated functions in RAFT: ##### -| Category | Examples | -| --- |-----------------------------------------------------------------------------------------------------------------------------------| -| **Data Formats** | sparse & dense, conversions, data generation | -| **Dense Operations** | linear algebra, matrix and vector operations, reductions, slicing, norms, factorization, least squares, svd & eigenvalue problems | +| Category | Accelerated Functions in RAFT | +|-----------------------|-----------------------------------------------------------------------------------------------------------------------------------| +| **Nearest Neighbors** | vector search, neighborhood graph construction, epsilon neighborhoods, pairwise distances | +| **Basic Clustering** | spectral clustering, hierarchical clustering, k-means | +| **Solvers** | combinatorial optimization, iterative solvers | +| **Data Formats** | sparse & dense, conversions, data generation | +| **Dense Operations** | linear algebra, matrix and vector operations, reductions, slicing, norms, factorization, least squares, svd & eigenvalue problems | | **Sparse Operations** | linear algebra, eigenvalue problems, slicing, norms, reductions, factorization, symmetrization, components & labeling | -| **Spatial** | pairwise distances, nearest neighbors and vector search, neighborhood graph construction | -| **Basic Clustering** | spectral clustering, hierarchical clustering, k-means | -| **Solvers** | combinatorial optimization, iterative solvers | -| **Statistics** | sampling, moments and summary statistics, metrics | -| **Tools & Utilities** | common utilities for developing CUDA applications, multi-node multi-gpu infrastructure | +| **Statistics** | sampling, moments and summary statistics, metrics, model evaluation | +| **Tools & Utilities** | common tools and utilities for developing CUDA applications, multi-node multi-gpu infrastructure | RAFT is a C++ header-only template library with an optional shared library that @@ -49,24 +67,40 @@ In addition being a C++ library, RAFT also provides 2 Python libraries: ### Vector Similarity Search -RAFT contains state-of-the-art implementations of approximate nearest neighbors algorithms on the GPU that enable vector similarity search. Vector similarity search applications often require fast online queries done one-at-a-time and RAFT's graph-based [CAGRA](https://docs.rapids.ai/api/raft/nightly/pylibraft_api/neighbors/#cagra) algorithm outperforms the state-of-the art on the CPU (hierarchical navigable small-world graph or HNSW). +RAFT contains state-of-the-art implementations of approximate nearest neighbors search (ANNS) algorithms on the GPU, such as: -In addition to CAGRA, RAFT contains other state-of-the-art GPU-accelerated implementations of popular algorithms for vector similarity search, such as [IVF-Flat](https://docs.rapids.ai/api/raft/nightly/pylibraft_api/neighbors/#ivf-flat) and [IVF-PQ](https://docs.rapids.ai/api/raft/nightly/pylibraft_api/neighbors/#ivf-pq) algorithms originally popularized by the [FAISS](https://github.com/facebookresearch/faiss) library. +* [Brute force](https://docs.rapids.ai/api/raft/nightly/pylibraft_api/neighbors/#brute-force). Performs a brute force nearest neighbors search without an index. +* [IVF-Flat](https://docs.rapids.ai/api/raft/nightly/pylibraft_api/neighbors/#ivf-flat) and [IVF-PQ](https://docs.rapids.ai/api/raft/nightly/pylibraft_api/neighbors/#ivf-pq). Use an inverted file index structure to map contents to their locations. IVF-PQ additionally uses product quantization to reduce the memory usage of vectors. These methods were originally popularized by the [FAISS](https://github.com/facebookresearch/faiss) library. +* [CAGRA](https://docs.rapids.ai/api/raft/nightly/pylibraft_api/neighbors/#cagra) (Cuda Anns GRAph-based). Uses a fast ANNS graph construction and search implementation optimized for the GPU. CAGRA outperforms state-of-the art CPU methods (i.e. HNSW) for large batch queries, single queries, and graph construction time. + +Projects that use the RAFT ANNS algorithms for accelerating vector search include: [Milvus](https://milvus.io/), [Redis](https://redis.io/), and [Faiss](https://github.com/facebookresearch/faiss). + +Please see the example [Jupyter notebook](https://github.com/rapidsai/raft/blob/HEAD/notebooks/VectorSearch_QuestionRetrieval.ipynb) to get started RAFT for vector search in Python. ### Information Retrieval -RAFT also contains a catalog of reusable primitives for composing algorithms that require fast neighborhood computations, such as +RAFT contains a catalog of reusable primitives for composing algorithms that require fast neighborhood computations, such as 1. Computing distances between vectors and computing kernel gramm matrices 2. Performing ball radius queries for constructing epsilon neighborhoods 3. Clustering points to partition a space for smaller and faster searches 4. Constructing neighborhood "connectivities" graphs from dense vectors -As an example, computations such as the above list are critical for information retrieval, data mining, and machine learning applications such as clustering, manifold learning, and dimensionality reduction. +### Machine Learning + +RAFT's primitives are used in several RAPIDS libraries, including [cuML](https://github.com/rapidsai/cuml), [cuGraph](https://github.com/rapidsai/cugraph), and [cuOpt](https://github.com/rapidsai/cuopt) to build many end-to-end machine learning algorithms that span a large spectrum of different applications, including +- data generation +- model evaluation +- classification and regression +- clustering +- manifold learning +- dimensionality reduction. + +RAFT is also used by the popular collaborative filtering library [implicit](https://github.com/benfred/implicit) for recommender systems. ## Is RAFT right for me? -RAFT contains low level primitives for accelerating applications and workflows. Data source providers and application developers may find specific tools -- like ANN algorithms -- very useful. RAFT is not intended to be used directly by data scientists for discovery and experimentation. For data science tools, please see the [RAPIDS website](https://rapids.ai/). +RAFT contains low-level primitives for accelerating applications and workflows. Data source providers and application developers may find specific tools -- like ANN algorithms -- very useful. RAFT is not intended to be used directly by data scientists for discovery and experimentation. For data science tools, please see the [RAPIDS website](https://rapids.ai/). ## Getting started @@ -357,4 +391,4 @@ If citing the single-linkage agglomerative clustering APIs, please consider the eprint={2306.16354}, archivePrefix={arXiv}, primaryClass={cs.LG} -} \ No newline at end of file +} diff --git a/conda/environments/all_cuda-118_arch-x86_64.yaml b/conda/environments/all_cuda-118_arch-x86_64.yaml index 5fd808f07c..057e33faf3 100644 --- a/conda/environments/all_cuda-118_arch-x86_64.yaml +++ b/conda/environments/all_cuda-118_arch-x86_64.yaml @@ -18,7 +18,7 @@ dependencies: - cudatoolkit - cupy>=12.0.0 - cxx-compiler -- cython>=0.29,<0.30 +- cython>=3.0.0 - dask-core>=2023.5.1 - dask-cuda==23.10.* - dask>=2023.5.1 diff --git a/conda/environments/all_cuda-120_arch-x86_64.yaml b/conda/environments/all_cuda-120_arch-x86_64.yaml index a87ce4428e..403afe7413 100644 --- a/conda/environments/all_cuda-120_arch-x86_64.yaml +++ b/conda/environments/all_cuda-120_arch-x86_64.yaml @@ -18,7 +18,7 @@ dependencies: - cuda-version=12.0 - cupy>=12.0.0 - cxx-compiler -- cython>=0.29,<0.30 +- cython>=3.0.0 - dask-core>=2023.5.1 - dask-cuda==23.10.* - dask>=2023.5.1 diff --git a/conda/environments/bench_ann_cuda-118_arch-x86_64.yaml b/conda/environments/bench_ann_cuda-118_arch-x86_64.yaml index e39e722224..278d8c4d5a 100644 --- a/conda/environments/bench_ann_cuda-118_arch-x86_64.yaml +++ b/conda/environments/bench_ann_cuda-118_arch-x86_64.yaml @@ -15,7 +15,7 @@ dependencies: - cuda-version=11.8 - cudatoolkit - cxx-compiler -- cython>=0.29,<0.30 +- cython>=3.0.0 - faiss-proc=*=cuda - gcc_linux-64=11.* - glog>=0.6.0 diff --git a/conda/recipes/pylibraft/meta.yaml b/conda/recipes/pylibraft/meta.yaml index 7468039539..cc781d0cba 100644 --- a/conda/recipes/pylibraft/meta.yaml +++ b/conda/recipes/pylibraft/meta.yaml @@ -45,7 +45,7 @@ requirements: - cuda-python >=12.0,<13.0a0 {% endif %} - cuda-version ={{ cuda_version }} - - cython >=0.29,<0.30 + - cython >=3.0.0 - libraft {{ version }} - libraft-headers {{ version }} - numpy >=1.21 diff --git a/conda/recipes/raft-dask/meta.yaml b/conda/recipes/raft-dask/meta.yaml index 940a301755..b189abb313 100644 --- a/conda/recipes/raft-dask/meta.yaml +++ b/conda/recipes/raft-dask/meta.yaml @@ -45,7 +45,7 @@ requirements: - cuda-python >=12.0,<13.0a0 {% endif %} - cuda-version ={{ cuda_version }} - - cython >=0.29,<0.30 + - cython >=3.0.0 - nccl >=2.9.9 - pylibraft {{ version }} - python x.x diff --git a/dependencies.yaml b/dependencies.yaml index 190c52d954..4b825b48f0 100644 --- a/dependencies.yaml +++ b/dependencies.yaml @@ -110,7 +110,7 @@ dependencies: - output_types: [conda, requirements, pyproject] packages: - &cmake_ver cmake>=3.26.4 - - cython>=0.29,<0.30 + - cython>=3.0.0 - ninja - scikit-build>=0.13.1 - output_types: [conda] diff --git a/docs/source/index.rst b/docs/source/index.rst index 37235c2f25..b5d6abbbab 100644 --- a/docs/source/index.rst +++ b/docs/source/index.rst @@ -5,19 +5,19 @@ RAPIDS RAFT: Reusable Accelerated Functions and Tools for Vector Search and More :width: 800 :alt: RAFT Tech Stack -Resources -######### +Useful Resources +################ .. _raft_reference: https://docs.rapids.ai/api/raft/stable/ -- `Example Notebooks `_: Example jupyer notebooks +- `Example Notebooks `_: Example Jupyter notebooks - `RAPIDS Community `_: Get help, contribute, and collaborate. - `GitHub repository `_: Download the RAFT source code. - `Issue tracker `_: Report issues or request features. -Overview -######## +What is RAFT? +############# RAFT contains fundamental widely-used algorithms and primitives for machine learning and information retrieval. The algorithms are CUDA-accelerated and form building blocks for more easily writing high performance applications. @@ -41,8 +41,8 @@ While not exhaustive, the following general categories help summarize the accele - linear algebra, matrix and vector operations, slicing, norms, factorization, least squares, svd & eigenvalue problems * - Sparse Operations - linear algebra, eigenvalue problems, slicing, norms, reductions, factorization, symmetrization, components & labeling - * - Spatial - - pairwise distances, nearest neighbors and vector search, neighborhood graph construction + * - Nearest Neighbors + - pairwise distances, vector search, epsilon neighborhoods, neighborhood graph construction * - Basic Clustering - spectral clustering, hierarchical clustering, k-means * - Solvers diff --git a/docs/source/raft_ann_benchmarks.md b/docs/source/raft_ann_benchmarks.md index 91958c0bcd..3a3db0f3ea 100644 --- a/docs/source/raft_ann_benchmarks.md +++ b/docs/source/raft_ann_benchmarks.md @@ -48,7 +48,7 @@ python scripts/ann-benchmarks/plot.py --result_csv out.csv ``` ### End-to-end example: Billion-scale -`scripts/get_dataset.py` cannot be used to download the [billion-scale datasets](ann_benchmarks_dataset.html#billion-scale) +`scripts/get_dataset.py` cannot be used to download the [billion-scale datasets](ann_benchmarks_dataset.md#billion-scale) because they are so large. You should instead use our billion-scale datasets guide to download and prepare them. All other python scripts mentioned below work as intended once the billion-scale dataset has been downloaded. diff --git a/python/pylibraft/pyproject.toml b/python/pylibraft/pyproject.toml index 5c3a50f50c..bbf4c1d3cb 100644 --- a/python/pylibraft/pyproject.toml +++ b/python/pylibraft/pyproject.toml @@ -17,7 +17,7 @@ requires = [ "cmake>=3.26.4", "cuda-python>=11.7.1,<12.0a0", - "cython>=0.29,<0.30", + "cython>=3.0.0", "ninja", "rmm==23.10.*", "scikit-build>=0.13.1", diff --git a/python/raft-dask/pyproject.toml b/python/raft-dask/pyproject.toml index ab9cad0e8f..68b6e7e2ad 100644 --- a/python/raft-dask/pyproject.toml +++ b/python/raft-dask/pyproject.toml @@ -16,7 +16,7 @@ requires = [ "cmake>=3.26.4", - "cython>=0.29,<0.30", + "cython>=3.0.0", "ninja", "scikit-build>=0.13.1", "setuptools",