diff --git a/dataset_preparation/wiki_filters/readme.md b/dataset_preparation/wiki_filters/readme.md new file mode 100644 index 00000000..774fb507 --- /dev/null +++ b/dataset_preparation/wiki_filters/readme.md @@ -0,0 +1,8 @@ +The wikipedia filtered dataset can be found here: + +Base vectors: https://comp21storage.blob.core.windows.net/$web/wiki-cohere-35M/wikipedia_base.bin +Base labels: https://comp21storage.blob.core.windows.net/$web/wiki-cohere-35M/wikipedia_35m_base_labels.txt +Query vectors: https://comp21storage.blob.core.windows.net/$web/wiki-cohere-35M/wikipedia_query.bin +Single query filters: https://comp21storage.blob.core.windows.net/$web/wiki-cohere-35M/wikipedia_35m_query_labels_single.txt +AND of two query filters: https://comp21storage.blob.core.windows.net/$web/wiki-cohere-35M/wikipedia_35m_query_labels.txt +