Skip to content

Commit

Permalink
Add Vector Search Samples and Tests (#35820)
Browse files Browse the repository at this point in the history
Add Vector Search Samples and Tests
  • Loading branch information
alzimmermsft authored Jul 11, 2023
1 parent fd5cbfe commit e6ed3b0
Show file tree
Hide file tree
Showing 42 changed files with 20,380 additions and 297 deletions.
2 changes: 1 addition & 1 deletion eng/code-quality-reports/pom.xml
Original file line number Diff line number Diff line change
@@ -1,4 +1,4 @@
<!-- Copyright (c) Microsoft Corporation. All rights reserved.
<!-- Copyright (c) Microsoft Corporation. All rights reserved.
Licensed under the MIT License. -->
<project xmlns="http://maven.apache.org/POM/4.0.0"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
Expand Down
10 changes: 8 additions & 2 deletions sdk/search/azure-search-documents/CHANGELOG.md
Original file line number Diff line number Diff line change
@@ -1,14 +1,20 @@
# Release History

## 11.6.0-beta.7 (Unreleased)
## 11.6.0-beta.7 (2023-07-11)

### Features Added

- Added support for [Vector Search](https://learn.microsoft.com/azure/search/vector-search-overview) ([Examples](https://github.com/Azure/azure-sdk-for-java/blob/main/sdk/search/azure-search-documents/src/samples/java/com/azure/search/documents/VectorSearchExample.java)).

### Breaking Changes

### Bugs Fixed
- Deprecated `EntityRecognitionSkillVersion.V1` and `SentimentSkillVersion.V1`, and corresponding constructors in
`EntityRecognitionSkill` and `SentimentSkill`, use `EntityRecognitionSkillVersion.V3` and `SentimentSkillVersion.V3`
instead. See [Cognitive Search skill deprecated](https://learn.microsoft.com/azure/search/cognitive-search-skill-deprecated)
for more details.

### Other Changes

- Migrate test recordings to assets repo.

## 11.5.8 (2023-06-09)
Expand Down
88 changes: 81 additions & 7 deletions sdk/search/azure-search-documents/README.md
Original file line number Diff line number Diff line change
Expand Up @@ -95,15 +95,20 @@ See [choosing a pricing tier](https://docs.microsoft.com/azure/search/search-sku

### Authenticate the client

In order to interact with the Azure Cognitive Search service you'll need to create an instance of the Search Client class.
To make this possible you will need,
To interact with the Search service, you'll need to create an instance of the appropriate client class: `SearchClient`
for searching indexed documents, `SearchIndexClient` for managing indexes, or `SearchIndexerClient` for crawling data
sources and loading search documents into an index. To instantiate a client object, you'll need an **endpoint** and
**API key**. You can refer to the documentation for more information on [supported authenticating approaches](https://learn.microsoft.com/azure/search/search-security-overview#authentication)
with the Search service.

1. [URL endpoint](https://docs.microsoft.com/azure/search/search-create-service-portal#get-a-key-and-url-endpoint)
1. [API key](https://docs.microsoft.com/azure/search/search-create-service-portal#get-a-key-and-url-endpoint)
#### Get an API Key

for your service. [The api-key is the sole mechanism for authenticating access to
your search service endpoint.](https://docs.microsoft.com/azure/search/search-security-api-keys)
You can obtain your api-key from the [Azure portal](https://portal.azure.com/) or via the Azure CLI:
You can get the **endpoint** and an **API key** from the Search service in the [Azure Portal](https://portal.azure.com/).
Please refer the [documentation](https://docs.microsoft.com/azure/search/search-security-api-keys) for instructions on
how to get an API key.

Alternatively, you can use the following [Azure CLI](https://learn.microsoft.com/cli/azure/) command to retrieve the
API key from the Search service:

```bash
az search admin-key show --service-name <mysearch> --resource-group <mysearch-rg>
Expand Down Expand Up @@ -188,6 +193,46 @@ SearchAsyncClient searchAsyncClient = new SearchClientBuilder()
.buildAsyncClient();
```

#### Create a client using Azure Active Directory authentication

You can also create a `SearchClient`, `SearchIndexClient`, or `SearchIndexerClient` using Azure Active Directory (AAD)
authentication. Your user or service principal must be assigned the "Search Index Data Reader" role.
Using the [DefaultAzureCredential](https://github.com/Azure/azure-sdk-for-java/blob/main/sdk/identity/azure-identity/README.md#defaultazurecredential)
you can authenticate a service using Managed Identity or a service principal, authenticate as a developer working on an
application, and more all without changing code. Please refer the [documentation](https://learn.microsoft.com/azure/search/search-security-rbac?tabs=config-svc-portal%2Croles-portal%2Ctest-portal%2Ccustom-role-portal%2Cdisable-keys-portal)
for instructions on how to connect to Azure Cognitive Search using Azure role-based access control (Azure RBAC).

Before you can use the `DefaultAzureCredential`, or any credential type from [Azure.Identity](https://github.com/Azure/azure-sdk-for-java/blob/main/sdk/identity/azure-identity/README.md),
you'll first need to [install the Azure.Identity package](https://github.com/Azure/azure-sdk-for-java/blob/main/sdk/identity/azure-identity/README.md#include-the-package).

To use `DefaultAzureCredential` with a client ID and secret, you'll need to set the `AZURE_TENANT_ID`,
`AZURE_CLIENT_ID`, and `AZURE_CLIENT_SECRET` environment variables; alternatively, you can pass those values
to the `ClientSecretCredential` also in `azure-identity`.

Make sure you use the right namespace for `DefaultAzureCredential` at the top of your source file:

```java
import com.azure.identity.DefaultAzureCredential;
import com.azure.identity.DefaultAzureCredentialBuilder;
```

Then you can create an instance of `DefaultAzureCredential` and pass it to a new instance of your client:

```java readme-sample-searchClientWithTokenCredential
String indexName = "nycjobs";

// Get the service endpoint from the environment
String endpoint = Configuration.getGlobalConfiguration().get("SEARCH_ENDPOINT");
DefaultAzureCredential credential = new DefaultAzureCredentialBuilder().build();

// Create a client
SearchClient client = new SearchClientBuilder()
.endpoint(endpoint)
.indexName(indexName)
.credential(credential)
.buildClient();
```

### Send your first search query

To get running with Azure Cognitive Search first create an index following this [guide][search-get-started-portal].
Expand Down Expand Up @@ -220,6 +265,35 @@ tables.)_ The `azure-search-documents` client library exposes operations on thes
* [Start indexers to automatically crawl data sources](https://docs.microsoft.com/rest/api/searchservice/indexer-operations)
* [Define AI powered Skillsets to transform and enrich your data](https://docs.microsoft.com/rest/api/searchservice/skillset-operations)

Azure Cognitive Search provides two powerful features:

### Semantic Search

Semantic search enhances the quality of search results for text-based queries. By enabling Semantic Search on your
search service, you can improve the relevance of search results in two ways:

- It applies secondary ranking to the initial result set, promoting the most semantically relevant results to the top.
- It extracts and returns captions and answers in the response, which can be displayed on a search page to enhance the
user's search experience.

To learn more about Semantic Search, you can refer to the [documentation](https://learn.microsoft.com/azure/search/vector-search-overview).

### Vector Search

Vector Search is an information retrieval technique that overcomes the limitations of traditional keyword-based search.
Instead of relying solely on lexical analysis and matching individual query terms, Vector Search utilizes machine
learning models to capture the contextual meaning of words and phrases. It represents documents and queries as vectors
in a high-dimensional space called an embedding. By understanding the intent behind the query, Vector Search can deliver
more relevant results that align with the user's requirements, even if the exact terms are not present in the document.
Moreover, Vector Search can be applied to various types of content, including images and videos, not just text.

To learn how to index vector fields and perform vector search, you can refer to the [sample](https://github.com/Azure/azure-sdk-for-java/blob/main/sdk/search/azure-search-documents/src/samples/java/com/azure/search/documents/VectorSearchExample.java).
This sample provides detailed guidance on indexing vector fields and demonstrates how to perform vector search.

Additionally, for more comprehensive information about Vector Search, including its concepts and usage, you can refer
to the [documentation](https://learn.microsoft.com/azure/search/vector-search-overview). The documentation provides
in-depth explanations and guidance on leveraging the power of Vector Search in Azure Cognitive Search.

## Examples

The following examples all use a simple [Hotel data set](https://github.com/Azure-Samples/azure-search-sample-data)
Expand Down
2 changes: 1 addition & 1 deletion sdk/search/azure-search-documents/assets.json
Original file line number Diff line number Diff line change
Expand Up @@ -2,5 +2,5 @@
"AssetsRepo": "Azure/azure-sdk-assets",
"AssetsRepoPrefixPath": "java",
"TagPrefix": "java/search/azure-search-documents",
"Tag": "java/search/azure-search-documents_67bdd0527a"
"Tag": "java/search/azure-search-documents_501669cb17"
}
6 changes: 0 additions & 6 deletions sdk/search/azure-search-documents/pom.xml
Original file line number Diff line number Diff line change
Expand Up @@ -115,12 +115,6 @@
<version>5.9.3</version> <!-- {x-version-update;org.junit.jupiter:junit-jupiter-params;external_dependency} -->
<scope>test</scope>
</dependency>
<dependency>
<groupId>org.mockito</groupId>
<artifactId>mockito-core</artifactId>
<version>4.11.0</version> <!-- {x-version-update;org.mockito:mockito-core;external_dependency} -->
<scope>test</scope>
</dependency>
<dependency>
<groupId>io.projectreactor</groupId>
<artifactId>reactor-test</artifactId>
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -1072,11 +1072,15 @@ static SearchRequest createSearchRequest(String searchText, SearchOptions option
.setSkip(options.getSkip())
.setTop(options.getTop())
.setCaptions(createSearchRequestCaptions(options))
.setSemanticFields(nullSafeStringJoin(options.getSemanticFields()));
.setSemanticFields(nullSafeStringJoin(options.getSemanticFields()))
.setSemanticErrorHandling(options.getSemanticErrorHandling())
.setSemanticMaxWaitInMilliseconds(options.getSemanticMaxWaitInMilliseconds())
.setDebug(options.getDebug())
.setVector(options.getVector());
}

static String createSearchRequestAnswers(SearchOptions searchOptions) {
QueryAnswerType answer = searchOptions.getAnswers();
QueryAnswerType answer = searchOptions.getQueryAnswer();
Integer answersCount = searchOptions.getAnswersCount();
Double answerThreshold = searchOptions.getAnswerThreshold();

Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -17,7 +17,12 @@ public enum SearchServiceVersion implements ServiceVersion {
/**
* {@code 2021-04-30-Preview} service version.
*/
V2021_04_30_PREVIEW("2021-04-30-Preview");
V2021_04_30_PREVIEW("2021-04-30-Preview"),

/**
* {@code 2023-07-01-Preview} service version.
*/
V2023_07_01_PREVIEW("2023-07-01-Preview");

private final String version;

Expand All @@ -39,6 +44,6 @@ public String getVersion() {
* @return The latest version supported by this client library.
*/
public static SearchServiceVersion getLatest() {
return V2021_04_30_PREVIEW;
return V2023_07_01_PREVIEW;
}
}

This file was deleted.

Original file line number Diff line number Diff line change
Expand Up @@ -17,6 +17,7 @@
import com.azure.search.documents.models.QueryType;
import com.azure.search.documents.models.ScoringStatistics;
import com.azure.search.documents.models.SearchMode;
import com.azure.search.documents.models.SearchQueryVector;
import com.azure.search.documents.models.SemanticErrorHandling;
import java.io.IOException;
import java.util.List;
Expand Down Expand Up @@ -205,7 +206,7 @@ public final class SearchRequest implements JsonSerializable<SearchRequest> {
/*
* The query parameters for vector and hybrid search queries.
*/
private Vector vector;
private SearchQueryVector vector;

/** Creates an instance of SearchRequest class. */
public SearchRequest() {}
Expand Down Expand Up @@ -865,7 +866,7 @@ public SearchRequest setSemanticFields(String semanticFields) {
*
* @return the vector value.
*/
public Vector getVector() {
public SearchQueryVector getVector() {
return this.vector;
}

Expand All @@ -875,7 +876,7 @@ public Vector getVector() {
* @param vector the vector value to set.
* @return the SearchRequest object itself.
*/
public SearchRequest setVector(Vector vector) {
public SearchRequest setVector(SearchQueryVector vector) {
this.vector = vector;
return this;
}
Expand Down Expand Up @@ -995,7 +996,7 @@ public static SearchRequest fromJson(JsonReader jsonReader) throws IOException {
} else if ("semanticFields".equals(fieldName)) {
deserializedSearchRequest.semanticFields = reader.getString();
} else if ("vector".equals(fieldName)) {
deserializedSearchRequest.vector = Vector.fromJson(reader);
deserializedSearchRequest.vector = SearchQueryVector.fromJson(reader);
} else {
reader.skipChildren();
}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -44,10 +44,18 @@ public final class EntityRecognitionSkill extends SearchIndexerSkill {

/**
* Creates an instance of EntityRecognitionSkill class.
* <p>
* The instance of SentimentSkill uses {@link EntityRecognitionSkillVersion#V1}, to set the specific version of the
* skill use {@link #EntityRecognitionSkill(List, List, EntityRecognitionSkillVersion)}.
*
* @param inputs the inputs value to set.
* @param outputs the outputs value to set.
* @deprecated Use {@link #EntityRecognitionSkill(List, List, EntityRecognitionSkillVersion)} as
* {@link EntityRecognitionSkillVersion#V1} is deprecated. See
* <a href="https://learn.microsoft.com/azure/search/cognitive-search-skill-deprecated">skill deprecation</a> for
* more information.
*/
@Deprecated
public EntityRecognitionSkill(List<InputFieldMappingEntry> inputs, List<OutputFieldMappingEntry> outputs) {
this(inputs, outputs, EntityRecognitionSkillVersion.V1);
}
Expand Down
Original file line number Diff line number Diff line change
Expand Up @@ -9,7 +9,12 @@
public enum EntityRecognitionSkillVersion {
/**
* Version 1 of {@link EntityRecognitionSkill}.
*
* @deprecated This version of the skill is deprecated, please use {@link #V3}. See
* <a href="https://learn.microsoft.com/azure/search/cognitive-search-skill-deprecated">skill deprecation</a> for
* more information.
*/
@Deprecated
V1("#Microsoft.Skills.Text.EntityRecognitionSkill"),

/**
Expand Down
Loading

0 comments on commit e6ed3b0

Please sign in to comment.