Azure AI Inference SDK - Beta 2 updates #36163

dargilco · 2024-06-19T18:28:10Z

The main reason for this release, shortly after the first release:

Add strongly-typed model as an optional input argument to the complete method of ChatCompletionsClient. This is required for a high-visiblity project. For this project, developers must set model.

Breaking change (noted in CHANGELOG.md):

The field input_tokens was removed from class EmbeddingsUsage, as this was never defined in the
REST API and the service never returned this value.

Other changes in this release:

Addressing some test dept (work in progress)
- Add tests for setting model_extras for sync and async clients. Make sure the additional parameters appear at the root of the JSON request payload, and make sure the unknown_parameters HTTP request header was set to pass_through.
- Add tests to validate serialization of a dummy chat completion request that includes all type of input objects. This is a regression test (no service response needed), as the test looks at the JSON request payload and compared to a hard-coded expected string, that was previously verified by hand. This test includes the new model argument, as well as all other arguments defined by the REST API. It will catch any regressions in hand-written code.
Update ref docs to remove mentioning of the old extras input argument to chat completions in hand-written code. The name was changed to model_extras before the first release, but looks like we still had some left-over ref-doc comments that describe the no-longer-existing argument.
Remove unused function from the sample sample_chat_completions_with_image_data.py. Forgot to do that in the first release.
Minor changes to root README.md
Indicate that complete method with stream=True returns Iterable[StreamingChatCompletionsUpdate] for
the synchronous ChatComletionsClient, and Iterable[StreamingChatCompletionsUpdate] for the asynchronous
ChatCompletionsClient. Per feedback from Anna T.
Update environment variable names used by sample code and test to start with "AZURE_AI" as common elsewhere, per feedback from Rob C.

azure-sdk · 2024-06-19T19:01:10Z

API change check

API changes are not detected in this pull request.

…erence-6-14

dargilco added 2 commits June 18, 2024 16:25

Minor fix to root README.md and one sample

c4d3951

Add tests for model_extras

353c875

github-actions bot added the AI Model Inference Issues related to the client library for Azure AI Model Inference (\sdk\ai\azure-ai-inference) label Jun 19, 2024

dargilco self-assigned this Jun 19, 2024

dargilco added 5 commits June 19, 2024 16:02

Add test for full chat completions payload

71c382d

Merge remote-tracking branch 'origin/main' into dargilco/azure-ai-inf…

486340d

…erence-6-14

Reemit with 'model' support

189f988

Env. variables start with AZURE_AI

60d10fa

Fix Python 3.8 error in test

705c712

dargilco marked this pull request as ready for review June 21, 2024 14:15

dargilco requested review from annatisch and johanste June 21, 2024 14:16

Remove some text from CHANGELOG.md

81e6e25

dargilco requested a review from robch June 21, 2024 14:32

dargilco added 2 commits June 21, 2024 08:44

Remove input_tokens from EmbeddingsUsage

cf5fcfa

Fix return type hint of load_client

5f8696d

dargilco enabled auto-merge (squash) June 21, 2024 17:54

johanste approved these changes Jun 22, 2024

View reviewed changes

dargilco merged commit 444ed8b into main Jun 22, 2024
17 checks passed

dargilco deleted the dargilco/azure-ai-inference-6-14 branch June 22, 2024 00:24

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Azure AI Inference SDK - Beta 2 updates #36163

Azure AI Inference SDK - Beta 2 updates #36163

dargilco commented Jun 19, 2024 •

edited

Loading

azure-sdk commented Jun 19, 2024

Azure AI Inference SDK - Beta 2 updates #36163

Azure AI Inference SDK - Beta 2 updates #36163

Conversation

dargilco commented Jun 19, 2024 • edited Loading

azure-sdk commented Jun 19, 2024

dargilco commented Jun 19, 2024 •

edited

Loading