feat: added tokenizer params to the listing #47

adubovik · 2023-11-22T11:58:06Z

Added params related to tokenization to the listing as part of #54

DIAL core config:

features.tokenizeEndpoint (URL optional) - the endpoint which allows to tokenize prompt
features.truncatePromptEndpoint (URL optional) - the endpoint which serves context trimming API call. It trims the chat history up to the point when it fit requested number of tokens specified in request field max_prompt_tokens.
tokenizerModel (string optional) - the reference model whose tokenization algorithm is matching the tokenization algorithm of the given model. It allows a user to make tokenization on their side. It's possible for models whose tokenization algorithm is publically known and implemented in a SDK (e.g. GPT and Anthropic). Models fall into families which share the same tokenization algorithm. tokenizer.referenceModel basically points to a representative model from a corresponding family. As a rule of thumb, choose the oldest representative from a family. See the families below.

Listing:

features.tokenize (boolean optional) - true means that the core has <server host>/v1/deployments/<deployment name>/tokenize endpoint
features.truncate_prompt (boolean optional) - true means that the core has <server host>/v1/deployments/<deployment name>/truncate_prompt endpoint
tokenizer_model (string optional) - same value as in the core config

Model families

_tokenization_families:
  - gpt:
    - family_1:
      - gpt-3.5-turbo-0613
      - gpt-3.5-turbo-16k-0613
      - gpt-4-0314
      - gpt-4-32k-0314
      - gpt-4-0613
      - gpt-4-32k-0613
      # tokens_per_message = 3, tokens_per_name = 1
    - family_2:
      - gpt-3.5-turbo-0301
      # tokens_per_message=4, tokens_per_name=-1
    - family_3:
      - text-embedding-ada-002
      # just a string
  - gpt_encoding: cl100k_base
  - gpt_refs:
    - https://github.com/openai/openai-cookbook/blob/main/examples/How_to_count_tokens_with_tiktoken.ipynb
    - https://platform.openai.com/docs/models
    - https://tiktokenizer.vercel.app/
  - palm:
    - family_1:
      - chat-bison@001
      - codechat-bison@001
      - textembedding-gecko@001
  - anthropic:
    - family_1:
      - anthropic.claude-instant-v1
      - anthropic.claude-v1
      - anthropic.claude-v2
  - anthropic_refs:
    - https://github.com/anthropics/anthropic-sdk-python/blob/main/src/anthropic/_tokenizers.py

src/main/java/com/epam/aidial/core/controller/ControllerSelector.java

src/main/java/com/epam/aidial/core/controller/TokenizeController.java

adubovik added 4 commits November 21, 2023 20:34

feat: added tokenizer params to the listing

d414853

feat: added tokenize controller

dd7e54e

Merge branch 'development' into feat/add-tokenizer-param-to-listing

495edc6

fix: renamed tokenize controller class

1b8a70a

adubovik self-assigned this Nov 22, 2023

adubovik added the enhancement New feature or request label Nov 22, 2023

adubovik requested a review from artsiomkorzun November 22, 2023 11:58

adubovik added 3 commits November 22, 2023 13:02

fix: formatted jsons using tabs=4

63ebf60

fix: review suggested renamings

0165dcb

fix: renamed listing parameter

72a0696

artsiomkorzun previously approved these changes Nov 22, 2023

View reviewed changes

astsiapanay self-requested a review November 22, 2023 16:20

astsiapanay reviewed Nov 22, 2023

View reviewed changes

src/main/java/com/epam/aidial/core/controller/ControllerSelector.java Outdated Show resolved Hide resolved

src/main/java/com/epam/aidial/core/controller/TokenizeController.java Outdated Show resolved Hide resolved

adubovik linked an issue Nov 24, 2023 that may be closed by this pull request

Extend model listing API with limits (tokenize, rate endpoint) #54

Closed

adubovik added 5 commits November 24, 2023 17:07

Merge branch 'development' into feat/add-tokenizer-param-to-listing

297cee1

fix: move new endpoints to the features dict

c255585

fix: renamed trim_history to truncate_prompt

c2c82b4

feat: parametrize ModelEndpointCtrl with endpoint getter

b0e2815

chore: added tests for Select Controller

5be6d23

adubovik dismissed artsiomkorzun’s stale review via 5be6d23 November 27, 2023 12:32

adubovik added 3 commits November 27, 2023 12:39

fix: set tokenizer_model parameter in the listing

7f5721c

chore: added tests for tokenizer and truncate_prompt controllers.

eaeb323

chore: added test for rate endpoint

b1b2e64

astsiapanay approved these changes Nov 27, 2023

View reviewed changes

adubovik merged commit 5a07076 into development Nov 27, 2023
5 checks passed

adubovik deleted the feat/add-tokenizer-param-to-listing branch November 27, 2023 17:23

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: added tokenizer params to the listing #47

feat: added tokenizer params to the listing #47

adubovik commented Nov 22, 2023 •

edited

Loading

feat: added tokenizer params to the listing #47

feat: added tokenizer params to the listing #47

Conversation

adubovik commented Nov 22, 2023 • edited Loading

adubovik commented Nov 22, 2023 •

edited

Loading