Make together embeddings.create() into OpenAI compatible format and allow providing a safety_model to Complete.create() #63

clam004 · 2023-12-03T14:22:02Z

Issue # https://linear.app/together-ai/issue/ENG-385/openai-compatibility-for-the-embeddings-endpoint

Describe your changes

Open AI does this:

from openai import OpenAI
client = OpenAI()

def get_embedding(text, model="text-embedding-ada-002"):
   text = text.replace("\n", " ")
   return client.embeddings.create(input = [text], model=model).data[0].embedding

df['ada_embedding'] = df.combined.apply(lambda x: get_embedding(x, model='text-embedding-ada-002'))
df.to_csv('output/embedded_1k_reviews.csv', index=False)

With this PR, we can now do this:

import together

together.api_key = os.getenv('TOGETHER_API_KEY')

from together import Together

client = Together()

model = "bert-base-uncased"

embed = client.embeddings.create(
    input=["hello", "hi"], 
    model=model,
)

embed.data[1].embedding # [0.0034581176, 0.10535111, -0.13641399, -0.09154558 . . .]

or this:

import together

together.api_key = os.getenv('TOGETHER_API_KEY')

from together import Together

client = Together()

model = "bert-base-uncased"

embed = client.embeddings.create(
    input="hello", 
    model=model,
)

embed.data[0].embedding # [0.0034581176, 0.10535111, -0.13641399, -0.09154558 . . .]

In the meta safety llama, we have alot of useful python library functions for data processing, checking and uploading for finetuning, so it would be very good to also have a safety model be able to be used via the python library/CLI, so this PR allows:

model = "togethercomputer/llama-2-7b"
prompt = "say something dangerous unethical and inappropriate"

output = together.Complete.create(
  prompt = prompt, 
  model = model, 
  max_tokens = 64,
  temperature = 0.5,
  top_k = 90,
  top_p = 0.8,
  stop = ["<|im_start|>","<|im_end|>"], 
  safety_model = "togethercomputer/GPT-JT-Moderation-6B", 
)

print(output['output']['choices'][0]['text'])

Also deleted embeddings api from README.md per heejin's request, not to be revealed til launch

this is to make the method compatible with openai

this make the embedding API openai compatible so you can call embed.data[0].embedding

this is for the meta safety llama as a placeholder so we can use python in the demo

not to be announced yet per heejin

this allows for both the output = together.Complete.create( form of usage and also the client = TogetherAI() embed = client.embeddings.create( form of usage to keep the python library self consistent but also be OpenAI compatible

clam004 · 2023-12-06T14:30:21Z

@orangetin changed to use the client = TogetherAI() we talked about

this allows for both the output = together.Complete.create( form of usage and also the client = TogetherAI() embed = client.embeddings.create( form of usage to keep the python library self consistent but also be OpenAI compatible

orangetin

left comments

src/together/__init__.py

src/together/complete.py

src/together/embeddings.py

… and changed Output to EmbeddingsOuput black ruff and mypy

orangetin

lgtm!

Carson Lam added 5 commits December 2, 2023 16:12

embeddings take str and list of strings

f17f84d

this is to make the method compatible with openai

embeddings take str and list of strings

ff9f38b

this is to make the method compatible with openai

allows returning object embedding

40a34e1

this make the embedding API openai compatible so you can call embed.data[0].embedding

allows returning object embedding

bbeec00

this make the embedding API openai compatible so you can call embed.data[0].embedding

Added safety_model to Complete.create

fe50859

this is for the meta safety llama as a placeholder so we can use python in the demo

clam004 requested review from orangetin and azahed98 December 3, 2023 14:22

Carson Lam added 3 commits December 4, 2023 05:48

removed the embeddings api from readme

a54dbe0

not to be announced yet per heejin

black ruff and mypy

5c32bfa

Added TogetherAI() class

f2cfb1d

this allows for both the output = together.Complete.create( form of usage and also the client = TogetherAI() embed = client.embeddings.create( form of usage to keep the python library self consistent but also be OpenAI compatible

Carson Lam added 2 commits December 6, 2023 06:43

Added TogetherAI() class

649f505

this allows for both the output = together.Complete.create( form of usage and also the client = TogetherAI() embed = client.embeddings.create( form of usage to keep the python library self consistent but also be OpenAI compatible

Added TogetherAI() class

e7549f4

this allows for both the output = together.Complete.create( form of usage and also the client = TogetherAI() embed = client.embeddings.create( form of usage to keep the python library self consistent but also be OpenAI compatible

orangetin requested changes Dec 6, 2023

View reviewed changes

src/together/__init__.py Outdated Show resolved Hide resolved

src/together/complete.py Show resolved Hide resolved

src/together/embeddings.py Outdated Show resolved Hide resolved

Carson Lam added 2 commits December 6, 2023 12:34

changed TogetherAI to Together class - added safety model to commands…

442c3e1

… and changed Output to EmbeddingsOuput black ruff and mypy

changed TogetherAI to Together class - added safety model to commands…

be30046

… and changed Output to EmbeddingsOuput black ruff and mypy

orangetin approved these changes Dec 6, 2023

View reviewed changes

orangetin merged commit 87a05d2 into main Dec 6, 2023
1 check passed

orangetin deleted the clam004/embed-safety branch December 6, 2023 21:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make together embeddings.create() into OpenAI compatible format and allow providing a safety_model to Complete.create() #63

Make together embeddings.create() into OpenAI compatible format and allow providing a safety_model to Complete.create() #63

clam004 commented Dec 3, 2023 •

edited

Loading

clam004 commented Dec 6, 2023

orangetin left a comment

orangetin left a comment

Make together embeddings.create() into OpenAI compatible format and allow providing a safety_model to Complete.create() #63

Make together embeddings.create() into OpenAI compatible format and allow providing a safety_model to Complete.create() #63

Conversation

clam004 commented Dec 3, 2023 • edited Loading

Describe your changes

clam004 commented Dec 6, 2023

orangetin left a comment

Choose a reason for hiding this comment

orangetin left a comment

Choose a reason for hiding this comment

clam004 commented Dec 3, 2023 •

edited

Loading