CleverBird
is a Swift Package that provides a convenient way to interact with OpenAI's chat APIs and perform various tasks, including token counting and encoding. The package is designed to deliver a superior Developer Experience (DX) by making the chat thread the center of the interactions.
CleverBird
includes support for document embeddings and similarity queries. This makes it a versatile tool for a broad range of applications, especially cases where chat prompts need enhanced contextual memory.
CleverBird
is focused narrowly on chat-based interactions, and making them awesome.
Please note that CleverBird
is an unofficial package, not provided by OpenAI itself.
- Asynchronous API calls with Swift's async/await syntax
- Streamed responses for real-time generated content
- Built-in token counting for usage limit calculations
- Token Encoding: Facilitates token counting and encoding through the
TokenEncoder
class. - Document Embedding and Similarity Queries: Utilize the
EmbeddedDocumentStore
class for managing and querying document similarities.
Import the CleverBird
package:
import CleverBird
Initialize an OpenAIAPIConnection
with your API key. Please note that API keys should always be loaded from environment variables, and not hard-coded into your source. After you have loaded your API key, pass it to the initializer of the connection:
let openAIAPIConnection = OpenAIAPIConnection(apiKey: <OPENAI_API_KEY>)
Create a ChatThread
instance with the connection, and add system, user, or assistant messages to the chat thread:
let chatThread = ChatThread(connection: openAIAPIConnection)
.addSystemMessage(content: "You are a helpful assistant.")
.addUserMessage(content: "Who won the world series in 2020?")
The ChatThread
initializer includes a mandatory connection
parameter and various optional parameters. You can set defaults for your thread by using any subset of these optional parameters:
let chatThread = ChatThread(
connection: openAIAPIConnection,
model: .gpt4,
temperature: 0.7,
maxTokens: 500
)
In the example above, we initialized a ChatThread
with a specific model, temperature, and maximum number of tokens. All parameters except connection
are optional. The full list of parameters is as follows:
connection
: The API connection object (required).model
: The model to use for the completion.temperature
: Controls randomness. Higher values (up to 1) generate more random outputs, while lower values generate more deterministic outputs.topP
: This is the nucleus sampling parameter. It specifies the probability mass to cover with the prediction.stop
: An array of strings. The model will stop generating when it encounters any of these strings.maxTokens
: The maximum number of tokens to generate.presencePenalty
: A penalty for using tokens that have already been used.frequencyPenalty
: A penalty for using frequent tokens.user
: The user ID associated with the chat.
Generate a completion using the chat thread:
let completion = try await chatThread.complete()
The response messages are automatically appended onto the thread, so you can continue interacting with it by just adding new user messages and requesting additional completions.
You can customize each call to complete()
with the same parameters as the ChatThread
initializer, allowing you to override the defaults set during initialization:
let completion = try await chatThread.complete(
model: .gpt35Turbo,
temperature: 0.5,
maxTokens: 300
)
In this example, we override the model, temperature, and maximum number of tokens for this specific completion. The parameters you can use in the complete()
method include:
model
: The model to use for the completion.temperature
: Controls randomness. Higher values (up to 1) generate more random outputs, while lower values generate more deterministic outputs.topP
: This is the nucleus sampling parameter. It specifies the probability mass to cover with the prediction.stop
: An array of strings. The model will stop generating when it encounters any of these strings.maxTokens
: The maximum number of tokens to generate.presencePenalty
: A penalty for using tokens that have already been used.frequencyPenalty
: A penalty for using frequent tokens.
All parameters are optional and default to the values set during ChatThread
initialization if not specified.
Generate a completion with streaming using the streaming version of a chat thread:
let chatThread = ChatThread(connection: openAIAPIConnection).withStreaming()
let completionStream = try await chatThread.complete()
for try await messageChunk in completionStream {
print("Received message chunk: \(messageChunk)")
}
Just like with the non-streamed completion, the message will be automatically appended onto the thread after it has finished streaming, but the stream allows you to see it as it's coming through.
Calculate the token count for messages in the chat thread:
let tokenCount = try chatThread.tokenCount()
If you need to count tokens or encode/decode text outside of a chat thread,
use the TokenEncoder
class:
let tokenEncoder = try TokenEncoder(model: .gpt3)
let encodedTokens = try tokenEncoder.encode(text: "Hello, world!")
let decodedText = try tokenEncoder.decode(tokens: encodedTokens)
CleverBird
supports Function Calls. This powerful feature allows developers to define their own custom commands, making it easier to control the behavior of the AI. Function Calls can be included in the ChatThread
and used in the complete()
method.
First, define your function parameters and the function itself. The Function.Parameters
class is used to set the properties and required parameters of your function.
let getCurrentWeatherParameters = Function.Parameters(
properties: [
"location": Function.Parameters.Property(type: .string,
description: "The city and state, e.g. San Francisco, CA"),
"format": Function.Parameters.Property(type: .string,
description: "The temperature unit to use. Infer this from the user's location.",
enumCases: ["celsius", "fahrenheit"])
],
required: ["location", "format"])
let getCurrentWeather = Function(name: "get_current_weather",
description: "Get the current weather",
parameters: getCurrentWeatherParameters)
Then, initialize your ChatThread
with your API connection and an array of functions:
let openAIAPIConnection = OpenAIAPIConnection(apiKey: "your_api_key_here")
let chatThread = ChatThread(connection: openAIAPIConnection,
functions: [getCurrentWeather])
.addSystemMessage(content: "You are a helpful assistant.")
Finally, call the complete()
function to generate a response. If the assistant needs to perform a function during the conversation, it will use the function definitions you provided.
Please note that functions are only supported in non-streaming completions at this time.
The EmbeddedDocumentStore
class provides a convenient way to manage and query a collection of documents based on their similarity. This class allows you to:
- Add documents to an internal store.
- Generate embeddings for those documents using a specified model.
- Query the store for similar documents to a given input document.
First, add an instance of the EmbeddedDocumentStore
to your code:
let openAIAPIConnection = OpenAIAPIConnection(apiKey: "your_api_key_here")
let embeddedDocumentStore = EmbeddedDocumentStore(connection: connection)
You can add a single document or a batch of documents to the store.
let singleDocument = "My single document"
try await embeddedDocumentStore.embedAndStore(singleDocument)
let documentCollection = ["First document", "Second document", "Third document"]
try await embeddedDocumentStore.embedAndStore(documentCollection)
You can query the store for documents that are similar to an input document.
let similarityResults = try await embeddedDocumentStore.queryDocumentSimilarity("Query text here")
let mostSimilarResult = similarityResults.first?.document ?? "No result returned"
The store can be saved to and loaded from a file (represented in JSON format) for persistent storage.
embeddedDocumentStore.save(to: fileURL)
embeddedDocumentStore.load(from: fileURL)
CleverBird
was written by B.T. Franklin (@btfranklin) from 2023 onward and is licensed under the MIT license. See LICENSE.md.