Support prompt caching for Claude AI chat models #546

KallynGowdy · 2024-09-25T15:17:37Z

Claude has added the ability to cache input prompts to increase performance and reduce costs. The caveat is that writing to the cache costs more than a regular input token, but the benefit is that a prompt that hits the cache will save 90% over what it would cost without the cache. CasualOS should support specifying if a message can be cached, and the AnthropicAIChatInterface should pass along the correct values to the Anthropic API.

The text was updated successfully, but these errors were encountered:

KallynGowdy added this to CasualOS Sep 25, 2024

KallynGowdy converted this from a draft issue Sep 25, 2024

KallynGowdy added enhancement New feature or request area:backend Is related to the backend server. (Everything in aux-records and aux-server/aux-backend) labels Sep 25, 2024

TroyceGowdy self-assigned this Dec 27, 2024

TroyceGowdy moved this from Backlog to In progress in CasualOS Dec 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support prompt caching for Claude AI chat models #546

Support prompt caching for Claude AI chat models #546

KallynGowdy commented Sep 25, 2024

Support prompt caching for Claude AI chat models #546

Support prompt caching for Claude AI chat models #546

Comments

KallynGowdy commented Sep 25, 2024