core+partners/anthropic: Anthropic prompt caching #25644

mrdrprofuroboros · 2024-08-21T22:50:53Z

Description: Added support for Anthropic prompt caching, see #25625
Issue: the issue # it fixes, if applicable
Dependencies: bump anthropic>=0.34.0

just found that it fails the test
will fix it and add usage example to the notebook, so far here:

from langchain_anthropic import ChatAnthropic
from langchain_core.messages import HumanMessage, SystemMessage

model = ChatAnthropic(model="claude-3-5-sonnet-20240620", beta=True)
model.invoke([
    SystemMessage([{
        "type": "text",
        "text": "foo"*1000,
        "cache_control": {"type": "ephemeral"}
    }]),
    HumanMessage("hi!"),
])

AIMessage(content='Hello! How can I assist you today?', response_metadata={'id': 'msg_01XnYziv7oZtaRw23d45ivSi', 'model': 'claude-3-5-sonnet-20240620', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'cache_creation_input_tokens': 1500, 'cache_read_input_tokens': 0, 'input_tokens': 9, 'output_tokens': 12}}, id='run-31a51a4f-bb0c-4b24-9c75-d7f7e5f99a89-0', usage_metadata={'input_tokens': 9, 'output_tokens': 12, 'total_tokens': 21, 'cache_creation_input_tokens': 1500, 'cache_read_input_tokens': 0})

vercel · 2024-08-21T22:50:57Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Aug 22, 2024 3:31am

mrdrprofuroboros · 2024-08-22T17:47:42Z

@efriis oof, main is running ahead real fast. my changes passed tests / lint before I decided to bump up with "Update branch". Would you mind taking a look and helping to figure out what are the next steps or how can I improve my PR to get it merged?

baskaryan

thanks for the contribution! i'm very down for including cache token usage in ChatAnthropic outputs but think we'll want to make sure we do it in a future-proof/generalizable way

baskaryan · 2024-08-22T22:21:09Z

libs/core/langchain_core/messages/ai.py

@@ -51,6 +51,10 @@ class UsageMetadata(TypedDict):
    """Count of output (or completion) tokens."""
    total_tokens: int
    """Total token count."""
+    cache_creation_input_tokens: NotRequired[int]


i dont think we want to add this to core until at least one or two other providers support a similar feature

baskaryan · 2024-08-22T22:22:49Z

libs/partners/anthropic/langchain_anthropic/chat_models.py

+    @property
+    def _messages_client(self) -> Messages:
+        if self.beta:
+            return self._client.beta.prompt_caching.messages  # type: ignore[attr-defined]


this feels more specific than just a "beta" flag indicates. are we going to update client to beta.{x}.messages every time there's a new beta feature?

also is cache usage not returned if you use the regular client with the beta headers?

Oh, nice, it actually works. here's an example:

from langchain_anthropic import ChatAnthropic from langchain_core.messages import HumanMessage, SystemMessage model = ChatAnthropic( model="claude-3-opus-20240229", temperature=0, extra_headers={"anthropic-beta": "prompt-caching-2024-07-31"} ) chat = [ SystemMessage([{ "type": "text", "text": "foo"*1000, "cache_control": {"type": "ephemeral"}, }]), HumanMessage("Hi"), ] model.invoke(chat)

returning

AIMessage(content='Hello! How can I assist you today?', response_metadata={'id': 'msg_01EuihUPN9JrbzZXuZd6oEu8', 'model': 'claude-3-opus-20240229', 'stop_reason': 'end_turn', 'stop_sequence': None, 'usage': {'input_tokens': 8, 'output_tokens': 12, 'cache_creation_input_tokens': 1500, 'cache_read_input_tokens': 0}}, id='run-a13ecd02-d669-4028-b8a2-56e5113d2417-0', usage_metadata={'input_tokens': 8, 'output_tokens': 12, 'total_tokens': 20})

Is there way to include variables in the system prompt but still includes the "cache_control": {"type": "ephemeral"} parameter?

Anthropic prompt caching

cebb4e6

efriis added the partner label Aug 21, 2024

efriis self-assigned this Aug 21, 2024

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. Ɑ: core Related to langchain-core 🔌: anthropic Primarily related to Anthropic integrations 🤖:improvement Medium size change to existing code to handle new use-cases labels Aug 21, 2024

tests + NotRequired

d63a697

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Aug 22, 2024

Merge branch 'master' into anthropic-prompt-cache

f35f906

baskaryan reviewed Aug 22, 2024

View reviewed changes

baskaryan mentioned this pull request Aug 22, 2024

rfc: anthropic cache usage #25684

Closed

baskaryan removed the Ɑ: core Related to langchain-core label Aug 23, 2024

efriis assigned baskaryan and unassigned efriis Aug 24, 2024

mrdrprofuroboros closed this Oct 14, 2024

mrdrprofuroboros deleted the anthropic-prompt-cache branch October 14, 2024 17:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core+partners/anthropic: Anthropic prompt caching #25644

core+partners/anthropic: Anthropic prompt caching #25644

mrdrprofuroboros commented Aug 21, 2024

vercel bot commented Aug 21, 2024 •

edited

Loading

mrdrprofuroboros commented Aug 22, 2024

baskaryan left a comment

baskaryan Aug 22, 2024

baskaryan Aug 22, 2024

mrdrprofuroboros Aug 23, 2024

mikevin920 Sep 5, 2024

core+partners/anthropic: Anthropic prompt caching #25644

core+partners/anthropic: Anthropic prompt caching #25644

Conversation

mrdrprofuroboros commented Aug 21, 2024

vercel bot commented Aug 21, 2024 • edited Loading

mrdrprofuroboros commented Aug 22, 2024

baskaryan left a comment

Choose a reason for hiding this comment

baskaryan Aug 22, 2024

Choose a reason for hiding this comment

baskaryan Aug 22, 2024

Choose a reason for hiding this comment

mrdrprofuroboros Aug 23, 2024

Choose a reason for hiding this comment

mikevin920 Sep 5, 2024

Choose a reason for hiding this comment

vercel bot commented Aug 21, 2024 •

edited

Loading