Add caching to BaseChatModel (issue #1644) #5089

UmerHA · 2023-05-22T12:08:30Z

Add caching to BaseChatModel

(Sidenote: While testing, I noticed we have multiple implementations of Fake LLMs, used for testing. I consolidated them.)

Who can review?

Community members can review the PR once tests pass. Tag maintainers/contributors who might be interested:
Models

Twitter: @UmerHAdil | Discord: RicChilligerDude#7589

…A/langchain into 1644-BaseChatModel-Caching

kaikun213 · 2023-05-24T16:00:58Z

Any comments on this?
@hwchase17
@agola11

Would be great to have caching included!

abdulzain6 · 2023-05-29T10:54:45Z

Someone please take a look at this. Really need this :) Thanks

…l-Caching

- Resolved merge conflict - Implemented general version of _combine_llm_outputs - Cleaned up

realjustinwu · 2023-05-31T14:44:07Z

Need this too.

ielmansouri · 2023-06-02T14:15:37Z

I hope it's reviewed soon, we need caching for ChatModels !

Rienkim · 2023-06-04T13:01:11Z

langchain.llm_cache = SQLiteCache(database_path=".langchain.db")
chat = ChatOpenAI(temperature=0, openai_api_key=get_openai_api_key())
messages = [
    SystemMessage(content="You are a helpful assistant that translates English to French."),
    HumanMessage(content="I love programming.")
]
start = time.time()
print(chat(messages))
print(f"first time = {time.time() - start}")
start = time.time()
print(chat(messages))
print(f"first time = {time.time() - start}")

I test this code with this PR. First request miss cache, so It works. But, second request hit cache, and error occur.

cls = <class 'langchain.schema.ChatGeneration'>
values = {'generation_info': None, 'text': "J'adore la programmation."}

    @root_validator
    def set_text(cls, values: Dict[str, Any]) -> Dict[str, Any]:
>       values["text"] = values["message"].content
E       KeyError: 'message'

With InMemoryCache, test code work fine

langchain.llm_cache = InMemoryCache()
chat = ChatOpenAI(temperature=0, openai_api_key=get_openai_api_key())
messages = [
    SystemMessage(content="You are a helpful assistant that translates English to French."),
    HumanMessage(content="I love programming.")
]
start = time.time()
print(chat(messages))
print(f"first time = {time.time() - start}")
start = time.time()
print(chat(messages))
print(f"first time = {time.time() - start}")

My guess is that InMemoryCache is just python dictionary, so it save data as ChatGeneration type. However, SQLiteCache is local database, so it save data as Generation type. If cache hit with SQLiteCache (and other type cache), loaded data is Generation type, not ChatGeneration. So, there is no "message" property in loaded data.
Fast(?) solution is implementing langchain.chat_model seperately, which save and load ChatGeneration type. But, it need two cache code, llm and chat_model for all cache implementation.
To solve this problem, BaseCache need to be modified (I think). But, It is complicated.

UmerHA · 2023-06-05T11:24:49Z

Hey @Rienkim, thanks for pointing that out! I'll take a look & add more tests that use more different caching options.

ETA should be this week. In the meanwhile, I'll turn this PR into a draft.

UmerHA · 2023-06-06T22:11:36Z

@Rienkim Fixed it & added more tests

deepblue · 2023-06-06T22:30:25Z

@UmerHA thank you for working on this.

I found that _combine_llm_outputs implemented in this PR can be an issue with OpenAICallbackHandler https://github.com/hwchase17/langchain/blob/master/langchain/callbacks/openai_info.py#LL99C42-L99C42

jakobsa · 2023-06-14T12:49:25Z

@deepblue could you elaborate on your concerns? I am waiting for the feature :)

deepblue · 2023-06-15T08:27:20Z

Just realized I made an error in my code- used BaseChatModel._combine_llm_outputs in this PR instead of the default ChatOpenAI._combine_llm_outputs for testing. Retested with the right method and it's all clear. The original method doesn't affect our subclassed code. Apologies for the mix-up and the late response.

I tested and confirmed that it's working as expected

pors · 2023-06-15T09:03:48Z

@hwchase17
@agola11

Good to go?

kaikun213 · 2023-06-22T09:32:14Z

@hwchase17
@agola11

Any update on this?

vercel · 2023-06-24T04:14:33Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Ignored Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)			Jun 24, 2023 6:44pm

hwchase17 · 2023-06-24T04:16:22Z

langchain/chat_models/base.py

@@ -59,7 +110,24 @@ class Config:
        arbitrary_types_allowed = True

    def _combine_llm_outputs(self, llm_outputs: List[Optional[dict]]) -> dict:
-        return {}
+        """Combine general llm outputs by aggregating them into lists


this seems separate? am going to revert this

dev2049 · 2023-06-24T05:37:33Z

langchain/chat_models/base.py

+                    )
+                else:
+                    result = self._generate(messages, stop=stop, **kwargs)
+                langchain.llm_cache.update(prompt, llm_string, result.generations)


won't we be storing ChatGenerations if _generate creates ChatGenerations? in which case do we need to do extra parsing in 218

nfcampos · 2023-06-24T16:21:16Z

docs/snippets/modules/model_io/models/llms/how_to/llm_caching.mdx

@@ -14,7 +14,7 @@ from langchain.cache import InMemoryCache
 langchain.llm_cache = InMemoryCache()

 # The first time, it is not yet in cache, so it should take longer
-llm("Tell me a joke")
+llm.predict("Tell me a joke")


Why is this changing?

to use the consist predict/predict_messages interface

nfcampos · 2023-06-24T16:22:22Z

langchain/cache.py

@@ -163,7 +179,7 @@ def update(self, prompt: str, llm_string: str, return_val: RETURN_VAL_TYPE) -> N
    def clear(self, **kwargs: Any) -> None:
        """Clear cache."""
        with Session(self.engine) as session:
-            session.execute(self.cache_schema.delete())
+            session.query(self.cache_schema).delete()


Why is this changing ?

yeah not sure, from previous pr, ill revert

ghost · 2023-07-11T11:07:01Z

Getting the below error when I use MomentoCache. @UmerHA please let me know if this is a bug or if I am doing anything wrong.

Edit: The error pops up for all calls after the cache has atleast 1 key set.

@root_validator
    def set_text(cls, values: Dict[str, Any]) -> Dict[str, Any]:
>       values["text"] = values["message"].content
E       KeyError: 'message'

Code:

import langchain
from datetime import timedelta
from langchain.cache import MomentoCache

langchain.llm_cache = MomentoCache.from_client_params("langchain_momento", imedelta(days=1))

# Further code for constructing and calling the chain using ChatOpenAI

UmerHA · 2023-07-11T11:40:32Z

Getting the below error when I use MomentoCache. @UmerHA please let me know if this is a bug or if I am doing anything wrong.

Edit: The error pops up for all calls after the cache has atleast 1 key set.

@root_validator
    def set_text(cls, values: Dict[str, Any]) -> Dict[str, Any]:
>       values["text"] = values["message"].content
E       KeyError: 'message'

Code:

import langchain
from datetime import timedelta
from langchain.cache import MomentoCache

langchain.llm_cache = MomentoCache.from_client_params("langchain_momento", imedelta(days=1))

# Further code for constructing and calling the chain using ChatOpenAI

Can you post the full code, error message, and stack trace?

UmerHA added 9 commits May 18, 2023 15:21

Added caching to ChatModels

aad073e

Added testing

cfb2aa8

Merge branch 'hwchase17:master' into 1644-BaseChatModel-Caching

cc16266

Improved combination of existing + new results

26d6d7e

Merge branch '1644-BaseChatModel-Caching' of https://github.com/UmerH…

3dabdb0

…A/langchain into 1644-BaseChatModel-Caching

Check in

865a4e4

CheckIn

f800c77

Merge branch 'hwchase17:master' into 1644-BaseChatModel-Caching

7ac90ac

Fixed tests + linting

da8c657

UmerHA mentioned this pull request May 22, 2023

Add caching support to BaseChatModel #1644

Closed

dev2049 added the 03 enhancement Enhancement of existing functionality label May 22, 2023

UmerHA added 2 commits May 29, 2023 15:32

Merge remote-tracking branch 'upstream/master' into 1644-BaseChatMode…

6cfc822

…l-Caching

Resolved merge conflict

b57e837

- Resolved merge conflict - Implemented general version of _combine_llm_outputs - Cleaned up

Merge branch 'master' into 1644-BaseChatModel-Caching

251bca4

Merge branch 'hwchase17:master' into 1644-BaseChatModel-Caching

ce6f0a4

UmerHA marked this pull request as draft June 5, 2023 11:25

UmerHA added 4 commits June 6, 2023 15:47

Fixed ChatModel cache

a8012c5

Fixed tests ; Fixed SQLAlchemyCache.clear

9a0d74c

Lint

897275f

Linting

dae03ea

UmerHA marked this pull request as ready for review June 6, 2023 22:11

Merge branch 'hwchase17:master' into 1644-BaseChatModel-Caching

940c4a0

UmerHA added 2 commits June 14, 2023 14:15

Merge branch 'master' into 1644-BaseChatModel-Caching

9032672

Update fake.py

f7be30e

Merge branch 'master' into 1644-BaseChatModel-Caching

917d9a2

hwchase17 reviewed Jun 24, 2023

View reviewed changes

hwchase17 added 3 commits June 23, 2023 21:49

cr

cd0c228

cr

43658e6

cr

772ff16

dev2049 reviewed Jun 24, 2023

View reviewed changes

hwchase17 added 2 commits June 24, 2023 08:29

cr

6183322

cr

8f7b730

nfcampos reviewed Jun 24, 2023

View reviewed changes

hwchase17 added 6 commits June 24, 2023 09:28

cr

8dcf2f8

cr

538444b

cr

7092798

Merge branch 'master' into 1644-BaseChatModel-Caching

a30da8c

cr

94d8eab

cr

c76c30f

hwchase17 merged commit 068142f into langchain-ai:master Jun 24, 2023

This was referenced Jun 25, 2023

Zep Authentication #6725

Closed

Zep Authentication #6728

Merged

Vasilije1990 mentioned this pull request Jul 18, 2023

Crash occurs when using RedisSemanticCache() as a cache #7722

Closed

14 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add caching to BaseChatModel (issue #1644) #5089

Add caching to BaseChatModel (issue #1644) #5089

UmerHA commented May 22, 2023

kaikun213 commented May 24, 2023

abdulzain6 commented May 29, 2023

realjustinwu commented May 31, 2023

ielmansouri commented Jun 2, 2023

Rienkim commented Jun 4, 2023 •

edited

Loading

UmerHA commented Jun 5, 2023

UmerHA commented Jun 6, 2023

deepblue commented Jun 6, 2023

jakobsa commented Jun 14, 2023

deepblue commented Jun 15, 2023

pors commented Jun 15, 2023

kaikun213 commented Jun 22, 2023

vercel bot commented Jun 24, 2023 •

edited

Loading

hwchase17 Jun 24, 2023

dev2049 Jun 24, 2023

nfcampos Jun 24, 2023

hwchase17 Jun 24, 2023

nfcampos Jun 24, 2023

hwchase17 Jun 24, 2023

ghost commented Jul 11, 2023 •

edited by ghost

Loading

UmerHA commented Jul 11, 2023

Add caching to BaseChatModel (issue #1644) #5089

Add caching to BaseChatModel (issue #1644) #5089

Conversation

UmerHA commented May 22, 2023

Add caching to BaseChatModel

Who can review?

kaikun213 commented May 24, 2023

abdulzain6 commented May 29, 2023

realjustinwu commented May 31, 2023

ielmansouri commented Jun 2, 2023

Rienkim commented Jun 4, 2023 • edited Loading

UmerHA commented Jun 5, 2023

UmerHA commented Jun 6, 2023

deepblue commented Jun 6, 2023

jakobsa commented Jun 14, 2023

deepblue commented Jun 15, 2023

pors commented Jun 15, 2023

kaikun213 commented Jun 22, 2023

vercel bot commented Jun 24, 2023 • edited Loading

hwchase17 Jun 24, 2023

Choose a reason for hiding this comment

dev2049 Jun 24, 2023

Choose a reason for hiding this comment

nfcampos Jun 24, 2023

Choose a reason for hiding this comment

hwchase17 Jun 24, 2023

Choose a reason for hiding this comment

nfcampos Jun 24, 2023

Choose a reason for hiding this comment

hwchase17 Jun 24, 2023

Choose a reason for hiding this comment

ghost commented Jul 11, 2023 • edited by ghost Loading

UmerHA commented Jul 11, 2023

Rienkim commented Jun 4, 2023 •

edited

Loading

vercel bot commented Jun 24, 2023 •

edited

Loading

ghost commented Jul 11, 2023 •

edited by ghost

Loading