core[patch]: extract input variables for `path` and `detail` keys in order to format an `ImagePromptTemplate` #22613

thdesc · 2024-06-06T13:18:46Z

Description: Add support for path and detail keys in ImagePromptTemplate. Previously, only variables associated with the url key were considered. This PR allows for the inclusion of a local image path and a detail parameter as input to the format method.
Issues:
- fixes How to use ImagePromptTemplate #20820
- related to Dinamically format HumanMessage list of dictionaries for multimodal LLM #22024
Dependencies: None
Twitter handle: @DeschampsTho5

…" key

vercel · 2024-06-06T13:18:49Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

1 Skipped Deployment

Name	Status	Preview	Comments	Updated (UTC)
langchain	⬜️ Ignored (Inspect)	Visit Preview		Jul 3, 2024 6:43pm

thdesc · 2024-06-10T15:44:17Z

Hi @eyurtsev do you have some time to review this PR soon ? I'm planning to take some time off, so I won't be able to iterate on your reviews afterward. Thanks!

thdesc · 2024-06-12T13:17:33Z

Hi @baskaryan @eyurtsev @efriis anyone of you have some time to review this PR please ? This should allow users to pass a path (local path to an image file) and a detail parameter (to control how the LLM processes the image) when formatting an HumanMessagePromptTemplate with a image_url.

Here is an example of how it would work:

from langchain_core.prompts import HumanMessagePromptTemplate, ChatPromptTemplate
from langchain_core.messages import HumanMessage

image_path = 'path_to_your_image.jpg'
detail_parameter = 'high'

chat_prompt_template = ChatPromptTemplate.from_messages(
    messages=[
        HumanMessage(content='Describe the following image.'),
        HumanMessagePromptTemplate.from_template(
            [{'image_url': {'path': {image_path}, 'detail': {detail_parameter}}]
        )
    ]
)

prompt = chat_prompt_template.format(image_path=image_path, detail_parameter=detail_parameter)

As you can see in this discussion #20820 there are several questions about how to use the ImagePromptTemplate with a HumanMessagePromptTemplate and I think this PR could help.

Thank you !

thdesc · 2024-06-12T13:29:53Z

Currently to upload a base 64 encoded image, users have to do something like this:

import base64

from langchain_core.prompts import HumanMessagePromptTemplate, ChatPromptTemplate
from langchain_core.messages import HumanMessage

def encode_image(image_path):
  with open(image_path, "rb") as image_file:
    return base64.b64encode(image_file.read()).decode('utf-8')

# Path to your image
image_path = "path_to_your_image.jpg"

# Getting the base64 string
base64_image = encode_image(image_path)

chat_prompt_template = ChatPromptTemplate.from_messages(
    messages=[
        HumanMessage(content='Describe the following image.'),
        HumanMessagePromptTemplate.from_template(
            [{'image_url': {'url': 'data:image/jpeg;base64,{base64_image}', 'detail': 'high'}]  # the detail parameter has to be hard-coded and can't be formatted at runtime.
        )
    ]
)

prompt = chat_prompt_template.format(base64_image=base64_image)

eyurtsev · 2024-06-13T14:00:11Z

On mobile right now so won't be able to review properly but from the PR description the API doesn't look correct

chat_prompt_template = ChatPromptTemplate.from_messages(
    messages=[
        HumanMessage(content='Describe the following image.'),
        HumanMessagePromptTemplate.from_template(
            [{'image_url': {'path': {image_path}, 'detail': {detail_parameter}}]
        )
    ]
)

prompt = chat_prompt_template.format(image_path=image_path, detail_parameter=detail_parameter)

Image path has already been set when defining the human prompt template. It doesn't appear that format does anything?

thdesc · 2024-06-13T22:33:01Z

@eyurtsev Oh yes you are 100% right sorry for that. It is a mistake in the comment only not in the PR I think. I should have wrote this:

from langchain_core.prompts import HumanMessagePromptTemplate, ChatPromptTemplate
from langchain_core.messages import HumanMessage

image_path = 'path_to_your_image.jpg'
detail_parameter = 'high'

chat_prompt_template = ChatPromptTemplate.from_messages(
    messages=[
        HumanMessage(content='Describe the following image.'),
        HumanMessagePromptTemplate.from_template(
            [{'image_url': {'path': '{image_path}', 'detail': '{detail_parameter}'}]
        )
    ]
)

prompt = chat_prompt_template.format(image_path=image_path, detail_parameter=detail_parameter)

thdesc · 2024-06-20T01:03:17Z

Hi @eyurtsev, did you have some time to review this PR please ?

libs/core/tests/unit_tests/prompts/test_chat.py

eyurtsev · 2024-07-01T18:26:00Z

@thdesc thanks for updating the PR! Merging now :)

thdesc · 2024-07-08T15:06:12Z

@eyurtsev Thanks for the merge! Sorry I wasn't available last week and didn't have my computer to fix the broken test. I want to create another PR related to the ImagePromptTemplate. I will do it soon. Thanks again!

tdeschamps added 4 commits June 6, 2024 12:52

get templates variables for keys "paths" and "detail" along with "url…

af5b8d9

…" key

use path format in unittest

69f3c59

fix lint and format

c49f46f

fix lint and format

7a5a772

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. 🤖:improvement Medium size change to existing code to handle new use-cases labels Jun 6, 2024

thdesc added 2 commits June 6, 2024 15:53

Merge branch 'langchain-ai:master' into dict_template_for_images

0830d21

Merge branch 'master' into dict_template_for_images

ee4c855

thdesc changed the title ~~core: extract input variables for path and detail keys in order to format an ImagePromptTemplate~~ langchain-core[minor]: extract input variables for path and detail keys in order to format an ImagePromptTemplate Jun 10, 2024

ccurme added the Ɑ: core Related to langchain-core label Jun 19, 2024

eyurtsev self-assigned this Jun 20, 2024

eyurtsev added 2 commits June 25, 2024 12:07

Merge branch 'master' into fork/thdesc/dict_template_for_images

4d6b1cb

x

4d060e4

dosubot bot added size:L This PR changes 100-499 lines, ignoring generated files. and removed size:M This PR changes 30-99 lines, ignoring generated files. labels Jun 25, 2024

eyurtsev reviewed Jun 25, 2024

View reviewed changes

libs/core/tests/unit_tests/prompts/test_chat.py Show resolved Hide resolved

eyurtsev reviewed Jun 25, 2024

View reviewed changes

libs/core/tests/unit_tests/prompts/test_chat.py Outdated Show resolved Hide resolved

Update libs/core/tests/unit_tests/prompts/test_chat.py

e9a0bee

eyurtsev changed the title ~~langchain-core[minor]: extract input variables for path and detail keys in order to format an ImagePromptTemplate~~ langchain-core[patch]: extract input variables for path and detail keys in order to format an ImagePromptTemplate Jun 25, 2024

eyurtsev approved these changes Jun 25, 2024

View reviewed changes

dosubot bot added the lgtm PR looks good. Use to confirm that a PR is ready for merging. label Jun 25, 2024

thdesc added 4 commits June 26, 2024 13:12

Merge branch 'master' into dict_template_for_images

0dd8ade

Merge branch 'master' into dict_template_for_images

3849f82

Merge branch 'master' into dict_template_for_images

4a7398e

Merge branch 'master' into dict_template_for_images

af91008

eyurtsev enabled auto-merge (squash) July 1, 2024 18:25

eyurtsev disabled auto-merge July 1, 2024 18:25

eyurtsev changed the title ~~langchain-core[patch]: extract input variables for path and detail keys in order to format an ImagePromptTemplate~~ core[patch]: extract input variables for path and detail keys in order to format an ImagePromptTemplate Jul 1, 2024

eyurtsev enabled auto-merge (squash) July 1, 2024 18:25

baskaryan and others added 2 commits July 3, 2024 11:28

Merge branch 'master' into dict_template_for_images

91729ac

fix test

4a59700

dosubot bot added size:M This PR changes 30-99 lines, ignoring generated files. and removed size:L This PR changes 100-499 lines, ignoring generated files. labels Jul 3, 2024

x

4284869

eyurtsev merged commit 39b19cf into langchain-ai:master Jul 3, 2024
134 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core[patch]: extract input variables for `path` and `detail` keys in order to format an `ImagePromptTemplate` #22613

core[patch]: extract input variables for `path` and `detail` keys in order to format an `ImagePromptTemplate` #22613

thdesc commented Jun 6, 2024

vercel bot commented Jun 6, 2024 •

edited

Loading

thdesc commented Jun 10, 2024

thdesc commented Jun 12, 2024 •

edited

Loading

thdesc commented Jun 12, 2024

eyurtsev commented Jun 13, 2024

thdesc commented Jun 13, 2024 •

edited

Loading

thdesc commented Jun 20, 2024

eyurtsev commented Jul 1, 2024 •

edited

Loading

thdesc commented Jul 8, 2024

core[patch]: extract input variables for path and detail keys in order to format an ImagePromptTemplate #22613

core[patch]: extract input variables for path and detail keys in order to format an ImagePromptTemplate #22613

Conversation

thdesc commented Jun 6, 2024

vercel bot commented Jun 6, 2024 • edited Loading

thdesc commented Jun 10, 2024

thdesc commented Jun 12, 2024 • edited Loading

thdesc commented Jun 12, 2024

eyurtsev commented Jun 13, 2024

thdesc commented Jun 13, 2024 • edited Loading

thdesc commented Jun 20, 2024

eyurtsev commented Jul 1, 2024 • edited Loading

thdesc commented Jul 8, 2024

core[patch]: extract input variables for `path` and `detail` keys in order to format an `ImagePromptTemplate` #22613

core[patch]: extract input variables for `path` and `detail` keys in order to format an `ImagePromptTemplate` #22613

vercel bot commented Jun 6, 2024 •

edited

Loading

thdesc commented Jun 12, 2024 •

edited

Loading

thdesc commented Jun 13, 2024 •

edited

Loading

eyurtsev commented Jul 1, 2024 •

edited

Loading