-
Notifications
You must be signed in to change notification settings - Fork 981
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Writing picture enrichment annotations to Markdown file #625
Comments
hi @theobgbd , What is the meaning of Picture Enrichment in Docling ? I'm searching for a way to add context to images of my pdfs which will eventually help in image summarization. Is picture enrichment the answer ? |
Picture enrichment currently has specific typing for classification, description (like model captioning), charts, chemical strucrures, and a generic one We have some idea on how to enable the serialization of that data when exporting to markdown or other formats. |
@gauravmindzk What I meant by "picture enrichment" was model captionning with a vision LLM, with added context from the document. For my use case, I used [this template] https://ds4sd.github.io/docling/examples/develop_picture_enrichment/ ) as a base to set up a call to an on-premise Pixtral instance to caption the images, and then store the answer in the
@dolfim-ibm Thanks for your feedback, I didn't know about the |
Image annotations to MD
Following the Figure Enrichment tutorial it is easy to add classification metadata to an image through the
element.annotations.append(data)
function.However this data is not stored during the export to Markdown format. Would there be a way to write it alongside the image path in the final MD document ? Would be great for our RAG application.
The text was updated successfully, but these errors were encountered: