Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PDF-processing feature #2505

Open
Am0stafa opened this issue Dec 19, 2024 · 0 comments
Open

PDF-processing feature #2505

Am0stafa opened this issue Dec 19, 2024 · 0 comments

Comments

@Am0stafa
Copy link

I have requested this feature from 8 month and cursor team wasn’t able to develop this feature which is basically the functionality for me to upload a pdf in the chat.
Anthropic has introduced a powerful new PDF-processing feature in its Claude API, surpassing basic text extraction, and it has largely flown under the radar.

Historically, many LLMs stumble when documents include complex elements like images, charts, and LaTeX formulas. But Anthropic’s latest upgrade manages to parse both textual and visual content within a PDF—no extra coding wizardry needed.

Key capabilities include:
(1) Automatically parsing PDF text, images, and tables for further analysis, from answering questions about the attached PDF to turning unstructured data into formatted JSONs

(2) Providing insight on charts and diagrams by evaluating visual context, not just textual tags

(3) Extracting and interpreting LaTeX for scientific or technical documentation

It works by splitting each PDF into two components: the text is extracted as normal, and the entire page is converted into an image. Claude then merges text and visual context for a more holistic understanding. It’s essentially combining LLM intelligence with basic computer vision techniques.

The API supports up to 32MB or 100 pages of PDF content and pricing is similar to the LLM pricing so there’s no premium cost for PDF analysis.

This API could dramatically streamline how we handle financial reports, legal docs, or any PDF requiring detailed interpretation.

Ready to run notebook analyzing Anthropic's constitutional AI paper here https://lnkd.in/ekyThDTC

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant