Improve File Uploads, Vision Always On #1210

Josh-XT · 2024-06-15T11:38:48Z

Improve File Uploads to Memory

Some file types were identified as needing improvement when chunking into memory while testing. As result, we have several improvements.

Add PowerPoint (PPT/PPTX) upload support

When PowerPoints are uploaded, they will be converted to PDF and handled as PDFs are handled.

Improve PDF uploads

When a PDF file is uploaded, we typically grab the text from it using pdfplumber and chunk the information into memory, which has great results. In addition to that strategy, if a vision_provider is selected for the agent, it will also break the PDF up into images per page for the vision model to answer questions about, and any questions answered about images will be retained in conversational memory.

Improve XLS/XLSX uploads

Uploading XLS/XLSX previously would upload the first sheet to memory, it will now iterate over each sheet, convert it to CSV, and then handle each sheet as CSVs are handled.

Improve CSV uploads

When uploading a CSV or XLS/XLSX file, it will now turn each item into json and add that information to memory to create a new memory per item with reference to where it came from and when. This will greatly improve data analysis, which has also been improved with this update. If a spreadsheet is uploaded at the chat completions endpoint, it will autonomously do data analysis based on user input and output results of executed code for things like graphs from the data.

Vision Always On

With PDFs also splitting into images, it makes sense for context to keep vision on when necessary rather than only when the image is uploaded initially. If you upload an image in a conversation and have a vision_provider defined for your agent, it will send your input to the vision model + the image, get a description, add that to memories for the conversation to be injected by context from the user's input. If relevant enough to the conversational memories, it will use the vision model with each interaction with the image in context essentially now.

agixt/Prompts.py

Josh-XT added 3 commits June 15, 2024 07:01

add convert ppt to pdf

c2216bd

add pdf2img

4cee87e

Recall images and persist vision through conversations

7e8dbbd

Josh-XT changed the title ~~Add PowerPoint Upload Support~~ Add PowerPoint Upload Support, Conversational Vision Persistence Jun 15, 2024

Josh-XT added 4 commits June 15, 2024 12:10

fix broken ref

dc949d2

remove images ref

dff3408

fix error

31afb38

add data analysis function

16ab540

github-advanced-security bot found potential problems Jun 15, 2024

View reviewed changes

agixt/Prompts.py Fixed Show fixed Hide fixed

agixt/Prompts.py Fixed Show fixed Hide fixed

Josh-XT added 4 commits June 15, 2024 14:35

add multifile data analysis support

46dc8f5

improve prompt

b8acd13

add analyze_csv to pipeline

da35a51

make sure path starts with current dir

f9d5395

github-advanced-security bot found potential problems Jun 15, 2024

View reviewed changes

agixt/Prompts.py Fixed Show fixed Hide fixed

agixt/Prompts.py Fixed Show fixed Hide fixed

Josh-XT added 5 commits June 15, 2024 16:29

handle path properly

242f503

improve handling of xls and xlsx

94cae66

improve xls handling

7d4b8fb

always convert xls to csv

00d7834

improve csv item format

5cbf1b6

Josh-XT changed the title ~~Add PowerPoint Upload Support, Conversational Vision Persistence~~ Improve File Uploads, Vision Always On Jun 15, 2024

Josh-XT marked this pull request as ready for review June 15, 2024 21:31

Josh-XT merged commit d139bae into main Jun 15, 2024
7 checks passed

Josh-XT deleted the add-powerpoint-support branch June 15, 2024 21:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve File Uploads, Vision Always On #1210

Improve File Uploads, Vision Always On #1210

Josh-XT commented Jun 15, 2024 •

edited

Loading

Improve File Uploads, Vision Always On #1210

Improve File Uploads, Vision Always On #1210

Conversation

Josh-XT commented Jun 15, 2024 • edited Loading

Improve File Uploads to Memory

Add PowerPoint (PPT/PPTX) upload support

Improve PDF uploads

Improve XLS/XLSX uploads

Improve CSV uploads

Vision Always On

Josh-XT commented Jun 15, 2024 •

edited

Loading