You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The EverNoteLoader treats an export from Evernote as a very large text document by combining the content from all notes into a single long text string.
This isn't terribly useful when interrogating data from Evernote whereby you might export an entire notebook which contains many notes, see an example notebook export below which has two notes. Ideally we should treat each note as an independent document with it's own richer metadata e.g. created, updated, title etc. to make retrieval more effective.
@eyurtsev / @dev2049 I have submitted a PR to make this improvement. Thanks for all your work on Langchain, really enjoying using it. Looking forward to your feedback!
# Improve Evernote Document Loader
When exporting from Evernote you may export more than one note.
Currently the Evernote loader concatenates the content of all notes in
the export into a single document and only attaches the name of the
export file as metadata on the document.
This change ensures that each note is loaded as an independent document
and all available metadata on the note e.g. author, title, created,
updated are added as metadata on each document.
It also uses an existing optional dependency of `html2text` instead of
`pypandoc` to remove the need to download the pandoc application via
`download_pandoc()` to be able to use the `pypandoc` python bindings.
Fixes#4493
Co-authored-by: Mike McGarry <[email protected]>
Co-authored-by: Dev 2049 <[email protected]>
Feature request
The
EverNoteLoader
treats an export from Evernote as a very large text document by combining the content from all notes into a single long text string.It also only saves the name of the export file as metadata on this large document.
This isn't terribly useful when interrogating data from Evernote whereby you might export an entire notebook which contains many notes, see an example notebook export below which has two notes. Ideally we should treat each note as an independent document with it's own richer metadata e.g. created, updated, title etc. to make retrieval more effective.
Motivation
Looking to add a tool to an agent which can interrogate my Evernote journal entries.
Your contribution
I can put together a PR for this.
The text was updated successfully, but these errors were encountered: