Improve BibTeX-from-PDF import #11999
Labels
📍 Assigned
Assigned by assign-issue-action (or manually assigned)
good first issue
An issue intended for project-newcomers. Varies in difficulty.
!! This is more an issue to experiment with heuristics. How can a machine with "traditional" (non-AI) code create useful information. !!
When importing the PDF se2paper.pdf
one gets following BibTeX entry
However, the title should be better:
The properties of the file show
Tasks:
org.jabref.logic.importer.fileformat.PdfMergeMetadataImporter#importDatabase(java.nio.file.Path)
)stripper.setSortByPosition(true);
needs to be removed fromorg.jabref.logic.importer.fileformat.PdfContentImporter#getFirstPageContents
. Maybe, two methods need to be done to be able to parse the title (depending on positon) and parsing the abstract (more on content)Hint:
org.jabref.logic.importer.fileformat.PdfContentImporterTest
and ´org.jabref.logic.importer.fileformat.PdfMergeMetadataImporterTest`The text was updated successfully, but these errors were encountered: