You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Describe the bug
PDFToTextConverter supports text extraction using stream order and layout order. Some PDFs have a completely unordered stream of data relative to the physical layout (due to the way they were built)
Error message
PDFs text is totally unordered, and so the text (the primary material of our work) is split into out of context data.
Expected behavior
Enable user to choose for the layout based specific setup into the public method
Describe the bug
PDFToTextConverter supports text extraction using stream order and layout order. Some PDFs have a completely unordered stream of data relative to the physical layout (due to the way they were built)
Error message
PDFs text is totally unordered, and so the text (the primary material of our work) is split into out of context data.
Expected behavior
Enable user to choose for the layout based specific setup into the public method
Additional context
To Reproduce
FAQ Check
System:
The text was updated successfully, but these errors were encountered: