-
Notifications
You must be signed in to change notification settings - Fork 177
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Original and Converted Document Dimentions Don't Match #626
Comments
It turns out that this is an issue with the mismatch of the pixel density when converting into pixels and when reassembling it back into a PDF. This is further conformed by the difference in document size when doing OCR (uses
In Doc to Pixels, it uses However, when reassembling it, with OCR it uses To fix this I've had success in setting the tesseract-ocr to |
In my case fortunately I already have this package installed, however in the general case I think it should be part of deb's dependencies. |
Oh, that part is fine because tesseract comes pre-installed in the container image we use. So even if one's system doesn't have tesseract, it will still work. |
The original document was larger in dimensions than the original one due to a mismatch in DPI settings. When converting documents to pixels we were setting the DPI to 150 pixels per inch. Then when converting back into a PDF we were using 70 DPI. This difference would result in an overall larger document in dimensions (though not necessarily in file size). Fixes #626
The original document was larger in dimensions than the original one due to a mismatch in DPI settings. When converting documents to pixels we were setting the DPI to 150 pixels per inch. Then when converting back into a PDF we were using 70 DPI. This difference would result in an overall larger document in dimensions (though not necessarily in file size). Fixes #626
The original document was larger in dimensions than the original one due to a mismatch in DPI settings. When converting documents to pixels we were setting the DPI to 150 pixels per inch. Then when converting back into a PDF we were using 70 DPI. This difference would result in an overall larger document in dimensions (though not necessarily in file size). Fixes #626
Comment by @j75 from another issue:
I had just come across this as well:
The text was updated successfully, but these errors were encountered: