UnicodeDecodeError: 'utf-16-be' codec can't decode byte 0x45 in position 0: truncated data #1293
Labels
workflow-text-extraction
From a users perspective, text extraction is the affected feature/workflow
See #1269 for further details, this reports another issue I've come accross.
Environment
Which environment were you using when you encountered the problem?
$ python -m platform Linux-5.4.0-122-generic-x86_64-with-glibc2.29 $ python -c "import PyPDF2;print(PyPDF2.__version__)" 2.10.3
Code + PDF
This is a minimal, complete example that shows the issue:
PDF used above: 2007,ASurveyofImageClassificationBasedTechniques.pdf
Traceback
This is the complete Traceback I see:
The PDF can be read using a normal PDF viewer.
This may be related to #969 (comment)
The text was updated successfully, but these errors were encountered: