ENH: Arabic Language Support (diacritics) #1578
Labels
is-feature
A feature request
workflow-arabic-text-extraction
Related to text extraction, but with a focus on Arabic text
workflow-text-extraction
From a users perspective, text extraction is the affected feature/workflow
This issue summarizes feedback from Karen McNeil: #1547 (reply in thread)
Environment
$ python -m platform Linux-5.4.0-137-generic-x86_64-with-glibc2.31 $ python -c "import pypdf;print(pypdf.__version__)" 3.2.0
Code + PDF
https://github.com/py-pdf/pypdf/files/10485679/arabic_sample.pdf
Actual Extract
Expected (via Google Chrome)
The text was updated successfully, but these errors were encountered: