Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PyPDF2 doesn't read Cyrillic letters #375

Closed
artsiomkaltovich opened this issue Nov 2, 2017 · 4 comments
Closed

PyPDF2 doesn't read Cyrillic letters #375

artsiomkaltovich opened this issue Nov 2, 2017 · 4 comments
Labels
is-bug From a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF PdfReader The PdfReader component is affected

Comments

@artsiomkaltovich
Copy link

artsiomkaltovich commented Nov 2, 2017

Hello, all.
When I tried to read this file http://rec.gov.by/sites/default/files/pdf/Elections-PRB2015-itog1.pdf
I got only digits, but no letters.
I used following code:

import PyPDF2

reader = PyPDF2.PdfFileReader("Elections-PRB2015-itog1.pdf")
page = reader.pages[0]
print(page.extract_text())

Is it my mistake or bug?

@ildarcheg
Copy link

@SlavaSTiwari
Copy link

Any solutions to this issue?
Please do share.

@MartinThoma
Copy link
Member

#464 might help

@MartinThoma MartinThoma added is-bug From a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF PdfReader The PdfReader component is affected labels Apr 8, 2022
@MartinThoma MartinThoma changed the title pyPDF2 doesn't read Cyrillic letters PyPDF2 doesn't read Cyrillic letters Jun 27, 2022
@MartinThoma
Copy link
Member

Using PyPDF2==2.4.0, it seems to work. I'm closing this now - please let me know if you encounter issues with the latest version of PyPDF2.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
is-bug From a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF PdfReader The PdfReader component is affected
Projects
None yet
Development

No branches or pull requests

4 participants