PyPDF2.utils.PdfReadError: Creating EncodedStreamObject is not currently supported #656

ghost · 2022-03-13T04:21:26Z

My task is to find and replace the text in pdf, I used pyPDF2 package to replace the text, but when I try to replace I'm receiving an error like

Traceback (most recent call last):
  File "c:\practice_python\sample.py", line 41, in <module>
    page.getContents().setData(replaced_text)
  File "C:\Users\Win\AppData\Local\Programs\Python\Python310\lib\site-packages\PyPDF2\generic.py", line 852, in setData
    raise utils.PdfReadError("Creating EncodedStreamObject is not currently supported")
PyPDF2.utils.PdfReadError: Creating EncodedStreamObject is not currently supported

and my code is

from PyPDF2 import PdfFileReader, PdfFileWriter

replacements = [
    ("HARIHARAN S", "<your name>")
]

pdf = PdfFileReader(open("samplenda.pdf", "rb"))
writer = PdfFileWriter() 

for page in pdf.pages:
    contents = page.getContents().getData()
    print(type(contents))
    for (a,b) in replacements:
        replaced_text = contents.replace(bytes(a,'utf-8'), bytes(b,'utf-8'))  # .encode('utf-8')
    print(type(replaced_text))
    page.getContents().setData(replaced_text)
    writer.addPage(page)
    
with open("modified.pdf", "wb") as f:
     writer.write(f)

I tried lots of way many times, please help me to solve this error

The text was updated successfully, but these errors were encountered:

add set_data() for encoded streams also, complete FlateEncode to get all requierd attributes Ease data manipulation without going through ContentStream (slow) closes py-pdf#656

pubpub-zz · 2023-05-22T14:39:12Z

I've produced a PR to introduce set_data() into EncodedStreamObject() however not that get_contents() returns a ContentStream Object which where data is processed through operations(). If you want to get content as EncodedStreamObject, you have to access ["/Contents"] data, holding the possible array decomposition.

Closes #656

SpastBanana · 2024-01-29T12:42:43Z

Hi,

Is there any update on this? I got the same error as @ghost

stefan6419846 · 2024-01-29T12:43:56Z

Please open a new issue, filling all the necessary details.

MartinThoma added the is-bug From a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF label Apr 6, 2022

MartinThoma added the is-feature A feature request label Apr 16, 2022

MartinThoma removed the is-bug From a users perspective, this is a bug - a violation of the expected behavior with a compliant PDF label Mar 15, 2023

pubpub-zz mentioned this issue May 22, 2023

ENH: Add set_data to EncodedStreamObject #1854

Merged

MartinThoma closed this as completed in #1854 Jun 11, 2023

MartinThoma pushed a commit that referenced this issue Jun 11, 2023

ENH: Add set_data to EncodedStreamObject (#1854)

56b33cc

Closes #656

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyPDF2.utils.PdfReadError: Creating EncodedStreamObject is not currently supported #656

PyPDF2.utils.PdfReadError: Creating EncodedStreamObject is not currently supported #656

ghost commented Mar 13, 2022 •

edited by MartinThoma

Loading

pubpub-zz commented May 22, 2023 •

edited by MartinThoma

Loading

SpastBanana commented Jan 29, 2024

stefan6419846 commented Jan 29, 2024

PyPDF2.utils.PdfReadError: Creating EncodedStreamObject is not currently supported #656

PyPDF2.utils.PdfReadError: Creating EncodedStreamObject is not currently supported #656

Comments

ghost commented Mar 13, 2022 • edited by MartinThoma Loading

pubpub-zz commented May 22, 2023 • edited by MartinThoma Loading

SpastBanana commented Jan 29, 2024

stefan6419846 commented Jan 29, 2024

ghost commented Mar 13, 2022 •

edited by MartinThoma

Loading

pubpub-zz commented May 22, 2023 •

edited by MartinThoma

Loading