Segmentation fault at caling get_cdrawings(extended=True) #2556

pulsar314 · 2023-07-25T15:04:10Z

Description

If a document contains sequences like

q
100 100 m
W n

invocation of page.get_cdrawings(extended=True) results in a segfault.

In this case dev_pathdict is being cleared due to the empty commands list, but jm_lineart_clip_path and jm_lineart_clip_stroke_path are trying to get a value from the dict.

Configuration

Linux MINT
Python 3.8
PyMuPDF 1.22.5, installed via pip

The text was updated successfully, but these errors were encountered:

JorjMcKie · 2023-07-26T14:20:09Z

I think it is a duplicate of #2462 / #2539.
To confirm this, do you have an example / reproducing file at hand please?

pulsar314 · 2023-07-27T11:20:44Z

Unfortunately, I cannot provide you with the original PDF, but there is a reproduction of the failing sequence
segfault.pdf

JorjMcKie · 2023-07-28T12:31:40Z

Unfortunately, I cannot provide you with the original PDF, but there is a reproduction of the failing sequence segfault.pdf

Thanks a lot! Indeed, this error is not being fixed.

Guard against incompletely specified clip paths by checking whether any drawing items have been generated.

Ensure #2556 is fixed properly.

For text extraction `get_text("words")`, or extractWORDS, words are defined as strings not containing white space. This change allows adding up to 64 characters to also function as delimiters. This allows for instance to separate words from punctuations or to decompose an e-mail address into its components. Other changes: Fixing #2522: correcting the typo Remove some unnecessary setting of flags when creating annotations. Fixing #2553: Adjust plain text extraction to use the same approach as other variants. This entails using Unicode escape strings on output instead of using the output of fz_chartorune. Another consequence is that standard text output is directed to a fz_buffer instead to a fz_output. Fixing #2556: Add checking the existence of path dictionaries at every possible place. Includes an additional test function. Add functions JM_ignore_rect / JM_ignore_irect which return a bool. The functions return True if the rectangle should be ignored. This is the case for infinite and empty rectangles, but also for any rectangle that has a common edge with the infinite rectangle. Support variable setting of character border widths for insert_text() / insert_textbox(). This is a factor to be multiplied with the font size. Default is 0.05 (read: 5% of the fontsize). This value is relevant for text rendering modes 1 and 2 only. Fixing #2637: In Page.insert_textbox, when the last word of a line won't fit in the line buffer, we did not increase the line position. This is now handled correctly.

Immunize against wrong path specifications by checking whether the current path dictionary actually exists.

julian-smith-artifex-com · 2023-09-27T08:28:21Z

Fixed in 1.23.4.

JorjMcKie added the bug label Jul 28, 2023

JorjMcKie added a commit that referenced this issue Jul 28, 2023

Fix #2556

63d8309

Guard against incompletely specified clip paths by checking whether any drawing items have been generated.

JorjMcKie added a commit that referenced this issue Jul 28, 2023

test fix of #2556

df830db

Ensure #2556 is fixed properly.

JorjMcKie mentioned this issue Jul 28, 2023

Fix #2556 #2564

Closed

JorjMcKie mentioned this issue Sep 11, 2023

Word delimiter support, fixes #2637, #2556, #2553, #2522 #2661

Closed

JorjMcKie added a commit that referenced this issue Sep 19, 2023

Fix #2556

8669ed7

Immunize against wrong path specifications by checking whether the current path dictionary actually exists.

JorjMcKie closed this as completed in 545defe Sep 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Segmentation fault at caling get_cdrawings(extended=True) #2556

Segmentation fault at caling get_cdrawings(extended=True) #2556

pulsar314 commented Jul 25, 2023

JorjMcKie commented Jul 26, 2023

pulsar314 commented Jul 27, 2023

JorjMcKie commented Jul 28, 2023

julian-smith-artifex-com commented Sep 27, 2023

Segmentation fault at caling get_cdrawings(extended=True) #2556

Segmentation fault at caling get_cdrawings(extended=True) #2556

Comments

pulsar314 commented Jul 25, 2023

Description

Configuration

JorjMcKie commented Jul 26, 2023

pulsar314 commented Jul 27, 2023

JorjMcKie commented Jul 28, 2023

julian-smith-artifex-com commented Sep 27, 2023