SuperFences produces different/"invalid" HTML after 10.11 #2471

mvelikikh · 2024-09-29T10:21:15Z

Description

SuperFences started producing "invalid" HTML after 10.11.
It happens for code blocks without language that use hl_lines, e.g.:

``` hl_lines="1"
some
code
```

I am aware that it got changed in that version but it breaks the code that is working fine in 10.10.2.

Minimal Reproduction

use the following python code

import markdown
text = ('``` hl_lines="1"\n'
        "test\n"
        "data\n"
        "```\n"
        "\n"
        "some text\n"
        "```sql\n"
        "select * from dual;\n"
        "```")
html = markdown.markdown(text, extensions=['pymdownx.superfences'])
print(html)

run in 10.10.2:

<pre class="highlight"><code>test
data</code></pre>
<p>some text
<pre class="highlight"><code class="language-sql">select * from dual;</code></pre></p>

run in 10.11:

<p>``` hl_lines="1"
test
data
<pre class="highlight"><code>
some text
```sql
select * from dual;</code></pre></p>

Version(s) & System Info

Operating System: Windows 10
Python Version: 3.12
Package Version: pymdown-extensions-10.11

The text was updated successfully, but these errors were encountered:

facelessuser · 2024-09-29T14:46:31Z

This was not an explicitly tested case, so that is most likely why this regression occurred. It is perfectly reasonable for this to be expected to work though.

Most likely, the language is now getting parsed from hl_lines in this case, and then we have a weird option of = remaining. The language match should enforce that we have a word boundary and no trailing = to be considered a language \b(?!=).

We adjusted the header but also cleaned up the regex to be a bit more precise with the reworked logic. Where before we incidentally allowed an omitted language, now we must explicitly expect it and define it such that it won't be confused with attributes or options: option="value".

Fixes #2471

facelessuser · 2024-09-29T15:34:20Z

It ends up being slightly more complicated pattern update as we have to account for names with non-word chars, so a simple check of (?=[\t ]|$) ends up validating the language specifier much better. So now we don't run into issue when using things like c++ etc.

facelessuser · 2024-09-29T17:03:19Z

The fix has been released. Thanks for the report!

mvelikikh added the T: bug Bug. label Sep 29, 2024

gir-bot added the S: triage Issue needs triage. label Sep 29, 2024

mvelikikh added a commit to mvelikikh/mvelikikh.github.io that referenced this issue Sep 29, 2024

Fix pymdown-extensions version: facelessuser/pymdown-extensions#2471

29b4382

facelessuser added a commit that referenced this issue Sep 29, 2024

Fix omitted language case

58f8e24

Fixes #2471

facelessuser mentioned this issue Sep 29, 2024

Fix omitted language case #2472

Merged

facelessuser closed this as completed in #2472 Sep 29, 2024

facelessuser closed this as completed in d43141d Sep 29, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SuperFences produces different/"invalid" HTML after 10.11 #2471

SuperFences produces different/"invalid" HTML after 10.11 #2471

mvelikikh commented Sep 29, 2024

facelessuser commented Sep 29, 2024

facelessuser commented Sep 29, 2024

facelessuser commented Sep 29, 2024

SuperFences produces different/"invalid" HTML after 10.11 #2471

SuperFences produces different/"invalid" HTML after 10.11 #2471

Comments

mvelikikh commented Sep 29, 2024

Description

Minimal Reproduction

Version(s) & System Info

facelessuser commented Sep 29, 2024

facelessuser commented Sep 29, 2024

facelessuser commented Sep 29, 2024