feat: Add Markdown languages #268

nzakas · 2024-08-06T15:16:15Z

This pull request adds two markdown languages to the plugin:

commonmark - CommonMark syntax
gfm - GitHub-Flavored Markdown syntax

It also adds several rules that apply to Markdown content. These are all intended to find problems and not to format the document. I've included documentation for each rule.

I changed the recommended config to be for linting Markdown content, not using the processor. There is a new processor config that implements the old recommended config.

Note on types: Because the @types/eslint package defines types specific to ESTree, I can't really add more types into the rules until we have a more generic type definition for rules. I use RuleModule to ensure that the plugin itself will type check correctly, but that definition doesn't allow visitor methods for non-ESTree values.

Refs #160

fasttime

Nice work! I noted a few things we may want to get fixed before releasing the changes.

src/language/markdown-source-code.js

src/rules/fenced-code-language.js

src/rules/no-html.js

README.md

fasttime · 2024-08-11T20:47:14Z

src/rules/no-html.js

+                    return;
+                }
+
+                const tagName = node.value.match(/<([a-zA-Z0-9]+)/u)?.[1];


This will not correctly match custom tag names with non-alphanumeric characters, if someone accidentally inserts them in the markdown. For example:

<ng-template> Hello, World! </ng-template>

Will give the error:

HTML element "ng" is not allowed.

I'm not sure what tag names are considered valid by parsers, but I think it would be better to make the regexp more permissive. Maybe like this?

Suggested change

const tagName = node.value.match(/<([a-zA-Z0-9]+)/u)?.[1];

const tagName = node.value.match(/<([^\/>\s]+)/u)?.[1];

Good point. I'll need to test the parser a bit to see what happens.

docs/processors/markdown.md

fasttime · 2024-08-11T22:16:13Z

tests/rules/no-missing-label-refs.test.js

+        "[foo][]\n\n[foo]: http://bar.com/image.jpg",
+        "![foo][]\n\n[foo]: http://bar.com/image.jpg"


It looks like GitHub is trimming spaces from within brackets, and even a single line break is ignored. So the following should be all valid equivalents of [foo][]\n\n[foo]: http://bar.com/image.jpg but they are being reported as errors:

"[ foo ][]\n\n[foo]: http://bar.com/image.jpg", "[foo][ ]\n\n[foo]: http://bar.com/image.jpg", "[\nfoo\n][\n]\n\n[foo]: http://bar.com/image.jpg",

I'm not sure if this is different in standard CommonMark.

Thanks for the heads up. I'll investigate.

I've tested this with the parser in both CommonMark and GFM modes.

For the first one, it correctly parses in both modes with the label of "foo", so it's trimming the label. The rule is not flagging this as an error.

The second and third are not being parsed as link references in either CommonMark or GFM mode. When I remove the whitespace between the [ and ], then they are recognized as link references. This appears to be a bug in the parser. I'll need to follow up with them, but I can still flag it in the rule.

According to the parser maintainers, [] with any amount of whitespace in between is not supposed to create a link. It seems that CommonMark forbids this and the fact that it's working on GitHub is actually a bug in GitHub's implementation.

So really, only the first example is a valid link. The other two are not considered links and therefore shouldn't check the link reference. I'll need to update accordingly.

We probably also need a no-sloppy-link-ref rule to flag those as not compatible with CommonMark.

Ref: syntax-tree/mdast-util-from-markdown#39 (comment)

Thanks for digging into this. The discussion in the mdast-util-from-markdown is helpful. If it's true that the different behavior in GFM is a bug and not the specified behavior then we can use probably https://spec.commonmark.org/dingus/ as a references to test how links should regardless of the markdown language.

Co-authored-by: Francesco Trotta <[email protected]>

nzakas · 2024-08-13T16:08:24Z

Added a new no-invalid-label-ref rule to account for the cases mentioned in #268 (comment)

src/rules/no-invalid-label-refs.js

nzakas · 2024-08-16T19:17:23Z

I updated no-invalid-label-refs so that it highlights the brackets rather than the label. I think that makes more sense because it's the brackets that are the problem.

fasttime · 2024-08-20T10:18:10Z

tests/rules/no-missing-label-refs.test.js

In this rule, the column calculation is incorrect when the link starts after the beginning of the line. For example with this test case:

{ code: "- - - [foo]", errors: [ { messageId: "notFound", data: { label: "foo" }, line: 1, column: 8, endLine: 1, endColumn: 11 } ] }

This is similar to the issue we had with no-invalid-label-refs.

Ah right, same logic. Thanks.

fasttime

Out of curiosity, I run the new rules against the .md files in https://github.com/eslint/eslint.org/. This reported a few hundred violations, some of them a bit unexpected like no-missing-label-refs on [object Object]. But on careful look, those were all legitimate violations. Duplicate headings are also very common and probably not something we want to address. Overall, I think this is working well and it's a useful reference for how to add language capability to a plugin.

fasttime · 2024-08-21T09:50:36Z

src/rules/no-invalid-label-refs.js

+
+        /*
+         * Search the entire document text to find the preceding open bracket.
+         * We add one to the start index to account for a preceding ! in an image.


We are no longer adding one.

fasttime · 2024-08-21T10:43:05Z

Just the above note, then LGTM.

nzakas · 2024-08-21T14:00:57Z

Thanks! Yeah Markdownlint has some other options for these rules that allow for more edge cases, but I wanted to start simple. We can always add more options to the rules as we go.

nzakas added 5 commits July 30, 2024 16:01

chore: Add type checking

fd92f37

feat: Add type checking

5dd8e74

feat: Add language

ce6279a

Merge files

0592f5a

feat: Add Markdown language

cd13b0c

eslint-github-bot bot added the feature label Aug 6, 2024

Update prepare script

99f8b7c

fasttime added the accepted label Aug 11, 2024

fasttime reviewed Aug 12, 2024

View reviewed changes

nzakas and others added 10 commits August 12, 2024 11:11

Update src/language/markdown-source-code.js

c332c6b

Co-authored-by: Francesco Trotta <[email protected]>

Update src/rules/fenced-code-language.js

59918a0

Co-authored-by: Francesco Trotta <[email protected]>

Update src/rules/no-html.js

c7530f1

Co-authored-by: Francesco Trotta <[email protected]>

Update docs/processors/markdown.md

35a516f

Co-authored-by: Francesco Trotta <[email protected]>

Update docs/processors/markdown.md

5281a3f

Co-authored-by: Francesco Trotta <[email protected]>

Update docs/processors/markdown.md

8d03f48

Co-authored-by: Francesco Trotta <[email protected]>

Fix link

d7f96cc

fix no-html

c618611

Fix no-missing-label-refs

47d94e7

Add no-invalid-label-ref; add util

21b5226

fasttime reviewed Aug 15, 2024

View reviewed changes

src/rules/no-invalid-label-refs.js Outdated Show resolved Hide resolved

nzakas added 2 commits August 15, 2024 13:50

Fix linting error

7b6e770

Fix no-invalid-label-ref

48cfd9a

fasttime reviewed Aug 16, 2024

View reviewed changes

src/rules/no-invalid-label-refs.js Outdated Show resolved Hide resolved

src/rules/no-invalid-label-refs.js Outdated Show resolved Hide resolved

Fix bugs in no-invalid-label-refs

548fdab

fasttime reviewed Aug 20, 2024

View reviewed changes

fix no-missing-label-refs locations

5b0c2cc

fasttime reviewed Aug 21, 2024

View reviewed changes

Update comment

aa625a1

nzakas merged commit d79c42b into main Aug 21, 2024
11 checks passed

nzakas deleted the language2 branch August 21, 2024 14:34

github-actions bot mentioned this pull request Aug 5, 2024

chore: release 6.0.0 🚀 #267

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add Markdown languages #268

feat: Add Markdown languages #268

nzakas commented Aug 6, 2024 •

edited

Loading

fasttime left a comment

fasttime Aug 11, 2024

nzakas Aug 12, 2024

fasttime Aug 11, 2024

nzakas Aug 12, 2024

nzakas Aug 12, 2024

nzakas Aug 13, 2024

fasttime Aug 15, 2024

nzakas commented Aug 13, 2024

nzakas commented Aug 16, 2024

fasttime Aug 20, 2024

nzakas Aug 20, 2024

fasttime left a comment

fasttime Aug 21, 2024

fasttime commented Aug 21, 2024

nzakas commented Aug 21, 2024

	const tagName = node.value.match(/<([a-zA-Z0-9]+)/u)?.[1];
	const tagName = node.value.match(/<([^\/>\s]+)/u)?.[1];

		"[foo][]\n\n[foo]: http://bar.com/image.jpg",
		"![foo][]\n\n[foo]: http://bar.com/image.jpg"

feat: Add Markdown languages #268

feat: Add Markdown languages #268

Conversation

nzakas commented Aug 6, 2024 • edited Loading

fasttime left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

nzakas commented Aug 13, 2024

nzakas commented Aug 16, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fasttime left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

fasttime commented Aug 21, 2024

nzakas commented Aug 21, 2024

nzakas commented Aug 6, 2024 •

edited

Loading