Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat apilinks.json generator #153

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open

feat apilinks.json generator #153

wants to merge 4 commits into from

Conversation

flakey5
Copy link
Member

@flakey5 flakey5 commented Nov 29, 2024

Closes #152

Opening this as a draft currently to get feedback on the approach since it's a bit non-trivial relative to the other generators. The apilinks.json file maps things exported by modules to their source locations on Github.

Example:

{
  "SomeClass.prototype.var": "github.com/nodejs/node/tree/<hash>/lib/file.js#L100"
}

This means we need to parse the module's javascript source in addition to its markdown.

What the current approach does is doing:

  • Adds another loading & parsing step for the js source files
    • acorn is used for parsing the source files
    • This intakes .js files, so to run it you need to do pass in node/lib/*.js as the input
    • This is dependent on the Markdown source parsing since it uses the source_link metadata in the docs
  • Exposes the parsed js ast to other generators by adding the ast-js generator
  • api-links generator is based off of the ast-js result

@flakey5 flakey5 requested a review from a team as a code owner November 29, 2024 07:59
@flakey5 flakey5 marked this pull request as draft November 29, 2024 07:59
src/utils/git.mjs Fixed Show fixed Hide fixed
src/utils/git.mjs Fixed Show fixed Hide fixed
src/utils/git.mjs Fixed Show fixed Hide fixed
src/utils/git.mjs Fixed Show fixed Hide fixed
src/utils/git.mjs Outdated Show resolved Hide resolved
src/parser.mjs Outdated Show resolved Hide resolved
@ovflowd
Copy link
Member

ovflowd commented Dec 19, 2024

@flakey5 do you need review here? I think I forgot to review this PR!

@flakey5
Copy link
Member Author

flakey5 commented Dec 20, 2024

do you need review here

As far as the approach yes please

.filter(path => path !== undefined && path.endsWith('.js'))
);

const parsedJsFiles = await parseJsSources(sourceFiles);
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Does this process needs to be blocking? Do we need to parse all these sources beforehand and not on-demand?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We don't need too, this is just consistent to how to we parse markdown sources. We could definitely just store the paths of the sources and parse them in the generator instead though. It'd definitely lower the amount of memory that's allocated for the lifetime of the program

bin/cli.mjs Outdated Show resolved Hide resolved
bin/cli.mjs Outdated Show resolved Hide resolved
src/generators.mjs Outdated Show resolved Hide resolved
src/generators.mjs Outdated Show resolved Hide resolved
@@ -0,0 +1,215 @@
// @ts-check
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What is this comment?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Temporary comment just enabling type checking in the file

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I wonder how expensive these functions are, as it seems that this can quickly get bloated depending on the size of source files.

BTW, could you use https://unifiedjs.com/explore/package/estree-util-visit/ to walk/visit the nodes? This is what I was referring to, instead of plain forEaches and while loops.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Also, it might be worth to move "estree" moving logic to a dedicated "estree parser" IDK

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Since we have a parser for the Markdown files, having one just for Estree makes sense I guess.

Copy link
Member Author

@flakey5 flakey5 Jan 7, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

How about having a createMarkdownLoader and createJsLoader? Then the same with createMarkdownParser and createJsParser

src/loader.mjs Outdated Show resolved Hide resolved
return filePaths.map(async filePath => {
const fileContents = await readFile(filePath, 'utf-8');

return new VFile({ path: filePath, value: fileContents });
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am a bit concerned of the memory allocation that this will cause, some of the source files can be humungous. Have you made some comparisons of heap size differences?

src/parser.mjs Outdated Show resolved Hide resolved
src/utils/git.mjs Outdated Show resolved Hide resolved
Copy link
Member

@ovflowd ovflowd left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Approach in general sounds good; Just, some of these utilities are way too complex and big and the git manipulation scripts are possibly dangerous or at least an easy vulnerability hell.

@ovflowd
Copy link
Member

ovflowd commented Jan 6, 2025

Also, @flakey5 this https://github.com/syntax-tree/estree-util-visit is what I was referring to use to walk through the Nodes of the source code. Since acorn generates an AST that is compatible with estree.

Signed-off-by: flakey5 <[email protected]>
@flakey5 flakey5 marked this pull request as ready for review January 15, 2025 23:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

urgent: feat: add apilinks.json generator
3 participants