Refactor the scanner #459

ianlewis · 2023-08-29T09:33:17Z

Rob Pike had a good talk on the subject.
https://www.youtube.com/watch?v=HxaD_trXwRE
https://go.dev/talks/2011/lex.slide

Some thoughts:

Use a more standard lexer/parser architecture. It has a similar CommentScanner/TODOScanner architecture but this could be cleaner.
Don't use regex to parse TODO comments. Instead generate lexemes from a lexer that can then be put together into full todos by a parser.
Rob Pike's idea to have states be functions was neat but I'm not sure I like that when a state has to hold data. Instead maybe make it a simple interface with a Run method. This would allow states to more easily hold data.
```
type state interface {
  Run() state
}
```
Consider building a generic lexer/parser package using generics.

Some alternative implementations

https://github.com/bbuck/go-lexer - Only simple lexing.
https://github.com/db47h/lex - Only simple lexing.
https://github.com/zalgonoise/lex - Uses generics all the way down to the reader. Also implements a parser interface.

The text was updated successfully, but these errors were encountered:

ianlewis · 2023-11-09T21:17:39Z

I can also perhaps just read from a byte reader and check the individual bytes match the starting characters for comments strings etc. This is because pretty much all languages use ASCII characters for these which are represented as bytes. I can then just scan to the end of the line or to the end of a multi-line comment to get the comment bytes and convert them to utf8. That way I don't have to convert all bytes in every file.

ianlewis added the refactor A code refactor or cleanup label Aug 29, 2023

This was referenced Aug 30, 2023

Concurrency support #121

Open

Refactor RuneReader #441

Closed

ianlewis changed the title ~~refactor: Refactor the scanner~~ Refactor the scanner Sep 1, 2023

ianlewis added the performance An issue with performance label Sep 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Refactor the scanner #459

Refactor the scanner #459

ianlewis commented Aug 29, 2023 •

edited

Loading

ianlewis commented Nov 9, 2023

Refactor the scanner #459

Refactor the scanner #459

Comments

ianlewis commented Aug 29, 2023 • edited Loading

ianlewis commented Nov 9, 2023

ianlewis commented Aug 29, 2023 •

edited

Loading