Added cursor to parser #287

Razican · 2020-03-30T15:57:01Z

This adds a simple cursor structure to the new parser (#281), which enables doing simpler operations, and making it easier to understand.

This also brings token referencing to the parser, to avoid so much cloning. Most of the cloning only happens when we need to return an error, in which case, it doesn't matter if it takes us a bit more. But for the rest, I think this should be faster. This closes #284.

I also took the opportunity to improve a bit further the documentation and the parser errors.

Razican · 2020-03-30T15:59:08Z

boa/src/syntax/parser/mod.rs

 }

 macro_rules! expression { ( $name:ident, $lower:ident, [ $( $op:path ),* ] ) => {
    fn $name (&mut self) -> ParseResult {
        let mut lhs = self. $lower ()?;
-        while let Ok(tok) = self.peek_skip_lineterminator() {
+        while let Some(tok) = self.peek_skip_lineterminator().cloned() {


Could we somehow avoid this? It seems that Rust gives a "multiple mutable reference" error when you return a read-only reference from a mutable reference more than once. Is this something the compiler should understand it's safe?

Hmm without checking this out and playing with it I’m not sure. Will need to take a look.

Razican · 2020-03-30T15:59:19Z

boa/src/syntax/parser/mod.rs

            let args = self.read_arguments()?;
            let call_node = Node::Call(Box::new(lhs), args);

            Node::New(Box::new(call_node))
        } else {
            self.read_primary_expression()?
        };
-        while let Ok(tok) = self.peek_skip_lineterminator() {
-            match tok.kind {
+        while let Some(tok) = self.peek_skip_lineterminator().cloned() {


Could we somehow avoid this? It seems that Rust gives a "multiple mutable reference" error when you return a read-only reference from a mutable reference more than once. Is this something the compiler should understand it's safe?

Razican · 2020-03-30T15:59:26Z

boa/src/syntax/parser/mod.rs

        }

-        while let Ok(tok) = self.peek_skip_lineterminator() {
+        while let Some(tok) = self.peek_skip_lineterminator().cloned() {


Could we somehow avoid this? It seems that Rust gives a "multiple mutable reference" error when you return a read-only reference from a mutable reference more than once. Is this something the compiler should understand it's safe?

Could we somehow avoid this? It seems that Rust gives a "multiple mutable reference" error when you return a read-only reference from a mutable reference more than once. Is this something the compiler should understand it's safe?

I found a way. If you take a &[Token] instead of an owning Vec<Token> in Cursor and you annotate the returning references with 'a, so &'a Token in functions that return it like next, next_if, etc. (also in Parser) you can eliminate the clone.
BTW I'm still new to rust lifetimes so there are probably better ways of doing this.

Yeah I think this is correct, its the same thing I would have done.

Razican · 2020-03-30T16:28:16Z

It seems that this new implementation is giving a test error:

---- exec::tests::spread_with_arguments stdout ----
thread 'exec::tests::spread_with_arguments' panicked at 'assertion failed: `(left == right)`
  left: `"undefined"`,
 right: `"1"`', boa/src/exec/tests.rs:55:5

Razican · 2020-03-30T16:45:34Z

boa/src/syntax/parser/cursor.rs

+    /// // Do some stuff that might change the cursor position...
+    /// cursor.seek(pos_save);
+    /// ```
+    pub(super) fn seek(&mut self, pos: usize) {


Maybe it would make sense to abstract over this and provide a stack with breakpoints. So that we just say cursor.create_breakpoint() and then cursor.to_last_breakpoint(). But we need to remember to go back in all cases, or to remove the breakpoint if we won't use it anymore.

So its worth pointing out that in future i would like to have the lexer send the parser tokens through a channel (most likely Crossbeam), this will allow us to have concurrent parsing (or streaming parsing as V8/Spidermonkey call it). It offers a big performance boost.

So you need to imagine we may not even have all the tokens yet, in which case seek may fail. (we jump too far ahead past the current-buffer)

I don't think we need to change things around now, but its worth building with that feature in mind.

I might just make an issue to discuss streaming parsing. Then we can tackle it after

Moved into an issue here:
jasonwilliams#288

Maybe it would make sense to abstract over this and provide a stack with breakpoints. So that we just say cursor.create_breakpoint() and then cursor.to_last_breakpoint(). But we need to remember to go back in all cases, or to remove the breakpoint if we won't use it anymore.

I think seek() is fine for now.
Like you said with the breakpoint idea, it could open a huge rabbit hole

boa/src/syntax/parser/mod.rs

Co-Authored-By: HalidOdat <[email protected]>

Razican · 2020-03-31T10:09:28Z

Thanks @HalidOdat! This fixes the test, and I think this is ready for a merge :)

Razican added 2 commits March 30, 2020 13:04

First implementation of a cursor

614e9f2

Finished parsing with cursor

7d78f5b

Razican commented Mar 30, 2020

View reviewed changes

Remove example in private doc

97aff10

Razican commented Mar 30, 2020

View reviewed changes

jasonwilliams mentioned this pull request Mar 30, 2020

Streaming Parsing (from Lexer to Parser) #288

Closed

HalidOdat reviewed Mar 31, 2020

View reviewed changes

boa/src/syntax/parser/mod.rs Outdated Show resolved Hide resolved

Fix failing test with spread operator

e24c3d2

Co-Authored-By: HalidOdat <[email protected]>

jasonwilliams merged commit e251c8f into boa-dev:parser Mar 31, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added cursor to parser #287

Added cursor to parser #287

Razican commented Mar 30, 2020 •

edited

Loading

Razican Mar 30, 2020

jasonwilliams Mar 30, 2020

Razican Mar 30, 2020

Razican Mar 30, 2020

HalidOdat Mar 31, 2020

jasonwilliams Mar 31, 2020

Razican commented Mar 30, 2020

Razican Mar 30, 2020

jasonwilliams Mar 30, 2020 •

edited

Loading

jasonwilliams Mar 30, 2020

jasonwilliams Mar 30, 2020 •

edited

Loading

Razican commented Mar 31, 2020

Added cursor to parser #287

Added cursor to parser #287

Conversation

Razican commented Mar 30, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Razican commented Mar 30, 2020

Choose a reason for hiding this comment

jasonwilliams Mar 30, 2020 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jasonwilliams Mar 30, 2020 • edited Loading

Choose a reason for hiding this comment

Razican commented Mar 31, 2020

Razican commented Mar 30, 2020 •

edited

Loading

jasonwilliams Mar 30, 2020 •

edited

Loading

jasonwilliams Mar 30, 2020 •

edited

Loading