build error: character too large for enclosing character literal type #1

patrickt · 2020-01-07T16:04:59Z

Running tree-sitter test produces the following error when compiling scanner.cc:

emcc command failed - src/scanner.cc:88:32: error: character too large for enclosing character literal type
           lexer->lookahead == '\uFEFF' ||
                               ^
src/scanner.cc:89:32: error: character too large for enclosing character literal type
           lexer->lookahead == '\u2060' ||
                               ^
src/scanner.cc:90:32: error: character too large for enclosing character literal type
           lexer->lookahead == '\u200B';

The text was updated successfully, but these errors were encountered:

simonrepp · 2020-01-07T17:08:18Z

I've encountered this myself - if I remember correctly:

You can comment out the affected lines for the time being if you just want to work on stuff (that's what I did so far), these are more exotic whitespace cases which of course matter, but are irrelevant for the overall implementation and general testing.
It only occurrs in certain compilation scenarios (when building for the wasm target?)
I don't have a solution yet!
If you find one I'd be much obliged :)

In any case, thanks for the report! Glad about your interest in this, let me know if you have more questions and/or issues!

patrickt · 2020-01-07T22:03:19Z

I fixed it by removing the character syntax:

  inline bool is_horizontal_whitespace(TSLexer *lexer) {
    return lexer->lookahead == ' ' ||
           lexer->lookahead == '\t' ||
           lexer->lookahead == 0xFEFF ||
           lexer->lookahead == 0x2060 ||
           lexer->lookahead == 0x200B;
  }

maxbrunsfeld · 2020-01-07T22:04:23Z

Yeah, I don't think '\uFEFF' is a valid C/C++ expression. Single quoted character literals are of type char, which is almost always an 8-bit value. The largest numerical char value is 0xFF.

In more recent versions of C++, you can use a U prefix (e.g. U'\uFEFF'). I think the normal way to write numbers like this is to just use integer syntax, like @patrickt said ☝️ .

simonrepp · 2020-01-08T09:22:27Z

Very cool, thanks patrick for the fix and max for the additional insight, appreciate it!

Commited in c059967

simonrepp closed this as completed Jan 8, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

build error: character too large for enclosing character literal type #1

build error: character too large for enclosing character literal type #1

patrickt commented Jan 7, 2020

simonrepp commented Jan 7, 2020

patrickt commented Jan 7, 2020

maxbrunsfeld commented Jan 7, 2020 •

edited

Loading

simonrepp commented Jan 8, 2020

build error: character too large for enclosing character literal type #1

build error: character too large for enclosing character literal type #1

Comments

patrickt commented Jan 7, 2020

simonrepp commented Jan 7, 2020

patrickt commented Jan 7, 2020

maxbrunsfeld commented Jan 7, 2020 • edited Loading

simonrepp commented Jan 8, 2020

maxbrunsfeld commented Jan 7, 2020 •

edited

Loading