Implement lexer int128 support #11571

BlobCodes · 2021-12-11T14:42:04Z

Supersedes #11196
Related to #8373
Closes #7915
Related to #5545

This PR implements int128 support in the lexer.

HertzDevil · 2021-12-12T03:58:30Z

What is UInt64 doing here?

In src\lib_c\x86_64-windows-msvc\c\dbghelp.cr:6:33

 6 | SYMOPT_LOAD_LINES           = 0x00000010
                                   ^---------
Error: 0x00000010 doesn't fit in an UInt64

BlobCodes · 2021-12-12T10:42:31Z

What is UInt64 doing here?

In src\lib_c\x86_64-windows-msvc\c\dbghelp.cr:6:33

 6 | SYMOPT_LOAD_LINES           = 0x00000010
                                   ^---------
Error: 0x00000010 doesn't fit in an UInt64

On Windows, the conversion from hex to dec fails, because the conversion is now handled using .to_u128?(base: 16).to_s (requires #11551).

The default error message when @token.value is nil and no suffix is given shows that it doesn't fit in the biggest integer (which should now be UInt128, I should change that).

BlobCodes · 2021-12-12T10:45:59Z

I could also fall back to u64 on Windows and show that windows doesn't support u128 integers yet, but I think it's better to wait until it's supported everywhere.

src/compiler/crystal/syntax/lexer.cr

BlobCodes · 2021-12-12T12:30:35Z

Does this need to be changed to include U/Int128? (I don't really know much about codegen)

crystal/src/compiler/crystal/codegen/types.cr

Line 199 in a9ee750

    
           @compile_time_value : (Int16 | Int32 | Int64 | Int8 | UInt16 | UInt32 | UInt64 | UInt8 | Bool | Char | Nil)

EDIT: Oh, it's the integers coming from the mathinterpreter, which doesn't yet support int128.

oprypin · 2021-12-12T12:47:53Z

I think it's not fully obvious what type should be deduced for literals that don't have an explicit suffix.
I kicked off a discussion of that in the issue, please check it out:
#8373 (comment)

BlobCodes · 2021-12-12T12:54:19Z

The newest commit allows constant math expressions to be interpreted at compile-time.
Basically, LLVM-IR before:

  call void @"~A:init"(), !dbg !24
  %57 = load i128, i128* @A, !dbg !24
  call void @"*puts<Int128>:Nil"(i128 %57), !dbg !91
  ret void, !dbg !91

After:

  call void @"*puts<Int128>:Nil"(i128 3), !dbg !97
  ret void, !dbg !97

Code:

A = 1_i128 + 2_i128
puts A

I hope the CI won't fail (except for windows)

straight-shoota · 2021-12-12T13:09:25Z

Please move 128-bit math interpreter to a separate PR. It's unrelated to lexer support. And it needs specs.

BlobCodes · 2021-12-12T18:48:24Z

The MathInterpreter changes (and a few more things) have been extracted to #11576

spec/compiler/lexer/lexer_spec.cr

oprypin · 2021-12-13T00:22:13Z

In terms of functionality, this is rock-solid. I just tested a huge range of numbers and it all matches expectations as per #8373 (comment) Alternative 4

See all 4004 passing specs' names

oprypin@3bb8a1a#diff-40baf545fe5167c85593fa7b286af7017cd0140411b0c12def7c995523d74b31R476

I codified all the number ranges in a shape that directly mirrors the tables from that comment.

crystal/spec/compiler/lexer/lexer_spec.cr

Lines 477 to 503 in 3bb8a1a

    
           test_cases("", [ 
        
             {nil, "-2**127 - 1", "{} doesn't fit in an Int64"}, 
        
             {"-2**127", "-2**63 - 1", "{} doesn't fit in an Int64, try using the suffix i128"}, 
        
             {"-2**63", "-2**31 - 1", :i64}, 
        
             {"-2**31", "2**31 - 1", :i32}, 
        
             {"2**31", "2**63 - 1", :i64}, 
        
             {"2**63", "2**64 - 1", :u64}, 
        
             {"2**64", "2**127 - 1", "{} doesn't fit in an UInt64, try using the suffix i128"}, 
        
             {"2**127", "2**128 - 1", "{} doesn't fit in an UInt64, try using the suffix u128"}, 
        
             {"2**128", nil, "{} doesn't fit in an UInt64"}, 
        
           ]) 
        
           [:i8, :i16, :i32, :i64, :i128].each do |suf| 
        
             size = suf.to_s[1..].to_i 
        
             test_cases(suf.to_s, [ 
        
               {nil, "-2**#{size - 1} - 1", "{} doesn't fit in an Int#{size}"}, 
        
               {"-2**#{size - 1}", "2**#{size - 1} - 1", suf}, 
        
               {"2**#{size - 1}", nil, "{} doesn't fit in an Int#{size}"}, 
        
             ]) 
        
           end 
        
           [:u8, :u16, :u32, :u64, :u128].each do |suf| 
        
             size = suf.to_s[1..].to_i 
        
             test_cases(suf.to_s, [ 
        
               {nil, "-1", "Invalid negative value {} for UInt#{size}"}, 
        
               {"0", "2**#{size} - 1", suf}, 
        
               {"2**#{size}", nil, "{} doesn't fit in an UInt#{size}"}, 
        
             ]) 
        
           end

…irst

oprypin

After #11551 is merged (which it wasn't quite yet),
please merge master into this PR, to double-check CI on windows
and then let's merge this one :)
Thanks much! 🎉

straight-shoota · 2021-12-16T11:41:56Z

#11551 has been merged.

BlobCodes · 2021-12-16T16:04:53Z

All specs pass with #11551 🎉

Implement lexer int128 support

128d1ac

BlobCodes mentioned this pull request Dec 11, 2021

Int128 Parsing support [PART 1] #11196

Closed

1 task

format code

043f9f1

Blacksmoke16 added kind:feature topic:compiler:parser topic:stdlib:numeric labels Dec 11, 2021

oprypin self-requested a review December 12, 2021 00:15

oprypin reviewed Dec 12, 2021

View reviewed changes

src/compiler/crystal/syntax/lexer.cr Outdated Show resolved Hide resolved

oprypin reviewed Dec 12, 2021

View reviewed changes

src/compiler/crystal/syntax/lexer.cr Show resolved Hide resolved

BlobCodes added 2 commits December 12, 2021 12:46

Show integer not fitting in U/Int128 on failed conversion

3e5399b

Try conversion to decimal using u64 before using u128

6e1f0c3

oprypin mentioned this pull request Dec 12, 2021

Int128 support #8373

Closed

Add i128 support in MathInterpreter

7c58a92

Remove everything not lexer-related

7e7e172

oprypin reviewed Dec 12, 2021

View reviewed changes

spec/compiler/lexer/lexer_spec.cr Outdated Show resolved Hide resolved

Apply rules from discussion in crystal-lang#8373

e25b754

oprypin reviewed Dec 12, 2021

View reviewed changes

spec/compiler/lexer/lexer_spec.cr Outdated Show resolved Hide resolved

BlobCodes added 2 commits December 13, 2021 13:25

Add comment about reasoning behind using u64 for base 10 conversion f…

92728ef

…irst

Add old specs

a9dc57c

straight-shoota approved these changes Dec 15, 2021

View reviewed changes

straight-shoota mentioned this pull request Dec 15, 2021

Add support for Int128 in codegen and macros #11576

Merged

oprypin approved these changes Dec 15, 2021

View reviewed changes

Merge branch 'crystal-lang:master' into feature/lexer-int128-support

36707d0

straight-shoota added this to the 1.3.0 milestone Dec 16, 2021

straight-shoota merged commit bfddd26 into crystal-lang:master Dec 18, 2021

BlobCodes deleted the feature/lexer-int128-support branch January 29, 2022 22:40

This was referenced Feb 4, 2022

Big UInt128 literals don't work #7448

Closed

Crystal compiler bug (UInt128 as Hex) #7449

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement lexer int128 support #11571

Implement lexer int128 support #11571

BlobCodes commented Dec 11, 2021 •

edited

Loading

HertzDevil commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

BlobCodes commented Dec 12, 2021 •

edited

Loading

oprypin commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

straight-shoota commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

oprypin commented Dec 13, 2021

oprypin left a comment

straight-shoota commented Dec 16, 2021

BlobCodes commented Dec 16, 2021

Implement lexer int128 support #11571

Implement lexer int128 support #11571

Conversation

BlobCodes commented Dec 11, 2021 • edited Loading

HertzDevil commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

BlobCodes commented Dec 12, 2021 • edited Loading

oprypin commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

straight-shoota commented Dec 12, 2021

BlobCodes commented Dec 12, 2021

oprypin commented Dec 13, 2021

oprypin left a comment

Choose a reason for hiding this comment

straight-shoota commented Dec 16, 2021

BlobCodes commented Dec 16, 2021

BlobCodes commented Dec 11, 2021 •

edited

Loading

BlobCodes commented Dec 12, 2021 •

edited

Loading