Add support for torch FP8 dtypes #445

riccardofelluga · 2024-05-22T19:17:14Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you read the contributor guideline, Pull Request section?
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

This PR fixes #254 and adds native thunder support for the following dtypes:

torch.float8_e5m2
torch.float8_e5m2fnuz
torch.float8_e4m3fn
torch.float8_e4m3fnuz

Since the float8 dtype is implemented in 4 different variants I added the variant mechanism for Thunder dtypes such that we can differentiate between them.

This PR also adds the option to create test fp8 tensors with make_tensor so that we can start testing fp8 operations. After running the existing operators tests it is evident that the support for this dtype in torch is scarce since the majority of tests fail with "not implemented" runtime errors. With that I decided to skip the operator testing for all the fp8.

Furthermore, I updated the type promotion table, please get a look and don't hesitate to comment if you think some promotions are not in the right place.

Did you have fun?

Oh yes!

… native-fp8

for more information, see https://pre-commit.ci

thunder/core/dtypes.py

Co-authored-by: Masaki Kozuki <[email protected]>

…r into native-fp8

kshitij12345

Thanks @riccardofelluga for the PR, this will enable a lot of interesting opportunities for FP8 support in thunder.

Overall the PR looks really good, I just think FP8 should not be involved in type promotion for the following reasons:

For gradient computation of linear, it is common to use E5M2 for gradient (to avoid underflow/overflow) and weights/input in E4M3 for higher precision, in such case, we don't want E4M3 to be upcasted as there are GEMM kernels for mixed FP8 dtypes.
When we are using an operator with FP8, we want to enforce that FP8 inputs will stay in FP8 for performance reasons (we won't want the input to be upcasted to higher precision and not use special FP8 GEMMs).
Except for matmul, there aren't any math operation support for FP8. (Also, I don't think the coverage would increase soon).

So, I think we should remove the type promotion logic. (cc: @mruberry for his thoughts on the same)

Also, I think we should not enable testing with FP8 by default, as FP8 will have a very limited op support and all tests will have to disable it for now. It is likely that we will have separate tests for selected ops to test with fp8. So we should probably tweak instantiate to not enable FP8 types by default for testing and only use them if they were specifically passed to dtypes argument.

lightning-thunder/thunder/tests/framework.py

Lines 416 to 438 in 82185e3

    
           class instantiate: 
        
               # TODO: support other kinds of dtype specifications 
        
               def __init__( 
        
                   self, 
        
                   *, 
        
                   executors=None, 
        
                   devicetypes=None, 
        
                   dtypes=None, 
        
                   num_devices: int = 1, 
        
                   decorators: None | Sequence = None, 
        
                   scope=None, 
        
                   as_name: str | None = None, 
        
               ): 
        
                   self.executors = set(executors) if executors is not None else set(_all_test_executors()) 
        
                   self.devicetypes = set(devicetypes) if devicetypes is not None else set(available_devicetypes()) 
        
                   self.devicetypes = set(filter_ci_devicetypes(self.devicetypes)) 
        
                   if dtypes == NOTHING: 
        
                       self.dtypes = (None,) 
        
                   else: 
        
                       self.dtypes = datatypes.resolve_dtypes(dtypes) if dtypes is not None else datatypes.all_dtypes

thunder/core/dtypes.py

thunder/core/utils.py

thunder/tests/make_tensor.py

thunder/tests/test_inplace_copy.py

mruberry

Hey @riccardofelluga! Cool stuff. I made a few small suggestions. The big things I'm curious about before merging is:

should we delay implementing type promotion logic for fp8 dtypes? The logic update seems pretty reasonable, but maybe different fp8 dtypes should promote to fp16 for now?
a convenient set of datatypes so test authors can select the floating point types except the fp8 types would be nice for now

Curious to hear your thoughts!

for more information, see https://pre-commit.ci

riccardofelluga · 2024-05-24T16:34:45Z

Thanks everybody for your comments! I though about it a bit and I mainly agree with your points, the support for this dtype in torch is so low that I agree with you, we should delay the type promotion.

Regarding the tests, I also had the idea to disable them from the start but I didn't realize how ugly the pytest.skip everywhere ended up looking. I introduce the float_math_dtypes set of floating datatypes so that it is possible to @instantiate the test with only the floating dtypes that are worth testing for now.

… native-fp8

t-vi · 2024-05-27T13:23:06Z

Thank you @riccardofelluga @crcrpar @kshitij12345 @mruberry @lantiga

riccardofelluga added 6 commits May 22, 2024 13:38

add float8 dtype support

5ca2eb1

Merge branch 'main' of github.com:Lightning-AI/lightning-thunder into…

a8378eb

… native-fp8

add ability to create float8 test tensors

6606244

skip test_errors for fp8

1d34145

Merge branch 'main' of github.com:Lightning-AI/lightning-thunder into…

73d0bbe

… native-fp8

updated type promotion table

942abab

riccardofelluga requested review from kshitij12345 and IvanYashchuk May 22, 2024 19:17

riccardofelluga requested review from mruberry, lantiga, robieta, t-vi and carmocca as code owners May 22, 2024 19:17

riccardofelluga linked an issue May 22, 2024 that may be closed by this pull request

Add support for FP8E4M3 and FP8E5M2 dtypes #254

Closed

[pre-commit.ci] auto fixes from pre-commit.com hooks

1d22c4b

for more information, see https://pre-commit.ci

crcrpar reviewed May 23, 2024

View reviewed changes

thunder/core/dtypes.py Outdated Show resolved Hide resolved

thunder/core/dtypes.py Outdated Show resolved Hide resolved

riccardofelluga and others added 3 commits May 23, 2024 10:38

update note on dtypes.py

a0f9cf6

Co-authored-by: Masaki Kozuki <[email protected]>

updated tests and floating dtype description

0f2b800

Merge branch 'native-fp8' of github.com:Lightning-AI/lightning-thunde…

785c723

…r into native-fp8

kshitij12345 reviewed May 23, 2024

View reviewed changes

thunder/core/dtypes.py Show resolved Hide resolved

thunder/core/dtypes.py Show resolved Hide resolved

thunder/core/dtypes.py Outdated Show resolved Hide resolved