Enable Flan-T5 inference in `float16` #296

hmellor · 2023-03-24T17:35:28Z

What does this PR do?

Enable floating point exceptions (probably temporarily) so we can track down the source of over/underflows
Fix overflow coming from attention masking
Fix overflow coming from tanh approximation of GeLU
Fix overflow coming from pre-norm residual structure
Fix overflow coming from FeedForward down projection

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2023-03-24T17:38:52Z

The documentation is not available anymore as the PR was closed or merged.

optimum/graphcore/models/t5/modeling_t5.py

optimum/graphcore/ipu_configuration.py

hmellor added 3 commits March 24, 2023 17:30

Enable floating point exceptions

474ac68

Fix overflow in masking

62ac303

Fix overflow in GeLU tanh approximation

9c919e2

hmellor added 11 commits March 28, 2023 09:45

Add comment indicating that change is temporary

60313f1

Relocate CustomGELU

54e5be4

Keep residual structure in FP32

7911bc0

Add scale to UpCastWrapper

5071683

Scale the down-projection for the encoder FNNs

1eb6038

Only substitute GeLU when GeLU is the act fn

47a98a0

Less lossy scale factor (only exponent changed)

68ee164

make style

31fac22

Merge branch 'main' into t5-numerical-issues

238ed33

Test error capture

4e87c35

Update artifact name

a5de6af

hmellor mentioned this pull request Apr 13, 2023

Add Flan-T5 notebook #318

Merged

3 tasks

hmellor added 4 commits April 14, 2023 15:08

Delete unused scaling function

f67ec62

Clearer TODO

4bdb088

Don't use reverse indexing

3b95766

Deparallelize properly

e9d7fe2

kundaMwiza reviewed Apr 16, 2023

View reviewed changes

optimum/graphcore/models/t5/modeling_t5.py Outdated Show resolved Hide resolved

optimum/graphcore/models/t5/modeling_t5.py Outdated Show resolved Hide resolved

Fix deparallelize

efba6b7

hmellor marked this pull request as ready for review April 17, 2023 12:31

hmellor changed the title ~~Address T5 float16 precision issues~~ Enable Flan-T5 inference in float16 Apr 17, 2023

hmellor commented Apr 17, 2023

View reviewed changes

optimum/graphcore/ipu_configuration.py Outdated Show resolved Hide resolved

Update optimum/graphcore/ipu_configuration.py

9116d9d

hmellor force-pushed the t5-numerical-issues branch from c40be80 to 9116d9d Compare April 18, 2023 13:34

Merge branch 'main' into t5-numerical-issues

870f196

jimypbr approved these changes Apr 24, 2023

View reviewed changes

jimypbr merged commit 5a68bc6 into huggingface:main Apr 24, 2023

jimypbr deleted the t5-numerical-issues branch April 24, 2023 12:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable Flan-T5 inference in `float16` #296

Enable Flan-T5 inference in `float16` #296

hmellor commented Mar 24, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 24, 2023 •

edited

Loading

Enable Flan-T5 inference in float16 #296

Enable Flan-T5 inference in float16 #296

Conversation

hmellor commented Mar 24, 2023 • edited Loading

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Mar 24, 2023 • edited Loading

Enable Flan-T5 inference in `float16` #296

Enable Flan-T5 inference in `float16` #296

hmellor commented Mar 24, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Mar 24, 2023 •

edited

Loading