Allow gelu approximations #1911

pcuenca · 2023-07-11T18:19:58Z

The MIL gelu implementation accepts tanh or sigmoid approximations, but the frontend asserts that no approximation is requested.

This PR allows the supported approximations to be specified. tanh approximation is used in models such as the ones with the gpt_bigcode architecture, so conversion can now proceed by applying the same approximation.

coremltools/converters/mil/frontend/torch/ops.py

(not yet supported in PyTorch)

pcuenca · 2023-07-12T13:20:05Z

I added a test for tanh and removed support for sigmoid in the frontend, as I realized it's not yet supported in PyTorch: pytorch/pytorch#102447

jakesabathia2 · 2023-07-12T18:31:23Z

coremltools/converters/mil/frontend/torch/ops.py

-    res = mb.gelu(x=inputs[0], name=node.name)
+        if approximate == "tanh":
+            approximate = "TANH_APPROXIMATION"
+        elif approximate == "none":


If pytorch only support two modes right now,
it will be better to do:

else: assert approximate == "none"

to deal with the future possible change in the torch frontend (like they add more support for different mode)

I didn't do it because an unsupported approximation would still fail in the mb.gelu implementation. But you are right that this is a better place to signal where the conversion needs to happen, changing it now!

jakesabathia2 · 2023-07-12T23:59:07Z

https://gitlab.com/coremltools1/coremltools/-/pipelines/929504950

pcuenca · 2023-07-13T08:40:13Z

Not sure why the Python 3.10 tests fail, I checked some (the Spectrogram ones, for instance), and they passed locally.

TobyRoseman · 2023-07-13T22:12:17Z

Not sure why the Python 3.10 tests fail, I checked some (the Spectrogram ones, for instance), and they passed locally.

This is curious. I also can not reproduce it locally using this pull request.

I've restarted the failed job. Perhaps this is a non-deterministic issue.

A similar unit test is also inexplicable failing in an unrelated pull request (#1897). I also can not reproduce it locally using that pull request either. Although it is passing in main.

The failure doesn't seem very serious: only one element is mismatch (out of thousands).

I don't think this unit test failure should block merging this pull request.

jakesabathia2 · 2023-07-14T18:35:47Z

I think it might be some flanky issue, which is triggered by certain random inputs ...

jakesabathia2 · 2023-07-14T18:36:50Z

@TobyRoseman I can put another PR to fix that,
I am guessing we just need to do some input data "surgery" for that particular test.
@pcuenca Your PR looks awesome, I am going to merge it.

PRs apple/coremltools#1910 and apple/coremltools#1911 are now released in coremltools 7.0b2.

Allow gelu approximations.

d31ac74

pcuenca mentioned this pull request Jul 11, 2023

gpt-bigcode: avoid zero_ to support Core ML huggingface/transformers#24755

Merged

5 tasks

jakesabathia2 requested changes Jul 12, 2023

View reviewed changes

coremltools/converters/mil/frontend/torch/ops.py Show resolved Hide resolved

jakesabathia2 requested a review from TobyRoseman July 12, 2023 05:43

pcuenca added 2 commits July 12, 2023 15:16

Add test for tanh gelu approximation.

3741477

Remove sigmoid GELU in frontend

dd9a028

(not yet supported in PyTorch)

pcuenca mentioned this pull request Jul 12, 2023

Support gpt-bigcode architecture huggingface/exporters#45

Merged

jakesabathia2 requested changes Jul 12, 2023

View reviewed changes

pcuenca added 2 commits July 12, 2023 20:53

Assert gelu approximation is supported.

044a096

gelu tests cover the default initialization.

f13ffec

TobyRoseman mentioned this pull request Jul 13, 2023

Support flexible shape fill_ for torch. #1897

Merged

jakesabathia2 approved these changes Jul 14, 2023

View reviewed changes

jakesabathia2 merged commit 36544e2 into apple:main Jul 14, 2023

pcuenca deleted the allow-gelu-approximations branch July 15, 2023 15:26

pcuenca added a commit to huggingface/exporters that referenced this pull request Aug 23, 2023

Upgrade coremltools, remove custom GPTBigcode ops.

546ab3b

PRs apple/coremltools#1910 and apple/coremltools#1911 are now released in coremltools 7.0b2.

pcuenca mentioned this pull request Aug 23, 2023

Upgrade coremltools, remove custom GPTBigcode ops huggingface/exporters#53

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow gelu approximations #1911

Allow gelu approximations #1911

pcuenca commented Jul 11, 2023

pcuenca commented Jul 12, 2023

jakesabathia2 Jul 12, 2023

pcuenca Jul 12, 2023

jakesabathia2 commented Jul 12, 2023

pcuenca commented Jul 13, 2023

TobyRoseman commented Jul 13, 2023

jakesabathia2 commented Jul 14, 2023

jakesabathia2 commented Jul 14, 2023

Allow gelu approximations #1911

Allow gelu approximations #1911

Conversation

pcuenca commented Jul 11, 2023

pcuenca commented Jul 12, 2023

jakesabathia2 Jul 12, 2023

Choose a reason for hiding this comment

pcuenca Jul 12, 2023

Choose a reason for hiding this comment

jakesabathia2 commented Jul 12, 2023

pcuenca commented Jul 13, 2023

TobyRoseman commented Jul 13, 2023

jakesabathia2 commented Jul 14, 2023

jakesabathia2 commented Jul 14, 2023