gpt-bigcode: avoid `zero_` to support Core ML #24755

pcuenca · 2023-07-11T18:33:37Z

What does this PR do?

In-place zero_ is not supported by the Core ML conversion process. This PR replaces it with zeros_like so conversion can proceed.

The change only affects a workaround for a PyTorch bug on the cpu device.

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

Anyone in the community is free to review the PR once the tests have passed. Feel free to tag
members/contributors who may be interested in your PR.

@younesbelkada, @loubnabnl, @jlamypoirier

In-place `zeros_` is not supported by the Core ML conversion process. This PR replaces it with `zeros_like` so conversion can proceed. The change only affects a workaround for a PyTorch bug on the `cpu` device.

pcuenca · 2023-07-11T18:36:03Z

Note: to fully test conversion of gpt-bigcode models, the following coremltools PRs (or equivalent workarounds) need to be applied as well: apple/coremltools#1910, apple/coremltools#1911.

HuggingFaceDocBuilderDev · 2023-07-11T18:50:26Z

The documentation is not available anymore as the PR was closed or merged.

jlamypoirier

I suggest moving the torch.empty to the else to avoid double allocation, otherwise lgtm

younesbelkada

Thanks @pcuenca !

mayank31398 · 2023-07-12T08:02:02Z

@younesbelkada I think this is already fixed in PT.
Should we just drop this logic?
Opened a PR: #24768 which supercedes this one

sgugger · 2023-07-12T11:07:06Z

We support versions of PyTorch from 1.10 and onward, so we need to keep the workaround for the bug.

younesbelkada · 2023-07-12T14:38:22Z

Merging to unblock @pcuenca , let's maybe address @jlamypoirier 's comments in a follow up PR !

gpt-bigcode: avoid `zeros_` to support Core ML. In-place `zeros_` is not supported by the Core ML conversion process. This PR replaces it with `zeros_like` so conversion can proceed. The change only affects a workaround for a PyTorch bug on the `cpu` device.

gpt-bigcode: avoid zeros_ to support Core ML.

4b2ca9b

In-place `zeros_` is not supported by the Core ML conversion process. This PR replaces it with `zeros_like` so conversion can proceed. The change only affects a workaround for a PyTorch bug on the `cpu` device.

jlamypoirier approved these changes Jul 12, 2023

View reviewed changes

younesbelkada approved these changes Jul 12, 2023

View reviewed changes

younesbelkada requested a review from sgugger July 12, 2023 07:14

sgugger approved these changes Jul 12, 2023

View reviewed changes

younesbelkada merged commit 395e566 into huggingface:main Jul 12, 2023

pcuenca deleted the gpt-bigcode-coreml-support branch July 12, 2023 14:40

pcuenca mentioned this pull request Jul 12, 2023

Support gpt-bigcode architecture huggingface/exporters#45

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gpt-bigcode: avoid `zero_` to support Core ML #24755

gpt-bigcode: avoid `zero_` to support Core ML #24755

pcuenca commented Jul 11, 2023

pcuenca commented Jul 11, 2023

HuggingFaceDocBuilderDev commented Jul 11, 2023 •

edited

Loading

jlamypoirier left a comment

younesbelkada left a comment

mayank31398 commented Jul 12, 2023 •

edited

Loading

sgugger commented Jul 12, 2023

younesbelkada commented Jul 12, 2023

gpt-bigcode: avoid zero_ to support Core ML #24755

gpt-bigcode: avoid zero_ to support Core ML #24755

Conversation

pcuenca commented Jul 11, 2023

What does this PR do?

Before submitting

Who can review?

pcuenca commented Jul 11, 2023

HuggingFaceDocBuilderDev commented Jul 11, 2023 • edited Loading

jlamypoirier left a comment

Choose a reason for hiding this comment

younesbelkada left a comment

Choose a reason for hiding this comment

mayank31398 commented Jul 12, 2023 • edited Loading

sgugger commented Jul 12, 2023

younesbelkada commented Jul 12, 2023

gpt-bigcode: avoid `zero_` to support Core ML #24755

gpt-bigcode: avoid `zero_` to support Core ML #24755

HuggingFaceDocBuilderDev commented Jul 11, 2023 •

edited

Loading

mayank31398 commented Jul 12, 2023 •

edited

Loading