avoid calling `gc.collect` and `cuda.empty_cache` #34514

ydshieh · 2024-10-30T16:18:08Z

What does this PR do?

Let's avoid calling gc.collect and cuda.empty_cache while the tests are running on CPU:

those operations are slow
(actually, in most cases, they are only relevant for integration tests which use large models)

Running on GPT2 tests,

60 seconds on main, 20 seconds on this PR

Rocketknight1

Yes, this seems like a good speed fix! cc @LysandreJik @ArthurZucker for core maintainer review

LysandreJik

Smart! Should a helper method be made that only runs on CPU?

Both the gc.collect and the torch device checks could be moved into the backend_empty_cache method (or an other method that wraps both)

ydshieh · 2024-10-31T09:29:25Z

Yes, a helper method is nice. Will update

ydshieh · 2024-10-31T10:03:53Z

updated.

So far it doesn't call gc.collect() at all (default value False).
I would like to see if this would cause issue.
In general, we don't need to call it after each test method (so in tearDown) as it is slow.

HuggingFaceDocBuilderDev · 2024-10-31T15:45:01Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

* update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

ydshieh requested a review from Rocketknight1 October 30, 2024 16:44

Rocketknight1 approved these changes Oct 30, 2024

View reviewed changes

ydshieh changed the title ~~Speed no empty~~ avoid calling gc.collect and cuda.empty_cache Oct 30, 2024

ydshieh changed the title ~~avoid calling gc.collect and cuda.empty_cache~~ avoid calling gc.collect and cuda.empty_cache Oct 30, 2024

ydshieh requested review from ArthurZucker and LysandreJik October 31, 2024 08:32

LysandreJik approved these changes Oct 31, 2024

View reviewed changes

ydshieh force-pushed the speed_no_empty branch from 82e3add to 6620320 Compare October 31, 2024 10:01

ydshieh force-pushed the speed_no_empty branch from 6620320 to 4403c5a Compare October 31, 2024 10:19

ydshieh added 5 commits October 31, 2024 16:14

update

7183264

update

d3af1c4

update

7ef02ac

update

86b6744

update

18d6d5d

ydshieh force-pushed the speed_no_empty branch from 1f47700 to 18d6d5d Compare October 31, 2024 15:18

ydshieh merged commit ab98f0b into main Oct 31, 2024
25 of 27 checks passed

ydshieh deleted the speed_no_empty branch October 31, 2024 15:36

2015aroras pushed a commit to 2015aroras/transformers that referenced this pull request Nov 15, 2024

avoid calling gc.collect and cuda.empty_cache (huggingface#34514)

90ec1c5

* update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

BernardZach pushed a commit to BernardZach/transformers that referenced this pull request Dec 5, 2024

avoid calling gc.collect and cuda.empty_cache (huggingface#34514)

78ed2fe

* update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

BernardZach pushed a commit to innovationcore/transformers that referenced this pull request Dec 6, 2024

avoid calling gc.collect and cuda.empty_cache (huggingface#34514)

30298bb

* update * update * update * update * update --------- Co-authored-by: ydshieh <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

avoid calling `gc.collect` and `cuda.empty_cache` #34514

avoid calling `gc.collect` and `cuda.empty_cache` #34514

ydshieh commented Oct 30, 2024 •

edited

Loading

Rocketknight1 left a comment

LysandreJik left a comment

ydshieh commented Oct 31, 2024

ydshieh commented Oct 31, 2024

HuggingFaceDocBuilderDev commented Oct 31, 2024

avoid calling gc.collect and cuda.empty_cache #34514

avoid calling gc.collect and cuda.empty_cache #34514

Conversation

ydshieh commented Oct 30, 2024 • edited Loading

What does this PR do?

Rocketknight1 left a comment

Choose a reason for hiding this comment

LysandreJik left a comment

Choose a reason for hiding this comment

ydshieh commented Oct 31, 2024

ydshieh commented Oct 31, 2024

HuggingFaceDocBuilderDev commented Oct 31, 2024

avoid calling `gc.collect` and `cuda.empty_cache` #34514

avoid calling `gc.collect` and `cuda.empty_cache` #34514

ydshieh commented Oct 30, 2024 •

edited

Loading