Fix `MistralIntegrationTest` #31231

ydshieh · 2024-06-04T12:58:43Z

What does this PR do?

test_speculative_generation: it was failed due to 9efec11 (Jan 19 2024) then 2e27291 (May 13 2024). I would trust those 2 PRs and simply update the expected outputs (cc @gante )
test_model_7b_generation and test_model_7b_logits: it is from my PR Fix slow tests for important models to be compatible with A10 runners #29905 where I changed dtype and/or load_in_4bit in from_pretrained in the tests but forgot to update the expected output values for T4.

Also,

        del model
        backend_empty_cache(torch_device)
        gc.collect()

this is not helping and worse, with it, we actually get GPU OOM (here for test_model_7b_long_prompt_sdpa). After removing them, the tests are all passing now.

111 passed, 48 skipped, 276 warnings in 205.46s (0:03:25)

HuggingFaceDocBuilderDev · 2024-06-04T13:19:14Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ydshieh · 2024-06-04T13:47:18Z

tests/models/mistral/test_modeling_mistral.py

@@ -526,7 +526,7 @@ def test_model_7b_logits(self):
        # Note: Key 9 is currently set for MI300, but may need potential future adjustments for H100s,
        # considering differences in hardware processing and potential deviations in output.
        EXPECTED_SLICE = {
-            7: torch.tensor([-5.8781, -5.8616, -0.1052, -4.7200, -5.8781, -5.8774, -5.8773, -5.8777, -5.8781, -5.8780, -5.8781, -5.8779, -1.0787, 1.7583, -5.8779, -5.8780, -5.8783, -5.8778, -5.8776, -5.8781, -5.8784, -5.8778, -5.8778, -5.8777, -5.8779, -5.8778, -5.8776, -5.8780, -5.8779, -5.8781]),
+            7: torch.tensor([-5.8828, -5.8633, -0.1042, -4.7266, -5.8828, -5.8789, -5.8789, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -1.0801,  1.7598, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828, -5.8828]),


should update in #29905 but forgot

ydshieh · 2024-06-04T13:47:31Z

tests/models/mistral/test_modeling_mistral.py

    @slow
    @require_bitsandbytes
    def test_model_7b_generation(self):
        EXPECTED_TEXT_COMPLETION = {
-            7: "My favourite condiment is 100% ketchup. I love it on everything. I'm not a big",
+            7: "My favourite condiment is 100% ketchup. I’m not a fan of mustard, mayo,",


should update in #29905 but forgot

ydshieh · 2024-06-04T13:47:50Z

tests/models/mistral/test_modeling_mistral.py

-        del model
-        backend_empty_cache(torch_device)
-        gc.collect()
-


not help and worse cause some GPU OOM in subsequent tests

Happy to have this deleted but very confused why this would cause OOM 😭

Got to say I am confused too. torch.cuda.empty_cache is not really a magic

empty_cache() doesn’t increase the amount of GPU memory available for PyTorch.

but I was not expecting it would have undesired side-effect like this (even if it is not helpful).

I don't check if del model and gc.collect() plays a role here though.

Out of curiosity and keep info here for the record:

it is test_model_7b_long_prompt gets OOM.

previously with those empty cache, at the beginning of test_model_7b_long_prompt, nvidia-smi shows 150MiB / 15360MiB which looks nice but we get OOM afterward inside this test

wihout empty cache, nvidia-smi shows 9066MiB/ 15360MiB which looks not great but we DON'T get OOM afterward inside this test

It's very mysterious to me.

ydshieh · 2024-06-04T13:48:04Z

tests/models/mistral/test_modeling_mistral.py

@@ -635,7 +622,7 @@ def test_speculative_generation(self):
        # Note: Key 9 is currently set for MI300, but may need potential future adjustments for H100s,
        # considering differences in hardware processing and potential deviations in generated text.
        EXPECTED_TEXT_COMPLETION = {
-            7: "My favourite condiment is 100% Sriracha. I love the heat, the tang and the fact costs",
+            7: "My favourite condiment is 100% ketchup. I love it on everything. I’m not a big",


see PR description

amyeroberts

Thanks for fixing!

amyeroberts · 2024-06-04T14:39:38Z

tests/models/mistral/test_modeling_mistral.py

-        del model
-        backend_empty_cache(torch_device)
-        gc.collect()
-


Happy to have this deleted but very confused why this would cause OOM 😭

* fix * fix * fix * fix --------- Co-authored-by: ydshieh <[email protected]>

ydshieh added 4 commits June 4, 2024 10:05

fix

22e9ed4

fix

3fc67eb

fix

f56f193

fix

c0778dc

ydshieh commented Jun 4, 2024

View reviewed changes

ydshieh requested a review from amyeroberts June 4, 2024 14:13

amyeroberts approved these changes Jun 4, 2024

View reviewed changes

ydshieh merged commit fd3238b into main Jun 4, 2024
21 checks passed

ydshieh deleted the fix_mistral_int branch June 4, 2024 16:04

zucchini-nlp pushed a commit to zucchini-nlp/transformers that referenced this pull request Jun 11, 2024

Fix MistralIntegrationTest (huggingface#31231)

2e072c6

* fix * fix * fix * fix --------- Co-authored-by: ydshieh <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `MistralIntegrationTest` #31231

Fix `MistralIntegrationTest` #31231

ydshieh commented Jun 4, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Jun 4, 2024

ydshieh Jun 4, 2024

ydshieh Jun 4, 2024

ydshieh Jun 4, 2024

amyeroberts Jun 4, 2024

ydshieh Jun 4, 2024 •

edited

Loading

ydshieh Jun 5, 2024

ydshieh Jun 4, 2024

amyeroberts left a comment

amyeroberts Jun 4, 2024

Fix MistralIntegrationTest #31231

Fix MistralIntegrationTest #31231

Conversation

ydshieh commented Jun 4, 2024 • edited Loading

What does this PR do?

HuggingFaceDocBuilderDev commented Jun 4, 2024

ydshieh Jun 4, 2024

Choose a reason for hiding this comment

ydshieh Jun 4, 2024

Choose a reason for hiding this comment

ydshieh Jun 4, 2024

Choose a reason for hiding this comment

amyeroberts Jun 4, 2024

Choose a reason for hiding this comment

ydshieh Jun 4, 2024 • edited Loading

Choose a reason for hiding this comment

ydshieh Jun 5, 2024

Choose a reason for hiding this comment

ydshieh Jun 4, 2024

Choose a reason for hiding this comment

amyeroberts left a comment

Choose a reason for hiding this comment

amyeroberts Jun 4, 2024

Choose a reason for hiding this comment

Fix `MistralIntegrationTest` #31231

Fix `MistralIntegrationTest` #31231

ydshieh commented Jun 4, 2024 •

edited

Loading

ydshieh Jun 4, 2024 •

edited

Loading