Jamba: update integration tests #32250

gante · 2024-07-26T18:14:38Z

What does this PR do?

🟢 Fixes generate-related integration tests for jamba 🟢

I've checked them against:

my machine (compute capability 8);
our slow CI (compute capability 7) -- CI is still red, but due to a SDPA test.

⚠️ skips logits checks on older devices: there are big differences across different versions, possibly explained by custom cuda kernels. Given the relatively low usage of jamba, we don't have the bandwidth to dive.

Detective work 🕵️

The tests use a dummy model, ai21labs/Jamba-tiny-random, the generation text quality doesn't matter.
On my machine, checking out to the commit that added the current tests, I already get a difference. My machine has the same major compute capability as an A100 (RTX4090).
We can see in the original PR that the tests were being updated as changes were made. There were a few commits after the latest version of the tests that modify the outputs, which would explain the failing tests since day 1

gante · 2024-07-26T19:17:59Z

cc @ydshieh

gante · 2024-07-26T19:19:02Z

tests/models/jamba/test_modeling_jamba.py

+    # This variable is used to determine which CUDA device are we using for our runners (A10 or T4)
+    # Depending on the hardware we get different logits / generations
+    cuda_compute_capability_major_version = None


This cuda_compute_capability_major_version pattern is copied from other models like e.g. gemma

HuggingFaceDocBuilderDev · 2024-07-26T19:26:33Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ydshieh · 2024-07-29T08:29:22Z

(thank you for trigger the tests on the runner 🙏 )

amyeroberts

Thanks for digging into this and fixing, and for writing a detailed PR description ❤️

Agreed it's not worth digging into given jamba usage, and as the generated texts appear similar despite the logic differences

tests/models/jamba/test_modeling_jamba.py

Co-authored-by: amyeroberts <[email protected]>

ydshieh · 2024-08-22T10:42:06Z

tests/models/jamba/test_modeling_jamba.py

-        torch.testing.assert_close(logits[0, -1, :40].cpu(), EXPECTED_LOGITS_NO_GRAD_0, rtol=1e-3, atol=1e-3)
-        torch.testing.assert_close(logits[1, -1, :40].cpu(), EXPECTED_LOGITS_NO_GRAD_1, rtol=1e-3, atol=1e-3)
+        # TODO: there are significant differences in the logits across major cuda versions, which shouldn't exist
+        if self.cuda_compute_capability_major_version == 8:


maybe better to use

self.skipTest(reason="Skipping for T4 runners because ...")

oops, merged before seeing this comment!

You have a good point, in fact we should split the test in two to test (/skip) the logits separately

* try test updates * a few more changes * a few more changes * a few more changes * [run slow] jamba * skip logits checks on older gpus * [run slow] jamba * oops * [run slow] jamba * Update tests/models/jamba/test_modeling_jamba.py Co-authored-by: amyeroberts <[email protected]> * Update tests/models/jamba/test_modeling_jamba.py Co-authored-by: amyeroberts <[email protected]> --------- Co-authored-by: amyeroberts <[email protected]>

gante added 4 commits July 26, 2024 18:09

try test updates

520bec2

a few more changes

2237449

a few more changes

94e81a1

a few more changes

4ea6855

gante added the run-slow label Jul 26, 2024

gante added 5 commits July 26, 2024 18:46

[run slow] jamba

a305dda

skip logits checks on older gpus

fbd83c2

[run slow] jamba

619d88f

oops

1b81b63

[run slow] jamba

c372fd9

gante requested a review from LysandreJik July 26, 2024 19:17

gante commented Jul 26, 2024

View reviewed changes

gante requested review from amyeroberts and removed request for LysandreJik August 7, 2024 15:35

amyeroberts approved these changes Aug 7, 2024

View reviewed changes

tests/models/jamba/test_modeling_jamba.py Outdated Show resolved Hide resolved

tests/models/jamba/test_modeling_jamba.py Outdated Show resolved Hide resolved

gante and others added 2 commits August 22, 2024 11:21

Update tests/models/jamba/test_modeling_jamba.py

27ce70a

Co-authored-by: amyeroberts <[email protected]>

Update tests/models/jamba/test_modeling_jamba.py

9912a22

Co-authored-by: amyeroberts <[email protected]>

ydshieh reviewed Aug 22, 2024

View reviewed changes

gante merged commit f6e2586 into huggingface:main Aug 22, 2024
21 checks passed

gante deleted the jamba_integration_tests branch August 22, 2024 10:46

vasqu mentioned this pull request Aug 23, 2024

Fix: Jamba batched generation #32914

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Jamba: update integration tests #32250

Jamba: update integration tests #32250

gante commented Jul 26, 2024 •

edited

Loading

gante commented Jul 26, 2024

gante Jul 26, 2024

HuggingFaceDocBuilderDev commented Jul 26, 2024

ydshieh commented Jul 29, 2024

amyeroberts left a comment

ydshieh Aug 22, 2024

gante Aug 22, 2024

Jamba: update integration tests #32250

Jamba: update integration tests #32250

Conversation

gante commented Jul 26, 2024 • edited Loading

What does this PR do?

gante commented Jul 26, 2024

gante Jul 26, 2024

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jul 26, 2024

ydshieh commented Jul 29, 2024

amyeroberts left a comment

Choose a reason for hiding this comment

ydshieh Aug 22, 2024

Choose a reason for hiding this comment

gante Aug 22, 2024

Choose a reason for hiding this comment

gante commented Jul 26, 2024 •

edited

Loading