Merge OpenAI Triton commit `86a2ac7` #2630

anmyachev · 2024-11-05T13:59:21Z

This PR change the Triton base from 1d5fdfe to 86a2ac7 (Oct 28).
Pass rate: 99.83%->97.41%

Please do not squash and merge this PR.

…odegen bug (#4873)" (#4973) After investigation of the differences caused by triton-lang/triton#4774 in the internal tests, we concluded that they were introduced by change in the layouts selected for the reduce operations. Re-introducing that change, as it is functionally correct and should be beneficial for performance.

This commit adds initial support for scaled_dot with mxfp8 LHS and fp8 RHS. It supports both mfma32 and mfma16 intrinsic variants. Right now we are missing software emulation for `Float8E4M3FN` type, so this only enables for `Float8E5M2`.

…`interpreter.cc` (#4976) `#include <atomic>` is already used in other triton files, so I believe it's not a cardinally change. Changes come from triton-lang/triton#4045

Signed-off-by: Anatoly Myachev <[email protected]>

anmyachev · 2024-11-05T14:31:29Z

@whitneywhtsang I am ending this and stopping this activity for now as agreed with you offline :)

anmyachev · 2024-11-05T15:18:28Z

@whitneywhtsang ready for review

whitneywhtsang · 2024-11-05T15:48:14Z

Is the pass rate degradation solely due to test_scaled_dot? Can we open an issue to fix that?

anmyachev · 2024-11-05T15:50:01Z

Is the pass rate degradation solely due to test_scaled_dot? Can we open an issue to fix that?

Yes, simply because the number of parameter combinations has increased, before this PR this test also did not work on XPU. Will open.

anmyachev · 2024-11-05T15:50:43Z

#2633

whitneywhtsang · 2024-11-05T15:59:30Z

Is the pass rate degradation solely due to test_scaled_dot? Can we open an issue to fix that?

Yes, simply because the number of parameter combinations has increased, before this PR this test also did not work on XPU. Will open.

Looks like to the number of test cases are unchanged, but this PR marks the failures as skipped instead of xfailed, that's why pass rate is affected.

anmyachev · 2024-11-05T16:02:30Z

Is the pass rate degradation solely due to test_scaled_dot? Can we open an issue to fix that?

Yes, simply because the number of parameter combinations has increased, before this PR this test also did not work on XPU. Will open.

Looks like to the number of test cases are unchanged, but this PR marks the failures as skipped instead of xfailed, that's why pass rate is affected.

Ah, it increased only for AMD, I see.

pawelszczerbuk and others added 6 commits October 28, 2024 08:58

[INTERPRETER] Replace GCC __ATOMIC* built-ins with std::atomic for …

7c03aac

…`interpreter.cc` (#4976) `#include <atomic>` is already used in other triton files, so I believe it's not a cardinally change. Changes come from triton-lang/triton#4045

[AMD] Restructure ReorderInstructions pass (#4998)

86a2ac7

Merge commit '86a2ac753befe5286a261ba3b64eb40bdcca5704'

d187943

[intel] skip 'test_scaled_dot' for XPU

a75031e

Signed-off-by: Anatoly Myachev <[email protected]>

anmyachev marked this pull request as ready for review November 5, 2024 14:29

anmyachev requested a review from whitneywhtsang November 5, 2024 14:30

whitneywhtsang approved these changes Nov 5, 2024

View reviewed changes

anmyachev merged commit 1fc59f6 into main Nov 5, 2024
4 checks passed

anmyachev deleted the amyachev/merge4 branch November 5, 2024 16:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Merge OpenAI Triton commit `86a2ac7` #2630

Merge OpenAI Triton commit `86a2ac7` #2630

anmyachev commented Nov 5, 2024 •

edited by whitneywhtsang

Loading

anmyachev commented Nov 5, 2024

anmyachev commented Nov 5, 2024

whitneywhtsang commented Nov 5, 2024

anmyachev commented Nov 5, 2024

anmyachev commented Nov 5, 2024

whitneywhtsang commented Nov 5, 2024

anmyachev commented Nov 5, 2024

Merge OpenAI Triton commit 86a2ac7 #2630

Merge OpenAI Triton commit 86a2ac7 #2630

Conversation

anmyachev commented Nov 5, 2024 • edited by whitneywhtsang Loading

anmyachev commented Nov 5, 2024

anmyachev commented Nov 5, 2024

whitneywhtsang commented Nov 5, 2024

anmyachev commented Nov 5, 2024

anmyachev commented Nov 5, 2024

whitneywhtsang commented Nov 5, 2024

anmyachev commented Nov 5, 2024

Merge OpenAI Triton commit `86a2ac7` #2630

Merge OpenAI Triton commit `86a2ac7` #2630

anmyachev commented Nov 5, 2024 •

edited by whitneywhtsang

Loading