Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix Sparse Attention with Packed QKV inputs #20591

Merged
merged 2 commits into from
May 7, 2024

Conversation

tianleiwu
Copy link
Contributor

@tianleiwu tianleiwu commented May 7, 2024

Description

(1) Fix UnpackQKV kernel
(2) Update test_sparse_attention.py with packed QKV option
CI build with unit test enabled in CI pipeline : https://dev.azure.com/onnxruntime/onnxruntime/_build/results?buildId=1373473&view=results

Motivation and Context

@tianleiwu tianleiwu merged commit d693aef into main May 7, 2024
95 checks passed
@tianleiwu tianleiwu deleted the tlwu/sparse_attention_test_packed_qkv branch May 7, 2024 17:50
@sophies927 sophies927 added the triage:approved Approved for cherrypicks for release label May 7, 2024
@yihonglyu yihonglyu added the cherry-picked Cherry-picked for a cherrypicks branch label May 9, 2024
yihonglyu pushed a commit that referenced this pull request May 9, 2024
### Description
(1) Fix UnpackQKV kernel
(2) Update test_sparse_attention.py with packed QKV option
@yihonglyu yihonglyu added the rel-merged Cherrypicks merged into release label May 10, 2024
poweiw pushed a commit to poweiw/onnxruntime that referenced this pull request Jun 25, 2024
### Description
(1) Fix UnpackQKV kernel
(2) Update test_sparse_attention.py with packed QKV option
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
cherry-picked Cherry-picked for a cherrypicks branch rel-merged Cherrypicks merged into release release:1.18.0 triage:approved Approved for cherrypicks for release
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants