[V1] Do not use inductor for piecewise CUDA graphs #10225

WoosukKwon · 2024-11-11T19:02:40Z

This PR disables the use of TorchInductor, to avoid the compilation time and issue. We can flip the flag again once the compiler becomes more stable.

Signed-off-by: Woosuk Kwon <[email protected]>

github-actions · 2024-11-11T19:02:52Z

👋 Hi! Thank you for contributing to the vLLM project.
Just a reminder: PRs would not trigger full CI run by default. Instead, it would only run fastcheck CI which starts running only a small and essential subset of CI tests to quickly catch errors. You can run other CI tests on top of those by going to your fastcheck build on Buildkite UI (linked in the PR checks section) and unblock them. If you do not have permission to unblock, ping simon-mo or khluu to add you in our Buildkite org.

Once the PR is approved and ready to go, your PR reviewer(s) can run CI to test the changes comprehensively before merging.

To run CI, PR reviewers can do one of these:

Add ready label to the PR
Enable auto-merge.

🚀

Signed-off-by: Woosuk Kwon <[email protected]>

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

Signed-off-by: Woosuk Kwon <[email protected]>

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>

Signed-off-by: Woosuk Kwon <[email protected]>

WoosukKwon added 2 commits November 11, 2024 10:55

[V1] Do not use inductor for piecewise CUDA graphs

195b833

Signed-off-by: Woosuk Kwon <[email protected]>

minor

6c2750c

Signed-off-by: Woosuk Kwon <[email protected]>

WoosukKwon requested a review from youkaichao November 11, 2024 19:02

youkaichao approved these changes Nov 11, 2024

View reviewed changes

WoosukKwon merged commit d7a4f22 into main Nov 11, 2024
14 of 17 checks passed

WoosukKwon deleted the v1-no-inductor branch November 11, 2024 19:05

WoosukKwon mentioned this pull request Nov 11, 2024

[V1] Use custom ops for piecewise CUDA graphs #10227

Merged

rickyyx pushed a commit to rickyyx/vllm that referenced this pull request Nov 13, 2024

[V1] Do not use inductor for piecewise CUDA graphs (vllm-project#10225)

eb0bc81

Signed-off-by: Woosuk Kwon <[email protected]>

sumitd2 pushed a commit to sumitd2/vllm that referenced this pull request Nov 14, 2024

[V1] Do not use inductor for piecewise CUDA graphs (vllm-project#10225)

abdc210

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Sumit Dubey <[email protected]>

KuntaiDu pushed a commit to KuntaiDu/vllm that referenced this pull request Nov 20, 2024

[V1] Do not use inductor for piecewise CUDA graphs (vllm-project#10225)

0e493f6

Signed-off-by: Woosuk Kwon <[email protected]>

mfournioux pushed a commit to mfournioux/vllm that referenced this pull request Nov 20, 2024

[V1] Do not use inductor for piecewise CUDA graphs (vllm-project#10225)

cf8fe07

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Maxime Fournioux <[email protected]>

tlrmchlsmth pushed a commit to neuralmagic/vllm that referenced this pull request Nov 23, 2024

[V1] Do not use inductor for piecewise CUDA graphs (vllm-project#10225)

dfede7c

Signed-off-by: Woosuk Kwon <[email protected]> Signed-off-by: Tyler Michael Smith <[email protected]>

sleepwalker2017 pushed a commit to sleepwalker2017/vllm that referenced this pull request Dec 13, 2024

[V1] Do not use inductor for piecewise CUDA graphs (vllm-project#10225)

e7cef58

Signed-off-by: Woosuk Kwon <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[V1] Do not use inductor for piecewise CUDA graphs #10225

[V1] Do not use inductor for piecewise CUDA graphs #10225

WoosukKwon commented Nov 11, 2024 •

edited

Loading

github-actions bot commented Nov 11, 2024

[V1] Do not use inductor for piecewise CUDA graphs #10225

[V1] Do not use inductor for piecewise CUDA graphs #10225

Conversation

WoosukKwon commented Nov 11, 2024 • edited Loading

github-actions bot commented Nov 11, 2024

WoosukKwon commented Nov 11, 2024 •

edited

Loading