-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Misc]: VLLM docker image size #363
Comments
The size of these images depends on the ROCm release image sizes, vLLM adds the bare minimum on top of it. |
I see vllm-dev comes in around ~7GB, but VLLM (https://hub.docker.com/r/rocm/vllm/tags) is around 22GB. |
vllm-dev are the nightly images, latest and greatest, but prone to intermittent regressions. |
Functionally, are both of the meant to provide the same tooling ? |
Why is there such a difference in the image sizes though ? |
The current ones in rocm/vllm are built on rocm/pytorch release images, which are quite large. vllm functionality is also frozen to the time of ROCm6.3 release So, regarding
tl;dr, no |
Anything you want to discuss about vllm.
The VLLM and vllm-dev image builds for ROCm on docker hub are quite large (~22GB). I wanted to know if there is interest or are already efforts to work towards optimizing the image size to reduce download times for end users.
Before submitting a new issue...
The text was updated successfully, but these errors were encountered: