Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Misc]: How to access the KV cache directly? #4156

Open
BDHU opened this issue Apr 18, 2024 · 6 comments
Open

[Misc]: How to access the KV cache directly? #4156

BDHU opened this issue Apr 18, 2024 · 6 comments

Comments

@BDHU
Copy link

BDHU commented Apr 18, 2024

Anything you want to discuss about vllm.

I'm looking to conduct an experiment, which involves copying the contents of KV cache between nodes. I'm not super familiar with the codebase, is there any way to access the page table/KV cache directly? Where do I start? Any suggestions are helpful!

@BDHU BDHU added the misc label Apr 18, 2024
@duanzhaol
Copy link

Curios about this topic too, I want to implement a simple request transfer (including kv cache) between nodes. #2809 seems did it, but only support with infiniband, and has a dependency on MSCCL++.

@BDHU
Copy link
Author

BDHU commented May 7, 2024

Any updates on this?

@tanejaaryan
Copy link

interested in this as well, can anyone guide a few first steps?

@CSEEduanyu
Copy link

just use cudaIPChandle and cudamemcopy

Copy link

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

@github-actions github-actions bot added the stale label Oct 29, 2024
@J1nLo
Copy link

J1nLo commented Nov 19, 2024

Any updates on this?

@github-actions github-actions bot added unstale and removed stale labels Nov 21, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants