[Misc]: How to access the KV cache directly? #4156

BDHU · 2024-04-18T02:09:45Z

Anything you want to discuss about vllm.

I'm looking to conduct an experiment, which involves copying the contents of KV cache between nodes. I'm not super familiar with the codebase, is there any way to access the page table/KV cache directly? Where do I start? Any suggestions are helpful!

duanzhaol · 2024-04-20T08:00:14Z

Curios about this topic too, I want to implement a simple request transfer (including kv cache) between nodes. #2809 seems did it, but only support with infiniband, and has a dependency on MSCCL++.

BDHU · 2024-05-07T22:16:55Z

Any updates on this?

tanejaaryan · 2024-06-03T15:18:34Z

interested in this as well, can anyone guide a few first steps?

CSEEduanyu · 2024-07-23T11:55:11Z

just use cudaIPChandle and cudamemcopy

github-actions · 2024-10-29T02:01:19Z

This issue has been automatically marked as stale because it has not had any activity within 90 days. It will be automatically closed if no further activity occurs within 30 days. Leave a comment if you feel this issue should remain open. Thank you!

J1nLo · 2024-11-19T08:01:29Z

Any updates on this?

BDHU added the misc label Apr 18, 2024

github-actions bot added the stale label Oct 29, 2024

github-actions bot added unstale and removed stale labels Nov 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Misc]: How to access the KV cache directly? #4156

[Misc]: How to access the KV cache directly? #4156

BDHU commented Apr 18, 2024

duanzhaol commented Apr 20, 2024

BDHU commented May 7, 2024

tanejaaryan commented Jun 3, 2024

CSEEduanyu commented Jul 23, 2024

github-actions bot commented Oct 29, 2024

J1nLo commented Nov 19, 2024

[Misc]: How to access the KV cache directly? #4156

[Misc]: How to access the KV cache directly? #4156

Comments

BDHU commented Apr 18, 2024

Anything you want to discuss about vllm.

duanzhaol commented Apr 20, 2024

BDHU commented May 7, 2024

tanejaaryan commented Jun 3, 2024

CSEEduanyu commented Jul 23, 2024

github-actions bot commented Oct 29, 2024

J1nLo commented Nov 19, 2024