[V1] Support cross-layer KV sharing by sarckk · Pull Request #18212 · vllm-project/vllm (original) (raw)

@mergify Bot added v1 tpu

Related to Google TPUs

labels

May 15, 2025

@sarckk sarckk marked this pull request as ready for review

May 15, 2025 17:31

luccafong

luccafong

luccafong

heheda12345

heheda12345

heheda12345

heheda12345

heheda12345

Yong Hoon Shin added 6 commits

June 3, 2025 07:09

Signed-off-by: Yong Hoon Shin yhshin@meta.com

Signed-off-by: Yong Hoon Shin yhshin@meta.com

Signed-off-by: Yong Hoon Shin yhshin@meta.com

Signed-off-by: Yong Hoon Shin yhshin@meta.com

Signed-off-by: Yong Hoon Shin yhshin@meta.com

Signed-off-by: Yong Hoon Shin yhshin@meta.com

auto-merge was automatically disabled

June 3, 2025 15:22

Head branch was pushed to by a user without write access

This was referenced

Jun 19, 2025

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@sarckk

Signed-off-by: Yong Hoon Shin yhshin@meta.com

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})