[V1] Support cross-layer KV sharing by sarckk · Pull Request #18212 · vllm-project/vllm (original) (raw)
Related to Google TPUs
labels
sarckk marked this pull request as ready for review
Yong Hoon Shin added 6 commits
Signed-off-by: Yong Hoon Shin yhshin@meta.com
Signed-off-by: Yong Hoon Shin yhshin@meta.com
Signed-off-by: Yong Hoon Shin yhshin@meta.com
Signed-off-by: Yong Hoon Shin yhshin@meta.com
Signed-off-by: Yong Hoon Shin yhshin@meta.com
Signed-off-by: Yong Hoon Shin yhshin@meta.com
auto-merge was automatically disabled
Head branch was pushed to by a user without write access
This was referenced
Jun 19, 2025
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Yong Hoon Shin yhshin@meta.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})