Eagle: MM Cuda Graphs with MRope by IzzyPutterman · Pull Request #28896 · vllm-project/vllm (original) (raw)
[](/apps/gemini-code-assist)
[](/apps/chatgpt-codex-connector)
benchislett added the ready
ONLY add when PR is ready to merge/full CI is needed
label
Signed-off-by: Izzy Putterman iputterman@nvidia.com
devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request
Signed-off-by: Izzy Putterman iputterman@nvidia.com Co-authored-by: Cyrus Leung tlleungac@connect.ust.hk
kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request
Signed-off-by: Izzy Putterman iputterman@nvidia.com Co-authored-by: Cyrus Leung tlleungac@connect.ust.hk
mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request
Signed-off-by: Izzy Putterman iputterman@nvidia.com Co-authored-by: Cyrus Leung tlleungac@connect.ust.hk
my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request
Signed-off-by: Izzy Putterman iputterman@nvidia.com Co-authored-by: Cyrus Leung tlleungac@connect.ust.hk
my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request
Signed-off-by: Izzy Putterman iputterman@nvidia.com Co-authored-by: Cyrus Leung tlleungac@connect.ust.hk
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Izzy Putterman iputterman@nvidia.com Co-authored-by: Cyrus Leung tlleungac@connect.ust.hk
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})