[Model Runner V2] Implement multi-step Eagle with CUDA graph by WoosukKwon · Pull Request #29559 · vllm-project/vllm (original) (raw)

added 2 commits

November 27, 2025 00:39

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

[gemini-code-assist[bot]](/apps/gemini-code-assist)

[chatgpt-codex-connector[bot]](/apps/chatgpt-codex-connector)

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

@WoosukKwon

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request

Dec 1, 2025

@WoosukKwon @kitaekatt

…oject#29559)

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

amd-hhashemi pushed a commit to amd-hhashemi/vllm that referenced this pull request

Dec 2, 2025

@WoosukKwon @amd-hhashemi

…oject#29559)

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu Signed-off-by: Hashem Hashemi hashem.hashemi@amd.com

mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request

May 10, 2026

@WoosukKwon

…oject#29559)

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@WoosukKwon

…oject#29559)

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})