[Bugfix] Fix FA3 full cuda graph correctness by WoosukKwon · Pull Request #19106 · vllm-project/vllm (original) (raw)

@WoosukKwon

[gemini-code-assist[bot]](/apps/gemini-code-assist)

@WoosukKwon WoosukKwon added the ready

ONLY add when PR is ready to merge/full CI is needed

label

Jun 3, 2025

[gemini-code-assist[bot]](/apps/gemini-code-assist)

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

tlrmchlsmth

ProExpertProg

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

houseroad

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

@vllm-bot vllm-bot deleted the fix-fa3-full-cuda-graph branch

June 4, 2025 06:10

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@WoosukKwon

Signed-off-by: Woosuk Kwon woosuk.kwon@berkeley.edu

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})