[V1] Use FlashInfer by default on Blackwell GPUs by mgoin · Pull Request #19118 · vllm-project/vllm (original) (raw)

@mgoin

Signed-off-by: mgoin mgoin64@gmail.com

[gemini-code-assist[bot]](/apps/gemini-code-assist)

[gemini-code-assist[bot]](/apps/gemini-code-assist)

@mgoin

Signed-off-by: mgoin mgoin64@gmail.com

houseroad

@mgoin

Signed-off-by: mgoin mgoin64@gmail.com

@mgoin

@mgoin

Signed-off-by: mgoin mgoin64@gmail.com

@mgoin mgoin changed the titleUse FlashInfer by default on Blackwell GPUs [V1] Use FlashInfer by default on Blackwell GPUs

Jun 4, 2025

simon-mo

youkaichao

@mgoin mgoin mentioned this pull request

Jun 11, 2025

leo-li-opus pushed a commit to leo-li-opus/vllm that referenced this pull request

Jul 22, 2025

@mgoin @leo-li-opus

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@mgoin

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})