[v1] Add fp32 support to v1 engine through flex attn by Isotr0py · Pull Request #19319 · vllm-project/vllm (original) (raw)

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

[gemini-code-assist[bot]](/apps/gemini-code-assist)

[gemini-code-assist[bot]](/apps/gemini-code-assist)

houseroad

@houseroad houseroad added the ready

ONLY add when PR is ready to merge/full CI is needed

label

Jun 8, 2025

@Isotr0py

Signed-off-by: Isotr0py 2037008807@qq.com

@Isotr0py

Signed-off-by: Isotr0py mozf@mail2.sysu.edu.cn

houseroad

Isotr0py added a commit that referenced this pull request

Jun 10, 2025

@Isotr0py

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@Isotr0py

)

Signed-off-by: Isotr0py 2037008807@qq.com Signed-off-by: Isotr0py mozf@mail2.sysu.edu.cn

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})