Enable bitsandbytes quantization on AMD GPUs that use warp size 32 by sstamenk · Pull Request #27307 · vllm-project/vllm (original) (raw)

Bot added the rocm

Related to AMD ROCm

label

Oct 21, 2025

sstamenk marked this pull request as ready for review

November 16, 2025 03:06

[ chatgpt-codex-connector[bot] ](/apps/chatgpt-codex-connector)

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

tjtanaa added the ready

ONLY add when PR is ready to merge/full CI is needed

label

Nov 18, 2025

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request

Nov 29, 2025

kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request

Dec 1, 2025

mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request

May 10, 2026

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})