Enable bitsandbytes quantization on AMD GPUs that use warp size 32 by sstamenk · Pull Request #27307 · vllm-project/vllm (original) (raw)

@mergify Bot added the rocm

Related to AMD ROCm

label

Oct 21, 2025

@sstamenk sstamenk marked this pull request as ready for review

November 16, 2025 03:06

[chatgpt-codex-connector[bot]](/apps/chatgpt-codex-connector)

@sstamenk

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

@sstamenk

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

@sstamenk

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

@sstamenk

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

@sstamenk

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

tjtanaa

@sstamenk

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

@sstamenk

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

@sstamenk

Signed-off-by: sstamenk strahinja.stamenkovic@amd.com

@tjtanaa tjtanaa added the ready

ONLY add when PR is ready to merge/full CI is needed

label

Nov 18, 2025

@tjtanaa

devpatelio pushed a commit to SumanthRH/vllm that referenced this pull request

Nov 29, 2025

@sstamenk @devpatelio

kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request

Dec 1, 2025

@sstamenk @kitaekatt

mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request

May 10, 2026

@sstamenk

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@sstamenk

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})