[Misc] Support quantization of MllamaForCausalLM by mgoin · Pull Request #8822 · vllm-project/vllm (original) (raw)

@mgoin

ywang96

approved these changes Sep 25, 2024

@comaniac comaniac added the ready

ONLY add when PR is ready to merge/full CI is needed

label

Sep 25, 2024

Alvant pushed a commit to compressa-ai/vllm that referenced this pull request

Oct 26, 2024

@mgoin @Alvant

Signed-off-by: Alvant alvasian@yandex.ru

garg-amit pushed a commit to garg-amit/vllm that referenced this pull request

Oct 28, 2024

@mgoin @garg-amit

Signed-off-by: Amit Garg mitgarg17495@gmail.com

sumitd2 pushed a commit to sumitd2/vllm that referenced this pull request

Nov 14, 2024

@mgoin @sumitd2

Signed-off-by: Sumit Dubey sumit.dubey2@ibm.com

LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request

Mar 26, 2025

@mgoin @LeiWang1999

Signed-off-by: LeiWang1999 leiwang1999@outlook.com

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})