[Misc] Support quantization of MllamaForCausalLM by mgoin · Pull Request #8822 · vllm-project/vllm (original) (raw)
approved these changes Sep 25, 2024
ONLY add when PR is ready to merge/full CI is needed
label
Alvant pushed a commit to compressa-ai/vllm that referenced this pull request
Signed-off-by: Alvant alvasian@yandex.ru
garg-amit pushed a commit to garg-amit/vllm that referenced this pull request
Signed-off-by: Amit Garg mitgarg17495@gmail.com
sumitd2 pushed a commit to sumitd2/vllm that referenced this pull request
Signed-off-by: Sumit Dubey sumit.dubey2@ibm.com
LeiWang1999 pushed a commit to LeiWang1999/vllm-bitblas that referenced this pull request
Signed-off-by: LeiWang1999 leiwang1999@outlook.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})