[Neuron] Support quantization on neuron by aws-satyajith · Pull Request #18283 · vllm-project/vllm (original) (raw)

mgoin

@aws-satyajith

Co-authored-by: Elaine Zhao elaineyz@amazon.com Co-authored-by: Tailin Pan tailinpa@amazon.com

Signed-off-by: Satyajith Chilappagari satchill@amazon.com

@aws-satyajith aws-satyajith changed the titleSupport quantization on neuron [Neuron] Support quantization on neuron

May 22, 2025

@mgoin mgoinenabled auto-merge (squash)

May 27, 2025 20:13

amitm02 pushed a commit to amitm02/vllm that referenced this pull request

Jun 1, 2025

@aws-satyajith @amitm02

Signed-off-by: Satyajith Chilappagari satchill@amazon.com Signed-off-by: amit amit.man@gmail.com

0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request

May 19, 2026

@aws-satyajith

Signed-off-by: Satyajith Chilappagari satchill@amazon.com

This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters

[ Show hidden characters]({{ revealButtonHref }})