[Kernel][Quantization] add w4a8 support for marlin kernel by jinzhen-lin · Pull Request #24722 · vllm-project/vllm (original) (raw)
added 10 commits
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
jinzhen-lin changed the title
Marlin a8 [Kernel][Quantization] add w4a8 support for marlin kernel
[](/apps/gemini-code-assist)
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com
kitaekatt pushed a commit to kitaekatt/vllm that referenced this pull request
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com Signed-off-by: Michael Goin mgoin64@gmail.com Signed-off-by: Jinzhen Lin linjinzhen@hotmail.com Co-authored-by: Michael Goin mgoin64@gmail.com Co-authored-by: Michael Goin mgoin@redhat.com
amd-hhashemi pushed a commit to amd-hhashemi/vllm that referenced this pull request
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com Signed-off-by: Michael Goin mgoin64@gmail.com Signed-off-by: Jinzhen Lin linjinzhen@hotmail.com Co-authored-by: Michael Goin mgoin64@gmail.com Co-authored-by: Michael Goin mgoin@redhat.com Signed-off-by: Hashem Hashemi hashem.hashemi@amd.com
Tmn07 mentioned this pull request
mgoin mentioned this pull request
1 task
ehfd mentioned this pull request
1 task
mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com Signed-off-by: Michael Goin mgoin64@gmail.com Signed-off-by: Jinzhen Lin linjinzhen@hotmail.com Co-authored-by: Michael Goin mgoin64@gmail.com Co-authored-by: Michael Goin mgoin@redhat.com
my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com Signed-off-by: Michael Goin mgoin64@gmail.com Signed-off-by: Jinzhen Lin linjinzhen@hotmail.com Co-authored-by: Michael Goin mgoin64@gmail.com Co-authored-by: Michael Goin mgoin@redhat.com
my-other-github-account pushed a commit to my-other-github-account/vllm that referenced this pull request
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com Signed-off-by: Michael Goin mgoin64@gmail.com Signed-off-by: Jinzhen Lin linjinzhen@hotmail.com Co-authored-by: Michael Goin mgoin64@gmail.com Co-authored-by: Michael Goin mgoin@redhat.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Jinzhen Lin jinzhen.ljz@antgroup.com Signed-off-by: Michael Goin mgoin64@gmail.com Signed-off-by: Jinzhen Lin linjinzhen@hotmail.com Co-authored-by: Michael Goin mgoin64@gmail.com Co-authored-by: Michael Goin mgoin@redhat.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})