[Hardware][SM100] Add TRTLLM Kernel for INT4 W4A16 Kernel. by pavanimajety · Pull Request #32437 · vllm-project/vllm (original) (raw)
[](/apps/gemini-code-assist)
[](/apps/cursor)
Add test at kernel level
Signed-off-by: Pavani Majety pmajety@nvidia.com
Signed-off-by: Pavani Majety pmajety@nvidia.com
Add logging
Signed-off-by: Pavani Majety pmajety@nvidia.com
Signed-off-by: Pavani Majety pmajety@nvidia.com
Signed-off-by: Pavani Majety pmajety@nvidia.com
Signed-off-by: Pavani Majety pmajety@nvidia.com
PiratePai pushed a commit to PiratePai/epd_shm that referenced this pull request
Signed-off-by: Pavani Majety pmajety@nvidia.com Signed-off-by: Pai 416932041@qq.com
mystous pushed a commit to mystous/vllm_hybrid that referenced this pull request
Signed-off-by: Pavani Majety pmajety@nvidia.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Pavani Majety pmajety@nvidia.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})