[Kernel] Integrate batched/masked deepgemm kernel by varun-sundar-rabindranath · Pull Request #19111 · vllm-project/vllm (original) (raw)
Signed-off-by: Varun vsundarr@redhat.com
[](/apps/gemini-code-assist)
[](/apps/gemini-code-assist)
Signed-off-by: Varun vsundarr@redhat.com
Signed-off-by: Varun vsundarr@redhat.com
Signed-off-by: Varun vsundarr@redhat.com
Signed-off-by: Varun vsundarr@redhat.com
Signed-off-by: Varun vsundarr@redhat.com
tlrmchlsmth added the ready
ONLY add when PR is ready to merge/full CI is needed
label
leo-li-opus pushed a commit to leo-li-opus/vllm that referenced this pull request
Signed-off-by: Varun vsundarr@redhat.com Co-authored-by: Varun vsundarr@redhat.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: Varun vsundarr@redhat.com Co-authored-by: Varun vsundarr@redhat.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})