[Bugfix][ROCm] fix the power of 2 exception from triton_unified_attention.py when running llama4 models and unit test fix by hongxiayang · Pull Request #18100 · vllm-project/vllm (original) (raw)
changed the title
[Bugfix] fix the power of 2 exception when running llama4 model in tr… [Bugfix] fix the power of 2 exception when running llama4 model in triton_unified_attention.py
hongxiayang changed the title
[Bugfix] fix the power of 2 exception when running llama4 model in triton_unified_attention.py [Bugfix] fix the power of 2 exception from triton_unified_attention.py when running llama4 models
This was referenced
May 15, 2025
…iton_unified_attention.py
Signed-off-by: Hongxia Yang hongxia.yang@amd.com
Signed-off-by: Hongxia Yang hongxia.yang@amd.com
Signed-off-by: Hongxia Yang hongxia.yang@amd.com
Signed-off-by: tjtanaa tunjian.tan@embeddedllm.com
Signed-off-by: Hongxia Yang hongxia.yang@amd.com
Signed-off-by: Hongxia Yang hongxia.yang@amd.com
Signed-off-by: Hongxia Yang hongxia.yang@amd.com
hongxiayang added the ready
ONLY add when PR is ready to merge/full CI is needed
label
Signed-off-by: Hongxia Yang hongxia.yang@amd.com
hongxiayang changed the title
[Bugfix] fix the power of 2 exception from triton_unified_attention.py when running llama4 models [Bugfix][ROCm] fix the power of 2 exception from triton_unified_attention.py when running llama4 models and unit test fix
amitm02 pushed a commit to amitm02/vllm that referenced this pull request
…tion.py when running llama4 models and unit test fix (vllm-project#18100)
Signed-off-by: Hongxia Yang hongxia.yang@amd.com Signed-off-by: tjtanaa tunjian.tan@embeddedllm.com Co-authored-by: tjtanaa tunjian.tan@embeddedllm.com Signed-off-by: amit amit.man@gmail.com
bringlein added a commit to bringlein/vllm that referenced this pull request
Signed-off-by: Burkhard Ringlein ngl@zurich.ibm.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
…tion.py when running llama4 models and unit test fix (vllm-project#18100)
Signed-off-by: Hongxia Yang hongxia.yang@amd.com Signed-off-by: tjtanaa tunjian.tan@embeddedllm.com Co-authored-by: tjtanaa tunjian.tan@embeddedllm.com
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})