Update test_flash_attn.py by ShuaibinLi · Pull Request #17102 · vllm-project/vllm (original) (raw)
fix flash_attn_fp8 test
Signed-off-by: ShuaibinLi lishuaibin@live.cn
jikunshang pushed a commit to jikunshang/vllm that referenced this pull request
Signed-off-by: ShuaibinLi lishuaibin@live.cn
lk-chen pushed a commit to lk-chen/vllm that referenced this pull request
Signed-off-by: ShuaibinLi lishuaibin@live.cn
adobrzyn pushed a commit to HabanaAI/vllm-fork that referenced this pull request
Signed-off-by: ShuaibinLi lishuaibin@live.cn Signed-off-by: Agata Dobrzyniewicz adobrzyniewicz@habana.ai
RichardoMrMu pushed a commit to RichardoMrMu/vllm that referenced this pull request
Signed-off-by: ShuaibinLi lishuaibin@live.cn Signed-off-by: Mu Huai tianbowen.tbw@antgroup.com
zzzyq pushed a commit to zzzyq/vllm that referenced this pull request
Signed-off-by: ShuaibinLi lishuaibin@live.cn Signed-off-by: Yuqi Zhang yuqizhang@google.com
0826joyce pushed a commit to 0826joyce/vllm-serving-optimization that referenced this pull request
Signed-off-by: ShuaibinLi lishuaibin@live.cn
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.Learn more about bidirectional Unicode characters
[ Show hidden characters]({{ revealButtonHref }})