Blockwise Scaling for FP8 by manishucsd · Pull Request #1932 · NVIDIA/cutlass (original) (raw)

sijialouintel added a commit to sijialouintel/cutlass that referenced this pull request

Co-authored-by: Haicheng Wu 57973641+hwu36@users.noreply.github.com


Co-authored-by: Haicheng Wu 57973641+hwu36@users.noreply.github.com

fix compile with cmake .. -DCUTLASS_ENABLE_TESTS=ON -DCUTLASS_TEST_LEVEL=2

Co-authored-by: Siyuan Fu siyuanf@nvidia.com


Co-authored-by: Siyuan Fu siyuanf@nvidia.com

Previously we had this error:

  File "/storage/home/cutlass/python/cutlass/backend/operation.py", line 39, in <listcomp>
    _version_splits = [int(x) for x in __version__.split("rc")[0].split(".")]
                       ^^^^^^
ValueError: invalid literal for int() with base 10: 'post1'

Co-authored-by: Jack Kosaian jackkosaian@gmail.com

Co-authored-by: Jack Kosaian jackkosaian@gmail.com


Co-authored-by: Jack Kosaian jackkosaian@gmail.com


Co-authored-by: yuzhai yuzhai@nvidia.com Co-authored-by: Haicheng Wu haichengw@nvidia.com

Shouldn't this be BLK_M, BLK_K, k

Co-authored-by: yuzhai yuzhai@nvidia.com


Co-authored-by: Haicheng Wu haichengw@nvidia.com


Co-authored-by: Haicheng Wu haichengw@nvidia.com


Co-authored-by: yuzhai yuzhai@nvidia.com Co-authored-by: Haicheng Wu haichengw@nvidia.com


Co-authored-by: yuzhai yuzhai@nvidia.com

This reverts commit b353e36.


Co-authored-by: Haicheng Wu 57973641+hwu36@users.noreply.github.com Co-authored-by: Haicheng Wu haichengw@nvidia.com


Co-authored-by: zl zl@deepseek.com Co-authored-by: Haicheng Wu haichengw@nvidia.com

Co-authored-by: Ali Hassani 68103095+alihassanijr@users.noreply.github.com

Co-authored-by: Ali Hassani 68103095+alihassanijr@users.noreply.github.com


Co-authored-by: Ali Hassani 68103095+alihassanijr@users.noreply.github.com


Co-authored-by: Haicheng Wu haichengw@nvidia.com


Co-authored-by: yuzhai yuzhai@nvidia.com


Co-authored-by: Saagar Jha saagar@saagarjha.com Co-authored-by: Haicheng Wu haichengw@nvidia.com Co-authored-by: Sergey Klevtsov 141879860+sklevtsov-nvidia@users.noreply.github.com Co-authored-by: Tri Dao tridao@users.noreply.github.com Co-authored-by: Xinyu Yang ltyxy@buaa.edu.cn Co-authored-by: sijialou sijia.lou@intel.com Co-authored-by: Bogumil Sapinski Mobica 48835513+Bogumil-Sapinski-Mobica@users.noreply.github.com Co-authored-by: Haicheng Wu 57973641+hwu36@users.noreply.github.com Co-authored-by: Lei Mao dukeleimao@gmail.com Co-authored-by: 103yiran 1039105206@qq.com Co-authored-by: MaxAkaAltmer MaxAkaAltmer@yandex.ru Co-authored-by: 侯奇 houqi1993@gmail.com Co-authored-by: Lain 28486541+IwakuraRein@users.noreply.github.com Co-authored-by: Siyuan Fu siyuanf@nvidia.com Co-authored-by: Caleb_Du 59528230+CalebDu@users.noreply.github.com Co-authored-by: LiYu Lu luliyucoordinate@outlook.com Co-authored-by: azhurkevich 101208641+azhurkevich@users.noreply.github.com Co-authored-by: chenwei 15601910741@163.com Co-authored-by: Wenlei Bao 142055114+wenlei-bao@users.noreply.github.com Co-authored-by: LiuQiang thorneliu@gmail.com Co-authored-by: dan_the_3rd 43445237+danthe3rd@users.noreply.github.com Co-authored-by: Jack Kosaian jackkosaian@gmail.com Co-authored-by: Yujia Zhai yzhai015@ucr.edu Co-authored-by: yuzhai yuzhai@nvidia.com Co-authored-by: Andrew O'Neill foolusion@gmail.com Co-authored-by: Dongxu.Wang wangdongxuking61@gmail.com Co-authored-by: ZZK 359521840@qq.com Co-authored-by: Driss Guessous 32754868+drisspg@users.noreply.github.com Co-authored-by: ZincCat 52513999+zinccat@users.noreply.github.com Co-authored-by: Manish Gupta mgupta.iitr@gmail.com Co-authored-by: bobliao codechaser@163.com Co-authored-by: mihir-awatramani 162148077+mihir-awatramani@users.noreply.github.com Co-authored-by: Liang 44948473+soundOfDestiny@users.noreply.github.com Co-authored-by: zl zl@deepseek.com Co-authored-by: Tadej Ciglarič tadej.c@gmail.com Co-authored-by: Ali Hassani 68103095+alihassanijr@users.noreply.github.com Co-authored-by: Josh Fromm jwfromm@meta.com