Fix compilation bug with CUDA 12.1 by Edenzzzz · Pull Request #949 · NVIDIA/TransformerEngine (original) (raw)
Description
Please include a brief summary of the changes, relevant motivation and context.
Fixes # (issue)
Type of change
- Documentation change (change only to the documentation, either a fix or a new content)
- Bug fix (non-breaking change which fixes an issue)
- New feature (non-breaking change which adds functionality)
- Breaking change (fix or feature that would cause existing functionality to not work as expected)
- Infra/Build change
- Code refractor
Changes
This has been mentioned in #560 but somehow someone just changed it back... Importing <cuda_fp8.h>, which imports <cuda_bf16.h>, after defining nv_bfloat16 triggers re-declaration error.
Checklist:
- I have read and followed the contributing guidelines
- The functionality is complete
- I have commented my code, particularly in hard-to-understand areas
- I have made corresponding changes to the documentation
- My changes generate no new warnings
- I have added tests that prove my fix is effective or that my feature works
- New and existing unit tests pass locally with my changes