Converting CUDA program to mlir (gpu, linalg etc.) MLIR cuda , gpu , llvm |
|
9 |
161 |
May 21, 2025 |
Help with MLIR CUDA Stream Management for Multiple CUDNN Convolutions MLIR cuda , gpu , runtime |
|
2 |
74 |
May 13, 2025 |
In clang cuda compiling, Can I call gcc with host code build, and use clang+llvm with device code build? Clang Frontend cuda |
|
3 |
54 |
March 14, 2025 |
[RFC] Use the 'new' offloding driver for CUDA and HIP compilation by default Clang Frontend cuda , hip , gpu |
|
27 |
816 |
January 10, 2025 |
Std::invoke_result_t of functor's __device__ operator() Using Clang cuda |
|
5 |
60 |
December 20, 2024 |
CMake CUDA opinions Beginners cuda , gpu , clang , cmake |
|
2 |
88 |
December 9, 2024 |
Copy capture rules for [=, *this] Clang Frontend cuda , gpu |
|
5 |
153 |
March 12, 2024 |
Showcasing LLVM/Offload Runtimes cuda , offload |
|
0 |
883 |
December 15, 2023 |
How to implement vectorloadStore pass for a new GPU backend? Code Generation cuda , gpu , nvptx , llvm |
|
1 |
326 |
June 28, 2023 |
About the plan for CUDA Fortran support in Flang Flang cuda , gpu |
|
0 |
367 |
July 3, 2023 |
LLVM reordering blocks breaks ptxas divergence analysis IR & Optimizations cuda , ptx , gpu |
|
31 |
748 |
June 13, 2023 |
Compiling CUDA code fails Using Clang cuda , gpu |
|
16 |
3046 |
May 25, 2023 |
Does MLIR supports CUDA source code generation? MLIR cuda , gpu |
|
3 |
1112 |
May 17, 2023 |
CUCLANG struggle with cooperative groups headers Using Clang cuda , gpu , clang , llvm |
|
4 |
486 |
February 28, 2023 |
[RFC] Floating-point accuracy control IR & Optimizations cuda , gpu , rfc |
|
32 |
2828 |
February 11, 2023 |
NVPTX: SyncScope/AtomicOrdering of atomicrmw support? LLVM Project cuda , nvptx |
|
1 |
222 |
February 7, 2023 |
CUDA Support for clang-tidy clang-tidy cuda |
|
6 |
733 |
July 25, 2022 |
Cannot pass __device__ function as template parameter in CUDA? Using Clang cuda , gpu |
|
3 |
999 |
June 28, 2022 |
Clang++ 15.0.0 with OpenMP offloading to nVidia GPU on Windows with VS2022CE - too many errors Using Clang cuda |
|
4 |
1556 |
June 5, 2022 |
LLVM@14.0.0 doesn't support well on CUDA@11.5.0 about variadic function and other definitions Clang Frontend cuda |
|
8 |
1044 |
May 13, 2022 |
[CUDA] CUDA device code does not support variadic functions in clang Clang Frontend cuda , clang |
|
1 |
1087 |
February 24, 2022 |
NVPTX: Calling convention for aggregate arguments passed by value Clang Frontend cuda , nvptx |
|
13 |
678 |
January 24, 2022 |