Topics tagged cuda (original) (raw)

Topic Replies Views Activity
Converting CUDA program to mlir (gpu, linalg etc.) MLIR cuda , gpu , llvm 9 161 May 21, 2025
Help with MLIR CUDA Stream Management for Multiple CUDNN Convolutions MLIR cuda , gpu , runtime 2 74 May 13, 2025
In clang cuda compiling, Can I call gcc with host code build, and use clang+llvm with device code build? Clang Frontend cuda 3 54 March 14, 2025
[RFC] Use the 'new' offloding driver for CUDA and HIP compilation by default Clang Frontend cuda , hip , gpu 27 816 January 10, 2025
Std::invoke_result_t of functor's __device__ operator() Using Clang cuda 5 60 December 20, 2024
CMake CUDA opinions Beginners cuda , gpu , clang , cmake 2 88 December 9, 2024
Copy capture rules for [=, *this] Clang Frontend cuda , gpu 5 153 March 12, 2024
Showcasing LLVM/Offload Runtimes cuda , offload 0 883 December 15, 2023
How to implement vectorloadStore pass for a new GPU backend? Code Generation cuda , gpu , nvptx , llvm 1 326 June 28, 2023
About the plan for CUDA Fortran support in Flang Flang cuda , gpu 0 367 July 3, 2023
LLVM reordering blocks breaks ptxas divergence analysis IR & Optimizations cuda , ptx , gpu 31 748 June 13, 2023
Compiling CUDA code fails Using Clang cuda , gpu 16 3046 May 25, 2023
Does MLIR supports CUDA source code generation? MLIR cuda , gpu 3 1112 May 17, 2023
CUCLANG struggle with cooperative groups headers Using Clang cuda , gpu , clang , llvm 4 486 February 28, 2023
[RFC] Floating-point accuracy control IR & Optimizations cuda , gpu , rfc 32 2828 February 11, 2023
NVPTX: SyncScope/AtomicOrdering of atomicrmw support? LLVM Project cuda , nvptx 1 222 February 7, 2023
CUDA Support for clang-tidy clang-tidy cuda 6 733 July 25, 2022
Cannot pass __device__ function as template parameter in CUDA? Using Clang cuda , gpu 3 999 June 28, 2022
Clang++ 15.0.0 with OpenMP offloading to nVidia GPU on Windows with VS2022CE - too many errors Using Clang cuda 4 1556 June 5, 2022
LLVM@14.0.0 doesn't support well on CUDA@11.5.0 about variadic function and other definitions Clang Frontend cuda 8 1044 May 13, 2022
[CUDA] CUDA device code does not support variadic functions in clang Clang Frontend cuda , clang 1 1087 February 24, 2022
NVPTX: Calling convention for aggregate arguments passed by value Clang Frontend cuda , nvptx 13 678 January 24, 2022