Topics tagged gpu (original) (raw)
16
72
June 2, 2025
[RFC] LLVM policy for top level directories and language runtimes
17
459
May 30, 2025
[mlir][vector distribution] WarpOpScfForOp fails when scf.for has results that are unused
2
79
May 28, 2025
Converting CUDA program to mlir (gpu, linalg etc.)
9
162
May 21, 2025
[RFC] Add GPU operations to permute data in 2 loaded mma_matrix
5
171
May 19, 2025
Help with MLIR CUDA Stream Management for Multiple CUDNN Convolutions
2
74
May 13, 2025
[libc][GSoC 2025] Direct I/O from the GPU with io_uring
13
759
May 11, 2025
[RFC] Proposal for Offload Execution Test Suite
17
484
May 7, 2025
Tablegen pattern that uses the same load twice
0
40
May 7, 2025
How do I get the future index of a symbol in the AsmPrinter stage?
4
102
April 29, 2025
[RFC] Adding opaque types to LLVM IR
31
3598
April 28, 2025
How to lower the combination of async gpu ops in `gpu` Dialect
18
816
April 22, 2025
About ParallelLoopMapper Pass Design issues
0
38
April 18, 2025
Question about GPU Dialect Async Tokens in MLIR
4
84
April 15, 2025
[RFC] MLIR types with encoding
37
675
April 14, 2025
2
390
April 18, 2024
10
708
April 6, 2025
[libc][GSoC 2025] Profiling and testing of the LLVM libc GPU math
17
723
April 6, 2025
Are there components in MLIR for analyzing GPU kernel dependencies and scheduling?
0
46
March 31, 2025
Seeking Guidance on Executing MLIR Code with GPU Dialect on GPU
2
98
March 28, 2025
OpenMP Offload Fortran Tests Pass with Flang-new
6
296
March 27, 2025
2
67
March 20, 2025
RFC: SPIRV IR as a vendor agnostic GPU representation
9
226
March 17, 2025
"An exception was thrown: Native API failed. Native API returns: 20 (UR_RESULT_ERROR_DEVICE_LOST)."
0
41
March 14, 2025
1
52
March 14, 2025
How to handle host-side global data automatically when lowering to GPU with MLIR?
2
81
January 15, 2025
How to Implement Asynchronous Concurrent Execution Between gpu.launch Operations?
4
78
March 11, 2025
Support mgpuMemcpy runtime call in SyclRuntimeWrappers
0
34
March 6, 2025
2
81
March 6, 2025
How to better implement operation-level parallelism?
10
208
March 6, 2025