Topics tagged gpu (original) (raw)

GPU Offloading Docker Image

16

72

June 2, 2025

[RFC] LLVM policy for top level directories and language runtimes

17

459

May 30, 2025

[mlir][vector distribution] WarpOpScfForOp fails when scf.for has results that are unused

2

79

May 28, 2025

Converting CUDA program to mlir (gpu, linalg etc.)

9

162

May 21, 2025

[RFC] Add GPU operations to permute data in 2 loaded mma_matrix

5

171

May 19, 2025

Help with MLIR CUDA Stream Management for Multiple CUDNN Convolutions

2

74

May 13, 2025

[libc][GSoC 2025] Direct I/O from the GPU with io_uring

13

759

May 11, 2025

[RFC] Proposal for Offload Execution Test Suite

17

484

May 7, 2025

Tablegen pattern that uses the same load twice

0

40

May 7, 2025

How do I get the future index of a symbol in the AsmPrinter stage?

4

102

April 29, 2025

[RFC] Adding opaque types to LLVM IR

31

3598

April 28, 2025

How to lower the combination of async gpu ops in `gpu` Dialect

18

816

April 22, 2025

About ParallelLoopMapper Pass Design issues

0

38

April 18, 2025

Question about GPU Dialect Async Tokens in MLIR

4

84

April 15, 2025

[RFC] MLIR types with encoding

37

675

April 14, 2025

Run linalg.matmul on gpu

2

390

April 18, 2024

[GSoC 2024] Offloading libcxx

10

708

April 6, 2025

[libc][GSoC 2025] Profiling and testing of the LLVM libc GPU math

17

723

April 6, 2025

Are there components in MLIR for analyzing GPU kernel dependencies and scheduling?

0

46

March 31, 2025

Seeking Guidance on Executing MLIR Code with GPU Dialect on GPU

2

98

March 28, 2025

OpenMP Offload Fortran Tests Pass with Flang-new

6

296

March 27, 2025

[MLIR][GPU] Failure to Generate Vectorized PTX Instructions from MLIR vector.load/store During GPU Lowering

2

67

March 20, 2025

RFC: SPIRV IR as a vendor agnostic GPU representation

9

226

March 17, 2025

"An exception was thrown: Native API failed. Native API returns: 20 (UR_RESULT_ERROR_DEVICE_LOST)."

0

41

March 14, 2025

Why Do Same-Named Kernels via gpu.launch_func Execute Concurrently but Different Kernels Execute Serially?

1

52

March 14, 2025

How to handle host-side global data automatically when lowering to GPU with MLIR?

2

81

January 15, 2025

How to Implement Asynchronous Concurrent Execution Between gpu.launch Operations?

4

78

March 11, 2025

Support mgpuMemcpy runtime call in SyclRuntimeWrappers

0

34

March 6, 2025

Memref to SPIRV Conversion

2

81

March 6, 2025

How to better implement operation-level parallelism?

10

208

March 6, 2025