Topics tagged gpu (original) (raw)

Help with MLIR CUDA Stream Management for Multiple CUDNN Convolutions

0

6

May 10, 2025

[RFC] Add GPU operations to permute data in 2 loaded mma_matrix

4

107

May 9, 2025

[libc][GSoC 2025] Direct I/O from the GPU with io_uring

11

683

May 8, 2025

[RFC] Proposal for Offload Execution Test Suite

17

450

May 7, 2025

[RFC] LLVM policy for top level directories and language runtimes

1

121

May 7, 2025

Tablegen pattern that uses the same load twice

0

32

May 7, 2025

How do I get the future index of a symbol in the AsmPrinter stage?

4

100

April 29, 2025

[RFC] Adding opaque types to LLVM IR

31

3529

April 28, 2025

How to lower the combination of async gpu ops in `gpu` Dialect

18

811

April 22, 2025

About ParallelLoopMapper Pass Design issues

0

36

April 18, 2025

Question about GPU Dialect Async Tokens in MLIR

4

79

April 15, 2025

[RFC] MLIR types with encoding

37

669

April 14, 2025

Run linalg.matmul on gpu

2

377

April 18, 2024

[GSoC 2024] Offloading libcxx

10

699

April 6, 2025

[libc][GSoC 2025] Profiling and testing of the LLVM libc GPU math

17

695

April 6, 2025

Are there components in MLIR for analyzing GPU kernel dependencies and scheduling?

0

42

March 31, 2025

Seeking Guidance on Executing MLIR Code with GPU Dialect on GPU

2

91

March 28, 2025

OpenMP Offload Fortran Tests Pass with Flang-new

6

285

March 27, 2025

[MLIR][GPU] Failure to Generate Vectorized PTX Instructions from MLIR vector.load/store During GPU Lowering

2

58

March 20, 2025

RFC: SPIRV IR as a vendor agnostic GPU representation

9

211

March 17, 2025

"An exception was thrown: Native API failed. Native API returns: 20 (UR_RESULT_ERROR_DEVICE_LOST)."

0

33

March 14, 2025

Why Do Same-Named Kernels via gpu.launch_func Execute Concurrently but Different Kernels Execute Serially?

1

49

March 14, 2025

How to handle host-side global data automatically when lowering to GPU with MLIR?

2

72

January 15, 2025

How to Implement Asynchronous Concurrent Execution Between gpu.launch Operations?

4

75

March 11, 2025

Support mgpuMemcpy runtime call in SyclRuntimeWrappers

0

33

March 6, 2025

Memref to SPIRV Conversion

2

75

March 6, 2025

How to better implement operation-level parallelism?

10

204

March 6, 2025

Low Parallelism in GPU Mapping for Nested Parallel Loops in MLIR

3

118

February 20, 2025

LLVM optimizations during PGOs

10

236

February 19, 2025

How to organize pass ordering to transform a 1-D affine.parallel into nested multi-dimensional SCF or affine parallel loops

0

28

February 18, 2025