Enabling predictable parallelism in single-GPU systems with persistent CUDA threads (original) (raw)
2023, arXiv (Cornell University)
Related papers
CUDA and Applications to Task-based Programming
2021
A study of Persistent Threads style GPU programming for GPGPU workloads
2012 Innovative Parallel Computing (InPar), 2012
2009
Efficient parallel processing by improved CPU-GPU interaction
2014 International Conference on Issues and Challenges in Intelligent Computing Techniques (ICICT), 2014
2010
Multithreading for Compute Accelerators Through Distributed Shared Memory Design
2014
Proceedings of the 5th International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2015
Memory Performance and Bottlenecks in Multicore and GPU Architectures
2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), 2019
MGSim + MGMark: A Framework for Multi-GPU System Research
arXiv (Cornell University), 2018
An Intermediate Library for Multi-GPUs Computing Skeletons
2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future, 2012
Loading Preview
Sorry, preview is currently unavailable. You can download the paper by clicking the button above.
Related papers
A complete and efficient CUDA-sharing solution for HPC clusters
Parallel Computing, 2014
Current Trends in Parallel Computing
International Journal of Computer Applications, 2012
Datacenter-Scale Analysis and Optimization of GPU Machine Learning Workloads
IEEE Micro, 2021
Parallel Computing Experiences with CUDA
IEEE Micro, 2000
Towards a methodology for creating time-critical, cloud-based CUDA applications
2018
Productivity of GPUs under different programming paradigms
Concurrency and Computation: Practice and Experience, 2012
2012
XeroZerox: Analysis and Optimization of GPU Memory Management for High-Integrity Autonomous Systems
IEEE access, 2024
A combined GPGPU-FPGA high-performance desktop
Real-Time Computing on Multicore Processors
Computer, 2016
2021 29th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems (MASCOTS), 2021
Analyzing CUDA workloads using a detailed GPU simulator
IEEE ISPASS, 2009
IRJET-ACCELERATE EXECUTION OF CUDA PROGRAMS FOR NON GPU USERS USING GPU IN THE CLOUD
Early Experiences Migrating CUDA codes to oneAPI
ArXiv, 2021
Automating CUDA Synchronization via Program Transformation
2019 34th IEEE/ACM International Conference on Automated Software Engineering (ASE)
Future Generation Computer Systems, 2019
The gpu used as a math co-processor in real time applications
Proceedings of the VI …, 2007
Parallel Computer Architectural Schemes
2012
International journal of applied engineering and management letters, 2023
DeNovo: Rethinking the Memory Hierarchy for Disciplined Parallelism
2011 International Conference on Parallel Architectures and Compilation Techniques, 2011
Effective multi-GPU communication using multiple CUDA streams and threads
2014 20th IEEE International Conference on Parallel and Distributed Systems (ICPADS), 2014
Profiling general purpose GPU applications
Proceedings - Symposium on Computer Architecture and High Performance Computing, 2009
Scheduling Parallel Iterative Applications on Volatile Resources
2011 IEEE International Parallel & Distributed Processing Symposium, 2011
Source-to-Source Code Translator: OpenMP C to CUDA
2011 IEEE International Conference on High Performance Computing and Communications, 2011
Towards efficient GPU sharing on multicore processors
Proceedings of the second international workshop on Performance modeling, benchmarking and simulation of high performance computing systems - PMBS '11, 2011
Related topics
EngineeringAerospace EngineeringComputer ScienceParallel ComputingComputer SecurityComputationGraphicsCUDAExploitProgramming languageGraphics processing unit