SFU-Driven Transparent Approximation Acceleration on GPUs (original) (raw)
Related papers
Transparent Acceleration of Program Execution using Reconfigurable Hardware
Design, Automation & Test in Europe Conference & Exhibition (DATE), 2015, 2015
An Intermediate Library for Multi-GPUs Computing Skeletons
2012 IEEE RIVF International Conference on Computing & Communication Technologies, Research, Innovation, and Vision for the Future, 2012
Specular Effects on the GPU: State of the Art
Computer Graphics Forum, 2009
2009
Architecture for transparent binary acceleration of loops with memory accesses
Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2013
Productivity of GPUs under different programming paradigms
Concurrency and Computation: Practice and Experience, 2012
Estimation of Volume Rendering Efficiency with GPU in a Parallel Distributed Environment
Procedia Computer Science, 2013
Efficient acceleration of sparse MPIE/MoM with graphics processing units
2011
MGSim + MGMark: A Framework for Multi-GPU System Research
arXiv (Cornell University), 2018
Memory Performance and Bottlenecks in Multicore and GPU Architectures
2019 27th Euromicro International Conference on Parallel, Distributed and Network-Based Processing (PDP), 2019
2010
Enabling predictable parallelism in single-GPU systems with persistent CUDA threads
arXiv (Cornell University), 2023
GPU Tensor Cores for Fast Arithmetic Reductions
IEEE Transactions on Parallel and Distributed Systems, 2021
Datacenter-Scale Analysis and Optimization of GPU Machine Learning Workloads
IEEE Micro, 2021
Fundamentals of Software Engineering, 2021
On optimization techniques for the matrix multiplication on hybrid CPU+GPU platforms
Annals of Multicore and Gpu Programming, 2014
Effective biclustering on GPU - capabilities and constraints
PRZEGLĄD ELEKTROTECHNICZNY, 2015
Transparent Control Flow Transfer between CPU and Accelerators for HPC
Electronics, 2021
Blaze-DEMGPU: Modular high performance DEM framework for the GPU architecture
SoftwareX, 2016
Proceedings of the 5th International Workshop on Domain-Specific Languages and High-Level Frameworks for High Performance Computing, 2015
Gpu-based adaptivesubdivision for view-dependent rendering
2009
Dynamic Memory Bandwidth Allocation for Real-Time GPU-Based SoC Platforms
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems, 2020
Accelerating Linux and Android applications on low-power devices through remote GPGPU offloading
Concurrency and Computation: Practice and Experience, 2017
Spartan: A Sparsity-Adaptive Framework to Accelerate Deep Neural Network Training on GPUs
IEEE Transactions on Parallel and Distributed Systems, 2021
Improving the GPU space of computation under triangular domain problems