GNNIE: GNN Inference Engine with Load-balancing and Graph-Specific Caching (original) (raw)
Related papers
AWB-GCN: A Graph Convolutional Network Accelerator with Runtime Workload Rebalancing
2020 53rd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)
LW-GCN: A Lightweight FPGA-based Graph Convolutional Network Accelerator
ACM Transactions on Reconfigurable Technology and Systems
GenGNN: A Generic FPGA Framework for Graph Neural Network Acceleration
Rishov Sarkar, Stefan Abi-Karam
2022
Computing Graph Neural Networks: A Survey from Algorithms to Accelerators
ACM Computing Surveys
G3: When Graph Neural Networks Meet Parallel Graph Processing Systems on GPUs
2020
2022
2022
MICRO-54: 54th Annual IEEE/ACM International Symposium on Microarchitecture, 2021
Deep Graph Library Optimizations for Intel(R) x86 Architecture
2020
COIN: Communication-Aware In-Memory Acceleration for Graph Convolutional Networks
IEEE Journal on Emerging and Selected Topics in Circuits and Systems
GSplit: Scaling Graph Neural Network Training on Large Graphs via Split-Parallelism
arXiv (Cornell University), 2023
Accelerating DNN Inference with GraphBLAS and the GPU
2019 IEEE High Performance Extreme Computing Conference (HPEC), 2019
Efficient Inference on GPUs for the Sparse Deep Neural Network Graph Challenge 2020
ArXiv, 2020
Analyzing the Performance of Graph Neural Networks with Pipe Parallelism
ArXiv, 2020
GDLL: A Scalable and Share Nothing Architecture based Distributed Graph Neural Networks Framework
IEEE Access
Hyperscale FPGA-as-a-service architecture for large-scale distributed graph neural network
Proceedings of the 49th Annual International Symposium on Computer Architecture
DistGNN: Scalable Distributed Training for Large-Scale Graph Neural Networks
2021
Enabling massive deep neural networks with the GraphBLAS
2017 IEEE High Performance Extreme Computing Conference (HPEC)
BlockGNN: Towards Efficient GNN Acceleration Using Block-Circulant Weight Matrices
2021 58th ACM/IEEE Design Automation Conference (DAC), 2021
Energy Efficient Architecture for Graph Analytics Accelerators
2016 ACM/IEEE 43rd Annual International Symposium on Computer Architecture (ISCA), 2016
Scalable Graph Convolutional Network Training on Distributed-Memory Systems
Proceedings of the VLDB Endowment
Large Graph Convolutional Network Training with GPU-Oriented Data Communication Architecture
2021
Proceedings of the 28th ACM International Conference on Architectural Support for Programming Languages and Operating Systems, Volume 2
DRAGON: Dynamic Recurrent Accelerator for Graph Online Convolution
ACM Transactions on Design Automation of Electronic Systems
First-Generation Inference Accelerator Deployment at Facebook
ArXiv, 2021
TF-GNN: Graph Neural Networks in TensorFlow
arXiv (Cornell University), 2022
Graphicionado: A high-performance and energy-efficient accelerator for graph analytics
2016 49th Annual IEEE/ACM International Symposium on Microarchitecture (MICRO)
PolyGraph: Exposing the Value of Flexibility for Graph Processing Accelerators
2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA), 2021
P3: Distributed Deep Graph Learning at Scale
2021
Intel nGraph: An Intermediate Representation, Compiler, and Executor for Deep Learning
2018
Understanding and bridging the gaps in current GNN performance optimizations
Proceedings of the 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, 2021
TaxoNN: A Light-Weight Accelerator for Deep Neural Network Training
2020 IEEE International Symposium on Circuits and Systems (ISCAS), 2020
MG-GCN: A Scalable multi-GPU GCN Training Framework
Proceedings of the 51st International Conference on Parallel Processing