Convolution engine (original) (raw)

CARLA: A Convolution Accelerator With a Reconfigurable and Low-Energy Architecture

Shervin Vakili

IEEE Transactions on Circuits and Systems I: Regular Papers

View PDFchevron_right

YaConv: Convolution with Low Cache Footprint

José Amaral

ACM Transactions on Architecture and Code Optimization

View PDFchevron_right

Reconfigurable Convolution Architecture for Heterogeneous Systems-on-Chip

Stefania Perri

2020

View PDFchevron_right

Deep Convolutional Neural Network Architecture With Reconfigurable Computation Patterns

karthika gidijala

IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2017

View PDFchevron_right

EcoFlow: Efficient Convolutional Dataflows for Low-Power Neural Network Accelerators

Michaela Blott

2022

View PDFchevron_right

Communication-Minimizing 2D Convolution in GPU Registers

Forrest Iandola

View PDFchevron_right

An FPGA-based Solution for Convolution Operation Acceleration

Khoa Anh

arXiv (Cornell University), 2022

View PDFchevron_right

14.5 Eyeriss: An energy-efficient reconfigurable accelerator for deep convolutional neural networks

Joel Emer

2016 IEEE International Solid-State Circuits Conference (ISSCC), 2016

View PDFchevron_right

Extensible Embedded Processor for Convolutional Neural Networks

shrikant jadhav

Sci. Program., 2021

View PDFchevron_right

Optimizing the Convolution Operation to Accelerate Deep Neural Networks on FPGA

Sarma Vrudhula

IEEE Transactions on Very Large Scale Integration (VLSI) Systems, 2018

View PDFchevron_right

Eyeriss: A Spatial Architecture for Energy-Efficient Dataflow for Convolutional Neural Networks

Joel Emer

IEEE Micro

View PDFchevron_right

An Energy-Efficient Accelerator Architecture with Serial Accumulation Dataflow for Deep CNNs

Shervin Vakili

2020 18th IEEE International New Circuits and Systems Conference (NEWCAS), 2020

View PDFchevron_right

A high-performance fully reconfigurable FPGA-based 2D convolution processor

Marco Lanuzza

View PDFchevron_right

p-im2col: Simple Yet Efficient Convolution Algorithm With Flexibly Controlled Memory Overhead

Vladimir V Arlazarov

View PDFchevron_right

Optimally Scheduling CNN Convolutions for Efficient Memory Access

Arthur Stoutchinin

ArXiv, 2019

View PDFchevron_right

Using Dataflow to Optimize Energy Efficiency of Deep Neural Network Accelerators

Joel Emer

IEEE Micro, 2017

View PDFchevron_right

Fast and Scalable 2D Convolutions and Cross-correlations for Processing Image Databases and Videos on CPUs

Marios S Pattichis

2020 IEEE Southwest Symposium on Image Analysis and Interpretation (SSIAI), 2020

View PDFchevron_right

On the AER convolution processors for FPGA

Manuel Rivas

ISCAS 2010 - 2010 IEEE International Symposium on Circuits and Systems: Nano-Bio Circuit Fabrics and Systems, 2010

View PDFchevron_right

The Design and Implementation of Convolution into FPGA.

Ijesrt Journal

International Journal of Engineering Sciences & Research Technology, 2014

View PDFchevron_right

DycSe: A Low-Power, Dynamic Reconfiguration Column Streaming-Based Convolution Engine for Resource-Aware Edge AI Accelerators

Weison Lin

Journal of Low Power Electronics and Applications

View PDFchevron_right

A Simple Method to Reduce Off-chip Memory Accesses on Convolutional Neural Networks

SangHyuck Ha

ArXiv, 2019

View PDFchevron_right

MulNet: A Flexible CNN Processor With Higher Resource Utilization Efficiency for Constrained Devices

Rafay Hasan

IEEE Access, 2019

View PDFchevron_right

Architecture Exploration for Energy-Efficient Embedded Vision Applications: From General Purpose Processor to Domain Specific Accelerator

Maria Malik

2016 IEEE Computer Society Annual Symposium on VLSI (ISVLSI), 2016

View PDFchevron_right

A Multithreaded CGRA for Convolutional Neural Network Processing

Masayuki Ikebe

Circuits and Systems, 2017

View PDFchevron_right

Comparing Energy Efficiency of CPU, GPU and FPGA Implementations for Vision Kernels

Murad Qasaimeh

View PDFchevron_right

Energy-Efficient Convolutional Neural Networks via Recurrent Data Reuse

Luca Mocerino

2019

View PDFchevron_right

A memory based architecture for real-time convolution with variable kernels

Vasily Moshnyaga

View PDFchevron_right

Design Exploration of Machine Learning Data-Flows onto Heterogeneous Reconfigurable Hardware

Michael Canesche

Anais do XXI Simpósio em Sistemas Computacionais de Alto Desempenho (WSCAD 2020), 2020

View PDFchevron_right

Cnp: An fpga-based processor for convolutional networks

Clement Farabet

… Programmable Logic and …, 2009

View PDFchevron_right

A streaming accelerator of Convolutional Neural Networks for resource-limited applications

JUAN PRADO OLIVAREZ

IEICE Electronics Express, 2019

View PDFchevron_right

High Performance and Portable Convolution Operators for Multicore Processors

Adrián Castelló

2020 IEEE 32nd International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), 2020

View PDFchevron_right

An energy-efficient memory-based high-throughput VLSI architecture for convolutional networks

Min-Sun Keel

2015

View PDFchevron_right