Group Sparsity: The Hinge Between Filter Pruning and Decomposition for Network Compression (original) (raw)

View PDF

Abstract:In this paper, we analyze two popular network compression techniques, i.e. filter pruning and low-rank decomposition, in a unified sense. By simply changing the way the sparsity regularization is enforced, filter pruning and low-rank decomposition can be derived accordingly. This provides another flexible choice for network compression because the techniques complement each other. For example, in popular network architectures with shortcut connections (e.g. ResNet), filter pruning cannot deal with the last convolutional layer in a ResBlock while the low-rank decomposition methods can. In addition, we propose to compress the whole network jointly instead of in a layer-wise manner. Our approach proves its potential as it compares favorably to the state-of-the-art on several benchmarks.

Submission history

From: Yawei Li [view email]
[v1] Thu, 19 Mar 2020 17:57:26 UTC (3,028 KB)