Mask-Soft Filter Pruning for Lightweight CNN Inference

Nam Joon Kim, Hyun Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Pruning is a network compression and acceleration technique to reduce the computation and memory footprint required by Convolutional Neural Networks (CNNs), and various pruning techniques have been proposed by many researchers. Weight pruning (i.e., Unstructured pruning) can reduce many parameters by removing redundant weights, but it requires special software or hardware structure to actually accelerate the neural networks in the GPU environment. On the other hand, filter pruning to remove the filters itself does not require any special software or hardware structure, and consequently, it enables the actual acceleration of CNN in the GPU environment. Inspired by the previous research, soft filter pruning, which prunes the filters in a soft manner, this paper proposes Mask-Soft Filter Pruning(M-SFP) method. The proposed M-SFP is a pruning technique that can preserve weight parameters without zeroing out by masking the output feature maps. By applying the proposed technique to ResNet on CIFAR-10 and CIFAR-100 datasets, more than 40% reduction of BFLOPs can be achieved with only an acceptable accuracy drop of 0.17%.

Original languageEnglish
Title of host publicationProceedings - International SoC Design Conference, ISOCC 2020
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages316-317
Number of pages2
ISBN (Electronic)9781728183312
DOIs
StatePublished - 21 Oct 2020
Event17th International System-on-Chip Design Conference, ISOCC 2020 - Yeosu, Korea, Republic of
Duration: 21 Oct 202024 Oct 2020

Publication series

NameProceedings - International SoC Design Conference, ISOCC 2020

Conference

Conference17th International System-on-Chip Design Conference, ISOCC 2020
Country/TerritoryKorea, Republic of
CityYeosu
Period21/10/2024/10/20

Keywords

  • Deep learning
  • image classification
  • network compression
  • pruning

Fingerprint

Dive into the research topics of 'Mask-Soft Filter Pruning for Lightweight CNN Inference'. Together they form a unique fingerprint.

Cite this