FB-SKP: Front-Back Structured Kernel Pruning via Group Convolution without Increasing Memory

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Recently, deep convolutional neural network (DCNN) models have achieved remarkable results in the field of computer vision; however, as the size of the network has gradually increased, the application of compression techniques has become essential for practical use. Among compression methods, pruning is a widely used technique, and weight pruning and filter pruning are mainly being studied. Weight pruning is a representative unstructured pruning technique with high compression rate; however, it has a limitation in that it is difficult to lead to real network acceleration. Filter pruning is a structured pruning technique with high acceleration performance because it directly removes parameters; however, it has the limitation of having a relatively low compression rate. Kernel pruning is an intermediate technique and has the advantages of both methods but has the problem of being difficult to implement as structured pruning. To solve this problem, this paper proposed Front-Back Structured Kernel Pruning (FB-SKP). FB-SKP can implement structured kernel pruning without increasing network memory through two convolution groups. Additionally, to avoid increasing the complexity of pruning implementation, the filter was divided into two parts, front and back, and one of the two parts was removed. As a result, 45.2% and 47.4% of parameters were removed from ResNet56 and ResNet110, respectively, without performance degradation on CIFAR-100.

Original languageEnglish
Title of host publicationProceedings - International SoC Design Conference 2024, ISOCC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages181-182
Number of pages2
ISBN (Electronic)9798350377088
DOIs
StatePublished - 2024
Event21st International System-on-Chip Design Conference, ISOCC 2024 - Sapporo, Japan
Duration: 19 Aug 202422 Aug 2024

Publication series

NameProceedings - International SoC Design Conference 2024, ISOCC 2024

Conference

Conference21st International System-on-Chip Design Conference, ISOCC 2024
Country/TerritoryJapan
CitySapporo
Period19/08/2422/08/24

Keywords

  • compression
  • convolutional neural network
  • deep learning
  • kernel pruning
  • structured pruning

Cite this