Implementation of Tiled Point-Wise Convolution in MobileNet for Parallel Processing

Hyeon Seok Hong, Hyun Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Convolutional neural networks (CNNs) have demonstrated outstanding performance in computer vision tasks. However, their massive computation makes the utilization of CNNs difficult on edge and mobile devices. To address this, lightweight CNNs (e.g., MobileNet) and dedicated FPGA accelerator designs have gained attention. However, the point-wise convolution (PWC) in MobileNet, which accounts for a significant portion of the computations, has a critical impact on latency, making optimization crucial for inference acceleration. In this study, we present a tiled PWC capable of parallel processing by partitioning feature maps into tiles and optimize this operation efficiently. The proposed design enables parallel PWC processing without additional controller modifications. As a result, we implement the proposed design on the Xilinx ZCU102 board platform and observe 4.0 x and 3.1 x improvements in throughput and power efficiency, respectively.

Original languageEnglish
Title of host publication2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350371888
DOIs
StatePublished - 2024
Event2024 International Conference on Electronics, Information, and Communication, ICEIC 2024 - Taipei, Taiwan, Province of China
Duration: 28 Jan 202431 Jan 2024

Publication series

Name2024 International Conference on Electronics, Information, and Communication, ICEIC 2024

Conference

Conference2024 International Conference on Electronics, Information, and Communication, ICEIC 2024
Country/TerritoryTaiwan, Province of China
CityTaipei
Period28/01/2431/01/24

Keywords

  • CNN accelerator
  • FPGA
  • MobileNetV1

Fingerprint

Dive into the research topics of 'Implementation of Tiled Point-Wise Convolution in MobileNet for Parallel Processing'. Together they form a unique fingerprint.

Cite this