TETRIS: On-Device Trainable Energy-Efficient FPGA Accelerator for Trustworthy and Real-Time Instance Segmentation

  • Seungil Lee
  • , Juntae Park
  • , Kwanghyun Koo
  • , Gilha Lee
  • , Sangbeom Jeong
  • , Junoh Park
  • , Hyun Kim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Instance segmentation plays a critical role in high-precision computer vision applications, such as autonomous driving and medical image analysis. As demands for both model accuracy and privacy-preserving solutions continue to rise, on-device training is gaining traction for enabling secure, adaptive learning directly on edge devices. However, the intensive computation and complex dataflows inherent to instance segmentation models pose a major barrier to achieving real-time training in resource-constrained environments. While prior research on on-device training has focused largely on image classification, efficient hardware acceleration for instance segmentation training remains largely unexplored. In this work, we present TETRIS, the first FPGA-based hardware accelerator specifically tailored for training instance segmentation models. TETRIS introduces structural and quantization-aware optimizations, including channel-aware smoothing quantization and distribution-aware tie quantization, to significantly reduce both computational and memory overhead. Moreover, TETRIS adopts a heterogeneous hardware architecture, incorporating a reconfigurable convolution processing unit (RC-PU) that supports variable kernel sizes and a fully pipelined auxiliary processing unit to handle specialized operations efficiently. Experimental evaluation on YOLACT-based instance segmentation tasks demonstrates that TETRIS achieves a peak performance of 805.9 GOPS and an energy efficiency of 69.5 GOPS/W, confirming its ability to support real-time training even in resource-constrained edge environments.

Original languageEnglish
Title of host publication2025 IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2025 - Conference Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331515607
DOIs
StatePublished - 2025
Event44th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2025 - Munich, Germany
Duration: 26 Oct 202530 Oct 2025

Publication series

NameIEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD
ISSN (Print)1092-3152

Conference

Conference44th IEEE/ACM International Conference on Computer-Aided Design, ICCAD 2025
Country/TerritoryGermany
CityMunich
Period26/10/2530/10/25

UN SDGs

This output contributes to the following UN Sustainable Development Goals (SDGs)

  1. SDG 7 - Affordable and Clean Energy
    SDG 7 Affordable and Clean Energy

Keywords

  • Convolutional neural network
  • FPGA
  • Instance segmentation
  • Quantization
  • Training accelerator

Fingerprint

Dive into the research topics of 'TETRIS: On-Device Trainable Energy-Efficient FPGA Accelerator for Trustworthy and Real-Time Instance Segmentation'. Together they form a unique fingerprint.

Cite this