MixLoss: Table Structure Recognition in Industrial Documents via Collaborative Learning

Gunoh Jung, Qing Tang, Hongdon Lee, Hail Jung

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Table Structure Recognition (TSR) is a fundamental challenge in document analysis, especially for industrial documents where tables often contain complex layouts and noisy formatting. Although recent image-to-markup approaches have made progress using end-to-end learning, they commonly suffer from misalignment between predicted cell bounding boxes and actual text regions, leading to structural parsing errors. To address this issue, we propose MixLoss, a collaborative learning framework designed to improve TSR performance by enforcing position-wise consistency between HTML structure prediction and bounding box detection. MixLoss combines HTML tokens with bounding box coordinates in a unified sequence, inserting coordinate tokens directly after filled cell tokens. This design ensures structural alignment while maintaining computational efficiency. Extensive experiments show that MixLoss delivers significant improvements on real-world industrial datasets, including a 2.2% gain on IX DocBench, while maintaining strong performance on standard benchmarks. These results demonstrate the effectiveness of collaborative learning in enhancing table structure recognition for practical industrial document parsing.

Original languageEnglish
Title of host publicationProceeding - 17th International Conference on Human System Interaction, HSI 2025
PublisherIEEE Computer Society
ISBN (Electronic)9798331538583
DOIs
StatePublished - 2025
Event17th International Conference on Human System Interaction, HSI 2025 - Ulsan, Korea, Republic of
Duration: 16 Jul 202519 Jul 2025

Publication series

NameInternational Conference on Human System Interaction, HSI
ISSN (Print)2158-2246
ISSN (Electronic)2158-2254

Conference

Conference17th International Conference on Human System Interaction, HSI 2025
Country/TerritoryKorea, Republic of
CityUlsan
Period16/07/2519/07/25

Keywords

  • Deep Learning
  • Industrial Document Analysis
  • Table Structure Recognition

Fingerprint

Dive into the research topics of 'MixLoss: Table Structure Recognition in Industrial Documents via Collaborative Learning'. Together they form a unique fingerprint.

Cite this