OL4TeX: Adaptive Online Learning for Text Classification under Distribution Shifts

Min Seon Kim, Ling Liu, Hyuk Yoon Kwon

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This study presents an adaptive online learning method for text classification under distribution shifts. We formulate a typical neural network-based text classification model as multiple logical modules. By leveraging the characteristics of the modules, we introduce three novel indicators to effectively measure the degree of dynamic distribution shifts without evaluating the model. To enhance online learning, we tactically trade off between learning efficiency and accuracy based on distribution shifts measured in real time. To the best of our knowledge, this is the first effort to adapt the model to the preference of learning efficiency or accuracy for online text classification. Extensive experiments on real-world streaming text datasets show that our method outperforms the best static update strategy and state-of-the-art online text classification models. Our code and data are available at https://github.com/bigbases/online-learning-text.

Original languageEnglish
Title of host publicationProceedings - 2024 IEEE International Conference on Big Data, BigData 2024
EditorsWei Ding, Chang-Tien Lu, Fusheng Wang, Liping Di, Kesheng Wu, Jun Huan, Raghu Nambiar, Jundong Li, Filip Ilievski, Ricardo Baeza-Yates, Xiaohua Hu
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1340-1345
Number of pages6
ISBN (Electronic)9798350362480
DOIs
StatePublished - 2024
Event2024 IEEE International Conference on Big Data, BigData 2024 - Washington, United States
Duration: 15 Dec 202418 Dec 2024

Publication series

NameProceedings - 2024 IEEE International Conference on Big Data, BigData 2024

Conference

Conference2024 IEEE International Conference on Big Data, BigData 2024
Country/TerritoryUnited States
CityWashington
Period15/12/2418/12/24

Keywords

  • Distribution shifts
  • Online learning
  • Streaming text classification

Fingerprint

Dive into the research topics of 'OL4TeX: Adaptive Online Learning for Text Classification under Distribution Shifts'. Together they form a unique fingerprint.

Cite this