Explainability-Based Mix-Up Approach for Text Data Augmentation

Soonki Kwon, Younghoon Lee

Research output: Contribution to journalArticlepeer-review

18 Scopus citations

Abstract

Text augmentation is a strategy for increasing the diversity of training examples without explicitly collecting new data. Owing to the efficiency and effectiveness of text augmentation, numerous augmentation methodologies have been proposed. Among them, the method based on modification, particularly the mix-up method of swapping words between two or more sentences, is widely used because it can be applied simply and shows good levels of performance. However, the existing mix-up approaches are limited; they do not reflect the importance of the manipulated word. That is, even if a word that has a critical effect on the classification result is manipulated, it is not considered significant in labeling the augmented data. Therefore, in this study, we propose an effective text augmentation technique that explicitly derives the importance of manipulated words and reflects this importance in the labeling of augmented data. The importance of each word, in other words, explainability, is calculated, and this is explicitly reflected in the labeling process of the augmented data. The results of the experiment confirmed that when the importance of the manipulated word was reflected in the labeling, the performance was significantly higher than that of the existing methods.

Original languageEnglish
Article number13
JournalACM Transactions on Knowledge Discovery from Data
Volume17
Issue number1
DOIs
StatePublished - 20 Feb 2023

Keywords

  • Text augmentation
  • XAI
  • mix-up approach
  • soft-labeling
  • word-explainability

Fingerprint

Dive into the research topics of 'Explainability-Based Mix-Up Approach for Text Data Augmentation'. Together they form a unique fingerprint.

Cite this