TY - JOUR
T1 - Advanced pseudo-labeling approach in mixing-based text data augmentation method
AU - Park, Jungmin
AU - Lee, Younghoon
N1 - Publisher Copyright:
© The Author(s), under exclusive licence to Springer-Verlag London Ltd., part of Springer Nature 2024.
PY - 2024/12
Y1 - 2024/12
N2 - Text augmentation methods facilitate an increase in the amount of training data, without having to collect new training data, by generating transformed versions of real datasets. Among such methods, mixing-based approaches, which swap words between two or more sentences, are widely applied owing to their simplicity and noteworthy performance. However, existing mixing-based approaches do not consider the importance of manipulated words during the pseudo-labeling process because they utilize a naive linear interpolation method. Thus, this paper proposes an advanced mixing-based text augmentation approach based on artificial intelligence methods that explicitly reflect the importance of manipulated words in the pseudo-labeling process. In addition, to avoid overdependence on the pseudo-labeling quality in the training process, the difference between the original label and prediction is also reflected in the loss function. Experimental results indicate that the performance of the proposed method is significantly higher than that of existing approaches.
AB - Text augmentation methods facilitate an increase in the amount of training data, without having to collect new training data, by generating transformed versions of real datasets. Among such methods, mixing-based approaches, which swap words between two or more sentences, are widely applied owing to their simplicity and noteworthy performance. However, existing mixing-based approaches do not consider the importance of manipulated words during the pseudo-labeling process because they utilize a naive linear interpolation method. Thus, this paper proposes an advanced mixing-based text augmentation approach based on artificial intelligence methods that explicitly reflect the importance of manipulated words in the pseudo-labeling process. In addition, to avoid overdependence on the pseudo-labeling quality in the training process, the difference between the original label and prediction is also reflected in the loss function. Experimental results indicate that the performance of the proposed method is significantly higher than that of existing approaches.
KW - Explainable artificial intelligence
KW - Mix-up approach
KW - Over-fitting prevention
KW - Text augmentation
KW - Word-explainability
UR - http://www.scopus.com/inward/record.url?scp=85205924771&partnerID=8YFLogxK
U2 - 10.1007/s10044-024-01340-6
DO - 10.1007/s10044-024-01340-6
M3 - Article
AN - SCOPUS:85205924771
SN - 1433-7541
VL - 27
JO - Pattern Analysis and Applications
JF - Pattern Analysis and Applications
IS - 4
M1 - 129
ER -