TY - JOUR
T1 - Improving generalization performance of electrocardiogram classification models
AU - Han, Hyeongrok
AU - Park, Seongjae
AU - Min, Seonwoo
AU - Kim, Eunji
AU - Kim, Hyun Gi
AU - Park, Sangha
AU - Kim, Jin Kook
AU - Park, Junsang
AU - An, Junho
AU - Lee, Kwanglo
AU - Jeong, Wonsun
AU - Chon, Sangil
AU - Ha, Kwon Woo
AU - Han, Myungkyu
AU - Choi, Hyun Soo
AU - Yoon, Sungroh
N1 - Publisher Copyright:
© 2023 Institute of Physics and Engineering in Medicine
PY - 2023/5/1
Y1 - 2023/5/1
N2 - Objective. Recently, many electrocardiogram (ECG) classification algorithms using deep learning have been proposed. Because the ECG characteristics vary across datasets owing to variations in factors such as recorded hospitals and the race of participants, the model needs to have a consistently high generalization performance across datasets. In this study, as part of the PhysioNet/Computing in Cardiology Challenge (PhysioNet Challenge) 2021, we present a model to classify cardiac abnormalities from the 12- and the reduced-lead ECGs. Approach. To improve the generalization performance of our earlier proposed model, we adopted a practical suite of techniques, i.e. constant-weighted cross-entropy loss, additional features, mixup augmentation, squeeze/excitation block, and OneCycle learning rate scheduler. We evaluated its generalization performance using the leave-one-dataset-out cross-validation setting. Furthermore, we demonstrate that the knowledge distillation from the 12-lead and large-teacher models improved the performance of the reduced-lead and small-student models. Main results. With the proposed model, our DSAIL SNU team has received Challenge scores of 0.55, 0.58, 0.58, 0.57, and 0.57 (ranked 2nd, 1st, 1st, 2nd, and 2nd of 39 teams) for the 12-, 6-, 4-, 3-, and 2-lead versions of the hidden test set, respectively. Significance. The proposed model achieved a higher generalization performance over six different hidden test datasets than the one we submitted to the PhysioNet Challenge 2020.
AB - Objective. Recently, many electrocardiogram (ECG) classification algorithms using deep learning have been proposed. Because the ECG characteristics vary across datasets owing to variations in factors such as recorded hospitals and the race of participants, the model needs to have a consistently high generalization performance across datasets. In this study, as part of the PhysioNet/Computing in Cardiology Challenge (PhysioNet Challenge) 2021, we present a model to classify cardiac abnormalities from the 12- and the reduced-lead ECGs. Approach. To improve the generalization performance of our earlier proposed model, we adopted a practical suite of techniques, i.e. constant-weighted cross-entropy loss, additional features, mixup augmentation, squeeze/excitation block, and OneCycle learning rate scheduler. We evaluated its generalization performance using the leave-one-dataset-out cross-validation setting. Furthermore, we demonstrate that the knowledge distillation from the 12-lead and large-teacher models improved the performance of the reduced-lead and small-student models. Main results. With the proposed model, our DSAIL SNU team has received Challenge scores of 0.55, 0.58, 0.58, 0.57, and 0.57 (ranked 2nd, 1st, 1st, 2nd, and 2nd of 39 teams) for the 12-, 6-, 4-, 3-, and 2-lead versions of the hidden test set, respectively. Significance. The proposed model achieved a higher generalization performance over six different hidden test datasets than the one we submitted to the PhysioNet Challenge 2020.
KW - artificial intelligence
KW - biomedical engineering
KW - cardiovascular disease
KW - deep learning
KW - ECG
KW - knowledge distillation
UR - http://www.scopus.com/inward/record.url?scp=85158906098&partnerID=8YFLogxK
U2 - 10.1088/1361-6579/acb30f
DO - 10.1088/1361-6579/acb30f
M3 - Article
C2 - 36638544
AN - SCOPUS:85158906098
SN - 0967-3334
VL - 44
JO - Physiological Measurement
JF - Physiological Measurement
IS - 5
M1 - 054003
ER -