TY - JOUR
T1 - Constrained Adaptive Distillation Based on Topological Persistence for Wearable Sensor Data
AU - Jeon, Eun Som
AU - Choi, Hongjun
AU - Shukla, Ankita
AU - Wang, Yuan
AU - Buman, Matthew P.
AU - Turaga, Pavan
N1 - Publisher Copyright:
© 1963-2012 IEEE.
PY - 2023
Y1 - 2023
N2 - Wearable sensor data analysis with persistence features generated by topological data analysis (TDA) has achieved great success in various applications, and however, it suffers from large computational and time resources for extracting topological features. In this article, our approach utilizes knowledge distillation (KD) that involves the use of multiple teacher networks trained with the raw time series and persistence images (PIs) generated by TDA. However, direct transfer of knowledge from the teacher models utilizing different characteristics as inputs to the student model results in a knowledge gap and limited performance. To address this problem, we introduce a robust framework that integrates multimodal features from two different teachers and enables a student to learn desirable knowledge effectively. To account for statistical differences in multimodalities, an entropy-based constrained adaptive weighting mechanism is leveraged to automatically balance the effects of teachers and encourage the student model to adequately adopt the knowledge from two teachers. To assimilate dissimilar structural information generated by different style models for distillation, batch and channel similarities within a mini-batch are used. We demonstrate the effectiveness of the proposed method on wearable sensor data.
AB - Wearable sensor data analysis with persistence features generated by topological data analysis (TDA) has achieved great success in various applications, and however, it suffers from large computational and time resources for extracting topological features. In this article, our approach utilizes knowledge distillation (KD) that involves the use of multiple teacher networks trained with the raw time series and persistence images (PIs) generated by TDA. However, direct transfer of knowledge from the teacher models utilizing different characteristics as inputs to the student model results in a knowledge gap and limited performance. To address this problem, we introduce a robust framework that integrates multimodal features from two different teachers and enables a student to learn desirable knowledge effectively. To account for statistical differences in multimodalities, an entropy-based constrained adaptive weighting mechanism is leveraged to automatically balance the effects of teachers and encourage the student model to adequately adopt the knowledge from two teachers. To assimilate dissimilar structural information generated by different style models for distillation, batch and channel similarities within a mini-batch are used. We demonstrate the effectiveness of the proposed method on wearable sensor data.
KW - Knowledge distillation (KD)
KW - topological data analysis (TDA)
KW - wearable sensor data
UR - https://www.scopus.com/pages/publications/85177196189
U2 - 10.1109/TIM.2023.3329818
DO - 10.1109/TIM.2023.3329818
M3 - Article
AN - SCOPUS:85177196189
SN - 0018-9456
VL - 72
SP - 1
EP - 14
JO - IEEE Transactions on Instrumentation and Measurement
JF - IEEE Transactions on Instrumentation and Measurement
M1 - 2532014
ER -