TY - JOUR
T1 - A Partially Labeled Anomaly Data Detection Approach Based on Prioritized Deep Reinforcement Learning for Consumer Electronics Security
AU - Qin, Shuqi
AU - Liu, Shenghao
AU - Ye, Shengjie
AU - Fan, Xiaoxuan
AU - Cheng, Minmin
AU - He, Yuanyuan
AU - Deng, Xianjun
AU - Hyuk Park, Jong
N1 - Publisher Copyright:
© 1975-2011 IEEE.
PY - 2024
Y1 - 2024
N2 - Anomalies within data flows in the Internet of Things environment can potentially result in security vulnerabilities in consumer electronics. Therefore, it is crucial to effectively detect anomaly data to safeguard the reliability and continuous functionality of consumer electronics. Existing related works either learn from unlabeled data using unsupervised methods or leverage the limited labeled data to improve detection performance by semi-supervised methods. However, these methods usually overfit specific types of known anomalies or ignore the uncertainty when model training. To this end, we design a novel approach to jointly optimize the end-to-end detection of labeled and unlabeled anomalies. Specifically, the anomaly data detection problem investigated is first reformulated as a Markov decision process. Then, a partially labeled anomaly data detection approach (PANDA) based on prioritized deep deterministic policy gradient is proposed, which considers uncertainty when the agent makes decisions and can learn from the labeled known anomalies while continuously exploring and detecting prospective anomalies in unlabeled data. Extensive experiments on 13 datasets show that PANDA improves the AUC-ROC and AUC-PR by 3.0%-10.3% and 10.0%-73.5% and its robustness under the impact of anomaly contamination rates compared with four state-of-the-art competing methods.
AB - Anomalies within data flows in the Internet of Things environment can potentially result in security vulnerabilities in consumer electronics. Therefore, it is crucial to effectively detect anomaly data to safeguard the reliability and continuous functionality of consumer electronics. Existing related works either learn from unlabeled data using unsupervised methods or leverage the limited labeled data to improve detection performance by semi-supervised methods. However, these methods usually overfit specific types of known anomalies or ignore the uncertainty when model training. To this end, we design a novel approach to jointly optimize the end-to-end detection of labeled and unlabeled anomalies. Specifically, the anomaly data detection problem investigated is first reformulated as a Markov decision process. Then, a partially labeled anomaly data detection approach (PANDA) based on prioritized deep deterministic policy gradient is proposed, which considers uncertainty when the agent makes decisions and can learn from the labeled known anomalies while continuously exploring and detecting prospective anomalies in unlabeled data. Extensive experiments on 13 datasets show that PANDA improves the AUC-ROC and AUC-PR by 3.0%-10.3% and 10.0%-73.5% and its robustness under the impact of anomaly contamination rates compared with four state-of-the-art competing methods.
KW - Consumer electronics security
KW - anomaly data detection
KW - deep reinforcement learning
UR - http://www.scopus.com/inward/record.url?scp=85201779274&partnerID=8YFLogxK
U2 - 10.1109/TCE.2024.3445629
DO - 10.1109/TCE.2024.3445629
M3 - Article
AN - SCOPUS:85201779274
SN - 0098-3063
VL - 70
SP - 6452
EP - 6462
JO - IEEE Transactions on Consumer Electronics
JF - IEEE Transactions on Consumer Electronics
IS - 4
ER -