TY - JOUR
T1 - A unified benchmark for the unknown detection capability of deep neural networks
AU - Kim, Jihyo
AU - Koo, Jiin
AU - Hwang, Sangheum
N1 - Publisher Copyright:
© 2023 Elsevier Ltd
PY - 2023/11/1
Y1 - 2023/11/1
N2 - Deep neural networks have achieved outstanding performance over various tasks, but they have a critical issue: over-confident predictions even for completely unknown samples. Many studies have been proposed to successfully filter out these unknown samples, but they only considered narrow and specific tasks, referred to as misclassification detection, open-set recognition, or out-of-distribution detection. In this work, we argue that these tasks should be treated as fundamentally an identical problem because an ideal model should possess detection capability for all those tasks. Therefore, we introduce the unknown detection task, an integration of previous individual tasks, for a rigorous examination of the detection capability of deep neural networks on a wide spectrum of unknown samples. To this end, unified benchmark datasets on different scales were constructed and the unknown detection capabilities of existing popular methods were subject to comparison. We found that Deep Ensemble consistently outperforms the other approaches in detecting unknowns; however, all methods are only successful for a specific type of unknown. The reproducible code and benchmark datasets are available at https://github.com/daintlab/unknown-detection-benchmarks.
AB - Deep neural networks have achieved outstanding performance over various tasks, but they have a critical issue: over-confident predictions even for completely unknown samples. Many studies have been proposed to successfully filter out these unknown samples, but they only considered narrow and specific tasks, referred to as misclassification detection, open-set recognition, or out-of-distribution detection. In this work, we argue that these tasks should be treated as fundamentally an identical problem because an ideal model should possess detection capability for all those tasks. Therefore, we introduce the unknown detection task, an integration of previous individual tasks, for a rigorous examination of the detection capability of deep neural networks on a wide spectrum of unknown samples. To this end, unified benchmark datasets on different scales were constructed and the unknown detection capabilities of existing popular methods were subject to comparison. We found that Deep Ensemble consistently outperforms the other approaches in detecting unknowns; however, all methods are only successful for a specific type of unknown. The reproducible code and benchmark datasets are available at https://github.com/daintlab/unknown-detection-benchmarks.
KW - Misclassification detection
KW - Open-set recognition
KW - Out-of-distribution detection
KW - Unknown detection
UR - http://www.scopus.com/inward/record.url?scp=85160574042&partnerID=8YFLogxK
U2 - 10.1016/j.eswa.2023.120461
DO - 10.1016/j.eswa.2023.120461
M3 - Article
AN - SCOPUS:85160574042
SN - 0957-4174
VL - 229
JO - Expert Systems with Applications
JF - Expert Systems with Applications
M1 - 120461
ER -