A study on evaluation measures for unsupervised outlier detection

Sunmin La, Nam Wook Cho

Research output: Contribution to journalArticlepeer-review

2 Scopus citations

Abstract

Outlier detection is a data analysis method based on data mining techniques and is used to identify outlying observations which might have significance in a dataset. Research on outlier detection, however, has mainly focused on supervised approaches, which require labeled training and test datasets. Unsupervised approaches are more appropriate for many applications such as network intrusion detection and fraud detection, but the suitability of these methods to determine the degree of outlierness of a dataset has not been fully addressed because the ground truth is usually unavailable. In this paper, evaluation measures for unsupervised outlier detection, which can effectively measure the outlierness of a dataset, are proposed. To verify the effectiveness of the proposed methods, experiments were conducted with University of California Irvine machine learning datasets using a k-nearest neighbors (k-NN) algorithm.

Original languageEnglish
Pages (from-to)515-520
Number of pages6
JournalICIC Express Letters
Volume14
Issue number5
DOIs
StatePublished - 2020

Keywords

  • External measure
  • Gini index
  • K-nearest neighbors (k-NN)
  • Outlierness
  • Unsupervised outlier detection

Fingerprint

Dive into the research topics of 'A study on evaluation measures for unsupervised outlier detection'. Together they form a unique fingerprint.

Cite this