혼합변수를 갖는 데이터의 분류를 위한 하이브리드 분류기

Translated title of the contribution: Hybrid Classifiers of Classification Techniques for Mixed Data

Jung Sik Hong

Research output: Contribution to journalArticlepeer-review

Abstract

Mixed data with numeric variables and categorical variables are appearing in many areas such as credit scoring, medical diagnosis and manufacturing products. Most Classification techniques are suitable for each variable type of mixed data. For example, techniques using Euclidean distance are suitable for numerical variables and another techniques using symbolic logic are suitable for categorical variables. In this paper, we propose a hybrid method of classifiers to improve performance of the classification algorithm. Main idea is to deal with the categorical and numerical attributes separately with appropriate techniques. First, a whole data is partitioned into several subsets by applying decision tree only to categorical variables. Next, posterior probability is obtained by applying either k-NN or SVM to numerical variables in each leaf node of decision tree. Six data (Australian credit, German credit, Japan credit, Mammographic mass, churn, bank) of the UCI Machine Learning Repository are used to evaluate performance of the proposed hybrid classifier. Performance of a hybrid k-NN classifier is improved comparing with the k-NN. Performance of a hybrid SVM is slightly better than that of SVM.
Translated title of the contributionHybrid Classifiers of Classification Techniques for Mixed Data
Original languageKorean
Pages (from-to)341-349
Number of pages9
Journal대한산업공학회지
Volume43
Issue number5
DOIs
StatePublished - Oct 2017

Fingerprint

Dive into the research topics of 'Hybrid Classifiers of Classification Techniques for Mixed Data'. Together they form a unique fingerprint.

Cite this