Product failure prediction with missing data

Seokho Kang, Eunji Kim, Jaewoong Shim, Wonsang Chang, Sungzoon Cho

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

In production data, missing values commonly appear for several reasons including changes in measurement and inspection items, sampling inspections, and unexpected process events. When applied to product failure prediction, the incompleteness of data should be properly addressed to avoid performance degradation in prediction models. Well-known approaches for missing data treatment, such as elimination and imputation, would not perform well under usual scenarios in production data, including high missing rate, systematic missing and class imbalance. To address these limitations, here we present a method for predictive modelling with missing data by considering the characteristics of production data. It builds multiple prediction models on different complete data subsets derived from the original data-set, each of which has different coverage of instances and input variables. These models are selectively used to make predictions for new instances with missing values. We demonstrate the effectiveness of the proposed method through a case study using actual data-sets from a home appliance manufacturer.

Original languageEnglish
Pages (from-to)4849-4859
Number of pages11
JournalInternational Journal of Production Research
Volume56
Issue number14
DOIs
StatePublished - 18 Jul 2018

Keywords

  • data mining
  • failure prediction
  • missing value
  • predictive modelling
  • production data

Fingerprint

Dive into the research topics of 'Product failure prediction with missing data'. Together they form a unique fingerprint.

Cite this