Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction

Kyoungok Kim, Jaewook Lee

Research output: Contribution to journalArticlepeer-review

61 Scopus citations

Abstract

Sentiment analysis, which detects the subjectivity or polarity of documents, is one of the fundamental tasks in text data analytics. Recently, the number of documents available online and offline is increasing dramatically, and preprocessed text data have more features. This development makes analysis more complex to be analyzed effectively. This paper proposes a novel semi-supervised Laplacian eigenmap (SS-LE). The SS-LE removes redundant features effectively by decreasing detection errors of sentiments. Moreover, it enables visualization of documents in perceptible low dimensional embedded space to provide a useful tool for text analytics. The proposed method is evaluated using multi-domain review data set in sentiment visualization and classification by comparing other dimensionality reduction methods. SS-LE provides a better similarity measure in the visualization result by separating positive and negative documents properly. Sentiment classification models trained over reduced data by SS-LE show higher accuracy. Overall, experimental results suggest that SS-LE has the potential to be used to visualize documents for the ease of analysis and to train a predictive model in sentiment analysis. SS-LE can also be applied to any other partially annotated text data sets.

Original languageEnglish
Pages (from-to)758-768
Number of pages11
JournalPattern Recognition
Volume47
Issue number2
DOIs
StatePublished - Feb 2014

Keywords

  • Laplacian eigenmaps
  • Semi-supervised dimensionality reduction
  • Sentiment classification
  • Text visualization

Fingerprint

Dive into the research topics of 'Sentiment visualization and classification via semi-supervised nonlinear dimensionality reduction'. Together they form a unique fingerprint.

Cite this