Multi-document summarization exploiting semantic analysis based on tag cluster

Jee Uk Heu, Jin Woo Jeong, Iqbal Qasim, Young Do Joo, Joon Myun Cho, Dong Ho Lee

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

Multi-document summarization techniques aim to reduce the documents into a small set of words or paragraphs that convey the main meaning of the original documents. Many approaches for multi-document summarization have used probability based methods and machine learning techniques to summarize multiple documents sharing a common topic at the same time. However, these techniques fail to semantically analyze proper nouns and newly-coined words because most of them depend on old-fashioned dictionary or thesaurus. To overcome these drawbacks, we propose a novel multi-document summarization technique which employs the tag cluster on Flickr, a kind of folksonomy systems, for detecting key sentences from multiple documents. We first create a word frequency table for analyzing the semantics and contribution of words by using HITS algorithm. Then, by exploiting tag clusters, we analyze the semantic relationship between words in the word frequency table. The experimental results on TAC 2008, 2009 data sets demonstrate the improvement of our proposed framework over existing summarization systems.

Original languageEnglish
Title of host publicationAdvances in Multimedia Modeling - 19th International Conference, MMM 2013, Proceedings
Pages479-489
Number of pages11
EditionPART 2
DOIs
StatePublished - 2013
Event19th International Conference on Advances in Multimedia Modeling, MMM 2013 - Huangshan, China
Duration: 7 Jan 20139 Jan 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
NumberPART 2
Volume7733 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference19th International Conference on Advances in Multimedia Modeling, MMM 2013
Country/TerritoryChina
CityHuangshan
Period7/01/139/01/13

Keywords

  • Multi-document summarization
  • Semantic analysis
  • Tag cluster

Fingerprint

Dive into the research topics of 'Multi-document summarization exploiting semantic analysis based on tag cluster'. Together they form a unique fingerprint.

Cite this