TY - JOUR
T1 - Concept map construction from text documents using affinity propagation
AU - Qasim, Iqbal
AU - Jeong, Jin Woo
AU - Heu, Jee Uk
AU - Lee, Dong Ho
PY - 2013/12
Y1 - 2013/12
N2 - Concept maps are playing an increasingly important role in various computing fields. In particular, they have been popularly used for organizing and representing knowledge. However, constructing concept maps manually is a complex and time-consuming task. Therefore, the creation of concept maps automatically or semi-automatically from text documents is a worthwhile research challenge. Recently, various approaches for automatic or semi-automatic construction of concept maps have been proposed. However, these approaches suffer from several limitations. First, only the noun phrases in text documents are included without resolution of the anaphora problems for pronouns. This omission causes important propositions available in the text documents to be missed, resulting in decreased recall. Second, although some approaches label the relationship to form propositions, they do not show the direction of the relationship between the subject and object in the form of Subject-Relationship- Object, leading to ambiguous propositions. In this paper, we present a cluster-based approach to semi-automatically construct concept maps from text documents. First, we extract the candidate terms from documents using typed dependency linguistic rules. Anaphoric resolution for pronouns is introduced to map the pronouns with candidate terms. Second, the similarities are calculated between the pairs of extracted candidate terms of a document and clusters are made through affinity propagation by providing the calculated similarities between the candidate terms. Finally, the extracted relationships are assigned between the candidate terms in each cluster. Our empirical results show that the semi-automatically constructed concept maps conform to the outputs generated manually by domain experts, since the degree of difference between them is proportionally small based on a Likert scale. Furthermore, domain experts verified that the constructed concept maps are in accordance with their knowledge of the information system domain.
AB - Concept maps are playing an increasingly important role in various computing fields. In particular, they have been popularly used for organizing and representing knowledge. However, constructing concept maps manually is a complex and time-consuming task. Therefore, the creation of concept maps automatically or semi-automatically from text documents is a worthwhile research challenge. Recently, various approaches for automatic or semi-automatic construction of concept maps have been proposed. However, these approaches suffer from several limitations. First, only the noun phrases in text documents are included without resolution of the anaphora problems for pronouns. This omission causes important propositions available in the text documents to be missed, resulting in decreased recall. Second, although some approaches label the relationship to form propositions, they do not show the direction of the relationship between the subject and object in the form of Subject-Relationship- Object, leading to ambiguous propositions. In this paper, we present a cluster-based approach to semi-automatically construct concept maps from text documents. First, we extract the candidate terms from documents using typed dependency linguistic rules. Anaphoric resolution for pronouns is introduced to map the pronouns with candidate terms. Second, the similarities are calculated between the pairs of extracted candidate terms of a document and clusters are made through affinity propagation by providing the calculated similarities between the candidate terms. Finally, the extracted relationships are assigned between the candidate terms in each cluster. Our empirical results show that the semi-automatically constructed concept maps conform to the outputs generated manually by domain experts, since the degree of difference between them is proportionally small based on a Likert scale. Furthermore, domain experts verified that the constructed concept maps are in accordance with their knowledge of the information system domain.
KW - affinity propagation
KW - concept map
KW - concept map learning
KW - knowledge acquisition
KW - text clustering
UR - https://www.scopus.com/pages/publications/84887583194
U2 - 10.1177/0165551513494645
DO - 10.1177/0165551513494645
M3 - Article
AN - SCOPUS:84887583194
SN - 0165-5515
VL - 39
SP - 719
EP - 736
JO - Journal of Information Science
JF - Journal of Information Science
IS - 6
ER -