TY - JOUR
T1 - Using Multi-Modal Semantic Association Rules to fuse keywords and visual features automatically for Web image retrieval
AU - He, Ruhan
AU - Xiong, Naixue
AU - Yang, Laurence T.
AU - Park, Jong Hyuk
PY - 2011/7
Y1 - 2011/7
N2 - A recent trend for image search is to fuse the two basic modalities of Web images, i.e., textual features (usually represented by keywords) and visual features for retrieval. The key issue is how to associate the two modalities for fusion. In this paper, a new approach based on Multi-Modal Semantic Association Rule (MMSAR) is proposed to fuse keywords and visual features automatically for Web image retrieval. A MMSAR contains a single keyword and several visual feature clusters, which crosses and associates the two modalities of Web images. A customized frequent itemsets mining algorithm is designed for the particular MMSARs based on the existing inverted file, and a new support-confidence framework is defined for the mining algorithm. Based on the mined MMSARs, the keywords and the visual features are fused automatically in the retrieval process. The proposed approach not only remarkably improves the retrieval precision, but also has fast response time. The experiments are carried out in a Web image retrieval system, VAST (VisuAl & SemanTic image search), and the results show the superiority and effectiveness of the proposed approach.
AB - A recent trend for image search is to fuse the two basic modalities of Web images, i.e., textual features (usually represented by keywords) and visual features for retrieval. The key issue is how to associate the two modalities for fusion. In this paper, a new approach based on Multi-Modal Semantic Association Rule (MMSAR) is proposed to fuse keywords and visual features automatically for Web image retrieval. A MMSAR contains a single keyword and several visual feature clusters, which crosses and associates the two modalities of Web images. A customized frequent itemsets mining algorithm is designed for the particular MMSARs based on the existing inverted file, and a new support-confidence framework is defined for the mining algorithm. Based on the mined MMSARs, the keywords and the visual features are fused automatically in the retrieval process. The proposed approach not only remarkably improves the retrieval precision, but also has fast response time. The experiments are carried out in a Web image retrieval system, VAST (VisuAl & SemanTic image search), and the results show the superiority and effectiveness of the proposed approach.
KW - Association rule mining
KW - Inverted file
KW - Multi-Modal Semantic Association Rule (MMSAR)
KW - Relevance Feedback (RF)
KW - Web image retrieval
UR - https://www.scopus.com/pages/publications/79954567261
U2 - 10.1016/j.inffus.2010.02.001
DO - 10.1016/j.inffus.2010.02.001
M3 - Article
AN - SCOPUS:79954567261
SN - 1566-2535
VL - 12
SP - 223
EP - 230
JO - Information Fusion
JF - Information Fusion
IS - 3
ER -