TY - GEN
T1 - A knowledge-driven approach to interactive event recognition for semantic video understanding
AU - Moon, Jinyoung
AU - Kwon, Yongjin
AU - Kang, Kyuchang
AU - Park, Jongyoul
AU - Han, Yong Jin
AU - Lee, Young Wha
N1 - Publisher Copyright:
© 2016 IEEE.
PY - 2016/11/9
Y1 - 2016/11/9
N2 - Since early 1990, event recognition has been one of the most attractive research topics for video understanding, in company with object recognition. Most studies on video event recognition, which are based on data-driven approaches, should train a model for a newly-added event without using human knowledge and existing models for similar events. Because it is impossible to define all events required for video understanding in advance, this paper proposed a hierarchical recognition method for general events based on dynamic spatial relations between two objects and specialized events determined by the related objects. The general events are useful for describing interactions between objects of interest regardless of video domain. The specialized events can be provided to users as familiar terms in video interpretation or visual question answering for user-friendly interaction. For two general events and their specialized four events, the proposed recognition method performed the F-score of 82.31% and 88.61% based on object-based and region-based event matching, respectively.
AB - Since early 1990, event recognition has been one of the most attractive research topics for video understanding, in company with object recognition. Most studies on video event recognition, which are based on data-driven approaches, should train a model for a newly-added event without using human knowledge and existing models for similar events. Because it is impossible to define all events required for video understanding in advance, this paper proposed a hierarchical recognition method for general events based on dynamic spatial relations between two objects and specialized events determined by the related objects. The general events are useful for describing interactions between objects of interest regardless of video domain. The specialized events can be provided to users as familiar terms in video interpretation or visual question answering for user-friendly interaction. For two general events and their specialized four events, the proposed recognition method performed the F-score of 82.31% and 88.61% based on object-based and region-based event matching, respectively.
UR - https://www.scopus.com/pages/publications/85006272105
U2 - 10.1109/ICITCS.2016.7740305
DO - 10.1109/ICITCS.2016.7740305
M3 - Conference contribution
AN - SCOPUS:85006272105
T3 - 2016 6th International Conference on IT Convergence and Security, ICITCS 2016
BT - 2016 6th International Conference on IT Convergence and Security, ICITCS 2016
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 6th International Conference on IT Convergence and Security, ICITCS 2016
Y2 - 26 September 2016 through 29 September 2016
ER -