TY - JOUR
T1 - Extensible hierarchical method of detecting interactive actions for video understanding
AU - Moon, Jinyoung
AU - Jin, Junho
AU - Kwon, Yongjin
AU - Kang, Kyuchang
AU - Park, Jongyoul
AU - Park, Kyoung
N1 - Publisher Copyright:
© ETRI.
PY - 2017/8
Y1 - 2017/8
N2 - For video understanding, namely analyzing who did what in a video, actions along with objects are primary elements. Most studies on actions have handled recognition problems for a well-trimmed video and focused on enhancing their classification performance. However, action detection, including localization as well as recognition, is required because, in general, actions intersect in time and space. In addition, most studies have not considered extensibility for a newly added action that has been previously trained. Therefore, proposed in this paper is an extensible hierarchical method for detecting generic actions, which combine object movements and spatial relations between two objects, and inherited actions, which are determined by the related objects through an ontology and rule based methodology. The hierarchical design of the method enables it to detect any interactive actions based on the spatial relations between two objects. The method using object information achieves an F-measure of 90.27%. Moreover, this paper describes the extensibility of the method for a new action contained in a video from a video domain that is different from the dataset used.
AB - For video understanding, namely analyzing who did what in a video, actions along with objects are primary elements. Most studies on actions have handled recognition problems for a well-trimmed video and focused on enhancing their classification performance. However, action detection, including localization as well as recognition, is required because, in general, actions intersect in time and space. In addition, most studies have not considered extensibility for a newly added action that has been previously trained. Therefore, proposed in this paper is an extensible hierarchical method for detecting generic actions, which combine object movements and spatial relations between two objects, and inherited actions, which are determined by the related objects through an ontology and rule based methodology. The hierarchical design of the method enables it to detect any interactive actions based on the spatial relations between two objects. The method using object information achieves an F-measure of 90.27%. Moreover, this paper describes the extensibility of the method for a new action contained in a video from a video domain that is different from the dataset used.
KW - Action detection
KW - Generic action
KW - Hierarchical action composition
KW - Inherited action
KW - Video understanding
UR - https://www.scopus.com/pages/publications/85032477067
U2 - 10.4218/etrij.17.0116.0054
DO - 10.4218/etrij.17.0116.0054
M3 - Article
AN - SCOPUS:85032477067
SN - 1225-6463
VL - 39
SP - 502
EP - 513
JO - ETRI Journal
JF - ETRI Journal
IS - 4
ER -