비디오 프레임 선택을 통한 주거 공간 인간 행동 인식 모델 경량화 방안 제안

Translated title of the contribution: A Proposal for Lightweight Human Action Recognition Model with Video Frame Selection for Residential Area

Research output: Contribution to journalArticlepeer-review

Abstract

Residential area closed-circuit televisions (CCTVs) need human action recognition (HAR) to predict any accidents and crucial problems. HAR model must be not only accurate but also light and fast to apply in the real world. Therefore, in this paper, a cross-modal PoseC3D model with a frame selection method is proposed. The proposed cross-modal PoseC3D model integrates multi-modality inputs (i.e., RGB image and human skeleton data) and trains them in a single model. Thus, the proposed model is lighter and faster than previous works such as two-pathway PoseC3D. Moreover, we apply the frame selection method to use only the meaningful frames based on differences between frames instead of using the whole frame of a video. AI Hub open dataset was used to verify the performance of proposed method. The experimental results showed that the proposed method achieves similar or better performance and is much lighter and faster than those in the previous works.
Translated title of the contributionA Proposal for Lightweight Human Action Recognition Model with Video Frame Selection for Residential Area
Original languageKorean
Pages (from-to)1111-1120
Number of pages10
Journal정보과학회논문지
Volume50
Issue number12
DOIs
StatePublished - 2023

Fingerprint

Dive into the research topics of 'A Proposal for Lightweight Human Action Recognition Model with Video Frame Selection for Residential Area'. Together they form a unique fingerprint.

Cite this