TY - GEN
T1 - Summarizing long-length videos with GAN-enhanced audio/visual features
AU - Lee, Hansol
AU - Lee, Gyemin
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/10
Y1 - 2019/10
N2 - In this paper, we propose a novel supervised method for summarizing long-length videos. Many recent works presented successful results in video summarization. However, most videos in those works are short in duration (~5 minutes), and the methods often break down on very long videos (~30 minutes). Moreover, most works only use visual features, while audios provide useful features for the task. Based on these observations, we present a model that exploits both visual and audio features. To handle long videos, our model also refines the extracted features using adversarial networks. To demonstrate our model, we have collected a new dataset of 63 e-sports (~30 minutes) videos, each accompanied by an editorial summary video that is about 10% in length of the original video. Evaluation on this dataset suggests that our method produces quality summaries for very long videos.
AB - In this paper, we propose a novel supervised method for summarizing long-length videos. Many recent works presented successful results in video summarization. However, most videos in those works are short in duration (~5 minutes), and the methods often break down on very long videos (~30 minutes). Moreover, most works only use visual features, while audios provide useful features for the task. Based on these observations, we present a model that exploits both visual and audio features. To handle long videos, our model also refines the extracted features using adversarial networks. To demonstrate our model, we have collected a new dataset of 63 e-sports (~30 minutes) videos, each accompanied by an editorial summary video that is about 10% in length of the original video. Evaluation on this dataset suggests that our method produces quality summaries for very long videos.
KW - Audio
KW - GAN
KW - Multimodal
KW - Summarization
KW - Video
UR - https://www.scopus.com/pages/publications/85082444293
U2 - 10.1109/ICCVW.2019.00462
DO - 10.1109/ICCVW.2019.00462
M3 - Conference contribution
AN - SCOPUS:85082444293
T3 - Proceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019
SP - 3727
EP - 3731
BT - Proceedings - 2019 International Conference on Computer Vision Workshop, ICCVW 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 17th IEEE/CVF International Conference on Computer Vision Workshop, ICCVW 2019
Y2 - 27 October 2019 through 28 October 2019
ER -