ActionNet-VE Dataset: A Dataset for Describing Visual Events by Extending VIRAT Ground 2.0

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

This paper introduces a dataset for recognizing and describing interactive events between objects of interest including persons, cars, bikes, and carried objects. Although there have been many video datasets for human activity recognition, most of them focus on persons and their actions and sometimes ignore the specific information on related objects, such as their object type and minimum bounding boxes, in annotations. ActionNet-VE dataset was designed to include full annotations on all objects and events of interest occurred in a video clip for describing the semantics of the event. The dataset adopt 75 video clips from VIRAT Ground 2.0, and extend annotations on the events and their related objects. In addition, the dataset describes semantics of each events by using elements of sentences, such as verb, subject, and objects.

Original languageEnglish
Title of host publicationProceedings - 8th International Conference on Signal Processing, Image Processing and Pattern Recognition, SIP 2015
EditorsByeong-Ho Kang, Carlos Ramos
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1-4
Number of pages4
ISBN (Electronic)9781467398558
DOIs
StatePublished - 11 Mar 2016
Event8th International Conference on Signal Processing, Image Processing and Pattern Recognition, SIP 2015 - Jeju Island, Korea, Republic of
Duration: 25 Nov 201528 Nov 2015

Publication series

NameProceedings - 8th International Conference on Signal Processing, Image Processing and Pattern Recognition, SIP 2015

Conference

Conference8th International Conference on Signal Processing, Image Processing and Pattern Recognition, SIP 2015
Country/TerritoryKorea, Republic of
CityJeju Island
Period25/11/1528/11/15

Keywords

  • and VIRAT
  • interactive events
  • Video dataset
  • video interpretation
  • visual events

Fingerprint

Dive into the research topics of 'ActionNet-VE Dataset: A Dataset for Describing Visual Events by Extending VIRAT Ground 2.0'. Together they form a unique fingerprint.

Cite this