|Title:||Multi-Camera Action Dataset (MCAD)|
Wong Yong Kang
|NUS Contact:||Wong Yong Kang|
Learning (artificial intelligence)
Multicamera action dataset
Cross-camera action recognition
Action recognition has received increasing attentions from the computer vision and machine learning community in the last decades. Ever since then, the recognition task has evolved from single view recording under controlled laboratory environment to unconstrained environment (i.e., surveillance environment or user generated videos). Furthermore, recent work focused on other aspect of action recognition problem, such as cross-view classification, cross domain learning, multi-modality learning, and action localization. Despite the large variations of studies, we observed limited works that explore the open-set and open-view classification problem, which is a genuine inherited properties in action recognition problem. In other words, a well designed algorithm should robustly identify an unfamiliar action as “unknown” and achieved similar performance across sensors with similar field of view. The Multi-Camera Action Dataset (MCAD) is designed to evaluate the open-view classification problem under surveillance environment.
In our multi-camera action dataset, different from common action datasets we use a total of five cameras, which can be divided into two types of cameras (StaticandPTZ), to record actions. Particularly, there are three Static cameras (Cam04 & Cam05 & Cam06) with fish eye effect and two PanTilt-Zoom (PTZ) cameras (PTZ04 & PTZ06). Static camera has a resolution of 1280×960 pixels, while PTZ camera has a resolution of 704×576 pixels and a smaller field of view than Static camera. What’s more, we don’t control the illumination environment. We even set two contrasting conditions (Daytime and Nighttime environment) which makes our dataset more challenge than many controlled datasets with strongly controlled illumination environment.The distribution of the cameras is shown in the picture on the right.
We identified 18 units single person daily actions with/without object which are inherited from the KTH, IXMAS, and TRECIVD datasets etc. The list and the definition of actions are shown in the table. These actions can also be divided into 4 types actions. Micro action without object (action ID of 01, 02 ,05) and with object (action ID of 10, 11, 12 ,13). Intense action with object (action ID of 03, 04 ,06, 07, 08, 09) and with object (action ID of 14, 15, 16, 17, 18). We recruited a total of 20 human subjects. Each candidate repeats 8 times (4 times during the day and 4 times in the evening) of each action under one camera. In the recording process, we use five cameras to record each action sample separately. During recording stage we just tell candidates the action name then they could perform the action freely with their own habit, only if they do the action in the field of view of the current camera. This can make our dataset much closer to reality. As a results there is high intra action class variation among different action samples as shown in picture of action samples.
Resources:IDXXXX.mp4.tar.gz contains video data for each individual; boundingbox.tar.gz contains person bounding box for all videos; protocol.json contains the evaluation protocol; img_list.txt contains the download URLs for the images version of the video data; idt_list.txt contians the download URLs for the improved Dense Trajectory feature; stip_list.txt contians the download URLs for the STIP feature. Manual annotated 2D joints for selected camera view and action class (available via http://zju-capg.org/heightmap/)
This dataset is a part of the following research paper. Please ensure the research paper is cited appropriately if you use the MCAD dataset in your work (papers, articles, reports, books, software, etc). For more details, please refer to Citation field.
|Citation:||When using this data, please cite the original publication and also the dataset.|
|License:||Attribution-NonCommercial 4.0 International|
|Appears in Collections:||Staff Dataset|
Show full item record
Files in This Item:
|stip_list.txt||contians the download URLs for the STIP feature||1.05 kB||Text||View/Download|
|protocol.json||contains the evaluation protocol||2.53 kB||Unknown||View/Download|
|img_list.txt||contains the download URLs for the images version of the video data||1.09 kB||Text||View/Download|
|idt_list.txt||contians the download URLs for the improved Dense Trajectory feature||1.09 kB||Text||View/Download|
|boundingbox.tar.gz||contains person bounding box for all videos||9.59 MB||Unknown||View/Download|
This item is licensed under a Creative Commons License