relation: https://eprints.uet.vnu.edu.vn/eprints/id/eprint/3123/
title: Selecting active frames for action recognition with 3D convolutional network
creator: Hoang, Tieu Binh
creator: Ma, Thi Chau
creator: Akihiro, Sugimoto
creator: Bui, The Duy
subject: Information Technology (IT)
description: Recent applications of Convolutional Neural Networks, especially 3-Dimensional Convoltutional Neural Networks (3DCNNs) for human action recognition  (HAR) in videos have widely used. In this paper, we  use a multi-stream framework which is a combination  from separated networks with different kind of input  generated from unique video dataset. To achieve the  high results, firstly, we proposed a method to extract  the active frames (called Selected Active Frames -  SAF) from a videos to build datasets for 3DCNNs in  video classifying problem. Second, we deploy a new  approach called Vote fusion which considered as an  effective fusion method for ensembling multi-stream  networks. From the various datasets generated from  videos, we extract frames by our method and feed  into 3DCNNs for feature extraction, then we carry out  training and then fuse the results of softmax layers  of these streams. We evaluate the proposed methods  on solving action recognition problem. These method  are carried on three well-known datasets (HMFB51,  UCF101, and KTH). The results are also compared to  the state-of-the-art results to illustrate the efficiency  and effectiveness in our approach
date: 2018-09
type: Conference or Workshop Item
type: PeerReviewed
format: application/pdf
language: en
identifier: https://eprints.uet.vnu.edu.vn/eprints/id/eprint/3123/1/ICCCE2018_BinhHT.pdf
identifier:   Hoang, Tieu Binh and Ma, Thi Chau and Akihiro, Sugimoto and Bui, The Duy  (2018) Selecting active frames for action recognition with 3D convolutional network.  In: 2018 the 7th International Conference on Computer and Communication Engineering (ICCCE), September 2018, Malaysia.