Auxiliary criterion conversion via spatiotemporal semantic encoding and feature entropy for action recognition - Details

Author：

Meng, Xiaoyan (Meng, Xiaoyan.) | Zhang, Guoliang (Zhang, Guoliang.) | Jia, Songmin (Jia, Songmin.) (Scholars：贾松敏) | Li, Xiuzhi (Li, Xiuzhi.) | Zhang, Xiangyin (Zhang, Xiangyin.)

Indexed by：

EI Scopus SCIE

Abstract：

Video-based　action　recognition　in　realistic　scenes　is　a　core　technology　for　human-computer　interaction　and　smart　surveillance.　Although　the　trajectory　features　with　the　bag　of　visual　words　have　confirmed　promising　performance,　spatiotemporal　interactive　information　cannot　be　effectively　encoded　which　is　valuable　for　classification.　To　address　this　issue,　we　propose　a　spatiotemporal　semantic　feature　(ST-SF)　and　implement　the　conversion　of　it　to　the　auxiliary　criterion　based　on　the　information　entropy　theory.　First,　we　present　a　text-based　relevance　analysis　method　to　estimate　the　textual　labels　of　objects　most　relevant　to　actions,　which　are　employed　to　train　the　more　targeted　detectors　based　on　the　deep　network.　False　detections　are　optimized　by　the　inter-frame　cooperativity　and　dynamic　programming　to　construct　the　valid　tubes.　Then,　we　design　the　ST-SF　to　encode　the　interactive　information,　and　the　concept　and　calculation　of　feature　entropy　are　defined　based　on　the　spatial　distribution　of　ST-SFs　on　the　training　set.　Finally,　we　achieve　a　two-stage　classification　strategy　using　the　resulting　decision　gains.　Experimental　results　on　three　publicly　available　datasets　demonstrate　that　our　method　is　robust　and　improves　upon　the　state-of-the-art　algorithms.

Keyword：

Spatiotemporal semantic feature Feature entropy Action recognition Text-based relevance analysis Bag-of-visual-words model

Author Community：

[ 1 ] [Meng, Xiaoyan]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Zhang, Guoliang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Jia, Songmin]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 4 ] [Li, Xiuzhi]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 5 ] [Zhang, Xiangyin]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 6 ] [Meng, Xiaoyan]Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[ 7 ] [Zhang, Guoliang]Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[ 8 ] [Jia, Songmin]Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[ 9 ] [Li, Xiuzhi]Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[ 10 ] [Zhang, Xiangyin]Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China
[ 11 ] [Meng, Xiaoyan]Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R China
[ 12 ] [Zhang, Guoliang]Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R China
[ 13 ] [Jia, Songmin]Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R China
[ 14 ] [Li, Xiuzhi]Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R China

Reprint Author's Address：

[Zhang, Guoliang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China;;[Zhang, Guoliang]Beijing Key Lab Computat Intelligence & Intellige, Beijing 100124, Peoples R China;;[Zhang, Guoliang]Minist Educ, Engn Res Ctr Digital Community, Beijing 100124, Peoples R China

Email：

zhangglmxy@foxmail.com

Show more details

Related Keywords：

Undo the codebook bias by linear transformation for visual applications
2013，21st ACM International Conference on Multimedia, MM 2013
Beyond Explicit Codebook Generation: Visual Representation Using Implicitly Transferred Codebooks
2015，IEEE TRANSACTIONS ON IMAGE PROCESSING
Undoing the codebook bias by linear transformation with sparsity and F-norm constraints for image classification
2014，PATTERN RECOGNITION LETTERS
An Improved Body Action Recognition Method Based on Manifold Learning
2015，IEEE International Conference on Mechatronics & Automation

Source ：

VISUAL COMPUTER

ISSN： 0178-2789

Year： 2020

Issue： 7

Volume： 37

Page： 1673-1690

3 . 5 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：132

Cited Count：

WoS CC Cited Count： 2

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to