• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Wu, Lifang (Wu, Lifang.) (学者:毋立芳) | Yang, Zhou (Yang, Zhou.) | Jian, Meng (Jian, Meng.) | Shen, Jialie (Shen, Jialie.) | Yang, Yuchen (Yang, Yuchen.) | Lang, Xianglong (Lang, Xianglong.)

收录:

EI Scopus SCIE

摘要:

Motion information used in the existed video action recognition schemes is mixing of global motion(GM) and local motion(LM). In fact, GM & LM have their respective semantic concepts. Thus, it is promising to decouple GM and LM from the mixed motions. Numerous efforts have been made on the design of global motion models for video encoding, video dejittering, video denoising, and so on. Nevertheless, some of the models are too basic to cover the camera motions in action recognition while others are over-complicated. In this paper, we focus on the characteristic of the action recognition and propose a novel independent univariate GM model. It ignores camera rotation, which appears rarely in action recognition videos, and represents the GM in x and y direction respectively. Furthermore, GM is position invariant because it is from the universal camera motion. Pixels with global motions are subjected to the same parametric model and pixels with mixed motion can be seen as outliers. Motivated by this, we develop an iterative optimization scheme for GM estimation which removes the outlier points step by step and estimates global motions in a coarse-to-fine manner. Finally, the LM is estimated through a Spatio-temporal threshold-based method. Experimental results demonstrate that the proposed GM model makes a better trade-off between the model complexity and the robustness. And the iterative optimization scheme is more effective than the existed algorithms. The compared experiments using four popular action recognition models on UCF-101 (for action recognition) and NCAA (for group activity recognition) demonstrate that local motions are more effective than the mixed motions. © 2021

关键词:

Motion estimation Semantics Statistics Cameras Iterative methods Pixels Image coding Video signal processing Economic and social effects

作者机构:

  • [ 1 ] [Wu, Lifang]Beijing University of Technology, Beijing, China
  • [ 2 ] [Wu, Lifang]Beijing Municipal Key Lab of Computation Intelligence and Intelligent Systems, Beijing, China
  • [ 3 ] [Yang, Zhou]Beijing University of Technology, Beijing, China
  • [ 4 ] [Jian, Meng]Beijing University of Technology, Beijing, China
  • [ 5 ] [Jian, Meng]Beijing Municipal Key Lab of Computation Intelligence and Intelligent Systems, Beijing, China
  • [ 6 ] [Shen, Jialie]School of Electronics, Electrical Engineering and Computer Science, Queen's University, Belfast, United Kingdom
  • [ 7 ] [Yang, Yuchen]Beijing University of Technology, Beijing, China
  • [ 8 ] [Lang, Xianglong]Beijing University of Technology, Beijing, China

通讯作者信息:

  • [jian, meng]beijing municipal key lab of computation intelligence and intelligent systems, beijing, china;;[jian, meng]beijing university of technology, beijing, china

电子邮件地址:

查看成果更多字段

相关关键词:

相关文章:

来源 :

Pattern Recognition

ISSN: 0031-3203

年份: 2021

卷: 116

8 . 0 0 0

JCR@2022

ESI学科: ENGINEERING;

ESI高被引阀值:87

JCR分区:1

被引次数:

WoS核心集被引频次: 0

SCOPUS被引频次: 24

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:190/4518101
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司