GLOCAL: A self-supervised learning framework for global and local motion estimation - Details

Author：

Indexed by：

EI Scopus SCIE

Abstract：

Motions　in　videos　are　typically　a　mixture　of　local　dynamic　object　motions　and　global　camera　motion,　which　are　inconsistent　in　some　cases,　and　even　interfere　with　each　other,　causing　difficulties　in　various　downstream　applications,　such　as　video　stabilization　that　requires　the　global　motion,　and　action　recognition　that　consumes　local　motions.　Therefore,　it　is　crucial　to　estimate　them　separately.　Existing　methods　separate　two　motions　from　the　mixed　motion　fields,　such　as　optical　flow.　However,　the　quality　of　mixed　motion　determines　the　higher　bounds　of　the　performance.　In　this　work,　we　propose　a　framework,　GLOCAL,　to　directly　estimate　global　and　local　motions　simultaneously　from　adjacent　frames　in　a　self-supervised　manner.　Our　GLOCAL　consists　of　a　Global　Motion　Estimation　(GME)　module　and　a　Local　Motion　Estimation　(LME)　module.　The　GME　module　involves　a　mixed　motion　estimation　backbone,　an　implicit　bottleneck　structure　for　feature　dimension　reduction,　and　an　explicit　bottleneck　for　global　motion　recovery　based　on　the　global　motion　bases　with　foreground　mask　under　the　training　guidance　of　proposed　global　reconstruction　loss.　An　attention　U-Net　is　adopted　for　LME　which　produces　local　motions　while　excluding　motion　of　irrelevant　regions　under　the　guidance　of　proposed　local　reconstruction　loss.　Our　method　can　achieve　better　performance　than　the　existing　methods　on　the　homography　estimation　dataset　DHE　and　the　action　recognition　dataset　NCAA　and　UCF-101.

Keyword：

motion estimation motion pattern Video understanding optical flow

Author Community：

[ 1 ] [Zheng, Yihao]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 2 ] [Li, Zun]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 3 ] [Xiang, Ye]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 4 ] [Wu, Lifang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 5 ] [Luo, Kunming]Megvii Technol, Beijing 100190, Peoples R China
[ 6 ] [Liu, Shuaicheng]Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[ 7 ] [Zeng, Bing]Univ Elect Sci & Technol China, Sch Informat & Commun Engn, Chengdu 611731, Peoples R China
[ 8 ] [Chen, Chang Wen]Hong Kong Polytech Univ, Dept Comp, Hong Kong 999077, Peoples R China

Reprint Author's Address：

毋立芳
[Wu, Lifang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China

Email：

Show more details

Related Keywords：

Variational optical flow based Velocity Estimation for Omni-Directional Intelligent Wheelchair
2017，10th International Symposium on Computational Intelligence and Design (ISCID)
AN IMPROVED INTER-PREDICTION MOTION ESTIMATION ALGORITHM OF AVS
2009，2nd IEEE International Conference on Broadband Network and Multimedia Technology
A Based Motion Prediction Algorithm of the Motion Estimation for H.264
2009，International Symposium on Image Analysis and Signal Processing
A fast stereoscopic video coding algorithm based on JMVM
2011，SCIENCE CHINA-INFORMATION SCIENCES

Source ：

PATTERN RECOGNITION LETTERS

ISSN： 0167-8655

Year： 2024

Volume： 178

Page： 91-97

5 . 1 0 0

JCR@2022

Cited Count：

WoS CC Cited Count： 3

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to