• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Liu, Yuxuan (Liu, Yuxuan.) | Ai, Haizhou (Ai, Haizhou.) | Xing, Junliang (Xing, Junliang.) | Li, Xuri (Li, Xuri.) | Wang, Xiaoyi (Wang, Xiaoyi.) | Tao, Pin (Tao, Pin.)

Indexed by:

EI Scopus

Abstract:

Multiple views play a vital role in 3D pose estimation tasks. Ideally, multi-view 3D pose estimation tasks should directly utilize naturally collected videos for pose estimation. However, due to the constraints of video synchronization, existing methods often use expensive hardware devices to synchronize the initiation of cameras, which restricts most 3D pose collection scenarios to indoor settings. Some recent works learn deep neural networks to align desynchronized datasets derived from synchronized cameras and can only produce frame-level accuracy. For fractional frame video synchronization, this work proposes an Inter-Frame and Intra-Frame Desynchronized Dataset (IFID), which labels fractional time intervals between two video clips. IFID is the first dataset that annotates inter-frame and intra-frame intervals, with a total of 382, 500 video clips annotated, making it the largest dataset to date. We also develop a novel model based on the Transformer architecture, named InSynFormer, for synchronizing inter-frame and intra-frame. Extensive experimental evaluations demonstrate its promising performance. The dataset and source code of the model are available at https://github.com/yuxuan-cser/InSynFormer. Copyright © 2024, Association for the Advancement of Artificial Intelligence (www.aaai.org). All rights reserved.

Keyword:

Deep neural networks Video cameras Synchronization

Author Community:

  • [ 1 ] [Liu, Yuxuan]Key Laboratory of Pervasive Computing, Ministry of Education, China
  • [ 2 ] [Liu, Yuxuan]Department of Computer Science and Technology, Tsinghua University, Beijing; 100084, China
  • [ 3 ] [Ai, Haizhou]Key Laboratory of Pervasive Computing, Ministry of Education, China
  • [ 4 ] [Ai, Haizhou]Department of Computer Science and Technology, Tsinghua University, Beijing; 100084, China
  • [ 5 ] [Xing, Junliang]Key Laboratory of Pervasive Computing, Ministry of Education, China
  • [ 6 ] [Xing, Junliang]Department of Computer Science and Technology, Tsinghua University, Beijing; 100084, China
  • [ 7 ] [Li, Xuri]Beijing University of Technology, Beijing; 100124, China
  • [ 8 ] [Wang, Xiaoyi]Unaffiliated Scholar, Haidian District, Beijing, China
  • [ 9 ] [Tao, Pin]Key Laboratory of Pervasive Computing, Ministry of Education, China
  • [ 10 ] [Tao, Pin]Department of Computer Science and Technology, Tsinghua University, Beijing; 100084, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 2159-5399

Year: 2024

Issue: 4

Volume: 38

Page: 3828-3836

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:681/5313261
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.