Task-Adaptive End-to-End Networks for Stereo Matching - Details

Author：

Li, Tong (Li, Tong.) | Ma, Wei (Ma, Wei.) | Xu, Shibiao (Xu, Shibiao.) | Zhang, Xiaopeng (Zhang, Xiaopeng.)

Indexed by：

EI CSCD

Abstract：

Estimating　depth/disparity　information　from　stereo　pairs　via　stereo　matching　is　a　classical　research　topic　in　computer　vision.　Recently,　along　with　the　development　of　deep　learning　technologies,　many　end-to-end　deep　networks　have　been　proposed　for　stereo　matching.　These　networks　generally　borrow　convolutional　neural　network　(CNN)　structures　originally　designed　for　other　tasks　to　extract　features.　These　structures　are　generally　redundant　for　the　task　of　stereo　matching.　Besides,　3D　convolutions　in　these　networks　are　too　complex　to　be　extended　for　large　perception　fields　which　are　helpful　for　disparity　estimation.　In　order　to　overcome　these　problems,　we　propose　a　deep　network　structure　based　on　the　properties　of　stereo　matching.　In　the　proposed　network,　a　concise　and　effective　feature　extraction　module　is　presented.　Moreover,　a　separated　3D　convolution　is　introduced　to　avoid　parameter　explosion　caused　by　increasing　the　size　of　convolution　kernels.　We　validate　our　network　on　the　dataset　of　SceneFlow　in　aspects　of　both　accuracy　and　computation　costs.　Results　show　that　the　proposed　network　obtains　state-of-the-art　performance.　Compared　with　the　other　structures,　our　feature　extraction　module　can　reduce　90%　parameters　and　25%　time　cost　while　achieving　comparable　accuracy.　At　the　same　time,　our　separated　3D　convolution,　accompanied　by　group　normalization　(GN),　achieves　lower　end-point-error　(EPE)　than　baseline　methods.　©　2020,　Science　Press.　All　right　reserved.

Keyword：

Stereo image processing Deep learning Convolution Extraction Stereo vision Feature extraction Convolutional neural networks

Author Community：

[ 1 ] [Li, Tong]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 2 ] [Ma, Wei]Faculty of Information Technology, Beijing University of Technology, Beijing; 100124, China
[ 3 ] [Xu, Shibiao]Institute of Automation, Chinese Academy of Sciences, Beijing; 100190, China
[ 4 ] [Zhang, Xiaopeng]Institute of Automation, Chinese Academy of Sciences, Beijing; 100190, China

Reprint Author's Address：

[ma, wei]faculty of information technology, beijing university of technology, beijing; 100124, china

Email：

mawei@bjut.edu.cn

Show more details

Related Keywords：

High accurate and real-time feature extraction in visual servoing systems for robotic arms
2009，Control and Decision
Study on high accurate and real-time feature extraction in visual servomechanism for robotic arms
2009，1st International Conference on Information Science and Engineering, ICISE2009
A new algorithm of quick calibration for binocular camera
2013，Journal of Central South University (Science and Technology)
3D reconstruction of tongue surface based on photometric stereo
2012，2012 11th International Conference on Signal Processing, ICSP 2012
Video flame detection method based on twostream convolutional neural network
2019，8th IEEE Joint International Information Technology and Artificial Intelligence Conference, ITAIC 2019

Source ：

Computer Research and Development

ISSN： 1000-1239

Year： 2020

Issue： 7

Volume： 57

Page： 1531-1538

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to