Multi-channel and multi-scale mid-level image representation for scene classification - Details

Author：

Yang, Jinfu (Yang, Jinfu.) (Scholars：杨金福) | Yang, Fei (Yang, Fei.) | Wang, Guanghui (Wang, Guanghui.) | Li, Mingai (Li, Mingai.) (Scholars：李明爱)

Indexed by：

EI Scopus SCIE

Abstract：

Convolutional　neural　network　(CNN)-based　approaches　have　received　state-of-the-art　results　in　scene　classification.　Features　from　the　output　of　fully　connected　(FC)　layers　express　one-dimensional　semantic　information　but　lose　the　detailed　information　of　objects　and　the　spatial　information　of　scene　categories.　On　the　contrary,　deep　convolutional　features　have　been　proved　to　be　more　suitable　for　describing　an　object　itself　and　the　spatial　relations　among　objects　in　an　image.　In　addition,　the　feature　map　from　each　layer　is　max-pooled　within　local　neighborhoods,　which　weakens　the　invariance　of　global　consistency　and　is　unfavorable　to　scenes　with　highly　complicated　variation.　To　cope　with　the　above　issues,　an　orderless　multi-channel　mid-level　image　representation　on　pre-trained　CNN　features　is　proposed　to　improve　the　classification　performance.　The　mid-level　image　representation　of　two　channels　from　the　FC　layer　and　the　deep　convolutional　layer　are　integrated　at　multi-scale　levels.　A　sum　pooling　approach　is　also　employed　to　aggregate　multi-scale　mid-level　image　representation　to　highlight　the　importance　of　the　descriptors　beneficial　for　scene　classification.　Extensive　experiments　on　SUN397　and　MIT　67　indoor　datasets　demonstrate　that　the　proposed　method　achieves　promising　classification　performance.　(C)　2017　SPIE　and　IS&T

Keyword：

mid-level representation scene classification convolutional neural network multi-channel

Author Community：

[ 1 ] [Yang, Jinfu]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 2 ] [Yang, Fei]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 3 ] [Li, Mingai]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China
[ 4 ] [Wang, Guanghui]Univ Kansas, Dept Elect Engn & Comp Sci, Lawrence, KS 66045 USA

Reprint Author's Address：

[Yang, Fei]Beijing Univ Technol, Fac Informat Technol, Beijing, Peoples R China

Email：

yangfei199217@emails.bjut.edu.cn

Show more details

Related Keywords：

A light intelligent diagnosis model based on improved Online Dictionary Learning sample-making and simplified convolutional neural network
2021，MEASUREMENT
Multi-level attention model with deep scattering spectrum for acoustic scene classification
2019，2019 IEEE International Conference on Multimedia and Expo Workshops, ICMEW 2019
Evaluation of convolutional neural network for recognizing uterine contractions with electrohysterogram
2019，COMPUTERS IN BIOLOGY AND MEDICINE
3DCNN-Based Real-Time Driver Fatigue Behavior Detection in Urban Rail Transit
2019，IEEE ACCESS

Source ：

JOURNAL OF ELECTRONIC IMAGING

ISSN： 1017-9909

Year： 2017

Issue： 2

Volume： 26

1 . 1 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：165

CAS Journal Grade：4

Cited Count：

WoS CC Cited Count： 4

SCOPUS Cited Count： 7

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to