Context-aware relation enhancement and similarity reasoning for image-text retrieval - Details

Author：

Cui, Zheng (Cui, Zheng.) | Hu, Yongli (Hu, Yongli.) (Scholars：胡永利) | Sun, Yanfeng (Sun, Yanfeng.) | Yin, Baocai (Yin, Baocai.)

Indexed by：

EI Scopus SCIE

Abstract：

Image-text　retrieval　is　a　fundamental　yet　challenging　task,　which　aims　to　bridge　a　semantic　gap　between　heterogeneous　data　to　achieve　precise　measurements　of　semantic　similarity.　The　technique　of　fine-grained　alignment　between　cross-modal　features　plays　a　key　role　in　various　successful　methods　that　have　been　proposed.　Nevertheless,　existing　methods　cannot　effectively　utilise　intra-modal　information　to　enhance　feature　representation　and　lack　powerful　similarity　reasoning　to　get　a　precise　similarity　score.　Intending　to　tackle　these　issues,　a　context-aware　Relation　Enhancement　and　Similarity　Reasoning　model,　called　RESR,　is　proposed,　which　conducts　both　intra-modal　relation　enhancement　and　inter-modal　similarity　reasoning　while　considering　the　global-context　information.　For　intra-modal　relation　enhancement,　a　novel　context-aware　graph　convolutional　network　is　introduced　to　enhance　local　feature　representations　by　utilising　relation　and　global-context　information.　For　inter-modal　similarity　reasoning,　local　and　global　similarity　features　are　exploited　by　the　bidirectional　alignment　of　image　and　text,　and　the　similarity　reasoning　is　implemented　among　multi-granularity　similarity　features.　Finally,　refined　local　and　global　similarity　features　are　adaptively　fused　to　get　a　precise　similarity　score.　The　experimental　results　show　that　our　effective　model　outperforms　some　state-of-the-art　approaches,　achieving　average　improvements　of　2.5%　and　6.3%　in　R@sum　on　the　Flickr30K　and　MS-COCO　dataset.　A　novel　context-aware　relation　enhancement　and　similarity　reasoning　model　is　proposed　to　achieve　precise　image-text　retrieval,　which　conducts　both　intra-modal　relation　enhancement　and　inter-modal　similarity　reasoning　while　considering　the　global-context　information.　image

Keyword：

image retrieval multimedia systems

Author Community：

[ 1 ] [Cui, Zheng]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 2 ] [Hu, Yongli]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 3 ] [Sun, Yanfeng]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 4 ] [Yin, Baocai]Beijing Univ Technol, Beijing Inst Artificial Intelligence, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, Peoples R China
[ 5 ] [Hu, Yongli]Beijing Univ Technol, 100 Pingleyuan, Beijing, Peoples R China

Reprint Author's Address：

胡永利
[Hu, Yongli]Beijing Univ Technol, 100 Pingleyuan, Beijing, Peoples R China

Email：

huyongli@bjut.edu.cn

Show more details

Related Keywords：

Large Scale Image Retrieval Based on Adaptive Dense-SIFT
2015，3rd IEEE International Conference on Progress in Informatcs and Computing (IEEE PIC)
Medical Image Retrieval Based on the Deep Convolution Network and Hash Coding
2017，10th International Congress on Image and Signal Processing, BioMedical Engineering and Informatics (CISP-BMEI)
Secure Index Construction for Privacy-Preserving Large-scale Image Retrieval
2014，IEEE Fourth International Conference on Big Data and Cloud Computing (BdCloud)
Medical Image Retrieval Based on Convolutional Neural Network and Supervised Hashing
2019，IEEE ACCESS

Source ：

IET COMPUTER VISION

ISSN： 1751-9632

Year： 2024

Issue： 5

Volume： 18

Page： 652-665

1 . 7 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to