Towards Efficient Coarse-grained Dialogue Response Selection - Details

Author：

Lan, Tian (Lan, Tian.) | Mao, Xian-Ling (Mao, Xian-Ling.) | Wei, Wei (Wei, Wei.) | Gao, Xiaoyan (Gao, Xiaoyan.) | Huang, Heyan (Huang, Heyan.)

Indexed by：

EI Scopus

Abstract：

oarse-grained　response　selection　is　a　fundamental　and　essential　subsystem　for　the　widely　used　retrievalbased　chatbots,　aiming　to　recall　a　coarse-grained　candidate　set　from　a　large-scale　dataset.　The　dense　retrievaltechnique　has　recently　been　proven　very　effective　in　building　such　a　subsystem.　However,　dialogue　denseretrieval　models　face　two　problems　in　real　scenarios:　(1)　the　multi-turn　dialogue　history　is　re-computed　ineach　turn,　leading　to　inefficient　inference;　(2)　the　index　storage　of　the　offline　index　is　enormous,　significantlyincreasing　the　deployment　cost.　To　address　these　problems,　we　propose　an　efficient　coarse-grained　responseselection　subsystem　consisting　of　two　novel　methods.　Specifically,　to　address　the　first　problem,　we　proposethe　Hierarchical　Dense　Retrieval.　It　caches　rich　multi-vector　representations　of　the　dialogue　history　and　onlyencodes　the　latest　user’s　utterance,　leading　to　better　inference　efficiency.　Then,　to　address　the　second　problem,we　design　the　Deep　Semantic　Hashing　to　reduce　the　index　storage　while　effectively　saving　its　recall　accuracynotably.　Extensive　experimental　results　prove　the　advantages　of　the　two　proposed　methods　over　previousworks.　Specifically,　with　the　limited　performance　loss,　our　proposed　coarse-grained　response　selection　modelachieves　over　5x　FLOPs　speedup　and　over　192x　storage　compression　ratio.　Moreover,　our　source　codes　havebeen　publicly　released.　©　2023　Association　for　Computing　Machinery.　All　rights　reserved.

Keyword：

Speech processing Semantics Large dataset Coarse-grained modeling

Author Community：

[ 1 ] [Lan, Tian]Beijing Institute of Technology, 5 South Zhongguancun Street, Haidian District, Beijing, Beijing; 100081, China
[ 2 ] [Mao, Xian-Ling]Beijing Institute of Technology, 5 South Zhongguancun Street, Haidian District, Beijing, Beijing; 100081, China
[ 3 ] [Wei, Wei]Huazhong University of Science and Technology, Hubei, Wuhan; 430074, China
[ 4 ] [Gao, Xiaoyan]Beijing University of Technology, 5 South Zhongguancun Street, Haidian District, Beijing, Beijing; 100081, China
[ 5 ] [Huang, Heyan]Beijing Institute of Technology, 5 South Zhongguancun Street, Haidian District, Beijing, Beijing; 100081, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

MVT: Chinese NER Using Multi-View Transformer
2024，IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING
Mandarin voice conversion using tone codebook mapping
2006，4th International Conference on Machine Learning and Cybernetics, ICMLC 2005
Speech endpoint detection algorithm based on the band-partitioning spectral entropy and spectral energy
2007，Journal of Beijing University of Technology
8-64kbit/s super-wideband embedded speech and audio coding algorithm
2009，Journal on Communications

Source ：

ACM Transactions on Information Systems

ISSN： 1046-8188

Year： 2023

Issue： 2

Volume： 42

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to