Using probabilistic topic models for document similarity computation - Details

Author：

He, Ming (He, Ming.) | Zheng, Wei (Zheng, Wei.)

Indexed by：

EI Scopus

Abstract：

Document　similarity　computation　is　an　exciting　research　topic　in　Information　Retrieval　(IR)　and　it　is　a　key　issue　for　automatic　document　categorization,　clustering　analysis,　fuzzy　query,　and　question　answering.　Topic　model　is　an　emerging　field　in　Natural　Language　Processing　(NLP),　IR,　and　Machine　Learning　(ML).　In　this　paper,　we　apply　a　Latent　Dirichlet　Allocation　(LDA)　topic　model-based　method　to　compute　similarity　between　documents.　By　mapping　a　document　with　term　space　representation　into　a　topic　space,　a　distribution　over　topics　is　derived　for　computing　document　similarity.　An　empirical　study　using　real　data　set　demonstrates　the　efficiency　of　our　method.　©　2015　Taylor　&　Francis　Group,　London.

Keyword：

Statistics Natural language processing systems

Author Community：

[ 1 ] [He, Ming]College of Computer Science, Beijing University of Technology, Beijing, China
[ 2 ] [Zheng, Wei]College of Computer Science, Beijing University of Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

U-Statistics and Ensemble Learning Based Method for Gene-Gene Interaction Detection
2018，Computer Research and Development
Variational Bayesian with Embedded Laplace Approximation for AR Model with Outliers
2020，4th International Conference on Artificial Intelligence, Automation and Control Technologies, AIACT 2020
Polynomial normal transform based on l-moments and its application to structural reliability
2019，13th International Conference on Applications of Statistics and Probability in Civil Engineering, ICASP 2019
Public opinion analysis based on probabilistic topic modeling and deep learning
2016，20th Pacific Asia Conference on Information Systems, PACIS 2016

Source ：

Year： 2015

Page： 303-311

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

信息学部计算机学院

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to