• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Du, Jinlian (Du, Jinlian.) | Jin, Xueyun (Jin, Xueyun.) | Wang, Peng (Wang, Peng.)

Indexed by:

EI Scopus

Abstract:

Word segmentation is a basic topic in the field of natural language processing, and improving the accuracy of word segmentation is a key problem. With the popularity of microblog, accurate word segmentation for microblog text has become a hot spot. However, microblog texts often contain information about multiple related domains, ambiguous words in multi-domain will lead to the decline of word segmentation accuracy. Based on the model theory of word vector and branching entropy, this paper proposes a multi-domain global correlation degree branching entropy method for microblog text word segmentation. This model is applied to microblog text about house price topic in Beijing. The precision, recall and F-measure of this method are compared with branching entropy model proposed by Zhang[6], and the experimental results show that our method outperforms it. © 2020 ACM.

Keyword:

Natural language processing systems Computational linguistics

Author Community:

  • [ 1 ] [Du, Jinlian]Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 2 ] [Jin, Xueyun]Faculty of Information Technology, Beijing University of Technology, Beijing, China
  • [ 3 ] [Wang, Peng]Information Technology Department, HUAXIA Bank Beijing Branch, Beijing, China

Reprint Author's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Year: 2020

Page: 71-75

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 7

Affiliated Colleges:

Online/Total:609/5313462
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.