• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Song, Bingyan (Song, Bingyan.) | Bao, Zhenshan (Bao, Zhenshan.) | Wang, YueZhang (Wang, YueZhang.) | Zhang, Wenbo (Zhang, Wenbo.) | Sun, Chao (Sun, Chao.)

收录:

EI Scopus

摘要:

Little research has been done on the Named Entity Recognition (NER) of Traditional Chinese Medicine (TCM) books and most of them use statistical models such as Conditional Random Fields (CRFs). However, in these methods, lexicon information and large-scale of unlabeled corpus data are not fully exploited. In order to improve the performance of NER for TCM books, we propose a method which is based on biLSTM-CRF model and can incorporate lexicon information into representation layer to enrich its semantic information. We compared our approach with several previous character-based and word-based methods. Experiments on 'Shanghan Lun' dataset show that our method outperforms previous models. In addition, we collected 376 TCM books to construct a large-scale of corpus to obtain the pre-trained vectors since there is no large available corpus in this field before. We have released the corpus and pre-trained vectors to the public. © 2020, Springer Nature Switzerland AG.

关键词:

Medicine Semantics Natural language processing systems Random processes

作者机构:

  • [ 1 ] [Song, Bingyan]College of Computer Science, Beijing University of Technology, Beijing; 100124, China
  • [ 2 ] [Bao, Zhenshan]College of Computer Science, Beijing University of Technology, Beijing; 100124, China
  • [ 3 ] [Wang, YueZhang]College of Computer Science, Beijing University of Technology, Beijing; 100124, China
  • [ 4 ] [Zhang, Wenbo]College of Computer Science, Beijing University of Technology, Beijing; 100124, China
  • [ 5 ] [Sun, Chao]College of Chinese Medicine, Capital Medical University, Beijing; 100069, China

通讯作者信息:

  • [zhang, wenbo]college of computer science, beijing university of technology, beijing; 100124, china

电子邮件地址:

查看成果更多字段

相关关键词:

来源 :

ISSN: 0302-9743

年份: 2020

卷: 12431 LNAI

页码: 481-489

语种: 英文

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

在线人数/总访问数:3602/4252460
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司