• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Han, Zongwang (Han, Zongwang.) | Lin, Shaofu (Lin, Shaofu.) | Huang, Zhisheng (Huang, Zhisheng.) | Guo, Chaohui (Guo, Chaohui.)

收录:

CPCI-S EI

摘要:

In recent years, with the exploration of pathological mechanisms and treatments of Long COVID, there has been a dramatic increase in related scientific publications. Effective extraction of key information from these texts is of great importance for public health and research progress. In the Long COVID context, Named Entity Recognition (NER) can be used to identify disease names as well as symptoms, which can help to analyze the sequelae caused by COVID-19 and its relationship with other diseases. Distinguished from molecular biomedical text mining, which focuses on the identification of entities such as genes, proteins, and chemistries and their relationships, Long COVID text mining faces problems such as the lack of publicly labeled datasets and the heavy workload of manual annotation. Moreover, due to the strong domain characteristics of Long COVID relevant named entities, models and methods that have achieved great performance in the generic domain will have significantly degraded named entity recognition performance on this domain. Based on the above problems, we constructed a Long COVID literature abstract NER dataset (LNER) and proposed a Long COVID biomedical literature NER model Bert-BiLSTM-IDCNN-ATT-CRF (BBIAC). First, the BERT-BiLSTM-CRF model is constructed on the LNER dataset. Then, the inflated convolutional neural network (IDCNN) is added between the BiLSTM and the CRF layers to obtain the local features in the text sequences. Finally, feature enhancement is performed by fusing the features of global and local information using the attention mechanism. The experimental results show that the method proposed in this paper for Long COVID literature can accurately extract the characteristic information of Long COVID symptoms and diseases, and has better performance compared to other baseline models.

关键词:

LNER Dataset Long COVID Biomedical literature Bert-BiLSTM-IDCNNATT-CRF Named Entity Recognition

作者机构:

  • [ 1 ] [Han, Zongwang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Lin, Shaofu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Huang, Zhisheng]Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
  • [ 4 ] [Guo, Chaohui]Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands

通讯作者信息:

查看成果更多字段

相关关键词:

相关文章:

来源 :

PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023

年份: 2023

页码: 1200-1205

被引次数:

WoS核心集被引频次:

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:445/4985151
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司