• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
搜索

Author:

Han, Zongwang (Han, Zongwang.) | Lin, Shaofu (Lin, Shaofu.) | Huang, Zhisheng (Huang, Zhisheng.) | Guo, Chaohui (Guo, Chaohui.)

Indexed by:

CPCI-S EI

Abstract:

In recent years, with the exploration of pathological mechanisms and treatments of Long COVID, there has been a dramatic increase in related scientific publications. Effective extraction of key information from these texts is of great importance for public health and research progress. In the Long COVID context, Named Entity Recognition (NER) can be used to identify disease names as well as symptoms, which can help to analyze the sequelae caused by COVID-19 and its relationship with other diseases. Distinguished from molecular biomedical text mining, which focuses on the identification of entities such as genes, proteins, and chemistries and their relationships, Long COVID text mining faces problems such as the lack of publicly labeled datasets and the heavy workload of manual annotation. Moreover, due to the strong domain characteristics of Long COVID relevant named entities, models and methods that have achieved great performance in the generic domain will have significantly degraded named entity recognition performance on this domain. Based on the above problems, we constructed a Long COVID literature abstract NER dataset (LNER) and proposed a Long COVID biomedical literature NER model Bert-BiLSTM-IDCNN-ATT-CRF (BBIAC). First, the BERT-BiLSTM-CRF model is constructed on the LNER dataset. Then, the inflated convolutional neural network (IDCNN) is added between the BiLSTM and the CRF layers to obtain the local features in the text sequences. Finally, feature enhancement is performed by fusing the features of global and local information using the attention mechanism. The experimental results show that the method proposed in this paper for Long COVID literature can accurately extract the characteristic information of Long COVID symptoms and diseases, and has better performance compared to other baseline models.

Keyword:

LNER Dataset Long COVID Biomedical literature Bert-BiLSTM-IDCNNATT-CRF Named Entity Recognition

Author Community:

  • [ 1 ] [Han, Zongwang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Lin, Shaofu]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Huang, Zhisheng]Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands
  • [ 4 ] [Guo, Chaohui]Vrije Univ Amsterdam, Dept Comp Sci, Amsterdam, Netherlands

Reprint Author's Address:

Show more details

Related Keywords:

Related Article:

Source :

PROCEEDINGS OF 2023 4TH INTERNATIONAL SYMPOSIUM ON ARTIFICIAL INTELLIGENCE FOR MEDICINE SCIENCE, ISAIMS 2023

Year: 2023

Page: 1200-1205

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:784/5321999
Address:BJUT Library(100 Pingleyuan,Chaoyang District,Beijing 100124, China Post Code:100124) Contact Us:010-67392185
Copyright:BJUT Library Technical Support:Beijing Aegean Software Co., Ltd.