• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Du, Yongping (Du, Yongping.) (学者:杜永萍) | Li, Qingxiao (Li, Qingxiao.) | Wang, Lulin (Wang, Lulin.) | He, Yanqing (He, Yanqing.)

收录:

EI Scopus SCIE

摘要:

In recent years, the performance of deep neural network in extractive summarization task has been improved significantly compared with traditional methods. However, in the field of biomedical extractive summarization, existing methods cannot make good use of the domain-aware external knowledge; furthermore, the document structural feature is omitted by existing deep neural network model. In this paper, we propose a novel model called BioBERTSum to better capture token-level and sentence-level contextual representation, which uses a domain-aware bidirectional language model pre-trained on large-scale biomedical corpora as encoder, and further fine-tunes the language model for extractive text summarization task on single biomedical document. Especially, we adopt a sentence position embedding mechanism, which enables the model to learn the position information of sentences and achieve the structural feature of document. To the best of our knowledge, this is the first work to use the pre-trained language model and fine-tuning strategy for extractive summarization task in the biomedical domain. Experiments on PubMed dataset show that our proposed model outperforms the recent SOTA (state-of-the-art) model by ROUGE-1/2/L. (C) 2020 Elsevier B.V. All rights reserved.

关键词:

Document representation Pre-trained language model Fine-tuning Extractive biomedical summarization

作者机构:

  • [ 1 ] [Du, Yongping]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 2 ] [Li, Qingxiao]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 3 ] [Wang, Lulin]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
  • [ 4 ] [He, Yanqing]Inst Sci & Tech Informat China, Beijing 100038, Peoples R China

通讯作者信息:

  • [He, Yanqing]Inst Sci & Tech Informat China, Beijing 100038, Peoples R China

查看成果更多字段

相关关键词:

相关文章:

来源 :

KNOWLEDGE-BASED SYSTEMS

ISSN: 0950-7051

年份: 2020

卷: 199

8 . 8 0 0

JCR@2022

ESI学科: COMPUTER SCIENCE;

ESI高被引阀值:132

被引次数:

WoS核心集被引频次: 32

SCOPUS被引频次: 45

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:497/3908215
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司