• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Wang, Xiaoyu (Wang, Xiaoyu.) | Zhai, Yujia (Zhai, Yujia.) | Lin, Yuanhai (Lin, Yuanhai.) | Wang, Fang (Wang, Fang.)

收录:

SSCI EI Scopus SCIE

摘要:

Tech mining is the application of text mining tools to science and technology information resources. The ever-increasing volume of scientific outputs is a boom to technological innovation, but it also complicates efforts to obtain useful and concise information for problem solving. This challenge extends to tech mining, where the development of techniques compatible with big data is an urgent issue. This article introduces a semi-supervised method for extracting layered technological information from scientific papers in order to extend the reach of tech mining. Our method starts with several pre-set seed patterns used to extract candidate phrases by matching the dependency tree of each sentence. Then, after a series of judgements, phrases are divided into two categories: 'main technique' and 'tech-component'. (A technique, for the purposes of this study, is a method or tool used in the article being analysed.) In order to generate new patterns for subsequent iterations, a weighted pattern learning method is also adopted. Finally, multiple iterations of the method are applied to extract technological information from each paper. A dataset from the field of optical switcher is used to verify the method's effectiveness. Our findings are that (1) by two loops of extraction process in each iteration, our method realises the layered technological information extraction, which contains the 'part-whole' relationships between main techniques and tech-components; (2) the recall rate for main techniques is superior to the baseline after iterating 23 rounds; (3) when layering is disregarded, in the aspect of the precision and the volume of techniques, the new method is higher than that for the baseline; and (4) adjusting another two parameters can optimise the efficiency - however, the effect is neither pronounced nor straightforward.

关键词:

semi-supervised learning information extraction Dependency tree tech mining

作者机构:

  • [ 1 ] [Wang, Xiaoyu]Nankai Univ, Business Sch, Dept Informat Resource Management, 94 Weijin Rd, Tianjin 300071, Peoples R China
  • [ 2 ] [Wang, Fang]Nankai Univ, Business Sch, Dept Informat Resource Management, 94 Weijin Rd, Tianjin 300071, Peoples R China
  • [ 3 ] [Wang, Xiaoyu]CETC Big Data Res Inst Co Ltd, Guiyang, Guizhou, Peoples R China
  • [ 4 ] [Zhai, Yujia]Tianjin Normal Univ, Sch Management, Dept Informat Resource Management, Tianjin, Peoples R China
  • [ 5 ] [Lin, Yuanhai]Beijing Univ Technol, Inst Informat Photon Technol, Beijing, Peoples R China
  • [ 6 ] [Lin, Yuanhai]Beijing Univ Technol, Coll Appl Sci, Beijing, Peoples R China

通讯作者信息:

  • [Wang, Fang]Nankai Univ, Business Sch, Dept Informat Resource Management, 94 Weijin Rd, Tianjin 300071, Peoples R China

电子邮件地址:

查看成果更多字段

相关关键词:

来源 :

JOURNAL OF INFORMATION SCIENCE

ISSN: 0165-5515

年份: 2019

期: 6

卷: 45

页码: 779-793

2 . 4 0 0

JCR@2022

ESI学科: SOCIAL SCIENCES, GENERAL;

ESI高被引阀值:84

JCR分区:3

被引次数:

WoS核心集被引频次: 2

SCOPUS被引频次: 5

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

在线人数/总访问数:204/4298528
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司