Indexed by:
Abstract:
Scientific publications and patents are usually viewed as respective proxies of scientific research and technical development. There is considerable effort spent towards establishing topic linkages between science and technology with the lexical- or topic-based approaches. However, due to the heterogeneity between scholarly articles and patents in terms of purpose, statement, and quality, the performance is not satisfactory. To understand the difficulties of topic linkages and improve the performance, a framework is proposed to detect the commonality and specialty between scientific publications and patents from the two perspectives: linguistic characteristics and thematic structures. Extensive experimental results on the DrugBank dataset discover five commonness and five significant differences in terms of linguistic characteristics. For example, nouns are used most frequently among them, and scientific publications contain more word tokens than patent documents, but patents have usually longer sentences and use more clauses. In the meanwhile, common and special thematic structures are also uncovered between scientific publications and patents. The themes about general description in the pharmaceutical field are shared by two heterogeneous resources. The scientific publications tend to explain the disease mechanism and the medication content, while patents bias towards the preparation and practical application of drugs.
Keyword:
Reprint Author's Address:
Email:
Source :
SCIENTOMETRICS
ISSN: 0138-9130
Year: 2021
Issue: 9
Volume: 126
Page: 7445-7475
3 . 9 0 0
JCR@2022
ESI Discipline: SOCIAL SCIENCES, GENERAL;
ESI HC Threshold:53
JCR Journal Grade:2
Cited Count:
WoS CC Cited Count: 11
SCOPUS Cited Count: 21
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 0
Affiliated Colleges: