Research and Exploration on the Construction Method of Knowledge Graph of Water Field Based on Text - Details

Author：

Yan, Jianzhuo (Yan, Jianzhuo.) | Gao, Kaili (Gao, Kaili.)

Indexed by：

Abstract：

With　the　development　of　the　Internet　age,　collected　data　have　become　an　important　source　of　knowledge.　The　field　of　unstructured　text　contains　many　named　entities,　but　includes　very　little　detailed　information　about　those　entities.　However,　the　Baidu　encyclopedia　website　is　a　type　of　semistructured　data　that　in　many　cases　includes　a　detailed　introduction　of　entities.　By　combining　the　advantages　of　these　two　kinds　of　data,　we　can　enrich　the　knowledge　base　of　a　knowledge　graph.　This　paper　aims　to　extract　semistructured　data　consisting　of　named　entities　starting　from　raw　text　data.　On　one　hand,　this　paper　extracts　named　entities　with　the　help　of　the　Harbin　Institute　of　Technology　model,　parses　semistructured　content　about　the　named　entities　using　the　Octopus　tool,　constructs　a　local　ontology,　and　merges　the　ontology　using　Python＇s　built-in　difflib.　SequenceMatcher　tool　and　the　Deckard　similarity　algorithm.　On　the　other　hand,　we　create　an　XPath-based　wrapper　to　extract　the　attributes　and　attribute　values　of　named　entities　from　semistructured　data.　The　experimental　results　show　that　this　approach　can　extract　information　related　to　named　entities　from　the　Baidu　encyclopedia　automatically　to　supplement　the　knowledge　base　of　a　water　domain　knowledge　graph.　This　article　can　also　serve　as　a　reference　for　constructing　domain　knowledge　graphs　in　other　fields.　©　2019　IEEE.

Keyword：

Knowledge representation Knowledge based systems Computer aided instruction Data mining Ontology Information use

Author Community：

[ 1 ] [Yan, Jianzhuo]Beijing University of Technology, Engineering Research Center of Digital Community, Ministry of Education, Faculty of Information Technology, China
[ 2 ] [Gao, Kaili]Beijing University of Technology, Engineering Research Center of Digital Community, Ministry of Education, Faculty of Information Technology, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Research of data mining system
2005，Journal of Beijing University of Technology
Constraints-based semantic mapping method from natural language questions to OWL
2007，Acta Electronica Sinica
Research on knowledge representation based on ontology for warning system and its application
2008，2008 International Conference on Wireless Communications, Networking and Mobile Computing, WiCOM 2008
Building the knowledge base to support the automatic animation generation of Chinese traditional architecture
2010，

Source ：

Year： 2019

Page： 71-77

Language： English

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 2

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 2

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to