英文标题
Research on Web information extraction technology based on coal safety incidents
英文摘要
In this paper,by taking into account characteristics of safety issue information in coal mining,we propose a Web information retrieval strategy based on coal mining safety issues?Our method includes cleaning webpage,generating extraction rule,and retrieving Web informaG tion? Webpage cleaning,as a pretreatment,is used for useless information such as HTML laG bels,scripts,and css parts in webpages?Extraction rule generation is used for finding the exact positions of the information in the webpage,which is based on the characteristics of the informaG tion?Moreover,experimental results in this paper demonstrate the possibility of Web informaG tion retrieval based on coal mining safety issues,and the experimental results further show the accuracy and efficiency of our proposed information retrieval method,in particular for small and middle scale information retrieval tasks.
翻译关键词
coal mining safety issues
web information retrieval
extracG tion rule
webpage cleaning
获取号
WF:perioarticalzgmt201410020