• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Dong Chen (Dong Chen.) | Fang Liying (Fang Liying.) | Yan Jianzhuo (Yan Jianzhuo.) | Shi Bin (Shi Bin.)

收录:

CPCI-S

摘要:

Semantic focused crawler is an important part of semantic vertical search engine. It is receiving increasing attention as a well founded alternative to search web with the problem of locating topical resource on entire web. In order to retrieval documents related to a given topic, in this paper, we propose QBLP Algorithm which enable crawler adaptive with the changing environment. This feature makes it possible to change behavior of focused crawler according to the particular environment and its relationships with the given input parameters during the search. QBLP Exploited Q learning which features whole-life learning and repayment delay accompany with Bayes classifier. It enables crawler to accumulate experience during the crawling and adjust strategy to achieve goal of making best decision under any circumstance. We make a comparison among QBLP, Best First and Breath First. According to statistics from experiments, We find that QBLP is superior on precision than others in long time crawling.

关键词:

Bayes classifier focused crawler Q-Learning Semantic web

作者机构:

  • [ 1 ] [Dong Chen]Beijing Univ Technol, Coll Elect Informat & Control Engn, Beijing, Peoples R China
  • [ 2 ] [Fang Liying]Beijing Univ Technol, Coll Elect Informat & Control Engn, Beijing, Peoples R China
  • [ 3 ] [Yan Jianzhuo]Beijing Univ Technol, Coll Elect Informat & Control Engn, Beijing, Peoples R China
  • [ 4 ] [Shi Bin]Beijing Univ Technol, Coll Elect Informat & Control Engn, Beijing, Peoples R China

通讯作者信息:

  • [Dong Chen]Beijing Univ Technol, Coll Elect Informat & Control Engn, Beijing, Peoples R China

查看成果更多字段

相关关键词:

相关文章:

来源 :

PROCEEDINGS OF 2010 3RD IEEE INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND INFORMATION TECHNOLOGY (ICCSIT 2010), VOL 8

ISSN: 2381-3458

年份: 2010

页码: 420-423

语种: 英文

被引次数:

WoS核心集被引频次: 1

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 3

归属院系:

在线人数/总访问数:5040/2949601
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司