• 综合
  • 标题
  • 关键词
  • 摘要
  • 学者
  • 期刊-刊名
  • 期刊-ISSN
  • 会议名称
搜索

作者:

Shi, YuLiang (Shi, YuLiang.) | Zhang, Ti (Zhang, Ti.)

收录:

CPCI-S

摘要:

In this article, an efficient and scalable distributed web crawler system based on Hadoop will be design and implement. In the paper, firstly the application of cloud computing in reptile field is introduced briefly, and then according to the current status of the crawler system, the specific use of Hadoop distributed and cloud computing features detailed design of a highly scalable crawler system, and finally the system Data statistics, under the same conditions, compared with the existing mature system, it is clear that the superiority of distributed web crawler. This advantage in the context of large data era of massive data is particularly important to climb.

关键词:

hadoop big data distributed crawler cloud computing

作者机构:

  • [ 1 ] [Shi, YuLiang]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China
  • [ 2 ] [Zhang, Ti]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China

通讯作者信息:

  • [Shi, YuLiang]BJUT, Sch Beijing Univ Technol, Beijing, Peoples R China

查看成果更多字段

相关关键词:

相关文章:

来源 :

2017 IEEE 2ND INTERNATIONAL CONFERENCE ON BIG DATA ANALYSIS (ICBDA)

年份: 2017

页码: 537-541

语种: 英文

被引次数:

WoS核心集被引频次: 2

SCOPUS被引频次:

ESI高被引论文在榜: 0 展开所有

万方被引频次:

中文被引频次:

近30日浏览量: 0

归属院系:

在线人数/总访问数:662/5059798
地址:北京工业大学图书馆(北京市朝阳区平乐园100号 邮编:100124) 联系我们:010-67392185
版权所有:北京工业大学图书馆 站点建设与维护:北京爱琴海乐之技术有限公司