An implementation of ABLE-based distributed Web search techniques - Details

Author：

Zhai Dongsheng (Zhai Dongsheng.) (Scholars：翟东升) | Li Li (Li Li.) | Liu Zhe (Liu Zhe.)

Indexed by：

CPCI-S CPCI-SSH

Abstract：

Due　to　the　unlimited　amount　of　information　available　on　the　Web,　it　is　a　burdensome　task　for　users　to　navigate　the　ever-increasing　Internet.　Therefore,　Web　search　engines　become　necessary　tools,　supporting　information　searching　and　retrieval.　However,　it　is　a　challenge　to　build　all　efficient　system　to　search　around　the　highly　dynamic　World　Wide　Web　(WWW).　The　aim　of　this　paper　is　to　develop　a　high-performance　Web　search　system.　In　this　paper.　we　propose　the　architecture　of　a　distributed　and　multi-threaded　search　system,　and　develop　this　system　in　Java　on　IBM　ABLE　platform.　With　the　assistance　of　various　techniques　concerning　crawling,　XML,　and　database,　the　paper　makes　special　emphasis　oil　the　design　and　implementation　of　Web　crawling　model　and　pages　extraction　model.

Keyword：

page extraction Web crawler ABLE search engine

Author Community：

[ 1 ] [Zhai Dongsheng]Beijing Univ Technol, Econ & Management Sch, Dept Management Sci & Engn, Beijing 100022, Peoples R China
[ 2 ] [Li Li]Beijing Univ Technol, Econ & Management Sch, Dept Management Sci & Engn, Beijing 100022, Peoples R China
[ 3 ] [Liu Zhe]Beijing Univ Technol, Econ & Management Sch, Dept Management Sci & Engn, Beijing 100022, Peoples R China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Design and Implementation of a Topic-Focused Search Engine based on Multi-Agent System
2008，IEEE International Conference on Service Operations and Logistics and Informatics
Image Semantic Search Engine
2009，1st International Workshop on Database Technology and Applications
Design and implementation of a scalable distributed web crawler based on Hadoop
2017，2nd IEEE International Conference on Big Data Analysis, ICBDA 2017
Research on Semi-Automatic Domain Ontology Construction Framework Based on Web Crawler
2013，International Conference on Computer, Networks and Communication Engineering (ICCNCE)

Source ：

PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INNOVATION & MANAGEMENT, VOLS I AND II

Year： 2007

Page： 1837-1842

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

经济与管理学院

Get Fulltext

Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to