Indexed by:
Abstract:
Due to the unlimited amount of information available on the Web, it is a burdensome task for users to navigate the ever-increasing Internet. Therefore, Web search engines become necessary tools, supporting information searching and retrieval. However, it is a challenge to build all efficient system to search around the highly dynamic World Wide Web (WWW). The aim of this paper is to develop a high-performance Web search system. In this paper. we propose the architecture of a distributed and multi-threaded search system, and develop this system in Java on IBM ABLE platform. With the assistance of various techniques concerning crawling, XML, and database, the paper makes special emphasis oil the design and implementation of Web crawling model and pages extraction model.
Keyword:
Reprint Author's Address:
Email:
Source :
PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON INNOVATION & MANAGEMENT, VOLS I AND II
Year: 2007
Page: 1837-1842
Language: English
Cited Count:
WoS CC Cited Count: 0
SCOPUS Cited Count:
ESI Highly Cited Papers on the List: 0 Unfold All
WanFang Cited Count:
Chinese Cited Count:
30 Days PV: 2
Affiliated Colleges: