J4

Previous Articles     Next Articles

Implementation of a Web Page Indexer JU_Indexer with High Performance

HE Feng-ling, ZUO Wan-li, ZHANG Xue-song   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China; Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University, Changchun 130012, China
  • Received:2005-03-24 Revised:1900-01-01 Online:2006-01-26 Published:2006-01-26
  • Contact: ZUO Wan-li

Abstract: JU_Indexer, a web page indexer with high performance, uses the inverted index table to store index data for web pages, creates index data by multiple threads concurrently, utilizes the optimized search algorithms to implement the queries posed by users. First, the system architecture of JU_Indexer is given. Then, the data structure used to store the index data in JU_Indexer, the algorithm employed to create web page index, and the strategy utilized to complete the queries proposed by users are described in great detail. The experiment result demonstrates that the queries proposed by users can be completed by JU_Indexer within milliseconds. The optimized phrase search algorithm emplo yed by JU_Indexer shortens the time to complete the phrase querying.

Key words: web pages, Internet search engine, web page indexer

CLC Number: 

  • TP393.09