J4
• 计算机科学 • Previous Articles Next Articles
LI Xiaoya, HE Fengling, ZUO Wanli
Received:
Revised:
Online:
Published:
Contact:
Abstract: In the light of result returned currently by generalpurpose search engines being excessive, and having no strong similarity with the topic, this paper covers a technique of dividing the web page to chunks to implement a focused crawler. With this method, Crawler1, a prototype of a focused crawler has been realized. Experimental results indicate that Crawler1 has better performance. The number of topic web pages crawled by Crawler1 attains more than 55%.
Key words: topicspecific search, focused crawling, relevance analysis, page segmentation
CLC Number:
LI Xiaoya, HE Fengling, ZUO Wanli. Realization of Focused Crawler Based on Page Segmentation[J].J4, 2007, 45(06): 959-965.
0 / / Recommend
Add to citation manager EndNote|Reference Manager|ProCite|BibTeX|RefWorks
URL: http://xuebao.jlu.edu.cn/lxb/EN/
http://xuebao.jlu.edu.cn/lxb/EN/Y2007/V45/I06/959
Cited