J4 ›› 2011, Vol. 49 ›› Issue (03): 487-492.

Previous Articles     Next Articles

Classification of Deep Web Based on Model Matching

GUO Dongwei, LI Sanyi, ZHANG Zhongming, LIU Miao   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2010-01-24 Online:2011-05-26 Published:2011-06-15
  • Contact: ZHANG Zhongming E-mail:zhangzm@jlu.edu.cn

Abstract:

The present paper presents a new method of information extraction from the Deep Web based on model matching. It extracts the characteristic vector of the Deep Web query interface by means of analysising the depth of feature of web page structure automatically. The frequency and concentration rate are both considered when the weight in vector space model is defined. The characteristic word vector is used to construct the database query interface with the number of characteristic word taken into account. At last, model matching is used to classify different databases. This method is validated by experiment results.

Key words: Deep Web, data integration, model matching

CLC Number: 

  • TP391.1