J4 ›› 2010, Vol. 48 ›› Issue (02): 277-283.

Previous Articles     Next Articles

A Novel Text Clustering Method Based on Ontology

ZHU Huifeng, ZUO Wanli, HE Fengling, PENG Tao, JI Wenyan   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China|Key Laboratory ofSymbol Computation and Knowledge Engineering of |Ministry of Education, Jilin University, |Changchun 130012, China
  • Received:2009-04-09 Online:2010-03-26 Published:2010-03-22
  • Contact: ZUO Wanli E-mail:wanly@mail.jlu.edu.cn

Abstract:

The text clustering method based on ontology applies WordNet and key concept set during text reprensentation, and the concept nodes and the semantic relations between the concepts in the ontology WordNet are used to reduce the number of features so as to improve clustering effect. And during text clustering, the algorithm uses the key concept set and the concept feature vector to calculate the similarity and uses key concept set to provide an explanation for every cluster of the result. The experimental results show that the method can effectively reduce the dimension number of the text feature vector and im
prove the text clustering effect compared with other text clustering algorithm and the novel method for text clustering can come up with a good explanation for the clusters.

Key words: ontology, WordNet, key concept set, concept feature vector

CLC Number: 

  • TP391