Journal of Jilin University Science Edition

Previous Articles     Next Articles

Centroid Classifier Based on Empirical Risk for Text Categorization

ZHOU Xiaotang, OUYANG Jihong, LI Ximing   

  1. College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2012-12-19 Online:2013-09-26 Published:2013-09-17
  • Contact: OUYANG Jihong E-mail:ouyangjihong@yahoo.com.cn

Abstract:

Empirical risk minimization inductive principle and gradient descent method were used to fix classcentroidvectors in traditional centroidbased text classification algorithms so as to improve the poor expression ability of classcentroidvectors in traditional centroidbased text classification algorithm caused by ignoring the weighting factors of training texts. Then, an improved centroidbased text classification algorithm was obtained, the performance of which is as well as those of support vector machines. Experimental results show that the method adopted in this article is an effective mean to improve the performance of traditional centroidbased text classification algorithms.

Key words: text classification, centroidbased text classification algorithms, empirical risk minimization

CLC Number: 

  • TP391.1