Journal of Jilin University Science Edition

Previous Articles     Next Articles

Feature Selection Algorithm Based on ComprehensiveMeasurement for Text Categorization

YANG Jieming1,2, LIU Yuanning2, QU Zhaoyang1, LIU Zhiying1   

  1. 1. School of Information Engineering, Northeast Dianli University, Jilin 132012, Jilin Province, China;2. College of Computer Science and Technology, Jilin University, Changchun 130012, China
  • Received:2012-09-07 Online:2013-09-26 Published:2013-09-17
  • Contact: YANG Jieming E-mail:yjmlzy@gmail.com

Abstract:

In view of the disadvantages of traditional feature selection algorithm, we proposed a new feature selection algorithm, which simultaneously measures the importance of one feature both in intracategory and intercategory. The proposed algorithm was compared with five feature selection algorithms via two classification algorithms on three benchmark document collections. The experimental results show the proposed method can reduce the dimensionality
 of the text representation and significantly improve the performance of the text categorization.

Key words: feature selection, text categorization, dimensionality reduction

CLC Number: 

  • TP301.6