J4

• 数学 • Previous Articles     Next Articles

Text Automatic Classification Based on Multiple HypothesisTesting in the Mayor’s Public Access Line Project

HAO Lizhu, ZHAO Shishun, HAO Lili   

  1. Institute of Mathematics, Jilin University, Changchun 130012, China
  • Received:2008-01-23 Revised:1900-01-01 Online:2008-11-26 Published:2008-11-26
  • Contact: ZHAO Shishun

Abstract: On the basis of multiple hypothesis testing, we proposed a feature weighted naive Bayesian algorithm, which outputs many sets of feature words by means of feature selection, and assigns a coefficient to each set of feature words which is used to construct the classifier in terms of the error rate of multiple hypothesis testing. This algorithm was used in the text classification of the mayor’s public access line project, where we realized the automatic classification of complaint texts by constructing three feature weighted naive Bayesian classifiers. Compared with those of the traditional methods, the efficiency and accuracy of our classifier are higher.

Key words: multiple hypothesis testing, text classification, fea ture weighted, the mayor’s public access line project

CLC Number: 

  • O235