J4 ›› 2009, Vol. 47 ›› Issue (6): 1260-1263.

• 计算机 • 上一篇    下一篇

使用机器学习对汉语评论进行情感分类

白鸽1, 左万利1, 赵乾坤1| 曲仁镜2   

  1. 1. 吉林大学 计算机科学与技术学院, 长春 130012|2. 中国网络通信有限公司 长春分公司| 长春 |130022
  • 收稿日期:2008-12-15 出版日期:2009-11-26 发布日期:2010-01-07
  • 通讯作者: 左万利 E-mail:wanli@mail.jlu.edu.cn.

Sentiment Classification for Chinese Reviews withMachine Learning

BAI Ge1, ZUO Wanli1, ZHAO Qiankun1, QU Ren jing2   

  1. 1. College of Computer Science and Technology, Jilin University, Changchun 130012, China;2. Changchun Filiale, China Netcom Corp, Changchun 130022, China
  • Received:2008-12-15 Online:2009-11-26 Published:2010-01-07
  • Contact: ZUO Wanli E-mail:wanli@mail.jlu.edu.cn.

摘要:

针对汉语评论的多种特征使用机器学习方法(如贝叶斯、 最大熵和支持向量机), 解决了汉语评论的情感分类问题. 实验结果表明, 机器学习方法对汉语评论的分类效果较好, 支持向量机的表现最好. 句子级别和评论级别的准确率分别达到88.26%和91.79%.

关键词: 情感分类, 贝叶斯分类器, 最大熵, 支持向量机

Abstract:

We solved the Chinese review sentiment classification problem via describing and evaluating several machine learning approaches (Nave Bayes, maximum entropy and support vector machines) on some features of the Chinese reviews. The experiment shows the three machine learning methods perform well especially support vector machines, and the accuracy of sentence level is up to 88.26%, the accuracy of review level is up to 91.79%.

Key words: sentiment classification, Nave Bayes, maximum entropy, support vector machines

中图分类号: 

  • TP391.12