吉林大学学报(理学版)

• 计算机科学 • 上一篇    下一篇

基于特征选择和聚类的分类算法

郭凯文, 潘宏亮, 侯阿临   

  1. 长春工业大学 计算机科学与工程学院, 长春 130012
  • 收稿日期:2016-09-21 出版日期:2018-03-26 发布日期:2018-03-27
  • 通讯作者: 侯阿临 E-mail:alinhou@163.com

Classification Algorithm Based on Feature Selection and Clustering

GUO Kaiwen, PAN Hongliang, HOU Alin   

  1. College of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China
  • Received:2016-09-21 Online:2018-03-26 Published:2018-03-27
  • Contact: HOU Alin E-mail:alinhou@163.com

摘要: 针对目前特征选择算法应用于数据分类精度不理想的问题, 提出一种基于最大相关最小冗余的特征选择算法, 该算法结合特征选择算法和聚类分析算法对特征进行处理, 将分类中冗余的特征去除. 利用支持向量机对一组心脏病患者实际测量得到的数据进行分类实验, 实验结果表明, 该方法可有效筛选影响分类的特征, 进而提高分类准确率.

关键词: 特征选择, 分类, 支持向量机, 聚类

Abstract: Aiming at the problem that the current feature selection algorithm was applied to the data classification accuracy was not ideal, we proposed a feature selection algorithm based on maximum correlation and minimum redundancy. The algorithm combined feature selection algorithm and clustering analysis algorithm to process the feature, and eliminated redundant features in the classification. We used support vector machine (SVM) to classify the data obtained from a group of patients with heart disease. The experimental results show that this method can effectively screen the features that affect classification, and then improve the classification accuracy.

Key words: clustering, classification, support vector machine, feature selection

中图分类号: 

  • TP391