吉林大学学报(理学版)

• 计算机科学 • 上一篇    下一篇

一种基于聚类和AdaBoost的自适应集成算法

王玲娣, 徐华   

  1. 江南大学 化工物联网工程学院, 江苏 无锡 214122
  • 收稿日期:2017-05-12 出版日期:2018-07-26 发布日期:2018-07-31
  • 通讯作者: 徐华 E-mail:joanxh2003@163.com

An Adaptive Ensemble Algorithm Based on Clustering and AdaBoost

WANG Lingdi, XU Hua   

  1. School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, Jiangsu Province, China
  • Received:2017-05-12 Online:2018-07-26 Published:2018-07-31
  • Contact: XU Hua E-mail:joanxh2003@163.com

摘要: 为同时保证基分类器的准确性和差异性, 提出一种基于聚类和AdaBoost的自适应集成算法. 首先利用聚类算法将训练样本分成多个类簇; 然后分别在每个类簇上进行AdaBoost训练并得到一组分类器; 最后按加权投票策略进行分类器的集成. 每个分类器的权重是自适应的, 且为基于测试样本与每个类簇的相似性及分类器对此测试样本的分类置信度计算得到. 实验结果表明, 与AdaBoost,Bagging(bootstrap aggregating)和随机森林等代表性集成算法相比, 该算法可取得更高的分类精度.

关键词: 自适应权重, 聚类, 集成学习, AdaBoost算法

Abstract: In order to ensure the accuracy and diversity of the base classifier at the same time, we proposed an adaptive ensemble algorithm based on clustering and AdaBoost. Firstly, the training samples were divided into multiple clusters by clustering algorithm. Secondly, AdaBoost training was performed on each cluster to get a set of classifiers. Finally, these classifiers were combined according to the weighted voting strategy. The weights of each classifier were adaptive, we calculated the similarity between the test samples and each cluster and got the test samples’ classification confidence given by classifiers. The experimental results show that the algorithm can achieve a higher classification accuracy compared with representative ensemble algorithms such as AdaBoost, Bagging (bootstrap aggregating) and Random Forest.

Key words: clustering, adaptive weight, AdaBoost algorithm, ensemble learning

中图分类号: 

  • TP181