吉林大学学报(信息科学版) ›› 2021, Vol. 39 ›› Issue (2): 205-214.

• • 上一篇    下一篇

基于 Shapelets 的多变量 D-S 证据加权集成分类

宋奎勇1,2, 王念滨1, 王红滨1   

  1. 1. 哈尔滨工程大学 计算机科学与技术学院, 哈尔滨 150000; 2. 呼伦贝尔职业技术学院 信息工程系, 内蒙古 呼伦贝尔 021000
  • 收稿日期:2020-09-19 出版日期:2021-04-19 发布日期:2021-04-28
  • 作者简介:宋奎勇(1979— ), 男, 内蒙古赤峰人, 哈尔滨工程大学副教授, 主要从事信息融合和深度学习研究, ( Tel) 86-17745169546 (E-mail)songkuiyong@hrbeu.edu.cn; 王念滨(1967— ),男,哈尔滨人, 哈尔滨工程大学教授, 博士生导师, 主要从事数据处理和数据分析研究,(Tel)86-13904807219(E-mail)wangnianbin@hrbeu.edu.cn
  • 基金资助:
    国家自然科学基金资助项目(61772152); 国家重点研发计划基金资助项目(2018YFC0806800); 技术基础科研基金资助项目(JSQB2017206C002); 中国博士后科学基金资助项目 (2019M651262 ); 教育部人文社科研究青年基金资助项目(20YJCZH172); 黑龙江省博士后基金会基金资助项目(LBH-Z19015)

Multivariate D-S Evidence Weighted Ensemble Classification Based on Shapelets

SONG Kuiyong1,2, WANG Nianbin1, WANG Hongbin1   

  1. 1. College of Computer Science and Technology, Harbin Engineering University, Harbin 150000, China;2. Department of Information Engineering, Hulunbuir Vocational Technical College, Hulun Buir 021000, China
  • Received:2020-09-19 Online:2021-04-19 Published:2021-04-28

摘要: 集成学习是分类多变量时间序列的有效方法。 然而集成学习对基分类器性能要求较高, 基分类器组合算法优劣对分类效果影响较大。 为此, 提出一种基于 Shapelets 的多变量 D-S(Dempster/ Shafer)证据加权集成分类方法。 首先, 在单变量时间序列上学习得到基分类器 Shapelets, 基分类器的分类准确率确定为其在多分类器的权重。 Shapelets 是时间序列的子序列, 不同变量 Shapelets 间不存在依赖关系, 且单个 Shapelets 分类准确度较高, 能得到“好而不同冶的基分类器。 然后, 提出一种加权概率指派算法, 增加分类准确率高的基分类器权重,减少分类准确率低的基分类器权重; 添加了 2 个组合策略, 即消除证据冲突, 又提高了效率。 在标准数据集上与多个最新算法进行比较, 笔者算法取得了较好的分类结果。

关键词: Shapelets 分类, 多变量时间序列, 集成学习, D-S 证据理论

Abstract: Ensemble learning is an effective method to classify multivariate time series. However, ensemble learning requires higher performance of the base classifier, and the combination of base classifier algorithms has a greater impact on the classification effect. This paper proposes a multivariate D-S(Dempster/ Shafer) evidence weighted ensemble classification method based on shapelets. First, learning the base classifier Shapelets on the univariate time series, the classification accuracy of the base classifier is determined as its weight in the multi-classifier. Shapelets are sub-sequences of time series. There is no dependency between different variable Shapelets, and the classification accuracy of a single Shapelets is high, and a “good but different” base classifier can be obtained. Then, a weighted probability assignment algorithm is proposed to increase the weight of the base classifier with high classification accuracy and reduce the weight of the base classifier with low classification accuracy. Two combination strategies are added to eliminate evidence conflicts and improve efficiency. Compared to some state-of-the-art algorithms on the standard dataset, our algorithm can obtain a better classification result.

Key words: shapelets, multivariate time series, ensemble learning, dempster/ shafer(D-S) evidence theory

中图分类号: 

  • TP301. 6