吉林大学学报(理学版) ›› 2021, Vol. 59 ›› Issue (6): 1517-1524.

• • 上一篇    下一篇

 基于生成对抗单分类网络的异常声音检测

薛英杰1, 韩威2, 周松斌2, 刘忆森2   

  1. 1. 昆明理工大学 信息工程与自动化学院, 昆明 650504;
    2. 广东省科学院智能制造研究所 广东省现代控制技术重点实验室, 广州 510070
  • 收稿日期:2021-02-13 出版日期:2021-11-26 发布日期:2021-11-26
  • 通讯作者: 周松斌 E-mail:Sb.zhou@giim.ac.cn

Abnormal Sound Detection Based on Generative Adversarial Single Classification Network

XUE Yingjie1, HAN Wei2, ZHOU Songbin2, LIU Yisen2   

  1. 1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China; 
    2. Institute of Intelligent Manufacturing, Guangdong Academy of Sciences, Guangdong Key Laboratory of Modern Control Technology, Guangzhou 510070, China
  • Received:2021-02-13 Online:2021-11-26 Published:2021-11-26

摘要: 针对正常和异常声音可能具有较大的相似性, 有时无法利用自编码器重构误差大小区分的问题, 提出一种生成对抗单分类网络方法进行异常声音检测, 通过多次训练, 该方法学习正常样本的分布特征. 在测试过程中, 测试正常样本能以极小的误差进行重构, 而异常样本重构效果较差, 在某些频率段会发生畸变, 从而给出判别分类结果. 实验采用UrbanSound8K公开数据集和实测电机声音数据集进行了测试, 获得该方法的准确率分别为86.3%和98.1%, 比卷积自动编码器等主要深度学习方法分别提高了5.0%和3.0%.

关键词: 自编码器, 生成对抗网络, 声音异常检测

Abstract: Aiming at the problem that normal and abnormal sounds might have great similarity and sometimes could not distinguish normal and abnormal sounds by the size of reconstruction error of autoencoder, we proposed a generative adversarial single classification network method for abnormal sound detection. Through multiple training, the method learned the distribution characteristics of normal samples. In the test process, the normal sample could be reconstructed with minimal error, while the abnormal sample had poor reconstruction effect, and distortion occurred in some frequency bands, so as to give the discriminant classification results. In the experiment, UrbanSound8K public data set and measured motor sound data set were used for testing, and the accuracy of this method is 86.3% and 98.1%, respectively, which is 5.0% and 3.0% higher than main deep learning methods such as convolutional autoencoder.

Key words:  , autoencoder, generative adversarial network, sound anomaly detection

中图分类号: 

  • TP391