基于生成对抗单分类网络的异常声音检测

吉林大学学报(理学版) ›› 2021, Vol. 59 ›› Issue (6): 1517-1524.

基于生成对抗单分类网络的异常声音检测

薛英杰¹, 韩威², 周松斌², 刘忆森²

1. 昆明理工大学信息工程与自动化学院, 昆明 650504；
2. 广东省科学院智能制造研究所广东省现代控制技术重点实验室, 广州 510070

收稿日期:2021-02-13 出版日期:2021-11-26 发布日期:2021-11-26
通讯作者: 周松斌 E-mail:Sb.zhou@giim.ac.cn

Abnormal Sound Detection Based on Generative Adversarial Single Classification Network

XUE Yingjie¹, HAN Wei², ZHOU Songbin², LIU Yisen²

1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China;
2. Institute of Intelligent Manufacturing, Guangdong Academy of Sciences, Guangdong Key Laboratory of Modern Control Technology, Guangzhou 510070, China

Received:2021-02-13 Online:2021-11-26 Published:2021-11-26

摘要/Abstract

摘要： 针对正常和异常声音可能具有较大的相似性, 有时无法利用自编码器重构误差大小区分的问题, 提出一种生成对抗单分类网络方法进行异常声音检测, 通过多次训练, 该方法学习正常样本的分布特征. 在测试过程中, 测试正常样本能以极小的误差进行重构, 而异常样本重构效果较差, 在某些频率段会发生畸变, 从而给出判别分类结果. 实验采用UrbanSound8K公开数据集和实测电机声音数据集进行了测试, 获得该方法的准确率分别为86.3%和98.1%, 比卷积自动编码器等主要深度学习方法分别提高了5.0%和3.0%.

关键词: 自编码器, 生成对抗网络, 声音异常检测

Abstract: Aiming at the problem that normal and abnormal sounds might have great similarity and sometimes could not distinguish normal and abnormal sounds by the size of reconstruction error of autoencoder, we proposed a generative adversarial single classification network method for abnormal sound detection. Through multiple training, the method learned the distribution characteristics of normal samples. In the test process, the normal sample could be reconstructed with minimal error, while the abnormal sample had poor reconstruction effect, and distortion occurred in some frequency bands, so as to give the discriminant classification results. In the experiment, UrbanSound8K public data set and measured motor sound data set were used for testing, and the accuracy of this method is 86.3% and 98.1%, respectively, which is 5.0% and 3.0% higher than main deep learning methods such as convolutional autoencoder.

Key words: , autoencoder, generative adversarial network, sound anomaly detection

中图分类号:

TP391

薛英杰, 韩威, 周松斌, 刘忆森. 基于生成对抗单分类网络的异常声音检测[J]. 吉林大学学报(理学版), 2021, 59(6): 1517-1524.

XUE Yingjie, HAN Wei, ZHOU Songbin, LIU Yisen. Abnormal Sound Detection Based on Generative Adversarial Single Classification Network[J]. Journal of Jilin University Science Edition, 2021, 59(6): 1517-1524.

[1]	朱海琦, 李宏, 李定文, 李富. 基于生成对抗网络的单图像超分辨率重建[J]. 吉林大学学报(理学版), 2021, 59(6): 1491-1498.
[2]	李二强, 陈凯健, 周漾. 可控多重纹理扩展合成与迁移[J]. 吉林大学学报(理学版), 2021, 59(3): 577-586.