›› 2012, Vol. 42 ›› Issue (05): 1331-1335.

• 论文 • 上一篇    下一篇

基于Mel频率倒谱参数相似度的语音端点检测算法

王宏志, 徐玉超, 李美静   

  1. 长春工业大学 计算机科学与工程学院,长春 130012
  • 收稿日期:2011-11-13 出版日期:2012-09-01 发布日期:2012-09-01
  • 基金资助:
    国家自然科学基金项目(11071026);教育部春晖计划项目(403-004077003);吉林省教育厅科学计划项目(2010087).

Voice activity detection algorithm based on Mel frequency cepstrum coefficient(MFCC) similarity

WANG Hong-zhi, XU Yu-chao, LI Mei-jing   

  1. School of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China
  • Received:2011-11-13 Online:2012-09-01 Published:2012-09-01

摘要: 为提高噪声环境下语音端点检测的精确性,提出了一种基于Mel频率倒谱参数(MFCC)相似度的端点检测方法。提取了每帧语音信号的Mel频率倒谱参数,然后将前十帧作为背景噪声,计算测试帧和背景噪声的MFCC相关系数距离,最后用得到的MFCC相似度距离曲线进行端点检测。实验结果表明,该方法在白噪声和粉噪声环境下均可得到理想的端点检测效果,并且在低信噪比时仍然有效。

关键词: 通信技术, 端点检测, Mel频率倒谱参数, 相关系数

Abstract: To improve the accuracy of Voice Activity Detection (VAD) under noisy Environment, a voice activity detection algorithm based on Mel Frequency Cepstrum Coefficient (MFCC) similarity is proposed. First, the MFCC from each frame of the voice signals is extracted. Then, the first ten frames are taken as the background noises. Finally, by calculating the MFCC correlation coefficient distance under the above noisy condition, the voice-activity parameters are detected from the MFCC similarity curves. The experiment results show that the proposed algorithm is effective under both white noise and pink noise conditions and at low signal to noise ratio.

Key words: communication, voice activity detection, Mel frequency cepstrum coefficient (MFCC), correlation coefficient

中图分类号: 

  • TN912
[1] 赵彦平,赵晓晖. 用于语音端点检测的鲁棒性特征提取新方法[J].吉林大学学报:工学版,2006,36(1):77-81. Zhao Yan-ping, Zhao Xiao-hui. New robust feature extraction method for speech endpoint detection[J]. Journal of Jilin University (Engineering and Technology Edition), 2006, 36(1):77-81.
[2] 刘伯森,卢志茂.基于希尔伯特-黄变换的低信噪比语音端点检测[J].吉林大学学报:工学版,2011,41(3):844-848. Liu Bo-sen, Lu Zhi-mao. Voice activity detection with low signal-to-noise ratio based on Hilbert-Huang transform[J].Journal of Jilin University(Engineering and Technology Edition), 2011, 41(3):844-848.
[3] 朱晓晶,侯旭初. 基于LPCC和能量熵的端点检测[J].电讯技术,2010,50(6):41-45. Zhu Xiao-jing, Hou Xu-chu. Voice activity detection based on LPCC and spectrum entropy[J]. Telecommunication Engineering, 2010, 50(6):41-45.
[4] 王纲金,赵欢. 基于小波变换C0复杂度的语音端点检测方法[J]. 计算机工程与应用,2010,46(29):134-136. Wang Gang-jin, Zhao Huan. Voice activity detection based on wavelet transform C0 complexity[J]. Computer Engineering and Applications, 2010, 46(29):134-136.
[5] Koichi Yamamoto, Firas Jabloun, Klaus Reinhard,et al. Robust endpoint detection for speech recognition based on discriminative feature extraction//IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse, France, 2006.
[6] Lu X, Unoki M, Isotani R,et al. Voice activity detection in a regularized reproducing kernel Hilbert space//INTERSPEECH, Makuhari, Japan, 2010.
[7] Chang J K, Kim N S, Mitra S K. Voice activity detection based on multiple statistical models[J]. IEEE Trans Signal Process, 2006, 54(6):1965-1976.
[8] Hyeopwoo Lee, Dongsuk Yook. Space-time voice activity detection[J]. IEEE Trans Consumer Electronics, 2009,55(3):1471-1476.
[9] Li K,Swamy M N S, Ahmad M O. An improved voice activity detection using higher order statistics[J]. IEEE Trans Speech and Audio Processing, 2005, 13(5): 965-974.
[10] Cho Namgook, Kim Eun-Kyoung. Enhanced voice activity detection using acoustic event detection and classification[J]. IEEE Trans Consumer Electronics, 2011, 57(1):196-202.
[1] 周彦果,张海林,陈瑞瑞,周韬. 协作网络中采用双层博弈的资源分配方案[J]. 吉林大学学报(工学版), 2018, 48(6): 1879-1886.
[2] 孙晓颖, 扈泽正, 杨锦鹏. 基于分层贝叶斯网络的车辆发动机系统电磁脉冲敏感度评估[J]. 吉林大学学报(工学版), 2018, 48(4): 1254-1264.
[3] 董颖, 崔梦瑶, 吴昊, 王雨后. 基于能量预测的分簇可充电无线传感器网络充电调度[J]. 吉林大学学报(工学版), 2018, 48(4): 1265-1273.
[4] 牟宗磊, 宋萍, 翟亚宇, 陈晓笑. 分布式测试系统同步触发脉冲传输时延的高精度测量方法[J]. 吉林大学学报(工学版), 2018, 48(4): 1274-1281.
[5] 丁宁, 常玉春, 赵健博, 王超, 杨小天. 基于USB 3.0的高速CMOS图像传感器数据采集系统[J]. 吉林大学学报(工学版), 2018, 48(4): 1298-1304.
[6] 陈瑞瑞, 张海林. 三维毫米波通信系统的性能分析[J]. 吉林大学学报(工学版), 2018, 48(2): 605-609.
[7] 张超逸, 李金海, 阎跃鹏. 双门限唐检测改进算法[J]. 吉林大学学报(工学版), 2018, 48(2): 610-617.
[8] 关济实, 石要武, 邱建文, 单泽彪, 史红伟. α稳定分布特征指数估计算法[J]. 吉林大学学报(工学版), 2018, 48(2): 618-624.
[9] 李炜, 李亚洁. 基于离散事件触发通信机制的非均匀传输网络化控制系统故障调节与通信满意协同设计[J]. 吉林大学学报(工学版), 2018, 48(1): 245-258.
[10] 孙晓颖, 王震, 杨锦鹏, 扈泽正, 陈建. 基于贝叶斯网络的电子节气门电磁敏感度评估[J]. 吉林大学学报(工学版), 2018, 48(1): 281-289.
[11] 武伟, 王世刚, 赵岩, 韦健, 钟诚. 蜂窝式立体元图像阵列的生成[J]. 吉林大学学报(工学版), 2018, 48(1): 290-294.
[12] 袁建国, 张锡若, 邱飘玉, 王永, 庞宇, 林金朝. OFDM系统中利用循环前缀的非迭代相位噪声抑制算法[J]. 吉林大学学报(工学版), 2018, 48(1): 295-300.
[13] 王金鹏, 曹帆, 贺晓阳, 邹念育. 基于多址干扰和蜂窝间互扰分布的多载波系统联合接收方法[J]. 吉林大学学报(工学版), 2018, 48(1): 301-305.
[14] 石文孝, 孙浩然, 王少博. 无线Mesh网络信道分配与路由度量联合优化算法[J]. 吉林大学学报(工学版), 2017, 47(6): 1918-1925.
[15] 姜来为, 沙学军, 吴宣利, 张乃通. LTE-A异构网络中新的用户选择接入和资源分配联合方法[J]. 吉林大学学报(工学版), 2017, 47(6): 1926-1932.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!