›› 2012, Vol. 42 ›› Issue (05): 1331-1335.

Previous Articles     Next Articles

Voice activity detection algorithm based on Mel frequency cepstrum coefficient(MFCC) similarity

WANG Hong-zhi, XU Yu-chao, LI Mei-jing   

  1. School of Computer Science and Engineering, Changchun University of Technology, Changchun 130012, China
  • Received:2011-11-13 Online:2012-09-01 Published:2012-09-01

Abstract: To improve the accuracy of Voice Activity Detection (VAD) under noisy Environment, a voice activity detection algorithm based on Mel Frequency Cepstrum Coefficient (MFCC) similarity is proposed. First, the MFCC from each frame of the voice signals is extracted. Then, the first ten frames are taken as the background noises. Finally, by calculating the MFCC correlation coefficient distance under the above noisy condition, the voice-activity parameters are detected from the MFCC similarity curves. The experiment results show that the proposed algorithm is effective under both white noise and pink noise conditions and at low signal to noise ratio.

Key words: communication, voice activity detection, Mel frequency cepstrum coefficient (MFCC), correlation coefficient

CLC Number: 

  • TN912
[1] 赵彦平,赵晓晖. 用于语音端点检测的鲁棒性特征提取新方法[J].吉林大学学报:工学版,2006,36(1):77-81. Zhao Yan-ping, Zhao Xiao-hui. New robust feature extraction method for speech endpoint detection[J]. Journal of Jilin University (Engineering and Technology Edition), 2006, 36(1):77-81.
[2] 刘伯森,卢志茂.基于希尔伯特-黄变换的低信噪比语音端点检测[J].吉林大学学报:工学版,2011,41(3):844-848. Liu Bo-sen, Lu Zhi-mao. Voice activity detection with low signal-to-noise ratio based on Hilbert-Huang transform[J].Journal of Jilin University(Engineering and Technology Edition), 2011, 41(3):844-848.
[3] 朱晓晶,侯旭初. 基于LPCC和能量熵的端点检测[J].电讯技术,2010,50(6):41-45. Zhu Xiao-jing, Hou Xu-chu. Voice activity detection based on LPCC and spectrum entropy[J]. Telecommunication Engineering, 2010, 50(6):41-45.
[4] 王纲金,赵欢. 基于小波变换C0复杂度的语音端点检测方法[J]. 计算机工程与应用,2010,46(29):134-136. Wang Gang-jin, Zhao Huan. Voice activity detection based on wavelet transform C0 complexity[J]. Computer Engineering and Applications, 2010, 46(29):134-136.
[5] Koichi Yamamoto, Firas Jabloun, Klaus Reinhard,et al. Robust endpoint detection for speech recognition based on discriminative feature extraction//IEEE International Conference on Acoustics, Speech and Signal Processing, Toulouse, France, 2006.
[6] Lu X, Unoki M, Isotani R,et al. Voice activity detection in a regularized reproducing kernel Hilbert space//INTERSPEECH, Makuhari, Japan, 2010.
[7] Chang J K, Kim N S, Mitra S K. Voice activity detection based on multiple statistical models[J]. IEEE Trans Signal Process, 2006, 54(6):1965-1976.
[8] Hyeopwoo Lee, Dongsuk Yook. Space-time voice activity detection[J]. IEEE Trans Consumer Electronics, 2009,55(3):1471-1476.
[9] Li K,Swamy M N S, Ahmad M O. An improved voice activity detection using higher order statistics[J]. IEEE Trans Speech and Audio Processing, 2005, 13(5): 965-974.
[10] Cho Namgook, Kim Eun-Kyoung. Enhanced voice activity detection using acoustic event detection and classification[J]. IEEE Trans Consumer Electronics, 2011, 57(1):196-202.
[1] CHEN Yong-heng,LIU Fang-hong,CAO Ning-bo. Analysis of conflict factors between pedestrians and channelized right turn vehicles at signalized intersections [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1669-1676.
[2] CHANG Shan,SONG Rui,HE Shi-wei,LI Hao-dong,YIN Wei-chuan. Recycling model of faulty bike sharing [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1677-1684.
[3] QU Da-yi,YANG Jing-ru,BING Qi-chun,WANG Wu-lin,ZHOU Jing-chun. Arterial traffic offset optimization based on queue characteristics at adjacent intersections [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1685-1693.
[4] ZHOU Yan-guo,ZHANG Hai-lin,CHEN Rui-rui,ZHOU Tao. Two-level game approach based resource allocation scheme in cooperative networks [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1879-1886.
[5] LIU Xiang-yu, YANG Qing-fang, KUI Hai-lin. Traffic guidance cell division based on random walk algorithm [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1380-1386.
[6] LIU Zhao-hui, WANG Chao, LYU Wen-hong, GUAN Xin. Identification of data characteristics of vehicle running status parameters by nonlinear dynamic analysis [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1405-1410.
[7] LUAN Xin, DENG Wei, CHENG Lin, CHEN Xin-yuan. Mixed Logit model for understanding travel mode choice behavior of megalopolitan residents [J]. 吉林大学学报(工学版), 2018, 48(4): 1029-1036.
[8] SUN Xiao-ying, HU Ze-zheng, YANG Jin-peng. Assessment method of electromagnetic pulse sensitivity of vehicle engine system based on hierarchical Bayesian networks [J]. 吉林大学学报(工学版), 2018, 48(4): 1254-1264.
[9] DONG Ying, CUI Meng-yao, WU Hao, WANG Yu-hou. Clustering wireless rechargeable sensor networks charging schedule based on energy prediction [J]. 吉林大学学报(工学版), 2018, 48(4): 1265-1273.
[10] MOU Zong-lei, SONG Ping, ZHAI Ya-yu, CHEN Xiao-xiao. High accuracy measurement method for synchronous triggering pulse transmission delay in distributed test system [J]. 吉林大学学报(工学版), 2018, 48(4): 1274-1281.
[11] DING Ning, CHANG Yu-chun, ZHAO Jian-bo, WANG Chao, YANG Xiao-tian. High-speed CMOS image sensor data acquisition system based on USB 3.0 [J]. 吉林大学学报(工学版), 2018, 48(4): 1298-1304.
[12] CHEN Yong-heng, LIU Xin-shan, XIONG Shuai, WANG Kun-wei, SHEN Yao, YANG Shao-hui. Variable speed limit control under snow and ice conditions for urban expressway in junction bottleneck area [J]. 吉林大学学报(工学版), 2018, 48(3): 677-687.
[13] WANG Zhan-zhong, LU Yue, LIU Xiao-feng, ZHAO Li-ying. Improved harmony search algorithm on truck scheduling for cross docking system [J]. 吉林大学学报(工学版), 2018, 48(3): 688-693.
[14] LI Zhi-hui, HU Yong-li, ZHAO Yong-hua, MA Jia-lei, LI Hai-tao, ZHONG Tao, YANG Shao-hui. Locating moving pedestrian from running vehicle [J]. 吉林大学学报(工学版), 2018, 48(3): 694-703.
[15] CHEN Song, LI Xian-sheng, REN Yuan-yuan. Adaptive signal control method for intersection with hook-turn buses [J]. 吉林大学学报(工学版), 2018, 48(2): 423-429.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!