吉林大学学报(工学版) ›› 2013, Vol. 43 ›› Issue (增刊1): 435-439.

Previous Articles     Next Articles

Research on VAD feature of exponent function warping group delay function

WANG Jin-fang, GUO Ming   

  1. College of Communications Engineering, Jilin University, Changchun 130012, China
  • Received:2012-06-07 Published:2013-06-01

Abstract:

Although the robustness to noise of group delay function (GDF) was proved,the spiky effects caused by the resonance seriously undermined its further application.The ability to represent the original acoustic space was by no means guaranteed for the current improvement solutions.On the premise reducing the information loss in the process of feature extraction and aiming at decreasing its dynamic range,the feature of exponent function warping group delay function was produced.The test experiments of voice activity detection indicate that the robustness and detection accuracy is raised remarkably compared to the present GDF versions.

Key words: speech signal processing, voice activity detection, exponent function warping, group delay function

CLC Number: 

  • TN912.3

[1] Dong E,Liu G,Zhou Y,et al.Voice activity detection based on short-time energy and noise spectrum adaptation[C]//In 2002 6th International Conference on Signal Processing(ICSP'02),Beijing,China,2002:464-467.

[2] Sangwan A,Chiranth M C,Jamadagni H S,et al.VAD techniques for real-time speech transmission on the Internet[C]//In 5th IEEE International Conference on High Speed Networks and Multimedia Communications,Jeju Island,Korea,2002:46-50.

[3] Nemer E,Goubran R,Mahmoud S.Robust voice activity detection using higher-order statistics in the LPC residual domain[J].IEEE Transactions on Speech and Audio Processing,2001,9:217-231.

[4] Alsteris L D,Paliwal K K.Short-time phase spectrum in speech processing:a review and some experimental results[J].Digital Signal Processing,2007,17:578-616.

[5] Murthy H A,Gadde V.The modified group delay function and its application to phoneme recognition[C]//In 2003 IEEE International Conference on Acoustics,Speech,and Signal Processing (ICASSP'03),Hong Kong,China,2003:68-71.

[6] Murthy H A,Madhu Murthy K V,Yegnanarayana B.Formant extraction from phase using weighted group delay function[J].Electronics Letters,1989,25:1609-1611.

[7] Yegnanarayana B,Murthy H A.Significance of group delay functions in spectrum estimation[J].IEEE Transactions on Signal Processing,1992,40:2281-2289.

[8] Murthy H A,Yegnanarayana B.Group delay functions and its applications in speech technology[J].Springer,2011,36:745-782.

[1] WANG Hong-zhi, XU Yu-chao, LI Mei-jing. Voice activity detection algorithm based on Mel frequency cepstrum coefficient(MFCC) similarity [J]. , 2012, 42(05): 1331-1335.
[2] WANG Hai-yan, ZHAO Xiao-hui, GU Hai-jun. Probability distribution estimation of speech amplitude spectrum based on Rayleigh mixture hidden Markov model [J]. , 2012, 42(05): 1327-1330.
[3] LIU Bai-sen,LU Zhi-mao,SHEN Li-ran,JIN Hui.  Voice activity detection with low signal-to-noise ratio based on Hilbert-Huang transform [J]. 吉林大学学报(工学版), 2011, 41(03): 844-848.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!