›› 2012, Vol. 42 ›› Issue (05): 1327-1330.

Previous Articles     Next Articles

Probability distribution estimation of speech amplitude spectrum based on Rayleigh mixture hidden Markov model

WANG Hai-yan, ZHAO Xiao-hui, GU Hai-jun   

  1. Laboratory of Information Science, College of Communication Engineering, Jilin University, Changchun 130012, China
  • Received:2011-08-23 Online:2012-09-01 Published:2012-09-01

Abstract: To solve the problem of oversimplified speech short-time spectral amplitude distribution model in speech signal processing, an estimation method of speech amplitude spectrum based on hidden Markov model is proposed. This estimation method uses Rayleigh mixture model as speech amplitude spectrum distribution and employs hidden Markov model to divide speech signal into different states. In each state, there is accordingly a group of Rayleigh mixture model parameters. By the division of speech signal into different states, this method can achieve speech classification and build more accurate model for speech signal short term spectrum.

Key words: speech signal processing, Rayleigh mixture model, hidden Markov model

CLC Number: 

  • TN912.3
[1] 欧世峰,王显云,高颖,等. 基于两步噪声消除技术与高斯统计模型的语音增强算法[J].信号处理,2011,27(8):1171-1178. Ou Shi-feng, Wang Xian-yun, Gao Ying, et al.Speech enhancement based on two-step noise reduction and Gaussian statistical model[J]. Signal Processing,2011,27(8):1171-1178.
[2] 王海艳,赵晓晖. 基于语音清浊音分离的语音增强算法[J]. 吉林大学学报:工学版, 2011, 41 (4) : 1135-1139. Wang Hai-yan, Zhao Xiao-hui. Speech enhancement algorithm based on voiced/unvoiced discrimination[J]. Jilin University(Engineering and Technology Edition), 2011, 41 (4) : 1135-1139.
[3] Ephrain Yariv, Malah David. Speech enhancement using a minimum mean square error short-time spectral amplitude estimator[J]. IEEE Transaction on Acoustics, Speech, and Signal Processing, 1984,32(6):443-445.
[4] Lotter Thomas, Vary Peter. Speech enhancement by MAP spectral Amplitude estimation using a super Gauss speech model[J].EURASIP Journal on Applied Signal Processing, 2005,7:1110-1126.
[5] Erkelens J S, Jensen J, Heusdens R. Speech enhancement based on rayleigh mixture modeling of speech spectral amplitude distributions//EURASIP, Poznań, Poland, 2007.
[6] Karsten Vandborg Srensen, Sren Vang Andersen. Rayleigh mixture modeling and estimation of noise in noisy speech signals[J]. IEEE Transactions on Audio, Speech, and Language Processing, 2007,15(3):901-917.
[7] Huang Xue-dong, Acero Alex, Hon Hsiao-Wuen. Spoken Language Processing: a Guide to Theory, Algorithm, and System Development[M]. New Jersey:Prentice Hall PTR, 2001.
[1] GAO Zhen-hai, LE DinhDat, HU Hong-yu, SUN Yi-teng. Recognition method of driver's unintentional lane departure [J]. 吉林大学学报(工学版), 2017, 47(3): 709-716.
[2] SHEN Xuan-jing, LI Xiang, LYU Ying-da, CHEN Hai-peng. Blind detection of image splicing based on visual attention model and extended hidden Markov model [J]. 吉林大学学报(工学版), 2014, 44(2): 446-453.
[3] WANG Sheng-sheng, YANG Feng, LIU Yi-ting, WANG Wei-lie, LI Yang. Complex interactive activity recognition with spatial-temporal relationship [J]. 吉林大学学报(工学版), 2014, 44(2): 421-426.
[4] WANG Jin-fang, GUO Ming. Research on VAD feature of exponent function warping group delay function [J]. 吉林大学学报(工学版), 2013, 43(增刊1): 435-439.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!