吉林大学学报(工学版) ›› 2015, Vol. 45 ›› Issue (5): 1665-1670.doi: 10.13229/j.cnki.jdxbgxb201505041

Previous Articles     Next Articles

Voiced speech blind signal separation algorithm based on signal energy

LI Hong-yan, QU Jun-ling, ZHANG Xue-ying   

  1. College of Information Engineering, Taiyuan University of Technology, Taiyuan 030024, China
  • Received:2013-12-16 Online:2015-09-01 Published:2015-09-01

Abstract: Considering the shortcoming of instability and low SNR in existing monaural voiced speech separation algorithms, a new voiced speech separation algorithm based on signal energy is proposed, which introduces the signal energy as another important voiced speech feature. This new algorithm is based on the improvement of the classical Hu-Wang algorithm, applying energy feature to the auditory reorganization part. It further improves the reorganization performance of the target speech auditory stream as well as reduces the influence of noise and improves its stability by applying the energy feature. The experiment results show that compared with Hu-Wang algorithm, this improved algorithm can improve the segmental SNR of the target speech segmentation and improve the separation performance obviously.

Key words: communication technology, computational auditory scene analysis, speech separation, signal energy, auditory segment, auditory stream

CLC Number: 

  • TN912.3
[1] Hu K, Wang D L. Unvoiced speech segregation from nonspeech interference via CASA and spectral subtraction[J]. IEEE Trans on Audio,Speech and Lang Process, 2011, 19(6): 1600-1609.
[2] Hu K,Wang D L.An unsupervised approach to cochannel speech separation[J].IEEE Trans on Audio,Speech and Lang Process,2013, 21(1): 120-129.
[3] Hu G N, Wang D L. Auditory segmentation based on onset and offset analysis[J]. IEEE Trans on Audio,Speech and Lang Process, 2007, 15(2): 396-405.
[4] Narayanan A, Wang D L. A CASA based system for long-term SNR estimation[J]. IEEE Trans on Audio,Speech and Lang Process, 2012, 20(9): 2518-2527.
[5] Wang Y, Han K, Wang D L. Exploring monaural features for classification-based speech segregation[J]. IEEE Trans on Audio Speech and Lang Process, 2013, 21(2): 270-279.
[6] Hu G N, Wang D L. A tandem algorithm for pitch estimation and voiced speech segregation[J]. IEEE Trans on Audio Speech and Lang Process, 2010, 18(8):2067-2079.
[7] 李从清, 孙立新, 龙东,等. 语音分离技术的研究现状与展望[J]. 声学技术, 2008, 27(13):779-785. Li Cong-qing, Sun Li-xin, Long Dong,et al. State and frontiers of research in speech separation[J]. Technical Acoustics, 2008, 27(13):779-785.
[8] 马建芬.语音信号盲分离与增强算法的研究[M].北京: 电子工业出版社, 2012. [9] Hu G N, Wang D L. Monaural speech segregation based on pitch tracking and amplitude modulation[J]. IEEE Trans on Neural Networks, 2004, 15(5):1135-1150.
[10] 赵立恒,汪增福. 基于谐波和能量特征的单声道浊语音分离方法[J]. 声学学报,2012, 37(2):218-224. Zhao Li-heng,Wang Zeng-fu.Monaural voiced speech separation based on harmonic and energy features[J]. Acta Acustica, 2012, 37(2): 218-224.
[11] Wang D L,Brown G J. Computational Auditory Scene Analysis[M]. USA: IEEE Press, 2006.
[12] 张学良,刘文举,李鹏,等. 改进谐波组织规则的单通道浊语音分离系统[J]. 声学学报,2011, 36(1): 88-96. Zhang Xue-liang,Liu Wen-ju,Li Peng,et al.Monaural voiced speech separation based on improved harmonic grouping rules[J]. Acta Acustica, 2011, 36(1): 88-96.
[13] Wang D L, Brown G J. Separation of speech from interfering sounds based on oscillatory correlation[J]. IEEE Trans on Neural Networks, 1999, 10(3): 684-697.
[14] 蒋毅,梁维谦,周宏. 低信噪比下二值掩蔽算法性能分析[J]. 清华大学学报:自然科学版,2012, 52(5): 636-641. Jiang Yi, Liang Wei-qian, Zhou Hong. Performance of binary time-frequency masks in low signal to noise ratio environments[J]. Journal of Tsinghua University (Natural science edition), 2012, 52(5):636-641.
[1] ZHOU Yan-guo,ZHANG Hai-lin,CHEN Rui-rui,ZHOU Tao. Two-level game approach based resource allocation scheme in cooperative networks [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1879-1886.
[2] SUN Xiao-ying, HU Ze-zheng, YANG Jin-peng. Assessment method of electromagnetic pulse sensitivity of vehicle engine system based on hierarchical Bayesian networks [J]. 吉林大学学报(工学版), 2018, 48(4): 1254-1264.
[3] DONG Ying, CUI Meng-yao, WU Hao, WANG Yu-hou. Clustering wireless rechargeable sensor networks charging schedule based on energy prediction [J]. 吉林大学学报(工学版), 2018, 48(4): 1265-1273.
[4] MOU Zong-lei, SONG Ping, ZHAI Ya-yu, CHEN Xiao-xiao. High accuracy measurement method for synchronous triggering pulse transmission delay in distributed test system [J]. 吉林大学学报(工学版), 2018, 48(4): 1274-1281.
[5] DING Ning, CHANG Yu-chun, ZHAO Jian-bo, WANG Chao, YANG Xiao-tian. High-speed CMOS image sensor data acquisition system based on USB 3.0 [J]. 吉林大学学报(工学版), 2018, 48(4): 1298-1304.
[6] CHEN Rui-rui, ZHANG Hai-lin. Performance analysis of 3D millimeter wave communications [J]. 吉林大学学报(工学版), 2018, 48(2): 605-609.
[7] ZHANG Chao-yi, LI Jin-hai, YAN Yue-peng. Improved Tong detection algorithm with double thresholds [J]. 吉林大学学报(工学版), 2018, 48(2): 610-617.
[8] GUAN Ji-shi, SHI Yao-wu, QIU Jian-wen, SHAN Ze-biao, SHI Hong-wei. New algorithm to estimate characteristic exponent of α-stable distribution [J]. 吉林大学学报(工学版), 2018, 48(2): 618-624.
[9] LI Wei, LI Ya-jie. Satisfactory integrated design between failure accommodation and communication for non-uniform transmission networked control system under discrete event-triggered communication scheme [J]. 吉林大学学报(工学版), 2018, 48(1): 245-258.
[10] SUN Xiao-ying, WANG Zhen, YANG Jin-peng, HU Ze-zheng, CHEN Jian. Electromagnetic susceptibility assessment of electronic throttle based on Bayesian network [J]. 吉林大学学报(工学版), 2018, 48(1): 281-289.
[11] WU Wei, WANG Shi-gang, ZHAO Yan, WEI Jian, ZHONG Cheng. Hexagonal elemental image array generation [J]. 吉林大学学报(工学版), 2018, 48(1): 290-294.
[12] YUAN Jian-guo, ZHANG Xi-ruo, QIU Piao-yu, WANG Yong, PANG Yu, LIN Jin-zhao. Non-iterative phase noise suppression algorithm utilizing cyclic prefix in OFDM systems [J]. 吉林大学学报(工学版), 2018, 48(1): 295-300.
[13] WANG Jin-peng, CAO Fan, HE Xiao-yang, ZOU Nian-yu. Multi carrier system joint receiving method based on MAI and ICI [J]. 吉林大学学报(工学版), 2018, 48(1): 301-305.
[14] SHI Wen-xiao, SUN Hao-ran, WANG Shao-bo. Joint channel allocation and routing algorithm in wireless mesh network [J]. 吉林大学学报(工学版), 2017, 47(6): 1918-1925.
[15] JIANG Lai-wei, SHA Xue-jun, WU Xuan-li, ZHANG Nai-tong. Novel joint user association and resource allocation method in LTE-A HetNets [J]. 吉林大学学报(工学版), 2017, 47(6): 1926-1932.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!