吉林大学学报(工学版) ›› 2016, Vol. 46 ›› Issue (3): 870-875.doi: 10.13229/j.cnki.jdxbgxb201603029

• Orginal Article • Previous Articles     Next Articles

Speaker recognition algorithm based on channel compensation

SHEN Xuan-jing1, 2, ZHAI Yu-jie1, 2, LU Yu-tong3, WANG Yu1, 2, 4, CHEN Hai-peng1, 2   

  1. 1.College of Computer Science and Technology, Jilin University, Changchun 130012, China;
    2.Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education,Jilin University, Changchun 130012, China;
    3.Faculty of Engineering,The Hong Kong Polytechnic University,Hong Kong 999077,China;
    4.Applied Technology College of Jilin University, Changchun 130012, China
  • Received:2014-08-28 Online:2016-06-20 Published:2016-06-20

Abstract: Channel interference factor for the identification results is prevalent among the existing speaker recognition algorithm. In order to improve the accuracy of the system, in this paper, feature warping is used to compensate the channel factor of Mel-Frequency Cepstral Coefficient (MFCC) features. Then, factor analysis technique is applied to deal with the channel factors of the speaker's Gaussian Mixture Model (GMM). In the endpoint detection phase of speech of this recognition system, the GMM for speech modeling is built to accurately determine the beginning and end points of the speech segment, and then the features after feature warping are used to establish speaker GMM. Using factor analysis technique to fit the differences between the speaker characteristics space and the channel space, the algorithm removes channel factor from GMM, and then extracts GMM super-vectors as input of the Support Vector Machine (SVM) to obtain recognition results. Experimental results show that the combination of channel compensation technique and SVM can obtain better recognition rate, and ensure the robustness of the system.

Key words: computer application, speaker recognition, support vector machine(SVM), Gaussian mixture model(GMM), feature warp, latent factor analysis(LFA)

CLC Number: 

  • TP391
[1] Takiguchi T, Nakamura S, Shikano K. HMM-separation-based speech recognition for a distant moving speaker[J]. IEEE Transactions on Speech and Audio Processing,2001,9(2):127-140.
[2] 吴迪,曹洁,王进花.基于自适应高斯混合模型与静动态听觉特征融合的说话人识别[J]. 光学精密工程,2013,21(6):1598-1604.
Wu Di,Cao Jie,Wang Jin-hua. Speaker recognition based on adapted Gaussian mixture model and static and dynamic auditory feature fusion[J]. Optics and Precision Engineering,2013,21(6):1598-1604.
[3] Johnson M, Sinha P. A compact model for speaker-adaptive training[J]. Powder Technology,2013,237(3):506-513.
[4] Kinnunen T, Li H. An overview of text-independent speaker recognition:from features to supervectors[J]. Speech Communication,2010,52(1):12-40.
[5] Kasuriya S,Wutiwiwatchai C,Achariyakulporn V,et al.Comparative study of continuous hidden Markov
models (CHMM) and artificial neural network (ANN) on speaker identification system[J]. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems,2001,9(6):673-683.
[6] Campbell W M, Sturim D E, Reynolds D A. Support vector machines using GMM supervectors for speaker verification[J]. Signal Processing Letters,2006,13(5):308-311.
[7] Munteanu D P, Toma S A. Automatic speaker verification experiments using HMM[C]∥8th International Conference on Communications, Bucharest,Romanian,2010:107-110.
[8] Badran E F M F, Selim H. Speaker recognition using artificial neural networks based on vowel phonemes[C]∥5th International Conference on Signal Processing, Beijing,China, 2000:796-802.
[9] 张素敏,苏东林,王炜. 改进的基于决策树的说话人在线聚类[J]. 光学精密工程,2010,18(1):227-233.
Zhang Su-min,Su Dong-lin,Wang Wei. Improved online speaker clustering based on decision tree[J]. Optics and Precision Engineering, 2010,18(1):227-233.
[10] Ding I J, Yen C T. Enhancing GMM speaker identification by incorporating SVM speaker verification for intelligent web-based speech applications[J]. Multimedia Tools and Applications,2015,74(14):5131-5140.
[11] Sen N, Patil H A, Mandal S K D, et al. Importance of Utterance Partitioning in SVM Classifier with GMM Supervectors for Text-Independent Speaker Verification[M]. Heidelberg:Springer International Publishing,2013:780-789.
[12] 王玉,申铉京,陈海鹏,等. 多角度特征融合的视频人脸纹理表示及识别[J]. 吉林大学学报:工学版,2015,45(6):1954-1960.
Wang Yu,Shen Xuan-jing,Chen Hai-peng,et al. Video-based face texture representation and recognitionwith fusion features from multi-view[J]. Journal of Jilin University(Engineering and Technology Edition), 2015,45(6):1954-1960.
[13] Neff M, Kipp M, Albrecht I, et al. Gesture modeling and animation based on a probabilistic re-creation of speaker style[J]. Acm Transactions on Graphics,2008,27(1):329-339.
[14] Chang C C, Lin C J. LIBSVM: a library for support vector machines[DB/OL].[2014-07-26].http:∥www.csie.ntu.edu.tw/~cjlin/papers/libsvm.pdf.
[1] LIU Fu,ZONG Yu-xuan,KANG Bing,ZHANG Yi-meng,LIN Cai-xia,ZHAO Hong-wei. Dorsal hand vein recognition system based on optimized texture features [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1844-1850.
[2] WANG Li-min,LIU Yang,SUN Ming-hui,LI Mei-hui. Ensemble of unrestricted K-dependence Bayesian classifiers based on Markov blanket [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1851-1858.
[3] JIN Shun-fu,WANG Bao-shuai,HAO Shan-shan,JIA Xiao-guang,HUO Zhan-qiang. Synchronous sleeping based energy saving strategy of reservation virtual machines in cloud data centers and its performance research [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1859-1866.
[4] ZHAO Dong,SUN Ming-yu,ZHU Jin-long,YU Fan-hua,LIU Guang-jie,CHEN Hui-ling. Improved moth-flame optimization method based on combination of particle swarm optimization and simplex method [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1867-1872.
[5] LIU En-ze,WU Wen-fu. Agricultural surface multiple feature decision fusion disease judgment algorithm based on machine vision [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1873-1878.
[6] OUYANG Dan-tong, FAN Qi. Clause-level context-aware open information extraction [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1563-1570.
[7] LIU Fu, LAN Xu-teng, HOU Tao, KANG Bing, LIU Yun, LIN Cai-xia. Metagenomic clustering method based on k-mer frequency optimization [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1593-1599.
[8] GUI Chun, HUANG Wang-xing. Network clustering method based on improved label propagation algorithm [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1600-1605.
[9] LIU Yuan-ning, LIU Shuai, ZHU Xiao-dong, CHEN Yi-hao, ZHENG Shao-ge, SHEN Chun-zhuang. LOG operator and adaptive optimization Gabor filtering for iris recognition [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1606-1613.
[10] CHE Xiang-jiu, WANG Li, GUO Xiao-xin. Improved boundary detection based on multi-scale cues fusion [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1621-1628.
[11] ZHAO Hong-wei, LIU Yu-qi, DONG Li-yan, WANG Yu, LIU Pei. Dynamic route optimization algorithm based on hybrid in ITS [J]. 吉林大学学报(工学版), 2018, 48(4): 1214-1223.
[12] HUANG Hui, FENG Xi-an, WEI Yan, XU Chi, CHEN Hui-ling. An intelligent system based on enhanced kernel extreme learning machine for choosing the second major [J]. 吉林大学学报(工学版), 2018, 48(4): 1224-1230.
[13] FU Wen-bo, ZHANG Jie, CHEN Yong-le. Network topology discovery algorithm against routing spoofing attack in Internet of things [J]. 吉林大学学报(工学版), 2018, 48(4): 1231-1236.
[14] CAO Jie, SU Zhe, LI Xiao-xu. Image annotation method based on Corr-LDA model [J]. 吉林大学学报(工学版), 2018, 48(4): 1237-1243.
[15] HOU Yong-hong, WANG Li-wei, XING Jia-ming. HTTP-based dynamic adaptive streaming video transmission algorithm [J]. 吉林大学学报(工学版), 2018, 48(4): 1244-1253.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
[1] LIU Song-shan, WANG Qing-nian, WANG Wei-hua, LIN Xin. Influence of inertial mass on damping and amplitude-frequency characteristic of regenerative suspension[J]. 吉林大学学报(工学版), 2013, 43(03): 557 -563 .
[2] CHU Liang, WANG Yan-bo, QI Fu-wei, ZHANG Yong-sheng. Control method of inlet valves for brake pressure fine regulation[J]. 吉林大学学报(工学版), 2013, 43(03): 564 -570 .
[3] LI Jing, WANG Zi-han, YU Chun-xian, HAN Zuo-yue, SUN Bo-hua. Design of control system to follow vehicle state with HIL test beach[J]. 吉林大学学报(工学版), 2013, 43(03): 577 -583 .
[4] HU Xing-jun, LI Teng-fei, WANG Jing-yu, YANG Bo, GUO Peng, LIAO Lei. Numerical simulation of the influence of rear-end panels on the wake flow field of a heavy-duty truck[J]. 吉林大学学报(工学版), 2013, 43(03): 595 -601 .
[5] WANG Tong-jian, CHEN Jin-shi, ZHAO Feng, ZHAO Qing-bo, LIU Xin-hui, YUAN Hua-shan. Mechanical-hydraulic co-simulation and experiment of full hydraulic steering systems[J]. 吉林大学学报(工学版), 2013, 43(03): 607 -612 .
[6] ZHANG Chun-qin, JIANG Gui-yan, WU Zheng-yan. Factors influencing motor vehicle travel departure time choice behavior[J]. 吉林大学学报(工学版), 2013, 43(03): 626 -632 .
[7] MA Wan-jing, XIE Han-zhou. Integrated control of main-signal and pre-signal on approach of intersection with double stop line[J]. 吉林大学学报(工学版), 2013, 43(03): 633 -639 .
[8] YU De-xin, TONG Qian, YANG Zhao-sheng, GAO Peng. Forecast model of emergency traffic evacuation time under major disaster[J]. 吉林大学学报(工学版), 2013, 43(03): 654 -658 .
[9] XIAO Yun, LEI Jun-qing, ZHANG Kun, LI Zhong-san. Fatigue stiffness degradation of prestressed concrete beam under multilevel amplitude cycle loading[J]. 吉林大学学报(工学版), 2013, 43(03): 665 -670 .
[10] XIAO Rui, DENG Zong-cai, LAN Ming-zhang, SHEN Chen-liang. Experiment research on proportions of reactive powder concrete without silica fume[J]. 吉林大学学报(工学版), 2013, 43(03): 671 -676 .