吉林大学学报(工学版) ›› 2017, Vol. 47 ›› Issue (1): 294-300.doi: 10.13229/j.cnki.jdxbgxb201701043

• Orginal Article • Previous Articles     Next Articles

Recognition to specific two words Chinese vocabulary based on projection matrix of spectrogram

LIANG Shi-li1, WEI Ying1, PAN Di1, ZHANG Ling2, XU Ting-fa3, WANG Shuang-wei1   

  1. 1.College of Physics,Northeast Normal University,Changchun 130024,China;
    2.College of Science,Changchun University of Science and Technology,Changchun 130022,China;
    3.Key Laboratory of Photoelectronic Imaging Technology and System of Ministry of Education,Beijing Institute of Technology,Beijing 100081,China
  • Received:2015-05-30 Online:2017-01-20 Published:2017-01-20

Abstract: In the process of image feature extraction, the image processing technique is applied to the speech recognition. First, equal width zoning line projection and binary width zoning line projection are carried out to the spectrogram, which are taken as the spectrogram of the first characteristic set and the second characteristic set, respectively. Meanwhile, equal width zoning line projection is carried out again to the spectrogram after Fourier transform, treating as the third feature set. Then, the above three feature sets are used as feature vectors to Support Vector Machine (SVM) as a classifier for the overall recognition of specific two words Chinese vocabulary. 1000 voice samples are used in simulation experiment. The results show that the correct recognition rate of this method is 92.8%, and it provides a new way of thinking of Chinese vocabulary overall recognition.

Key words: communication, speech recognition, spectrogram, line projection, support vector machine

CLC Number: 

  • TN912
[1] 赵力.语音信号处理[M].北京:机械工业出版社,2009.
[2] 蔡莲红,黄德智,蔡锐. 现代语音技术基础与应用[M]. 北京:清华大学出版社,2003.
[3] 潘凌云,孙达传,吴美朝. 语音识别中基于语谱图的语音音素分割方法[J]. 杭州大学学报:自然科学版,1995, 22(1):42-46.
Pan Ling-yun,Sun Da-chuan,Wu Mei-chao. A method of automatic segmentation for speech recognition based on spectrograms[J]. Journal of Hangzhou University(Natural Science),1995,22(1):42-46.
[4] Zhang Shi-xiong, Gales M J. Structured SVMs for automatic speech recognition[J]. IEEE Transactions on Audio, Speech and Language Processing, 2013, 21(3):544-555.
[5] Khunarsal P, Lursinsap C, Raicharoen T.Singing voice recognition based on matching of spectrogram pattern[C]∥Proceedings of International Joint Conference on Neural Networks, NewYork, 2009: 1595-1599.
[6] Ueda Y,Sakata T, Ikeda H, et al. Development of speech analysis and representation tool using visualized speech[J]. Journal of the Institute of Image Information and Television Engineers, 2007, 61(5): 692-698.
[7] Zhang Xue-ying, Liu Xiao-feng, Wang Zi-zhong. Evaluation of a set of new ORF kernel functions of SVM for speech recognition[J]. Engineering Applications of Artificial Intelligence, 2013, 26(10): 2574-2580.
[8] Zergat K Y, Amrouche A. New scheme based on GMM-PCA-SVM modeling for automatic speaker recognition[J]. International Journal of Speech Technology, 2014, 17(4):373-381.
[9] 马义德,袁敏,刘悦. 基于 PCNN 的语谱图特征提取在说话人识别中的应用[J].计算机工程与应用,2005, 41(20):81-84.
Ma Yi-de, Yuan Min, Liu Yue. The application of PCNN based on feature extraction in speaker recognition[J]. Computer Engineering and Application, 2005, 41(20):81-84.
[10] Asahi K, Ogawa A. Reduction of noise in speech signals through image processing using the spectrogram[J] .IEEE Transactions on Electronics, Information and Systems, 2006, 126(12):1483-1489.
[11] Ajmera P K, Jadhav D V, Holambe R S, et al. Text-independent speaker identification using Radon and discrete cosine transforms based features from speech spectrogram[J]. Pattern Recognition, 2011, 44(10/11):2749-2759.
[12] Wu Di, Zhao He-ming, Huang Cheng-wei, et al. Speech endpoint detection in low-SNRs environment based on perception spectrogram structure boundary parameter[J]. Acta Acustica, 2014, 39(3):392-399.
[13] Souli S, Lachiri Z. Multiclass support vector machines for environmental sounds classification in visual domain based on log-Gabor filters[J]. International Journal of Speech Technology, 2013,16(2):203-213.
[14] Wang Kun-ching.The feature extraction based on texture image information for emotion sensing in speech[J].Sensors (Switzerland),2014,14(9):16692-16714.
[15] 许森,赵旭,段成华,等. 汉语元音识别音语谱图的数学形态处理[J].应用力学与材料,2014,571-572:665-671.
Xu Sen, Zhao Xu, Duan Cheng-hua,et al.A mathematical morophological processing of spectrograms for the tone of Chineses vowels recognition[J]. Applied Mechanics and Materials,2014,571-572:665-671.
[16] Zhang Dong-juan,Tang Wan-you. Research of hot stamping image recognition algorithm based on projection feature[J]. Applied Mechanics and Materials, 2014, 469:240-245.
[17] 赵力.语音信号处理[M].北京:机械工业出版社,2009.
[18] 张悦.语谱图用于特定人组小词汇量识别算法的研究[D].长春:东北师范大学计算机科学与信息技术学院,2013.
Zhang Yue. The study of specific group of small vocabulary recognition algorithm for the spectrogram[D]. Changchun: College of Computer Sciences and Technology,Northeast Normal University, 2013.
[19] 李明宇. 现代汉语常用词表[M]. 北京:商务印书馆出版社,2008.
[20] Chang Chih-chung, Lin Chih-jen. A Library for Support Vector Machines[M]. Taipei: National Taiwan University Press, 2001.
[1] CHEN Yong-heng,LIU Fang-hong,CAO Ning-bo. Analysis of conflict factors between pedestrians and channelized right turn vehicles at signalized intersections [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1669-1676.
[2] CHANG Shan,SONG Rui,HE Shi-wei,LI Hao-dong,YIN Wei-chuan. Recycling model of faulty bike sharing [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1677-1684.
[3] QU Da-yi,YANG Jing-ru,BING Qi-chun,WANG Wu-lin,ZHOU Jing-chun. Arterial traffic offset optimization based on queue characteristics at adjacent intersections [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1685-1693.
[4] ZHOU Yan-guo,ZHANG Hai-lin,CHEN Rui-rui,ZHOU Tao. Two-level game approach based resource allocation scheme in cooperative networks [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(6): 1879-1886.
[5] KUI Hai-lin, BAO Cui-zhu, LI Hong-xue, LI Ming-da. Idling time prediction method based on least square support vector machine [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1360-1365.
[6] LIU Xiang-yu, YANG Qing-fang, KUI Hai-lin. Traffic guidance cell division based on random walk algorithm [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1380-1386.
[7] LIU Zhao-hui, WANG Chao, LYU Wen-hong, GUAN Xin. Identification of data characteristics of vehicle running status parameters by nonlinear dynamic analysis [J]. Journal of Jilin University(Engineering and Technology Edition), 2018, 48(5): 1405-1410.
[8] LUAN Xin, DENG Wei, CHENG Lin, CHEN Xin-yuan. Mixed Logit model for understanding travel mode choice behavior of megalopolitan residents [J]. 吉林大学学报(工学版), 2018, 48(4): 1029-1036.
[9] SUN Xiao-ying, HU Ze-zheng, YANG Jin-peng. Assessment method of electromagnetic pulse sensitivity of vehicle engine system based on hierarchical Bayesian networks [J]. 吉林大学学报(工学版), 2018, 48(4): 1254-1264.
[10] DONG Ying, CUI Meng-yao, WU Hao, WANG Yu-hou. Clustering wireless rechargeable sensor networks charging schedule based on energy prediction [J]. 吉林大学学报(工学版), 2018, 48(4): 1265-1273.
[11] MOU Zong-lei, SONG Ping, ZHAI Ya-yu, CHEN Xiao-xiao. High accuracy measurement method for synchronous triggering pulse transmission delay in distributed test system [J]. 吉林大学学报(工学版), 2018, 48(4): 1274-1281.
[12] DING Ning, CHANG Yu-chun, ZHAO Jian-bo, WANG Chao, YANG Xiao-tian. High-speed CMOS image sensor data acquisition system based on USB 3.0 [J]. 吉林大学学报(工学版), 2018, 48(4): 1298-1304.
[13] CHEN Yong-heng, LIU Xin-shan, XIONG Shuai, WANG Kun-wei, SHEN Yao, YANG Shao-hui. Variable speed limit control under snow and ice conditions for urban expressway in junction bottleneck area [J]. 吉林大学学报(工学版), 2018, 48(3): 677-687.
[14] WANG Zhan-zhong, LU Yue, LIU Xiao-feng, ZHAO Li-ying. Improved harmony search algorithm on truck scheduling for cross docking system [J]. 吉林大学学报(工学版), 2018, 48(3): 688-693.
[15] LI Zhi-hui, HU Yong-li, ZHAO Yong-hua, MA Jia-lei, LI Hai-tao, ZHONG Tao, YANG Shao-hui. Locating moving pedestrian from running vehicle [J]. 吉林大学学报(工学版), 2018, 48(3): 694-703.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!