基于融合特征ADRMFCC的语音识别方法

Journal of Jilin University Science Edition ›› 2024, Vol. 62 ›› Issue (4): 943-950.

Previous Articles Next Articles

Speech Recognition Method Based on Fusion Feature ADRMFCC

DUO Lin, MA Jian, WEI Guixiang, TANG Jian

Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China

Received:2023-07-12 Online:2024-07-26 Published:2024-07-26

Abstract

Abstract: Aiming at the problem of low accuracy and poor robustness of speech recognition in complex noise environment, we proposed a speech recognition method based on Mel cepstrum fusion feature of increasing and decreasing residuals. This method first used the increase and decrease component method to screen the key speech features, and then mapped them to the Mel domain-residual domain spatial coordinate system to generate the increase and decrease residual Mel cepstral coefficients. Finally, these fusion features were used to train the end-to-end model. The experimental results show that the proposed method significantly improves the accuracy and performance of speech recognition under different noise types and signal-to-noise ratio conditions. Under the low signal-to-noise ratio condition of -5 dB, the speech recognition accuracy reaches 73.13%, while the average speech
recognition accuracy under other noise conditions reaches 88.67%, which fully proves the effectiveness and robustness of the proposed method.

Key words: speech recognition, residual Mel cepstral coefficient, feature screening, increase and decrease , component method

CLC Number:

TP391

DUO Lin, MA Jian, WEI Guixiang, TANG Jian. Speech Recognition Method Based on Fusion Feature ADRMFCC[J].Journal of Jilin University Science Edition, 2024, 62(4): 943-950.

[1]	MA Jian, DUO Lin, WEI Guixiang, TANG Jian. End-to-End Speech Recognition Based on Threshold-Based BPE-Dropout Multi-task Learning [J]. Journal of Jilin University Science Edition, 2024, 62(3): 674-682.
[2]	JIANG Nan, PANG Yongheng, GAO Shuang. Speech Recognition Based on Attention Mechanism and Spectrogram Feature Extraction [J]. Journal of Jilin University Science Edition, 2024, 62(2): 320-0330.
[3]	SHI Xiaohu, YUAN Yuping, LV Guilin, CHANG Zhiyong, ZOU Yuanjun. Compression Algorithms for Automatic Speech Recognition Models: A Survey [J]. Journal of Jilin University Science Edition, 2024, 62(1): 122-0131.
[4]	LIU Yanxiu, SUN Yiming, YANG Huamin. Noise Robust Continuous Speech Recognition Based on Normalization [J]. Journal of Jilin University Science Edition, 2015, 53(03): 519-524.
[5]	WU Xi hong, WU Hao, GAO Qin, LIN Xiao jun, WANG Xin hao. Latent Semantic Analysis Language Model and Its Application in Chinese Large Vocabulary Continuous Speech Recognition [J]. J4, 2006, 44(06): 16-20.
[6]	WANG Peng, LIU Jia, LIU Run-sheng. Discrete HMM Based Speaker Independent Keyword Spotting Speech Recognition Syste m [J]. J4, 2003, 41(03): 347-351.

Speech Recognition Method Based on Fusion Feature ADRMFCC

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 6

Metrics

Comments

Recommended 0