自动语音识别模型压缩算法综述

Journal of Jilin University Science Edition ›› 2024, Vol. 62 ›› Issue (1): 122-0131.

Previous Articles Next Articles

Compression Algorithms for Automatic Speech Recognition Models: A Survey

SHI Xiaohu¹, YUAN Yuping², LV Guilin³, CHANG Zhiyong⁴, ZOU Yuanjun⁵

1. College of Computer Science and Technology, Jilin University, Changchun 130012, China; 2. Management Center of Big Data and Network, Jilin University, Changchun 130012, China; 3. Intelligent Network Development Institute, R&D Institute of China FAW Group Co., Ltd, Changchun 130011, China; 4. College of Biological and Agricultural Engineering, Jilin University, Changchun 130022, China;
5. School of Medical Information, Changchun University of Chinese Medicine, Changchun 130117, China

Received:2023-02-23 Online:2024-01-26 Published:2024-01-26

Abstract

Abstract: With the development of deep learning technology, the number of parameters in automatic speech recognition task models was becoming increasingly large, which gradually increased the computing overhead, storage requirements and power consumption of the models, and it was difficult to deploy on resource-constrained devices. Therefore, it was of great value to compress the automatic speech recognition models based on deep learning to reduce the size of the modes while maintaining the original performance as much as possible. Aiming at the above problems, a comprehensive survey was conducted on the main works in this field in recent years, which was summarized as several methods, including knowledge distillation, model quantization, low-rank decomposition, network pruning, parameter sharing and combination models, and conducted a systematic review to provide alternative solutions for the deployment of models on resource-constrained devices.

Key words: speech recognition, model compression, knowledge distillation, model quantization, low-rank decomposition, network pruning, parameter sharing

CLC Number:

TP391

SHI Xiaohu, YUAN Yuping, LV Guilin, CHANG Zhiyong, ZOU Yuanjun. Compression Algorithms for Automatic Speech Recognition Models: A Survey[J].Journal of Jilin University Science Edition, 2024, 62(1): 122-0131.

[1]	SONG Hanyu, OUYANG Dantong, YE Yuxin. Lightweight Relation Extraction Based on Positive Soft Labels [J]. Journal of Jilin University Science Edition, 2023, 61(2): 317-324.
[2]	HONG Liang, GAO Shang, LI Xiang. Layer-Wise Pruning Method Based on Network Characteristics [J]. Journal of Jilin University Science Edition, 2022, 60(6): 1407-1415.
[3]	WU Zhiyuan, QI Hong, JIANG Yu, CUI Chupeng, YANG Zongmin, XUE Xinhui. Activation Map Adaptation Model for Knowledge Distillation [J]. Journal of Jilin University Science Edition, 2022, 60(4): 881-888.
[4]	LIU Yanxiu, SUN Yiming, YANG Huamin. Noise Robust Continuous Speech Recognition Based on Normalization [J]. Journal of Jilin University Science Edition, 2015, 53(03): 519-524.
[5]	WU Xi hong, WU Hao, GAO Qin, LIN Xiao jun, WANG Xin hao. Latent Semantic Analysis Language Model and Its Application in Chinese Large Vocabulary Continuous Speech Recognition [J]. J4, 2006, 44(06): 16-20.
[6]	WANG Peng, LIU Jia, LIU Run-sheng. Discrete HMM Based Speaker Independent Keyword Spotting Speech Recognition Syste m [J]. J4, 2003, 41(03): 347-351.

Compression Algorithms for Automatic Speech Recognition Models: A Survey

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 6

Metrics

Comments

Recommended 0