Journal of Jilin University Science Edition ›› 2026, Vol. 64 ›› Issue (2): 403-0410.

Previous Articles     Next Articles

Adaptive Speech Enhancement Algorithm Based on Improved Multi-metric Optimization

FU Chunyu, LIU Jun   

  1. School of Physics and Electronic Engineering, Northeast Petroleum University, Daqing 163318, Heilongjiang Province, China
  • Received:2024-10-15 Online:2026-03-26 Published:2026-03-26

Abstract: Aiming at  the problem of susceptibility to outlier interference and unstable optimization during training process of  multi-index speech enhancement algorithms, we proposed an adaptive speech enhancement algorithm based on a multi-head attention mechanism. Firstly, by  introducing a multi-head attention structure into the intermediate layer of the discriminator network, we  enhanced the joint modeling ability of the  model for local features and overall structure of speech spectrum, and  combined it with an online knowledge distillation strategy to achieve information sharing among multiple generators, thereby improving the collaborative optimization effect under multi-index conditions. Secondly, in order to reduce the impact of outliers on the training process, we replaced the loss function with a logarithmic mean-squared error form to improve  stability and robustness of the model. Experimental results on the publicly available speech dataset VoiceBank-DEMAND show that this method outperforms existing multi-index speech enhancement models in terms of speech quality, background noise suppression, and speech intelligibility metrics. Therefore,  introducing an attention mechanism and a stabilizing loss function can significantly improve the overall performance of multi-index speech enhancement algorithms.

Key words: speech enhancement, frequency domain, multi-head attention mechanism, online knowledge distillation, logarithmic mean-squared error loss

CLC Number: 

  • TP391