J4

• 计算机科学 • Previous Articles     Next Articles

Latent Semantic Analysis Language Model and Its Application in Chinese Large Vocabulary Continuous Speech Recognition

WU Xi hong, WU Hao, GAO Qin, LIN Xiao jun, WANG Xin hao   

  1. National Laboratory on Machine Perception, College of Information Science and Technology, Peking University, Beijing 100871, China
  • Received:2006-05-28 Revised:1900-01-01 Online:2006-08-26 Published:2006-08-26
  • Contact: WU Xi hong

Abstract: Integrating high level semantic knowledge in speech recognition has been a hot topic in the field. Latent semantic analysis (LSA) technology can model long span correlation of words in the language efficiently. How to utilize LSA in language modeling and guide searching procedure more accurately is an urgent task. We analyzed major problems on the modeling and decoding procedure of LSA language model, realized modeling of LSA language model, and proposed a method to use LSA language model in the first-pass decoding of continuous speech recognition system. Experimental result shows that the proposed method can improve the performance of large vocabulary continuous speech recognition of Chinese language significantly.

Key words: language model, vector space model, latent semantic an alysis, speech recognition

CLC Number: 

  • TP391