Journal of Jilin University Science Edition

Previous Articles     Next Articles

Short Text Classification Model Based on Integrated Neural Networks

GAO Yunlong1,2, ZUO Wanli1,2, WANG Ying1,2, WANG Xin2,3   

  1. 1. College of Computer Science and Technology, Jilin University, Changchun 130012, China; 2. Key Laboratory of Symbol Computation and Knowledge Engineering of Ministry of Education, Changchun 130012, China;3. School of Computer Technology and Engineering, Changchun Institute of Technology, Changchun 130012, China
  • Received:2017-05-25 Online:2018-07-26 Published:2018-07-31
  • Contact: ZUO Wanli E-mail:zuowl@jlu.edu.cn

Abstract: Aiming at the characteristics of sparseness and too limited words in one short text, in order to better deal with the problem of short text classification, we proposed a short text classification model based on integrated neural networks. Firstly, the extended word vector was used as the input of the model, so that the numerical word vector could effectively describe the morphological, syntactic and semantic features of short text. Secondly, the recurrent neural network (RNN) was used to model the semantics of short text, capture the dependency of internal structure of short text. Finally, we used the regularization term to select the model with minimal empirical risk and model complexity simultaneously in the process of training model. By the short text classification experiments on the corpus, we verified  that the proposed model has a better classification effect, and the classification model could deal with short text input with variable length, and has a good robustness.

Key words: short text, classification, extended word vector, integrated neural network

CLC Number: 

  • TP181