Journal of Jilin University Science Edition ›› 2021, Vol. 59 ›› Issue (4): 922-928.

Previous Articles     Next Articles

Stack Overflow Question Post Classification Method Based on Deep Learning

YANG Guang1, JIA Yanxin1, CHEN Xiang1,2, XU Shuyuan1   

  1. 1. School of Information Science and Technology, Nantong University, Nantong 226019, Jiangsu Province, China;
    2. State Key Laboratory for Novel Software Technology, Nanjing University, Nanjing 210023, China
  • Received:2020-06-10 Online:2021-07-26 Published:2021-07-26

Abstract: The classification methods based on regular expressions and traditional machine learning had the problems of manual extraction of patterns and performance bottleneck, we proposed deep learning-based classification methods for question post, the deep text mining model TextCNN and integrating attention mechanism—TextRNN were used to construct a classification model. The experimental results show that the classification performance of deep learning-based methods is better than the existing benchmark methods on most of the question purpose categories, and the Adam optimizer is better than the SGD optimizer, and the Glove pre-trained word vector is better than randomly generated word vectors. The method classifies posts for the purpose of asking question, which can add a new dimension to the analysis of post discussion topics on Stack Overflow (SO).

Key words: post question purpose, deep learning, text mining, word vector

CLC Number: 

  • TP311.5