Journal of Jilin University Science Edition ›› 2022, Vol. 60 ›› Issue (6): 1399-1406.

Previous Articles     Next Articles

Semantic Similarity Calculation Method of News Text and Comment Integrated with News Title Information

LI Yitong1, WANG Hongbin1, CHENG Liang2   

  1. 1. Faculty of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650504, China; 2. College of City, Kunming University of Science and Technology, Kunming 650051, China
  • Received:2021-08-27 Online:2022-11-26 Published:2022-11-26

Abstract: Aiming at the problem that the pre-training model would cut off part of text when dealing with long text such as news, which led to the loss of text infomation, we  proposed a  method to build a model by combining  TextRank algorithm, implicit Dirichlet distribution topic model and pre-training model on the basis of integrating news title information, and  compared the model with other  semantic similarity calculation methods. The results show that the accuracy rate of the model is 82.46%, the recall rate is 87.43%, the accuracy rate is 82.68%, and the F1 value is 84.99%,  the optimal results are obtained, which effectively improves the performance of semantic similarity calculation between news texts and comments.

Key words: semantic similarity, pre-training model, implicit Dirichlet distribution,  , news comment

CLC Number: 

  • TP391.1