Journal of Jilin University (Information Science Edition) ›› 2025, Vol. 43 ›› Issue (2): 384-393.

Previous Articles     Next Articles

Chinese Named Entity Recognition Based on BERT-BiLSTM-CRF

LONG Xingquan, LI Jia   

  1. College of Information and Control Engineering, Jilin Institute of Chemical Technology, Jilin 132022, China
  • Received:2024-02-23 Online:2025-04-08 Published:2025-04-10

Abstract: Existing Chinese named entity recognition algorithms inadequately consider the data features of entity recognition tasks, leading to imbalance in the categories of Chinese sample data, excessive noise in the training data, and significant differences in the distribution of generated data. An improved Chinese named entity recognition model based on BERT-BiLSTM-CRF ( Bidirectional Encoder Representations from Transformers-Bidirectional Long Short-Term Memory-Conditional Random Field) is proposed. The first improvement involves combining the P-Tuning v2 technology with BERT-BiLSTM-CRF to accurately extract data features. And three
loss functions, including Focal Loss, Label Smoothing, and KL Loss(Kullback-Leibler divergence loss), are utilized as regularization terms in the loss calculation to address the problems. The improved model achieves F1 scores of 71. 13% ,96. 31% , and 95. 90% on the Weibo, Resume, and MSRA( Microsoft Research Asia)datasets, respectively. The results validate that the proposed algorithm outperforms previous research achievements in terms of performance and is easy to combine and extend with other neural networks for various downstream tasks.

Key words: Chinese named entity recognition, bidirectional encoder representations from transformers-bidirectional long short-term memory-conditional random field ( BERT-BiLSTM-CRF ) model, P-tuning v2 technology, loss function

CLC Number: 

  • TP391. 1