基于XLM-RoBERTa-Large-Finetuned-Conll03-English模型结合CRF的中文命名实体识别微调优化方法

Journal of Jilin University Science Edition ›› 2026, Vol. 64 ›› Issue (2): 370-0376.

Previous Articles Next Articles

Fine-Tuning Optimization Method of Chinese Named Entity Recognition Based on XLM-RoBERTa-Large-Finetuned-Conll03-English Model Combined with CRF

LIAN Xiongjie, DONG Zhen

Center of Information Technology, Yanbian University, Yanji 133002, Jilin Province, China

Received:2024-09-11 Online:2026-03-26 Published:2026-03-26

Abstract

Abstract: Aiming at the problem that there was no obvious space separation between words in Chinese, which led to unclear vocabulary boundaries, and it was difficult to accurately capture the relationship between entities and surrounding words, resulting in low accuracy of Chinese named entity recognition, we proposed a fine-tuning optimization method of Chinese named entity
recognition based on XLM-RoBERTa-Large-Finetuned-Conll03-English model combined with conditional random field (CRF). Firstly, we established a Chinese named entity indicator lexicon, determined the scope of named entities, sorted the entities, and used probability calculation to obtain the optimal features of named entities. Secondly, we introduced the features obtained by CRF into the XLM-RoBERTa-Large-Finetuned-Conll03-English model to capture the feature sequences of named entities and their dependencies. Finally, by adding CRF layer to the multi-language model, the fine-tuning optimization of Chinese named entity recognition was realized. The experimental results show that this fine-tuning optimization method significantly improves the performance of Chinese named entity recognition, enabling the model to have higher accuracy and lower loss value, and better applicability in Chinese named entity recognition (NER) task.

Key words: XLM-RoBERTa model, named entity recognition, fine-tuning optimization, conditional random field

CLC Number:

TP391.1

LIAN Xiongjie, DONG Zhen. Fine-Tuning Optimization Method of Chinese Named Entity Recognition Based on XLM-RoBERTa-Large-Finetuned-Conll03-English Model Combined with CRF[J].Journal of Jilin University Science Edition, 2026, 64(2): 370-0376.

[1]	YANG Chenju, SUN Jun, PI Qiandong, SHAO Yubin, LONG Hua. Hierarchical Parsing Based on CRF and Multiple Rules [J]. Journal of Jilin University Science Edition, 2020, 58(6): 1452-1460.
[2]	XU Zhongyi, HU Qian, LIU Lei. Chinese Text Chunking Based CRF [J]. J4, 2007, 45(03): 416-420.

Fine-Tuning Optimization Method of Chinese Named Entity Recognition Based on XLM-RoBERTa-Large-Finetuned-Conll03-English Model Combined with CRF

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 2

Metrics

Comments

Recommended 0