Journal of Jilin University(Engineering and Technology Edition) ›› 2021, Vol. 51 ›› Issue (3): 989-995.doi: 10.13229/j.cnki.jdxbgxb20200640

Previous Articles     Next Articles

Chinese named entity recognition based on Transformer encoder

Xiao-ran GUO1(),Ping LUO2,Wei-lan WANG3   

  1. 1.School of Mathematics and Computer Science,Northwest Minzu University,Lanzhou 730030,China
    2.School of Electronic and Information Engineering,Lanzhou Jiaotong University,Lanzhou 730070,China
    3.Key Laboratory of China's Ethnic Languages and Information Technology,Ministry of Education,Northwest Minzu University,Lanzhou 730030,China
  • Received:2020-08-20 Online:2021-05-01 Published:2021-05-07

Abstract:

This paper proposes a Chinese named entity recognition method based on Transformer encoder and BiLSTM. This method uses a joint vector as the word representation layer by combining the word embedding and the position coding vector to avoid the losses of the word embedding information and the position information. The directional information is integrated into the joint vector using BiLSTM. The Transformer encoder is introduced to further extract the word relationship features. The experimental results show that the F value of this method on the general MSRA and Thangka domain data sets reaches 81.39% and 86.99% respectively, which effectively improve the effect of Chinese named entity recognition.

Key words: named entity recognition, Transformer encoder, BiLSTM, position coding

CLC Number: 

  • TP391

Fig.1

Bilstm transformer CRF model"

Fig.2

LSTM unit"

Fig.3

Transformer coding layer"

Table 1

Statistical information of experimentaldata set"

语料类别训练集验证集测试集
MSRA句子32 4218 1054 631
PER11 2742 8181 973
LOC23 3725 8422 877
ORG13 1663 2911 331
唐卡句子1 591397692
NSL3 1547881 056

Table 2

Comparison of recognition effects ofdifferent models on dataset"

CoupsModelPRF1
MSRACRF52.9631.8939.81
BiLSTM-CRF81.2476.3678.73
BiLSTM-Attention-CRF83.9774.6279.02
Transformer-CRF66.7060.3363.35
Transformer*-CRF67.3161.3564.19
Ours*81.8379.0180.40
Ours87.0576.4381.39
唐卡CRF80.3368.9074.18
BiLSTM-CRF87.5680.0283.63
BiLSTM-Attention-CRF87.6780.7884.09
Transformer-CRF80.3663.9271.20
Transformer*-CRF90.1581.4485.57
Ours*90.1580.6885.15
Ours93.8681.0686.99

Table 3

Identification results of model1 andmodel2 in Thangka test set"

命名实体长度总计
123~56~10>10
测试集实体个数0559306921056
Model1识别实体个数5446821520973
Model1识别正确实体个数028774510853
Model1正确率/%060.894.298.1-87.7
Model2识别实体个数1151797521912
Model2识别正确实体个数033773500856
Model2正确率/%064.796.996.1093.9
1 张晓艳, 王挺, 陈火旺. 命名实体识别研究[J]. 计算机科学, 2005, 32(4):44-48.
Zhang Xiao-yan, Wang Ting, Chen Huo-wang. Research on named entity recognition[J]. Computer Science, 2005, 32(4):44-48.
2 刘浏, 王东波. 命名实体识别研究综述[J]. 情报学报, 2018, 37(3): 329-340.
Liu Liu, Wang Dong-bo. A review on named entity recognition[J]. Journal of the China Society for Scientific and Technical Information, 2018, 37(3):329-340.
3 张玥杰, 徐智婷, 薛向阳. 融合多特征的最大熵汉语命名实体识别模型[J]. 计算机研究与发展, 2008, 45(6):1004-1010.
Zhang Yue-jie, Xu Zhi-ting, Xue Xiang-yang. Fusion of multiple features for Chinese named entity recognition based on maximum entropy model[J]. Journal of Computer Research and Development, 2008, 45(6):1004-1010.
4 Morwal S, Jahan N, Chopra D. Named entity recognition using hidden Markov model[J]. International Journal on Natural Language Computing,2012, 1(4):15-23.
5 Ju Zhen-fei, Wang Jian, Zhu Fei. Named entity recognition from biomedical text using SVM[C]∥International Conference on Bioinformatics & Biomedical Engineering, Wuhan,China, 2011:1-4.
6 王路路, 艾山·吾买尔, 买合木提·买买提,等. 基于CRF和半监督学习的维吾尔文命名实体识别[J]. 中文信息学报, 2018, 32(11):16-26, 33.
Wang Lu-lu, Wumaier Aishan, Maimaiti Maihemuti, et al. A semi-supervised approach to uyghur named entity recognition based on CRF[J]. Journal of Chinese Information Processing, 2018, 32(11):16-26, 33.
7 Maryam Habibi, Leon Weber, Mariana Neves, et al. Deep learning with word embeddings improves biomedical named entity recognition[J]. Bioinformatics, 2017, 33(14):37-48.
8 Lei J, Tang B, Lu X, et al. Research and applications: a comprehensive study of named entity recognition in Chinese clinical text[J]. Journal of the American Medical Informatics Association, 2014, 21(5):808-814.
9 Ji Y, Tong C, Liang J, et al. A deep learning method for named entity recognition in bidding document[J]. Journal of Physics:Conference Series, 2019, 1168(3):032076.
10 Levy O, Goldberg Y. Neural word embedding asimplicit matrix factorization[J/OL].[2020-08-12].
11 Hochreiter S, Schmidhuber J. Long short-term memory[J]. Neural Computation, 1997, 9(8):1735-1780.
12 Graves A, Jurgen S. Framewise phoneme classification with bidirectional LSTM and other neural network architectures[J]. Neural Networks, 2005, 18(5/6):602-610.
13 Huang Z H, Xu W, Yu K. Bidirectional LSTM-CRF models for sequence tagging[J/OL].[2020-08-15].
14 李明浩, 刘忠, 姚远哲. 基于LSTM-CRF的中医医案症状术语识别[J]. 计算机应用, 2018, 38(2):42-46.
Li Ming-hao, Liu Zhong, Yao Yuan-zhe. LSTM-CRF based symptom term recognition on traditional Chinese medical case[J]. Journal of Computer Applications, 2018, 38(2):42-46.
15 Ma X Z, Hovy E. End-to-end sequence labeling via bidirectional LSTM-CNNs-CRF[J/OL]. [2020-08-15].
16 韩鑫鑫,贲可荣,张献. 军用软件测试领域的命名实体识别技术研究[J]. 计算机科学与探索, 2020, 14(5):740-748.
Han Xin-xin, Ke-rong Ben, Zhang Xian. Research on named entity recognition technology in military software testing[J]. Journal of Frontiers of Computer Science & Technology, 2020, 14(5):740-748.
17 Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need[J]. Proceedings of Advances in Neural Information Processing Systems, 2017(12):6000-6010.
18 Devlin J, Chang M W, Lee K, et al. BERT: pre-training of deep bidirectional transformers for language understanding[J/OL].[2020-08-17].
19 Rei M, Crichton G, Pyysalo S. Attending to characters in neural sequence labeling models[C]∥International Conference on Computational Linguistics, Osaka, Japan, 2016: 309-318.
20 李明扬, 孔芳. 融入自注意力机制的社交媒体命名实体识别[J]. 清华大学学报:自然科学版, 2019, 59(6):461-467.
Li Ming-yang, Kong Fang. Combined self-attention mechanism for named entity recognition in social media[J]. Journal of Tsinghua University(Science and Technology), 2019, 59(6): 461-467.
21 Yan H, Deng B, Li X N, et al. TENER: adapting transformer encoder for named entity recognition[J]. Computation and Language, 2019, 1:04474.
22 Li X Y, Meng Y X, Sun X F, et al. Is word segmentation necessary for deep learning of Chinese representations?[C]∥Proceedings of the 57th Annual Meeting of the Association for Computational Linguistics, Florence, Italy, 2019:3242-3252.
[1] Tao NI,Hai-qiang LIU,Lin-lin WANG,Shao-yuan ZOU,Hong-yan ZHANG,Ling-tao HUANG. Intelligent manipulation method of crane based on BiLSTM model [J]. Journal of Jilin University(Engineering and Technology Edition), 2020, 50(2): 445-453.
[2] YAN Yang, WEN Dun-wei, WANG Yun-ji, WANG Ke. Named entity recognition in Chinese medical records based on cascaded conditional random field [J]. 吉林大学学报(工学版), 2014, 44(6): 1843-1848.
Viewed
Full text


Abstract

Cited

  Shared   
  Discussed   
No Suggested Reading articles found!