基于 BERT-BiGRU-CNN 模型的短文本分类研究

Abstract

Abstract: To address the problem that traditional language models can not solve the problem of deep bidirectional representation and the problem that classification models can not adequately capture salient features of text, a text classification model based on BERT-BiGRU-CNN ( Bidirectional Encoder Representation from Transformers-Bidirectional Gating Recurrent Unit-Convolutional Neural Networks) is proposed. Firstly, the BERT pre-training model is used for text representation; secondly, the output data of BERT is input into BiGRU to capture the global semantic information of text. The results of BiGRU layer again are input into CNN to capture the local semantic features of text. Finally, the feature vectors are input into Softmax layer to obtain the classification results. The Chinese news text headlines dataset is used, and the experimental results show that the BERT-BiGRU-CNN based text classification model achieves an F1 value of 0. 948 5 on the dataset, which is better than other baseline models, proving that the BERT-BiGRU-CNN model can improve theshort text classification performance.

Key words: text classification, bidirectional encoder representation from transformers(BERT)word embedding, bidirectional gating recurrent unit(BiGRU), convolutional neural networks(CNN)

CLC Number:

TP391. 1

CHEN Xuesong, ZOU Meng . Research on Short Text Classification Based on BERT-BiGRU-CNN Model[J].Journal of Jilin University (Information Science Edition), 2023, 41(6): 1048-1053.

References

Metrics

Viewed

Full text

404

HTML			PDF

Just accepted	Online first	Issue	Just accepted	Online first	Issue
0	0	0	0	0	404

From	Others	local

Times	41	363
Rate	10%	90%

Abstract

236

Just accepted	Online first	Issue

0	0	236

From	Others	local

Times	234	2
Rate	99%	1%

Cited

Web of Science	Crossref	ScienceDirect	Search for Citations in Google Scholar >>


This page requires you have already subscribed to WoS.

Shared

[1]	CHEN Xuesong , ZHAN Ziyi , WANG Haochang . Ancient Chinese Named Entity Recognition Based on SikuBERT Model and MHA [J]. Journal of Jilin University (Information Science Edition), 2023, 41(5): 866-875.
[2]	ZHANG Lu , MA Zirui , WANG Yue , MA Cuiling . Named Entity Recognition for High School Chemistry Exam Papers [J]. Journal of Jilin University (Information Science Edition), 2023, 41(4): 608-620.
[3]	YE Pei. Automatic Detection Algorithm of Composition Subject Deviation Oriented to College English Teaching [J]. Journal of Jilin University (Information Science Edition), 2022, 40(6): 1033-1038.
[4]	NIE Yongdan, WANG Bin, ZHANG Yan. Literature Relevance Ranking Method Based on Improved PageRank Algorithm [J]. Journal of Jilin University (Information Science Edition), 2022, 40(3): 464-470.
[5]	ZHAO Jian , DONG Wenhua , SHI Lijuan , KUANG Zhejun , BI Jingxiao , WANG Zheyu , QIANG Wenqian . Public Opinion Monitoring and Visual Analysis for Public Emergencies [J]. Journal of Jilin University (Information Science Edition), 2021, 39(6): 712-719.
[6]	ZHANG Sainan , SUN Biao . Research on Network Anomaly Detection Method Basedon Machine Learning [J]. Journal of Jilin University (Information Science Edition), 2021, 39(6): 732-742.
[7]	YUAN Man , LI Shengrui , LIU Xiaoye . Research on Standardized Model of Geological Knowledge Graph [J]. Journal of Jilin University (Information Science Edition), 2021, 39(2): 215-222.
[8]	YUAN Man, QIU Tingting, HU Chao. Fine-Grained Course Knowledge Meta-Organization Model and Knowledge Graph Implementation#br# [J]. Journal of Jilin University (Information Science Edition), 2019, 37(5): 526-532.
[9]	LI Shujun, ZHANG Hongjie, WANG Haitang, WANG Qiushuang. Design of Stac Course Database Automatic Ｒetrieval System Based on Scene Theory [J]. Journal of Jilin University (Information Science Edition), 2019, 37(4): 457-462.
[10]	ZOU Chenhong, YUAN Man. Ｒesearch on Fuzzy Comprehensive Clustering Algorithm of Evaluation System [J]. Journal of Jilin University (Information Science Edition), 2018, 36(5): 569-576.
[11]	YU Chao, WANG Lu, CHENG Daowen. Research on Educational Resources Semantic Retrieval System Based on Ontology [J]. Journal of Jilin University(Information Science Ed, 2018, 36(2): 207-212.
[12]	LI Kai, LI Wanlong, ZHENG Shanhong, ZHANG Yafeng. Improved Multi-Strategy Ontology Mapping Method [J]. Journal of Jilin University(Information Science Ed, 2016, 34(4): 536-542.

Research on Short Text Classification Based on BERT-BiGRU-CNN Model

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 12

Metrics

Comments

Recommended 10