一种工件表面压印字符识别网络

doi:10.13229/j.cnki.jdxbgxb.20230150

吉林大学学报(工学版) ›› 2024, Vol. 54 ›› Issue (7): 2072-2079.doi: 10.13229/j.cnki.jdxbgxb.20230150

• 计算机科学与技术 • 上一篇

一种工件表面压印字符识别网络

游新冬(),郭磊,韩晶(),吕学强

北京信息科技大学网络文化与数字传播北京市重点实验室，北京 100101

收稿日期:2023-02-21 出版日期:2024-07-01 发布日期:2024-08-05
通讯作者: 韩晶 E-mail:youxindong7895@126.com;hanjing@bistu.edu.cn
作者简介:游新冬（1979-），女，副教授，博士.研究方向：计算机视觉.E-mail： youxindong7895@126.com
基金资助:
国家自然科学基金项目(62171043);北京市自然科学基金项目(4212020)

An character recognition network for imprint character

Xin-dong YOU(),Lei GUO,Jing HAN(),Xue-qiang LYU

Beijing Key Laboratory of Internet Culture and Digital Communication，Beijing Information Science and Technology University，Beijing 100101，China

Received:2023-02-21 Online:2024-07-01 Published:2024-08-05
Contact: Jing HAN E-mail:youxindong7895@126.com;hanjing@bistu.edu.cn

摘要/Abstract

摘要：

工件表面的压印字符存在凹凸不平、锈蚀、风化等问题，导致传统的字符识别算法难以取得满意的效果。针对这一问题，将工件表面压印字符的识别视为一类特殊的目标检测问题，并针对其特性设计了一种两阶段识别网络：定位-分类网络。定位网络使用无锚框的方法提取字符感兴趣区域，有效解决了字符区域提取困难的问题。分类网络采用特征解耦的卷积模块和结构重参数化技术，能够在不增加额外参数的情况下提升分类的准确率。此外，分类网络采用跨域迁移学习的训练策略，能够有效解决实际应用中的小样本和类别不平衡问题。在自建螺栓数据集和SynthText数据集上的实验结果表明，该算法的整体精度能够达到98%和92%，优于对比算法。

关键词: 压印字符, 字符识别, 无锚框, 小样本, 目标检测

Abstract:

The imprint characters on the surface of the workpiece are uneven， rusty， and weathered， which the traditional character recognition methods hard to achieve satisfactory results. This paper regards the characters recognition task as a particular detection problem and designs a two-stage recognition network according to its characteristics： location and classification network. The location newtork uses the anchor-free method to extract the region of interest of characters， which effectively solves the problem of character region extraction. The classification network uses the Feature Decoupled Convolution Block and the Structural Re-parameterization technology， which can significantly improve the classification accuracy without any extra parameter. The transferring learning is used to solve the small sample problem and imbalance problem in the training stage. The experimental results on the self-built bolt dataset and the SynthText dataset show that the algorithm can achieve overall accuracies of 98% and 92%， respectively， which is superior to the compared algorithms.

Key words: imprint character, character recognition, anchor-free, small sample, object detection

中图分类号:

TP391

游新冬,郭磊,韩晶,吕学强. 一种工件表面压印字符识别网络[J]. 吉林大学学报(工学版), 2024, 54(7): 2072-2079.

Xin-dong YOU,Lei GUO,Jing HAN,Xue-qiang LYU. An character recognition network for imprint character[J]. Journal of Jilin University(Engineering and Technology Edition), 2024, 54(7): 2072-2079.

图/表 12

图1

图2

图3

图4

图5

图6

表1

表2

表3

表4

表5

图7

参考文献 16

1	黄慧宁, 张学军, 黄菊, 等. 基于深度学习YOLOv2算法的钢材压印字符识别研究[J]. 计算机科学与应用, 2020, 10(1): 126-135.
	Huang Hui-ning, Zhang Xue-jun, Huang Ju. Research on steel stamping character recognition on deep learning YOLOv2 algorithm[J]. Computer Science and Application, 2020, 10(1): 126-135.
2	Chen X, Jin L, Zhu Y, et al. Text recognition in the wild: a survey[J]. ACM Computing Surveys(CSUR), 2021, 54(2): 1-35.
3	Ding X, Guo Y, Ding G, et al. ACNet: strengthening the kernel skeletons for powerful CNN via asymmetric convolution blocks[C]∥IEEE/CVF International Conference on Computer Vision(ICCV), Seoul, Korea(South), 2019: 1911-1920.
4	Chen Y, Liu S, Shen X, et al. Fast point R-CNN[C]//Proceedings of the IEEE/CVF International Conference on Computer Vision,Seoul, Korea (South),2019: 9774-9783.
5	Qiao L, Zhao Y, Li Z, et al. Defrcn: decoupled faster r-cnn for few-shot object detection[C]∥Proceedings of the IEEE/CVF International Conference on Computer Vision, Montreal, Canada, 2021: 8681-8690.
6	Oksuz K, Cam B C, Kalkan S, et al. Imbalance problems in object detection: a review[J]. IEEE Trans Pattern Anal Mach Intell, 2021(10): 3388-3415.
7	Cheng T, Wang X, Huang L, et al. Boundary-preserving mask R-CNN[C]∥European Conference on Computer Vision, Berlin: Springer, 2020: 660-676.
8	He K, Girshick R, Dollár P. Rethinking imagenet pre-training[DB/OL].[2023-01-26]..
9	Simonyan K, Zisserman A. Very deep convolutional networks for large-scale image recognition[DB/OL].[2023-01-26]..
10	Ding X, Zhang X, Ma N, et al. RepVGG: making VGG-style convnets great again[C]//IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Nashville, USA,2021:No.01352.
11	Zhou Z, Siddiquee M, Tajbakhsh N, et al. UNet++: redesigning skip connections to exploit multiscale features in image segmentation[J]. IEEE Transactions on Medical Imaging, 2020, 39(6): 1856-1867.
12	Gupta A, Vedaldi A, Zisserman A. Synthetic data for text localisation in natural images[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 2315-2324.
13	Redmon J, Farhadi A. YOLOv3: an incremental improvement[DB/OL].[2023-01-27]..
14	Lin T Y, Goyal P, Girshick R, et al. Focal loss for dense object detection[DB/OL].[2023-01-27]..
15	Duan K, Bai S, Xie L, et al. Centernet: keypoint triplets for object detection[DB/OL].[2023-01-27]..
16	He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]∥Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, USA, 2016: 770-778.

相关文章 15

[1]	高云龙,任明,吴川,高文. 基于注意力机制改进的无锚框舰船检测模型[J]. 吉林大学学报(工学版), 2024, 54(5): 1407-1416.
[2]	陈仁祥,胡超超,胡小林,杨黎霞,张军,何家乐. 基于改进YOLOv5的驾驶员分心驾驶检测[J]. 吉林大学学报(工学版), 2024, 54(4): 959-968.
[3]	张云佐,郭威,李文博. 遥感图像密集小目标全方位精准检测算法[J]. 吉林大学学报(工学版), 2024, 54(4): 1105-1113.
[4]	王宏志,宋明轩,程超,解东旋. 基于改进YOLOv4-tiny算法的车距预警方法[J]. 吉林大学学报(工学版), 2024, 54(3): 741-748.
[5]	李晓旭,安文娟,武继杰,李真,张珂,马占宇. 通道注意力双线性度量网络[J]. 吉林大学学报(工学版), 2024, 54(2): 524-532.
[6]	王春华,李恩泽,肖敏. 多特征融合和孪生注意力网络的高分辨率遥感图像目标检测[J]. 吉林大学学报(工学版), 2024, 54(1): 240-250.
[7]	薛珊,张亚亮,吕琼莹,曹国华. 复杂背景下的反无人机系统目标检测算法[J]. 吉林大学学报(工学版), 2023, 53(3): 891-901.
[8]	陶博,颜伏伍,尹智帅,武冬梅. 基于高精度地图增强的三维目标检测算法[J]. 吉林大学学报(工学版), 2023, 53(3): 802-809.
[9]	刘晶红,邓安平,陈琪琪,彭佳琦,左羽佳. 基于多重注意力机制的无锚框目标跟踪算法[J]. 吉林大学学报(工学版), 2023, 53(12): 3518-3528.
[10]	黄彭奇子,段晓君,黄文伟,晏良. 基于元学习的小样本图像非对称缺陷检测方法[J]. 吉林大学学报(工学版), 2023, 53(1): 234-240.
[11]	高明华,杨璨. 基于改进卷积神经网络的交通目标检测方法[J]. 吉林大学学报(工学版), 2022, 52(6): 1353-1361.
[12]	曲优,李文辉. 基于锚框变换的单阶段旋转目标检测方法[J]. 吉林大学学报(工学版), 2022, 52(1): 162-173.
[13]	曹洁,屈雪,李晓旭. 基于滑动特征向量的小样本图像分类方法[J]. 吉林大学学报(工学版), 2021, 51(5): 1785-1791.
[14]	潘德伦,冀隽,张跃进. 基于运动矢量空间编码的视频监控动态目标检测方法[J]. 吉林大学学报(工学版), 2021, 51(4): 1370-1374.
[15]	金立生,郭柏苍,王芳荣,石健. 基于改进YOLOv3的车辆前方动态多目标检测算法[J]. 吉林大学学报(工学版), 2021, 51(4): 1427-1436.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed

数据集	算法	准确率	召回率	F₁值
自建螺栓数据集	Faster R-CNN	0.975	0.962	0.969
	YOLOv5	0.979	0.970	0.974
	CenterNet	0.980	0.949	0.964
	RetinaNet	0.943	0.906	0.924
	本文	0.996	0.997	0.996
SynthText数据集	Faster R-CNN	0.955	0.937	0.946
	YOLOv5	0.962	0.939	0.950
	CenterNet	0.945	0.920	0.932
	RetinaNet	0.898	0.886	0.891
	本文	0.973	0.963	0.968

数据集	算法	基线准确率	替换后准确率	基线帧率/（帧·s^-1）	替换后帧率/（帧·s^-1）
自建螺栓数据集	VGG16	0.963	0.974	112.7	113.9
自建螺栓数据集	ResNet50	0.975	0.981	95.1	95.8
SynthText数据集	VGG16	0.915	0.922	184.8	186.3
SynthText数据集	ResNet50	0.923	0.932	148.5	149.1

算法	准确率	召回率	F₁值
Faster R-CNN	0.947	0.934	0.941
YOLOv5	0.967	0.956	0.962
CenterNet	0.961	0.931	0.946
RetinaNet	0.923	0.887	0.905
本文	0.985	0.986	0.985

算法	准确率	召回率	F₁值
Faster R-CNN	0.904	0.887	0.895
YOLOv5	0.915	0.893	0.903
CenterNet	0.877	0.864	0.870
RetinaNet	0.843	0.827	0.835
本文	0.923	0.914	0.918

一种工件表面压印字符识别网络

An character recognition network for imprint character

RICH HTML

PDF (PC)

摘要/Abstract

引用本文

使用本文

图/表 12

参考文献 16

相关文章 15

Metrics

本文评价

推荐阅读 0