基于角点类别特征和边缘幅值方向梯度直方图统计特征的复杂场景文字定位算法

吉林大学学报(工学版) ›› 2013, Vol. 43 ›› Issue (01): 250-255.

基于角点类别特征和边缘幅值方向梯度直方图统计特征的复杂场景文字定位算法

姜维, 卢朝阳, 李静, 刘晓佩

西安电子科技大学综合业务网国家重点实验室, 西安 710071

收稿日期:2011-12-14 出版日期:2013-01-01 发布日期:2013-01-01
通讯作者: 卢朝阳(1963-),男,教授,博士生导师.研究方向:图像分析与图像理解,图像与视频编码,生物特征,图文分析,基于图像分析的智能交通系统应用.E-mail:zhylu@xidian.edu.cn E-mail:mail:zhylu@xidian.edu.cn
作者简介:姜维(1981-),男,博士研究生.研究方向:自然场景文字定位和识别.E-mail:jwmianzu@gmail.com
基金资助:
国家自然科学基金项目(60872141);中央高校基本科研业务费专项基金项目(K50510010007);华为科技基金项目(HITC2011023).

Text localization algorithm in complex scene based on corner-type feature and histogram of oriented gradients of edge magnitude statistical feature

JIANG Wei, LU Zhao-yang, LI Jing, LIU Xiao-pei

State Key Laboratory of Integrated Service Networks, Xidian University, Xi'an 710071, China

Received:2011-12-14 Online:2013-01-01 Published:2013-01-01

摘要/Abstract

摘要： 针对复杂场景中纹理丰富的非文字区对文字定位算法的干扰,提出了基于光度不变量的角点类别特征和边缘幅值方向梯度直方图(Histogram of oriented gradients of edge magnitude,HOG-EM)统计特征两种新特征,并据此设计了一种两级多层复杂场景文字定位算法。首先获取边缘图像并提取根据HSL颜色空间特性划分的8层二值化图像,将其组成9层子图并做连通域分析提取文字候选区。然后提取文字候选区的角点类别特征和HOG-EM统计特征,将二者分别用于剔除非文字候选区和获取文字。实验表明:本文算法可以较为准确地剔除纹理丰富的非文字区,有效地降低复杂场景文字定位算法的虚警率,取得比较理想的准确率和召回率。

关键词: 信息处理技术, 文字定位, 角点类别, 方向梯度直方图, 光度不变量

Abstract: The corner-type feature based on photometric invariants and the Histogram of Oriented Gradients of Edge Magnitude (HOG-EM) statistical feature are proposed to overcome the interference of the texture-rich non-text regions to the text localization algorithm. A two-stage multilayer text localization algorithm in complex scene is presented on the basis of the two novel features. In the proposed method, first, edge map is obtained and eight layers of binary maps in the Hue Saturation and Lightness (HSL) space are generated according to the characteristics of the HSL space. Then, nine layers of sub-maps are formed to gain text candidate blocks with multilayer connected component analysis. Finally, the two novel features mentioned above are extracted to remove the non-text block from the text candidate blocks and keep the text. Experiments indicate that the proposed scheme can efficiently remove non-text texture-rich regions, decrease false-alarm rate and obtain reasonable accuracy and recall rate.

Key words: information processing technology, text localization, corner-type, histogram of oriented gradients, photometric invariants

中图分类号:

TN911.72

姜维, 卢朝阳, 李静, 刘晓佩. 基于角点类别特征和边缘幅值方向梯度直方图统计特征的复杂场景文字定位算法[J]. 吉林大学学报(工学版), 2013, 43(01): 250-255.

JIANG Wei, LU Zhao-yang, LI Jing, LIU Xiao-pei. Text localization algorithm in complex scene based on corner-type feature and histogram of oriented gradients of edge magnitude statistical feature[J]. 吉林大学学报(工学版), 2013, 43(01): 250-255.

参考文献

[1] Jung K, Kim K I, Jain A K. Text information extraction in images and video: a survey[J]. Pattern Recogntion, 2004, 37(5): 977-997.

[2] Liang J, Doermann D, Li H P. Camera-based analysis of text and documents: a survey[J]. International Journal Document Analysis and Recognition, 2005, 7(2-3):84-104.

[3] Anoual H, El Fkihi Sanaa, Jilbab Abdelilah, et al. New approach based on texture and geometric features for text detection//Proceedings of the 4th International Conference on Image and Signal Processing, Berlin, Heidelberg, 2010:157-164.

[4] Wang X, Huang L, Liu C. A video text location method based on background classification[J]. International Journal on Document Analysis and Recognition,2010,13(3): 173-186.

[5] Pan Yi-feng, Hou Xin-wen, Liu Cheng-lin. Text localization in natural scene images based on conditional random field//International Conference on Document Analysis and Recognition, Barcelona, Spain, 2009: 6-10.

[6] Roy P P, Pal U, Lladós J. Text line extraction in graphical documents using background and foreground information[J]. International Journal on Document Analysis and Recognition, 2011,15:227-241.

[7] Yi Chu-cai, Tian Ying-li. Text string detection from natural scenes by structure-based partition and grouping[J]. IEEE Transactions on Image Processing, 2011, 20(9): 2594-2605.

[8] Kunishige Y, Yaokai F, Uchida S. Scenery character detection with environmental context//International Conference on Document Analysis and Recognition,Beijing, China, 2011:1049-1053.

[9] Zhao X, Lin K H, Fu Y, et al. Text from corners: a novel approach to detect text and caption in videos[J]. IEEE Transactions on Image Processing, 2011, 20(3): 790-799.

[10] Gevers T. Reflectance-based classification of color edges//Proceedings of 9th IEEE International Conference on Computer Vision, 2003: 856-861.

[11] Shafer S A. Using color to separate reflection components[J]. Color Research and Applications, 1985, 10(4): 210-218.

[12] Rosten E, Porter R, Drummond T. Faster and better: a machine learning approach to corner detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(1): 105-119.

[13] Dalal N, Triggs B. Histograms of oriented gradients for human detection//IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005: 886-893.

[14] Lucas S M. ICDAR 2005 text locating competition results//The 8th International Conference on Document Analysis and Recognition, 2005: 80-84.

相关文章 15

[1]	苏寒松,代志涛,刘高华,张倩芳. 结合吸收Markov链和流行排序的显著性区域检测[J]. 吉林大学学报(工学版), 2018, 48(6): 1887-1894.
[2]	徐岩,孙美双. 基于卷积神经网络的水下图像增强方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1895-1903.
[3]	黄勇,杨德运,乔赛,慕振国. 高分辨合成孔径雷达图像的耦合传统恒虚警目标检测[J]. 吉林大学学报(工学版), 2018, 48(6): 1904-1909.
[4]	李居朋,张祖成,李墨羽,缪德芳. 基于Kalman滤波的电容屏触控轨迹平滑算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1910-1916.
[5]	应欢,刘松华,唐博文,韩丽芳,周亮. 基于自适应释放策略的低开销确定性重放方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1917-1924.
[6]	陆智俊,钟超,吴敬玉. 星载合成孔径雷达图像小特征的准确分割方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1925-1930.
[7]	刘仲民,王阳,李战明,胡文瑾. 基于简单线性迭代聚类和快速最近邻区域合并的图像分割算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1931-1937.
[8]	单泽彪,刘小松,史红伟,王春阳,石要武. 动态压缩感知波达方向跟踪算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1938-1944.
[9]	姚海洋, 王海燕, 张之琛, 申晓红. 双Duffing振子逆向联合信号检测模型[J]. 吉林大学学报(工学版), 2018, 48(4): 1282-1290.
[10]	全薇, 郝晓明, 孙雅东, 柏葆华, 王禹亭. 基于实际眼结构的个性化投影式头盔物镜研制[J]. 吉林大学学报(工学版), 2018, 48(4): 1291-1297.
[11]	陈绵书, 苏越, 桑爱军, 李培鹏. 基于空间矢量模型的图像分类方法[J]. 吉林大学学报(工学版), 2018, 48(3): 943-951.
[12]	陈涛, 崔岳寒, 郭立民. 适用于单快拍的多重信号分类改进算法[J]. 吉林大学学报(工学版), 2018, 48(3): 952-956.
[13]	孟广伟, 李荣佳, 王欣, 周立明, 顾帅. 压电双材料界面裂纹的强度因子分析[J]. 吉林大学学报(工学版), 2018, 48(2): 500-506.
[14]	林金花, 王延杰, 孙宏海. 改进的自适应特征细分方法及其对Catmull-Clark曲面的实时绘制[J]. 吉林大学学报(工学版), 2018, 48(2): 625-632.
[15]	王柯, 刘富, 康冰, 霍彤彤, 周求湛. 基于沙蝎定位猎物的仿生震源定位方法[J]. 吉林大学学报(工学版), 2018, 48(2): 633-639.

Metrics

Viewed

Full text

Abstract

Cited

Shared

Discussed