吉林大学学报(工学版) ›› 2013, Vol. 43 ›› Issue (01): 250-255.
姜维, 卢朝阳, 李静, 刘晓佩
JIANG Wei, LU Zhao-yang, LI Jing, LIU Xiao-pei
摘要: 针对复杂场景中纹理丰富的非文字区对文字定位算法的干扰,提出了基于光度不变量的角点类别特征和边缘幅值方向梯度直方图(Histogram of oriented gradients of edge magnitude,HOG-EM)统计特征两种新特征,并据此设计了一种两级多层复杂场景文字定位算法。首先获取边缘图像并提取根据HSL颜色空间特性划分的8层二值化图像,将其组成9层子图并做连通域分析提取文字候选区。然后提取文字候选区的角点类别特征和HOG-EM统计特征,将二者分别用于剔除非文字候选区和获取文字。实验表明:本文算法可以较为准确地剔除纹理丰富的非文字区,有效地降低复杂场景文字定位算法的虚警率,取得比较理想的准确率和召回率。
中图分类号:
| [1] Jung K, Kim K I, Jain A K. Text information extraction in images and video: a survey[J]. Pattern Recogntion, 2004, 37(5): 977-997.[2] Liang J, Doermann D, Li H P. Camera-based analysis of text and documents: a survey[J]. International Journal Document Analysis and Recognition, 2005, 7(2-3):84-104.[3] Anoual H, El Fkihi Sanaa, Jilbab Abdelilah, et al. New approach based on texture and geometric features for text detection//Proceedings of the 4th International Conference on Image and Signal Processing, Berlin, Heidelberg, 2010:157-164.[4] Wang X, Huang L, Liu C. A video text location method based on background classification[J]. International Journal on Document Analysis and Recognition,2010,13(3): 173-186.[5] Pan Yi-feng, Hou Xin-wen, Liu Cheng-lin. Text localization in natural scene images based on conditional random field//International Conference on Document Analysis and Recognition, Barcelona, Spain, 2009: 6-10.[6] Roy P P, Pal U, Lladós J. Text line extraction in graphical documents using background and foreground information[J]. International Journal on Document Analysis and Recognition, 2011,15:227-241.[7] Yi Chu-cai, Tian Ying-li. Text string detection from natural scenes by structure-based partition and grouping[J]. IEEE Transactions on Image Processing, 2011, 20(9): 2594-2605.[8] Kunishige Y, Yaokai F, Uchida S. Scenery character detection with environmental context//International Conference on Document Analysis and Recognition,Beijing, China, 2011:1049-1053.[9] Zhao X, Lin K H, Fu Y, et al. Text from corners: a novel approach to detect text and caption in videos[J]. IEEE Transactions on Image Processing, 2011, 20(3): 790-799.[10] Gevers T. Reflectance-based classification of color edges//Proceedings of 9th IEEE International Conference on Computer Vision, 2003: 856-861.[11] Shafer S A. Using color to separate reflection components[J]. Color Research and Applications, 1985, 10(4): 210-218.[12] Rosten E, Porter R, Drummond T. Faster and better: a machine learning approach to corner detection[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2010, 32(1): 105-119.[13] Dalal N, Triggs B. Histograms of oriented gradients for human detection//IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2005: 886-893.[14] Lucas S M. ICDAR 2005 text locating competition results//The 8th International Conference on Document Analysis and Recognition, 2005: 80-84. |
| [1] | 苏寒松,代志涛,刘高华,张倩芳. 结合吸收Markov链和流行排序的显著性区域检测[J]. 吉林大学学报(工学版), 2018, 48(6): 1887-1894. |
| [2] | 徐岩,孙美双. 基于卷积神经网络的水下图像增强方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1895-1903. |
| [3] | 黄勇,杨德运,乔赛,慕振国. 高分辨合成孔径雷达图像的耦合传统恒虚警目标检测[J]. 吉林大学学报(工学版), 2018, 48(6): 1904-1909. |
| [4] | 李居朋,张祖成,李墨羽,缪德芳. 基于Kalman滤波的电容屏触控轨迹平滑算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1910-1916. |
| [5] | 应欢,刘松华,唐博文,韩丽芳,周亮. 基于自适应释放策略的低开销确定性重放方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1917-1924. |
| [6] | 陆智俊,钟超,吴敬玉. 星载合成孔径雷达图像小特征的准确分割方法[J]. 吉林大学学报(工学版), 2018, 48(6): 1925-1930. |
| [7] | 刘仲民,王阳,李战明,胡文瑾. 基于简单线性迭代聚类和快速最近邻区域合并的图像分割算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1931-1937. |
| [8] | 单泽彪,刘小松,史红伟,王春阳,石要武. 动态压缩感知波达方向跟踪算法[J]. 吉林大学学报(工学版), 2018, 48(6): 1938-1944. |
| [9] | 姚海洋, 王海燕, 张之琛, 申晓红. 双Duffing振子逆向联合信号检测模型[J]. 吉林大学学报(工学版), 2018, 48(4): 1282-1290. |
| [10] | 全薇, 郝晓明, 孙雅东, 柏葆华, 王禹亭. 基于实际眼结构的个性化投影式头盔物镜研制[J]. 吉林大学学报(工学版), 2018, 48(4): 1291-1297. |
| [11] | 陈绵书, 苏越, 桑爱军, 李培鹏. 基于空间矢量模型的图像分类方法[J]. 吉林大学学报(工学版), 2018, 48(3): 943-951. |
| [12] | 陈涛, 崔岳寒, 郭立民. 适用于单快拍的多重信号分类改进算法[J]. 吉林大学学报(工学版), 2018, 48(3): 952-956. |
| [13] | 孟广伟, 李荣佳, 王欣, 周立明, 顾帅. 压电双材料界面裂纹的强度因子分析[J]. 吉林大学学报(工学版), 2018, 48(2): 500-506. |
| [14] | 林金花, 王延杰, 孙宏海. 改进的自适应特征细分方法及其对Catmull-Clark曲面的实时绘制[J]. 吉林大学学报(工学版), 2018, 48(2): 625-632. |
| [15] | 王柯, 刘富, 康冰, 霍彤彤, 周求湛. 基于沙蝎定位猎物的仿生震源定位方法[J]. 吉林大学学报(工学版), 2018, 48(2): 633-639. |
|
||