中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Integrated Method for Text Detection in Natural Scene Images

文献类型:期刊论文

作者Yang Zheng1; Jie Liu2; Heping Liu1; Qing Li1; Gen Li2
刊名KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS
出版日期2016-11-30
卷号10期号:11页码:5583-5604
关键词Stroke Color Extension Character Classifier Character Color Transform Convolutional Neural Network
英文摘要In this paper, we present a novel image operator to extract textual information in natural scene images. First, a powerful refiner called the Stroke Color Extension, which extends the widely used Stroke Width Transform by incorporating color information of strokes, is proposed to achieve significantly enhanced performance on intra-character connection and non-character removal. Second, a character classifier is trained by using gradient features. The classifier not only eliminates non-character components but also remains a large number of characters. Third, an effective extractor called the Character Color Transform combines color information of characters and geometry features. It is used to extract potential characters which are not correctly extracted in previous steps. Fourth, a Convolutional Neural Network model is used to verify text candidates, improving the performance of text detection. The proposed technique is tested on two public datasets, i.e., ICDAR2011 dataset and ICDAR2013 dataset. The experimental results show that our approach achieves state-of-the-art performance.; In this paper, we present a novel image operator to extract textual information in natural scene images. First, a powerful refiner called the Stroke Color Extension, which extends the widely used Stroke Width Transform by incorporating color information of strokes, is proposed to achieve significantly enhanced performance on intra-character connection and non-character removal. Second, a character classifier is trained by using gradient features. The classifier not only eliminates non-character components but also remains a large number of characters. Third, an effective extractor called the Character Color Transform combines color information of characters and geometry features. It is used to extract potential characters which are not correctly extracted in previous steps. Fourth, a Convolutional Neural Network model is used to verify text candidates, improving the performance of text detection. The proposed technique is tested on two public datasets, i.e., ICDAR2011 dataset and ICDAR2013 dataset. The experimental results show that our approach achieves state-of-the-art performance.
源URL[http://ir.ia.ac.cn/handle/173211/19529]  
专题数字内容技术与服务研究中心_新媒体服务与管理技术
作者单位1.北京科技大学
2.中国科学院自动化研究所
推荐引用方式
GB/T 7714
Yang Zheng,Jie Liu,Heping Liu,et al. Integrated Method for Text Detection in Natural Scene Images[J]. KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS,2016,10(11):5583-5604.
APA Yang Zheng,Jie Liu,Heping Liu,Qing Li,&Gen Li.(2016).Integrated Method for Text Detection in Natural Scene Images.KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS,10(11),5583-5604.
MLA Yang Zheng,et al."Integrated Method for Text Detection in Natural Scene Images".KSII TRANSACTIONS ON INTERNET AND INFORMATION SYSTEMS 10.11(2016):5583-5604.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。