中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
End-to-end scene text recognition using tree-structured models

文献类型:期刊论文

作者Shi, Cunzhao; Wang, Chunheng; Xiao, Baihua; Gao, Song; Hu, Jinlong; Wang Chunheng
刊名PATTERN RECOGNITION
出版日期2014-09-01
卷号47期号:9页码:2853-2866
关键词End-to-end Scene text recognition Part-based tree-structured models (TSMs) Normalized pictorial structure
英文摘要Detecting and recognizing text in natural images are quite challenging and have received much attention from the computer vision community in recent years. In this paper, we propose a robust end-to-end scene text recognition method, which utilizes tree-structured character models and normalized pictorial structured word models. For each category of characters, we build a part-based tree-structured model (TSM) so as to make use of the character-specific structure information as well as the local appearance information. The TSM could detect each part of the character and recognize the unique structure as well, seamlessly combining character detection and recognition together. As the TSMs could accurately detect characters from complex background, for text localization, we apply TSMs for all the characters on the coarse text detection regions to eliminate the false positives and search the possible missing characters as well. While for word recognition, we propose a normalized pictorial structure (PS) framework to deal with the bias caused by words of different lengths. Experimental results on a range of challenging public datasets (ICDAR 2003, ICDAR 2011, SVT) demonstrate that the proposed method outperforms state-of-the-art methods both for text localization and word recognition. (C) 2014 Elsevier Ltd. All rights reserved.
WOS标题词Science & Technology ; Technology
类目[WOS]Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
研究领域[WOS]Computer Science ; Engineering
关键词[WOS]POSE ESTIMATION ; SEGMENTATION ; IMAGES ; LOCALIZATION ; DETECT ; WILD
收录类别SCI
语种英语
WOS记录号WOS:000336872000006
源URL[http://ir.ia.ac.cn/handle/173211/3759]  
专题自动化研究所_复杂系统管理与控制国家重点实验室_影像分析与机器视觉团队
通讯作者Wang Chunheng
作者单位Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Shi, Cunzhao,Wang, Chunheng,Xiao, Baihua,et al. End-to-end scene text recognition using tree-structured models[J]. PATTERN RECOGNITION,2014,47(9):2853-2866.
APA Shi, Cunzhao,Wang, Chunheng,Xiao, Baihua,Gao, Song,Hu, Jinlong,&Wang Chunheng.(2014).End-to-end scene text recognition using tree-structured models.PATTERN RECOGNITION,47(9),2853-2866.
MLA Shi, Cunzhao,et al."End-to-end scene text recognition using tree-structured models".PATTERN RECOGNITION 47.9(2014):2853-2866.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。