An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition
文献类型:期刊论文
作者 | MingMing Yu(于明明); Zhang H(张恒)![]() |
刊名 | Pattern Recognition
![]() |
出版日期 | 2024 |
页码 | 110373 |
英文摘要 | Text line recognition methods can be categorized into explicit segmentation based and implicit segmentation based ones. Explicit segmentation based methods require character-level annotation during training, while implicit segmentation based methods, trained on line-level annotated data, face alignment drift challenges. Though some methods have been proposed to address these challenges using weakly supervised object detection, they often rely on cumbersome pseudobox generation processes and complex decoding. In this paper, we propose a unified framework to overcome these challenges, achieving high accuracy in text recognition and character segmentation. To eliminate the need of character-level annotated real text line data in training, we introduce a novel training paradigm that utilizes character-level annotated synthetic data and line-level annotated real data jointly. For synthetic data, candidate characters are explicitly aligned with labeled characters to generate hard labels for supervising model training. For real data, implicit alignment is produced by Connectionist Temporal Classification (CTC) mapping to provide soft labels for weakly-supervised model training. And for inference, we propose two decoding strategies leveraging the advantages of Non-Maximum Suppression (NMS) and CTC decoding. Extensive experiments on benchmark datasets demonstrate the superior performance of our method in text recognition and character localization, even with minimal amounts of character-level annotated line data. |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/57526] ![]() |
专题 | 自动化研究所_模式识别国家重点实验室_模式分析与学习团队 |
通讯作者 | Cheng-Lin Liu(刘成林) |
推荐引用方式 GB/T 7714 | MingMing Yu,Zhang H,Fei Yin,et al. An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition[J]. Pattern Recognition,2024:110373. |
APA | MingMing Yu,Zhang H,Fei Yin,&Cheng-Lin Liu.(2024).An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition.Pattern Recognition,110373. |
MLA | MingMing Yu,et al."An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition".Pattern Recognition (2024):110373. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。