中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Handwritten Chinese text line segmentation by clustering with distance metric learning

文献类型:期刊论文

作者Yin, Fei; Liu, Cheng-Lin
刊名PATTERN RECOGNITION
出版日期2009-12-01
卷号42期号:12页码:3146-3157
关键词Handwritten text line segmentation Clustering Minimal spanning tree (MST) Distance metric learning Hypervolume reduction
英文摘要Separating text lines in unconstrained handwritten documents remains a challenge because the handwritten text lines are often un-uniformly skewed and curved, and the space between lines is not obvious. In this paper, we propose a novel text line segmentation algorithm based on minimal spanning tree (MST) clustering with distance metric learning. Given a distance metric, the connected components (CCs) of document image are grouped into a tree structure, from which text lines are extracted by dynamically cutting the edges using a new hypervolume reduction criterion and a straightness measure. By learning the distance metric in supervised learning on a dataset of pairs of CCs, the proposed algorithm is made robust to handle various documents with multi-skewed and curved text lines. In experiments on a database with 803 unconstrained handwritten Chinese document images containing a total of 8,169 lines, the proposed algorithm achieved a correct rate 98.02% of line detection, and compared favorably to other competitive algorithms. (C) 2009 Elsevier Ltd. All rights reserved.
WOS标题词Science & Technology ; Technology
类目[WOS]Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
研究领域[WOS]Computer Science ; Engineering
关键词[WOS]EMPIRICAL PERFORMANCE EVALUATION ; LAYOUT ANALYSIS ; RECOGNITION ; ALGORITHMS ; DOCUMENTS ; EXTRACTION
收录类别SCI ; ISTP
语种英语
WOS记录号WOS:000269727800004
源URL[http://ir.ia.ac.cn/handle/173211/3064]  
专题自动化研究所_模式识别国家重点实验室_模式分析与学习团队
作者单位Chinese Acad Sci, Inst Automat, NLPR, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Yin, Fei,Liu, Cheng-Lin. Handwritten Chinese text line segmentation by clustering with distance metric learning[J]. PATTERN RECOGNITION,2009,42(12):3146-3157.
APA Yin, Fei,&Liu, Cheng-Lin.(2009).Handwritten Chinese text line segmentation by clustering with distance metric learning.PATTERN RECOGNITION,42(12),3146-3157.
MLA Yin, Fei,et al."Handwritten Chinese text line segmentation by clustering with distance metric learning".PATTERN RECOGNITION 42.12(2009):3146-3157.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。