中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Joint stroke classification and text line grouping in online handwritten documents with edge pooling attention networks

文献类型:期刊论文

作者Jun-Yu Ye2,3; Yan-Ming Zhang2; Qing Yang2; Cheng-Lin Liu1,2,3
刊名Pattern Recognition
出版日期2021-02
卷号114期号:114页码:107859
关键词Online handwritten documents Stroke classification Text line grouping Graph neural networks Edge pooling attention networks
ISSN号0031-3203
DOI10.1016/j.patcog.2021.107859
通讯作者Liu, Cheng-Lin(liucl@nlpr.ia.ac.cn)
文献子类ORIGINAL RESEARCH
英文摘要

Stroke classification and text line grouping are important tasks in online handwritten document segmentation. In the past, the two tasks were usually performed using different models which are trained
independently and perform sequentially. This cannot optimize the integration of contextual information
and the system may suffer from error accumulation in stroke classification. In this paper, we propose a
method for joint text/non-text stroke classification and text line grouping in online handwritten documents using attention based graph neural network. In our framework, the stroke classification and text
line grouping problems are formulated as node classification and node clustering problems in a relational graph, which is constructed based on the temporal and spatial relationship between strokes. We
propose a new graph network architecture, called edge pooling attention network (EPAT) to efficiently aggregate information between the features of neighboring nodes and edges. The proposed model is trained
by multi-task learning with cross entropy loss for node classification and distance metric loss for node
clustering. In experiments on two online handwritten document datasets IAMOnDo and Kondate, the proposed method is demonstrated effective, yielding superior performance in both stroke classification and
text line grouping.

资助项目National Key Research and Development Program[2018YFB1005000] ; National Natural Science Foundation of China (NSFC)[61773376] ; National Natural Science Foundation of China (NSFC)[61721004]
WOS研究方向Computer Science ; Engineering
语种英语
WOS记录号WOS:000632385300013
出版者ELSEVIER SCI LTD
资助机构National Key Research and Development Program ; National Natural Science Foundation of China (NSFC)
源URL[http://ir.ia.ac.cn/handle/173211/43291]  
专题自动化研究所_模式识别国家重点实验室_模式分析与学习团队
通讯作者Cheng-Lin Liu
作者单位1.School of Artificial Intelligence, University of Chinese Academy of Sciences
2.National Laboratory of Pattern Recognition, Institute of Automation of Chinese Academy of Sciences
3.CAS Center for Exellence of Brain Science and Intelligence Technology
推荐引用方式
GB/T 7714
Jun-Yu Ye,Yan-Ming Zhang,Qing Yang,et al. Joint stroke classification and text line grouping in online handwritten documents with edge pooling attention networks[J]. Pattern Recognition,2021,114(114):107859.
APA Jun-Yu Ye,Yan-Ming Zhang,Qing Yang,&Cheng-Lin Liu.(2021).Joint stroke classification and text line grouping in online handwritten documents with edge pooling attention networks.Pattern Recognition,114(114),107859.
MLA Jun-Yu Ye,et al."Joint stroke classification and text line grouping in online handwritten documents with edge pooling attention networks".Pattern Recognition 114.114(2021):107859.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。