中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Semi-supervised Long-tail Endoscopic Image Classification

文献类型:期刊论文

作者Runnan Cao2,3; Mengjie Fang2,3; Hailing Li4; Jie Tian1,2,4; Di Dong2,3
刊名Chinese Medical Sciences Journal
出版日期2022-09-28
卷号37期号:3页码:171-181
关键词endoscopic image artificial intelligence semi-supervised learning long-tail distribution image classification
ISSN号1001-9294
DOI10.24920/003834
英文摘要

Objective To explore the semi-supervised learning (SSL) algorithm for long-tail endoscopic image classification with limited annotations.

Method We explored semi-supervised long-tail endoscopic image classification in HyperKvasir, the largest gastrointestinal public dataset with 23 diverse classes. Semi-supervised learning algorithm FixMatch was applied based on consistency regularization and pseudo-labeling. After splitting the training dataset and the test dataset at a ratio of 4:1, we sampled 20%, 50%, and 100% labeled training data to test the classification with limited annotations.

Results The classification performance was evaluated by micro-average and macro-average evaluation metrics, with the Mathews correlation coefficient (MCC) as the overall evaluation. SSL algorithm improved the classification performance with MCC increased from 0.8761 to 0.8850, from 0.8983 to 0.8994, and from 0.9075 to 0.9095 with 20%, 50%, and 100% ratio of labeled training data, respectively. With a 20% ratio of labeled training data, SSL improved both the micro-average and macro-average classification performance; while for the ratio of 50% and 100%, SSL improved the micro-average performance but hurt macro-average performance. Through analyzing the confusion matrix and labeling bias in each class, we found that the pseudo-based SSL algorithm exacerbated the classifier’s preference for the head class, resulting in improved performance in the head class and degenerated performance in the tail class.

Conclusion SSL can improve the classification performance for semi-supervised long-tail endoscopic image classification, especially when the labeled data is extremely limited, which may benefit the building of assisted diagnosis systems for low-volume hospitals. However, the pseudo-labeling strategy may amplify the effect of class imbalance, which hurts the classification performance for the tail class.

语种英语
源URL[http://ir.ia.ac.cn/handle/173211/51985]  
专题自动化研究所_中国科学院分子影像重点实验室
通讯作者Di Dong
作者单位1.Engineering Research Center of Molecular and Neuro Imaging of Ministry of Education, School of Life Science and Technology, Xidian University, Xi’an, Shanxi, 710126, China
2.CAS Key Laboratory of Molecular Imaging, Beijing Key Laboratory of Molecular Imaging, the State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences, Beijing 100190, China
3.School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing 100049, China
4.Beijing Advanced Innovation Center for Big Data-Based Precision Medicine, School of Engineering Medicine, Beihang University, Beijing, 100191, China
推荐引用方式
GB/T 7714
Runnan Cao,Mengjie Fang,Hailing Li,et al. Semi-supervised Long-tail Endoscopic Image Classification[J]. Chinese Medical Sciences Journal,2022,37(3):171-181.
APA Runnan Cao,Mengjie Fang,Hailing Li,Jie Tian,&Di Dong.(2022).Semi-supervised Long-tail Endoscopic Image Classification.Chinese Medical Sciences Journal,37(3),171-181.
MLA Runnan Cao,et al."Semi-supervised Long-tail Endoscopic Image Classification".Chinese Medical Sciences Journal 37.3(2022):171-181.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。