中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
AMDO: An Over-Sampling Technique for Multi-Class Imbalanced Problems

文献类型:期刊论文

作者Yang, Xuebing1,2; Kuang, Qiuming1,2; Zhang, Wensheng1,2; Zhang, Guoping3; Wensheng Zhang
刊名IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
出版日期2018-09-01
卷号30期号:9页码:1672-1685
关键词Multi-class Imbalanced Problems Over-sampling Mdo Mixed-type Data
DOI10.1109/TKDE.2017.2761347
文献子类Article
英文摘要 ; Multi-class imbalanced problems have attracted growing attention from the real-world classification tasks in engineering. The underlying skewed distribution of multiple classes poses difficulties for learning algorithms, which becomes more challenging when considering overlapping between classes, lack of representative data, and mixed-type data. In this work, we address this problem in a data-oriented way. Motivated by a recently proposed over-sampling technique designed for numeric data sets, Mahalanobis Distance-based Over-sampling (MDO), we use this technique to capture the covariance structure of the minority class and to generate synthetic samples along the probability contours for learning algorithms. Based on MDO, we further improve the over-sampling strategy and generalize it for mixed-type data sets. The established technique, Adaptive Mahalanobis Distance-based Over-sampling (AMDO), introduces GSVD (Generalized Singular Value Decomposition) for mixed-type data, develops a partially balanced resampling scheme and optimizes the sample synthesis. Theoretical analysis is conducted to demonstrate the reasonability of AMDO. Extensive experimental testing is performed on 15 multi-class imbalanced benchmarks and two data sets for precipitation phase recognition in comparison with several state-of-the-art multi-class imbalanced learning methods. The results validate the effectiveness and robustness of our proposal.
WOS关键词DATA-SETS ; NEURAL-NETWORKS ; CLASSIFICATION ; CLASSIFIERS ; SMOTE ; ALGORITHMS ; SYSTEM
WOS研究方向Computer Science ; Engineering
语种英语
WOS记录号WOS:000440853500004
资助机构National Natural Science Foundation of China(61432008 ; 61472423 ; U1636220)
源URL[http://ir.ia.ac.cn/handle/173211/20866]  
专题精密感知与控制研究中心_人工智能与机器学习
通讯作者Wensheng Zhang
作者单位1.Chinese Acad Sci, Res Ctr Precis Sensing & Control, Inst Automat, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Beijing 101408, Peoples R China
3.CMA, Joint Lab Meteorol Data & Machine Learning, Publ Meteorol Serv Ctr, Beijing 100081, Peoples R China
推荐引用方式
GB/T 7714
Yang, Xuebing,Kuang, Qiuming,Zhang, Wensheng,et al. AMDO: An Over-Sampling Technique for Multi-Class Imbalanced Problems[J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING,2018,30(9):1672-1685.
APA Yang, Xuebing,Kuang, Qiuming,Zhang, Wensheng,Zhang, Guoping,&Wensheng Zhang.(2018).AMDO: An Over-Sampling Technique for Multi-Class Imbalanced Problems.IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING,30(9),1672-1685.
MLA Yang, Xuebing,et al."AMDO: An Over-Sampling Technique for Multi-Class Imbalanced Problems".IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING 30.9(2018):1672-1685.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。