中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Multiset Feature Learning for Highly Imbalanced Data Classification

文献类型:期刊论文

作者Jing, Xiao-Yuan3,4,5; Zhang, Xinyu3; Zhu, Xiaoke6; Wu, Fei5; You, Xinge7; Gao, Yang1; Shan, Shiguang8; Yang, Jing-Yu2
刊名IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
出版日期2021
卷号43期号:1页码:139-156
关键词Highly imbalanced data classification multiset feature learning deep metric learning generative adversarial network cost-sensitive factor weighted uncorrelated constraint
ISSN号0162-8828
DOI10.1109/TPAMI.2019.2929166
英文摘要With the expansion of data, increasing imbalanced data has emerged. When the imbalance ratio (IR) of data is high, most existing imbalanced learning methods decline seriously in classification performance. In this paper, we systematically investigate the highly imbalanced data classification problem, and propose an uncorrelated cost-sensitive multiset learning (UCML) approach for it. Specifically, UCML first constructs multiple balanced subsets through random partition, and then employs the multiset feature learning (MFL) to learn discriminant features from the constructed multiset. To enhance the usability of each subset and deal with the nonlinearity issue existed in each subset, we further propose a deep metric based UCML (DM-UCML) approach. DM-UCML introduces the generative adversarial network technique into the multiset constructing process, such that each subset can own similar distribution with the original dataset. To cope with the non-linearity issue, DM-UCML integrates deep metric learning with MFL, such that more favorable performance can be achieved. In addition, DM-UCML designs a new discriminant term to enhance the discriminability of learned metrics. Experiments on eight traditional highly class-imbalanced datasets and two large-scale datasets indicate that: the proposed approaches outperform state-of-the-art highly imbalanced learning methods and are more robust to high IR.
资助项目NSFC-Key Project of General Technology Fundamental Research United Fund[U1736211] ; National Natural Science Foundation of China[61672208] ; National Natural Science Foundation of China[61702280] ; National Natural Science Foundation of China[61772220] ; National Natural Science Foundation of China[61432008] ; key research and development program of China[2016YFE0121200] ; Key Science and Technology Innovation Program of Hubei Province[2017AAA017] ; Key Science and Technology Innovation Program of Hubei Province[2018ACA135] ; Natural Science Foundation Key Project for Innovation Group of Hubei Province[2018CFA024] ; Natural Science Foundation of Jiangsu Province[BK20170900] ; National Postdoctoral Program for Innovative Talents[BX20180146] ; Higher Education Institution Key Research Projects of Henan Province[19A520001]
WOS研究方向Computer Science ; Engineering
语种英语
WOS记录号WOS:000597206900010
出版者IEEE COMPUTER SOC
源URL[http://119.78.100.204/handle/2XEOYT63/16517]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Jing, Xiao-Yuan; Zhu, Xiaoke; Wu, Fei
作者单位1.Nanjing Univ, State Key Lab Novel Software Technol, Nanjing 210094, Peoples R China
2.Nanjing Univ Sci & Technol, Coll Comp Sci, Nanjing 210094, Peoples R China
3.Wuhan Univ, Sch Comp Sci, Wuhan 430072, Peoples R China
4.Guangdong Univ Petrochem Technol, Sch Comp, Maoming 525000, Peoples R China
5.Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210003, Peoples R China
6.Henan Univ, Henan Key Lab Big Data Anal & Proc, Kaifeng 475004, Peoples R China
7.Huazhong Univ Sci & Technol, Dept Elect & Informat Engn, Wuhan 430074, Peoples R China
8.Chinese Acad Sci, Inst Comp Technol, CAS, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Jing, Xiao-Yuan,Zhang, Xinyu,Zhu, Xiaoke,et al. Multiset Feature Learning for Highly Imbalanced Data Classification[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2021,43(1):139-156.
APA Jing, Xiao-Yuan.,Zhang, Xinyu.,Zhu, Xiaoke.,Wu, Fei.,You, Xinge.,...&Yang, Jing-Yu.(2021).Multiset Feature Learning for Highly Imbalanced Data Classification.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,43(1),139-156.
MLA Jing, Xiao-Yuan,et al."Multiset Feature Learning for Highly Imbalanced Data Classification".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 43.1(2021):139-156.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。