中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder

文献类型:期刊论文

作者Liu PJ(刘朋杰)1,2,3,4; Pan FC(潘福成)2,3,4; Zhou XF(周晓锋)2,3,4; Li S(李帅)1,2,3,4; Zeng PY(曾鹏宇)1,2,3,4; Liu SR(刘舒锐)2,3,4; Jin L(金樑)2,3,4
刊名Neural Computing and Applications
出版日期2022
页码1-22
关键词Automated machine learning Parallel automated system Dual-stacked autoencoder Selective ensemble
ISSN号0941-0643
产权排序1
英文摘要

Finding a high-performance machine learning pipeline (ML pipeline) for a supervised learning task takes much time. It requires many choices, including preprocessing datasets, selecting algorithms, tuning hyperparameters, and ensembling candidate models. With increasing pipelines arises a combination explosion problem. This work presents a new automated machine learning (AutoML) system called Dsa-PAML to address this challenge by recommending, training, and ensembling suitable models for supervised learning tasks. Dsa-PAML is a parallel automated system based on a dual-stacked autoencoder (Dsa). Firstly, meta-features of datasets and ML pipelines are used to alleviate cold-start recommendation problems. Secondly, a novel dual-stacked autoencoder is used to simultaneously learn the latent features of datasets and ML pipelines, efficiently learning collaborations of both datasets and ML pipelines and recommending suitable ML pipelines for a new dataset. Thirdly, Dsa-PAML can train the recommended ML pipelines on the new dataset in a parallel method, which substantially reduces the time complexity of the proposed method. Finally, a parallel selective ensemble system is embedded into Dsa-PAML. It selects base models from candidate ML pipelines according to their runtime, classification performance, and diversity on the validation set, enhancing Dsa-PAML’s stability for most datasets. Amounts of experiments on 30 UCI datasets show that our approach outperforms current state-of-the-art methods.

语种英语
WOS记录号WOS:000773831700002
资助机构National Key R&D Program of China under Grant No. 2019B090916002
源URL[http://ir.sia.cn/handle/173321/30743]  
专题沈阳自动化研究所_数字工厂研究室
通讯作者Zhou XF(周晓锋)
作者单位1.University of Chinese Academy of Sciences, Beijing 100049, China
2.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China
3.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China
4.Key Laboratory of Networked Control Systems, Chinese Academy of Sciences, Shenyang 110016, China
推荐引用方式
GB/T 7714
Liu PJ,Pan FC,Zhou XF,et al. Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder[J]. Neural Computing and Applications,2022:1-22.
APA Liu PJ.,Pan FC.,Zhou XF.,Li S.,Zeng PY.,...&Jin L.(2022).Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder.Neural Computing and Applications,1-22.
MLA Liu PJ,et al."Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder".Neural Computing and Applications (2022):1-22.

入库方式: OAI收割

来源:沈阳自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。