Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder
文献类型:期刊论文
作者 | Liu PJ(刘朋杰)1,2,3,4; Pan FC(潘福成)2,3,4![]() ![]() ![]() ![]() |
刊名 | Neural Computing and Applications
![]() |
出版日期 | 2022 |
页码 | 1-22 |
关键词 | Automated machine learning Parallel automated system Dual-stacked autoencoder Selective ensemble |
ISSN号 | 0941-0643 |
产权排序 | 1 |
英文摘要 | Finding a high-performance machine learning pipeline (ML pipeline) for a supervised learning task takes much time. It requires many choices, including preprocessing datasets, selecting algorithms, tuning hyperparameters, and ensembling candidate models. With increasing pipelines arises a combination explosion problem. This work presents a new automated machine learning (AutoML) system called Dsa-PAML to address this challenge by recommending, training, and ensembling suitable models for supervised learning tasks. Dsa-PAML is a parallel automated system based on a dual-stacked autoencoder (Dsa). Firstly, meta-features of datasets and ML pipelines are used to alleviate cold-start recommendation problems. Secondly, a novel dual-stacked autoencoder is used to simultaneously learn the latent features of datasets and ML pipelines, efficiently learning collaborations of both datasets and ML pipelines and recommending suitable ML pipelines for a new dataset. Thirdly, Dsa-PAML can train the recommended ML pipelines on the new dataset in a parallel method, which substantially reduces the time complexity of the proposed method. Finally, a parallel selective ensemble system is embedded into Dsa-PAML. It selects base models from candidate ML pipelines according to their runtime, classification performance, and diversity on the validation set, enhancing Dsa-PAML’s stability for most datasets. Amounts of experiments on 30 UCI datasets show that our approach outperforms current state-of-the-art methods. |
语种 | 英语 |
WOS记录号 | WOS:000773831700002 |
资助机构 | National Key R&D Program of China under Grant No. 2019B090916002 |
源URL | [http://ir.sia.cn/handle/173321/30743] ![]() |
专题 | 沈阳自动化研究所_数字工厂研究室 |
通讯作者 | Zhou XF(周晓锋) |
作者单位 | 1.University of Chinese Academy of Sciences, Beijing 100049, China 2.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China 3.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China 4.Key Laboratory of Networked Control Systems, Chinese Academy of Sciences, Shenyang 110016, China |
推荐引用方式 GB/T 7714 | Liu PJ,Pan FC,Zhou XF,et al. Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder[J]. Neural Computing and Applications,2022:1-22. |
APA | Liu PJ.,Pan FC.,Zhou XF.,Li S.,Zeng PY.,...&Jin L.(2022).Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder.Neural Computing and Applications,1-22. |
MLA | Liu PJ,et al."Dsa-PAML: a parallel automated machine learning system via dual-stacked autoencoder".Neural Computing and Applications (2022):1-22. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。