中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Mining Cross Features for Financial Credit Risk Assessment

文献类型:会议论文

作者Liu, Qiang3,4; Liu, Zhaocheng2; Zhang, Haoli2; Chen, Yuntian5; Zhu, Jun1
出版日期2021-11
会议日期2021.11.01-2021.11.05
会议地点Queensland, Australia
英文摘要

For reliability, machine learning models in some areas, e.g., finance and healthcare, require to be both accurate and globally interpretable. Among them, credit risk assessment is a major application of machine learning for financial institutions to evaluate credit of users and detect default or fraud. Simple white-box models, such as Logistic Regression (LR), are usually used for credit risk assessment, but not powerful enough to model complex nonlinear interactions among features. In contrast, complex black-box models are powerful at modeling, but lack of interpretability, especially global interpretability. Fortunately, automatic feature crossing is a promising way to find cross features to make simple classifiers to be more accurate without heavy handcrafted feature engineering. However, existing automatic feature crossing methods have problems in efficiency on credit risk assessment, for corresponding data usually contains hundreds of feature fields.

In this work, we find local interpretations in Deep Neural Networks (DNNs) of a specific feature are usually inconsistent among different samples. We demonstrate this is caused by nonlinear feature interactions in the hidden layers of DNN. Thus, we can mine feature interactions in DNN, and use them as cross features in LR. This will result in mining cross features more efficiently. Accordingly, we propose a novel automatic feature crossing method called DNN2LR. The final model, which is a LR model empowered with cross features, generated by DNN2LR is a white-box model. We conduct experiments on both public and business datasets from real-world credit risk assessment applications, which show that, DNN2LR outperform both conventional models used for credit assessment and several feature crossing methods. Moreover, comparing with state-of-the-art feature crossing methods, i.e., AutoCross, the proposed DNN2LR method accelerates the speed by about 10 to 40 times on financial credit assessment datasets, which contain hundreds of feature fields.

语种英语
源URL[http://ir.ia.ac.cn/handle/173211/47492]  
专题自动化研究所_智能感知与计算研究中心
作者单位1.Tsinghua University
2.RealAI
3.University of Chinese Academy of Sciences
4.Institute of Automation, Chinese Academy of Sciences
5.Peng Cheng Laboratory
推荐引用方式
GB/T 7714
Liu, Qiang,Liu, Zhaocheng,Zhang, Haoli,et al. Mining Cross Features for Financial Credit Risk Assessment[C]. 见:. Queensland, Australia. 2021.11.01-2021.11.05.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。