A comprehensive comparison of two variable importance analysis techniques in high dimensions: Application to an environmental multi-indicators system

Wei, Pengfei<sup>*</sup>; Lu, Zhenzhou; Song, Jingwen

doi:10.1016/j.envsoft.2015.04.015

摘要

Permutation variable importance measure (PVIM) based on random forest and Morris' screening design are two effective techniques for measuring the variable importance in high dimensions. The former technique is developed in the machine learning discipline and widely used in bioinformatics, while the latter technique is popular in scientific computing. We present three main contributions to variable importance analysis (VIA). First, through theoretical derivation, we show that the PVIM converges to double the non-standardized Sobol' total effect index. This observation indicates that the PVIM is especially useful for variable screening as it captures both the individual and interaction effects. Second, three numerical examples with different types of model behavior are presented for comparing the performances of these two techniques. The main conclusions are as follows. For high-dimensional additive or approximately additive models, the PVIM is much more efficient than Morris' screening design when used for both variable importance ranking and variable screening. For high-dimensional models mainly governed by interaction effects, the performance of PVIM degrades, but it is still a competitive technique. Finally, the two techniques are applied to an environmental multi-indicators system for improving the robustness of the partial order structure of this system.

出版日期2015-8
单位西北工业大学

全文

访问全文

收藏分享被引(8) 浏览

更新时间：2021-11-22 02:05

A comprehensive comparison of two variable importance analysis techniques in high dimensions: Application to an environmental multi-indicators system

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友