A modification of the ICOMP criterion for estimation of optimum complexity of PCR models

Capron X; Walczak B; de Noord OE; Massart DL<sup>*</sup>

doi:10.1002/cem.934

摘要

The predictive ability of a PCR bilinear regression model is highly dependent on the number of latent variables selected. A non-optimal complexity is likely to result in a model yielding unsatisfactory predictions, due to a high bias or high variance of the coefficients of regression. The popular cross-validation methods such as leave-one-out cross-validation (LOOCV) and Monte-Carlo cross-validation (MCCV) are not always able to retain the proper number of latent variables, especially when atypical samples are present in the data. Also, they are computationally intensive, particularly for large data sets. In this study, the information complexity criterion ICOMP is modified in order to select the optimal PCR model. The results obtained demonstrate that this information criterion behaves at least as good as the cross-validation approaches, and usually outperforms them in terms of model selection and computation time, whether atypical samples are present in the data or not.

出版日期2005-7

全文

访问全文

收藏分享被引(2) 浏览

更新时间：2018-08-03 12:09

A modification of the ICOMP criterion for estimation of optimum complexity of PCR models

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友