Comparing the CORAL and Random Forest Approaches for Modelling the In Vitro Cytotoxicity of Silica Nanomaterials

作者:Cassano Antonio; Robinson Richard L Marchese; Palczewska Anna; Puzyn Tomasz; Gajewicz Agnieszka; Tran Lang; Manganelli Serena; Cronin Mark T D
来源:ATLA-Alternatives to Laboratory Animals, 2016, 44(6): 533-556.
DOI:10.1177/026119291604400603

摘要

Nanotechnology is one of the most important technological developments of the 21st cen-tury. In silico methods to predict toxicity, such as quantitative structure-activity relationships (QSARs), pro-mote the safe-by-design approach for the development of new materials, including nanomaterials. In this study, a set of cytotoxicity experimental data corresponding to 19 data points for silica nanomaterials were investigated, to compare the widely employed CORAL and Random Forest approaches in terms of their use-fulness for developing so-called 'nano-QSAR' models. 'External' leave-one-out cross-validation (LOO) analysis was performed, to validate the two different approaches. An analysis of variable importance mea-sures and signed feature contributions for both algorithms was undertaken, in order to interpret the mod-els developed. CORAL showed a more pronounced difference between the average coefficient of determination (R2) for training and for LOO (0.83 and 0.65 for training and LOO, respectively), compared to Random Forest (0.87 and 0.78 without bootstrap sampling, 0.90 and 0.78 with bootstrap sampling), which may be due to overfitting. With regard to the physicochemical properties of the nanomaterials, the aspect ratio and zeta potential were found to be the two most important variables for Random Forest, and the average feature contributions calculated for the corresponding descriptors were consistent with the clear trends observed in the data set: less negative zeta potential values and lower aspect ratio values were associated with higher cytotoxicity. In contrast, CORAL failed to capture these trends.

  • 出版日期2016-12