Data Set Modelability by QSAR

作者:Golbraikh Alexander; Muratov Eugene; Fourches Denis; Tropsha Alexander*
来源:Journal of Chemical Information and Modeling, 2014, 54(1): 1-4.
DOI:10.1021/ci400572x

摘要

We introduce a simple MODelability Index (MODI) that estimates the feasibility of obtaining predictive QSAR models (correct classification rate above 0.7) for a binary data set of bioactive compounds. MODI is defined as an activity class-weighted ratio of the number of nearest-neighbor pairs of compounds with the same activity class versus the total number of pairs. The MODI values were calculated for more than 100 data sets, and the threshold of 0.65 was found to separate the nonmodelable and modelable data sets.

  • 出版日期2014-1