摘要

One of the prime reasons for the patients suffering from the acute liver damage or dysfunction is drug. Thus, it is imperative to carry out the further research on hepatotoxicity affected by the drugs before the drugs enter the market. In order to obtain better classification accuracy of hepatotoxicity prediction, support vector machine (SVM) associates with a coarse-grained parallel genetic algorithm (CGPGA) to complete the process of selecting feature and optimizing parameter of SVM simultaneously. The proposed model shortly named as CGPGA-SVM approach. To compare with the previous learning algorithms, the same dataset is necessary which includes 1087 compounds. Firstly, in the total dataset with the external validation, the overall accuracy for the prediction is 75.63% by 10-fold cross-validation. As the result, the figure is 12% higher than the previous congeneric prediction. Next, training set and test set are produced to form the resource in this experiment. The final prediction accuracies are 78.21% and 70.89% by CGPGA-SVM approach. From the aspect of the efficiency, there is a contrast among the serial genetic algorithm (SGA), Grid search and CGPGA-SVM by the program. Meanwhile, Our proposed approach reduces the running time obviously. The result implicates the performance of the proposed CGPGA-SVM approach is better in hepatotoxicity prediction.