Machine learning for LC-MS medicinal plants identification

作者:Nazarenko D V; Kharyuk P V; Oseledets I V; Rodin I A; Shpigun O A
来源:Chemometrics and Intelligent Laboratory Systems, 2016, 156: 174-180.
DOI:10.1016/j.chemolab.2016.06.003

摘要

Herbal medicines are vigorously marketed, but poorly regulated. Analysis methodology for this field is still forming. One particular analytical task is confirmation of plant species identity for medicinal plants used as ingredients. In this work, machine learning approach has been implemented for LC-MS plant species identification. Samples for 36 plant species have been analyzed. Peak data (m/z, abundance) from respective samples have been used for development of classification algorithms. Namely, logistic regression (LR), support vector machine (SVM) and random forest (RF) techniques were used. For most of used machine learning algorithms, classification accuracy of 95% higher were obtained on cross-validation dataset. Now, massive training datasets are needed for full-scale application of this approach.

  • 出版日期2016-8-15