Continuous wavelet transform-based feature selection applied to near-infrared spectral diagnosis of cancer

作者:Chen Hui; Lin Zan; Mo Lin; Wu Hegang; Wu Tong; Tan Chao*
来源:Spectrochimica Acta Part A: Molecular and Biomolecular Spectroscopy , 2015, 151: 286-291.
DOI:10.1016/j.saa.2015.06.109

摘要

Spectrum is inherently local in nature since it can be thought of as a signal being composed of various frequency components. Wavelet transform (WT) is a powerful tool that partitions a signal into components with different frequency. The property of multi-resolution enables WT a very effective and natural tool for analyzing spectrum-like signal. In this study, a continuous wavelet transform (CWT)-based variable selection procedure was proposed to search for a set of informative wavelet coefficients for constructing a near-infrared (NIR) spectral diagnosis model of cancer. The CWT provided a fine multi-resolution feature space for selecting best predictors. A measure of discriminating power (DP) was defined to evaluate the coefficients. Partial least squares-discriminant analysis (PLS-DA) was used as the classification algorithm. A NIR spectral dataset associated to cancer diagnosis was used for experiment. The optimal results obtained correspond to the wavelet of db2. It revealed that on condition of having better performance on the training set, the optimal PLS-DA model using only 40 wavelet coefficients in 10 scales achieved the same performance as the one using all the variables in the original space on the test set: an overall accuracy of 93.8%, sensitivity of 92.5% and specificity of 96.3%. It confirms that the CWT-based feature selection coupled with PLS-DA is feasible and effective for constructing models of diagnostic cancer by NIR spectroscopy.