Sparse network modeling and metscape-based visualization methods for the analysis of large-scale metabolomics data

作者:Basu Sumanta; Duren William; Evans Charles R; Burant Charles F; Michailidis George*; Karnovsky Alla*
来源:Bioinformatics, 2017, 33(10): 1545-1553.
DOI:10.1093/bioinformatics/btx012

摘要

Motivation: Recent technological advances in mass spectrometry, development of richer mass spectral libraries and data processing tools have enabled large scale metabolic profiling. Biological interpretation of metabolomics studies heavily relies on knowledge-based tools that contain information about metabolic pathways. Incomplete coverage of different areas of metabolism and lack of information about non-canonical connections between metabolites limits the scope of applications of such tools. Furthermore, the presence of a large number of unknown features, which cannot be readily identified, but nonetheless can represent bona fide compounds, also considerably complicates biological interpretation of the data. Results: Leveraging recent developments in the statistical analysis of high-dimensional data, we developed a new Debiased Sparse Partial Correlation algorithm (DSPC) for estimating partial correlation networks and implemented it as a Java-based CorrelationCalculator program. We also introduce a new version of our previously developed tool Metscape that enables building and visualization of correlation networks. We demonstrate the utility of these tools by constructing biologically relevant networks and in aiding identification of unknown compounds.

  • 出版日期2017-5-15