mapDIA: Preprocessing and statistical analysis of quantitative proteomics data from data independent acquisition mass spectrometry

作者:Teo Guoshou; Kim Sinae; Tsou Chih Chiang; Collins Ben; Gingras Anne Claude; Nesvizhskii Alexey I; Choi Hyungwon*
来源:Journal of Proteomics, 2015, 129: 108-120.
DOI:10.1016/j.jprot.2015.09.013

摘要

Data independent acquisition (DIA) mass spectrometry is an emerging technique that offers more complete detection and quantification of peptides and proteins across multiple samples. DIA allows fragment-level quantification, which can be considered as repeated measurements of the abundance of the corresponding peptides and proteins in the downstream statistical analysis. However, few statistical approaches are available for aggregating these complex fragment-level data into peptide- or protein-level statistical summaries. In this work, we describe a software package, mapDIA, for statistical analysis of differential protein expression using DIA fragment-level intensities. The workflow consists of three major steps: intensity normalization, peptide/ fragment selection, and statistical analysis. First, mapDIA offers normalization of fragment-level intensities by total intensity sums as well as a novel alternative normalization by local intensity sums in retention time space. Second, mapDIA removes outlier observations and selects peptides/fragments that preserve the major quantitative patterns across all samples for each protein. Last, using the selected fragments and peptides, mapDIA performs model-based statistical significance analysis of protein-level differential expression between specified groups of samples. Using a comprehensive set of simulation datasets, we show that mapDIA detects differentially expressed proteins with accurate control of the false discovery rates. We also describe the analysis procedure in detail using two recently published DIA datasets generated for 14-3-313 dynamic interaction network and prostate cancer glycoproteome. Availability: The software was written in C + + language and the source code is available for free through SourceForge website http://sourceforge.net/projects/mapdia/. This article is part of a Special Issue entitled: Computational Proteomics.

  • 出版日期2015-11-3
  • 单位rutgers