Use of ENCODE Resources to Characterize Novel Proteoforms and Missing Proteins in the Human Proteome

作者:Nilsson Carol L; Mostovenko Ekaterina; Lichti Cheryl F; Ruggles Kelly; Fenyoe David; Rosenbloom Kate R; Hancock William S; Paik Young Ki; Omenn Gilbert S; LaBaer Joshua; Kroes Roger A; Uhlen Matthias; Hober Sophia; Vegvari Akos; Andren Per E; Sulman Erik P; Lang Frederick F; Fuentes Manuel; Carlsohn Elisabet; Emmett Mark R; Moskal Joseph R; Berven Frode S; Fehniger Thomas E; Marko Varga Gyorgy*
来源:Journal of Proteome Research, 2015, 14(2): 603-608.
DOI:10.1021/pr500564q

摘要

We describe the utility of integrated strategies that employ both translation of ENCODE data and major proteomic technology pillars to improve the identification of the "missing proteins", novel proteoforms, and PTMs. On one hand, databases in combination with bioinformatic tools are efficiently utilized to establish microarray-based transcript analysis and supply rapid protein identifications in clinical samples. On the other hand, sequence libraries are the foundation of targeted protein identification and quantification using mass spectrometric and immunoaffinity techniques. The results from combining proteoENCODEdb searches with experimental mass spectral data indicate that some alternative splicing forms detected at the transcript level are in fact translated to proteins. Our results provide a step toward the directives of the C-HPP initiative and related biomedical research.