Aligning the proteome and genome of the silkworm, Bombyx mori

作者:Zhang Yaozhou*; Xia Qingyou; Xu Jie; Chen Jian; Nie Zuoming; Wang Dan; Zhang Wenping; Chen Jianqing; Zheng Qingliang; Chen Qing; Kong Lingying; Ren Xiaoyuan; Wang Jiang; Lv Zhengbing; Yu Wei; Jiang Caiying; Liu Lili; Sheng Qing; Jin Yongfeng; Wu Xiangfu
来源:Functional & Integrative Genomics, 2009, 9(4): 447-454.
DOI:10.1007/s10142-009-0127-x

摘要

A technology of mass spectrometry (MS) was used in this study for the large-scale proteomic identification and verification of protein-encoding genes present in the silkworm (Bombyx mori) genome. Peptide sequences identified by MS were compared with those from an open reading frame (ORF) library of the B. mori genome and a cDNA library, to validate the coding attributes of ORFs. Two databases were created. The first was based on a 9x draft sequence of the silkworm genome and contained 14,632 putative proteins. The second was based on a B. mori pupal cDNA library containing 3,187 putative proteins of at least 30 amino acid residues in length. A total of 81,000 peptide sequences with a threshold score of 60% were generated by the MS/MS analysis, and 55,400 of these were chosen for a sequence alignment. By searching these two databases, 6,649 and 250 proteins were matched, which accounted for approximately 45.4% and 7.8% of the peptide sequences and putative proteins, respectively. Further analyses carried out by several bioinformatic tools suggested that the matches included proteins with predicted transmembrane domains (1,393) and preproteins with a signal peptide (976). These results provide a fundamental understanding of the expression and function of silkworm proteins.