Deep Convolutional Neural Networks for Predicting Hydroxyproline in Proteins

Long, HaiXia; Wang, Mi<sup>*</sup>; Fu, HaiYan

doi:10.2174/1574893612666170221152848

摘要

Background: Protein hydroxyproline is one type of post translational modification (PTM). Because protein sequence contains many uncharacterized residues of P, the question that needs to be answered is: Which ones can be hydroxylated, and which ones cannot? The solution will not only give a deeper understanding of the hydroxylation mechanism but can also lead to drug development. The evergrowing demand for better handling of protein sequences in the post-genomic age presents new prediction challenges. @@@ Objective: To address these challenges, developing computational methods to identify these sites quickly and accurately is our objective. @@@ Method: We propose a new approach for predicting hydroxyproline using the deep learning model known as the convolutional neural network (CNN), and employed a pseudo amino acid composition (PseAAC) to identify these proteins and used the position-specific scoring matrix (PSSM) to represent samples as input to the CNN model. @@@ Results and Conclusion: In our experiment, K-fold cross-validation testing on benchmark datasets further demonstrated the potential for CNN identification of protein hydroxyproline as well as other PTM type proteins.

出版日期2017
单位海南师范大学

全文

访问全文

收藏分享被引(25) 浏览

更新时间：2024-05-12 11:49

Deep Convolutional Neural Networks for Predicting Hydroxyproline in Proteins

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友