A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites

Long, Haixia; Liao, Bo<sup>*</sup>; Xu, Xingyu; Yang, Jialiang<sup>*</sup>

doi:10.3390/ijms19092817

摘要

Protein hydroxylation is one type of post-translational modifications (PTMs) playing critical roles in human diseases. It is known that protein sequence contains many uncharacterized residues of proline and lysine. The question that needs to be answered is: which residue can be hydroxylated, and which one cannot. The answer will not only help understand the mechanism of hydroxylation but can also benefit the development of new drugs. In this paper, we proposed a novel approach for predicting hydroxylation using a hybrid deep learning model integrating the convolutional neural network (CNN) and long short-term memory network (LSTM). We employed a pseudo amino acid composition (PseAAC) method to construct valid benchmark datasets based on a sliding window strategy and used the position-specific scoring matrix (PSSM) to represent samples as inputs to the deep learning model. In addition, we compared our method with popular predictors including CNN, iHyd-PseAAC, and iHyd-PseCp. The results for 5-fold cross-validations all demonstrated that our method significantly outperforms the other methods in prediction accuracy.

出版日期2018-9
单位浙江理工大学; 海南师范大学

全文

访问全文

收藏分享被引(26) 浏览

更新时间：2024-04-23 04:34

A Hybrid Deep Learning Model for Predicting Protein Hydroxylation Sites

摘要

全文

产品服务

站内浏览

服务支持

联系方式

科研之友