摘要

Predicting information regarding the structure of the protein from its sequence still remains an uphill task. Though both are intimately linked, it has been found difficult so far to get a direct correlation between the two. In our present approach we use a simple method based on multiple alignment and phylogeny to derive a correlation between the protein structure and sequence via motif search. The protein families which we have considered are SH2 like, Homeodomain, Leucine rich repeat, Alphabeta knot trefoilknot and ferritin like helix bundle. We have been able to successfully predict the protein families with an average prediction of accuracy of 81%, the highest being 89% and the lowest being 73% on our test data set.

  • 出版日期2009-9

全文