A novel hybrid gene prediction method employing protein multiple sequence alignments

作者:Keller Oliver; Kollmar Martin; Stanke Mario*; Waack Stephan
来源:Bioinformatics, 2011, 27(6): 757-763.
DOI:10.1093/bioinformatics/btr010

摘要

Mitovation: As improved DNA sequencing techniques have increased enormously the speed of producing new eukaryotic genome assemblies, the further development of automated gene prediction methods continues to be essential. While the classification of proteins into families is a task heavily relying on correct gene predictions, it can at the same time provide a source of additional information for the prediction, complementary to those presently used. Results: We extended the gene prediction software AUGUSTUS by a method that employs block profiles generated from multiple sequence alignments as a protein signature to improve the accuracy of the prediction. Equipped with profiles modelling human dynein heavy chain (DHC) proteins and other families, AUGUSTUS was run on the genomic sequences known to contain members of these families. Compared with AUGUSTUS' ab initio version, the rate of genes predicted with high accuracy showed a dramatic increase.

  • 出版日期2011-3-15