|MEDINFO 2001: Proceedings of the 10th World Congress on Medical Informatics (2001) 84:965-9|
|Northeast Structural Genomics Consortium|
(click to unfold)
Domain parsing, or the detection of signals of protein structural domains from sequence data, is a complex and difficult problem. ...
If carried out reliably it would be a powerful interpretive and predictive tool for genomic and proteomic studies. We report on a novel approach to domain parsing using consensus techniques based on Hidden Markov Models (HMMs) and BLAST searches built from a training set of 1471 continuous structural domains from the Dali Domain Dictionary (DDD). Validation on an independent test sample of family-matched structural domain sequences from the Scop database yields a consensus prediction performance rate of 75.5%, well above the 58% obtained by simple agreement of methods.
|Computational Biology Protein Structure, Tertiary Sequence Analysis Proteins Algorithms Markov Chains |
|0 (Last update: 05/27/2017 12:21:58pm)|
|Medinfo. 2001;10(Pt 2):965-9.|