|Proteins (2009) 74:655-68|
|Northeast Structural Genomics Consortium|
(click to unfold)
Machine-learning techniques can classify functionally related proteins where homology-transfer as well as sequence and structure motifs fail. ...
Here, we present a method that aimed at complementing homology-transfer in the identification of cell cycle control kinases from sequence alone. First, we identified functionally significant residues in cell cycle proteins through their high sequence conservation and biophysical properties. We then incorporated these residues and their features into support vector machines (SVM) to identify new kinases and more specifically to differentiate cell cycle kinases from other kinases and other proteins. As expected, the most informative residues tend to be highly conserved and tend to localize in the ATP binding regions of the kinases. Another observation confirmed that ATP binding regions are typically not found on the surface but in partially buried sites, and that this fact is correctly captured by accessibility predictions. Using these highly conserved, semi-buried residues and their biophysical properties, we could distinguish cell cycle S/T kinases from other kinase families at levels around 70-80% accuracy and 62-81% coverage. An application to the entire human proteome predicted at least 97 human proteins with limited previous annotations to be candidates for cell cycle kinases.
|chemistry metabolism |
|Models, Molecular Conserved Sequence Humans Databases, Protein Protein Conformation Evolution, Molecular Protein-Serine-Threonine Kinases Cell Cycle Proteins Binding Sites Cell Cycle Artificial Intelligence Amino Acid Motifs |
|1 (Last update: 10/28/2017 11:42:31am)|
Cell cycle kinases predicted from conserved biophysical properties