|Journal of Structural and Functional Genomics (2007) 8:217-26|
|Center for Eukaryotic Structural Genomics|
(click to unfold)
It has been previously shown that protein sequences containing a quasi-repetitive assortment of amino acids are common in genomes and databases such as Swiss-Prot but are under-represented in the structure-based Protein Data Bank (PDB). ...
Structural genomics groups have been using the absence of these "low-complexity" sequences for several years as a way to select proteins that have a good chance of successful structure determination. In this study, we examine the data deposited in the PDB as well as the available data from structural genomics groups in TargetDB and PepcDB to reveal interesting trends that could be taken into consideration when using low-complexity sequences as part of the target selection process.
|metabolism chemistry methods |
|Crystallography, X-Ray Algorithms Amino Acid Sequence Databases, Protein Molecular Sequence Data Proteins Sequence Alignment Genome Database Management Systems |
|9 (Last update: 02/09/2019 4:44:38pm)|
Evaluation of low-complexity in amino acid sequences for target selection in structural genomics