TY - JOUR
T1 - Loopy proteins appear conserved in evolution
AU - Liu, Jinfeng
AU - Tan, Hepan
AU - Rost, Burkhard
N1 - Funding Information:
We thank Henrik Nielsen (CBS, Denmark) for providing the source code for SignalP and for his generous help in using this program, to Andrei Lupas (MPI Tübingen) for helpful suggestions about using the COILS program, and to Henry Bigelow (Columbia) for crucial comments on the manuscript. We thank Florencio Pazos, Damien Devos and Alfonso Valencia (all CNB Madrid) for supplying and helping with the program EUCLID.; and Alexei Murzin (MRC Cambridge) for useful discussions. Finally, we thank to the undisclosed reviewer who suggested analysing the hydrogen bonding networks of NORS regions. The work of J.L. and B.R. was supported by grants 1-P50-GM62413-01 and RO1-GM63029-01 from the National Institute of Health. Last, not least, thanks to all those who deposit their experimental data into public databases, and to those who maintain these databases.
PY - 2002
Y1 - 2002
N2 - Over the last decade, structural biologists have unravelled many proteins that appear natively disordered. Common assumptions are that many of these proteins adopt structure through binding and that the structural flexibility enables them to adopt different functions. Here, we investigated regions of more than 70 sequence-consecutive residues that have no regular secondary structure (NORS). Analysing 31 entirely sequenced organisms, we predicted five times as many proteins with NORS regions (loopy proteins) in eukaryotes (20%) than in prokaryotes and archaeas (4%). Thousands of these NORS regions were over 150 residues long. The amino acid composition of NORS regions differed from that of loops in PDB. Although NORS proteins had significantly more residues in low-complexity regions than other proteins, simple cut-off thresholds for sequence bias missed most NORS regions. On average, NORS regions were evolutionarily at least as conserved as their flanking regions. Furthermore, yeast proteins with NORS regions had more protein-protein interaction partners than other proteins. Regulatory and transcription-related functions were over-represented in loopy proteins, biosynthesis and energy metabolism were under-represented. Overall, our analysis confirmed that proteins with non-regular structures appear to play important functional roles, and they may adopt as yet unknown types of protein structures.
AB - Over the last decade, structural biologists have unravelled many proteins that appear natively disordered. Common assumptions are that many of these proteins adopt structure through binding and that the structural flexibility enables them to adopt different functions. Here, we investigated regions of more than 70 sequence-consecutive residues that have no regular secondary structure (NORS). Analysing 31 entirely sequenced organisms, we predicted five times as many proteins with NORS regions (loopy proteins) in eukaryotes (20%) than in prokaryotes and archaeas (4%). Thousands of these NORS regions were over 150 residues long. The amino acid composition of NORS regions differed from that of loops in PDB. Although NORS proteins had significantly more residues in low-complexity regions than other proteins, simple cut-off thresholds for sequence bias missed most NORS regions. On average, NORS regions were evolutionarily at least as conserved as their flanking regions. Furthermore, yeast proteins with NORS regions had more protein-protein interaction partners than other proteins. Regulatory and transcription-related functions were over-represented in loopy proteins, biosynthesis and energy metabolism were under-represented. Overall, our analysis confirmed that proteins with non-regular structures appear to play important functional roles, and they may adopt as yet unknown types of protein structures.
KW - Disordered regions
KW - Natively unstructured proteins
KW - Protein function
KW - Protein-protein interactions
KW - Regular secondary structure
UR - http://www.scopus.com/inward/record.url?scp=0036968309&partnerID=8YFLogxK
U2 - 10.1016/S0022-2836(02)00736-2
DO - 10.1016/S0022-2836(02)00736-2
M3 - Article
C2 - 12215414
AN - SCOPUS:0036968309
SN - 0022-2836
VL - 322
SP - 53
EP - 64
JO - Journal of Molecular Biology
JF - Journal of Molecular Biology
IS - 1
ER -