TY - JOUR
T1 - Structural genomics target selection for the New York consortium on membrane protein structure
AU - Punta, Marco
AU - Love, James
AU - Handelman, Samuel
AU - Hunt, John F.
AU - Shapiro, Lawrence
AU - Hendrickson, Wayne A.
AU - Rost, Burkhard
N1 - Funding Information:
Acknowledgments This work was supported by the grant U54-GM75026-01 to the NYCOMPS from PSI of the NIH. Thanks to all NYCOMPS collaborators who contribute to making NYCOMPS a wonderful experience, in particular thanks to Ann McDermott, Filippo Mancia, Ming Zhou, Francesca Gubellini (all Columbia), Da-Neng Wang (New York University), Mark Girvin (Albert Einstein), Guy Montelione (Rutgers), Renato Bruni and Brian Kloss (both NYCOMPS). Specific thanks to Jinfeng Liu (Genentech), Jessica Locke (Rutgers) and Rajesh Nair (Food and Drug Administration) for providing preliminary information and programs; to Ta-tsen Soong (Columbia), Henry Bigelow and Dariusz Przybylski (both Broad Institute), Kaz Wrzeszczynski (Cold Spring Harbor Laboratory), Zsuzsanna Dosztanyi (Hungarian Academy of Sciences, Budapest) and Zsolt Zolnai (University of Wisconsin—Madison) for useful discussions; to Guy Yachdav and Laszlo Kajan (both Columbia) for computer assistance and the collection of genome data sets. Last, not least, thanks to all those who deposit their experimental data in public databases, and to those who maintain these databases.
Funding Information:
The protein structure initiative (PSI) is the leading structural genomics (SG) initiative in the USA; it is funded by the National Institute of General Medical Sciences (NIGMS) at the National Institutes of Health (NIH). The PSI currently supports four large production centers and six specialized centers as well as other activities [1–3]. Two of these specialized centers focus on developing new technologies for membrane protein structure determination: the Center for Structures of Membrane Proteins (CSMP) [4] and the New York Consortium on Membrane Protein Structure (NYCOMPS). At NYCOMPS (http://www.nycomps.org/) we have established a high-throughput pipeline beginning with target selection and further including protein purification, protein expression and scale-up. Scaled-up proteins are sent to individual participating labs (within or outside of the consortium) for structure determination trials. While most of our resources are channeled into X-ray crystallography, we also pursue structure determination by NMR, solid-state NMR and cryo-electron microscopy. Here, we describe target selection, the first stage of the NYCOMPS pipeline.
PY - 2009/12
Y1 - 2009/12
N2 - The New York Consortium on Membrane Protein Structure (NYCOMPS), a part of the Protein Structure Initiative (PSI) in the USA, has as its mission to establish a high-throughput pipeline for determination of novel integral membrane protein structures. Here we describe our current target selection protocol, which applies structural genomics approaches informed by the collective experience of our team of investigators. We first extract all annotated proteins from our reagent genomes, i.e. the 96 fully sequenced prokaryotic genomes from which we clone DNA. We filter this initial pool of sequences and obtain a list of valid targets. NYCOMPS defines valid targets as those that, among other features, have at least two predicted transmembrane helices, no predicted long disordered regions and, except for community nominated targets, no significant sequence similarity in the predicted transmembrane region to any known protein structure. Proteins that feed our experimental pipeline are selected by defining a protein seed and searching the set of all valid targets for proteins that are likely to have a transmembrane region structurally similar to that of the seed. We require sequence similarity aligning at least half of the predicted transmembrane region of seed and target. Seeds are selected according to their feasibility and/or biological interest, and they include both centrally selected targets and community nominated targets. As of December 2008, over 6,000 targets have been selected and are currently being processed by the experimental pipeline. We discuss how our target list may impact structural coverage of the membrane protein space.
AB - The New York Consortium on Membrane Protein Structure (NYCOMPS), a part of the Protein Structure Initiative (PSI) in the USA, has as its mission to establish a high-throughput pipeline for determination of novel integral membrane protein structures. Here we describe our current target selection protocol, which applies structural genomics approaches informed by the collective experience of our team of investigators. We first extract all annotated proteins from our reagent genomes, i.e. the 96 fully sequenced prokaryotic genomes from which we clone DNA. We filter this initial pool of sequences and obtain a list of valid targets. NYCOMPS defines valid targets as those that, among other features, have at least two predicted transmembrane helices, no predicted long disordered regions and, except for community nominated targets, no significant sequence similarity in the predicted transmembrane region to any known protein structure. Proteins that feed our experimental pipeline are selected by defining a protein seed and searching the set of all valid targets for proteins that are likely to have a transmembrane region structurally similar to that of the seed. We require sequence similarity aligning at least half of the predicted transmembrane region of seed and target. Seeds are selected according to their feasibility and/or biological interest, and they include both centrally selected targets and community nominated targets. As of December 2008, over 6,000 targets have been selected and are currently being processed by the experimental pipeline. We discuss how our target list may impact structural coverage of the membrane protein space.
KW - Membrane proteins
KW - Structural genomics
KW - Structure determination
KW - Target selection
UR - http://www.scopus.com/inward/record.url?scp=71049118492&partnerID=8YFLogxK
U2 - 10.1007/s10969-009-9071-1
DO - 10.1007/s10969-009-9071-1
M3 - Article
C2 - 19859826
AN - SCOPUS:71049118492
SN - 1345-711X
VL - 10
SP - 255
EP - 268
JO - Journal of Structural and Functional Genomics
JF - Journal of Structural and Functional Genomics
IS - 4
ER -