Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS) - PubMed (original) (raw)

doi: 10.1038/nmeth.1353. Epub 2009 Jul 20.

Angeli L Menon, Michal Hammel, Robert P Rambo, Farris L Poole 2nd, Susan E Tsutakawa, Francis E Jenney Jr, Scott Classen, Kenneth A Frankel, Robert C Hopkins, Sung-Jae Yang, Joseph W Scott, Bret D Dillard, Michael W W Adams, John A Tainer

Affiliations

Robust, high-throughput solution structural analyses by small angle X-ray scattering (SAXS)

Greg L Hura et al. Nat Methods. 2009 Aug.

Abstract

We present an efficient pipeline enabling high-throughput analysis of protein structure in solution with small angle X-ray scattering (SAXS). Our SAXS pipeline combines automated sample handling of microliter volumes, temperature and anaerobic control, rapid data collection and data analysis, and couples structural analysis with automated archiving. We subjected 50 representative proteins, mostly from Pyrococcus furiosus, to this pipeline and found that 30 were multimeric structures in solution. SAXS analysis allowed us to distinguish aggregated and unfolded proteins, define global structural parameters and oligomeric states for most samples, identify shapes and similar structures for 25 unknown structures, and determine envelopes for 41 proteins. We believe that high-throughput SAXS is an enabling technology that may change the way that structural genomics research is done.

PubMed Disclaimer

Figures

Figure 1

Figure 1

High-throughput SAXS pipeline. (a) Configuration of the SAXS endstation shows X-ray beam path, sample position, pipetting robot, and area detector. (b) Schematic of the sample area showing how the sample is loaded by the robot into a temperature-controlled cell. Positive helium pressure reduces air scatter and oxidative damage. (c) SAXS analysis tree for rapid and robust data processing and analysis. Proteins are first categorized as aggregated (using either the scattering curve itself or dynamic light scattering (DLS)), mixtures (based on native gel electrophoresis or multi-angle light scattering (MALS)), or mono-disperse samples. For monodisperse samples, SAXS data next defines global solution structural parameters radius of gyration, maximum dimension, and calculated mass. Sequence-based homology search discovers existing structures that can be used to analyze both mixtures and monodisperse samples. Approximate time scales are noted in each step. Perl scripts are used to collect information and begin processes for dashed paths. Both primary data and derived shapes are stored at the BioIsis internet accessible utility.

Figure 2

Figure 2

SAXS analysis provides feedback on challenging samples that are polydisperse or inhomogeneous. (a) PF0230 and PF1548 were mixtures by native gel electrophoresis. Overlaying the SAXS-predicted PF0230 envelope with a close homolog (PDB 2CWE) revealed consistency to the homolog dimer with additional density indicating a larger species. (b) SAXS results directly discerned aggregation based on low angle Guinier regions (insert) for three protein samples PF0418 (red), PF1733 (blue) and PF1281 (green). Features (oscillations) in the SAXS scattering curve for PF0418 and PF1281 suggest that small adjustments in sample preparation may yield workable data, e.g. PF1281 was markedly improved after passing through a filter (purple). (c) Probable multimers may be identified when atomic resolution results are available of the protein or a homolog. Here, multimers in crystal lattices (PF0094 homolog PDB 1J08, PF0380 PDB 1VK1, PF0930 homolog PDB 1V7L, and PF1090 PDB 1SJ1) are used to identify a best fit to the SAXS data.

Figure 3

Figure 3

SAXS provides accurate shape and assembly in solution for most samples. (a) For the ten proteins with structural homologs or existing structures, the experimental scattering data (colors) were compared with the scattering curve calculated for the matching structure (black). (b) For monodisperse samples, the envelope determinations (colored as in a) were overlaid with the existing structures (ribbons). All monomeric units had a seven amino-acid His-tag attached. (c) For the 9 proteins with no pre-existing structural information, envelope predictions from two independent programs were compared and generally agree. The DAMMIN results (black mesh) were generated without symmetry. The GASBOR results used 2-fold symmetry for PF0014/0015, PF0965/0966/0967/0971, PF1911 (dimer), PF00716 (dimer), PF0699 (dimer) and PF1950 (dimer). Four-fold symmetry was imposed on tetrameric PF1291 and PF1372. (d) Plotting the SAXS data as I*q2 vs. q (Kratky plot) highlights proteins with large unfolded regions. The Kratky plot of PF0715 is shown for comparison of a folded protein and shows characteristic parabolic behavior at wide angles. In contrast PF0706.1, PF2047.1, and PF1282/1205 have SAXS data consistent with unfolded regions as reflected in the non-parabolic wide-angle properties.

Figure 4

Figure 4

SAXS determines accurate assembly state in solution, as shown for acetyl-CoA synthetase subunit (PF1787). The experimental scattering curve for PF1787 (black) is shown with calculated scattering curves for monomeric (magenta dots) and dimeric (green dashes) atomic resolution structures of homologs. The best fit (red) to the experimental SAXS data is calculated from a 3-fold symmetric trimer derived from a monomeric homologue (PDB 1WR2). The trimeric form of PF1787 was confirmed using I(0), the extrapolated intensity at 0 scattering angle, normalized for concentration (inset). Proteins standards lysozyme (Lys), xylanase (Xyl), PF1281, bovine serum albumin (BSA) and glucose isomerase (GI) were used to place the data on a relative scale. Relevant structures from analysis of PF1787 are shown on the right. The crystallographic dimer (green) is a flexibly-linked 2-domain protein. Models with 3-fold symmetry enforced (blue) match the SAXS results.

Figure 5

Figure 5

SAXS defines accurate shape and assembly in solution for unknown structures and can uncover unsuspected structural similarity. Experimental scattering curves for proteins with no known structural homolog (left, color) were compared with calculated scattering (black curves on left) from PDB structures identified by DARA, a database of scattering curves calculated from the PDB database. Results from the shape reconstruction program GASBOR (colored envelopes) are overlaid onto the structures identified by DARA (ribbon models, right). In addition, PF1674 and PF1281 with known structures show a limitation in the DARA search (see text) and the need for better comparative algorithms.

Similar articles

Cited by

References

    1. Robinson CV, Sali A, Baumeister W. The molecular sociology of the cell. Nature. 2007;450:973. - PubMed
    1. Green BD, Keller M. Capturing the uncultivated majority. Curr. Opin. Biotechnol. 2006;17:236. - PubMed
    1. Wilmes P, Bond PL. Metaproteomics: studying functional gene expression in microbial ecosystems. Trends Microbiol. 2006;14:92. - PubMed
    1. Putnam CD, Hammel M, Hura GL, Tainer JA. X-ray solution scattering (SAXS) combined with crystallography and computation: defining accurate macromolecular structures, conformations and assemblies in solution. Q. Rev. Biophys. 2007;40:191. - PubMed
    1. Fox BG, et al. Structural genomics: from genes to structures with valuable materials and many questions in between. Nat. Methods. 2008;5:129. - PubMed

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources