New tools for automated high-resolution cryo-EM structure determination in RELION-3 - PubMed (original) (raw)

New tools for automated high-resolution cryo-EM structure determination in RELION-3

Jasenko Zivanov et al. Elife. 2018.

Abstract

Here, we describe the third major release of RELION. CPU-based vector acceleration has been added in addition to GPU support, which provides flexibility in use of resources and avoids memory limitations. Reference-free autopicking with Laplacian-of-Gaussian filtering and execution of jobs from python allows non-interactive processing during acquisition, including 2D-classification, de novo model generation and 3D-classification. Per-particle refinement of CTF parameters and correction of estimated beam tilt provides higher resolution reconstructions when particles are at different heights in the ice, and/or coma-free alignment has not been optimal. Ewald sphere curvature correction improves resolution for large particles. We illustrate these developments with publicly available data sets: together with a Bayesian approach to beam-induced motion correction it leads to resolution improvements of 0.2-0.7 Å compared to previous RELION versions.

Keywords: Cryo-EM; molecular biophysics; none; single-particle analysis; software; structural biology.

© 2018, Zivanov et al.

PubMed Disclaimer

Conflict of interest statement

JZ, TN, BF, DK, WH, EL No competing interests declared, SS Reviewing editor, eLife

Figures

Figure 1.

Figure 1.. Comparison of the traditional power-spectrum-based CTF estimation technique with reference-map-based CTF refinement.

Top: Sum over the power spectra ||Xk|| of all polished particle images in micrograph EMD-2984_0000 of EMPIAR-10061 (left) and the averages and standard deviations of that sum over Fourier rings between 10 Å and 2.5 Å (right). The dashed grey curves are located one std. deviation of each ring above and below the average. The overall rise in spectral power as a function of frequency is a consequence of particle polishing. Bottom: The corresponding plot for the sum over the real components of Vk*⁢Xk, instead of the power spectra. Note that the latter term is not only centered around zero, obviating the need to estimate the background intensity, but it also offers a higher signal-to-noise ratio.

Figure 2.

Figure 2.. Accelerated CPU performance (A) Even when specific vector instructions are disabled, RELION-3 runs faster than RELION-2 even on the previous-generation Broadwell processors that are ubiquitous in many cryo-EM clusters worldwide.

Enabling vectorisation during compilation with the Intel compiler benefits the new streamlined code path, but not the legacy code. (B) For latest-generation Skylake CPUs, the difference is much larger even with only AVX2 vectorisation enabled, and when enabling the new AVX512 instructions the performance is roughly 4.5x higher than the legacy code path. (C) The accelerated CPU code executing on dual-socket x86 nodes provides cost-efficiency that is at least approaching that of professional-class GPU hardware (but not consumer GPUs).

Figure 2—figure supplement 1.

Figure 2—figure supplement 1.. FCSs comparing half-sets in each type of run, legacy-CPU, acc-CPU, and acc-GPU, all reach the same resolution and sampling accuracy.

Additional FSCs compare the reconstruction of the acc-CPU and acc-GPU using the legacy-CPU as a base of comparison, showing numerical agreement beyond the reconstructed signal threshold. This validates that the quality of results has not been compromised from using lower precision or compiler optimisation.

Figure 3.

Figure 3.. Per-particle defocus correction (A) FSC curves between independently refined half-maps for the different stages of processing as explained in the main text.

(B) As in A, but FSC curves are between the cryo-EM maps and the corresponding atomic model (PDB-4FNK) (Ekiert et al., 2012). (C) Representative density features for some of the maps for which FSC curves are shown in A and B.

Figure 3—figure supplement 1.

Figure 3—figure supplement 1.. Per-particle defocus estimates (along Z) are plotted against the X,Y-coordinates of the particles in a representative micrograph.

Figure 4.

Figure 4.. Beam tilt correction (A) FSC curves between independently refined half-maps for the different stages of processing as explained in the main text.

(B) As in A, but FSC curves are between the cryo-EM maps and the corresponding atomic model (PDB-2W0O) (de Val et al., 2012). (C) Representative density features from the 2.2 Å map from the data set with active beam tilt compensation (left); the 2.9 Å map from the data set without active beam tilt compensation and without beam tilt correction (middle); and the 2.2 Å map from the data set without active beam tilt compensation but with beam tilt correction (right).

Figure 4—figure supplement 1.

Figure 4—figure supplement 1.. The average per-pixel phase differences (in radiants) between reference projections and the individual experimental particle images (i.e. the phase angle of qk in Equation 8).

The nine plots correspond to subsets of images collected at the nine different holes without active beam-tilt compensation. Note that the applied beam shift values have been recorded in stage coordinates giving a 175 degree X-axis rotation for the magnification used, which is reflected in the relative position of the nine plots. The inside regions showing the opposite slope to the global trend result from a systematic error in particle alignment caused by beam tilt.

Figure 4—figure supplement 2.

Figure 4—figure supplement 2.. The estimated beam tilt is plotted against the beam-shift values from serialEM in the X-direction (black circles) and in the Y-direction (red circles).

Linear fits through these points indicate there is 0.19 mrad of beam tilt for each μm of beam shift.

Figure 5.

Figure 5.. Ewald sphere correction (A) FSC curves between independently refined half-maps without Ewald sphere correction (grey); with Ewald sphere correction with the correct curvature (green) and with Ewald sphere correction with the inverse curvature (orange).

(B) As in A, but FSC curves are between the cryo-EM and the corresponding atomic model (PDB-5UU5) (Hryc et al., 2017). (C) Representative density features without Ewald sphere correction (grey) with Ewald sphere correction and the correct curvature (green) and with Ewald sphere correction and the reverse curvature (orange).

Figure 6.

Figure 6.. Non-interactive data pre-processing.

(A) Particle positions as selected by the LoG-based auto-picking algorithm are shown as yellow dots. (B) The 2D class averages for the eight largest classes after LoG-based autopicking of the first batch of 10,000 particles. (C) 3D models generated by the script: after SGD initial model generation (left), and after 3D classification for the largest class (middle) and the second largest class (right). The two maps after 3D classification are thresholded at the same intensity level. (D) FSC curves between independently refined half-maps for EMD-3061 (grey), for the map obtained after non-interactive pre-processing (orange) and for the map obtained after processing of the originally published subset of particles in RELION-3. (E) FSC curves between PDB-5A63 and the same maps as in D. (F) Representative density features for the maps in E and F.

Figure 7.

Figure 7.. High-resolution refinement: β-galactosidase (A) FSC curves between independently refined half-maps for the different stages of processing as explained in the main text.

(B) FSC curves between PDB-5A1A and the maps at the same stages of processing as in A. (C) Representative density features for some of the maps for which FSC curves are shown in A and B.

Figure 7—figure supplement 1.

Figure 7—figure supplement 1.. FSC curves between PDB-5A1A and EMD-4116, EMD-7770 and the new map reconstructed in RELION-3.

Figure 7—figure supplement 2.

Figure 7—figure supplement 2.. The B-factor plot that was generated automatically by the bfactor_plot.py script.

The script performed nine 3D refinement and post-processing jobs with subsets of increasing sizes for the beta-galactosidase data set. Fitting a straight line through the inverse of the resolution squared of each refinement against the natural logarithm of the number of particles in the corresponding subset yielded an estimated B-factor of 56 Å2.

Figure 8.

Figure 8.. High-resolution refinement: apo-ferritin (A) FSC curves between independently refined half-maps at different stages of the processing.

(B) FSC curves between PDB-5N27 (Ferraro et al., 2017) and the cryo-EM map at different stages of the processing. (C) Representative density features at different stages of the processing.

Figure 8—figure supplement 1.

Figure 8—figure supplement 1.. The B-factor plot that was generated automatically by the bfactor_plot.py script.

The script performed ten 3D refinement and post-processing jobs with subsets of increasing sizes for the high-resolution apo-ferritin data set. Fitting a straight line through the inverse of the resolution squared of each refinement against the natural logarithm of the number of particles in the corresponding subset yielded an estimated B-factor of 66 Å2.

Similar articles

Cited by

References

    1. Abrishami V, Vargas J, Li X, Cheng Y, Marabini R, Sorzano CÓ, Carazo JM. Alignment of direct detection device micrographs using a robust Optical Flow approach. Journal of Structural Biology. 2015;189:163–176. doi: 10.1016/j.jsb.2015.02.001. - DOI - PubMed
    1. Bai XC, Rajendra E, Yang G, Shi Y, Scheres SH. Sampling the conformational space of the catalytic subunit of human gamma-secretase. eLife. 2015a;4:e11182. doi: 10.7554/eLife.11182. - DOI - PMC - PubMed
    1. Bai XC, Yan C, Yang G, Lu P, Ma D, Sun L, Zhou R, Scheres SHW, Shi Y. An atomic structure of human γ-secretase. Nature. 2015b;525:212–217. doi: 10.1038/nature14892. - DOI - PMC - PubMed
    1. Bartesaghi A, Merk A, Banerjee S, Matthies D, Wu X, Milne JL, Subramaniam S. 2.2 Å resolution cryo-EM structure of β-galactosidase in complex with a cell-permeant inhibitor. Science. 2015;348:1147–1151. doi: 10.1126/science.aab1576. - DOI - PMC - PubMed
    1. Bartesaghi A, Aguerrebere C, Falconieri V, Banerjee S, Earl LA, Zhu X, Grigorieff N, Milne JLS, Sapiro G, Wu X, Subramaniam S. Atomic Resolution Cryo-EM Structure of β-Galactosidase. Structure. 2018;26:848–856. doi: 10.1016/j.str.2018.04.004. - DOI - PMC - PubMed

Publication types

MeSH terms

LinkOut - more resources