Conference Agenda

Overview and details of the sessions of this conference. Please select a date or location to show only sessions at that day or location. Please select a single session for detailed view (with abstracts and downloads if available).

 
 
Session Overview
Session
MS-2: Validation of cryoEM structures and maps
Time:
Sunday, 15/Aug/2021:
10:20am - 12:45pm

Session Chair: Jose-Maria Carazo
Session Chair: Elena Orlova
Location: Club A

170 1st floor

Invited: Florence Tama (Japan), Pavel Afonine (USA), G. Kleywegt (Germany)


Session Abstract

New achievements in Cryo-electron microscopy enabled a revolution in structural biology and over the last few years reached resolutions comparable with crystallography. Still, quantitative assessment of these structures is urgently needed, and essential for the validation of all these new EMDB and PDB entries. New algorithms tailored to the nature of these data need to be developed and made widely available to the community. This session will present the advancements in this field.

For all abstracts of the session as prepared for Acta Crystallographica see external PDF, or individual abstracts below.


Show help for 'Increase or decrease the abstract text size'
Presentations
10:20am - 10:50am

Protein hydrogen bond parameters as a new validation tool

Pavel Afonine1, Oleg Sobolev1, Nigel Moriarty1, Yanting Xu1,2, Thomas Terwilliger3,4, Paul Adams1,5

1Lawrence Berkeley National Laboratory, Berkeley, United States of America; 2International Center for Quantum and Molecular Structures, Shanghai University, Shanghai 200444, People's Republic of China; 3Bioscience Division, Los Alamos National Laboratory, Mail Stop M888, Los Alamos, NM 87545, USA; 4New Mexico Consortium, Los Alamos, NM 87544, USA; 5Department of Bioengineering, University of California Berkeley, Berkeley, CA 94720, USA

Atomic model refinement and completion at low resolution (cryo-EM or crystallographic) is often a challenging task. This is mostly because the experimental data aren’t sufficiently detailed to describe using atomic models. To make refinement practical and ensure a refined model is geometrically meaningful additional a priori information about model geometry needs to be used. This information includes restraints on Ramachandran plot distributions or side chain rotameric states. However, using Ramachandran plot or rotameric states as refinement targets diminish the validating power of these tools. Therefore finding additional model validation criteria that are not used or difficult to use as refinement goals is desirable. Hydrogen bonds are one of most important non-covalent interactions that shape and maintain protein structure. These interactions can be characterized by specific geometry of hydrogen donor and acceptor atoms. Systematic analysis of these geometries performed for all quality-filtered high-resolution models of proteins from PDB shows they have distinct and conserved distribution that can be characterized by only two parameters. Here we demonstrate how these two parameters can serve as unique validation metrics and how they can pinpoint severe modeling problems that no other validation tools can detect. This tool is now a part of Phenix model validation suite; guidelines to its use and interpretation will be given.



10:50am - 11:20am

Integrative modeling to characterize structure and dynamics of biomolecules

Florence Tama

Nagoya University & RIKEN, Nagoya, Japan

Hybrid and integrative modeling methods that combine computational molecular mechanics simulations with experimental data are powerful in describing the structure and dynamics of large biomolecules. In particular, flexible fitting is a powerful technique to build the 3D structures of biomolecules from cryo-electron microscopy (cryo-EM) density maps. While flexible fitting methods work nicely with very high-resolution maps, there are limitations for medium resolution maps (~5-10 angstrom) in the case of complex conformational transitions. To overcome such issues, we proposed a refinement based on conformational ensemble, i.e., performing multiple fittings trials using various parameters. An automatic adjustment of the biasing force constants during the fitting process was introduced via a replica-exchange scheme to improve the success rate. From such multiple fittings, clustering analysis of the models obtained can be an effective approach to avoid over‐fitting. In addition, we have looked into the pixel size parameter as it can impact the resolution and accuracy of a cryo-EM map, and we proposed a computational protocol to estimate the appropriate pixel size parameter. In our protocol, we fit and refine atomic structures against cryo-EM maps at multiple pixel sizes. The resulting fitted and refined structures are evaluated using the GOAP score. We have demonstrated the efficacy of this protocol in retrieving appropriate pixel sizes via several examples.



11:20am - 11:50am

Community recommendations on validating cryo-EM models and data

Gerard Kleywegt

EMBL EBI, Cambridge, United Kingdom

Structural biology, the study of the 3D structures of biological entities on scales from small molecules to cells, has had an enormous impact on our understanding of biology and biological processes in health and disease. The results of these structural studies (mainly by MX, NMR and 3DEM) have been captured in the single global archive of atomistic models of biomacromolecules and their complexes, the PDB, operated by the wwPDB consortium. In addition, since 2002 the cryo-EM community has been depositing their maps and tomograms in EMDB.

A few years before the resolution revolution, wwPDB and EMDB jointly convened an EM Validation Task Force (VTF) which met in 2010 to discuss initial recommendations (published in 2012) regarding validation of cryo-EM data and models. In the following decade, the resolution revolution happened, EMDB grew from 1,000 to 15,000 entries, an archive for raw cryo-EM data was established (EMPIAR), community challenges related to EM validation were organised, and many labs began to develop new approaches to validating EM structures. This made it clear that a second EM VTF meeting was urgently needed. This meeting took place (in person!) in January 2020. During two days several dozen experts from all over the globe discussed cryo-EM data
management, deposition and validation.

A white paper summarising the discussions and recommendations of the second EM VTF meeting is currently in preparation. I will provide an overview of the major consensus recommendations emanating from the meeting and also address how wwPDB and EMDB are implementing these.



11:50am - 12:05pm

Cryo-EM Validation Metrics in EMDA

Rangana Warshamanage, Keitaro Yamashita, Garib N. Murshudov

MRC Laboratory of Molecular Biology, Francis Crick Avenue, Cambridge, Biomedical Campus, Cambridge, CB2 0QH, UK

Cryo-EM is becoming an increasingly popular method of structure determination in structural biology. As the number of cryo-EM structures increases, it is important to maintain standards that measure the quality of those structures. The correctness of atomic models is very important because they often serve as targets for novel drugs or the knowledge base of such developments. Also, such standards are important to prevent the accumulation of errors of the structures in the databases. Thus, careful curation and validation of cryo-EM maps and derived atomic models are of utmost importance.

We have developed the EMDA Python package [1] that includes tools for cryo-EM map and model manipulation. In this presentation, the emphasis is given to those for validation. The majority of the current validation tools used in single-particle cryo-EM analyses are global metrics. They provide summaries of the global quality of maps or map-model fits. In order to reveal the local variation of the signal in maps and map-model fits, a new set of tools based on the local correlation have been developed. To calculate the local correlation, a spherical kernel is convolved with the map in image space to yield a correlation value at each voxel resulting in a three-dimensional (3D) correlation map. The variation of calculated correlation depends on the size of the kernel. The local correlation calculated using half maps captures the local variations in the signal, whereas the local correlation calculated between a map and a model indicates the quality of their fit. Map-model local correlation can be used to identify model regions outside the density or poorly fitted. Also, it can highlight unmodeled regions on the map. While the half map local correlation is useful to identify the presence/absence of the signal, its comparison with the map-model local correlation can be used to validate the map-model fit. In this presentation, we will demonstrate the use of local correlation through several examples. Also, EMDA includes several tools based on the maximum likelihood method. EMDA’s map-overlay and map magnification refinement are based on the maximisation of the joint probability distribution between two maps by a quasi-Newton method. We will demonstrate the use of map overlay and magnification refinement implemented in EMDA through examples.



12:05pm - 12:20pm

FSC-Q: A method for quality analysis of cryoEM-derived models

Erney Ramírez-Aportela, David Maluenda, Yunior C. Fonseca, Pablo Conesa, Roberto Marabini, Carlos Oscar S. Sorzano, Jose M. Carazo

CNB-CSIC, Madrid, Spain

To obtain more accurate atomic models from cryoEM and increase their impact on biomedical research, metrics are needed that carefully evaluate these constructed models. In this poster we present further developments on FSC-Q, a map-to-model quality issue recently introduced [1], with the capability to detect those areas of the model that are better supported by the experimental data (Figure1). The algorithm performs a careful analysis of the Signal-to-Noise Ratio in the half maps and in map generated from the proposed model through local resolution. It is intuitive and, yet, very precise, introducing quality information that we have quantitatively shown is new, in the sense that some of it was not captured in previous quality assessment metrics.



12:20pm - 12:35pm

Outcomes from EMDataResource model challenges

Catherine Lawson1, Andriy Kryshstafovych2, Wah Chiu3

1Rutgers University, NJ, USA; 2University of California Davis, CA, USA; 3Stanford University/SLAC, Stanford, CA, USA

Electron cryo-microscopy (cryo-EM) is rapidly becoming a mainstream area of structural biology and medicine, enabling visualization and modelling of a wide variety of biologically important complexes. This recent explosion of new cryo-EM structures raises several important questions. How accurate are these maps and their model interpretations? What criteria are currently being used and are they good enough? This paper describes the outcomes of the 2019 Model Metrics Challenge sponsored by EMDataResource (https://challenges.emdataresource.org). The goals of this challenge were two-fold: (1) to evaluate the quality of models that can be produced using current modelling software, and (2) to assess the performance of metrics currently in use to evaluate cryo-EM models. In both instances the focus was on map targets selected the near-atomic resolution regime (1.8-3.1 Å), with an innovative twist: three of four maps formed a resolution series from the same specimen/imaging experiment.The results permit several specific recommendations to be made about validating near-atomic cryo-EM structures, both in the context of an individual laboratory experiment and for in the context of a structure data archive. We will also touch on preliminary results from our ongoing 2021 Ligand Model Challenge.



12:35pm - 12:50pm

Quantifiying resolvability of atomic features in cryo-EM maps using Q-scores

Greg Pintilie1, Michael Schmid2, Wah Chiu3

1Stanford University; 2Stanford University, SLAC National Accelerator Laboratory; 3Stanford University, SLAC National Accelerator Laboratory

Q-scores are calculated locally for individual atoms in a model fitted or built into a cryo-EM map. They can be averaged over groups of atoms to represent resolvability of larger features such as residues in proteins, nucleotides in nucleic acids, and ligands. Plotting of residue or nucleotide Q-scores helps to identify which parts of a model are resolved in the map, and which parts may be unresolved or may need further refinement. A useful property of Q-scores is that for well-fitted models, they correlate strongly to the resolution of the map estimated by FSC; this answers the question ‘what is a good score’ for a map at a certain resolution. Several examples and related structural insights are shown with models and maps ranging from 2 to 5Å resolution. The connection between Q-scores and atomic B-factors is also explored. Finally, Q-scores are used to help detect and assess water and ion molecules in maps at 3Å and higher resolutions.