Woody Sherman | Schrodinger - Academia.edu (original) (raw)

Papers by Woody Sherman

Research paper thumbnail of Novel Inverse Binding Mode of Indirubin Derivatives Yields Improved Selectivity for DYRK Kinases

ACS Medicinal Chemistry Letters, 2013

DYRK kinases are involved in alternative pre-mRNA splicing as well as in neuropathological states... more DYRK kinases are involved in alternative pre-mRNA splicing as well as in neuropathological states such as Alzheimer's disease and Down syndrome. In this study, we present the design, synthesis, and biological evaluation of indirubins as DYRK inhibitors with enhanced selectivity. Modifications of the bis-indole included polar or acidic functionalities at positions 5′ and 6′ and a bromine or a trifluoromethyl group at position 7, affording analogues that possess high activity and pronounced specificity. Compound 6i carrying a 5′-carboxylate moiety demonstrated the best inhibitory profile. A novel inverse binding mode, which forms the basis for the improved selectivity, was suggested by molecular modeling and confirmed by determining the crystal structure of DYRK2 in complex with 6i. Structure−activity relationships were further established, including a thermodynamic analysis of binding site water molecules, offering a structural explanation for the selective DYRK inhibition.

Research paper thumbnail of Docking and Virtual Screening Strategies for GPCR Drug Discovery

Methods in Molecular Biology, 2015

Progress in structure determination of G protein-coupled receptors (GPCRs) has made it possible t... more Progress in structure determination of G protein-coupled receptors (GPCRs) has made it possible to apply structure-based drug design (SBDD) methods to this pharmaceutically important target class. The quality of GPCR structures available for SBDD projects fall on a spectrum ranging from high resolution crystal structures (<2 Å), where all water molecules in the binding pocket are resolved, to lower resolution (>3 Å) where some protein residues are not resolved, and finally to homology models that are built using distantly related templates. Each GPCR project involves a distinct set of opportunities and challenges, and requires different approaches to model the interaction between the receptor and the ligands. In this review we will discuss docking and virtual screening to GPCRs, and highlight several refinement and post-processing steps that can be used to improve the accuracy of these calculations. Several examples are discussed that illustrate specific steps that can be taken to improve upon the docking and virtual screening accuracy. While GPCRs are a unique target class, many of the methods and strategies outlined in this review are general and therefore applicable to other protein families.

Research paper thumbnail of ja4075776

Research paper thumbnail of 365784 Ion-Induced Perturbations of Solvation Structure Reverse the Thermodynamic Mechanism of Hydrophobic Association Between Human Carbonic Anhydrase and Arylsulfonamides

The hydrophobic effect—the tendency of nonpolar solutes to aggregate in aqueous solution, and the... more The hydrophobic effect—the tendency of nonpolar solutes to aggregate in aqueous solution, and the driving force of many biomolecular recognition events—arises from the energetically favorable rearrangement of molecules of water. A detailed understanding of the mechanism by which these rearrangements alter the thermodynamics of ligand-protein interactions is essential for (i) predicting the thermodynamic influence of conditions that alter solvation structure (e.g. the presence of other solutes, structural changes in interacting species) and for (ii) exploiting the hydrophobic effect in the rational design of tight-binding ligands. In this work, we studied the thermodynamic repercussions of incremental perturbations to solvation structure by examining the influence of Hofmeister anions on the thermodynamics of hydrophobic interactions between Human Carbonic Anhydrase II (HCA, EC 4.2.11) and arylsulfonamide ligands. Using a combination of calorimetry, x-ray crystallography, and molecul...

Research paper thumbnail of Generation of receptor structural ensembles for virtual screening using binding site shape analysis and clustering

Chemical Biology &amp Drug Design

Accounting for protein flexibility is an essential yet challenging component of structure-based v... more Accounting for protein flexibility is an essential yet challenging component of structure-based virtual screening. Whereas an ideal approach would account for full protein and ligand flexibility during the virtual screening process, this is currently intractable using available computational resources. An alternative is ensemble docking, where calculations are performed on a set of individual rigid receptor conformations and the results combined. The primary challenge associated with this approach is the choice of receptor structures to use for the docking calculations. In this work, we show that selection of a small set of structures based on clustering on binding site volume overlaps provides an efficient and effective way to account for protein flexibility in virtual screening. We first apply the method to crystal structures of cyclin-dependent kinase 2 and HIV protease and show that virtual screening for ensembles of four cluster representative structures yields consistently hig...

Research paper thumbnail of Simulation Study of Stapled Alpha-Helical P53 Peptide Analogs:probing the Relationship between Structural Stability and Biological Potency

Biophysical Journal

computational protein-protein docking approaches can provide a useful alternative to address this... more computational protein-protein docking approaches can provide a useful alternative to address this issue. We present a novel protein docking algorithm, VDOCK, based on the use of 3D Zernike descriptors (3DZD) as regional features of molecular shape. The key motivation of using these descriptors is their invariance to transformation, in addition to a compact representation of local surface shape characteristics. In our previous works we have shown that 3DZD are suitable for comparing global/local protein surface shape and surface physicochemical properties to quantify their similarity. Here we apply 3DZD for quantifying surface complementarity. Docking decoys are generated using geometric hashing, which are then initially screened by a shape-based scoring function that incorporates buried surface area and 3DZD. The benchmark studies show that 3DZD are not only efficient in identifying shape complementarity for bound docking cases but superior to other existing methods in accommodating a certain level of flexibility of the protein surface in unbound docking cases, taking advantage of 3DZD's controlable resolution of the surface description. In the next stage, generated docking decoys are evaluated using a physics-based scoring function. The weighting factors to combine these terms are trained using several different target metrics on a large dataset of docking decoys. Additional information and steps for selecting models are also employed, which include protein-protein interaction site predictions and optimization of global and side-chain conformations. One of the challenges in protein structure prediction is the development of an accurate scoring function that yields a global minimum free energy for the native state. Although the knowledge-based scoring function has proven to be a successful scoring approach to protein structure prediction, there exists a hurdle in deriving knowledge-based potentials. Namely, the ideal reference state is inaccessible. In this work, we have developed a general physics-based iterative method to extract distance-dependent all-atom potentials for protein structure prediction. Our method circumvented the long-standing reference state problem. The derived scoring function was extensively evaluated with three diverse test sets, and showed significant improvement over other wellknown scoring functions. The results suggest the efficacy of our scoring function for protein structure prediction.

Research paper thumbnail of Interactions between Hofmeister Anions and the Binding Pocket of a Protein

Journal of the American Chemical Society, 2015

This paper uses the binding pocket of human carbonic anhydrase II (HCAII, EC 4.2.1.1) as a tool t... more This paper uses the binding pocket of human carbonic anhydrase II (HCAII, EC 4.2.1.1) as a tool to examine the properties of Hofmeister anions that determine (i) where, and how strongly, they associate with concavities on the surfaces of proteins and (ii) how, upon binding, they alter the structure of water within those concavities. Results from X-ray crystallography and isothermal titration calorimetry show that most anions associate with the binding pocket of HCAII by forming inner-sphere ion pairs with the Zn 2+ cofactor. In these ion pairs, the free energy of anion−Zn 2+ association is inversely proportional to the free energetic cost of anion dehydration; this relationship is consistent with the mechanism of ion pair formation suggested by the "law of matching water affinities". Iodide and bromide anions also associate with a hydrophobic declivity in the wall of the binding pocket. Molecular dynamics simulations suggest that anions, upon associating with Zn 2+ , trigger rearrangements of water that extend up to 8 Å away from their surfaces. These findings expand the range of interactions previously thought to occur between ions and proteins by suggesting that (i) weakly hydrated anions can bind complementarily shaped hydrophobic declivities, and that (ii) ion-induced rearrangements of water within protein concavities can (in contrast with similar rearrangements in bulk water) extend well beyond the first hydration shells of the ions that trigger them. This study paints a picture of Hofmeister anions as a set of structurally varied ligands that differ in size, shape, and affinity for water and, thus, in their ability to bind toand to alter the charge and hydration structure ofpolar, nonpolar, and topographically complex concavities on the surfaces of proteins. , Cl − , Br − , NO 3 − , I − , ClO 4 − , and SCN − . (B) The association of anions with the Zn 2+ cofactor involves two states: an initial state (left) with the anion and protein in aqueous solution, and a final state (right) with the anion−protein complex in aqueous solution. Thermodynamic parameters measured with ITC (ΔJ°b ind , where J = H, TS, or G) represent a difference between the initial and final states.

Research paper thumbnail of Exploring Protein Flexibility: Incorporating Structural Ensembles From Crystal Structures and Simulation into Virtual Screening Protocols

The Journal of Physical Chemistry B, 2012

The capacity of proteins to adapt their structure in response to various perturbations including ... more The capacity of proteins to adapt their structure in response to various perturbations including covalent modifications, and interactions with ligands and other proteins plays a key role in biological processes. Here, we explore the ability of molecular dynamics (MD), replica exchange molecular dynamics (REMD), and a library of structures of crystal-ligand complexes, to sample the protein conformational landscape and especially the accessible ligand binding site geometry. The extent of conformational space sampled is measured by the diversity of the shapes of the ligand binding sites. Since our focus here is the effect of this plasticity on the ability to identify active compounds through virtual screening, we use the structures generated by these techniques to generate a small ensemble for further docking studies, using binding site shape hierarchical clustering to determine four structures for each ensemble. These are then assessed for their capacity to optimize enrichment and diversity in docking. We test these protocols on three different receptors: androgen receptor (AR), HIV protease, and CDK2. We show that REMD enhances structural sampling slightly as compared both to MD, and the distortions induced by ligand binding as reflected in the crystal structures. The improved sampling of the simulation methods does not translate directly into improved docking performance, however. The ensemble approach did improve enrichment and diversity, and the ensemble derived from the crystal structures performed somewhat better than those derived from the simulations.

Research paper thumbnail of Improved Docking of Polypeptides with Glide

Journal of Chemical Information and Modeling, 2013

Predicting the binding mode of flexible polypeptides to proteins is an important task that falls ... more Predicting the binding mode of flexible polypeptides to proteins is an important task that falls outside the domain of applicability of most small molecule and protein−protein docking tools. Here, we test the small molecule flexible ligand docking program Glide on a set of 19 non-α-helical peptides and systematically improve pose prediction accuracy by enhancing Glide sampling for flexible polypeptides. In addition, scoring of the poses was improved by post-processing with physics-based implicit solvent MM-GBSA calculations. Using the best RMSD among the top 10 scoring poses as a metric, the success rate (RMSD ≤ 2.0 Å for the interface backbone atoms) increased from 21% with default Glide SP settings to 58% with the enhanced peptide sampling and scoring protocol in the case of redocking to the native protein structure. This approaches the accuracy of the recently developed Rosetta FlexPepDock method (63% success for these 19 peptides) while being over 100 times faster. Cross-docking was performed for a subset of cases where an unbound receptor structure was available, and in that case, 40% of peptides were docked successfully. We analyze the results and find that the optimized polypeptide protocol is most accurate for extended peptides of limited size and number of formal charges, defining a domain of applicability for this approach.

Research paper thumbnail of Glide XP fragment docking and structurebased pharmacophores

Chemistry Central Journal, 2009

In recent years, fragment-based drug design has become increasingly popular. Common computational... more In recent years, fragment-based drug design has become increasingly popular. Common computational approaches include building fragments up sequentially, or linking disparate fragments. However, the former approach can restrict the exploration of chemical space and may produce ligands that are not sufficiently druglike, whereas the latter approach may result in difficulties when linking together the individual fragments.

Research paper thumbnail of Fast and accurate method for flexible ligand superposition and shapebased screening

Chemistry Central Journal, 2009

The use of 3-dimensional molecular shapes has been demonstrated to be useful in comparing small m... more The use of 3-dimensional molecular shapes has been demonstrated to be useful in comparing small molecules. We have developed a novel method that allows for rapid superposition and scoring of flexible molecules.

Research paper thumbnail of Generation of Receptor Structural Ensembles for Virtual Screening Using Binding Site Shape Analysis and Clustering

Chemical Biology & Drug Design, 2012

Accounting for protein flexibility is an essential yet challenging component of structure-based v... more Accounting for protein flexibility is an essential yet challenging component of structure-based virtual screening. Whereas an ideal approach would account for full protein and ligand flexibility during the virtual screening process, this is currently intractable using available computational resources. An alternative is ensemble docking, where calculations are performed on a set of individual rigid receptor conformations and the results combined. The primary challenge associated with this approach is the choice of receptor structures to use for the docking calculations. In this work, we show that selection of a small set of structures based on clustering on binding site volume overlaps provides an efficient and effective way to account for protein flexibility in virtual screening. We first apply the method to crystal structures of cyclin-dependent kinase 2 and HIV protease and show that virtual screening for ensembles of four cluster representative structures yields consistently high enrichments and diverse actives. We then apply the method to a structural ensemble of the androgen receptor generated with molecular dynamics and obtain results that are in agreement with those from the crystal structures of cyclin-dependent kinase 2 and HIV protease. This work provides a step forward in the incorporation of protein flexibility into structure-based virtual screening.

Research paper thumbnail of Allosteric Inhibition of the NS2B-NS3 Protease from Dengue Virus

ACS Chemical Biology, 2013

Dengue virus is the flavivirus that causes dengue fever, dengue hemorrhagic disease, and dengue s... more Dengue virus is the flavivirus that causes dengue fever, dengue hemorrhagic disease, and dengue shock syndrome, which are currently increasing in incidence worldwide. Dengue virus protease (NS2B-NS3pro) is essential for dengue virus infection and is thus a target of therapeutic interest. To date, attention has focused on developing active-site inhibitors of NS2B-NS3pro. The flat and charged nature of the NS2B-NS3pro active site may contribute to difficulties in developing inhibitors and suggests that a strategy of identifying allosteric sites may be useful. We report an approach that allowed us to scan the NS2B-NS3pro surface by cysteine mutagenesis and use cysteine reactive probes to identify regions of the protein that are susceptible to allosteric inhibition. This method identified a new allosteric site utilizing a circumscribed panel of just eight cysteine variants and only five cysteine reactive probes. The allosterically sensitive site is centered at Ala125, between the 120s loop and the 150s loop. The crystal structures of WT and modified NS2B-NS3pro demonstrate that the 120s loop is flexible. Our work suggests that binding at this site prevents a conformational rearrangement of the NS2B region of the protein, which is required for activation. Preventing this movement locks the protein into the open, inactive conformation, suggesting that this site may be useful in the future development of therapeutic allosteric inhibitors.

Research paper thumbnail of Simulation Study of Stapled Alpha-Helical P53 Peptide Analogs: probing the Relationship between Structural Stability and Biological Potency

computational protein-protein docking approaches can provide a useful alternative to address this... more computational protein-protein docking approaches can provide a useful alternative to address this issue. We present a novel protein docking algorithm, VDOCK, based on the use of 3D Zernike descriptors (3DZD) as regional features of molecular shape. The key motivation of using these descriptors is their invariance to transformation, in addition to a compact representation of local surface shape characteristics. In our previous works we have shown that 3DZD are suitable for comparing global/local protein surface shape and surface physicochemical properties to quantify their similarity. Here we apply 3DZD for quantifying surface complementarity. Docking decoys are generated using geometric hashing, which are then initially screened by a shape-based scoring function that incorporates buried surface area and 3DZD. The benchmark studies show that 3DZD are not only efficient in identifying shape complementarity for bound docking cases but superior to other existing methods in accommodating a certain level of flexibility of the protein surface in unbound docking cases, taking advantage of 3DZD's controlable resolution of the surface description. In the next stage, generated docking decoys are evaluated using a physics-based scoring function. The weighting factors to combine these terms are trained using several different target metrics on a large dataset of docking decoys. Additional information and steps for selecting models are also employed, which include protein-protein interaction site predictions and optimization of global and side-chain conformations. One of the challenges in protein structure prediction is the development of an accurate scoring function that yields a global minimum free energy for the native state. Although the knowledge-based scoring function has proven to be a successful scoring approach to protein structure prediction, there exists a hurdle in deriving knowledge-based potentials. Namely, the ideal reference state is inaccessible. In this work, we have developed a general physics-based iterative method to extract distance-dependent all-atom potentials for protein structure prediction. Our method circumvented the long-standing reference state problem. The derived scoring function was extensively evaluated with three diverse test sets, and showed significant improvement over other wellknown scoring functions. The results suggest the efficacy of our scoring function for protein structure prediction.

Research paper thumbnail of Accurate and Reliable Prediction of Relative Ligand Binding Potency in Prospective Drug Discovery by way of a Modern Free Energy Calculation Protocol and Force Field

Journal of the American Chemical Society, 2015

Designing tight-binding ligands is a primary objective of small-molecule drug discovery. Over the... more Designing tight-binding ligands is a primary objective of small-molecule drug discovery. Over the past few decades, free-energy calculations have benefited from improved force fields and sampling algorithms, as well as the advent of low-cost parallel computing. However, it has proven to be challenging to reliably achieve the level of accuracy that would be needed to guide lead optimization (∼5× in binding affinity) for a wide range of ligands and protein targets. Not surprisingly, widespread commercial application of free-energy simulations has been limited due to the lack of large-scale validation coupled with the technical challenges traditionally associated with running these types of calculations. Here, we report an approach that achieves an unprecedented level of accuracy across a broad range of target classes and ligands, with retrospective results encompassing 200 ligands and a wide variety of chemical perturbations, many of which involve significant changes in ligand chemical structures. In addition, we have applied the method in prospective drug discovery projects and found a significant improvement in the quality of the compounds synthesized that have been predicted to be potent. Compounds predicted to be potent by this approach have a substantial reduction in false positives relative to compounds synthesized on the basis of other computational or medicinal chemistry approaches. Furthermore, the results are consistent with those obtained from our retrospective studies, demonstrating the robustness and broad range of applicability of this approach, which can be used to drive decisions in lead optimization.

Research paper thumbnail of Improving docking results via reranking of ensembles of ligand poses in multiple X-ray protein conformations with MM-GBSA

Journal of chemical information and modeling, Jan 27, 2014

There is a tendency in the literature to be critical of scoring functions when docking programs p... more There is a tendency in the literature to be critical of scoring functions when docking programs perform poorly. The assumption is that existing scoring functions need to be enhanced or new ones developed in order to improve the performance of docking programs for tasks such as pose prediction and virtual screening. However, failures can result from either sampling or scoring (or a combination of the two), although less emphasis tends to be given to the former. In this work, we use the programs GOLD and Glide on a high-quality data set to explore whether failures in pose prediction and binding affinity estimation can be attributable more to sampling or scoring. We show that identification of the correct pose (docking power) can be improved by incorporating ligand strain into the scoring function or rescoring an ensemble of diverse docking poses with MM-GBSA in a postprocessing step. We explore the use of nondefault docking settings and find that enhancing ligand sampling also improve...

Research paper thumbnail of Physics-based enzyme design: predicting binding affinity and catalytic activity

Proteins, 2014

Computational enzyme design is an emerging field that has yielded promising success stories, but ... more Computational enzyme design is an emerging field that has yielded promising success stories, but where numerous challenges remain. Accurate methods to rapidly evaluate possible enzyme design variants could provide significant value when combined with experimental efforts by reducing the number of variants needed to be synthesized and speeding the time to reach the desired endpoint of the design. To that end, extending our computational methods to model the fundamental physical-chemical principles that regulate activity in a protocol that is automated and accessible to a broad population of enzyme design researchers is essential. Here, we apply a physics-based implicit solvent MM-GBSA scoring approach to enzyme design and benchmark the computational predictions against experimentally determined activities. Specifically, we evaluate the ability of MM-GBSA to predict changes in affinity for a steroid binder protein, catalytic turnover for a Kemp eliminase, and catalytic activity for α-...

Research paper thumbnail of Contributions of water transfer energy to protein-ligand association and dissociation barriers: Watermap analysis of a series of p38α MAP kinase inhibitors

Proteins, 2013

In our previous work, we proposed that desolvation and resolvation of the binding sites of protei... more In our previous work, we proposed that desolvation and resolvation of the binding sites of proteins can serve as the slowest steps during ligand association and dissociation, respectively, and tested this hypothesis on two protein-ligand systems with known binding kinetics behavior. In the present work, we test this hypothesis on another kinetically-determined protein-ligand system-that of p38α and eight Type II BIRB 796 inhibitor analogs. The kon values among the inhibitor analogs are narrowly distributed (10⁴ ≤ kon ≤ 10⁵ M⁻¹ s⁻¹), suggesting a common rate-determining step, whereas the koff values are widely distributed (10⁻¹ ≤ koff ≤ 10⁻⁶ s⁻¹), suggesting a spectrum of rate-determining steps. We calculated the solvation properties of the DFG-out protein conformation using an explicit solvent molecular dynamics simulation and thermodynamic analysis method implemented in WaterMap to predict the enthalpic and entropic costs of water transfer to and from bulk solvent incurred upon ass...

Research paper thumbnail of Cambridge Healthtech Institute Drug Discovery Chemistry. San Diego, CA, USA, 12-14 April, 2011

Future medicinal chemistry, 2011

Research paper thumbnail of Structure-based approach to the prediction of disulfide bonds in proteins

Protein engineering, design & selection : PEDS, 2014

Protein engineering remains an area of growing importance in pharmaceutical and biotechnology res... more Protein engineering remains an area of growing importance in pharmaceutical and biotechnology research. Stabilization of a folded protein conformation is a frequent goal in projects that deal with affinity optimization, enzyme design, protein construct design, and reducing the size of functional proteins. Indeed, it can be desirable to assess and improve protein stability in order to avoid liabilities such as aggregation, degradation, and immunogenic response that may arise during development. One way to stabilize a protein is through the introduction of disulfide bonds. Here, we describe a method to predict pairs of protein residues that can be mutated to form a disulfide bond. We combine a physics-based approach that incorporates implicit solvent molecular mechanics with a knowledge-based approach. We first assign relative weights to the terms that comprise our scoring function using a genetic algorithm applied to a set of 75 wild-type structures that each contains a disulfide bon...

Research paper thumbnail of Novel Inverse Binding Mode of Indirubin Derivatives Yields Improved Selectivity for DYRK Kinases

ACS Medicinal Chemistry Letters, 2013

DYRK kinases are involved in alternative pre-mRNA splicing as well as in neuropathological states... more DYRK kinases are involved in alternative pre-mRNA splicing as well as in neuropathological states such as Alzheimer's disease and Down syndrome. In this study, we present the design, synthesis, and biological evaluation of indirubins as DYRK inhibitors with enhanced selectivity. Modifications of the bis-indole included polar or acidic functionalities at positions 5′ and 6′ and a bromine or a trifluoromethyl group at position 7, affording analogues that possess high activity and pronounced specificity. Compound 6i carrying a 5′-carboxylate moiety demonstrated the best inhibitory profile. A novel inverse binding mode, which forms the basis for the improved selectivity, was suggested by molecular modeling and confirmed by determining the crystal structure of DYRK2 in complex with 6i. Structure−activity relationships were further established, including a thermodynamic analysis of binding site water molecules, offering a structural explanation for the selective DYRK inhibition.

Research paper thumbnail of Docking and Virtual Screening Strategies for GPCR Drug Discovery

Methods in Molecular Biology, 2015

Progress in structure determination of G protein-coupled receptors (GPCRs) has made it possible t... more Progress in structure determination of G protein-coupled receptors (GPCRs) has made it possible to apply structure-based drug design (SBDD) methods to this pharmaceutically important target class. The quality of GPCR structures available for SBDD projects fall on a spectrum ranging from high resolution crystal structures (<2 Å), where all water molecules in the binding pocket are resolved, to lower resolution (>3 Å) where some protein residues are not resolved, and finally to homology models that are built using distantly related templates. Each GPCR project involves a distinct set of opportunities and challenges, and requires different approaches to model the interaction between the receptor and the ligands. In this review we will discuss docking and virtual screening to GPCRs, and highlight several refinement and post-processing steps that can be used to improve the accuracy of these calculations. Several examples are discussed that illustrate specific steps that can be taken to improve upon the docking and virtual screening accuracy. While GPCRs are a unique target class, many of the methods and strategies outlined in this review are general and therefore applicable to other protein families.

Research paper thumbnail of ja4075776

Research paper thumbnail of 365784 Ion-Induced Perturbations of Solvation Structure Reverse the Thermodynamic Mechanism of Hydrophobic Association Between Human Carbonic Anhydrase and Arylsulfonamides

The hydrophobic effect—the tendency of nonpolar solutes to aggregate in aqueous solution, and the... more The hydrophobic effect—the tendency of nonpolar solutes to aggregate in aqueous solution, and the driving force of many biomolecular recognition events—arises from the energetically favorable rearrangement of molecules of water. A detailed understanding of the mechanism by which these rearrangements alter the thermodynamics of ligand-protein interactions is essential for (i) predicting the thermodynamic influence of conditions that alter solvation structure (e.g. the presence of other solutes, structural changes in interacting species) and for (ii) exploiting the hydrophobic effect in the rational design of tight-binding ligands. In this work, we studied the thermodynamic repercussions of incremental perturbations to solvation structure by examining the influence of Hofmeister anions on the thermodynamics of hydrophobic interactions between Human Carbonic Anhydrase II (HCA, EC 4.2.11) and arylsulfonamide ligands. Using a combination of calorimetry, x-ray crystallography, and molecul...

Research paper thumbnail of Generation of receptor structural ensembles for virtual screening using binding site shape analysis and clustering

Chemical Biology &amp Drug Design

Accounting for protein flexibility is an essential yet challenging component of structure-based v... more Accounting for protein flexibility is an essential yet challenging component of structure-based virtual screening. Whereas an ideal approach would account for full protein and ligand flexibility during the virtual screening process, this is currently intractable using available computational resources. An alternative is ensemble docking, where calculations are performed on a set of individual rigid receptor conformations and the results combined. The primary challenge associated with this approach is the choice of receptor structures to use for the docking calculations. In this work, we show that selection of a small set of structures based on clustering on binding site volume overlaps provides an efficient and effective way to account for protein flexibility in virtual screening. We first apply the method to crystal structures of cyclin-dependent kinase 2 and HIV protease and show that virtual screening for ensembles of four cluster representative structures yields consistently hig...

Research paper thumbnail of Simulation Study of Stapled Alpha-Helical P53 Peptide Analogs:probing the Relationship between Structural Stability and Biological Potency

Biophysical Journal

computational protein-protein docking approaches can provide a useful alternative to address this... more computational protein-protein docking approaches can provide a useful alternative to address this issue. We present a novel protein docking algorithm, VDOCK, based on the use of 3D Zernike descriptors (3DZD) as regional features of molecular shape. The key motivation of using these descriptors is their invariance to transformation, in addition to a compact representation of local surface shape characteristics. In our previous works we have shown that 3DZD are suitable for comparing global/local protein surface shape and surface physicochemical properties to quantify their similarity. Here we apply 3DZD for quantifying surface complementarity. Docking decoys are generated using geometric hashing, which are then initially screened by a shape-based scoring function that incorporates buried surface area and 3DZD. The benchmark studies show that 3DZD are not only efficient in identifying shape complementarity for bound docking cases but superior to other existing methods in accommodating a certain level of flexibility of the protein surface in unbound docking cases, taking advantage of 3DZD's controlable resolution of the surface description. In the next stage, generated docking decoys are evaluated using a physics-based scoring function. The weighting factors to combine these terms are trained using several different target metrics on a large dataset of docking decoys. Additional information and steps for selecting models are also employed, which include protein-protein interaction site predictions and optimization of global and side-chain conformations. One of the challenges in protein structure prediction is the development of an accurate scoring function that yields a global minimum free energy for the native state. Although the knowledge-based scoring function has proven to be a successful scoring approach to protein structure prediction, there exists a hurdle in deriving knowledge-based potentials. Namely, the ideal reference state is inaccessible. In this work, we have developed a general physics-based iterative method to extract distance-dependent all-atom potentials for protein structure prediction. Our method circumvented the long-standing reference state problem. The derived scoring function was extensively evaluated with three diverse test sets, and showed significant improvement over other wellknown scoring functions. The results suggest the efficacy of our scoring function for protein structure prediction.

Research paper thumbnail of Interactions between Hofmeister Anions and the Binding Pocket of a Protein

Journal of the American Chemical Society, 2015

This paper uses the binding pocket of human carbonic anhydrase II (HCAII, EC 4.2.1.1) as a tool t... more This paper uses the binding pocket of human carbonic anhydrase II (HCAII, EC 4.2.1.1) as a tool to examine the properties of Hofmeister anions that determine (i) where, and how strongly, they associate with concavities on the surfaces of proteins and (ii) how, upon binding, they alter the structure of water within those concavities. Results from X-ray crystallography and isothermal titration calorimetry show that most anions associate with the binding pocket of HCAII by forming inner-sphere ion pairs with the Zn 2+ cofactor. In these ion pairs, the free energy of anion−Zn 2+ association is inversely proportional to the free energetic cost of anion dehydration; this relationship is consistent with the mechanism of ion pair formation suggested by the "law of matching water affinities". Iodide and bromide anions also associate with a hydrophobic declivity in the wall of the binding pocket. Molecular dynamics simulations suggest that anions, upon associating with Zn 2+ , trigger rearrangements of water that extend up to 8 Å away from their surfaces. These findings expand the range of interactions previously thought to occur between ions and proteins by suggesting that (i) weakly hydrated anions can bind complementarily shaped hydrophobic declivities, and that (ii) ion-induced rearrangements of water within protein concavities can (in contrast with similar rearrangements in bulk water) extend well beyond the first hydration shells of the ions that trigger them. This study paints a picture of Hofmeister anions as a set of structurally varied ligands that differ in size, shape, and affinity for water and, thus, in their ability to bind toand to alter the charge and hydration structure ofpolar, nonpolar, and topographically complex concavities on the surfaces of proteins. , Cl − , Br − , NO 3 − , I − , ClO 4 − , and SCN − . (B) The association of anions with the Zn 2+ cofactor involves two states: an initial state (left) with the anion and protein in aqueous solution, and a final state (right) with the anion−protein complex in aqueous solution. Thermodynamic parameters measured with ITC (ΔJ°b ind , where J = H, TS, or G) represent a difference between the initial and final states.

Research paper thumbnail of Exploring Protein Flexibility: Incorporating Structural Ensembles From Crystal Structures and Simulation into Virtual Screening Protocols

The Journal of Physical Chemistry B, 2012

The capacity of proteins to adapt their structure in response to various perturbations including ... more The capacity of proteins to adapt their structure in response to various perturbations including covalent modifications, and interactions with ligands and other proteins plays a key role in biological processes. Here, we explore the ability of molecular dynamics (MD), replica exchange molecular dynamics (REMD), and a library of structures of crystal-ligand complexes, to sample the protein conformational landscape and especially the accessible ligand binding site geometry. The extent of conformational space sampled is measured by the diversity of the shapes of the ligand binding sites. Since our focus here is the effect of this plasticity on the ability to identify active compounds through virtual screening, we use the structures generated by these techniques to generate a small ensemble for further docking studies, using binding site shape hierarchical clustering to determine four structures for each ensemble. These are then assessed for their capacity to optimize enrichment and diversity in docking. We test these protocols on three different receptors: androgen receptor (AR), HIV protease, and CDK2. We show that REMD enhances structural sampling slightly as compared both to MD, and the distortions induced by ligand binding as reflected in the crystal structures. The improved sampling of the simulation methods does not translate directly into improved docking performance, however. The ensemble approach did improve enrichment and diversity, and the ensemble derived from the crystal structures performed somewhat better than those derived from the simulations.

Research paper thumbnail of Improved Docking of Polypeptides with Glide

Journal of Chemical Information and Modeling, 2013

Predicting the binding mode of flexible polypeptides to proteins is an important task that falls ... more Predicting the binding mode of flexible polypeptides to proteins is an important task that falls outside the domain of applicability of most small molecule and protein−protein docking tools. Here, we test the small molecule flexible ligand docking program Glide on a set of 19 non-α-helical peptides and systematically improve pose prediction accuracy by enhancing Glide sampling for flexible polypeptides. In addition, scoring of the poses was improved by post-processing with physics-based implicit solvent MM-GBSA calculations. Using the best RMSD among the top 10 scoring poses as a metric, the success rate (RMSD ≤ 2.0 Å for the interface backbone atoms) increased from 21% with default Glide SP settings to 58% with the enhanced peptide sampling and scoring protocol in the case of redocking to the native protein structure. This approaches the accuracy of the recently developed Rosetta FlexPepDock method (63% success for these 19 peptides) while being over 100 times faster. Cross-docking was performed for a subset of cases where an unbound receptor structure was available, and in that case, 40% of peptides were docked successfully. We analyze the results and find that the optimized polypeptide protocol is most accurate for extended peptides of limited size and number of formal charges, defining a domain of applicability for this approach.

Research paper thumbnail of Glide XP fragment docking and structurebased pharmacophores

Chemistry Central Journal, 2009

In recent years, fragment-based drug design has become increasingly popular. Common computational... more In recent years, fragment-based drug design has become increasingly popular. Common computational approaches include building fragments up sequentially, or linking disparate fragments. However, the former approach can restrict the exploration of chemical space and may produce ligands that are not sufficiently druglike, whereas the latter approach may result in difficulties when linking together the individual fragments.

Research paper thumbnail of Fast and accurate method for flexible ligand superposition and shapebased screening

Chemistry Central Journal, 2009

The use of 3-dimensional molecular shapes has been demonstrated to be useful in comparing small m... more The use of 3-dimensional molecular shapes has been demonstrated to be useful in comparing small molecules. We have developed a novel method that allows for rapid superposition and scoring of flexible molecules.

Research paper thumbnail of Generation of Receptor Structural Ensembles for Virtual Screening Using Binding Site Shape Analysis and Clustering

Chemical Biology & Drug Design, 2012

Accounting for protein flexibility is an essential yet challenging component of structure-based v... more Accounting for protein flexibility is an essential yet challenging component of structure-based virtual screening. Whereas an ideal approach would account for full protein and ligand flexibility during the virtual screening process, this is currently intractable using available computational resources. An alternative is ensemble docking, where calculations are performed on a set of individual rigid receptor conformations and the results combined. The primary challenge associated with this approach is the choice of receptor structures to use for the docking calculations. In this work, we show that selection of a small set of structures based on clustering on binding site volume overlaps provides an efficient and effective way to account for protein flexibility in virtual screening. We first apply the method to crystal structures of cyclin-dependent kinase 2 and HIV protease and show that virtual screening for ensembles of four cluster representative structures yields consistently high enrichments and diverse actives. We then apply the method to a structural ensemble of the androgen receptor generated with molecular dynamics and obtain results that are in agreement with those from the crystal structures of cyclin-dependent kinase 2 and HIV protease. This work provides a step forward in the incorporation of protein flexibility into structure-based virtual screening.

Research paper thumbnail of Allosteric Inhibition of the NS2B-NS3 Protease from Dengue Virus

ACS Chemical Biology, 2013

Dengue virus is the flavivirus that causes dengue fever, dengue hemorrhagic disease, and dengue s... more Dengue virus is the flavivirus that causes dengue fever, dengue hemorrhagic disease, and dengue shock syndrome, which are currently increasing in incidence worldwide. Dengue virus protease (NS2B-NS3pro) is essential for dengue virus infection and is thus a target of therapeutic interest. To date, attention has focused on developing active-site inhibitors of NS2B-NS3pro. The flat and charged nature of the NS2B-NS3pro active site may contribute to difficulties in developing inhibitors and suggests that a strategy of identifying allosteric sites may be useful. We report an approach that allowed us to scan the NS2B-NS3pro surface by cysteine mutagenesis and use cysteine reactive probes to identify regions of the protein that are susceptible to allosteric inhibition. This method identified a new allosteric site utilizing a circumscribed panel of just eight cysteine variants and only five cysteine reactive probes. The allosterically sensitive site is centered at Ala125, between the 120s loop and the 150s loop. The crystal structures of WT and modified NS2B-NS3pro demonstrate that the 120s loop is flexible. Our work suggests that binding at this site prevents a conformational rearrangement of the NS2B region of the protein, which is required for activation. Preventing this movement locks the protein into the open, inactive conformation, suggesting that this site may be useful in the future development of therapeutic allosteric inhibitors.

Research paper thumbnail of Simulation Study of Stapled Alpha-Helical P53 Peptide Analogs: probing the Relationship between Structural Stability and Biological Potency

computational protein-protein docking approaches can provide a useful alternative to address this... more computational protein-protein docking approaches can provide a useful alternative to address this issue. We present a novel protein docking algorithm, VDOCK, based on the use of 3D Zernike descriptors (3DZD) as regional features of molecular shape. The key motivation of using these descriptors is their invariance to transformation, in addition to a compact representation of local surface shape characteristics. In our previous works we have shown that 3DZD are suitable for comparing global/local protein surface shape and surface physicochemical properties to quantify their similarity. Here we apply 3DZD for quantifying surface complementarity. Docking decoys are generated using geometric hashing, which are then initially screened by a shape-based scoring function that incorporates buried surface area and 3DZD. The benchmark studies show that 3DZD are not only efficient in identifying shape complementarity for bound docking cases but superior to other existing methods in accommodating a certain level of flexibility of the protein surface in unbound docking cases, taking advantage of 3DZD's controlable resolution of the surface description. In the next stage, generated docking decoys are evaluated using a physics-based scoring function. The weighting factors to combine these terms are trained using several different target metrics on a large dataset of docking decoys. Additional information and steps for selecting models are also employed, which include protein-protein interaction site predictions and optimization of global and side-chain conformations. One of the challenges in protein structure prediction is the development of an accurate scoring function that yields a global minimum free energy for the native state. Although the knowledge-based scoring function has proven to be a successful scoring approach to protein structure prediction, there exists a hurdle in deriving knowledge-based potentials. Namely, the ideal reference state is inaccessible. In this work, we have developed a general physics-based iterative method to extract distance-dependent all-atom potentials for protein structure prediction. Our method circumvented the long-standing reference state problem. The derived scoring function was extensively evaluated with three diverse test sets, and showed significant improvement over other wellknown scoring functions. The results suggest the efficacy of our scoring function for protein structure prediction.

Research paper thumbnail of Accurate and Reliable Prediction of Relative Ligand Binding Potency in Prospective Drug Discovery by way of a Modern Free Energy Calculation Protocol and Force Field

Journal of the American Chemical Society, 2015

Designing tight-binding ligands is a primary objective of small-molecule drug discovery. Over the... more Designing tight-binding ligands is a primary objective of small-molecule drug discovery. Over the past few decades, free-energy calculations have benefited from improved force fields and sampling algorithms, as well as the advent of low-cost parallel computing. However, it has proven to be challenging to reliably achieve the level of accuracy that would be needed to guide lead optimization (∼5× in binding affinity) for a wide range of ligands and protein targets. Not surprisingly, widespread commercial application of free-energy simulations has been limited due to the lack of large-scale validation coupled with the technical challenges traditionally associated with running these types of calculations. Here, we report an approach that achieves an unprecedented level of accuracy across a broad range of target classes and ligands, with retrospective results encompassing 200 ligands and a wide variety of chemical perturbations, many of which involve significant changes in ligand chemical structures. In addition, we have applied the method in prospective drug discovery projects and found a significant improvement in the quality of the compounds synthesized that have been predicted to be potent. Compounds predicted to be potent by this approach have a substantial reduction in false positives relative to compounds synthesized on the basis of other computational or medicinal chemistry approaches. Furthermore, the results are consistent with those obtained from our retrospective studies, demonstrating the robustness and broad range of applicability of this approach, which can be used to drive decisions in lead optimization.

Research paper thumbnail of Improving docking results via reranking of ensembles of ligand poses in multiple X-ray protein conformations with MM-GBSA

Journal of chemical information and modeling, Jan 27, 2014

There is a tendency in the literature to be critical of scoring functions when docking programs p... more There is a tendency in the literature to be critical of scoring functions when docking programs perform poorly. The assumption is that existing scoring functions need to be enhanced or new ones developed in order to improve the performance of docking programs for tasks such as pose prediction and virtual screening. However, failures can result from either sampling or scoring (or a combination of the two), although less emphasis tends to be given to the former. In this work, we use the programs GOLD and Glide on a high-quality data set to explore whether failures in pose prediction and binding affinity estimation can be attributable more to sampling or scoring. We show that identification of the correct pose (docking power) can be improved by incorporating ligand strain into the scoring function or rescoring an ensemble of diverse docking poses with MM-GBSA in a postprocessing step. We explore the use of nondefault docking settings and find that enhancing ligand sampling also improve...

Research paper thumbnail of Physics-based enzyme design: predicting binding affinity and catalytic activity

Proteins, 2014

Computational enzyme design is an emerging field that has yielded promising success stories, but ... more Computational enzyme design is an emerging field that has yielded promising success stories, but where numerous challenges remain. Accurate methods to rapidly evaluate possible enzyme design variants could provide significant value when combined with experimental efforts by reducing the number of variants needed to be synthesized and speeding the time to reach the desired endpoint of the design. To that end, extending our computational methods to model the fundamental physical-chemical principles that regulate activity in a protocol that is automated and accessible to a broad population of enzyme design researchers is essential. Here, we apply a physics-based implicit solvent MM-GBSA scoring approach to enzyme design and benchmark the computational predictions against experimentally determined activities. Specifically, we evaluate the ability of MM-GBSA to predict changes in affinity for a steroid binder protein, catalytic turnover for a Kemp eliminase, and catalytic activity for α-...

Research paper thumbnail of Contributions of water transfer energy to protein-ligand association and dissociation barriers: Watermap analysis of a series of p38α MAP kinase inhibitors

Proteins, 2013

In our previous work, we proposed that desolvation and resolvation of the binding sites of protei... more In our previous work, we proposed that desolvation and resolvation of the binding sites of proteins can serve as the slowest steps during ligand association and dissociation, respectively, and tested this hypothesis on two protein-ligand systems with known binding kinetics behavior. In the present work, we test this hypothesis on another kinetically-determined protein-ligand system-that of p38α and eight Type II BIRB 796 inhibitor analogs. The kon values among the inhibitor analogs are narrowly distributed (10⁴ ≤ kon ≤ 10⁵ M⁻¹ s⁻¹), suggesting a common rate-determining step, whereas the koff values are widely distributed (10⁻¹ ≤ koff ≤ 10⁻⁶ s⁻¹), suggesting a spectrum of rate-determining steps. We calculated the solvation properties of the DFG-out protein conformation using an explicit solvent molecular dynamics simulation and thermodynamic analysis method implemented in WaterMap to predict the enthalpic and entropic costs of water transfer to and from bulk solvent incurred upon ass...

Research paper thumbnail of Cambridge Healthtech Institute Drug Discovery Chemistry. San Diego, CA, USA, 12-14 April, 2011

Future medicinal chemistry, 2011

Research paper thumbnail of Structure-based approach to the prediction of disulfide bonds in proteins

Protein engineering, design & selection : PEDS, 2014

Protein engineering remains an area of growing importance in pharmaceutical and biotechnology res... more Protein engineering remains an area of growing importance in pharmaceutical and biotechnology research. Stabilization of a folded protein conformation is a frequent goal in projects that deal with affinity optimization, enzyme design, protein construct design, and reducing the size of functional proteins. Indeed, it can be desirable to assess and improve protein stability in order to avoid liabilities such as aggregation, degradation, and immunogenic response that may arise during development. One way to stabilize a protein is through the introduction of disulfide bonds. Here, we describe a method to predict pairs of protein residues that can be mutated to form a disulfide bond. We combine a physics-based approach that incorporates implicit solvent molecular mechanics with a knowledge-based approach. We first assign relative weights to the terms that comprise our scoring function using a genetic algorithm applied to a set of 75 wild-type structures that each contains a disulfide bon...