WorldWideScience

Sample records for binding holo-structure prediction

  1. Modeling holo-ACP:DH and holo-ACP:KR complexes of modular polyketide synthases: a docking and molecular dynamics study

    Directory of Open Access Journals (Sweden)

    Anand Swadha

    2012-05-01

    Full Text Available Abstract Background Modular polyketide synthases are multifunctional megasynthases which biosynthesize a variety of secondary metabolites using various combinations of dehydratase (DH, ketoreductase (KR and enoyl-reductase (ER domains. During the catalysis of various reductive steps these domains act on a substrate moiety which is covalently attached to the phosphopantetheine (P-pant group of the holo-Acyl Carrier Protein (holo-ACP domain, thus necessitating the formation of holo-ACP:DH and holo-ACP:KR complexes. Even though three dimensional structures are available for DH, KR and ACP domains, no structures are available for DH or KR domains in complex with ACP or substrate moieties. Since Ser of holo-ACP is covalently attached to a large phosphopantetheine group, obtaining complexes involving holo-ACP by standard protein-protein docking has been a difficult task. Results We have modeled the holo-ACP:DH and holo-ACP:KR complexes for identifying specific residues on DH and KR domains which are involved in interaction with ACP, phosphopantetheine and substrate moiety. A novel combination of protein-protein and protein-ligand docking has been used to first model complexes involving apo-ACP and then dock the phosphopantetheine and substrate moieties using covalent connectivity between ACP, phosphopantetheine and substrate moiety as constraints. The holo-ACP:DH and holo-ACP:KR complexes obtained from docking have been further refined by restraint free explicit solvent MD simulations to incorporate effects of ligand and receptor flexibilities. The results from 50 ns MD simulations reveal that substrate enters into a deep tunnel in DH domain while in case of KR domain the substrate binds a shallow surface exposed cavity. Interestingly, in case of DH domain the predicted binding site overlapped with the binding site in the inhibitor bound crystal structure of FabZ, the DH domain from E.Coli FAS. In case of KR domain, the substrate binding site

  2. Structure, High Affinity, and Negative Cooperativity of the Escherichia coli Holo-(Acyl Carrier Protein):Holo-(Acyl Carrier Protein) Synthase Complex

    Energy Technology Data Exchange (ETDEWEB)

    Marcella, Aaron M.; Culbertson, Sannie J.; Shogren-Knaak, Michael A.; Barb, Adam W.

    2017-11-01

    The Escherichia coli holo-(acyl carrier protein) synthase (ACPS) catalyzes the coenzyme A-dependent activation of apo-ACPP to generate holo-(acyl carrier protein) (holo-ACPP) in an early step of fatty acid biosynthesis. E. coli ACPS is sufficiently different from the human fatty acid synthase to justify the development of novel ACPS-targeting antibiotics. Models of E. coli ACPS in unliganded and holo-ACPP-bound forms solved by X-ray crystallography to 2.05 and 4.10 Å, respectively, revealed that ACPS bound three product holo-ACPP molecules to form a 3:3 hexamer. Solution NMR spectroscopy experiments validated the ACPS binding interface on holo-ACPP using chemical shift perturbations and by determining the relative orientation of holo-ACPP to ACPS by fitting residual dipolar couplings. The binding interface is organized to arrange contacts between positively charged ACPS residues and the holo-ACPP phosphopantetheine moiety, indicating product contains more stabilizing interactions than expected in the enzyme:substrate complex. Indeed, holo-ACPP bound the enzyme with greater affinity than the substrate, apo-ACPP, and with negative cooperativity. The first equivalent of holo-ACPP bound with a KD = 62 ± 13 nM, followed by the binding of two more equivalents of holo-ACPP with KD = 1.2 ± 0.2 μM. Cooperativity was not observed for apo-ACPP which bound with KD = 2.4 ± 0.1 μM. Strong product binding and high levels of holo-ACPP in the cell identify a potential regulatory role of ACPS in fatty acid biosynthesis.

  3. Structure, High Affinity, and Negative Cooperativity of the Escherichia coli Holo-(Acyl Carrier Protein):Holo-(Acyl Carrier Protein) Synthase Complex.

    Science.gov (United States)

    Marcella, Aaron M; Culbertson, Sannie J; Shogren-Knaak, Michael A; Barb, Adam W

    2017-11-24

    The Escherichia coli holo-(acyl carrier protein) synthase (ACPS) catalyzes the coenzyme A-dependent activation of apo-ACPP to generate holo-(acyl carrier protein) (holo-ACPP) in an early step of fatty acid biosynthesis. E. coli ACPS is sufficiently different from the human fatty acid synthase to justify the development of novel ACPS-targeting antibiotics. Models of E. coli ACPS in unliganded and holo-ACPP-bound forms solved by X-ray crystallography to 2.05and 4.10Å, respectively, revealed that ACPS bound three product holo-ACPP molecules to form a 3:3 hexamer. Solution NMR spectroscopy experiments validated the ACPS binding interface on holo-ACPP using chemical shift perturbations and by determining the relative orientation of holo-ACPP to ACPS by fitting residual dipolar couplings. The binding interface is organized to arrange contacts between positively charged ACPS residues and the holo-ACPP phosphopantetheine moiety, indicating product contains more stabilizing interactions than expected in the enzyme:substrate complex. Indeed, holo-ACPP bound the enzyme with greater affinity than the substrate, apo-ACPP, and with negative cooperativity. The first equivalent of holo-ACPP bound with a K D =62±13nM, followed by the binding of two more equivalents of holo-ACPP with K D =1.2±0.2μM. Cooperativity was not observed for apo-ACPP which bound with K D =2.4±0.1μM. Strong product binding and high levels of holo-ACPP in the cell identify a potential regulatory role of ACPS in fatty acid biosynthesis. Copyright © 2017 Elsevier Ltd. All rights reserved.

  4. PockDrug-Server: a new web server for predicting pocket druggability on holo and apo proteins.

    Science.gov (United States)

    Hussein, Hiba Abi; Borrel, Alexandre; Geneix, Colette; Petitjean, Michel; Regad, Leslie; Camproux, Anne-Claude

    2015-07-01

    Predicting protein pocket's ability to bind drug-like molecules with high affinity, i.e. druggability, is of major interest in the target identification phase of drug discovery. Therefore, pocket druggability investigations represent a key step of compound clinical progression projects. Currently computational druggability prediction models are attached to one unique pocket estimation method despite pocket estimation uncertainties. In this paper, we propose 'PockDrug-Server' to predict pocket druggability, efficient on both (i) estimated pockets guided by the ligand proximity (extracted by proximity to a ligand from a holo protein structure) and (ii) estimated pockets based solely on protein structure information (based on amino atoms that form the surface of potential binding cavities). PockDrug-Server provides consistent druggability results using different pocket estimation methods. It is robust with respect to pocket boundary and estimation uncertainties, thus efficient using apo pockets that are challenging to estimate. It clearly distinguishes druggable from less druggable pockets using different estimation methods and outperformed recent druggability models for apo pockets. It can be carried out from one or a set of apo/holo proteins using different pocket estimation methods proposed by our web server or from any pocket previously estimated by the user. PockDrug-Server is publicly available at: http://pockdrug.rpbs.univ-paris-diderot.fr. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Predicting DNA-binding proteins and binding residues by complex structure prediction and application to human proteome.

    Directory of Open Access Journals (Sweden)

    Huiying Zhao

    Full Text Available As more and more protein sequences are uncovered from increasingly inexpensive sequencing techniques, an urgent task is to find their functions. This work presents a highly reliable computational technique for predicting DNA-binding function at the level of protein-DNA complex structures, rather than low-resolution two-state prediction of DNA-binding as most existing techniques do. The method first predicts protein-DNA complex structure by utilizing the template-based structure prediction technique HHblits, followed by binding affinity prediction based on a knowledge-based energy function (Distance-scaled finite ideal-gas reference state for protein-DNA interactions. A leave-one-out cross validation of the method based on 179 DNA-binding and 3797 non-binding protein domains achieves a Matthews correlation coefficient (MCC of 0.77 with high precision (94% and high sensitivity (65%. We further found 51% sensitivity for 82 newly determined structures of DNA-binding proteins and 56% sensitivity for the human proteome. In addition, the method provides a reasonably accurate prediction of DNA-binding residues in proteins based on predicted DNA-binding complex structures. Its application to human proteome leads to more than 300 novel DNA-binding proteins; some of these predicted structures were validated by known structures of homologous proteins in APO forms. The method [SPOT-Seq (DNA] is available as an on-line server at http://sparks-lab.org.

  6. Conformational dynamics of Escherichia coli flavodoxins in apo- and holo-states by solution NMR spectroscopy.

    Directory of Open Access Journals (Sweden)

    Qian Ye

    Full Text Available Flavodoxins are a family of small FMN-binding proteins that commonly exist in prokaryotes. They utilize a non-covalently bound FMN molecule to act as the redox center during the electron transfer processes in various important biological pathways. Although extensive investigations were performed, detailed molecular mechanisms of cofactor binding and electron transfer remain elusive. Herein we report the solution NMR studies on Escherichia coli flavodoxins FldA and YqcA, belonging to the long-chain and short-chain flavodoxin subfamilies respectively. Our structural studies demonstrate that both proteins show the typical flavodoxin fold, with extensive conformational exchanges observed near the FMN binding pocket in their apo-forms. Cofactor binding significantly stabilizes both proteins as revealed by the extension of secondary structures in the holo-forms, and the overall rigidity shown by the backbone dynamics data. However, the 50 s loops of both proteins in the holo-form still show conformational exchanges on the µs-ms timescales, which appears to be a common feature in the flavodoxin family, and might play an important role in structural fine-tuning during the electron transfer reactions.

  7. An unexpected phosphate binding site in Glyceraldehyde 3-Phosphate Dehydrogenase: Crystal structures of apo, holo and ternary complex of Cryptosporidium parvum enzyme

    Energy Technology Data Exchange (ETDEWEB)

    Cook, William J; Senkovich, Olga; Chattopadhyay, Debasish; (UAB)

    2009-06-08

    The structure, function and reaction mechanism of glyceraldehyde 3-phosphate dehydrogenase (GAPDH) have been extensively studied. Based on these studies, three anion binding sites have been identified, one 'Ps' site (for binding the C-3 phosphate of the substrate) and two sites, 'Pi' and 'new Pi', for inorganic phosphate. According to the original flip-flop model, the substrate phosphate group switches from the 'Pi' to the 'Ps' site during the multistep reaction. In light of the discovery of the 'new Pi' site, a modified flip-flop mechanism, in which the C-3 phosphate of the substrate binds to the 'new Pi' site and flips to the 'Ps' site before the hydride transfer, was proposed. An alternative model based on a number of structures of B. stearothermophilus GAPDH ternary complexes (non-covalent and thioacyl intermediate) proposes that in the ternary Michaelis complex the C-3 phosphate binds to the 'Ps' site and flips from the 'Ps' to the 'new Pi' site during or after the redox step. We determined the crystal structure of Cryptosporidium parvum GAPDH in the apo and holo (enzyme + NAD) state and the structure of the ternary enzyme-cofactor-substrate complex using an active site mutant enzyme. The C. parvum GAPDH complex was prepared by pre-incubating the enzyme with substrate and cofactor, thereby allowing free movement of the protein structure and substrate molecules during their initial encounter. Sulfate and phosphate ions were excluded from purification and crystallization steps. The quality of the electron density map at 2{angstrom} resolution allowed unambiguous positioning of the substrate. In three subunits of the homotetramer the C-3 phosphate group of the non-covalently bound substrate is in the 'new Pi' site. A concomitant movement of the phosphate binding loop is observed in these three subunits. In the fourth subunit the C-3 phosphate

  8. An unexpected phosphate binding site in Glyceraldehyde 3-Phosphate Dehydrogenase: Crystal structures of apo, holo and ternary complex of Cryptosporidium parvum enzyme

    Directory of Open Access Journals (Sweden)

    Chattopadhyay Debasish

    2009-02-01

    Full Text Available Abstract Background The structure, function and reaction mechanism of glyceraldehyde 3-phosphate dehydrogenase (GAPDH have been extensively studied. Based on these studies, three anion binding sites have been identified, one 'Ps' site (for binding the C-3 phosphate of the substrate and two sites, 'Pi' and 'new Pi', for inorganic phosphate. According to the original flip-flop model, the substrate phosphate group switches from the 'Pi' to the 'Ps' site during the multistep reaction. In light of the discovery of the 'new Pi' site, a modified flip-flop mechanism, in which the C-3 phosphate of the substrate binds to the 'new Pi' site and flips to the 'Ps' site before the hydride transfer, was proposed. An alternative model based on a number of structures of B. stearothermophilus GAPDH ternary complexes (non-covalent and thioacyl intermediate proposes that in the ternary Michaelis complex the C-3 phosphate binds to the 'Ps' site and flips from the 'Ps' to the 'new Pi' site during or after the redox step. Results We determined the crystal structure of Cryptosporidium parvum GAPDH in the apo and holo (enzyme + NAD state and the structure of the ternary enzyme-cofactor-substrate complex using an active site mutant enzyme. The C. parvum GAPDH complex was prepared by pre-incubating the enzyme with substrate and cofactor, thereby allowing free movement of the protein structure and substrate molecules during their initial encounter. Sulfate and phosphate ions were excluded from purification and crystallization steps. The quality of the electron density map at 2Å resolution allowed unambiguous positioning of the substrate. In three subunits of the homotetramer the C-3 phosphate group of the non-covalently bound substrate is in the 'new Pi' site. A concomitant movement of the phosphate binding loop is observed in these three subunits. In the fourth subunit the C-3 phosphate occupies an unexpected site not seen before and the phosphate binding loop remains in

  9. Backbone and sidechain 1H, 13C and 15N resonance assignments of the human brain-type fatty acid binding protein (FABP7) in its apo form and the holo forms binding to DHA, oleic acid, linoleic acid and elaidic acid

    DEFF Research Database (Denmark)

    Oeemig, Jesper S; Jørgensen, Mathilde L; Hansen, Mikka S

    2009-01-01

    In this manuscript, we present the backbone and side chain assignments of human brain-type fatty acid binding protein, also known as FABP7, in its apo form and in four different holo forms, bound to DHA, oleic acid, linoleic acid and elaidic acid.......In this manuscript, we present the backbone and side chain assignments of human brain-type fatty acid binding protein, also known as FABP7, in its apo form and in four different holo forms, bound to DHA, oleic acid, linoleic acid and elaidic acid....

  10. Predicting binding within disordered protein regions to structurally characterised peptide-binding domains.

    Directory of Open Access Journals (Sweden)

    Waqasuddin Khan

    Full Text Available Disordered regions of proteins often bind to structured domains, mediating interactions within and between proteins. However, it is difficult to identify a priori the short disordered regions involved in binding. We set out to determine if docking such peptide regions to peptide binding domains would assist in these predictions.We assembled a redundancy reduced dataset of SLiM (Short Linear Motif containing proteins from the ELM database. We selected 84 sequences which had an associated PDB structures showing the SLiM bound to a protein receptor, where the SLiM was found within a 50 residue region of the protein sequence which was predicted to be disordered. First, we investigated the Vina docking scores of overlapping tripeptides from the 50 residue SLiM containing disordered regions of the protein sequence to the corresponding PDB domain. We found only weak discrimination of docking scores between peptides involved in binding and adjacent non-binding peptides in this context (AUC 0.58.Next, we trained a bidirectional recurrent neural network (BRNN using as input the protein sequence, predicted secondary structure, Vina docking score and predicted disorder score. The results were very promising (AUC 0.72 showing that multiple sources of information can be combined to produce results which are clearly superior to any single source.We conclude that the Vina docking score alone has only modest power to define the location of a peptide within a larger protein region known to contain it. However, combining this information with other knowledge (using machine learning methods clearly improves the identification of peptide binding regions within a protein sequence. This approach combining docking with machine learning is primarily a predictor of binding to peptide-binding sites, and is not intended as a predictor of specificity of binding to particular receptors.

  11. Predicting nucleic acid binding interfaces from structural models of proteins.

    Science.gov (United States)

    Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael

    2012-02-01

    The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However, the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three-dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared with patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. Copyright © 2011 Wiley Periodicals, Inc.

  12. Optical architecture of HoloLens mixed reality headset

    Science.gov (United States)

    Kress, Bernard C.; Cummings, William J.

    2017-06-01

    HoloLens by Microsoft Corp. is the world's first untethered Mixed Reality (MR) Head Mounted Display (HMD) system, released to developers in March 2016 as a Development Kit. We review in this paper the various display requirements and subsequent optical hardware choices we made for HoloLens. Its main achievements go along performance and comfort for the user: it is the first fully untethered MR headset, with the highest angular resolution and the industry's largest eyebox. It has the first inside-out global sensor fusion system including precise head tracking and 3D mapping all controlled by a fully custom on-board GPU. Based on such achievements, HoloLens came out as the most advanced MR system today. Additional features may be implemented in next generations MR headsets, leading to the ultimate experience for the user, and securing the upcoming fabulous AR/MR market predicted by most analysts.

  13. HemeBIND: a novel method for heme binding residue prediction by combining structural and sequence information

    Directory of Open Access Journals (Sweden)

    Hu Jianjun

    2011-05-01

    Full Text Available Abstract Background Accurate prediction of binding residues involved in the interactions between proteins and small ligands is one of the major challenges in structural bioinformatics. Heme is an essential and commonly used ligand that plays critical roles in electron transfer, catalysis, signal transduction and gene expression. Although much effort has been devoted to the development of various generic algorithms for ligand binding site prediction over the last decade, no algorithm has been specifically designed to complement experimental techniques for identification of heme binding residues. Consequently, an urgent need is to develop a computational method for recognizing these important residues. Results Here we introduced an efficient algorithm HemeBIND for predicting heme binding residues by integrating structural and sequence information. We systematically investigated the characteristics of binding interfaces based on a non-redundant dataset of heme-protein complexes. It was found that several sequence and structural attributes such as evolutionary conservation, solvent accessibility, depth and protrusion clearly illustrate the differences between heme binding and non-binding residues. These features can then be separately used or combined to build the structure-based classifiers using support vector machine (SVM. The results showed that the information contained in these features is largely complementary and their combination achieved the best performance. To further improve the performance, an attempt has been made to develop a post-processing procedure to reduce the number of false positives. In addition, we built a sequence-based classifier based on SVM and sequence profile as an alternative when only sequence information can be used. Finally, we employed a voting method to combine the outputs of structure-based and sequence-based classifiers, which demonstrated remarkably better performance than the individual classifier alone

  14. Adsorption of apo- and holo-tear lipocalin to a bovine Meibomian lipid film.

    Science.gov (United States)

    Mudgil, Poonam; Millar, Thomas J

    2008-04-01

    Adsorption of apo- and holo-tear lipocalin (Tlc) to bovine Meibomian lipid film was studied. A Langmuir trough was used for these studies and the adsorption of protein was observed by recording changes in the pressure with time (pi-T profile). The films were photographed at different stages of adsorption by doping Meibomian lipids with a fluorescently tagged lipid. The results indicated that apo-Tlc adsorbed much more quickly than holo-Tlc to the Meibomian lipid film. Contrary to the expectation that holo-Tlc would release lipids to the surface and surface pressure would be higher, it was found that the surface pressure was higher with the adsorption of apo-Tlc to the surface. Photography of the films showed that apo- and holo-Tlc interacted differently with the Meibomian lipid layer. Adsorption of holo-Tlc resulted in big bright patches and adsorption of apo-Tlc resulted in many small patches along with the big patches. Both forms of Tlc produced a more stable film as indicated by decreased movement of the protein adsorbed films, and a higher maximum surface pressure upon compression of these films compared with Meibomian lipid films alone. Isocyles of apo-Tlc adsorbed films gave a higher surface pressure than that of holo-Tlc. From these results, it is concluded that both apo- and holo-Tlc adsorbed to the Meibomian lipid layer and the delivery of the lipids from Tlc to the outer lipid layer could not be detected by our techniques. Its scavenging role to remove lipids from the corneal surface and bind with them might be beneficial for increasing tear viscosity but whether those lipids are delivered to the outermost lipid layer still remains unclear.

  15. Multiscale high-order/low-order (HOLO) algorithms and applications

    International Nuclear Information System (INIS)

    Chacón, L.; Chen, G.; Knoll, D.A.; Newman, C.; Park, H.; Taitano, W.; Willert, J.A.; Womeldorff, G.

    2017-01-01

    We review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging multiscale problems. HOLO algorithms attempt to couple one or several high-complexity physical models (the high-order model, HO) with low-complexity ones (the low-order model, LO). The primary goal of HOLO algorithms is to achieve nonlinear convergence between HO and LO components while minimizing memory footprint and managing the computational complexity in a practical manner. Key to the HOLO approach is the use of the LO representations to address temporal stiffness, effectively accelerating the convergence of the HO/LO coupled system. The HOLO approach is broadly underpinned by the concept of nonlinear elimination, which enables segregation of the HO and LO components in ways that can effectively use heterogeneous architectures. The accuracy and efficiency benefits of HOLO algorithms are demonstrated with specific applications to radiation transport, gas dynamics, plasmas (both Eulerian and Lagrangian formulations), and ocean modeling. Across this broad application spectrum, HOLO algorithms achieve significant accuracy improvements at a fraction of the cost compared to conventional approaches. It follows that HOLO algorithms hold significant potential for high-fidelity system scale multiscale simulations leveraging exascale computing.

  16. Multiscale high-order/low-order (HOLO) algorithms and applications

    Energy Technology Data Exchange (ETDEWEB)

    Chacón, L., E-mail: chacon@lanl.gov [Los Alamos National Laboratory, Los Alamos, NM 87545 (United States); Chen, G.; Knoll, D.A.; Newman, C.; Park, H.; Taitano, W. [Los Alamos National Laboratory, Los Alamos, NM 87545 (United States); Willert, J.A. [Institute for Defense Analyses, Alexandria, VA 22311 (United States); Womeldorff, G. [Los Alamos National Laboratory, Los Alamos, NM 87545 (United States)

    2017-02-01

    We review the state of the art in the formulation, implementation, and performance of so-called high-order/low-order (HOLO) algorithms for challenging multiscale problems. HOLO algorithms attempt to couple one or several high-complexity physical models (the high-order model, HO) with low-complexity ones (the low-order model, LO). The primary goal of HOLO algorithms is to achieve nonlinear convergence between HO and LO components while minimizing memory footprint and managing the computational complexity in a practical manner. Key to the HOLO approach is the use of the LO representations to address temporal stiffness, effectively accelerating the convergence of the HO/LO coupled system. The HOLO approach is broadly underpinned by the concept of nonlinear elimination, which enables segregation of the HO and LO components in ways that can effectively use heterogeneous architectures. The accuracy and efficiency benefits of HOLO algorithms are demonstrated with specific applications to radiation transport, gas dynamics, plasmas (both Eulerian and Lagrangian formulations), and ocean modeling. Across this broad application spectrum, HOLO algorithms achieve significant accuracy improvements at a fraction of the cost compared to conventional approaches. It follows that HOLO algorithms hold significant potential for high-fidelity system scale multiscale simulations leveraging exascale computing.

  17. CDOCKER and lambda λ -dynamics for prospective prediction in D3R Grand Challenge 2

    Science.gov (United States)

    Ding, Xinqiang; Hayes, Ryan L.; Vilseck, Jonah Z.; Charles, Murchtricia K.; Brooks, Charles L.

    2018-01-01

    The opportunity to prospectively predict ligand bound poses and free energies of binding to the Farnesoid X Receptor in the D3R Grand Challenge 2 provided a useful exercise to evaluate CHARMM based docking (CDOCKER) and λ-dynamics methodologies for use in "real-world" applications in computer aided drug design. In addition to measuring their current performance, several recent methodological developments have been analyzed retrospectively to highlight best procedural practices in future applications. For pose prediction with CDOCKER, when the protein structure used for rigid receptor docking was close to the crystallographic holo structure, reliable poses were obtained. Benzimidazoles, with a known holo receptor structure, were successfully docked with an average RMSD of 0.97 Å. Other non-benzimidazole ligands displayed less accuracy largely because the receptor structures we chose for docking were too different from the experimental holo structures. However, retrospective analysis has shown that when these ligands were re-docked into their holo structures, the average RMSD dropped to 1.18 Å for all ligands. When sulfonamides and spiros were docked with the apo structure, which agrees more with their holo structure than the structures we chose, five out of six ligands were correctly docked. These docking results emphasize the need for flexible receptor docking approaches. For λ-dynamics techniques, including multisite λ-dynamics (MSλD), reasonable agreement with experiment was observed for the 33 ligands investigated; root mean square errors of 2.08 and 1.67 kcal/mol were obtained for free energy sets 1 and 2, respectively. Retrospectively, soft-core potentials, adaptive landscape flattening, and biasing potential replica exchange (BP-REX) algorithms were critical to model large substituent perturbations with sufficient precision and within restrictive timeframes, such as was required with participation in Grand Challenge 2. These developments, their

  18. Internationalization strategy: HOLOS Mobile - Angola

    OpenAIRE

    Almeida, Diogo Boto Machado Carneiro de

    2009-01-01

    A Work Project, presented as part of the requirements for the Award of a Masters Degree in Management from the NOVA – School of Business and Economics Nowadays, direct marketing tools are being used by companies that want to expand their businesses, aim to distinguish its customer service and improve its stakeholders’ relationship. HOLOS Mobile, developed by the Portuguese software company HOLOS S.A. throughout a partnership with Google, is an innovative product that can be used in any ...

  19. Structures of holo wild-type human cellular retinol-binding protein II (hCRBPII) bound to retinol and retinal.

    Science.gov (United States)

    Nossoni, Zahra; Assar, Zahra; Yapici, Ipek; Nosrati, Meisam; Wang, Wenjing; Berbasova, Tetyana; Vasileiou, Chrysoula; Borhan, Babak; Geiger, James

    2014-12-01

    Cellular retinol-binding proteins (CRBPs) I and II, which are members of the intracellular lipid-binding protein (iLBP) family, are retinoid chaperones that are responsible for the intracellular transport and delivery of both retinol and retinal. Although structures of retinol-bound CRBPI and CRBPII are known, no structure of a retinal-bound CRBP has been reported. In addition, the retinol-bound human CRBPII (hCRBPII) structure shows partial occupancy of a noncanonical conformation of retinol in the binding pocket. Here, the structure of retinal-bound hCRBPII and the structure of retinol-bound hCRBPII with retinol fully occupying the binding pocket are reported. It is further shown that the retinoid derivative seen in both the zebrafish CRBP and the hCRBPII structures is likely to be the product of flux-dependent and wavelength-dependent X-ray damage during data collection. The structures of retinoid-bound CRBPs are compared and contrasted, and rationales for the differences in binding affinities for retinal and retinol are provided.

  20. Electrostatics, structure prediction, and the energy landscapes for protein folding and binding.

    Science.gov (United States)

    Tsai, Min-Yeh; Zheng, Weihua; Balamurugan, D; Schafer, Nicholas P; Kim, Bobby L; Cheung, Margaret S; Wolynes, Peter G

    2016-01-01

    While being long in range and therefore weakly specific, electrostatic interactions are able to modulate the stability and folding landscapes of some proteins. The relevance of electrostatic forces for steering the docking of proteins to each other is widely acknowledged, however, the role of electrostatics in establishing specifically funneled landscapes and their relevance for protein structure prediction are still not clear. By introducing Debye-Hückel potentials that mimic long-range electrostatic forces into the Associative memory, Water mediated, Structure, and Energy Model (AWSEM), a transferable protein model capable of predicting tertiary structures, we assess the effects of electrostatics on the landscapes of thirteen monomeric proteins and four dimers. For the monomers, we find that adding electrostatic interactions does not improve structure prediction. Simulations of ribosomal protein S6 show, however, that folding stability depends monotonically on electrostatic strength. The trend in predicted melting temperatures of the S6 variants agrees with experimental observations. Electrostatic effects can play a range of roles in binding. The binding of the protein complex KIX-pKID is largely assisted by electrostatic interactions, which provide direct charge-charge stabilization of the native state and contribute to the funneling of the binding landscape. In contrast, for several other proteins, including the DNA-binding protein FIS, electrostatics causes frustration in the DNA-binding region, which favors its binding with DNA but not with its protein partner. This study highlights the importance of long-range electrostatics in functional responses to problems where proteins interact with their charged partners, such as DNA, RNA, as well as membranes. © 2015 The Protein Society.

  1. G-LoSA for Prediction of Protein-Ligand Binding Sites and Structures.

    Science.gov (United States)

    Lee, Hui Sun; Im, Wonpil

    2017-01-01

    Recent advances in high-throughput structure determination and computational protein structure prediction have significantly enriched the universe of protein structure. However, there is still a large gap between the number of available protein structures and that of proteins with annotated function in high accuracy. Computational structure-based protein function prediction has emerged to reduce this knowledge gap. The identification of a ligand binding site and its structure is critical to the determination of a protein's molecular function. We present a computational methodology for predicting small molecule ligand binding site and ligand structure using G-LoSA, our protein local structure alignment and similarity measurement tool. All the computational procedures described here can be easily implemented using G-LoSA Toolkit, a package of standalone software programs and preprocessed PDB structure libraries. G-LoSA and G-LoSA Toolkit are freely available to academic users at http://compbio.lehigh.edu/GLoSA . We also illustrate a case study to show the potential of our template-based approach harnessing G-LoSA for protein function prediction.

  2. The interrelationship between ligand binding and thermal unfolding of the folate binding protein. The role of self-association and pH

    DEFF Research Database (Denmark)

    Holm, Jan; Babol, Linnea N.; Markova, Natalia

    2014-01-01

    The present study utilized a combination of DLS (dynamic light scattering) and DSC (differential scanning calorimetry) to address thermostability of high-affinity folate binding protein (FBP), a transport protein and cellular receptor for the vitamin folate. At pH7.4 (pI=7-8) ligand binding......, intermolecular forces involved in concentration-dependent multimerization thus contribute to the thermostability of holo-FBP. Hence, thermal unfolding and dissociation of holo-FBP multimers occur simultaneously consistent with a gradual decrease from octameric to monomeric holo-FBP (10μM) in DLS after a step-wise...

  3. Structural and binding studies of a C-type galactose-binding lectin from Bothrops jararacussu snake venom.

    Science.gov (United States)

    Sartim, Marco A; Pinheiro, Matheus P; de Pádua, Ricardo A P; Sampaio, Suely V; Nonato, M Cristina

    2017-02-01

    BJcuL is a snake venom galactoside-binding lectin (SVgalL) isolated from Bothrops jararacussu and is involved in a wide variety of biological activities including triggering of pro-inflammatory response, disruption of microbial biofilm structure and induction of apoptosis. In the present work, we determined the crystallographic structure of BJcuL, the first holo structure of a SVgalL, and introduced the fluorescence-based thermal stability assay (Thermofluor) as a tool for screening and characterization of the binding mechanism of SVgalL ligands. BJcuL structure revealed the existence of a porous and flexible decameric arrangement composed of disulfide-linked dimers related by a five-fold symmetry. Each monomer contains the canonical carbohydrate recognition domain, a calcium ion required for BJcuL lectinic activity and a sodium ion required for protein stabilization. BJcuL thermostability was found to be induced by calcium ion and galactoside sugars which exhibit hyperbolic saturation profiles dependent on ligand concentration. Serendipitously, the gentamicin group of aminoglycoside antibiotics (gAGAs) was also identified as BJcuL ligands. On contrast, gAGAs exhibited a sigmoidal saturation profile compatible with a cooperative mechanism of binding. Thermofluor, hemagglutination inhibition assay and molecular docking strategies were used to identify a distinct binding site in BJcuL localized at the dimeric interface near the fully conserved intermolecular Cys86-Cys86 disulfide bond. The hybrid approach used in the present work provided novel insights into structural behavior and functional diversification of SVgaLs. Copyright © 2016 Elsevier Ltd. All rights reserved.

  4. Augmented Reality Technology Using Microsoft HoloLens in Anatomic Pathology.

    Science.gov (United States)

    Hanna, Matthew G; Ahmed, Ishtiaque; Nine, Jeffrey; Prajapati, Shyam; Pantanowitz, Liron

    2018-05-01

    Context Augmented reality (AR) devices such as the Microsoft HoloLens have not been well used in the medical field. Objective To test the HoloLens for clinical and nonclinical applications in pathology. Design A Microsoft HoloLens was tested for virtual annotation during autopsy, viewing 3D gross and microscopic pathology specimens, navigating whole slide images, telepathology, as well as real-time pathology-radiology correlation. Results Pathology residents performing an autopsy wearing the HoloLens were remotely instructed with real-time diagrams, annotations, and voice instruction. 3D-scanned gross pathology specimens could be viewed as holograms and easily manipulated. Telepathology was supported during gross examination and at the time of intraoperative consultation, allowing users to remotely access a pathologist for guidance and to virtually annotate areas of interest on specimens in real-time. The HoloLens permitted radiographs to be coregistered on gross specimens and thereby enhanced locating important pathologic findings. The HoloLens also allowed easy viewing and navigation of whole slide images, using an AR workstation, including multiple coregistered tissue sections facilitating volumetric pathology evaluation. Conclusions The HoloLens is a novel AR tool with multiple clinical and nonclinical applications in pathology. The device was comfortable to wear, easy to use, provided sufficient computing power, and supported high-resolution imaging. It was useful for autopsy, gross and microscopic examination, and ideally suited for digital pathology. Unique applications include remote supervision and annotation, 3D image viewing and manipulation, telepathology in a mixed-reality environment, and real-time pathology-radiology correlation.

  5. Binding and Endocytosis of Bovine Hololactoferrin by the Parasite Entamoeba histolytica

    Directory of Open Access Journals (Sweden)

    Guillermo Ortíz-Estrada

    2015-01-01

    Full Text Available Entamoeba histolytica is a human parasite that requires iron (Fe for its metabolic function and virulence. Bovine lactoferrin (B-Lf and its peptides can be found in the digestive tract after dairy products are ingested. The aim of this study was to compare virulent trophozoites recently isolated from hamster liver abscesses with nonvirulent trophozoites maintained for more than 30 years in cultures in vitro regarding their interaction with iron-charged B-Lf (B-holo-Lf. We performed growth kinetics analyses of trophozoites in B-holo-Lf and throughout several consecutive transfers. The virulent parasites showed higher growth and tolerance to iron than nonvirulent parasites. Both amoeba variants specifically bound B-holo-Lf with a similar Kd. However, averages of 9.45 × 105 and 6.65 × 106 binding sites/cell were found for B-holo-Lf in nonvirulent and virulent amoebae, respectively. Virulent amoebae bound more efficiently to human and bovine holo-Lf, human holo-transferrin, and human and bovine hemoglobin than nonvirulent amoebae. Virulent amoebae showed two types of B-holo-Lf binding proteins. Although both amoebae endocytosed this glycoprotein through clathrin-coated vesicles, the virulent amoebae also endocytosed B-holo-Lf through a cholesterol-dependent mechanism. Both amoeba variants secreted cysteine proteases cleaving B-holo-Lf. These data demonstrate that the B-Lf endocytosis is more efficient in virulent amoebae.

  6. Binding and Endocytosis of Bovine Hololactoferrin by the Parasite Entamoeba histolytica.

    Science.gov (United States)

    Ortíz-Estrada, Guillermo; Calderón-Salinas, Víctor; Shibayama-Salas, Mineko; León-Sicairos, Nidia; de la Garza, Mireya

    2015-01-01

    Entamoeba histolytica is a human parasite that requires iron (Fe) for its metabolic function and virulence. Bovine lactoferrin (B-Lf) and its peptides can be found in the digestive tract after dairy products are ingested. The aim of this study was to compare virulent trophozoites recently isolated from hamster liver abscesses with nonvirulent trophozoites maintained for more than 30 years in cultures in vitro regarding their interaction with iron-charged B-Lf (B-holo-Lf). We performed growth kinetics analyses of trophozoites in B-holo-Lf and throughout several consecutive transfers. The virulent parasites showed higher growth and tolerance to iron than nonvirulent parasites. Both amoeba variants specifically bound B-holo-Lf with a similar K d . However, averages of 9.45 × 10(5) and 6.65 × 10(6) binding sites/cell were found for B-holo-Lf in nonvirulent and virulent amoebae, respectively. Virulent amoebae bound more efficiently to human and bovine holo-Lf, human holo-transferrin, and human and bovine hemoglobin than nonvirulent amoebae. Virulent amoebae showed two types of B-holo-Lf binding proteins. Although both amoebae endocytosed this glycoprotein through clathrin-coated vesicles, the virulent amoebae also endocytosed B-holo-Lf through a cholesterol-dependent mechanism. Both amoeba variants secreted cysteine proteases cleaving B-holo-Lf. These data demonstrate that the B-Lf endocytosis is more efficient in virulent amoebae.

  7. Ligand Binding Induces Conformational Changes in Human Cellular Retinol-binding Protein 1 (CRBP1) Revealed by Atomic Resolution Crystal Structures.

    Science.gov (United States)

    Silvaroli, Josie A; Arne, Jason M; Chelstowska, Sylwia; Kiser, Philip D; Banerjee, Surajit; Golczak, Marcin

    2016-04-15

    Important in regulating the uptake, storage, and metabolism of retinoids, cellular retinol-binding protein 1 (CRBP1) is essential for trafficking vitamin A through the cytoplasm. However, the molecular details of ligand uptake and targeted release by CRBP1 remain unclear. Here we report the first structure of CRBP1 in a ligand-free form as well as ultra-high resolution structures of this protein bound to either all-trans-retinol or retinylamine, the latter a therapeutic retinoid that prevents light-induced retinal degeneration. Superpositioning of human apo- and holo-CRBP1 revealed major differences within segments surrounding the entrance to the retinoid-binding site. These included α-helix II and hairpin turns between β-strands βC-βD and βE-βF as well as several side chains, such as Phe-57, Tyr-60, and Ile-77, that change their orientations to accommodate the ligand. Additionally, we mapped hydrogen bond networks inside the retinoid-binding cavity and demonstrated their significance for the ligand affinity. Analyses of the crystallographic B-factors indicated several regions with higher backbone mobility in the apoprotein that became more rigid upon retinoid binding. This conformational flexibility of human apo-CRBP1 facilitates interaction with the ligands, whereas the more rigid holoprotein structure protects the labile retinoid moiety during vitamin A transport. These findings suggest a mechanism of induced fit upon ligand binding by mammalian cellular retinol-binding proteins. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  8. Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast.

    Science.gov (United States)

    Tsai, Zing Tsung-Yeh; Shiu, Shin-Han; Tsai, Huai-Kuang

    2015-08-01

    Transcription factor (TF) binding is determined by the presence of specific sequence motifs (SM) and chromatin accessibility, where the latter is influenced by both chromatin state (CS) and DNA structure (DS) properties. Although SM, CS, and DS have been used to predict TF binding sites, a predictive model that jointly considers CS and DS has not been developed to predict either TF-specific binding or general binding properties of TFs. Using budding yeast as model, we found that machine learning classifiers trained with either CS or DS features alone perform better in predicting TF-specific binding compared to SM-based classifiers. In addition, simultaneously considering CS and DS further improves the accuracy of the TF binding predictions, indicating the highly complementary nature of these two properties. The contributions of SM, CS, and DS features to binding site predictions differ greatly between TFs, allowing TF-specific predictions and potentially reflecting different TF binding mechanisms. In addition, a "TF-agnostic" predictive model based on three DNA "intrinsic properties" (in silico predicted nucleosome occupancy, major groove geometry, and dinucleotide free energy) that can be calculated from genomic sequences alone has performance that rivals the model incorporating experiment-derived data. This intrinsic property model allows prediction of binding regions not only across TFs, but also across DNA-binding domain families with distinct structural folds. Furthermore, these predicted binding regions can help identify TF binding sites that have a significant impact on target gene expression. Because the intrinsic property model allows prediction of binding regions across DNA-binding domain families, it is TF agnostic and likely describes general binding potential of TFs. Thus, our findings suggest that it is feasible to establish a TF agnostic model for identifying functional regulatory regions in potentially any sequenced genome.

  9. Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast.

    Directory of Open Access Journals (Sweden)

    Zing Tsung-Yeh Tsai

    2015-08-01

    Full Text Available Transcription factor (TF binding is determined by the presence of specific sequence motifs (SM and chromatin accessibility, where the latter is influenced by both chromatin state (CS and DNA structure (DS properties. Although SM, CS, and DS have been used to predict TF binding sites, a predictive model that jointly considers CS and DS has not been developed to predict either TF-specific binding or general binding properties of TFs. Using budding yeast as model, we found that machine learning classifiers trained with either CS or DS features alone perform better in predicting TF-specific binding compared to SM-based classifiers. In addition, simultaneously considering CS and DS further improves the accuracy of the TF binding predictions, indicating the highly complementary nature of these two properties. The contributions of SM, CS, and DS features to binding site predictions differ greatly between TFs, allowing TF-specific predictions and potentially reflecting different TF binding mechanisms. In addition, a "TF-agnostic" predictive model based on three DNA "intrinsic properties" (in silico predicted nucleosome occupancy, major groove geometry, and dinucleotide free energy that can be calculated from genomic sequences alone has performance that rivals the model incorporating experiment-derived data. This intrinsic property model allows prediction of binding regions not only across TFs, but also across DNA-binding domain families with distinct structural folds. Furthermore, these predicted binding regions can help identify TF binding sites that have a significant impact on target gene expression. Because the intrinsic property model allows prediction of binding regions across DNA-binding domain families, it is TF agnostic and likely describes general binding potential of TFs. Thus, our findings suggest that it is feasible to establish a TF agnostic model for identifying functional regulatory regions in potentially any sequenced genome.

  10. Canadian Whole-Farm Model Holos - Development, Stakeholder Involvement, and Model Application

    Science.gov (United States)

    Kroebel, R.; Janzen, H.; Beauchemin, K. A.

    2017-12-01

    modelling approach based on the ICBM model. Also under development are sub-models to predict ammonia volatilization and water budgets. Development of Holos is expected to continue, forging an interactive link between ongoing research and the interests of stakeholders in an ever-changing agricultural environment.

  11. PROCARB: A Database of Known and Modelled Carbohydrate-Binding Protein Structures with Sequence-Based Prediction Tools

    Directory of Open Access Journals (Sweden)

    Adeel Malik

    2010-01-01

    Full Text Available Understanding of the three-dimensional structures of proteins that interact with carbohydrates covalently (glycoproteins as well as noncovalently (protein-carbohydrate complexes is essential to many biological processes and plays a significant role in normal and disease-associated functions. It is important to have a central repository of knowledge available about these protein-carbohydrate complexes as well as preprocessed data of predicted structures. This can be significantly enhanced by tools de novo which can predict carbohydrate-binding sites for proteins in the absence of structure of experimentally known binding site. PROCARB is an open-access database comprising three independently working components, namely, (i Core PROCARB module, consisting of three-dimensional structures of protein-carbohydrate complexes taken from Protein Data Bank (PDB, (ii Homology Models module, consisting of manually developed three-dimensional models of N-linked and O-linked glycoproteins of unknown three-dimensional structure, and (iii CBS-Pred prediction module, consisting of web servers to predict carbohydrate-binding sites using single sequence or server-generated PSSM. Several precomputed structural and functional properties of complexes are also included in the database for quick analysis. In particular, information about function, secondary structure, solvent accessibility, hydrogen bonds and literature reference, and so forth, is included. In addition, each protein in the database is mapped to Uniprot, Pfam, PDB, and so forth.

  12. Structure of the plasminogen kringle 4 binding calcium-free form of the C-type lectin-like domain of tetranectin

    DEFF Research Database (Denmark)

    Nielbo, Steen Günther; Thomsen, J.K.; Graversen, J. H.

    2004-01-01

    . A conserved proline, which was found to be in the cis conformation in holoTN3, is in apoTN3 predominantly in the trans conformation. Backbone dynamics indicate that, in apoTN3 especially, two of the three calcium-binding loops and two of the three K4-binding residues exhibit increased flexibility, whereas...

  13. Using sequence-specific chemical and structural properties of DNA to predict transcription factor binding sites.

    Directory of Open Access Journals (Sweden)

    Amy L Bauer

    2010-11-01

    Full Text Available An important step in understanding gene regulation is to identify the DNA binding sites recognized by each transcription factor (TF. Conventional approaches to prediction of TF binding sites involve the definition of consensus sequences or position-specific weight matrices and rely on statistical analysis of DNA sequences of known binding sites. Here, we present a method called SiteSleuth in which DNA structure prediction, computational chemistry, and machine learning are applied to develop models for TF binding sites. In this approach, binary classifiers are trained to discriminate between true and false binding sites based on the sequence-specific chemical and structural features of DNA. These features are determined via molecular dynamics calculations in which we consider each base in different local neighborhoods. For each of 54 TFs in Escherichia coli, for which at least five DNA binding sites are documented in RegulonDB, the TF binding sites and portions of the non-coding genome sequence are mapped to feature vectors and used in training. According to cross-validation analysis and a comparison of computational predictions against ChIP-chip data available for the TF Fis, SiteSleuth outperforms three conventional approaches: Match, MATRIX SEARCH, and the method of Berg and von Hippel. SiteSleuth also outperforms QPMEME, a method similar to SiteSleuth in that it involves a learning algorithm. The main advantage of SiteSleuth is a lower false positive rate.

  14. Machine-learning scoring functions to improve structure-based binding affinity prediction and virtual screening.

    Science.gov (United States)

    Ain, Qurrat Ul; Aleksandrova, Antoniya; Roessler, Florian D; Ballester, Pedro J

    2015-01-01

    Docking tools to predict whether and how a small molecule binds to a target can be applied if a structural model of such target is available. The reliability of docking depends, however, on the accuracy of the adopted scoring function (SF). Despite intense research over the years, improving the accuracy of SFs for structure-based binding affinity prediction or virtual screening has proven to be a challenging task for any class of method. New SFs based on modern machine-learning regression models, which do not impose a predetermined functional form and thus are able to exploit effectively much larger amounts of experimental data, have recently been introduced. These machine-learning SFs have been shown to outperform a wide range of classical SFs at both binding affinity prediction and virtual screening. The emerging picture from these studies is that the classical approach of using linear regression with a small number of expert-selected structural features can be strongly improved by a machine-learning approach based on nonlinear regression allied with comprehensive data-driven feature selection. Furthermore, the performance of classical SFs does not grow with larger training datasets and hence this performance gap is expected to widen as more training data becomes available in the future. Other topics covered in this review include predicting the reliability of a SF on a particular target class, generating synthetic data to improve predictive performance and modeling guidelines for SF development. WIREs Comput Mol Sci 2015, 5:405-424. doi: 10.1002/wcms.1225 For further resources related to this article, please visit the WIREs website.

  15. An overview of the prediction of protein DNA-binding sites.

    Science.gov (United States)

    Si, Jingna; Zhao, Rui; Wu, Rongling

    2015-03-06

    Interactions between proteins and DNA play an important role in many essential biological processes such as DNA replication, transcription, splicing, and repair. The identification of amino acid residues involved in DNA-binding sites is critical for understanding the mechanism of these biological activities. In the last decade, numerous computational approaches have been developed to predict protein DNA-binding sites based on protein sequence and/or structural information, which play an important role in complementing experimental strategies. At this time, approaches can be divided into three categories: sequence-based DNA-binding site prediction, structure-based DNA-binding site prediction, and homology modeling and threading. In this article, we review existing research on computational methods to predict protein DNA-binding sites, which includes data sets, various residue sequence/structural features, machine learning methods for comparison and selection, evaluation methods, performance comparison of different tools, and future directions in protein DNA-binding site prediction. In particular, we detail the meta-analysis of protein DNA-binding sites. We also propose specific implications that are likely to result in novel prediction methods, increased performance, or practical applications.

  16. Holographic Rovers: Augmented Reality and the Microsoft HoloLens

    Science.gov (United States)

    Toler, Laura

    2017-01-01

    Augmented Reality is an emerging field in technology, and encompasses Head Mounted Displays, smartphone apps, and even projected images. HMDs include the Meta 2, Magic Leap, Avegant Light Field, and the Microsoft HoloLens, which is evaluated specifically. The Microsoft HoloLens is designed to be used as an AR personal computer, and is being optimized with that goal in mind. Microsoft allied with the Unity3D game engine to create an SDK for interested application developers that can be used in the Unity environment.

  17. The calcium binding properties and structure prediction of the Hax-1 protein.

    Science.gov (United States)

    Balcerak, Anna; Rowinski, Sebastian; Szafron, Lukasz M; Grzybowska, Ewa A

    2017-01-01

    Hax-1 is a protein involved in regulation of different cellular processes, but its properties and exact mechanisms of action remain unknown. In this work, using purified, recombinant Hax-1 and by applying an in vitro autoradiography assay we have shown that this protein binds Ca 2+ . Additionally, we performed structure prediction analysis which shows that Hax-1 displays definitive structural features, such as two α-helices, short β-strands and four disordered segments.

  18. An Overview of the Prediction of Protein DNA-Binding Sites

    Directory of Open Access Journals (Sweden)

    Jingna Si

    2015-03-01

    Full Text Available Interactions between proteins and DNA play an important role in many essential biological processes such as DNA replication, transcription, splicing, and repair. The identification of amino acid residues involved in DNA-binding sites is critical for understanding the mechanism of these biological activities. In the last decade, numerous computational approaches have been developed to predict protein DNA-binding sites based on protein sequence and/or structural information, which play an important role in complementing experimental strategies. At this time, approaches can be divided into three categories: sequence-based DNA-binding site prediction, structure-based DNA-binding site prediction, and homology modeling and threading. In this article, we review existing research on computational methods to predict protein DNA-binding sites, which includes data sets, various residue sequence/structural features, machine learning methods for comparison and selection, evaluation methods, performance comparison of different tools, and future directions in protein DNA-binding site prediction. In particular, we detail the meta-analysis of protein DNA-binding sites. We also propose specific implications that are likely to result in novel prediction methods, increased performance, or practical applications.

  19. Interactive Molecular Graphics for Augmented Reality Using HoloLens.

    Science.gov (United States)

    Müller, Christoph; Krone, Michael; Huber, Markus; Biener, Verena; Herr, Dominik; Koch, Steffen; Reina, Guido; Weiskopf, Daniel; Ertl, Thomas

    2018-06-13

    Immersive technologies like stereo rendering, virtual reality, or augmented reality (AR) are often used in the field of molecular visualisation. Modern, comparably lightweight and affordable AR headsets like Microsoft's HoloLens open up new possibilities for immersive analytics in molecular visualisation. A crucial factor for a comprehensive analysis of molecular data in AR is the rendering speed. HoloLens, however, has limited hardware capabilities due to requirements like battery life, fanless cooling and weight. Consequently, insights from best practises for powerful desktop hardware may not be transferable. Therefore, we evaluate the capabilities of the HoloLens hardware for modern, GPU-enabled, high-quality rendering methods for the space-filling model commonly used in molecular visualisation. We also assess the scalability for large molecular data sets. Based on the results, we discuss ideas and possibilities for immersive molecular analytics. Besides more obvious benefits like the stereoscopic rendering offered by the device, this specifically includes natural user interfaces that use physical navigation instead of the traditional virtual one. Furthermore, we consider different scenarios for such an immersive system, ranging from educational use to collaborative scenarios.

  20. Low-Quality Structural and Interaction Data Improves Binding Affinity Prediction via Random Forest.

    Science.gov (United States)

    Li, Hongjian; Leung, Kwong-Sak; Wong, Man-Hon; Ballester, Pedro J

    2015-06-12

    Docking scoring functions can be used to predict the strength of protein-ligand binding. It is widely believed that training a scoring function with low-quality data is detrimental for its predictive performance. Nevertheless, there is a surprising lack of systematic validation experiments in support of this hypothesis. In this study, we investigated to which extent training a scoring function with data containing low-quality structural and binding data is detrimental for predictive performance. We actually found that low-quality data is not only non-detrimental, but beneficial for the predictive performance of machine-learning scoring functions, though the improvement is less important than that coming from high-quality data. Furthermore, we observed that classical scoring functions are not able to effectively exploit data beyond an early threshold, regardless of its quality. This demonstrates that exploiting a larger data volume is more important for the performance of machine-learning scoring functions than restricting to a smaller set of higher data quality.

  1. Modulation of microtubule assembly by the HIV-1 Tat protein is strongly dependent on zinc binding to Tat

    Directory of Open Access Journals (Sweden)

    Muller Sylviane

    2008-07-01

    Full Text Available Abstract Background During HIV-1 infection, the Tat protein plays a key role by transactivating the transcription of the HIV-1 proviral DNA. In addition, Tat induces apoptosis of non-infected T lymphocytes, leading to a massive loss of immune competence. This apoptosis is notably mediated by the interaction of Tat with microtubules, which are dynamic components essential for cell structure and division. Tat binds two Zn2+ ions through its conserved cysteine-rich region in vitro, but the role of zinc in the structure and properties of Tat is still controversial. Results To investigate the role of zinc, we first characterized Tat apo- and holo-forms by fluorescence correlation spectroscopy and time-resolved fluorescence spectroscopy. Both of the Tat forms are monomeric and poorly folded but differ by local conformational changes in the vicinity of the cysteine-rich region. The interaction of the two Tat forms with tubulin dimers and microtubules was monitored by analytical ultracentrifugation, turbidity measurements and electron microscopy. At 20°C, both of the Tat forms bind tubulin dimers, but only the holo-Tat was found to form discrete complexes. At 37°C, both forms promoted the nucleation and increased the elongation rates of tubulin assembly. However, only the holo-Tat increased the amount of microtubules, decreased the tubulin critical concentration, and stabilized the microtubules. In contrast, apo-Tat induced a large amount of tubulin aggregates. Conclusion Our data suggest that holo-Tat corresponds to the active form, responsible for the Tat-mediated apoptosis.

  2. FEI Titan G2 60-300 HOLO

    Directory of Open Access Journals (Sweden)

    Chris Boothroyd

    2016-02-01

    Full Text Available The FEI Titan G2 60-300 HOLO is a unique fourth generation transmission electron microscope, which has been specifically designed for the investigation of electromagnetic fields of materials using off-axis electron holography. It has a Lorentz lens to allow magnetic field free imaging plus two electron biprisms, which in combination enable more uniform holographic fringes to be used. The instrument also has an ultra-wide objective lens pole piece gap which is ideal for in situ experiments. For these purposes, the FEI Titan G2 60-300 HOLO is equipped with a Schottky type high-brightness electron gun (FEI X-FEG, an image Cs corrector (CEOS, a post-column energy filter system (Gatan Tridiem 865 ER as well as a 4 megapixel CCD system (Gatan UltraScan 1000 XP. Typical examples of use and technical specifications for the instrument are given below.

  3. Predicting protein-binding RNA nucleotides with consideration of binding partners.

    Science.gov (United States)

    Tuvshinjargal, Narankhuu; Lee, Wook; Park, Byungkyu; Han, Kyungsook

    2015-06-01

    most performance measures. To the best of our knowledge, this is the first sequence-based prediction of protein-binding nucleotides in RNA which considers the binding partner of RNA. The new model will provide valuable information for designing biochemical experiments to find putative protein-binding sites in RNA with unknown structure. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  4. Prediction of GPCR-Ligand Binding Using Machine Learning Algorithms

    Directory of Open Access Journals (Sweden)

    Sangmin Seo

    2018-01-01

    Full Text Available We propose a novel method that predicts binding of G-protein coupled receptors (GPCRs and ligands. The proposed method uses hub and cycle structures of ligands and amino acid motif sequences of GPCRs, rather than the 3D structure of a receptor or similarity of receptors or ligands. The experimental results show that these new features can be effective in predicting GPCR-ligand binding (average area under the curve [AUC] of 0.944, because they are thought to include hidden properties of good ligand-receptor binding. Using the proposed method, we were able to identify novel ligand-GPCR bindings, some of which are supported by several studies.

  5. Knowledge-based Fragment Binding Prediction

    Science.gov (United States)

    Tang, Grace W.; Altman, Russ B.

    2014-01-01

    Target-based drug discovery must assess many drug-like compounds for potential activity. Focusing on low-molecular-weight compounds (fragments) can dramatically reduce the chemical search space. However, approaches for determining protein-fragment interactions have limitations. Experimental assays are time-consuming, expensive, and not always applicable. At the same time, computational approaches using physics-based methods have limited accuracy. With increasing high-resolution structural data for protein-ligand complexes, there is now an opportunity for data-driven approaches to fragment binding prediction. We present FragFEATURE, a machine learning approach to predict small molecule fragments preferred by a target protein structure. We first create a knowledge base of protein structural environments annotated with the small molecule substructures they bind. These substructures have low-molecular weight and serve as a proxy for fragments. FragFEATURE then compares the structural environments within a target protein to those in the knowledge base to retrieve statistically preferred fragments. It merges information across diverse ligands with shared substructures to generate predictions. Our results demonstrate FragFEATURE's ability to rediscover fragments corresponding to the ligand bound with 74% precision and 82% recall on average. For many protein targets, it identifies high scoring fragments that are substructures of known inhibitors. FragFEATURE thus predicts fragments that can serve as inputs to fragment-based drug design or serve as refinement criteria for creating target-specific compound libraries for experimental or computational screening. PMID:24762971

  6. CaMELS: In silico prediction of calmodulin binding proteins and their binding sites.

    Science.gov (United States)

    Abbasi, Wajid Arshad; Asif, Amina; Andleeb, Saiqa; Minhas, Fayyaz Ul Amir Afsar

    2017-09-01

    Due to Ca 2+ -dependent binding and the sequence diversity of Calmodulin (CaM) binding proteins, identifying CaM interactions and binding sites in the wet-lab is tedious and costly. Therefore, computational methods for this purpose are crucial to the design of such wet-lab experiments. We present an algorithm suite called CaMELS (CalModulin intEraction Learning System) for predicting proteins that interact with CaM as well as their binding sites using sequence information alone. CaMELS offers state of the art accuracy for both CaM interaction and binding site prediction and can aid biologists in studying CaM binding proteins. For CaM interaction prediction, CaMELS uses protein sequence features coupled with a large-margin classifier. CaMELS models the binding site prediction problem using multiple instance machine learning with a custom optimization algorithm which allows more effective learning over imprecisely annotated CaM-binding sites during training. CaMELS has been extensively benchmarked using a variety of data sets, mutagenic studies, proteome-wide Gene Ontology enrichment analyses and protein structures. Our experiments indicate that CaMELS outperforms simple motif-based search and other existing methods for interaction and binding site prediction. We have also found that the whole sequence of a protein, rather than just its binding site, is important for predicting its interaction with CaM. Using the machine learning model in CaMELS, we have identified important features of protein sequences for CaM interaction prediction as well as characteristic amino acid sub-sequences and their relative position for identifying CaM binding sites. Python code for training and evaluating CaMELS together with a webserver implementation is available at the URL: http://faculty.pieas.edu.pk/fayyaz/software.html#camels. © 2017 Wiley Periodicals, Inc.

  7. Holo-transcobalamin is an indicator of vitamin B-12 absorption in healthy adults with adequate vitamin B-12 status

    DEFF Research Database (Denmark)

    von Castel-Roberts, Kristina M; Mørkbak, Anne Louise; Nexo, Ebba

    2007-01-01

    BACKGROUND: It has been hypothesized that the response of holo-transcobalamin (holo-TC) to oral vitamin B-12 may be used to assess absorption. To develop a reliable clinical absorption test that uses holo-TC, it is necessary to determine the optimal timeline for vitamin B-12 administration...... and postdose assessment. OBJECTIVE: The objective of this study was to assess the magnitude and patterns of change in the postabsorption response of holo-TC to oral vitamin B-12. DESIGN: Adult (18-49 y) male and female participants (n = 21) with normal vitamin B-12 status were given three 9-mug doses...... of vitamin B-12 at 6-h intervals beginning early morning (baseline) on day 1. Blood was drawn at 17 timed intervals over the course of 3 d for the analysis of holo-TC and other indicators of vitamin B-12 status. RESULTS: Mean holo-TC increased significantly (P

  8. Probing the 3-D Structure, Dynamics, and Stability of Bacterial Collagenase Collagen Binding Domain (apo- versus holo-) by Limited Proteolysis MALDI-TOF MS

    Science.gov (United States)

    Sides, Cynthia R.; Liyanage, Rohana; Lay, Jackson O.; Philominathan, Sagaya Theresa Leena; Matsushita, Osamu; Sakon, Joshua

    2012-03-01

    Pairing limited proteolysis and matrix-assisted laser desorption/ionization-time of flight mass spectrometry (MALDI-TOF MS) to probe clostridial collagenase collagen binding domain (CBD) reveals the solution dynamics and stability of the protein, as these factors are crucial to CBD effectiveness as a drug-delivery vehicle. MS analysis of proteolytic digests indicates initial cleavage sites, thereby specifying the less stable and highly accessible regions of CBD. Modulation of protein structure and stability upon metal binding is shown through MS analysis of calcium-bound and cobalt-bound CBD proteolytic digests. Previously determined X-ray crystal structures illustrate that calcium binding induces secondary structure transformation in the highly mobile N-terminal arm and increases protein stability. MS-based detection of exposed residues confirms protein flexibility, accentuates N-terminal dynamics, and demonstrates increased global protein stability exported by calcium binding. Additionally, apo- and calcium-bound CBD proteolysis sites correlate well with crystallographic B-factors, accessibility, and enzyme specificity. MS-observed cleavage sites with no clear correlations are explained either by crystal contacts of the X-ray crystal structures or by observed differences between Molecules A and B in the X-ray crystal structures. The study newly reveals the absence of the βA strand and thus the very dynamic N-terminal linker, as corroborated by the solution X-ray scattering results. Cobalt binding has a regional effect on the solution phase stability of CBD, as limited proteolysis data implies the capture of an intermediate-CBD solution structure when cobalt is bound.

  9. The crystal structures of apo and cAMP-bound GlxR from Corynebacterium glutamicum reveal structural and dynamic changes upon cAMP binding in CRP/FNR family transcription factors.

    Directory of Open Access Journals (Sweden)

    Philip D Townsend

    Full Text Available The cyclic AMP-dependent transcriptional regulator GlxR from Corynebacterium glutamicum is a member of the super-family of CRP/FNR (cyclic AMP receptor protein/fumarate and nitrate reduction regulator transcriptional regulators that play central roles in bacterial metabolic regulatory networks. In C. glutamicum, which is widely used for the industrial production of amino acids and serves as a non-pathogenic model organism for members of the Corynebacteriales including Mycobacterium tuberculosis, the GlxR homodimer controls the transcription of a large number of genes involved in carbon metabolism. GlxR therefore represents a key target for understanding the regulation and coordination of C. glutamicum metabolism. Here we investigate cylic AMP and DNA binding of GlxR from C. glutamicum and describe the crystal structures of apo GlxR determined at a resolution of 2.5 Å, and two crystal forms of holo GlxR at resolutions of 2.38 and 1.82 Å, respectively. The detailed structural analysis and comparison of GlxR with CRP reveals that the protein undergoes a distinctive conformational change upon cyclic AMP binding leading to a dimer structure more compatible to DNA-binding. As the two binding sites in the GlxR homodimer are structurally identical dynamic changes upon binding of the first ligand are responsible for the allosteric behavior. The results presented here show how dynamic and structural changes in GlxR lead to optimization of orientation and distance of its two DNA-binding helices for optimal DNA recognition.

  10. HoloR: Interactive Mixed-Reality Rooms

    OpenAIRE

    Schwede, Carsten; Hermann, Thomas

    2015-01-01

    Existing virtual reality technologies only cover certain areas of the mixed-reality spectrum: Augmented reality goggles are unable to provide immersion while head-mounted displays make it difficult to interact with the real world. In this paper we introduce HoloR - short for Holographic Room: A stereoscopic, multi-person, multi-viewer, spatial projected augmented reality system, which enables applications to fade between different parts of the mixed-reality spectrum. By using web-technologies...

  11. A sequence-based dynamic ensemble learning system for protein ligand-binding site prediction

    KAUST Repository

    Chen, Peng

    2015-12-03

    Background: Proteins have the fundamental ability to selectively bind to other molecules and perform specific functions through such interactions, such as protein-ligand binding. Accurate prediction of protein residues that physically bind to ligands is important for drug design and protein docking studies. Most of the successful protein-ligand binding predictions were based on known structures. However, structural information is not largely available in practice due to the huge gap between the number of known protein sequences and that of experimentally solved structures

  12. A sequence-based dynamic ensemble learning system for protein ligand-binding site prediction

    KAUST Repository

    Chen, Peng; Hu, ShanShan; Zhang, Jun; Gao, Xin; Li, Jinyan; Xia, Junfeng; Wang, Bing

    2015-01-01

    Background: Proteins have the fundamental ability to selectively bind to other molecules and perform specific functions through such interactions, such as protein-ligand binding. Accurate prediction of protein residues that physically bind to ligands is important for drug design and protein docking studies. Most of the successful protein-ligand binding predictions were based on known structures. However, structural information is not largely available in practice due to the huge gap between the number of known protein sequences and that of experimentally solved structures

  13. Statistical Profiling of One Promiscuous Protein Binding Site: Illustrated by Urokinase Catalytic Domain.

    Science.gov (United States)

    Cerisier, Natacha; Regad, Leslie; Triki, Dhoha; Petitjean, Michel; Flatters, Delphine; Camproux, Anne-Claude

    2017-10-01

    While recent literature focuses on drug promiscuity, the characterization of promiscuous binding sites (ability to bind several ligands) remains to be explored. Here, we present a proteochemometric modeling approach to analyze diverse ligands and corresponding multiple binding sub-pockets associated with one promiscuous binding site to characterize protein-ligand recognition. We analyze both geometrical and physicochemical profile correspondences. This approach was applied to examine the well-studied druggable urokinase catalytic domain inhibitor binding site, which results in a large number of complex structures bound to various ligands. This approach emphasizes the importance of jointly characterizing pocket and ligand spaces to explore the impact of ligand diversity on sub-pocket properties and to establish their main profile correspondences. This work supports an interest in mining available 3D holo structures associated with a promiscuous binding site to explore its main protein-ligand recognition tendency. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  14. Nucleos: a web server for the identification of nucleotide-binding sites in protein structures.

    Science.gov (United States)

    Parca, Luca; Ferré, Fabrizio; Ausiello, Gabriele; Helmer-Citterich, Manuela

    2013-07-01

    Nucleos is a web server for the identification of nucleotide-binding sites in protein structures. Nucleos compares the structure of a query protein against a set of known template 3D binding sites representing nucleotide modules, namely the nucleobase, carbohydrate and phosphate. Structural features, clustering and conservation are used to filter and score the predictions. The predicted nucleotide modules are then joined to build whole nucleotide-binding sites, which are ranked by their score. The server takes as input either the PDB code of the query protein structure or a user-submitted structure in PDB format. The output of Nucleos is composed of ranked lists of predicted nucleotide-binding sites divided by nucleotide type (e.g. ATP-like). For each ranked prediction, Nucleos provides detailed information about the score, the template structure and the structural match for each nucleotide module composing the nucleotide-binding site. The predictions on the query structure and the template-binding sites can be viewed directly on the web through a graphical applet. In 98% of the cases, the modules composing correct predictions belong to proteins with no homology relationship between each other, meaning that the identification of brand-new nucleotide-binding sites is possible using information from non-homologous proteins. Nucleos is available at http://nucleos.bio.uniroma2.it/nucleos/.

  15. MicroRNA-target binding structures mimic microRNA duplex structures in humans.

    Directory of Open Access Journals (Sweden)

    Xi Chen

    Full Text Available Traditionally, researchers match a microRNA guide strand to mRNA sequences using sequence comparisons to predict its potential target genes. However, many of the predictions can be false positives due to limitations in sequence comparison alone. In this work, we consider the association of two related RNA structures that share a common guide strand: the microRNA duplex and the microRNA-target binding structure. We have analyzed thousands of such structure pairs and found many of them share high structural similarity. Therefore, we conclude that when predicting microRNA target genes, considering just the microRNA guide strand matches to gene sequences may not be sufficient--the microRNA duplex structure formed by the guide strand and its companion passenger strand must also be considered. We have developed software to translate RNA binding structure into encoded representations, and we have also created novel automatic comparison methods utilizing such encoded representations to determine RNA structure similarity. Our software and methods can be utilized in the other RNA secondary structure comparisons as well.

  16. Binding Ligand Prediction for Proteins Using Partial Matching of Local Surface Patches

    Directory of Open Access Journals (Sweden)

    Lee Sael

    2010-12-01

    Full Text Available Functional elucidation of uncharacterized protein structures is an important task in bioinformatics. We report our new approach for structure-based function prediction which captures local surface features of ligand binding pockets. Function of proteins, specifically, binding ligands of proteins, can be predicted by finding similar local surface regions of known proteins. To enable partial comparison of binding sites in proteins, a weighted bipartite matching algorithm is used to match pairs of surface patches. The surface patches are encoded with the 3D Zernike descriptors. Unlike the existing methods which compare global characteristics of the protein fold or the global pocket shape, the local surface patch method can find functional similarity between non-homologous proteins and binding pockets for flexible ligand molecules. The proposed method improves prediction results over global pocket shape-based method which was previously developed by our group.

  17. Binding ligand prediction for proteins using partial matching of local surface patches.

    Science.gov (United States)

    Sael, Lee; Kihara, Daisuke

    2010-01-01

    Functional elucidation of uncharacterized protein structures is an important task in bioinformatics. We report our new approach for structure-based function prediction which captures local surface features of ligand binding pockets. Function of proteins, specifically, binding ligands of proteins, can be predicted by finding similar local surface regions of known proteins. To enable partial comparison of binding sites in proteins, a weighted bipartite matching algorithm is used to match pairs of surface patches. The surface patches are encoded with the 3D Zernike descriptors. Unlike the existing methods which compare global characteristics of the protein fold or the global pocket shape, the local surface patch method can find functional similarity between non-homologous proteins and binding pockets for flexible ligand molecules. The proposed method improves prediction results over global pocket shape-based method which was previously developed by our group.

  18. A high pressure study of calmodulin-ligand interactions using small-angle X-ray and elastic incoherent neutron scattering.

    Science.gov (United States)

    Cinar, Süleyman; Al-Ayoubi, Samy; Sternemann, Christian; Peters, Judith; Winter, Roland; Czeslik, Claus

    2018-01-31

    Calmodulin (CaM) is a Ca 2+ sensor and mediates Ca 2+ signaling through binding of numerous target ligands. The binding of ligands by Ca 2+ -saturated CaM (holo-CaM) is governed by attractive hydrophobic and electrostatic interactions that are weakened under high pressure in aqueous solutions. Moreover, the potential formation of void volumes upon ligand binding creates a further source of pressure sensitivity. Hence, high pressure is a suitable thermodynamic variable to probe protein-ligand interactions. In this study, we compare the binding of two different ligands to holo-CaM as a function of pressure by using X-ray and neutron scattering techniques. The two ligands are the farnesylated hypervariable region (HVR) of the K-Ras4B protein, which is a natural binding partner of holo-CaM, and the antagonist trifluoperazine (TFP), which is known to inhibit holo-CaM activity. From small-angle X-ray scattering experiments performed up to 3000 bar, we observe a pressure-induced partial unfolding of the free holo-CaM in the absence of ligands, where the two lobes of the dumbbell-shaped protein are slightly swelled. In contrast, upon binding TFP, holo-CaM forms a closed globular conformation, which is pressure stable at least up to 3000 bar. The HVR of K-Ras4B shows a different binding behavior, and the data suggest the dissociation of the holo-CaM/HVR complex under high pressure, probably due to a less dense protein contact of the HVR as compared to TFP. The elastic incoherent neutron scattering experiments corroborate these findings. Below 2000 bar, pressure induces enhanced atomic fluctuations in both holo-CaM/ligand complexes, but those of the holo-CaM/HVR complex seem to be larger. Thus, the inhibition of holo-CaM by TFP is supported by a low-volume ligand binding, albeit this is not associated with a rigidification of the complex structure on the sub-ns Å-scale.

  19. Characterization of a monoclonal antibody with specificity for holo-transcobalamin

    Directory of Open Access Journals (Sweden)

    Fedosov Sergey N

    2006-01-01

    Full Text Available Abstract Background Holotranscobalamin, cobalamin-saturated transcobalamin, is the minor fraction of circulating cobalamin (vitamin B12, which is available for cellular uptake and hence is physiologically relevant. Currently, no method allows simple, direct quantification of holotranscobalamin. We now report on the identification and characterization of a monoclonal antibody with a unique specificity for holotranscobalamin. Methods The specificity and affinity of the monoclonal antibodies were determined using surface plasmon resonance and recombinant transcobalamin as well as by immobilizing the antibodies on magnetic microspheres and using native transcobalamin in serum. The epitope of the holotranscobalamin specific antibody was identified using phage display and comparison to a de novo generated three-dimensional model of transcobalamin using the program Rosetta. A direct assay for holotrnscobalamin in the ELISA format was developed using the specific antibody and compared to the commercial assay HoloTC RIA. Results An antibody exhibiting >100-fold specificity for holotranscobalamin over apotranscobalamin was identified. The affinity but not the specificity varied inversely with ionic strength and pH, indicating importance of electrostatic interactions. The epitope was discontinuous and epitope mapping of the antibody by phage display identified two similar motifs with no direct sequence similarity to transcobalamin. A comparison of the motifs with a de novo generated three-dimensional model of transcobalamin identified two structures in the N-terminal part of transcobalamin that resembled the motif. Using this antibody an ELISA based prototype assay was developed and compared to the only available commercial assay for measuring holotranscobalamin, HoloTC RIA. Conclusion The identified antibody possesses a unique specificity for holotranscobalamin and can be used to develop a direct assay for the quantification of holotranscobalamin.

  20. RNAcontext: a new method for learning the sequence and structure binding preferences of RNA-binding proteins.

    Directory of Open Access Journals (Sweden)

    Hilal Kazan

    2010-07-01

    Full Text Available Metazoan genomes encode hundreds of RNA-binding proteins (RBPs. These proteins regulate post-transcriptional gene expression and have critical roles in numerous cellular processes including mRNA splicing, export, stability and translation. Despite their ubiquity and importance, the binding preferences for most RBPs are not well characterized. In vitro and in vivo studies, using affinity selection-based approaches, have successfully identified RNA sequence associated with specific RBPs; however, it is difficult to infer RBP sequence and structural preferences without specifically designed motif finding methods. In this study, we introduce a new motif-finding method, RNAcontext, designed to elucidate RBP-specific sequence and structural preferences with greater accuracy than existing approaches. We evaluated RNAcontext on recently published in vitro and in vivo RNA affinity selected data and demonstrate that RNAcontext identifies known binding preferences for several control proteins including HuR, PTB, and Vts1p and predicts new RNA structure preferences for SF2/ASF, RBM4, FUSIP1 and SLM2. The predicted preferences for SF2/ASF are consistent with its recently reported in vivo binding sites. RNAcontext is an accurate and efficient motif finding method ideally suited for using large-scale RNA-binding affinity datasets to determine the relative binding preferences of RBPs for a wide range of RNA sequences and structures.

  1. Implicit ligand theory for relative binding free energies

    Science.gov (United States)

    Nguyen, Trung Hai; Minh, David D. L.

    2018-03-01

    Implicit ligand theory enables noncovalent binding free energies to be calculated based on an exponential average of the binding potential of mean force (BPMF)—the binding free energy between a flexible ligand and rigid receptor—over a precomputed ensemble of receptor configurations. In the original formalism, receptor configurations were drawn from or reweighted to the apo ensemble. Here we show that BPMFs averaged over a holo ensemble yield binding free energies relative to the reference ligand that specifies the ensemble. When using receptor snapshots from an alchemical simulation with a single ligand, the new statistical estimator outperforms the original.

  2. Predicting Ligand Binding Sites on Protein Surfaces by 3-Dimensional Probability Density Distributions of Interacting Atoms

    Science.gov (United States)

    Jian, Jhih-Wei; Elumalai, Pavadai; Pitti, Thejkiran; Wu, Chih Yuan; Tsai, Keng-Chang; Chang, Jeng-Yih; Peng, Hung-Pin; Yang, An-Suei

    2016-01-01

    Predicting ligand binding sites (LBSs) on protein structures, which are obtained either from experimental or computational methods, is a useful first step in functional annotation or structure-based drug design for the protein structures. In this work, the structure-based machine learning algorithm ISMBLab-LIG was developed to predict LBSs on protein surfaces with input attributes derived from the three-dimensional probability density maps of interacting atoms, which were reconstructed on the query protein surfaces and were relatively insensitive to local conformational variations of the tentative ligand binding sites. The prediction accuracy of the ISMBLab-LIG predictors is comparable to that of the best LBS predictors benchmarked on several well-established testing datasets. More importantly, the ISMBLab-LIG algorithm has substantial tolerance to the prediction uncertainties of computationally derived protein structure models. As such, the method is particularly useful for predicting LBSs not only on experimental protein structures without known LBS templates in the database but also on computationally predicted model protein structures with structural uncertainties in the tentative ligand binding sites. PMID:27513851

  3. Structure prediction and binding sites analysis of curcin protein of Jatropha curcas using computational approaches.

    Science.gov (United States)

    Srivastava, Mugdha; Gupta, Shishir K; Abhilash, P C; Singh, Nandita

    2012-07-01

    Ribosome inactivating proteins (RIPs) are defense proteins in a number of higher-plant species that are directly targeted toward herbivores. Jatropha curcas is one of the biodiesel plants having RIPs. The Jatropha seed meal, after extraction of oil, is rich in curcin, a highly toxic RIP similar to ricin, which makes it unsuitable for animal feed. Although the toxicity of curcin is well documented in the literature, the detailed toxic properties and the 3D structure of curcin has not been determined by X-ray crystallography, NMR spectroscopy or any in silico techniques to date. In this pursuit, the structure of curcin was modeled by a composite approach of 3D structure prediction using threading and ab initio modeling. Assessment of model quality was assessed by methods which include Ramachandran plot analysis and Qmean score estimation. Further, we applied the protein-ligand docking approach to identify the r-RNA binding residue of curcin. The present work provides the first structural insight into the binding mode of r-RNA adenine to the curcin protein and forms the basis for designing future inhibitors of curcin. Cloning of a future peptide inhibitor within J. curcas can produce non-toxic varieties of J. curcas, which would make the seed-cake suitable as animal feed without curcin detoxification.

  4. Application of quantitative structure-activity relationship to the determination of binding constant based on fluorescence quenching

    Energy Technology Data Exchange (ETDEWEB)

    Wen Yingying [Department of Applied Chemistry, Yantai University, Yantai 264005 (China); Liu Huitao, E-mail: liuht-ytu@163.co [Department of Applied Chemistry, Yantai University, Yantai 264005 (China); Luan Feng; Gao Yuan [Department of Applied Chemistry, Yantai University, Yantai 264005 (China)

    2011-01-15

    Quantitative structure-activity relationship (QSAR) model was used to predict and explain binding constant (log K) determined by fluorescence quenching. This method allowed us to predict binding constants of a variety of compounds with human serum albumin (HSA) based on their structures alone. Stepwise multiple linear regression (MLR) and nonlinear radial basis function neural network (RBFNN) were performed to build the models. The statistical parameters provided by the MLR model (R{sup 2}=0.8521, RMS=0.2678) indicated satisfactory stability and predictive ability while the RBFNN predictive ability is somewhat superior (R{sup 2}=0.9245, RMS=0.1736). The proposed models were used to predict the binding constants of two bioactive components in traditional Chinese medicines (isoimperatorin and chrysophanol) whose experimental results were obtained in our laboratory and the predicted results were in good agreement with the experimental results. This QSAR approach can contribute to a better understanding of structural factors of the compounds responsible for drug-protein interactions, and can be useful in predicting the binding constants of other compounds. - Research Highlights: QSAR models for binding constants of some compounds to HSA were developed. The models provide a simple and straightforward way to predict binding constant. QSAR can give some insight into structural features related to binding behavior.

  5. Calcium-induced conformational changes of Thrombospondin-1 signature domain: implications for vascular disease.

    Science.gov (United States)

    Gupta, Akanksha; Agarwal, Rahul; Singh, Ashutosh; Bhatnagar, Sonika

    2017-06-01

    Thrombospondin1 (TSP1) participates in numerous signaling pathways critical for vascular physiology and disease. The conserved signature domain of thrombospondin 1 (TSP1-Sig1) comprises three epidermal growth factor (EGF), 13 calcium-binding type 3 thrombospondin (T3) repeats, and one lectin-like module arranged in a stalk-wire-globe topology. TSP1 is known to be present in both calcium-replete (Holo-) and calcium-depleted (Apo-) state, each with distinct downstream signaling effects. To prepare a homology model of TSP1-Sig1 and investigate the effect of calcium on its dynamic structure and interactions. A homology model of Holo-TSP1-Sig1 was prepared with TSP2 as template in Swissmodel workspace. The Apo-form of the model was obtained by omitting the bound calcium ions from the homology model. Molecular dynamics (MD) simulation studies (100 ns) were performed on the Holo- and Apo- forms of TSP1 using Gromacs4.6.5. After simulation, Holo-TSP1-Sig1 showed significant reorientation at the interface of the EGF1-2 and EGF2-3 modules. The T3 wire is predicted to show the maximum mobility and deviation from the initial model. In Apo-TSP1-Sig1 model, the T3 repeats unfolded and formed coils with predicted increase in flexibility. Apo-TSP1-Sig1model also predicted the exposure of the binding sites for neutrophil elastase, integrin and fibroblast growth factor 2. We present a structural model and hypothesis for the role of TSP1-Sig1 interactions in the development of vascular disorders. The simulated model of the fully calcium-loaded and calcium-depleted TSP1-Sig1 may enable the development of its interactions as a novel therapeutic target for the treatment of vascular diseases.

  6. Predicting accurate absolute binding energies in aqueous solution

    DEFF Research Database (Denmark)

    Jensen, Jan Halborg

    2015-01-01

    Recent predictions of absolute binding free energies of host-guest complexes in aqueous solution using electronic structure theory have been encouraging for some systems, while other systems remain problematic. In this paper I summarize some of the many factors that could easily contribute 1-3 kcal......-represented by continuum models. While I focus on binding free energies in aqueous solution the approach also applies (with minor adjustments) to any free energy difference such as conformational or reaction free energy differences or activation free energies in any solvent....

  7. Structural and functional investigation of flavin binding center of the NqrC subunit of sodium-translocating NADH:quinone oxidoreductase from Vibrio harveyi.

    Directory of Open Access Journals (Sweden)

    Valentin Borshchevskiy

    Full Text Available Na+-translocating NADH:quinone oxidoreductase (NQR is a redox-driven sodium pump operating in the respiratory chain of various bacteria, including pathogenic species. The enzyme has a unique set of redox active prosthetic groups, which includes two covalently bound flavin mononucleotide (FMN residues attached to threonine residues in subunits NqrB and NqrC. The reason of FMN covalent bonding in the subunits has not been established yet. In the current work, binding of free FMN to the apo-form of NqrC from Vibrio harveyi was studied showing very low affinity of NqrC to FMN in the absence of its covalent bonding. To study structural aspects of flavin binding in NqrC, its holo-form was crystallized and its 3D structure was solved at 1.56 Å resolution. It was found that the isoalloxazine moiety of the FMN residue is buried in a hydrophobic cavity and that its pyrimidine ring is squeezed between hydrophobic amino acid residues while its benzene ring is extended from the protein surroundings. This structure of the flavin-binding pocket appears to provide flexibility of the benzene ring, which can help the FMN residue to take the bended conformation and thus to stabilize the one-electron reduced form of the prosthetic group. These properties may also lead to relatively weak noncovalent binding of the flavin. This fact along with periplasmic location of the FMN-binding domains in the vast majority of NqrC-like proteins may explain the necessity of the covalent bonding of this prosthetic group to prevent its loss to the external medium.

  8. Use of Microsoft HoloLens to survey and visualize buried networks

    CERN Multimedia

    CERN. Geneva

    2017-01-01

    Survey and positioning of buried infrastructure networks are crucial issues for their maintenance and a starting point for every new Civil Engineering project. 3DCity is a research & development project which consists in a development of software providing a method for quick underground pipe networks surveying and holographic visualization, by using Microsoft HoloLens devices.

  9. Using physics-based pose predictions and free energy perturbation calculations to predict binding poses and relative binding affinities for FXR ligands in the D3R Grand Challenge 2

    Science.gov (United States)

    Athanasiou, Christina; Vasilakaki, Sofia; Dellis, Dimitris; Cournia, Zoe

    2018-01-01

    Computer-aided drug design has become an integral part of drug discovery and development in the pharmaceutical and biotechnology industry, and is nowadays extensively used in the lead identification and lead optimization phases. The drug design data resource (D3R) organizes challenges against blinded experimental data to prospectively test computational methodologies as an opportunity for improved methods and algorithms to emerge. We participated in Grand Challenge 2 to predict the crystallographic poses of 36 Farnesoid X Receptor (FXR)-bound ligands and the relative binding affinities for two designated subsets of 18 and 15 FXR-bound ligands. Here, we present our methodology for pose and affinity predictions and its evaluation after the release of the experimental data. For predicting the crystallographic poses, we used docking and physics-based pose prediction methods guided by the binding poses of native ligands. For FXR ligands with known chemotypes in the PDB, we accurately predicted their binding modes, while for those with unknown chemotypes the predictions were more challenging. Our group ranked #1st (based on the median RMSD) out of 46 groups, which submitted complete entries for the binding pose prediction challenge. For the relative binding affinity prediction challenge, we performed free energy perturbation (FEP) calculations coupled with molecular dynamics (MD) simulations. FEP/MD calculations displayed a high success rate in identifying compounds with better or worse binding affinity than the reference (parent) compound. Our studies suggest that when ligands with chemical precedent are available in the literature, binding pose predictions using docking and physics-based methods are reliable; however, predictions are challenging for ligands with completely unknown chemotypes. We also show that FEP/MD calculations hold predictive value and can nowadays be used in a high throughput mode in a lead optimization project provided that crystal structures of

  10. Conformational Dynamics and Binding Free Energies of Inhibitors of BACE-1: From the Perspective of Protonation Equilibria.

    Directory of Open Access Journals (Sweden)

    M Olivia Kim

    2015-10-01

    Full Text Available BACE-1 is the β-secretase responsible for the initial amyloidogenesis in Alzheimer's disease, catalyzing hydrolytic cleavage of substrate in a pH-sensitive manner. The catalytic mechanism of BACE-1 requires water-mediated proton transfer from aspartyl dyad to the substrate, as well as structural flexibility in the flap region. Thus, the coupling of protonation and conformational equilibria is essential to a full in silico characterization of BACE-1. In this work, we perform constant pH replica exchange molecular dynamics simulations on both apo BACE-1 and five BACE-1-inhibitor complexes to examine the effect of pH on dynamics and inhibitor binding properties of BACE-1. In our simulations, we find that solution pH controls the conformational flexibility of apo BACE-1, whereas bound inhibitors largely limit the motions of the holo enzyme at all levels of pH. The microscopic pKa values of titratable residues in BACE-1 including its aspartyl dyad are computed and compared between apo and inhibitor-bound states. Changes in protonation between the apo and holo forms suggest a thermodynamic linkage between binding of inhibitors and protons localized at the dyad. Utilizing our recently developed computational protocol applying the binding polynomial formalism to the constant pH molecular dynamics (CpHMD framework, we are able to obtain the pH-dependent binding free energy profiles for various BACE-1-inhibitor complexes. Our results highlight the importance of correctly addressing the binding-induced protonation changes in protein-ligand systems where binding accompanies a net proton transfer. This work comprises the first application of our CpHMD-based free energy computational method to protein-ligand complexes and illustrates the value of CpHMD as an all-purpose tool for obtaining pH-dependent dynamics and binding free energies of biological systems.

  11. Post processing of protein-compound docking for fragment-based drug discovery (FBDD): in-silico structure-based drug screening and ligand-binding pose prediction.

    Science.gov (United States)

    Fukunishi, Yoshifumi

    2010-01-01

    For fragment-based drug development, both hit (active) compound prediction and docking-pose (protein-ligand complex structure) prediction of the hit compound are important, since chemical modification (fragment linking, fragment evolution) subsequent to the hit discovery must be performed based on the protein-ligand complex structure. However, the naïve protein-compound docking calculation shows poor accuracy in terms of docking-pose prediction. Thus, post-processing of the protein-compound docking is necessary. Recently, several methods for the post-processing of protein-compound docking have been proposed. In FBDD, the compounds are smaller than those for conventional drug screening. This makes it difficult to perform the protein-compound docking calculation. A method to avoid this problem has been reported. Protein-ligand binding free energy estimation is useful to reduce the procedures involved in the chemical modification of the hit fragment. Several prediction methods have been proposed for high-accuracy estimation of protein-ligand binding free energy. This paper summarizes the various computational methods proposed for docking-pose prediction and their usefulness in FBDD.

  12. The Holo-Transcriptome of the Zoantharian Protopalythoa variabilis (Cnidaria: Anthozoa: A Plentiful Source of Enzymes for Potential Application in Green Chemistry, Industrial and Pharmaceutical Biotechnology

    Directory of Open Access Journals (Sweden)

    Jean-Étienne R. L. Morlighem

    2018-06-01

    Full Text Available Marine invertebrates, such as sponges, tunicates and cnidarians (zoantharians and scleractinian corals, form functional assemblages, known as holobionts, with numerous microbes. This type of species-specific symbiotic association can be a repository of myriad valuable low molecular weight organic compounds, bioactive peptides and enzymes. The zoantharian Protopalythoa variabilis (Cnidaria: Anthozoa is one such example of a marine holobiont that inhabits the coastal reefs of the tropical Atlantic coast and is an interesting source of secondary metabolites and biologically active polypeptides. In the present study, we analyzed the entire holo-transcriptome of P. variabilis, looking for enzyme precursors expressed in the zoantharian-microbiota assemblage that are potentially useful as industrial biocatalysts and biopharmaceuticals. In addition to hundreds of predicted enzymes that fit into the classes of hydrolases, oxidoreductases and transferases that were found, novel enzyme precursors with multiple activities in single structures and enzymes with incomplete Enzyme Commission numbers were revealed. Our results indicated the predictive expression of thirteen multifunctional enzymes and 694 enzyme sequences with partially characterized activities, distributed in 23 sub-subclasses. These predicted enzyme structures and activities can prospectively be harnessed for applications in diverse areas of industrial and pharmaceutical biotechnology.

  13. Structural insights into Cydia pomonella pheromone binding protein 2 mediated prediction of potentially active semiochemicals

    Science.gov (United States)

    Tian, Zhen; Liu, Jiyuan; Zhang, Yalin

    2016-03-01

    Given the advantages of behavioral disruption application in pest control and the damage of Cydia pomonella, due progresses have not been made in searching active semiochemicals for codling moth. In this research, 31 candidate semiochemicals were ranked for their binding potential to Cydia pomonella pheromone binding protein 2 (CpomPBP2) by simulated docking, and this sorted result was confirmed by competitive binding assay. This high predicting accuracy of virtual screening led to the construction of a rapid and viable method for semiochemicals searching. By reference to binding mode analyses, hydrogen bond and hydrophobic interaction were suggested to be two key factors in determining ligand affinity, so is the length of molecule chain. So it is concluded that semiochemicals of appropriate chain length with hydroxyl group or carbonyl group at one head tended to be favored by CpomPBP2. Residues involved in binding with each ligand were pointed out as well, which were verified by computational alanine scanning mutagenesis. Progress made in the present study helps establish an efficient method for predicting potentially active compounds and prepares for the application of high-throughput virtual screening in searching semiochemicals by taking insights into binding mode analyses.

  14. Improving binding mode and binding affinity predictions of docking by ligand-based search of protein conformations: evaluation in D3R grand challenge 2015

    Science.gov (United States)

    Xu, Xianjin; Yan, Chengfei; Zou, Xiaoqin

    2017-08-01

    The growing number of protein-ligand complex structures, particularly the structures of proteins co-bound with different ligands, in the Protein Data Bank helps us tackle two major challenges in molecular docking studies: the protein flexibility and the scoring function. Here, we introduced a systematic strategy by using the information embedded in the known protein-ligand complex structures to improve both binding mode and binding affinity predictions. Specifically, a ligand similarity calculation method was employed to search a receptor structure with a bound ligand sharing high similarity with the query ligand for the docking use. The strategy was applied to the two datasets (HSP90 and MAP4K4) in recent D3R Grand Challenge 2015. In addition, for the HSP90 dataset, a system-specific scoring function (ITScore2_hsp90) was generated by recalibrating our statistical potential-based scoring function (ITScore2) using the known protein-ligand complex structures and the statistical mechanics-based iterative method. For the HSP90 dataset, better performances were achieved for both binding mode and binding affinity predictions comparing with the original ITScore2 and with ensemble docking. For the MAP4K4 dataset, although there were only eight known protein-ligand complex structures, our docking strategy achieved a comparable performance with ensemble docking. Our method for receptor conformational selection and iterative method for the development of system-specific statistical potential-based scoring functions can be easily applied to other protein targets that have a number of protein-ligand complex structures available to improve predictions on binding.

  15. Effect of free cysteine on the denaturation and aggregation of holo α-lactalbumin

    DEFF Research Database (Denmark)

    Nielsen, Line R.; Lund, Marianne N.; Davies, Michael J.

    2018-01-01

    α-Lactalbumin (α-LA) is a key commercial whey protein for nutritional purposes. The holo protein (calcium saturated) is considered the most heat stable whey protein, capable of refolding from unfolded states under many conditions. This is due to the absence of free thiols (cysteine residues......) that are typically involved in thermal aggregation and thiol–disulphide exchange reactions of other whey proteins. Heating (0–120 min at 90 °C, pH 7.0) holo α-LA generates free thiols through thermal cleavage of disulphide bonds, resulting in aggregates comprising unfolded α-LA species. The addition of free cysteine...... promotes the formation of soluble aggregates, effectively decreasing the holding time required to reach a particular aggregate size in a dose-dependent manner (0.35–1.4 mM cysteine). Excess cysteine (≥14 mM) causes a destabilisation of α-LA, shown by decreased denaturation temperature and gel formation...

  16. Characterization of the mycobacterial acyl-CoA carboxylase holo complexes reveals their functional expansion into amino acid catabolism.

    Directory of Open Access Journals (Sweden)

    Matthias T Ehebauer

    2015-02-01

    Full Text Available Biotin-mediated carboxylation of short-chain fatty acid coenzyme A esters is a key step in lipid biosynthesis that is carried out by multienzyme complexes to extend fatty acids by one methylene group. Pathogenic mycobacteria have an unusually high redundancy of carboxyltransferase genes and biotin carboxylase genes, creating multiple combinations of protein/protein complexes of unknown overall composition and functional readout. By combining pull-down assays with mass spectrometry, we identified nine binary protein/protein interactions and four validated holo acyl-coenzyme A carboxylase complexes. We investigated one of these--the AccD1-AccA1 complex from Mycobacterium tuberculosis with hitherto unknown physiological function. Using genetics, metabolomics and biochemistry we found that this complex is involved in branched amino-acid catabolism with methylcrotonyl coenzyme A as the substrate. We then determined its overall architecture by electron microscopy and found it to be a four-layered dodecameric arrangement that matches the overall dimensions of a distantly related methylcrotonyl coenzyme A holo complex. Our data argue in favor of distinct structural requirements for biotin-mediated γ-carboxylation of α-β unsaturated acid esters and will advance the categorization of acyl-coenzyme A carboxylase complexes. Knowledge about the underlying structural/functional relationships will be crucial to make the target category amenable for future biomedical applications.

  17. First trimester serum levels of the soluble transcobalamin receptor, holo-transcobalamin, and total transcobalamin in relation to preeclampsia risk

    DEFF Research Database (Denmark)

    Abuyaman, Omar; Torring, Niels; Obeid, Rima

    2016-01-01

    transcobalamin (TC) with the risk of subsequent preeclampsia using serum samples from asymptomatic first trimester pregnant women. Moreover, we aimed to establish reference intervals of the aforementioned biomarkers for first trimester pregnant women who remained healthy throughout pregnancy. STUDY DESIGN...... preeclampsia while the controls remained normotensive throughout pregnancy. We measured the serum concentration of sCD320, holoTC, and total TC by using in-house ELISA methods. RESULTS: First trimester median concentrations of sCD320, holoTC and total TC were not significantly different between cases...... and controls. The odd ratio for developing preeclampsia based on exposure to low or high levels of sCD320, holoTC or total TC at first trimester was not significant. The reference intervals (2.5-97.5% percentiles (median)) derived from the controls were 50-170 (90) pmol\\L for sCD320, 20-140 (70) pmol...

  18. Predicting binding affinities of protein ligands from three-dimensional models: application to peptide binding to class I major histocompatibility proteins

    DEFF Research Database (Denmark)

    Rognan, D; Lauemoller, S L; Holm, A

    1999-01-01

    A simple and fast free energy scoring function (Fresno) has been developed to predict the binding free energy of peptides to class I major histocompatibility (MHC) proteins. It differs from existing scoring functions mainly by the explicit treatment of ligand desolvation and of unfavorable protein...... coordinates of the MHC-bound peptide have first been determined with an accuracy of about 1-1.5 A. Furthermore, it may be easily recalibrated for any protein-ligand complex.......) and of a series of 16 peptides to H-2K(k). Predictions were more accurate for HLA-A2-binding peptides as the training set had been built from experimentally determined structures. The average error in predicting the binding free energy of the test peptides was 3.1 kJ/mol. For the homology model-derived equation...

  19. Structural Fingerprints of Transcription Factor Binding Site Regions

    Directory of Open Access Journals (Sweden)

    Peter Willett

    2009-03-01

    Full Text Available Fourier transforms are a powerful tool in the prediction of DNA sequence properties, such as the presence/absence of codons. We have previously compiled a database of the structural properties of all 32,896 unique DNA octamers. In this work we apply Fourier techniques to the analysis of the structural properties of human chromosomes 21 and 22 and also to three sets of transcription factor binding sites within these chromosomes. We find that, for a given structural property, the structural property power spectra of chromosomes 21 and 22 are strikingly similar. We find common peaks in their power spectra for both Sp1 and p53 transcription factor binding sites. We use the power spectra as a structural fingerprint and perform similarity searching in order to find transcription factor binding site regions. This approach provides a new strategy for searching the genome data for information. Although it is difficult to understand the relationship between specific functional properties and the set of structural parameters in our database, our structural fingerprints nevertheless provide a useful tool for searching for function information in sequence data. The power spectrum fingerprints provide a simple, fast method for comparing a set of functional sequences, in this case transcription factor binding site regions, with the sequences of whole chromosomes. On its own, the power spectrum fingerprint does not find all transcription factor binding sites in a chromosome, but the results presented here show that in combination with other approaches, this technique will improve the chances of identifying functional sequences hidden in genomic data.

  20. ProBiS-ligands: a web server for prediction of ligands by examination of protein binding sites.

    Science.gov (United States)

    Konc, Janez; Janežič, Dušanka

    2014-07-01

    The ProBiS-ligands web server predicts binding of ligands to a protein structure. Starting with a protein structure or binding site, ProBiS-ligands first identifies template proteins in the Protein Data Bank that share similar binding sites. Based on the superimpositions of the query protein and the similar binding sites found, the server then transposes the ligand structures from those sites to the query protein. Such ligand prediction supports many activities, e.g. drug repurposing. The ProBiS-ligands web server, an extension of the ProBiS web server, is open and free to all users at http://probis.cmm.ki.si/ligands. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Predicting Flavin and Nicotinamide Adenine Dinucleotide-Binding Sites in Proteins Using the Fragment Transformation Method

    Directory of Open Access Journals (Sweden)

    Chih-Hao Lu

    2015-01-01

    Full Text Available We developed a computational method to identify NAD- and FAD-binding sites in proteins. First, we extracted from the Protein Data Bank structures of proteins that bind to at least one of these ligands. NAD-/FAD-binding residue templates were then constructed by identifying binding residues through the ligand-binding database BioLiP. The fragment transformation method was used to identify structures within query proteins that resembled the ligand-binding templates. By comparing residue types and their relative spatial positions, potential binding sites were identified and a ligand-binding potential for each residue was calculated. Setting the false positive rate at 5%, our method predicted NAD- and FAD-binding sites at true positive rates of 67.1% and 68.4%, respectively. Our method provides excellent results for identifying FAD- and NAD-binding sites in proteins, and the most important is that the requirement of conservation of residue types and local structures in the FAD- and NAD-binding sites can be verified.

  2. LIGSITEcsc: predicting ligand binding sites using the Connolly surface and degree of conservation

    Directory of Open Access Journals (Sweden)

    Schroeder Michael

    2006-09-01

    Full Text Available Abstract Background Identifying pockets on protein surfaces is of great importance for many structure-based drug design applications and protein-ligand docking algorithms. Over the last ten years, many geometric methods for the prediction of ligand-binding sites have been developed. Results We present LIGSITEcsc, an extension and implementation of the LIGSITE algorithm. LIGSITEcsc is based on the notion of surface-solvent-surface events and the degree of conservation of the involved surface residues. We compare our algorithm to four other approaches, LIGSITE, CAST, PASS, and SURFNET, and evaluate all on a dataset of 48 unbound/bound structures and 210 bound-structures. LIGSITEcsc performs slightly better than the other tools and achieves a success rate of 71% and 75%, respectively. Conclusion The use of the Connolly surface leads to slight improvements, the prediction re-ranking by conservation to significant improvements of the binding site predictions. A web server for LIGSITEcsc and its source code is available at scoppi.biotec.tu-dresden.de/pocket.

  3. The structure of apo and holo forms of xylose reductase, a dimeric aldo-keto reductase from Candida tenuis.

    Science.gov (United States)

    Kavanagh, Kathryn L; Klimacek, Mario; Nidetzky, Bernd; Wilson, David K

    2002-07-16

    Xylose reductase is a homodimeric oxidoreductase dependent on NADPH or NADH and belongs to the largely monomeric aldo-keto reductase superfamily of proteins. It catalyzes the first step in the assimilation of xylose, an aldose found to be a major constituent monosaccharide of renewable plant hemicellulosic material, into yeast metabolic pathways. It does this by reducing open chain xylose to xylitol, which is reoxidized to xylulose by xylitol dehydrogenase and metabolically integrated via the pentose phosphate pathway. No structure has yet been determined for a xylose reductase, a dimeric aldo-keto reductase or a family 2 aldo-keto reductase. The structures of the Candida tenuis xylose reductase apo- and holoenzyme, which crystallize in spacegroup C2 with different unit cells, have been determined to 2.2 A resolution and an R-factor of 17.9 and 20.8%, respectively. Residues responsible for mediating the novel dimeric interface include Asp-178, Arg-181, Lys-202, Phe-206, Trp-313, and Pro-319. Alignments with other superfamily members indicate that these interactions are conserved in other dimeric xylose reductases but not throughout the remainder of the oligomeric aldo-keto reductases, predicting alternate modes of oligomerization for other families. An arrangement of side chains in a catalytic triad shows that Tyr-52 has a conserved function as a general acid. The loop that folds over the NAD(P)H cosubstrate is disordered in the apo form but becomes ordered upon cosubstrate binding. A slow conformational isomerization of this loop probably accounts for the observed rate-limiting step involving release of cosubstrate. Xylose binding (K(m) = 87 mM) is mediated by interactions with a binding pocket that is more polar than a typical aldo-keto reductase. Modeling of xylose into the active site of the holoenzyme using ordered waters as a guide for sugar hydroxyls suggests a convincing mode of substrate binding.

  4. SAAMBE: Webserver to Predict the Charge of Binding Free Energy Caused by Amino Acids Mutations.

    Science.gov (United States)

    Petukh, Marharyta; Dai, Luogeng; Alexov, Emil

    2016-04-12

    Predicting the effect of amino acid substitutions on protein-protein affinity (typically evaluated via the change of protein binding free energy) is important for both understanding the disease-causing mechanism of missense mutations and guiding protein engineering. In addition, researchers are also interested in understanding which energy components are mostly affected by the mutation and how the mutation affects the overall structure of the corresponding protein. Here we report a webserver, the Single Amino Acid Mutation based change in Binding free Energy (SAAMBE) webserver, which addresses the demand for tools for predicting the change of protein binding free energy. SAAMBE is an easy to use webserver, which only requires that a coordinate file be inputted and the user is provided with various, but easy to navigate, options. The user specifies the mutation position, wild type residue and type of mutation to be made. The server predicts the binding free energy change, the changes of the corresponding energy components and provides the energy minimized 3D structure of the wild type and mutant proteins for download. The SAAMBE protocol performance was tested by benchmarking the predictions against over 1300 experimentally determined changes of binding free energy and a Pearson correlation coefficient of 0.62 was obtained. How the predictions can be used for discriminating disease-causing from harmless mutations is discussed. The webserver can be accessed via http://compbio.clemson.edu/saambe_webserver/.

  5. Three-dimensional (3D) structure prediction and function analysis of the chitin-binding domain 3 protein HD73_3189 from Bacillus thuringiensis HD73.

    Science.gov (United States)

    Zhan, Yiling; Guo, Shuyuan

    2015-01-01

    Bacillus thuringiensis (Bt) is capable of producing a chitin-binding protein believed to be functionally important to bacteria during the stationary phase of its growth cycle. In this paper, the chitin-binding domain 3 protein HD73_3189 from B. thuringiensis has been analyzed by computer technology. Primary and secondary structural analyses demonstrated that HD73_3189 is negatively charged and contains several α-helices, aperiodical coils and β-strands. Domain and motif analyses revealed that HD73_3189 contains a signal peptide, an N-terminal chitin binding 3 domains, two copies of a fibronectin-like domain 3 and a C-terminal carbohydrate binding domain classified as CBM_5_12. Moreover, analysis predicted the protein's associated localization site to be the cell wall. Ligand site prediction determined that amino acid residues GLU-312, TRP-334, ILE-341 and VAL-382 exposed on the surface of the target protein exhibit polar interactions with the substrate.

  6. Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

    KAUST Repository

    Guturu, H.

    2013-11-11

    Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and \\'through-DNA\\' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.

  7. Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

    KAUST Repository

    Guturu, H.; Doxey, A. C.; Wenger, A. M.; Bejerano, G.

    2013-01-01

    Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and 'through-DNA' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.

  8. Interaction between holo transferrin and HSA-PPIX complex in the presence of lomefloxacin: An evaluation of PPIX aggregation in protein-protein interactions

    Science.gov (United States)

    Sattar, Zohreh; Iranfar, Hediye; Asoodeh, Ahmad; Saberi, Mohammad Reza; Mazhari, Mahboobeh; Chamani, Jamshidkhan

    2012-11-01

    Human serum albumin (HSA) and holo transferrin (TF) are two serum carrier proteins that are able to interact with each other, thereby altering their binding behavior toward their ligands. During the course of this study, the interaction between HSA-PPIX and TF, in the presence and absence of lomefloxacin (LMF), was for the first time investigated using different spectroscopic and molecular modeling techniques. Fluorescence spectroscopy experiments were performed in order to study conformational changes of proteins. The RLS technique was utilized to investigate the effect of LMF on J-aggregation of PPIX, which is the first report of its kind. Our findings present clear-cut evidence for the alteration of interactions between HSA and TF in the presence of PPIX and changes in drug-binding to HSA and HSA-PPIX complex upon interaction with TF. Moreover, molecular modeling studies suggested that the binding site for LMF became switched in the presence of PPIX, and that LMF bound to the site IIA of HSA. The obtained results should give new insight into research in this field and may cast some light on the dynamics of drugs in biological systems.

  9. Ligand Binding Site Detection by Local Structure Alignment and Its Performance Complementarity

    Science.gov (United States)

    Lee, Hui Sun; Im, Wonpil

    2013-01-01

    Accurate determination of potential ligand binding sites (BS) is a key step for protein function characterization and structure-based drug design. Despite promising results of template-based BS prediction methods using global structure alignment (GSA), there is a room to improve the performance by properly incorporating local structure alignment (LSA) because BS are local structures and often similar for proteins with dissimilar global folds. We present a template-based ligand BS prediction method using G-LoSA, our LSA tool. A large benchmark set validation shows that G-LoSA predicts drug-like ligands’ positions in single-chain protein targets more precisely than TM-align, a GSA-based method, while the overall success rate of TM-align is better. G-LoSA is particularly efficient for accurate detection of local structures conserved across proteins with diverse global topologies. Recognizing the performance complementarity of G-LoSA to TM-align and a non-template geometry-based method, fpocket, a robust consensus scoring method, CMCS-BSP (Complementary Methods and Consensus Scoring for ligand Binding Site Prediction), is developed and shows improvement on prediction accuracy. The G-LoSA source code is freely available at http://im.bioinformatics.ku.edu/GLoSA. PMID:23957286

  10. Mixed Reality with HoloLens: Where Virtual Reality Meets Augmented Reality in the Operating Room.

    Science.gov (United States)

    Tepper, Oren M; Rudy, Hayeem L; Lefkowitz, Aaron; Weimer, Katie A; Marks, Shelby M; Stern, Carrie S; Garfein, Evan S

    2017-11-01

    Virtual reality and augmented reality devices have recently been described in the surgical literature. The authors have previously explored various iterations of these devices, and although they show promise, it has become clear that virtual reality and/or augmented reality devices alone do not adequately meet the demands of surgeons. The solution may lie in a hybrid technology known as mixed reality, which merges many virtual reality and augmented realty features. Microsoft's HoloLens, the first commercially available mixed reality device, provides surgeons intraoperative hands-free access to complex data, the real environment, and bidirectional communication. This report describes the use of HoloLens in the operating room to improve decision-making and surgical workflow. The pace of mixed reality-related technological development will undoubtedly be rapid in the coming years, and plastic surgeons are ideally suited to both lead and benefit from this advance.

  11. Facilitating RNA structure prediction with microarrays.

    Science.gov (United States)

    Kierzek, Elzbieta; Kierzek, Ryszard; Turner, Douglas H; Catrina, Irina E

    2006-01-17

    Determining RNA secondary structure is important for understanding structure-function relationships and identifying potential drug targets. This paper reports the use of microarrays with heptamer 2'-O-methyl oligoribonucleotides to probe the secondary structure of an RNA and thereby improve the prediction of that secondary structure. When experimental constraints from hybridization results are added to a free-energy minimization algorithm, the prediction of the secondary structure of Escherichia coli 5S rRNA improves from 27 to 92% of the known canonical base pairs. Optimization of buffer conditions for hybridization and application of 2'-O-methyl-2-thiouridine to enhance binding and improve discrimination between AU and GU pairs are also described. The results suggest that probing RNA with oligonucleotide microarrays can facilitate determination of secondary structure.

  12. Cloud computing approaches for prediction of ligand binding poses and pathways.

    Science.gov (United States)

    Lawrenz, Morgan; Shukla, Diwakar; Pande, Vijay S

    2015-01-22

    We describe an innovative protocol for ab initio prediction of ligand crystallographic binding poses and highly effective analysis of large datasets generated for protein-ligand dynamics. We include a procedure for setup and performance of distributed molecular dynamics simulations on cloud computing architectures, a model for efficient analysis of simulation data, and a metric for evaluation of model convergence. We give accurate binding pose predictions for five ligands ranging in affinity from 7 nM to > 200 μM for the immunophilin protein FKBP12, for expedited results in cases where experimental structures are difficult to produce. Our approach goes beyond single, low energy ligand poses to give quantitative kinetic information that can inform protein engineering and ligand design.

  13. Target and Tissue Selectivity Prediction by Integrated Mechanistic Pharmacokinetic-Target Binding and Quantitative Structure Activity Modeling.

    Science.gov (United States)

    Vlot, Anna H C; de Witte, Wilhelmus E A; Danhof, Meindert; van der Graaf, Piet H; van Westen, Gerard J P; de Lange, Elizabeth C M

    2017-12-04

    Selectivity is an important attribute of effective and safe drugs, and prediction of in vivo target and tissue selectivity would likely improve drug development success rates. However, a lack of understanding of the underlying (pharmacological) mechanisms and availability of directly applicable predictive methods complicates the prediction of selectivity. We explore the value of combining physiologically based pharmacokinetic (PBPK) modeling with quantitative structure-activity relationship (QSAR) modeling to predict the influence of the target dissociation constant (K D ) and the target dissociation rate constant on target and tissue selectivity. The K D values of CB1 ligands in the ChEMBL database are predicted by QSAR random forest (RF) modeling for the CB1 receptor and known off-targets (TRPV1, mGlu5, 5-HT1a). Of these CB1 ligands, rimonabant, CP-55940, and Δ 8 -tetrahydrocanabinol, one of the active ingredients of cannabis, were selected for simulations of target occupancy for CB1, TRPV1, mGlu5, and 5-HT1a in three brain regions, to illustrate the principles of the combined PBPK-QSAR modeling. Our combined PBPK and target binding modeling demonstrated that the optimal values of the K D and k off for target and tissue selectivity were dependent on target concentration and tissue distribution kinetics. Interestingly, if the target concentration is high and the perfusion of the target site is low, the optimal K D value is often not the lowest K D value, suggesting that optimization towards high drug-target affinity can decrease the benefit-risk ratio. The presented integrative structure-pharmacokinetic-pharmacodynamic modeling provides an improved understanding of tissue and target selectivity.

  14. Supervised machine learning techniques to predict binding affinity. A study for cyclin-dependent kinase 2.

    Science.gov (United States)

    de Ávila, Maurício Boff; Xavier, Mariana Morrone; Pintro, Val Oliveira; de Azevedo, Walter Filgueira

    2017-12-09

    Here we report the development of a machine-learning model to predict binding affinity based on the crystallographic structures of protein-ligand complexes. We used an ensemble of crystallographic structures (resolution better than 1.5 Å resolution) for which half-maximal inhibitory concentration (IC 50 ) data is available. Polynomial scoring functions were built using as explanatory variables the energy terms present in the MolDock and PLANTS scoring functions. Prediction performance was tested and the supervised machine learning models showed improvement in the prediction power, when compared with PLANTS and MolDock scoring functions. In addition, the machine-learning model was applied to predict binding affinity of CDK2, which showed a better performance when compared with AutoDock4, AutoDock Vina, MolDock, and PLANTS scores. Copyright © 2017 Elsevier Inc. All rights reserved.

  15. Predicting protein-ATP binding sites from primary sequence through fusing bi-profile sampling of multi-view features

    Directory of Open Access Journals (Sweden)

    Zhang Ya-Nan

    2012-05-01

    Full Text Available Abstract Background Adenosine-5′-triphosphate (ATP is one of multifunctional nucleotides and plays an important role in cell biology as a coenzyme interacting with proteins. Revealing the binding sites between protein and ATP is significantly important to understand the functionality of the proteins and the mechanisms of protein-ATP complex. Results In this paper, we propose a novel framework for predicting the proteins’ functional residues, through which they can bind with ATP molecules. The new prediction protocol is achieved by combination of sequence evolutional information and bi-profile sampling of multi-view sequential features and the sequence derived structural features. The hypothesis for this strategy is single-view feature can only represent partial target’s knowledge and multiple sources of descriptors can be complementary. Conclusions Prediction performances evaluated by both 5-fold and leave-one-out jackknife cross-validation tests on two benchmark datasets consisting of 168 and 227 non-homologous ATP binding proteins respectively demonstrate the efficacy of the proposed protocol. Our experimental results also reveal that the residue structural characteristics of real protein-ATP binding sites are significant different from those normal ones, for example the binding residues do not show high solvent accessibility propensities, and the bindings prefer to occur at the conjoint points between different secondary structure segments. Furthermore, results also show that performance is affected by the imbalanced training datasets by testing multiple ratios between positive and negative samples in the experiments. Increasing the dataset scale is also demonstrated useful for improving the prediction performances.

  16. Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d

    Directory of Open Access Journals (Sweden)

    Moffatt Barbara A

    2010-08-01

    Full Text Available Abstract Background Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB for coplanar aromatic motifs similar to those found in known glycan-binding proteins. Results The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192 in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Conclusions Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.

  17. Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d.

    Science.gov (United States)

    Doxey, Andrew C; Cheng, Zhenyu; Moffatt, Barbara A; McConkey, Brendan J

    2010-08-03

    Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB) for coplanar aromatic motifs similar to those found in known glycan-binding proteins. The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO) enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192) in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.

  18. Combining structural modeling with ensemble machine learning to accurately predict protein fold stability and binding affinity effects upon mutation.

    Directory of Open Access Journals (Sweden)

    Niklas Berliner

    Full Text Available Advances in sequencing have led to a rapid accumulation of mutations, some of which are associated with diseases. However, to draw mechanistic conclusions, a biochemical understanding of these mutations is necessary. For coding mutations, accurate prediction of significant changes in either the stability of proteins or their affinity to their binding partners is required. Traditional methods have used semi-empirical force fields, while newer methods employ machine learning of sequence and structural features. Here, we show how combining both of these approaches leads to a marked boost in accuracy. We introduce ELASPIC, a novel ensemble machine learning approach that is able to predict stability effects upon mutation in both, domain cores and domain-domain interfaces. We combine semi-empirical energy terms, sequence conservation, and a wide variety of molecular details with a Stochastic Gradient Boosting of Decision Trees (SGB-DT algorithm. The accuracy of our predictions surpasses existing methods by a considerable margin, achieving correlation coefficients of 0.77 for stability, and 0.75 for affinity predictions. Notably, we integrated homology modeling to enable proteome-wide prediction and show that accurate prediction on modeled structures is possible. Lastly, ELASPIC showed significant differences between various types of disease-associated mutations, as well as between disease and common neutral mutations. Unlike pure sequence-based prediction methods that try to predict phenotypic effects of mutations, our predictions unravel the molecular details governing the protein instability, and help us better understand the molecular causes of diseases.

  19. An integrative computational framework based on a two-step random forest algorithm improves prediction of zinc-binding sites in proteins.

    Directory of Open Access Journals (Sweden)

    Cheng Zheng

    Full Text Available Zinc-binding proteins are the most abundant metalloproteins in the Protein Data Bank where the zinc ions usually have catalytic, regulatory or structural roles critical for the function of the protein. Accurate prediction of zinc-binding sites is not only useful for the inference of protein function but also important for the prediction of 3D structure. Here, we present a new integrative framework that combines multiple sequence and structural properties and graph-theoretic network features, followed by an efficient feature selection to improve prediction of zinc-binding sites. We investigate what information can be retrieved from the sequence, structure and network levels that is relevant to zinc-binding site prediction. We perform a two-step feature selection using random forest to remove redundant features and quantify the relative importance of the retrieved features. Benchmarking on a high-quality structural dataset containing 1,103 protein chains and 484 zinc-binding residues, our method achieved >80% recall at a precision of 75% for the zinc-binding residues Cys, His, Glu and Asp on 5-fold cross-validation tests, which is a 10%-28% higher recall at the 75% equal precision compared to SitePredict and zincfinder at residue level using the same dataset. The independent test also indicates that our method has achieved recall of 0.790 and 0.759 at residue and protein levels, respectively, which is a performance better than the other two methods. Moreover, AUC (the Area Under the Curve and AURPC (the Area Under the Recall-Precision Curve by our method are also respectively better than those of the other two methods. Our method can not only be applied to large-scale identification of zinc-binding sites when structural information of the target is available, but also give valuable insights into important features arising from different levels that collectively characterize the zinc-binding sites. The scripts and datasets are available at http://protein.cau.edu.cn/zincidentifier/.

  20. DNABP: Identification of DNA-Binding Proteins Based on Feature Selection Using a Random Forest and Predicting Binding Residues.

    Science.gov (United States)

    Ma, Xin; Guo, Jing; Sun, Xiao

    2016-01-01

    DNA-binding proteins are fundamentally important in cellular processes. Several computational-based methods have been developed to improve the prediction of DNA-binding proteins in previous years. However, insufficient work has been done on the prediction of DNA-binding proteins from protein sequence information. In this paper, a novel predictor, DNABP (DNA-binding proteins), was designed to predict DNA-binding proteins using the random forest (RF) classifier with a hybrid feature. The hybrid feature contains two types of novel sequence features, which reflect information about the conservation of physicochemical properties of the amino acids, and the binding propensity of DNA-binding residues and non-binding propensities of non-binding residues. The comparisons with each feature demonstrated that these two novel features contributed most to the improvement in predictive ability. Furthermore, to improve the prediction performance of the DNABP model, feature selection using the minimum redundancy maximum relevance (mRMR) method combined with incremental feature selection (IFS) was carried out during the model construction. The results showed that the DNABP model could achieve 86.90% accuracy, 83.76% sensitivity, 90.03% specificity and a Matthews correlation coefficient of 0.727. High prediction accuracy and performance comparisons with previous research suggested that DNABP could be a useful approach to identify DNA-binding proteins from sequence information. The DNABP web server system is freely available at http://www.cbi.seu.edu.cn/DNABP/.

  1. Transcription factor binding sites prediction based on modified nucleosomes.

    Directory of Open Access Journals (Sweden)

    Mohammad Talebzadeh

    Full Text Available In computational methods, position weight matrices (PWMs are commonly applied for transcription factor binding site (TFBS prediction. Although these matrices are more accurate than simple consensus sequences to predict actual binding sites, they usually produce a large number of false positive (FP predictions and so are impoverished sources of information. Several studies have employed additional sources of information such as sequence conservation or the vicinity to transcription start sites to distinguish true binding regions from random ones. Recently, the spatial distribution of modified nucleosomes has been shown to be associated with different promoter architectures. These aligned patterns can facilitate DNA accessibility for transcription factors. We hypothesize that using data from these aligned and periodic patterns can improve the performance of binding region prediction. In this study, we propose two effective features, "modified nucleosomes neighboring" and "modified nucleosomes occupancy", to decrease FP in binding site discovery. Based on these features, we designed a logistic regression classifier which estimates the probability of a region as a TFBS. Our model learned each feature based on Sp1 binding sites on Chromosome 1 and was tested on the other chromosomes in human CD4+T cells. In this work, we investigated 21 histone modifications and found that only 8 out of 21 marks are strongly correlated with transcription factor binding regions. To prove that these features are not specific to Sp1, we combined the logistic regression classifier with the PWM, and created a new model to search TFBSs on the genome. We tested the model using transcription factors MAZ, PU.1 and ELF1 and compared the results to those using only the PWM. The results show that our model can predict Transcription factor binding regions more successfully. The relative simplicity of the model and capability of integrating other features make it a superior method

  2. A tool for calculating binding-site residues on proteins from PDB structures

    Directory of Open Access Journals (Sweden)

    Hu Jing

    2009-08-01

    Full Text Available Abstract Background In the research on protein functional sites, researchers often need to identify binding-site residues on a protein. A commonly used strategy is to find a complex structure from the Protein Data Bank (PDB that consists of the protein of interest and its interacting partner(s and calculate binding-site residues based on the complex structure. However, since a protein may participate in multiple interactions, the binding-site residues calculated based on one complex structure usually do not reveal all binding sites on a protein. Thus, this requires researchers to find all PDB complexes that contain the protein of interest and combine the binding-site information gleaned from them. This process is very time-consuming. Especially, combing binding-site information obtained from different PDB structures requires tedious work to align protein sequences. The process becomes overwhelmingly difficult when researchers have a large set of proteins to analyze, which is usually the case in practice. Results In this study, we have developed a tool for calculating binding-site residues on proteins, TCBRP http://yanbioinformatics.cs.usu.edu:8080/ppbindingsubmit. For an input protein, TCBRP can quickly find all binding-site residues on the protein by automatically combining the information obtained from all PDB structures that consist of the protein of interest. Additionally, TCBRP presents the binding-site residues in different categories according to the interaction type. TCBRP also allows researchers to set the definition of binding-site residues. Conclusion The developed tool is very useful for the research on protein binding site analysis and prediction.

  3. Computational predictions of zinc oxide hollow structures

    Science.gov (United States)

    Tuoc, Vu Ngoc; Huan, Tran Doan; Thao, Nguyen Thi

    2018-03-01

    Nanoporous materials are emerging as potential candidates for a wide range of technological applications in environment, electronic, and optoelectronics, to name just a few. Within this active research area, experimental works are predominant while theoretical/computational prediction and study of these materials face some intrinsic challenges, one of them is how to predict porous structures. We propose a computationally and technically feasible approach for predicting zinc oxide structures with hollows at the nano scale. The designed zinc oxide hollow structures are studied with computations using the density functional tight binding and conventional density functional theory methods, revealing a variety of promising mechanical and electronic properties, which can potentially find future realistic applications.

  4. Structure-based prediction of free energy changes of binding of PTP1B inhibitors

    Science.gov (United States)

    Wang, Jing; Ling Chan, Shek; Ramnarayan, Kal

    2003-08-01

    The goals were (1) to understand the driving forces in the binding of small molecule inhibitors to the active site of PTP1B and (2) to develop a molecular mechanics-based empirical free energy function for compound potency prediction. A set of compounds with known activities was docked onto the active site. The related energy components and molecular surface areas were calculated. The bridging water molecules were identified and their contributions were considered. Linear relationships were explored between the above terms and the binding free energies of compounds derived based on experimental inhibition constants. We found that minimally three terms are required to give rise to a good correlation (0.86) with predictive power in five-group cross-validation test (q2 = 0.70). The dominant terms are the electrostatic energy and non-electrostatic energy stemming from the intra- and intermolecular interactions of solutes and from those of bridging water molecules in complexes.

  5. Substituting random forest for multiple linear regression improves binding affinity prediction of scoring functions: Cyscore as a case study.

    Science.gov (United States)

    Li, Hongjian; Leung, Kwong-Sak; Wong, Man-Hon; Ballester, Pedro J

    2014-08-27

    State-of-the-art protein-ligand docking methods are generally limited by the traditionally low accuracy of their scoring functions, which are used to predict binding affinity and thus vital for discriminating between active and inactive compounds. Despite intensive research over the years, classical scoring functions have reached a plateau in their predictive performance. These assume a predetermined additive functional form for some sophisticated numerical features, and use standard multivariate linear regression (MLR) on experimental data to derive the coefficients. In this study we show that such a simple functional form is detrimental for the prediction performance of a scoring function, and replacing linear regression by machine learning techniques like random forest (RF) can improve prediction performance. We investigate the conditions of applying RF under various contexts and find that given sufficient training samples RF manages to comprehensively capture the non-linearity between structural features and measured binding affinities. Incorporating more structural features and training with more samples can both boost RF performance. In addition, we analyze the importance of structural features to binding affinity prediction using the RF variable importance tool. Lastly, we use Cyscore, a top performing empirical scoring function, as a baseline for comparison study. Machine-learning scoring functions are fundamentally different from classical scoring functions because the former circumvents the fixed functional form relating structural features with binding affinities. RF, but not MLR, can effectively exploit more structural features and more training samples, leading to higher prediction performance. The future availability of more X-ray crystal structures will further widen the performance gap between RF-based and MLR-based scoring functions. This further stresses the importance of substituting RF for MLR in scoring function development.

  6. Structural and Histone Binding Ability Characterizations of Human PWWP Domains

    Energy Technology Data Exchange (ETDEWEB)

    Wu, Hong; Zeng, Hong; Lam, Robert; Tempel, Wolfram; Amaya, Maria F.; Xu, Chao; Dombrovski, Ludmila; Qiu, Wei; Wang, Yanming; Min, Jinrong (Toronto); (Penn)

    2013-09-25

    The PWWP domain was first identified as a structural motif of 100-130 amino acids in the WHSC1 protein and predicted to be a protein-protein interaction domain. It belongs to the Tudor domain 'Royal Family', which consists of Tudor, chromodomain, MBT and PWWP domains. While Tudor, chromodomain and MBT domains have long been known to bind methylated histones, PWWP was shown to exhibit histone binding ability only until recently. The PWWP domain has been shown to be a DNA binding domain, but sequence analysis and previous structural studies show that the PWWP domain exhibits significant similarity to other 'Royal Family' members, implying that the PWWP domain has the potential to bind histones. In order to further explore the function of the PWWP domain, we used the protein family approach to determine the crystal structures of the PWWP domains from seven different human proteins. Our fluorescence polarization binding studies show that PWWP domains have weak histone binding ability, which is also confirmed by our NMR titration experiments. Furthermore, we determined the crystal structures of the BRPF1 PWWP domain in complex with H3K36me3, and HDGF2 PWWP domain in complex with H3K79me3 and H4K20me3. PWWP proteins constitute a new family of methyl lysine histone binders. The PWWP domain consists of three motifs: a canonical {beta}-barrel core, an insertion motif between the second and third {beta}-strands and a C-terminal {alpha}-helix bundle. Both the canonical {beta}-barrel core and the insertion motif are directly involved in histone binding. The PWWP domain has been previously shown to be a DNA binding domain. Therefore, the PWWP domain exhibits dual functions: binding both DNA and methyllysine histones.

  7. Linear Interaction Energy Based Prediction of Cytochrome P450 1A2 Binding Affinities with Reliability Estimation.

    Directory of Open Access Journals (Sweden)

    Luigi Capoferri

    Full Text Available Prediction of human Cytochrome P450 (CYP binding affinities of small ligands, i.e., substrates and inhibitors, represents an important task for predicting drug-drug interactions. A quantitative assessment of the ligand binding affinity towards different CYPs can provide an estimate of inhibitory activity or an indication of isoforms prone to interact with the substrate of inhibitors. However, the accuracy of global quantitative models for CYP substrate binding or inhibition based on traditional molecular descriptors can be limited, because of the lack of information on the structure and flexibility of the catalytic site of CYPs. Here we describe the application of a method that combines protein-ligand docking, Molecular Dynamics (MD simulations and Linear Interaction Energy (LIE theory, to allow for quantitative CYP affinity prediction. Using this combined approach, a LIE model for human CYP 1A2 was developed and evaluated, based on a structurally diverse dataset for which the estimated experimental uncertainty was 3.3 kJ mol-1. For the computed CYP 1A2 binding affinities, the model showed a root mean square error (RMSE of 4.1 kJ mol-1 and a standard error in prediction (SDEP in cross-validation of 4.3 kJ mol-1. A novel approach that includes information on both structural ligand description and protein-ligand interaction was developed for estimating the reliability of predictions, and was able to identify compounds from an external test set with a SDEP for the predicted affinities of 4.6 kJ mol-1 (corresponding to 0.8 pKi units.

  8. Predicting "Hot" and "Warm" Spots for Fragment Binding.

    Science.gov (United States)

    Rathi, Prakash Chandra; Ludlow, R Frederick; Hall, Richard J; Murray, Christopher W; Mortenson, Paul N; Verdonk, Marcel L

    2017-05-11

    Computational fragment mapping methods aim to predict hotspots on protein surfaces where small fragments will bind. Such methods are popular for druggability assessment as well as structure-based design. However, to date researchers developing or using such tools have had no clear way of assessing the performance of these methods. Here, we introduce the first diverse, high quality validation set for computational fragment mapping. The set contains 52 diverse examples of fragment binding "hot" and "warm" spots from the Protein Data Bank (PDB). Additionally, we describe PLImap, a novel protocol for fragment mapping based on the Protein-Ligand Informatics force field (PLIff). We evaluate PLImap against the new fragment mapping test set, and compare its performance to that of simple shape-based algorithms and fragment docking using GOLD. PLImap is made publicly available from https://bitbucket.org/AstexUK/pli .

  9. Predicting Binding Free Energy Change Caused by Point Mutations with Knowledge-Modified MM/PBSA Method.

    Directory of Open Access Journals (Sweden)

    Marharyta Petukh

    2015-07-01

    Full Text Available A new methodology termed Single Amino Acid Mutation based change in Binding free Energy (SAAMBE was developed to predict the changes of the binding free energy caused by mutations. The method utilizes 3D structures of the corresponding protein-protein complexes and takes advantage of both approaches: sequence- and structure-based methods. The method has two components: a MM/PBSA-based component, and an additional set of statistical terms delivered from statistical investigation of physico-chemical properties of protein complexes. While the approach is rigid body approach and does not explicitly consider plausible conformational changes caused by the binding, the effect of conformational changes, including changes away from binding interface, on electrostatics are mimicked with amino acid specific dielectric constants. This provides significant improvement of SAAMBE predictions as indicated by better match against experimentally determined binding free energy changes over 1300 mutations in 43 proteins. The final benchmarking resulted in a very good agreement with experimental data (correlation coefficient 0.624 while the algorithm being fast enough to allow for large-scale calculations (the average time is less than a minute per mutation.

  10. Structural properties of MHC class II ligands, implications for the prediction of MHC class II epitopes.

    Directory of Open Access Journals (Sweden)

    Kasper Winther Jørgensen

    2010-12-01

    Full Text Available Major Histocompatibility class II (MHC-II molecules sample peptides from the extracellular space allowing the immune system to detect the presence of foreign microbes from this compartment. Prediction of MHC class II ligands is complicated by the open binding cleft of the MHC class II molecule, allowing binding of peptides extending out of the binding groove. Furthermore, only a few HLA-DR alleles have been characterized with a sufficient number of peptides (100-200 peptides per allele to derive accurate description of their binding motif. Little work has been performed characterizing structural properties of MHC class II ligands. Here, we perform one such large-scale analysis. A large set of SYFPEITHI MHC class II ligands covering more than 20 different HLA-DR molecules was analyzed in terms of their secondary structure and surface exposure characteristics in the context of the native structure of the corresponding source protein. We demonstrated that MHC class II ligands are significantly more exposed and have significantly more coil content than other peptides in the same protein with similar predicted binding affinity. We next exploited this observation to derive an improved prediction method for MHC class II ligands by integrating prediction of MHC- peptide binding with prediction of surface exposure and protein secondary structure. This combined prediction method was shown to significantly outperform the state-of-the-art MHC class II peptide binding prediction method when used to identify MHC class II ligands. We also tried to integrate N- and O-glycosylation in our prediction methods but this additional information was found not to improve prediction performance. In summary, these findings strongly suggest that local structural properties influence antigen processing and/or the accessibility of peptides to the MHC class II molecule.

  11. Structural insights into conserved L-arabinose metabolic enzymes reveal the substrate binding site of a thermophilic L-arabinose isomerase.

    Science.gov (United States)

    Lee, Yong-Jik; Lee, Sang-Jae; Kim, Seong-Bo; Lee, Sang Jun; Lee, Sung Haeng; Lee, Dong-Woo

    2014-03-18

    Structural genomics demonstrates that despite low levels of structural similarity of proteins comprising a metabolic pathway, their substrate binding regions are likely to be conserved. Herein based on the 3D-structures of the α/β-fold proteins involved in the ara operon, we attempted to predict the substrate binding residues of thermophilic Geobacillus stearothermophilus L-arabinose isomerase (GSAI) with no 3D-structure available. Comparison of the structures of L-arabinose catabolic enzymes revealed a conserved feature to form the substrate-binding modules, which can be extended to predict the substrate binding site of GSAI (i.e., D195, E261 and E333). Moreover, these data implicated that proteins in the l-arabinose metabolic pathway might retain their substrate binding niches as the modular structure through conserved molecular evolution even with totally different structural scaffolds. Copyright © 2014 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  12. Structure of Drosophila Oskar reveals a novel RNA binding protein

    Science.gov (United States)

    Yang, Na; Yu, Zhenyu; Hu, Menglong; Wang, Mingzhu; Lehmann, Ruth; Xu, Rui-Ming

    2015-01-01

    Oskar (Osk) protein plays critical roles during Drosophila germ cell development, yet its functions in germ-line formation and body patterning remain poorly understood. This situation contrasts sharply with the vast knowledge about the function and mechanism of osk mRNA localization. Osk is predicted to have an N-terminal LOTUS domain (Osk-N), which has been suggested to bind RNA, and a C-terminal hydrolase-like domain (Osk-C) of unknown function. Here, we report the crystal structures of Osk-N and Osk-C. Osk-N shows a homodimer of winged-helix–fold modules, but without detectable RNA-binding activity. Osk-C has a lipase-fold structure but lacks critical catalytic residues at the putative active site. Surprisingly, we found that Osk-C binds the 3′UTRs of osk and nanos mRNA in vitro. Mutational studies identified a region of Osk-C important for mRNA binding. These results suggest possible functions of Osk in the regulation of stability, regulation of translation, and localization of relevant mRNAs through direct interaction with their 3′UTRs, and provide structural insights into a novel protein–RNA interaction motif involving a hydrolase-related domain. PMID:26324911

  13. Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

    Science.gov (United States)

    Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

    2014-11-01

    As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of

  14. Visualisation of variable binding pockets on protein surfaces by probabilistic analysis of related structure sets

    Directory of Open Access Journals (Sweden)

    Ashford Paul

    2012-03-01

    Full Text Available Abstract Background Protein structures provide a valuable resource for rational drug design. For a protein with no known ligand, computational tools can predict surface pockets that are of suitable size and shape to accommodate a complementary small-molecule drug. However, pocket prediction against single static structures may miss features of pockets that arise from proteins' dynamic behaviour. In particular, ligand-binding conformations can be observed as transiently populated states of the apo protein, so it is possible to gain insight into ligand-bound forms by considering conformational variation in apo proteins. This variation can be explored by considering sets of related structures: computationally generated conformers, solution NMR ensembles, multiple crystal structures, homologues or homology models. It is non-trivial to compare pockets, either from different programs or across sets of structures. For a single structure, difficulties arise in defining particular pocket's boundaries. For a set of conformationally distinct structures the challenge is how to make reasonable comparisons between them given that a perfect structural alignment is not possible. Results We have developed a computational method, Provar, that provides a consistent representation of predicted binding pockets across sets of related protein structures. The outputs are probabilities that each atom or residue of the protein borders a predicted pocket. These probabilities can be readily visualised on a protein using existing molecular graphics software. We show how Provar simplifies comparison of the outputs of different pocket prediction algorithms, of pockets across multiple simulated conformations and between homologous structures. We demonstrate the benefits of use of multiple structures for protein-ligand and protein-protein interface analysis on a set of complexes and consider three case studies in detail: i analysis of a kinase superfamily highlights the

  15. Visualisation of variable binding pockets on protein surfaces by probabilistic analysis of related structure sets.

    Science.gov (United States)

    Ashford, Paul; Moss, David S; Alex, Alexander; Yeap, Siew K; Povia, Alice; Nobeli, Irene; Williams, Mark A

    2012-03-14

    Protein structures provide a valuable resource for rational drug design. For a protein with no known ligand, computational tools can predict surface pockets that are of suitable size and shape to accommodate a complementary small-molecule drug. However, pocket prediction against single static structures may miss features of pockets that arise from proteins' dynamic behaviour. In particular, ligand-binding conformations can be observed as transiently populated states of the apo protein, so it is possible to gain insight into ligand-bound forms by considering conformational variation in apo proteins. This variation can be explored by considering sets of related structures: computationally generated conformers, solution NMR ensembles, multiple crystal structures, homologues or homology models. It is non-trivial to compare pockets, either from different programs or across sets of structures. For a single structure, difficulties arise in defining particular pocket's boundaries. For a set of conformationally distinct structures the challenge is how to make reasonable comparisons between them given that a perfect structural alignment is not possible. We have developed a computational method, Provar, that provides a consistent representation of predicted binding pockets across sets of related protein structures. The outputs are probabilities that each atom or residue of the protein borders a predicted pocket. These probabilities can be readily visualised on a protein using existing molecular graphics software. We show how Provar simplifies comparison of the outputs of different pocket prediction algorithms, of pockets across multiple simulated conformations and between homologous structures. We demonstrate the benefits of use of multiple structures for protein-ligand and protein-protein interface analysis on a set of complexes and consider three case studies in detail: i) analysis of a kinase superfamily highlights the conserved occurrence of surface pockets at the active

  16. PRODIGY : a web server for predicting the binding affinity of protein-protein complexes

    NARCIS (Netherlands)

    Xue, Li; Garcia Lopes Maia Rodrigues, João; Kastritis, Panagiotis L; Bonvin, Alexandre Mjj; Vangone, Anna

    2016-01-01

    Gaining insights into the structural determinants of protein-protein interactions holds the key for a deeper understanding of biological functions, diseases and development of therapeutics. An important aspect of this is the ability to accurately predict the binding strength for a given

  17. RCK: accurate and efficient inference of sequence- and structure-based protein-RNA binding models from RNAcompete data.

    Science.gov (United States)

    Orenstein, Yaron; Wang, Yuhao; Berger, Bonnie

    2016-06-15

    Protein-RNA interactions, which play vital roles in many processes, are mediated through both RNA sequence and structure. CLIP-based methods, which measure protein-RNA binding in vivo, suffer from experimental noise and systematic biases, whereas in vitro experiments capture a clearer signal of protein RNA-binding. Among them, RNAcompete provides binding affinities of a specific protein to more than 240 000 unstructured RNA probes in one experiment. The computational challenge is to infer RNA structure- and sequence-based binding models from these data. The state-of-the-art in sequence models, Deepbind, does not model structural preferences. RNAcontext models both sequence and structure preferences, but is outperformed by GraphProt. Unfortunately, GraphProt cannot detect structural preferences from RNAcompete data due to the unstructured nature of the data, as noted by its developers, nor can it be tractably run on the full RNACompete dataset. We develop RCK, an efficient, scalable algorithm that infers both sequence and structure preferences based on a new k-mer based model. Remarkably, even though RNAcompete data is designed to be unstructured, RCK can still learn structural preferences from it. RCK significantly outperforms both RNAcontext and Deepbind in in vitro binding prediction for 244 RNAcompete experiments. Moreover, RCK is also faster and uses less memory, which enables scalability. While currently on par with existing methods in in vivo binding prediction on a small scale test, we demonstrate that RCK will increasingly benefit from experimentally measured RNA structure profiles as compared to computationally predicted ones. By running RCK on the entire RNAcompete dataset, we generate and provide as a resource a set of protein-RNA structure-based models on an unprecedented scale. Software and models are freely available at http://rck.csail.mit.edu/ bab@mit.edu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by

  18. The role of subcutaneous adipose tissue in supporting the copper balance in rats with a chronic deficiency in holo-ceruloplasmin.

    Directory of Open Access Journals (Sweden)

    Ekaterina Y Ilyechova

    Full Text Available We have previously shown that (1 an acute deficiency in blood serum holo-ceruloplasmin (Cp developed in rats that were fed fodder containing silver ions (Ag-fodder for one month and (2 the deficiency in holo-Cp was compensated by non-hepatic holo-Cp synthesis in rats that were chronically fed Ag-fodder for 6 months (Ag-rats. The purpose of the present study is to identify the organ(s that compensate for the hepatic holo-Cp deficiency in the circulation. This study was performed on rats that were fed Ag-fodder (40 mg Ag·kg-1 body mass daily for 6 months. The relative expression levels of the genes responsible for copper status were measured by RT-PCR. The in vitro synthesis and secretion of [14C]Cp were analyzed using a metabolic labeling approach. Oxidase activity was determined using a gel assay with o-dianisidine. Copper status and some hematological indexes were measured. Differential centrifugation, immunoblotting, immunoelectrophoresis, and atomic absorption spectrometry were included in the investigation. In the Ag-rats, silver accumulation was tissue-specific. Skeletal muscles and internal (IAT and subcutaneous (SAT adipose tissues did not accumulate silver significantly. In SAT, the mRNAs for the soluble and glycosylphosphatidylinositol-anchored ceruloplasmin isoforms were expressed, and their relative levels were increased two-fold in the Ag-rats. In parallel, the levels of the genes responsible for Cp metallation (Ctr1 and Atp7a/b increased correspondingly. In the SAT of the Ag-rats, Cp oxidase activity was observed in the Golgi complex and plasma membrane. Moreover, full-length [14C]Cp polypeptides were released into the medium by slices of SAT. The possibilities that SAT is part of a system that controls the copper balance in mammals, and it plays a significant role in supporting copper homeostasis throughout the body are discussed.

  19. Binding free energy analysis of protein-protein docking model structures by evERdock.

    Science.gov (United States)

    Takemura, Kazuhiro; Matubayasi, Nobuyuki; Kitao, Akio

    2018-03-14

    To aid the evaluation of protein-protein complex model structures generated by protein docking prediction (decoys), we previously developed a method to calculate the binding free energies for complexes. The method combines a short (2 ns) all-atom molecular dynamics simulation with explicit solvent and solution theory in the energy representation (ER). We showed that this method successfully selected structures similar to the native complex structure (near-native decoys) as the lowest binding free energy structures. In our current work, we applied this method (evERdock) to 100 or 300 model structures of four protein-protein complexes. The crystal structures and the near-native decoys showed the lowest binding free energy of all the examined structures, indicating that evERdock can successfully evaluate decoys. Several decoys that show low interface root-mean-square distance but relatively high binding free energy were also identified. Analysis of the fraction of native contacts, hydrogen bonds, and salt bridges at the protein-protein interface indicated that these decoys were insufficiently optimized at the interface. After optimizing the interactions around the interface by including interfacial water molecules, the binding free energies of these decoys were improved. We also investigated the effect of solute entropy on binding free energy and found that consideration of the entropy term does not necessarily improve the evaluations of decoys using the normal model analysis for entropy calculation.

  20. Bacillus cereus Fnr binds a [4Fe-4S] cluster and forms a ternary complex with ResD and PlcR

    Directory of Open Access Journals (Sweden)

    Esbelin Julia

    2012-06-01

    Full Text Available Abstract Background Bacillus cereus is a facultative anaerobe that causes diarrheal disease in humans. Diarrheal syndrome may result from the secretion of various virulence factors including hemolysin BL and nonhemolytic enterotoxin Nhe. Expression of genes encoding Hbl and Nhe is regulated by the two redox systems, ResDE and Fnr, and the virulence regulator PlcR. B. cereus Fnr is a member of the Crp/Fnr family of iron-sulfur (Fe-S proteins. Only its apo-form has so far been studied. A major goal in deciphering the Fnr-dependent regulation of enterotoxin genes is thus to obtain and characterize holoFnr. Results Fnr has been subjected to in vitro Fe-S cluster reconstitution under anoxic conditions. UV-visible and EPR spectroscopic analyses together with the chemical estimation of the iron content indicated that Fnr binds one [4Fe-4S]2+ cluster per monomer. Atmospheric O2 causes disassembly of the Fe-S cluster, which exhibited a half-life of 15 min in air. Holo- and apoFnr have similar affinities for the nhe and hbl promoter regions, while holoFnr has a higher affinity for fnr promoter region than apoFnr. Both the apo- and holo-form of Fnr interact with ResD and PlcR to form a ternary complex. Conclusions Overall, this work shows that incorporation of the [4Fe-4S]2+ cluster is not required for DNA binding of Fnr to promoter regions of hbl and nhe enterotoxin genes or for the formation of a ternary complex with ResD and PlcR. This points to some new unusual properties of Fnr that may have physiological relevance in the redox regulation of enterotoxin gene regulation.

  1. Prediction of Water Binding to Protein Hydration Sites with a Discrete, Semiexplicit Solvent Model.

    Science.gov (United States)

    Setny, Piotr

    2015-12-08

    Buried water molecules are ubiquitous in protein structures and are found at the interface of most protein-ligand complexes. Determining their distribution and thermodynamic effect is a challenging yet important task, of great of practical value for the modeling of biomolecular structures and their interactions. In this study, we present a novel method aimed at the prediction of buried water molecules in protein structures and estimation of their binding free energies. It is based on a semiexplicit, discrete solvation model, which we previously introduced in the context of small molecule hydration. The method is applicable to all macromolecular structures described by a standard all-atom force field, and predicts complete solvent distribution within a single run with modest computational cost. We demonstrate that it indicates positions of buried hydration sites, including those filled by more than one water molecule, and accurately differentiates them from sterically accessible to water but void regions. The obtained estimates of water binding free energies are in fair agreement with reference results determined with the double decoupling method.

  2. Binding Mode and Induced Fit Predictions for Prospective Computational Drug Design.

    Science.gov (United States)

    Grebner, Christoph; Iegre, Jessica; Ulander, Johan; Edman, Karl; Hogner, Anders; Tyrchan, Christian

    2016-04-25

    Computer-aided drug design plays an important role in medicinal chemistry to obtain insights into molecular mechanisms and to prioritize design strategies. Although significant improvement has been made in structure based design, it still remains a key challenge to accurately model and predict induced fit mechanisms. Most of the current available techniques either do not provide sufficient protein conformational sampling or are too computationally demanding to fit an industrial setting. The current study presents a systematic and exhaustive investigation of predicting binding modes for a range of systems using PELE (Protein Energy Landscape Exploration), an efficient and fast protein-ligand sampling algorithm. The systems analyzed (cytochrome P, kinase, protease, and nuclear hormone receptor) exhibit different complexities of ligand induced fit mechanisms and protein dynamics. The results are compared with results from classical molecular dynamics simulations and (induced fit) docking. This study shows that ligand induced side chain rearrangements and smaller to medium backbone movements are captured well in PELE. Large secondary structure rearrangements, however, remain challenging for all employed techniques. Relevant binding modes (ligand heavy atom RMSD PELE method within a few hours of simulation, positioning PELE as a tool applicable for rapid drug design cycles.

  3. HoloMonitor M4: holographic imaging cytometer for real-time kinetic label-free live-cell analysis of adherent cells

    Science.gov (United States)

    Sebesta, Mikael; Egelberg, Peter J.; Langberg, Anders; Lindskov, Jens-Henrik; Alm, Kersti; Janicke, Birgit

    2016-03-01

    Live-cell imaging enables studying dynamic cellular processes that cannot be visualized in fixed-cell assays. An increasing number of scientists in academia and the pharmaceutical industry are choosing live-cell analysis over or in addition to traditional fixed-cell assays. We have developed a time-lapse label-free imaging cytometer HoloMonitorM4. HoloMonitor M4 assists researchers to overcome inherent disadvantages of fluorescent analysis, specifically effects of chemical labels or genetic modifications which can alter cellular behavior. Additionally, label-free analysis is simple and eliminates the costs associated with staining procedures. The underlying technology principle is based on digital off-axis holography. While multiple alternatives exist for this type of analysis, we prioritized our developments to achieve the following: a) All-inclusive system - hardware and sophisticated cytometric analysis software; b) Ease of use enabling utilization of instrumentation by expert- and entrylevel researchers alike; c) Validated quantitative assay end-points tracked over time such as optical path length shift, optical volume and multiple derived imaging parameters; d) Reliable digital autofocus; e) Robust long-term operation in the incubator environment; f) High throughput and walk-away capability; and finally g) Data management suitable for single- and multi-user networks. We provide examples of HoloMonitor applications of label-free cell viability measurements and monitoring of cell cycle phase distribution.

  4. Automated benchmarking of peptide-MHC class I binding predictions

    Science.gov (United States)

    Trolle, Thomas; Metushi, Imir G.; Greenbaum, Jason A.; Kim, Yohan; Sidney, John; Lund, Ole; Sette, Alessandro; Peters, Bjoern; Nielsen, Morten

    2015-01-01

    Motivation: Numerous in silico methods predicting peptide binding to major histocompatibility complex (MHC) class I molecules have been developed over the last decades. However, the multitude of available prediction tools makes it non-trivial for the end-user to select which tool to use for a given task. To provide a solid basis on which to compare different prediction tools, we here describe a framework for the automated benchmarking of peptide-MHC class I binding prediction tools. The framework runs weekly benchmarks on data that are newly entered into the Immune Epitope Database (IEDB), giving the public access to frequent, up-to-date performance evaluations of all participating tools. To overcome potential selection bias in the data included in the IEDB, a strategy was implemented that suggests a set of peptides for which different prediction methods give divergent predictions as to their binding capability. Upon experimental binding validation, these peptides entered the benchmark study. Results: The benchmark has run for 15 weeks and includes evaluation of 44 datasets covering 17 MHC alleles and more than 4000 peptide-MHC binding measurements. Inspection of the results allows the end-user to make educated selections between participating tools. Of the four participating servers, NetMHCpan performed the best, followed by ANN, SMM and finally ARB. Availability and implementation: Up-to-date performance evaluations of each server can be found online at http://tools.iedb.org/auto_bench/mhci/weekly. All prediction tool developers are invited to participate in the benchmark. Sign-up instructions are available at http://tools.iedb.org/auto_bench/mhci/join. Contact: mniel@cbs.dtu.dk or bpeters@liai.org Supplementary information: Supplementary data are available at Bioinformatics online. PMID:25717196

  5. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

    Directory of Open Access Journals (Sweden)

    Xiaoxia Yang

    Full Text Available Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

  6. SNBRFinder: A Sequence-Based Hybrid Algorithm for Enhanced Prediction of Nucleic Acid-Binding Residues.

    Science.gov (United States)

    Yang, Xiaoxia; Wang, Jia; Sun, Jun; Liu, Rong

    2015-01-01

    Protein-nucleic acid interactions are central to various fundamental biological processes. Automated methods capable of reliably identifying DNA- and RNA-binding residues in protein sequence are assuming ever-increasing importance. The majority of current algorithms rely on feature-based prediction, but their accuracy remains to be further improved. Here we propose a sequence-based hybrid algorithm SNBRFinder (Sequence-based Nucleic acid-Binding Residue Finder) by merging a feature predictor SNBRFinderF and a template predictor SNBRFinderT. SNBRFinderF was established using the support vector machine whose inputs include sequence profile and other complementary sequence descriptors, while SNBRFinderT was implemented with the sequence alignment algorithm based on profile hidden Markov models to capture the weakly homologous template of query sequence. Experimental results show that SNBRFinderF was clearly superior to the commonly used sequence profile-based predictor and SNBRFinderT can achieve comparable performance to the structure-based template methods. Leveraging the complementary relationship between these two predictors, SNBRFinder reasonably improved the performance of both DNA- and RNA-binding residue predictions. More importantly, the sequence-based hybrid prediction reached competitive performance relative to our previous structure-based counterpart. Our extensive and stringent comparisons show that SNBRFinder has obvious advantages over the existing sequence-based prediction algorithms. The value of our algorithm is highlighted by establishing an easy-to-use web server that is freely accessible at http://ibi.hzau.edu.cn/SNBRFinder.

  7. Development of estrogen receptor beta binding prediction model using large sets of chemicals.

    Science.gov (United States)

    Sakkiah, Sugunadevi; Selvaraj, Chandrabose; Gong, Ping; Zhang, Chaoyang; Tong, Weida; Hong, Huixiao

    2017-11-03

    We developed an ER β binding prediction model to facilitate identification of chemicals specifically bind ER β or ER α together with our previously developed ER α binding model. Decision Forest was used to train ER β binding prediction model based on a large set of compounds obtained from EADB. Model performance was estimated through 1000 iterations of 5-fold cross validations. Prediction confidence was analyzed using predictions from the cross validations. Informative chemical features for ER β binding were identified through analysis of the frequency data of chemical descriptors used in the models in the 5-fold cross validations. 1000 permutations were conducted to assess the chance correlation. The average accuracy of 5-fold cross validations was 93.14% with a standard deviation of 0.64%. Prediction confidence analysis indicated that the higher the prediction confidence the more accurate the predictions. Permutation testing results revealed that the prediction model is unlikely generated by chance. Eighteen informative descriptors were identified to be important to ER β binding prediction. Application of the prediction model to the data from ToxCast project yielded very high sensitivity of 90-92%. Our results demonstrated ER β binding of chemicals could be accurately predicted using the developed model. Coupling with our previously developed ER α prediction model, this model could be expected to facilitate drug development through identification of chemicals that specifically bind ER β or ER α .

  8. The helical structure of DNA facilitates binding

    International Nuclear Information System (INIS)

    Berg, Otto G; Mahmutovic, Anel; Marklund, Emil; Elf, Johan

    2016-01-01

    The helical structure of DNA imposes constraints on the rate of diffusion-limited protein binding. Here we solve the reaction–diffusion equations for DNA-like geometries and extend with simulations when necessary. We find that the helical structure can make binding to the DNA more than twice as fast compared to a case where DNA would be reactive only along one side. We also find that this rate advantage remains when the contributions from steric constraints and rotational diffusion of the DNA-binding protein are included. Furthermore, we find that the association rate is insensitive to changes in the steric constraints on the DNA in the helix geometry, while it is much more dependent on the steric constraints on the DNA-binding protein. We conclude that the helical structure of DNA facilitates the nonspecific binding of transcription factors and structural DNA-binding proteins in general. (paper)

  9. Structural characterization and comparison of three acyl-carrier-protein synthases from pathogenic bacteria

    Energy Technology Data Exchange (ETDEWEB)

    Halavaty, Andrei S. [Center for Structural Genomics of Infectious Diseases, (United States); Northwestern University, Chicago, IL 60611 (United States); Kim, Youngchang [Center for Structural Genomics of Infectious Diseases, (United States); Argonne National Laboratory, Argonne, IL 60439 (United States); University of Chicago, Chicago, IL 60637 (United States); Minasov, George; Shuvalova, Ludmilla; Dubrovska, Ievgeniia; Winsor, James [Center for Structural Genomics of Infectious Diseases, (United States); Northwestern University, Chicago, IL 60611 (United States); Zhou, Min [Center for Structural Genomics of Infectious Diseases, (United States); Argonne National Laboratory, Argonne, IL 60439 (United States); University of Chicago, Chicago, IL 60637 (United States); Onopriyenko, Olena; Skarina, Tatiana [Center for Structural Genomics of Infectious Diseases, (United States); University of Toronto, Toronto, Ontario M5G 1L6 (Canada); Papazisi, Leka; Kwon, Keehwan; Peterson, Scott N. [Center for Structural Genomics of Infectious Diseases, (United States); J. Craig Venter Institute, Rockville, MD 20850 (United States); Joachimiak, Andrzej [Center for Structural Genomics of Infectious Diseases, (United States); Argonne National Laboratory, Argonne, IL 60439 (United States); University of Chicago, Chicago, IL 60637 (United States); Savchenko, Alexei [Center for Structural Genomics of Infectious Diseases, (United States); University of Toronto, Toronto, Ontario M5G 1L6 (Canada); Anderson, Wayne F., E-mail: wf-anderson@northwestern.edu [Center for Structural Genomics of Infectious Diseases, (United States); Northwestern University, Chicago, IL 60611 (United States)

    2012-10-01

    The structural characterization of acyl-carrier-protein synthase (AcpS) from three different pathogenic microorganisms is reported. One interesting finding of the present work is a crystal artifact related to the activity of the enzyme, which fortuitously represents an opportunity for a strategy to design a potential inhibitor of a pathogenic AcpS. Some bacterial type II fatty-acid synthesis (FAS II) enzymes have been shown to be important candidates for drug discovery. The scientific and medical quest for new FAS II protein targets continues to stimulate research in this field. One of the possible additional candidates is the acyl-carrier-protein synthase (AcpS) enzyme. Its holo form post-translationally modifies the apo form of an acyl carrier protein (ACP), which assures the constant delivery of thioester intermediates to the discrete enzymes of FAS II. At the Center for Structural Genomics of Infectious Diseases (CSGID), AcpSs from Staphylococcus aureus (AcpS{sub SA}), Vibrio cholerae (AcpS{sub VC}) and Bacillus anthracis (AcpS{sub BA}) have been structurally characterized in their apo, holo and product-bound forms, respectively. The structure of AcpS{sub BA} is emphasized because of the two 3′, 5′-adenosine diphosphate (3′, 5′-ADP) product molecules that are found in each of the three coenzyme A (CoA) binding sites of the trimeric protein. One 3′, 5′-ADP is bound as the 3′, 5′-ADP part of CoA in the known structures of the CoA–AcpS and 3′, 5′-ADP–AcpS binary complexes. The position of the second 3′, 5′-ADP has never been described before. It is in close proximity to the first 3′, 5′-ADP and the ACP-binding site. The coordination of two ADPs in AcpS{sub BA} may possibly be exploited for the design of AcpS inhibitors that can block binding of both CoA and ACP.

  10. Structural characterization and comparison of three acyl-carrier-protein synthases from pathogenic bacteria

    International Nuclear Information System (INIS)

    Halavaty, Andrei S.; Kim, Youngchang; Minasov, George; Shuvalova, Ludmilla; Dubrovska, Ievgeniia; Winsor, James; Zhou, Min; Onopriyenko, Olena; Skarina, Tatiana; Papazisi, Leka; Kwon, Keehwan; Peterson, Scott N.; Joachimiak, Andrzej; Savchenko, Alexei; Anderson, Wayne F.

    2012-01-01

    The structural characterization of acyl-carrier-protein synthase (AcpS) from three different pathogenic microorganisms is reported. One interesting finding of the present work is a crystal artifact related to the activity of the enzyme, which fortuitously represents an opportunity for a strategy to design a potential inhibitor of a pathogenic AcpS. Some bacterial type II fatty-acid synthesis (FAS II) enzymes have been shown to be important candidates for drug discovery. The scientific and medical quest for new FAS II protein targets continues to stimulate research in this field. One of the possible additional candidates is the acyl-carrier-protein synthase (AcpS) enzyme. Its holo form post-translationally modifies the apo form of an acyl carrier protein (ACP), which assures the constant delivery of thioester intermediates to the discrete enzymes of FAS II. At the Center for Structural Genomics of Infectious Diseases (CSGID), AcpSs from Staphylococcus aureus (AcpS SA ), Vibrio cholerae (AcpS VC ) and Bacillus anthracis (AcpS BA ) have been structurally characterized in their apo, holo and product-bound forms, respectively. The structure of AcpS BA is emphasized because of the two 3′, 5′-adenosine diphosphate (3′, 5′-ADP) product molecules that are found in each of the three coenzyme A (CoA) binding sites of the trimeric protein. One 3′, 5′-ADP is bound as the 3′, 5′-ADP part of CoA in the known structures of the CoA–AcpS and 3′, 5′-ADP–AcpS binary complexes. The position of the second 3′, 5′-ADP has never been described before. It is in close proximity to the first 3′, 5′-ADP and the ACP-binding site. The coordination of two ADPs in AcpS BA may possibly be exploited for the design of AcpS inhibitors that can block binding of both CoA and ACP

  11. Computational prediction of binding affinity for CYP1A2-ligand complexes using empirical free energy calculations

    DEFF Research Database (Denmark)

    Poongavanam, Vasanthanathan; Olsen, Lars; Jørgensen, Flemming Steen

    2010-01-01

    , and methods based on statistical mechanics. In the present investigation, we started from an LIE model to predict the binding free energy of structurally diverse compounds of cytochrome P450 1A2 ligands, one of the important human metabolizing isoforms of the cytochrome P450 family. The data set includes both...... substrates and inhibitors. It appears that the electrostatic contribution to the binding free energy becomes negligible in this particular protein and a simple empirical model was derived, based on a training set of eight compounds. The root mean square error for the training set was 3.7 kJ/mol. Subsequent......Predicting binding affinities for receptor-ligand complexes is still one of the challenging processes in computational structure-based ligand design. Many computational methods have been developed to achieve this goal, such as docking and scoring methods, the linear interaction energy (LIE) method...

  12. Integrating water exclusion theory into βcontacts to predict binding free energy changes and binding hot spots

    Science.gov (United States)

    2014-01-01

    Background Binding free energy and binding hot spots at protein-protein interfaces are two important research areas for understanding protein interactions. Computational methods have been developed previously for accurate prediction of binding free energy change upon mutation for interfacial residues. However, a large number of interrupted and unimportant atomic contacts are used in the training phase which caused accuracy loss. Results This work proposes a new method, βACV ASA , to predict the change of binding free energy after alanine mutations. βACV ASA integrates accessible surface area (ASA) and our newly defined β contacts together into an atomic contact vector (ACV). A β contact between two atoms is a direct contact without being interrupted by any other atom between them. A β contact’s potential contribution to protein binding is also supposed to be inversely proportional to its ASA to follow the water exclusion hypothesis of binding hot spots. Tested on a dataset of 396 alanine mutations, our method is found to be superior in classification performance to many other methods, including Robetta, FoldX, HotPOINT, an ACV method of β contacts without ASA integration, and ACV ASA methods (similar to βACV ASA but based on distance-cutoff contacts). Based on our data analysis and results, we can draw conclusions that: (i) our method is powerful in the prediction of binding free energy change after alanine mutation; (ii) β contacts are better than distance-cutoff contacts for modeling the well-organized protein-binding interfaces; (iii) β contacts usually are only a small fraction number of the distance-based contacts; and (iv) water exclusion is a necessary condition for a residue to become a binding hot spot. Conclusions βACV ASA is designed using the advantages of both β contacts and water exclusion. It is an excellent tool to predict binding free energy changes and binding hot spots after alanine mutation. PMID:24568581

  13. Prediction of consensus binding mode geometries for related chemical series of positive allosteric modulators of adenosine and muscarinic acetylcholine receptors.

    Science.gov (United States)

    Sakkal, Leon A; Rajkowski, Kyle Z; Armen, Roger S

    2017-06-05

    Following insights from recent crystal structures of the muscarinic acetylcholine receptor, binding modes of Positive Allosteric Modulators (PAMs) were predicted under the assumption that PAMs should bind to the extracellular surface of the active state. A series of well-characterized PAMs for adenosine (A 1 R, A 2A R, A 3 R) and muscarinic acetylcholine (M 1 R, M 5 R) receptors were modeled using both rigid and flexible receptor CHARMM-based molecular docking. Studies of adenosine receptors investigated the molecular basis of the probe-dependence of PAM activity by modeling in complex with specific agonist radioligands. Consensus binding modes map common pharmacophore features of several chemical series to specific binding interactions. These models provide a rationalization of how PAM binding slows agonist radioligand dissociation kinetics. M 1 R PAMs were predicted to bind in the analogous M 2 R PAM LY2119620 binding site. The M 5 R NAM (ML-375) was predicted to bind in the PAM (ML-380) binding site with a unique induced-fit receptor conformation. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  14. Coarse-grained/molecular mechanics of the TAS2R38 bitter taste receptor: experimentally-validated detailed structural prediction of agonist binding.

    Directory of Open Access Journals (Sweden)

    Alessandro Marchiori

    Full Text Available Bitter molecules in humans are detected by ∼25 G protein-coupled receptors (GPCRs. The lack of atomic resolution structure for any of them is complicating an in depth understanding of the molecular mechanisms underlying bitter taste perception. Here, we investigate the molecular determinants of the interaction of the TAS2R38 bitter taste receptor with its agonists phenylthiocarbamide (PTC and propylthiouracil (PROP. We use the recently developed hybrid Molecular Mechanics/Coarse Grained (MM/CG method tailored specifically for GPCRs. The method, through an extensive exploration of the conformational space in the binding pocket, allows the identification of several residues important for agonist binding that would have been very difficult to capture from the standard bioinformatics/docking approach. Our calculations suggest that both agonists bind to Asn103, Phe197, Phe264 and Trp201, whilst they do not interact with the so-called extra cellular loop 2, involved in cis-retinal binding in the GPCR rhodopsin. These predictions are consistent with data sets based on more than 20 site-directed mutagenesis and functional calcium imaging experiments of TAS2R38. The method could be readily used for other GPCRs for which experimental information is currently lacking.

  15. Physics holo.lab learning experience: using smartglasses for augmented reality labwork to foster the concepts of heat conduction

    Science.gov (United States)

    Strzys, M. P.; Kapp, S.; Thees, M.; Klein, P.; Lukowicz, P.; Knierim, P.; Schmidt, A.; Kuhn, J.

    2018-05-01

    Fundamental concepts of thermodynamics rely on abstract physical quantities such as energy, heat and entropy, which play an important role in the process of interpreting thermal phenomena and statistical mechanics. However, these quantities are not covered by human visual perception, and since heat sensation is purely qualitative and easy to deceive, an intuitive understanding often is lacking. Today immersive technologies like head-mounted displays of the newest generation, especially HoloLens, allow for high-quality augmented reality learning experiences, which can overcome this gap in human perception by presenting different representations of otherwise invisible quantities directly in the field of view of the user on the experimental apparatus, which simultaneously avoids a split-attention effect. In a mixed reality (MR) scenario as presented in this paper—which we call a holo.lab—human perception can be extended to the thermal regime by presenting false-color representations of the temperature of objects as a virtual augmentation directly on the real object itself in real-time. Direct feedback to experimental actions of the users in the form of different representations allows for immediate comparison to theoretical principles and predictions and therefore is supposed to intensify the theory–experiment interactions and to increase students’ conceptual understanding. We tested this technology for an experiment on thermal conduction of metals in the framework of undergraduate laboratories. A pilot study with treatment and control groups (N = 59) showed a small positive effect of MR on students’ performance measured with a standardized concept test for thermodynamics, pointing to an improvement of the understanding of the underlying physical concepts. These findings indicate that complex experiments could benefit even more from augmentation. This motivates us to enrich further experiments with MR.

  16. Solution Structure of 4′-Phosphopantetheine - GmACP3 from Geobacter metallireducens: A Specialized Acyl Carrier Protein with Atypical Structural Features and a Putative Role in Lipopolysaccharide Biosynthesis†

    Science.gov (United States)

    Ramelot, Theresa A.; Smola, Matthew J.; Lee, Hsiau-Wei; Ciccosanti, Colleen; Hamilton, Keith; Acton, Thomas B.; Xiao, Rong; Everett, John K.; Prestegard, James H.; Montelione, Gaetano T.; Kennedy, Michael A.

    2011-01-01

    GmACP3 from Geobacter metallireducens is a specialized acyl carrier protein (ACP) whose gene, gmet_2339, is located near genes encoding many proteins involved in lipopolysaccharide (LPS) biosynthesis, indicating a likely function for GmACP3 in LPS production. By overexpression in Escherichia coli, about 50% holo-GmACP3 and 50% apo-GmACP3 were obtained. Apo-GmACP3 exhibited slow precipitation and non-monomeric behavior by 15N NMR relaxation measurements. Addition of 4′-phosphopantetheine (4′-PP) via enzymatic conversion by E. coli holo-ACP synthase, resulted in stable >95% holo-GmACP3 that was characterized as monomeric by 15N relaxation measurements and had no indication of conformational exchange. We have determined a high-resolution solution structure of holo-GmACP3 by standard NMR methods, including refinement with two sets of NH residual dipolar couplings, allowing for a detailed structural analysis of the interactions between 4′-PP and GmACP3. Whereas the overall four helix bundle topology is similar to previously solved ACP structures, this structure has unique characteristics, including an ordered 4′-PP conformation that places the thiol at the entrance to a central hydrophobic cavity near a conserved hydrogen-bonded Trp-His pair. These residues are part of a conserved WDSLxH/N motif found in GmACP3 and it’s orthologs. The helix locations and the large hydrophobic cavity are more similar to medium- and long-chain acyl-ACPs than to other apo- and holo-ACP structures. Taken together, structural characterization along with bioinformatic analysis of nearby genes suggest that GmACP3 is involved in lipid A acylation, possibly by atypical long-chain hydroxy fatty acids, and potentially involved in synthesis of secondary metabolites. PMID:21235239

  17. Structural and energetic effects of A2A adenosine receptor mutations on agonist and antagonist binding.

    Directory of Open Access Journals (Sweden)

    Henrik Keränen

    Full Text Available To predict structural and energetic effects of point mutations on ligand binding is of considerable interest in biochemistry and pharmacology. This is not only useful in connection with site-directed mutagenesis experiments, but could also allow interpretation and prediction of individual responses to drug treatment. For G-protein coupled receptors systematic mutagenesis has provided the major part of functional data as structural information until recently has been very limited. For the pharmacologically important A(2A adenosine receptor, extensive site-directed mutagenesis data on agonist and antagonist binding is available and crystal structures of both types of complexes have been determined. Here, we employ a computational strategy, based on molecular dynamics free energy simulations, to rationalize and interpret available alanine-scanning experiments for both agonist and antagonist binding to this receptor. These computer simulations show excellent agreement with the experimental data and, most importantly, reveal the molecular details behind the observed effects which are often not immediately evident from the crystal structures. The work further provides a distinct validation of the computational strategy used to assess effects of point-mutations on ligand binding. It also highlights the importance of considering not only protein-ligand interactions but also those mediated by solvent water molecules, in ligand design projects.

  18. A systems biology approach to transcription factor binding site prediction.

    Directory of Open Access Journals (Sweden)

    Xiang Zhou

    2010-03-01

    Full Text Available The elucidation of mammalian transcriptional regulatory networks holds great promise for both basic and translational research and remains one the greatest challenges to systems biology. Recent reverse engineering methods deduce regulatory interactions from large-scale mRNA expression profiles and cross-species conserved regulatory regions in DNA. Technical challenges faced by these methods include distinguishing between direct and indirect interactions, associating transcription regulators with predicted transcription factor binding sites (TFBSs, identifying non-linearly conserved binding sites across species, and providing realistic accuracy estimates.We address these challenges by closely integrating proven methods for regulatory network reverse engineering from mRNA expression data, linearly and non-linearly conserved regulatory region discovery, and TFBS evaluation and discovery. Using an extensive test set of high-likelihood interactions, which we collected in order to provide realistic prediction-accuracy estimates, we show that a careful integration of these methods leads to significant improvements in prediction accuracy. To verify our methods, we biochemically validated TFBS predictions made for both transcription factors (TFs and co-factors; we validated binding site predictions made using a known E2F1 DNA-binding motif on E2F1 predicted promoter targets, known E2F1 and JUND motifs on JUND predicted promoter targets, and a de novo discovered motif for BCL6 on BCL6 predicted promoter targets. Finally, to demonstrate accuracy of prediction using an external dataset, we showed that sites matching predicted motifs for ZNF263 are significantly enriched in recent ZNF263 ChIP-seq data.Using an integrative framework, we were able to address technical challenges faced by state of the art network reverse engineering methods, leading to significant improvement in direct-interaction detection and TFBS-discovery accuracy. We estimated the accuracy

  19. Hardware device to physical structure binding and authentication

    Science.gov (United States)

    Hamlet, Jason R.; Stein, David J.; Bauer, Todd M.

    2013-08-20

    Detection and deterrence of device tampering and subversion may be achieved by including a cryptographic fingerprint unit within a hardware device for authenticating a binding of the hardware device and a physical structure. The cryptographic fingerprint unit includes an internal physically unclonable function ("PUF") circuit disposed in or on the hardware device, which generate an internal PUF value. Binding logic is coupled to receive the internal PUF value, as well as an external PUF value associated with the physical structure, and generates a binding PUF value, which represents the binding of the hardware device and the physical structure. The cryptographic fingerprint unit also includes a cryptographic unit that uses the binding PUF value to allow a challenger to authenticate the binding.

  20. Towards Automated Binding Affinity Prediction Using an Iterative Linear Interaction Energy Approach

    Directory of Open Access Journals (Sweden)

    C. Ruben Vosmeer

    2014-01-01

    Full Text Available Binding affinity prediction of potential drugs to target and off-target proteins is an essential asset in drug development. These predictions require the calculation of binding free energies. In such calculations, it is a major challenge to properly account for both the dynamic nature of the protein and the possible variety of ligand-binding orientations, while keeping computational costs tractable. Recently, an iterative Linear Interaction Energy (LIE approach was introduced, in which results from multiple simulations of a protein-ligand complex are combined into a single binding free energy using a Boltzmann weighting-based scheme. This method was shown to reach experimental accuracy for flexible proteins while retaining the computational efficiency of the general LIE approach. Here, we show that the iterative LIE approach can be used to predict binding affinities in an automated way. A workflow was designed using preselected protein conformations, automated ligand docking and clustering, and a (semi-automated molecular dynamics simulation setup. We show that using this workflow, binding affinities of aryloxypropanolamines to the malleable Cytochrome P450 2D6 enzyme can be predicted without a priori knowledge of dominant protein-ligand conformations. In addition, we provide an outlook for an approach to assess the quality of the LIE predictions, based on simulation outcomes only.

  1. Identification of antibody glycosylation structures that predict monoclonal antibody Fc-effector function.

    Science.gov (United States)

    Chung, Amy W; Crispin, Max; Pritchard, Laura; Robinson, Hannah; Gorny, Miroslaw K; Yu, Xiaojie; Bailey-Kellogg, Chris; Ackerman, Margaret E; Scanlan, Chris; Zolla-Pazner, Susan; Alter, Galit

    2014-11-13

    To determine monoclonal antibody (mAb) features that predict fragment crystalizable (Fc)-mediated effector functions against HIV. Monoclonal antibodies, derived from Chinese hamster ovary cells or Epstein-Barr virus-immortalized mouse heteromyelomas, with specificity to key regions of the HIV envelope including gp120-V2, gp120-V3 loop, gp120-CD4(+) binding site, and gp41-specific antibodies, were functionally profiled to determine the relative contribution of the variable and constant domain features of the antibodies in driving robust Fc-effector functions. Each mAb was assayed for antibody-binding affinity to gp140(SR162), antibody-dependent cellular cytotoxicity (ADCC), antibody-dependent cellular phagocytosis (ADCP) and for the ability to bind to FcγRIIa, FcγRIIb and FcγRIIIa receptors. Antibody glycan profiles were determined by HPLC. Neither the specificity nor the affinity of the mAbs determined the potency of Fc-effector function. FcγRIIIa binding strongly predicted ADCC and decreased galactose content inversely correlated with ADCP, whereas N-glycolylneuraminic acid-containing structures exhibited enhanced ADCP. Additionally, the bi-antenary glycan arm onto which galactose was added predicted enhanced binding to FcγRIIIa and ADCC activity, independent of the specificity of the mAb. Our studies point to the specific Fc-glycan structures that can selectively promote Fc-effector functions independently of the antibody specificity. Furthermore, we demonstrated antibody glycan structures associated with enhanced ADCP activity, an emerging Fc-effector function that may aid in the control and clearance of HIV infection.

  2. Comparison of S. cerevisiae F-BAR domain structures reveals a conserved inositol phosphate binding site

    Science.gov (United States)

    Moravcevic, Katarina; Alvarado, Diego; Schmitz, Karl R.; Kenniston, Jon A.; Mendrola, Jeannine M.; Ferguson, Kathryn M.; Lemmon, Mark A.

    2015-01-01

    SUMMARY F-BAR domains control membrane interactions in endocytosis, cytokinesis, and cell signaling. Although generally thought to bind curved membranes containing negatively charged phospholipids, numerous functional studies argue that differences in lipid-binding selectivities of F-BAR domains are functionally important. Here, we compare membrane-binding properties of the S. cerevisiae F-BAR domains in vitro and in vivo. Whereas some F-BAR domains (such as Bzz1p and Hof1p F-BARs) bind equally well to all phospholipids, the F-BAR domain from the RhoGAP Rgd1p preferentially binds phosphoinositides. We determined X-ray crystal structures of F-BAR domains from Hof1p and Rgd1p, the latter bound to an inositol phosphate. The structures explain phospholipid-binding selectivity differences, and reveal an F-BAR phosphoinositide binding site that is fully conserved in a mammalian RhoGAP called Gmip, and is partly retained in certain other F-BAR domains. Our findings reveal previously unappreciated determinants of F-BAR domain lipid-binding specificity, and provide a basis for its prediction from sequence. PMID:25620000

  3. PatchSurfers: Two methods for local molecular property-based binding ligand prediction.

    Science.gov (United States)

    Shin, Woong-Hee; Bures, Mark Gregory; Kihara, Daisuke

    2016-01-15

    Protein function prediction is an active area of research in computational biology. Function prediction can help biologists make hypotheses for characterization of genes and help interpret biological assays, and thus is a productive area for collaboration between experimental and computational biologists. Among various function prediction methods, predicting binding ligand molecules for a target protein is an important class because ligand binding events for a protein are usually closely intertwined with the proteins' biological function, and also because predicted binding ligands can often be directly tested by biochemical assays. Binding ligand prediction methods can be classified into two types: those which are based on protein-protein (or pocket-pocket) comparison, and those that compare a target pocket directly to ligands. Recently, our group proposed two computational binding ligand prediction methods, Patch-Surfer, which is a pocket-pocket comparison method, and PL-PatchSurfer, which compares a pocket to ligand molecules. The two programs apply surface patch-based descriptions to calculate similarity or complementarity between molecules. A surface patch is characterized by physicochemical properties such as shape, hydrophobicity, and electrostatic potentials. These properties on the surface are represented using three-dimensional Zernike descriptors (3DZD), which are based on a series expansion of a 3 dimensional function. Utilizing 3DZD for describing the physicochemical properties has two main advantages: (1) rotational invariance and (2) fast comparison. Here, we introduce Patch-Surfer and PL-PatchSurfer with an emphasis on PL-PatchSurfer, which is more recently developed. Illustrative examples of PL-PatchSurfer performance on binding ligand prediction as well as virtual drug screening are also provided. Copyright © 2015 Elsevier Inc. All rights reserved.

  4. The IntFOLD server: an integrated web resource for protein fold recognition, 3D model quality assessment, intrinsic disorder prediction, domain prediction and ligand binding site prediction.

    Science.gov (United States)

    Roche, Daniel B; Buenavista, Maria T; Tetchner, Stuart J; McGuffin, Liam J

    2011-07-01

    The IntFOLD server is a novel independent server that integrates several cutting edge methods for the prediction of structure and function from sequence. Our guiding principles behind the server development were as follows: (i) to provide a simple unified resource that makes our prediction software accessible to all and (ii) to produce integrated output for predictions that can be easily interpreted. The output for predictions is presented as a simple table that summarizes all results graphically via plots and annotated 3D models. The raw machine readable data files for each set of predictions are also provided for developers, which comply with the Critical Assessment of Methods for Protein Structure Prediction (CASP) data standards. The server comprises an integrated suite of five novel methods: nFOLD4, for tertiary structure prediction; ModFOLD 3.0, for model quality assessment; DISOclust 2.0, for disorder prediction; DomFOLD 2.0 for domain prediction; and FunFOLD 1.0, for ligand binding site prediction. Predictions from the IntFOLD server were found to be competitive in several categories in the recent CASP9 experiment. The IntFOLD server is available at the following web site: http://www.reading.ac.uk/bioinf/IntFOLD/.

  5. Analysis of electric moments of RNA-binding proteins: implications for mechanism and prediction

    Directory of Open Access Journals (Sweden)

    Sarai Akinori

    2011-02-01

    Full Text Available Abstract Background Protein-RNA interactions play important role in many biological processes such as gene regulation, replication, protein synthesis and virus assembly. Although many structures of various types of protein-RNA complexes have been determined, the mechanism of protein-RNA recognition remains elusive. We have earlier shown that the simplest electrostatic properties viz. charge, dipole and quadrupole moments, calculated from backbone atomic coordinates of proteins are biased relative to other proteins, and these quantities can be used to identify DNA-binding proteins. Closely related, RNA-binding proteins are investigated in this study. In particular, discrimination between various types of RNA-binding proteins, evolutionary conservation of these bulk electrostatic features and effect of conformational changes by complex formation are investigated. Basic binding mechanism of a putative RNA-binding protein (HI1333 from Haemophilus influenza is suggested as a potential application of this study. Results We found that similar to DNA-binding proteins (DBPs, RNA-binding proteins (RBPs also show significantly higher values of electric moments. However, higher moments in RBPs are found to strongly depend on their functional class: proteins binding to ribosomal RNA (rRNA constitute the only class with all three of the properties (charge, dipole and quadrupole moments being higher than control proteins. Neural networks were trained using leave-one-out cross-validation to predict RBPs from control data as well as pair-wise classification capacity between proteins binding to various RNA types. RBPs and control proteins reached up to 78% accuracy measured by the area under the ROC curve. Proteins binding to rRNA are found to be best distinguished (AUC = 79%. Changes in dipole and quadrupole moments between unbound and bound structures were small and these properties are found to be robust under complex formation. Conclusions Bulk electric

  6. On Holo-Hilbert spectral analysis: a full informational spectral representation for nonlinear and non-stationary data

    OpenAIRE

    Huang, Norden E.; Hu, Kun; Yang, Albert C. C.; Chang, Hsing-Chih; Jia, Deng; Liang, Wei-Kuang; Yeh, Jia Rong; Kao, Chu-Lan; Juan, Chi-Hung; Peng, Chung Kang; Meijer, Johanna H.; Wang, Yung-Hung; Long, Steven R.; Wu, Zhauhua

    2016-01-01

    The Holo-Hilbert spectral analysis (HHSA) method is introduced to cure the deficiencies of traditional spectral analysis and to give a full informational representation of nonlinear and non-stationary data. It uses a nested empirical mode decomposition and Hilbert–Huang transform (HHT) approach to identify intrinsic amplitude and frequency modulations often present in nonlinear systems. Comparisons are first made with traditional spectrum analysis, which usually achieved its results through c...

  7. Diverse binding site structures revealed in homology models of polyreactive immunoglobulins

    Science.gov (United States)

    Ramsland, Paul A.; Guddat, Luke W.; Edmundson, Allen B.; Raison, Robert L.

    1997-09-01

    We describe here computer-assisted homology models of the combiningsite structure of three polyreactive immunoglobulins. Template-based modelsof Fv (VL-VH) fragments were derived forthe surface IgM expressed by the malignant CD5 positive B cells from threepatients with chronic lymphocytic leukaemia (CLL). The conserved frameworkregions were constructed using crystal coordinates taken from highlyhomologous human variable domain structures (Pot and Hil). Complementaritydetermining regions (CDRs) were predicted by grafting loops, taken fromknown immunoglobulin structures, onto the Fv framework models. The CDRtemplates were chosen, where possible, to be of the same length and of highresidue identity or similarity. LCDR1, 2 and 3 as well as HCDR1 and 2 forthe Fv were constructed using this strategy. For HCDR3 prediction, adatabase containing the Cartesian coordinates of 30 of these loops wascompiled from unliganded antibody X-ray crystallographic structures and anHCDR3 of the same length as that of the B CLL Fv was selected as a template.In one case (Yar), the resulting HCDR3 model gave unfavourable interactionswhen incorporated into the Fv model. This HCDR3 was therefore modelled usingan alternative strategy of construction of the loop stems, using apreviously described HCDR3 conformation (Pot), followed by chain closurewith a β-turn. The template models were subjected to positionalrefinement using energy minimisation and molecular dynamics simulations(X-PLOR). An electrostatic surface description (GRASP) did not reveal acommon structural feature within the binding sites of the three polyreactiveFv. Thus, polyreactive immunoglobulins may recognise similar and multipleantigens through a diverse array of binding site structures.

  8. Automatic generation of bioinformatics tools for predicting protein-ligand binding sites.

    Science.gov (United States)

    Komiyama, Yusuke; Banno, Masaki; Ueki, Kokoro; Saad, Gul; Shimizu, Kentaro

    2016-03-15

    Predictive tools that model protein-ligand binding on demand are needed to promote ligand research in an innovative drug-design environment. However, it takes considerable time and effort to develop predictive tools that can be applied to individual ligands. An automated production pipeline that can rapidly and efficiently develop user-friendly protein-ligand binding predictive tools would be useful. We developed a system for automatically generating protein-ligand binding predictions. Implementation of this system in a pipeline of Semantic Web technique-based web tools will allow users to specify a ligand and receive the tool within 0.5-1 day. We demonstrated high prediction accuracy for three machine learning algorithms and eight ligands. The source code and web application are freely available for download at http://utprot.net They are implemented in Python and supported on Linux. shimizu@bi.a.u-tokyo.ac.jp Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  9. RBscore&NBench: a high-level web server for nucleic acid binding residues prediction with a large-scale benchmarking database.

    Science.gov (United States)

    Miao, Zhichao; Westhof, Eric

    2016-07-08

    RBscore&NBench combines a web server, RBscore and a database, NBench. RBscore predicts RNA-/DNA-binding residues in proteins and visualizes the prediction scores and features on protein structures. The scoring scheme of RBscore directly links feature values to nucleic acid binding probabilities and illustrates the nucleic acid binding energy funnel on the protein surface. To avoid dataset, binding site definition and assessment metric biases, we compared RBscore with 18 web servers and 3 stand-alone programs on 41 datasets, which demonstrated the high and stable accuracy of RBscore. A comprehensive comparison led us to develop a benchmark database named NBench. The web server is available on: http://ahsoka.u-strasbg.fr/rbscorenbench/. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Structural studies of Pseudomonas and Chromobacterium ω-aminotransferases provide insights into their differing substrate specificity

    International Nuclear Information System (INIS)

    Sayer, Christopher; Isupov, Michail N.; Westlake, Aaron; Littlechild, Jennifer A.

    2013-01-01

    The X-ray structures of two ω-aminotransferases from P. aeruginosa and C. violaceum in complex with an inhibitor offer the first detailed insight into the structural basis of the substrate specificity of these industrially important enzymes. The crystal structures and inhibitor complexes of two industrially important ω-aminotransferase enzymes from Pseudomonas aeruginosa and Chromobacterium violaceum have been determined in order to understand the differences in their substrate specificity. The two enzymes share 30% sequence identity and use the same amino acceptor, pyruvate; however, the Pseudomonas enzyme shows activity towards the amino donor β-alanine, whilst the Chromobacterium enzyme does not. Both enzymes show activity towards S-α-methylbenzylamine (MBA), with the Chromobacterium enzyme having a broader substrate range. The crystal structure of the P. aeruginosa enzyme has been solved in the holo form and with the inhibitor gabaculine bound. The C. violaceum enzyme has been solved in the apo and holo forms and with gabaculine bound. The structures of the holo forms of both enzymes are quite similar. There is little conformational difference observed between the inhibitor complex and the holoenzyme for the P. aeruginosa aminotransferase. In comparison, the crystal structure of the C. violaceum gabaculine complex shows significant structural rearrangements from the structures of both the apo and holo forms of the enzyme. It appears that the different rigidity of the protein scaffold contributes to the substrate specificity observed for the two ω-aminotransferases

  11. Prediction of RNA-Binding Proteins by Voting Systems

    Directory of Open Access Journals (Sweden)

    C. R. Peng

    2011-01-01

    Full Text Available It is important to identify which proteins can interact with RNA for the purpose of protein annotation, since interactions between RNA and proteins influence the structure of the ribosome and play important roles in gene expression. This paper tries to identify proteins that can interact with RNA using voting systems. Firstly through Weka, 34 learning algorithms are chosen for investigation. Then simple majority voting system (SMVS is used for the prediction of RNA-binding proteins, achieving average ACC (overall prediction accuracy value of 79.72% and MCC (Matthew’s correlation coefficient value of 59.77% for the independent testing dataset. Then mRMR (minimum redundancy maximum relevance strategy is used, which is transferred into algorithm selection. In addition, the MCC value of each classifier is assigned to be the weight of the classifier’s vote. As a result, best average MCC values are attained when 22 algorithms are selected and integrated through weighted votes, which are 64.70% for the independent testing dataset, and ACC value is 82.04% at this moment.

  12. Impact of domain knowledge on blinded predictions of binding energies by alchemical free energy calculations

    Science.gov (United States)

    Mey, Antonia S. J. S.; Jiménez, Jordi Juárez; Michel, Julien

    2018-01-01

    The Drug Design Data Resource (D3R) consortium organises blinded challenges to address the latest advances in computational methods for ligand pose prediction, affinity ranking, and free energy calculations. Within the context of the second D3R Grand Challenge several blinded binding free energies predictions were made for two congeneric series of Farsenoid X Receptor (FXR) inhibitors with a semi-automated alchemical free energy calculation workflow featuring FESetup and SOMD software tools. Reasonable performance was observed in retrospective analyses of literature datasets. Nevertheless, blinded predictions on the full D3R datasets were poor due to difficulties encountered with the ranking of compounds that vary in their net-charge. Performance increased for predictions that were restricted to subsets of compounds carrying the same net-charge. Disclosure of X-ray crystallography derived binding modes maintained or improved the correlation with experiment in a subsequent rounds of predictions. The best performing protocols on D3R set1 and set2 were comparable or superior to predictions made on the basis of analysis of literature structure activity relationships (SAR)s only, and comparable or slightly inferior, to the best submissions from other groups.

  13. Ab-initio conformational epitope structure prediction using genetic algorithm and SVM for vaccine design.

    Science.gov (United States)

    Moghram, Basem Ameen; Nabil, Emad; Badr, Amr

    2018-01-01

    T-cell epitope structure identification is a significant challenging immunoinformatic problem within epitope-based vaccine design. Epitopes or antigenic peptides are a set of amino acids that bind with the Major Histocompatibility Complex (MHC) molecules. The aim of this process is presented by Antigen Presenting Cells to be inspected by T-cells. MHC-molecule-binding epitopes are responsible for triggering the immune response to antigens. The epitope's three-dimensional (3D) molecular structure (i.e., tertiary structure) reflects its proper function. Therefore, the identification of MHC class-II epitopes structure is a significant step towards epitope-based vaccine design and understanding of the immune system. In this paper, we propose a new technique using a Genetic Algorithm for Predicting the Epitope Structure (GAPES), to predict the structure of MHC class-II epitopes based on their sequence. The proposed Elitist-based genetic algorithm for predicting the epitope's tertiary structure is based on Ab-Initio Empirical Conformational Energy Program for Peptides (ECEPP) Force Field Model. The developed secondary structure prediction technique relies on Ramachandran Plot. We used two alignment algorithms: the ROSS alignment and TM-Score alignment. We applied four different alignment approaches to calculate the similarity scores of the dataset under test. We utilized the support vector machine (SVM) classifier as an evaluation of the prediction performance. The prediction accuracy and the Area Under Receiver Operating Characteristic (ROC) Curve (AUC) were calculated as measures of performance. The calculations are performed on twelve similarity-reduced datasets of the Immune Epitope Data Base (IEDB) and a large dataset of peptide-binding affinities to HLA-DRB1*0101. The results showed that GAPES was reliable and very accurate. We achieved an average prediction accuracy of 93.50% and an average AUC of 0.974 in the IEDB dataset. Also, we achieved an accuracy of 95

  14. Modeling the binding affinity of structurally diverse industrial chemicals to carbon using the artificial intelligence approaches.

    Science.gov (United States)

    Gupta, Shikha; Basant, Nikita; Rai, Premanjali; Singh, Kunwar P

    2015-11-01

    Binding affinity of chemical to carbon is an important characteristic as it finds vast industrial applications. Experimental determination of the adsorption capacity of diverse chemicals onto carbon is both time and resource intensive, and development of computational approaches has widely been advocated. In this study, artificial intelligence (AI)-based ten different qualitative and quantitative structure-property relationship (QSPR) models (MLPN, RBFN, PNN/GRNN, CCN, SVM, GEP, GMDH, SDT, DTF, DTB) were established for the prediction of the adsorption capacity of structurally diverse chemicals to activated carbon following the OECD guidelines. Structural diversity of the chemicals and nonlinear dependence in the data were evaluated using the Tanimoto similarity index and Brock-Dechert-Scheinkman statistics. The generalization and prediction abilities of the constructed models were established through rigorous internal and external validation procedures performed employing a wide series of statistical checks. In complete dataset, the qualitative models rendered classification accuracies between 97.04 and 99.93%, while the quantitative models yielded correlation (R(2)) values of 0.877-0.977 between the measured and the predicted endpoint values. The quantitative prediction accuracies for the higher molecular weight (MW) compounds (class 4) were relatively better than those for the low MW compounds. Both in the qualitative and quantitative models, the Polarizability was the most influential descriptor. Structural alerts responsible for the extreme adsorption behavior of the compounds were identified. Higher number of carbon and presence of higher halogens in a molecule rendered higher binding affinity. Proposed QSPR models performed well and outperformed the previous reports. A relatively better performance of the ensemble learning models (DTF, DTB) may be attributed to the strengths of the bagging and boosting algorithms which enhance the predictive accuracies. The

  15. Properties and crystal structure of methylenetetrahydrofolate reductase from Thermus thermophilus HB8.

    Directory of Open Access Journals (Sweden)

    Sayaka Igari

    Full Text Available Methylenetetrahydrofolate reductase (MTHFR is one of the enzymes involved in homocysteine metabolism. Despite considerable genetic and clinical attention, the reaction mechanism and regulation of this enzyme are not fully understood because of difficult production and poor stability. While recombinant enzymes from thermophilic organisms are often stable and easy to prepare, properties of thermostable MTHFRs have not yet been reported.MTHFR from Thermus thermophilus HB8, a homologue of Escherichia coli MetF, has been expressed in E. coli and purified. The purified MTHFR was chiefly obtained as a heterodimer of apo- and holo-subunits, that is, one flavin adenine dinucleotide (FAD prosthetic group bound per dimer. The crystal structure of the holo-subunit was quite similar to the β(8α(8 barrel of E. coli MTHFR, while that of the apo-subunit was a previously unobserved closed form. In addition, the intersubunit interface of the dimer in the crystals was different from any of the subunit interfaces of the tetramer of E. coli MTHFR. Free FAD could be incorporated into the apo-subunit of the purified Thermus enzyme after purification, forming a homodimer of holo-subunits. Comparison of the crystal structures of the heterodimer and the homodimer revealed different intersubunit interfaces, indicating a large conformational change upon FAD binding. Most of the biochemical properties of the heterodimer and the homodimer were the same, except that the homodimer showed ≈50% activity per FAD-bound subunit in folate-dependent reactions.The different intersubunit interfaces and rearrangement of subunits of Thermus MTHFR may be related to human enzyme properties, such as the allosteric regulation by S-adenosylmethionine and the enhanced instability of the Ala222Val mutant upon loss of FAD. Whereas E. coli MTHFR was the only structural model for human MTHFR to date, our findings suggest that Thermus MTHFR will be another useful model for this important enzyme.

  16. Solving the crystal structure of human calcium-free S100Z: the siege and conquer of one of the last S100 family strongholds.

    Science.gov (United States)

    Calderone, V; Fragai, M; Gallo, G; Luchinat, C

    2017-06-01

    The X-ray structure of human apo-S100Z has been solved and compared with that of the zebrafish calcium-bound S100Z, which is the closest in sequence. Human apo-S100A12, which shows only 43% sequence identity to human S100Z, has been used as template model to solve the crystallographic phase problem. Although a significant buried surface area between the two physiological dimers is present in the asymmetric unit of human apo-S100Z, the protein does not form the superhelical arrangement in the crystal as observed for the zebrafish calcium-bound S100Z and human calcium-bound S100A4. These findings further demonstrate that calcium plays a fundamental role in triggering quaternary structure formation in several S100s. Solving the X-ray structure of human apo-S100Z by standard molecular replacement procedures turned out to be a challenge and required trying different models and different software tools among which only one was successful. The model that allowed structure solution was that with one of the lowest sequence identity with the target protein among the S100 family in the apo state. Based on the previously solved zebrafish holo-S100Z, a putative human holo-S100Z structure has been then calculated through homology modeling; the differences between the experimental human apo and calculated holo structure have been compared to those existing for other members of the family.

  17. HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features.

    Science.gov (United States)

    Zaman, Rianon; Chowdhury, Shahana Yasmin; Rashid, Mahmood A; Sharma, Alok; Dehzangi, Abdollah; Shatabda, Swakkhar

    2017-01-01

    DNA-binding proteins often play important role in various processes within the cell. Over the last decade, a wide range of classification algorithms and feature extraction techniques have been used to solve this problem. In this paper, we propose a novel DNA-binding protein prediction method called HMMBinder. HMMBinder uses monogram and bigram features extracted from the HMM profiles of the protein sequences. To the best of our knowledge, this is the first application of HMM profile based features for the DNA-binding protein prediction problem. We applied Support Vector Machines (SVM) as a classification technique in HMMBinder. Our method was tested on standard benchmark datasets. We experimentally show that our method outperforms the state-of-the-art methods found in the literature.

  18. HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features

    Directory of Open Access Journals (Sweden)

    Rianon Zaman

    2017-01-01

    Full Text Available DNA-binding proteins often play important role in various processes within the cell. Over the last decade, a wide range of classification algorithms and feature extraction techniques have been used to solve this problem. In this paper, we propose a novel DNA-binding protein prediction method called HMMBinder. HMMBinder uses monogram and bigram features extracted from the HMM profiles of the protein sequences. To the best of our knowledge, this is the first application of HMM profile based features for the DNA-binding protein prediction problem. We applied Support Vector Machines (SVM as a classification technique in HMMBinder. Our method was tested on standard benchmark datasets. We experimentally show that our method outperforms the state-of-the-art methods found in the literature.

  19. Deep convolutional neural networks for pan-specific peptide-MHC class I binding prediction.

    Science.gov (United States)

    Han, Youngmahn; Kim, Dongsup

    2017-12-28

    Computational scanning of peptide candidates that bind to a specific major histocompatibility complex (MHC) can speed up the peptide-based vaccine development process and therefore various methods are being actively developed. Recently, machine-learning-based methods have generated successful results by training large amounts of experimental data. However, many machine learning-based methods are generally less sensitive in recognizing locally-clustered interactions, which can synergistically stabilize peptide binding. Deep convolutional neural network (DCNN) is a deep learning method inspired by visual recognition process of animal brain and it is known to be able to capture meaningful local patterns from 2D images. Once the peptide-MHC interactions can be encoded into image-like array(ILA) data, DCNN can be employed to build a predictive model for peptide-MHC binding prediction. In this study, we demonstrated that DCNN is able to not only reliably predict peptide-MHC binding, but also sensitively detect locally-clustered interactions. Nonapeptide-HLA-A and -B binding data were encoded into ILA data. A DCNN, as a pan-specific prediction model, was trained on the ILA data. The DCNN showed higher performance than other prediction tools for the latest benchmark datasets, which consist of 43 datasets for 15 HLA-A alleles and 25 datasets for 10 HLA-B alleles. In particular, the DCNN outperformed other tools for alleles belonging to the HLA-A3 supertype. The F1 scores of the DCNN were 0.86, 0.94, and 0.67 for HLA-A*31:01, HLA-A*03:01, and HLA-A*68:01 alleles, respectively, which were significantly higher than those of other tools. We found that the DCNN was able to recognize locally-clustered interactions that could synergistically stabilize peptide binding. We developed ConvMHC, a web server to provide user-friendly web interfaces for peptide-MHC class I binding predictions using the DCNN. ConvMHC web server can be accessible via http://jumong.kaist.ac.kr:8080/convmhc

  20. CaFE: a tool for binding affinity prediction using end-point free energy methods.

    Science.gov (United States)

    Liu, Hui; Hou, Tingjun

    2016-07-15

    Accurate prediction of binding free energy is of particular importance to computational biology and structure-based drug design. Among those methods for binding affinity predictions, the end-point approaches, such as MM/PBSA and LIE, have been widely used because they can achieve a good balance between prediction accuracy and computational cost. Here we present an easy-to-use pipeline tool named Calculation of Free Energy (CaFE) to conduct MM/PBSA and LIE calculations. Powered by the VMD and NAMD programs, CaFE is able to handle numerous static coordinate and molecular dynamics trajectory file formats generated by different molecular simulation packages and supports various force field parameters. CaFE source code and documentation are freely available under the GNU General Public License via GitHub at https://github.com/huiliucode/cafe_plugin It is a VMD plugin written in Tcl and the usage is platform-independent. tingjunhou@zju.edu.cn. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  1. eMatchSite: sequence order-independent structure alignments of ligand binding pockets in protein models.

    Directory of Open Access Journals (Sweden)

    Michal Brylinski

    2014-09-01

    Full Text Available Detecting similarities between ligand binding sites in the absence of global homology between target proteins has been recognized as one of the critical components of modern drug discovery. Local binding site alignments can be constructed using sequence order-independent techniques, however, to achieve a high accuracy, many current algorithms for binding site comparison require high-quality experimental protein structures, preferably in the bound conformational state. This, in turn, complicates proteome scale applications, where only various quality structure models are available for the majority of gene products. To improve the state-of-the-art, we developed eMatchSite, a new method for constructing sequence order-independent alignments of ligand binding sites in protein models. Large-scale benchmarking calculations using adenine-binding pockets in crystal structures demonstrate that eMatchSite generates accurate alignments for almost three times more protein pairs than SOIPPA. More importantly, eMatchSite offers a high tolerance to structural distortions in ligand binding regions in protein models. For example, the percentage of correctly aligned pairs of adenine-binding sites in weakly homologous protein models is only 4-9% lower than those aligned using crystal structures. This represents a significant improvement over other algorithms, e.g. the performance of eMatchSite in recognizing similar binding sites is 6% and 13% higher than that of SiteEngine using high- and moderate-quality protein models, respectively. Constructing biologically correct alignments using predicted ligand binding sites in protein models opens up the possibility to investigate drug-protein interaction networks for complete proteomes with prospective systems-level applications in polypharmacology and rational drug repositioning. eMatchSite is freely available to the academic community as a web-server and a stand-alone software distribution at http://www.brylinski.org/ematchsite.

  2. NetMHCpan, a method for MHC class I binding prediction beyond humans

    DEFF Research Database (Denmark)

    Hoof, Ilka; Peters, B; Sidney, J

    2009-01-01

    molecules. We show that the NetMHCpan-2.0 method can accurately predict binding to uncharacterized HLA molecules, including HLA-C and HLA-G. Moreover, NetMHCpan-2.0 is demonstrated to accurately predict peptide binding to chimpanzee and macaque MHC class I molecules. The power of NetMHCpan-2.0 to guide...

  3. Using TESS to predict transcription factor binding sites in DNA sequence.

    Science.gov (United States)

    Schug, Jonathan

    2008-03-01

    This unit describes how to use the Transcription Element Search System (TESS). This Web site predicts transcription factor binding sites (TFBS) in DNA sequence using two different kinds of models of sites, strings and positional weight matrices. The binding of transcription factors to DNA is a major part of the control of gene expression. Transcription factors exhibit sequence-specific binding; they form stronger bonds to some DNA sequences than to others. Identification of a good binding site in the promoter for a gene suggests the possibility that the corresponding factor may play a role in the regulation of that gene. However, the sequences transcription factors recognize are typically short and allow for some amount of mismatch. Because of this, binding sites for a factor can typically be found at random every few hundred to a thousand base pairs. TESS has features to help sort through and evaluate the significance of predicted sites.

  4. The structural flexibility of the human copper chaperone Atox1: Insights from combined pulsed EPR studies and computations.

    Science.gov (United States)

    Levy, Ariel R; Turgeman, Meital; Gevorkyan-Aiapetov, Lada; Ruthstein, Sharon

    2017-08-01

    Metallochaperones are responsible for shuttling metal ions to target proteins. Thus, a metallochaperone's structure must be sufficiently flexible both to hold onto its ion while traversing the cytoplasm and to transfer the ion to or from a partner protein. Here, we sought to shed light on the structure of Atox1, a metallochaperone involved in the human copper regulation system. Atox1 shuttles copper ions from the main copper transporter, Ctr1, to the ATP7b transporter in the Golgi apparatus. Conventional biophysical tools such as X-ray or NMR cannot always target the various conformational states of metallochaperones, owing to a requirement for crystallography or low sensitivity and resolution. Electron paramagnetic resonance (EPR) spectroscopy has recently emerged as a powerful tool for resolving biological reactions and mechanisms in solution. When coupled with computational methods, EPR with site-directed spin labeling and nanoscale distance measurements can provide structural information on a protein or protein complex in solution. We use these methods to show that Atox1 can accommodate at least four different conformations in the apo state (unbound to copper), and two different conformations in the holo state (bound to copper). We also demonstrate that the structure of Atox1 in the holo form is more compact than in the apo form. Our data provide insight regarding the structural mechanisms through which Atox1 can fulfill its dual role of copper binding and transfer. © 2017 The Protein Society.

  5. A web server for analysis, comparison and prediction of protein ligand binding sites.

    Science.gov (United States)

    Singh, Harinder; Srivastava, Hemant Kumar; Raghava, Gajendra P S

    2016-03-25

    One of the major challenges in the field of system biology is to understand the interaction between a wide range of proteins and ligands. In the past, methods have been developed for predicting binding sites in a protein for a limited number of ligands. In order to address this problem, we developed a web server named 'LPIcom' to facilitate users in understanding protein-ligand interaction. Analysis, comparison and prediction modules are available in the "LPIcom' server to predict protein-ligand interacting residues for 824 ligands. Each ligand must have at least 30 protein binding sites in PDB. Analysis module of the server can identify residues preferred in interaction and binding motif for a given ligand; for example residues glycine, lysine and arginine are preferred in ATP binding sites. Comparison module of the server allows comparing protein-binding sites of multiple ligands to understand the similarity between ligands based on their binding site. This module indicates that ATP, ADP and GTP ligands are in the same cluster and thus their binding sites or interacting residues exhibit a high level of similarity. Propensity-based prediction module has been developed for predicting ligand-interacting residues in a protein for more than 800 ligands. In addition, a number of web-based tools have been integrated to facilitate users in creating web logo and two-sample between ligand interacting and non-interacting residues. In summary, this manuscript presents a web-server for analysis of ligand interacting residue. This server is available for public use from URL http://crdd.osdd.net/raghava/lpicom .

  6. Peptide binding predictions for HLA DR, DP and DQ molecules

    DEFF Research Database (Denmark)

    Wang, P.; Sidney, J.; Kim, Y.

    2010-01-01

    a significant gap in knowledge as HLA DP and DQ molecules are presumably equally important, and have only been studied less because they are more difficult to handle experimentally. RESULTS: In this study, we aimed to narrow this gap by providing a large scale dataset of over 17,000 HLA-peptide binding...... affinities for a set of 11 HLA DP and DQ alleles. We also expanded our dataset for HLA DR alleles resulting in a total of 40,000 MHC class II binding affinities covering 26 allelic variants. Utilizing this dataset, we generated prediction tools utilizing several machine learning algorithms and evaluated...... include all training data for maximum performance. 4) The recently developed NN-align prediction method significantly outperformed all other algorithms, including a naïve consensus based on all prediction methods. A new consensus method dropping the comparably weak ARB prediction method could outperform...

  7. Toward the prediction of class I and II mouse major histocompatibility complex-peptide-binding affinity: in silico bioinformatic step-by-step guide using quantitative structure-activity relationships.

    Science.gov (United States)

    Hattotuwagama, Channa K; Doytchinova, Irini A; Flower, Darren R

    2007-01-01

    Quantitative structure-activity relationship (QSAR) analysis is a cornerstone of modern informatics. Predictive computational models of peptide-major histocompatibility complex (MHC)-binding affinity based on QSAR technology have now become important components of modern computational immunovaccinology. Historically, such approaches have been built around semiqualitative, classification methods, but these are now giving way to quantitative regression methods. We review three methods--a 2D-QSAR additive-partial least squares (PLS) and a 3D-QSAR comparative molecular similarity index analysis (CoMSIA) method--which can identify the sequence dependence of peptide-binding specificity for various class I MHC alleles from the reported binding affinities (IC50) of peptide sets. The third method is an iterative self-consistent (ISC) PLS-based additive method, which is a recently developed extension to the additive method for the affinity prediction of class II peptides. The QSAR methods presented here have established themselves as immunoinformatic techniques complementary to existing methodology, useful in the quantitative prediction of binding affinity: current methods for the in silico identification of T-cell epitopes (which form the basis of many vaccines, diagnostics, and reagents) rely on the accurate computational prediction of peptide-MHC affinity. We have reviewed various human and mouse class I and class II allele models. Studied alleles comprise HLA-A*0101, HLA-A*0201, HLA-A*0202, HLA-A*0203, HLA-A*0206, HLA-A*0301, HLA-A*1101, HLA-A*3101, HLA-A*6801, HLA-A*6802, HLA-B*3501, H2-K(k), H2-K(b), H2-D(b) HLA-DRB1*0101, HLA-DRB1*0401, HLA-DRB1*0701, I-A(b), I-A(d), I-A(k), I-A(S), I-E(d), and I-E(k). In this chapter we show a step-by-step guide into predicting the reliability and the resulting models to represent an advance on existing methods. The peptides used in this study are available from the AntiJen database (http://www.jenner.ac.uk/AntiJen). The PLS method

  8. Concurrent Increases and Decreases in Local Stability and Conformational Heterogeneity in Cu, Zn Superoxide Dismutase Variants Revealed by Temperature-Dependence of Amide Chemical Shifts.

    Science.gov (United States)

    Doyle, Colleen M; Rumfeldt, Jessica A; Broom, Helen R; Sekhar, Ashok; Kay, Lewis E; Meiering, Elizabeth M

    2016-03-08

    The chemical shifts of backbone amide protons in proteins are sensitive reporters of local structural stability and conformational heterogeneity, which can be determined from their readily measured linear and nonlinear temperature-dependences, respectively. Here we report analyses of amide proton temperature-dependences for native dimeric Cu, Zn superoxide dismutase (holo pWT SOD1) and structurally diverse mutant SOD1s associated with amyotrophic lateral sclerosis (ALS). Holo pWT SOD1 loses structure with temperature first at its periphery and, while having extremely high global stability, nevertheless exhibits extensive conformational heterogeneity, with ∼1 in 5 residues showing evidence for population of low energy alternative states. The holo G93A and E100G ALS mutants have moderately decreased global stability, whereas V148I is slightly stabilized. Comparison of the holo mutants as well as the marginally stable immature monomeric unmetalated and disulfide-reduced (apo(2SH)) pWT with holo pWT shows that changes in the local structural stability of individual amides vary greatly, with average changes corresponding to differences in global protein stability measured by differential scanning calorimetry. Mutants also exhibit altered conformational heterogeneity compared to pWT. Strikingly, substantial increases as well as decreases in local stability and conformational heterogeneity occur, in particular upon maturation and for G93A. Thus, the temperature-dependence of amide shifts for SOD1 variants is a rich source of information on the location and extent of perturbation of structure upon covalent changes and ligand binding. The implications for potential mechanisms of toxic misfolding of SOD1 in disease and for general aspects of protein energetics, including entropy-enthalpy compensation, are discussed.

  9. Structure and Stability of Molecular Crystals with Many-Body Dispersion-Inclusive Density Functional Tight Binding.

    Science.gov (United States)

    Mortazavi, Majid; Brandenburg, Jan Gerit; Maurer, Reinhard J; Tkatchenko, Alexandre

    2018-01-18

    Accurate prediction of structure and stability of molecular crystals is crucial in materials science and requires reliable modeling of long-range dispersion interactions. Semiempirical electronic structure methods are computationally more efficient than their ab initio counterparts, allowing structure sampling with significant speedups. We combine the Tkatchenko-Scheffler van der Waals method (TS) and the many-body dispersion method (MBD) with third-order density functional tight-binding (DFTB3) via a charge population-based method. We find an overall good performance for the X23 benchmark database of molecular crystals, despite an underestimation of crystal volume that can be traced to the DFTB parametrization. We achieve accurate lattice energy predictions with DFT+MBD energetics on top of vdW-inclusive DFTB3 structures, resulting in a speedup of up to 3000 times compared with a full DFT treatment. This suggests that vdW-inclusive DFTB3 can serve as a viable structural prescreening tool in crystal structure prediction.

  10. Assessing the model transferability for prediction of transcription factor binding sites based on chromatin accessibility.

    Science.gov (United States)

    Liu, Sheng; Zibetti, Cristina; Wan, Jun; Wang, Guohua; Blackshaw, Seth; Qian, Jiang

    2017-07-27

    Computational prediction of transcription factor (TF) binding sites in different cell types is challenging. Recent technology development allows us to determine the genome-wide chromatin accessibility in various cellular and developmental contexts. The chromatin accessibility profiles provide useful information in prediction of TF binding events in various physiological conditions. Furthermore, ChIP-Seq analysis was used to determine genome-wide binding sites for a range of different TFs in multiple cell types. Integration of these two types of genomic information can improve the prediction of TF binding events. We assessed to what extent a model built upon on other TFs and/or other cell types could be used to predict the binding sites of TFs of interest. A random forest model was built using a set of cell type-independent features such as specific sequences recognized by the TFs and evolutionary conservation, as well as cell type-specific features derived from chromatin accessibility data. Our analysis suggested that the models learned from other TFs and/or cell lines performed almost as well as the model learned from the target TF in the cell type of interest. Interestingly, models based on multiple TFs performed better than single-TF models. Finally, we proposed a universal model, BPAC, which was generated using ChIP-Seq data from multiple TFs in various cell types. Integrating chromatin accessibility information with sequence information improves prediction of TF binding.The prediction of TF binding is transferable across TFs and/or cell lines suggesting there are a set of universal "rules". A computational tool was developed to predict TF binding sites based on the universal "rules".

  11. Prediction of the binding affinities of peptides to class II MHC using a regularized thermodynamic model

    Directory of Open Access Journals (Sweden)

    Mittelmann Hans D

    2010-01-01

    Full Text Available Abstract Background The binding of peptide fragments of extracellular peptides to class II MHC is a crucial event in the adaptive immune response. Each MHC allotype generally binds a distinct subset of peptides and the enormous number of possible peptide epitopes prevents their complete experimental characterization. Computational methods can utilize the limited experimental data to predict the binding affinities of peptides to class II MHC. Results We have developed the Regularized Thermodynamic Average, or RTA, method for predicting the affinities of peptides binding to class II MHC. RTA accounts for all possible peptide binding conformations using a thermodynamic average and includes a parameter constraint for regularization to improve accuracy on novel data. RTA was shown to achieve higher accuracy, as measured by AUC, than SMM-align on the same data for all 17 MHC allotypes examined. RTA also gave the highest accuracy on all but three allotypes when compared with results from 9 different prediction methods applied to the same data. In addition, the method correctly predicted the peptide binding register of 17 out of 18 peptide-MHC complexes. Finally, we found that suboptimal peptide binding registers, which are often ignored in other prediction methods, made significant contributions of at least 50% of the total binding energy for approximately 20% of the peptides. Conclusions The RTA method accurately predicts peptide binding affinities to class II MHC and accounts for multiple peptide binding registers while reducing overfitting through regularization. The method has potential applications in vaccine design and in understanding autoimmune disorders. A web server implementing the RTA prediction method is available at http://bordnerlab.org/RTA/.

  12. Structural characterization of the interactions between calmodulin and skeletal muscle myosin light chain kinase: Effect of peptide (576-594)G binding on the Ca2+-binding domains

    International Nuclear Information System (INIS)

    Seeholzer, S.H.; Wand, A.J.

    1989-01-01

    Calcium-containing calmodulin (CaM) and its complex with a peptide corresponding to the calmodulin-binding domain of skeletal muscle myosin light chain kinase [skMLCK(576-594)G] have been studied by one- and two-dimensional 1 H NMR techniques. Resonances arising from the antiparallel β-sheet structures associated with the calcium-binding domains of CaM and their counterparts in the CaM-skMLCK(576-594)G complex have been assigned. The assignments were initiated by application of the main chain directed assignment strategy. It is found that, despite significant changes in chemical shifts of resonances arising from amino acid residues in this region upon binding of the peptide, the β-sheets have virtually the same structure in the complex as in CaM. Hydrogen exchange rates of amide NH within the β-sheet structures are significantly slowed upon binding of peptide. These data, in conjunction with the observed nuclear Overhauser effect (NOE) patterns and relative intensities and the downfield shifts of associated amide and α resonances upon binding of peptide, show that the peptide stabilizes the Ca 2+ -bound state of calmodulin. The observed pattern of NOEs within the β-sheets and their structural similarity correspond closely to those predicted by the crystal structure. These findings imply that the apparent inconsistency of the crystal structure with recently reported low-angle X-ray scattering profiles of CaM may lie within the putative central helix bridging the globular domains

  13. Plasticity of the Binding Site of Renin: Optimized Selection of Protein Structures for Ensemble Docking.

    Science.gov (United States)

    Strecker, Claas; Meyer, Bernd

    2018-05-02

    Protein flexibility poses a major challenge to docking of potential ligands in that the binding site can adopt different shapes. Docking algorithms usually keep the protein rigid and only allow the ligand to be treated as flexible. However, a wrong assessment of the shape of the binding pocket can prevent a ligand from adapting a correct pose. Ensemble docking is a simple yet promising method to solve this problem: Ligands are docked into multiple structures, and the results are subsequently merged. Selection of protein structures is a significant factor for this approach. In this work we perform a comprehensive and comparative study evaluating the impact of structure selection on ensemble docking. We perform ensemble docking with several crystal structures and with structures derived from molecular dynamics simulations of renin, an attractive target for antihypertensive drugs. Here, 500 ns of MD simulations revealed binding site shapes not found in any available crystal structure. We evaluate the importance of structure selection for ensemble docking by comparing binding pose prediction, ability to rank actives above nonactives (screening utility), and scoring accuracy. As a result, for ensemble definition k-means clustering appears to be better suited than hierarchical clustering with average linkage. The best performing ensemble consists of four crystal structures and is able to reproduce the native ligand poses better than any individual crystal structure. Moreover this ensemble outperforms 88% of all individual crystal structures in terms of screening utility as well as scoring accuracy. Similarly, ensembles of MD-derived structures perform on average better than 75% of any individual crystal structure in terms of scoring accuracy at all inspected ensembles sizes.

  14. Crystal structure of the gamma-2 herpesvirus LANA DNA binding domain identifies charged surface residues which impact viral latency.

    Directory of Open Access Journals (Sweden)

    Bruno Correia

    Full Text Available Latency-associated nuclear antigen (LANA mediates γ2-herpesvirus genome persistence and regulates transcription. We describe the crystal structure of the murine gammaherpesvirus-68 LANA C-terminal domain at 2.2 Å resolution. The structure reveals an alpha-beta fold that assembles as a dimer, reminiscent of Epstein-Barr virus EBNA1. A predicted DNA binding surface is present and opposite this interface is a positive electrostatic patch. Targeted DNA recognition substitutions eliminated DNA binding, while certain charged patch mutations reduced bromodomain protein, BRD4, binding. Virus containing LANA abolished for DNA binding was incapable of viable latent infection in mice. Virus with mutations at the charged patch periphery exhibited substantial deficiency in expansion of latent infection, while central region substitutions had little effect. This deficiency was independent of BRD4. These results elucidate the LANA DNA binding domain structure and reveal a unique charged region that exerts a critical role in viral latent infection, likely acting through a host cell protein(s.

  15. Real-Time Ligand Binding Pocket Database Search Using Local Surface Descriptors

    Science.gov (United States)

    Chikhi, Rayan; Sael, Lee; Kihara, Daisuke

    2010-01-01

    Due to the increasing number of structures of unknown function accumulated by ongoing structural genomics projects, there is an urgent need for computational methods for characterizing protein tertiary structures. As functions of many of these proteins are not easily predicted by conventional sequence database searches, a legitimate strategy is to utilize structure information in function characterization. Of a particular interest is prediction of ligand binding to a protein, as ligand molecule recognition is a major part of molecular function of proteins. Predicting whether a ligand molecule binds a protein is a complex problem due to the physical nature of protein-ligand interactions and the flexibility of both binding sites and ligand molecules. However, geometric and physicochemical complementarity is observed between the ligand and its binding site in many cases. Therefore, ligand molecules which bind to a local surface site in a protein can be predicted by finding similar local pockets of known binding ligands in the structure database. Here, we present two representations of ligand binding pockets and utilize them for ligand binding prediction by pocket shape comparison. These representations are based on mapping of surface properties of binding pockets, which are compactly described either by the two dimensional pseudo-Zernike moments or the 3D Zernike descriptors. These compact representations allow a fast real-time pocket searching against a database. Thorough benchmark study employing two different datasets show that our representations are competitive with the other existing methods. Limitations and potentials of the shape-based methods as well as possible improvements are discussed. PMID:20455259

  16. Predicting peptides binding to MHC class II molecules using multi-objective evolutionary algorithms

    Directory of Open Access Journals (Sweden)

    Feng Lin

    2007-11-01

    Full Text Available Abstract Background Peptides binding to Major Histocompatibility Complex (MHC class II molecules are crucial for initiation and regulation of immune responses. Predicting peptides that bind to a specific MHC molecule plays an important role in determining potential candidates for vaccines. The binding groove in class II MHC is open at both ends, allowing peptides longer than 9-mer to bind. Finding the consensus motif facilitating the binding of peptides to a MHC class II molecule is difficult because of different lengths of binding peptides and varying location of 9-mer binding core. The level of difficulty increases when the molecule is promiscuous and binds to a large number of low affinity peptides. In this paper, we propose two approaches using multi-objective evolutionary algorithms (MOEA for predicting peptides binding to MHC class II molecules. One uses the information from both binders and non-binders for self-discovery of motifs. The other, in addition, uses information from experimentally determined motifs for guided-discovery of motifs. Results The proposed methods are intended for finding peptides binding to MHC class II I-Ag7 molecule – a promiscuous binder to a large number of low affinity peptides. Cross-validation results across experiments on two motifs derived for I-Ag7 datasets demonstrate better generalization abilities and accuracies of the present method over earlier approaches. Further, the proposed method was validated and compared on two publicly available benchmark datasets: (1 an ensemble of qualitative HLA-DRB1*0401 peptide data obtained from five different sources, and (2 quantitative peptide data obtained for sixteen different alleles comprising of three mouse alleles and thirteen HLA alleles. The proposed method outperformed earlier methods on most datasets, indicating that it is well suited for finding peptides binding to MHC class II molecules. Conclusion We present two MOEA-based algorithms for finding motifs

  17. A Rat α-Fetoprotein Binding Activity Prediction Model to Facilitate Assessment of the Endocrine Disruption Potential of Environmental Chemicals.

    Science.gov (United States)

    Hong, Huixiao; Shen, Jie; Ng, Hui Wen; Sakkiah, Sugunadevi; Ye, Hao; Ge, Weigong; Gong, Ping; Xiao, Wenming; Tong, Weida

    2016-03-25

    Endocrine disruptors such as polychlorinated biphenyls (PCBs), diethylstilbestrol (DES) and dichlorodiphenyltrichloroethane (DDT) are agents that interfere with the endocrine system and cause adverse health effects. Huge public health concern about endocrine disruptors has arisen. One of the mechanisms of endocrine disruption is through binding of endocrine disruptors with the hormone receptors in the target cells. Entrance of endocrine disruptors into target cells is the precondition of endocrine disruption. The binding capability of a chemical with proteins in the blood affects its entrance into the target cells and, thus, is very informative for the assessment of potential endocrine disruption of chemicals. α-fetoprotein is one of the major serum proteins that binds to a variety of chemicals such as estrogens. To better facilitate assessment of endocrine disruption of environmental chemicals, we developed a model for α-fetoprotein binding activity prediction using the novel pattern recognition method (Decision Forest) and the molecular descriptors calculated from two-dimensional structures by Mold² software. The predictive capability of the model has been evaluated through internal validation using 125 training chemicals (average balanced accuracy of 69%) and external validations using 22 chemicals (balanced accuracy of 71%). Prediction confidence analysis revealed the model performed much better at high prediction confidence. Our results indicate that the model is useful (when predictions are in high confidence) in endocrine disruption risk assessment of environmental chemicals though improvement by increasing number of training chemicals is needed.

  18. Structure and self-assembly of the calcium binding matrix protein of human metapneumovirus.

    Science.gov (United States)

    Leyrat, Cedric; Renner, Max; Harlos, Karl; Huiskonen, Juha T; Grimes, Jonathan M

    2014-01-07

    The matrix protein (M) of paramyxoviruses plays a key role in determining virion morphology by directing viral assembly and budding. Here, we report the crystal structure of the human metapneumovirus M at 2.8 Å resolution in its native dimeric state. The structure reveals the presence of a high-affinity Ca²⁺ binding site. Molecular dynamics simulations (MDS) predict a secondary lower-affinity site that correlates well with data from fluorescence-based thermal shift assays. By combining small-angle X-ray scattering with MDS and ensemble analysis, we captured the structure and dynamics of M in solution. Our analysis reveals a large positively charged patch on the protein surface that is involved in membrane interaction. Structural analysis of DOPC-induced polymerization of M into helical filaments using electron microscopy leads to a model of M self-assembly. The conservation of the Ca²⁺ binding sites suggests a role for calcium in the replication and morphogenesis of pneumoviruses. Copyright © 2014 The Authors. Published by Elsevier Inc. All rights reserved.

  19. Predicting and analyzing DNA-binding domains using a systematic approach to identifying a set of informative physicochemical and biochemical properties

    Science.gov (United States)

    2011-01-01

    Background Existing methods of predicting DNA-binding proteins used valuable features of physicochemical properties to design support vector machine (SVM) based classifiers. Generally, selection of physicochemical properties and determination of their corresponding feature vectors rely mainly on known properties of binding mechanism and experience of designers. However, there exists a troublesome problem for designers that some different physicochemical properties have similar vectors of representing 20 amino acids and some closely related physicochemical properties have dissimilar vectors. Results This study proposes a systematic approach (named Auto-IDPCPs) to automatically identify a set of physicochemical and biochemical properties in the AAindex database to design SVM-based classifiers for predicting and analyzing DNA-binding domains/proteins. Auto-IDPCPs consists of 1) clustering 531 amino acid indices in AAindex into 20 clusters using a fuzzy c-means algorithm, 2) utilizing an efficient genetic algorithm based optimization method IBCGA to select an informative feature set of size m to represent sequences, and 3) analyzing the selected features to identify related physicochemical properties which may affect the binding mechanism of DNA-binding domains/proteins. The proposed Auto-IDPCPs identified m=22 features of properties belonging to five clusters for predicting DNA-binding domains with a five-fold cross-validation accuracy of 87.12%, which is promising compared with the accuracy of 86.62% of the existing method PSSM-400. For predicting DNA-binding sequences, the accuracy of 75.50% was obtained using m=28 features, where PSSM-400 has an accuracy of 74.22%. Auto-IDPCPs and PSSM-400 have accuracies of 80.73% and 82.81%, respectively, applied to an independent test data set of DNA-binding domains. Some typical physicochemical properties discovered are hydrophobicity, secondary structure, charge, solvent accessibility, polarity, flexibility, normalized Van Der

  20. Molecular basis of calcium-sensitizing and desensitizing mutations of the human cardiac troponin C regulatory domain: a multi-scale simulation study.

    Directory of Open Access Journals (Sweden)

    Peter Michael Kekenes-Huskey

    Full Text Available Troponin C (TnC is implicated in the initiation of myocyte contraction via binding of cytosolic Ca²⁺ and subsequent recognition of the Troponin I switch peptide. Mutations of the cardiac TnC N-terminal regulatory domain have been shown to alter both calcium binding and myofilament force generation. We have performed molecular dynamics simulations of engineered TnC variants that increase or decrease Ca²⁺ sensitivity, in order to understand the structural basis of their impact on TnC function. We will use the distinction for mutants that are associated with increased Ca²⁺ affinity and for those mutants with reduced affinity. Our studies demonstrate that for GOF mutants V44Q and L48Q, the structure of the physiologically-active site II Ca²⁺ binding site in the Ca²⁺-free (apo state closely resembled the Ca²⁺-bound (holo state. In contrast, site II is very labile for LOF mutants E40A and V79Q in the apo form and bears little resemblance with the holo conformation. We hypothesize that these phenomena contribute to the increased association rate, k(on, for the GOF mutants relative to LOF. Furthermore, we observe significant positive and negative positional correlations between helices in the GOF holo mutants that are not found in the LOF mutants. We anticipate these correlations may contribute either directly to Ca²⁺ affinity or indirectly through TnI association. Our observations based on the structure and dynamics of mutant TnC provide rationale for binding trends observed in GOF and LOF mutants and will guide the development of inotropic drugs that target TnC.

  1. Predicting DNA binding proteins using support vector machine with hybrid fractal features.

    Science.gov (United States)

    Niu, Xiao-Hui; Hu, Xue-Hai; Shi, Feng; Xia, Jing-Bo

    2014-02-21

    DNA-binding proteins play a vitally important role in many biological processes. Prediction of DNA-binding proteins from amino acid sequence is a significant but not fairly resolved scientific problem. Chaos game representation (CGR) investigates the patterns hidden in protein sequences, and visually reveals previously unknown structure. Fractal dimensions (FD) are good tools to measure sizes of complex, highly irregular geometric objects. In order to extract the intrinsic correlation with DNA-binding property from protein sequences, CGR algorithm, fractal dimension and amino acid composition are applied to formulate the numerical features of protein samples in this paper. Seven groups of features are extracted, which can be computed directly from the primary sequence, and each group is evaluated by the 10-fold cross-validation test and Jackknife test. Comparing the results of numerical experiments, the group of amino acid composition and fractal dimension (21-dimension vector) gets the best result, the average accuracy is 81.82% and average Matthew's correlation coefficient (MCC) is 0.6017. This resulting predictor is also compared with existing method DNA-Prot and shows better performances. © 2013 The Authors. Published by Elsevier Ltd All rights reserved.

  2. AutoSite: an automated approach for pseudo-ligands prediction—from ligand-binding sites identification to predicting key ligand atoms

    Science.gov (United States)

    Ravindranath, Pradeep Anand; Sanner, Michel F.

    2016-01-01

    Motivation: The identification of ligand-binding sites from a protein structure facilitates computational drug design and optimization, and protein function assignment. We introduce AutoSite: an efficient software tool for identifying ligand-binding sites and predicting pseudo ligand corresponding to each binding site identified. Binding sites are reported as clusters of 3D points called fills in which every point is labelled as hydrophobic or as hydrogen bond donor or acceptor. From these fills AutoSite derives feature points: a set of putative positions of hydrophobic-, and hydrogen-bond forming ligand atoms. Results: We show that AutoSite identifies ligand-binding sites with higher accuracy than other leading methods, and produces fills that better matches the ligand shape and properties, than the fills obtained with a software program with similar capabilities, AutoLigand. In addition, we demonstrate that for the Astex Diverse Set, the feature points identify 79% of hydrophobic ligand atoms, and 81% and 62% of the hydrogen acceptor and donor hydrogen ligand atoms interacting with the receptor, and predict 81.2% of water molecules mediating interactions between ligand and receptor. Finally, we illustrate potential uses of the predicted feature points in the context of lead optimization in drug discovery projects. Availability and Implementation: http://adfr.scripps.edu/AutoDockFR/autosite.html Contact: sanner@scripps.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27354702

  3. A novel method for improved accuracy of transcription factor binding site prediction

    KAUST Repository

    Khamis, Abdullah M.; Motwalli, Olaa Amin; Oliva, Romina; Jankovic, Boris R.; Medvedeva, Yulia; Ashoor, Haitham; Essack, Magbubah; Gao, Xin; Bajic, Vladimir B.

    2018-01-01

    Identifying transcription factor (TF) binding sites (TFBSs) is important in the computational inference of gene regulation. Widely used computational methods of TFBS prediction based on position weight matrices (PWMs) usually have high false positive rates. Moreover, computational studies of transcription regulation in eukaryotes frequently require numerous PWM models of TFBSs due to a large number of TFs involved. To overcome these problems we developed DRAF, a novel method for TFBS prediction that requires only 14 prediction models for 232 human TFs, while at the same time significantly improves prediction accuracy. DRAF models use more features than PWM models, as they combine information from TFBS sequences and physicochemical properties of TF DNA-binding domains into machine learning models. Evaluation of DRAF on 98 human ChIP-seq datasets shows on average 1.54-, 1.96- and 5.19-fold reduction of false positives at the same sensitivities compared to models from HOCOMOCO, TRANSFAC and DeepBind, respectively. This observation suggests that one can efficiently replace the PWM models for TFBS prediction by a small number of DRAF models that significantly improve prediction accuracy. The DRAF method is implemented in a web tool and in a stand-alone software freely available at http://cbrc.kaust.edu.sa/DRAF.

  4. A novel method for improved accuracy of transcription factor binding site prediction

    KAUST Repository

    Khamis, Abdullah M.

    2018-03-20

    Identifying transcription factor (TF) binding sites (TFBSs) is important in the computational inference of gene regulation. Widely used computational methods of TFBS prediction based on position weight matrices (PWMs) usually have high false positive rates. Moreover, computational studies of transcription regulation in eukaryotes frequently require numerous PWM models of TFBSs due to a large number of TFs involved. To overcome these problems we developed DRAF, a novel method for TFBS prediction that requires only 14 prediction models for 232 human TFs, while at the same time significantly improves prediction accuracy. DRAF models use more features than PWM models, as they combine information from TFBS sequences and physicochemical properties of TF DNA-binding domains into machine learning models. Evaluation of DRAF on 98 human ChIP-seq datasets shows on average 1.54-, 1.96- and 5.19-fold reduction of false positives at the same sensitivities compared to models from HOCOMOCO, TRANSFAC and DeepBind, respectively. This observation suggests that one can efficiently replace the PWM models for TFBS prediction by a small number of DRAF models that significantly improve prediction accuracy. The DRAF method is implemented in a web tool and in a stand-alone software freely available at http://cbrc.kaust.edu.sa/DRAF.

  5. MHC2NNZ: A novel peptide binding prediction approach for HLA DQ molecules

    Science.gov (United States)

    Xie, Jiang; Zeng, Xu; Lu, Dongfang; Liu, Zhixiang; Wang, Jiao

    2017-07-01

    The major histocompatibility complex class II (MHC-II) molecule plays a crucial role in immunology. Computational prediction of MHC-II binding peptides can help researchers understand the mechanism of immune systems and design vaccines. Most of the prediction algorithms for MHC-II to date have made large efforts in human leukocyte antigen (HLA, the name of MHC in Human) molecules encoded in the DR locus. However, HLA DQ molecules are equally important and have only been made less progress because it is more difficult to handle them experimentally. In this study, we propose an artificial neural network-based approach called MHC2NNZ to predict peptides binding to HLA DQ molecules. Unlike previous artificial neural network-based methods, MHC2NNZ not only considers sequence similarity features but also captures the chemical and physical properties, and a novel method incorporating these properties is proposed to represent peptide flanking regions (PFR). Furthermore, MHC2NNZ improves the prediction accuracy by combining with amino acid preference at more specific positions of the peptides binding core. By evaluating on 3549 peptides binding to six most frequent HLA DQ molecules, MHC2NNZ is demonstrated to outperform other state-of-the-art MHC-II prediction methods.

  6. Theoretical prediction of low-density hexagonal ZnO hollow structures

    Energy Technology Data Exchange (ETDEWEB)

    Tuoc, Vu Ngoc, E-mail: tuoc.vungoc@hust.edu.vn [Institute of Engineering Physics, Hanoi University of Science and Technology, 1 Dai Co Viet Road, Hanoi (Viet Nam); Huan, Tran Doan [Institute of Materials Science, University of Connecticut, Storrs, Connecticut 06269-3136 (United States); Thao, Nguyen Thi [Institute of Engineering Physics, Hanoi University of Science and Technology, 1 Dai Co Viet Road, Hanoi (Viet Nam); Hong Duc University, 307 Le Lai, Thanh Hoa City (Viet Nam); Tuan, Le Manh [Hong Duc University, 307 Le Lai, Thanh Hoa City (Viet Nam)

    2016-10-14

    Along with wurtzite and zinc blende, zinc oxide (ZnO) has been found in a large number of polymorphs with substantially different properties and, hence, applications. Therefore, predicting and synthesizing new classes of ZnO polymorphs are of great significance and have been gaining considerable interest. Herein, we perform a density functional theory based tight-binding study, predicting several new series of ZnO hollow structures using the bottom-up approach. The geometry of the building blocks allows for obtaining a variety of hexagonal, low-density nanoporous, and flexible ZnO hollow structures. Their stability is discussed by means of the free energy computed within the lattice-dynamics approach. Our calculations also indicate that all the reported hollow structures are wide band gap semiconductors in the same fashion with bulk ZnO. The electronic band structures of the ZnO hollow structures are finally examined in detail.

  7. A community resource benchmarking predictions of peptide binding to MHC-I molecules.

    Science.gov (United States)

    Peters, Bjoern; Bui, Huynh-Hoa; Frankild, Sune; Nielson, Morten; Lundegaard, Claus; Kostem, Emrah; Basch, Derek; Lamberth, Kasper; Harndahl, Mikkel; Fleri, Ward; Wilson, Stephen S; Sidney, John; Lund, Ole; Buus, Soren; Sette, Alessandro

    2006-06-09

    Recognition of peptides bound to major histocompatibility complex (MHC) class I molecules by T lymphocytes is an essential part of immune surveillance. Each MHC allele has a characteristic peptide binding preference, which can be captured in prediction algorithms, allowing for the rapid scan of entire pathogen proteomes for peptide likely to bind MHC. Here we make public a large set of 48,828 quantitative peptide-binding affinity measurements relating to 48 different mouse, human, macaque, and chimpanzee MHC class I alleles. We use this data to establish a set of benchmark predictions with one neural network method and two matrix-based prediction methods extensively utilized in our groups. In general, the neural network outperforms the matrix-based predictions mainly due to its ability to generalize even on a small amount of data. We also retrieved predictions from tools publicly available on the internet. While differences in the data used to generate these predictions hamper direct comparisons, we do conclude that tools based on combinatorial peptide libraries perform remarkably well. The transparent prediction evaluation on this dataset provides tool developers with a benchmark for comparison of newly developed prediction methods. In addition, to generate and evaluate our own prediction methods, we have established an easily extensible web-based prediction framework that allows automated side-by-side comparisons of prediction methods implemented by experts. This is an advance over the current practice of tool developers having to generate reference predictions themselves, which can lead to underestimating the performance of prediction methods they are not as familiar with as their own. The overall goal of this effort is to provide a transparent prediction evaluation allowing bioinformaticians to identify promising features of prediction methods and providing guidance to immunologists regarding the reliability of prediction tools.

  8. Machine learning competition in immunology – Prediction of HLA class I binding peptides

    DEFF Research Database (Denmark)

    Zhang, Guang Lan; Ansari, Hifzur Rahman; Bradley, Phil

    2011-01-01

    of peptide binding, therefore, determines the accuracy of the overall method. Computational predictions of peptide binding to HLA, both class I and class II, use a variety of algorithms ranging from binding motifs to advanced machine learning techniques ( [Brusic et al., 2004] and [Lafuente and Reche, 2009...

  9. Subfamily-specific adaptations in the structures of two penicillin-binding proteins from Mycobacterium tuberculosis.

    Directory of Open Access Journals (Sweden)

    Daniil M Prigozhin

    Full Text Available Beta-lactam antibiotics target penicillin-binding proteins including several enzyme classes essential for bacterial cell-wall homeostasis. To better understand the functional and inhibitor-binding specificities of penicillin-binding proteins from the pathogen, Mycobacterium tuberculosis, we carried out structural and phylogenetic analysis of two predicted D,D-carboxypeptidases, Rv2911 and Rv3330. Optimization of Rv2911 for crystallization using directed evolution and the GFP folding reporter method yielded a soluble quadruple mutant. Structures of optimized Rv2911 bound to phenylmethylsulfonyl fluoride and Rv3330 bound to meropenem show that, in contrast to the nonspecific inhibitor, meropenem forms an extended interaction with the enzyme along a conserved surface. Phylogenetic analysis shows that Rv2911 and Rv3330 belong to different clades that emerged in Actinobacteria and are not represented in model organisms such as Escherichia coli and Bacillus subtilis. Clade-specific adaptations allow these enzymes to fulfill distinct physiological roles despite strict conservation of core catalytic residues. The characteristic differences include potential protein-protein interaction surfaces and specificity-determining residues surrounding the catalytic site. Overall, these structural insights lay the groundwork to develop improved beta-lactam therapeutics for tuberculosis.

  10. STRUCTURAL FEATURES OF PLANT CHITINASES AND CHITIN-BINDING PROTEINS

    NARCIS (Netherlands)

    BEINTEMA, JJ

    1994-01-01

    Structural features of plant chitinases and chitin-binding proteins are discussed. Many of these proteins consist of multiple domains,of which the chitin-binding hevein domain is a predominant one. X-ray and NMR structures of representatives of the major classes of these proteins are available now,

  11. Predicting the binding patterns of hub proteins: a study using yeast protein interaction networks.

    Directory of Open Access Journals (Sweden)

    Carson M Andorf

    Full Text Available Protein-protein interactions are critical to elucidating the role played by individual proteins in important biological pathways. Of particular interest are hub proteins that can interact with large numbers of partners and often play essential roles in cellular control. Depending on the number of binding sites, protein hubs can be classified at a structural level as singlish-interface hubs (SIH with one or two binding sites, or multiple-interface hubs (MIH with three or more binding sites. In terms of kinetics, hub proteins can be classified as date hubs (i.e., interact with different partners at different times or locations or party hubs (i.e., simultaneously interact with multiple partners.Our approach works in 3 phases: Phase I classifies if a protein is likely to bind with another protein. Phase II determines if a protein-binding (PB protein is a hub. Phase III classifies PB proteins as singlish-interface versus multiple-interface hubs and date versus party hubs. At each stage, we use sequence-based predictors trained using several standard machine learning techniques.Our method is able to predict whether a protein is a protein-binding protein with an accuracy of 94% and a correlation coefficient of 0.87; identify hubs from non-hubs with 100% accuracy for 30% of the data; distinguish date hubs/party hubs with 69% accuracy and area under ROC curve of 0.68; and SIH/MIH with 89% accuracy and area under ROC curve of 0.84. Because our method is based on sequence information alone, it can be used even in settings where reliable protein-protein interaction data or structures of protein-protein complexes are unavailable to obtain useful insights into the functional and evolutionary characteristics of proteins and their interactions.We provide a web server for our three-phase approach: http://hybsvm.gdcb.iastate.edu.

  12. Predicting binding poses and affinities for protein - ligand complexes in the 2015 D3R Grand Challenge using a physical model with a statistical parameter estimation

    Science.gov (United States)

    Grudinin, Sergei; Kadukova, Maria; Eisenbarth, Andreas; Marillet, Simon; Cazals, Frédéric

    2016-09-01

    The 2015 D3R Grand Challenge provided an opportunity to test our new model for the binding free energy of small molecules, as well as to assess our protocol to predict binding poses for protein-ligand complexes. Our pose predictions were ranked 3-9 for the HSP90 dataset, depending on the assessment metric. For the MAP4K dataset the ranks are very dispersed and equal to 2-35, depending on the assessment metric, which does not provide any insight into the accuracy of the method. The main success of our pose prediction protocol was the re-scoring stage using the recently developed Convex-PL potential. We make a thorough analysis of our docking predictions made with AutoDock Vina and discuss the effect of the choice of rigid receptor templates, the number of flexible residues in the binding pocket, the binding pocket size, and the benefits of re-scoring. However, the main challenge was to predict experimentally determined binding affinities for two blind test sets. Our affinity prediction model consisted of two terms, a pairwise-additive enthalpy, and a non pairwise-additive entropy. We trained the free parameters of the model with a regularized regression using affinity and structural data from the PDBBind database. Our model performed very well on the training set, however, failed on the two test sets. We explain the drawback and pitfalls of our model, in particular in terms of relative coverage of the test set by the training set and missed dynamical properties from crystal structures, and discuss different routes to improve it.

  13. Cell-type specificity of ChIP-predicted transcription factor binding sites

    Directory of Open Access Journals (Sweden)

    Håndstad Tony

    2012-08-01

    Full Text Available Abstract Background Context-dependent transcription factor (TF binding is one reason for differences in gene expression patterns between different cellular states. Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq identifies genome-wide TF binding sites for one particular context—the cells used in the experiment. But can such ChIP-seq data predict TF binding in other cellular contexts and is it possible to distinguish context-dependent from ubiquitous TF binding? Results We compared ChIP-seq data on TF binding for multiple TFs in two different cell types and found that on average only a third of ChIP-seq peak regions are common to both cell types. Expectedly, common peaks occur more frequently in certain genomic contexts, such as CpG-rich promoters, whereas chromatin differences characterize cell-type specific TF binding. We also find, however, that genotype differences between the cell types can explain differences in binding. Moreover, ChIP-seq signal intensity and peak clustering are the strongest predictors of common peaks. Compared with strong peaks located in regions containing peaks for multiple transcription factors, weak and isolated peaks are less common between the cell types and are less associated with data that indicate regulatory activity. Conclusions Together, the results suggest that experimental noise is prevalent among weak peaks, whereas strong and clustered peaks represent high-confidence binding events that often occur in other cellular contexts. Nevertheless, 30-40% of the strongest and most clustered peaks show context-dependent regulation. We show that by combining signal intensity with additional data—ranging from context independent information such as binding site conservation and position weight matrix scores to context dependent chromatin structure—we can predict whether a ChIP-seq peak is likely to be present in other cellular contexts.

  14. Use of Spectroscopic, Zeta Potential and Molecular Dynamic Techniques to Study the Interaction between Human Holo-Transferrin and Two Antagonist Drugs: Comparison of Binary and Ternary Systems

    Directory of Open Access Journals (Sweden)

    Mohammad Reza Saberi

    2012-03-01

    Full Text Available For the first time, the binding of ropinirole hydrochloride (ROP and aspirin (ASA to human holo-transferrin (hTf has been investigated by spectroscopic approaches (fluorescence quenching, synchronous fluorescence, time-resolved fluorescence, three-dimensional fluorescence, UV-vis absorption, circular dichroism, resonance light scattering, as well as zeta potential and molecular modeling techniques, under simulated physiological conditions. Fluorescence analysis was used to estimate the effect of the ROP and ASA drugs on the fluorescence of hTf as well as to define the binding and quenching properties of binary and ternary complexes. The synchronized fluorescence and three-dimensional fluorescence spectra demonstrated some micro-environmental and conformational changes around the Trp and Tyr residues with a faint red shift. Thermodynamic analysis displayed the van der Waals forces and hydrogen bonds interactions are the major acting forces in stabilizing the complexes. Steady-state and time-resolved fluorescence data revealed that the fluorescence quenching of complexes are static mechanism. The effect of the drugs aggregating on the hTf resulted in an enhancement of the resonance light scattering (RLS intensity. The average binding distance between were computed according to the forster non-radiation energy transfer theory. The circular dichroism (CD spectral examinations indicated that the binding of the drugs induced a conformational change of hTf. Measurements of the zeta potential indicated that the combination of electrostatic and hydrophobic interactions between ROP, ASA and hTf formed micelle-like clusters. The molecular modeling confirmed the experimental results. This study is expected to provide important insight into the interaction of hTf with ROP and ASA to use in various toxicological and therapeutic processes.

  15. The necessity of connection structures in neural models of variable binding.

    Science.gov (United States)

    van der Velde, Frank; de Kamps, Marc

    2015-08-01

    In his review of neural binding problems, Feldman (Cogn Neurodyn 7:1-11, 2013) addressed two types of models as solutions of (novel) variable binding. The one type uses labels such as phase synchrony of activation. The other ('connectivity based') type uses dedicated connections structures to achieve novel variable binding. Feldman argued that label (synchrony) based models are the only possible candidates to handle novel variable binding, whereas connectivity based models lack the flexibility required for that. We argue and illustrate that Feldman's analysis is incorrect. Contrary to his conclusion, connectivity based models are the only viable candidates for models of novel variable binding because they are the only type of models that can produce behavior. We will show that the label (synchrony) based models analyzed by Feldman are in fact examples of connectivity based models. Feldman's analysis that novel variable binding can be achieved without existing connection structures seems to result from analyzing the binding problem in a wrong frame of reference, in particular in an outside instead of the required inside frame of reference. Connectivity based models can be models of novel variable binding when they possess a connection structure that resembles a small-world network, as found in the brain. We will illustrate binding with this type of model with episode binding and the binding of words, including novel words, in sentence structures.

  16. Specificity of anion-binding in the substrate-pocket ofbacteriorhodopsin

    Energy Technology Data Exchange (ETDEWEB)

    Facciotti, Marc T.; Cheung, Vincent S.; Lunde, Christopher S.; Rouhani, Shahab; Baliga, Nitin S.; Glaeser, Robert M.

    2003-08-30

    The structure of the D85S mutant of bacteriorhodopsin with a nitrate anion bound in the Schiff-base binding site, and the structure of the anion-free protein have been obtained in the same crystal form. Together with the previously solved structures of this anion pump, in both the anion-free state and bromide-bound state, these new structures provide insight into how this mutant of bacteriorhodopsin is able to bind a variety of different anions in the same binding pocket. The structural analysis reveals that the main structural change that accommodates different anions is the repositioning of the polar side-chain of S85. On the basis of these x-ray crystal structures, the prediction is then made that the D85S/D212N double mutant might bind similar anions and do so over a broader pH range than does the single mutant. Experimental comparison of the dissociation constants, K{sub d}, for a variety of anions confirms this prediction and demonstrates, in addition, that the binding affinity is dramatically improved by the D212N substitution.

  17. Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method

    DEFF Research Database (Denmark)

    Nielsen, Morten; Lundegaard, Claus; Lund, Ole

    2007-01-01

    the correct alignment of a peptide in the binding groove a crucial part of identifying the core of an MHC class II binding motif. Here, we present a novel stabilization matrix alignment method, SMM-align, that allows for direct prediction of peptide:MHC binding affinities. The predictive performance...... of the method is validated on a large MHC class II benchmark data set covering 14 HLA-DR (human MHC) and three mouse H2-IA alleles. RESULTS: The predictive performance of the SMM-align method was demonstrated to be superior to that of the Gibbs sampler, TEPITOPE, SVRMHC, and MHCpred methods. Cross validation...... between peptide data set obtained from different sources demonstrated that direct incorporation of peptide length potentially results in over-fitting of the binding prediction method. Focusing on amino terminal peptide flanking residues (PFR), we demonstrate a consistent gain in predictive performance...

  18. Prediction of Nucleotide Binding Peptides Using Star Graph Topological Indices.

    Science.gov (United States)

    Liu, Yong; Munteanu, Cristian R; Fernández Blanco, Enrique; Tan, Zhiliang; Santos Del Riego, Antonino; Pazos, Alejandro

    2015-11-01

    The nucleotide binding proteins are involved in many important cellular processes, such as transmission of genetic information or energy transfer and storage. Therefore, the screening of new peptides for this biological function is an important research topic. The current study proposes a mixed methodology to obtain the first classification model that is able to predict new nucleotide binding peptides, using only the amino acid sequence. Thus, the methodology uses a Star graph molecular descriptor of the peptide sequences and the Machine Learning technique for the best classifier. The best model represents a Random Forest classifier based on two features of the embedded and non-embedded graphs. The performance of the model is excellent, considering similar models in the field, with an Area Under the Receiver Operating Characteristic Curve (AUROC) value of 0.938 and true positive rate (TPR) of 0.886 (test subset). The prediction of new nucleotide binding peptides with this model could be useful for drug target studies in drug development. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. Two unique ligand-binding clamps of Rhizopus oryzae starch binding domain for helical structure disruption of amylose.

    Directory of Open Access Journals (Sweden)

    Ting-Ying Jiang

    Full Text Available The N-terminal starch binding domain of Rhizopus oryzae glucoamylase (RoSBD has a high binding affinity for raw starch. RoSBD has two ligand-binding sites, each containing a ligand-binding clamp: a polyN clamp residing near binding site I is unique in that it is expressed in only three members of carbohydrate binding module family 21 (CBM21 members, and a Y32/F58 clamp located at binding site II is conserved in several CBMs. Here we characterized different roles of these sites in the binding of insoluble and soluble starches using an amylose-iodine complex assay, atomic force microscopy, isothermal titration calorimetry, site-directed mutagenesis, and structural bioinformatics. RoSBD induced the release of iodine from the amylose helical cavity and disrupted the helical structure of amylose type III, thereby significantly diminishing the thickness and length of the amylose type III fibrils. A point mutation in the critical ligand-binding residues of sites I and II, however, reduced both the binding affinity and amylose helix disruption. This is the first molecular model for structure disruption of the amylose helix by a non-hydrolytic CBM21 member. RoSBD apparently twists the helical amylose strands apart to expose more ligand surface for further SBD binding. Repeating the process triggers the relaxation and unwinding of amylose helices to generate thinner and shorter amylose fibrils, which are more susceptible to hydrolysis by glucoamylase. This model aids in understanding the natural roles of CBMs in protein-glycan interactions and contributes to potential molecular engineering of CBMs.

  20. Positively-charged semi-tunnel is a structural and surface characteristic of polyphosphate-binding proteins: an in-silico study.

    Directory of Open Access Journals (Sweden)

    Zheng Zachory Wei

    Full Text Available Phosphate is essential for all major life processes, especially energy metabolism and signal transduction. A linear phosphate polymer, polyphosphate (polyP, linked by high-energy phosphoanhydride bonds, can interact with various proteins, playing important roles as an energy source and regulatory factor. However, polyP-binding structures are largely unknown. Here we proposed a putative polyP binding site, a positively-charged semi-tunnel (PCST, identified by surface electrostatics analyses in polyP kinases (PPKs and many other polyP-related proteins. We found that the PCSTs in varied proteins were folded in different secondary structure compositions. Molecular docking calculations revealed a significant value for binding affinity to polyP in PCST-containing proteins. Utilizing the PCST identified in the β subunit of PPK3, we predicted the potential polyP-binding domain of PPK3. The discovery of this feature facilitates future searches for polyP-binding proteins and discovery of the mechanisms for polyP-binding activities. This should greatly enhance the understanding of the many physiological functions of protein-bound polyP and the involvement of polyP and polyP-binding proteins in various human diseases.

  1. Domain-based small molecule binding site annotation

    Directory of Open Access Journals (Sweden)

    Dumontier Michel

    2006-03-01

    Full Text Available Abstract Background Accurate small molecule binding site information for a protein can facilitate studies in drug docking, drug discovery and function prediction, but small molecule binding site protein sequence annotation is sparse. The Small Molecule Interaction Database (SMID, a database of protein domain-small molecule interactions, was created using structural data from the Protein Data Bank (PDB. More importantly it provides a means to predict small molecule binding sites on proteins with a known or unknown structure and unlike prior approaches, removes large numbers of false positive hits arising from transitive alignment errors, non-biologically significant small molecules and crystallographic conditions that overpredict ion binding sites. Description Using a set of co-crystallized protein-small molecule structures as a starting point, SMID interactions were generated by identifying protein domains that bind to small molecules, using NCBI's Reverse Position Specific BLAST (RPS-BLAST algorithm. SMID records are available for viewing at http://smid.blueprint.org. The SMID-BLAST tool provides accurate transitive annotation of small-molecule binding sites for proteins not found in the PDB. Given a protein sequence, SMID-BLAST identifies domains using RPS-BLAST and then lists potential small molecule ligands based on SMID records, as well as their aligned binding sites. A heuristic ligand score is calculated based on E-value, ligand residue identity and domain entropy to assign a level of confidence to hits found. SMID-BLAST predictions were validated against a set of 793 experimental small molecule interactions from the PDB, of which 472 (60% of predicted interactions identically matched the experimental small molecule and of these, 344 had greater than 80% of the binding site residues correctly identified. Further, we estimate that 45% of predictions which were not observed in the PDB validation set may be true positives. Conclusion By

  2. Crystal structure of the botulinum neurotoxin type G binding domain: insight into cell surface binding.

    Science.gov (United States)

    Stenmark, Pål; Dong, Min; Dupuy, Jérôme; Chapman, Edwin R; Stevens, Raymond C

    2010-04-16

    Botulinum neurotoxins (BoNTs) typically bind the neuronal cell surface via dual interactions with both protein receptors and gangliosides. We present here the 1.9-A X-ray structure of the BoNT serotype G (BoNT/G) receptor binding domain (residues 868-1297) and a detailed view of protein receptor and ganglioside binding regions. The ganglioside binding motif (SxWY) has a conserved structure compared to the corresponding regions in BoNT serotype A and BoNT serotype B (BoNT/B), but several features of interactions with the hydrophilic face of the ganglioside are absent at the opposite side of the motif in the BoNT/G ganglioside binding cleft. This may significantly reduce the affinity between BoNT/G and gangliosides. BoNT/G and BoNT/B share the protein receptor synaptotagmin (Syt) I/II. The Syt binding site has a conserved hydrophobic plateau located centrally in the proposed protein receptor binding interface (Tyr1189, Phe1202, Ala1204, Pro1205, and Phe1212). Interestingly, only 5 of 14 residues that are important for binding between Syt-II and BoNT/B are conserved in BoNT/G, suggesting that the means by which BoNT/G and BoNT/B bind Syt diverges more than previously appreciated. Indeed, substitution of Syt-II Phe47 and Phe55 with alanine residues had little effect on the binding of BoNT/G, but strongly reduced the binding of BoNT/B. Furthermore, an extended solvent-exposed hydrophobic loop, located between the Syt binding site and the ganglioside binding cleft, may serve as a third membrane association and binding element to contribute to high-affinity binding to the neuronal membrane. While BoNT/G and BoNT/B are homologous to each other and both utilize Syt-I/Syt-II as their protein receptor, the precise means by which these two toxin serotypes bind to Syt appears surprisingly divergent. Copyright (c) 2010. Published by Elsevier Ltd.

  3. Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method.

    Science.gov (United States)

    Nielsen, Morten; Lundegaard, Claus; Lund, Ole

    2007-07-04

    Antigen presenting cells (APCs) sample the extra cellular space and present peptides from here to T helper cells, which can be activated if the peptides are of foreign origin. The peptides are presented on the surface of the cells in complex with major histocompatibility class II (MHC II) molecules. Identification of peptides that bind MHC II molecules is thus a key step in rational vaccine design and developing methods for accurate prediction of the peptide:MHC interactions play a central role in epitope discovery. The MHC class II binding groove is open at both ends making the correct alignment of a peptide in the binding groove a crucial part of identifying the core of an MHC class II binding motif. Here, we present a novel stabilization matrix alignment method, SMM-align, that allows for direct prediction of peptide:MHC binding affinities. The predictive performance of the method is validated on a large MHC class II benchmark data set covering 14 HLA-DR (human MHC) and three mouse H2-IA alleles. The predictive performance of the SMM-align method was demonstrated to be superior to that of the Gibbs sampler, TEPITOPE, SVRMHC, and MHCpred methods. Cross validation between peptide data set obtained from different sources demonstrated that direct incorporation of peptide length potentially results in over-fitting of the binding prediction method. Focusing on amino terminal peptide flanking residues (PFR), we demonstrate a consistent gain in predictive performance by favoring binding registers with a minimum PFR length of two amino acids. Visualizing the binding motif as obtained by the SMM-align and TEPITOPE methods highlights a series of fundamental discrepancies between the two predicted motifs. For the DRB1*1302 allele for instance, the TEPITOPE method favors basic amino acids at most anchor positions, whereas the SMM-align method identifies a preference for hydrophobic or neutral amino acids at the anchors. The SMM-align method was shown to outperform other

  4. Prediction of MHC class II binding affinity using SMM-align, a novel stabilization matrix alignment method

    Directory of Open Access Journals (Sweden)

    Lund Ole

    2007-07-01

    Full Text Available Abstract Background Antigen presenting cells (APCs sample the extra cellular space and present peptides from here to T helper cells, which can be activated if the peptides are of foreign origin. The peptides are presented on the surface of the cells in complex with major histocompatibility class II (MHC II molecules. Identification of peptides that bind MHC II molecules is thus a key step in rational vaccine design and developing methods for accurate prediction of the peptide:MHC interactions play a central role in epitope discovery. The MHC class II binding groove is open at both ends making the correct alignment of a peptide in the binding groove a crucial part of identifying the core of an MHC class II binding motif. Here, we present a novel stabilization matrix alignment method, SMM-align, that allows for direct prediction of peptide:MHC binding affinities. The predictive performance of the method is validated on a large MHC class II benchmark data set covering 14 HLA-DR (human MHC and three mouse H2-IA alleles. Results The predictive performance of the SMM-align method was demonstrated to be superior to that of the Gibbs sampler, TEPITOPE, SVRMHC, and MHCpred methods. Cross validation between peptide data set obtained from different sources demonstrated that direct incorporation of peptide length potentially results in over-fitting of the binding prediction method. Focusing on amino terminal peptide flanking residues (PFR, we demonstrate a consistent gain in predictive performance by favoring binding registers with a minimum PFR length of two amino acids. Visualizing the binding motif as obtained by the SMM-align and TEPITOPE methods highlights a series of fundamental discrepancies between the two predicted motifs. For the DRB1*1302 allele for instance, the TEPITOPE method favors basic amino acids at most anchor positions, whereas the SMM-align method identifies a preference for hydrophobic or neutral amino acids at the anchors. Conclusion

  5. New horizons in mouse immunoinformatics: reliable in silico prediction of mouse class I histocompatibility major complex peptide binding affinity.

    Science.gov (United States)

    Hattotuwagama, Channa K; Guan, Pingping; Doytchinova, Irini A; Flower, Darren R

    2004-11-21

    Quantitative structure-activity relationship (QSAR) analysis is a main cornerstone of modern informatic disciplines. Predictive computational models, based on QSAR technology, of peptide-major histocompatibility complex (MHC) binding affinity have now become a vital component of modern day computational immunovaccinology. Historically, such approaches have been built around semi-qualitative, classification methods, but these are now giving way to quantitative regression methods. The additive method, an established immunoinformatics technique for the quantitative prediction of peptide-protein affinity, was used here to identify the sequence dependence of peptide binding specificity for three mouse class I MHC alleles: H2-D(b), H2-K(b) and H2-K(k). As we show, in terms of reliability the resulting models represent a significant advance on existing methods. They can be used for the accurate prediction of T-cell epitopes and are freely available online ( http://www.jenner.ac.uk/MHCPred).

  6. Evidence of chemical exchange in recombinant Major Urinary Protein and quenching thereof upon pheromone binding

    Energy Technology Data Exchange (ETDEWEB)

    Perazzolo, Chiara, E-mail: Chiara.Perazzolo@epfl.ch; Verde, Mariachiara [Ecole Polytechnique Federale de Lausanne, Institut des Sciences et Ingenierie Chimiques (Switzerland); Homans, Steve W. [University of Leeds, Institute of Molecular and Cellular Biology (United Kingdom); Bodenhausen, Geoffrey [Ecole Polytechnique Federale de Lausanne, Institut des Sciences et Ingenierie Chimiques (Switzerland)

    2007-05-15

    The internal dynamics of recombinant Major Urinary Protein (rMUP) have been investigated by monitoring transverse nitrogen-15 relaxation using multiple-echo Carr-Purcell-Meiboom-Gill (CPMG) experiments. While the ligand-free protein (APO-rMUP) features extensive evidence of motions on the milliseconds time scale, the complex with 2-methoxy-3-isobutylpyrazine (HOLO-rMUP) appears to be much less mobile on this time scale. At 308 K, exchange rates k{sub ex} = 500-2000 s{sup -1} were typically observed in APO-rMUP for residues located adjacent to a {beta}-turn comprising residues 83-87. These residues occlude an entry to the binding pocket and have been proposed to be a portal for ligand entry in other members of the lipocalin family, such as the retinol binding protein and the human fatty-acid binding protein. Exchange rates and populations are largely uncorrelated, suggesting local 'breathing' motions rather than a concerted global conformational change.

  7. Evidence of chemical exchange in recombinant Major Urinary Protein and quenching thereof upon pheromone binding

    International Nuclear Information System (INIS)

    Perazzolo, Chiara; Verde, Mariachiara; Homans, Steve W.; Bodenhausen, Geoffrey

    2007-01-01

    The internal dynamics of recombinant Major Urinary Protein (rMUP) have been investigated by monitoring transverse nitrogen-15 relaxation using multiple-echo Carr-Purcell-Meiboom-Gill (CPMG) experiments. While the ligand-free protein (APO-rMUP) features extensive evidence of motions on the milliseconds time scale, the complex with 2-methoxy-3-isobutylpyrazine (HOLO-rMUP) appears to be much less mobile on this time scale. At 308 K, exchange rates k ex = 500-2000 s -1 were typically observed in APO-rMUP for residues located adjacent to a β-turn comprising residues 83-87. These residues occlude an entry to the binding pocket and have been proposed to be a portal for ligand entry in other members of the lipocalin family, such as the retinol binding protein and the human fatty-acid binding protein. Exchange rates and populations are largely uncorrelated, suggesting local 'breathing' motions rather than a concerted global conformational change

  8. Large-scale binding ligand prediction by improved patch-based method Patch-Surfer2.0.

    Science.gov (United States)

    Zhu, Xiaolei; Xiong, Yi; Kihara, Daisuke

    2015-03-01

    Ligand binding is a key aspect of the function of many proteins. Thus, binding ligand prediction provides important insight in understanding the biological function of proteins. Binding ligand prediction is also useful for drug design and examining potential drug side effects. We present a computational method named Patch-Surfer2.0, which predicts binding ligands for a protein pocket. By representing and comparing pockets at the level of small local surface patches that characterize physicochemical properties of the local regions, the method can identify binding pockets of the same ligand even if they do not share globally similar shapes. Properties of local patches are represented by an efficient mathematical representation, 3D Zernike Descriptor. Patch-Surfer2.0 has significant technical improvements over our previous prototype, which includes a new feature that captures approximate patch position with a geodesic distance histogram. Moreover, we constructed a large comprehensive database of ligand binding pockets that will be searched against by a query. The benchmark shows better performance of Patch-Surfer2.0 over existing methods. http://kiharalab.org/patchsurfer2.0/ CONTACT: dkihara@purdue.edu Supplementary data are available at Bioinformatics online. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  9. Structure-based function prediction of the expanding mollusk tyrosinase family

    Science.gov (United States)

    Huang, Ronglian; Li, Li; Zhang, Guofan

    2017-11-01

    Tyrosinase (Ty) is a common enzyme found in many different animal groups. In our previous study, genome sequencing revealed that the Ty family is expanded in the Pacific oyster ( Crassostrea gigas). Here, we examine the larger number of Ty family members in the Pacific oyster by high-level structure prediction to obtain more information about their function and evolution, especially the unknown role in biomineralization. We verified 12 Ty gene sequences from Crassostrea gigas genome and Pinctada fucata martensii transcriptome. By using phylogenetic analysis of these Tys with functionally known Tys from other molluscan species, eight subgroups were identified (CgTy_s1, CgTy_s2, MolTy_s1, MolTy-s2, MolTy-s3, PinTy-s1, PinTy-s2 and PviTy). Structural data and surface pockets of the dinuclear copper center in the eight subgroups of molluscan Ty were obtained using the latest versions of prediction online servers. Structural comparison with other Ty proteins from the protein databank revealed functionally important residues (HA1, HA2, HA3, HB1, HB2, HB3, Z1-Z9) and their location within these protein structures. The structural and chemical features of these pockets which may related to the substrate binding showed considerable variability among mollusks, which undoubtedly defines Ty substrate binding. Finally, we discuss the potential driving forces of Ty family evolution in mollusks. Based on these observations, we conclude that the Ty family has rapidly evolved as a consequence of substrate adaptation in mollusks.

  10. MCTBI: a web server for predicting metal ion effects in RNA structures.

    Science.gov (United States)

    Sun, Li-Zhen; Zhang, Jing-Xiang; Chen, Shi-Jie

    2017-08-01

    Metal ions play critical roles in RNA structure and function. However, web servers and software packages for predicting ion effects in RNA structures are notably scarce. Furthermore, the existing web servers and software packages mainly neglect ion correlation and fluctuation effects, which are potentially important for RNAs. We here report a new web server, the MCTBI server (http://rna.physics.missouri.edu/MCTBI), for the prediction of ion effects for RNA structures. This server is based on the recently developed MCTBI, a model that can account for ion correlation and fluctuation effects for nucleic acid structures and can provide improved predictions for the effects of metal ions, especially for multivalent ions such as Mg 2+ effects, as shown by extensive theory-experiment test results. The MCTBI web server predicts metal ion binding fractions, the most probable bound ion distribution, the electrostatic free energy of the system, and the free energy components. The results provide mechanistic insights into the role of metal ions in RNA structure formation and folding stability, which is important for understanding RNA functions and the rational design of RNA structures. © 2017 Sun et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  11. Accurate and Reliable Prediction of the Binding Affinities of Macrocycles to Their Protein Targets.

    Science.gov (United States)

    Yu, Haoyu S; Deng, Yuqing; Wu, Yujie; Sindhikara, Dan; Rask, Amy R; Kimura, Takayuki; Abel, Robert; Wang, Lingle

    2017-12-12

    Macrocycles have been emerging as a very important drug class in the past few decades largely due to their expanded chemical diversity benefiting from advances in synthetic methods. Macrocyclization has been recognized as an effective way to restrict the conformational space of acyclic small molecule inhibitors with the hope of improving potency, selectivity, and metabolic stability. Because of their relatively larger size as compared to typical small molecule drugs and the complexity of the structures, efficient sampling of the accessible macrocycle conformational space and accurate prediction of their binding affinities to their target protein receptors poses a great challenge of central importance in computational macrocycle drug design. In this article, we present a novel method for relative binding free energy calculations between macrocycles with different ring sizes and between the macrocycles and their corresponding acyclic counterparts. We have applied the method to seven pharmaceutically interesting data sets taken from recent drug discovery projects including 33 macrocyclic ligands covering a diverse chemical space. The predicted binding free energies are in good agreement with experimental data with an overall root-mean-square error (RMSE) of 0.94 kcal/mol. This is to our knowledge the first time where the free energy of the macrocyclization of linear molecules has been directly calculated with rigorous physics-based free energy calculation methods, and we anticipate the outstanding accuracy demonstrated here across a broad range of target classes may have significant implications for macrocycle drug discovery.

  12. Prediction of vitamin interacting residues in a vitamin binding protein using evolutionary information

    Directory of Open Access Journals (Sweden)

    Panwar Bharat

    2013-02-01

    Full Text Available Abstract Background The vitamins are important cofactors in various enzymatic-reactions. In past, many inhibitors have been designed against vitamin binding pockets in order to inhibit vitamin-protein interactions. Thus, it is important to identify vitamin interacting residues in a protein. It is possible to detect vitamin-binding pockets on a protein, if its tertiary structure is known. Unfortunately tertiary structures of limited proteins are available. Therefore, it is important to develop in-silico models for predicting vitamin interacting residues in protein from its primary structure. Results In this study, first we compared protein-interacting residues of vitamins with other ligands using Two Sample Logo (TSL. It was observed that ATP, GTP, NAD, FAD and mannose preferred {G,R,K,S,H}, {G,K,T,S,D,N}, {T,G,Y}, {G,Y,W} and {Y,D,W,N,E} residues respectively, whereas vitamins preferred {Y,F,S,W,T,G,H} residues for the interaction with proteins. Furthermore, compositional information of preferred and non-preferred residues along with patterns-specificity was also observed within different vitamin-classes. Vitamins A, B and B6 preferred {F,I,W,Y,L,V}, {S,Y,G,T,H,W,N,E} and {S,T,G,H,Y,N} interacting residues respectively. It suggested that protein-binding patterns of vitamins are different from other ligands, and motivated us to develop separate predictor for vitamins and their sub-classes. The four different prediction modules, (i vitamin interacting residues (VIRs, (ii vitamin-A interacting residues (VAIRs, (iii vitamin-B interacting residues (VBIRs and (iv pyridoxal-5-phosphate (vitamin B6 interacting residues (PLPIRs have been developed. We applied various classifiers of SVM, BayesNet, NaiveBayes, ComplementNaiveBayes, NaiveBayesMultinomial, RandomForest and IBk etc., as machine learning techniques, using binary and Position-Specific Scoring Matrix (PSSM features of protein sequences. Finally, we selected best performing SVM modules and

  13. Prediction of vitamin interacting residues in a vitamin binding protein using evolutionary information.

    Science.gov (United States)

    Panwar, Bharat; Gupta, Sudheer; Raghava, Gajendra P S

    2013-02-07

    The vitamins are important cofactors in various enzymatic-reactions. In past, many inhibitors have been designed against vitamin binding pockets in order to inhibit vitamin-protein interactions. Thus, it is important to identify vitamin interacting residues in a protein. It is possible to detect vitamin-binding pockets on a protein, if its tertiary structure is known. Unfortunately tertiary structures of limited proteins are available. Therefore, it is important to develop in-silico models for predicting vitamin interacting residues in protein from its primary structure. In this study, first we compared protein-interacting residues of vitamins with other ligands using Two Sample Logo (TSL). It was observed that ATP, GTP, NAD, FAD and mannose preferred {G,R,K,S,H}, {G,K,T,S,D,N}, {T,G,Y}, {G,Y,W} and {Y,D,W,N,E} residues respectively, whereas vitamins preferred {Y,F,S,W,T,G,H} residues for the interaction with proteins. Furthermore, compositional information of preferred and non-preferred residues along with patterns-specificity was also observed within different vitamin-classes. Vitamins A, B and B6 preferred {F,I,W,Y,L,V}, {S,Y,G,T,H,W,N,E} and {S,T,G,H,Y,N} interacting residues respectively. It suggested that protein-binding patterns of vitamins are different from other ligands, and motivated us to develop separate predictor for vitamins and their sub-classes. The four different prediction modules, (i) vitamin interacting residues (VIRs), (ii) vitamin-A interacting residues (VAIRs), (iii) vitamin-B interacting residues (VBIRs) and (iv) pyridoxal-5-phosphate (vitamin B6) interacting residues (PLPIRs) have been developed. We applied various classifiers of SVM, BayesNet, NaiveBayes, ComplementNaiveBayes, NaiveBayesMultinomial, RandomForest and IBk etc., as machine learning techniques, using binary and Position-Specific Scoring Matrix (PSSM) features of protein sequences. Finally, we selected best performing SVM modules and obtained highest MCC of 0.53, 0.48, 0.61, 0

  14. Structural analysis of site-directed mutants of cellular retinoic acid-binding protein II addresses the relationship between structural integrity and ligand binding

    International Nuclear Information System (INIS)

    Vaezeslami, Soheila; Jia, Xiaofei; Vasileiou, Chrysoula; Borhan, Babak; Geiger, James H.

    2008-01-01

    A water network stabilizes the structure of cellular retionic acid binding protein II. The structural integrity of cellular retinoic acid-binding protein II (CRABPII) has been investigated using the crystal structures of CRABPII mutants. The overall fold was well maintained by these CRABPII mutants, each of which carried multiple different mutations. A water-mediated network is found to be present across the large binding cavity, extending from Arg111 deep inside the cavity to the α2 helix at its entrance. This chain of interactions acts as a ‘pillar’ that maintains the integrity of the protein. The disruption of the water network upon loss of Arg111 leads to decreased structural integrity of the protein. A water-mediated network can be re-established by introducing the hydrophilic Glu121 inside the cavity, which results in a rigid protein with the α2 helix adopting an altered conformation compared with wild-type CRABPII

  15. Proteochemometric model for predicting the inhibition of penicillin-binding proteins

    Science.gov (United States)

    Nabu, Sunanta; Nantasenamat, Chanin; Owasirikul, Wiwat; Lawung, Ratana; Isarankura-Na-Ayudhya, Chartchalerm; Lapins, Maris; Wikberg, Jarl E. S.; Prachayasittikul, Virapong

    2015-02-01

    Neisseria gonorrhoeae infection threatens to become an untreatable sexually transmitted disease in the near future owing to the increasing emergence of N. gonorrhoeae strains with reduced susceptibility and resistance to the extended-spectrum cephalosporins (ESCs), i.e. ceftriaxone and cefixime, which are the last remaining option for first-line treatment of gonorrhea. Alteration of the penA gene, encoding penicillin-binding protein 2 (PBP2), is the main mechanism conferring penicillin resistance including reduced susceptibility and resistance to ESCs. To predict and investigate putative amino acid mutations causing β-lactam resistance particularly for ESCs, we applied proteochemometric modeling to generalize N. gonorrhoeae susceptibility data for predicting the interaction of PBP2 with therapeutic β-lactam antibiotics. This was afforded by correlating publicly available data on antimicrobial susceptibility of wild-type and mutant N. gonorrhoeae strains for penicillin-G, cefixime and ceftriaxone with 50 PBP2 protein sequence data using partial least-squares projections to latent structures. The generated model revealed excellent predictability ( R 2 = 0.91, Q 2 = 0.77, Q Ext 2 = 0.78). Moreover, our model identified amino acid mutations in PBP2 with the highest impact on antimicrobial susceptibility and provided information on physicochemical properties of amino acid mutations affecting antimicrobial susceptibility. Our model thus provided insight into the physicochemical basis for resistance development in PBP2 suggesting its use for predicting and monitoring novel PBP2 mutations that may emerge in the future.

  16. Mapping small molecule binding data to structural domains.

    Science.gov (United States)

    Kruger, Felix A; Rostom, Raghd; Overington, John P

    2012-01-01

    Large-scale bioactivity/SAR Open Data has recently become available, and this has allowed new analyses and approaches to be developed to help address the productivity and translational gaps of current drug discovery. One of the current limitations of these data is the relative sparsity of reported interactions per protein target, and complexities in establishing clear relationships between bioactivity and targets using bioinformatics tools. We detail in this paper the indexing of targets by the structural domains that bind (or are likely to bind) the ligand within a full-length protein. Specifically, we present a simple heuristic to map small molecule binding to Pfam domains. This profiling can be applied to all proteins within a genome to give some indications of the potential pharmacological modulation and regulation of all proteins. In this implementation of our heuristic, ligand binding to protein targets from the ChEMBL database was mapped to structural domains as defined by profiles contained within the Pfam-A database. Our mapping suggests that the majority of assay targets within the current version of the ChEMBL database bind ligands through a small number of highly prevalent domains, and conversely the majority of Pfam domains sampled by our data play no currently established role in ligand binding. Validation studies, carried out firstly against Uniprot entries with expert binding-site annotation and secondly against entries in the wwPDB repository of crystallographic protein structures, demonstrate that our simple heuristic maps ligand binding to the correct domain in about 90 percent of all assessed cases. Using the mappings obtained with our heuristic, we have assembled ligand sets associated with each Pfam domain. Small molecule binding has been mapped to Pfam-A domains of protein targets in the ChEMBL bioactivity database. The result of this mapping is an enriched annotation of small molecule bioactivity data and a grouping of activity classes

  17. Prediction of Carbohydrate-Binding Proteins from Sequences Using Support Vector Machines

    Directory of Open Access Journals (Sweden)

    Seizi Someya

    2010-01-01

    Full Text Available Carbohydrate-binding proteins are proteins that can interact with sugar chains but do not modify them. They are involved in many physiological functions, and we have developed a method for predicting them from their amino acid sequences. Our method is based on support vector machines (SVMs. We first clarified the definition of carbohydrate-binding proteins and then constructed positive and negative datasets with which the SVMs were trained. By applying the leave-one-out test to these datasets, our method delivered 0.92 of the area under the receiver operating characteristic (ROC curve. We also examined two amino acid grouping methods that enable effective learning of sequence patterns and evaluated the performance of these methods. When we applied our method in combination with the homology-based prediction method to the annotated human genome database, H-invDB, we found that the true positive rate of prediction was improved.

  18. Prediction of small molecule binding property of protein domains with Bayesian classifiers based on Markov chains.

    Science.gov (United States)

    Bulashevska, Alla; Stein, Martin; Jackson, David; Eils, Roland

    2009-12-01

    Accurate computational methods that can help to predict biological function of a protein from its sequence are of great interest to research biologists and pharmaceutical companies. One approach to assume the function of proteins is to predict the interactions between proteins and other molecules. In this work, we propose a machine learning method that uses a primary sequence of a domain to predict its propensity for interaction with small molecules. By curating the Pfam database with respect to the small molecule binding ability of its component domains, we have constructed a dataset of small molecule binding and non-binding domains. This dataset was then used as training set to learn a Bayesian classifier, which should distinguish members of each class. The domain sequences of both classes are modelled with Markov chains. In a Jack-knife test, our classification procedure achieved the predictive accuracies of 77.2% and 66.7% for binding and non-binding classes respectively. We demonstrate the applicability of our classifier by using it to identify previously unknown small molecule binding domains. Our predictions are available as supplementary material and can provide very useful information to drug discovery specialists. Given the ubiquitous and essential role small molecules play in biological processes, our method is important for identifying pharmaceutically relevant components of complete proteomes. The software is available from the author upon request.

  19. Activator Protein-1: redox switch controlling structure and DNA-binding

    Energy Technology Data Exchange (ETDEWEB)

    Yin, Zhou; Machius, Mischa; Nestler, Eric J.; Rudenko, Gabby (Texas-MED); (Icahn)

    2017-09-07

    The transcription factor, activator protein-1 (AP-1), binds to cognate DNA under redox control; yet, the underlying mechanism has remained enigmatic. A series of crystal structures of the AP-1 FosB/JunD bZIP domains reveal ordered DNA-binding regions in both FosB and JunD even in absence DNA. However, while JunD is competent to bind DNA, the FosB bZIP domain must undergo a large conformational rearrangement that is controlled by a ‘redox switch’ centered on an inter-molecular disulfide bond. Solution studies confirm that FosB/JunD cannot undergo structural transition and bind DNA when the redox-switch is in the ‘OFF’ state, and show that the mid-point redox potential of the redox switch affords it sensitivity to cellular redox homeostasis. The molecular and structural studies presented here thus reveal the mechanism underlying redox-regulation of AP-1 Fos/Jun transcription factors and provide structural insight for therapeutic interventions targeting AP-1 proteins.

  20. Structural Analysis of Botulinum Neurotoxin Type G Receptor Binding

    Energy Technology Data Exchange (ETDEWEB)

    Schmitt, John; Karalewitz, Andrew; Benefield, Desire A.; Mushrush, Darren J.; Pruitt, Rory N.; Spiller, Benjamin W.; Barbieri, Joseph T.; Lacy, D. Borden (Vanderbilt); (MCW)

    2010-10-19

    Botulinum neurotoxin (BoNT) binds peripheral neurons at the neuromuscular junction through a dual-receptor mechanism that includes interactions with ganglioside and protein receptors. The receptor identities vary depending on BoNT serotype (A-G). BoNT/B and BoNT/G bind the luminal domains of synaptotagmin I and II, homologous synaptic vesicle proteins. We observe conditions under which BoNT/B binds both Syt isoforms, but BoNT/G binds only SytI. Both serotypes bind ganglioside G{sub T1b}. The BoNT/G receptor-binding domain crystal structure provides a context for examining these binding interactions and a platform for understanding the physiological relevance of different Syt receptor isoforms in vivo.

  1. ANALYSIS OF STRUCTURAL ELEMENT OF FAMILY 6 CARBOHYDRATE BINDING MODULE (CTCBM6B OF ALPHA-L-ARABINOFURANOSIDASE FROM CLOSTRIDIUM THERMOCELLUM

    Directory of Open Access Journals (Sweden)

    Shadab Ahmed

    2013-06-01

    Full Text Available The amino acid sequence of a family 6 carbohydrate binding module (CtCBM6B from Clostridium thermocellum alpha-L-arabinofuranosidase showed close evolutionary relationship with some other member of family 6 carbohydrate binding modules. The CD spectrum analysis confirmed the secondary structure prediction of CtCBM6B as both showed beta-sheets (44-48% and random coils (52-54% and no alpha-helix. The hydrogen bonding plot of CtCBM6B showed many segments of parallel and anti-parallel beta-strands which was similar to the secondary structure prediction by PSIPRED VIEW. The three dimensional structure of CtCBM6B generated by MODELLER revealed a typical beta-sandwich architecture at its core, characteristic of beta-jelly roll CBM superfamily. The Ramachandran plot analysis by PROCHECK showed that out of 134 residues, 92.9% were in most favoured region, 6.2% in additionally allowed region and only 0.9% in generously allowed region which indicated a stable conformation of 3D model of CtCBM6B. The docking analysis of CtCBM6B for finding putative ligand binding sites showed that it has high binding affinity for arabinobiose, beta-L-arabinofuranose and beta-D-xylopyranose indicated by lower ligand binding energy (-14.28 kcal mol–1, -12.5 kcal mol–1 and -11.3 kcal mol–1, respectively. CtCBM6B also showed appreciable binding affinity with alpha-D-xylopyranose (–10.8 kcal mol–1, beta-L-arabinopyranose (–10.2 kcal mol-1, alpha-L-arabinopyranose (–10.0 kcal mol–1 and alpha-L-arabinofuranose (–8.75 kcal mol–1. The results indicated that CtCBM6B has high potential for binding arabinan, xylans and substituted xylans.

  2. Protein docking prediction using predicted protein-protein interface

    Directory of Open Access Journals (Sweden)

    Li Bin

    2012-01-01

    Full Text Available Abstract Background Many important cellular processes are carried out by protein complexes. To provide physical pictures of interacting proteins, many computational protein-protein prediction methods have been developed in the past. However, it is still difficult to identify the correct docking complex structure within top ranks among alternative conformations. Results We present a novel protein docking algorithm that utilizes imperfect protein-protein binding interface prediction for guiding protein docking. Since the accuracy of protein binding site prediction varies depending on cases, the challenge is to develop a method which does not deteriorate but improves docking results by using a binding site prediction which may not be 100% accurate. The algorithm, named PI-LZerD (using Predicted Interface with Local 3D Zernike descriptor-based Docking algorithm, is based on a pair wise protein docking prediction algorithm, LZerD, which we have developed earlier. PI-LZerD starts from performing docking prediction using the provided protein-protein binding interface prediction as constraints, which is followed by the second round of docking with updated docking interface information to further improve docking conformation. Benchmark results on bound and unbound cases show that PI-LZerD consistently improves the docking prediction accuracy as compared with docking without using binding site prediction or using the binding site prediction as post-filtering. Conclusion We have developed PI-LZerD, a pairwise docking algorithm, which uses imperfect protein-protein binding interface prediction to improve docking accuracy. PI-LZerD consistently showed better prediction accuracy over alternative methods in the series of benchmark experiments including docking using actual docking interface site predictions as well as unbound docking cases.

  3. Protein docking prediction using predicted protein-protein interface.

    Science.gov (United States)

    Li, Bin; Kihara, Daisuke

    2012-01-10

    Many important cellular processes are carried out by protein complexes. To provide physical pictures of interacting proteins, many computational protein-protein prediction methods have been developed in the past. However, it is still difficult to identify the correct docking complex structure within top ranks among alternative conformations. We present a novel protein docking algorithm that utilizes imperfect protein-protein binding interface prediction for guiding protein docking. Since the accuracy of protein binding site prediction varies depending on cases, the challenge is to develop a method which does not deteriorate but improves docking results by using a binding site prediction which may not be 100% accurate. The algorithm, named PI-LZerD (using Predicted Interface with Local 3D Zernike descriptor-based Docking algorithm), is based on a pair wise protein docking prediction algorithm, LZerD, which we have developed earlier. PI-LZerD starts from performing docking prediction using the provided protein-protein binding interface prediction as constraints, which is followed by the second round of docking with updated docking interface information to further improve docking conformation. Benchmark results on bound and unbound cases show that PI-LZerD consistently improves the docking prediction accuracy as compared with docking without using binding site prediction or using the binding site prediction as post-filtering. We have developed PI-LZerD, a pairwise docking algorithm, which uses imperfect protein-protein binding interface prediction to improve docking accuracy. PI-LZerD consistently showed better prediction accuracy over alternative methods in the series of benchmark experiments including docking using actual docking interface site predictions as well as unbound docking cases.

  4. Insights on Structural Characteristics and Ligand Binding Mechanisms of CDK2

    Directory of Open Access Journals (Sweden)

    Yan Li

    2015-04-01

    Full Text Available Cyclin-dependent kinase 2 (CDK2 is a crucial regulator of the eukaryotic cell cycle. However it is well established that monomeric CDK2 lacks regulatory activity, which needs to be aroused by its positive regulators, cyclins E and A, or be phosphorylated on the catalytic segment. Interestingly, these activation steps bring some dynamic changes on the 3D-structure of the kinase, especially the activation segment. Until now, in the monomeric CDK2 structure, three binding sites have been reported, including the adenosine triphosphate (ATP binding site (Site I and two non-competitive binding sites (Site II and III. In addition, when the kinase is subjected to the cyclin binding process, the resulting structural changes give rise to a variation of the ATP binding site, thus generating an allosteric binding site (Site IV. All the four sites are demonstrated as being targeted by corresponding inhibitors, as is illustrated by the allosteric binding one which is targeted by inhibitor ANS (fluorophore 8-anilino-1-naphthalene sulfonate. In the present work, the binding mechanisms and their fluctuations during the activation process attract our attention. Therefore, we carry out corresponding studies on the structural characterization of CDK2, which are expected to facilitate the understanding of the molecular mechanisms of kinase proteins. Besides, the binding mechanisms of CDK2 with its relevant inhibitors, as well as the changes of binding mechanisms following conformational variations of CDK2, are summarized and compared. The summary of the conformational characteristics and ligand binding mechanisms of CDK2 in the present work will improve our understanding of the molecular mechanisms regulating the bioactivities of CDK2.

  5. Prediction of chloride ingress and binding in cement paste

    DEFF Research Database (Denmark)

    Geiker, Mette Rica; Nielsen, Erik Pram; Herforth, Duncan

    2007-01-01

    This paper summarizes recent work on an analytical model for predicting the ingress rate of chlorides in cement-based materials. An integral part of this is a thermodynamic model for predicting the phase equilibria in hydrated Portland cement. The model’s ability to predict chloride binding...... in Portland cement pastes at any content of chloride, alkalis, sulfates and carbonate was verified experimentally and found to be equally valid when applied to other data in the literature. The thermodynamic model for predicting the phase equilibria in hydrated Portland cement was introduced into an existing...... Finite Difference Model for the ingress of chlorides into concrete which takes into account its multi-component nature. The “composite theory” was then used to predict the diffusivity of each ion based on the phase assemblage present in the hydrated Portland cement paste. Agreement was found between...

  6. Crystal complexes of a predicted S-adenosylmethionine-dependent methyltransferase reveal a typical AdoMet binding domain and a substrate recognition domain

    Energy Technology Data Exchange (ETDEWEB)

    Miller, D.J.; Ouellette, N.; Evodokimova, E.; Savchenko, A.; Edwards, A.; Anderson, W.F. (Toronto); (NWU)

    2010-03-08

    S-adenosyl-L-methionine-dependent methyltransferases (MTs) are abundant, and highly conserved across phylogeny. These enzymes use the cofactor AdoMet to methylate a wide variety of molecular targets, thereby modulating important cellular and metabolic activities. Thermotoga maritima protein 0872 (TM0872) belongs to a large sequence family of predicted MTs, ranging phylogenetically from relatively simple bacteria to humans. The genes for many of the bacterial homologs are located within operons involved in cell wall synthesis and cell division. Despite preliminary biochemical studies in E. coli and B. subtilis, the substrate specificity of this group of more than 150 proteins is unknown. As part of the Midwest Center for Structural Genomics initiative (www.mcsg.anl.gov), we have determined the structure of TM0872 in complexes with AdoMet and with S-adenosyl-L-homocysteine (AdoHcy). As predicted, TM0872 has a typical MT domain, and binds endogenous AdoMet, or co-crystallized AdoHcy, in a manner consistent with other known MT structures. In addition, TM0872 has a second domain that is novel among MTs in both its location in the sequence and its structure. The second domain likely acts in substrate recognition and binding, and there is a potential substrate-binding cleft spanning the two domains. This long and narrow cleft is lined with positively charged residues which are located opposite the S{sup +}-CH{sub 3} bond, suggesting that a negatively charged molecule might be targeted for catalysis. However, AdoMet and AdoHcy are both buried, and access to the methyl group would presumably require structural rearrangement. These TM0872 crystal structures offer the first structural glimpses at this phylogenetically conserved sequence family.

  7. Structural prediction in aphasia

    Directory of Open Access Journals (Sweden)

    Tessa Warren

    2015-05-01

    Full Text Available There is considerable evidence that young healthy comprehenders predict the structure of upcoming material, and that their processing is facilitated when they encounter material matching those predictions (e.g., Staub & Clifton, 2006; Yoshida, Dickey & Sturt, 2013. However, less is known about structural prediction in aphasia. There is evidence that lexical prediction may be spared in aphasia (Dickey et al., 2014; Love & Webb, 1977; cf. Mack et al, 2013. However, predictive mechanisms supporting facilitated lexical access may not necessarily support structural facilitation. Given that many people with aphasia (PWA exhibit syntactic deficits (e.g. Goodglass, 1993, PWA with such impairments may not engage in structural prediction. However, recent evidence suggests that some PWA may indeed predict upcoming structure (Hanne, Burchert, De Bleser, & Vashishth, 2015. Hanne et al. tracked the eyes of PWA (n=8 with sentence-comprehension deficits while they listened to reversible subject-verb-object (SVO and object-verb-subject (OVS sentences in German, in a sentence-picture matching task. Hanne et al. manipulated case and number marking to disambiguate the sentences’ structure. Gazes to an OVS or SVO picture during the unfolding of a sentence were assumed to indicate prediction of the structure congruent with that picture. According to this measure, the PWA’s structural prediction was impaired compared to controls, but they did successfully predict upcoming structure when morphosyntactic cues were strong and unambiguous. Hanne et al.’s visual-world evidence is suggestive, but their forced-choice sentence-picture matching task places tight constraints on possible structural predictions. Clearer evidence of structural prediction would come from paradigms where the content of upcoming material is not as constrained. The current study used self-paced reading study to examine structural prediction among PWA in less constrained contexts. PWA (n=17 who

  8. Memory Binding Test Predicts Incident Amnestic Mild Cognitive Impairment.

    Science.gov (United States)

    Mowrey, Wenzhu B; Lipton, Richard B; Katz, Mindy J; Ramratan, Wendy S; Loewenstein, David A; Zimmerman, Molly E; Buschke, Herman

    2016-07-14

    The Memory Binding Test (MBT), previously known as Memory Capacity Test, has demonstrated discriminative validity for distinguishing persons with amnestic mild cognitive impairment (aMCI) and dementia from cognitively normal elderly. We aimed to assess the predictive validity of the MBT for incident aMCI. In a longitudinal, community-based study of adults aged 70+, we administered the MBT to 246 cognitively normal elderly adults at baseline and followed them annually. Based on previous work, a subtle reduction in memory binding at baseline was defined by a Total Items in the Paired (TIP) condition score of ≤22 on the MBT. Cox proportional hazards models were used to assess the predictive validity of the MBT for incident aMCI accounting for the effects of covariates. The hazard ratio of incident aMCI was also assessed for different prediction time windows ranging from 4 to 7 years of follow-up, separately. Among 246 controls who were cognitively normal at baseline, 48 developed incident aMCI during follow-up. A baseline MBT reduction was associated with an increased risk for developing incident aMCI (hazard ratio (HR) = 2.44, 95% confidence interval: 1.30-4.56, p = 0.005). When varying the prediction window from 4-7 years, the MBT reduction remained significant for predicting incident aMCI (HR range: 2.33-3.12, p: 0.0007-0.04). Persons with poor performance on the MBT are at significantly greater risk for developing incident aMCI. High hazard ratios up to seven years of follow-up suggest that the MBT is sensitive to early disease.

  9. Structural modeling and DNA binding autoinhibition analysis of Ergp55, a critical transcription factor in prostate cancer.

    Directory of Open Access Journals (Sweden)

    Shanti P Gangwar

    Full Text Available BACKGROUND: The Ergp55 protein belongs to Ets family of transcription factor. The Ets proteins are highly conserved in their DNA binding domain and involved in various development processes and regulation of cancer metabolism. To study the structure and DNA binding autoinhibition mechanism of Ergp55 protein, we have produced full length and smaller polypeptides of Ergp55 protein in E. coli and characterized using various biophysical techniques. RESULTS: The Ergp55 polypeptides contain large amount of α-helix and random coil structures as measured by circular dichorism spectroscopy. The full length Ergp55 forms a flexible and elongated molecule as revealed by molecular modeling, dynamics simulation and structural prediction algorithms. The binding analyses of Ergp55 polypeptides with target DNA sequences of E74 and cfos promoters indicate that longer fragments of Ergp55 (beyond the Ets domain showed the evidence of auto-inhibition. This study also revealed the parts of Ergp55 protein that mediate auto-inhibition. SIGNIFICANCE: The current study will aid in designing the compounds that stabilize the inhibited form of Ergp55 and inhibit its binding to promoter DNA. It will contribute in the development of drugs targeting Ergp55 for the prostate cancer treatment.

  10. Recent improvements to Binding MOAD: a resource for protein–ligand binding affinities and structures

    Science.gov (United States)

    Ahmed, Aqeel; Smith, Richard D.; Clark, Jordan J.; Dunbar, James B.; Carlson, Heather A.

    2015-01-01

    For over 10 years, Binding MOAD (Mother of All Databases; http://www.BindingMOAD.org) has been one of the largest resources for high-quality protein–ligand complexes and associated binding affinity data. Binding MOAD has grown at the rate of 1994 complexes per year, on average. Currently, it contains 23 269 complexes and 8156 binding affinities. Our annual updates curate the data using a semi-automated literature search of the references cited within the PDB file, and we have recently upgraded our website and added new features and functionalities to better serve Binding MOAD users. In order to eliminate the legacy application server of the old platform and to accommodate new changes, the website has been completely rewritten in the LAMP (Linux, Apache, MySQL and PHP) environment. The improved user interface incorporates current third-party plugins for better visualization of protein and ligand molecules, and it provides features like sorting, filtering and filtered downloads. In addition to the field-based searching, Binding MOAD now can be searched by structural queries based on the ligand. In order to remove redundancy, Binding MOAD records are clustered in different families based on 90% sequence identity. The new Binding MOAD, with the upgraded platform, features and functionalities, is now equipped to better serve its users. PMID:25378330

  11. Activator Protein-1: redox switch controlling structure and DNA-binding.

    Science.gov (United States)

    Yin, Zhou; Machius, Mischa; Nestler, Eric J; Rudenko, Gabby

    2017-11-02

    The transcription factor, activator protein-1 (AP-1), binds to cognate DNA under redox control; yet, the underlying mechanism has remained enigmatic. A series of crystal structures of the AP-1 FosB/JunD bZIP domains reveal ordered DNA-binding regions in both FosB and JunD even in absence DNA. However, while JunD is competent to bind DNA, the FosB bZIP domain must undergo a large conformational rearrangement that is controlled by a 'redox switch' centered on an inter-molecular disulfide bond. Solution studies confirm that FosB/JunD cannot undergo structural transition and bind DNA when the redox-switch is in the 'OFF' state, and show that the mid-point redox potential of the redox switch affords it sensitivity to cellular redox homeostasis. The molecular and structural studies presented here thus reveal the mechanism underlying redox-regulation of AP-1 Fos/Jun transcription factors and provide structural insight for therapeutic interventions targeting AP-1 proteins. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. NN-align. An artificial neural network-based alignment algorithm for MHC class II peptide binding prediction

    Directory of Open Access Journals (Sweden)

    Lund Ole

    2009-09-01

    Full Text Available Abstract Background The major histocompatibility complex (MHC molecule plays a central role in controlling the adaptive immune response to infections. MHC class I molecules present peptides derived from intracellular proteins to cytotoxic T cells, whereas MHC class II molecules stimulate cellular and humoral immunity through presentation of extracellularly derived peptides to helper T cells. Identification of which peptides will bind a given MHC molecule is thus of great importance for the understanding of host-pathogen interactions, and large efforts have been placed in developing algorithms capable of predicting this binding event. Results Here, we present a novel artificial neural network-based method, NN-align that allows for simultaneous identification of the MHC class II binding core and binding affinity. NN-align is trained using a novel training algorithm that allows for correction of bias in the training data due to redundant binding core representation. Incorporation of information about the residues flanking the peptide-binding core is shown to significantly improve the prediction accuracy. The method is evaluated on a large-scale benchmark consisting of six independent data sets covering 14 human MHC class II alleles, and is demonstrated to outperform other state-of-the-art MHC class II prediction methods. Conclusion The NN-align method is competitive with the state-of-the-art MHC class II peptide binding prediction algorithms. The method is publicly available at http://www.cbs.dtu.dk/services/NetMHCII-2.0.

  13. Sequence-Based Prediction of RNA-Binding Proteins Using Random Forest with Minimum Redundancy Maximum Relevance Feature Selection

    Directory of Open Access Journals (Sweden)

    Xin Ma

    2015-01-01

    Full Text Available The prediction of RNA-binding proteins is one of the most challenging problems in computation biology. Although some studies have investigated this problem, the accuracy of prediction is still not sufficient. In this study, a highly accurate method was developed to predict RNA-binding proteins from amino acid sequences using random forests with the minimum redundancy maximum relevance (mRMR method, followed by incremental feature selection (IFS. We incorporated features of conjoint triad features and three novel features: binding propensity (BP, nonbinding propensity (NBP, and evolutionary information combined with physicochemical properties (EIPP. The results showed that these novel features have important roles in improving the performance of the predictor. Using the mRMR-IFS method, our predictor achieved the best performance (86.62% accuracy and 0.737 Matthews correlation coefficient. High prediction accuracy and successful prediction performance suggested that our method can be a useful approach to identify RNA-binding proteins from sequence information.

  14. Crystal structure of mouse coronavirus receptor-binding domain complexed with its murine receptor

    Energy Technology Data Exchange (ETDEWEB)

    Peng, Guiqing; Sun, Dawei; Rajashankar, Kanagalaghatta R.; Qian, Zhaohui; Holmes, Kathryn V.; Li, Fang (Cornell); (UMM-MED); (Colorado)

    2011-09-28

    Coronaviruses have evolved diverse mechanisms to recognize different receptors for their cross-species transmission and host-range expansion. Mouse hepatitis coronavirus (MHV) uses the N-terminal domain (NTD) of its spike protein as its receptor-binding domain. Here we present the crystal structure of MHV NTD complexed with its receptor murine carcinoembryonic antigen-related cell adhesion molecule 1a (mCEACAM1a). Unexpectedly, MHV NTD contains a core structure that has the same {beta}-sandwich fold as human galectins (S-lectins) and additional structural motifs that bind to the N-terminal Ig-like domain of mCEACAM1a. Despite its galectin fold, MHV NTD does not bind sugars, but instead binds mCEACAM1a through exclusive protein-protein interactions. Critical contacts at the interface have been confirmed by mutagenesis, providing a structural basis for viral and host specificities of coronavirus/CEACAM1 interactions. Sugar-binding assays reveal that galectin-like NTDs of some coronaviruses such as human coronavirus OC43 and bovine coronavirus bind sugars. Structural analysis and mutagenesis localize the sugar-binding site in coronavirus NTDs to be above the {beta}-sandwich core. We propose that coronavirus NTDs originated from a host galectin and retained sugar-binding functions in some contemporary coronaviruses, but evolved new structural features in MHV for mCEACAM1a binding.

  15. Ligand binding and crystal structures of the substrate-binding domain of the ABC transporter OpuA.

    Directory of Open Access Journals (Sweden)

    Justina C Wolters

    2010-04-01

    Full Text Available The ABC transporter OpuA from Lactococcus lactis transports glycine betaine upon activation by threshold values of ionic strength. In this study, the ligand binding characteristics of purified OpuA in a detergent-solubilized state and of its substrate-binding domain produced as soluble protein (OpuAC was characterized.The binding of glycine betaine to purified OpuA and OpuAC (K(D = 4-6 microM did not show any salt dependence or cooperative effects, in contrast to the transport activity. OpuAC is highly specific for glycine betaine and the related proline betaine. Other compatible solutes like proline and carnitine bound with affinities that were 3 to 4 orders of magnitude lower. The low affinity substrates were not noticeably transported by membrane-reconstituted OpuA. OpuAC was crystallized in an open (1.9 A and closed-liganded (2.3 A conformation. The binding pocket is formed by three tryptophans (Trp-prism coordinating the quaternary ammonium group of glycine betaine in the closed-liganded structure. Even though the binding site of OpuAC is identical to that of its B. subtilis homolog, the affinity for glycine betaine is 4-fold higher.Ionic strength did not affect substrate binding to OpuA, indicating that regulation of transport is not at the level of substrate binding, but rather at the level of translocation. The overlap between the crystal structures of OpuAC from L.lactis and B.subtilis, comprising the classical Trp-prism, show that the differences observed in the binding affinities originate from outside of the ligand binding site.

  16. Crystallographic structure and substrate-binding interactions of the molybdate-binding protein of the phytopathogen Xanthomonas axonopodis pv. citri.

    Science.gov (United States)

    Balan, Andrea; Santacruz-Pérez, Carolina; Moutran, Alexandre; Ferreira, Luís Carlos Souza; Neshich, Goran; Gonçalves Barbosa, João Alexandre Ribeiro

    2008-02-01

    In Xanthomonas axonopodis pv. citri (Xac or X. citri), the modA gene codes for a periplasmic protein (ModA) that is capable of binding molybdate and tungstate as part of the ABC-type transporter required for the uptake of micronutrients. In this study, we report the crystallographic structure of the Xac ModA protein with bound molybdate. The Xac ModA structure is similar to orthologs with known three-dimensional structures and consists of two nearly symmetrical domains separated by a hinge region where the oxyanion-binding site lies. Phylogenetic analysis of different ModA orthologs based on sequence alignments revealed three groups of molybdate-binding proteins: bacterial phytopathogens, enterobacteria and soil bacteria. Even though the ModA orthologs are segregated into different groups, the ligand-binding hydrogen bonds are mostly conserved, except for Archaeglobus fulgidus ModA. A detailed discussion of hydrophobic interactions in the active site is presented and two new residues, Ala38 and Ser151, are shown to be part of the ligand-binding pocket.

  17. Two distinct calmodulin binding sites in the third intracellular loop and carboxyl tail of angiotensin II (AT(1A receptor.

    Directory of Open Access Journals (Sweden)

    Renwen Zhang

    Full Text Available In this study, we present data that support the presence of two distinct calmodulin binding sites within the angiotensin II receptor (AT(1A, at juxtamembrane regions of the N-terminus of the third intracellular loop (i3, amino acids 214-231 and carboxyl tail of the receptor (ct, 302-317. We used bioluminescence resonance energy transfer assays to document interactions of calmodulin with the AT(1A holo-receptor and GST-fusion protein pull-downs to demonstrate that i3 and ct interact with calmodulin in a Ca²⁺-dependent fashion. The former is a 1-12 motif and the latter belongs to 1-5-10 calmodulin binding motif. The apparent Kd of calmodulin for i3 is 177.0±9.1 nM, and for ct is 79.4±7.9 nM as assessed by dansyl-calmodulin fluorescence. Replacement of the tryptophan (W219 for alanine in i3, and phenylalanine (F309 or F313 for alanine in ct reduced their binding affinities for calmodulin, as predicted by computer docking simulations. Exogenously applied calmodulin attenuated interactions between G protein βγ subunits and i3 and ct, somewhat more so for ct than i3. Mutations W219A, F309A, and F313A did not alter Gβγ binding, but reduced the ability of calmodulin to compete with Gβγ, suggesting that calmodulin and Gβγ have overlapping, but not identical, binding requirements for i3 and ct. Calmodulin interference with the Gβγ binding to i3 and ct regions of the AT(1A receptor strongly suggests that calmodulin plays critical roles in regulating Gβγ-dependent signaling of the receptor.

  18. Two Distinct Calmodulin Binding Sites in the Third Intracellular Loop and Carboxyl Tail of Angiotensin II (AT1A) Receptor

    Science.gov (United States)

    Zhang, Renwen; Liu, Zhijie; Qu, Youxing; Xu, Ying; Yang, Qing

    2013-01-01

    In this study, we present data that support the presence of two distinct calmodulin binding sites within the angiotensin II receptor (AT1A), at juxtamembrane regions of the N-terminus of the third intracellular loop (i3, amino acids 214–231) and carboxyl tail of the receptor (ct, 302–317). We used bioluminescence resonance energy transfer assays to document interactions of calmodulin with the AT1A holo-receptor and GST-fusion protein pull-downs to demonstrate that i3 and ct interact with calmodulin in a Ca2+-dependent fashion. The former is a 1–12 motif and the latter belongs to 1-5-10 calmodulin binding motif. The apparent Kd of calmodulin for i3 is 177.0±9.1 nM, and for ct is 79.4±7.9 nM as assessed by dansyl-calmodulin fluorescence. Replacement of the tryptophan (W219) for alanine in i3, and phenylalanine (F309 or F313) for alanine in ct reduced their binding affinities for calmodulin, as predicted by computer docking simulations. Exogenously applied calmodulin attenuated interactions between G protein βγ subunits and i3 and ct, somewhat more so for ct than i3. Mutations W219A, F309A, and F313A did not alter Gβγ binding, but reduced the ability of calmodulin to compete with Gβγ, suggesting that calmodulin and Gβγ have overlapping, but not identical, binding requirements for i3 and ct. Calmodulin interference with the Gβγ binding to i3 and ct regions of the AT1A receptor strongly suggests that calmodulin plays critical roles in regulating Gβγ-dependent signaling of the receptor. PMID:23755207

  19. Recent improvements to Binding MOAD: a resource for protein-ligand binding affinities and structures.

    Science.gov (United States)

    Ahmed, Aqeel; Smith, Richard D; Clark, Jordan J; Dunbar, James B; Carlson, Heather A

    2015-01-01

    For over 10 years, Binding MOAD (Mother of All Databases; http://www.BindingMOAD.org) has been one of the largest resources for high-quality protein-ligand complexes and associated binding affinity data. Binding MOAD has grown at the rate of 1994 complexes per year, on average. Currently, it contains 23,269 complexes and 8156 binding affinities. Our annual updates curate the data using a semi-automated literature search of the references cited within the PDB file, and we have recently upgraded our website and added new features and functionalities to better serve Binding MOAD users. In order to eliminate the legacy application server of the old platform and to accommodate new changes, the website has been completely rewritten in the LAMP (Linux, Apache, MySQL and PHP) environment. The improved user interface incorporates current third-party plugins for better visualization of protein and ligand molecules, and it provides features like sorting, filtering and filtered downloads. In addition to the field-based searching, Binding MOAD now can be searched by structural queries based on the ligand. In order to remove redundancy, Binding MOAD records are clustered in different families based on 90% sequence identity. The new Binding MOAD, with the upgraded platform, features and functionalities, is now equipped to better serve its users. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Sequence based prediction of DNA-binding proteins based on hybrid feature selection using random forest and Gaussian naïve Bayes.

    Directory of Open Access Journals (Sweden)

    Wangchao Lou

    Full Text Available Developing an efficient method for determination of the DNA-binding proteins, due to their vital roles in gene regulation, is becoming highly desired since it would be invaluable to advance our understanding of protein functions. In this study, we proposed a new method for the prediction of the DNA-binding proteins, by performing the feature rank using random forest and the wrapper-based feature selection using forward best-first search strategy. The features comprise information from primary sequence, predicted secondary structure, predicted relative solvent accessibility, and position specific scoring matrix. The proposed method, called DBPPred, used Gaussian naïve Bayes as the underlying classifier since it outperformed five other classifiers, including decision tree, logistic regression, k-nearest neighbor, support vector machine with polynomial kernel, and support vector machine with radial basis function. As a result, the proposed DBPPred yields the highest average accuracy of 0.791 and average MCC of 0.583 according to the five-fold cross validation with ten runs on the training benchmark dataset PDB594. Subsequently, blind tests on the independent dataset PDB186 by the proposed model trained on the entire PDB594 dataset and by other five existing methods (including iDNA-Prot, DNA-Prot, DNAbinder, DNABIND and DBD-Threader were performed, resulting in that the proposed DBPPred yielded the highest accuracy of 0.769, MCC of 0.538, and AUC of 0.790. The independent tests performed by the proposed DBPPred on completely a large non-DNA binding protein dataset and two RNA binding protein datasets also showed improved or comparable quality when compared with the relevant prediction methods. Moreover, we observed that majority of the selected features by the proposed method are statistically significantly different between the mean feature values of the DNA-binding and the non DNA-binding proteins. All of the experimental results indicate that

  1. Effects of mutants in bHLH region on structure stability and protein-DNA binding energy in DECs.

    Science.gov (United States)

    Kong, Yi; Wang, Zhen; Jia, Yanfei; Li, Ping; Hao, Shuhua; Wang, Yunshan

    2017-07-01

    The human DEC subfamily contains two highly conserved members belonging to basic helix-loop-helix (bHLH) transcription factors. This conserved family is spread widely among various species with the function of regulating various crucial molecular signaling pathways. Due to the significance of DECs for important biological processes, their relationship with diseases and the lack of experimentally proven structures, we have implemented a comparative modeling for the bHLH region of DECs as homodimers with themselves and heterodimers with HES-1. Three mutants with predicted roles in reducing intramolecular binding (H57A, R65A, and LL7879AA in DEC1 and LL7071AA in DEC2) were investigated on DEC monomers. Molecular dynamics (MD) simulations were also employed to evaluate the behavior of the mutant molecules in aqueous solution. The monomer was divided into subregions for accurate investigation. The fluctuation in the basic region of mutants was higher than that of wild-type molecules. The binding energy value between protein and DNA obviously increased in the homodimer harboring R65A mutants, which led to more unstable status between protein and DNA. Thus, the mutant R65A interfered DNA-binding affinity. A study on the spatial structures of wild-type and mutant DECs may facilitate functional prediction for mutation effects and dynamic behavior under various conditions and may ultimately help in targeted drug design.

  2. LigandRFs: random forest ensemble to identify ligand-binding residues from sequence information alone

    KAUST Repository

    Chen, Peng

    2014-12-03

    Background Protein-ligand binding is important for some proteins to perform their functions. Protein-ligand binding sites are the residues of proteins that physically bind to ligands. Despite of the recent advances in computational prediction for protein-ligand binding sites, the state-of-the-art methods search for similar, known structures of the query and predict the binding sites based on the solved structures. However, such structural information is not commonly available. Results In this paper, we propose a sequence-based approach to identify protein-ligand binding residues. We propose a combination technique to reduce the effects of different sliding residue windows in the process of encoding input feature vectors. Moreover, due to the highly imbalanced samples between the ligand-binding sites and non ligand-binding sites, we construct several balanced data sets, for each of which a random forest (RF)-based classifier is trained. The ensemble of these RF classifiers forms a sequence-based protein-ligand binding site predictor. Conclusions Experimental results on CASP9 and CASP8 data sets demonstrate that our method compares favorably with the state-of-the-art protein-ligand binding site prediction methods.

  3. Mechanisms of Intentional Binding and Sensory Attenuation: The Role of Temporal Prediction, Temporal Control, Identity Prediction, and Motor Prediction

    Science.gov (United States)

    Hughes, Gethin; Desantis, Andrea; Waszak, Florian

    2013-01-01

    Sensory processing of action effects has been shown to differ from that of externally triggered stimuli, with respect both to the perceived timing of their occurrence (intentional binding) and to their intensity (sensory attenuation). These phenomena are normally attributed to forward action models, such that when action prediction is consistent…

  4. Predictions of RNA-binding ability and aggregation propensity of proteins

    OpenAIRE

    Agostini, Federico, 1985-

    2014-01-01

    RNA-binding proteins (RBPs) control the fate of a multitude of coding and non-coding transcripts. Formation of ribonucleoprotein (RNP) complexes fine-tunes regulation of post-transcriptional events and influences gene expression. Recently, it has been observed that non-canonical proteins with RNA-binding ability are enriched in structurally disordered and low-complexity regions that are generally involved in functional and dysfunctional associations. Therefore, it is possible that interaction...

  5. Solution structure of an archaeal DNA binding protein with an eukaryotic zinc finger fold.

    Directory of Open Access Journals (Sweden)

    Florence Guillière

    Full Text Available While the basal transcription machinery in archaea is eukaryal-like, transcription factors in archaea and their viruses are usually related to bacterial transcription factors. Nevertheless, some of these organisms show predicted classical zinc fingers motifs of the C2H2 type, which are almost exclusively found in proteins of eukaryotes and most often associated with transcription regulators. In this work, we focused on the protein AFV1p06 from the hyperthermophilic archaeal virus AFV1. The sequence of the protein consists of the classical eukaryotic C2H2 motif with the fourth histidine coordinating zinc missing, as well as of N- and C-terminal extensions. We showed that the protein AFV1p06 binds zinc and solved its solution structure by NMR. AFV1p06 displays a zinc finger fold with a novel structure extension and disordered N- and C-termini. Structure calculations show that a glutamic acid residue that coordinates zinc replaces the fourth histidine of the C2H2 motif. Electromobility gel shift assays indicate that the protein binds to DNA with different affinities depending on the DNA sequence. AFV1p06 is the first experimentally characterised archaeal zinc finger protein with a DNA binding activity. The AFV1p06 protein family has homologues in diverse viruses of hyperthermophilic archaea. A phylogenetic analysis points out a common origin of archaeal and eukaryotic C2H2 zinc fingers.

  6. Structure solution of DNA-binding proteins and complexes with ARCIMBOLDO libraries

    Energy Technology Data Exchange (ETDEWEB)

    Pröpper, Kevin [University of Göttingen, (Germany); Instituto de Biologia Molecular de Barcelona (IBMB-CSIC), (Spain); Meindl, Kathrin; Sammito, Massimo [Instituto de Biologia Molecular de Barcelona (IBMB-CSIC), (Spain); Dittrich, Birger; Sheldrick, George M. [University of Göttingen, (Germany); Pohl, Ehmke, E-mail: ehmke.pohl@durham.ac.uk [Durham University, (United Kingdom); Usón, Isabel, E-mail: ehmke.pohl@durham.ac.uk [Instituto de Biologia Molecular de Barcelona (IBMB-CSIC), (Spain); Institucio Catalana de Recerca i Estudis Avancats (ICREA), (Spain); University of Göttingen, (Germany)

    2014-06-01

    The structure solution of DNA-binding protein structures and complexes based on the combination of location of DNA-binding protein motif fragments with density modification in a multi-solution frame is described. Protein–DNA interactions play a major role in all aspects of genetic activity within an organism, such as transcription, packaging, rearrangement, replication and repair. The molecular detail of protein–DNA interactions can be best visualized through crystallography, and structures emphasizing insight into the principles of binding and base-sequence recognition are essential to understanding the subtleties of the underlying mechanisms. An increasing number of high-quality DNA-binding protein structure determinations have been witnessed despite the fact that the crystallographic particularities of nucleic acids tend to pose specific challenges to methods primarily developed for proteins. Crystallographic structure solution of protein–DNA complexes therefore remains a challenging area that is in need of optimized experimental and computational methods. The potential of the structure-solution program ARCIMBOLDO for the solution of protein–DNA complexes has therefore been assessed. The method is based on the combination of locating small, very accurate fragments using the program Phaser and density modification with the program SHELXE. Whereas for typical proteins main-chain α-helices provide the ideal, almost ubiquitous, small fragments to start searches, in the case of DNA complexes the binding motifs and DNA double helix constitute suitable search fragments. The aim of this work is to provide an effective library of search fragments as well as to determine the optimal ARCIMBOLDO strategy for the solution of this class of structures.

  7. Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach

    DEFF Research Database (Denmark)

    Buus, S.; Lauemoller, S.L.; Worning, Peder

    2003-01-01

    We have generated Artificial Neural Networks (ANN) capable of performing sensitive, quantitative predictions of peptide binding to the MHC class I molecule, HLA-A*0204. We have shown that such quantitative ANN are superior to conventional classification ANN, that have been trained to predict bind...... of an iterative feedback loop whereby advanced, computational bioinformatics optimize experimental strategy, and vice versa....

  8. Molecular simulations and Markov state modeling reveal the structural diversity and dynamics of a theophylline-binding RNA aptamer in its unbound state.

    Directory of Open Access Journals (Sweden)

    Becka M Warfield

    Full Text Available RNA aptamers are oligonucleotides that bind with high specificity and affinity to target ligands. In the absence of bound ligand, secondary structures of RNA aptamers are generally stable, but single-stranded and loop regions, including ligand binding sites, lack defined structures and exist as ensembles of conformations. For example, the well-characterized theophylline-binding aptamer forms a highly stable binding site when bound to theophylline, but the binding site is unstable and disordered when theophylline is absent. Experimental methods have not revealed at atomic resolution the conformations that the theophylline aptamer explores in its unbound state. Consequently, in the present study we applied 21 microseconds of molecular dynamics simulations to structurally characterize the ensemble of conformations that the aptamer adopts in the absence of theophylline. Moreover, we apply Markov state modeling to predict the kinetics of transitions between unbound conformational states. Our simulation results agree with experimental observations that the theophylline binding site is found in many distinct binding-incompetent states and show that these states lack a binding pocket that can accommodate theophylline. The binding-incompetent states interconvert with binding-competent states through structural rearrangement of the binding site on the nanosecond to microsecond timescale. Moreover, we have simulated the complete theophylline binding pathway. Our binding simulations supplement prior experimental observations of slow theophylline binding kinetics by showing that the binding site must undergo a large conformational rearrangement after the aptamer and theophylline form an initial complex, most notably, a major rearrangement of the C27 base from a buried to solvent-exposed orientation. Theophylline appears to bind by a combination of conformational selection and induced fit mechanisms. Finally, our modeling indicates that when Mg2+ ions are

  9. Improved methods for predicting peptide binding affinity to MHC class II molecules.

    Science.gov (United States)

    Jensen, Kamilla Kjaergaard; Andreatta, Massimo; Marcatili, Paolo; Buus, Søren; Greenbaum, Jason A; Yan, Zhen; Sette, Alessandro; Peters, Bjoern; Nielsen, Morten

    2018-01-06

    Major histocompatibility complex class II (MHC-II) molecules are expressed on the surface of professional antigen-presenting cells where they display peptides to T helper cells, which orchestrate the onset and outcome of many host immune responses. Understanding which peptides will be presented by the MHC-II molecule is therefore important for understanding the activation of T helper cells and can be used to identify T-cell epitopes. We here present updated versions of two MHC-II-peptide binding affinity prediction methods, NetMHCII and NetMHCIIpan. These were constructed using an extended data set of quantitative MHC-peptide binding affinity data obtained from the Immune Epitope Database covering HLA-DR, HLA-DQ, HLA-DP and H-2 mouse molecules. We show that training with this extended data set improved the performance for peptide binding predictions for both methods. Both methods are publicly available at www.cbs.dtu.dk/services/NetMHCII-2.3 and www.cbs.dtu.dk/services/NetMHCIIpan-3.2. © 2018 John Wiley & Sons Ltd.

  10. Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins

    Directory of Open Access Journals (Sweden)

    Bradley Michael E

    2006-02-01

    Full Text Available Abstract Background When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use. Results The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1 multiple sequence alignments, 2 mapping of alignment sites to crystal structure sites, 3 phylogenetic trees, 4 inferred ancestral sequences at internal tree nodes, and 5 amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures. Conclusion We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural

  11. A Structural Model for Binding of the Serine-Rich Repeat Adhesin GspB to Host Carbohydrate Receptors

    Energy Technology Data Exchange (ETDEWEB)

    Pyburn, Tasia M.; Bensing, Barbara A.; Xiong, Yan Q.; Melancon, Bruce J.; Tomasiak, Thomas M.; Ward, Nicholas J.; Yankovskaya, Victoria; Oliver, Kevin M.; Cecchini, Gary; Sulikowski, Gary A.; Tyska, Matthew J.; Sullam, Paul M.; Iverson, T.M. (VA); (UCLA); (Vanderbilt); (UCSF)

    2014-10-02

    GspB is a serine-rich repeat (SRR) adhesin of Streptococcus gordonii that mediates binding of this organism to human platelets via its interaction with sialyl-T antigen on the receptor GPIb{alpha}. This interaction appears to be a major virulence determinant in the pathogenesis of infective endocarditis. To address the mechanism by which GspB recognizes its carbohydrate ligand, we determined the high-resolution x-ray crystal structure of the GspB binding region (GspB{sub BR}), both alone and in complex with a disaccharide precursor to sialyl-T antigen. Analysis of the GspB{sub BR} structure revealed that it is comprised of three independently folded subdomains or modules: (1) an Ig-fold resembling a CnaA domain from prokaryotic pathogens; (2) a second Ig-fold resembling the binding region of mammalian Siglecs; (3) a subdomain of unique fold. The disaccharide was found to bind in a pocket within the Siglec subdomain, but at a site distinct from that observed in mammalian Siglecs. Confirming the biological relevance of this binding pocket, we produced three isogenic variants of S. gordonii, each containing a single point mutation of a residue lining this binding pocket. These variants have reduced binding to carbohydrates of GPIb{alpha}. Further examination of purified GspB{sub BR}-R484E showed reduced binding to sialyl-T antigen while S. gordonii harboring this mutation did not efficiently bind platelets and showed a significant reduction in virulence, as measured by an animal model of endocarditis. Analysis of other SRR proteins revealed that the predicted binding regions of these adhesins also had a modular organization, with those known to bind carbohydrate receptors having modules homologous to the Siglec and Unique subdomains of GspBBR. This suggests that the binding specificity of the SRR family of adhesins is determined by the type and organization of discrete modules within the binding domains, which may affect the tropism of organisms for different tissues.

  12. Linguistic Structure Prediction

    CERN Document Server

    Smith, Noah A

    2011-01-01

    A major part of natural language processing now depends on the use of text data to build linguistic analyzers. We consider statistical, computational approaches to modeling linguistic structure. We seek to unify across many approaches and many kinds of linguistic structures. Assuming a basic understanding of natural language processing and/or machine learning, we seek to bridge the gap between the two fields. Approaches to decoding (i.e., carrying out linguistic structure prediction) and supervised and unsupervised learning of models that predict discrete structures as outputs are the focus. W

  13. Large scale free energy calculations for blind predictions of protein-ligand binding: the D3R Grand Challenge 2015.

    Science.gov (United States)

    Deng, Nanjie; Flynn, William F; Xia, Junchao; Vijayan, R S K; Zhang, Baofeng; He, Peng; Mentes, Ahmet; Gallicchio, Emilio; Levy, Ronald M

    2016-09-01

    We describe binding free energy calculations in the D3R Grand Challenge 2015 for blind prediction of the binding affinities of 180 ligands to Hsp90. The present D3R challenge was built around experimental datasets involving Heat shock protein (Hsp) 90, an ATP-dependent molecular chaperone which is an important anticancer drug target. The Hsp90 ATP binding site is known to be a challenging target for accurate calculations of ligand binding affinities because of the ligand-dependent conformational changes in the binding site, the presence of ordered waters and the broad chemical diversity of ligands that can bind at this site. Our primary focus here is to distinguish binders from nonbinders. Large scale absolute binding free energy calculations that cover over 3000 protein-ligand complexes were performed using the BEDAM method starting from docked structures generated by Glide docking. Although the ligand dataset in this study resembles an intermediate to late stage lead optimization project while the BEDAM method is mainly developed for early stage virtual screening of hit molecules, the BEDAM binding free energy scoring has resulted in a moderate enrichment of ligand screening against this challenging drug target. Results show that, using a statistical mechanics based free energy method like BEDAM starting from docked poses offers better enrichment than classical docking scoring functions and rescoring methods like Prime MM-GBSA for the Hsp90 data set in this blind challenge. Importantly, among the three methods tested here, only the mean value of the BEDAM binding free energy scores is able to separate the large group of binders from the small group of nonbinders with a gap of 2.4 kcal/mol. None of the three methods that we have tested provided accurate ranking of the affinities of the 147 active compounds. We discuss the possible sources of errors in the binding free energy calculations. The study suggests that BEDAM can be used strategically to discriminate

  14. Validation of tautomeric and protomeric binding modes by free energy calculations. A case study for the structure based optimization of d-amino acid oxidase inhibitors

    Science.gov (United States)

    Orgován, Zoltán; Ferenczy, György G.; Steinbrecher, Thomas; Szilágyi, Bence; Bajusz, Dávid; Keserű, György M.

    2018-02-01

    Optimization of fragment size d-amino acid oxidase (DAAO) inhibitors was investigated using a combination of computational and experimental methods. Retrospective free energy perturbation (FEP) calculations were performed for benzo[d]isoxazole derivatives, a series of known inhibitors with two potential binding modes derived from X-ray structures of other DAAO inhibitors. The good agreement between experimental and computed binding free energies in only one of the hypothesized binding modes strongly support this bioactive conformation. Then, a series of 1-H-indazol-3-ol derivatives formerly not described as DAAO inhibitors was investigated. Binding geometries could be reliably identified by structural similarity to benzo[d]isoxazole and other well characterized series and FEP calculations were performed for several tautomers of the deprotonated and protonated compounds since all these forms are potentially present owing to the experimental pKa values of representative compounds in the series. Deprotonated compounds are proposed to be the most important bound species owing to the significantly better agreement between their calculated and measured affinities compared to the protonated forms. FEP calculations were also used for the prediction of the affinities of compounds not previously tested as DAAO inhibitors and for a comparative structure-activity relationship study of the benzo[d]isoxazole and indazole series. Selected indazole derivatives were synthesized and their measured binding affinity towards DAAO was in good agreement with FEP predictions.

  15. Expression, purification, crystallization and structure of human adipocyte lipid-binding protein (aP2)

    International Nuclear Information System (INIS)

    Marr, Eric; Tardie, Mark; Carty, Maynard; Brown Phillips, Tracy; Wang, Ing-Kae; Soeller, Walt; Qiu, Xiayang; Karam, George

    2006-01-01

    The crystal structure of human adipocyte lipid-binding protein (aP2) with a bound palmitate is reported at 1.5 Å resolution. Human adipocyte lipid-binding protein (aP2) belongs to a family of intracellular lipid-binding proteins involved in the transport and storage of lipids. Here, the crystal structure of human aP2 with a bound palmitate is described at 1.5 Å resolution. Unlike the known crystal structure of murine aP2 in complex with palmitate, this structure shows that the fatty acid is in a folded conformation and that the loop containing Phe57 acts as a lid to regulate ligand binding by excluding solvent exposure to the central binding cavity

  16. Dynamic fluctuations provide the basis of a conformational switch mechanism in apo cyclic AMP receptor protein.

    Directory of Open Access Journals (Sweden)

    Burcu Aykaç Fas

    Full Text Available Escherichia coli cyclic AMP Receptor Protein (CRP undergoes conformational changes with cAMP binding and allosterically promotes CRP to bind specifically to the DNA. In that, the structural and dynamic properties of apo CRP prior to cAMP binding are of interest for the comprehension of the activation mechanism. Here, the dynamics of apo CRP monomer/dimer and holo CRP dimer were studied by Molecular Dynamics (MD simulations and Gaussian Network Model (GNM. The interplay of the inter-domain hinge with the cAMP and DNA binding domains are pre-disposed in the apo state as a conformational switch in the CRP's allosteric communication mechanism. The hinge at L134-D138 displaying intra- and inter-subunit coupled fluctuations with the cAMP and DNA binding domains leads to the emergence of stronger coupled fluctuations between the two domains and describes an on state. The flexible regions at K52-E58, P154/D155 and I175 maintain the dynamic coupling of the two domains. With a shift in the inter-domain hinge position towards the N terminus, nevertheless, the latter correlations between the domains loosen and become disordered; L134-D138 dynamically interacts only with the cAMP and DNA binding domains of its own subunit, and an off state is assumed. We present a mechanistic view on how the structural dynamic units are hierarchically built for the allosteric functional mechanism; from apo CRP monomer to apo-to-holo CRP dimers.

  17. Probing the structural basis of oxygen binding in a cofactor-independent dioxygenase.

    Science.gov (United States)

    Li, Kunhua; Fielding, Elisha N; Condurso, Heather L; Bruner, Steven D

    2017-07-01

    The enzyme DpgC is included in the small family of cofactor-independent dioxygenases. The chemistry of DpgC is uncommon as the protein binds and utilizes dioxygen without the aid of a metal or organic cofactor. Previous structural and biochemical studies identified the substrate-binding mode and the components of the active site that are important in the catalytic mechanism. In addition, the results delineated a putative binding pocket and migration pathway for the co-substrate dioxygen. Here, structural biology is utilized, along with site-directed mutagenesis, to probe the assigned dioxygen-binding pocket. The key residues implicated in dioxygen trafficking were studied to probe the process of binding, activation and chemistry. The results support the proposed chemistry and provide insight into the general mechanism of dioxygen binding and activation.

  18. Nucleotide sequence of Phaseolus vulgaris L. alcohol dehydrogenase encoding cDNA and three-dimensional structure prediction of the deduced protein.

    Science.gov (United States)

    Amelia, Kassim; Khor, Chin Yin; Shah, Farida Habib; Bhore, Subhash J

    2015-01-01

    Common beans (Phaseolus vulgaris L.) are widely consumed as a source of proteins and natural products. However, its yield needs to be increased. In line with the agenda of Phaseomics (an international consortium), work of expressed sequence tags (ESTs) generation from bean pods was initiated. Altogether, 5972 ESTs have been isolated. Alcohol dehydrogenase (AD) encoding gene cDNA was a noticeable transcript among the generated ESTs. This AD is an important enzyme; therefore, to understand more about it this study was undertaken. The objective of this study was to elucidate P. vulgaris L. AD (PvAD) gene cDNA sequence and to predict the three-dimensional (3D) structure of deduced protein. positive and negative strands of the PvAD cDNA clone were sequenced using M13 forward and M13 reverse primers to elucidate the nucleotide sequence. Deduced PvAD cDNA and protein sequence was analyzed for their basic features using online bioinformatics tools. Sequence comparison was carried out using bl2seq program, and tree-view program was used to construct a phylogenetic tree. The secondary structures and 3D structure of PvAD protein were predicted by using the PHYRE automatic fold recognition server. The sequencing results analysis showed that PvAD cDNA is 1294 bp in length. It's open reading frame encodes for a protein that contains 371 amino acids. Deduced protein sequence analysis showed the presence of putative substrate binding, catalytic Zn binding, and NAD binding sites. Results indicate that the predicted 3D structure of PvAD protein is analogous to the experimentally determined crystal structure of s-nitrosoglutathione reductase from an Arabidopsis species. The 1294 bp long PvAD cDNA encodes for 371 amino acid long protein that contains conserved domains required for biological functions of AD. The predicted deduced PvAD protein's 3D structure reflects the analogy with the crystal structure of Arabidopsis thaliana s-nitrosoglutathione reductase. Further study is required

  19. Phosphorus Binding Sites in Proteins: Structural Preorganization and Coordination

    DEFF Research Database (Denmark)

    Gruber, Mathias Felix; Greisen, Per Junior; Junker, Märta Caroline

    2014-01-01

    to individual structures that bind to phosphate groups; here, we investigate a total of 8307 structures obtained from the RCSB Protein Data Bank (PDB). An analysis of the binding site amino acid propensities reveals very characteristic first shell residue distributions, which are found to be influenced...... by the characteristics of the phosphorus compound and by the presence of cobound cations. The second shell, which supports the coordinating residues in the first shell, is found to consist mainly of protein backbone groups. Our results show how the second shell residue distribution is dictated mainly by the first shell...

  20. An Augmented Pocketome: Detection and Analysis of Small-Molecule Binding Pockets in Proteins of Known 3D Structure.

    Science.gov (United States)

    Bhagavat, Raghu; Sankar, Santhosh; Srinivasan, Narayanaswamy; Chandra, Nagasuma

    2018-03-06

    Protein-ligand interactions form the basis of most cellular events. Identifying ligand binding pockets in proteins will greatly facilitate rationalizing and predicting protein function. Ligand binding sites are unknown for many proteins of known three-dimensional (3D) structure, creating a gap in our understanding of protein structure-function relationships. To bridge this gap, we detect pockets in proteins of known 3D structures, using computational techniques. This augmented pocketome (PocketDB) consists of 249,096 pockets, which is about seven times larger than what is currently known. We deduce possible ligand associations for about 46% of the newly identified pockets. The augmented pocketome, when subjected to clustering based on similarities among pockets, yielded 2,161 site types, which are associated with 1,037 ligand types, together providing fold-site-type-ligand-type associations. The PocketDB resource facilitates a structure-based function annotation, delineation of the structural basis of ligand recognition, and provides functional clues for domains of unknown functions, allosteric proteins, and druggable pockets. Copyright © 2018 Elsevier Ltd. All rights reserved.

  1. Structural determination of functional units of the nucleotide binding domain (NBD94 of the reticulocyte binding protein Py235 of Plasmodium yoelii.

    Directory of Open Access Journals (Sweden)

    Ardina Grüber

    2010-02-01

    Full Text Available Invasion of the red blood cells (RBC by the merozoite of malaria parasites involves a large number of receptor ligand interactions. The reticulocyte binding protein homologue family (RH plays an important role in erythrocyte recognition as well as virulence. Recently, it has been shown that members of RH in addition to receptor binding may also have a role as ATP/ADP sensor. A 94 kDa region named Nucleotide-Binding Domain 94 (NBD94 of Plasmodium yoelii YM, representative of the putative nucleotide binding region of RH, has been demonstrated to bind ATP and ADP selectively. Binding of ATP or ADP induced nucleotide-dependent structural changes in the C-terminal hinge-region of NBD94, and directly impacted on the RBC binding ability of RH.In order to find the smallest structural unit, able to bind nucleotides, and its coupling module, the hinge region, three truncated domains of NBD94 have been generated, termed NBD94(444-547, NBD94(566-663 and NBD94(674-793, respectively. Using fluorescence correlation spectroscopy NBD94(444-547 has been identified to form the smallest nucleotide binding segment, sensitive for ATP and ADP, which became inhibited by 4-Chloro-7-nitrobenzofurazan. The shape of NBD94(444-547 in solution was calculated from small-angle X-ray scattering data, revealing an elongated molecule, comprised of two globular domains, connected by a spiral segment of about 73.1 A in length. The high quality of the constructs, forming the hinge-region, NBD94(566-663 and NBD94(674-793 enabled to determine the first crystallographic and solution structure, respectively. The crystal structure of NBD94(566-663 consists of two helices with 97.8 A and 48.6 A in length, linked by a loop. By comparison, the low resolution structure of NBD94(674-793 in solution represents a chair-like shape with three architectural segments.These structures give the first insight into how nucleotide binding impacts on the overall structure of RH and demonstrates the

  2. Structural Changes of Creatine Kinase upon Substrate Binding

    OpenAIRE

    Forstner, Michael; Kriechbaum, Manfred; Laggner, Peter; Wallimann, Theo

    1998-01-01

    Small-angle x-ray scattering was used to investigate structural changes upon binding of individual substrates or a transition state analog complex (TSAC; Mg-ADP, creatine, and KNO3) to creatine kinase (CK) isoenzymes (dimeric muscle-type (M)-CK and octameric mitochondrial (Mi)-CK) and monomeric arginine kinase (AK). Considerable changes in the shape and the size of the molecules occurred upon binding of Mg-nucleotide or TSAC. The radius of gyration of Mi-CK was reduced from 55.6 A (free enzym...

  3. Ensemble Architecture for Prediction of Enzyme-ligand Binding Residues Using Evolutionary Information.

    Science.gov (United States)

    Pai, Priyadarshini P; Dattatreya, Rohit Kadam; Mondal, Sukanta

    2017-11-01

    Enzyme interactions with ligands are crucial for various biochemical reactions governing life. Over many years attempts to identify these residues for biotechnological manipulations have been made using experimental and computational techniques. The computational approaches have gathered impetus with the accruing availability of sequence and structure information, broadly classified into template-based and de novo methods. One of the predominant de novo methods using sequence information involves application of biological properties for supervised machine learning. Here, we propose a support vector machines-based ensemble for prediction of protein-ligand interacting residues using one of the most important discriminative contributing properties in the interacting residue neighbourhood, i. e., evolutionary information in the form of position-specific- scoring matrix (PSSM). The study has been performed on a non-redundant dataset comprising of 9269 interacting and 91773 non-interacting residues for prediction model generation and further evaluation. Of the various PSSM-based models explored, the proposed method named ROBBY (pRediction Of Biologically relevant small molecule Binding residues on enzYmes) shows an accuracy of 84.0 %, Matthews Correlation Coefficient of 0.343 and F-measure of 39.0 % on 78 test enzymes. Further, scope of adding domain knowledge such as pocket information has also been investigated; results showed significant enhancement in method precision. Findings are hoped to boost the reliability of small-molecule ligand interaction prediction for enzyme applications and drug design. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Structural Basis for Sialoglycan Binding by the Streptococcus sanguinis SrpA Adhesin.

    Science.gov (United States)

    Bensing, Barbara A; Loukachevitch, Lioudmila V; McCulloch, Kathryn M; Yu, Hai; Vann, Kendra R; Wawrzak, Zdzislaw; Anderson, Spencer; Chen, Xi; Sullam, Paul M; Iverson, T M

    2016-04-01

    Streptococcus sanguinisis a leading cause of infective endocarditis, a life-threatening infection of the cardiovascular system. An important interaction in the pathogenesis of infective endocarditis is attachment of the organisms to host platelets.S. sanguinisexpresses a serine-rich repeat adhesin, SrpA, similar in sequence to platelet-binding adhesins associated with increased virulence in this disease. In this study, we determined the first crystal structure of the putative binding region of SrpA (SrpABR) both unliganded and in complex with a synthetic disaccharide ligand at 1.8 and 2.0 Å resolution, respectively. We identified a conserved Thr-Arg motif that orients the sialic acid moiety and is required for binding to platelet monolayers. Furthermore, we propose that sequence insertions in closely related family members contribute to the modulation of structural and functional properties, including the quaternary structure, the tertiary structure, and the ligand-binding site. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. Insight into mitochondrial structure and function from electron tomography.

    Science.gov (United States)

    Frey, T G; Renken, C W; Perkins, G A

    2002-09-10

    In recent years, electron tomography has provided detailed three-dimensional models of mitochondria that have redefined our concept of mitochondrial structure. The models reveal an inner membrane consisting of two components, the inner boundary membrane (IBM) closely apposed to the outer membrane and the cristae membrane that projects into the matrix compartment. These two components are connected by tubular structures of relatively uniform size called crista junctions. The distribution of crista junction sizes and shapes is predicted by a thermodynamic model based upon the energy of membrane bending, but proteins likely also play a role in determining the conformation of the inner membrane. Results of structural studies of mitochondria during apoptosis demonstrate that cytochrome c is released without detectable disruption of the outer membrane or extensive swelling of the mitochondrial matrix, suggesting the formation of an outer membrane pore large enough to allow passage of holo-cytochrome c. The possible compartmentation of inner membrane function between the IBM and the cristae membrane is also discussed.

  6. Six independent fucose-binding sites in the crystal structure of Aspergillus oryzae lectin

    Energy Technology Data Exchange (ETDEWEB)

    Makyio, Hisayoshi [Structural Biology Research Center, Photon Factory, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), 1-1 Oho, Tsukuba, Ibaraki, 305-0801 (Japan); Shimabukuro, Junpei; Suzuki, Tatsuya [Department of Applied Bioorganic Chemistry, Gifu University, 1-1 Yanagido, Gifu-shi, Gifu 501-1193 (Japan); Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Yoshida Ushinomiya-cho, Sakyo-ku, Kyoto 606-8501 (Japan); Imamura, Akihiro; Ishida, Hideharu [Department of Applied Bioorganic Chemistry, Gifu University, 1-1 Yanagido, Gifu-shi, Gifu 501-1193 (Japan); Kiso, Makoto [Department of Applied Bioorganic Chemistry, Gifu University, 1-1 Yanagido, Gifu-shi, Gifu 501-1193 (Japan); Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Yoshida Ushinomiya-cho, Sakyo-ku, Kyoto 606-8501 (Japan); Ando, Hiromune, E-mail: hando@gifu-u.ac.jp [Department of Applied Bioorganic Chemistry, Gifu University, 1-1 Yanagido, Gifu-shi, Gifu 501-1193 (Japan); Institute for Integrated Cell-Material Sciences (WPI-iCeMS), Kyoto University, Yoshida Ushinomiya-cho, Sakyo-ku, Kyoto 606-8501 (Japan); Kato, Ryuichi, E-mail: ryuichi.kato@kek.jp [Structural Biology Research Center, Photon Factory, Institute of Materials Structure Science, High Energy Accelerator Research Organization (KEK), 1-1 Oho, Tsukuba, Ibaraki, 305-0801 (Japan)

    2016-08-26

    The crystal structure of AOL (a fucose-specific lectin of Aspergillus oryzae) has been solved by SAD (single-wavelength anomalous diffraction) and MAD (multi-wavelength anomalous diffraction) phasing of seleno-fucosides. The overall structure is a six-bladed β-propeller similar to that of other fucose-specific lectins. The fucose moieties of the seleno-fucosides are located in six fucose-binding sites. Although the Arg and Glu/Gln residues bound to the fucose moiety are common to all fucose-binding sites, the amino-acid residues involved in fucose binding at each site are not identical. The varying peak heights of the seleniums in the electron density map suggest that each fucose-binding site has a different carbohydrate binding affinity. - Highlights: • The six-bladed β-propeller structure of AOL was solved by seleno-sugar phasing. • The mode of fucose binding is essentially conserved at all six binding sites. • The seleno-fucosides exhibit slightly different interactions and electron densities. • These findings suggest that the affinity for fucose is not identical at each site.

  7. Six independent fucose-binding sites in the crystal structure of Aspergillus oryzae lectin

    International Nuclear Information System (INIS)

    Makyio, Hisayoshi; Shimabukuro, Junpei; Suzuki, Tatsuya; Imamura, Akihiro; Ishida, Hideharu; Kiso, Makoto; Ando, Hiromune; Kato, Ryuichi

    2016-01-01

    The crystal structure of AOL (a fucose-specific lectin of Aspergillus oryzae) has been solved by SAD (single-wavelength anomalous diffraction) and MAD (multi-wavelength anomalous diffraction) phasing of seleno-fucosides. The overall structure is a six-bladed β-propeller similar to that of other fucose-specific lectins. The fucose moieties of the seleno-fucosides are located in six fucose-binding sites. Although the Arg and Glu/Gln residues bound to the fucose moiety are common to all fucose-binding sites, the amino-acid residues involved in fucose binding at each site are not identical. The varying peak heights of the seleniums in the electron density map suggest that each fucose-binding site has a different carbohydrate binding affinity. - Highlights: • The six-bladed β-propeller structure of AOL was solved by seleno-sugar phasing. • The mode of fucose binding is essentially conserved at all six binding sites. • The seleno-fucosides exhibit slightly different interactions and electron densities. • These findings suggest that the affinity for fucose is not identical at each site.

  8. UPF201 Archaeal Specific Family Members Reveals Structural Similarity to RNA-Binding Proteins but Low Likelihood for RNA-Binding Function

    Energy Technology Data Exchange (ETDEWEB)

    Rao, K.N.; Swaminathan, S.; Burley, S. K.

    2008-12-11

    We have determined X-ray crystal structures of four members of an archaeal specific family of proteins of unknown function (UPF0201; Pfam classification: DUF54) to advance our understanding of the genetic repertoire of archaea. Despite low pairwise amino acid sequence identities (10-40%) and the absence of conserved sequence motifs, the three-dimensional structures of these proteins are remarkably similar to one another. Their common polypeptide chain fold, encompassing a five-stranded antiparallel {beta}-sheet and five {alpha}-helices, proved to be quite unexpectedly similar to that of the RRM-type RNA-binding domain of the ribosomal L5 protein, which is responsible for binding the 5S- rRNA. Structure-based sequence alignments enabled construction of a phylogenetic tree relating UPF0201 family members to L5 ribosomal proteins and other structurally similar RNA binding proteins, thereby expanding our understanding of the evolutionary purview of the RRM superfamily. Analyses of the surfaces of these newly determined UPF0201 structures suggest that they probably do not function as RNA binding proteins, and that this domain specific family of proteins has acquired a novel function in archaebacteria, which awaits experimental elucidation.

  9. Structural and functional characterization of solute binding proteins for aromatic compounds derived from lignin: p-coumaric acid and related aromatic acids.

    Science.gov (United States)

    Tan, Kemin; Chang, Changsoo; Cuff, Marianne; Osipiuk, Jerzy; Landorf, Elizabeth; Mack, Jamey C; Zerbs, Sarah; Joachimiak, Andrzej; Collart, Frank R

    2013-10-01

    Lignin comprises 15-25% of plant biomass and represents a major environmental carbon source for utilization by soil microorganisms. Access to this energy resource requires the action of fungal and bacterial enzymes to break down the lignin polymer into a complex assortment of aromatic compounds that can be transported into the cells. To improve our understanding of the utilization of lignin by microorganisms, we characterized the molecular properties of solute binding proteins of ATP-binding cassette transporter proteins that interact with these compounds. A combination of functional screens and structural studies characterized the binding specificity of the solute binding proteins for aromatic compounds derived from lignin such as p-coumarate, 3-phenylpropionic acid and compounds with more complex ring substitutions. A ligand screen based on thermal stabilization identified several binding protein clusters that exhibit preferences based on the size or number of aromatic ring substituents. Multiple X-ray crystal structures of protein-ligand complexes for these clusters identified the molecular basis of the binding specificity for the lignin-derived aromatic compounds. The screens and structural data provide new functional assignments for these solute-binding proteins which can be used to infer their transport specificity. This knowledge of the functional roles and molecular binding specificity of these proteins will support the identification of the specific enzymes and regulatory proteins of peripheral pathways that funnel these compounds to central metabolic pathways and will improve the predictive power of sequence-based functional annotation methods for this family of proteins. Copyright © 2013 Wiley Periodicals, Inc.

  10. The development of real-time stability supports visual working memory performance: Young children's feature binding can be improved through perceptual structure.

    Science.gov (United States)

    Simmering, Vanessa R; Wood, Chelsey M

    2017-08-01

    Working memory is a basic cognitive process that predicts higher-level skills. A central question in theories of working memory development is the generality of the mechanisms proposed to explain improvements in performance. Prior theories have been closely tied to particular tasks and/or age groups, limiting their generalizability. The cognitive dynamics theory of visual working memory development has been proposed to overcome this limitation. From this perspective, developmental improvements arise through the coordination of cognitive processes to meet demands of different behavioral tasks. This notion is described as real-time stability, and can be probed through experiments that assess how changing task demands impact children's performance. The current studies test this account by probing visual working memory for colors and shapes in a change detection task that compares detection of changes to new features versus swaps in color-shape binding. In Experiment 1, 3- to 4-year-old children showed impairments specific to binding swaps, as predicted by decreased real-time stability early in development; 5- to 6-year-old children showed a slight advantage on binding swaps, but 7- to 8-year-old children and adults showed no difference across trial types. Experiment 2 tested the proposed explanation of young children's binding impairment through added perceptual structure, which supported the stability and precision of feature localization in memory-a process key to detecting binding swaps. This additional structure improved young children's binding swap detection, but not new-feature detection or adults' performance. These results provide further evidence for the cognitive dynamics and real-time stability explanation of visual working memory development. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  11. Neural Networks for protein Structure Prediction

    DEFF Research Database (Denmark)

    Bohr, Henrik

    1998-01-01

    This is a review about neural network applications in bioinformatics. Especially the applications to protein structure prediction, e.g. prediction of secondary structures, prediction of surface structure, fold class recognition and prediction of the 3-dimensional structure of protein backbones...

  12. Structure and ligand-binding properties of the biogenic amine-binding protein from the saliva of a blood-feeding insect vector of Trypanosoma cruzi

    Energy Technology Data Exchange (ETDEWEB)

    Xu, Xueqing; Chang, Bianca W. [NIH/NIAID, 12735 Twinbrook Parkway, Rockville, MD 20852 (United States); Mans, Ben J. [NIH/NIAID, 12735 Twinbrook Parkway, Rockville, MD 20852 (United States); Agricultural Research Council, Onderstepoort 0110 (South Africa); Ribeiro, Jose M. C.; Andersen, John F., E-mail: jandersen@niaid.nih.gov [NIH/NIAID, 12735 Twinbrook Parkway, Rockville, MD 20852 (United States)

    2013-01-01

    Biogenic amine-binding proteins mediate the anti-inflammatory and antihemostatic activities of blood-feeding insect saliva. The structure of the amine-binding protein from R. prolixus reveals the interaction of biogenic amine ligands with the protein. Proteins that bind small-molecule mediators of inflammation and hemostasis are essential for blood-feeding by arthropod vectors of infectious disease. In ticks and triatomine insects, the lipocalin protein family is greatly expanded and members have been shown to bind biogenic amines, eicosanoids and ADP. These compounds are potent mediators of platelet activation, inflammation and vascular tone. In this paper, the structure of the amine-binding protein (ABP) from Rhodnius prolixus, a vector of the trypanosome that causes Chagas disease, is described. ABP binds the biogenic amines serotonin and norepinephrine with high affinity. A complex with tryptamine shows the presence of a binding site for a single ligand molecule in the central cavity of the β-barrel structure. The cavity contains significant additional volume, suggesting that this protein may have evolved from the related nitrophorin proteins, which bind a much larger heme ligand in the central cavity.

  13. Structure and ligand-binding properties of the biogenic amine-binding protein from the saliva of a blood-feeding insect vector of Trypanosoma cruzi

    International Nuclear Information System (INIS)

    Xu, Xueqing; Chang, Bianca W.; Mans, Ben J.; Ribeiro, Jose M. C.; Andersen, John F.

    2013-01-01

    Biogenic amine-binding proteins mediate the anti-inflammatory and antihemostatic activities of blood-feeding insect saliva. The structure of the amine-binding protein from R. prolixus reveals the interaction of biogenic amine ligands with the protein. Proteins that bind small-molecule mediators of inflammation and hemostasis are essential for blood-feeding by arthropod vectors of infectious disease. In ticks and triatomine insects, the lipocalin protein family is greatly expanded and members have been shown to bind biogenic amines, eicosanoids and ADP. These compounds are potent mediators of platelet activation, inflammation and vascular tone. In this paper, the structure of the amine-binding protein (ABP) from Rhodnius prolixus, a vector of the trypanosome that causes Chagas disease, is described. ABP binds the biogenic amines serotonin and norepinephrine with high affinity. A complex with tryptamine shows the presence of a binding site for a single ligand molecule in the central cavity of the β-barrel structure. The cavity contains significant additional volume, suggesting that this protein may have evolved from the related nitrophorin proteins, which bind a much larger heme ligand in the central cavity

  14. SCOWLP classification: Structural comparison and analysis of protein binding regions

    Directory of Open Access Journals (Sweden)

    Anders Gerd

    2008-01-01

    Full Text Available Abstract Background Detailed information about protein interactions is critical for our understanding of the principles governing protein recognition mechanisms. The structures of many proteins have been experimentally determined in complex with different ligands bound either in the same or different binding regions. Thus, the structural interactome requires the development of tools to classify protein binding regions. A proper classification may provide a general view of the regions that a protein uses to bind others and also facilitate a detailed comparative analysis of the interacting information for specific protein binding regions at atomic level. Such classification might be of potential use for deciphering protein interaction networks, understanding protein function, rational engineering and design. Description Protein binding regions (PBRs might be ideally described as well-defined separated regions that share no interacting residues one another. However, PBRs are often irregular, discontinuous and can share a wide range of interacting residues among them. The criteria to define an individual binding region can be often arbitrary and may differ from other binding regions within a protein family. Therefore, the rational behind protein interface classification should aim to fulfil the requirements of the analysis to be performed. We extract detailed interaction information of protein domains, peptides and interfacial solvent from the SCOWLP database and we classify the PBRs of each domain family. For this purpose, we define a similarity index based on the overlapping of interacting residues mapped in pair-wise structural alignments. We perform our classification with agglomerative hierarchical clustering using the complete-linkage method. Our classification is calculated at different similarity cut-offs to allow flexibility in the analysis of PBRs, feature especially interesting for those protein families with conflictive binding regions

  15. Structural characterization of the binding interactions of various endogenous estrogen metabolites with human estrogen receptor α and β subtypes: a molecular modeling study.

    Directory of Open Access Journals (Sweden)

    Pan Wang

    Full Text Available In the present study, we used the molecular docking approach to study the binding interactions of various derivatives of 17β-estradiol (E2 with human estrogen receptor (ER α and β. First, we determined the suitability of the molecular docking method to correctly predict the binding modes and interactions of two representative agonists (E2 and diethylstilbesterol in the ligand binding domain (LBD of human ERα. We showed that the docked structures of E2 and diethylstilbesterol in the ERα LBD were almost exactly the same as the known crystal structures of ERα in complex with these two estrogens. Using the same docking approach, we then characterized the binding interactions of 27 structurally similar E2 derivatives with the LBDs of human ERα and ERβ. While the binding modes of these E2 derivatives are very similar to that of E2, there are distinct subtle differences, and these small differences contribute importantly to their differential binding affinities for ERs. In the case of A-ring estrogen derivatives, there is a strong inverse relationship between the length of the hydrogen bonds formed with ERs and their binding affinity. We found that a better correlation between the computed binding energy values and the experimentally determined logRBA values could be achieved for various A-ring derivatives by re-adjusting the relative weights of the van der Waals interaction energy and the Coulomb interaction energy in computing the overall binding energy values.

  16. Interleukin-11 binds specific EF-hand proteins via their conserved structural motifs.

    Science.gov (United States)

    Kazakov, Alexei S; Sokolov, Andrei S; Vologzhannikova, Alisa A; Permyakova, Maria E; Khorn, Polina A; Ismailov, Ramis G; Denessiouk, Konstantin A; Denesyuk, Alexander I; Rastrygina, Victoria A; Baksheeva, Viktoriia E; Zernii, Evgeni Yu; Zinchenko, Dmitry V; Glazatov, Vladimir V; Uversky, Vladimir N; Mirzabekov, Tajib A; Permyakov, Eugene A; Permyakov, Sergei E

    2017-01-01

    Interleukin-11 (IL-11) is a hematopoietic cytokine engaged in numerous biological processes and validated as a target for treatment of various cancers. IL-11 contains intrinsically disordered regions that might recognize multiple targets. Recently we found that aside from IL-11RA and gp130 receptors, IL-11 interacts with calcium sensor protein S100P. Strict calcium dependence of this interaction suggests a possibility of IL-11 interaction with other calcium sensor proteins. Here we probed specificity of IL-11 to calcium-binding proteins of various types: calcium sensors of the EF-hand family (calmodulin, S100B and neuronal calcium sensors: recoverin, NCS-1, GCAP-1, GCAP-2), calcium buffers of the EF-hand family (S100G, oncomodulin), and a non-EF-hand calcium buffer (α-lactalbumin). A specific subset of the calcium sensor proteins (calmodulin, S100B, NCS-1, GCAP-1/2) exhibits metal-dependent binding of IL-11 with dissociation constants of 1-19 μM. These proteins share several amino acid residues belonging to conservative structural motifs of the EF-hand proteins, 'black' and 'gray' clusters. Replacements of the respective S100P residues by alanine drastically decrease its affinity to IL-11, suggesting their involvement into the association process. Secondary structure and accessibility of the hinge region of the EF-hand proteins studied are predicted to control specificity and selectivity of their binding to IL-11. The IL-11 interaction with the EF-hand proteins is expected to occur under numerous pathological conditions, accompanied by disintegration of plasma membrane and efflux of cellular components into the extracellular milieu.

  17. Discovery of novel membrane binding structures and functions

    Science.gov (United States)

    Kufareva, Irina; Lenoir, Marc; Dancea, Felician; Sridhar, Pooja; Raush, Eugene; Bissig, Christin; Gruenberg, Jean; Abagyan, Ruben; Overduin, Michael

    2014-01-01

    The function of a protein is determined by its intrinsic activity in the context of its subcellular distribution. Membranes localize proteins within cellular compartments and govern their specific activities. Discovering such membrane-protein interactions is important for understanding biological mechanisms, and could uncover novel sites for therapeutic intervention. Here we present a method for detecting membrane interactive proteins and their exposed residues that insert into lipid bilayers. Although the development process involved analysis of how C1b, C2, ENTH, FYVE, Gla, pleckstrin homology (PH) and PX domains bind membranes, the resulting Membrane Optimal Docking Area (MODA) method yields predictions for a given protein of known three dimensional structures without referring to canonical membrane-targeting modules. This approach was tested on the Arf1 GTPase, ATF2 acetyltransferase, von Willebrand factor A3 domain and Neisseria gonorrhoeae MsrB protein, and further refined with membrane interactive and non-interactive FAPP1 and PKD1 pleckstrin homology domains, respectively. Furthermore we demonstrate how this tool can be used to discover unprecedented membrane binding functions as illustrated by the Bro1 domain of Alix, which was revealed to recognize lysobisphosphatidic acid (LBPA). Validation of novel membrane-protein interactions relies on other techniques such as nuclear magnetic resonance spectroscopy (NMR) which was used here to map the sites of micelle interaction. Together this indicates that genome-wide identification of known and novel membrane interactive proteins and sites is now feasible, and provides a new tool for functional annotation of the proteome. PMID:25394204

  18. Artemin Crystal Structure Reveals Insights into Heparan Sulfate Binding

    Energy Technology Data Exchange (ETDEWEB)

    Silvian,L.; Jin, P.; Carmillo, P.; Boriack-Sjodin, P.; Pelletier, C.; Rushe, M.; Gong, B.; Sah, D.; Pepinsky, B.; Rossomando, A.

    2006-01-01

    Artemin (ART) promotes the growth of developing peripheral neurons by signaling through a multicomponent receptor complex comprised of a transmembrane tyrosine kinase receptor (cRET) and a specific glycosylphosphatidylinositol-linked co-receptor (GFR{alpha}3). Glial cell line-derived neurotrophic factor (GDNF) signals through a similar ternary complex but requires heparan sulfate proteoglycans (HSPGs) for full activity. HSPG has not been demonstrated as a requirement for ART signaling. We crystallized ART in the presence of sulfate and solved its structure by isomorphous replacement. The structure reveals ordered sulfate anions bound to arginine residues in the pre-helix and amino-terminal regions that were organized in a triad arrangement characteristic of heparan sulfate. Three residues in the pre-helix were singly or triply substituted with glutamic acid, and the resulting proteins were shown to have reduced heparin-binding affinity that is partly reflected in their ability to activate cRET. This study suggests that ART binds HSPGs and identifies residues that may be involved in HSPG binding.

  19. Structural basis for the ligand-binding specificity of fatty acid-binding proteins (pFABP4 and pFABP5) in gentoo penguin.

    Science.gov (United States)

    Lee, Chang Woo; Kim, Jung Eun; Do, Hackwon; Kim, Ryeo-Ok; Lee, Sung Gu; Park, Hyun Ho; Chang, Jeong Ho; Yim, Joung Han; Park, Hyun; Kim, Il-Chan; Lee, Jun Hyuck

    2015-09-11

    Fatty acid-binding proteins (FABPs) are involved in transporting hydrophobic fatty acids between various aqueous compartments of the cell by directly binding ligands inside their β-barrel cavities. Here, we report the crystal structures of ligand-unbound pFABP4, linoleate-bound pFABP4, and palmitate-bound pFABP5, obtained from gentoo penguin (Pygoscelis papua), at a resolution of 2.1 Å, 2.2 Å, and 2.3 Å, respectively. The pFABP4 and pFABP5 proteins have a canonical β-barrel structure with two short α-helices that form a cap region and fatty acid ligand binding sites in the hydrophobic cavity within the β-barrel structure. Linoleate-bound pFABP4 and palmitate-bound pFABP5 possess different ligand-binding modes and a unique ligand-binding pocket due to several sequence dissimilarities (A76/L78, T30/M32, underlining indicates pFABP4 residues) between the two proteins. Structural comparison revealed significantly different conformational changes in the β3-β4 loop region (residues 57-62) as well as the flipped Phe60 residue of pFABP5 than that in pFABP4 (the corresponding residue is Phe58). A ligand-binding study using fluorophore displacement assays shows that pFABP4 has a relatively strong affinity for linoleate as compared to pFABP5. In contrast, pFABP5 exhibits higher affinity for palmitate than that for pFABP4. In conclusion, our high-resolution structures and ligand-binding studies provide useful insights into the ligand-binding preferences of pFABPs based on key protein-ligand interactions. Copyright © 2015 Elsevier Inc. All rights reserved.

  20. Inference of expanded Lrp-like feast/famine transcription factor targets in a non-model organism using protein structure-based prediction.

    Science.gov (United States)

    Ashworth, Justin; Plaisier, Christopher L; Lo, Fang Yin; Reiss, David J; Baliga, Nitin S

    2014-01-01

    Widespread microbial genome sequencing presents an opportunity to understand the gene regulatory networks of non-model organisms. This requires knowledge of the binding sites for transcription factors whose DNA-binding properties are unknown or difficult to infer. We adapted a protein structure-based method to predict the specificities and putative regulons of homologous transcription factors across diverse species. As a proof-of-concept we predicted the specificities and transcriptional target genes of divergent archaeal feast/famine regulatory proteins, several of which are encoded in the genome of Halobacterium salinarum. This was validated by comparison to experimentally determined specificities for transcription factors in distantly related extremophiles, chromatin immunoprecipitation experiments, and cis-regulatory sequence conservation across eighteen related species of halobacteria. Through this analysis we were able to infer that Halobacterium salinarum employs a divergent local trans-regulatory strategy to regulate genes (carA and carB) involved in arginine and pyrimidine metabolism, whereas Escherichia coli employs an operon. The prediction of gene regulatory binding sites using structure-based methods is useful for the inference of gene regulatory relationships in new species that are otherwise difficult to infer.

  1. Multiple binding modes of ibuprofen in human serum albumin identified by absolute binding free energy calculations

    KAUST Repository

    Evoli, Stefania

    2016-11-10

    Human serum albumin possesses multiple binding sites and transports a wide range of ligands that include the anti-inflammatory drug ibuprofen. A complete map of the binding sites of ibuprofen in albumin is difficult to obtain in traditional experiments, because of the structural adaptability of this protein in accommodating small ligands. In this work, we provide a set of predictions covering the geometry, affinity of binding and protonation state for the pharmaceutically most active form (S-isomer) of ibuprofen to albumin, by using absolute binding free energy calculations in combination with classical molecular dynamics (MD) simulations and molecular docking. The most favorable binding modes correctly reproduce several experimentally identified binding locations, which include the two Sudlow\\'s drug sites (DS2 and DS1) and the fatty acid binding sites 6 and 2 (FA6 and FA2). Previously unknown details of the binding conformations were revealed for some of them, and formerly undetected binding modes were found in other protein sites. The calculated binding affinities exhibit trends which seem to agree with the available experimental data, and drastically degrade when the ligand is modeled in a protonated (neutral) state, indicating that ibuprofen associates with albumin preferentially in its charged form. These findings provide a detailed description of the binding of ibuprofen, help to explain a wide range of results reported in the literature in the last decades, and demonstrate the possibility of using simulation methods to predict ligand binding to albumin.

  2. Molecular phylogeny and predicted 3D structure of plant beta-D-N-acetylhexosaminidase.

    Science.gov (United States)

    Hossain, Md Anowar; Roslan, Hairul Azman

    2014-01-01

    beta-D-N-Acetylhexosaminidase, a family 20 glycosyl hydrolase, catalyzes the removal of β-1,4-linked N-acetylhexosamine residues from oligosaccharides and their conjugates. We constructed phylogenetic tree of β-hexosaminidases to analyze the evolutionary history and predicted functions of plant hexosaminidases. Phylogenetic analysis reveals the complex history of evolution of plant β-hexosaminidase that can be described by gene duplication events. The 3D structure of tomato β-hexosaminidase (β-Hex-Sl) was predicted by homology modeling using 1now as a template. Structural conformity studies of the best fit model showed that more than 98% of the residues lie inside the favoured and allowed regions where only 0.9% lie in the unfavourable region. Predicted 3D structure contains 531 amino acids residues with glycosyl hydrolase20b domain-I and glycosyl hydrolase20 superfamily domain-II including the (β/α)8 barrel in the central part. The α and β contents of the modeled structure were found to be 33.3% and 12.2%, respectively. Eleven amino acids were found to be involved in ligand-binding site; Asp(330) and Glu(331) could play important roles in enzyme-catalyzed reactions. The predicted model provides a structural framework that can act as a guide to develop a hypothesis for β-Hex-Sl mutagenesis experiments for exploring the functions of this class of enzymes in plant kingdom.

  3. Stability of the octameric structure affects plasminogen-binding capacity of streptococcal enolase.

    Directory of Open Access Journals (Sweden)

    Amanda J Cork

    Full Text Available Group A Streptococcus (GAS is a human pathogen that has the potential to cause invasive disease by binding and activating human plasmin(ogen. Streptococcal surface enolase (SEN is an octameric α-enolase that is localized at the GAS cell surface. In addition to its glycolytic role inside the cell, SEN functions as a receptor for plasmin(ogen on the bacterial surface, but the understanding of the molecular basis of plasmin(ogen binding is limited. In this study, we determined the crystal and solution structures of GAS SEN and characterized the increased plasminogen binding by two SEN mutants. The plasminogen binding ability of SENK312A and SENK362A is ~2- and ~3.4-fold greater than for the wild-type protein. A combination of thermal stability assays, native mass spectrometry and X-ray crystallography approaches shows that increased plasminogen binding ability correlates with decreased stability of the octamer. We propose that decreased stability of the octameric structure facilitates the access of plasmin(ogen to its binding sites, leading to more efficient plasmin(ogen binding and activation.

  4. Proteome scale identification, classification and structural analysis of iron-binding proteins in bread wheat.

    Science.gov (United States)

    Verma, Shailender Kumar; Sharma, Ankita; Sandhu, Padmani; Choudhary, Neha; Sharma, Shailaja; Acharya, Vishal; Akhter, Yusuf

    2017-05-01

    Bread wheat is one of the major staple foods of worldwide population and iron plays a significant role in growth and development of the plant. In this report, we are presenting the genome wide identification of iron-binding proteins in bread wheat. The wheat genome derived putative proteome was screened for identification of iron-binding sequence motifs. Out of 602 putative iron-binding proteins, 130 were able to produce reliable structural models by homology techniques and further analyzed for the presence of iron-binding structural motifs. The computationally identified proteins appear to bind to ferrous and ferric ions and showed diverse coordination geometries. Glu, His, Asp and Cys amino acid residues were found to be mostly involved in iron binding. We have classified these proteins on the basis of their localization in the different cellular compartments. The identified proteins were further classified into their protein folds, families and functional classes ranging from structure maintenance of cellular components, regulation of gene expression, post translational modification, membrane proteins, enzymes, signaling and storage proteins. This comprehensive report regarding structural iron binding proteome provides useful insights into the diversity of iron binding proteins of wheat plants and further utilized to study their roles in plant growth, development and physiology. Copyright © 2017 Elsevier Inc. All rights reserved.

  5. Multiple binding modes of ibuprofen in human serum albumin identified by absolute binding free energy calculations

    KAUST Repository

    Evoli, Stefania; Mobley, David L.; Guzzi, Rita; Rizzuti, Bruno

    2016-01-01

    experiments, because of the structural adaptability of this protein in accommodating small ligands. In this work, we provide a set of predictions covering the geometry, affinity of binding and protonation state for the pharmaceutically most active form (S

  6. Comparative structural analysis of lipid binding START domains.

    Directory of Open Access Journals (Sweden)

    Ann-Gerd Thorsell

    Full Text Available Steroidogenic acute regulatory (StAR protein related lipid transfer (START domains are small globular modules that form a cavity where lipids and lipid hormones bind. These domains can transport ligands to facilitate lipid exchange between biological membranes, and they have been postulated to modulate the activity of other domains of the protein in response to ligand binding. More than a dozen human genes encode START domains, and several of them are implicated in a disease.We report crystal structures of the human STARD1, STARD5, STARD13 and STARD14 lipid transfer domains. These represent four of the six functional classes of START domains.Sequence alignments based on these and previously reported crystal structures define the structural determinants of human START domains, both those related to structural framework and those involved in ligand specificity.This article can also be viewed as an enhanced version in which the text of the article is integrated with interactive 3D representations and animated transitions. Please note that a web plugin is required to access this enhanced functionality. Instructions for the installation and use of the web plugin are available in Text S1.

  7. Conversion of pre-RISC to holo-RISC by Ago2 during assembly of RNAi complexes

    Science.gov (United States)

    Kim, Kevin; Lee, Young Sik; Carthew, Richard W.

    2007-01-01

    In the Drosophila RNA interference (RNAi) pathway, small interfering RNAs (siRNAs) direct Argonaute2 (Ago2), an endonuclease, within the RNA-induced silencing complex (RISC) to cleave complementary mRNA targets. In vitro studies have shown that, for each siRNA duplex, RISC retains only one strand, the guide, and releases the other, the passenger, to form a holo-RISC complex. Here, we have isolated a new Ago2 mutant allele and provide, for the first time, in vivo evidence that endogenous Ago2 slicer activity is important to mount an RNAi response in Drosophila. We demonstrate in vivo that efficient removal of the passenger strand from RISC requires the cleavage activity of Ago2. We have also identified a new intermediate complex in the RISC assembly pathway, pre-RISC, in which Ago2 is stably bound to double-stranded siRNA. PMID:17123955

  8. Visualizing RNA Secondary Structure Base Pair Binding Probabilities using Nested Concave Hulls

    OpenAIRE

    Sansen , Joris; Bourqui , Romain; Thebault , Patricia; Allali , Julien; Auber , David

    2015-01-01

    International audience; The challenge 1 of the BIOVIS 2015 design contest consists in designing an intuitive visual depiction of base pairs binding probabilities for secondary structure of ncRNA. Our representation depicts the potential nucleotide pairs binding using nested concave hulls over the computed MFE ncRNA secondary structure. Thus, it allows to identify regions with a high level of uncertainty in the MFE computation and the structures which seem to match to reality.

  9. Molecular characterization of the receptor binding structure-activity relationships of influenza B virus hemagglutinin.

    Science.gov (United States)

    Carbone, V; Kim, H; Huang, J X; Baker, M A; Ong, C; Cooper, M A; Li, J; Rockman, S; Velkov, T

    2013-01-01

    Selectivity of α2,6-linked human-like receptors by B hemagglutinin (HA) is yet to be fully understood. This study integrates binding data with structure-recognition models to examine the impact of regional-specific sequence variations within the receptor-binding pocket on selectivity and structure activity relationships (SAR). The receptor-binding selectivity of influenza B HAs corresponding to either B/Victoria/2/1987 or the B/Yamagata/16/88 lineages was examined using surface plasmon resonance, solid-phase ELISA and gel-capture assays. Our SAR data showed that the presence of asialyl sugar units is the main determinant of receptor preference of α2,6 versus α2,3 receptor binding. Changes to the type of sialyl-glycan linkage present on receptors exhibit only a minor effect upon binding affinity. Homology-based structural models revealed that structural properties within the HA pocket, such as a glyco-conjugate at Asn194 on the 190-helix, sterically interfere with binding to avian receptor analogs by blocking the exit path of the asialyl sugars. Similarly, naturally occurring substitutions in the C-terminal region of the 190-helix and near the N-terminal end of the 140-loop narrows the horizontal borders of the binding pocket, which restricts access of the avian receptor analog LSTa. This study helps bridge the gap between ligand structure and receptor recognition for influenza B HA; and provides a consensus SAR model for the binding of human and avian receptor analogs to influenza B HA.

  10. Key structural features of nonsteroidal ligands for binding and activation of the androgen receptor.

    Science.gov (United States)

    Yin, Donghua; He, Yali; Perera, Minoli A; Hong, Seoung Soo; Marhefka, Craig; Stourman, Nina; Kirkovsky, Leonid; Miller, Duane D; Dalton, James T

    2003-01-01

    The purposes of the present studies were to examine the androgen receptor (AR) binding ability and in vitro functional activity of multiple series of nonsteroidal compounds derived from known antiandrogen pharmacophores and to investigate the structure-activity relationships (SARs) of these nonsteroidal compounds. The AR binding properties of sixty-five nonsteroidal compounds were assessed by a radioligand competitive binding assay with the use of cytosolic AR prepared from rat prostates. The AR agonist and antagonist activities of high-affinity ligands were determined by the ability of the ligand to regulate AR-mediated transcriptional activation in cultured CV-1 cells, using a cotransfection assay. Nonsteroidal compounds with diverse structural features demonstrated a wide range of binding affinity for the AR. Ten compounds, mainly from the bicalutamide-related series, showed a binding affinity superior to the structural pharmacophore from which they were derived. Several SARs regarding nonsteroidal AR binding were revealed from the binding data, including stereoisomeric conformation, steric effect, and electronic effect. The functional activity of high-affinity ligands ranged from antagonist to full agonist for the AR. Several structural features were found to be determinative of agonist and antagonist activities. The nonsteroidal AR agonists identified from the present studies provided a pool of candidates for further development of selective androgen receptor modulators (SARMs) for androgen therapy. Also, these studies uncovered or confirmed numerous important SARs governing AR binding and functional properties by nonsteroidal molecules, which would be valuable in the future structural optimization of SARMs.

  11. Rapid and accurate prediction and scoring of water molecules in protein binding sites.

    Directory of Open Access Journals (Sweden)

    Gregory A Ross

    Full Text Available Water plays a critical role in ligand-protein interactions. However, it is still challenging to predict accurately not only where water molecules prefer to bind, but also which of those water molecules might be displaceable. The latter is often seen as a route to optimizing affinity of potential drug candidates. Using a protocol we call WaterDock, we show that the freely available AutoDock Vina tool can be used to predict accurately the binding sites of water molecules. WaterDock was validated using data from X-ray crystallography, neutron diffraction and molecular dynamics simulations and correctly predicted 97% of the water molecules in the test set. In addition, we combined data-mining, heuristic and machine learning techniques to develop probabilistic water molecule classifiers. When applied to WaterDock predictions in the Astex Diverse Set of protein ligand complexes, we could identify whether a water molecule was conserved or displaced to an accuracy of 75%. A second model predicted whether water molecules were displaced by polar groups or by non-polar groups to an accuracy of 80%. These results should prove useful for anyone wishing to undertake rational design of new compounds where the displacement of water molecules is being considered as a route to improved affinity.

  12. Probing binding hot spots at protein-RNA recognition sites.

    Science.gov (United States)

    Barik, Amita; Nithin, Chandran; Karampudi, Naga Bhushana Rao; Mukherjee, Sunandan; Bahadur, Ranjit Prasad

    2016-01-29

    We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein-RNA interfaces to probe the binding hot spots at protein-RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein-protein and protein-RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein-RNA recognition sites with desired affinity. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. SPOT-ligand 2: improving structure-based virtual screening by binding-homology search on an expanded structural template library.

    Science.gov (United States)

    Litfin, Thomas; Zhou, Yaoqi; Yang, Yuedong

    2017-04-15

    The high cost of drug discovery motivates the development of accurate virtual screening tools. Binding-homology, which takes advantage of known protein-ligand binding pairs, has emerged as a powerful discrimination technique. In order to exploit all available binding data, modelled structures of ligand-binding sequences may be used to create an expanded structural binding template library. SPOT-Ligand 2 has demonstrated significantly improved screening performance over its previous version by expanding the template library 15 times over the previous one. It also performed better than or similar to other binding-homology approaches on the DUD and DUD-E benchmarks. The server is available online at http://sparks-lab.org . yaoqi.zhou@griffith.edu.au or yuedong.yang@griffith.edu.au. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  14. Secbase: database module to retrieve secondary structure elements with ligand binding motifs.

    Science.gov (United States)

    Koch, Oliver; Cole, Jason; Block, Peter; Klebe, Gerhard

    2009-10-01

    Secbase is presented as a novel extension module of Relibase. It integrates the information about secondary structure elements into the retrieval facilities of Relibase. The data are accessible via the extended Relibase user interface, and integrated retrieval queries can be addressed using an extended version of Reliscript. The primary information about alpha-helices and beta-sheets is used as provided by the PDB. Furthermore, a uniform classification of all turn families, based on recent clustering methods, and a new helix assignment that is based on this turn classification has been included. Algorithms to analyze the geometric features of helices and beta-strands were also implemented. To demonstrate the performance of the Secbase implementation, some application examples are given. They provide new insights into the involvement of secondary structure elements in ligand binding. A survey of water molecules detected next to the N-terminus of helices is analyzed to show their involvement in ligand binding. Additionally, the parallel oriented NH groups at the alpha-helix N-termini provide special binding motifs to bind particular ligand functional groups with two adjacent oxygen atoms, e.g., as found in negatively charged carboxylate or phosphate groups, respectively. The present study also shows that the specific structure of the first turn of alpha-helices provides a suitable explanation for stabilizing charged structures. The magnitude of the overall helix macrodipole seems to have no or only a minor influence on binding. Furthermore, an overview of the involvement of secondary structure elements with the recognition of some important endogenous ligands such as cofactors shows some distinct preference for particular binding motifs and amino acids.

  15. Structural study of LEDGF/p75 binding partners

    Czech Academy of Sciences Publication Activity Database

    Těšina, Petr; Čermáková, Kateřina; Procházková, Kateřina; Hořejší, Magdalena; Christ, F.; De Rijck, J.; Veverka, Václav; Řezáčová, Pavlína

    2013-01-01

    Roč. 20, č. 1 (2013), s. 12-12 ISSN 1211-5894. [Discussions in Structural Molecular Biology. Annual Meeting of the Czech Society for Structural Biology /11./. 14.03.2013-16.03.2013, Nové Hrady] R&D Projects: GA MŠk(CZ) LK11205 Institutional support: RVO:61388963 ; RVO:68378050 Keywords : LEDGF/p75 * HIV * integrase-binding domain Subject RIV: EB - Genetics ; Molecular Biology

  16. The Puf family of RNA-binding proteins in plants: phylogeny, structural modeling, activity and subcellular localization

    Directory of Open Access Journals (Sweden)

    Tam Michael WC

    2010-03-01

    Full Text Available Abstract Background Puf proteins have important roles in controlling gene expression at the post-transcriptional level by promoting RNA decay and repressing translation. The Pumilio homology domain (PUM-HD is a conserved region within Puf proteins that binds to RNA with sequence specificity. Although Puf proteins have been well characterized in animal and fungal systems, little is known about the structural and functional characteristics of Puf-like proteins in plants. Results The Arabidopsis and rice genomes code for 26 and 19 Puf-like proteins, respectively, each possessing eight or fewer Puf repeats in their PUM-HD. Key amino acids in the PUM-HD of several of these proteins are conserved with those of animal and fungal homologs, whereas other plant Puf proteins demonstrate extensive variability in these amino acids. Three-dimensional modeling revealed that the predicted structure of this domain in plant Puf proteins provides a suitable surface for binding RNA. Electrophoretic gel mobility shift experiments showed that the Arabidopsis AtPum2 PUM-HD binds with high affinity to BoxB of the Drosophila Nanos Response Element I (NRE1 RNA, whereas a point mutation in the core of the NRE1 resulted in a significant reduction in binding affinity. Transient expression of several of the Arabidopsis Puf proteins as fluorescent protein fusions revealed a dynamic, punctate cytoplasmic pattern of localization for most of these proteins. The presence of predicted nuclear export signals and accumulation of AtPuf proteins in the nucleus after treatment of cells with leptomycin B demonstrated that shuttling of these proteins between the cytosol and nucleus is common among these proteins. In addition to the cytoplasmically enriched AtPum proteins, two AtPum proteins showed nuclear targeting with enrichment in the nucleolus. Conclusions The Puf family of RNA-binding proteins in plants consists of a greater number of members than any other model species studied to

  17. Phocid seal leptin: tertiary structure and hydrophobic receptor binding site preservation during distinct leptin gene evolution.

    Directory of Open Access Journals (Sweden)

    John A Hammond

    Full Text Available The cytokine hormone leptin is a key signalling molecule in many pathways that control physiological functions. Although leptin demonstrates structural conservation in mammals, there is evidence of positive selection in primates, lagomorphs and chiropterans. We previously reported that the leptin genes of the grey and harbour seals (phocids have significantly diverged from other mammals. Therefore we further investigated the diversification of leptin in phocids, other marine mammals and terrestrial taxa by sequencing the leptin genes of representative species. Phylogenetic reconstruction revealed that leptin diversification was pronounced within the phocid seals with a high dN/dS ratio of 2.8, indicating positive selection. We found significant evidence of positive selection along the branch leading to the phocids, within the phocid clade, but not over the dataset as a whole. Structural predictions indicate that the individual residues under selection are away from the leptin receptor (LEPR binding site. Predictions of the surface electrostatic potential indicate that phocid seal leptin is notably different to other mammalian leptins, including the otariids. Cloning the grey seal leptin binding domain of LEPR confirmed that this was structurally conserved. These data, viewed in toto, support a hypothesis that phocid leptin divergence is unlikely to have arisen by random mutation. Based upon these phylogenetic and structural assessments, and considering the comparative physiology and varying life histories among species, we postulate that the unique phocid diving behaviour has produced this selection pressure. The Phocidae includes some of the deepest diving species, yet have the least modified lung structure to cope with pressure and volume changes experienced at depth. Therefore, greater surfactant production is required to facilitate rapid lung re-inflation upon surfacing, while maintaining patent airways. We suggest that this additional

  18. Prediction of protein-protein interaction sites in sequences and 3D structures by random forests.

    Directory of Open Access Journals (Sweden)

    Mile Sikić

    2009-01-01

    Full Text Available Identifying interaction sites in proteins provides important clues to the function of a protein and is becoming increasingly relevant in topics such as systems biology and drug discovery. Although there are numerous papers on the prediction of interaction sites using information derived from structure, there are only a few case reports on the prediction of interaction residues based solely on protein sequence. Here, a sliding window approach is combined with the Random Forests method to predict protein interaction sites using (i a combination of sequence- and structure-derived parameters and (ii sequence information alone. For sequence-based prediction we achieved a precision of 84% with a 26% recall and an F-measure of 40%. When combined with structural information, the prediction performance increases to a precision of 76% and a recall of 38% with an F-measure of 51%. We also present an attempt to rationalize the sliding window size and demonstrate that a nine-residue window is the most suitable for predictor construction. Finally, we demonstrate the applicability of our prediction methods by modeling the Ras-Raf complex using predicted interaction sites as target binding interfaces. Our results suggest that it is possible to predict protein interaction sites with quite a high accuracy using only sequence information.

  19. Structural Basis for Sialoglycan Binding by the Streptococcus sanguinis SrpA Adhesin*♦

    Science.gov (United States)

    Bensing, Barbara A.; Loukachevitch, Lioudmila V.; McCulloch, Kathryn M.; Yu, Hai; Vann, Kendra R.; Wawrzak, Zdzislaw; Anderson, Spencer; Chen, Xi; Sullam, Paul M.; Iverson, T. M.

    2016-01-01

    Streptococcus sanguinis is a leading cause of infective endocarditis, a life-threatening infection of the cardiovascular system. An important interaction in the pathogenesis of infective endocarditis is attachment of the organisms to host platelets. S. sanguinis expresses a serine-rich repeat adhesin, SrpA, similar in sequence to platelet-binding adhesins associated with increased virulence in this disease. In this study, we determined the first crystal structure of the putative binding region of SrpA (SrpABR) both unliganded and in complex with a synthetic disaccharide ligand at 1.8 and 2.0 Å resolution, respectively. We identified a conserved Thr-Arg motif that orients the sialic acid moiety and is required for binding to platelet monolayers. Furthermore, we propose that sequence insertions in closely related family members contribute to the modulation of structural and functional properties, including the quaternary structure, the tertiary structure, and the ligand-binding site. PMID:26833566

  20. Membrane proteins bind lipids selectively to modulate their structure and function.

    Science.gov (United States)

    Laganowsky, Arthur; Reading, Eamonn; Allison, Timothy M; Ulmschneider, Martin B; Degiacomi, Matteo T; Baldwin, Andrew J; Robinson, Carol V

    2014-06-05

    Previous studies have established that the folding, structure and function of membrane proteins are influenced by their lipid environments and that lipids can bind to specific sites, for example, in potassium channels. Fundamental questions remain however regarding the extent of membrane protein selectivity towards lipids. Here we report a mass spectrometry approach designed to determine the selectivity of lipid binding to membrane protein complexes. We investigate the mechanosensitive channel of large conductance (MscL) from Mycobacterium tuberculosis and aquaporin Z (AqpZ) and the ammonia channel (AmtB) from Escherichia coli, using ion mobility mass spectrometry (IM-MS), which reports gas-phase collision cross-sections. We demonstrate that folded conformations of membrane protein complexes can exist in the gas phase. By resolving lipid-bound states, we then rank bound lipids on the basis of their ability to resist gas phase unfolding and thereby stabilize membrane protein structure. Lipids bind non-selectively and with high avidity to MscL, all imparting comparable stability; however, the highest-ranking lipid is phosphatidylinositol phosphate, in line with its proposed functional role in mechanosensation. AqpZ is also stabilized by many lipids, with cardiolipin imparting the most significant resistance to unfolding. Subsequently, through functional assays we show that cardiolipin modulates AqpZ function. Similar experiments identify AmtB as being highly selective for phosphatidylglycerol, prompting us to obtain an X-ray structure in this lipid membrane-like environment. The 2.3 Å resolution structure, when compared with others obtained without lipid bound, reveals distinct conformational changes that re-position AmtB residues to interact with the lipid bilayer. Our results demonstrate that resistance to unfolding correlates with specific lipid-binding events, enabling a distinction to be made between lipids that merely bind from those that modulate membrane

  1. Convergence of Domain Architecture, Structure, and Ligand Affinity in Animal and Plant RNA-Binding Proteins.

    Science.gov (United States)

    Dias, Raquel; Manny, Austin; Kolaczkowski, Oralia; Kolaczkowski, Bryan

    2017-06-01

    Reconstruction of ancestral protein sequences using phylogenetic methods is a powerful technique for directly examining the evolution of molecular function. Although ancestral sequence reconstruction (ASR) is itself very efficient, downstream functional, and structural studies necessary to characterize when and how changes in molecular function occurred are often costly and time-consuming, currently limiting ASR studies to examining a relatively small number of discrete functional shifts. As a result, we have very little direct information about how molecular function evolves across large protein families. Here we develop an approach combining ASR with structure and function prediction to efficiently examine the evolution of ligand affinity across a large family of double-stranded RNA binding proteins (DRBs) spanning animals and plants. We find that the characteristic domain architecture of DRBs-consisting of 2-3 tandem double-stranded RNA binding motifs (dsrms)-arose independently in early animal and plant lineages. The affinity with which individual dsrms bind double-stranded RNA appears to have increased and decreased often across both animal and plant phylogenies, primarily through convergent structural mechanisms involving RNA-contact residues within the β1-β2 loop and a small region of α2. These studies provide some of the first direct information about how protein function evolves across large gene families and suggest that changes in molecular function may occur often and unassociated with major phylogenetic events, such as gene or domain duplications. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  2. Module structure of interphotoreceptor retinoid-binding protein (IRBP may provide bases for its complex role in the visual cycle – structure/function study of Xenopus IRBP

    Directory of Open Access Journals (Sweden)

    Ghosh Debashis

    2007-08-01

    Full Text Available Abstract Background Interphotoreceptor retinoid-binding protein's (IRBP remarkable module structure may be critical to its role in mediating the transport of all-trans and 11-cis retinol, and 11-cis retinal between rods, cones, RPE and Müller cells during the visual cycle. We isolated cDNAs for Xenopus IRBP, and expressed and purified its individual modules, module combinations, and the full-length polypeptide. Binding of all-trans retinol, 11-cis retinal and 9-(9-anthroyloxy stearic acid were characterized by fluorescence spectroscopy monitoring ligand-fluorescence enhancement, quenching of endogenous protein fluorescence, and energy transfer. Finally, the X-ray crystal structure of module-2 was used to predict the location of the ligand-binding sites, and compare their structures among modules using homology modeling. Results The full-length Xenopus IRBP cDNA codes for a polypeptide of 1,197 amino acid residues beginning with a signal peptide followed by four homologous modules each ~300 amino acid residues in length. Modules 1 and 3 are more closely related to each other than either is to modules 2 and 4. Modules 1 and 4 are most similar to the N- and C-terminal modules of the two module IRBP of teleosts. Our data are consistent with the model that vertebrate IRBPs arose through two genetic duplication events, but that the middle two modules were lost during the evolution of the ray finned fish. The sequence of the expressed full-length IRBP was confirmed by liquid chromatography-tandem mass spectrometry. The recombinant full-length Xenopus IRBP bound all-trans retinol and 11-cis retinaldehyde at 3 to 4 sites with Kd's of 0.2 to 0.3 μM, and was active in protecting all-trans retinol from degradation. Module 2 showed selectivity for all-trans retinol over 11-cis retinaldehyde. The binding data are correlated to the results of docking of all-trans-retinol to the crystal structure of Xenopus module 2 suggesting two ligand-binding sites

  3. Intrinsic Thermodynamics and Structures of 2,4- and 3,4-Substituted Fluorinated Benzenesulfonamides Binding to Carbonic Anhydrases.

    Science.gov (United States)

    Zubrienė, Asta; Smirnov, Alexey; Dudutienė, Virginija; Timm, David D; Matulienė, Jurgita; Michailovienė, Vilma; Zakšauskas, Audrius; Manakova, Elena; Gražulis, Saulius; Matulis, Daumantas

    2017-01-20

    The goal of rational drug design is to understand structure-thermodynamics correlations in order to predict the chemical structure of a drug that would exhibit excellent affinity and selectivity for a target protein. In this study we explored the contribution of added functionalities of benzenesulfonamide inhibitors to the intrinsic binding affinity, enthalpy, and entropy for recombinant human carbonic anhydrases (CA) CA I, CA II, CA VII, CA IX, CA XII, and CA XIII. The binding enthalpies of compounds possessing similar chemical structures and affinities were found to be very different, spanning a range from -90 to +10 kJ mol -1 , and are compensated by a similar opposing entropy contribution. The intrinsic parameters of binding were determined by subtracting the linked protonation reactions. The sulfonamide group pK a values of the compounds were measured spectrophotometrically, and the protonation enthalpies were measured by isothermal titration calorimetry (ITC). Herein we describe the development of meta- or ortho-substituted fluorinated benzenesulfonamides toward the highly potent compound 10 h, which exhibits an observed dissociation constant value of 43 pm and an intrinsic dissociation constant value of 1.1 pm toward CA IX, an anticancer target that is highly overexpressed in various tumors. Fluorescence thermal shift assays, ITC, and X-ray crystallography were all applied in this work. © 2017 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. Computational identification of antigen-binding antibody fragments.

    Science.gov (United States)

    Burkovitz, Anat; Leiderman, Olga; Sela-Culang, Inbal; Byk, Gerardo; Ofran, Yanay

    2013-03-01

    Determining which parts of the Ab are essential for Ag recognition and binding is crucial for understanding B cell-mediated immunity. Identification of fragments of Abs that maintain specificity to the Ag will also allow for the development of improved Ab-based therapy and diagnostics. In this article, we show that structural analysis of Ab-Ag complexes reveals which fragments of the Ab may bind the Ag on their own. In particular, it is possible to predict whether a given CDR is likely to bind the Ag as a peptide by analyzing the energetic contribution of each CDR to Ag binding and by assessing to what extent the interaction between that CDR and the Ag depends on other CDRs. To demonstrate this, we analyzed five Ab-Ag complexes and predicted for each of them which of the CDRs may bind the Ag on its own as a peptide. We then show that these predictions are in agreement with our experimental analysis and with previously published experimental results. These findings promote our understanding of the modular nature of Ab-Ag interactions and lay the foundation for the rational design of active CDR-derived peptides.

  5. Computational prediction of cAMP receptor protein (CRP binding sites in cyanobacterial genomes

    Directory of Open Access Journals (Sweden)

    Su Zhengchang

    2009-01-01

    Full Text Available Abstract Background Cyclic AMP receptor protein (CRP, also known as catabolite gene activator protein (CAP, is an important transcriptional regulator widely distributed in many bacteria. The biological processes under the regulation of CRP are highly diverse among different groups of bacterial species. Elucidation of CRP regulons in cyanobacteria will further our understanding of the physiology and ecology of this important group of microorganisms. Previously, CRP has been experimentally studied in only two cyanobacterial strains: Synechocystis sp. PCC 6803 and Anabaena sp. PCC 7120; therefore, a systematic genome-scale study of the potential CRP target genes and binding sites in cyanobacterial genomes is urgently needed. Results We have predicted and analyzed the CRP binding sites and regulons in 12 sequenced cyanobacterial genomes using a highly effective cis-regulatory binding site scanning algorithm. Our results show that cyanobacterial CRP binding sites are very similar to those in E. coli; however, the regulons are very different from that of E. coli. Furthermore, CRP regulons in different cyanobacterial species/ecotypes are also highly diversified, ranging from photosynthesis, carbon fixation and nitrogen assimilation, to chemotaxis and signal transduction. In addition, our prediction indicates that crp genes in modern cyanobacteria are likely inherited from a common ancestral gene in their last common ancestor, and have adapted various cellular functions in different environments, while some cyanobacteria lost their crp genes as well as CRP binding sites during the course of evolution. Conclusion The CRP regulons in cyanobacteria are highly diversified, probably as a result of divergent evolution to adapt to various ecological niches. Cyanobacterial CRPs may function as lineage-specific regulators participating in various cellular processes, and are important in some lineages. However, they are dispensable in some other lineages. The

  6. Prediction of molecular crystal structures

    International Nuclear Information System (INIS)

    Beyer, Theresa

    2001-01-01

    The ab initio prediction of molecular crystal structures is a scientific challenge. Reliability of first-principle prediction calculations would show a fundamental understanding of crystallisation. Crystal structure prediction is also of considerable practical importance as different crystalline arrangements of the same molecule in the solid state (polymorphs)are likely to have different physical properties. A method of crystal structure prediction based on lattice energy minimisation has been developed in this work. The choice of the intermolecular potential and of the molecular model is crucial for the results of such studies and both of these criteria have been investigated. An empirical atom-atom repulsion-dispersion potential for carboxylic acids has been derived and applied in a crystal structure prediction study of formic, benzoic and the polymorphic system of tetrolic acid. As many experimental crystal structure determinations at different temperatures are available for the polymorphic system of paracetamol (acetaminophen), the influence of the variations of the molecular model on the crystal structure lattice energy minima, has also been studied. The general problem of prediction methods based on the assumption that the experimental thermodynamically stable polymorph corresponds to the global lattice energy minimum, is that more hypothetical low lattice energy structures are found within a few kJ mol -1 of the global minimum than are likely to be experimentally observed polymorphs. This is illustrated by the results for molecule I, 3-oxabicyclo(3.2.0)hepta-1,4-diene, studied for the first international blindtest for small organic crystal structures organised by the Cambridge Crystallographic Data Centre (CCDC) in May 1999. To reduce the number of predicted polymorphs, additional factors to thermodynamic criteria have to be considered. Therefore the elastic constants and vapour growth morphologies have been calculated for the lowest lattice energy

  7. Prediction of molecular crystal structures

    Energy Technology Data Exchange (ETDEWEB)

    Beyer, Theresa

    2001-07-01

    The ab initio prediction of molecular crystal structures is a scientific challenge. Reliability of first-principle prediction calculations would show a fundamental understanding of crystallisation. Crystal structure prediction is also of considerable practical importance as different crystalline arrangements of the same molecule in the solid state (polymorphs)are likely to have different physical properties. A method of crystal structure prediction based on lattice energy minimisation has been developed in this work. The choice of the intermolecular potential and of the molecular model is crucial for the results of such studies and both of these criteria have been investigated. An empirical atom-atom repulsion-dispersion potential for carboxylic acids has been derived and applied in a crystal structure prediction study of formic, benzoic and the polymorphic system of tetrolic acid. As many experimental crystal structure determinations at different temperatures are available for the polymorphic system of paracetamol (acetaminophen), the influence of the variations of the molecular model on the crystal structure lattice energy minima, has also been studied. The general problem of prediction methods based on the assumption that the experimental thermodynamically stable polymorph corresponds to the global lattice energy minimum, is that more hypothetical low lattice energy structures are found within a few kJ mol{sup -1} of the global minimum than are likely to be experimentally observed polymorphs. This is illustrated by the results for molecule I, 3-oxabicyclo(3.2.0)hepta-1,4-diene, studied for the first international blindtest for small organic crystal structures organised by the Cambridge Crystallographic Data Centre (CCDC) in May 1999. To reduce the number of predicted polymorphs, additional factors to thermodynamic criteria have to be considered. Therefore the elastic constants and vapour growth morphologies have been calculated for the lowest lattice energy

  8. Structure of the nucleotide-binding domain of a dipeptide ABC transporter reveals a novel iron-sulfur cluster-binding domain.

    Science.gov (United States)

    Li, Xiaolu; Zhuo, Wei; Yu, Jie; Ge, Jingpeng; Gu, Jinke; Feng, Yue; Yang, Maojun; Wang, Linfang; Wang, Na

    2013-02-01

    Dipeptide permease (Dpp), which belongs to an ABC transport system, imports peptides consisting of two or three L-amino acids from the matrix to the cytoplasm in microbes. Previous studies have indicated that haem competes with dipeptides to bind DppA in vitro and in vivo and that the Dpp system can also translocate haem. Here, the crystal structure of DppD, the nucleotide-binding domain (NBD) of the ABC-type dipeptide/oligopeptide/nickel-transport system from Thermoanaerobacter tengcongensis, bound with ATP, Mg(2+) and a [4Fe-4S] iron-sulfur cluster is reported. The N-terminal domain of DppD shares a similar structural fold with the NBDs of other ABC transporters. Interestingly, the C-terminal domain of DppD contains a [4Fe-4S] cluster. The UV-visible absorbance spectrum of DppD was consistent with the presence of a [4Fe-4S] cluster. A search with DALI revealed that the [4Fe-4S] cluster-binding domain is a novel structural fold. Structural analysis and comparisons with other ABC transporters revealed that this iron-sulfur cluster may act as a mediator in substrate (dipeptide or haem) binding by electron transfer and may regulate the transport process in Dpp ABC transport systems. The crystal structure provides a basis for understanding the properties of ABC transporters and will be helpful in investigating the functions of NBDs in the regulation of ABC transporter activity.

  9. Crystal Structures and Binding Dynamics of Odorant-Binding Protein 3 from two aphid species Megoura viciae and Nasonovia ribisnigri.

    Science.gov (United States)

    Northey, Tom; Venthur, Herbert; De Biasio, Filomena; Chauviac, Francois-Xavier; Cole, Ambrose; Ribeiro, Karlos Antonio Lisboa; Grossi, Gerarda; Falabella, Patrizia; Field, Linda M; Keep, Nicholas H; Zhou, Jing-Jiang

    2016-04-22

    Aphids use chemical cues to locate hosts and find mates. The vetch aphid Megoura viciae feeds exclusively on the Fabaceae, whereas the currant-lettuce aphid Nasonovia ribisnigri alternates hosts between the Grossulariaceae and Asteraceae. Both species use alarm pheromones to warn of dangers. For N. ribisnigri this pheromone is a single component (E)-β-farnesene but M. viciae uses a mixture of (E)-β-farnesene, (-)-α-pinene, β-pinene, and limonene. Odorant-binding proteins (OBP) are believed to capture and transport such semiochemicals to their receptors. Here, we report the first aphid OBP crystal structures and examine their molecular interactions with the alarm pheromone components. Our study reveals some unique structural features: 1) the lack of an internal ligand binding site; 2) a striking groove in the surface of the proteins as a putative binding site; 3) the N-terminus rather than the C-terminus occupies the site closing off the conventional OBP pocket. The results from fluorescent binding assays, molecular docking and dynamics demonstrate that OBP3 from M. viciae can bind to all four alarm pheromone components and the differential ligand binding between these very similar OBP3s from the two aphid species is determined mainly by the direct π-π interactions between ligands and the aromatic residues of OBP3s in the binding pocket.

  10. Structure of the C-terminal heme-binding domain of THAP domain containing protein 4 from Homo sapiens

    Energy Technology Data Exchange (ETDEWEB)

    Bianchetti, Christopher M.; Bingman, Craig A.; Phillips, Jr., George N. (UW)

    2012-03-15

    The thanatos (the Greek god of death)-associated protein (THAP) domain is a sequence-specific DNA-binding domain that contains a C2-CH (Cys-Xaa{sub 2-4}-Cys-Xaa{sub 35-50}-Cys-Xaa{sub 2}-His) zinc finger that is similar to the DNA domain of the P element transposase from Drosophila. THAP-containing proteins have been observed in the proteome of humans, pigs, cows, chickens, zebrafish, Drosophila, C. elegans, and Xenopus. To date, there are no known THAP domain proteins in plants, yeast, or bacteria. There are 12 identified human THAP domain-containing proteins (THAP0-11). In all human THAP protein, the THAP domain is located at the N-terminus and is {approx}90 residues in length. Although all of the human THAP-containing proteins have a homologous N-terminus, there is extensive variation in both the predicted structure and length of the remaining protein. Even though the exact function of these THAP proteins is not well defined, there is evidence that they play a role in cell proliferation, apoptosis, cell cycle modulation, chromatin modification, and transcriptional regulation. THAP-containing proteins have also been implicated in a number of human disease states including heart disease, neurological defects, and several types of cancers. Human THAP4 is a 577-residue protein of unknown function that is proposed to bind DNA in a sequence-specific manner similar to THAP1 and has been found to be upregulated in response to heat shock. THAP4 is expressed in a relatively uniform manner in a broad range of tissues and appears to be upregulated in lymphoma cells and highly expressed in heart cells. The C-terminal domain of THAP4 (residues 415-577), designated here as cTHAP4, is evolutionarily conserved and is observed in all known THAP4 orthologs. Several single-domain proteins lacking a THAP domain are found in plants and bacteria and show significant levels of homology to cTHAP4. It appears that cTHAP4 belongs to a large class of proteins that have yet to be fully

  11. Binding of matrix metalloproteinase inhibitors to extracellular matrix: 3D-QSAR analysis.

    Science.gov (United States)

    Zhang, Yufen; Lukacova, Viera; Bartus, Vladimir; Nie, Xiaoping; Sun, Guorong; Manivannan, Ethirajan; Ghorpade, Sandeep R; Jin, Xiaomin; Manyem, Shankar; Sibi, Mukund P; Cook, Gregory R; Balaz, Stefan

    2008-10-01

    Binding to the extracellular matrix, one of the most abundant human protein complexes, significantly affects drug disposition. Specifically, the interactions with extracellular matrix determine the free concentrations of small molecules acting in tissues, including signaling peptides, inhibitors of tissue remodeling enzymes such as matrix metalloproteinases, and other drug candidates. The nature of extracellular matrix binding was elucidated for 63 matrix metalloproteinase inhibitors, for which the association constants to an extracellular matrix mimic were reported here. The data did not correlate with lipophilicity as a common determinant of structure-nonspecific, orientation-averaged binding. A hypothetical structure of the binding site of the solidified extracellular matrix surrogate was analyzed using the Comparative Molecular Field Analysis, which needed to be applied in our multi-mode variant. This fact indicates that the compounds bind to extracellular matrix in multiple modes, which cannot be considered as completely orientation-averaged and exhibit structural dependence. The novel comparative molecular field analysis models, exhibiting satisfactory descriptive and predictive abilities, are suitable for prediction of the extracellular matrix binding for the untested chemicals, which are within applicability domains. The results contribute to a better prediction of the pharmacokinetic parameters such as the distribution volume and the tissue-blood partition coefficients, in addition to a more imminent benefit for the development of more effective matrix metalloproteinase inhibitors.

  12. Relationship of Structure and Function of DNA-Binding Domain in Vitamin D Receptor

    Directory of Open Access Journals (Sweden)

    Lin-Yan Wan

    2015-07-01

    Full Text Available While the structure of the DNA-binding domain (DBD of the vitamin D receptor (VDR has been determined in great detail, the roles of its domains and how to bind the motif of its target genes are still under debate. The VDR DBD consists of two zinc finger modules and a C-terminal extension (CTE, at the end of the C-terminal of each structure presenting α-helix. For the first zinc finger structure, N37 and S-box take part in forming a dimer with 9-cis retinoid X receptor (RXR, while V26, R50, P-box and S-box participate in binding with VDR response elements (VDRE. For the second zinc finger structure, P61, F62 and H75 are essential in the structure of the VDR homodimer with the residues N37, E92 and F93 of the downstream of partner VDR, which form the inter-DBD interface. T-box of the CTE, especially the F93 and I94, plays a critical role in heterodimerization and heterodimers–VDRE binding. Six essential residues (R102, K103, M106, I107, K109, and R110 of the CTE α-helix of VDR construct one interaction face, which packs against the DBD core of the adjacent symmetry mate. In 1,25(OH2D3-activated signaling, the VDR-RXR heterodimer may bind to DR3-type VDRE and ER9-type VDREs of its target gene directly resulting in transactivation and also bind to DR3-liked nVDRE of its target gene directly resulting in transrepression. Except for this, 1α,25(OH2D3 ligand VDR-RXR may bind to 1αnVDRE indirectly through VDIR, resulting in transrepression of the target gene. Upon binding of 1α,25(OH2D3, VDR can transactivate and transrepress its target genes depending on the DNA motif that DBD binds.

  13. Structural and mutational analyses of the receptor binding domain of botulinum D/C mosaic neurotoxin: Insight into the ganglioside binding mechanism

    Energy Technology Data Exchange (ETDEWEB)

    Nuemket, Nipawan [Graduate School of Life Sciences, Hokkaido University, Sapporo 060-0810 (Japan); Tanaka, Yoshikazu [Creative Research Institution ' Sousei,' Hokkaido University, Sapporo 001-0021 (Japan); Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810 (Japan); Tsukamoto, Kentaro; Tsuji, Takao [Department of Microbiology, Fujita Health University School of Medicine, Toyoake, Aichi 470-1192 (Japan); Nakamura, Keiji; Kozaki, Shunji [Department of Veterinary Science, Graduate School of Life and Environmental Sciences, Osaka Prefecture University, Osaka 598-8531 (Japan); Yao, Min [Graduate School of Life Sciences, Hokkaido University, Sapporo 060-0810 (Japan); Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810 (Japan); Tanaka, Isao, E-mail: tanaka@castor.sci.hokudai.ac.jp [Graduate School of Life Sciences, Hokkaido University, Sapporo 060-0810 (Japan); Faculty of Advanced Life Science, Hokkaido University, Sapporo 060-0810 (Japan)

    2011-07-29

    Highlights: {yields} We determined the crystal structure of the receptor binding domain of BoNT in complex with 3'-sialyllactose. {yields} An electron density derived from the 3'-sialyllactose was confirmed at the cleft in the C-terminal subdomain. {yields} Alanine site-directed mutagenesis showed that GBS and GBL are important for ganglioside binding. {yields} A cell binding mechanism, which involves cooperative contribution of two sites, was proposed. -- Abstract: Clostridium botulinum type D strain OFD05, which produces the D/C mosaic neurotoxin, was isolated from cattle killed by the recent botulism outbreak in Japan. The D/C mosaic neurotoxin is the most toxic of the botulinum neurotoxins (BoNT) characterized to date. Here, we determined the crystal structure of the receptor binding domain of BoNT from strain OFD05 in complex with 3'-sialyllactose at a resolution of 3.0 A. In the structure, an electron density derived from the 3'-sialyllactose was confirmed at the cleft in the C-terminal subdomain. Alanine site-directed mutagenesis showed the significant contribution of the residues surrounding the cleft to ganglioside recognition. In addition, a loop adjoining the cleft also plays an important role in ganglioside recognition. In contrast, little effect was observed when the residues located around the surface previously identified as the protein receptor binding site in other BoNTs were substituted. The results of cell binding analysis of the mutants were significantly correlated with the ganglioside binding properties. Based on these observations, a cell binding mechanism of BoNT from strain OFD05 is proposed, which involves cooperative contribution of two ganglioside binding sites.

  14. Optimizing Stem Length To Improve Ligand Selectivity in a Structure-Switching Cocaine-Binding Aptamer.

    Science.gov (United States)

    Neves, Miguel A D; Shoara, Aron A; Reinstein, Oren; Abbasi Borhani, Okty; Martin, Taylor R; Johnson, Philip E

    2017-10-27

    Understanding how aptamer structure and function are related is crucial in the design and development of aptamer-based biosensors. We have analyzed a series of cocaine-binding aptamers with different lengths of their stem 1 in order to understand the role that this stem plays in the ligand-induced structure-switching binding mechanism utilized in many of the sensor applications of this aptamer. In the cocaine-binding aptamer, the length of stem 1 controls whether the structure-switching binding mechanism for this aptamer occurs or not. We varied the length of stem 1 from being one to seven base pairs long and found that the structural transition from unfolded to folded in the unbound aptamer is when the aptamer elongates from 3 to 4 base pairs in stem 1. We then used this knowledge to achieve new binding selectivity of this aptamer for quinine over cocaine by using an aptamer with a stem 1 two base pairs long. This selectivity is achieved by means of the greater affinity quinine has for the aptamer compared with cocaine. Quinine provides enough free energy to both fold and bind the 2-base pair-long aptamer while cocaine does not. This tuning of binding selectivity of an aptamer by reducing its stability is likely a general mechanism that could be used to tune aptamer specificity for tighter binding ligands.

  15. Oligosaccharide binding to barley alpha-amylase 1

    DEFF Research Database (Denmark)

    Robert, X.; Haser, R.; Mori, H.

    2005-01-01

    Enzymatic subsite mapping earlier predicted 10 binding subsites in the active site substrate binding cleft of barley alpha-amylase isozymes. The three-dimensional structures of the oligosaccharide complexes with barley alpha-amylase isozyme 1 (AMY1) described here give for the first time a thorough...... in barley alpha-amylase isozyme 2 (AMY2), and the sugar binding modes are compared between the two isozymes. The "sugar tongs" surface binding site discovered in the AMY1-thio-DP4 complex is confirmed in the present work. A site that putatively serves as an entrance for the substrate to the active site...

  16. Atomic structure of nitrate-binding protein crucial for photosynthetic productivity

    Energy Technology Data Exchange (ETDEWEB)

    Koropatkin, Nicole M.; Pakrasi, Himadri B.; Smith, Thomas J.

    2006-06-27

    Cyanobacteria, blue-green algae, are the most abundant autotrophs in aquatic environments and form the base of all aquatic food chains by fixing carbon and nitrogen into cellular biomass. The single most important nutrient for photosynthesis and growth is nitrate, which is severely limiting in many aquatic environments particularly the open ocean (1, 2). It is therefore not surprising that NrtA, the solute-binding component of the high-affinity nitrate ABC transporter, is the single-most abundant protein in the plasma membrane of these bacteria (3). Here we describe the first structure of a nitratespecific receptor, NrtA from Synechocystis sp. PCC 6803, complexed with nitrate and determined to a resolution of 1.5Å. NrtA is significantly larger than other oxyanionbinding proteins, representing a new class of transport proteins. From sequence alignments, the only other solute-binding protein in this class is CmpA, a bicarbonatebinding protein. Therefore, these organisms created a novel solute-binding protein for two of the most important nutrients; inorganic nitrogen and carbon. The electrostatic charge distribution of NrtA appears to force the protein off of the membrane while the flexible tether facilitates the delivery of nitrate to the membrane pore. The structure not only details the determinants for nitrate selectivity in NrtA, but also the bicarbonate specificity in CmpA. Nitrate and bicarbonate transport are regulated by the cytoplasmic proteins NrtC and CmpC, respectively. Interestingly, the residues lining the ligand binding pockets suggest that they both bind nitrate. This implies that the nitrogen and carbon uptake pathways are synchronized by intracellular nitrate and nitrite.3 The nitrate ABC transporter of cyanobacteria is composed of four polypeptides (Figure 1): a high-affinity periplasmic solute-binding lipoprotein (NrtA), an integral membrane permease (NrtB), a cytoplasmic ATPase (NrtD), and a unique ATPase/solute-binding fusion protein (Nrt

  17. Computational analysis and prediction of the binding motif and protein interacting partners of the Abl SH3 domain.

    Directory of Open Access Journals (Sweden)

    Tingjun Hou

    2006-01-01

    Full Text Available Protein-protein interactions, particularly weak and transient ones, are often mediated by peptide recognition domains, such as Src Homology 2 and 3 (SH2 and SH3 domains, which bind to specific sequence and structural motifs. It is important but challenging to determine the binding specificity of these domains accurately and to predict their physiological interacting partners. In this study, the interactions between 35 peptide ligands (15 binders and 20 non-binders and the Abl SH3 domain were analyzed using molecular dynamics simulation and the Molecular Mechanics/Poisson-Boltzmann Solvent Area method. The calculated binding free energies correlated well with the rank order of the binding peptides and clearly distinguished binders from non-binders. Free energy component analysis revealed that the van der Waals interactions dictate the binding strength of peptides, whereas the binding specificity is determined by the electrostatic interaction and the polar contribution of desolvation. The binding motif of the Abl SH3 domain was then determined by a virtual mutagenesis method, which mutates the residue at each position of the template peptide relative to all other 19 amino acids and calculates the binding free energy difference between the template and the mutated peptides using the Molecular Mechanics/Poisson-Boltzmann Solvent Area method. A single position mutation free energy profile was thus established and used as a scoring matrix to search peptides recognized by the Abl SH3 domain in the human genome. Our approach successfully picked ten out of 13 experimentally determined binding partners of the Abl SH3 domain among the top 600 candidates from the 218,540 decapeptides with the PXXP motif in the SWISS-PROT database. We expect that this physical-principle based method can be applied to other protein domains as well.

  18. Molecular Phylogeny and Predicted 3D Structure of Plant beta-D-N-Acetylhexosaminidase

    Directory of Open Access Journals (Sweden)

    Md. Anowar Hossain

    2014-01-01

    Full Text Available beta-D-N-Acetylhexosaminidase, a family 20 glycosyl hydrolase, catalyzes the removal of β-1,4-linked N-acetylhexosamine residues from oligosaccharides and their conjugates. We constructed phylogenetic tree of β-hexosaminidases to analyze the evolutionary history and predicted functions of plant hexosaminidases. Phylogenetic analysis reveals the complex history of evolution of plant β-hexosaminidase that can be described by gene duplication events. The 3D structure of tomato β-hexosaminidase (β-Hex-Sl was predicted by homology modeling using 1now as a template. Structural conformity studies of the best fit model showed that more than 98% of the residues lie inside the favoured and allowed regions where only 0.9% lie in the unfavourable region. Predicted 3D structure contains 531 amino acids residues with glycosyl hydrolase20b domain-I and glycosyl hydrolase20 superfamily domain-II including the (β/α8 barrel in the central part. The α and β contents of the modeled structure were found to be 33.3% and 12.2%, respectively. Eleven amino acids were found to be involved in ligand-binding site; Asp(330 and Glu(331 could play important roles in enzyme-catalyzed reactions. The predicted model provides a structural framework that can act as a guide to develop a hypothesis for β-Hex-Sl mutagenesis experiments for exploring the functions of this class of enzymes in plant kingdom.

  19. Advancing viral RNA structure prediction: measuring the thermodynamics of pyrimidine-rich internal loops.

    Science.gov (United States)

    Phan, Andy; Mailey, Katherine; Saeki, Jessica; Gu, Xiaobo; Schroeder, Susan J

    2017-05-01

    Accurate thermodynamic parameters improve RNA structure predictions and thus accelerate understanding of RNA function and the identification of RNA drug binding sites. Many viral RNA structures, such as internal ribosome entry sites, have internal loops and bulges that are potential drug target sites. Current models used to predict internal loops are biased toward small, symmetric purine loops, and thus poorly predict asymmetric, pyrimidine-rich loops with >6 nucleotides (nt) that occur frequently in viral RNA. This article presents new thermodynamic data for 40 pyrimidine loops, many of which can form UU or protonated CC base pairs. Uracil and protonated cytosine base pairs stabilize asymmetric internal loops. Accurate prediction rules are presented that account for all thermodynamic measurements of RNA asymmetric internal loops. New loop initiation terms for loops with >6 nt are presented that do not follow previous assumptions that increasing asymmetry destabilizes loops. Since the last 2004 update, 126 new loops with asymmetry or sizes greater than 2 × 2 have been measured. These new measurements significantly deepen and diversify the thermodynamic database for RNA. These results will help better predict internal loops that are larger, pyrimidine-rich, and occur within viral structures such as internal ribosome entry sites. © 2017 Phan et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  20. Homology modeling and docking analyses of M. leprae Mur ligases reveals the common binding residues for structure based drug designing to eradicate leprosy.

    Science.gov (United States)

    Shanmugam, Anusuya; Natarajan, Jeyakumar

    2012-06-01

    Multi drug resistance capacity for Mycobacterium leprae (MDR-Mle) demands the profound need for developing new anti-leprosy drugs. Since most of the drugs target a single enzyme, mutation in the active site renders the antibiotic ineffective. However, structural and mechanistic information on essential bacterial enzymes in a pathway could lead to the development of antibiotics that targets multiple enzymes. Peptidoglycan is an important component of the cell wall of M. leprae. The biosynthesis of bacterial peptidoglycan represents important targets for the development of new antibacterial drugs. Biosynthesis of peptidoglycan is a multi-step process that involves four key Mur ligase enzymes: MurC (EC:6.3.2.8), MurD (EC:6.3.2.9), MurE (EC:6.3.2.13) and MurF (EC:6.3.2.10). Hence in our work, we modeled the three-dimensional structure of the above Mur ligases using homology modeling method and analyzed its common binding features. The residues playing an important role in the catalytic activity of each of the Mur enzymes were predicted by docking these Mur ligases with their substrates and ATP. The conserved sequence motifs significant for ATP binding were predicted as the probable residues for structure based drug designing. Overall, the study was successful in listing significant and common binding residues of Mur enzymes in peptidoglycan pathway for multi targeted therapy.

  1. Conservation of transcription factor binding events predicts gene expression across species

    Science.gov (United States)

    Hemberg, Martin; Kreiman, Gabriel

    2011-01-01

    Recent technological advances have made it possible to determine the genome-wide binding sites of transcription factors (TFs). Comparisons across species have suggested a relatively low degree of evolutionary conservation of experimentally defined TF binding events (TFBEs). Using binding data for six different TFs in hepatocytes and embryonic stem cells from human and mouse, we demonstrate that evolutionary conservation of TFBEs within orthologous proximal promoters is closely linked to function, defined as expression of the target genes. We show that (i) there is a significantly higher degree of conservation of TFBEs when the target gene is expressed in both species; (ii) there is increased conservation of binding events for groups of TFs compared to individual TFs; and (iii) conserved TFBEs have a greater impact on the expression of their target genes than non-conserved ones. These results link conservation of structural elements (TFBEs) to conservation of function (gene expression) and suggest a higher degree of functional conservation than implied by previous studies. PMID:21622661

  2. Structure of Alzheimer’s disease amyloid precursor protein copper-binding domain at atomic resolution

    Energy Technology Data Exchange (ETDEWEB)

    Kong, Geoffrey Kwai-Wai; Adams, Julian J. [Biota Structural Biology Laboratory, St Vincent’s Institute, 9 Princes Street, Fitzroy, Victoria 3065 (Australia); Cappai, Roberto [Department of Pathology and Centre for Neuroscience, The University of Melbourne, Victoria 3010 (Australia); The Mental Health Research Institute of Victoria, Parkville, Victoria 3052 (Australia); Bio21 Institute, The University of Melbourne, Victoria 3010 (Australia); Parker, Michael W., E-mail: mparker@svi.edu.au [Biota Structural Biology Laboratory, St Vincent’s Institute, 9 Princes Street, Fitzroy, Victoria 3065 (Australia); Bio21 Institute, The University of Melbourne, Victoria 3010 (Australia)

    2007-10-01

    An atomic resolution structure of the copper-binding domain of the Alzheimer’s disease amyloid precursor protein is presented. Amyloid precursor protein (APP) plays a central role in the pathogenesis of Alzheimer’s disease, as its cleavage generates the Aβ peptide that is toxic to cells. APP is able to bind Cu{sup 2+} and reduce it to Cu{sup +} through its copper-binding domain (CuBD). The interaction between Cu{sup 2+} and APP leads to a decrease in Aβ production and to alleviation of the symptoms of the disease in mouse models. Structural studies of CuBD have been undertaken in order to better understand the mechanism behind the process. Here, the crystal structure of CuBD in the metal-free form determined to ultrahigh resolution (0.85 Å) is reported. The structure shows that the copper-binding residues of CuBD are rather rigid but that Met170, which is thought to be the electron source for Cu{sup 2+} reduction, adopts two different side-chain conformations. These observations shed light on the copper-binding and redox mechanisms of CuBD. The structure of CuBD at atomic resolution provides an accurate framework for structure-based design of molecules that will deplete Aβ production.

  3. Prediction of protein binding sites using physical and chemical descriptors and the support vector machine regression method

    International Nuclear Information System (INIS)

    Sun Zhong-Hua; Jiang Fan

    2010-01-01

    In this paper a new continuous variable called core-ratio is defined to describe the probability for a residue to be in a binding site, thereby replacing the previous binary description of the interface residue using 0 and 1. So we can use the support vector machine regression method to fit the core-ratio value and predict the protein binding sites. We also design a new group of physical and chemical descriptors to characterize the binding sites. The new descriptors are more effective, with an averaging procedure used. Our test shows that much better prediction results can be obtained by the support vector regression (SVR) method than by the support vector classification method. (rapid communication)

  4. Structural study and thermodynamic characterization of inhibitor binding to lumazine synthase from Bacillus anthracis

    Energy Technology Data Exchange (ETDEWEB)

    Morgunova, Ekaterina [Karolinska Institutet NOVUM, Center of Structural Biochemistry, Hälsovägen 7-9, 141 57 Huddinge (Sweden); Illarionov, Boris; Saller, Sabine [Institut für Lebensmittelchemie, Universität Hamburg, Grindelallee 117, 20146 Hamburg (Germany); Popov, Aleksander [European Synchrotron Radiation Facility, BP 220, F-38043 Grenoble CEDEX 09 (France); Sambaiah, Thota [Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University (United States); Bacher, Adelbert [Chemistry Department, Technical University of Munich, 85747 Garching (Germany); Cushman, Mark [Department of Medicinal Chemistry and Molecular Pharmacology, Purdue University (United States); Fischer, Markus [Institut für Lebensmittelchemie, Universität Hamburg, Grindelallee 117, 20146 Hamburg (Germany); Ladenstein, Rudolf, E-mail: rudolf.ladenstein@ki.se [Karolinska Institutet NOVUM, Center of Structural Biochemistry, Hälsovägen 7-9, 141 57 Huddinge (Sweden)

    2010-09-01

    Crystallographic studies of lumazine synthase, the penultimate enzyme of the riboflavin-biosynthetic pathway in B. anthracis, provide a structural framework for the design of antibiotic inhibitors, together with calorimetric and kinetic investigations of inhibitor binding. The crystal structure of lumazine synthase from Bacillus anthracis was solved by molecular replacement and refined to R{sub cryst} = 23.7% (R{sub free} = 28.4%) at a resolution of 3.5 Å. The structure reveals the icosahedral symmetry of the enzyme and specific features of the active site that are unique in comparison with previously determined orthologues. The application of isothermal titration calorimetry in combination with enzyme kinetics showed that three designed pyrimidine derivatives bind to lumazine synthase with micromolar dissociation constants and competitively inhibit the catalytic reaction. Structure-based modelling suggested the binding modes of the inhibitors in the active site and allowed an estimation of the possible contacts formed upon binding. The results provide a structural framework for the design of antibiotics active against B. anthracis.

  5. Structure and DNA-binding of meiosis-specific protein Hop2

    Science.gov (United States)

    Zhou, Donghua; Moktan, Hem; Pezza, Roberto

    2014-03-01

    Here we report structure elucidation of the DNA binding domain of homologous pairing protein 2 (Hop2), which is important to gene diversity when sperms and eggs are produced. Together with another protein Mnd1, Hop2 enhances the strand invasion activity of recombinase Dmc1 by over 30 times, facilitating proper synapsis of homologous chromosomes. However, the structural and biochemical bases for the function of Hop2 and Mnd1 have not been well understood. As a first step toward such understanding, we recently solved the structure for the N-terminus of Hop2 (1-84) using solution NMR. This fragment shows a typical winged-head conformation with recognized DNA binding activity. DNA interacting sites were then investigated by chemical shift perturbations in a titration experiment. Information of these sites was used to guide protein-DNA docking with MD simulation, revealing that helix 3 is stably lodged in the DNA major groove and that wing 1 (connecting strands 2 and 3) transiently comes in contact with the minor groove in nanosecond time scale. Mutagenesis analysis further confirmed the DNA binding sites in this fragment of the protein.

  6. Structure of a retro-binding peptide inhibitor complexed with human alpha-thrombin.

    Science.gov (United States)

    Tabernero, L; Chang, C Y; Ohringer, S L; Lau, W F; Iwanowicz, E J; Han, W C; Wang, T C; Seiler, S M; Roberts, D G; Sack, J S

    1995-02-10

    The crystallographic structure of the ternary complex between human alpha-thrombin, hirugen and the peptidyl inhibitor Phe-alloThr-Phe-O-CH3, which is acylated at its N terminus with 4-guanidino butanoic acid (BMS-183507), has been determined at 2.6 A resolution. The structure reveals a unique "retro-binding" mode for this tripeptide active site inhibitor. The inhibitor binds with its alkyl-guanidine moiety in the primary specificity pocket and its two phenyl rings occupying the hydrophobic proximal and distal pockets of the thrombin active site. In this arrangement the backbone of the tripeptide forms a parallel beta-strand to the thrombin main-chain at the binding site. This is opposite to the orientation of the natural substrate, fibrinogen, and all the small active site-directed thrombin inhibitors whose bound structures have been previously reported. BMS-183507 is the first synthetic inhibitor proved to bind in a retro-binding fashion to thrombin, in a fashion similar to that of the N-terminal residues of the natural inhibitor hirudin. Furthermore, this new potent thrombin inhibitor (Ki = 17.2 nM) is selective for thrombin over other serine proteases tested and may be a template to be considered in designing hirudin-based thrombin inhibitors with interactions at the specificity pocket.

  7. LIBRA: LIgand Binding site Recognition Application.

    Science.gov (United States)

    Hung, Le Viet; Caprari, Silvia; Bizai, Massimiliano; Toti, Daniele; Polticelli, Fabio

    2015-12-15

    In recent years, structural genomics and ab initio molecular modeling activities are leading to the availability of a large number of structural models of proteins whose biochemical function is not known. The aim of this study was the development of a novel software tool that, given a protein's structural model, predicts the presence and identity of active sites and/or ligand binding sites. The algorithm implemented by ligand binding site recognition application (LIBRA) is based on a graph theory approach to find the largest subset of similar residues between an input protein and a collection of known functional sites. The algorithm makes use of two predefined databases for active sites and ligand binding sites, respectively, derived from the Catalytic Site Atlas and the Protein Data Bank. Tests indicate that LIBRA is able to identify the correct binding/active site in 90% of the cases analyzed, 90% of which feature the identified site as ranking first. As far as ligand binding site recognition is concerned, LIBRA outperforms other structure-based ligand binding sites detection tools with which it has been compared. The application, developed in Java SE 7 with a Swing GUI embedding a JMol applet, can be run on any OS equipped with a suitable Java Virtual Machine (JVM), and is available at the following URL: http://www.computationalbiology.it/software/LIBRAv1.zip. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. Structure of a periplasmic glucose-binding protein from Thermotoga maritima

    International Nuclear Information System (INIS)

    Palani, Kandavelu; Kumaran, Desigan; Burley, Stephen K.; Swaminathan, Subramanyam

    2012-01-01

    The periplasmic glucose-binding protein from T. maritima consists of two domains with the ligand β-d-glucose buried between them. The two domains adopt a closed conformation. ABC transport systems have been characterized in organisms ranging from bacteria to humans. In most bacterial systems, the periplasmic component is the primary determinant of specificity of the transport complex as a whole. Here, the X-ray crystal structure of a periplasmic glucose-binding protein (GBP) from Thermotoga maritima determined at 2.4 Å resolution is reported. The molecule consists of two similar α/β domains connected by a three-stranded hinge region. In the current structure, a ligand (β-d-glucose) is buried between the two domains, which have adopted a closed conformation. Details of the substrate-binding sites revealed features that determine substrate specificity. In toto, ten residues from both domains form eight hydrogen bonds to the bound sugar and four aromatic residues (two from each domain) stabilize the substrate through stacking interactions

  9. MEGADOCK-Web: an integrated database of high-throughput structure-based protein-protein interaction predictions.

    Science.gov (United States)

    Hayashi, Takanori; Matsuzaki, Yuri; Yanagisawa, Keisuke; Ohue, Masahito; Akiyama, Yutaka

    2018-05-08

    Protein-protein interactions (PPIs) play several roles in living cells, and computational PPI prediction is a major focus of many researchers. The three-dimensional (3D) structure and binding surface are important for the design of PPI inhibitors. Therefore, rigid body protein-protein docking calculations for two protein structures are expected to allow elucidation of PPIs different from known complexes in terms of 3D structures because known PPI information is not explicitly required. We have developed rapid PPI prediction software based on protein-protein docking, called MEGADOCK. In order to fully utilize the benefits of computational PPI predictions, it is necessary to construct a comprehensive database to gather prediction results and their predicted 3D complex structures and to make them easily accessible. Although several databases exist that provide predicted PPIs, the previous databases do not contain a sufficient number of entries for the purpose of discovering novel PPIs. In this study, we constructed an integrated database of MEGADOCK PPI predictions, named MEGADOCK-Web. MEGADOCK-Web provides more than 10 times the number of PPI predictions than previous databases and enables users to conduct PPI predictions that cannot be found in conventional PPI prediction databases. In MEGADOCK-Web, there are 7528 protein chains and 28,331,628 predicted PPIs from all possible combinations of those proteins. Each protein structure is annotated with PDB ID, chain ID, UniProt AC, related KEGG pathway IDs, and known PPI pairs. Additionally, MEGADOCK-Web provides four powerful functions: 1) searching precalculated PPI predictions, 2) providing annotations for each predicted protein pair with an experimentally known PPI, 3) visualizing candidates that may interact with the query protein on biochemical pathways, and 4) visualizing predicted complex structures through a 3D molecular viewer. MEGADOCK-Web provides a huge amount of comprehensive PPI predictions based on

  10. In Silico Mechanistic Profiling to Probe Small Molecule Binding to Sulfotransferases

    Science.gov (United States)

    Martiny, Virginie Y.; Carbonell, Pablo; Lagorce, David; Villoutreix, Bruno O.; Moroy, Gautier; Miteva, Maria A.

    2013-01-01

    Drug metabolizing enzymes play a key role in the metabolism, elimination and detoxification of xenobiotics, drugs and endogenous molecules. While their principal role is to detoxify organisms by modifying compounds, such as pollutants or drugs, for a rapid excretion, in some cases they render their substrates more toxic thereby inducing severe side effects and adverse drug reactions, or their inhibition can lead to drug–drug interactions. We focus on sulfotransferases (SULTs), a family of phase II metabolizing enzymes, acting on a large number of drugs and hormones and showing important structural flexibility. Here we report a novel in silico structure-based approach to probe ligand binding to SULTs. We explored the flexibility of SULTs by molecular dynamics (MD) simulations in order to identify the most suitable multiple receptor conformations for ligand binding prediction. Then, we employed structure-based docking-scoring approach to predict ligand binding and finally we combined the predicted interaction energies by using a QSAR methodology. The results showed that our protocol successfully prioritizes potent binders for the studied here SULT1 isoforms, and give new insights on specific molecular mechanisms for diverse ligands’ binding related to their binding sites plasticity. Our best QSAR models, introducing predicted protein-ligand interaction energy by using docking, showed accuracy of 67.28%, 78.00% and 75.46%, for the isoforms SULT1A1, SULT1A3 and SULT1E1, respectively. To the best of our knowledge our protocol is the first in silico structure-based approach consisting of a protein-ligand interaction analysis at atomic level that considers both ligand and enzyme flexibility, along with a QSAR approach, to identify small molecules that can interact with II phase dug metabolizing enzymes. PMID:24039991

  11. In silico mechanistic profiling to probe small molecule binding to sulfotransferases.

    Directory of Open Access Journals (Sweden)

    Virginie Y Martiny

    Full Text Available Drug metabolizing enzymes play a key role in the metabolism, elimination and detoxification of xenobiotics, drugs and endogenous molecules. While their principal role is to detoxify organisms by modifying compounds, such as pollutants or drugs, for a rapid excretion, in some cases they render their substrates more toxic thereby inducing severe side effects and adverse drug reactions, or their inhibition can lead to drug-drug interactions. We focus on sulfotransferases (SULTs, a family of phase II metabolizing enzymes, acting on a large number of drugs and hormones and showing important structural flexibility. Here we report a novel in silico structure-based approach to probe ligand binding to SULTs. We explored the flexibility of SULTs by molecular dynamics (MD simulations in order to identify the most suitable multiple receptor conformations for ligand binding prediction. Then, we employed structure-based docking-scoring approach to predict ligand binding and finally we combined the predicted interaction energies by using a QSAR methodology. The results showed that our protocol successfully prioritizes potent binders for the studied here SULT1 isoforms, and give new insights on specific molecular mechanisms for diverse ligands' binding related to their binding sites plasticity. Our best QSAR models, introducing predicted protein-ligand interaction energy by using docking, showed accuracy of 67.28%, 78.00% and 75.46%, for the isoforms SULT1A1, SULT1A3 and SULT1E1, respectively. To the best of our knowledge our protocol is the first in silico structure-based approach consisting of a protein-ligand interaction analysis at atomic level that considers both ligand and enzyme flexibility, along with a QSAR approach, to identify small molecules that can interact with II phase dug metabolizing enzymes.

  12. RStrucFam: a web server to associate structure and cognate RNA for RNA-binding proteins from sequence information.

    Science.gov (United States)

    Ghosh, Pritha; Mathew, Oommen K; Sowdhamini, Ramanathan

    2016-10-07

    RNA-binding proteins (RBPs) interact with their cognate RNA(s) to form large biomolecular assemblies. They are versatile in their functionality and are involved in a myriad of processes inside the cell. RBPs with similar structural features and common biological functions are grouped together into families and superfamilies. It will be useful to obtain an early understanding and association of RNA-binding property of sequences of gene products. Here, we report a web server, RStrucFam, to predict the structure, type of cognate RNA(s) and function(s) of proteins, where possible, from mere sequence information. The web server employs Hidden Markov Model scan (hmmscan) to enable association to a back-end database of structural and sequence families. The database (HMMRBP) comprises of 437 HMMs of RBP families of known structure that have been generated using structure-based sequence alignments and 746 sequence-centric RBP family HMMs. The input protein sequence is associated with structural or sequence domain families, if structure or sequence signatures exist. In case of association of the protein with a family of known structures, output features like, multiple structure-based sequence alignment (MSSA) of the query with all others members of that family is provided. Further, cognate RNA partner(s) for that protein, Gene Ontology (GO) annotations, if any and a homology model of the protein can be obtained. The users can also browse through the database for details pertaining to each family, protein or RNA and their related information based on keyword search or RNA motif search. RStrucFam is a web server that exploits structurally conserved features of RBPs, derived from known family members and imprinted in mathematical profiles, to predict putative RBPs from sequence information. Proteins that fail to associate with such structure-centric families are further queried against the sequence-centric RBP family HMMs in the HMMRBP database. Further, all other essential

  13. Structural Basis for a Ribofuranosyl Binding Protein: Insights into the Furanose Specific Transport

    Energy Technology Data Exchange (ETDEWEB)

    Bagaria, A.; Swaminathan, S.; Kumaran, D.; Burley, S. K.

    2011-04-01

    The ATP-binding cassette transporters (ABC-transporters) are members of one of the largest protein superfamilies, with representatives in all extant phyla. These integral membrane proteins utilize the energy of ATP hydrolysis to carry out certain biological processes, including translocation of various substrates across membranes and non-transport related processes such as translation of RNA and DNA repair. Typically, such transport systems in bacteria consist of an ATP binding component, a transmembrane permease, and a periplasmic receptor or binding protein. Soluble proteins found in the periplasm of gram-negative bacteria serve as the primary receptors for transport of many compounds, such as sugars, small peptides, and some ions. Ligand binding activates these periplasmic components, permitting recognition by the membrane spanning domain, which supports for transport and, in some cases, chemotaxis. Transport and chemotaxis processes appear to be independent of one another, and a few mutants of bifunctional periplasmic components reveal the absence of one or the other function. Previously published high-resolution X-ray structures of various periplasmic ligand binding proteins include Arabinose binding protein (ABP), Allose binding protein (ALBP), Glucose-galactose binding protein (GBP) and Ribose binding protein (RBP). Each of these proteins consists of two structurally similar domains connected by a three-stranded hinge region, with ligand buried between the domains. Upon ligand binding and release, various conformational changes have been observed. For RBP, open (apo) and closed (ligand bound) conformations have been reported and so for MBP. The closed/active form of the protein interacts with the integral membrane component of the system in both transport and chemotaxis. Herein, we report 1.9{angstrom} resolution X-ray structure of the R{sub f}BP periplasmic component of an ABC-type sugar transport system from Hahella chejuensis (UniProt Id Q2S7D2) bound to

  14. Structural basis for ubiquitin recognition by ubiquitin-binding zinc finger of FAAP20.

    Directory of Open Access Journals (Sweden)

    Aya Toma

    Full Text Available Several ubiquitin-binding zinc fingers (UBZs have been reported to preferentially bind K63-linked ubiquitin chains. In particular, the UBZ domain of FAAP20 (FAAP20-UBZ, a member of the Fanconi anemia core complex, seems to recognize K63-linked ubiquitin chains, in order to recruit the complex to DNA interstrand crosslinks and mediate DNA repair. By contrast, it is reported that the attachment of a single ubiquitin to Rev1, a translesion DNA polymerase, increases binding of Rev1 to FAAP20. To clarify the specificity of FAAP20-UBZ, we determined the crystal structure of FAAP20-UBZ in complex with K63-linked diubiquitin at 1.9 Å resolution. In this structure, FAAP20-UBZ interacts only with one of the two ubiquitin moieties. Consistently, binding assays using surface plasmon resonance spectrometry showed that FAAP20-UBZ binds ubiquitin and M1-, K48- and K63-linked diubiquitin chains with similar affinities. Residues in the vicinity of Ala168 within the α-helix and the C-terminal Trp180 interact with the canonical Ile44-centered hydrophobic patch of ubiquitin. Asp164 within the α-helix and the C-terminal loop mediate a hydrogen bond network, which reinforces ubiquitin-binding of FAAP20-UBZ. Mutations of the ubiquitin-interacting residues disrupted binding to ubiquitin in vitro and abolished the accumulation of FAAP20 to DNA damage sites in vivo. Finally, structural comparison among FAAP20-UBZ, WRNIP1-UBZ and RAD18-UBZ revealed distinct modes of ubiquitin binding. UBZ family proteins could be divided into at least three classes, according to their ubiquitin-binding modes.

  15. In silico predictive studies of mAHR congener binding using homology modelling and molecular docking.

    Science.gov (United States)

    Panda, Roshni; Cleave, A Suneetha Susan; Suresh, P K

    2014-09-01

    The aryl hydrocarbon receptor (AHR) is one of the principal xenobiotic, nuclear receptor that is responsible for the early events involved in the transcription of a complex set of genes comprising the CYP450 gene family. In the present computational study, homology modelling and molecular docking were carried out with the objective of predicting the relationship between the binding efficiency and the lipophilicity of different polychlorinated biphenyl (PCB) congeners and the AHR in silico. Homology model of the murine AHR was constructed by several automated servers and assessed by PROCHECK, ERRAT, VERIFY3D and WHAT IF. The resulting model of the AHR by MODWEB was used to carry out molecular docking of 36 PCB congeners using PatchDock server. The lipophilicity of the congeners was predicted using the XLOGP3 tool. The results suggest that the lipophilicity influences binding energy scores and is positively correlated with the same. Score and Log P were correlated with r = +0.506 at p = 0.01 level. In addition, the number of chlorine (Cl) atoms and Log P were highly correlated with r = +0.900 at p = 0.01 level. The number of Cl atoms and scores also showed a moderate positive correlation of r = +0.481 at p = 0.01 level. To the best of our knowledge, this is the first study employing PatchDock in the docking of AHR to the environmentally deleterious congeners and attempting to correlate structural features of the AHR with its biochemical properties with regards to PCBs. The result of this study are consistent with those of other computational studies reported in the previous literature that suggests that a combination of docking, scoring and ranking organic pollutants could be a possible predictive tool for investigating ligand-mediated toxicity, for their subsequent validation using wet lab-based studies. © The Author(s) 2012.

  16. LDA+U and tight-binding electronic structure of InN nanowires

    Science.gov (United States)

    Molina-Sánchez, A.; García-Cristóbal, A.; Cantarero, A.; Terentjevs, A.; Cicero, G.

    2010-10-01

    In this paper we employ a combined ab initio and tight-binding approach to obtain the electronic and optical properties of hydrogenated Indium nitride (InN) nanowires. We first discuss InN band structure for the wurtzite structure calculated at the LDA+U level and use this information to extract the parameters needed for an empirical tight-binging implementation. These parameters are then employed to calculate the electronic and optical properties of InN nanowires in a diameter range that would not be affordable by ab initio techniques. The reliability of the large nanowires results is assessed by explicitly comparing the electronic structure of a small diameter wire studied both at LDA+U and tight-binding level.

  17. Structural basis of nonribosomal peptide macrocyclization in fungi.

    Science.gov (United States)

    Zhang, Jinru; Liu, Nicholas; Cacho, Ralph A; Gong, Zhou; Liu, Zhu; Qin, Wenming; Tang, Chun; Tang, Yi; Zhou, Jiahai

    2016-12-01

    Nonribosomal peptide synthetases (NRPSs) in fungi biosynthesize important pharmaceutical compounds, including penicillin, cyclosporine and echinocandin. To understand the fungal strategy of forging the macrocyclic peptide linkage, we determined the crystal structures of the terminal condensation-like (C T ) domain and the holo thiolation (T)-C T complex of Penicillium aethiopicum TqaA. The first, to our knowledge, structural depiction of the terminal module in a fungal NRPS provides a molecular blueprint for generating new macrocyclic peptide natural products.

  18. RNA-SSPT: RNA Secondary Structure Prediction Tools.

    Science.gov (United States)

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; Din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes.

  19. Acetylcholine-Binding Protein Engineered to Mimic the α4-α4 Binding Pocket in α4β2 Nicotinic Acetylcholine Receptors Reveals Interface Specific Interactions Important for Binding and Activity

    DEFF Research Database (Denmark)

    Shahsavar, Azadeh; Ahring, Philip K; Olsen, Jeppe A

    2015-01-01

    Neuronal α4β2 nicotinic acetylcholine receptors are attractive drug targets for psychiatric and neurodegenerative disorders and smoking cessation aids. Recently, a third agonist binding site between two α4 subunits in the (α4)(3)(β2)(2) receptor subpopulation was discovered. In particular, three......-yl)-1,4-diazepane], highlights the roles of the three residues in determining binding affinities and functional properties of ligands at the α4-α4 interface. Confirmed by mutational studies, our structures suggest a unique ligand-specific role of residue H142 on the α4 subunit. In the cocrystal...... that could not be predicted based on wild-type Ls-AChBP structures in complex with the same agonists. The results show that an unprecedented correlation between binding in engineered AChBPs and functional receptors can be obtained and provide new opportunities for structure-based design of drugs targeting...

  20. Dataset size and composition impact the reliability of performance benchmarks for peptide-MHC binding predictions

    DEFF Research Database (Denmark)

    Kim, Yohan; Sidney, John; Buus, Søren

    2014-01-01

    Background: It is important to accurately determine the performance of peptide: MHC binding predictions, as this enables users to compare and choose between different prediction methods and provides estimates of the expected error rate. Two common approaches to determine prediction performance...... are cross-validation, in which all available data are iteratively split into training and testing data, and the use of blind sets generated separately from the data used to construct the predictive method. In the present study, we have compared cross-validated prediction performances generated on our last...

  1. Sensitive quantitative predictions of peptide-MHC binding by a 'Query by Committee' artificial neural network approach

    DEFF Research Database (Denmark)

    Buus, S.; Lauemoller, S.L.; Worning, Peder

    2003-01-01

    We have generated Artificial Neural Networks (ANN) capable of performing sensitive, quantitative predictions of peptide binding to the MHC class I molecule, HLA-A*0204. We have shown that such quantitative ANN are superior to conventional classification ANN, that have been trained to predict...

  2. The host-binding domain of the P2 phage tail spike reveals a trimeric iron-binding structure

    International Nuclear Information System (INIS)

    Yamashita, Eiki; Nakagawa, Atsushi; Takahashi, Junichi; Tsunoda, Kin-ichi; Yamada, Seiko; Takeda, Shigeki

    2011-01-01

    The C-terminal domain of a bacteriophage P2 tail-spike protein, gpV, was crystallized and its structure was solved at 1.27 Å resolution. The refined model showed a triple β-helix structure and the presence of iron, calcium and chloride ions. The adsorption and infection of bacteriophage P2 is mediated by tail fibres and tail spikes. The tail spikes on the tail baseplate are used to irreversibly adsorb to the host cells. Recently, a P2 phage tail-spike protein, gpV, was purified and it was shown that a C-terminal domain, Ser87–Leu211, is sufficient for the binding of gpV to host Escherichia coli membranes [Kageyama et al. (2009 ▶), Biochemistry, 48, 10129–10135]. In this paper, the crystal structure of the C-terminal domain of P2 gpV is reported. The structure is a triangular pyramid and looks like a spearhead composed of an intertwined β-sheet, a triple β-helix and a metal-binding region containing iron, calcium and chloride ions

  3. Structural insights into human peroxisome proliferator activated receptor delta (PPAR-delta selective ligand binding.

    Directory of Open Access Journals (Sweden)

    Fernanda A H Batista

    Full Text Available Peroxisome proliferator activated receptors (PPARs δ, α and γ are closely related transcription factors that exert distinct effects on fatty acid and glucose metabolism, cardiac disease, inflammatory response and other processes. Several groups developed PPAR subtype specific modulators to trigger desirable effects of particular PPARs without harmful side effects associated with activation of other subtypes. Presently, however, many compounds that bind to one of the PPARs cross-react with others and rational strategies to obtain highly selective PPAR modulators are far from clear. GW0742 is a synthetic ligand that binds PPARδ more than 300-fold more tightly than PPARα or PPARγ but the structural basis of PPARδ:GW0742 interactions and reasons for strong selectivity are not clear. Here we report the crystal structure of the PPARδ:GW0742 complex. Comparisons of the PPARδ:GW0742 complex with published structures of PPARs in complex with α and γ selective agonists and pan agonists suggests that two residues (Val312 and Ile328 in the buried hormone binding pocket play special roles in PPARδ selective binding and experimental and computational analysis of effects of mutations in these residues confirms this and suggests that bulky substituents that line the PPARα and γ ligand binding pockets as structural barriers for GW0742 binding. This analysis suggests general strategies for selective PPARδ ligand design.

  4. Prediction of trypsin/molecular fragment binding affinities by free energy decomposition and empirical scores

    Science.gov (United States)

    Benson, Mark L.; Faver, John C.; Ucisik, Melek N.; Dashti, Danial S.; Zheng, Zheng; Merz, Kenneth M.

    2012-05-01

    Two families of binding affinity estimation methodologies are described which were utilized in the SAMPL3 trypsin/fragment binding affinity challenge. The first is a free energy decomposition scheme based on a thermodynamic cycle, which included separate contributions from enthalpy and entropy of binding as well as a solvent contribution. Enthalpic contributions were estimated with PM6-DH2 semiempirical quantum mechanical interaction energies, which were modified with a statistical error correction procedure. Entropic contributions were estimated with the rigid-rotor harmonic approximation, and solvent contributions to the free energy were estimated with several different methods. The second general methodology is the empirical score LISA, which contains several physics-based terms trained with the large PDBBind database of protein/ligand complexes. Here we also introduce LISA+, an updated version of LISA which, prior to scoring, classifies systems into one of four classes based on a ligand's hydrophobicity and molecular weight. Each version of the two methodologies (a total of 11 methods) was trained against a compiled set of known trypsin binders available in the Protein Data Bank to yield scaling parameters for linear regression models. Both raw and scaled scores were submitted to SAMPL3. Variants of LISA showed relatively low absolute errors but also low correlation with experiment, while the free energy decomposition methods had modest success when scaling factors were included. Nonetheless, re-scaled LISA yielded the best predictions in the challenge in terms of RMS error, and six of these models placed in the top ten best predictions by RMS error. This work highlights some of the difficulties of predicting binding affinities of small molecular fragments to protein receptors as well as the benefit of using training data.

  5. Modification of DNA radiolysis by DNA-binding proteins: Structural aspects

    International Nuclear Information System (INIS)

    Davidkova, M.; Stisova, V.; Goffinont, S.; Gillard, N.; Castaing, B.; Spotheim-Maurizot, M.

    2006-01-01

    Formation of specific complexes between proteins and their cognate DNA modulates the yields and the location of radiation damage on both partners of the complex. The radiolysis of DNA-protein complexes is studied for: (1) the Escherichia coli lactose operator-repressor complex, (2) the complex between DNA bearing an analogue of an abasic site and the repair protein Fpg of Lactococcus lactis. Experimental patterns of DNA damages are presented and compared to predicted damage distribution obtained using an improved version of the stochastic model RADACK. The same method is used for predicting the location of damages on the proteins. At doses lower than a threshold that depends on the system, proteins protect their specific binding site on DNA while at high doses, the studied complexes are disrupted mainly through protein damage. The loss of binding ability is the functional consequence of the amino-acids modification by OH . radicals. Many of the most probably damaged amino acids are essential for the DNA-protein interaction and within a complex are protected by DNA. (authors)

  6. Isoforms of retinol binding protein 4 (RBP4) are increased in chronic diseases of the kidney but not of the liver

    DEFF Research Database (Denmark)

    Frey, Simone K; Nagl, Britta; Henze, Andrea

    2008-01-01

    disease (CLD) RBP4 levels decrease. Little is known about RBP4 isoforms including apo-RBP4, holo-RBP4 as well as RBP4 truncated at the C-terminus (RBP4-L and RBP4-LL) except that RBP4 isoforms have been reported to be increased in hemodialysis patients. Since it is not known whether CLD influence RBP4...... isoforms, we investigated RBP4 levels, apo- and holo-RBP4 as well as RBP4-L and RBP4-LL in plasma of 36 patients suffering from CKD, in 55 CLD patients and in 50 control subjects. RBP4 was determined by ELISA and apo- and holo-RBP4 by native polyacrylamide gel electrophoresis (PAGE). RBP4-L and RBP4-LL...

  7. Structures of Orf Virus Chemokine Binding Protein in Complex with Host Chemokines Reveal Clues to Broad Binding Specificity.

    Science.gov (United States)

    Couñago, Rafael M; Knapp, Karen M; Nakatani, Yoshio; Fleming, Stephen B; Corbett, Michael; Wise, Lyn M; Mercer, Andrew A; Krause, Kurt L

    2015-07-07

    The chemokine binding protein (CKBP) from orf virus (ORFV) binds with high affinity to chemokines from three classes, C, CC, and CXC, making it unique among poxvirus CKBPs described to date. We present its crystal structure alone and in complex with three CC chemokines, CCL2, CCL3, and CCL7. ORFV CKBP possesses a β-sandwich fold that is electrostatically and sterically complementary to its binding partners. Chemokines bind primarily through interactions involving the N-terminal loop and a hydrophobic recess on the ORFV CKBP β-sheet II surface, and largely polar interactions between the chemokine 20s loop and a negatively charged surface groove located at one end of the CKBP β-sheet II surface. ORFV CKBP interacts with leukocyte receptor and glycosaminoglycan binding sites found on the surface of bound chemokines. SEC-MALLS and chromatographic evidence is presented supporting that ORFV CKBP is a dimer in solution over a broad range of protein concentrations. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. SVM prediction of ligand-binding sites in bacterial lipoproteins employing shape and physio-chemical descriptors.

    Science.gov (United States)

    Kadam, Kiran; Prabhakar, Prashant; Jayaraman, V K

    2012-11-01

    Bacterial lipoproteins play critical roles in various physiological processes including the maintenance of pathogenicity and numbers of them are being considered as potential candidates for generating novel vaccines. In this work, we put forth an algorithm to identify and predict ligand-binding sites in bacterial lipoproteins. The method uses three types of pocket descriptors, namely fpocket descriptors, 3D Zernike descriptors and shell descriptors, and combines them with Support Vector Machine (SVM) method for the classification. The three types of descriptors represent shape-based properties of the pocket as well as its local physio-chemical features. All three types of descriptors, along with their hybrid combinations are evaluated with SVM and to improve classification performance, WEKA-InfoGain feature selection is applied. Results obtained in the study show that the classifier successfully differentiates between ligand-binding and non-binding pockets. For the combination of three types of descriptors, 10 fold cross-validation accuracy of 86.83% is obtained for training while the selected model achieved test Matthews Correlation Coefficient (MCC) of 0.534. Individually or in combination with new and existing methods, our model can be a very useful tool for the prediction of potential ligand-binding sites in bacterial lipoproteins.

  9. Structural aspects of inotropic bipyridine binding. Crystal structure determination to 1.9 A of the human serum transthyretin-milrinone complex.

    Science.gov (United States)

    Wojtczak, A; Luft, J R; Cody, V

    1993-03-25

    The crystal structure of human transthyretin (TTR) complexed with milrinone (2-methyl-5-cyano-3,4'-bipyridin-6(1H)-one), a positive inotropic cardiac agent, has been refined to R = 17.4% for 8-1.9-A resolution data. This report provides the first detailed description of protein interactions for an inotropic bipyridine agent which is an effective thyroid hormone binding competitor to transthyretin. Milrinone is bound along the 2-fold axis in the binding site with its substituted pyridone ring located deep within the channel of the two identical binding domains of the TTR tetramer. In this orientation the 5-cyano group occupies the same site as the 3'-iodine in the TTR complex with 3,3'-diiodothyronine (Wojtczak, A., Luft, J., and Cody, V. (1992) J. Biol. Chem. 267, 353-357), which is 3.5 A deeper in the channel than thyroxine (Blake, C. C. F., and Oately, S. J., (1977) Nature 268, 115-120). These structural results confirm computer modeling studies of milrinone structural homology with thyroxine and its TTR binding interactions and explain the effectiveness of milrinone competition for thyroxine binding to TTR. To understand the weaker binding affinity of the parent inotropic drug, amrinone (5-amino-3,4'-bipyridin-6(1H)-one), modeling studies of its TTR binding were carried out which indicate that the 5-amino group cannot participate in strong interactions with TTR and the lack of the 2-methyl further weakens amrinone binding.

  10. Structural characterization of Staphylococcus aureus biotin protein ligase and interaction partners: an antibiotic target.

    Science.gov (United States)

    Pendini, Nicole R; Yap, Min Y; Traore, D A K; Polyak, Steven W; Cowieson, Nathan P; Abell, Andrew; Booker, Grant W; Wallace, John C; Wilce, Jacqueline A; Wilce, Matthew C J

    2013-06-01

    The essential metabolic enzyme biotin protein ligase (BPL) is a potential target for the development of new antibiotics required to combat drug-resistant pathogens. Staphylococcus aureus BPL (SaBPL) is a bifunctional protein, possessing both biotin ligase and transcription repressor activities. This positions BPL as a key regulator of several important metabolic pathways. Here, we report the structural analysis of both holo- and apo-forms of SaBPL using X-ray crystallography. We also present small-angle X-ray scattering data of SaBPL in complex with its biotin-carboxyl carrier protein substrate as well as the SaBPL:DNA complex that underlies repression. This has revealed the molecular basis of ligand (biotinyl-5'-AMP) binding and conformational changes associated with catalysis and repressor function. These data provide new information to better understand the bifunctional activities of SaBPL and to inform future strategies for antibiotic discovery. © 2013 The Protein Society.

  11. Factors correlating with significant differences between X-ray structures of myoglobin

    International Nuclear Information System (INIS)

    Rashin, Alexander A.; Domagalski, Marcin J.; Zimmermann, Michael T.; Minor, Wladek; Chruszcz, Maksymilian; Jernigan, Robert L.

    2014-01-01

    thresholds. The binding of unusual ligands by myoglobin, leading to crystal-induced distortions, suggests that some of the conformational differences between the apo and holo structures might not be ‘functionally important’ but rather artifacts caused by the binding of ‘unusual’ substrate analogs. The causes of P6 symmetry in myoglobin crystals and the relationship between crystal and solution structures are also discussed

  12. Factors correlating with significant differences between X-ray structures of myoglobin

    Energy Technology Data Exchange (ETDEWEB)

    Rashin, Alexander A., E-mail: alexander-rashin@hotmail.com [BioChemComp Inc., 543 Sagamore Avenue, Teaneck, NJ 07666 (United States); Iowa State University, 112 Office and Lab Bldg, Ames, IA 50011-3020 (United States); Domagalski, Marcin J. [University of Virginia, 1340 Jefferson Park Avenue, Jordan Hall, Room 4223, Charlottesville, VA 22908 (United States); Zimmermann, Michael T. [Iowa State University, 112 Office and Lab Bldg, Ames, IA 50011-3020 (United States); Minor, Wladek [University of Virginia, 1340 Jefferson Park Avenue, Jordan Hall, Room 4223, Charlottesville, VA 22908 (United States); Chruszcz, Maksymilian [University of Virginia, 1340 Jefferson Park Avenue, Jordan Hall, Room 4223, Charlottesville, VA 22908 (United States); University of South Carolina, 631 Sumter Street, Columbia, SC 29208 (United States); Jernigan, Robert L. [Iowa State University, 112 Office and Lab Bldg, Ames, IA 50011-3020 (United States); BioChemComp Inc., 543 Sagamore Avenue, Teaneck, NJ 07666 (United States)

    2014-02-01

    thresholds. The binding of unusual ligands by myoglobin, leading to crystal-induced distortions, suggests that some of the conformational differences between the apo and holo structures might not be ‘functionally important’ but rather artifacts caused by the binding of ‘unusual’ substrate analogs. The causes of P6 symmetry in myoglobin crystals and the relationship between crystal and solution structures are also discussed.

  13. Structural and binding studies of SAP-1 protein with heparin.

    Science.gov (United States)

    Yadav, Vikash K; Mandal, Rahul S; Puniya, Bhanwar L; Kumar, Rahul; Dey, Sharmistha; Singh, Sarman; Yadav, Savita

    2015-03-01

    SAP-1 is a low molecular weight cysteine protease inhibitor (CPI) which belongs to type-2 cystatins family. SAP-1 protein purified from human seminal plasma (HuSP) has been shown to inhibit cysteine and serine proteases and exhibit interesting biological properties, including high temperature and pH stability. Heparin is a naturally occurring glycosaminoglycan (with varied chain length) which interacts with a number of proteins and regulates multiple steps in different biological processes. As an anticoagulant, heparin enhances inhibition of thrombin by the serpin antithrombin III. Therefore, we have employed surface plasmon resonance (SPR) to improve our understanding of the binding interaction between heparin and SAP-1 (protease inhibitor). SPR data suggest that SAP-1 binds to heparin with a significant affinity (KD = 158 nm). SPR solution competition studies using heparin oligosaccharides showed that the binding of SAP-1 to heparin is dependent on chain length. Large oligosaccharides show strong binding affinity for SAP-1. Further to get insight into the structural aspect of interactions between SAP-1 and heparin, we used modelled structure of the SAP-1 and docked with heparin and heparin-derived polysaccharides. The results suggest that a positively charged residue lysine plays important role in these interactions. Such information should improve our understanding of how heparin, present in the reproductive tract, regulates cystatins activity. © 2014 John Wiley & Sons A/S.

  14. Structure of the caspase-recruitment domain from a zebrafish guanylate-binding protein

    International Nuclear Information System (INIS)

    Jin, Tengchuan; Huang, Mo; Smith, Patrick; Jiang, Jiansheng; Xiao, T. Sam

    2013-01-01

    The crystal structure of the first zebrafish caspase-recruitment domain at 1.47 Å resolution illustrates a six-helix bundle fold similar to that of the human NLRP1 CARD. The caspase-recruitment domain (CARD) mediates homotypic protein–protein interactions that assemble large oligomeric signaling complexes such as the inflammasomes during innate immune responses. Structural studies of the mammalian CARDs demonstrate that their six-helix bundle folds belong to the death-domain superfamily, whereas such studies have not been reported for other organisms. Here, the zebrafish interferon-induced guanylate-binding protein 1 (zIGBP1) was identified that contains an N-terminal GTPase domain and a helical domain typical of the mammalian guanylate-binding proteins, followed by a FIIND domain and a C-terminal CARD similar to the mammalian inflammasome proteins NLRP1 and CARD8. The structure of the zIGBP1 CARD as a fusion with maltose-binding protein was determined at 1.47 Å resolution. This revealed a six-helix bundle fold similar to the NLRP1 CARD structure with the bent α1 helix typical of all known CARD structures. The zIGBP1 CARD surface contains a positively charged patch near its α1 and α4 helices and a negatively charged patch near its α2, α3 and α5 helices, which may mediate its interaction with partner domains. Further studies using binding assays and other analyses will be required in order to address the physiological function(s) of this zebrafish protein

  15. Crystal structure of Arabidopsis thaliana Dawdle forkhead-associated domain reveals a conserved phospho-threonine recognition cleft for dicer-like 1 binding.

    Science.gov (United States)

    Machida, Satoru; Yuan, Y Adam

    2013-07-01

    Dawdle (DDL) is a microRNA processing protein essential for the development of Arabidopsis. DDL contains a putative nuclear localization signal at its amino-terminus and forkhead-associated (FHA) domain at the carboxyl-terminus. Here, we report the crystal structure of the FHA domain of Arabidopsis Dawdle, determined by multiple-wavelength anomalous dispersion method at 1.7-Å resolution. DDL FHA structure displays a seven-stranded β-sandwich architecture that contains a unique structural motif comprising two long anti-parallel strands. Strikingly, crystal packing of the DDL FHA domain reveals that a glutamate residue from the symmetry-related DDL FHA domain, a structural mimic of the phospho-threonine, is specifically recognized by the structurally conserved phospho-threonine binding cleft. Consistently with the structural observations, co-immuno-precipitation experiments performed in Nicotiana benthamiana show that the DDL FHA domain co-immuno-precipitates with DCL1 fragments containing the predicted pThr+3(Ile/Val/Leu/Asp) motif. Taken together, we count the recognition of the target residue by the canonical binding cleft of the DDL FHA domain as the key molecular event to instate FHA domain-mediated protein-protein interaction in plant miRNA processing.

  16. The structure of cytomegalovirus immune modulator UL141 highlights structural Ig-fold versatility for receptor binding

    Energy Technology Data Exchange (ETDEWEB)

    Nemčovičová, Ivana [La Jolla Institute for Allergy and Immunology, 9420 Athena Circle, La Jolla, CA 92037 (United States); Slovak Academy of Sciences, Dúbravská cesta 9, SK 84505 Bratislava (Slovakia); Zajonc, Dirk M., E-mail: dzajonc@liai.org [La Jolla Institute for Allergy and Immunology, 9420 Athena Circle, La Jolla, CA 92037 (United States)

    2014-03-01

    The crystal structure of Human cytomegalovirus immune modulator UL141 was solved at 3.25 Å resolution. Here, a detailed analysis of its intimate dimerization interface and the biophysical properties of its receptor (TRAIL-R2 and CD155) binding interactions are presented. Natural killer (NK) cells are critical components of the innate immune system as they rapidly detect and destroy infected cells. To avoid immune recognition and to allow long-term persistence in the host, Human cytomegalovirus (HCMV) has evolved a number of genes to evade or inhibit immune effector pathways. In particular, UL141 can inhibit cell-surface expression of both the NK cell-activating ligand CD155 as well as the TRAIL death receptors (TRAIL-R1 and TRAIL-R2). The crystal structure of unliganded HCMV UL141 refined to 3.25 Å resolution allowed analysis of its head-to-tail dimerization interface. A ‘dimerization-deficient’ mutant of UL141 (ddUL141) was further designed, which retained the ability to bind to TRAIL-R2 or CD155 while losing the ability to cross-link two receptor monomers. Structural comparison of unliganded UL141 with UL141 bound to TRAIL-R2 further identified a mobile loop that makes intimate contacts with TRAIL-R2 upon receptor engagement. Superposition of the Ig-like domain of UL141 on the CD155 ligand T-cell immunoreceptor with Ig and ITIM domains (TIGIT) revealed that UL141 can potentially engage CD155 similar to TIGIT by using the C′C′′ and GF loops. Further mutations in the TIGIT binding site of CD155 (Q63R and F128R) abrogated UL141 binding, suggesting that the Ig-like domain of UL141 is a viral mimic of TIGIT, as it targets the same binding site on CD155 using similar ‘lock-and-key’ interactions. Sequence alignment of the UL141 gene and its orthologues also showed conservation in this highly hydrophobic (L/A)X{sub 6}G ‘lock’ motif for CD155 binding as well as conservation of the TRAIL-R2 binding patches, suggesting that these host

  17. The structure of cytomegalovirus immune modulator UL141 highlights structural Ig-fold versatility for receptor binding

    International Nuclear Information System (INIS)

    Nemčovičová, Ivana; Zajonc, Dirk M.

    2014-01-01

    The crystal structure of Human cytomegalovirus immune modulator UL141 was solved at 3.25 Å resolution. Here, a detailed analysis of its intimate dimerization interface and the biophysical properties of its receptor (TRAIL-R2 and CD155) binding interactions are presented. Natural killer (NK) cells are critical components of the innate immune system as they rapidly detect and destroy infected cells. To avoid immune recognition and to allow long-term persistence in the host, Human cytomegalovirus (HCMV) has evolved a number of genes to evade or inhibit immune effector pathways. In particular, UL141 can inhibit cell-surface expression of both the NK cell-activating ligand CD155 as well as the TRAIL death receptors (TRAIL-R1 and TRAIL-R2). The crystal structure of unliganded HCMV UL141 refined to 3.25 Å resolution allowed analysis of its head-to-tail dimerization interface. A ‘dimerization-deficient’ mutant of UL141 (ddUL141) was further designed, which retained the ability to bind to TRAIL-R2 or CD155 while losing the ability to cross-link two receptor monomers. Structural comparison of unliganded UL141 with UL141 bound to TRAIL-R2 further identified a mobile loop that makes intimate contacts with TRAIL-R2 upon receptor engagement. Superposition of the Ig-like domain of UL141 on the CD155 ligand T-cell immunoreceptor with Ig and ITIM domains (TIGIT) revealed that UL141 can potentially engage CD155 similar to TIGIT by using the C′C′′ and GF loops. Further mutations in the TIGIT binding site of CD155 (Q63R and F128R) abrogated UL141 binding, suggesting that the Ig-like domain of UL141 is a viral mimic of TIGIT, as it targets the same binding site on CD155 using similar ‘lock-and-key’ interactions. Sequence alignment of the UL141 gene and its orthologues also showed conservation in this highly hydrophobic (L/A)X 6 G ‘lock’ motif for CD155 binding as well as conservation of the TRAIL-R2 binding patches, suggesting that these host–receptor interactions

  18. Recognition of anesthetic barbiturates by a protein binding site: a high resolution structural analysis.

    Directory of Open Access Journals (Sweden)

    Simon Oakley

    Full Text Available Barbiturates potentiate GABA actions at the GABA(A receptor and act as central nervous system depressants that can induce effects ranging from sedation to general anesthesia. No structural information has been available about how barbiturates are recognized by their protein targets. For this reason, we tested whether these drugs were able to bind specifically to horse spleen apoferritin, a model protein that has previously been shown to bind many anesthetic agents with affinities that are closely correlated with anesthetic potency. Thiopental, pentobarbital, and phenobarbital were all found to bind to apoferritin with affinities ranging from 10-500 µM, approximately matching the concentrations required to produce anesthetic and GABAergic responses. X-ray crystal structures were determined for the complexes of apoferritin with thiopental and pentobarbital at resolutions of 1.9 and 2.0 Å, respectively. These structures reveal that the barbiturates bind to a cavity in the apoferritin shell that also binds haloalkanes, halogenated ethers, and propofol. Unlike these other general anesthetics, however, which rely entirely upon van der Waals interactions and the hydrophobic effect for recognition, the barbiturates are recognized in the apoferritin site using a mixture of both polar and nonpolar interactions. These results suggest that any protein binding site that is able to recognize and respond to the chemically and structurally diverse set of compounds used as general anesthetics is likely to include a versatile mixture of both polar and hydrophobic elements.

  19. Synthesis, X-ray crystal structure, DNA binding and Nuclease activity ...

    Indian Academy of Sciences (India)

    s12039-016-1125-x. Synthesis, X-ray crystal structure, DNA binding and Nuclease activity of lanthanide(III) complexes of 2-benzoylpyridine acetylhydrazone. KARREDDULA RAJA, AKKILI SUSEELAMMA and KATREDDI HUSSAIN REDDY. ∗.

  20. Predicting RNA Structure Using Mutual Information

    DEFF Research Database (Denmark)

    Freyhult, E.; Moulton, V.; Gardner, P. P.

    2005-01-01

    , to display and predict conserved RNA secondary structure (including pseudoknots) from an alignment. Results: We show that MIfold can be used to predict simple pseudoknots, and that the performance can be adjusted to make it either more sensitive or more selective. We also demonstrate that the overall...... package. Conclusion: MIfold provides a useful supplementary tool to programs such as RNA Structure Logo, RNAalifold and COVE, and should be useful for automatically generating structural predictions for databases such as Rfam. Availability: MIfold is freely available from http......Background: With the ever-increasing number of sequenced RNAs and the establishment of new RNA databases, such as the Comparative RNA Web Site and Rfam, there is a growing need for accurately and automatically predicting RNA structures from multiple alignments. Since RNA secondary structure...

  1. The crystal structure and RNA-binding of an orthomyxovirus nucleoprotein.

    Directory of Open Access Journals (Sweden)

    Wenjie Zheng

    2013-09-01

    Full Text Available Genome packaging for viruses with segmented genomes is often a complex problem. This is particularly true for influenza viruses and other orthomyxoviruses, whose genome consists of multiple negative-sense RNAs encapsidated as ribonucleoprotein (RNP complexes. To better understand the structural features of orthomyxovirus RNPs that allow them to be packaged, we determined the crystal structure of the nucleoprotein (NP of a fish orthomyxovirus, the infectious salmon anemia virus (ISAV (genus Isavirus. As the major protein component of the RNPs, ISAV-NP possesses a bi-lobular structure similar to the influenza virus NP. Because both RNA-free and RNA-bound ISAV NP forms stable dimers in solution, we were able to measure the NP RNA binding affinity as well as the stoichiometry using recombinant proteins and synthetic oligos. Our RNA binding analysis revealed that each ISAV-NP binds ~12 nts of RNA, shorter than the 24-28 nts originally estimated for the influenza A virus NP based on population average. The 12-nt stoichiometry was further confirmed by results from electron microscopy and dynamic light scattering. Considering that RNPs of ISAV and the influenza viruses have similar morphologies and dimensions, our findings suggest that NP-free RNA may exist on orthomyxovirus RNPs, and selective RNP packaging may be accomplished through direct RNA-RNA interactions.

  2. Structure of the Nucleoprotein Binding Domain of Mokola Virus Phosphoprotein▿

    Science.gov (United States)

    Assenberg, René; Delmas, Olivier; Ren, Jingshan; Vidalain, Pierre-Olivier; Verma, Anil; Larrous, Florence; Graham, Stephen C.; Tangy, Frédéric; Grimes, Jonathan M.; Bourhy, Hervé

    2010-01-01

    Mokola virus (MOKV) is a nonsegmented, negative-sense RNA virus that belongs to the Lyssavirus genus and Rhabdoviridae family. MOKV phosphoprotein P is an essential component of the replication and transcription complex and acts as a cofactor for the viral RNA-dependent RNA polymerase. P recruits the viral polymerase to the nucleoprotein-bound viral RNA (N-RNA) via an interaction between its C-terminal domain and the N-RNA complex. Here we present a structure for this domain of MOKV P, obtained by expression of full-length P in Escherichia coli, which was subsequently truncated during crystallization. The structure has a high degree of homology with P of rabies virus, another member of Lyssavirus genus, and to a lesser degree with P of vesicular stomatitis virus (VSV), a member of the related Vesiculovirus genus. In addition, analysis of the crystal packing of this domain reveals a potential binding site for the nucleoprotein N. Using both site-directed mutagenesis and yeast two-hybrid experiments to measure P-N interaction, we have determined the relative roles of key amino acids involved in this interaction to map the region of P that binds N. This analysis also reveals a structural relationship between the N-RNA binding domain of the P proteins of the Rhabdoviridae and the Paramyxoviridae. PMID:19906936

  3. Solution structure of telomere binding domain of AtTRB2 derived from Arabidopsis thaliana

    International Nuclear Information System (INIS)

    Yun, Ji-Hye; Lee, Won Kyung; Kim, Heeyoun; Kim, Eunhee; Cheong, Chaejoon; Cho, Myeon Haeng; Lee, Weontae

    2014-01-01

    Highlights: • We have determined solution structure of Myb domain of AtTRB2. • The Myb domain of AtTRB2 is located in the N-terminal region. • The Myb domain of AtTRB2 binds to plant telomeric DNA without fourth helix. • Helix 2 and 3 of the Myb domain of AtTRB2 are involved in DNA recognition. • AtTRB2 is a novel protein distinguished from other known plant TBP. - Abstract: Telomere homeostasis is regulated by telomere-associated proteins, and the Myb domain is well conserved for telomere binding. AtTRB2 is a member of the SMH (Single-Myb-Histone)-like family in Arabidopsis thaliana, having an N-terminal Myb domain, which is responsible for DNA binding. The Myb domain of AtTRB2 contains three α-helices and loops for DNA binding, which is unusual given that other plant telomere-binding proteins have an additional fourth helix that is essential for DNA binding. To understand the structural role for telomeric DNA binding of AtTRB2, we determined the solution structure of the Myb domain of AtTRB2 (AtTRB2 1–64 ) using nuclear magnetic resonance (NMR) spectroscopy. In addition, the inter-molecular interaction between AtTRB2 1–64 and telomeric DNA has been characterized by the electrophoretic mobility shift assay (EMSA) and NMR titration analyses for both plant (TTTAGGG)n and human (TTAGGG)n telomere sequences. Data revealed that Trp28, Arg29, and Val47 residues located in Helix 2 and Helix 3 are crucial for DNA binding, which are well conserved among other plant telomere binding proteins. We concluded that although AtTRB2 is devoid of the additional fourth helix in the Myb-extension domain, it is able to bind to plant telomeric repeat sequences as well as human telomeric repeat sequences

  4. Molecular mechanisms in the activation of abscisic acid receptor PYR1.

    Directory of Open Access Journals (Sweden)

    Lyudmyla Dorosh

    Full Text Available The pyrabactin resistance 1 (PYR1/PYR1-like (PYL/regulatory component of abscisic acid (ABA response (RCAR proteins comprise a well characterized family of ABA receptors. Recent investigations have revealed two subsets of these receptors that, in the absence of ABA, either form inactive homodimers (PYR1 and PYLs 1-3 or mediate basal inhibition of downstream target type 2C protein phosphatases (PP2Cs; PYLs 4-10 respectively in vitro. Addition of ABA has been shown to release the apo-homodimers yielding ABA-bound monomeric holo-receptors that can interact with PP2Cs; highlighting a competitive-interaction process. Interaction selectivity has been shown to be mediated by subtle structural variations of primary sequence and ligand binding effects. Now, the dynamical contributions of ligand binding on interaction selectivity are investigated through extensive molecular dynamics (MD simulations of apo and holo-PYR1 in monomeric and dimeric form as well as in complex with a PP2C, homology to ABA insensitive 1 (HAB1. Robust comparative interpretations were enabled by a novel essential collective dynamics approach. In agreement with recent experimental findings, our analysis indicates that ABA-bound PYR1 should efficiently bind to HAB1. However, both ABA-bound and ABA-extracted PYR1-HAB1 constructs have demonstrated notable similarities in their dynamics, suggesting that apo-PYR1 should also be able to make a substantial interaction with PP2Cs, albeit likely with slower complex formation kinetics. Further analysis indicates that both ABA-bound and ABA-free PYR1 in complex with HAB1 exhibit a higher intra-molecular structural stability and stronger inter-molecular dynamic correlations, in comparison with either holo- or apo-PYR1 dimers, supporting a model that includes apo-PYR1 in complex with HAB1. This possibility of a conditional functional apo-PYR1-PP2C complex was validated in vitro. These findings are generally consistent with the competitive

  5. Structure of photosystem II and substrate binding at room temperature.

    Science.gov (United States)

    Young, Iris D; Ibrahim, Mohamed; Chatterjee, Ruchira; Gul, Sheraz; Fuller, Franklin; Koroidov, Sergey; Brewster, Aaron S; Tran, Rosalie; Alonso-Mori, Roberto; Kroll, Thomas; Michels-Clark, Tara; Laksmono, Hartawan; Sierra, Raymond G; Stan, Claudiu A; Hussein, Rana; Zhang, Miao; Douthit, Lacey; Kubin, Markus; de Lichtenberg, Casper; Long Vo, Pham; Nilsson, Håkan; Cheah, Mun Hon; Shevela, Dmitriy; Saracini, Claudio; Bean, Mackenzie A; Seuffert, Ina; Sokaras, Dimosthenis; Weng, Tsu-Chien; Pastor, Ernest; Weninger, Clemens; Fransson, Thomas; Lassalle, Louise; Bräuer, Philipp; Aller, Pierre; Docker, Peter T; Andi, Babak; Orville, Allen M; Glownia, James M; Nelson, Silke; Sikorski, Marcin; Zhu, Diling; Hunter, Mark S; Lane, Thomas J; Aquila, Andy; Koglin, Jason E; Robinson, Joseph; Liang, Mengning; Boutet, Sébastien; Lyubimov, Artem Y; Uervirojnangkoorn, Monarin; Moriarty, Nigel W; Liebschner, Dorothee; Afonine, Pavel V; Waterman, David G; Evans, Gwyndaf; Wernet, Philippe; Dobbek, Holger; Weis, William I; Brunger, Axel T; Zwart, Petrus H; Adams, Paul D; Zouni, Athina; Messinger, Johannes; Bergmann, Uwe; Sauter, Nicholas K; Kern, Jan; Yachandra, Vittal K; Yano, Junko

    2016-12-15

    Light-induced oxidation of water by photosystem II (PS II) in plants, algae and cyanobacteria has generated most of the dioxygen in the atmosphere. PS II, a membrane-bound multi-subunit pigment protein complex, couples the one-electron photochemistry at the reaction centre with the four-electron redox chemistry of water oxidation at the Mn 4 CaO 5 cluster in the oxygen-evolving complex (OEC). Under illumination, the OEC cycles through five intermediate S-states (S 0 to S 4 ), in which S 1 is the dark-stable state and S 3 is the last semi-stable state before O-O bond formation and O 2 evolution. A detailed understanding of the O-O bond formation mechanism remains a challenge, and will require elucidation of both the structures of the OEC in the different S-states and the binding of the two substrate waters to the catalytic site. Here we report the use of femtosecond pulses from an X-ray free electron laser (XFEL) to obtain damage-free, room temperature structures of dark-adapted (S 1 ), two-flash illuminated (2F; S 3 -enriched), and ammonia-bound two-flash illuminated (2F-NH 3 ; S 3 -enriched) PS II. Although the recent 1.95 Å resolution structure of PS II at cryogenic temperature using an XFEL provided a damage-free view of the S 1 state, measurements at room temperature are required to study the structural landscape of proteins under functional conditions, and also for in situ advancement of the S-states. To investigate the water-binding site(s), ammonia, a water analogue, has been used as a marker, as it binds to the Mn 4 CaO 5 cluster in the S 2 and S 3 states. Since the ammonia-bound OEC is active, the ammonia-binding Mn site is not a substrate water site. This approach, together with a comparison of the native dark and 2F states, is used to discriminate between proposed O-O bond formation mechanisms.

  6. Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins

    Science.gov (United States)

    Kinjo, Akira R.; Nakamura, Haruki

    2012-01-01

    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478

  7. Structural Determinants of Specific Lipid Binding to Potassium Channels

    NARCIS (Netherlands)

    Weingarth, M.H.|info:eu-repo/dai/nl/330985655; Prokofyev, A.; van der Cruijsen, E.A.W.|info:eu-repo/dai/nl/330826743; Nand, D.|info:eu-repo/dai/nl/337731403; Bonvin, A.M.J.J.|info:eu-repo/dai/nl/113691238; Pongs, O.; Baldus, M.|info:eu-repo/dai/nl/314410864

    2013-01-01

    We have investigated specific lipid binding to the pore domain of potassium channels KcsA and chimeric KcsAKv1.3 on the structural and functional level using extensive coarse-grained and atomistic molecular dynamics simulations, solid-state NMR, and single channel measurements. We show that, while

  8. Predicted MHC peptide binding promiscuity explains MHC class I 'hotspots' of antigen presentation defined by mass spectrometry eluted ligand data.

    Science.gov (United States)

    Jappe, Emma Christine; Kringelum, Jens; Trolle, Thomas; Nielsen, Morten

    2018-02-15

    Peptides that bind to and are presented by MHC class I and class II molecules collectively make up the immunopeptidome. In the context of vaccine development, an understanding of the immunopeptidome is essential, and much effort has been dedicated to its accurate and cost-effective identification. Current state-of-the-art methods mainly comprise in silico tools for predicting MHC binding, which is strongly correlated with peptide immunogenicity. However, only a small proportion of the peptides that bind to MHC molecules are, in fact, immunogenic, and substantial work has been dedicated to uncovering additional determinants of peptide immunogenicity. In this context, and in light of recent advancements in mass spectrometry (MS), the existence of immunological hotspots has been given new life, inciting the hypothesis that hotspots are associated with MHC class I peptide immunogenicity. We here introduce a precise terminology for defining these hotspots and carry out a systematic analysis of MS and in silico predicted hotspots. We find that hotspots defined from MS data are largely captured by peptide binding predictions, enabling their replication in silico. This leads us to conclude that hotspots, to a great degree, are simply a result of promiscuous HLA binding, which disproves the hypothesis that the identification of hotspots provides novel information in the context of immunogenic peptide prediction. Furthermore, our analyses demonstrate that the signal of ligand processing, although present in the MS data, has very low predictive power to discriminate between MS and in silico defined hotspots. © 2018 John Wiley & Sons Ltd.

  9. The primary structure of fatty-acid-binding protein from nurse shark liver. Structural and evolutionary relationship to the mammalian fatty-acid-binding protein family.

    Science.gov (United States)

    Medzihradszky, K F; Gibson, B W; Kaur, S; Yu, Z H; Medzihradszky, D; Burlingame, A L; Bass, N M

    1992-02-01

    The primary structure of a fatty-acid-binding protein (FABP) isolated from the liver of the nurse shark (Ginglymostoma cirratum) was determined by high-performance tandem mass spectrometry (employing multichannel array detection) and Edman degradation. Shark liver FABP consists of 132 amino acids with an acetylated N-terminal valine. The chemical molecular mass of the intact protein determined by electrospray ionization mass spectrometry (Mr = 15124 +/- 2.5) was in good agreement with that calculated from the amino acid sequence (Mr = 15121.3). The amino acid sequence of shark liver FABP displays significantly greater similarity to the FABP expressed in mammalian heart, peripheral nerve myelin and adipose tissue (61-53% sequence similarity) than to the FABP expressed in mammalian liver (22% similarity). Phylogenetic trees derived from the comparison of the shark liver FABP amino acid sequence with the members of the mammalian fatty-acid/retinoid-binding protein gene family indicate the initial divergence of an ancestral gene into two major subfamilies: one comprising the genes for mammalian liver FABP and gastrotropin, the other comprising the genes for mammalian cellular retinol-binding proteins I and II, cellular retinoic-acid-binding protein myelin P2 protein, adipocyte FABP, heart FABP and shark liver FABP, the latter having diverged from the ancestral gene that ultimately gave rise to the present day mammalian heart-FABP, adipocyte FABP and myelin P2 protein sequences. The sequence for intestinal FABP from the rat could be assigned to either subfamily, depending on the approach used for phylogenetic tree construction, but clearly diverged at a relatively early evolutionary time point. Indeed, sequences proximately ancestral or closely related to mammalian intestinal FABP, liver FABP, gastrotropin and the retinoid-binding group of proteins appear to have arisen prior to the divergence of shark liver FABP and should therefore also be present in elasmobranchs

  10. Structure prediction and activity analysis of human heme oxygenase-1 and its mutant.

    Science.gov (United States)

    Xia, Zhen-Wei; Zhou, Wen-Pu; Cui, Wen-Jun; Zhang, Xue-Hong; Shen, Qing-Xiang; Li, Yun-Zhu; Yu, Shan-Chang

    2004-08-15

    To predict wild human heme oxygenase-1 (whHO-1) and hHO-1 His25Ala mutant (delta hHO-1) structures, to clone and express them and analyze their activities. Swiss-PdbViewer and Antheprot 5.0 were used for the prediction of structure diversity and physical-chemical changes between wild and mutant hHO-1. hHO-1 His25Ala mutant cDNA was constructed by site-directed mutagenesis in two plasmids of E. coli DH5alpha. Expression products were purified by ammonium sulphate precipitation and Q-Sepharose Fast Flow column chromatography, and their activities were measured. rHO-1 had the structure of a helical fold with the heme sandwiched between heme-heme oxygenase-1 helices. Bond angle, dihedral angle and chemical bond in the active pocket changed after Ala25 was replaced by His25, but Ala25 was still contacting the surface and the electrostatic potential of the active pocket was negative. The mutated enzyme kept binding activity to heme. Two vectors pBHO-1 and pBHO-1(M) were constructed and expressed. Ammonium sulphate precipitation and column chromatography yielded 3.6-fold and 30-fold higher purities of whHO-1, respectively. The activity of delta hHO-1 was reduced 91.21% after mutation compared with whHO-1. Proximal His25 ligand is crucial for normal hHO-1 catalytic activity. delta hHO-1 is deactivated by mutation but keeps the same binding site as whHO-1. delta hHO-1 might be a potential inhibitor of whHO-1 for preventing neonatal hyperbilirubinemia.

  11. Structure and function of A41, a vaccinia virus chemokine binding protein.

    Directory of Open Access Journals (Sweden)

    Mohammad W Bahar

    2008-01-01

    Full Text Available The vaccinia virus (VACV A41L gene encodes a secreted 30 kDa glycoprotein that is nonessential for virus replication but affects the host response to infection. The A41 protein shares sequence similarity with another VACV protein that binds CC chemokines (called vCKBP, or viral CC chemokine inhibitor, vCCI, and strains of VACV lacking the A41L gene induced stronger CD8+ T-cell responses than control viruses expressing A41. Using surface plasmon resonance, we screened 39 human and murine chemokines and identified CCL21, CCL25, CCL26 and CCL28 as A41 ligands, with Kds of between 8 nM and 118 nM. Nonetheless, A41 was ineffective at inhibiting chemotaxis induced by these chemokines, indicating it did not block the interaction of these chemokines with their receptors. However the interaction of A41 and chemokines was inhibited in a dose-dependent manner by heparin, suggesting that A41 and heparin bind to overlapping sites on these chemokines. To better understand the mechanism of action of A41 its crystal structure was solved to 1.9 A resolution. The protein has a globular beta sandwich structure similar to that of the poxvirus vCCI family of proteins, but there are notable structural differences, particularly in surface loops and electrostatic charge distribution. Structural modelling suggests that the binding paradigm as defined for the vCCI-chemokine interaction is likely to be conserved between A41 and its chemokine partners. Additionally, sequence analysis of chemokines binding to A41 identified a signature for A41 binding. The biological and structural data suggest that A41 functions by forming moderately strong (nM interactions with certain chemokines, sufficient to interfere with chemokine-glycosaminoglycan interactions at the cell surface (microM-nM and thereby to destroy the chemokine concentration gradient, but not strong enough to disrupt the (pM chemokine-chemokine receptor interactions.

  12. The Structure of the Iron Binding Protein, FutA1, from Synechocystis 6803*

    International Nuclear Information System (INIS)

    Koropatkin, Nicole; Randich, Amelia M.; Bhattacharyya-Pakrasi, Maitrayee; Pakrasi, Himadri B.; Smith, Thomas J.

    2007-01-01

    Cyanobacteria account for a significant percentage of aquatic primary productivity even in areas where the concentrations of essential micronutrients are extremely low. To better understand the mechanism of iron selectivity and transport, the structure of the solute-binding domain of an ABC iron transporter, FutA1, was determined in the presence and absence of iron. The iron ion is bound within the 'C-clamp' structure via four tyrosine and one histidine residues. There are extensive interactions between these ligating residues and the rest of the protein such that the conformations of the side chains remain relatively unchanged as the iron is released by the opening of the metal binding cleft. This is in stark contrast to the zinc binding protein, ZnuA, where the domains of the metal binding protein remain relatively fixed while the ligating residues rotate out of the binding pocket upon metal release. The rotation of the domains in FutA1 is facilitated by two flexible β-strands running along the back of the protein that act like a hinge during domain motion. This motion may require relatively little energy since total contact area between the domains is the same whether the protein is in the open or closed conformation. Consistent with the pH dependency of iron binding, the main trigger for iron release is likely the histidine in the iron-binding site. Finally, neither FutA1 nor FutA2 binds iron as a siderophore complex or in the presence of anions and both preferentially bind ferrous over ferric ions

  13. Using remote substituents to control solution structure and anion binding in lanthanide complexes

    DEFF Research Database (Denmark)

    Tropiano, Manuel; Blackburn, Octavia A.; Tilney, James A.

    2013-01-01

    A study of the anion-binding properties of three structurally related lanthanide complexes, which all contain chemically identical anion-binding motifs, has revealed dramatic differences in their anion affinity. These arise as a consequence of changes in the substitution pattern on the periphery ...

  14. Structural consequences of cutting a binding loop: two circularly permuted variants of streptavidin

    International Nuclear Information System (INIS)

    Le Trong, Isolde; Chu, Vano; Xing, Yi; Lybrand, Terry P.; Stayton, Patrick S.; Stenkamp, Ronald E.

    2013-01-01

    The crystal structures of two circularly permuted streptavidins probe the role of a flexible loop in the tight binding of biotin. Molecular-dynamics calculations for one of the mutants suggests that increased fluctuations in a hydrogen bond between the protein and biotin are associated with cleavage of the binding loop. Circular permutation of streptavidin was carried out in order to investigate the role of a main-chain amide in stabilizing the high-affinity complex of the protein and biotin. Mutant proteins CP49/48 and CP50/49 were constructed to place new N-termini at residues 49 and 50 in a flexible loop involved in stabilizing the biotin complex. Crystal structures of the two mutants show that half of each loop closes over the binding site, as observed in wild-type streptavidin, while the other half adopts the open conformation found in the unliganded state. The structures are consistent with kinetic and thermodynamic data and indicate that the loop plays a role in enthalpic stabilization of the bound state via the Asn49 amide–biotin hydrogen bond. In wild-type streptavidin, the entropic penalties of immobilizing a flexible portion of the protein to enhance binding are kept to a manageable level by using a contiguous loop of medium length (six residues) which is already constrained by its anchorage to strands of the β-barrel protein. A molecular-dynamics simulation for CP50/49 shows that cleavage of the binding loop results in increased structural fluctuations for Ser45 and that these fluctuations destabilize the streptavidin–biotin complex

  15. Structure-based methods to predict mutational resistance to diarylpyrimidine non-nucleoside reverse transcriptase inhibitors.

    Science.gov (United States)

    Azeem, Syeda Maryam; Muwonge, Alecia N; Thakkar, Nehaben; Lam, Kristina W; Frey, Kathleen M

    2018-01-01

    Resistance to non-nucleoside reverse transcriptase inhibitors (NNRTIs) is a leading cause of HIV treatment failure. Often included in antiviral therapy, NNRTIs are chemically diverse compounds that bind an allosteric pocket of enzyme target reverse transcriptase (RT). Several new NNRTIs incorporate flexibility in order to compensate for lost interactions with amino acid conferring mutations in RT. Unfortunately, even successful inhibitors such as diarylpyrimidine (DAPY) inhibitor rilpivirine are affected by mutations in RT that confer resistance. In order to aid drug design efforts, it would be efficient and cost effective to pre-evaluate NNRTI compounds in development using a structure-based computational approach. As proof of concept, we applied a residue scan and molecular dynamics strategy using RT crystal structures to predict mutations that confer resistance to DAPYs rilpivirine, etravirine, and investigational microbicide dapivirine. Our predictive values, changes in affinity and stability, are correlative with fold-resistance data for several RT mutants. Consistent with previous studies, mutation K101P is predicted to confer high-level resistance to DAPYs. These findings were further validated using structural analysis, molecular dynamics, and an enzymatic reverse transcription assay. Our results confirm that changes in affinity and stability for mutant complexes are predictive parameters of resistance as validated by experimental and clinical data. In future work, we believe that this computational approach may be useful to predict resistance mutations for inhibitors in development. Published by Elsevier Inc.

  16. Improving the accuracy of protein secondary structure prediction using structural alignment

    Directory of Open Access Journals (Sweden)

    Gallin Warren J

    2006-06-01

    Full Text Available Abstract Background The accuracy of protein secondary structure prediction has steadily improved over the past 30 years. Now many secondary structure prediction methods routinely achieve an accuracy (Q3 of about 75%. We believe this accuracy could be further improved by including structure (as opposed to sequence database comparisons as part of the prediction process. Indeed, given the large size of the Protein Data Bank (>35,000 sequences, the probability of a newly identified sequence having a structural homologue is actually quite high. Results We have developed a method that performs structure-based sequence alignments as part of the secondary structure prediction process. By mapping the structure of a known homologue (sequence ID >25% onto the query protein's sequence, it is possible to predict at least a portion of that query protein's secondary structure. By integrating this structural alignment approach with conventional (sequence-based secondary structure methods and then combining it with a "jury-of-experts" system to generate a consensus result, it is possible to attain very high prediction accuracy. Using a sequence-unique test set of 1644 proteins from EVA, this new method achieves an average Q3 score of 81.3%. Extensive testing indicates this is approximately 4–5% better than any other method currently available. Assessments using non sequence-unique test sets (typical of those used in proteome annotation or structural genomics indicate that this new method can achieve a Q3 score approaching 88%. Conclusion By using both sequence and structure databases and by exploiting the latest techniques in machine learning it is possible to routinely predict protein secondary structure with an accuracy well above 80%. A program and web server, called PROTEUS, that performs these secondary structure predictions is accessible at http://wishart.biology.ualberta.ca/proteus. For high throughput or batch sequence analyses, the PROTEUS programs

  17. Crystal structure and DNA binding of the homeodomain of the stem cell transcription factor Nanog.

    Science.gov (United States)

    Jauch, Ralf; Ng, Calista Keow Leng; Saikatendu, Kumar Singh; Stevens, Raymond C; Kolatkar, Prasanna R

    2008-02-22

    The transcription factor Nanog is an upstream regulator in early mammalian development and a key determinant of pluripotency in embryonic stem cells. Nanog binds to promoter elements of hundreds of target genes and regulates their expression by an as yet unknown mechanism. Here, we report the crystal structure of the murine Nanog homeodomain (HD) and analysis of its interaction with a DNA element derived from the Tcf3 promoter. Two Nanog amino acid pairs, unique among HD sequences, appear to affect the mechanism of nonspecific DNA recognition as well as maintain the integrity of the structural scaffold. To assess selective DNA recognition by Nanog, we performed electrophoretic mobility shift assays using a panel of modified DNA binding sites and found that Nanog HD preferentially binds the TAAT(G/T)(G/T) motif. A series of rational mutagenesis experiments probing the role of six variant residues of Nanog on its DNA binding function establish their role in affecting binding affinity but not binding specificity. Together, the structural and functional evidence establish Nanog as a distant member of a Q50-type HD despite having considerable variation at the sequence level.

  18. Crystal Structure and DNA Binding of the Homeodomain of the Stem Cell Transcription Factor Nanog

    Energy Technology Data Exchange (ETDEWEB)

    Jauch, Ralf; Ng, Calista Keow Leng; Saikatendu, Kumar Singh; Stevens, Raymond C.; Kolatkar, Prasanna R. (GI-Singapore); (Scripps)

    2010-02-08

    The transcription factor Nanog is an upstream regulator in early mammalian development and a key determinant of pluripotency in embryonic stem cells. Nanog binds to promoter elements of hundreds of target genes and regulates their expression by an as yet unknown mechanism. Here, we report the crystal structure of the murine Nanog homeodomain (HD) and analysis of its interaction with a DNA element derived from the Tcf3 promoter. Two Nanog amino acid pairs, unique among HD sequences, appear to affect the mechanism of nonspecific DNA recognition as well as maintain the integrity of the structural scaffold. To assess selective DNA recognition by Nanog, we performed electrophoretic mobility shift assays using a panel of modified DNA binding sites and found that Nanog HD preferentially binds the TAAT(G/T)(G/T) motif. A series of rational mutagenesis experiments probing the role of six variant residues of Nanog on its DNA binding function establish their role in affecting binding affinity but not binding specificity. Together, the structural and functional evidence establish Nanog as a distant member of a Q50-type HD despite having considerable variation at the sequence level.

  19. Ensemble-based prediction of RNA secondary structures.

    Science.gov (United States)

    Aghaeepour, Nima; Hoos, Holger H

    2013-04-24

    Accurate structure prediction methods play an important role for the understanding of RNA function. Energy-based, pseudoknot-free secondary structure prediction is one of the most widely used and versatile approaches, and improved methods for this task have received much attention over the past five years. Despite the impressive progress that as been achieved in this area, existing evaluations of the prediction accuracy achieved by various algorithms do not provide a comprehensive, statistically sound assessment. Furthermore, while there is increasing evidence that no prediction algorithm consistently outperforms all others, no work has been done to exploit the complementary strengths of multiple approaches. In this work, we present two contributions to the area of RNA secondary structure prediction. Firstly, we use state-of-the-art, resampling-based statistical methods together with a previously published and increasingly widely used dataset of high-quality RNA structures to conduct a comprehensive evaluation of existing RNA secondary structure prediction procedures. The results from this evaluation clarify the performance relationship between ten well-known existing energy-based pseudoknot-free RNA secondary structure prediction methods and clearly demonstrate the progress that has been achieved in recent years. Secondly, we introduce AveRNA, a generic and powerful method for combining a set of existing secondary structure prediction procedures into an ensemble-based method that achieves significantly higher prediction accuracies than obtained from any of its component procedures. Our new, ensemble-based method, AveRNA, improves the state of the art for energy-based, pseudoknot-free RNA secondary structure prediction by exploiting the complementary strengths of multiple existing prediction procedures, as demonstrated using a state-of-the-art statistical resampling approach. In addition, AveRNA allows an intuitive and effective control of the trade-off between

  20. Binding mode prediction and MD/MMPBSA-based free energy ranking for agonists of REV-ERBα/NCoR.

    Science.gov (United States)

    Westermaier, Yvonne; Ruiz-Carmona, Sergio; Theret, Isabelle; Perron-Sierra, Françoise; Poissonnet, Guillaume; Dacquet, Catherine; Boutin, Jean A; Ducrot, Pierre; Barril, Xavier

    2017-08-01

    The knowledge of the free energy of binding of small molecules to a macromolecular target is crucial in drug design as is the ability to predict the functional consequences of binding. We highlight how a molecular dynamics (MD)-based approach can be used to predict the free energy of small molecules, and to provide priorities for the synthesis and the validation via in vitro tests. Here, we study the dynamics and energetics of the nuclear receptor REV-ERBα with its co-repressor NCoR and 35 novel agonists. Our in silico approach combines molecular docking, molecular dynamics (MD), solvent-accessible surface area (SASA) and molecular mechanics poisson boltzmann surface area (MMPBSA) calculations. While docking yielded initial hints on the binding modes, their stability was assessed by MD. The SASA calculations revealed that the presence of the ligand led to a higher exposure of hydrophobic REV-ERB residues for NCoR recruitment. MMPBSA was very successful in ranking ligands by potency in a retrospective and prospective manner. Particularly, the prospective MMPBSA ranking-based validations for four compounds, three predicted to be active and one weakly active, were confirmed experimentally.

  1. Sequence and structural analysis of the chitinase insertion domain reveals two conserved motifs involved in chitin-binding.

    Directory of Open Access Journals (Sweden)

    Hai Li

    2010-01-01

    Full Text Available Chitinases are prevalent in life and are found in species including archaea, bacteria, fungi, plants, and animals. They break down chitin, which is the second most abundant carbohydrate in nature after cellulose. Hence, they are important for maintaining a balance between carbon and nitrogen trapped as insoluble chitin in biomass. Chitinases are classified into two families, 18 and 19 glycoside hydrolases. In addition to a catalytic domain, which is a triosephosphate isomerase barrel, many family 18 chitinases contain another module, i.e., chitinase insertion domain. While numerous studies focus on the biological role of the catalytic domain in chitinase activity, the function of the chitinase insertion domain is not completely understood. Bioinformatics offers an important avenue in which to facilitate understanding the role of residues within the chitinase insertion domain in chitinase function.Twenty-seven chitinase insertion domain sequences, which include four experimentally determined structures and span five kingdoms, were aligned and analyzed using a modified sequence entropy parameter. Thirty-two positions with conserved residues were identified. The role of these conserved residues was explored by conducting a structural analysis of a number of holo-enzymes. Hydrogen bonding and van der Waals calculations revealed a distinct subset of four conserved residues constituting two sequence motifs that interact with oligosaccharides. The other conserved residues may be key to the structure, folding, and stability of this domain.Sequence and structural studies of the chitinase insertion domains conducted within the framework of evolution identified four conserved residues which clearly interact with the substrates. Furthermore, evolutionary studies propose a link between the appearance of the chitinase insertion domain and the function of family 18 chitinases in the subfamily A.

  2. Identification and Structural Basis of Binding to Host Lung Glycogen by Streptococcal Virulence Factors

    Energy Technology Data Exchange (ETDEWEB)

    Lammerts van Bueren,A.; Higgins, M.; Wang, D.; Burke, R.; Boraston, A.

    2007-01-01

    The ability of pathogenic bacteria to recognize host glycans is often essential to their virulence. Here we report structure-function studies of previously uncharacterized glycogen-binding modules in the surface-anchored pullulanases from Streptococcus pneumoniae (SpuA) and Streptococcus pyogenes (PulA). Multivalent binding to glycogen leads to a strong interaction with alveolar type II cells in mouse lung tissue. X-ray crystal structures of the binding modules reveal a novel fusion of tandem modules into single, bivalent functional domains. In addition to indicating a structural basis for multivalent attachment, the structure of the SpuA modules in complex with carbohydrate provides insight into the molecular basis for glycogen specificity. This report provides the first evidence that intracellular lung glycogen may be a novel target of pathogenic streptococci and thus provides a rationale for the identification of the streptococcal {alpha}-glucan-metabolizing machinery as virulence factors.

  3. Applications of contact predictions to structural biology

    Directory of Open Access Journals (Sweden)

    Felix Simkovic

    2017-05-01

    Full Text Available Evolutionary pressure on residue interactions, intramolecular or intermolecular, that are important for protein structure or function can lead to covariance between the two positions. Recent methodological advances allow much more accurate contact predictions to be derived from this evolutionary covariance signal. The practical application of contact predictions has largely been confined to structural bioinformatics, yet, as this work seeks to demonstrate, the data can be of enormous value to the structural biologist working in X-ray crystallography, cryo-EM or NMR. Integrative structural bioinformatics packages such as Rosetta can already exploit contact predictions in a variety of ways. The contribution of contact predictions begins at construct design, where structural domains may need to be expressed separately and contact predictions can help to predict domain limits. Structure solution by molecular replacement (MR benefits from contact predictions in diverse ways: in difficult cases, more accurate search models can be constructed using ab initio modelling when predictions are available, while intermolecular contact predictions can allow the construction of larger, oligomeric search models. Furthermore, MR using supersecondary motifs or large-scale screens against the PDB can exploit information, such as the parallel or antiparallel nature of any β-strand pairing in the target, that can be inferred from contact predictions. Contact information will be particularly valuable in the determination of lower resolution structures by helping to assign sequence register. In large complexes, contact information may allow the identity of a protein responsible for a certain region of density to be determined and then assist in the orientation of an available model within that density. In NMR, predicted contacts can provide long-range information to extend the upper size limit of the technique in a manner analogous but complementary to experimental

  4. Regulation of Neurexin 1[beta] Tertiary Structure and Ligand Binding through Alternative Splicing

    Energy Technology Data Exchange (ETDEWEB)

    Shen, Kaiser C.; Kuczynska, Dorota A.; Wu, Irene J.; Murray, Beverly H.; Sheckler, Lauren R.; Rudenko, Gabby (Michigan)

    2008-08-04

    Neurexins and neuroligins play an essential role in synapse function, and their alterations are linked to autistic spectrum disorder. Interactions between neurexins and neuroligins regulate inhibitory and excitatory synaptogenesis in vitro through a splice-insert signaling code. In particular, neurexin 1{beta} carrying an alternative splice insert at site SS{number_sign}4 interacts with neuroligin 2 (found predominantly at inhibitory synapses) but much less so with other neuroligins (those carrying an insert at site B and prevalent at excitatory synapses). The structure of neurexin 1{beta}+SS{number_sign}4 reveals dramatic rearrangements to the 'hypervariable surface', the binding site for neuroligins. The splice insert protrudes as a long helix into space, triggers conversion of loop {beta}10-{beta}11 into a helix rearranging the binding site for neuroligins, and rearranges the Ca{sup 2+}-binding site required for ligand binding, increasing its affinity. Our structures reveal the mechanism by which neurexin 1{beta} isoforms acquire neuroligin splice isoform selectivity.

  5. Solution structure of telomere binding domain of AtTRB2 derived from Arabidopsis thaliana

    Energy Technology Data Exchange (ETDEWEB)

    Yun, Ji-Hye [Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University, Seoul 120-749 (Korea, Republic of); Lee, Won Kyung [Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul 120-749 (Korea, Republic of); Kim, Heeyoun [Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University, Seoul 120-749 (Korea, Republic of); Kim, Eunhee; Cheong, Chaejoon [Magnetic Resonance Team, Korea Basic Science Institute (KBSI), Ochang, Chungbuk 363-883 (Korea, Republic of); Cho, Myeon Haeng [Department of Systems Biology, College of Life Science and Biotechnology, Yonsei University, Seoul 120-749 (Korea, Republic of); Lee, Weontae, E-mail: wlee@spin.yonsei.ac.kr [Department of Biochemistry, College of Life Science and Biotechnology, Yonsei University, Seoul 120-749 (Korea, Republic of)

    2014-09-26

    Highlights: • We have determined solution structure of Myb domain of AtTRB2. • The Myb domain of AtTRB2 is located in the N-terminal region. • The Myb domain of AtTRB2 binds to plant telomeric DNA without fourth helix. • Helix 2 and 3 of the Myb domain of AtTRB2 are involved in DNA recognition. • AtTRB2 is a novel protein distinguished from other known plant TBP. - Abstract: Telomere homeostasis is regulated by telomere-associated proteins, and the Myb domain is well conserved for telomere binding. AtTRB2 is a member of the SMH (Single-Myb-Histone)-like family in Arabidopsis thaliana, having an N-terminal Myb domain, which is responsible for DNA binding. The Myb domain of AtTRB2 contains three α-helices and loops for DNA binding, which is unusual given that other plant telomere-binding proteins have an additional fourth helix that is essential for DNA binding. To understand the structural role for telomeric DNA binding of AtTRB2, we determined the solution structure of the Myb domain of AtTRB2 (AtTRB2{sub 1–64}) using nuclear magnetic resonance (NMR) spectroscopy. In addition, the inter-molecular interaction between AtTRB2{sub 1–64} and telomeric DNA has been characterized by the electrophoretic mobility shift assay (EMSA) and NMR titration analyses for both plant (TTTAGGG)n and human (TTAGGG)n telomere sequences. Data revealed that Trp28, Arg29, and Val47 residues located in Helix 2 and Helix 3 are crucial for DNA binding, which are well conserved among other plant telomere binding proteins. We concluded that although AtTRB2 is devoid of the additional fourth helix in the Myb-extension domain, it is able to bind to plant telomeric repeat sequences as well as human telomeric repeat sequences.

  6. Nuclear Cartography: Patterns in Binding Energies and Subatomic Structure

    Science.gov (United States)

    Simpson, E. C.; Shelley, M.

    2017-01-01

    Nuclear masses and binding energies are some of the first nuclear properties met in high school physics, and can be used to introduce radioactive decays, fusion, and fission. With relatively little extension, they can also illustrate fundamental concepts in nuclear physics, such as shell structure and pairing, and to discuss how the elements…

  7. Integrating structural and mutagenesis data to elucidate GPCR ligand binding

    DEFF Research Database (Denmark)

    Munk, Christian; Harpsøe, Kasper; Hauser, Alexander S

    2016-01-01

    is reported that exhibit activity through multiple receptors, binding in allosteric sites, and bias towards different intracellular signalling pathways. Furthermore, a wealth of single point mutants has accumulated in literature and public databases. Integrating these structural and mutagenesis data will help...

  8. Effect of the vitamin B12-binding protein haptocorrin present in human milk on a panel of commensal and pathogenic bacteria

    Directory of Open Access Journals (Sweden)

    Nexø Ebba

    2011-06-01

    Full Text Available Abstract Background Haptocorrin is a vitamin B12-binding protein present in high amounts in different body fluids including human milk. Haptocorrin has previously been shown to inhibit the growth of specific E. coli strains, and the aim of the present study was to elucidate whether the antibacterial properties of this protein may exert a general defense against pathogens and/or affect the composition of the developing microbiota in the gastrointestinal tracts of breastfed infants. Findings The present work was the first systematic study of the effect of haptocorrin on bacterial growth, and included 34 commensal and pathogenic bacteria to which infants are likely to be exposed. Well-diffusion assays addressing antibacterial effects were performed with human milk, haptocorrin-free human milk, porcine holo-haptocorrin (saturated with B-12 and human apo-haptocorrin (unsaturated. Human milk inhibited the growth of S. thermophilus and the pathogenic strains L. monocytogenes LO28, L. monocytogenes 4446 and L. monocytogenes 7291, but the inhibition could not be ascribed to haptocorrin. Human apo-haptocorrin inhibited the growth of only a single bacterial strain (Bifidobacterium breve, while porcine holo-haptocorrin did not show any inhibitory effect. Conclusions Our results suggest that haptocorrin does not have a general antibacterial activity, and thereby contradict the existing hypothesis implicating such an effect. The study contributes to the knowledge on the potential impact of breastfeeding on the establishment of a healthy microbiota in infants.

  9. Importance of Accurate Charges in Binding Affinity Calculations: A Case of Neuraminidase Series

    Energy Technology Data Exchange (ETDEWEB)

    Park, Kichul; Kyun, Nack Sung; Cho, Art E. [Korea Univ., Sejong (Korea, Republic of)

    2013-02-15

    It has been shown that calculating atomic charges using quantum mechanical level theory greatly improves the accuracy of docking. A protocol was developed and shown to be effective. That this protocol works is just a manifestation of the fact that electrostatic interactions are important in protein-ligand binding. In order to investigate how the same protocol helps in prediction of binding affinities, we took a series of known cocrystal structures of influenza neuraminidase inhibitors with the receptor and performed docking with Glide SP, Glide XP, and QPLD, the last being a workflow that incorporates QM/MM calculations to replace the fixed atomic charges of force fields with quantum mechanically recalculated ones at a given docking pose, and predicted the binding affinities of each cocrystal. The correlation with experimental binding affinities considerably improved with QPLD compared to Glide SP/XP yielding r{sup 2} = 0.83. The results suggest that for binding sites, such as that of neuraminidase, which are laden with hydrophilic residues, protocols such as QPLD which utilizes QM-based atomic charges can better predict the binding affinities.

  10. Predicting HLA class I non-permissive amino acid residues substitutions.

    Directory of Open Access Journals (Sweden)

    T Andrew Binkowski

    Full Text Available Prediction of peptide binding to human leukocyte antigen (HLA molecules is essential to a wide range of clinical entities from vaccine design to stem cell transplant compatibility. Here we present a new structure-based methodology that applies robust computational tools to model peptide-HLA (p-HLA binding interactions. The method leverages the structural conservation observed in p-HLA complexes to significantly reduce the search space and calculate the system's binding free energy. This approach is benchmarked against existing p-HLA complexes and the prediction performance is measured against a library of experimentally validated peptides. The effect on binding activity across a large set of high-affinity peptides is used to investigate amino acid mismatches reported as high-risk factors in hematopoietic stem cell transplantation.

  11. Structural Modulation of Phosducin by Phosphorylation and 14-3-3 Protein Binding

    Science.gov (United States)

    Rezabkova, Lenka; Kacirova, Miroslava; Sulc, Miroslav; Herman, Petr; Vecer, Jaroslav; Stepanek, Miroslav; Obsilova, Veronika; Obsil, Tomas

    2012-01-01

    Phosducin (Pdc), a highly conserved phosphoprotein, plays an important role in the regulation of G protein signaling, transcriptional control, and modulation of blood pressure. Pdc is negatively regulated by phosphorylation followed by binding to the 14-3-3 protein, whose role is still unclear. To gain insight into the role of 14-3-3 in the regulation of Pdc function, we studied structural changes of Pdc induced by phosphorylation and 14-3-3 protein binding using time-resolved fluorescence spectroscopy. Our data show that the phosphorylation of the N-terminal domain of Pdc at Ser-54 and Ser-73 affects the structure of the whole Pdc molecule. Complex formation with 14-3-3 reduces the flexibility of both the N- and C-terminal domains of phosphorylated Pdc, as determined by time-resolved tryptophan and dansyl fluorescence. Therefore, our data suggest that phosphorylated Pdc undergoes a conformational change when binding to 14-3-3. These changes involve the Gtβγ binding surface within the N-terminal domain of Pdc, and thus could explain the inhibitory effect of 14-3-3 on Pdc function. PMID:23199924

  12. THE INFLUENCE OF BINDING MATERIAL ON POROUS STRUCTURE OF SHAPED HOPCALITE

    Directory of Open Access Journals (Sweden)

    N.K. Kulikov

    2008-06-01

    Full Text Available The authors have investigated the equilibrated adsorption of water vapors on GFG hopcalite, which was obtained using the extrusion shaping method, with bentonite clay as the binding compound. In the frames of the BET model, the values of the monolayer capacity and the size of medium area occupied by the water molecule in the filled monolayer have been determined. The distribution of pores according to their sizes has been evaluated. It has been established that the modification of the bentonitic clay allows directed construction of the hopcalite porous structure,i.e. the formation of the mesoporous structure with a narrow distribution of the pores capacities by sizes, which was achieved varying the sizes of binding compound particles.

  13. Principal component analysis for predicting transcription-factor binding motifs from array-derived data

    Directory of Open Access Journals (Sweden)

    Vincenti Matthew P

    2005-11-01

    Full Text Available Abstract Background The responses to interleukin 1 (IL-1 in human chondrocytes constitute a complex regulatory mechanism, where multiple transcription factors interact combinatorially to transcription-factor binding motifs (TFBMs. In order to select a critical set of TFBMs from genomic DNA information and an array-derived data, an efficient algorithm to solve a combinatorial optimization problem is required. Although computational approaches based on evolutionary algorithms are commonly employed, an analytical algorithm would be useful to predict TFBMs at nearly no computational cost and evaluate varying modelling conditions. Singular value decomposition (SVD is a powerful method to derive primary components of a given matrix. Applying SVD to a promoter matrix defined from regulatory DNA sequences, we derived a novel method to predict the critical set of TFBMs. Results The promoter matrix was defined to establish a quantitative relationship between the IL-1-driven mRNA alteration and genomic DNA sequences of the IL-1 responsive genes. The matrix was decomposed with SVD, and the effects of 8 potential TFBMs (5'-CAGGC-3', 5'-CGCCC-3', 5'-CCGCC-3', 5'-ATGGG-3', 5'-GGGAA-3', 5'-CGTCC-3', 5'-AAAGG-3', and 5'-ACCCA-3' were predicted from a pool of 512 random DNA sequences. The prediction included matches to the core binding motifs of biologically known TFBMs such as AP2, SP1, EGR1, KROX, GC-BOX, ABI4, ETF, E2F, SRF, STAT, IK-1, PPARγ, STAF, ROAZ, and NFκB, and their significance was evaluated numerically using Monte Carlo simulation and genetic algorithm. Conclusion The described SVD-based prediction is an analytical method to provide a set of potential TFBMs involved in transcriptional regulation. The results would be useful to evaluate analytically a contribution of individual DNA sequences.

  14. Continuous Automated Model EvaluatiOn (CAMEO) complementing the critical assessment of structure prediction in CASP12.

    Science.gov (United States)

    Haas, Jürgen; Barbato, Alessandro; Behringer, Dario; Studer, Gabriel; Roth, Steven; Bertoni, Martino; Mostaguir, Khaled; Gumienny, Rafal; Schwede, Torsten

    2018-03-01

    Every second year, the community experiment "Critical Assessment of Techniques for Structure Prediction" (CASP) is conducting an independent blind assessment of structure prediction methods, providing a framework for comparing the performance of different approaches and discussing the latest developments in the field. Yet, developers of automated computational modeling methods clearly benefit from more frequent evaluations based on larger sets of data. The "Continuous Automated Model EvaluatiOn (CAMEO)" platform complements the CASP experiment by conducting fully automated blind prediction assessments based on the weekly pre-release of sequences of those structures, which are going to be published in the next release of the PDB Protein Data Bank. CAMEO publishes weekly benchmarking results based on models collected during a 4-day prediction window, on average assessing ca. 100 targets during a time frame of 5 weeks. CAMEO benchmarking data is generated consistently for all participating methods at the same point in time, enabling developers to benchmark and cross-validate their method's performance, and directly refer to the benchmarking results in publications. In order to facilitate server development and promote shorter release cycles, CAMEO sends weekly email with submission statistics and low performance warnings. Many participants of CASP have successfully employed CAMEO when preparing their methods for upcoming community experiments. CAMEO offers a variety of scores to allow benchmarking diverse aspects of structure prediction methods. By introducing new scoring schemes, CAMEO facilitates new development in areas of active research, for example, modeling quaternary structure, complexes, or ligand binding sites. © 2017 Wiley Periodicals, Inc.

  15. Aminoglycosylation can enhance the G-quadruplex binding activity of epigallocatechin.

    Directory of Open Access Journals (Sweden)

    Li-Ping Bai

    Full Text Available With the aim of enhancing G-quadruplex binding activity, two new glucosaminosides (16, 18 of penta-methylated epigallocatechin were synthesized by chemical glycosylation. Subsequent ESI-TOF-MS analysis demonstrated that these two glucosaminoside derivatives exhibit much stronger binding activity to human telomeric DNA and RNA G-quadruplexes than their parent structure (i.e., methylated EGC (14 as well as natural epigallocatechin (EGC, 6. The DNA G-quadruplex binding activity of 16 and 18 is even more potent than strong G-quadruplex binder quercetin, which has a more planar structure. These two synthetic compounds also showed a higher binding strength to human telomeric RNA G-quadruplex than its DNA counterpart. Analysis of the structure-activity relationship revealed that the more basic compound, 16, has a higher binding capacity with DNA and RNA G-quadruplexes than its N-acetyl derivative, 18, suggesting the importance of the basicity of the aminoglycoside for G-quadruplex binding activity. Molecular docking simulation predicted that the aromatic ring of 16 π-stacks with the aromatic ring of guanine nucleotides, with the glucosamine moiety residing in the groove of G-quadruplex. This research indicates that glycosylation of natural products with aminosugar can significantly enhance their G-quadruplex binding activities, thus is an effective way to generate small molecules targeting G-quadruplexes in nucleic acids. In addition, this is the first report that green tea catechin can bind to nucleic acid G-quadruplex structures.

  16. A novel RUNX2 missense mutation predicted to disrupt DNA binding causes cleidocranial dysplasia in a large Chinese family with hyperplastic nails

    Directory of Open Access Journals (Sweden)

    Wang Xiaoqin

    2007-12-01

    Full Text Available Abstract Background Cleidocranial dysplasia (CCD is a dominantly inherited disease characterized by hypoplastic or absent clavicles, large fontanels, dental dysplasia, and delayed skeletal development. The purpose of this study is to investigate the genetic basis of Chinese family with CCD. Methods Here, a large Chinese family with CCD and hyperplastic nails was recruited. The clinical features displayed a significant intrafamilial variation. We sequenced the coding region of the RUNX2 gene for the mutation and phenotype analysis. Results The family carries a c.T407C (p.L136P mutation in the DNA- and CBFβ-binding Runt domain of RUNX2. Based on the crystal structure, we predict this novel missense mutation is likely to disrupt DNA binding by RUNX2, and at least locally affect the Runt domain structure. Conclusion A novel missense mutation was identified in a large Chinese family with CCD with hyperplastic nails. This report further extends the mutation spectrum and clinical features of CCD. The identification of this mutation will facilitate prenatal diagnosis and preimplantation genetic diagnosis.

  17. The structure of the nucleoprotein binding domain of lyssavirus phosphoprotein reveals a structural relationship between the N-RNA binding domains of Rhabdoviridae and Paramyxoviridae.

    Science.gov (United States)

    Delmas, Olivier; Assenberg, Rene; Grimes, Jonathan M; Bourhy, Hervé

    2010-01-01

    The phosphoprotein P of non-segmented negative-sense RNA viruses is an essential component of the replication and transcription complex and acts as a co-factor for the viral RNA-dependent RNA polymerase. P recruits the viral polymerase to the nucleoprotein-bound viral RNA (N-RNA) via an interaction between its C-terminal domain and the N-RNA complex. We have obtained the structure of the C-terminal domain of P of Mokola virus (MOKV), a lyssavirus that belongs to the Rhabdoviridae family and mapped at the amino acid level the crucial positions involved in interaction with N and in the formation of the viral replication complex. Comparison of the N-RNA binding domains of P solved to date suggests that the N-RNA binding domains are structurally conserved among paramyxoviruses and rhabdoviruses in spite of low sequence conservation. We also review the numerous other functions of this domain and more generally of the phosphoprotein.

  18. Binding mode and free energy prediction of fisetin/β-cyclodextrin inclusion complexes

    Directory of Open Access Journals (Sweden)

    Bodee Nutho

    2014-11-01

    Full Text Available In the present study, our aim is to investigate the preferential binding mode and encapsulation of the flavonoid fisetin in the nano-pore of β-cyclodextrin (β-CD at the molecular level using various theoretical approaches: molecular docking, molecular dynamics (MD simulations and binding free energy calculations. The molecular docking suggested four possible fisetin orientations in the cavity through its chromone or phenyl ring with two different geometries of fisetin due to the rotatable bond between the two rings. From the multiple MD results, the phenyl ring of fisetin favours its inclusion into the β-CD cavity, whilst less binding or even unbinding preference was observed in the complexes where the larger chromone ring is located in the cavity. All MM- and QM-PBSA/GBSA free energy predictions supported the more stable fisetin/β-CD complex of the bound phenyl ring. Van der Waals interaction is the key force in forming the complexes. In addition, the quantum mechanics calculations with M06-2X/6-31G(d,p clearly showed that both solvation effect and BSSE correction cannot be neglected for the energy determination of the chosen system.

  19. Composite organization of the cobalamin binding and cubilin recognition sites of intrinsic factor

    DEFF Research Database (Denmark)

    Fedosov, Sergey N; Fedosova, Natalya U; Berglund, Lars

    2005-01-01

    of the ligand. Each isolated fragment of IF was tested for the binding to the specific receptor cubilin in the presence or absence of Cbl. Neither apo nor holo forms of IF(20) and IF(30) were recognized by the receptor. When two fragments were mixed and incubated with Cbl, they associated into a stable complex......; however, efficient retention of the ligand required the presence of both fragments. Detailed schemes of the interaction of Cbl with IF(50) and with IF(30) and IF(20) are presented, where the sequential attachment of Cbl to the IF(20) and IF(30) domains plays the key role in recognition and retention......, IF(30+20).Cbl, which bound to cubilin as well as the noncleaved IF(50).Cbl complex. We suggest that formation of the cubilin recognition site on IF is caused by assembly of two distant domains, which allows the saturated protein to be recognized by the receptor. The obtained parameters for ligand...

  20. GenProBiS: web server for mapping of sequence variants to protein binding sites.

    Science.gov (United States)

    Konc, Janez; Skrlj, Blaz; Erzen, Nika; Kunej, Tanja; Janezic, Dusanka

    2017-07-03

    Discovery of potentially deleterious sequence variants is important and has wide implications for research and generation of new hypotheses in human and veterinary medicine, and drug discovery. The GenProBiS web server maps sequence variants to protein structures from the Protein Data Bank (PDB), and further to protein-protein, protein-nucleic acid, protein-compound, and protein-metal ion binding sites. The concept of a protein-compound binding site is understood in the broadest sense, which includes glycosylation and other post-translational modification sites. Binding sites were defined by local structural comparisons of whole protein structures using the Protein Binding Sites (ProBiS) algorithm and transposition of ligands from the similar binding sites found to the query protein using the ProBiS-ligands approach with new improvements introduced in GenProBiS. Binding site surfaces were generated as three-dimensional grids encompassing the space occupied by predicted ligands. The server allows intuitive visual exploration of comprehensively mapped variants, such as human somatic mis-sense mutations related to cancer and non-synonymous single nucleotide polymorphisms from 21 species, within the predicted binding sites regions for about 80 000 PDB protein structures using fast WebGL graphics. The GenProBiS web server is open and free to all users at http://genprobis.insilab.org. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. Preliminary structural characterization of human SOUL, a haem-binding protein

    International Nuclear Information System (INIS)

    Freire, Filipe; Romão, Maria João; Macedo, Anjos L.; Aveiro, Susana S.; Goodfellow, Brian J.; Carvalho, Ana Luísa

    2009-01-01

    This manuscript describes the overexpression, purification and crystallization of human SOUL protein (hSOUL). hSOUL is a 23 kDa haem-binding protein that was first identified as the PP23 protein isolated from human full-term placenta. Human SOUL (hSOUL) is a 23 kDa haem-binding protein that was first identified as the PP 23 protein isolated from human full-term placentas. Here, the overexpression, purification and crystallization of hSOUL are reported. The crystals belonged to space group P6 4 22, with unit-cell parameters a = b = 145, c = 60 Å and one protein molecule in the asymmetric unit. X-ray diffraction data were collected to 3.5 Å resolution at the ESRF. A preliminary model of the three-dimensional structure of hSOUL was obtained by molecular replacement using the structures of murine p22HBP, obtained by solution NMR, as search models

  2. Crystal structure of CbpF, a bifunctional choline-binding protein and autolysis regulator from Streptococcus pneumoniae.

    Science.gov (United States)

    Molina, Rafael; González, Ana; Stelter, Meike; Pérez-Dorado, Inmaculada; Kahn, Richard; Morales, María; Moscoso, Miriam; Campuzano, Susana; Campillo, Nuria E; Mobashery, Shahriar; García, José L; García, Pedro; Hermoso, Juan A

    2009-03-01

    Phosphorylcholine, a crucial component of the pneumococcal cell wall, is essential in bacterial physiology and in human pathogenesis because it binds to serum components of the immune system and acts as a docking station for the family of surface choline-binding proteins. The three-dimensional structure of choline-binding protein F (CbpF), one of the most abundant proteins in the pneumococcal cell wall, has been solved in complex with choline. CbpF shows a new modular structure composed both of consensus and non-consensus choline-binding repeats, distributed along its length, which markedly alter its shape, charge distribution and binding ability, and organizing the protein into two well-defined modules. The carboxy-terminal module is involved in cell wall binding and the amino-terminal module is crucial for inhibition of the autolytic LytC muramidase, providing a regulatory function for pneumococcal autolysis.

  3. Implementation of structure-mapping inference by event-file binding and action planning: a model of tool-improvisation analogies.

    Science.gov (United States)

    Fields, Chris

    2011-03-01

    Structure-mapping inferences are generally regarded as dependent upon relational concepts that are understood and expressible in language by subjects capable of analogical reasoning. However, tool-improvisation inferences are executed by members of a variety of non-human primate and other species. Tool improvisation requires correctly inferring the motion and force-transfer affordances of an object; hence tool improvisation requires structure mapping driven by relational properties. Observational and experimental evidence can be interpreted to indicate that structure-mapping analogies in tool improvisation are implemented by multi-step manipulation of event files by binding and action-planning mechanisms that act in a language-independent manner. A functional model of language-independent event-file manipulations that implement structure mapping in the tool-improvisation domain is developed. This model provides a mechanism by which motion and force representations commonly employed in tool-improvisation structure mappings may be sufficiently reinforced to be available to inwardly directed attention and hence conceptualization. Predictions and potential experimental tests of this model are outlined.

  4. Structure-dependent binding and activation of perfluorinated compounds on human peroxisome proliferator-activated receptor γ

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Lianying [State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, 18 Shuangqing Road, Beijing 100085 (China); College of Life Science, Dezhou University, Dezhou 253023 (China); Ren, Xiao-Min; Wan, Bin [State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, 18 Shuangqing Road, Beijing 100085 (China); Guo, Liang-Hong, E-mail: LHGuo@rcees.ac.cn [State Key Laboratory of Environmental Chemistry and Ecotoxicology, Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, P.O. Box 2871, 18 Shuangqing Road, Beijing 100085 (China)

    2014-09-15

    Perfluorinated compounds (PFCs) have been shown to disrupt lipid metabolism and even induce cancer in rodents through activation of peroxisome proliferator-activated receptors (PPARs). Lines of evidence showed that PPARα was activated by PFCs. However, the information on the binding interactions between PPARγ and PFCs and subsequent alteration of PPARγ activity is still limited and sometimes inconsistent. In the present study, in vitro binding of 16 PFCs to human PPARγ ligand binding domain (hPPARγ-LBD) and their activity on the receptor in cells were investigated. The results showed that the binding affinity was strongly dependent on their carbon number and functional group. For the eleven perfluorinated carboxylic acids (PFCAs), the binding affinity increased with their carbon number from 4 to 11, and then decreased slightly. The binding affinity of the three perfluorinated sulfonic acids (PFSAs) was stronger than their PFCA counterparts. No binding was detected for the two fluorotelomer alcohols (FTOHs). Circular dichroim spectroscopy showed that PFC binding induced distinctive structural change of the receptor. In dual luciferase reporter assays using transiently transfected Hep G2 cells, PFCs acted as hPPARγ agonists, and their potency correlated with their binding affinity with hPPARγ-LBD. Molecular docking showed that PFCs with different chain length bind with the receptor in different geometry, which may contribute to their differences in binding affinity and transcriptional activity. - Highlights: • Binding affinity between PFCs and PPARγ was evaluated for the first time. • The binding strength was dependent on fluorinated carbon chain and functional group. • PFC binding induced distinctive structural change of the receptor. • PFCs could act as hPPARγ agonists in Hep G2 cells.

  5. Structure-function analysis of peroxisomal ATP-binding cassette transporters using chimeric dimers

    NARCIS (Netherlands)

    Geillon, Flore; Gondcaille, Catherine; Charbonnier, Soëli; van Roermund, Carlo W.; Lopez, Tatiana E.; Dias, Alexandre M. M.; Pais de Barros, Jean-Paul; Arnould, Christine; Wanders, Ronald J.; Trompier, Doriane; Savary, Stéphane

    2014-01-01

    ABCD1 and ABCD2 are two closely related ATP-binding cassette half-transporters predicted to homodimerize and form peroxisomal importers for fatty acyl-CoAs. Available evidence has shown that ABCD1 and ABCD2 display a distinct but overlapping substrate specificity, although much remains to be learned

  6. LIBP-Pred: web server for lipid binding proteins using structural network parameters; PDB mining of human cancer biomarkers and drug targets in parasites and bacteria.

    Science.gov (United States)

    González-Díaz, Humberto; Munteanu, Cristian R; Postelnicu, Lucian; Prado-Prado, Francisco; Gestal, Marcos; Pazos, Alejandro

    2012-03-01

    Lipid-Binding Proteins (LIBPs) or Fatty Acid-Binding Proteins (FABPs) play an important role in many diseases such as different types of cancer, kidney injury, atherosclerosis, diabetes, intestinal ischemia and parasitic infections. Thus, the computational methods that can predict LIBPs based on 3D structure parameters became a goal of major importance for drug-target discovery, vaccine design and biomarker selection. In addition, the Protein Data Bank (PDB) contains 3000+ protein 3D structures with unknown function. This list, as well as new experimental outcomes in proteomics research, is a very interesting source to discover relevant proteins, including LIBPs. However, to the best of our knowledge, there are no general models to predict new LIBPs based on 3D structures. We developed new Quantitative Structure-Activity Relationship (QSAR) models based on 3D electrostatic parameters of 1801 different proteins, including 801 LIBPs. We calculated these electrostatic parameters with the MARCH-INSIDE software and they correspond to the entire protein or to specific protein regions named core, inner, middle, and surface. We used these parameters as inputs to develop a simple Linear Discriminant Analysis (LDA) classifier to discriminate 3D structure of LIBPs from other proteins. We implemented this predictor in the web server named LIBP-Pred, freely available at , along with other important web servers of the Bio-AIMS portal. The users can carry out an automatic retrieval of protein structures from PDB or upload their custom protein structural models from their disk created with LOMETS server. We demonstrated the PDB mining option performing a predictive study of 2000+ proteins with unknown function. Interesting results regarding the discovery of new Cancer Biomarkers in humans or drug targets in parasites have been discussed here in this sense.

  7. In silico engineering and optimization of Transcription Activator-Like Effectors and their derivatives for improved DNA binding predictions.

    KAUST Repository

    Piatek, Marek J.

    2015-12-01

    Transcription Activator-Like Effectors (TALEs) can be used as adaptable DNAbinding modules to create site-specific chimeric nucleases or synthetic transcriptional regulators. The central repeat domain mediates specific DNA binding via hypervariable repeat di-residues (RVDs). This DNA-Binding Domain can be engineered to bind preferentially to any user-selected DNA sequence if engineered appropriately. Therefore, TALEs and their derivatives have become indispensable molecular tools in site-specific manipulation of genes and genomes. This thesis revolves around two problems: in silico design and improved binding site prediction of TALEs. In the first part, a study is shown where TALEs are successfully designed in silico and validated in laboratory to yield the anticipated effects on selected genes. Software is developed to accompany the process of designing and prediction of binding sites. I expanded the functionality of the software to be used as a more generic set of tools for the design, target and offtarget searching. Part two contributes a method and associated toolkit developed to allow users to design in silico optimized synthetic TALEs with user-defined specificities for various experimental purposes. This method is based on a mutual relationship of three consecutive tandem repeats in the DNA-binding domain. This approach revealed positional and compositional bias behind the binding of TALEs to DNA. In conclusion, I developed methods, approaches, and software to enhance the functionality of synthetic TALEs, which should improve understanding of TALEs biology and will further advance genome-engineering applications in various organisms and cell types.

  8. Characteristics and Prediction of RNA Structure

    Directory of Open Access Journals (Sweden)

    Hengwu Li

    2014-01-01

    Full Text Available RNA secondary structures with pseudoknots are often predicted by minimizing free energy, which is NP-hard. Most RNAs fold during transcription from DNA into RNA through a hierarchical pathway wherein secondary structures form prior to tertiary structures. Real RNA secondary structures often have local instead of global optimization because of kinetic reasons. The performance of RNA structure prediction may be improved by considering dynamic and hierarchical folding mechanisms. This study is a novel report on RNA folding that accords with the golden mean characteristic based on the statistical analysis of the real RNA secondary structures of all 480 sequences from RNA STRAND, which are validated by NMR or X-ray. The length ratios of domains in these sequences are approximately 0.382L, 0.5L, 0.618L, and L, where L is the sequence length. These points are just the important golden sections of sequence. With this characteristic, an algorithm is designed to predict RNA hierarchical structures and simulate RNA folding by dynamically folding RNA structures according to the above golden section points. The sensitivity and number of predicted pseudoknots of our algorithm are better than those of the Mfold, HotKnots, McQfold, ProbKnot, and Lhw-Zhu algorithms. Experimental results reflect the folding rules of RNA from a new angle that is close to natural folding.

  9. Structural determinants for binding to angiotensin converting enzyme 2 (ACE2 and angiotensin receptors

    Directory of Open Access Journals (Sweden)

    Daniel eClayton

    2015-01-01

    Full Text Available Angiotensin converting enzyme 2 (ACE2 is a zinc carboxypeptidase involved in the renin angiotensin system (RAS and inactivates the potent vasopressive peptide angiotensin II (Ang II by removing the C-terminal phenylalanine residue to yield Ang1-7. This conversion inactivates the vasoconstrictive action of Ang II and yields a peptide that acts as a vasodilatory molecule at the Mas receptor and potentially other receptors. Given the growing complexity of RAS and level of cross-talk between ligands and their corresponding enzymes and receptors, the design of molecules with selectivity for the major RAS binding partners to control cardiovascular tone is an on-going challenge. In previous studies we used single β-amino acid substitutions to modulate the structure of Ang II and its selectivity for ACE2, AT1R and angiotensin type 2 (AT2R receptor. We showed that modification at the C-terminus of Ang II generally resulted in more pronounced changes to secondary structure and ligand binding, and here we further explore this region for the potential to modulate ligand specificity. In this study, 1 a library of forty-seven peptides derived from the C-terminal tetra-peptide sequence (-IHPF of Ang II was synthesised and assessed for ACE2 binding, 2 the terminal group requirements for high affinity ACE2 binding were explored by and N- and C-terminal modification, 3 high affinity ACE2 binding chimeric AngII analogues were then synthesized and assessed, 4 the structure of the full-length Ang II analogues were assessed by circular dichroism, and 5 the Ang II analogues were assessed for AT1R/AT2R selectivity by cell-based assays. Studies on the C-terminus of Ang II demonstrated varied specificity at different residue positions for ACE2 binding and four Ang II chimeric peptides were identified as selective ligands for the AT2 receptor. Overall, these results provide insight into the residue and structural requirements for ACE2 binding and angiotensin receptor

  10. COMPARATIVE MODELLING AND LIGAND BINDING SITE PREDICTION OF A FAMILY 43 GLYCOSIDE HYDROLASE FROM Clostridium thermocellum

    Directory of Open Access Journals (Sweden)

    Shadab Ahmed

    2012-06-01

    Full Text Available The phylogenetic analysis of Clostridium thermocellum family 43 glycoside hydrolase (CtGH43 showed close evolutionary relation with carbohydrate binding family 6 proteins from C. cellulolyticum, C. papyrosolvens, C. cellulyticum, and A. cellulyticum. Comparative modeling of CtGH43 was performed based on crystal structures with PDB IDs 3C7F, 1YIF, 1YRZ, 2EXH and 1WL7. The structure having lowest MODELLER objective function was selected. The three-dimensional structure revealed typical 5-fold beta–propeller architecture. Energy minimization and validation of predicted model with VERIFY 3D indicated acceptability of the proposed atomic structure. The Ramachandran plot analysis by RAMPAGE confirmed that family 43 glycoside hydrolase (CtGH43 contains little or negligible segments of helices. It also showed that out of 301 residues, 267 (89.3% were in most favoured region, 23 (7.7% were in allowed region and 9 (3.0% were in outlier region. IUPred analysis of CtGH43 showed no disordered region. Active site analysis showed presence of two Asp and one Glu, assumed to form a catalytic triad. This study gives us information about three-dimensional structure and reaffirms the fact that it has the similar core 5-fold beta–propeller architecture and so probably has the same inverting mechanism of action with the formation of above mentioned catalytic triad for catalysis of polysaccharides.

  11. NetMHCpan-4.0: Improved Peptide-MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data.

    Science.gov (United States)

    Jurtz, Vanessa; Paul, Sinu; Andreatta, Massimo; Marcatili, Paolo; Peters, Bjoern; Nielsen, Morten

    2017-11-01

    Cytotoxic T cells are of central importance in the immune system's response to disease. They recognize defective cells by binding to peptides presented on the cell surface by MHC class I molecules. Peptide binding to MHC molecules is the single most selective step in the Ag-presentation pathway. Therefore, in the quest for T cell epitopes, the prediction of peptide binding to MHC molecules has attracted widespread attention. In the past, predictors of peptide-MHC interactions have primarily been trained on binding affinity data. Recently, an increasing number of MHC-presented peptides identified by mass spectrometry have been reported containing information about peptide-processing steps in the presentation pathway and the length distribution of naturally presented peptides. In this article, we present NetMHCpan-4.0, a method trained on binding affinity and eluted ligand data leveraging the information from both data types. Large-scale benchmarking of the method demonstrates an increase in predictive performance compared with state-of-the-art methods when it comes to identification of naturally processed ligands, cancer neoantigens, and T cell epitopes. Copyright © 2017 by The American Association of Immunologists, Inc.

  12. Structures of BmrR-Drug Complexes Reveal a Rigid Multidrug Binding Pocket And Transcription Activation Through Tyrosine Expulsion

    Energy Technology Data Exchange (ETDEWEB)

    Newberry, K.J.; Huffman, J.L.; Miller, M.C.; Vazquez-Laslop, N.; Neyfakh, A.A.; Brennan, R.G.

    2009-05-22

    BmrR is a member of the MerR family and a multidrug binding transcription factor that up-regulates the expression of the bmr multidrug efflux transporter gene in response to myriad lipophilic cationic compounds. The structural mechanism by which BmrR binds these chemically and structurally different drugs and subsequently activates transcription is poorly understood. Here, we describe the crystal structures of BmrR bound to rhodamine 6G (R6G) or berberine (Ber) and cognate DNA. These structures reveal each drug stacks against multiple aromatic residues with their positive charges most proximal to the carboxylate group of Glu-253 and that, unlike other multidrug binding pockets, that of BmrR is rigid. Substitution of Glu-253 with either alanine (E253A) or glutamine (E253Q) results in unpredictable binding affinities for R6G, Ber, and tetraphenylphosphonium. Moreover, these drug binding studies reveal that the negative charge of Glu-253 is not important for high affinity binding to Ber and tetraphenylphosphonium but plays a more significant, but unpredictable, role in R6G binding. In vitro transcription data show that E253A and E253Q are constitutively active, and structures of the drug-free E253A-DNA and E253Q-DNA complexes support a transcription activation mechanism requiring the expulsion of Tyr-152 from the multidrug binding pocket. In sum, these data delineate the mechanism by which BmrR binds lipophilic, monovalent cationic compounds and suggest the importance of the redundant negative electrostatic nature of this rigid drug binding pocket that can be used to discriminate against molecules that are not substrates of the Bmr multidrug efflux pump.

  13. Structural requirements of cholesterol for binding to Vibrio cholerae hemolysin.

    Science.gov (United States)

    Ikigai, Hajime; Otsuru, Hiroshi; Yamamoto, Koichiro; Shimamura, Tadakatsu

    2006-01-01

    Cholesterol is necessary for the conversion of Vibrio cholerae hemolysin (VCH) monomers into oligomers in liposome membranes. Using different sterols, we determined the stereochemical structures of the VCH-binding active groups present in cholesterol. The VCH monomers are bound to cholesterol, diosgenin, campesterol, and ergosterol, which have a hydroxyl group at position C-3 (3betaOH) in the A ring and a C-C double bond between positions C-5 and C-6 (C-C Delta(5)) in the B ring. They are not bound to epicholesterol and dihydrocholesterol, which form a covalent link with a 3alphaOH group and a C-C single bond between positions C-5 and C-6, respectively. This result suggests that the 3betaOH group and the C-CDelta(5) bond in cholesterol are required for VCH monomer binding. We further examined VCH oligomer binding to cholesterol. However, this oligomer did not bind to cholesterol, suggesting that the disappearance of the cholesterol-binding potential of the VCH oligomer might be a result of the conformational change caused by the conversion of the monomer into the oligomer. VCH oligomer formation was observed in liposomes containing sterols with the 3betaOH group and the C-C Delta(5) bond, and it correlated with the binding affinity of the monomer to each sterol. Therefore, it seems likely that monomer binding to membrane sterol leads to the assembly of the monomer. However, since oligomer formation was induced by liposomes containing either epicholesterol or dihydrocholesterol, the 3betaOH group and the C-C Delta(5) bond were not essential for conversion into the oligomer.

  14. FoxA1 binding to the MMTV LTR modulates chromatin structure and transcription

    International Nuclear Information System (INIS)

    Holmqvist, Per-Henrik; Belikov, Sergey; Zaret, Kenneth S.; Wrange, Oerjan

    2005-01-01

    Novel binding sites for the forkhead transcription factor family member Forkhead box A (FoxA), previously referred to as Hepatocyte Nuclear Factor 3 (HNF3), were found within the mouse mammary tumor virus long terminal repeat (MMTV LTR). The effect of FoxA1 on MMTV LTR chromatin structure, and expression was evaluated in Xenopus laevis oocytes. Mutagenesis of either of the two main FoxA binding sites showed that the distal site, -232/-221, conferred FoxA1-dependent partial inhibition of glucocorticoid receptor (GR) driven MMTV transcription. The proximal FoxA binding segment consisted of two individual FoxA sites at -57/-46 and -45/-34, respectively, that mediated an increased basal MMTV transcription. FoxA1 binding altered the chromatin structure of both the inactive- and the hormone-activated MMTV LTR. Hydroxyl radical foot printing revealed FoxA1-mediated changes in the nucleosome arrangement. Micrococcal nuclease digestion showed the hormone-dependent sub-nucleosome complex, containing ∼120 bp of DNA, to be expanded by FoxA1 binding to the proximal segment into a larger complex containing ∼200 bp. The potential function of the FoxA1-mediated expression of the MMTV provirus for maintenance of expression in different tissues is discussed

  15. Crystal structure of NL63 respiratory coronavirus receptor-binding domain complexed with its human receptor

    Energy Technology Data Exchange (ETDEWEB)

    Wu, Kailang; Li, Weikai; Peng, Guiqing; Li, Fang; (Harvard-Med); (UMM-MED)

    2010-03-04

    NL63 coronavirus (NL63-CoV), a prevalent human respiratory virus, is the only group I coronavirus known to use angiotensin-converting enzyme 2 (ACE2) as its receptor. Incidentally, ACE2 is also used by group II SARS coronavirus (SARS-CoV). We investigated how different groups of coronaviruses recognize the same receptor, whereas homologous group I coronaviruses recognize different receptors. We determined the crystal structure of NL63-CoV spike protein receptor-binding domain (RBD) complexed with human ACE2. NL63-CoV RBD has a novel {beta}-sandwich core structure consisting of 2 layers of {beta}-sheets, presenting 3 discontinuous receptor-binding motifs (RBMs) to bind ACE2. NL63-CoV and SARS-CoV have no structural homology in RBD cores or RBMs; yet the 2 viruses recognize common ACE2 regions, largely because of a 'virus-binding hotspot' on ACE2. Among group I coronaviruses, RBD cores are conserved but RBMs are variable, explaining how these viruses recognize different receptors. These results provide a structural basis for understanding viral evolution and virus-receptor interactions.

  16. NetMHCpan 4.0: Improved peptide-MHC class I interaction predictions integrating eluted ligand and peptide binding affinity data

    OpenAIRE

    Jurtz, Vanessa; Paul, Sinu; Andreatta, Massimo; Marcatili, Paolo; Peters, Bjoern; Nielsen, Morten

    2017-01-01

    Cytotoxic T cells are of central importance in the immune systems response to disease. They recognize defective cells by binding to peptides presented on the cell surface by MHC (major histocompatibility complex) class I molecules. Peptide binding to MHC molecules is the single most selective step in the antigen presentation pathway. On the quest for T cell epitopes, the prediction of peptide binding to MHC molecules has therefore attracted large attention. In the past, predictors of peptide-...

  17. Pathways to Structure-Property Relationships of Peptide-Materials Interfaces: Challenges in Predicting Molecular Structures.

    Science.gov (United States)

    Walsh, Tiffany R

    2017-07-18

    An in-depth appreciation of how to manipulate the molecular-level recognition between peptides and aqueous materials interfaces, including nanoparticles, will advance technologies based on self-organized metamaterials for photonics and plasmonics, biosensing, catalysis, energy generation and harvesting, and nanomedicine. Exploitation of the materials-selective binding of biomolecules is pivotal to success in these areas and may be particularly key to producing new hierarchically structured biobased materials. These applications could be accomplished by realizing preferential adsorption of a given biomolecule onto one materials composition over another, one surface facet over another, or one crystalline polymorph over another. Deeper knowledge of the aqueous abiotic-biotic interface, to establish clear structure-property relationships in these systems, is needed to meet this goal. In particular, a thorough structural characterization of the surface-adsorbed peptides is essential for establishing these relationships but can often be challenging to accomplish via experimental approaches alone. In addition to myriad existing challenges associated with determining the detailed molecular structure of any molecule adsorbed at an aqueous interface, experimental characterization of materials-binding peptides brings new, complex challenges because many materials-binding peptides are thought to be intrinsically disordered. This means that these peptides are not amenable to experimental techniques that rely on the presence of well-defined secondary structure in the peptide when in the adsorbed state. To address this challenge, and in partnership with experiment, molecular simulations at the atomistic level can bring complementary and critical insights into the origins of this abiotic/biotic recognition and suggest routes for manipulating this phenomenon to realize new types of hybrid materials. For the reasons outlined above, molecular simulation approaches also face

  18. PocketMatch: A new algorithm to compare binding sites in protein structures

    Directory of Open Access Journals (Sweden)

    Chandra Nagasuma

    2008-12-01

    Full Text Available Abstract Background Recognizing similarities and deriving relationships among protein molecules is a fundamental requirement in present-day biology. Similarities can be present at various levels which can be detected through comparison of protein sequences or their structural folds. In some cases similarities obscure at these levels could be present merely in the substructures at their binding sites. Inferring functional similarities between protein molecules by comparing their binding sites is still largely exploratory and not as yet a routine protocol. One of the main reasons for this is the limitation in the choice of appropriate analytical tools that can compare binding sites with high sensitivity. To benefit from the enormous amount of structural data that is being rapidly accumulated, it is essential to have high throughput tools that enable large scale binding site comparison. Results Here we present a new algorithm PocketMatch for comparison of binding sites in a frame invariant manner. Each binding site is represented by 90 lists of sorted distances capturing shape and chemical nature of the site. The sorted arrays are then aligned using an incremental alignment method and scored to obtain PMScores for pairs of sites. A comprehensive sensitivity analysis and an extensive validation of the algorithm have been carried out. A comparison with other site matching algorithms is also presented. Perturbation studies where the geometry of a given site was retained but the residue types were changed randomly, indicated that chance similarities were virtually non-existent. Our analysis also demonstrates that shape information alone is insufficient to discriminate between diverse binding sites, unless combined with chemical nature of amino acids. Conclusion A new algorithm has been developed to compare binding sites in accurate, efficient and high-throughput manner. Though the representation used is conceptually simplistic, we demonstrate that

  19. Crystal structure of glucose isomerase in complex with xylitol inhibitor in one metal binding mode.

    Science.gov (United States)

    Bae, Ji-Eun; Kim, In Jung; Nam, Ki Hyun

    2017-11-04

    Glucose isomerase (GI) is an intramolecular oxidoreductase that interconverts aldoses and ketoses. These characteristics are widely used in the food, detergent, and pharmaceutical industries. In order to obtain an efficient GI, identification of novel GI genes and substrate binding/inhibition have been studied. Xylitol is a well-known inhibitor of GI. In Streptomyces rubiginosus, two crystal structures have been reported for GI in complex with xylitol inhibitor. However, a structural comparison showed that xylitol can have variable conformation at the substrate binding site, e.g., a nonspecific binding mode. In this study, we report the crystal structure of S. rubiginosus GI in a complex with xylitol and glycerol. Our crystal structure showed one metal binding mode in GI, which we presumed to represent the inactive form of the GI. The metal ion was found only at the M1 site, which was involved in substrate binding, and was not present at the M2 site, which was involved in catalytic function. The O 2 and O 4 atoms of xylitol molecules contributed to the stable octahedral coordination of the metal in M1. Although there was no metal at the M2 site, no large conformational change was observed for the conserved residues coordinating M2. Our structural analysis showed that the metal at the M2 site was not important when a xylitol inhibitor was bound to the M1 site in GI. Thus, these findings provided important information for elucidation or engineering of GI functions. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. Structural Probing and Molecular Modeling of the A₃ Adenosine Receptor: A Focus on Agonist Binding.

    Science.gov (United States)

    Ciancetta, Antonella; Jacobson, Kenneth A

    2017-03-11

    Adenosine is an endogenous modulator exerting its functions through the activation of four adenosine receptor (AR) subtypes, termed A₁, A 2A , A 2B and A₃, which belong to the G protein-coupled receptor (GPCR) superfamily. The human A₃AR (hA₃AR) subtype is implicated in several cytoprotective functions. Therefore, hA₃AR modulators, and in particular agonists, are sought for their potential application as anti-inflammatory, anticancer, and cardioprotective agents. Structure-based molecular modeling techniques have been applied over the years to rationalize the structure-activity relationships (SARs) of newly emerged A₃AR ligands, guide the subsequent lead optimization, and interpret site-directed mutagenesis (SDM) data from a molecular perspective. In this review, we showcase selected modeling-based and guided strategies that were applied to elucidate the binding of agonists to the A₃AR and discuss the challenges associated with an accurate prediction of the receptor extracellular vestibule through homology modeling from the available X-ray templates.

  1. Complex structure of the fission yeast SREBP-SCAP binding domains reveals an oligomeric organization.

    Science.gov (United States)

    Gong, Xin; Qian, Hongwu; Shao, Wei; Li, Jingxian; Wu, Jianping; Liu, Jun-Jie; Li, Wenqi; Wang, Hong-Wei; Espenshade, Peter; Yan, Nieng

    2016-11-01

    Sterol regulatory element-binding protein (SREBP) transcription factors are master regulators of cellular lipid homeostasis in mammals and oxygen-responsive regulators of hypoxic adaptation in fungi. SREBP C-terminus binds to the WD40 domain of SREBP cleavage-activating protein (SCAP), which confers sterol regulation by controlling the ER-to-Golgi transport of the SREBP-SCAP complex and access to the activating proteases in the Golgi. Here, we biochemically and structurally show that the carboxyl terminal domains (CTD) of Sre1 and Scp1, the fission yeast SREBP and SCAP, form a functional 4:4 oligomer and Sre1-CTD forms a dimer of dimers. The crystal structure of Sre1-CTD at 3.5 Å and cryo-EM structure of the complex at 5.4 Å together with in vitro biochemical evidence elucidate three distinct regions in Sre1-CTD required for Scp1 binding, Sre1-CTD dimerization and tetrameric formation. Finally, these structurally identified domains are validated in a cellular context, demonstrating that the proper 4:4 oligomeric complex formation is required for Sre1 activation.

  2. Structure, dynamics and RNA binding of the multi-domain splicing factor TIA-1

    Science.gov (United States)

    Wang, Iren; Hennig, Janosch; Jagtap, Pravin Kumar Ankush; Sonntag, Miriam; Valcárcel, Juan; Sattler, Michael

    2014-01-01

    Alternative pre-messenger ribonucleic acid (pre-mRNA) splicing is an essential process in eukaryotic gene regulation. The T-cell intracellular antigen-1 (TIA-1) is an apoptosis-promoting factor that modulates alternative splicing of transcripts, including the pre-mRNA encoding the membrane receptor Fas. TIA-1 is a multi-domain ribonucleic acid (RNA) binding protein that recognizes poly-uridine tract RNA sequences to facilitate 5′ splice site recognition by the U1 small nuclear ribonucleoprotein (snRNP). Here, we characterize the RNA interaction and conformational dynamics of TIA-1 by nuclear magnetic resonance (NMR), isothermal titration calorimetry (ITC) and small angle X-ray scattering (SAXS). Our NMR-derived solution structure of TIA-1 RRM2–RRM3 (RRM2,3) reveals that RRM2 adopts a canonical RNA recognition motif (RRM) fold, while RRM3 is preceded by an non-canonical helix α0. NMR and SAXS data show that all three RRMs are largely independent structural modules in the absence of RNA, while RNA binding induces a compact arrangement. RRM2,3 binds to pyrimidine-rich FAS pre-mRNA or poly-uridine (U9) RNA with nanomolar affinities. RRM1 has little intrinsic RNA binding affinity and does not strongly contribute to RNA binding in the context of RRM1,2,3. Our data unravel the role of binding avidity and the contributions of the TIA-1 RRMs for recognition of pyrimidine-rich RNAs. PMID:24682828

  3. Algorithms for Protein Structure Prediction

    DEFF Research Database (Denmark)

    Paluszewski, Martin

    -trace. Here we present three different approaches for reconstruction of C-traces from predictable measures. In our first approach [63, 62], the C-trace is positioned on a lattice and a tabu-search algorithm is applied to find minimum energy structures. The energy function is based on half-sphere-exposure (HSE......) is more robust than standard Monte Carlo search. In the second approach for reconstruction of C-traces, an exact branch and bound algorithm has been developed [67, 65]. The model is discrete and makes use of secondary structure predictions, HSE, CN and radius of gyration. We show how to compute good lower...... bounds for partial structures very fast. Using these lower bounds, we are able to find global minimum structures in a huge conformational space in reasonable time. We show that many of these global minimum structures are of good quality compared to the native structure. Our branch and bound algorithm...

  4. Structural basis of non-steroidal anti-inflammatory drug diclofenac binding to human serum albumin.

    Science.gov (United States)

    Zhang, Yao; Lee, Philbert; Liang, Shichu; Zhou, Zuping; Wu, Xiaoyang; Yang, Feng; Liang, Hong

    2015-11-01

    Human serum albumin (HSA) is the most abundant protein in plasma, which plays a central role in drug pharmacokinetics because most compounds bound to HSA in blood circulation. To understand binding characterization of non-steroidal anti-inflammatory drugs to HSA, we resolved the structure of diclofenac and HSA complex by X-ray crystallography. HSA-palmitic acid-diclofenac structure reveals two distinct binding sites for three diclofenac in HSA. One diclofenac is located at the IB subdomain, and its carboxylate group projects toward polar environment, forming hydrogen bond with one water molecule. The other two diclofenac molecules cobind in big hydrophobic cavity of the IIA subdomain without interactive association. Among them, one binds in main chamber of big hydrophobic cavity, and its carboxylate group forms hydrogen bonds with Lys199 and Arg218, as well as one water molecule, whereas another diclofenac binds in side chamber, its carboxylate group projects out cavity, forming hydrogen bond with Ser480. © 2015 John Wiley & Sons A/S.

  5. Switch region for pathogenic structural change in conformational disease and its prediction.

    Directory of Open Access Journals (Sweden)

    Xin Liu

    2010-01-01

    Full Text Available Many diseases are believed to be related to abnormal protein folding. In the first step of such pathogenic structural changes, misfolding occurs in regions important for the stability of the native structure. This destabilizes the normal protein conformation, while exposing the previously hidden aggregation-prone regions, leading to subsequent errors in the folding pathway. Sites involved in this first stage can be deemed switch regions of the protein, and can represent perfect binding targets for drugs to block the abnormal folding pathway and prevent pathogenic conformational changes. In this study, a prediction algorithm for the switch regions responsible for the start of pathogenic structural changes is introduced. With an accuracy of 94%, this algorithm can successfully find short segments covering sites significant in triggering conformational diseases (CDs and is the first that can predict switch regions for various CDs. To illustrate its effectiveness in dealing with urgent public health problems, the reason of the increased pathogenicity of H5N1 influenza virus is analyzed; the mechanisms of the pandemic swine-origin 2009 A(H1N1 influenza virus in overcoming species barriers and in infecting large number of potential patients are also suggested. It is shown that the algorithm is a potential tool useful in the study of the pathology of CDs because: (1 it can identify the origin of pathogenic structural conversion with high sensitivity and specificity, and (2 it provides an ideal target for clinical treatment.

  6. The structure of Plasmodium vivax phosphatidylethanolamine-binding protein suggests a functional motif containing a left-handed helix

    International Nuclear Information System (INIS)

    Arakaki, Tracy; Neely, Helen; Boni, Erica; Mueller, Natasha; Buckner, Frederick S.; Van Voorhis, Wesley C.; Lauricella, Angela; DeTitta, George; Luft, Joseph; Hol, Wim G. J.; Merritt, Ethan A.

    2007-01-01

    The crystal structure of a phosphatidylethanolamine-binding protein from P. vivax, a homolog of Raf-kinase inhibitor protein (RKIP), has been solved to a resolution of 1.3 Å. The inferred interaction surface near the anion-binding site is found to include a distinctive left-handed α-helix. The structure of a putative Raf kinase inhibitor protein (RKIP) homolog from the eukaryotic parasite Plasmodium vivax has been studied to a resolution of 1.3 Å using multiple-wavelength anomalous diffraction at the Se K edge. This protozoan protein is topologically similar to previously studied members of the phosphatidylethanolamine-binding protein (PEBP) sequence family, but exhibits a distinctive left-handed α-helical region at one side of the canonical phospholipid-binding site. Re-examination of previously determined PEBP structures suggests that the P. vivax protein and yeast carboxypeptidase Y inhibitor may represent a structurally distinct subfamily of the diverse PEBP-sequence family

  7. Structures of Adnectin/Protein Complexes Reveal an Expanded Binding Footprint

    Energy Technology Data Exchange (ETDEWEB)

    Ramamurthy, Vidhyashankar; Krystek, Jr., Stanley R.; Bush, Alexander; Wei, Anzhi; Emanuel, Stuart L.; Gupta, Ruchira Das; Janjua, Ahsen; Cheng, Lin; Murdock, Melissa; Abramczyk, Bozena; Cohen, Daniel; Lin, Zheng; Morin, Paul; Davis, Jonathan H.; Dabritz, Michael; McLaughlin, Douglas C.; Russo, Katie A.; Chao, Ginger; Wright, Martin C.; Jenny, Victoria A.; Engle, Linda J.; Furfine, Eric; Sheriff, Steven (BMS)

    2014-10-02

    Adnectins are targeted biologics derived from the tenth type III domain of human fibronectin ({sup 10}Fn3), a member of the immunoglobulin superfamily. Target-specific binders are selected from libraries generated by diversifying the three {sup 10}Fn3 loops that are analogous to the complementarity determining regions of antibodies. The crystal structures of two Adnectins were determined, each in complex with its therapeutic target, EGFR or IL-23. Both Adnectins bind different epitopes than those bound by known monoclonal antibodies. Molecular modeling suggests that some of these epitopes might not be accessible to antibodies because of the size and concave shape of the antibody combining site. In addition to interactions from the Adnectin diversified loops, residues from the N terminus and/or the {beta} strands interact with the target proteins in both complexes. Alanine-scanning mutagenesis confirmed the calculated binding energies of these {beta} strand interactions, indicating that these nonloop residues can expand the available binding footprint.

  8. Structural changes of creatine kinase upon substrate binding.

    Science.gov (United States)

    Forstner, M; Kriechbaum, M; Laggner, P; Wallimann, T

    1998-08-01

    Small-angle x-ray scattering was used to investigate structural changes upon binding of individual substrates or a transition state analog complex (TSAC; Mg-ADP, creatine, and KNO3) to creatine kinase (CK) isoenzymes (dimeric muscle-type (M)-CK and octameric mitochondrial (Mi)-CK) and monomeric arginine kinase (AK). Considerable changes in the shape and the size of the molecules occurred upon binding of Mg-nucleotide or TSAC. The radius of gyration of Mi-CK was reduced from 55.6 A (free enzyme) to 48.9 A (enzyme plus Mg-ATP) and to 48.2 A (enzyme plus TSAC). M-CK showed similar changes from 28.0 A (free enzyme) to 25.6 A (enzyme plus Mg-ATP) and to 25.5 A (enzyme plus TSAC). Creatine alone did not lead to significant changes in the radii of gyration, nor did free ATP or ADP. AK also showed a change of the radius of gyration from 21.5 A (free enzyme) to 19.7 A (enzyme plus Mg-ATP), whereas with arginine alone only a minor change could be observed. The primary change in structure as seen with monomeric AK seems to be a Mg-nucleotide-induced domain movement relative to each other, whereas the effect of substrate may be of local order only. In CK, however, additional movements have to be involved.

  9. Structural basis for antagonism of human interleukin 18 by poxvirus interleukin 18-binding protein

    Energy Technology Data Exchange (ETDEWEB)

    Krumm, Brian; Meng, Xiangzhi; Li, Yongchao; Xiang, Yan; Deng, Junpeng (Texas-HSC); (OKLU)

    2009-07-10

    Human interleukin-18 (hIL-18) is a cytokine that plays an important role in inflammation and host defense against microbes. Its activity is regulated in vivo by a naturally occurring antagonist, the human IL-18-binding protein (IL-18BP). Functional homologs of human IL-18BP are encoded by all orthopoxviruses, including variola virus, the causative agent of smallpox. They contribute to virulence by suppressing IL-18-mediated immune responses. Here, we describe the 2.0-{angstrom} resolution crystal structure of an orthopoxvirus IL-18BP, ectromelia virus IL-18BP (ectvIL-18BP), in complex with hIL-18. The hIL-18 structure in the complex shows significant conformational change at the binding interface compared with the structure of ligand-free hIL-18, indicating that the binding is mediated by an induced-fit mechanism. EctvIL-18BP adopts a canonical Ig fold and interacts via one edge of its {beta}-sandwich with 3 cavities on the hIL-18 surface through extensive hydrophobic and hydrogen bonding interactions. Most of the ectvIL-18BP residues that participate in these interactions are conserved in both human and viral homologs, explaining their functional equivalence despite limited sequence homology. EctvIL-18BP blocks a putative receptor-binding site on IL-18, thus preventing IL-18 from engaging its receptor. Our structure provides insights into how IL-18BPs modulate hIL-18 activity. The revealed binding interface provides the basis for rational design of inhibitors against orthopoxvirus IL-18BP (for treating orthopoxvirus infection) or hIL-18 (for treating certain inflammatory and autoimmune diseases).

  10. A Common Structural Motif in the Binding of Virulence Factors to Bacterial Secretion Chaperones

    International Nuclear Information System (INIS)

    Lilic, M.; Vujanac, M.; Stebbins, C.

    2006-01-01

    Salmonella invasion protein A (SipA) is translocated into host cells by a type III secretion system (T3SS) and comprises two regions: one domain binds its cognate type III secretion chaperone, InvB, in the bacterium to facilitate translocation, while a second domain functions in the host cell, contributing to bacterial uptake by polymerizing actin. We present here the crystal structures of the SipA chaperone binding domain (CBD) alone and in complex with InvB. The SipA CBD is found to consist of a nonglobular polypeptide as well as a large globular domain, both of which are necessary for binding to InvB. We also identify a structural motif that may direct virulence factors to their cognate chaperones in a diverse range of pathogenic bacteria. Disruption of this structural motif leads to a destabilization of several chaperone-substrate complexes from different species, as well as an impairment of secretion in Salmonella

  11. From nonspecific DNA-protein encounter complexes to the prediction of DNA-protein interactions.

    Directory of Open Access Journals (Sweden)

    Mu Gao

    2009-03-01

    Full Text Available DNA-protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA-protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA-protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA-protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA-protein interaction modes exhibit some similarity to specific DNA-protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Calpha deviation from native is up to 5 A from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA-protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein.

  12. Structure-based engineering to restore high affinity binding of an isoform-selective anti-TGFβ1 antibody

    Science.gov (United States)

    Honey, Denise M.; Best, Annie; Qiu, Huawei

    2018-01-01

    ABSTRACT Metelimumab (CAT192) is a human IgG4 monoclonal antibody developed as a TGFβ1-specific antagonist. It was tested in clinical trials for the treatment of scleroderma but later terminated due to lack of efficacy. Subsequent characterization of CAT192 indicated that its TGFβ1 binding affinity was reduced by ∼50-fold upon conversion from the parental single-chain variable fragment (scFv) to IgG4. We hypothesized this result was due to decreased conformational flexibility of the IgG that could be altered via engineering. Therefore, we designed insertion mutants in the elbow region and screened for binding and potency. Our results indicated that increasing the elbow region linker length in each chain successfully restored the isoform-specific and high affinity binding of CAT192 to TGFβ1. The crystal structure of the high binding affinity mutant displays large conformational rearrangements of the variable domains compared to the wild-type antigen-binding fragment (Fab) and the low binding affinity mutants. Insertion of two glycines in both the heavy and light chain elbow regions provided sufficient flexibility for the variable domains to extend further apart than the wild-type Fab, and allow the CDR3s to make additional interactions not seen in the wild-type Fab structure. These interactions coupled with the dramatic conformational changes provide a possible explanation of how the scFv and elbow-engineered Fabs bind TGFβ1 with high affinity. This study demonstrates the benefits of re-examining both structure and function when converting scFv to IgG molecules, and highlights the potential of structure-based engineering to produce fully functional antibodies. PMID:29333938

  13. Conformational Dynamics of apo-GlnBP Revealed by Experimental and Computational Analysis

    KAUST Repository

    Feng, Yitao

    2016-10-13

    The glutamine binding protein (GlnBP) binds l-glutamine and cooperates with its cognate transporters during glutamine uptake. Crystal structure analysis has revealed an open and a closed conformation for apo- and holo-GlnBP, respectively. However, the detailed conformational dynamics have remained unclear. Herein, we combined NMR spectroscopy, MD simulations, and single-molecule FRET techniques to decipher the conformational dynamics of apo-GlnBP. The NMR residual dipolar couplings of apo-GlnBP were in good agreement with a MD-derived structure ensemble consisting of four metastable states. The open and closed conformations are the two major states. This four-state model was further validated by smFRET experiments and suggests the conformational selection mechanism in ligand recognition of GlnBP. © 2016 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim

  14. Conformational Dynamics of apo-GlnBP Revealed by Experimental and Computational Analysis

    KAUST Repository

    Feng, Yitao; Zhang, Lu; Wu, Shaowen; Liu, Zhijun; Gao, Xin; Zhang, Xu; Liu, Maili; Liu, Jianwei; Huang, Xuhui; Wang, Wenning

    2016-01-01

    The glutamine binding protein (GlnBP) binds l-glutamine and cooperates with its cognate transporters during glutamine uptake. Crystal structure analysis has revealed an open and a closed conformation for apo- and holo-GlnBP, respectively. However, the detailed conformational dynamics have remained unclear. Herein, we combined NMR spectroscopy, MD simulations, and single-molecule FRET techniques to decipher the conformational dynamics of apo-GlnBP. The NMR residual dipolar couplings of apo-GlnBP were in good agreement with a MD-derived structure ensemble consisting of four metastable states. The open and closed conformations are the two major states. This four-state model was further validated by smFRET experiments and suggests the conformational selection mechanism in ligand recognition of GlnBP. © 2016 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim

  15. Critical Features of Fragment Libraries for Protein Structure Prediction.

    Science.gov (United States)

    Trevizani, Raphael; Custódio, Fábio Lima; Dos Santos, Karina Baptista; Dardenne, Laurent Emmanuel

    2017-01-01

    The use of fragment libraries is a popular approach among protein structure prediction methods and has proven to substantially improve the quality of predicted structures. However, some vital aspects of a fragment library that influence the accuracy of modeling a native structure remain to be determined. This study investigates some of these features. Particularly, we analyze the effect of using secondary structure prediction guiding fragments selection, different fragments sizes and the effect of structural clustering of fragments within libraries. To have a clearer view of how these factors affect protein structure prediction, we isolated the process of model building by fragment assembly from some common limitations associated with prediction methods, e.g., imprecise energy functions and optimization algorithms, by employing an exact structure-based objective function under a greedy algorithm. Our results indicate that shorter fragments reproduce the native structure more accurately than the longer. Libraries composed of multiple fragment lengths generate even better structures, where longer fragments show to be more useful at the beginning of the simulations. The use of many different fragment sizes shows little improvement when compared to predictions carried out with libraries that comprise only three different fragment sizes. Models obtained from libraries built using only sequence similarity are, on average, better than those built with a secondary structure prediction bias. However, we found that the use of secondary structure prediction allows greater reduction of the search space, which is invaluable for prediction methods. The results of this study can be critical guidelines for the use of fragment libraries in protein structure prediction.

  16. TFpredict and SABINE: sequence-based prediction of structural and functional characteristics of transcription factors.

    Directory of Open Access Journals (Sweden)

    Johannes Eichner

    Full Text Available One of the key mechanisms of transcriptional control are the specific connections between transcription factors (TF and cis-regulatory elements in gene promoters. The elucidation of these specific protein-DNA interactions is crucial to gain insights into the complex regulatory mechanisms and networks underlying the adaptation of organisms to dynamically changing environmental conditions. As experimental techniques for determining TF binding sites are expensive and mostly performed for selected TFs only, accurate computational approaches are needed to analyze transcriptional regulation in eukaryotes on a genome-wide level. We implemented a four-step classification workflow which for a given protein sequence (1 discriminates TFs from other proteins, (2 determines the structural superclass of TFs, (3 identifies the DNA-binding domains of TFs and (4 predicts their cis-acting DNA motif. While existing tools were extended and adapted for performing the latter two prediction steps, the first two steps are based on a novel numeric sequence representation which allows for combining existing knowledge from a BLAST scan with robust machine learning-based classification. By evaluation on a set of experimentally confirmed TFs and non-TFs, we demonstrate that our new protein sequence representation facilitates more reliable identification and structural classification of TFs than previously proposed sequence-derived features. The algorithms underlying our proposed methodology are implemented in the two complementary tools TFpredict and SABINE. The online and stand-alone versions of TFpredict and SABINE are freely available to academics at http://www.cogsys.cs.uni-tuebingen.de/software/TFpredict/ and http://www.cogsys.cs.uni-tuebingen.de/software/SABINE/.

  17. Identification of the bile salt binding site on IpaD from Shigella flexneri and the influence of ligand binding on IpaD structure.

    Science.gov (United States)

    Barta, Michael L; Guragain, Manita; Adam, Philip; Dickenson, Nicholas E; Patil, Mrinalini; Geisbrecht, Brian V; Picking, Wendy L; Picking, William D

    2012-03-01

    Type III secretion (TTS) is an essential virulence factor for Shigella flexneri, the causative agent of shigellosis. The Shigella TTS apparatus (TTSA) is an elegant nanomachine that is composed of a basal body, an external needle to deliver effectors into human cells, and a needle tip complex that controls secretion activation. IpaD is at the tip of the nascent TTSA needle where it controls the first step of TTS activation. The bile salt deoxycholate (DOC) binds to IpaD to induce recruitment of the translocator protein IpaB into the maturing tip complex. We recently used spectroscopic analyses to show that IpaD undergoes a structural rearrangement that accompanies binding to DOC. Here, we report a crystal structure of IpaD with DOC bound and test the importance of the residues that make up the DOC binding pocket on IpaD function. IpaD binds DOC at the interface between helices α3 and α7, with concomitant movement in the orientation of helix α7 relative to its position in unbound IpaD. When the IpaD residues involved in DOC binding are mutated, some are found to lead to altered invasion and secretion phenotypes. These findings suggest that adoption of a DOC bound structural state for IpaD primes the Shigella TTSA for contact with host cells. The data presented here and in the studies leading up to this work provide the foundation for developing a model of the first step in Shigella TTS activation.

  18. Mapping the heparin-binding site of the BMP antagonist gremlin by site-directed mutagenesis based on predictive modelling.

    Science.gov (United States)

    Tatsinkam, Arnold Junior; Mulloy, Barbara; Rider, Christopher C

    2015-08-15

    Gremlin is a member of the CAN (cerberus and DAN) family of secreted BMP (bone morphogenetic protein) antagonists and also an agonist of VEGF (vascular endothelial growth factor) receptor-2. It is critical in limb skeleton and kidney development and is re-expressed during tissue fibrosis. Gremlin binds strongly to heparin and heparan sulfate and, in the present study, we sought to investigate its heparin-binding site. In order to explore a putative non-contiguous binding site predicted by computational molecular modelling, we substituted a total of 11 key arginines and lysines located in three basic residue sequence clusters with homologous sequences from cerberus and DAN (differential screening selected gene abberative in neuroblastoma), CAN proteins which lack basic residues in these positions. A panel of six Myc-tagged gremlin mutants, MGR-1-MGR-6 (MGR, mutant gremlin), each containing different combinations of targeted substitutions, all showed markedly reduced affinity for heparin as demonstrated by their NaCl elution on heparin affinity chromatography, thus verifying our predictions. Both MGR-5 and MGR-6 retained BMP-4-binding activity comparable to that of wild-type gremlin. Low-molecular-mass heparin neither promoted nor inhibited BMP-4 binding. Finally, glutaraldehyde cross-linking demonstrated that gremlin forms non-covalent dimers, similar behaviour to that of DAN and also PRDC (protein related to cerberus and DAN), another CAN protein. The resulting dimer would possess two heparin-binding sites, each running along an exposed surface on the second β-strand finger loop of one of the monomers. © 2015 Authors; published by Portland Press Limited.

  19. Protein Structure Prediction by Protein Threading

    Science.gov (United States)

    Xu, Ying; Liu, Zhijie; Cai, Liming; Xu, Dong

    The seminal work of Bowie, Lüthy, and Eisenberg (Bowie et al., 1991) on "the inverse protein folding problem" laid the foundation of protein structure prediction by protein threading. By using simple measures for fitness of different amino acid types to local structural environments defined in terms of solvent accessibility and protein secondary structure, the authors derived a simple and yet profoundly novel approach to assessing if a protein sequence fits well with a given protein structural fold. Their follow-up work (Elofsson et al., 1996; Fischer and Eisenberg, 1996; Fischer et al., 1996a,b) and the work by Jones, Taylor, and Thornton (Jones et al., 1992) on protein fold recognition led to the development of a new brand of powerful tools for protein structure prediction, which we now term "protein threading." These computational tools have played a key role in extending the utility of all the experimentally solved structures by X-ray crystallography and nuclear magnetic resonance (NMR), providing structural models and functional predictions for many of the proteins encoded in the hundreds of genomes that have been sequenced up to now.

  20. Structural insight into the binding interactions of modeled structure of Arabidopsis thaliana urease with urea: an in silico study.

    Science.gov (United States)

    Yata, Vinod Kumar; Thapa, Arun; Mattaparthi, Venkata Satish Kumar

    2015-01-01

    Urease (EC 3.5.1.5., urea amidohydrolase) catalyzes the hydrolysis of urea to ammonia and carbon dioxide. Urease is present to a greater abundance in plants and plays significant role related to nitrogen recycling from urea. But little is known about the structure and function of the urease derived from the Arabidopsis thaliana, the model system of choice for research in plant biology. In this study, a three-dimensional structural model of A. thaliana urease was constructed using computer-aided molecular modeling technique. The characteristic structural features of the modeled structure were then studied using atomistic molecular dynamics simulation. It was observed that the modeled structure was stable and regions between residues index (50-80, 500-700) to be significantly flexible. From the docking studies, we detected the possible binding interactions of modeled urease with urea. Ala399, Ile675, Thr398, and Thr679 residues of A. thaliana urease were observed to be significantly involved in binding with the substrate urea. We also compared the docking studies of ureases from other sources such as Canavalia ensiformis, Helicobacter pylori, and Bacillus pasteurii. In addition, we carried out mutation analysis to find the highly mutable amino acid residues of modeled A. thaliana urease. In this particular study, we observed Met485, Tyr510, Ser786, Val426, and Lys765 to be highly mutable amino acids. These results are significant for the mutagenesis analysis. As a whole, this study expounds the salient structural features as well the binding interactions of the modeled structure of A. thaliana urease.

  1. Antibody structural modeling with prediction of immunoglobulin structure (PIGS)

    DEFF Research Database (Denmark)

    Marcatili, Paolo; Olimpieri, Pier Paolo; Chailyan, Anna

    2014-01-01

    Antibodies (or immunoglobulins) are crucial for defending organisms from pathogens, but they are also key players in many medical, diagnostic and biotechnological applications. The ability to predict their structure and the specific residues involved in antigen recognition has several useful...... applications in all of these areas. Over the years, we have developed or collaborated in developing a strategy that enables researchers to predict the 3D structure of antibodies with a very satisfactory accuracy. The strategy is completely automated and extremely fast, requiring only a few minutes (∼10 min...... on average) to build a structural model of an antibody. It is based on the concept of canonical structures of antibody loops and on our understanding of the way light and heavy chains pack together....

  2. LIGAND-BINDING SITES ON THE MYCOBACTERIUM TUBERCULOSIS UREASE

    Directory of Open Access Journals (Sweden)

    Lisnyak Yu. V.

    2017-10-01

    algorithm. To model the reduction in flexibility of allosteric pocket on modulator binding, the unperturbed normal modes are first calculated for the protein. The calculation is then repeated, each time perturbing one of the pockets in the protein. These results are combined with output from Fpocket in a support vector machine (SVM to predict allosteric pockets on proteins. The AlloSite server is similar to the AlloPred method in that it uses the Fpocket algorithm to elucidate allosteric pockets, whereas AlloPred uses an approach that combines flexibility with the Fpocket output. Results and discussion. By computational solvent mapping method FTSite, we have explored M.tuberculosis urease nonamer surface to find sites that tend to bind small organic molecular probes representing fragments of drug molecules with diverse hydrophobic and hydrophilic properties. The predicted three top ranked binding sites were situated at the interfaces between chains C and A, and chain G of neighbour trimer (and at equivalent locations in symmetrical trimers as well. A mapping of enzymes generally yields the most probable sites situated in a subsite of the enzyme active site. This was not the case for MTU which active sites were inaccessible for probes due to the closed conformation of the covering flap, and predicted binding sites were located not far from them at the entrance into a deep pocket. To explore their possible structural and functional role, we correlated the locations of predicted MTU binding sites and its ancillary pockets (which remain open and solvent exposed even while the flap is closed and indicated their partial overlapping. This overlapping may suggest that predicted sites are likely the intermediate binding sites responsible for recruiting a ligand to another binding site deeply buried in the protein. To examine the possibility that predicted binding sites are the sites for allostery binding we carried out the search for probable sites of allostery binding on MTU

  3. Convolutional neural network architectures for predicting DNA–protein binding

    Science.gov (United States)

    Zeng, Haoyang; Edwards, Matthew D.; Liu, Ge; Gifford, David K.

    2016-01-01

    Motivation: Convolutional neural networks (CNN) have outperformed conventional methods in modeling the sequence specificity of DNA–protein binding. Yet inappropriate CNN architectures can yield poorer performance than simpler models. Thus an in-depth understanding of how to match CNN architecture to a given task is needed to fully harness the power of CNNs for computational biology applications. Results: We present a systematic exploration of CNN architectures for predicting DNA sequence binding using a large compendium of transcription factor datasets. We identify the best-performing architectures by varying CNN width, depth and pooling designs. We find that adding convolutional kernels to a network is important for motif-based tasks. We show the benefits of CNNs in learning rich higher-order sequence features, such as secondary motifs and local sequence context, by comparing network performance on multiple modeling tasks ranging in difficulty. We also demonstrate how careful construction of sequence benchmark datasets, using approaches that control potentially confounding effects like positional or motif strength bias, is critical in making fair comparisons between competing methods. We explore how to establish the sufficiency of training data for these learning tasks, and we have created a flexible cloud-based framework that permits the rapid exploration of alternative neural network architectures for problems in computational biology. Availability and Implementation: All the models analyzed are available at http://cnn.csail.mit.edu. Contact: gifford@mit.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27307608

  4. Structural aspects of catalytic mechanisms of endonucleases and their binding to nucleic acids

    Energy Technology Data Exchange (ETDEWEB)

    Zhukhlistova, N. E.; Balaev, V. V.; Lyashenko, A. V.; Lashkov, A. A., E-mail: alashkov83@gmail.com [Russian Academy of Sciences, Shubnikov Institute of Crystallography (Russian Federation)

    2012-05-15

    Endonucleases (EC 3.1) are enzymes of the hydrolase class that catalyze the hydrolytic cleavage of deoxyribonucleic and ribonucleic acids at any region of the polynucleotide chain. Endonucleases are widely used both in biotechnological processes and in veterinary medicine as antiviral agents. Medical applications of endonucleases in human cancer therapy hold promise. The results of X-ray diffraction studies of the spatial organization of endonucleases and their complexes and the mechanism of their action are analyzed and generalized. An analysis of the structural studies of this class of enzymes showed that the specific binding of enzymes to nucleic acids is characterized by interactions with nitrogen bases and the nucleotide backbone, whereas the nonspecific binding of enzymes is generally characterized by interactions only with the nucleic-acid backbone. It should be taken into account that the specificity can be modulated by metal ions and certain low-molecular-weight organic compounds. To test the hypotheses about specific and nonspecific nucleic-acid-binding proteins, it is necessary to perform additional studies of atomic-resolution three-dimensional structures of enzyme-nucleic-acid complexes by methods of structural biology.

  5. Igs expressed by chronic lymphocytic leukemia B cells show limited binding-site structure variability.

    Science.gov (United States)

    Marcatili, Paolo; Ghiotto, Fabio; Tenca, Claudya; Chailyan, Anna; Mazzarello, Andrea N; Yan, Xiao-Jie; Colombo, Monica; Albesiano, Emilia; Bagnara, Davide; Cutrona, Giovanna; Morabito, Fortunato; Bruno, Silvia; Ferrarini, Manlio; Chiorazzi, Nicholas; Tramontano, Anna; Fais, Franco

    2013-06-01

    Ag selection has been suggested to play a role in chronic lymphocytic leukemia (CLL) pathogenesis, but no large-scale analysis has been performed so far on the structure of the Ag-binding sites (ABSs) of leukemic cell Igs. We sequenced both H and L chain V(D)J rearrangements from 366 CLL patients and modeled their three-dimensional structures. The resulting ABS structures were clustered into a small number of discrete sets, each containing ABSs with similar shapes and physicochemical properties. This structural classification correlates well with other known prognostic factors such as Ig mutation status and recurrent (stereotyped) receptors, but it shows a better prognostic value, at least in the case of one structural cluster for which clinical data were available. These findings suggest, for the first time, to our knowledge, on the basis of a structural analysis of the Ab-binding sites, that selection by a finite quota of antigenic structures operates on most CLL cases, whether mutated or unmutated.

  6. PSPP: a protein structure prediction pipeline for computing clusters.

    Directory of Open Access Journals (Sweden)

    Michael S Lee

    2009-07-01

    Full Text Available Protein structures are critical for understanding the mechanisms of biological systems and, subsequently, for drug and vaccine design. Unfortunately, protein sequence data exceed structural data by a factor of more than 200 to 1. This gap can be partially filled by using computational protein structure prediction. While structure prediction Web servers are a notable option, they often restrict the number of sequence queries and/or provide a limited set of prediction methodologies. Therefore, we present a standalone protein structure prediction software package suitable for high-throughput structural genomic applications that performs all three classes of prediction methodologies: comparative modeling, fold recognition, and ab initio. This software can be deployed on a user's own high-performance computing cluster.The pipeline consists of a Perl core that integrates more than 20 individual software packages and databases, most of which are freely available from other research laboratories. The query protein sequences are first divided into domains either by domain boundary recognition or Bayesian statistics. The structures of the individual domains are then predicted using template-based modeling or ab initio modeling. The predicted models are scored with a statistical potential and an all-atom force field. The top-scoring ab initio models are annotated by structural comparison against the Structural Classification of Proteins (SCOP fold database. Furthermore, secondary structure, solvent accessibility, transmembrane helices, and structural disorder are predicted. The results are generated in text, tab-delimited, and hypertext markup language (HTML formats. So far, the pipeline has been used to study viral and bacterial proteomes.The standalone pipeline that we introduce here, unlike protein structure prediction Web servers, allows users to devote their own computing assets to process a potentially unlimited number of queries as well as perform

  7. Purification, characterization, cloning and structural analysis of Crocodylus siamensis ovotransferrin for insight into functions of iron binding and autocleavage.

    Science.gov (United States)

    Chaipayang, Sukanya; Songsiriritthigul, Chomphunuch; Chen, Chun-Jung; Palacios, Philip M; Pierce, Brad S; Jangpromma, Nisachon; Klaynongsruang, Sompong

    2017-10-01

    Ovotransferrin (OTf), the major protein constituent of egg white, is of great interest due to its pivotal role in biological iron transport and storage processes and its spontaneous autocleavage into peptidic fragments with alternative biological properties, such as antibacterial and antioxidant activities. However, despite being well-investigated in avian, a detailed elucidation of the structure-function relationship of ovotransferrins in the closely related order of Crocodilia has not been reported to date. In this study, electron paramagnetic resonance (EPR) confirmed the presence of two spectroscopically distinct ferric iron binding sites in Crocodylus siamensis OTf (cOTf), but implied a five-fold lower quantity of bound iron than in hen OTf (hOTf). In addition, quantitative estimation of free sulfhydryl groups revealed slight differences to hOTf. To gain a better structural understanding of cOTf, we found a cOTf gene consisting of an open reading frame of 2040bp and encoding a protein of 679 amino acids. In silico prediction of the three-dimensional structure of cOTf and comparison with hOTf revealed four evolutionarily conserved iron-binding sites in both N- and C-lobes, as well as the presence of only 13 of the 15 disulfide bonds in hOTf. This evolutionary loss of disulfide linkages in conjunction with the lack of hydrogen bonding from a dilysine trigger in the C-lobe are presumed to affect the iron binding and autocleavage character of cOTf. As a result, cOTf may be capable of exerting a more diverse array of functions compared to its avian counterparts; for instance, ion buffering, antioxidant and antimicrobial activities. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Limited tryptic proteolysis of the benzodiazepine binding proteins in different species reveals structural homologies.

    Science.gov (United States)

    Friedl, W; Lentes, K U; Schmitz, E; Propping, P; Hebebrand, J

    1988-12-01

    Peptide mapping can be used to elucidate further the structural similarities of the benzodiazepine binding proteins in different vertebrate species. Crude synaptic membrane preparations were photoaffinity-labeled with [3H]flunitrazepam and subsequently degraded with various concentrations of trypsin. Sodium dodecyl sulfate-polyacrylamide gel electrophoresis followed by fluorography allowed a comparison of the molecular weights of photolabeled peptides in different species. Tryptic degradation led to a common peptide of 40K in all species investigated, a finding indicating that the benzodiazepine binding proteins are structurally homologous in higher bony fishes and tetrapods.

  9. Text mining improves prediction of protein functional sites.

    Directory of Open Access Journals (Sweden)

    Karin M Verspoor

    Full Text Available We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites. The structure analysis was carried out using Dynamics Perturbation Analysis (DPA, which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions.

  10. Text Mining Improves Prediction of Protein Functional Sites

    Science.gov (United States)

    Cohn, Judith D.; Ravikumar, Komandur E.

    2012-01-01

    We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites). The structure analysis was carried out using Dynamics Perturbation Analysis (DPA), which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites) in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions. PMID:22393388

  11. Prediction and Dissection of Protein-RNA Interactions by Molecular Descriptors.

    Science.gov (United States)

    Liu, Zhi-Ping; Chen, Luonan

    2016-01-01

    Protein-RNA interactions play crucial roles in numerous biological processes. However, detecting the interactions and binding sites between protein and RNA by traditional experiments is still time consuming and labor costing. Thus, it is of importance to develop bioinformatics methods for predicting protein-RNA interactions and binding sites. Accurate prediction of protein-RNA interactions and recognitions will highly benefit to decipher the interaction mechanisms between protein and RNA, as well as to improve the RNA-related protein engineering and drug design. In this work, we summarize the current bioinformatics strategies of predicting protein-RNA interactions and dissecting protein-RNA interaction mechanisms from local structure binding motifs. In particular, we focus on the feature-based machine learning methods, in which the molecular descriptors of protein and RNA are extracted and integrated as feature vectors of representing the interaction events and recognition residues. In addition, the available methods are classified and compared comprehensively. The molecular descriptors are expected to elucidate the binding mechanisms of protein-RNA interaction and reveal the functional implications from structural complementary perspective.

  12. Consensus of sample-balanced classifiers for identifying ligand-binding residue by co-evolutionary physicochemical characteristics of amino acids

    KAUST Repository

    Chen, Peng

    2013-01-01

    Protein-ligand binding is an important mechanism for some proteins to perform their functions, and those binding sites are the residues of proteins that physically bind to ligands. So far, the state-of-the-art methods search for similar, known structures of the query and predict the binding sites based on the solved structures. However, such structural information is not commonly available. In this paper, we propose a sequence-based approach to identify protein-ligand binding residues. Due to the highly imbalanced samples between the ligand-binding sites and non ligand-binding sites, we constructed several balanced data sets, for each of which a random forest (RF)-based classifier was trained. The ensemble of these RF classifiers formed a sequence-based protein-ligand binding site predictor. Experimental results on CASP9 targets demonstrated that our method compared favorably with the state-of-the-art. © Springer-Verlag Berlin Heidelberg 2013.

  13. RNAstructure: software for RNA secondary structure prediction and analysis.

    Science.gov (United States)

    Reuter, Jessica S; Mathews, David H

    2010-03-15

    To understand an RNA sequence's mechanism of action, the structure must be known. Furthermore, target RNA structure is an important consideration in the design of small interfering RNAs and antisense DNA oligonucleotides. RNA secondary structure prediction, using thermodynamics, can be used to develop hypotheses about the structure of an RNA sequence. RNAstructure is a software package for RNA secondary structure prediction and analysis. It uses thermodynamics and utilizes the most recent set of nearest neighbor parameters from the Turner group. It includes methods for secondary structure prediction (using several algorithms), prediction of base pair probabilities, bimolecular structure prediction, and prediction of a structure common to two sequences. This contribution describes new extensions to the package, including a library of C++ classes for incorporation into other programs, a user-friendly graphical user interface written in JAVA, and new Unix-style text interfaces. The original graphical user interface for Microsoft Windows is still maintained. The extensions to RNAstructure serve to make RNA secondary structure prediction user-friendly. The package is available for download from the Mathews lab homepage at http://rna.urmc.rochester.edu/RNAstructure.html.

  14. Cloud computing for protein-ligand binding site comparison.

    Science.gov (United States)

    Hung, Che-Lun; Hua, Guan-Jie

    2013-01-01

    The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery.

  15. A novel signal transduction protein: Combination of solute binding and tandem PAS-like sensor domains in one polypeptide chain: Periplasmic Ligand Binding Protein Dret_0059

    Energy Technology Data Exchange (ETDEWEB)

    Wu, R. [Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne Illinois 60439; Biosciences Division, Argonne National Laboratory, Argonne Illinois 60439; Wilton, R. [Biosciences Division, Argonne National Laboratory, Argonne Illinois 60439; Cuff, M. E. [Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne Illinois 60439; Biosciences Division, Argonne National Laboratory, Argonne Illinois 60439; Structural Biology Center, Argonne National Laboratory, Argonne Illinois 60439; Endres, M. [Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne Illinois 60439; Babnigg, G. [Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne Illinois 60439; Biosciences Division, Argonne National Laboratory, Argonne Illinois 60439; Edirisinghe, J. N. [Mathematics and Computer Science Division, Argonne National Laboratory, Argonne Illinois 60439; Computation Institute, University of Chicago, Chicago Illinois 60637; Henry, C. S. [Mathematics and Computer Science Division, Argonne National Laboratory, Argonne Illinois 60439; Computation Institute, University of Chicago, Chicago Illinois 60637; Joachimiak, A. [Midwest Center for Structural Genomics, Argonne National Laboratory, Argonne Illinois 60439; Biosciences Division, Argonne National Laboratory, Argonne Illinois 60439; Structural Biology Center, Argonne National Laboratory, Argonne Illinois 60439; Department of Biochemistry and Molecular Biology, University of Chicago, Chicago Illinois 60637; Schiffer, M. [Biosciences Division, Argonne National Laboratory, Argonne Illinois 60439; Pokkuluri, P. R. [Biosciences Division, Argonne National Laboratory, Argonne Illinois 60439

    2017-03-06

    We report the structural and biochemical characterization of a novel periplasmic ligand-binding protein, Dret_0059, from Desulfohalobium retbaense DSM 5692, an organism isolated from the Salt Lake Retba in Senegal. The structure of the protein consists of a unique combination of a periplasmic solute binding protein (SBP) domain at the N-terminal and a tandem PAS-like sensor domain at the C-terminal region. SBP domains are found ubiquitously and their best known function is in solute transport across membranes. PAS-like sensor domains are commonly found in signal transduction proteins. These domains are widely observed as parts of many protein architectures and complexes but have not been observed previously within the same polypeptide chain. In the structure of Dret_0059, a ketoleucine moiety is bound to the SBP, whereas a cytosine molecule is bound in the distal PAS-like domain of the tandem PAS-like domain. Differential scanning flourimetry support the binding of ligands observed in the crystal structure. There is significant interaction between the SBP and tandem PAS-like domains, and it is possible that the binding of one ligand could have an effect on the binding of the other. We uncovered three other proteins with this structural architecture in the non-redundant sequence data base, and predict that they too bind the same substrates. The genomic context of this protein did not offer any clues for its function. We did not find any biological process in which the two observed ligands are coupled. The protein Dret_0059 could be involved in either signal transduction or solute transport.

  16. Insights into structural features determining odorant affinities to honey bee odorant binding protein 14.

    Science.gov (United States)

    Schwaighofer, Andreas; Pechlaner, Maria; Oostenbrink, Chris; Kotlowski, Caroline; Araman, Can; Mastrogiacomo, Rosa; Pelosi, Paolo; Knoll, Wolfgang; Nowak, Christoph; Larisika, Melanie

    2014-04-18

    Molecular interactions between odorants and odorant binding proteins (OBPs) are of major importance for understanding the principles of selectivity of OBPs towards the wide range of semiochemicals. It is largely unknown on a structural basis, how an OBP binds and discriminates between odorant molecules. Here we examine this aspect in greater detail by comparing the C-minus OBP14 of the honey bee (Apis mellifera L.) to a mutant form of the protein that comprises the third disulfide bond lacking in C-minus OBPs. Affinities of structurally analogous odorants featuring an aromatic phenol group with different side chains were assessed based on changes of the thermal stability of the protein upon odorant binding monitored by circular dichroism spectroscopy. Our results indicate a tendency that odorants show higher affinity to the wild-type OBP suggesting that the introduced rigidity in the mutant protein has a negative effect on odorant binding. Furthermore, we show that OBP14 stability is very sensitive to the position and type of functional groups in the odorant. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. Antibody structural modeling with prediction of immunoglobulin structure (PIGS)

    KAUST Repository

    Marcatili, Paolo

    2014-11-06

    © 2014 Nature America, Inc. All rights reserved. Antibodies (or immunoglobulins) are crucial for defending organisms from pathogens, but they are also key players in many medical, diagnostic and biotechnological applications. The ability to predict their structure and the specific residues involved in antigen recognition has several useful applications in all of these areas. Over the years, we have developed or collaborated in developing a strategy that enables researchers to predict the 3D structure of antibodies with a very satisfactory accuracy. The strategy is completely automated and extremely fast, requiring only a few minutes (~10 min on average) to build a structural model of an antibody. It is based on the concept of canonical structures of antibody loops and on our understanding of the way light and heavy chains pack together.

  18. Structure and Sequence Search on Aptamer-Protein Docking

    Science.gov (United States)

    Xiao, Jiajie; Bonin, Keith; Guthold, Martin; Salsbury, Freddie

    2015-03-01

    Interactions between proteins and deoxyribonucleic acid (DNA) play a significant role in the living systems, especially through gene regulation. However, short nucleic acids sequences (aptamers) with specific binding affinity to specific proteins exhibit clinical potential as therapeutics. Our capillary and gel electrophoresis selection experiments show that specific sequences of aptamers can be selected that bind specific proteins. Computationally, given the experimentally-determined structure and sequence of a thrombin-binding aptamer, we can successfully dock the aptamer onto thrombin in agreement with experimental structures of the complex. In order to further study the conformational flexibility of this thrombin-binding aptamer and to potentially develop a predictive computational model of aptamer-binding, we use GPU-enabled molecular dynamics simulations to both examine the conformational flexibility of the aptamer in the absence of binding to thrombin, and to determine our ability to fold an aptamer. This study should help further de-novo predictions of aptamer sequences by enabling the study of structural and sequence-dependent effects on aptamer-protein docking specificity.

  19. A structural classification of substrate-binding proteins

    NARCIS (Netherlands)

    Berntsson, Ronnie P. -A.; Smits, Sander H. J.; Schmitt, Lutz; Slotboom, Dirk-Jan; Poolman, Bert

    2010-01-01

    Substrate-binding proteins (SBP) are associated with a wide variety of protein complexes. The proteins are part of ATP-binding cassette transporters for substrate uptake, ion gradient driven transporters, DNA-binding proteins, as well as channels and receptors from both pro-and eukaryotes. A wealth

  20. Loop-to-helix transition in the structure of multidrug regulator AcrR at the entrance of the drug-binding cavity

    Energy Technology Data Exchange (ETDEWEB)

    Manjasetty, Babu A.; Halavaty, Andrei S.; Luan, Chi-Hao; Osipiuk, Jerzy; Mulligan, Rory; Kwon, Keehwan; Anderson, Wayne F.; Joachimiak, Andrzej

    2016-04-01

    Multidrug transcription regulator AcrR from Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 belongs to the tetracycline repressor family, one of the largest groups of bacterial transcription factors. The crystal structure of dimeric AcrR was determined and refined to 1.56 Å resolution. The tertiary and quaternary structures of AcrR are similar to those of its homologs. The multidrug binding site was identified based on structural alignment with homologous proteins and has a di(hydroxyethyl)ether molecule bound. Residues from helices a4 and a7 shape the entry into this binding site. The structure of AcrR reveals that the extended helical conformation of helix a4 is stabilized by the hydrogen bond between Glu67 (helix a4) and Gln130 (helix a7). Based on the structural comparison with the closest homolog structure, the Escherichia coli AcrR, we propose that this hydrogen bond is responsible for control of the loop-to-helix transition within helix a4. This local conformational switch of helix a4 may be a key step in accessing the multidrug binding site and securing ligands at the binding site. Solution smallmolecule binding studies suggest that AcrR binds ligands with their core chemical structure resembling the tetracyclic ring of cholesterol.

  1. Deciphering common recognition principles of nucleoside mono/di and tri-phosphates binding in diverse proteins via structural matching of their binding sites.

    Science.gov (United States)

    Bhagavat, Raghu; Srinivasan, Narayanaswamy; Chandra, Nagasuma

    2017-09-01

    Nucleoside triphosphate (NTP) ligands are of high biological importance and are essential for all life forms. A pre-requisite for them to participate in diverse biochemical processes is their recognition by diverse proteins. It is thus of great interest to understand the basis for such recognition in different proteins. Towards this, we have used a structural bioinformatics approach and analyze structures of 4677 NTP complexes available in Protein Data Bank (PDB). Binding sites were extracted and compared exhaustively using PocketMatch, a sensitive in-house site comparison algorithm, which resulted in grouping the entire dataset into 27 site-types. Each of these site-types represent a structural motif comprised of two or more residue conservations, derived using another in-house tool for superposing binding sites, PocketAlign. The 27 site-types could be grouped further into 9 super-types by considering partial similarities in the sites, which indicated that the individual site-types comprise different combinations of one or more site features. A scan across PDB using the 27 structural motifs determined the motifs to be specific to NTP binding sites, and a computational alanine mutagenesis indicated that residues identified to be highly conserved in the motifs are also most contributing to binding. Alternate orientations of the ligand in several site-types were observed and rationalized, indicating the possibility of some residues serving as anchors for NTP recognition. The presence of multiple site-types and the grouping of multiple folds into each site-type is strongly suggestive of convergent evolution. Knowledge of determinants obtained from this study will be useful for detecting function in unknown proteins. Proteins 2017; 85:1699-1712. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  2. Computational identification of binding energy hot spots in protein-RNA complexes using an ensemble approach.

    Science.gov (United States)

    Pan, Yuliang; Wang, Zixiang; Zhan, Weihua; Deng, Lei

    2018-05-01

    Identifying RNA-binding residues, especially energetically favored hot spots, can provide valuable clues for understanding the mechanisms and functional importance of protein-RNA interactions. Yet, limited availability of experimentally recognized energy hot spots in protein-RNA crystal structures leads to the difficulties in developing empirical identification approaches. Computational prediction of RNA-binding hot spot residues is still in its infant stage. Here, we describe a computational method, PrabHot (Prediction of protein-RNA binding hot spots), that can effectively detect hot spot residues on protein-RNA binding interfaces using an ensemble of conceptually different machine learning classifiers. Residue interaction network features and new solvent exposure characteristics are combined together and selected for classification with the Boruta algorithm. In particular, two new reference datasets (benchmark and independent) have been generated containing 107 hot spots from 47 known protein-RNA complex structures. In 10-fold cross-validation on the training dataset, PrabHot achieves promising performances with an AUC score of 0.86 and a sensitivity of 0.78, which are significantly better than that of the pioneer RNA-binding hot spot prediction method HotSPRing. We also demonstrate the capability of our proposed method on the independent test dataset and gain a competitive advantage as a result. The PrabHot webserver is freely available at http://denglab.org/PrabHot/. leideng@csu.edu.cn. Supplementary data are available at Bioinformatics online.

  3. Structural variation and inhibitor binding in polypeptide deformylase from four different bacterial species.

    Science.gov (United States)

    Smith, Kathrine J; Petit, Chantal M; Aubart, Kelly; Smyth, Martin; McManus, Edward; Jones, Jo; Fosberry, Andrew; Lewis, Ceri; Lonetto, Michael; Christensen, Siegfried B

    2003-02-01

    Polypeptide deformylase (PDF) catalyzes the deformylation of polypeptide chains in bacteria. It is essential for bacterial cell viability and is a potential antibacterial drug target. Here, we report the crystal structures of polypeptide deformylase from four different species of bacteria: Streptococcus pneumoniae, Staphylococcus aureus, Haemophilus influenzae, and Escherichia coli. Comparison of these four structures reveals significant overall differences between the two Gram-negative species (E. coli and H. influenzae) and the two Gram-positive species (S. pneumoniae and S. aureus). Despite these differences and low overall sequence identity, the S1' pocket of PDF is well conserved among the four enzymes studied. We also describe the binding of nonpeptidic inhibitor molecules SB-485345, SB-543668, and SB-505684 to both S. pneumoniae and E. coli PDF. Comparison of these structures shows similar binding interactions with both Gram-negative and Gram-positive species. Understanding the similarities and subtle differences in active site structure between species will help to design broad-spectrum polypeptide deformylase inhibitor molecules.

  4. Structures of minute virus of mice replication initiator protein N-terminal domain: Insights into DNA nicking and origin binding

    International Nuclear Information System (INIS)

    Tewary, Sunil K.; Liang, Lingfei; Lin, Zihan; Lynn, Annie; Cotmore, Susan F.; Tattersall, Peter; Zhao, Haiyan; Tang, Liang

    2015-01-01

    Members of the Parvoviridae family all encode a non-structural protein 1 (NS1) that directs replication of single-stranded viral DNA, packages viral DNA into capsid, and serves as a potent transcriptional activator. Here we report the X-ray structure of the minute virus of mice (MVM) NS1 N-terminal domain at 1.45 Å resolution, showing that sites for dsDNA binding, ssDNA binding and cleavage, nuclear localization, and other functions are integrated on a canonical fold of the histidine-hydrophobic-histidine superfamily of nucleases, including elements specific for this Protoparvovirus but distinct from its Bocaparvovirus or Dependoparvovirus orthologs. High resolution structural analysis reveals a nickase active site with an architecture that allows highly versatile metal ligand binding. The structures support a unified mechanism of replication origin recognition for homotelomeric and heterotelomeric parvoviruses, mediated by a basic-residue-rich hairpin and an adjacent helix in the initiator proteins and by tandem tetranucleotide motifs in the replication origins. - Highlights: • The structure of a parvovirus replication initiator protein has been determined; • The structure sheds light on mechanisms of ssDNA binding and cleavage; • The nickase active site is preconfigured for versatile metal ligand binding; • The binding site for the double-stranded replication origin DNA is identified; • A single domain integrates multiple functions in virus replication

  5. Structures of minute virus of mice replication initiator protein N-terminal domain: Insights into DNA nicking and origin binding

    Energy Technology Data Exchange (ETDEWEB)

    Tewary, Sunil K.; Liang, Lingfei; Lin, Zihan; Lynn, Annie [Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66045 (United States); Cotmore, Susan F. [Departments of Laboratory Medicine, Yale University Medical School, New Haven, CT 06510 (United States); Tattersall, Peter [Departments of Laboratory Medicine, Yale University Medical School, New Haven, CT 06510 (United States); Departments of Genetics, Yale University Medical School, New Haven, CT 06510 (United States); Zhao, Haiyan, E-mail: zhaohy@ku.edu [Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66045 (United States); Tang, Liang, E-mail: tangl@ku.edu [Department of Molecular Biosciences, University of Kansas, Lawrence, KS 66045 (United States)

    2015-02-15

    Members of the Parvoviridae family all encode a non-structural protein 1 (NS1) that directs replication of single-stranded viral DNA, packages viral DNA into capsid, and serves as a potent transcriptional activator. Here we report the X-ray structure of the minute virus of mice (MVM) NS1 N-terminal domain at 1.45 Å resolution, showing that sites for dsDNA binding, ssDNA binding and cleavage, nuclear localization, and other functions are integrated on a canonical fold of the histidine-hydrophobic-histidine superfamily of nucleases, including elements specific for this Protoparvovirus but distinct from its Bocaparvovirus or Dependoparvovirus orthologs. High resolution structural analysis reveals a nickase active site with an architecture that allows highly versatile metal ligand binding. The structures support a unified mechanism of replication origin recognition for homotelomeric and heterotelomeric parvoviruses, mediated by a basic-residue-rich hairpin and an adjacent helix in the initiator proteins and by tandem tetranucleotide motifs in the replication origins. - Highlights: • The structure of a parvovirus replication initiator protein has been determined; • The structure sheds light on mechanisms of ssDNA binding and cleavage; • The nickase active site is preconfigured for versatile metal ligand binding; • The binding site for the double-stranded replication origin DNA is identified; • A single domain integrates multiple functions in virus replication.

  6. Methods and systems for identifying ligand-protein binding sites

    KAUST Repository

    Gao, Xin

    2016-05-06

    The invention provides a novel integrated structure and system-based approach for drug target prediction that enables the large-scale discovery of new targets for existing drugs Novel computer-readable storage media and computer systems are also provided. Methods and systems of the invention use novel sequence order-independent structure alignment, hierarchical clustering, and probabilistic sequence similarity techniques to construct a probabilistic pocket ensemble (PPE) that captures even promiscuous structural features of different binding sites for a drug on known targets. The drug\\'s PPE is combined with an approximation of the drug delivery profile to facilitate large-scale prediction of novel drug- protein interactions with several applications to biological research and drug development.

  7. A tandem regression-outlier analysis of a ligand cellular system for key structural modifications around ligand binding.

    Science.gov (United States)

    Lin, Ying-Ting

    2013-04-30

    A tandem technique of hard equipment is often used for the chemical analysis of a single cell to first isolate and then detect the wanted identities. The first part is the separation of wanted chemicals from the bulk of a cell; the second part is the actual detection of the important identities. To identify the key structural modifications around ligand binding, the present study aims to develop a counterpart of tandem technique for cheminformatics. A statistical regression and its outliers act as a computational technique for separation. A PPARγ (peroxisome proliferator-activated receptor gamma) agonist cellular system was subjected to such an investigation. Results show that this tandem regression-outlier analysis, or the prioritization of the context equations tagged with features of the outliers, is an effective regression technique of cheminformatics to detect key structural modifications, as well as their tendency of impact to ligand binding. The key structural modifications around ligand binding are effectively extracted or characterized out of cellular reactions. This is because molecular binding is the paramount factor in such ligand cellular system and key structural modifications around ligand binding are expected to create outliers. Therefore, such outliers can be captured by this tandem regression-outlier analysis.

  8. Structural characterisation of Tpx from Yersinia pseudotuberculosis reveals insights into the binding of salicylidene acylhydrazide compounds.

    Directory of Open Access Journals (Sweden)

    Mads Gabrielsen

    Full Text Available Thiol peroxidase, Tpx, has been shown to be a target protein of the salicylidene acylhydrazide class of antivirulence compounds. In this study we present the crystal structures of Tpx from Y. pseudotuberculosis (ypTpx in the oxidised and reduced states, together with the structure of the C61S mutant. The structures solved are consistent with previously solved atypical 2-Cys thiol peroxidases, including that for "forced" reduced states using the C61S mutant. In addition, by investigating the solution structure of ypTpx using small angle X-ray scattering (SAXS, we have confirmed that reduced state ypTpx in solution is a homodimer. The solution structure also reveals flexibility around the dimer interface. Notably, the conformational changes observed between the redox states at the catalytic triad and at the dimer interface have implications for substrate and inhibitor binding. The structural data were used to model the binding of two salicylidene acylhydrazide compounds to the oxidised structure of ypTpx. Overall, the study provides insights into the binding of the salicylidene acylhydrazides to ypTpx, aiding our long-term strategy to understand the mode of action of this class of compounds.

  9. How to deal with multiple binding poses in alchemical relative protein-ligand binding free energy calculations.

    Science.gov (United States)

    Kaus, Joseph W; Harder, Edward; Lin, Teng; Abel, Robert; McCammon, J Andrew; Wang, Lingle

    2015-06-09

    Recent advances in improved force fields and sampling methods have made it possible for the accurate calculation of protein–ligand binding free energies. Alchemical free energy perturbation (FEP) using an explicit solvent model is one of the most rigorous methods to calculate relative binding free energies. However, for cases where there are high energy barriers separating the relevant conformations that are important for ligand binding, the calculated free energy may depend on the initial conformation used in the simulation due to the lack of complete sampling of all the important regions in phase space. This is particularly true for ligands with multiple possible binding modes separated by high energy barriers, making it difficult to sample all relevant binding modes even with modern enhanced sampling methods. In this paper, we apply a previously developed method that provides a corrected binding free energy for ligands with multiple binding modes by combining the free energy results from multiple alchemical FEP calculations starting from all enumerated poses, and the results are compared with Glide docking and MM-GBSA calculations. From these calculations, the dominant ligand binding mode can also be predicted. We apply this method to a series of ligands that bind to c-Jun N-terminal kinase-1 (JNK1) and obtain improved free energy results. The dominant ligand binding modes predicted by this method agree with the available crystallography, while both Glide docking and MM-GBSA calculations incorrectly predict the binding modes for some ligands. The method also helps separate the force field error from the ligand sampling error, such that deviations in the predicted binding free energy from the experimental values likely indicate possible inaccuracies in the force field. An error in the force field for a subset of the ligands studied was identified using this method, and improved free energy results were obtained by correcting the partial charges assigned to the

  10. How To Deal with Multiple Binding Poses in Alchemical Relative Protein–Ligand Binding Free Energy Calculations

    Science.gov (United States)

    2016-01-01

    Recent advances in improved force fields and sampling methods have made it possible for the accurate calculation of protein–ligand binding free energies. Alchemical free energy perturbation (FEP) using an explicit solvent model is one of the most rigorous methods to calculate relative binding free energies. However, for cases where there are high energy barriers separating the relevant conformations that are important for ligand binding, the calculated free energy may depend on the initial conformation used in the simulation due to the lack of complete sampling of all the important regions in phase space. This is particularly true for ligands with multiple possible binding modes separated by high energy barriers, making it difficult to sample all relevant binding modes even with modern enhanced sampling methods. In this paper, we apply a previously developed method that provides a corrected binding free energy for ligands with multiple binding modes by combining the free energy results from multiple alchemical FEP calculations starting from all enumerated poses, and the results are compared with Glide docking and MM-GBSA calculations. From these calculations, the dominant ligand binding mode can also be predicted. We apply this method to a series of ligands that bind to c-Jun N-terminal kinase-1 (JNK1) and obtain improved free energy results. The dominant ligand binding modes predicted by this method agree with the available crystallography, while both Glide docking and MM-GBSA calculations incorrectly predict the binding modes for some ligands. The method also helps separate the force field error from the ligand sampling error, such that deviations in the predicted binding free energy from the experimental values likely indicate possible inaccuracies in the force field. An error in the force field for a subset of the ligands studied was identified using this method, and improved free energy results were obtained by correcting the partial charges assigned to the

  11. 8-anilino-1-naphthaline sulfonate binds at the hemoglobin allosteric regulatory sites: inhibitory analyses

    International Nuclear Information System (INIS)

    Bokut', S.B.; Parul', D.A.; Yachnik, N.N.; Milyutin, A.A.

    2001-01-01

    The present study focused on the localization at least one of the ANS binding sites in the major form of human hemoglobin HbA. High-resolution docking predict ANS binding to the hemoglobin central cavity. Steady-state fluorescence titration data obtained in the absence/presence of natural effector inositol hexaphosphate (IHP) allowed to conclude that IHP competitively inhibited ANS binding to HbA. Thus, we must conclude that one of the ANS binding sites is central cavity, which makes it possible to monitor changes at this region upon ligation/deligation, effector binding and changes in hemoglobin structure

  12. Crystal structure of the anti-(carcinoembryonic antigen) single-chain Fv antibody MFE-23 and a model for antigen binding based on intermolecular contacts.

    Science.gov (United States)

    Boehm, M K; Corper, A L; Wan, T; Sohi, M K; Sutton, B J; Thornton, J D; Keep, P A; Chester, K A; Begent, R H; Perkins, S J

    2000-03-01

    MFE-23 is the first single-chain Fv antibody molecule to be used in patients and is used to target colorectal cancer through its high affinity for carcinoembryonic antigen (CEA), a cell-surface member of the immunoglobulin superfamily. MFE-23 contains an N-terminal variable heavy-chain domain joined by a (Gly(4)Ser)(3) linker to a variable light-chain (V(L)) domain (kappa chain) with an 11-residue C-terminal Myc-tag. Its crystal structure was determined at 2.4 A resolution by molecular replacement with an R(cryst) of 19.0%. Five of the six antigen-binding loops, L1, L2, L3, H1 and H2, conformed to known canonical structures. The sixth loop, H3, displayed a unique structure, with a beta-hairpin loop and a bifurcated apex characterized by a buried Thr residue. In the crystal lattice, two MFE-23 molecules were associated back-to-back in a manner not seen before. The antigen-binding site displayed a large acidic region located mainly within the H2 loop and a large hydrophobic region within the H3 loop. Even though this structure is unliganded within the crystal, there is an unusually large region of contact between the H1, H2 and H3 loops and the beta-sheet of the V(L) domain of an adjacent molecule (strands DEBA) as a result of intermolecular packing. These interactions exhibited remarkably high surface and electrostatic complementarity. Of seven MFE-23 residues predicted to make contact with antigen, five participated in these lattice contacts, and this model for antigen binding is consistent with previously reported site-specific mutagenesis of MFE-23 and its effect on CEA binding.

  13. Communication between the Zinc and Nickel Sites in Dimeric HypA: Metal Recognition and pH Sensing

    Energy Technology Data Exchange (ETDEWEB)

    Herbst, R.; Perovic, I; Martin-Diaconescu, V; O’Brien, K; Chivers, P; Sondej Pochapsky, S; Pochapsky, T; Maroney, M

    2010-01-01

    Helicobacter pylori, a pathogen that colonizes the human stomach, requires the nickel-containing metalloenzymes urease and NiFe-hydrogenase to survive this low pH environment. The maturation of both enzymes depends on the metallochaperone, HypA. HypA contains two metal sites, an intrinsic zinc site and a low-affinity nickel binding site. X-ray absorption spectroscopy (XAS) shows that the structure of the intrinsic zinc site of HypA is dynamic and able to sense both nickel loading and pH changes. At pH 6.3, an internal pH that occurs during acid shock, the zinc site undergoes unprecedented ligand substitutions to convert from a Zn(Cys){sub 4} site to a Zn(His){sub 2}(Cys){sub 2} site. NMR spectroscopy shows that binding of Ni(II) to HypA results in paramagnetic broadening of resonances near the N-terminus. NOEs between the {beta}-CH{sub 2} protons of Zn cysteinyl ligands are consistent with a strand-swapped HypA dimer. Addition of nickel causes resonances from the zinc binding motif and other regions to double, indicating more than one conformation can exist in solution. Although the structure of the high-spin, 5-6 coordinate Ni(II) site is relatively unaffected by pH, the nickel binding stoichiometry is decreased from one per monomer to one per dimer at pH = 6.3. Mutation of any cysteine residue in the zinc binding motif results in a zinc site structure similar to that found for holo-WT-HypA at low pH and is unperturbed by the addition of nickel. Mutation of the histidines that flank the CXXC motifs results in a zinc site structure that is similar to holo-WT-HypA at neutral pH (Zn(Cys){sub 4}) and is no longer responsive to nickel binding or pH changes. Using an in vitro urease activity assay, it is shown that the recombinant protein is sufficient for recovery of urease activity in cell lysate from a HypA deletion mutant, and that mutations in the zinc-binding motif result in a decrease in recovered urease activity. The results are interpreted in terms of a model

  14. Communication between the Zinc and Nickel Sites in Dimeric HypA: Metal Recognition and pH Sensing

    International Nuclear Information System (INIS)

    Herbst, R.; Perovic, I.; Martin-Diaconescu, V.; O'Brien, K.; Chivers, P.; Sondej Pochapsky, S.; Pochapsky, T.; Maroney, M.

    2010-01-01

    Helicobacter pylori, a pathogen that colonizes the human stomach, requires the nickel-containing metalloenzymes urease and NiFe-hydrogenase to survive this low pH environment. The maturation of both enzymes depends on the metallochaperone, HypA. HypA contains two metal sites, an intrinsic zinc site and a low-affinity nickel binding site. X-ray absorption spectroscopy (XAS) shows that the structure of the intrinsic zinc site of HypA is dynamic and able to sense both nickel loading and pH changes. At pH 6.3, an internal pH that occurs during acid shock, the zinc site undergoes unprecedented ligand substitutions to convert from a Zn(Cys) 4 site to a Zn(His) 2 (Cys) 2 site. NMR spectroscopy shows that binding of Ni(II) to HypA results in paramagnetic broadening of resonances near the N-terminus. NOEs between the β-CH 2 protons of Zn cysteinyl ligands are consistent with a strand-swapped HypA dimer. Addition of nickel causes resonances from the zinc binding motif and other regions to double, indicating more than one conformation can exist in solution. Although the structure of the high-spin, 5-6 coordinate Ni(II) site is relatively unaffected by pH, the nickel binding stoichiometry is decreased from one per monomer to one per dimer at pH = 6.3. Mutation of any cysteine residue in the zinc binding motif results in a zinc site structure similar to that found for holo-WT-HypA at low pH and is unperturbed by the addition of nickel. Mutation of the histidines that flank the CXXC motifs results in a zinc site structure that is similar to holo-WT-HypA at neutral pH (Zn(Cys) 4 ) and is no longer responsive to nickel binding or pH changes. Using an in vitro urease activity assay, it is shown that the recombinant protein is sufficient for recovery of urease activity in cell lysate from a HypA deletion mutant, and that mutations in the zinc-binding motif result in a decrease in recovered urease activity. The results are interpreted in terms of a model wherein HypA controls the

  15. Tight binding electronic band structure calculation of achiral boron nitride single wall nanotubes

    International Nuclear Information System (INIS)

    Saxena, Prapti; Sanyal, Sankar P

    2006-01-01

    In this paper we report the Tight-Binding method, for the electronic structure calculations of achiral single wall Boron Nitride nanotubes. We have used the contribution of π electron only to define the electronic band structure for the solid. The Zone-folding method is used for the Brillouin Zone definition. Calculation of tight binding model parameters is done by fitting them to available experimental results of two-dimensional hexagonal monolayers of Boron Nitride. It has been found that all the boron nitride nanotubes (both zigzag and armchair) are constant gap semiconductors with a band gap of 5.27eV. All zigzag BNNTs are found to be direct gap semiconductors while all armchair nanotubes are indirect gap semiconductors. (author)

  16. Structural Basis for Nucleotide Binding and Reaction Catalysis in Mevalonate Diphosphate Decarboxylase

    Energy Technology Data Exchange (ETDEWEB)

    Barta, Michael L.; McWhorter, William J.; Miziorko, Henry M.; Geisbrecht, Brian V. (UMKC)

    2012-09-17

    Mevalonate diphosphate decarboxylase (MDD) catalyzes the final step of the mevalonate pathway, the Mg{sup 2+}-ATP dependent decarboxylation of mevalonate 5-diphosphate (MVAPP), producing isopentenyl diphosphate (IPP). Synthesis of IPP, an isoprenoid precursor molecule that is a critical intermediate in peptidoglycan and polyisoprenoid biosynthesis, is essential in Gram-positive bacteria (e.g., Staphylococcus, Streptococcus, and Enterococcus spp.), and thus the enzymes of the mevalonate pathway are ideal antimicrobial targets. MDD belongs to the GHMP superfamily of metabolite kinases that have been extensively studied for the past 50 years, yet the crystallization of GHMP kinase ternary complexes has proven to be difficult. To further our understanding of the catalytic mechanism of GHMP kinases with the purpose of developing broad spectrum antimicrobial agents that target the substrate and nucleotide binding sites, we report the crystal structures of wild-type and mutant (S192A and D283A) ternary complexes of Staphylococcus epidermidis MDD. Comparison of apo, MVAPP-bound, and ternary complex wild-type MDD provides structural information about the mode of substrate binding and the catalytic mechanism. Structural characterization of ternary complexes of catalytically deficient MDD S192A and D283A (k{sub cat} decreased 10{sup 3}- and 10{sup 5}-fold, respectively) provides insight into MDD function. The carboxylate side chain of invariant Asp{sup 283} functions as a catalytic base and is essential for the proper orientation of the MVAPP C3-hydroxyl group within the active site funnel. Several MDD amino acids within the conserved phosphate binding loop ('P-loop') provide key interactions, stabilizing the nucleotide triphosphoryl moiety. The crystal structures presented here provide a useful foundation for structure-based drug design.

  17. A strategy for interaction site prediction between phospho-binding modules and their partners identified from proteomic data.

    Science.gov (United States)

    Aucher, Willy; Becker, Emmanuelle; Ma, Emilie; Miron, Simona; Martel, Arnaud; Ochsenbein, Françoise; Marsolier-Kergoat, Marie-Claude; Guerois, Raphaël

    2010-12-01

    Small and large scale proteomic technologies are providing a wealth of potential interactions between proteins bearing phospho-recognition modules and their substrates. Resulting interaction maps reveal such a dense network of interactions that the functional dissection and understanding of these networks often require to break specific interactions while keeping the rest intact. Here, we developed a computational strategy, called STRIP, to predict the precise interaction site involved in an interaction with a phospho-recognition module. The method was validated by a two-hybrid screen carried out using the ForkHead Associated (FHA)1 domain of Rad53, a key protein of Saccharomyces cerevisiae DNA checkpoint, as a bait. In this screen we detected 11 partners, including Cdc7 and Cdc45, essential components of the DNA replication machinery. FHA domains are phospho-threonine binding modules and the threonines involved in both interactions could be predicted using the STRIP strategy. The threonines T484 and T189 in Cdc7 and Cdc45, respectively, were mutated and loss of binding could be monitored experimentally with the full-length proteins. The method was further tested for the analysis of 63 known Rad53 binding partners and provided several key insights regarding the threonines likely involved in these interactions. The STRIP method relies on a combination of conservation, phosphorylation likelihood, and binding specificity criteria and can be accessed via a web interface at http://biodev.extra.cea.fr/strip/.

  18. A Strategy for Interaction Site Prediction between Phospho-binding Modules and their Partners Identified from Proteomic Data*

    Science.gov (United States)

    Aucher, Willy; Becker, Emmanuelle; Ma, Emilie; Miron, Simona; Martel, Arnaud; Ochsenbein, Françoise; Marsolier-Kergoat, Marie-Claude; Guerois, Raphaël

    2010-01-01

    Small and large scale proteomic technologies are providing a wealth of potential interactions between proteins bearing phospho-recognition modules and their substrates. Resulting interaction maps reveal such a dense network of interactions that the functional dissection and understanding of these networks often require to break specific interactions while keeping the rest intact. Here, we developed a computational strategy, called STRIP, to predict the precise interaction site involved in an interaction with a phospho-recognition module. The method was validated by a two-hybrid screen carried out using the ForkHead Associated (FHA)1 domain of Rad53, a key protein of Saccharomyces cerevisiae DNA checkpoint, as a bait. In this screen we detected 11 partners, including Cdc7 and Cdc45, essential components of the DNA replication machinery. FHA domains are phospho-threonine binding modules and the threonines involved in both interactions could be predicted using the STRIP strategy. The threonines T484 and T189 in Cdc7 and Cdc45, respectively, were mutated and loss of binding could be monitored experimentally with the full-length proteins. The method was further tested for the analysis of 63 known Rad53 binding partners and provided several key insights regarding the threonines likely involved in these interactions. The STRIP method relies on a combination of conservation, phosphorylation likelihood, and binding specificity criteria and can be accessed via a web interface at http://biodev.extra.cea.fr/strip/. PMID:20733106

  19. Crystal structure of equine serum albumin in complex with cetirizine reveals a novel drug binding site.

    Science.gov (United States)

    Handing, Katarzyna B; Shabalin, Ivan G; Szlachta, Karol; Majorek, Karolina A; Minor, Wladek

    2016-03-01

    Serum albumin (SA) is the main transporter of drugs in mammalian blood plasma. Here, we report the first crystal structure of equine serum albumin (ESA) in complex with antihistamine drug cetirizine at a resolution of 2.1Å. Cetirizine is bound in two sites--a novel drug binding site (CBS1) and the fatty acid binding site 6 (CBS2). Both sites differ from those that have been proposed in multiple reports based on equilibrium dialysis and fluorescence studies for mammalian albumins as cetirizine binding sites. We show that the residues forming the binding pockets in ESA are highly conserved in human serum albumin (HSA), and suggest that binding of cetirizine to HSA will be similar. In support of that hypothesis, we show that the dissociation constants for cetirizine binding to CBS2 in ESA and HSA are identical using tryptophan fluorescence quenching. Presence of lysine and arginine residues that have been previously reported to undergo nonenzymatic glycosylation in CBS1 and CBS2 suggests that cetirizine transport in patients with diabetes could be altered. A review of all available SA structures from the PDB shows that in addition to the novel drug binding site we present here (CBS1), there are two pockets on SA capable of binding drugs that do not overlap with fatty acid binding sites and have not been discussed in published reviews. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. NetMHCpan-4.0: Improved Peptide-MHC Class I Interaction Predictions Integrating Eluted Ligand and Peptide Binding Affinity Data

    DEFF Research Database (Denmark)

    Jurtz, Vanessa Isabell; Paul, Sinu; Andreatta, Massimo

    2017-01-01

    by mass spectrometry have been reported containing information about peptide-processing steps in the presentation pathway and the length distribution of naturally presented peptides. In this article, we present NetMHCpan-4.0, a method trained on binding affinity and eluted ligand data leveraging......Cytotoxic T cells are of central importance in the immune system's response to disease. They recognize defective cells by binding to peptides presented on the cell surface by MHC class I molecules. Peptide binding to MHC molecules is the single most selective step in the Ag-presentation pathway....... Therefore, in the quest for T cell epitopes, the prediction of peptide binding to MHC molecules has attracted widespread attention. In the past, predictors of peptide-MHC interactions have primarily been trained on binding affinity data. Recently, an increasing number of MHC-presented peptides identified...

  1. Neural network and SVM classifiers accurately predict lipid binding proteins, irrespective of sequence homology.

    Science.gov (United States)

    Bakhtiarizadeh, Mohammad Reza; Moradi-Shahrbabak, Mohammad; Ebrahimi, Mansour; Ebrahimie, Esmaeil

    2014-09-07

    Due to the central roles of lipid binding proteins (LBPs) in many biological processes, sequence based identification of LBPs is of great interest. The major challenge is that LBPs are diverse in sequence, structure, and function which results in low accuracy of sequence homology based methods. Therefore, there is a need for developing alternative functional prediction methods irrespective of sequence similarity. To identify LBPs from non-LBPs, the performances of support vector machine (SVM) and neural network were compared in this study. Comprehensive protein features and various techniques were employed to create datasets. Five-fold cross-validation (CV) and independent evaluation (IE) tests were used to assess the validity of the two methods. The results indicated that SVM outperforms neural network. SVM achieved 89.28% (CV) and 89.55% (IE) overall accuracy in identification of LBPs from non-LBPs and 92.06% (CV) and 92.90% (IE) (in average) for classification of different LBPs classes. Increasing the number and the range of extracted protein features as well as optimization of the SVM parameters significantly increased the efficiency of LBPs class prediction in comparison to the only previous report in this field. Altogether, the results showed that the SVM algorithm can be run on broad, computationally calculated protein features and offers a promising tool in detection of LBPs classes. The proposed approach has the potential to integrate and improve the common sequence alignment based methods. Copyright © 2014 Elsevier Ltd. All rights reserved.

  2. A computational tool to predict the evolutionarily conserved protein-protein interaction hot-spot residues from the structure of the unbound protein.

    Science.gov (United States)

    Agrawal, Neeraj J; Helk, Bernhard; Trout, Bernhardt L

    2014-01-21

    Identifying hot-spot residues - residues that are critical to protein-protein binding - can help to elucidate a protein's function and assist in designing therapeutic molecules to target those residues. We present a novel computational tool, termed spatial-interaction-map (SIM), to predict the hot-spot residues of an evolutionarily conserved protein-protein interaction from the structure of an unbound protein alone. SIM can predict the protein hot-spot residues with an accuracy of 36-57%. Thus, the SIM tool can be used to predict the yet unknown hot-spot residues for many proteins for which the structure of the protein-protein complexes are not available, thereby providing a clue to their functions and an opportunity to design therapeutic molecules to target these proteins. Copyright © 2013 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  3. Impact of low-frequency hotspot mutation R282Q on the structure of p53 DNA-binding domain as revealed by crystallography at 1.54 Å resolution

    Energy Technology Data Exchange (ETDEWEB)

    Tu, Chao [Macromolecular Crystallography Laboratory, National Cancer Institute, Frederick, MD 21702 (United States); Tan, Yu-Hong [Department of Molecular Biology and Biochemistry, University of California at Irvine, Irvine, CA 92697 (United States); Shaw, Gary [Macromolecular Crystallography Laboratory, National Cancer Institute, Frederick, MD 21702 (United States); Zhou, Zheng; Bai, Yawen [Laboratory of Biochemistry and Molecular Biology, National Cancer Institute, Bethesda, MD 20892 (United States); Luo, Ray [Department of Molecular Biology and Biochemistry, University of California at Irvine, Irvine, CA 92697 (United States); Ji, Xinhua, E-mail: jix@ncifcrf.gov [Macromolecular Crystallography Laboratory, National Cancer Institute, Frederick, MD 21702 (United States)

    2008-05-01

    The impact of hotspot mutation R282Q on the structure of human p53 DNA-binding domain has been characterized by X-ray crystallography and molecular-dynamics simulations. Tumor suppressor p53 is a sequence-specific DNA-binding protein and its central DNA-binding domain (DBD) harbors six hotspots (Arg175, Gly245, Arg248, Arg249, Arg273 and Arg282) for human cancers. Here, the crystal structure of a low-frequency hotspot mutant, p53DBD(R282Q), is reported at 1.54 Å resolution together with the results of molecular-dynamics simulations on the basis of the structure. In addition to eliminating a salt bridge, the R282Q mutation has a significant impact on the properties of two DNA-binding loops (L1 and L3). The L1 loop is flexible in the wild type, but it is not flexible in the mutant. The L3 loop of the wild type is not flexible, whereas it assumes two conformations in the mutant. Molecular-dynamics simulations indicated that both conformations of the L3 loop are accessible under biological conditions. It is predicted that the elimination of the salt bridge and the inversion of the flexibility of L1 and L3 are directly or indirectly responsible for deactivating the tumor suppressor p53.

  4. Structure and mechanism of calmodulin binding to a signaling sphingolipid reveal new aspects of lipid-protein interactions

    Science.gov (United States)

    Kovacs, Erika; Harmat, Veronika; Tóth, Judit; Vértessy, Beáta G.; Módos, Károly; Kardos, József; Liliom, Károly

    2010-01-01

    Lipid-protein interactions are rarely characterized at a structural molecular level due to technical difficulties; however, the biological significance of understanding the mechanism of these interactions is outstanding. In this report, we provide mechanistic insight into the inhibitory complex formation of the lipid mediator sphingosylphosphorylcholine with calmodulin, the most central and ubiquitous regulator protein in calcium signaling. We applied crystallographic, thermodynamic, kinetic, and spectroscopic approaches using purified bovine calmodulin and bovine cerebral microsomal fraction to arrive at our conclusions. Here we present 1) a 1.6-Å resolution crystal structure of their complex, in which the sphingolipid occupies the conventional hydrophobic binding site on calmodulin; 2) a peculiar stoichiometry-dependent binding process: at low or high protein-to-lipid ratio calmodulin binds lipid micelles or a few lipid molecules in a compact globular conformation, respectively, and 3) evidence that the sphingolipid displaces calmodulin from its targets on cerebral microsomes. We have ascertained the specificity of the interaction using structurally related lipids as controls. Our observations reveal the structural basis of selective calmodulin inhibition by the sphingolipid. On the basis of the crystallographic and biophysical characterization of the calmodulin–sphingosylphosphorylcholine interaction, we propose a novel lipid-protein binding model, which might be applicable to other interactions as well.—Kovacs, E., Harmat, V., Tóth, J., Vértessy, B. G., Módos, K., Kardos, J., Liliom, K. Structure and mechanism of calmodulin binding to a signaling sphingolipid reveal new aspects of lipid-protein interactions. PMID:20522785

  5. Structural basis for high substrate-binding affinity and enantioselectivity of 3-quinuclidinone reductase AtQR

    International Nuclear Information System (INIS)

    Hou, Feng; Miyakawa, Takuya; Kataoka, Michihiko; Takeshita, Daijiro; Kumashiro, Shoko; Uzura, Atsuko; Urano, Nobuyuki; Nagata, Koji; Shimizu, Sakayu; Tanokura, Masaru

    2014-01-01

    Highlights: • Crystal structure of AtQR has been determined at 1.72 Å. • NADH binding induces the formation of substrate binding site. • AtQR possesses a conserved hydrophobic wall for stereospecific binding of substrate. • Additional Glu197 residue is critical to the high binding affinity. - Abstract: (R)-3-Quinuclidinol, a useful compound for the synthesis of various pharmaceuticals, can be enantioselectively produced from 3-quinuclidinone by 3-quinuclidinone reductase. Recently, a novel NADH-dependent 3-quinuclidionone reductase (AtQR) was isolated from Agrobacterium tumefaciens, and showed much higher substrate-binding affinity (>100 fold) than the reported 3-quinuclidionone reductase (RrQR) from Rhodotorula rubra. Here, we report the crystal structure of AtQR at 1.72 Å. Three NADH-bound protomers and one NADH-free protomer form a tetrameric structure in an asymmetric unit of crystals. NADH not only acts as a proton donor, but also contributes to the stability of the α7 helix. This helix is a unique and functionally significant part of AtQR and is related to form a deep catalytic cavity. AtQR has all three catalytic residues of the short-chain dehydrogenases/reductases family and the hydrophobic wall for the enantioselective reduction of 3-quinuclidinone as well as RrQR. An additional residue on the α7 helix, Glu197, exists near the active site of AtQR. This acidic residue is considered to form a direct interaction with the amine part of 3-quinuclidinone, which contributes to substrate orientation and enhancement of substrate-binding affinity. Mutational analyses also support that Glu197 is an indispensable residue for the activity

  6. Structural basis for high substrate-binding affinity and enantioselectivity of 3-quinuclidinone reductase AtQR

    Energy Technology Data Exchange (ETDEWEB)

    Hou, Feng; Miyakawa, Takuya [Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-8657 (Japan); Kataoka, Michihiko [Division of Applied Life Sciences, Graduate School of Life and Environmental Sciences, Osaka Prefecture University, 1-1 Gakuen-cho, Naka-ku, Sakai 559-8531 (Japan); Division of Applied Life Sciences, Graduate School of Agriculture, Kyoto University, Kitashirakawa-Oiwakecho, Sakyo-ku, Kyoto 606-8502 (Japan); Takeshita, Daijiro [Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-8657 (Japan); Kumashiro, Shoko [Division of Applied Life Sciences, Graduate School of Agriculture, Kyoto University, Kitashirakawa-Oiwakecho, Sakyo-ku, Kyoto 606-8502 (Japan); Uzura, Atsuko [Research and Development Center, Nagase and Co., Ltd., 2-2-3 Muratani, Nishi-ku, Kobe 651-2241 (Japan); Urano, Nobuyuki [Division of Applied Life Sciences, Graduate School of Life and Environmental Sciences, Osaka Prefecture University, 1-1 Gakuen-cho, Naka-ku, Sakai 559-8531 (Japan); Division of Applied Life Sciences, Graduate School of Agriculture, Kyoto University, Kitashirakawa-Oiwakecho, Sakyo-ku, Kyoto 606-8502 (Japan); Nagata, Koji [Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-8657 (Japan); Shimizu, Sakayu [Division of Applied Life Sciences, Graduate School of Agriculture, Kyoto University, Kitashirakawa-Oiwakecho, Sakyo-ku, Kyoto 606-8502 (Japan); Faculty of Bioenvironmental Science, Kyoto Gakuen University, Sogabe-cho, Kameoka 621-8555 (Japan); Tanokura, Masaru, E-mail: amtanok@mail.ecc.u-tokyo.ac.jp [Department of Applied Biological Chemistry, Graduate School of Agricultural and Life Sciences, The University of Tokyo, 1-1-1 Yayoi, Bunkyo-ku, Tokyo 113-8657 (Japan)

    2014-04-18

    Highlights: • Crystal structure of AtQR has been determined at 1.72 Å. • NADH binding induces the formation of substrate binding site. • AtQR possesses a conserved hydrophobic wall for stereospecific binding of substrate. • Additional Glu197 residue is critical to the high binding affinity. - Abstract: (R)-3-Quinuclidinol, a useful compound for the synthesis of various pharmaceuticals, can be enantioselectively produced from 3-quinuclidinone by 3-quinuclidinone reductase. Recently, a novel NADH-dependent 3-quinuclidionone reductase (AtQR) was isolated from Agrobacterium tumefaciens, and showed much higher substrate-binding affinity (>100 fold) than the reported 3-quinuclidionone reductase (RrQR) from Rhodotorula rubra. Here, we report the crystal structure of AtQR at 1.72 Å. Three NADH-bound protomers and one NADH-free protomer form a tetrameric structure in an asymmetric unit of crystals. NADH not only acts as a proton donor, but also contributes to the stability of the α7 helix. This helix is a unique and functionally significant part of AtQR and is related to form a deep catalytic cavity. AtQR has all three catalytic residues of the short-chain dehydrogenases/reductases family and the hydrophobic wall for the enantioselective reduction of 3-quinuclidinone as well as RrQR. An additional residue on the α7 helix, Glu197, exists near the active site of AtQR. This acidic residue is considered to form a direct interaction with the amine part of 3-quinuclidinone, which contributes to substrate orientation and enhancement of substrate-binding affinity. Mutational analyses also support that Glu197 is an indispensable residue for the activity.

  7. Igs Expressed by Chronic Lymphocytic Leukemia B Cells Show Limited Binding-Site Structure Variability

    KAUST Repository

    Marcatili, P.

    2013-05-01

    Ag selection has been suggested to play a role in chronic lymphocytic leukemia (CLL) pathogenesis, but no large-scale analysis has been performed so far on the structure of the Ag-binding sites (ABSs) of leukemic cell Igs. We sequenced both H and L chain V(D)J rearrangements from 366 CLL patients and modeled their three-dimensional structures. The resulting ABS structures were clustered into a small number of discrete sets, each containing ABSs with similar shapes and physicochemical properties. This structural classification correlates well with other known prognostic factors such as Ig mutation status and recurrent (stereotyped) receptors, but it shows a better prognostic value, at least in the case of one structural cluster for which clinical data were available. These findings suggest, for the first time, to our knowledge, on the basis of a structural analysis of the Ab-binding sites, that selection by a finite quota of antigenic structures operates on most CLL cases, whether mutated or unmutated. Copyright © 2013 by The American Association of Immunologists, Inc.

  8. Igs Expressed by Chronic Lymphocytic Leukemia B Cells Show Limited Binding-Site Structure Variability

    KAUST Repository

    Marcatili, P.; Ghiotto, F.; Tenca, C.; Chailyan, A.; Mazzarello, A. N.; Yan, X.-J.; Colombo, M.; Albesiano, E.; Bagnara, D.; Cutrona, G.; Morabito, F.; Bruno, S.; Ferrarini, M.; Chiorazzi, N.; Tramontano, A.; Fais, F.

    2013-01-01

    Ag selection has been suggested to play a role in chronic lymphocytic leukemia (CLL) pathogenesis, but no large-scale analysis has been performed so far on the structure of the Ag-binding sites (ABSs) of leukemic cell Igs. We sequenced both H and L chain V(D)J rearrangements from 366 CLL patients and modeled their three-dimensional structures. The resulting ABS structures were clustered into a small number of discrete sets, each containing ABSs with similar shapes and physicochemical properties. This structural classification correlates well with other known prognostic factors such as Ig mutation status and recurrent (stereotyped) receptors, but it shows a better prognostic value, at least in the case of one structural cluster for which clinical data were available. These findings suggest, for the first time, to our knowledge, on the basis of a structural analysis of the Ab-binding sites, that selection by a finite quota of antigenic structures operates on most CLL cases, whether mutated or unmutated. Copyright © 2013 by The American Association of Immunologists, Inc.

  9. A comprehensive comparison of comparative RNA structure prediction approaches

    DEFF Research Database (Denmark)

    Gardner, P. P.; Giegerich, R.

    2004-01-01

    -finding and multiple-sequence-alignment algorithms. Results Here we evaluate a number of RNA folding algorithms using reliable RNA data-sets and compare their relative performance. Conclusions We conclude that comparative data can enhance structure prediction but structure-prediction-algorithms vary widely in terms......Background An increasing number of researchers have released novel RNA structure analysis and prediction algorithms for comparative approaches to structure prediction. Yet, independent benchmarking of these algorithms is rarely performed as is now common practice for protein-folding, gene...

  10. A Physiologically Based Pharmacokinetic Model to Predict the Pharmacokinetics of Highly Protein-Bound Drugs and Impact of Errors in Plasma Protein Binding

    Science.gov (United States)

    Ye, Min; Nagar, Swati; Korzekwa, Ken

    2015-01-01

    Predicting the pharmacokinetics of highly protein-bound drugs is difficult. Also, since historical plasma protein binding data was often collected using unbuffered plasma, the resulting inaccurate binding data could contribute to incorrect predictions. This study uses a generic physiologically based pharmacokinetic (PBPK) model to predict human plasma concentration-time profiles for 22 highly protein-bound drugs. Tissue distribution was estimated from in vitro drug lipophilicity data, plasma protein binding, and blood: plasma ratio. Clearance was predicted with a well-stirred liver model. Underestimated hepatic clearance for acidic and neutral compounds was corrected by an empirical scaling factor. Predicted values (pharmacokinetic parameters, plasma concentration-time profile) were compared with observed data to evaluate model accuracy. Of the 22 drugs, less than a 2-fold error was obtained for terminal elimination half-life (t1/2, 100% of drugs), peak plasma concentration (Cmax, 100%), area under the plasma concentration-time curve (AUC0–t, 95.4%), clearance (CLh, 95.4%), mean retention time (MRT, 95.4%), and steady state volume (Vss, 90.9%). The impact of fup errors on CLh and Vss prediction was evaluated. Errors in fup resulted in proportional errors in clearance prediction for low-clearance compounds, and in Vss prediction for high-volume neutral drugs. For high-volume basic drugs, errors in fup did not propagate to errors in Vss prediction. This is due to the cancellation of errors in the calculations for tissue partitioning of basic drugs. Overall, plasma profiles were well simulated with the present PBPK model. PMID:26531057

  11. Structural Basis for Linezolid Binding Site Rearrangement in the Staphylococcus aureus Ribosome.

    Science.gov (United States)

    Belousoff, Matthew J; Eyal, Zohar; Radjainia, Mazdak; Ahmed, Tofayel; Bamert, Rebecca S; Matzov, Donna; Bashan, Anat; Zimmerman, Ella; Mishra, Satabdi; Cameron, David; Elmlund, Hans; Peleg, Anton Y; Bhushan, Shashi; Lithgow, Trevor; Yonath, Ada

    2017-05-09

    An unorthodox, surprising mechanism of resistance to the antibiotic linezolid was revealed by cryo-electron microscopy (cryo-EM) in the 70S ribosomes from a clinical isolate of Staphylococcus aureus This high-resolution structural information demonstrated that a single amino acid deletion in ribosomal protein uL3 confers linezolid resistance despite being located 24 Å away from the linezolid binding pocket in the peptidyl-transferase center. The mutation induces a cascade of allosteric structural rearrangements of the rRNA that ultimately results in the alteration of the antibiotic binding site. IMPORTANCE The growing burden on human health caused by various antibiotic resistance mutations now includes prevalent Staphylococcus aureus resistance to last-line antimicrobial drugs such as linezolid and daptomycin. Structure-informed drug modification represents a frontier with respect to designing advanced clinical therapies, but success in this strategy requires rapid, facile means to shed light on the structural basis for drug resistance (D. Brown, Nat Rev Drug Discov 14:821-832, 2015, https://doi.org/10.1038/nrd4675). Here, detailed structural information demonstrates that a common mechanism is at play in linezolid resistance and provides a step toward the redesign of oxazolidinone antibiotics, a strategy that could thwart known mechanisms of linezolid resistance. Copyright © 2017 Belousoff et al.

  12. Ceruloplasmin revisited: structural and functional roles of various metal cation-binding sites

    International Nuclear Information System (INIS)

    Bento, Isabel; Peixoto, Cristina; Zaitsev, Vjacheslav N.; Lindley, Peter F.

    2007-01-01

    The three-dimensional molecular structure of human serum ceruloplasmin has been reinvestigated using X-ray synchrotron data collected at 100 K from a crystal frozen to liquid-nitrogen temperature. The three-dimensional molecular structure of human serum ceruloplasmin has been reinvestigated using X-ray synchrotron data collected at 100 K from a crystal frozen to liquid-nitrogen temperature. The resulting model, with an increase in resolution from 3.1 to 2.8 Å, gives an overall improvement of the molecular structure, in particular the side chains. In addition, it enables the clear definition of previously unidentified Ca 2+ -binding and Na + -binding sites. The Ca 2+ cation is located in domain 1 in a configuration very similar to that found in the activated bovine factor Va. The Na + sites appear to play a structural role in providing rigidity to the three protuberances on the top surface of the molecule. These features probably help to steer substrates towards the mononuclear copper sites prior to their oxidation and to restrict the size of the approaching substrate. The trinuclear copper centre appears to differ from the room-temperature structure in that a dioxygen moiety is bound in a similar way to that found in the endospore coat protein CotA from Bacillus subtilis

  13. Loop-to-helix transition in the structure of multidrug regulator AcrR at the entrance of the drug-binding cavity.

    Science.gov (United States)

    Manjasetty, Babu A; Halavaty, Andrei S; Luan, Chi-Hao; Osipiuk, Jerzy; Mulligan, Rory; Kwon, Keehwan; Anderson, Wayne F; Joachimiak, Andrzej

    2016-04-01

    Multidrug transcription regulator AcrR from Salmonella enterica subsp. enterica serovar Typhimurium str. LT2 belongs to the tetracycline repressor family, one of the largest groups of bacterial transcription factors. The crystal structure of dimeric AcrR was determined and refined to 1.56Å resolution. The tertiary and quaternary structures of AcrR are similar to those of its homologs. The multidrug binding site was identified based on structural alignment with homologous proteins and has a di(hydroxyethyl)ether molecule bound. Residues from helices α4 and α7 shape the entry into this binding site. The structure of AcrR reveals that the extended helical conformation of helix α4 is stabilized by the hydrogen bond between Glu67 (helix α4) and Gln130 (helix α7). Based on the structural comparison with the closest homolog structure, the Escherichia coli AcrR, we propose that this hydrogen bond is responsible for control of the loop-to-helix transition within helix α4. This local conformational switch of helix α4 may be a key step in accessing the multidrug binding site and securing ligands at the binding site. Solution small-molecule binding studies suggest that AcrR binds ligands with their core chemical structure resembling the tetracyclic ring of cholesterol. Copyright © 2016. Published by Elsevier Inc.

  14. Tight-binding study of the structural and magnetic properties of vanadium clusters

    International Nuclear Information System (INIS)

    Zhao Jijun; Lain, K.D.

    1995-01-01

    The structural and magnetic properties of small vanadium clusters are studied in the framework of tight-binding theory. According to parameters of the cluster dimer and bulk solid, we developed a tight-binding interatomic potential and calculated the bonding energies for the different possible structures to determine the ground state atomic configurations of the small vanadium clusters. The theoretical bonding energies for the vanadium clusters agree with the experiment much better than the simple droplet model. However, the calculated values for the clusters of odd atomic number are somewhat higher than the measured ones, corresponding to the pair occupation of delocalized 4s 1 electrons. Based on the optimized geometries, we study the magnetic properties of these clusters through a parametrized Hubbard Hamiltonian. We find the small V clusters of ground-state structures exhibit antiferromagnetic behavior while the alignment of local moments in the clusters with the unoptimized structures may show either ferromagnetic or antiferromagnetic characteristics. The average magnetic moments of the clusters decrease nonmonotonically as cluster size increases and the theoretical results are consistent with the upper limits obtained from a recent experiment. (orig.)

  15. Nucleic acid secondary structure prediction and display.

    OpenAIRE

    Stüber, K

    1986-01-01

    A set of programs has been developed for the prediction and display of nucleic acid secondary structures. Information from experimental data can be used to restrict or enforce secondary structural elements. The predictions can be displayed either on normal line printers or on graphic devices like plotters or graphic terminals.

  16. Relative binding affinity prediction of farnesoid X receptor in the D3R Grand Challenge 2 using FEP+

    Science.gov (United States)

    Schindler, Christina; Rippmann, Friedrich; Kuhn, Daniel

    2018-01-01

    Physics-based free energy simulations have increasingly become an important tool for predicting binding affinity and the recent introduction of automated protocols has also paved the way towards a more widespread use in the pharmaceutical industry. The D3R 2016 Grand Challenge 2 provided an opportunity to blindly test the commercial free energy calculation protocol FEP+ and assess its performance relative to other affinity prediction methods. The present D3R free energy prediction challenge was built around two experimental data sets involving inhibitors of farnesoid X receptor (FXR) which is a promising anticancer drug target. The FXR binding site is predominantly hydrophobic with few conserved interaction motifs and strong induced fit effects making it a challenging target for molecular modeling and drug design. For both data sets, we achieved reasonable prediction accuracy (RMSD ≈ 1.4 kcal/mol, rank 3-4 according to RMSD out of 20 submissions) comparable to that of state-of-the-art methods in the field. Our D3R results boosted our confidence in the method and strengthen our desire to expand its applications in future in-house drug design projects.

  17. Relative binding affinity prediction of farnesoid X receptor in the D3R Grand Challenge 2 using FEP.

    Science.gov (United States)

    Schindler, Christina; Rippmann, Friedrich; Kuhn, Daniel

    2018-01-01

    Physics-based free energy simulations have increasingly become an important tool for predicting binding affinity and the recent introduction of automated protocols has also paved the way towards a more widespread use in the pharmaceutical industry. The D3R 2016 Grand Challenge 2 provided an opportunity to blindly test the commercial free energy calculation protocol FEP+ and assess its performance relative to other affinity prediction methods. The present D3R free energy prediction challenge was built around two experimental data sets involving inhibitors of farnesoid X receptor (FXR) which is a promising anticancer drug target. The FXR binding site is predominantly hydrophobic with few conserved interaction motifs and strong induced fit effects making it a challenging target for molecular modeling and drug design. For both data sets, we achieved reasonable prediction accuracy (RMSD ≈ 1.4 kcal/mol, rank 3-4 according to RMSD out of 20 submissions) comparable to that of state-of-the-art methods in the field. Our D3R results boosted our confidence in the method and strengthen our desire to expand its applications in future in-house drug design projects.

  18. Sasquatch: predicting the impact of regulatory SNPs on transcription factor binding from cell- and tissue-specific DNase footprints.

    Science.gov (United States)

    Schwessinger, Ron; Suciu, Maria C; McGowan, Simon J; Telenius, Jelena; Taylor, Stephen; Higgs, Doug R; Hughes, Jim R

    2017-10-01

    In the era of genome-wide association studies (GWAS) and personalized medicine, predicting the impact of single nucleotide polymorphisms (SNPs) in regulatory elements is an important goal. Current approaches to determine the potential of regulatory SNPs depend on inadequate knowledge of cell-specific DNA binding motifs. Here, we present Sasquatch, a new computational approach that uses DNase footprint data to estimate and visualize the effects of noncoding variants on transcription factor binding. Sasquatch performs a comprehensive k -mer-based analysis of DNase footprints to determine any k -mer's potential for protein binding in a specific cell type and how this may be changed by sequence variants. Therefore, Sasquatch uses an unbiased approach, independent of known transcription factor binding sites and motifs. Sasquatch only requires a single DNase-seq data set per cell type, from any genotype, and produces consistent predictions from data generated by different experimental procedures and at different sequence depths. Here we demonstrate the effectiveness of Sasquatch using previously validated functional SNPs and benchmark its performance against existing approaches. Sasquatch is available as a versatile webtool incorporating publicly available data, including the human ENCODE collection. Thus, Sasquatch provides a powerful tool and repository for prioritizing likely regulatory SNPs in the noncoding genome. © 2017 Schwessinger et al.; Published by Cold Spring Harbor Laboratory Press.

  19. Formation Mechanism and Binding Energy for Body-Centred Regular Icosahedral Structure of Li13 Cluster

    International Nuclear Information System (INIS)

    Liu Weina; Li Ping; Gou Qingquan; Zhao Yanping

    2008-01-01

    The formation mechanism for the body-centred regular icosahedral structure of Li 13 cluster is proposed. The curve of the total energy versus the separation R between the nucleus at the centre and nuclei at the apexes for this structure of Li 13 has been calculated by using the method of Gou's modified arrangement channel quantum mechanics (MACQM). The result shows that the curve has a minimal energy of -96.951 39 a.u. at R = 5.46a 0 . When R approaches to infinity, the total energy of thirteen lithium atoms has the value of -96.564 38 a.u. So the binding energy of Li 13 with respect to thirteen lithium atoms is 0.387 01 a.u. Therefore the binding energy per atom for Li 13 is 0.029 77 a.u. or 0.810 eV, which is greater than the binding energy per atom of 0.453 eV for Li 2 , 0.494 eV for Li 3 , 0.7878 eV for Li 4 , 0.632 eV for Li 5 , and 0.674 eV for Li 7 calculated by us previously. This means that the Li 13 cluster may be formed stably in a body-centred regular icosahedral structure with a greater binding energy

  20. Study of binding interaction between anthelmintic 2, 3-dihydroquinazolin-4-ones with bovine serum albumin by spectroscopic methods

    Energy Technology Data Exchange (ETDEWEB)

    Hemalatha, K.; Madhumitha, G., E-mail: madhumitha.g@vit.ac.in

    2016-10-15

    A new series of brominated derivatives of 2, 3-dihydroquinazolin-4(1H)-one were synthesized and their structures were confirmed using IR, NMR and mass spectra. The synthesized derivatives were screened for their in vitro anthelmintic activity. The investigations on interaction of the bioactive compound, 2i with bovine serum albumin (BSA) were evaluated. The quenching mechanism of the compound, 2i was deduced based on the results of Stern–Volmer equation. The number of binding site, prediction of binding site region and the changes in the secondary structure of protein were predicted using various spectroscopic studies.