WorldWideScience

Sample records for binding site prediction

  1. Predicted metal binding sites for phytoremediation.

    Science.gov (United States)

    Sharma, Ashok; Roy, Sudeep; Tripathi, Kumar Parijat; Roy, Pratibha; Mishra, Manoj; Khan, Feroz; Meena, Abha

    2009-09-05

    Metal ion binding domains are found in proteins that mediate transport, buffering or detoxification of metal ions. The objective of the study is to design and analyze metal binding motifs against the genes involved in phytoremediation. This is being done on the basis of certain pre-requisite amino-acid residues known to bind metal ions/metal complexes in medicinal and aromatic plants (MAP's). Earlier work on MAP's have shown that heavy metals accumulated by aromatic and medicinal plants do not appear in the essential oil and that some of these species are able to grow in metal contaminated sites. A pattern search against the UniProtKB/Swiss-Prot and UniProtKB/TrEMBL databases yielded true positives in each case showing the high specificity of the motifs designed for the ions of nickel, lead, molybdenum, manganese, cadmium, zinc, iron, cobalt and xenobiotic compounds. Motifs were also studied against PDB structures. Results of the study suggested the presence of binding sites on the surface of protein molecules involved. PDB structures of proteins were finally predicted for the binding sites functionality in their respective phytoremediation usage. This was further validated through CASTp server to study its physico-chemical properties. Bioinformatics implications would help in designing strategy for developing transgenic plants with increased metal binding capacity. These metal binding factors can be used to restrict metal update by plants. This helps in reducing the possibility of metal movement into the food chain.

  2. Computational Prediction of RNA-Binding Proteins and Binding Sites.

    Science.gov (United States)

    Si, Jingna; Cui, Jing; Cheng, Jin; Wu, Rongling

    2015-01-01

    Proteins and RNA interaction have vital roles in many cellular processes such as protein synthesis, sequence encoding, RNA transfer, and gene regulation at the transcriptional and post-transcriptional levels. Approximately 6%-8% of all proteins are RNA-binding proteins (RBPs). Distinguishing these RBPs or their binding residues is a major aim of structural biology. Previously, a number of experimental methods were developed for the determination of protein-RNA interactions. However, these experimental methods are expensive, time-consuming, and labor-intensive. Alternatively, researchers have developed many computational approaches to predict RBPs and protein-RNA binding sites, by combining various machine learning methods and abundant sequence and/or structural features. There are three kinds of computational approaches, which are prediction from protein sequence, prediction from protein structure, and protein-RNA docking. In this paper, we review all existing studies of predictions of RNA-binding sites and RBPs and complexes, including data sets used in different approaches, sequence and structural features used in several predictors, prediction method classifications, performance comparisons, evaluation methods, and future directions.

  3. Computational Prediction of RNA-Binding Proteins and Binding Sites

    Directory of Open Access Journals (Sweden)

    Jingna Si

    2015-11-01

    Full Text Available Proteins and RNA interaction have vital roles in many cellular processes such as protein synthesis, sequence encoding, RNA transfer, and gene regulation at the transcriptional and post-transcriptional levels. Approximately 6%–8% of all proteins are RNA-binding proteins (RBPs. Distinguishing these RBPs or their binding residues is a major aim of structural biology. Previously, a number of experimental methods were developed for the determination of protein–RNA interactions. However, these experimental methods are expensive, time-consuming, and labor-intensive. Alternatively, researchers have developed many computational approaches to predict RBPs and protein–RNA binding sites, by combining various machine learning methods and abundant sequence and/or structural features. There are three kinds of computational approaches, which are prediction from protein sequence, prediction from protein structure, and protein-RNA docking. In this paper, we review all existing studies of predictions of RNA-binding sites and RBPs and complexes, including data sets used in different approaches, sequence and structural features used in several predictors, prediction method classifications, performance comparisons, evaluation methods, and future directions.

  4. Predicted metal binding sites for phytoremediation

    OpenAIRE

    Sharma, Ashok; Roy, Sudeep; Tripathi, Kumar Parijat; Roy, Pratibha; Mishra, Manoj; Khan, Feroz; Meena, Abha

    2009-01-01

    Metal ion binding domains are found in proteins that mediate transport, buffering or detoxification of metal ions. The objective of the study is to design and analyze metal binding motifs against the genes involved in phytoremediation. This is being done on the basis of certain pre-requisite amino-acid residues known to bind metal ions/metal complexes in medicinal and aromatic plants (MAP's). Earlier work on MAP's have shown that heavy metals accumulated by aromatic and medicinal plants do no...

  5. A systems biology approach to transcription factor binding site prediction.

    Directory of Open Access Journals (Sweden)

    Xiang Zhou

    Full Text Available BACKGROUND: The elucidation of mammalian transcriptional regulatory networks holds great promise for both basic and translational research and remains one the greatest challenges to systems biology. Recent reverse engineering methods deduce regulatory interactions from large-scale mRNA expression profiles and cross-species conserved regulatory regions in DNA. Technical challenges faced by these methods include distinguishing between direct and indirect interactions, associating transcription regulators with predicted transcription factor binding sites (TFBSs, identifying non-linearly conserved binding sites across species, and providing realistic accuracy estimates. METHODOLOGY/PRINCIPAL FINDINGS: We address these challenges by closely integrating proven methods for regulatory network reverse engineering from mRNA expression data, linearly and non-linearly conserved regulatory region discovery, and TFBS evaluation and discovery. Using an extensive test set of high-likelihood interactions, which we collected in order to provide realistic prediction-accuracy estimates, we show that a careful integration of these methods leads to significant improvements in prediction accuracy. To verify our methods, we biochemically validated TFBS predictions made for both transcription factors (TFs and co-factors; we validated binding site predictions made using a known E2F1 DNA-binding motif on E2F1 predicted promoter targets, known E2F1 and JUND motifs on JUND predicted promoter targets, and a de novo discovered motif for BCL6 on BCL6 predicted promoter targets. Finally, to demonstrate accuracy of prediction using an external dataset, we showed that sites matching predicted motifs for ZNF263 are significantly enriched in recent ZNF263 ChIP-seq data. CONCLUSIONS/SIGNIFICANCE: Using an integrative framework, we were able to address technical challenges faced by state of the art network reverse engineering methods, leading to significant improvement in direct

  6. Prediction of nucleosome positioning based on transcription factor binding sites.

    Directory of Open Access Journals (Sweden)

    Xianfu Yi

    Full Text Available BACKGROUND: The DNA of all eukaryotic organisms is packaged into nucleosomes, the basic repeating units of chromatin. The nucleosome consists of a histone octamer around which a DNA core is wrapped and the linker histone H1, which is associated with linker DNA. By altering the accessibility of DNA sequences, the nucleosome has profound effects on all DNA-dependent processes. Understanding the factors that influence nucleosome positioning is of great importance for the study of genomic control mechanisms. Transcription factors (TFs have been suggested to play a role in nucleosome positioning in vivo. PRINCIPAL FINDINGS: Here, the minimum redundancy maximum relevance (mRMR feature selection algorithm, the nearest neighbor algorithm (NNA, and the incremental feature selection (IFS method were used to identify the most important TFs that either favor or inhibit nucleosome positioning by analyzing the numbers of transcription factor binding sites (TFBSs in 53,021 nucleosomal DNA sequences and 50,299 linker DNA sequences. A total of nine important families of TFs were extracted from 35 families, and the overall prediction accuracy was 87.4% as evaluated by the jackknife cross-validation test. CONCLUSIONS: Our results are consistent with the notion that TFs are more likely to bind linker DNA sequences than the sequences in the nucleosomes. In addition, our results imply that there may be some TFs that are important for nucleosome positioning but that play an insignificant role in discriminating nucleosome-forming DNA sequences from nucleosome-inhibiting DNA sequences. The hypothesis that TFs play a role in nucleosome positioning is, thus, confirmed by the results of this study.

  7. Prediction of calcium-binding sites by combining loop-modeling with machine learning

    Directory of Open Access Journals (Sweden)

    Altman Russ B

    2009-12-01

    Full Text Available Abstract Background Protein ligand-binding sites in the apo state exhibit structural flexibility. This flexibility often frustrates methods for structure-based recognition of these sites because it leads to the absence of electron density for these critical regions, particularly when they are in surface loops. Methods for recognizing functional sites in these missing loops would be useful for recovering additional functional information. Results We report a hybrid approach for recognizing calcium-binding sites in disordered regions. Our approach combines loop modeling with a machine learning method (FEATURE for structure-based site recognition. For validation, we compared the performance of our method on known calcium-binding sites for which there are both holo and apo structures. When loops in the apo structures are rebuilt using modeling methods, FEATURE identifies 14 out of 20 crystallographically proven calcium-binding sites. It only recognizes 7 out of 20 calcium-binding sites in the initial apo crystal structures. We applied our method to unstructured loops in proteins from SCOP families known to bind calcium in order to discover potential cryptic calcium binding sites. We built 2745 missing loops and evaluated them for potential calcium binding. We made 102 predictions of calcium-binding sites. Ten predictions are consistent with independent experimental verifications. We found indirect experimental evidence for 14 other predictions. The remaining 78 predictions are novel predictions, some with intriguing potential biological significance. In particular, we see an enrichment of beta-sheet folds with predicted calcium binding sites in the connecting loops on the surface that may be important for calcium-mediated function switches. Conclusion Protein crystal structures are a potentially rich source of functional information. When loops are missing in these structures, we may be losing important information about binding sites and active

  8. Genome-wide prediction, display and refinement of binding sites with information theory-based models

    Directory of Open Access Journals (Sweden)

    Leeder J Steven

    2003-09-01

    Full Text Available Abstract Background We present Delila-genome, a software system for identification, visualization and analysis of protein binding sites in complete genome sequences. Binding sites are predicted by scanning genomic sequences with information theory-based (or user-defined weight matrices. Matrices are refined by adding experimentally-defined binding sites to published binding sites. Delila-Genome was used to examine the accuracy of individual information contents of binding sites detected with refined matrices as a measure of the strengths of the corresponding protein-nucleic acid interactions. The software can then be used to predict novel sites by rescanning the genome with the refined matrices. Results Parameters for genome scans are entered using a Java-based GUI interface and backend scripts in Perl. Multi-processor CPU load-sharing minimized the average response time for scans of different chromosomes. Scans of human genome assemblies required 4–6 hours for transcription factor binding sites and 10–19 hours for splice sites, respectively, on 24- and 3-node Mosix and Beowulf clusters. Individual binding sites are displayed either as high-resolution sequence walkers or in low-resolution custom tracks in the UCSC genome browser. For large datasets, we applied a data reduction strategy that limited displays of binding sites exceeding a threshold information content to specific chromosomal regions within or adjacent to genes. An HTML document is produced listing binding sites ranked by binding site strength or chromosomal location hyperlinked to the UCSC custom track, other annotation databases and binding site sequences. Post-genome scan tools parse binding site annotations of selected chromosome intervals and compare the results of genome scans using different weight matrices. Comparisons of multiple genome scans can display binding sites that are unique to each scan and identify sites with significantly altered binding strengths

  9. An Overview of the Prediction of Protein DNA-Binding Sites

    Directory of Open Access Journals (Sweden)

    Jingna Si

    2015-03-01

    Full Text Available Interactions between proteins and DNA play an important role in many essential biological processes such as DNA replication, transcription, splicing, and repair. The identification of amino acid residues involved in DNA-binding sites is critical for understanding the mechanism of these biological activities. In the last decade, numerous computational approaches have been developed to predict protein DNA-binding sites based on protein sequence and/or structural information, which play an important role in complementing experimental strategies. At this time, approaches can be divided into three categories: sequence-based DNA-binding site prediction, structure-based DNA-binding site prediction, and homology modeling and threading. In this article, we review existing research on computational methods to predict protein DNA-binding sites, which includes data sets, various residue sequence/structural features, machine learning methods for comparison and selection, evaluation methods, performance comparison of different tools, and future directions in protein DNA-binding site prediction. In particular, we detail the meta-analysis of protein DNA-binding sites. We also propose specific implications that are likely to result in novel prediction methods, increased performance, or practical applications.

  10. Using TESS to predict transcription factor binding sites in DNA sequence.

    Science.gov (United States)

    Schug, Jonathan

    2008-03-01

    This unit describes how to use the Transcription Element Search System (TESS). This Web site predicts transcription factor binding sites (TFBS) in DNA sequence using two different kinds of models of sites, strings and positional weight matrices. The binding of transcription factors to DNA is a major part of the control of gene expression. Transcription factors exhibit sequence-specific binding; they form stronger bonds to some DNA sequences than to others. Identification of a good binding site in the promoter for a gene suggests the possibility that the corresponding factor may play a role in the regulation of that gene. However, the sequences transcription factors recognize are typically short and allow for some amount of mismatch. Because of this, binding sites for a factor can typically be found at random every few hundred to a thousand base pairs. TESS has features to help sort through and evaluate the significance of predicted sites.

  11. Predicting DNA-binding sites of proteins based on sequential and 3D structural information.

    Science.gov (United States)

    Li, Bi-Qing; Feng, Kai-Yan; Ding, Juan; Cai, Yu-Dong

    2014-06-01

    Protein-DNA interactions play important roles in many biological processes. To understand the molecular mechanisms of protein-DNA interaction, it is necessary to identify the DNA-binding sites in DNA-binding proteins. In the last decade, computational approaches have been developed to predict protein-DNA-binding sites based solely on protein sequences. In this study, we developed a novel predictor based on support vector machine algorithm coupled with the maximum relevance minimum redundancy method followed by incremental feature selection. We incorporated not only features of physicochemical/biochemical properties, sequence conservation, residual disorder, secondary structure, solvent accessibility, but also five three-dimensional (3D) structural features calculated from PDB data to predict the protein-DNA interaction sites. Feature analysis showed that 3D structural features indeed contributed to the prediction of DNA-binding site and it was demonstrated that the prediction performance was better with 3D structural features than without them. It was also shown via analysis of features from each site that the features of DNA-binding site itself contribute the most to the prediction. Our prediction method may become a useful tool for identifying the DNA-binding sites and the feature analysis described in this paper may provide useful insights for in-depth investigations into the mechanisms of protein-DNA interaction.

  12. Spatial distribution of predicted transcription factor binding sites in Drosophila ChIP peaks.

    Science.gov (United States)

    Pettie, Kade P; Dresch, Jacqueline M; Drewell, Robert A

    2016-08-01

    In the development of the Drosophila embryo, gene expression is directed by the sequence-specific interactions of a large network of protein transcription factors (TFs) and DNA cis-regulatory binding sites. Once the identity of the typically 8-10bp binding sites for any given TF has been determined by one of several experimental procedures, the sequences can be represented in a position weight matrix (PWM) and used to predict the location of additional TF binding sites elsewhere in the genome. Often, alignments of large (>200bp) genomic fragments that have been experimentally determined to bind the TF of interest in Chromatin Immunoprecipitation (ChIP) studies are trimmed under the assumption that the majority of the binding sites are located near the center of all the aligned fragments. In this study, ChIP/chip datasets are analyzed using the corresponding PWMs for the well-studied TFs; CAUDAL, HUNCHBACK, KNIRPS and KRUPPEL, to determine the distribution of predicted binding sites. All four TFs are critical regulators of gene expression along the anterio-posterior axis in early Drosophila development. For all four TFs, the ChIP peaks contain multiple binding sites that are broadly distributed across the genomic region represented by the peak, regardless of the prediction stringency criteria used. This result suggests that ChIP peak trimming may exclude functional binding sites from subsequent analyses.

  13. MBSTAR: multiple instance learning for predicting specific functional binding sites in microRNA targets

    Science.gov (United States)

    Bandyopadhyay, Sanghamitra; Ghosh, Dip; Mitra, Ramkrishna; Zhao, Zhongming

    2015-01-01

    MicroRNA (miRNA) regulates gene expression by binding to specific sites in the 3'untranslated regions of its target genes. Machine learning based miRNA target prediction algorithms first extract a set of features from potential binding sites (PBSs) in the mRNA and then train a classifier to distinguish targets from non-targets. However, they do not consider whether the PBSs are functional or not, and consequently result in high false positive rates. This substantially affects the follow up functional validation by experiments. We present a novel machine learning based approach, MBSTAR (Multiple instance learning of Binding Sites of miRNA TARgets), for accurate prediction of true or functional miRNA binding sites. Multiple instance learning framework is adopted to handle the lack of information about the actual binding sites in the target mRNAs. Biologically validated 9531 interacting and 973 non-interacting miRNA-mRNA pairs are identified from Tarbase 6.0 and confirmed with PAR-CLIP dataset. It is found that MBSTAR achieves the highest number of binding sites overlapping with PAR-CLIP with maximum F-Score of 0.337. Compared to the other methods, MBSTAR also predicts target mRNAs with highest accuracy. The tool and genome wide predictions are available at http://www.isical.ac.in/~bioinfo_miu/MBStar30.htm.

  14. Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

    Directory of Open Access Journals (Sweden)

    John A Capra

    2009-12-01

    Full Text Available Identifying a protein's functional sites is an important step towards characterizing its molecular function. Numerous structure- and sequence-based methods have been developed for this problem. Here we introduce ConCavity, a small molecule binding site prediction algorithm that integrates evolutionary sequence conservation estimates with structure-based methods for identifying protein surface cavities. In large-scale testing on a diverse set of single- and multi-chain protein structures, we show that ConCavity substantially outperforms existing methods for identifying both 3D ligand binding pockets and individual ligand binding residues. As part of our testing, we perform one of the first direct comparisons of conservation-based and structure-based methods. We find that the two approaches provide largely complementary information, which can be combined to improve upon either approach alone. We also demonstrate that ConCavity has state-of-the-art performance in predicting catalytic sites and drug binding pockets. Overall, the algorithms and analysis presented here significantly improve our ability to identify ligand binding sites and further advance our understanding of the relationship between evolutionary sequence conservation and structural and functional attributes of proteins. Data, source code, and prediction visualizations are available on the ConCavity web site (http://compbio.cs.princeton.edu/concavity/.

  15. Predicting protein ligand binding sites by combining evolutionary sequence conservation and 3D structure.

    Science.gov (United States)

    Capra, John A; Laskowski, Roman A; Thornton, Janet M; Singh, Mona; Funkhouser, Thomas A

    2009-12-01

    Identifying a protein's functional sites is an important step towards characterizing its molecular function. Numerous structure- and sequence-based methods have been developed for this problem. Here we introduce ConCavity, a small molecule binding site prediction algorithm that integrates evolutionary sequence conservation estimates with structure-based methods for identifying protein surface cavities. In large-scale testing on a diverse set of single- and multi-chain protein structures, we show that ConCavity substantially outperforms existing methods for identifying both 3D ligand binding pockets and individual ligand binding residues. As part of our testing, we perform one of the first direct comparisons of conservation-based and structure-based methods. We find that the two approaches provide largely complementary information, which can be combined to improve upon either approach alone. We also demonstrate that ConCavity has state-of-the-art performance in predicting catalytic sites and drug binding pockets. Overall, the algorithms and analysis presented here significantly improve our ability to identify ligand binding sites and further advance our understanding of the relationship between evolutionary sequence conservation and structural and functional attributes of proteins. Data, source code, and prediction visualizations are available on the ConCavity web site (http://compbio.cs.princeton.edu/concavity/).

  16. A Large-Scale Assessment of Nucleic Acids Binding Site Prediction Programs.

    Directory of Open Access Journals (Sweden)

    Zhichao Miao

    2015-12-01

    Full Text Available Computational prediction of nucleic acid binding sites in proteins are necessary to disentangle functional mechanisms in most biological processes and to explore the binding mechanisms. Several strategies have been proposed, but the state-of-the-art approaches display a great diversity in i the definition of nucleic acid binding sites; ii the training and test datasets; iii the algorithmic methods for the prediction strategies; iv the performance measures and v the distribution and availability of the prediction programs. Here we report a large-scale assessment of 19 web servers and 3 stand-alone programs on 41 datasets including more than 5000 proteins derived from 3D structures of protein-nucleic acid complexes. Well-defined binary assessment criteria (specificity, sensitivity, precision, accuracy… are applied. We found that i the tools have been greatly improved over the years; ii some of the approaches suffer from theoretical defects and there is still room for sorting out the essential mechanisms of binding; iii RNA binding and DNA binding appear to follow similar driving forces and iv dataset bias may exist in some methods.

  17. STarMir Tools for Prediction of microRNA binding sites

    Science.gov (United States)

    Kanoria, Shaveta; Rennie, William; Liu, Chaochun; Carmack, C. Steven; Lu, Jun; Ding, Ye

    2017-01-01

    MicroRNAs (miRNAs) are a class of endogenous short non-coding RNAs that regulate gene expression by targeting messenger RNAs (mRNAs), which results in translational repression and/or mRNA degradation. As regulatory molecules, miRNAs are involved in many mammalian biological processes and also in the manifestation of certain human diseases. As miRNAs play central role in the regulation of gene expression, understanding miRNA-binding patterns is essential to gain an insight of miRNA mediated gene regulation and also holds promise for therapeutic applications. Computational prediction of miRNA binding sites on target mRNAs facilitates experimental investigation of miRNA functions. This chapter provides protocols for using the STarMir web server for improved predictions of miRNA binding sites on a target mRNA. As an application module of the Sfold RNA package, the current version of STarMir is an implementation of logistic prediction models developed with high throughput miRNA binding data from crosslinking immuno-precipitation (CLIP) studies. The models incorporated comprehensive thermodynamic, structural and sequence features, and were found to make improved predictions of both seed and seedless sites, in comparison to the established algorithms [1]. Their broad applicability was indicated by their good performance in cross-species validation. STarMir is freely available at http://sfold.wadsworth.org/starmir.html PMID:27665594

  18. Computational prediction of cAMP receptor protein (CRP binding sites in cyanobacterial genomes

    Directory of Open Access Journals (Sweden)

    Su Zhengchang

    2009-01-01

    Full Text Available Abstract Background Cyclic AMP receptor protein (CRP, also known as catabolite gene activator protein (CAP, is an important transcriptional regulator widely distributed in many bacteria. The biological processes under the regulation of CRP are highly diverse among different groups of bacterial species. Elucidation of CRP regulons in cyanobacteria will further our understanding of the physiology and ecology of this important group of microorganisms. Previously, CRP has been experimentally studied in only two cyanobacterial strains: Synechocystis sp. PCC 6803 and Anabaena sp. PCC 7120; therefore, a systematic genome-scale study of the potential CRP target genes and binding sites in cyanobacterial genomes is urgently needed. Results We have predicted and analyzed the CRP binding sites and regulons in 12 sequenced cyanobacterial genomes using a highly effective cis-regulatory binding site scanning algorithm. Our results show that cyanobacterial CRP binding sites are very similar to those in E. coli; however, the regulons are very different from that of E. coli. Furthermore, CRP regulons in different cyanobacterial species/ecotypes are also highly diversified, ranging from photosynthesis, carbon fixation and nitrogen assimilation, to chemotaxis and signal transduction. In addition, our prediction indicates that crp genes in modern cyanobacteria are likely inherited from a common ancestral gene in their last common ancestor, and have adapted various cellular functions in different environments, while some cyanobacteria lost their crp genes as well as CRP binding sites during the course of evolution. Conclusion The CRP regulons in cyanobacteria are highly diversified, probably as a result of divergent evolution to adapt to various ecological niches. Cyanobacterial CRPs may function as lineage-specific regulators participating in various cellular processes, and are important in some lineages. However, they are dispensable in some other lineages. The

  19. Improving the prediction of protein binding sites by combining heterogeneous data and Voronoi diagrams

    Directory of Open Access Journals (Sweden)

    Fernandez-Fuentes Narcis

    2011-08-01

    Full Text Available Abstract Background Protein binding site prediction by computational means can yield valuable information that complements and guides experimental approaches to determine the structure of protein complexes. Predictions become even more relevant and timely given the current resolution of protein interaction maps, where there is a very large and still expanding gap between the available information on: (i which proteins interact and (ii how proteins interact. Proteins interact through exposed residues that present differential physicochemical properties, and these can be exploited to identify protein interfaces. Results Here we present VORFFIP, a novel method for protein binding site prediction. The method makes use of broad set of heterogeneous data and defined of residue environment, by means of Voronoi Diagrams that are integrated by a two-steps Random Forest ensemble classifier. Four sets of residue features (structural, energy terms, sequence conservation, and crystallographic B-factors used in different combinations together with three definitions of residue environment (Voronoi Diagrams, sequence sliding window, and Euclidian distance have been analyzed in order to maximize the performance of the method. Conclusions The integration of different forms information such as structural features, energy term, evolutionary conservation and crystallographic B-factors, improves the performance of binding site prediction. Including the information of neighbouring residues also improves the prediction of protein interfaces. Among the different approaches that can be used to define the environment of exposed residues, Voronoi Diagrams provide the most accurate description. Finally, VORFFIP compares favourably to other methods reported in the recent literature.

  20. Proteins and Their Interacting Partners: An Introduction to Protein-Ligand Binding Site Prediction Methods.

    Science.gov (United States)

    Roche, Daniel Barry; Brackenridge, Danielle Allison; McGuffin, Liam James

    2015-12-15

    Elucidating the biological and biochemical roles of proteins, and subsequently determining their interacting partners, can be difficult and time consuming using in vitro and/or in vivo methods, and consequently the majority of newly sequenced proteins will have unknown structures and functions. However, in silico methods for predicting protein-ligand binding sites and protein biochemical functions offer an alternative practical solution. The characterisation of protein-ligand binding sites is essential for investigating new functional roles, which can impact the major biological research spheres of health, food, and energy security. In this review we discuss the role in silico methods play in 3D modelling of protein-ligand binding sites, along with their role in predicting biochemical functionality. In addition, we describe in detail some of the key alternative in silico prediction approaches that are available, as well as discussing the Critical Assessment of Techniques for Protein Structure Prediction (CASP) and the Continuous Automated Model EvaluatiOn (CAMEO) projects, and their impact on developments in the field. Furthermore, we discuss the importance of protein function prediction methods for tackling 21st century problems.

  1. Development of a protein-ligand-binding site prediction method based on interaction energy and sequence conservation.

    Science.gov (United States)

    Tsujikawa, Hiroto; Sato, Kenta; Wei, Cao; Saad, Gul; Sumikoshi, Kazuya; Nakamura, Shugo; Terada, Tohru; Shimizu, Kentaro

    2016-09-01

    We present a new method for predicting protein-ligand-binding sites based on protein three-dimensional structure and amino acid conservation. This method involves calculation of the van der Waals interaction energy between a protein and many probes placed on the protein surface and subsequent clustering of the probes with low interaction energies to identify the most energetically favorable locus. In addition, it uses amino acid conservation among homologous proteins. Ligand-binding sites were predicted by combining the interaction energy and the amino acid conservation score. The performance of our prediction method was evaluated using a non-redundant dataset of 348 ligand-bound and ligand-unbound protein structure pairs, constructed by filtering entries in a ligand-binding site structure database, LigASite. Ligand-bound structure prediction (bound prediction) indicated that 74.0 % of predicted ligand-binding sites overlapped with real ligand-binding sites by over 25 % of their volume. Ligand-unbound structure prediction (unbound prediction) indicated that 73.9 % of predicted ligand-binding residues overlapped with real ligand-binding residues. The amino acid conservation score improved the average prediction accuracy by 17.0 and 17.6 points for the bound and unbound predictions, respectively. These results demonstrate the effectiveness of the combined use of the interaction energy and amino acid conservation in the ligand-binding site prediction.

  2. Predicting Polymerase Ⅱ Core Promoters by Cooperating Transcription Factor Binding Sites in Eukaryotic Genes

    Institute of Scientific and Technical Information of China (English)

    Xiao-Tu MA; Min-Ping QIAN; Hai-Xu TANG

    2004-01-01

    Several discriminate functions for predicting core promoters that based on the potential cooperation between transcription factor binding sites (TFBSs) are discussed. It is demonstrated that the promoter predicting accuracy is improved when the cooperation among TFBSs is taken into consideration.The core promoter region of a newly discovered gene CKLFSF1 is predicted to locate more than 1.5 kb far away from the 5′ end of the transcript and in the last intron of its upstream gene, which is experimentally confirmed later. The core promoters of 3402 human RefSeq sequences, obtained by extending the mRNAs in human genome sequences, are predicted by our algorithm, and there are about 60% of the predicted core promoters locating within the ± 500 bp region relative to the annotated transcription start site.

  3. Cell-type specificity of ChIP-predicted transcription factor binding sites

    Directory of Open Access Journals (Sweden)

    Håndstad Tony

    2012-08-01

    Full Text Available Abstract Background Context-dependent transcription factor (TF binding is one reason for differences in gene expression patterns between different cellular states. Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq identifies genome-wide TF binding sites for one particular context—the cells used in the experiment. But can such ChIP-seq data predict TF binding in other cellular contexts and is it possible to distinguish context-dependent from ubiquitous TF binding? Results We compared ChIP-seq data on TF binding for multiple TFs in two different cell types and found that on average only a third of ChIP-seq peak regions are common to both cell types. Expectedly, common peaks occur more frequently in certain genomic contexts, such as CpG-rich promoters, whereas chromatin differences characterize cell-type specific TF binding. We also find, however, that genotype differences between the cell types can explain differences in binding. Moreover, ChIP-seq signal intensity and peak clustering are the strongest predictors of common peaks. Compared with strong peaks located in regions containing peaks for multiple transcription factors, weak and isolated peaks are less common between the cell types and are less associated with data that indicate regulatory activity. Conclusions Together, the results suggest that experimental noise is prevalent among weak peaks, whereas strong and clustered peaks represent high-confidence binding events that often occur in other cellular contexts. Nevertheless, 30-40% of the strongest and most clustered peaks show context-dependent regulation. We show that by combining signal intensity with additional data—ranging from context independent information such as binding site conservation and position weight matrix scores to context dependent chromatin structure—we can predict whether a ChIP-seq peak is likely to be present in other cellular contexts.

  4. Combining features in a graphical model to predict protein binding sites.

    Science.gov (United States)

    Wierschin, Torsten; Wang, Keyu; Welter, Marlon; Waack, Stephan; Stanke, Mario

    2015-05-01

    Large efforts have been made in classifying residues as binding sites in proteins using machine learning methods. The prediction task can be translated into the computational challenge of assigning each residue the label binding site or non-binding site. Observational data comes from various possibly highly correlated sources. It includes the structure of the protein but not the structure of the complex. The model class of conditional random fields (CRFs) has previously successfully been used for protein binding site prediction. Here, a new CRF-approach is presented that models the dependencies of residues using a general graphical structure defined as a neighborhood graph and thus our model makes fewer independence assumptions on the labels than sequential labeling approaches. A novel node feature "change in free energy" is introduced into the model, which is then denoted by ΔF-CRF. Parameters are trained with an online large-margin algorithm. Using the standard feature class relative accessible surface area alone, the general graph-structure CRF already achieves higher prediction accuracy than the linear chain CRF of Li et al. ΔF-CRF performs significantly better on a large range of false positive rates than the support-vector-machine-based program PresCont of Zellner et al. on a homodimer set containing 128 chains. ΔF-CRF has a broader scope than PresCont since it is not constrained to protein subgroups and requires no multiple sequence alignment. The improvement is attributed to the advantageous combination of the novel node feature with the standard feature and to the adopted parameter training method.

  5. SNP2TFBS – a database of regulatory SNPs affecting predicted transcription factor binding site affinity

    Science.gov (United States)

    Kumar, Sunil; Ambrosini, Giovanna; Bucher, Philipp

    2017-01-01

    SNP2TFBS is a computational resource intended to support researchers investigating the molecular mechanisms underlying regulatory variation in the human genome. The database essentially consists of a collection of text files providing specific annotations for human single nucleotide polymorphisms (SNPs), namely whether they are predicted to abolish, create or change the affinity of one or several transcription factor (TF) binding sites. A SNP's effect on TF binding is estimated based on a position weight matrix (PWM) model for the binding specificity of the corresponding factor. These data files are regenerated at regular intervals by an automatic procedure that takes as input a reference genome, a comprehensive SNP catalogue and a collection of PWMs. SNP2TFBS is also accessible over a web interface, enabling users to view the information provided for an individual SNP, to extract SNPs based on various search criteria, to annotate uploaded sets of SNPs or to display statistics about the frequencies of binding sites affected by selected SNPs. Homepage: http://ccg.vital-it.ch/snp2tfbs/. PMID:27899579

  6. LIGSITEcsc: predicting ligand binding sites using the Connolly surface and degree of conservation

    Directory of Open Access Journals (Sweden)

    Schroeder Michael

    2006-09-01

    Full Text Available Abstract Background Identifying pockets on protein surfaces is of great importance for many structure-based drug design applications and protein-ligand docking algorithms. Over the last ten years, many geometric methods for the prediction of ligand-binding sites have been developed. Results We present LIGSITEcsc, an extension and implementation of the LIGSITE algorithm. LIGSITEcsc is based on the notion of surface-solvent-surface events and the degree of conservation of the involved surface residues. We compare our algorithm to four other approaches, LIGSITE, CAST, PASS, and SURFNET, and evaluate all on a dataset of 48 unbound/bound structures and 210 bound-structures. LIGSITEcsc performs slightly better than the other tools and achieves a success rate of 71% and 75%, respectively. Conclusion The use of the Connolly surface leads to slight improvements, the prediction re-ranking by conservation to significant improvements of the binding site predictions. A web server for LIGSITEcsc and its source code is available at scoppi.biotec.tu-dresden.de/pocket.

  7. The predicted 3D structure of the human D2 dopamine receptor and the binding site and binding affinities for agonists and antagonists

    Science.gov (United States)

    Kalani, M. Yashar S.; Vaidehi, Nagarajan; Hall, Spencer E.; Trabanino, Rene J.; Freddolino, Peter L.; Kalani, Maziyar A.; Floriano, Wely B.; Tak Kam, Victor Wai; Goddard, William A., III

    2004-03-01

    Dopamine neurotransmitter and its receptors play a critical role in the cell signaling process responsible for information transfer in neurons functioning in the nervous system. Development of improved therapeutics for such disorders as Parkinson's disease and schizophrenia would be significantly enhanced with the availability of the 3D structure for the dopamine receptors and of the binding site for dopamine and other agonists and antagonists. We report here the 3D structure of the long isoform of the human D2 dopamine receptor, predicted from primary sequence using first-principles theoretical and computational techniques (i.e., we did not use bioinformatic or experimental 3D structural information in predicting structures). The predicted 3D structure is validated by comparison of the predicted binding site and the relative binding affinities of dopamine, three known dopamine agonists (antiparkinsonian), and seven known antagonists (antipsychotic) in the D2 receptor to experimentally determined values. These structures correctly predict the critical residues for binding dopamine and several antagonists, identified by mutation studies, and give relative binding affinities that correlate well with experiments. The predicted binding site for dopamine and agonists is located between transmembrane (TM) helices 3, 4, 5, and 6, whereas the best antagonists bind to a site involving TM helices 2, 3, 4, 6, and 7 with minimal contacts to TM helix 5. We identify characteristic differences between the binding sites of agonists and antagonists.

  8. Identification of co-regulated genes through Bayesian clustering of predicted regulatory binding sites.

    Science.gov (United States)

    Qin, Zhaohui S; McCue, Lee Ann; Thompson, William; Mayerhofer, Linda; Lawrence, Charles E; Liu, Jun S

    2003-04-01

    The identification of co-regulated genes and their transcription-factor binding sites (TFBS) are key steps toward understanding transcription regulation. In addition to effective laboratory assays, various computational approaches for the detection of TFBS in promoter regions of coexpressed genes have been developed. The availability of complete genome sequences combined with the likelihood that transcription factors and their cognate sites are often conserved during evolution has led to the development of phylogenetic footprinting. The modus operandi of this technique is to search for conserved motifs upstream of orthologous genes from closely related species. The method can identify hundreds of TFBS without prior knowledge of co-regulation or coexpression. Because many of these predicted sites are likely to be bound by the same transcription factor, motifs with similar patterns can be put into clusters so as to infer the sets of co-regulated genes, that is, the regulons. This strategy utilizes only genome sequence information and is complementary to and confirmative of gene expression data generated by microarray experiments. However, the limited data available to characterize individual binding patterns, the variation in motif alignment, motif width, and base conservation, and the lack of knowledge of the number and sizes of regulons make this inference problem difficult. We have developed a Gibbs sampling-based Bayesian motif clustering (BMC) algorithm to address these challenges. Tests on simulated data sets show that BMC produces many fewer errors than hierarchical and K-means clustering methods. The application of BMC to hundreds of predicted gamma-proteobacterial motifs correctly identified many experimentally reported regulons, inferred the existence of previously unreported members of these regulons, and suggested novel regulons.

  9. A sequence-based dynamic ensemble learning system for protein ligand-binding site prediction

    KAUST Repository

    Chen, Peng

    2015-12-03

    Background: Proteins have the fundamental ability to selectively bind to other molecules and perform specific functions through such interactions, such as protein-ligand binding. Accurate prediction of protein residues that physically bind to ligands is important for drug design and protein docking studies. Most of the successful protein-ligand binding predictions were based on known structures. However, structural information is not largely available in practice due to the huge gap between the number of known protein sequences and that of experimentally solved structures

  10. Genome wide prediction of HNF4alpha functional binding sites by the use of local and global sequence context.

    Science.gov (United States)

    Kel, Alexander E; Niehof, Monika; Matys, Volker; Zemlin, Rüdiger; Borlak, Jürgen

    2008-01-01

    We report an application of machine learning algorithms that enables prediction of the functional context of transcription factor binding sites in the human genome. We demonstrate that our method allowed de novo identification of hepatic nuclear factor (HNF)4alpha binding sites and significantly improved an overall recognition of faithful HNF4alpha targets. When applied to published findings, an unprecedented high number of false positives were identified. The technique can be applied to any transcription factor.

  11. Interactome-wide prediction of protein-protein binding sites reveals effects of protein sequence variation in Arabidopsis thaliana.

    Directory of Open Access Journals (Sweden)

    Felipe Leal Valentim

    Full Text Available The specificity of protein-protein interactions is encoded in those parts of the sequence that compose the binding interface. Therefore, understanding how changes in protein sequence influence interaction specificity, and possibly the phenotype, requires knowing the location of binding sites in those sequences. However, large-scale detection of protein interfaces remains a challenge. Here, we present a sequence- and interactome-based approach to mine interaction motifs from the recently published Arabidopsis thaliana interactome. The resultant proteome-wide predictions are available via www.ab.wur.nl/sliderbio and set the stage for further investigations of protein-protein binding sites. To assess our method, we first show that, by using a priori information calculated from protein sequences, such as evolutionary conservation and residue surface accessibility, we improve the performance of interface prediction compared to using only interactome data. Next, we present evidence for the functional importance of the predicted sites, which are under stronger selective pressure than the rest of protein sequence. We also observe a tendency for compensatory mutations in the binding sites of interacting proteins. Subsequently, we interrogated the interactome data to formulate testable hypotheses for the molecular mechanisms underlying effects of protein sequence mutations. Examples include proteins relevant for various developmental processes. Finally, we observed, by analysing pairs of paralogs, a correlation between functional divergence and sequence divergence in interaction sites. This analysis suggests that large-scale prediction of binding sites can cast light on evolutionary processes that shape protein-protein interaction networks.

  12. The IntFOLD server: an integrated web resource for protein fold recognition, 3D model quality assessment, intrinsic disorder prediction, domain prediction and ligand binding site prediction.

    Science.gov (United States)

    Roche, Daniel B; Buenavista, Maria T; Tetchner, Stuart J; McGuffin, Liam J

    2011-07-01

    The IntFOLD server is a novel independent server that integrates several cutting edge methods for the prediction of structure and function from sequence. Our guiding principles behind the server development were as follows: (i) to provide a simple unified resource that makes our prediction software accessible to all and (ii) to produce integrated output for predictions that can be easily interpreted. The output for predictions is presented as a simple table that summarizes all results graphically via plots and annotated 3D models. The raw machine readable data files for each set of predictions are also provided for developers, which comply with the Critical Assessment of Methods for Protein Structure Prediction (CASP) data standards. The server comprises an integrated suite of five novel methods: nFOLD4, for tertiary structure prediction; ModFOLD 3.0, for model quality assessment; DISOclust 2.0, for disorder prediction; DomFOLD 2.0 for domain prediction; and FunFOLD 1.0, for ligand binding site prediction. Predictions from the IntFOLD server were found to be competitive in several categories in the recent CASP9 experiment. The IntFOLD server is available at the following web site: http://www.reading.ac.uk/bioinf/IntFOLD/.

  13. Predicting transcription factor binding sites using local over-representation and comparative genomics

    Directory of Open Access Journals (Sweden)

    Touzet Hélène

    2006-08-01

    Full Text Available Abstract Background Identifying cis-regulatory elements is crucial to understanding gene expression, which highlights the importance of the computational detection of overrepresented transcription factor binding sites (TFBSs in coexpressed or coregulated genes. However, this is a challenging problem, especially when considering higher eukaryotic organisms. Results We have developed a method, named TFM-Explorer, that searches for locally overrepresented TFBSs in a set of coregulated genes, which are modeled by profiles provided by a database of position weight matrices. The novelty of the method is that it takes advantage of spatial conservation in the sequence and supports multiple species. The efficiency of the underlying algorithm and its robustness to noise allow weak regulatory signals to be detected in large heterogeneous data sets. Conclusion TFM-Explorer provides an efficient way to predict TFBS overrepresentation in related sequences. Promising results were obtained in a variety of examples in human, mouse, and rat genomes. The software is publicly available at http://bioinfo.lifl.fr/TFM-Explorer.

  14. MetalDetector v2.0: predicting the geometry of metal binding sites from protein sequence

    OpenAIRE

    Passerini, A; Lippi, M.; P. Frasconi

    2011-01-01

    MetalDetector identifies CYS and HIS involved in transition metal protein binding sites, starting from sequence alone. A major new feature of release 2.0 is the ability to predict which residues are jointly involved in the coordination of the same metal ion. The server is available at http://metaldetector.dsi.unifi.it/v2.0/.

  15. Sequence-based prediction of protein-binding sites in DNA: comparative study of two SVM models.

    Science.gov (United States)

    Park, Byungkyu; Im, Jinyong; Tuvshinjargal, Narankhuu; Lee, Wook; Han, Kyungsook

    2014-11-01

    As many structures of protein-DNA complexes have been known in the past years, several computational methods have been developed to predict DNA-binding sites in proteins. However, its inverse problem (i.e., predicting protein-binding sites in DNA) has received much less attention. One of the reasons is that the differences between the interaction propensities of nucleotides are much smaller than those between amino acids. Another reason is that DNA exhibits less diverse sequence patterns than protein. Therefore, predicting protein-binding DNA nucleotides is much harder than predicting DNA-binding amino acids. We computed the interaction propensity (IP) of nucleotide triplets with amino acids using an extensive dataset of protein-DNA complexes, and developed two support vector machine (SVM) models that predict protein-binding nucleotides from sequence data alone. One SVM model predicts protein-binding nucleotides using DNA sequence data alone, and the other SVM model predicts protein-binding nucleotides using both DNA and protein sequences. In a 10-fold cross-validation with 1519 DNA sequences, the SVM model that uses DNA sequence data only predicted protein-binding nucleotides with an accuracy of 67.0%, an F-measure of 67.1%, and a Matthews correlation coefficient (MCC) of 0.340. With an independent dataset of 181 DNAs that were not used in training, it achieved an accuracy of 66.2%, an F-measure 66.3% and a MCC of 0.324. Another SVM model that uses both DNA and protein sequences achieved an accuracy of 69.6%, an F-measure of 69.6%, and a MCC of 0.383 in a 10-fold cross-validation with 1519 DNA sequences and 859 protein sequences. With an independent dataset of 181 DNAs and 143 proteins, it showed an accuracy of 67.3%, an F-measure of 66.5% and a MCC of 0.329. Both in cross-validation and independent testing, the second SVM model that used both DNA and protein sequence data showed better performance than the first model that used DNA sequence data. To the best of

  16. ProBiS-CHARMMing: Web Interface for Prediction and Optimization of Ligands in Protein Binding Sites.

    Science.gov (United States)

    Konc, Janez; Miller, Benjamin T; Štular, Tanja; Lešnik, Samo; Woodcock, H Lee; Brooks, Bernard R; Janežič, Dušanka

    2015-11-23

    Proteins often exist only as apo structures (unligated) in the Protein Data Bank, with their corresponding holo structures (with ligands) unavailable. However, apoproteins may not represent the amino-acid residue arrangement upon ligand binding well, which is especially problematic for molecular docking. We developed the ProBiS-CHARMMing web interface by connecting the ProBiS ( http://probis.cmm.ki.si ) and CHARMMing ( http://www.charmming.org ) web servers into one functional unit that enables prediction of protein-ligand complexes and allows for their geometry optimization and interaction energy calculation. The ProBiS web server predicts ligands (small compounds, proteins, nucleic acids, and single-atom ligands) that may bind to a query protein. This is achieved by comparing its surface structure against a nonredundant database of protein structures and finding those that have binding sites similar to that of the query protein. Existing ligands found in the similar binding sites are then transposed to the query according to predictions from ProBiS. The CHARMMing web server enables, among other things, minimization and potential energy calculation for a wide variety of biomolecular systems, and it is used here to optimize the geometry of the predicted protein-ligand complex structures using the CHARMM force field and to calculate their interaction energies with the corresponding query proteins. We show how ProBiS-CHARMMing can be used to predict ligands and their poses for a particular binding site, and minimize the predicted protein-ligand complexes to obtain representations of holoproteins. The ProBiS-CHARMMing web interface is freely available for academic users at http://probis.nih.gov.

  17. Dopamine transporter comparative molecular modeling and binding site prediction using the LeuT(Aa) leucine transporter as a template.

    Science.gov (United States)

    Indarte, Martín; Madura, Jeffry D; Surratt, Christopher K

    2008-02-15

    Pharmacological and behavioral studies indicate that binding of cocaine and the amphetamines by the dopamine transporter (DAT) protein is principally responsible for initiating the euphoria and addiction associated with these drugs. The lack of an X-ray crystal structure for the DAT or any other member of the neurotransmitter:sodium symporter (NSS) family has hindered understanding of psychostimulant recognition at the atomic level; structural information has been obtained largely from mutagenesis and biophysical studies. The recent publication of a crystal structure for the bacterial leucine transporter LeuT(Aa), a distantly related NSS family homolog, provides for the first time a template for three-dimensional comparative modeling of NSS proteins. A novel computational modeling approach using the capabilities of the Molecular Operating Environment program MOE 2005.06 in conjunction with other comparative modeling servers generated the LeuT(Aa)-directed DAT model. Probable dopamine and amphetamine binding sites were identified within the DAT model using multiple docking approaches. Binding sites for the substrate ligands (dopamine and amphetamine) overlapped substantially with the analogous region of the LeuT(Aa) crystal structure for the substrate leucine. The docking predictions implicated DAT side chains known to be critical for high affinity ligand binding and suggest novel mutagenesis targets in elucidating discrete substrate and inhibitor binding sites. The DAT model may guide DAT ligand QSAR studies, and rational design of novel DAT-binding therapeutics.

  18. Exploiting structural and topological information to improve prediction of RNA-protein binding sites

    Directory of Open Access Journals (Sweden)

    Yuan Zheng

    2009-10-01

    Full Text Available Abstract Background RNA-protein interactions are important for a wide range of biological processes. Current computational methods to predict interacting residues in RNA-protein interfaces predominately rely on sequence data. It is, however, known that interface residue propensity is closely correlated with structural properties. In this paper we systematically study information obtained from sequences and structures and compare their contributions in this prediction problem. Particularly, different geometrical and network topological properties of protein structures are evaluated to improve interface residue prediction accuracy. Results We have quantified the impact of structural information on the prediction accuracy in comparison to the purely sequence based approach using two machine learning techniques: Naïve Bayes classifiers and Support Vector Machines. The highest AUC of 0.83 was achieved by a Support Vector Machine, exploiting PSI-BLAST profile, accessible surface area, betweenness-centrality and retention coefficient as input features. Taking into account that our results are based on a larger non-redundant data set, the prediction accuracy is considerably higher than reported in previous, comparable studies. A protein-RNA interface predictor (PRIP and the data set have been made available at http://www.qfab.org/PRIP. Conclusion Graph-theoretic properties of residue contact maps derived from protein structures such as betweenness-centrality can supplement sequence or structure features to improve the prediction accuracy for binding residues in RNA-protein interactions. While Support Vector Machines perform better on this task, Naïve Bayes classifiers also have been found to achieve good prediction accuracies but require much less training time and are an attractive choice for large scale predictions.

  19. pkaPS: prediction of protein kinase A phosphorylation sites with the simplified kinase-substrate binding model

    Directory of Open Access Journals (Sweden)

    Schneider Georg

    2007-01-01

    Full Text Available Abstract Background Protein kinase A (cAMP-dependent kinase, PKA is a serine/threonine kinase, for which ca. 150 substrate proteins are known. Based on a refinement of the recognition motif using the available experimental data, we wished to apply the simplified substrate protein binding model for accurate prediction of PKA phosphorylation sites, an approach that was previously successful for the prediction of lipid posttranslational modifications and of the PTS1 peroxisomal translocation signal. Results Approximately 20 sequence positions flanking the phosphorylated residue on both sides have been found to be restricted in their sequence variability (region -18...+23 with the site at position 0. The conserved physical pattern can be rationalized in terms of a qualitative binding model with the catalytic cleft of the protein kinase A. Positions -6...+4 surrounding the phosphorylation site are influenced by direct interaction with the kinase in a varying degree. This sequence stretch is embedded in an intrinsically disordered region composed preferentially of hydrophilic residues with flexible backbone and small side chain. This knowledge has been incorporated into a simplified analytical model of productive binding of substrate proteins with PKA. Conclusion The scoring function of the pkaPS predictor can confidently discriminate PKA phosphorylation sites from serines/threonines with non-permissive sequence environments (sensitivity of ~96% at a specificity of ~94%. The tool "pkaPS" has been applied on the whole human proteome. Among new predicted PKA targets, there are entirely uncharacterized protein groups as well as apparently well-known families such as those of the ribosomal proteins L21e, L22 and L6. Availability The supplementary data as well as the prediction tool as WWW server are available at http://mendel.imp.univie.ac.at/sat/pkaPS. Reviewers Erik van Nimwegen (Biozentrum, University of Basel, Switzerland, Sandor Pongor (International

  20. Structure prediction and binding sites analysis of curcin protein of Jatropha curcas using computational approaches.

    Science.gov (United States)

    Srivastava, Mugdha; Gupta, Shishir K; Abhilash, P C; Singh, Nandita

    2012-07-01

    Ribosome inactivating proteins (RIPs) are defense proteins in a number of higher-plant species that are directly targeted toward herbivores. Jatropha curcas is one of the biodiesel plants having RIPs. The Jatropha seed meal, after extraction of oil, is rich in curcin, a highly toxic RIP similar to ricin, which makes it unsuitable for animal feed. Although the toxicity of curcin is well documented in the literature, the detailed toxic properties and the 3D structure of curcin has not been determined by X-ray crystallography, NMR spectroscopy or any in silico techniques to date. In this pursuit, the structure of curcin was modeled by a composite approach of 3D structure prediction using threading and ab initio modeling. Assessment of model quality was assessed by methods which include Ramachandran plot analysis and Qmean score estimation. Further, we applied the protein-ligand docking approach to identify the r-RNA binding residue of curcin. The present work provides the first structural insight into the binding mode of r-RNA adenine to the curcin protein and forms the basis for designing future inhibitors of curcin. Cloning of a future peptide inhibitor within J. curcas can produce non-toxic varieties of J. curcas, which would make the seed-cake suitable as animal feed without curcin detoxification.

  1. Prediction of DtxR regulon: Identification of binding sites and operons controlled by Diphtheria toxin repressor in Corynebacterium diphtheriae

    Directory of Open Access Journals (Sweden)

    Hasnain Seyed

    2004-09-01

    Full Text Available Abstract Background The diphtheria toxin repressor, DtxR, of Corynebacterium diphtheriae has been shown to be an iron-activated transcription regulator that controls not only the expression of diphtheria toxin but also of iron uptake genes. This study aims to identify putative binding sites and operons controlled by DtxR to understand the role of DtxR in patho-physiology of Corynebacterium diphtheriae. Result Positional Shannon relative entropy method was used to build the DtxR-binding site recognition profile and the later was used to identify putative regulatory sites of DtxR within C. diphtheriae genome. In addition, DtxR-regulated operons were also identified taking into account the predicted DtxR regulatory sites and genome annotation. Few of the predicted motifs were experimentally validated by electrophoretic mobility shift assay. The analysis identifies motifs upstream to the novel iron-regulated genes that code for Formamidopyrimidine-DNA glycosylase (FpG, an enzyme involved in DNA-repair and starvation inducible DNA-binding protein (Dps which is involved in iron storage and oxidative stress defense. In addition, we have found the DtxR motifs upstream to the genes that code for sortase which catalyzes anchoring of host-interacting proteins to the cell wall of pathogenic bacteria and the proteins of secretory system which could be involved in translocation of various iron-regulated virulence factors including diphtheria toxin. Conclusions We have used an in silico approach to identify the putative binding sites and genes controlled by DtxR in Corynebacterium diphtheriae. Our analysis shows that DtxR could provide a molecular link between Fe+2-induced Fenton's reaction and protection of DNA from oxidative damage. DtxR-regulated Dps prevents lethal combination of Fe+2 and H2O2 and also protects DNA by nonspecific DNA-binding. In addition DtxR could play an important role in host interaction and virulence by regulating the levels of sortase

  2. Computational Characterization and Prediction of Estrogen Receptor Coactivator Binding Site Inhibitors

    Energy Technology Data Exchange (ETDEWEB)

    Bennion, B J; Kulp, K S; Cosman, M; Lightstone, F C

    2005-08-26

    Many carcinogens have been shown to cause tissue specific tumors in animal models. The mechanism for this specificity has not been fully elucidated and is usually attributed to differences in organ metabolism. For heterocyclic amines, potent carcinogens that are formed in well-done meat, the ability to either bind to the estrogen receptor and activate or inhibit an estrogenic response will have a major impact on carcinogenicity. Here we describe our work with the human estrogen receptor alpha (hERa) and the mutagenic/carcinogenic heterocyclic amines PhIP, MeIQx, IFP, and the hydroxylated metabolite of PhIP, N2-hydroxy-PhIP. We found that PhIP, in contrast to the other heterocyclic amines, increased cell-proliferation in MCF-7 human breast cancer cells and activated the hERa receptor. We show mechanistic data supporting this activation both computationally by homology modeling and docking, and by NMR confirmation that PhIP binds with the ligand binding domain (LBD). This binding competes with estradiol (E2) in the native E2 binding cavity of the receptor. We also find that other heterocyclic amines and N2-hydroxy-PhIP inhibit ER activation presumably by binding into another cavity on the LBD. Moreover, molecular dynamics simulations of inhibitory heterocyclic amines reveal a disruption of the surface of the receptor protein involved with protein-protein signaling. We therefore propose that the mechanism for the tissue specific carcinogenicity seen in the rat breast tumors and the presumptive human breast cancer associated with the consumption of well-done meat maybe mediated by this receptor activation.

  3. Polymorphisms in MicroRNA Binding Sites Predict Colorectal Cancer Survival

    Science.gov (United States)

    Yang, Ying-Pi; Ting, Wen-Chien; Chen, Lu-Min; Lu, Te-Ling; Bao, Bo-Ying

    2017-01-01

    Background: MicroRNAs (miRNAs) mediate negative regulation of target genes through base pairing, and aberrant miRNA expression has been described in cancers. We hypothesized that single nucleotide polymorphisms (SNPs) within miRNA target sites might influence clinical outcomes in patients with colorectal cancer. Methods: Sixteen common SNPs within miRNA target sites were identified, and the association between these SNPs and overall survival was assessed in colorectal cancer patients using Kaplan-Meier analysis, Cox regression model, and survival tree analysis. Results: Survival tree analysis identified a higher-order genetic interaction profile consisting of the RPS6KB1 rs1051424 and ZNF839 rs11704 that was significantly associated with overall survival. The 5-year survival rates were 74.6%, 62.7%, and 57.1% for the low-, medium-, and high-risk genetic profiles, respectively (P = 0.006). The genetic interaction profile remained significant even after adjusting for potential risk factors. Additional in silico analysis provided evidence that rs1051424 and rs11704 affect RPS6KB1 and ZNF839 expressions, which in turn is significantly correlated with prognosis in colorectal cancer. Conclusion: Our results suggest that the genetic interaction profiles among SNPs within miRNA target sites might be prognostic markers for colorectal cancer survival. PMID:28138309

  4. Accurate microRNA target prediction using detailed binding site accessibility and machine learning on proteomics data

    Directory of Open Access Journals (Sweden)

    Martin eReczko

    2012-01-01

    Full Text Available MicroRNAs (miRNAs are a class of small regulatory genes regulating gene expression by targetingmessenger RNA. Though computational methods for miRNA target prediction are the prevailingmeans to analyze their function, they still miss a large fraction of the targeted genes and additionallypredict a large number of false positives. Here we introduce a novel algorithm called DIANAmicroT-ANN which combines multiple novel target site features through an artificial neural network(ANN and is trained using recently published high-throughput data measuring the change of proteinlevels after miRNA overexpression, providing positive and negative targeting examples. The featurescharacterizing each miRNA recognition element include binding structure, conservation level and aspecific profile of structural accessibility. The ANN is trained to integrate the features of eachrecognition element along the 3’ untranslated region into a targeting score, reproducing the relativerepression fold change of the protein. Tested on two different sets the algorithm outperforms otherwidely used algorithms and also predicts a significant number of unique and reliable targets notpredicted by the other methods. For 542 human miRNAs DIANA-microT-ANN predicts 120,000targets not provided by TargetScan 5.0. The algorithm is freely available athttp://microrna.gr/microT-ANN.

  5. Interactome-Wide Prediction of Protein-Protein Binding Sites Reveals Effects of Protein Sequence Variation in Arabidopsis thaliana

    NARCIS (Netherlands)

    Valentim, F.L.; Neven, F.; Boyen, P.; Dijk, van A.D.J.

    2012-01-01

    The specificity of protein-protein interactions is encoded in those parts of the sequence that compose the binding interface. Therefore, understanding how changes in protein sequence influence interaction specificity, and possibly the phenotype, requires knowing the location of binding sites in thos

  6. Statistics for Transcription Factor Binding Sites

    OpenAIRE

    2008-01-01

    Transcription factors (TFs) play a key role in gene regulation. They interact with specific binding sites or motifs on the DNA sequence and regulate expression of genes downstream of these binding sites. In silico prediction of potential binding of a TF to a binding site is an important task in computational biology. From a statistical point of view, the DNA sequence is a long text consisting of four different letters ('A','C','G', and 'T'). The binding of a TF to the sequence corresponds to ...

  7. An integrative computational framework based on a two-step random forest algorithm improves prediction of zinc-binding sites in proteins.

    Directory of Open Access Journals (Sweden)

    Cheng Zheng

    Full Text Available Zinc-binding proteins are the most abundant metalloproteins in the Protein Data Bank where the zinc ions usually have catalytic, regulatory or structural roles critical for the function of the protein. Accurate prediction of zinc-binding sites is not only useful for the inference of protein function but also important for the prediction of 3D structure. Here, we present a new integrative framework that combines multiple sequence and structural properties and graph-theoretic network features, followed by an efficient feature selection to improve prediction of zinc-binding sites. We investigate what information can be retrieved from the sequence, structure and network levels that is relevant to zinc-binding site prediction. We perform a two-step feature selection using random forest to remove redundant features and quantify the relative importance of the retrieved features. Benchmarking on a high-quality structural dataset containing 1,103 protein chains and 484 zinc-binding residues, our method achieved >80% recall at a precision of 75% for the zinc-binding residues Cys, His, Glu and Asp on 5-fold cross-validation tests, which is a 10%-28% higher recall at the 75% equal precision compared to SitePredict and zincfinder at residue level using the same dataset. The independent test also indicates that our method has achieved recall of 0.790 and 0.759 at residue and protein levels, respectively, which is a performance better than the other two methods. Moreover, AUC (the Area Under the Curve and AURPC (the Area Under the Recall-Precision Curve by our method are also respectively better than those of the other two methods. Our method can not only be applied to large-scale identification of zinc-binding sites when structural information of the target is available, but also give valuable insights into important features arising from different levels that collectively characterize the zinc-binding sites. The scripts and datasets are available at http://protein.cau.edu.cn/zincidentifier/.

  8. Predicting the right spacing between protein immobilization sites on self-assembled monolayers to optimize ligand binding.

    Science.gov (United States)

    Perez, Javier Batista; Tyagi, Deependra; Yang, Mo; Calvo, Loany; Perez, Rolando; Moreno, Ernesto; Zhu, Jinsong

    2015-09-01

    Self-assembled monolayers designed to immobilize capture antibodies are usually prepared using a mixture of functional and inactive linkers. Here, using low molar ratios (1:1 to 1:100) of the two linkers resulted in loss of binding capability of the anti-EGFR (epidermal growth factor receptor) antibody nimotuzumab, as assessed by surface plasmon resonance imaging. We then developed a simple theoretical model to predict the optimal surface density of the functional linker, taking into account the antibody size and linker diameter. A high (1:1000) dilution of the functional linker yielded the best results. As an advantage, this approach does not require chemical modification of the protein.

  9. Predicting zinc binding at the proteome level

    Directory of Open Access Journals (Sweden)

    Rosato Antonio

    2007-02-01

    Full Text Available Abstract Background Metalloproteins are proteins capable of binding one or more metal ions, which may be required for their biological function, for regulation of their activities or for structural purposes. Metal-binding properties remain difficult to predict as well as to investigate experimentally at the whole-proteome level. Consequently, the current knowledge about metalloproteins is only partial. Results The present work reports on the development of a machine learning method for the prediction of the zinc-binding state of pairs of nearby amino-acids, using predictors based on support vector machines. The predictor was trained using chains containing zinc-binding sites and non-metalloproteins in order to provide positive and negative examples. Results based on strong non-redundancy tests prove that (1 zinc-binding residues can be predicted and (2 modelling the correlation between the binding state of nearby residues significantly improves performance. The trained predictor was then applied to the human proteome. The present results were in good agreement with the outcomes of previous, highly manually curated, efforts for the identification of human zinc-binding proteins. Some unprecedented zinc-binding sites could be identified, and were further validated through structural modelling. The software implementing the predictor is freely available at: http://zincfinder.dsi.unifi.it Conclusion The proposed approach constitutes a highly automated tool for the identification of metalloproteins, which provides results of comparable quality with respect to highly manually refined predictions. The ability to model correlations between pairwise residues allows it to obtain a significant improvement over standard 1D based approaches. In addition, the method permits the identification of unprecedented metal sites, providing important hints for the work of experimentalists.

  10. Prediction of altered 3'- UTR miRNA-binding sites from RNA-Seq data: the swine leukocyte antigen complex (SLA as a model region.

    Directory of Open Access Journals (Sweden)

    Marie-Laure Endale Ahanda

    Full Text Available THE SLA (swine leukocyte antigen, MHC: SLA genes are the most important determinants of immune, infectious disease and vaccine response in pigs; several genetic associations with immunity and swine production traits have been reported. However, most of the current knowledge on SLA is limited to gene coding regions. MicroRNAs (miRNAs are small molecules that post-transcriptionally regulate the expression of a large number of protein-coding genes in metazoans, and are suggested to play important roles in fine-tuning immune mechanisms and disease responses. Polymorphisms in either miRNAs or their gene targets may have a significant impact on gene expression by abolishing, weakening or creating miRNA target sites, possibly leading to phenotypic variation. We explored the impact of variants in the 3'-UTR miRNA target sites of genes within the whole SLA region. The combined predictions by TargetScan, PACMIT and TargetSpy, based on different biological parameters, empowered the identification of miRNA target sites and the discovery of polymorphic miRNA target sites (poly-miRTSs. Predictions for three SLA genes characterized by a different range of sequence variation provided proof of principle for the analysis of poly-miRTSs from a total of 144 M RNA-Seq reads collected from different porcine tissues. Twenty-four novel SNPs were predicted to affect miRNA-binding sites in 19 genes of the SLA region. Seven of these genes (SLA-1, SLA-6, SLA-DQA, SLA-DQB1, SLA-DOA, SLA-DOB and TAP1 are linked to antigen processing and presentation functions, which is reminiscent of associations with disease traits reported for altered miRNA binding to MHC genes in humans. An inverse correlation in expression levels was demonstrated between miRNAs and co-expressed SLA targets by exploiting a published dataset (RNA-Seq and small RNA-Seq of three porcine tissues. Our results support the resource value of RNA-Seq collections to identify SNPs that may lead to altered mi

  11. Prediction of altered 3'- UTR miRNA-binding sites from RNA-Seq data: the swine leukocyte antigen complex (SLA) as a model region.

    Science.gov (United States)

    Endale Ahanda, Marie-Laure; Fritz, Eric R; Estellé, Jordi; Hu, Zhi-Liang; Madsen, Ole; Groenen, Martien A M; Beraldi, Dario; Kapetanovic, Ronan; Hume, David A; Rowland, Robert R R; Lunney, Joan K; Rogel-Gaillard, Claire; Reecy, James M; Giuffra, Elisabetta

    2012-01-01

    THE SLA (swine leukocyte antigen, MHC: SLA) genes are the most important determinants of immune, infectious disease and vaccine response in pigs; several genetic associations with immunity and swine production traits have been reported. However, most of the current knowledge on SLA is limited to gene coding regions. MicroRNAs (miRNAs) are small molecules that post-transcriptionally regulate the expression of a large number of protein-coding genes in metazoans, and are suggested to play important roles in fine-tuning immune mechanisms and disease responses. Polymorphisms in either miRNAs or their gene targets may have a significant impact on gene expression by abolishing, weakening or creating miRNA target sites, possibly leading to phenotypic variation. We explored the impact of variants in the 3'-UTR miRNA target sites of genes within the whole SLA region. The combined predictions by TargetScan, PACMIT and TargetSpy, based on different biological parameters, empowered the identification of miRNA target sites and the discovery of polymorphic miRNA target sites (poly-miRTSs). Predictions for three SLA genes characterized by a different range of sequence variation provided proof of principle for the analysis of poly-miRTSs from a total of 144 M RNA-Seq reads collected from different porcine tissues. Twenty-four novel SNPs were predicted to affect miRNA-binding sites in 19 genes of the SLA region. Seven of these genes (SLA-1, SLA-6, SLA-DQA, SLA-DQB1, SLA-DOA, SLA-DOB and TAP1) are linked to antigen processing and presentation functions, which is reminiscent of associations with disease traits reported for altered miRNA binding to MHC genes in humans. An inverse correlation in expression levels was demonstrated between miRNAs and co-expressed SLA targets by exploiting a published dataset (RNA-Seq and small RNA-Seq) of three porcine tissues. Our results support the resource value of RNA-Seq collections to identify SNPs that may lead to altered miRNA regulation patterns.

  12. Adaptive evolution of transcription factor binding sites

    Directory of Open Access Journals (Sweden)

    Berg Johannes

    2004-10-01

    Full Text Available Abstract Background The regulation of a gene depends on the binding of transcription factors to specific sites located in the regulatory region of the gene. The generation of these binding sites and of cooperativity between them are essential building blocks in the evolution of complex regulatory networks. We study a theoretical model for the sequence evolution of binding sites by point mutations. The approach is based on biophysical models for the binding of transcription factors to DNA. Hence we derive empirically grounded fitness landscapes, which enter a population genetics model including mutations, genetic drift, and selection. Results We show that the selection for factor binding generically leads to specific correlations between nucleotide frequencies at different positions of a binding site. We demonstrate the possibility of rapid adaptive evolution generating a new binding site for a given transcription factor by point mutations. The evolutionary time required is estimated in terms of the neutral (background mutation rate, the selection coefficient, and the effective population size. Conclusions The efficiency of binding site formation is seen to depend on two joint conditions: the binding site motif must be short enough and the promoter region must be long enough. These constraints on promoter architecture are indeed seen in eukaryotic systems. Furthermore, we analyse the adaptive evolution of genetic switches and of signal integration through binding cooperativity between different sites. Experimental tests of this picture involving the statistics of polymorphisms and phylogenies of sites are discussed.

  13. Comparison of Transcription Factor Binding Site Models

    KAUST Repository

    Bhuyan, Sharifulislam

    2012-05-01

    Modeling of transcription factor binding sites (TFBSs) and TFBS prediction on genomic sequences are important steps to elucidate transcription regulatory mechanism. Dependency of transcription regulation on a great number of factors such as chemical specificity, molecular structure, genomic and epigenetic characteristics, long distance interaction, makes this a challenging problem. Different experimental procedures generate evidence that DNA-binding domains of transcription factors show considerable DNA sequence specificity. Probabilistic modeling of TFBSs has been moderately successful in identifying patterns from a family of sequences. In this study, we compare performances of different probabilistic models and try to estimate their efficacy over experimental TFBSs data. We build a pipeline to calculate sensitivity and specificity from aligned TFBS sequences for several probabilistic models, such as Markov chains, hidden Markov models, Bayesian networks. Our work, containing relevant statistics and evaluation for the models, can help researchers to choose the most appropriate model for the problem at hand.

  14. An improved method for TAL effectors DNA-binding sites prediction reveals functional convergence in TAL repertoires of Xanthomonas oryzae strains.

    Directory of Open Access Journals (Sweden)

    Alvaro L Pérez-Quintero

    Full Text Available Transcription Activators-Like Effectors (TALEs belong to a family of virulence proteins from the Xanthomonas genus of bacterial plant pathogens that are translocated into the plant cell. In the nucleus, TALEs act as transcription factors inducing the expression of susceptibility genes. A code for TALE-DNA binding specificity and high-resolution three-dimensional structures of TALE-DNA complexes were recently reported. Accurate prediction of TAL Effector Binding Elements (EBEs is essential to elucidate the biological functions of the many sequenced TALEs as well as for robust design of artificial TALE DNA-binding domains in biotechnological applications. In this work a program with improved EBE prediction performances was developed using an updated specificity matrix and a position weight correction function to account for the matching pattern observed in a validation set of TALE-DNA interactions. To gain a systems perspective on the large TALE repertoires from X. oryzae strains, this program was used to predict rice gene targets for 99 sequenced family members. Integrating predictions and available expression data in a TALE-gene network revealed multiple candidate transcriptional targets for many TALEs as well as several possible instances of functional convergence among TALEs.

  15. Domain-based small molecule binding site annotation

    Directory of Open Access Journals (Sweden)

    Dumontier Michel

    2006-03-01

    Full Text Available Abstract Background Accurate small molecule binding site information for a protein can facilitate studies in drug docking, drug discovery and function prediction, but small molecule binding site protein sequence annotation is sparse. The Small Molecule Interaction Database (SMID, a database of protein domain-small molecule interactions, was created using structural data from the Protein Data Bank (PDB. More importantly it provides a means to predict small molecule binding sites on proteins with a known or unknown structure and unlike prior approaches, removes large numbers of false positive hits arising from transitive alignment errors, non-biologically significant small molecules and crystallographic conditions that overpredict ion binding sites. Description Using a set of co-crystallized protein-small molecule structures as a starting point, SMID interactions were generated by identifying protein domains that bind to small molecules, using NCBI's Reverse Position Specific BLAST (RPS-BLAST algorithm. SMID records are available for viewing at http://smid.blueprint.org. The SMID-BLAST tool provides accurate transitive annotation of small-molecule binding sites for proteins not found in the PDB. Given a protein sequence, SMID-BLAST identifies domains using RPS-BLAST and then lists potential small molecule ligands based on SMID records, as well as their aligned binding sites. A heuristic ligand score is calculated based on E-value, ligand residue identity and domain entropy to assign a level of confidence to hits found. SMID-BLAST predictions were validated against a set of 793 experimental small molecule interactions from the PDB, of which 472 (60% of predicted interactions identically matched the experimental small molecule and of these, 344 had greater than 80% of the binding site residues correctly identified. Further, we estimate that 45% of predictions which were not observed in the PDB validation set may be true positives. Conclusion By

  16. Tissue specificity of endothelin binding sites

    Energy Technology Data Exchange (ETDEWEB)

    Bolger, G.T.; Liard, F.; Krogsrud, R.; Thibeault, D.; Jaramillo, J. (BioMega, Inc., Laval, Quebec (Canada))

    1990-09-01

    A measurement was made of the binding of 125I-labeled endothelin (125I-ET) to crude membrane fractions prepared from rat aorta, atrium, ventricle, portal vein, trachea, lung parenchyma, vas deferens, ileum, bladder, and guinea-pig taenia coli and lung parenchyma. Scatchard analysis of 125I-ET binding in all tissues indicated binding to a single class of saturable sites. The affinity and density of 125I-ET binding sites varied between tissues. The Kd of 125I-ET binding was approximately 0.5 nM for rat aorta, trachea, lung parenchyma, ventricle, bladder, and vas deferens, and guinea-pig taenia coli and lung parenchyma, 1.8 nM for rat portal vein and atrium, and 3.3 nM for ileum. The Bmax of 125I-ET binding had the following rank order of density in rat tissues: trachea greater than lung parenchyma = vas deferens much greater than aorta = portal vein = atrium greater than bladder greater than ventricle = ileum. The properties of 125I-ET endothelin binding were characterized in rat ventricular membranes. 125I-ET binding was time dependent, reaching a maximum within 45-60 min at 25 degrees C. The calculated microassociation constant was 9.67 x 10(5) s-1 M-1. Only 15-20% of 125I-ET dissociated from its binding site even when dissociation was studied as long as 3 h. Preincubation of ventricular membranes with ET prevented binding of 125I-ET. 125I-ET binding was destroyed by boiling of ventricular membranes and was temperature, pH, and cation (Ca2+, Mg2+, and Na+) dependent.

  17. Probing binding hot spots at protein-RNA recognition sites.

    Science.gov (United States)

    Barik, Amita; Nithin, Chandran; Karampudi, Naga Bhushana Rao; Mukherjee, Sunandan; Bahadur, Ranjit Prasad

    2016-01-29

    We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein-RNA interfaces to probe the binding hot spots at protein-RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein-protein and protein-RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein-RNA recognition sites with desired affinity.

  18. Probing binding hot spots at protein–RNA recognition sites

    Science.gov (United States)

    Barik, Amita; Nithin, Chandran; Karampudi, Naga Bhushana Rao; Mukherjee, Sunandan; Bahadur, Ranjit Prasad

    2016-01-01

    We use evolutionary conservation derived from structure alignment of polypeptide sequences along with structural and physicochemical attributes of protein–RNA interfaces to probe the binding hot spots at protein–RNA recognition sites. We find that the degree of conservation varies across the RNA binding proteins; some evolve rapidly compared to others. Additionally, irrespective of the structural class of the complexes, residues at the RNA binding sites are evolutionary better conserved than those at the solvent exposed surfaces. For recognitions involving duplex RNA, residues interacting with the major groove are better conserved than those interacting with the minor groove. We identify multi-interface residues participating simultaneously in protein–protein and protein–RNA interfaces in complexes where more than one polypeptide is involved in RNA recognition, and show that they are better conserved compared to any other RNA binding residues. We find that the residues at water preservation site are better conserved than those at hydrated or at dehydrated sites. Finally, we develop a Random Forests model using structural and physicochemical attributes for predicting binding hot spots. The model accurately predicts 80% of the instances of experimental ΔΔG values in a particular class, and provides a stepping-stone towards the engineering of protein–RNA recognition sites with desired affinity. PMID:26365245

  19. Text mining improves prediction of protein functional sites.

    Science.gov (United States)

    Verspoor, Karin M; Cohn, Judith D; Ravikumar, Komandur E; Wall, Michael E

    2012-01-01

    We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites). The structure analysis was carried out using Dynamics Perturbation Analysis (DPA), which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites) in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions.

  20. Text mining improves prediction of protein functional sites.

    Directory of Open Access Journals (Sweden)

    Karin M Verspoor

    Full Text Available We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites. The structure analysis was carried out using Dynamics Perturbation Analysis (DPA, which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions.

  1. Genetic polymorphisms in the microRNA binding-sites of the thymidylate synthase gene predict risk and survival in gastric cancer.

    Science.gov (United States)

    Shen, Rong; Liu, Hongliang; Wen, Juyi; Liu, Zhensheng; Wang, Li-E; Wang, Qiming; Tan, Dongfeng; Ajani, Jaffer A; Wei, Qingyi

    2015-09-01

    Thymidylate synthase (TYMS) plays a crucial role in folate metabolism as well as DNA synthesis and repair. We hypothesized that functional polymorphisms in the 3' UTR of TYMS are associated with gastric cancer risk and survival. In the present study, we tested our hypothesis by genotyping three potentially functional (at miRNA binding sites) TYMS SNPs (rs16430 6bp del/ins, rs2790 A>G and rs1059394 C>T) in 379 gastric cancer patients and 431 cancer-free controls. Compared with the rs16430 6bp/6bp + 6bp/0bp genotypes, the 0bp/0bp genotype was associated with significantly increased gastric cancer risk (adjusted OR = 1.72, 95% CI = 1.15-2.58). Similarly, rs2790 GG and rs1059394 TT genotypes were also associated with significantly increased risk (adjusted OR = 2.52, 95% CI = 1.25-5.10 and adjusted OR = 1.57, 95% CI = 1.04-2.35, respectively), compared with AA + AG and CC + CT genotypes, respectively. In the haplotype analysis, the T-G-0bp haplotype was associated with significantly increased gastric cancer risk, compared with the C-A-6bp haplotype (adjusted OR = 1.34, 95% CI = 1.05-1.72). Survival analysis revealed that rs16430 0bp/0bp and rs1059394 TT genotypes were also associated with poor survival in gastric cancer patients who received chemotherapy treatment (adjusted HR = 1.61, 95% CI = 1.05-2.48 and adjusted HR = 1.59, 95% CI = 1.02-2.48, respectively). These results suggest that these three variants in the miRNA binding sites of TYMS may be associated with cancer risk and survival of gastric cancer patients. Larger population studies are warranted to verify these findings.

  2. Computational identification of uncharacterized cruzain binding sites.

    Directory of Open Access Journals (Sweden)

    Jacob D Durrant

    Full Text Available Chagas disease, caused by the unicellular parasite Trypanosoma cruzi, claims 50,000 lives annually and is the leading cause of infectious myocarditis in the world. As current antichagastic therapies like nifurtimox and benznidazole are highly toxic, ineffective at parasite eradication, and subject to increasing resistance, novel therapeutics are urgently needed. Cruzain, the major cysteine protease of Trypanosoma cruzi, is one attractive drug target. In the current work, molecular dynamics simulations and a sequence alignment of a non-redundant, unbiased set of peptidase C1 family members are used to identify uncharacterized cruzain binding sites. The two sites identified may serve as targets for future pharmacological intervention.

  3. Binding-site assessment by virtual fragment screening.

    Directory of Open Access Journals (Sweden)

    Niu Huang

    Full Text Available The accurate prediction of protein druggability (propensity to bind high-affinity drug-like small molecules would greatly benefit the fields of chemical genomics and drug discovery. We have developed a novel approach to quantitatively assess protein druggability by computationally screening a fragment-like compound library. In analogy to NMR-based fragment screening, we dock approximately 11,000 fragments against a given binding site and compute a computational hit rate based on the fraction of molecules that exceed an empirically chosen score cutoff. We perform a large-scale evaluation of the approach on four datasets, totaling 152 binding sites. We demonstrate that computed hit rates correlate with hit rates measured experimentally in a previously published NMR-based screening method. Secondly, we show that the in silico fragment screening method can be used to distinguish known druggable and non-druggable targets, including both enzymes and protein-protein interaction sites. Finally, we explore the sensitivity of the results to different receptor conformations, including flexible protein-protein interaction sites. Besides its original aim to assess druggability of different protein targets, this method could be used to identifying druggable conformations of flexible binding site for lead discovery, and suggesting strategies for growing or joining initial fragment hits to obtain more potent inhibitors.

  4. Oxytocin binding sites in bovine mammary tissue

    Energy Technology Data Exchange (ETDEWEB)

    Zhao, Xin.

    1989-01-01

    Oxytocin binding sites were identified and characterized in bovine mammary tissue. ({sup 3}H)-oxytocin binding reached equilibrium by 50 min at 20{degree}C and by 8 hr at 4{degree}C. The half-time of displacement at 20{degree}C was approximately 1 hr. Thyrotropin releasing hormone, adrenocorticotropin, angiotensin I, angiotensin II, pentagastrin, bradykinin, xenopsin and L-valyl-histidyl-L-leucyl-L-threonyl-L-prolyl-L-valyl-L-glutamyl-L-lysine were not competitive. In the presence of 10 nM LiCl, addition of oxytocin to dispersed bovine mammary cells, in which phosphatidylinositol was pre-labelled, caused a time and dose-dependent increase in radioactive inositiol monophosphate incorporation. The possibility that there are distinct vasopressin receptors in bovine mammary tissue was investigated. ({sup 3}H)-vasopressin binding reached equilibrium by 40 min at 20{degree}. The half-time of displacement at 20{degree}C was approximately 1 hr. The ability of the peptides to inhibit ({sup 3}H)-vasopressin binding was: (Thr{sup 4},Gly{sup 7})-oxytocin > Arg{sup 8}-vasopressin > (lys{sup 8})-vasopressin > (Deamino{sup 1},D-arg{sup 8})-vasopressin > oxytocin > d (CH{sub 2}){sub 5}Tyr(Me)AVP.

  5. Incorporating evolution of transcription factor binding sites into annotated alignments

    Indian Academy of Sciences (India)

    Abha S Bais; Steffen Grossmann; Martin Vingron

    2007-08-01

    Identifying transcription factor binding sites (TFBSs) is essential to elucidate putative regulatory mechanisms. A common strategy is to combine cross-species conservation with single sequence TFBS annotation to yield ``conserved TFBSs”. Most current methods in this field adopt a multi-step approach that segregates the two aspects. Again, it is widely accepted that the evolutionary dynamics of binding sites differ from those of the surrounding sequence. Hence, it is desirable to have an approach that explicitly takes this factor into account. Although a plethora of approaches have been proposed for the prediction of conserved TFBSs, very few explicitly model TFBS evolutionary properties, while additionally being multi-step. Recently, we introduced a novel approach to simultaneously align and annotate conserved TFBSs in a pair of sequences. Building upon the standard Smith-Waterman algorithm for local alignments, SimAnn introduces additional states for profiles to output extended alignments or annotated alignments. That is, alignments with parts annotated as gaplessly aligned TFBSs (pair-profile hits) are generated. Moreover, the pair-profile related parameters are derived in a sound statistical framework. In this article, we extend this approach to explicitly incorporate evolution of binding sites in the SimAnn framework. We demonstrate the extension in the theoretical derivations through two position-specific evolutionary models, previously used for modelling TFBS evolution. In a simulated setting, we provide a proof of concept that the approach works given the underlying assumptions, as compared to the original work. Finally, using a real dataset of experimentally verified binding sites in human-mouse sequence pairs, we compare the new approach (eSimAnn) to an existing multi-step tool that also considers TFBS evolution. Although it is widely accepted that binding sites evolve differently from the surrounding sequences, most comparative TFBS identification

  6. STUDY OF ESTROGEN BINDING SITE ON HUMAN EJACULATED SPERMATOZOA

    Institute of Scientific and Technical Information of China (English)

    CHUJin-Shong; WANGYi-Fei

    1989-01-01

    The specific estrogen binding site for 17β-estradiol has been investigated on human spermatozoa by electron microscopec autoradiography. The results show that the binding sites were distributed over the surface of human spermatozoa: acrosomal cap, equatorial

  7. A Conserved Steroid Binding Site in Cytochrome c Oxidase

    Energy Technology Data Exchange (ETDEWEB)

    Qin, Ling; Mills, Denise A.; Buhrow, Leann; Hiser, Carrie; Ferguson-Miller, Shelagh (Michigan)

    2010-09-02

    Micromolar concentrations of the bile salt deoxycholate are shown to rescue the activity of an inactive mutant, E101A, in the K proton pathway of Rhodobacter sphaeroides cytochrome c oxidase. A crystal structure of the wild-type enzyme reveals, as predicted, deoxycholate bound with its carboxyl group at the entrance of the K path. Since cholate is a known potent inhibitor of bovine oxidase and is seen in a similar position in the bovine structure, the crystallographically defined, conserved steroid binding site could reveal a regulatory site for steroids or structurally related molecules that act on the essential K proton path.

  8. Being a binding site: characterizing residue composition of binding sites on proteins.

    Science.gov (United States)

    Iván, Gábor; Szabadka, Zoltán; Grolmusz, Vince

    2007-12-30

    The Protein Data Bank contains the description of more than 45,000 three-dimensional protein and nucleic-acid structures today. Started to exist as the computer-readable depository of crystallographic data complementing printed articles, the proper interpretation of the content of the individual files in the PDB still frequently needs the detailed information found in the citing publication. This fact implies that the fully automatic processing of the whole PDB is a very hard task. We first cleaned and re-structured the PDB data, then analyzed the residue composition of the binding sites in the whole PDB for frequency and for hidden association rules. Main results of the paper: (i) the cleaning and repairing algorithm (ii) redundancy elimination from the data (iii) application of association rule mining to the cleaned non-redundant data set. We have found numerous significant relations of the residue-composition of the ligand binding sites on protein surfaces, summarized in two figures. One of the classical data-mining methods for exploring implication-rules, the association-rule mining, is capable to find previously unknown residue-set preferences of bind ligands on protein surfaces. Since protein-ligand binding is a key step in enzymatic mechanisms and in drug discovery, these uncovered preferences in the study of more than 19,500 binding sites may help in identifying new binding protein-ligand pairs.

  9. Mechanisms of in vivo binding site selection of the hematopoietic master transcription factor PU.1.

    Science.gov (United States)

    Pham, Thu-Hang; Minderjahn, Julia; Schmidl, Christian; Hoffmeister, Helen; Schmidhofer, Sandra; Chen, Wei; Längst, Gernot; Benner, Christopher; Rehli, Michael

    2013-07-01

    The transcription factor PU.1 is crucial for the development of many hematopoietic lineages and its binding patterns significantly change during differentiation processes. However, the 'rules' for binding or not-binding of potential binding sites are only partially understood. To unveil basic characteristics of PU.1 binding site selection in different cell types, we studied the binding properties of PU.1 during human macrophage differentiation. Using in vivo and in vitro binding assays, as well as computational prediction, we show that PU.1 selects its binding sites primarily based on sequence affinity, which results in the frequent autonomous binding of high affinity sites in DNase I inaccessible regions (25-45% of all occupied sites). Increasing PU.1 concentrations and the availability of cooperative transcription factor interactions during lineage differentiation both decrease affinity thresholds for in vivo binding and fine-tune cell type-specific PU.1 binding, which seems to be largely independent of DNA methylation. Occupied sites were predominantly detected in active chromatin domains, which are characterized by higher densities of PU.1 recognition sites and neighboring motifs for cooperative transcription factors. Our study supports a model of PU.1 binding control that involves motif-binding affinity, PU.1 concentration, cooperativeness with neighboring transcription factor sites and chromatin domain accessibility, which likely applies to all PU.1 expressing cells.

  10. Chloride binding site of neurotransmitter sodium symporters.

    Science.gov (United States)

    Kantcheva, Adriana K; Quick, Matthias; Shi, Lei; Winther, Anne-Marie Lund; Stolzenberg, Sebastian; Weinstein, Harel; Javitch, Jonathan A; Nissen, Poul

    2013-05-21

    Neurotransmitter:sodium symporters (NSSs) play a critical role in signaling by reuptake of neurotransmitters. Eukaryotic NSSs are chloride-dependent, whereas prokaryotic NSS homologs like LeuT are chloride-independent but contain an acidic residue (Glu290 in LeuT) at a site where eukaryotic NSSs have a serine. The LeuT-E290S mutant displays chloride-dependent activity. We show that, in LeuT-E290S cocrystallized with bromide or chloride, the anion is coordinated by side chain hydroxyls from Tyr47, Ser290, and Thr254 and the side chain amide of Gln250. The bound anion and the nearby sodium ion in the Na1 site organize a connection between their coordinating residues and the extracellular gate of LeuT through a continuous H-bond network. The specific insights from the structures, combined with results from substrate binding studies and molecular dynamics simulations, reveal an anion-dependent occlusion mechanism for NSS and shed light on the functional role of chloride binding.

  11. Structural Fingerprints of Transcription Factor Binding Site Regions

    Directory of Open Access Journals (Sweden)

    Peter Willett

    2009-03-01

    Full Text Available Fourier transforms are a powerful tool in the prediction of DNA sequence properties, such as the presence/absence of codons. We have previously compiled a database of the structural properties of all 32,896 unique DNA octamers. In this work we apply Fourier techniques to the analysis of the structural properties of human chromosomes 21 and 22 and also to three sets of transcription factor binding sites within these chromosomes. We find that, for a given structural property, the structural property power spectra of chromosomes 21 and 22 are strikingly similar. We find common peaks in their power spectra for both Sp1 and p53 transcription factor binding sites. We use the power spectra as a structural fingerprint and perform similarity searching in order to find transcription factor binding site regions. This approach provides a new strategy for searching the genome data for information. Although it is difficult to understand the relationship between specific functional properties and the set of structural parameters in our database, our structural fingerprints nevertheless provide a useful tool for searching for function information in sequence data. The power spectrum fingerprints provide a simple, fast method for comparing a set of functional sequences, in this case transcription factor binding site regions, with the sequences of whole chromosomes. On its own, the power spectrum fingerprint does not find all transcription factor binding sites in a chromosome, but the results presented here show that in combination with other approaches, this technique will improve the chances of identifying functional sequences hidden in genomic data.

  12. Cutoff lensing: predicting catalytic sites in enzymes

    Science.gov (United States)

    Aubailly, Simon; Piazza, Francesco

    2015-10-01

    Predicting function-related amino acids in proteins with unknown function or unknown allosteric binding sites in drug-targeted proteins is a task of paramount importance in molecular biomedicine. In this paper we introduce a simple, light and computationally inexpensive structure-based method to identify catalytic sites in enzymes. Our method, termed cutoff lensing, is a general procedure consisting in letting the cutoff used to build an elastic network model increase to large values. A validation of our method against a large database of annotated enzymes shows that optimal values of the cutoff exist such that three different structure-based indicators allow one to recover a maximum of the known catalytic sites. Interestingly, we find that the larger the structures the greater the predictive power afforded by our method. Possible ways to combine the three indicators into a single figure of merit and into a specific sequential analysis are suggested and discussed with reference to the classic case of HIV-protease. Our method could be used as a complement to other sequence- and/or structure-based methods to narrow the results of large-scale screenings.

  13. Discovery and information-theoretic characterization of transcription factor binding sites that act cooperatively.

    Science.gov (United States)

    Clifford, Jacob; Adami, Christoph

    2015-09-02

    Transcription factor binding to the surface of DNA regulatory regions is one of the primary causes of regulating gene expression levels. A probabilistic approach to model protein-DNA interactions at the sequence level is through position weight matrices (PWMs) that estimate the joint probability of a DNA binding site sequence by assuming positional independence within the DNA sequence. Here we construct conditional PWMs that depend on the motif signatures in the flanking DNA sequence, by conditioning known binding site loci on the presence or absence of additional binding sites in the flanking sequence of each site's locus. Pooling known sites with similar flanking sequence patterns allows for the estimation of the conditional distribution function over the binding site sequences. We apply our model to the Dorsal transcription factor binding sites active in patterning the Dorsal-Ventral axis of Drosophila development. We find that those binding sites that cooperate with nearby Twist sites on average contain about 0.5 bits of information about the presence of Twist transcription factor binding sites in the flanking sequence. We also find that Dorsal binding site detectors conditioned on flanking sequence information make better predictions about what is a Dorsal site relative to background DNA than detection without information about flanking sequence features.

  14. Grafting of protein-protein binding sites

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    A strategy for grafting protein-protein binding sites is described. Firstly, key interaction residues at the interface of ligand protein to be grafted are identified and suitable positions in scaffold protein for grafting these key residues are sought. Secondly, the scaffold proteins are superposed onto the ligand protein based on the corresponding Ca and Cb atoms. The complementarity between the scaffold protein and the receptor protein is evaluated and only matches with high score are accepted. The relative position between scaffold and receptor proteins is adjusted so that the interface has a reasonable packing density. Then the scaffold protein is mutated to corresponding residues in ligand protein at each candidate position. And the residues having bad steric contacts with the receptor proteins, or buried charged residues not involved in the formation of any salt bridge are mutated. Finally, the mutated scaffold protein in complex with receptor protein is co-minimized by Charmm. In addition, we deduce a scoring function to evaluate the affinity between mutated scaffold protein and receptor protein by statistical analysis of rigid binding data sets.

  15. Predicting binding free energies in solution

    CERN Document Server

    Jensen, Jan H

    2015-01-01

    Recent predictions of absolute binding free energies of host-guest complexes in aqueous solution using electronic structure theory have been encouraging for some systems, while other systems remain problematic for others. In paper I summarize some of the many factors that could easily contribute 1-3 kcal/mol errors at 298 K: three-body dispersion effects, molecular symmetry, anharmonicity, spurious imaginary frequencies, insufficient conformational sampling, wrong or changing ionization states, errors in the solvation free energy of ions, and explicit solvent (and ion) effects that are not well-represented by continuum models. While the paper is primarily a synthesis of previously published work there are two new results: the adaptation of Legendre transformed free energies to electronic structure theory and a use of water clusters that maximizes error cancellation in binding free energies computed using explicit solvent molecules. While I focus on binding free energies in aqueous solution the approach also a...

  16. The Binding Mode Prediction and Similar Ligand Potency in the Active Site of Vitamin D Receptor with QM/MM Interaction, MESP, and MD Simulation.

    Science.gov (United States)

    Selvaraman, Nagamani; Selvam, Saravana Kumar; Muthusamy, Karthikeyan

    2016-08-01

    Non-secosteroidal ligands are well-known vitamin D receptor (VDR) agonists. In this study, we described a combined QM/MM to define the protein-ligand interaction energy a strong positive correlation in both QM-MM interaction energy and binding free energy against the biological activity. The molecular dynamics simulation study was performed, and specific interactions were extensively studied. The molecular docking results and surface analysis shed light on steric and electrostatic complementarities of these non-secosteroidal ligands to VDR. Finally, the drug likeness properties were also calculated and found within the acceptable range. The results show that bulky group substitutions in side chain decrease the VDR activity, whereas a small substitution increased it. Functional analyses of H393A and H301A mutations substantiate their roles in the VDR agonistic and antagonistic activities. Apart from the His393 and His301, two other amino acids in the hinge region viz. Ser233 and Arg270 acted as an electron donor/acceptor specific to the agonist in the distinct ligand potency. The results from this study disclose the binding mechanism of VDR agonists and structural modifications required to improve the selectivity.

  17. Detection of secondary binding sites in proteins using fragment screening.

    Science.gov (United States)

    Ludlow, R Frederick; Verdonk, Marcel L; Saini, Harpreet K; Tickle, Ian J; Jhoti, Harren

    2015-12-29

    Proteins need to be tightly regulated as they control biological processes in most normal cellular functions. The precise mechanisms of regulation are rarely completely understood but can involve binding of endogenous ligands and/or partner proteins at specific locations on a protein that can modulate function. Often, these additional secondary binding sites appear separate to the primary binding site, which, for example for an enzyme, may bind a substrate. In previous work, we have uncovered several examples in which secondary binding sites were discovered on proteins using fragment screening approaches. In each case, we were able to establish that the newly identified secondary binding site was biologically relevant as it was able to modulate function by the binding of a small molecule. In this study, we investigate how often secondary binding sites are located on proteins by analyzing 24 protein targets for which we have performed a fragment screen using X-ray crystallography. Our analysis shows that, surprisingly, the majority of proteins contain secondary binding sites based on their ability to bind fragments. Furthermore, sequence analysis of these previously unknown sites indicate high conservation, which suggests that they may have a biological function, perhaps via an allosteric mechanism. Comparing the physicochemical properties of the secondary sites with known primary ligand binding sites also shows broad similarities indicating that many of the secondary sites may be druggable in nature with small molecules that could provide new opportunities to modulate potential therapeutic targets.

  18. Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.

    Directory of Open Access Journals (Sweden)

    Arnoldo J Müller-Molina

    Full Text Available To know the map between transcription factors (TFs and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.

  19. Shared binding sites in Lepidoptera for Bacillus thuringiensis Cry1Ja and Cry1A toxins.

    Science.gov (United States)

    Herrero, S; González-Cabrera, J; Tabashnik, B E; Ferré, J

    2001-12-01

    Bacillus thuringiensis toxins act by binding to specific target sites in the insect midgut epithelial membrane. The best-known mechanism of resistance to B. thuringiensis toxins is reduced binding to target sites. Because alteration of a binding site shared by several toxins may cause resistance to all of them, knowledge of which toxins share binding sites is useful for predicting cross-resistance. Conversely, cross-resistance among toxins suggests that the toxins share a binding site. At least two strains of diamondback moth (Plutella xylostella) with resistance to Cry1A toxins and reduced binding of Cry1A toxins have strong cross-resistance to Cry1Ja. Thus, we hypothesized that Cry1Ja shares binding sites with Cry1A toxins. We tested this hypothesis in six moth and butterfly species, each from a different family: Cacyreus marshalli (Lycaenidae), Lobesia botrana (Tortricidae), Manduca sexta (Sphingidae), Pectinophora gossypiella (Gelechiidae), P. xylostella (Plutellidae), and Spodoptera exigua (Noctuidae). Although the extent of competition varied among species, experiments with biotinylated Cry1Ja and radiolabeled Cry1Ac showed that Cry1Ja and Cry1Ac competed for binding sites in all six species. A recent report also indicates shared binding sites for Cry1Ja and Cry1A toxins in Heliothis virescens (Noctuidae). Thus, shared binding sites for Cry1Ja and Cry1A occur in all lepidopteran species tested so far.

  20. rVISTA for Comparative Sequence-Based Discovery of Functional Transcription Factor Binding Sites

    Energy Technology Data Exchange (ETDEWEB)

    Loots, Gabriela G.; Ovcharenko, Ivan; Pachter, Lior; Dubchak, Inna; Rubin, Edward M.

    2002-03-08

    Identifying transcriptional regulatory elements represents a significant challenge in annotating the genomes of higher vertebrates. We have developed a computational tool, rVISTA, for high-throughput discovery of cis-regulatory elements that combines transcription factor binding site prediction and the analysis of inter-species sequence conservation. Here, we illustrate the ability of rVISTA to identify true transcription factor binding sites through the analysis of AP-1 and NFAT binding sites in the 1 Mb well-annotated cytokine gene cluster1 (Hs5q31; Mm11). The exploitation of orthologous human-mouse data set resulted in the elimination of 95 percent of the 38,000 binding sites predicted upon analysis of the human sequence alone, while it identified 87 percent of the experimentally verified binding sites in this region.

  1. Mapping of the Neisseria meningitidis NadA cell-binding site: relevance of predicted {alpha}-helices in the NH2-terminal and dimeric coiled-coil regions.

    Science.gov (United States)

    Tavano, Regina; Capecchi, Barbara; Montanari, Paolo; Franzoso, Susanna; Marin, Oriano; Sztukowska, Maryta; Cecchini, Paola; Segat, Daniela; Scarselli, Maria; Aricò, Beatrice; Papini, Emanuele

    2011-01-01

    NadA is a trimeric autotransporter protein of Neisseria meningitidis belonging to the group of oligomeric coiled-coil adhesins. It is implicated in the colonization of the human upper respiratory tract by hypervirulent serogroup B N. meningitidis strains and is part of a multiantigen anti-serogroup B vaccine. Structure prediction indicates that NadA is made by a COOH-terminal membrane anchor (also necessary for autotranslocation to the bacterial surface), an intermediate elongated coiled-coil-rich stalk, and an NH(2)-terminal region involved in cell interaction. Electron microscopy analysis and structure prediction suggest that the apical region of NadA forms a compact and globular domain. Deletion studies proved that the NH(2)-terminal sequence (residues 24 to 87) is necessary for cell adhesion. In this study, to better define the NadA cell binding site, we exploited (i) a panel of NadA mutants lacking sequences along the coiled-coil stalk and (ii) several oligoclonal rabbit antibodies, and their relative Fab fragments, directed to linear epitopes distributed along the NadA ectodomain. We identified two critical regions for the NadA-cell receptor interaction with Chang cells: the NH(2) globular head domain and the NH(2) dimeric intrachain coiled-coil α-helices stemming from the stalk. This raises the importance of different modules within the predicted NadA structure. The identification of linear epitopes involved in receptor binding that are able to induce interfering antibodies reinforces the importance of NadA as a vaccine antigen.

  2. Cloud computing for protein-ligand binding site comparison.

    Science.gov (United States)

    Hung, Che-Lun; Hua, Guan-Jie

    2013-01-01

    The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery.

  3. Cloud Computing for Protein-Ligand Binding Site Comparison

    Directory of Open Access Journals (Sweden)

    Che-Lun Hung

    2013-01-01

    Full Text Available The proteome-wide analysis of protein-ligand binding sites and their interactions with ligands is important in structure-based drug design and in understanding ligand cross reactivity and toxicity. The well-known and commonly used software, SMAP, has been designed for 3D ligand binding site comparison and similarity searching of a structural proteome. SMAP can also predict drug side effects and reassign existing drugs to new indications. However, the computing scale of SMAP is limited. We have developed a high availability, high performance system that expands the comparison scale of SMAP. This cloud computing service, called Cloud-PLBS, combines the SMAP and Hadoop frameworks and is deployed on a virtual cloud computing platform. To handle the vast amount of experimental data on protein-ligand binding site pairs, Cloud-PLBS exploits the MapReduce paradigm as a management and parallelizing tool. Cloud-PLBS provides a web portal and scalability through which biologists can address a wide range of computer-intensive questions in biology and drug discovery.

  4. DBD2BS: connecting a DNA-binding protein with its binding sites.

    Science.gov (United States)

    Chien, Ting-Ying; Lin, Chih-Kang; Lin, Chih-Wei; Weng, Yi-Zhong; Chen, Chien-Yu; Chang, Darby Tien-Hao

    2012-07-01

    By binding to short and highly conserved DNA sequences in genomes, DNA-binding proteins initiate, enhance or repress biological processes. Accurately identifying such binding sites, often represented by position weight matrices (PWMs), is an important step in understanding the control mechanisms of cells. When given coordinates of a DNA-binding domain (DBD) bound with DNA, a potential function can be used to estimate the change of binding affinity after base substitutions, where the changes can be summarized as a PWM. This technique provides an effective alternative when the chromatin immunoprecipitation data are unavailable for PWM inference. To facilitate the procedure of predicting PWMs based on protein-DNA complexes or even structures of the unbound state, the web server, DBD2BS, is presented in this study. The DBD2BS uses an atom-level knowledge-based potential function to predict PWMs characterizing the sequences to which the query DBD structure can bind. For unbound queries, a list of 1066 DBD-DNA complexes (including 1813 protein chains) is compiled for use as templates for synthesizing bound structures. The DBD2BS provides users with an easy-to-use interface for visualizing the PWMs predicted based on different templates and the spatial relationships of the query protein, the DBDs and the DNAs. The DBD2BS is the first attempt to predict PWMs of DBDs from unbound structures rather than from bound ones. This approach increases the number of existing protein structures that can be exploited when analyzing protein-DNA interactions. In a recent study, the authors showed that the kernel adopted by the DBD2BS can generate PWMs consistent with those obtained from the experimental data. The use of DBD2BS to predict PWMs can be incorporated with sequence-based methods to discover binding sites in genome-wide studies. Available at: http://dbd2bs.csie.ntu.edu.tw/, http://dbd2bs.csbb.ntu.edu.tw/, and http://dbd2bs.ee.ncku.edu.tw.

  5. Substrate and drug binding sites in LeuT.

    Science.gov (United States)

    Nyola, Ajeeta; Karpowich, Nathan K; Zhen, Juan; Marden, Jennifer; Reith, Maarten E; Wang, Da-Neng

    2010-08-01

    LeuT is a member of the neurotransmitter/sodium symporter family, which includes the neuronal transporters for serotonin, norepinephrine, and dopamine. The original crystal structure of LeuT shows a primary leucine-binding site at the center of the protein. LeuT is inhibited by different classes of antidepressants that act as potent inhibitors of the serotonin transporter. The newly determined crystal structures of LeuT-antidepressant complexes provide opportunities to probe drug binding in the serotonin transporter, of which the exact position remains controversial. Structure of a LeuT-tryptophan complex shows an overlapping binding site with the primary substrate site. A secondary substrate binding site was recently identified, where the binding of a leucine triggers the cytoplasmic release of the primary substrate. This two binding site model presents opportunities for a better understanding of drug binding and the mechanism of inhibition for mammalian transporters.

  6. Determination of energies and sites of binding of PFOA and PFOS to human serum albumin.

    Science.gov (United States)

    Salvalaglio, Matteo; Muscionico, Isabella; Cavallotti, Carlo

    2010-11-25

    Structure and energies of the binding sites of perfluorooctanoic acid (PFOA) and perfluorooctane sulfonate (PFOS) to human serum albumin (HSA) were determined through molecular modeling. The calculations consisted of a compound approach based on docking, followed by molecular dynamics simulations and by the estimation of the free binding energies adopting WHAM-umbrella sampling and semiempirical methodologies. The binding sites so determined are common either to known HSA fatty acids sites or to other HSA sites known to bind to pharmaceutical compounds such as warfarin, thyroxine, indole, and benzodiazepin. Among the PFOA binding sites, five have interaction energies in excess of -6 kcal/mol, which become nine for PFOS. The calculated binding free energy of PFOA to the Trp 214 binding site is the highest among the PFOA complexes, -8.0 kcal/mol, in good agreement with literature experimental data. The PFOS binding site with the highest energy, -8.8 kcal/mol, is located near the Trp 214 binding site, thus partially affecting its activity. The maximum number of ligands that can be bound to HSA is 9 for PFOA and 11 for PFOS. The calculated data were adopted to predict the level of complexation of HSA as a function of the concentration of PFOA and PFOS found in human blood for different levels of exposition. The analysis of the factors contributing to the complex binding energy permitted to outline a set of guidelines for the rational design of alternative fluorinated surfactants with a lower bioaccumulation potential.

  7. SITE-DIRECTED MUTAGENESIS OF PROPOSED ACTIVE-SITE RESIDUES OF PENICILLIN-BINDING PROTEIN-5 FROM ESCHERICHIA-COLI

    NARCIS (Netherlands)

    VANDERLINDEN, MPG; DEHAAN, L; DIDEBERG, O; KECK, W

    1994-01-01

    Alignment of the amino acid sequence of penicillin-binding protein 5 (PBP5) with the sequences of other members of the family of active-site-serine penicillin-interacting enzymes predicted the residues playing a role in the catalytic mechanism of PBP5. Apart from the active-site (Ser(44)), Lys(47),

  8. Protein function annotation by local binding site surface similarity.

    Science.gov (United States)

    Spitzer, Russell; Cleves, Ann E; Varela, Rocco; Jain, Ajay N

    2014-04-01

    Hundreds of protein crystal structures exist for proteins whose function cannot be confidently determined from sequence similarity. Surflex-PSIM, a previously reported surface-based protein similarity algorithm, provides an alternative method for hypothesizing function for such proteins. The method now supports fully automatic binding site detection and is fast enough to screen comprehensive databases of protein binding sites. The binding site detection methodology was validated on apo/holo cognate protein pairs, correctly identifying 91% of ligand binding sites in holo structures and 88% in apo structures where corresponding sites existed. For correctly detected apo binding sites, the cognate holo site was the most similar binding site 87% of the time. PSIM was used to screen a set of proteins that had poorly characterized functions at the time of crystallization, but were later biochemically annotated. Using a fully automated protocol, this set of 8 proteins was screened against ∼60,000 ligand binding sites from the PDB. PSIM correctly identified functional matches that predated query protein biochemical annotation for five out of the eight query proteins. A panel of 12 currently unannotated proteins was also screened, resulting in a large number of statistically significant binding site matches, some of which suggest likely functions for the poorly characterized proteins.

  9. Basis for half-site ligand binding in yeast NAD(+)-specific isocitrate dehydrogenase.

    Science.gov (United States)

    Lin, An-Ping; McAlister-Henn, Lee

    2011-09-27

    Yeast NAD(+)-specific isocitrate dehydrogenase is an allosterically regulated octameric enzyme composed of four heterodimers of a catalytic IDH2 subunit and a regulatory IDH1 subunit. Despite structural predictions that the enzyme would contain eight isocitrate binding sites, four NAD(+) binding sites, and four AMP binding sites, only half of the sites for each ligand can be measured in binding assays. On the basis of a potential interaction between side chains of Cys-150 residues in IDH2 subunits in each tetramer of the enzyme, ligand binding assays of wild-type (IDH1/IDH2) and IDH1/IDH2(C150S) octameric enzymes were conducted in the presence of dithiothreitol. These assays demonstrated the presence of eight isocitrate and four AMP binding sites for the wild-type enzyme in the presence of dithiothreitol and for the IDH1/IDH2(C150S) enzyme in the absence or presence of this reagent, suggesting that interactions between sulfhydryl side chains of IDH2 Cys-150 residues limit access to these sites. However, only two NAD(+) sites could be measured for either enzyme. A tetrameric form of IDH (an IDH1(G15D)/IDH2 mutant enzyme) demonstrated half-site binding for isocitrate (two sites) in the absence of dithiothreitol and full-site binding (four sites) in the presence of dithiothreitol. Only one NAD(+) site could be measured for the tetramer under both conditions. In the context of the structure of the enzyme, these results suggest that an observed asymmetry between heterotetramers in the holoenzyme contributes to interactions between IDH2 Cys-150 residues and to half-site binding of isocitrate, but that a form of negative cooperativity may limit access to apparently equivalent NAD(+) binding sites.

  10. Methods and systems for identifying ligand-protein binding sites

    KAUST Repository

    Gao, Xin

    2016-05-06

    The invention provides a novel integrated structure and system-based approach for drug target prediction that enables the large-scale discovery of new targets for existing drugs Novel computer-readable storage media and computer systems are also provided. Methods and systems of the invention use novel sequence order-independent structure alignment, hierarchical clustering, and probabilistic sequence similarity techniques to construct a probabilistic pocket ensemble (PPE) that captures even promiscuous structural features of different binding sites for a drug on known targets. The drug\\'s PPE is combined with an approximation of the drug delivery profile to facilitate large-scale prediction of novel drug- protein interactions with several applications to biological research and drug development.

  11. Identification of ligands that target the HCV-E2 binding site on CD81

    Science.gov (United States)

    Olaby, Reem Al; Azzazy, Hassan M.; Harris, Rodney; Chromy, Brett; Vielmetter, Jost; Balhorn, Rod

    2013-04-01

    Hepatitis C is a global health problem. While many drug companies have active R&D efforts to develop new drugs for treating Hepatitis C virus (HCV), most target the viral enzymes. The HCV glycoprotein E2 has been shown to play an essential role in hepatocyte invasion by binding to CD81 and other cell surface receptors. This paper describes the use of AutoDock to identify ligand binding sites on the large extracellular loop of the open conformation of CD81 and to perform virtual screening runs to identify sets of small molecule ligands predicted to bind to two of these sites. The best sites selected by AutoLigand were located in regions identified by mutational studies to be the site of E2 binding. Thirty-six ligands predicted by AutoDock to bind to these sites were subsequently tested experimentally to determine if they bound to CD81-LEL. Binding assays conducted using surface Plasmon resonance revealed that 26 out of 36 (72 %) of the ligands bound in vitro to the recombinant CD81-LEL protein. Competition experiments performed using dual polarization interferometry showed that one of the ligands predicted to bind to the large cleft between the C and D helices was also effective in blocking E2 binding to CD81-LEL.

  12. Identification of ligands that target the HCV-E2 binding site on CD81.

    Science.gov (United States)

    Olaby, Reem Al; Azzazy, Hassan M; Harris, Rodney; Chromy, Brett; Vielmetter, Jost; Balhorn, Rod

    2013-04-01

    Hepatitis C is a global health problem. While many drug companies have active R&D efforts to develop new drugs for treating Hepatitis C virus (HCV), most target the viral enzymes. The HCV glycoprotein E2 has been shown to play an essential role in hepatocyte invasion by binding to CD81 and other cell surface receptors. This paper describes the use of AutoDock to identify ligand binding sites on the large extracellular loop of the open conformation of CD81 and to perform virtual screening runs to identify sets of small molecule ligands predicted to bind to two of these sites. The best sites selected by AutoLigand were located in regions identified by mutational studies to be the site of E2 binding. Thirty-six ligands predicted by AutoDock to bind to these sites were subsequently tested experimentally to determine if they bound to CD81-LEL. Binding assays conducted using surface Plasmon resonance revealed that 26 out of 36 (72 %) of the ligands bound in vitro to the recombinant CD81-LEL protein. Competition experiments performed using dual polarization interferometry showed that one of the ligands predicted to bind to the large cleft between the C and D helices was also effective in blocking E2 binding to CD81-LEL.

  13. Fast dynamics perturbation analysis for prediction of protein functional sites

    Directory of Open Access Journals (Sweden)

    Cohn Judith D

    2008-01-01

    Full Text Available Abstract Background We present a fast version of the dynamics perturbation analysis (DPA algorithm to predict functional sites in protein structures. The original DPA algorithm finds regions in proteins where interactions cause a large change in the protein conformational distribution, as measured using the relative entropy Dx. Such regions are associated with functional sites. Results The Fast DPA algorithm, which accelerates DPA calculations, is motivated by an empirical observation that Dx in a normal-modes model is highly correlated with an entropic term that only depends on the eigenvalues of the normal modes. The eigenvalues are accurately estimated using first-order perturbation theory, resulting in a N-fold reduction in the overall computational requirements of the algorithm, where N is the number of residues in the protein. The performance of the original and Fast DPA algorithms was compared using protein structures from a standard small-molecule docking test set. For nominal implementations of each algorithm, top-ranked Fast DPA predictions overlapped the true binding site 94% of the time, compared to 87% of the time for original DPA. In addition, per-protein recall statistics (fraction of binding-site residues that are among predicted residues were slightly better for Fast DPA. On the other hand, per-protein precision statistics (fraction of predicted residues that are among binding-site residues were slightly better using original DPA. Overall, the performance of Fast DPA in predicting ligand-binding-site residues was comparable to that of the original DPA algorithm. Conclusion Compared to the original DPA algorithm, the decreased run time with comparable performance makes Fast DPA well-suited for implementation on a web server and for high-throughput analysis.

  14. Structure and localisation of drug binding sites on neurotransmitter transporters.

    Science.gov (United States)

    Ravna, Aina W; Sylte, Ingebrigt; Dahl, Svein G

    2009-10-01

    The dopamine (DAT), serotontin (SERT) and noradrenalin (NET) transporters are molecular targets for different classes of psychotropic drugs. The crystal structure of Aquifex aeolicus LeuT(Aa) was used as a template for molecular modeling of DAT, SERT and NET, and two putative drug binding sites (pocket 1 and 2) in each transporter were identified. Cocaine was docked into binding pocket 1 of DAT, corresponding to the leucine binding site in LeuT(Aa), which involved transmembrane helices (TMHs) 1, 3, 6 and 8. Clomipramine was docked into binding pocket 2 of DAT, involving TMHs 1, 3, 6, 10 and 11, and extracellular loops 4 and 6, corresponding to the clomipramine binding site in a crystal structure of a LeuT(Aa)-clomipramine complex. The structures of the proposed cocaine- and tricyclic antidepressant-binding sites may be of particular interest for the design of novel DAT interacting ligands.

  15. A 3D-QSAR-driven approach to binding mode and affinity prediction

    DEFF Research Database (Denmark)

    Tosco, Paolo; Balle, Thomas

    2012-01-01

    A method for predicting the binding mode of a series of ligands is proposed. The procedure relies on three-dimensional quantitative structure-activity relationships (3D-QSAR) and does not require structural knowledge of the binding site. Candidate alignments are automatically built and ranked...... according to a consensus scoring function. 3D-QSAR analysis based on the selected binding mode enables affinity prediction of new drug candidates having less than 10 rotatable bonds....

  16. DBD2BS: connecting a DNA-binding protein with its binding sites

    OpenAIRE

    2012-01-01

    By binding to short and highly conserved DNA sequences in genomes, DNA-binding proteins initiate, enhance or repress biological processes. Accurately identifying such binding sites, often represented by position weight matrices (PWMs), is an important step in understanding the control mechanisms of cells. When given coordinates of a DNA-binding domain (DBD) bound with DNA, a potential function can be used to estimate the change of binding affinity after base substitutions, where the changes c...

  17. Identification and characterization of anion binding sites in RNA

    Energy Technology Data Exchange (ETDEWEB)

    Kieft, Jeffrey S.; Chase, Elaine; Costantino, David A.; Golden, Barbara L. (Purdue); (Colorado)

    2010-05-24

    Although RNA molecules are highly negatively charged, anions have been observed bound to RNA in crystal structures. It has been proposed that anion binding sites found within isolated RNAs represent regions of the molecule that could be involved in intermolecular interactions, indicating potential contact points for negatively charged amino acids from proteins or phosphate groups from an RNA. Several types of anion binding sites have been cataloged based on available structures. However, currently there is no method for unambiguously assigning anions to crystallographic electron density, and this has precluded more detailed analysis of RNA-anion interaction motifs and their significance. We therefore soaked selenate into two different types of RNA crystals and used the anomalous signal from these anions to identify binding sites in these RNA molecules unambiguously. Examination of these sites and comparison with other suspected anion binding sites reveals features of anion binding motifs, and shows that selenate may be a useful tool for studying RNA-anion interactions.

  18. SitesIdentify: a protein functional site prediction tool

    Directory of Open Access Journals (Sweden)

    Doig Andrew J

    2009-11-01

    Full Text Available Abstract Background The rate of protein structures being deposited in the Protein Data Bank surpasses the capacity to experimentally characterise them and therefore computational methods to analyse these structures have become increasingly important. Identifying the region of the protein most likely to be involved in function is useful in order to gain information about its potential role. There are many available approaches to predict functional site, but many are not made available via a publicly-accessible application. Results Here we present a functional site prediction tool (SitesIdentify, based on combining sequence conservation information with geometry-based cleft identification, that is freely available via a web-server. We have shown that SitesIdentify compares favourably to other functional site prediction tools in a comparison of seven methods on a non-redundant set of 237 enzymes with annotated active sites. Conclusion SitesIdentify is able to produce comparable accuracy in predicting functional sites to its closest available counterpart, but in addition achieves improved accuracy for proteins with few characterised homologues. SitesIdentify is available via a webserver at http://www.manchester.ac.uk/bioinformatics/sitesidentify/

  19. Position specific variation in the rate of evolution intranscription factor binding sites

    Energy Technology Data Exchange (ETDEWEB)

    Moses, Alan M.; Chiang, Derek Y.; Kellis, Manolis; Lander, EricS.; Eisen, Michael B.

    2003-08-28

    The binding sites of sequence specific transcription factors are an important and relatively well-understood class of functional non-coding DNAs. Although a wide variety of experimental and computational methods have been developed to characterize transcription factor binding sites, they remain difficult to identify. Comparison of non-coding DNA from related species has shown considerable promise in identifying these functional non-coding sequences, even though relatively little is known about their evolution. Here we analyze the genome sequences of the budding yeasts Saccharomyces cerevisiae, S. bayanus, S. paradoxus and S. mikataeto study the evolution of transcription factor binding sites. As expected, we find that both experimentally characterized and computationally predicted binding sites evolve slower than surrounding sequence, consistent with the hypothesis that they are under purifying selection. We also observe position-specific variation in the rate of evolution within binding sites. We find that the position-specific rate of evolution is positively correlated with degeneracy among binding sites within S. cerevisiae. We test theoretical predictions for the rate of evolution at positions where the base frequencies deviate from background due to purifying selection and find reasonable agreement with the observed rates of evolution. Finally, we show how the evolutionary characteristics of real binding motifs can be used to distinguish them from artifacts of computational motif finding algorithms. As has been observed for protein sequences, the rate of evolution in transcription factor binding sites varies with position, suggesting that some regions are under stronger functional constraint than others. This variation likely reflects the varying importance of different positions in the formation of the protein-DNA complex. The characterization of the pattern of evolution in known binding sites will likely contribute to the effective use of comparative

  20. Impact of Binding Site Comparisons on Medicinal Chemistry and Rational Molecular Design.

    Science.gov (United States)

    Ehrt, Christiane; Brinkjost, Tobias; Koch, Oliver

    2016-05-12

    Modern rational drug design not only deals with the search for ligands binding to interesting and promising validated targets but also aims to identify the function and ligands of yet uncharacterized proteins having impact on different diseases. Additionally, it contributes to the design of inhibitors with distinct selectivity patterns and the prediction of possible off-target effects. The identification of similarities between binding sites of various proteins is a useful approach to cope with those challenges. The main scope of this perspective is to describe applications of different protein binding site comparison approaches to outline their applicability and impact on molecular design. The article deals with various substantial application domains and provides some outstanding examples to show how various binding site comparison methods can be applied to promote in silico drug design workflows. In addition, we will also briefly introduce the fundamental principles of different protein binding site comparison methods.

  1. Prediction of Protein-DNA binding by Monte Carlo method

    Science.gov (United States)

    Deng, Yuefan; Eisenberg, Moises; Korobka, Alex

    1997-08-01

    We present an analysis and prediction of protein-DNA binding specificity based on the hydrogen bonding between DNA, protein, and auxillary clusters of water molecules. Zif268, glucocorticoid receptor, λ-repressor mutant, HIN-recombinase, and tramtrack protein-DNA complexes are studied. Hydrogen bonds are approximated by the Lennard-Jones potential with a cutoff distance between the hydrogen and the acceptor atoms set to 3.2 Åand an angular component based on a dipole-dipole interaction. We use a three-stage docking algorithm: geometric hashing that matches pairs of hydrogen bonding sites; (2) least-squares minimization of pairwise distances to filter out insignificant matches; and (3) Monte Carlo stochastic search to minimize the energy of the system. More information can be obtained from our first paper on this subject [Y.Deng et all, J.Computational Chemistry (1995)]. Results show that the biologically correct base pair is selected preferentially when there are two or more strong hydrogen bonds (with LJ potential lower than -0.20) that bind it to the protein. Predicted sequences are less stable in the case of weaker bonding sites. In general the inclusion of water bridges does increase the number of base pairs for which correct specificity is predicted.

  2. Discovery and validation of information theory-based transcription factor and cofactor binding site motifs.

    Science.gov (United States)

    Lu, Ruipeng; Mucaki, Eliseos J; Rogan, Peter K

    2016-11-28

    Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, discovered 23 cofactor motifs for 127 TFs and revealed six high-confidence novel motifs. The reliability and accuracy of these iPWMs were determined via four independent validation methods, including the detection of experimentally proven binding sites, explanation of effects of characterized SNPs, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF coregulatory interactions (e.g. TF complexes). These iPWMs constitute a powerful tool for predicting the effects of sequence variants in known binding sites, performing mutation analysis on regulatory SNPs and predicting previously unrecognized binding sites and target genes.

  3. DNA-MATRIX: a tool for constructing transcription factor binding sites Weight matrix

    Directory of Open Access Journals (Sweden)

    Chandra Prakash Singh,

    2009-12-01

    Full Text Available Despite considerable effort to date, DNA transcription factor binding sites prediction in whole genome remains a challenge for the researchers. Currently the genome wide transcription factor binding sites prediction tools required either direct pattern sequence or weight matrix. Although there are known transcription factor binding sites pattern databases and tools for genome level prediction but no tool for weight matrix construction. Considering this, we developed a DNA-MATRIX tool for searching putative transcription factor binding sites in genomic sequences. DNA-MATRIX uses the simple heuristic approach for weight matrix construction, which can be transformed into different formats as per the requirement of researcher’s for further genome wide prediction and therefore provides the possibility to identify the conserved known DNA binding sites in the coregulated genes and also to search for a great variety of different regulatory binding patterns. The user may construct and save specific weight or frequency matrices in different formats derived through user selected set of known motif sequences.

  4. Transcription factor binding site positioning in yeast: proximal promoter motifs characterize TATA-less promoters.

    Science.gov (United States)

    Erb, Ionas; van Nimwegen, Erik

    2011-01-01

    The availability of sequence specificities for a substantial fraction of yeast's transcription factors and comparative genomic algorithms for binding site prediction has made it possible to comprehensively annotate transcription factor binding sites genome-wide. Here we use such a genome-wide annotation for comprehensively studying promoter architecture in yeast, focusing on the distribution of transcription factor binding sites relative to transcription start sites, and the architecture of TATA and TATA-less promoters. For most transcription factors, binding sites are positioned further upstream and vary over a wider range in TATA promoters than in TATA-less promoters. In contrast, a group of 6 'proximal promoter motifs' (GAT1/GLN3/DAL80, FKH1/2, PBF1/2, RPN4, NDT80, and ROX1) occur preferentially in TATA-less promoters and show a strong preference for binding close to the transcription start site in these promoters. We provide evidence that suggests that pre-initiation complexes are recruited at TATA sites in TATA promoters and at the sites of the other proximal promoter motifs in TATA-less promoters. TATA-less promoters can generally be classified by the proximal promoter motif they contain, with different classes of TATA-less promoters showing different patterns of transcription factor binding site positioning and nucleosome coverage. These observations suggest that different modes of regulation of transcription initiation may be operating in the different promoter classes. In addition we show that, across all promoter classes, there is a close match between nucleosome free regions and regions of highest transcription factor binding site density. This close agreement between transcription factor binding site density and nucleosome depletion suggests a direct and general competition between transcription factors and nucleosomes for binding to promoters.

  5. Transcription factor binding site positioning in yeast: proximal promoter motifs characterize TATA-less promoters.

    Directory of Open Access Journals (Sweden)

    Ionas Erb

    Full Text Available The availability of sequence specificities for a substantial fraction of yeast's transcription factors and comparative genomic algorithms for binding site prediction has made it possible to comprehensively annotate transcription factor binding sites genome-wide. Here we use such a genome-wide annotation for comprehensively studying promoter architecture in yeast, focusing on the distribution of transcription factor binding sites relative to transcription start sites, and the architecture of TATA and TATA-less promoters. For most transcription factors, binding sites are positioned further upstream and vary over a wider range in TATA promoters than in TATA-less promoters. In contrast, a group of 6 'proximal promoter motifs' (GAT1/GLN3/DAL80, FKH1/2, PBF1/2, RPN4, NDT80, and ROX1 occur preferentially in TATA-less promoters and show a strong preference for binding close to the transcription start site in these promoters. We provide evidence that suggests that pre-initiation complexes are recruited at TATA sites in TATA promoters and at the sites of the other proximal promoter motifs in TATA-less promoters. TATA-less promoters can generally be classified by the proximal promoter motif they contain, with different classes of TATA-less promoters showing different patterns of transcription factor binding site positioning and nucleosome coverage. These observations suggest that different modes of regulation of transcription initiation may be operating in the different promoter classes. In addition we show that, across all promoter classes, there is a close match between nucleosome free regions and regions of highest transcription factor binding site density. This close agreement between transcription factor binding site density and nucleosome depletion suggests a direct and general competition between transcription factors and nucleosomes for binding to promoters.

  6. Influence of sulfhydryl sites on metal binding by bacteria

    Science.gov (United States)

    Nell, Ryan M.; Fein, Jeremy B.

    2017-02-01

    The role of sulfhydryl sites within bacterial cell envelopes is still unknown, but the sites may control the fate and bioavailability of metals. Organic sulfhydryl compounds are important complexing ligands in aqueous systems and they can influence metal speciation in natural waters. Though representing only approximately 5-10% of the total available binding sites on bacterial surfaces, sulfhydryl sites exhibit high binding affinities for some metals. Due to the potential importance of bacterial sulfhydryl sites in natural systems, metal-bacterial sulfhydryl site binding constants must be determined in order to construct accurate models of the fate and distribution of metals in these systems. To date, only Cd-sulfhydryl binding has been quantified. In this study, the thermodynamic stabilities of Mn-, Co-, Ni-, Zn-, Sr- and Pb-sulfhydryl bacterial cell envelope complexes were determined for the bacterial species Shewanella oneidensis MR-1. Metal adsorption experiments were conducted as a function of both pH, ranging from 5.0 to 7.0, and metal loading, from 0.5 to 40.0 μmol/g (wet weight) bacteria, in batch experiments in order to determine if metal-sulfhydryl binding occurs. Initially, the data were used to calculate the value of the stability constants for the important metal-sulfhydryl bacterial complexes for each metal-loading condition studied, assuming a single binding reaction for the dominant metal-binding site type under the pH conditions of the experiments. For most of the metals that we studied, these calculated stability constant values increased significantly with decreasing metal loading, strongly suggesting that our initial assumption was not valid and that more than one type of binding occurs at the assumed binding site. We then modeled each dataset with two distinct site types with identical acidity constants: one site with a high metal-site stability constant value, which we take to represent metal-sulfhydryl binding and which dominates under low

  7. Effect of positional dependence and alignment strategy on modeling transcription factor binding sites

    Directory of Open Access Journals (Sweden)

    Quader Saad

    2012-07-01

    Full Text Available Abstract Background Many consensus-based and Position Weight Matrix-based methods for recognizing transcription factor binding sites (TFBS are not well suited to the variability in the lengths of binding sites. Besides, many methods discard known binding sites while building the model. Moreover, the impact of Information Content (IC and the positional dependence of nucleotides within an aligned set of TFBS has not been well researched for modeling variable-length binding sites. In this paper, we propose ML-Consensus (Mixed-Length Consensus: a consensus model for variable-length TFBS which does not exclude any reported binding sites. Methods We consider Pairwise Score (PS as a measure of positional dependence of nucleotides within an alignment of TFBS. We investigate how the prediction accuracy of ML-Consensus is affected by the incorporation of IC and PS with a particular binding site alignment strategy. We perform cross-validations for datasets of six species from the TRANSFAC public database, and analyze the results using ROC curves and the Wilcoxon matched-pair signed-ranks test. Results We observe that the incorporation of IC and PS in ML-Consensus results in statistically significant improvement in the prediction accuracy of the model. Moreover, the existence of a core region among the known binding sites (of any length is witnessed by the pairwise coexistence of nucleotides within the core length. Conclusions These observations suggest the possibility of an efficient multiple sequence alignment algorithm for aligning TFBS, accommodating known binding sites of any length, for optimal (or near-optimal TFBS prediction. However, designing such an algorithm is a matter of further investigation.

  8. An additional substrate binding site in a bacterial phenylalanine hydroxylase.

    Science.gov (United States)

    Ronau, Judith A; Paul, Lake N; Fuchs, Julian E; Corn, Isaac R; Wagner, Kyle T; Liedl, Klaus R; Abu-Omar, Mahdi M; Das, Chittaranjan

    2013-09-01

    Phenylalanine hydroxylase (PAH) is a non-heme iron enzyme that catalyzes oxidation of phenylalanine to tyrosine, a reaction that must be kept under tight regulatory control. Mammalian PAH has a regulatory domain in which binding of the substrate leads to allosteric activation of the enzyme. However, the existence of PAH regulation in evolutionarily distant organisms, for example some bacteria in which it occurs, has so far been underappreciated. In an attempt to crystallographically characterize substrate binding by PAH from Chromobacterium violaceum, a single-domain monomeric enzyme, electron density for phenylalanine was observed at a distal site 15.7 Å from the active site. Isothermal titration calorimetry (ITC) experiments revealed a dissociation constant of 24 ± 1.1 μM for phenylalanine. Under the same conditions, ITC revealed no detectable binding for alanine, tyrosine, or isoleucine, indicating the distal site may be selective for phenylalanine. Point mutations of amino acid residues in the distal site that contact phenylalanine (F258A, Y155A, T254A) led to impaired binding, consistent with the presence of distal site binding in solution. Although kinetic analysis revealed that the distal site mutants suffer discernible loss of their catalytic activity, X-ray crystallographic analysis of Y155A and F258A, the two mutants with the most noticeable decrease in activity, revealed no discernible change in the structure of their active sites, suggesting that the effect of distal binding may result from protein dynamics in solution.

  9. Microbes bind complement inhibitor factor H via a common site.

    Directory of Open Access Journals (Sweden)

    T Meri

    Full Text Available To cause infections microbes need to evade host defense systems, one of these being the evolutionarily old and important arm of innate immunity, the alternative pathway of complement. It can attack all kinds of targets and is tightly controlled in plasma and on host cells by plasma complement regulator factor H (FH. FH binds simultaneously to host cell surface structures such as heparin or glycosaminoglycans via domain 20 and to the main complement opsonin C3b via domain 19. Many pathogenic microbes protect themselves from complement by recruiting host FH. We analyzed how and why different microbes bind FH via domains 19-20 (FH19-20. We used a selection of FH19-20 point mutants to reveal the binding sites of several microbial proteins and whole microbes (Haemophilus influenzae, Bordetella pertussis, Pseudomonas aeruginosa, Streptococcus pneumonia, Candida albicans, Borrelia burgdorferi, and Borrelia hermsii. We show that all studied microbes use the same binding region located on one side of domain 20. Binding of FH to the microbial proteins was inhibited with heparin showing that the common microbial binding site overlaps with the heparin site needed for efficient binding of FH to host cells. Surprisingly, the microbial proteins enhanced binding of FH19-20 to C3b and down-regulation of complement activation. We show that this is caused by formation of a tripartite complex between the microbial protein, FH, and C3b. In this study we reveal that seven microbes representing different phyla utilize a common binding site on the domain 20 of FH for complement evasion. Binding via this site not only mimics the glycosaminoglycans of the host cells, but also enhances function of FH on the microbial surfaces via the novel mechanism of tripartite complex formation. This is a unique example of convergent evolution resulting in enhanced immune evasion of important pathogens via utilization of a "superevasion site."

  10. HDAC Inhibitors without an Active Site Zn2+-Binding Group

    DEFF Research Database (Denmark)

    Vickers, Chris J.; Olsen, Christian Adam; Leman, Luke J.;

    2012-01-01

    Natural and synthetic histone deacetylase (HDAC) inhibitors generally derive their strong binding affinity and high potency from a key functional group that binds to the Zn2+ ion within the enzyme active site. However, this feature is also thought to carry the potential liability of undesirable o...

  11. Regulation of ryanodine receptor RyR2 by protein-protein interactions: prediction of a PKA binding site on the N-terminal domain of RyR2 and its relation to disease causing mutations [v1; ref status: indexed, http://f1000r.es/4tw

    Directory of Open Access Journals (Sweden)

    Belinda Nazan Walpoth

    2015-01-01

    Full Text Available Protein-protein interactions are the key processes responsible for signaling and function in complex networks. Determining the correct binding partners and predicting the ligand binding sites in the absence of experimental data require predictive models. Hybrid models that combine quantitative atomistic calculations with statistical thermodynamics formulations are valuable tools for bioinformatics predictions. We present a hybrid prediction and analysis model for determining putative binding partners and interpreting the resulting correlations in the yet functionally uncharacterized interactions of the ryanodine RyR2 N-terminal domain. Using extensive docking calculations and libraries of hexameric peptides generated from regulator proteins of the RyR2 channel, we show that the residues 318-323 of protein kinase A, PKA, have a very high affinity for the N-terminal of RyR2. Using a coarse grained Elastic Net Model, we show that the binding site lies at the end of a pathway of evolutionarily conserved residues in RyR2. The two disease causing mutations are also on this path. The program for the prediction of the energetically responsive residues by the Elastic Net Model is freely available on request from the corresponding author.

  12. Differential Nucleosome Occupancies across Oct4-Sox2 Binding Sites in Murine Embryonic Stem Cells.

    Science.gov (United States)

    Sebeson, Amy; Xi, Liqun; Zhang, Quanwei; Sigmund, Audrey; Wang, Ji-Ping; Widom, Jonathan; Wang, Xiaozhong

    2015-01-01

    The binding sequence for any transcription factor can be found millions of times within a genome, yet only a small fraction of these sequences encode functional transcription factor binding sites. One of the reasons for this dichotomy is that many other factors, such as nucleosomes, compete for binding. To study how the competition between nucleosomes and transcription factors helps determine a functional transcription factor site from a predicted transcription factor site, we compared experimentally-generated in vitro nucleosome occupancy with in vivo nucleosome occupancy and transcription factor binding in murine embryonic stem cells. Using a solution hybridization enrichment technique, we generated a high-resolution nucleosome map from targeted regions of the genome containing predicted sites and functional sites of Oct4/Sox2 regulation. We found that at Pax6 and Nes, which are bivalently poised in stem cells, functional Oct4 and Sox2 sites show high amounts of in vivo nucleosome displacement compared to in vitro. Oct4 and Sox2, which are active, show no significant displacement of in vivo nucleosomes at functional sites, similar to nonfunctional Oct4/Sox2 binding. This study highlights a complex interplay between Oct4 and Sox2 transcription factors and nucleosomes among different target genes, which may result in distinct patterns of stem cell gene regulation.

  13. Differential Nucleosome Occupancies across Oct4-Sox2 Binding Sites in Murine Embryonic Stem Cells.

    Directory of Open Access Journals (Sweden)

    Amy Sebeson

    Full Text Available The binding sequence for any transcription factor can be found millions of times within a genome, yet only a small fraction of these sequences encode functional transcription factor binding sites. One of the reasons for this dichotomy is that many other factors, such as nucleosomes, compete for binding. To study how the competition between nucleosomes and transcription factors helps determine a functional transcription factor site from a predicted transcription factor site, we compared experimentally-generated in vitro nucleosome occupancy with in vivo nucleosome occupancy and transcription factor binding in murine embryonic stem cells. Using a solution hybridization enrichment technique, we generated a high-resolution nucleosome map from targeted regions of the genome containing predicted sites and functional sites of Oct4/Sox2 regulation. We found that at Pax6 and Nes, which are bivalently poised in stem cells, functional Oct4 and Sox2 sites show high amounts of in vivo nucleosome displacement compared to in vitro. Oct4 and Sox2, which are active, show no significant displacement of in vivo nucleosomes at functional sites, similar to nonfunctional Oct4/Sox2 binding. This study highlights a complex interplay between Oct4 and Sox2 transcription factors and nucleosomes among different target genes, which may result in distinct patterns of stem cell gene regulation.

  14. SiteOut: An Online Tool to Design Binding Site-Free DNA Sequences.

    Directory of Open Access Journals (Sweden)

    Javier Estrada

    Full Text Available DNA-binding proteins control many fundamental biological processes such as transcription, recombination and replication. A major goal is to decipher the role that DNA sequence plays in orchestrating the binding and activity of such regulatory proteins. To address this goal, it is useful to rationally design DNA sequences with desired numbers, affinities and arrangements of protein binding sites. However, removing binding sites from DNA is computationally non-trivial since one risks creating new sites in the process of deleting or moving others. Here we present an online binding site removal tool, SiteOut, that enables users to design arbitrary DNA sequences that entirely lack binding sites for factors of interest. SiteOut can also be used to delete sites from a specific sequence, or to introduce site-free spacers between functional sequences without creating new sites at the junctions. In combination with commercial DNA synthesis services, SiteOut provides a powerful and flexible platform for synthetic projects that interrogate regulatory DNA. Here we describe the algorithm and illustrate the ways in which SiteOut can be used; it is publicly available at https://depace.med.harvard.edu/siteout/.

  15. SiteOut: An Online Tool to Design Binding Site-Free DNA Sequences.

    Science.gov (United States)

    Estrada, Javier; Ruiz-Herrero, Teresa; Scholes, Clarissa; Wunderlich, Zeba; DePace, Angela H

    2016-01-01

    DNA-binding proteins control many fundamental biological processes such as transcription, recombination and replication. A major goal is to decipher the role that DNA sequence plays in orchestrating the binding and activity of such regulatory proteins. To address this goal, it is useful to rationally design DNA sequences with desired numbers, affinities and arrangements of protein binding sites. However, removing binding sites from DNA is computationally non-trivial since one risks creating new sites in the process of deleting or moving others. Here we present an online binding site removal tool, SiteOut, that enables users to design arbitrary DNA sequences that entirely lack binding sites for factors of interest. SiteOut can also be used to delete sites from a specific sequence, or to introduce site-free spacers between functional sequences without creating new sites at the junctions. In combination with commercial DNA synthesis services, SiteOut provides a powerful and flexible platform for synthetic projects that interrogate regulatory DNA. Here we describe the algorithm and illustrate the ways in which SiteOut can be used; it is publicly available at https://depace.med.harvard.edu/siteout/.

  16. Identification of a functional hepatocyte nuclear factor 4 binding site in the neutral ceramidase promoter

    DEFF Research Database (Denmark)

    Maltesen, Henrik R; Troelsen, Jesper T; Olsen, Jørgen

    2010-01-01

    in ceramide digestion. It was the purpose of the present work to experimentally verify the functional importance of a HNF-4a binding site predicted by bioinformatic analysis to be present in the Asah2 promoter. Using supershift analysis, HNF-4a overexpression, and HNF-4a knockdown experiments it was confirmed...... that the predicted HNF-4a binding site identified in the Asah2 promoter is functional. The results support the hypothesis that HNF-4a might be important for intestinal glycolipid metabolism....

  17. Cation binding site of cytochrome c oxidase: progress report.

    Science.gov (United States)

    Vygodina, Tatiana V; Kirichenko, Anna; Konstantinov, Alexander A

    2014-07-01

    Cytochrome c oxidase from bovine heart binds Ca(2+) reversibly at a specific Cation Binding Site located near the outer face of the mitochondrial membrane. Ca(2+) shifts the absorption spectrum of heme a, which allowed earlier the determination of the kinetic and equilibrium characteristics of the binding, and, as shown recently, the binding of calcium to the site inhibits cytochrome oxidase activity at low turnover rates of the enzyme [Vygodina, Т., Kirichenko, A., Konstantinov, A.A (2013). Direct Regulation of Cytochrome c Oxidase by Calcium Ions. PloS ONE 8, e74436]. This paper summarizes further progress in the studies of the Cation Binding Site in this group presenting the results to be reported at 18th EBEC Meeting in Lisbon, 2014. The paper revises specificity of the bovine oxidase Cation Binding Site for different cations, describes dependence of the Ca(2+)-induced inhibition on turnover rate of the enzyme and reports very high affinity binding of calcium with the "slow" form of cytochrome oxidase. This article is part of a Special Issue entitled: 18th European Bioenergetic Conference. Guest Editors: Manuela Pereira and Miguel Teixeira.

  18. Combining transcription factor binding affinities with open-chromatin data for accurate gene expression prediction

    Science.gov (United States)

    Schmidt, Florian; Gasparoni, Nina; Gasparoni, Gilles; Gianmoena, Kathrin; Cadenas, Cristina; Polansky, Julia K.; Ebert, Peter; Nordström, Karl; Barann, Matthias; Sinha, Anupam; Fröhler, Sebastian; Xiong, Jieyi; Dehghani Amirabad, Azim; Behjati Ardakani, Fatemeh; Hutter, Barbara; Zipprich, Gideon; Felder, Bärbel; Eils, Jürgen; Brors, Benedikt; Chen, Wei; Hengstler, Jan G.; Hamann, Alf; Lengauer, Thomas; Rosenstiel, Philip; Walter, Jörn; Schulz, Marcel H.

    2017-01-01

    The binding and contribution of transcription factors (TF) to cell specific gene expression is often deduced from open-chromatin measurements to avoid costly TF ChIP-seq assays. Thus, it is important to develop computational methods for accurate TF binding prediction in open-chromatin regions (OCRs). Here, we report a novel segmentation-based method, TEPIC, to predict TF binding by combining sets of OCRs with position weight matrices. TEPIC can be applied to various open-chromatin data, e.g. DNaseI-seq and NOMe-seq. Additionally, Histone-Marks (HMs) can be used to identify candidate TF binding sites. TEPIC computes TF affinities and uses open-chromatin/HM signal intensity as quantitative measures of TF binding strength. Using machine learning, we find low affinity binding sites to improve our ability to explain gene expression variability compared to the standard presence/absence classification of binding sites. Further, we show that both footprints and peaks capture essential TF binding events and lead to a good prediction performance. In our application, gene-based scores computed by TEPIC with one open-chromatin assay nearly reach the quality of several TF ChIP-seq data sets. Finally, these scores correctly predict known transcriptional regulators as illustrated by the application to novel DNaseI-seq and NOMe-seq data for primary human hepatocytes and CD4+ T-cells, respectively. PMID:27899623

  19. Combining transcription factor binding affinities with open-chromatin data for accurate gene expression prediction.

    Science.gov (United States)

    Schmidt, Florian; Gasparoni, Nina; Gasparoni, Gilles; Gianmoena, Kathrin; Cadenas, Cristina; Polansky, Julia K; Ebert, Peter; Nordström, Karl; Barann, Matthias; Sinha, Anupam; Fröhler, Sebastian; Xiong, Jieyi; Dehghani Amirabad, Azim; Behjati Ardakani, Fatemeh; Hutter, Barbara; Zipprich, Gideon; Felder, Bärbel; Eils, Jürgen; Brors, Benedikt; Chen, Wei; Hengstler, Jan G; Hamann, Alf; Lengauer, Thomas; Rosenstiel, Philip; Walter, Jörn; Schulz, Marcel H

    2017-01-09

    The binding and contribution of transcription factors (TF) to cell specific gene expression is often deduced from open-chromatin measurements to avoid costly TF ChIP-seq assays. Thus, it is important to develop computational methods for accurate TF binding prediction in open-chromatin regions (OCRs). Here, we report a novel segmentation-based method, TEPIC, to predict TF binding by combining sets of OCRs with position weight matrices. TEPIC can be applied to various open-chromatin data, e.g. DNaseI-seq and NOMe-seq. Additionally, Histone-Marks (HMs) can be used to identify candidate TF binding sites. TEPIC computes TF affinities and uses open-chromatin/HM signal intensity as quantitative measures of TF binding strength. Using machine learning, we find low affinity binding sites to improve our ability to explain gene expression variability compared to the standard presence/absence classification of binding sites. Further, we show that both footprints and peaks capture essential TF binding events and lead to a good prediction performance. In our application, gene-based scores computed by TEPIC with one open-chromatin assay nearly reach the quality of several TF ChIP-seq data sets. Finally, these scores correctly predict known transcriptional regulators as illustrated by the application to novel DNaseI-seq and NOMe-seq data for primary human hepatocytes and CD4+ T-cells, respectively.

  20. Opioid binding sites in the guinea pig and rat kidney: Radioligand homogenate binding and autoradiography

    Energy Technology Data Exchange (ETDEWEB)

    Dissanayake, V.U.; Hughes, J.; Hunter, J.C. (Parke-Davis Research Unit, Addenbrookes Hospital Site, Cambridge (England))

    1991-07-01

    The specific binding of the selective {mu}-, {delta}-, and {kappa}-opioid ligands (3H)(D-Ala2,MePhe4,Gly-ol5)enkephalin ((3H) DAGOL), (3H)(D-Pen2,D-Pen5)enkephalin ((3H)DPDPE), and (3H)U69593, respectively, to crude membranes of the guinea pig and rat whole kidney, kidney cortex, and kidney medulla was investigated. In addition, the distribution of specific 3H-opioid binding sites in the guinea pig and rat kidney was visualized by autoradiography. Homogenate binding and autoradiography demonstrated the absence of {mu}- and {kappa}-opioid binding sites in the guinea pig kidney. No opioid binding sites were demonstrable in the rat kidney. In the guinea pig whole kidney, cortex, and medulla, saturation studies demonstrated that (3H)DPDPE bound with high affinity (KD = 2.6-3.5 nM) to an apparently homogeneous population of binding sites (Bmax = 8.4-30 fmol/mg of protein). Competition studies using several opioid compounds confirmed the nature of the {delta}-opioid binding site. Autoradiography experiments demonstrated that specific (3H)DPDPE binding sites were distributed radially in regions of the inner and outer medulla and at the corticomedullary junction of the guinea pig kidney. Computer-assisted image analysis of saturation data yielded KD values (4.5-5.0 nM) that were in good agreement with those obtained from the homogenate binding studies. Further investigation of the {delta}-opioid binding site in medulla homogenates, using agonist ((3H)DPDPE) and antagonist ((3H)diprenorphine) binding in the presence of Na+, Mg2+, and nucleotides, suggested that the {delta}-opioid site is linked to a second messenger system via a GTP-binding protein. Further studies are required to establish the precise localization of the {delta} binding site in the guinea pig kidney and to determine the nature of the second messenger linked to the GTP-binding protein in the medulla.

  1. Chloride binding site of neurotransmitter sodium symporters

    DEFF Research Database (Denmark)

    Kantcheva, Adriana Krassimirova; Quick, Matthias; Shi, Lei

    2013-01-01

    Neurotransmitter:sodium symporters (NSSs) play a critical role in signaling by reuptake of neurotransmitters. Eukaryotic NSSs are chloride-dependent, whereas prokaryotic NSS homologs like LeuT are chloride-independent but contain an acidic residue (Glu290 in LeuT) at a site where eukaryotic NSSs...... have a serine. The LeuT-E290S mutant displays chloride-dependent activity. We show that, in LeuT-E290S cocrystallized with bromide or chloride, the anion is coordinated by side chain hydroxyls from Tyr47, Ser290, and Thr254 and the side chain amide of Gln250. The bound anion and the nearby sodium ion...

  2. Sequence and structural features of binding site residues in protein-protein complexes: comparison with protein-nucleic acid complexes

    Directory of Open Access Journals (Sweden)

    Selvaraj S

    2011-10-01

    Full Text Available Abstract Background Protein-protein interactions are important for several cellular processes. Understanding the mechanism of protein-protein recognition and predicting the binding sites in protein-protein complexes are long standing goals in molecular and computational biology. Methods We have developed an energy based approach for identifying the binding site residues in protein–protein complexes. The binding site residues have been analyzed with sequence and structure based parameters such as binding propensity, neighboring residues in the vicinity of binding sites, conservation score and conformational switching. Results We observed that the binding propensities of amino acid residues are specific for protein-protein complexes. Further, typical dipeptides and tripeptides showed high preference for binding, which is unique to protein-protein complexes. Most of the binding site residues are highly conserved among homologous sequences. Our analysis showed that 7% of residues changed their conformations upon protein-protein complex formation and it is 9.2% and 6.6% in the binding and non-binding sites, respectively. Specifically, the residues Glu, Lys, Leu and Ser changed their conformation from coil to helix/strand and from helix to coil/strand. Leu, Ser, Thr and Val prefer to change their conformation from strand to coil/helix. Conclusions The results obtained in this study will be helpful for understanding and predicting the binding sites in protein-protein complexes.

  3. Assessment of algorithms for inferring positional weight matrix motifs of transcription factor binding sites using protein binding microarray data.

    Directory of Open Access Journals (Sweden)

    Yaron Orenstein

    Full Text Available The new technology of protein binding microarrays (PBMs allows simultaneous measurement of the binding intensities of a transcription factor to tens of thousands of synthetic double-stranded DNA probes, covering all possible 10-mers. A key computational challenge is inferring the binding motif from these data. We present a systematic comparison of four methods developed specifically for reconstructing a binding site motif represented as a positional weight matrix from PBM data. The reconstructed motifs were evaluated in terms of three criteria: concordance with reference motifs from the literature and ability to predict in vivo and in vitro bindings. The evaluation encompassed over 200 transcription factors and some 300 assays. The results show a tradeoff between how the methods perform according to the different criteria, and a dichotomy of method types. Algorithms that construct motifs with low information content predict PBM probe ranking more faithfully, while methods that produce highly informative motifs match reference motifs better. Interestingly, in predicting high-affinity binding, all methods give far poorer results for in vivo assays compared to in vitro assays.

  4. Modulation of RNase E activity by alternative RNA binding sites.

    Directory of Open Access Journals (Sweden)

    Daeyoung Kim

    Full Text Available Endoribonuclease E (RNase E affects the composition and balance of the RNA population in Escherichia coli via degradation and processing of RNAs. In this study, we investigated the regulatory effects of an RNA binding site between amino acid residues 25 and 36 (24LYDLDIESPGHEQK37 of RNase E. Tandem mass spectrometry analysis of the N-terminal catalytic domain of RNase E (N-Rne that was UV crosslinked with a 5'-32P-end-labeled, 13-nt oligoribonucleotide (p-BR13 containing the RNase E cleavage site of RNA I revealed that two amino acid residues, Y25 and Q36, were bound to the cytosine and adenine of BR13, respectively. Based on these results, the Y25A N-Rne mutant was constructed, and was found to be hypoactive in comparison to wild-type and hyperactive Q36R mutant proteins. Mass spectrometry analysis showed that Y25A and Q36R mutations abolished the RNA binding to the uncompetitive inhibition site of RNase E. The Y25A mutation increased the RNA binding to the multimer formation interface between amino acid residues 427 and 433 (427LIEEEALK433, whereas the Q36R mutation enhanced the RNA binding to the catalytic site of the enzyme (65HGFLPL*K71. Electrophoretic mobility shift assays showed that the stable RNA-protein complex formation was positively correlated with the extent of RNA binding to the catalytic site and ribonucleolytic activity of the N-Rne proteins. These mutations exerted similar effects on the ribonucleolytic activity of the full-length RNase E in vivo. Our findings indicate that RNase E has two alternative RNA binding sites for modulating RNA binding to the catalytic site and the formation of a functional catalytic unit.

  5. Bacterial periplasmic sialic acid-binding proteins exhibit a conserved binding site

    Energy Technology Data Exchange (ETDEWEB)

    Gangi Setty, Thanuja [Institute for Stem Cell Biology and Regenerative Medicine, NCBS Campus, GKVK Post, Bangalore, Karnataka 560 065 (India); Cho, Christine [Carver College of Medicine, University of Iowa, Iowa City, IA 52242-1109 (United States); Govindappa, Sowmya [Institute for Stem Cell Biology and Regenerative Medicine, NCBS Campus, GKVK Post, Bangalore, Karnataka 560 065 (India); Apicella, Michael A. [Carver College of Medicine, University of Iowa, Iowa City, IA 52242-1109 (United States); Ramaswamy, S., E-mail: ramas@instem.res.in [Institute for Stem Cell Biology and Regenerative Medicine, NCBS Campus, GKVK Post, Bangalore, Karnataka 560 065 (India)

    2014-07-01

    Structure–function studies of sialic acid-binding proteins from F. nucleatum, P. multocida, V. cholerae and H. influenzae reveal a conserved network of hydrogen bonds involved in conformational change on ligand binding. Sialic acids are a family of related nine-carbon sugar acids that play important roles in both eukaryotes and prokaryotes. These sialic acids are incorporated/decorated onto lipooligosaccharides as terminal sugars in multiple bacteria to evade the host immune system. Many pathogenic bacteria scavenge sialic acids from their host and use them for molecular mimicry. The first step of this process is the transport of sialic acid to the cytoplasm, which often takes place using a tripartite ATP-independent transport system consisting of a periplasmic binding protein and a membrane transporter. In this paper, the structural characterization of periplasmic binding proteins from the pathogenic bacteria Fusobacterium nucleatum, Pasteurella multocida and Vibrio cholerae and their thermodynamic characterization are reported. The binding affinities of several mutations in the Neu5Ac binding site of the Haemophilus influenzae protein are also reported. The structure and the thermodynamics of the binding of sugars suggest that all of these proteins have a very well conserved binding pocket and similar binding affinities. A significant conformational change occurs when these proteins bind the sugar. While the C1 carboxylate has been identified as the primary binding site, a second conserved hydrogen-bonding network is involved in the initiation and stabilization of the conformational states.

  6. Modeling lanthanide series binding sites on humic acid.

    Science.gov (United States)

    Pourret, Olivier; Martinez, Raul E

    2009-02-01

    Lanthanide (Ln) binding to humic acid (HA) has been investigated by combining ultrafiltration and ICP-MS techniques. A Langmuir-sorption-isotherm metal-complexation model was used in conjunction with a linear programming method (LPM) to fit experimental data representing various experimental conditions both in HA/Ln ratio (varying between 5 and 20) and in pH range (from 2 to 10) with an ionic strength of 10(-3) mol L(-1). The LPM approach, not requiring prior knowledge of surface complexation parameters, was used to solve the existing discrepancies in LnHA binding constants and site densities. The application of the LPM to experimental data revealed the presence of two discrete metal binding sites at low humic acid concentrations (5 mg L(-1)), with log metal complexation constants (logK(S,j)) of 2.65+/-0.05 and 7.00 (depending on Ln). The corresponding site densities were 2.71+/-0.57x10(-8) and 0.58+/-0.32x10(-8) mol of Ln(3+)/mg of HA (depending on Ln). Total site densities of 3.28+/-0.28x10(-8), 4.99+/-0.02x10(-8), and 5.01+/-0.01x10(-8) mol mg(-1) were obtained by LPM for humic acid, for humic acid concentrations of 5, 10, and 20 mg L(-1), respectively. These results confirm that lanthanide binding occurs mainly at weak sites (i.e., ca. 80%) and second at strong sites (i.e., ca. 20%). The first group of discrete metal binding sites may be attributed to carboxylic groups (known to be the main binding sites of Ln in HA), and the second metal binding group to phenolic moieties. Moreover, this study evidences heterogeneity in the distribution of the binding sites among Ln. Eventually, the LPM approach produced feasible and reasonable results, but it was less sensitive to error and did not require an a priori assumption of the number and concentration of binding sites.

  7. Exploring the composition of protein-ligand binding sites on a large scale.

    Directory of Open Access Journals (Sweden)

    Nickolay A Khazanov

    Full Text Available The residue composition of a ligand binding site determines the interactions available for diffusion-mediated ligand binding, and understanding general composition of these sites is of great importance if we are to gain insight into the functional diversity of the proteome. Many structure-based drug design methods utilize such heuristic information for improving prediction or characterization of ligand-binding sites in proteins of unknown function. The Binding MOAD database if one of the largest curated sets of protein-ligand complexes, and provides a source of diverse, high-quality data for establishing general trends of residue composition from currently available protein structures. We present an analysis of 3,295 non-redundant proteins with 9,114 non-redundant binding sites to identify residues over-represented in binding regions versus the rest of the protein surface. The Binding MOAD database delineates biologically-relevant "valid" ligands from "invalid" small-molecule ligands bound to the protein. Invalids are present in the crystallization medium and serve no known biological function. Contacts are found to differ between these classes of ligands, indicating that residue composition of biologically relevant binding sites is distinct not only from the rest of the protein surface, but also from surface regions capable of opportunistic binding of non-functional small molecules. To confirm these trends, we perform a rigorous analysis of the variation of residue propensity with respect to the size of the dataset and the content bias inherent in structure sets obtained from a large protein structure database. The optimal size of the dataset for establishing general trends of residue propensities, as well as strategies for assessing the significance of such trends, are suggested for future studies of binding-site composition.

  8. Putative hAPN receptor binding sites in SARS_CoV spike protein

    Institute of Scientific and Technical Information of China (English)

    YUXiao-Jing; LUOCheng; LinJian-Cheng; HAOPei; HEYou-Yu; GUOZong-Ming; QINLei; SUJiong; LIUBo-Shu; HUANGYin; NANPeng; LIChuan-Song; XIONGBin; LUOXiao-Min; ZHAOGuo-Ping; PEIGang; CHENKai-Xian; SHENXu; SHENJian-Hua; ZOUJian-Ping; HEWei-Zhong; SHITie-Liu; ZHONGYang; JIANGHua-Liang; LIYi-Xue

    2003-01-01

    AIM:To obtain the information of ligand-receptor binding between thd S protein of SARS_CoV and CD13, identify the possible interacting domains or motifs related to binding sites, and provide clues for studying the functions of SARS proteins and designing anti-SARS drugs and vaccines. METHODS: On the basis of comparative genomics, the homology search, phylogenetic analyses, and multi-sequence alignment were used to predict CD13 related interacting domains and binding sites sites in the S protein of SARS_CoV. Molecular modeling and docking simulation methods were employed to address the interaction feature between CD13 and S protein of SARS_CoV in validating the bioinformatics predictions. RESULTS:Possible binding sites in the SARS_CoV S protein to CD13 have been mapped out by using bioinformatics analysis tools. The binding for one protein-protein interaction pair (D757-R761 motif of the SARS_CoV S protein to P585-A653 domain of CD13) has been simulated by molecular modeling and docking simulation methods. CONCLUSION:CD13 may be a possible receptor of the SARS_CoV S protein which may be associated with the SARS infection. This study also provides a possible strategy for mapping the possible binding receptors of the proteins in a genome.

  9. Pactamycin binding site on archaebacterial and eukaryotic ribosomes

    Energy Technology Data Exchange (ETDEWEB)

    Tejedor, F.; Amils, R.; Ballesta, J.P.G.

    1987-01-27

    The presence of a photoreactive acetophenone group in the protein synthesis inhibitor pactamycin and the possibility of obtaining active iodinated derivatives that retain full biological activity allow the antibiotic binding site on Saccharomyces cerevisiae and archaebacterium Sulfolobus solfataricus ribosomes to be photoaffinity labeled. Four major labeled proteins have been identified in the yeast ribosomes, i.e., YS10, YS18, YS21/24, and YS30, while proteins AL1a, AS10/L8, AS18/20, and AS21/22 appeared as radioactive spots in S. solfataricus. There seems to be a correlation between some of the proteins labeled in yeast and those previously reported in Escherichia coli indicating that the pactamycin binding sites of both species, which are in the small subunit close to the initiation factors and mRNA binding sites, must have similar characteristics.

  10. Autoradiographic localization of estrogen binding sites in human mammary lesions

    Energy Technology Data Exchange (ETDEWEB)

    Buell, R.H.

    1984-01-01

    The biochemical assay of human mammary carcinomas for estrogen receptors is of proven clinical utility, but the cellular localization of estrogen binding sites within these lesions is less certain. The author describes the identification of estrogen binding sites as visualized by thaw-mount autoradiography after in vitro incubation in a series of 17 benign and 40 malignant human female mammary lesions. The results on the in vitro incubation method compared favorably with data from in vivo studies in mouse uterus, a well-characterized estrogen target organ. In noncancerous breast biopsies, a variable proportion of epithelial cells contained specific estrogen binding sites. Histologically identifiable myoepithelial and stromal cells were, in general, unlabeled. In human mammary carcinomas, biochemically estrogen receptor-positive, labeled and unlabeled neoplastic epithelial cells were identified by autoradiography. Quantitative results from the autoradiographic method compared favorably with biochemical data.

  11. Relating the shape of protein binding sites to binding affinity profiles: is there an association?

    Directory of Open Access Journals (Sweden)

    Bitter István

    2010-10-01

    Full Text Available Abstract Background Various pattern-based methods exist that use in vitro or in silico affinity profiles for classification and functional examination of proteins. Nevertheless, the connection between the protein affinity profiles and the structural characteristics of the binding sites is still unclear. Our aim was to investigate the association between virtual drug screening results (calculated binding free energy values and the geometry of protein binding sites. Molecular Affinity Fingerprints (MAFs were determined for 154 proteins based on their molecular docking energy results for 1,255 FDA-approved drugs. Protein binding site geometries were characterized by 420 PocketPicker descriptors. The basic underlying component structure of MAFs and binding site geometries, respectively, were examined by principal component analysis; association between principal components extracted from these two sets of variables was then investigated by canonical correlation and redundancy analyses. Results PCA analysis of the MAF variables provided 30 factors which explained 71.4% of the total variance of the energy values while 13 factors were obtained from the PocketPicker descriptors which cumulatively explained 94.1% of the total variance. Canonical correlation analysis resulted in 3 statistically significant canonical factor pairs with correlation values of 0.87, 0.84 and 0.77, respectively. Redundancy analysis indicated that PocketPicker descriptor factors explain 6.9% of the variance of the MAF factor set while MAF factors explain 15.9% of the total variance of PocketPicker descriptor factors. Based on the salient structures of the factor pairs, we identified a clear-cut association between the shape and bulkiness of the drug molecules and the protein binding site descriptors. Conclusions This is the first study to investigate complex multivariate associations between affinity profiles and the geometric properties of protein binding sites. We found that

  12. A novel non-opioid binding site for endomorphin-1.

    Science.gov (United States)

    Lengyel, I; Toth, F; Biyashev, D; Szatmari, I; Monory, K; Tomboly, C; Toth, G; Benyhe, S; Borsodi, A

    2016-08-01

    Endomorphins are natural amidated opioid tetrapeptides with the following structure: Tyr-Pro-Trp-Phe-NH2 (endomorphin-1), and Tyr-Pro-Phe-Phe-NH2 (endomorphin-2). Endomorphins interact selectively with the μ-opioid or MOP receptors and exhibit nanomolar or sub-nanomolar receptor binding affinities, therefore they suggested to be endogenous agonists for the μ-opioid receptors. Endomorphins mediate a number of characteristic opioid effects, such as antinociception, however there are several physiological functions in which endomorphins appear to act in a fashion that does not involve binding to and activation of the μ-opioid receptor. Our recent data indicate that a radiolabelled [(3)H]endomorphin-1 with a specific radioactivity of 2.35 TBq/mmol - prepared by catalytic dehalogenation of the diiodinated peptide precursor in the presence of tritium gas - is able to bind to a second, naloxone insensitive recognition site in rat brain membranes. Binding heterogeneity, i.e., the presence of higher (Kd = 0.4 nM / Bmax = 120 fmol/mg protein) and lower (Kd = 8.2 nM / Bmax = 432 fmol/mg protein) affinity binding components is observed both in saturation binding experiments followed by Schatchard analysis, and in equilibrium competition binding studies. The signs of receptor multiplicity, e.g., curvilinear Schatchard plots or biphasic displacement curves are seen only if the non-specific binding is measured in the presence of excess unlabeled endomorphin-1 and not in the presence of excess unlabeled naloxone. The second, lower affinity non-opioid binding site is not recognized by heterocyclic opioid alkaloid ligands, neither agonists such as morphine, nor antagonists such as naloxone. On the contrary, endomorphin-1 is displaced from its lower affinity, higher capacity binding site by several natural neuropeptides, including methionine-enkephalin-Arg-Phe, nociceptin-orphanin FQ, angiotensin and FMRF-amide. This naloxone-insensitive, consequently non-opioid binding site seems

  13. Mutations and binding sites of human transcription factors

    KAUST Repository

    Kamanu, Frederick Kinyua

    2012-06-01

    Mutations in any genome may lead to phenotype characteristics that determine ability of an individual to cope with adaptation to environmental challenges. In studies of human biology, among the most interesting ones are phenotype characteristics that determine responses to drug treatments, response to infections, or predisposition to specific inherited diseases. Most of the research in this field has been focused on the studies of mutation effects on the final gene products, peptides, and their alterations. Considerably less attention was given to the mutations that may affect regulatory mechanism(s) of gene expression, although these may also affect the phenotype characteristics. In this study we make a pilot analysis of mutations observed in the regulatory regions of 24,667 human RefSeq genes. Our study reveals that out of eight studied mutation types, insertions are the only one that in a statistically significant manner alters predicted transcription factor binding sites (TFBSs). We also find that 25 families of TFBSs have been altered by mutations in a statistically significant manner in the promoter regions we considered. Moreover, we find that the related transcription factors are, for example, prominent in processes related to intracellular signaling; cell fate; morphogenesis of organs and epithelium; development of urogenital system, epithelium, and tube; neuron fate commitment. Our study highlights the significance of studying mutations within the genes regulatory regions and opens way for further detailed investigations on this topic, particularly on the downstream affected pathways. 2012 Kamanu, Medvedeva, Schaefer, Jankovic, Archer and Bajic.

  14. A structural-based strategy for recognition of transcription factor binding sites.

    Directory of Open Access Journals (Sweden)

    Beisi Xu

    Full Text Available Scanning through genomes for potential transcription factor binding sites (TFBSs is becoming increasingly important in this post-genomic era. The position weight matrix (PWM is the standard representation of TFBSs utilized when scanning through sequences for potential binding sites. However, many transcription factor (TF motifs are short and highly degenerate, and methods utilizing PWMs to scan for sites are plagued by false positives. Furthermore, many important TFs do not have well-characterized PWMs, making identification of potential binding sites even more difficult. One approach to the identification of sites for these TFs has been to use the 3D structure of the TF to predict the DNA structure around the TF and then to generate a PWM from the predicted 3D complex structure. However, this approach is dependent on the similarity of the predicted structure to the native structure. We introduce here a novel approach to identify TFBSs utilizing structure information that can be applied to TFs without characterized PWMs, as long as a 3D complex structure (TF/DNA exists. This approach utilizes an energy function that is uniquely trained on each structure. Our approach leads to increased prediction accuracy and robustness compared with those using a more general energy function. The software is freely available upon request.

  15. Characterization of Heparin-binding Site of Tissue Transglutaminase

    Science.gov (United States)

    Wang, Zhuo; Collighan, Russell J.; Pytel, Kamila; Rathbone, Daniel L.; Li, Xiaoling; Griffin, Martin

    2012-01-01

    Tissue transglutaminase (TG2) is a multifunctional Ca2+-activated protein cross-linking enzyme secreted into the extracellular matrix (ECM), where it is involved in wound healing and scarring, tissue fibrosis, celiac disease, and metastatic cancer. Extracellular TG2 can also facilitate cell adhesion important in wound healing through a nontransamidating mechanism via its association with fibronectin, heparan sulfates (HS), and integrins. Regulating the mechanism how TG2 is translocated into the ECM therefore provides a strategy for modulating these physiological and pathological functions of the enzyme. Here, through molecular modeling and mutagenesis, we have identified the HS-binding site of TG2 202KFLKNAGRDCSRRSSPVYVGR222. We demonstrate the requirement of this binding site for translocation of TG2 into the ECM through a mechanism involving cell surface shedding of HS. By synthesizing a peptide NPKFLKNAGRDCSRRSS corresponding to the HS-binding site within TG2, we also demonstrate how this mimicking peptide can in isolation compensate for the RGD-induced loss of cell adhesion on fibronectin via binding to syndecan-4, leading to activation of PKCα, pFAK-397, and ERK1/2 and the subsequent formation of focal adhesions and actin cytoskeleton organization. A novel regulatory mechanism for TG2 translocation into the extracellular compartment that depends upon TG2 conformation and the binding of HS is proposed. PMID:22298777

  16. Eel calcitonin binding site distribution and antinociceptive activity in rats

    Energy Technology Data Exchange (ETDEWEB)

    Guidobono, F.; Netti, C.; Sibilia, V.; Villa, I.; Zamboni, A.; Pecile, A.

    1986-03-01

    The distribution of binding site for (/sup 125/I)-eel-calcitonin (ECT) to rat central nervous system, studied by an autoradiographic technique, showed concentrations of binding in the diencephalon, the brain stem and the spinal cord. Large accumulations of grains were seen in the hypothalamus, the amygdala, in the fasciculus medialis prosencephali, in the fasciculus longitudinalis medialis, in the ventrolateral part of the periventricular gray matter, in the lemniscus medialis and in the raphe nuclei. The density of grains in the reticular formation and in the nucleus tractus spinalis nervi trigemini was more moderate. In the spinal cord, grains were scattered throughout the dorsal horns. Binding of the ligand was displaced equally by cold ECT and by salmon CT(sCT), indicating that both peptides bind to the same receptors. Human CT was much weaker than sCT in displacing (/sup 125/I)-ECT binding. The administration of ECT into the brain ventricles of rats dose-dependently induced a significant and long-lasting enhancement of hot-plate latencies comparable with that obtained with sCT. The antinociceptive activity induced by ECT is compatible with the topographical distribution of binding sites for the peptide and is a further indication that fish CTs are active in the mammalian brain.

  17. Autologous peptides constitutively occupy the antigen binding site on Ia

    DEFF Research Database (Denmark)

    Buus, S; Sette, A; Colon, S M;

    1988-01-01

    Low molecular weight material associated with affinity-purified class II major histocompatibility complex (MHC) molecules of mouse (Ia) had the expected properties of peptides bound to the antigen binding site of Ia. Thus, the low molecular weight material derived from the I-Ad isotype...

  18. Automated benchmarking of peptide-MHC class I binding predictions

    DEFF Research Database (Denmark)

    Trolle, Thomas; Metushi, Imir G.; Greenbaum, Jason;

    2015-01-01

    Motivation: Numerous in silico methods predicting peptide binding to major histocompatibility complex (MHC) class I molecules have been developed over the last decades. However, the multitude of available prediction tools makes it non-trivial for the end-user to select which tool to use for a given...... the public access to frequent, up-to-date performance evaluations of all participating tools. To overcome potential selection bias in the data included in the IEDB, a strategy was implemented that suggests a set of peptides for which different prediction methods give divergent predictions as to their binding...

  19. Study of V2 vasopressin receptor hormone binding site using in silico methods.

    Science.gov (United States)

    Sebti, Yeganeh; Sardari, Soroush; Sadeghi, Hamid Mir Mohammad; Ghahremani, Mohammad Hossein; Innamorati, Giulio

    2015-01-01

    The antidiuretic effect of arginine vasopressin (AVP) is mediated by the vasopressin V2 receptor. The docking study of AVP as a ligand to V2 receptor helps in identifying important amino acid residues that might be involved in AVP binding for predicting the lowest free energy state of the protein complex. Whereas previous researchers were not able to detect the exact site of the ligand-receptor binding, we designed the current study to identify the vasopressin V2 receptor hormone binding site using bioinformatic methods. The 3D structure of nonapeptide hormone vasopressin was extracted from Protein Data Bank. Since no suitable template resembling V2 receptor was found, an ab initio approach was chosen to model the protein receptor. Using protein docking methods such as Hex protein-protein docking, the model of V2 receptor was docked to the peptide ligand AVP to identify possible binding sites. The residues that involved in binding site are W293, W296, D297, A300, and P301. The lowest free energy state of the protein complex was predicted after mutation in the above residues. The amount of gained energies permits us to compare the mutant forms with native forms and help to asses critical changes such as positive and negative mutations followed by ranking the best mutations. Based on the mutation/docking predictions, we found some mutants such as W293D and A300E possess positively inducing effect in ligand binding and some of them such as A300R present negatively inducing effect in ligand binding.

  20. In Silico Docking, Molecular Dynamics and Binding Energy Insights into the Bolinaquinone-Clathrin Terminal Domain Binding Site

    Directory of Open Access Journals (Sweden)

    Mohammed K. Abdel-Hamid

    2014-05-01

    Full Text Available Clathrin-mediated endocytosis (CME is a process that regulates selective internalization of important cellular cargo using clathrin-coated vesicles. Perturbation of this process has been linked to many diseases including cancer and neurodegenerative conditions. Chemical proteomics identified the marine metabolite, 2-hydroxy-5-methoxy-3-(((1S,4aS,8aS-1,4a,5-trimethyl-1,2,3,4,4a,7,8,8a-octahydronaphthalen-2-ylmethylcyclohexa- 2,5-diene-1,4-dione (bolinaquinone as a clathrin inhibitor. While being an attractive medicinal chemistry target, the lack of data about bolinaquinone’s mode of binding to the clathrin enzyme represents a major limitation for its structural optimization. We have used a molecular modeling approach to rationalize the observed activity of bolinaquinone and to predict its mode of binding with the clathrin terminal domain (CTD. The applied protocol started by global rigid-protein docking followed by flexible docking, molecular dynamics and linear interaction energy calculations. The results revealed the potential of bolinaquinone to interact with various pockets within the CTD, including the clathrin-box binding site. The results also highlight the importance of electrostatic contacts over van der Waals interactions for proper binding between bolinaquinone and its possible binding sites. This study provides a novel model that has the potential to allow rapid elaboration of bolinaquinone analogues as a new class of clathrin inhibitors.

  1. CLIPZ: a database and analysis environment for experimentally determined binding sites of RNA-binding proteins.

    Science.gov (United States)

    Khorshid, Mohsen; Rodak, Christoph; Zavolan, Mihaela

    2011-01-01

    The stability, localization and translation rate of mRNAs are regulated by a multitude of RNA-binding proteins (RBPs) that find their targets directly or with the help of guide RNAs. Among the experimental methods for mapping RBP binding sites, cross-linking and immunoprecipitation (CLIP) coupled with deep sequencing provides transcriptome-wide coverage as well as high resolution. However, partly due to their vast volume, the data that were so far generated in CLIP experiments have not been put in a form that enables fast and interactive exploration of binding sites. To address this need, we have developed the CLIPZ database and analysis environment. Binding site data for RBPs such as Argonaute 1-4, Insulin-like growth factor II mRNA-binding protein 1-3, TNRC6 proteins A-C, Pumilio 2, Quaking and Polypyrimidine tract binding protein can be visualized at the level of the genome and of individual transcripts. Individual users can upload their own sequence data sets while being able to limit the access to these data to specific users, and analyses of the public and private data sets can be performed interactively. CLIPZ, available at http://www.clipz.unibas.ch, aims to provide an open access repository of information for post-transcriptional regulatory elements.

  2. Discriminating between HuR and TTP binding sites using the k-spectrum kernel method

    Science.gov (United States)

    Goldberg, Debra S.; Dowell, Robin

    2017-01-01

    Background The RNA binding proteins (RBPs) human antigen R (HuR) and Tristetraprolin (TTP) are known to exhibit competitive binding but have opposing effects on the bound messenger RNA (mRNA). How cells discriminate between the two proteins is an interesting problem. Machine learning approaches, such as support vector machines (SVMs), may be useful in the identification of discriminative features. However, this method has yet to be applied to studies of RNA binding protein motifs. Results Applying the k-spectrum kernel to a support vector machine (SVM), we first verified the published binding sites of both HuR and TTP. Additional feature engineering highlighted the U-rich binding preference of HuR and AU-rich binding preference for TTP. Domain adaptation along with multi-task learning was used to predict the common binding sites. Conclusion The distinction between HuR and TTP binding appears to be subtle content features. HuR prefers strongly U-rich sequences whereas TTP prefers AU-rich as with increasing A content, the sequences are more likely to be bound only by TTP. Our model is consistent with competitive binding of the two proteins, particularly at intermediate AU-balanced sequences. This suggests that fine changes in the A/U balance within a untranslated region (UTR) can alter the binding and subsequent stability of the message. Both feature engineering and domain adaptation emphasized the extent to which these proteins recognize similar general sequence features. This work suggests that the k-spectrum kernel method could be useful when studying RNA binding proteins and domain adaptation techniques such as feature augmentation could be employed particularly when examining RBPs with similar binding preferences. PMID:28333956

  3. Influence of binding energies of electrons on nuclear mass predictions

    Science.gov (United States)

    Tang, Jing; Niu, Zhong-Ming; Guo, Jian-You

    2016-07-01

    Nuclear mass contains a wealth of nuclear structure information, and has been widely employed to extract the nuclear effective interactions. The known nuclear mass is usually extracted from the experimental atomic mass by subtracting the masses of electrons and adding the binding energy of electrons in the atom. However, the binding energies of electrons are sometimes neglected in extracting the known nuclear masses. The influence of binding energies of electrons on nuclear mass predictions are carefully investigated in this work. If the binding energies of electrons are directly subtracted from the theoretical mass predictions, the rms deviations of nuclear mass predictions with respect to the known data are increased by about 200 keV for nuclei with Z, N ⩾ 8. Furthermore, by using the Coulomb energies between protons to absorb the binding energies of electrons, their influence on the rms deviations is significantly reduced to only about 10 keV for nuclei with Z, N ⩾ 8. However, the binding energies of electrons are still important for the heavy nuclei, about 150 keV for nuclei around Z = 100 and up to about 500 keV for nuclei around Z = 120. Therefore, it is necessary to consider the binding energies of electrons to reliably predict the masses of heavy nuclei at an accuracy of hundreds of keV. Supported by National Natural Science Foundation of China (11205004)

  4. Bifunctional avidin with covalently modifiable ligand binding site.

    Directory of Open Access Journals (Sweden)

    Jenni Leppiniemi

    Full Text Available The extensive use of avidin and streptavidin in life sciences originates from the extraordinary tight biotin-binding affinity of these tetrameric proteins. Numerous studies have been performed to modify the biotin-binding affinity of (streptavidin to improve the existing applications. Even so, (streptavidin greatly favours its natural ligand, biotin. Here we engineered the biotin-binding pocket of avidin with a single point mutation S16C and thus introduced a chemically active thiol group, which could be covalently coupled with thiol-reactive molecules. This approach was applied to the previously reported bivalent dual chain avidin by modifying one binding site while preserving the other one intact. Maleimide was then coupled to the modified binding site resulting in a decrease in biotin affinity. Furthermore, we showed that this thiol could be covalently coupled to other maleimide derivatives, for instance fluorescent labels, allowing intratetrameric FRET. The bifunctional avidins described here provide improved and novel tools for applications such as the biofunctionalization of surfaces.

  5. PhyloScan: identification of transcription factor binding sites using cross-species evidence

    Directory of Open Access Journals (Sweden)

    Newberg Lee A

    2007-01-01

    Full Text Available Abstract Background When transcription factor binding sites are known for a particular transcription factor, it is possible to construct a motif model that can be used to scan sequences for additional sites. However, few statistically significant sites are revealed when a transcription factor binding site motif model is used to scan a genome-scale database. Methods We have developed a scanning algorithm, PhyloScan, which combines evidence from matching sites found in orthologous data from several related species with evidence from multiple sites within an intergenic region, to better detect regulons. The orthologous sequence data may be multiply aligned, unaligned, or a combination of aligned and unaligned. In aligned data, PhyloScan statistically accounts for the phylogenetic dependence of the species contributing data to the alignment and, in unaligned data, the evidence for sites is combined assuming phylogenetic independence of the species. The statistical significance of the gene predictions is calculated directly, without employing training sets. Results In a test of our methodology on synthetic data modeled on seven Enterobacteriales, four Vibrionales, and three Pasteurellales species, PhyloScan produces better sensitivity and specificity than MONKEY, an advanced scanning approach that also searches a genome for transcription factor binding sites using phylogenetic information. The application of the algorithm to real sequence data from seven Enterobacteriales species identifies novel Crp and PurR transcription factor binding sites, thus providing several new potential sites for these transcription factors. These sites enable targeted experimental validation and thus further delineation of the Crp and PurR regulons in E. coli. Conclusion Better sensitivity and specificity can be achieved through a combination of (1 using mixed alignable and non-alignable sequence data and (2 combining evidence from multiple sites within an intergenic

  6. Ceruloplasmin has two nearly identical sites that bind myeloperoxidase.

    Science.gov (United States)

    Bakhautdin, Bakytzhan; Goksoy Bakhautdin, Esen; Fox, Paul L

    2014-10-31

    Ceruloplasmin (Cp) is a copper-containing ferroxidase with potent antioxidant activity. Cp is expressed by hepatocytes and activated macrophages and has been known as physiologic inhibitor of myeloperoxidase (MPO). Enzymatic activity of MPO produces anti-microbial agents and strong prooxidants such as hypochlorous acid and has a potential to damage host tissue at the sites of inflammation and infection. Thus Cp-MPO interaction and inhibition of MPO has previously been suggested as an important control mechanism of excessive MPO activity. Our aim in this study was to identify minimal Cp domain or peptide that interacts with MPO. We first confirmed Cp-MPO interaction by ELISA and surface plasmon resonance (SPR). SPR analysis of the interaction yielded 30nM affinity between Cp and MPO. We then designed and synthesized 87 overlapping peptides spanning the entire amino acid sequence of Cp. Each of the peptides was tested whether it binds to MPO by direct binding ELISA. Two of the 87 peptides, P18 and P76 strongly interacted with MPO. Amino acid sequence analysis of identified peptides revealed high sequence and structural homology between them. Further structural analysis of Cp's crystal structure by PyMOL software unfolded that both peptides represent surface-exposed sites of Cp and face nearly the same direction. To confirm our finding we raised anti-P18 antisera in rabbit and demonstrated that this antisera disrupts Cp-MPO binding and rescues MPO activity. Collectively, our results confirm Cp-MPO interaction and identify two nearly identical sites on Cp that specifically bind MPO. We propose that inhibition of MPO by Cp requires two nearly identical sites on Cp to bind homodimeric MPO simultaneously and at an angle of at least 120degrees, which, in turn, exerts tension on MPO and results in conformational change.

  7. The magic spot: a ppGpp binding site on E. coli RNA polymerase responsible for regulation of transcription initiation.

    Science.gov (United States)

    Ross, Wilma; Vrentas, Catherine E; Sanchez-Vazquez, Patricia; Gaal, Tamas; Gourse, Richard L

    2013-05-01

    The global regulatory nucleotide ppGpp ("magic spot") regulates transcription from a large subset of Escherichia coli promoters, illustrating how small molecules can control gene expression promoter-specifically by interacting with RNA polymerase (RNAP) without binding to DNA. However, ppGpp's target site on RNAP, and therefore its mechanism of action, has remained unclear. We report here a binding site for ppGpp on E. coli RNAP, identified by crosslinking, protease mapping, and analysis of mutant RNAPs that fail to respond to ppGpp. A strain with a mutant ppGpp binding site displays properties characteristic of cells defective for ppGpp synthesis. The binding site is at an interface of two RNAP subunits, ω and β', and its position suggests an allosteric mechanism of action involving restriction of motion between two mobile RNAP modules. Identification of the binding site allows prediction of bacterial species in which ppGpp exerts its effects by targeting RNAP.

  8. Structural descriptor database: a new tool for sequence-based functional site prediction

    Directory of Open Access Journals (Sweden)

    Vasconcelos Ana

    2008-11-01

    Full Text Available Abstract Background The Structural Descriptor Database (SDDB is a web-based tool that predicts the function of proteins and functional site positions based on the structural properties of related protein families. Structural alignments and functional residues of a known protein set (defined as the training set are used to build special Hidden Markov Models (HMM called HMM descriptors. SDDB uses previously calculated and stored HMM descriptors for predicting active sites, binding residues, and protein function. The database integrates biologically relevant data filtered from several databases such as PDB, PDBSUM, CSA and SCOP. It accepts queries in fasta format and predicts functional residue positions, protein-ligand interactions, and protein function, based on the SCOP database. Results To assess the SDDB performance, we used different data sets. The Trypsion-like Serine protease data set assessed how well SDDB predicts functional sites when curated data is available. The SCOP family data set was used to analyze SDDB performance by using training data extracted from PDBSUM (binding sites and from CSA (active sites. The ATP-binding experiment was used to compare our approach with the most current method. For all evaluations, significant improvements were obtained with SDDB. Conclusion SDDB performed better when trusty training data was available. SDDB worked better in predicting active sites rather than binding sites because the former are more conserved than the latter. Nevertheless, by using our prediction method we obtained results with precision above 70%.

  9. (/sup 3/H)desipramine binding to rat brain tissue: binding to both noradrenaline uptake sites and sites not related to noradrenaline neurons

    Energy Technology Data Exchange (ETDEWEB)

    Baeckstroem, I.T.Ro.; Ross, S.B.; Marcusson, J.O.

    1989-04-01

    The pharmacological and biochemical characteristics of (3H)desipramine binding to rat brain tissue were investigated. Competition studies with noradrenaline, nisoxetine, nortriptyline, and desipramine suggested the presence of more than one (3H)desipramine binding site. Most of the noradrenaline-sensitive binding represented a high-affinity site, and this site appeared to be the same as the high-affinity site of nisoxetine-sensitive binding. The (3H)desipramine binding sites were abolished by protease treatment, a result suggesting that the binding sites are protein in nature. When specific binding was defined by 0.1 microM nisoxetine, the binding was saturable and fitted a single-site binding model with a binding affinity of approximately 1 nM. This binding fraction was abolished by lesioning of the noradrenaline neurons with the noradrenaline neurotoxin N-(2-chloroethyl)-N-ethyl-2-bromobenzylamine (DSP4). In contrast, when 10 microM nisoxetine was used to define the specific binding, the binding was not saturable over the nanomolar range, but the binding fitted a two-site binding model with KD values of 0.5 and greater than 100 nM for the high- and low-affinity components, respectively. The high-affinity site was abolished after DSP4 lesioning, whereas the low-affinity site remained. The binding capacity (Bmax) for binding defined by 0.1 microM nisoxetine varied between brain regions, with very low density in the striatum (Bmax not possible to determine), 60-90 fmol/mg of protein in cortical areas and cerebellum, and 120 fmol/mg of protein in the hypothalamus. The binding capacities of these high-affinity sites correlated significantly with the regional distribution of (3H)noradrenaline uptake but not with 5-(3H)hydroxytryptamine uptake.

  10. Better estimation of protein-DNA interaction parameters improve prediction of functional sites

    Directory of Open Access Journals (Sweden)

    O'Flanagan Ruadhan A

    2008-12-01

    Full Text Available Abstract Background Characterizing transcription factor binding motifs is a common bioinformatics task. For transcription factors with variable binding sites, we need to get many suboptimal binding sites in our training dataset to get accurate estimates of free energy penalties for deviating from the consensus DNA sequence. One procedure to do that involves a modified SELEX (Systematic Evolution of Ligands by Exponential Enrichment method designed to produce many such sequences. Results We analyzed low stringency SELEX data for E. coli Catabolic Activator Protein (CAP, and we show here that appropriate quantitative analysis improves our ability to predict in vitro affinity. To obtain large number of sequences required for this analysis we used a SELEX SAGE protocol developed by Roulet et al. The sequences obtained from here were subjected to bioinformatic analysis. The resulting bioinformatic model characterizes the sequence specificity of the protein more accurately than those sequence specificities predicted from previous analysis just by using a few known binding sites available in the literature. The consequences of this increase in accuracy for prediction of in vivo binding sites (and especially functional ones in the E. coli genome are also discussed. We measured the dissociation constants of several putative CAP binding sites by EMSA (Electrophoretic Mobility Shift Assay and compared the affinities to the bioinformatics scores provided by methods like the weight matrix method and QPMEME (Quadratic Programming Method of Energy Matrix Estimation trained on known binding sites as well as on the new sites from SELEX SAGE data. We also checked predicted genome sites for conservation in the related species S. typhimurium. We found that bioinformatics scores based on SELEX SAGE data does better in terms of prediction of physical binding energies as well as in detecting functional sites. Conclusion We think that training binding site detection

  11. Predicting nucleic acid binding interfaces from structural models of proteins.

    Science.gov (United States)

    Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael

    2012-02-01

    The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However, the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three-dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared with patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure.

  12. The inhibitory binding site(s) of Zn2+ in cytochrome c oxidase.

    Science.gov (United States)

    Francia, Francesco; Giachini, Lisa; Boscherini, Federico; Venturoli, Giovanni; Capitanio, Giuseppe; Martino, Pietro Luca; Papa, Sergio

    2007-02-20

    EXAFS analysis of Zn binding site(s) in bovine-heart cytochrome c oxidase and characterization of the inhibitory effect of internal zinc on respiratory activity and proton pumping of the liposome reconstituted oxidase are presented. EXAFS identifies tetrahedral coordination site(s) for Zn(2+) with two N-histidine imidazoles, one N-histidine imidazol or N-lysine and one O-COOH (glutamate or aspartate), possibly located at the entry site of the proton conducting D pathway in the oxidase and involved in inhibition of the oxygen reduction catalysis and proton pumping by internally trapped zinc.

  13. Minimal Zn2+ Binding Site of Amyloid-β

    Science.gov (United States)

    Tsvetkov, Philipp O.; Kulikova, Alexandra A.; Golovin, Andrey V.; Tkachev, Yaroslav V.; Archakov, Alexander I.; Kozin, Sergey A.; Makarov, Alexander A.

    2010-01-01

    Zinc-induced aggregation of amyloid-β peptide (Aβ) is a hallmark molecular feature of Alzheimer's disease. Here we provide direct thermodynamic evidence that elucidates the role of the Aβ region 6–14 as the minimal Zn2+ binding site wherein the ion is coordinated by His6, Glu11, His13, and His14. With the help of isothermal titration calorimetry and quantum mechanics/molecular mechanics simulations, the region 11–14 was determined as the primary zinc recognition site and considered an important drug-target candidate to prevent Zn2+-induced aggregation of Aβ. PMID:21081056

  14. Minimal Zn(2+) binding site of amyloid-β.

    Science.gov (United States)

    Tsvetkov, Philipp O; Kulikova, Alexandra A; Golovin, Andrey V; Tkachev, Yaroslav V; Archakov, Alexander I; Kozin, Sergey A; Makarov, Alexander A

    2010-11-17

    Zinc-induced aggregation of amyloid-β peptide (Aβ) is a hallmark molecular feature of Alzheimer's disease. Here we provide direct thermodynamic evidence that elucidates the role of the Aβ region 6-14 as the minimal Zn(2+) binding site wherein the ion is coordinated by His(6), Glu(11), His(13), and His(14). With the help of isothermal titration calorimetry and quantum mechanics/molecular mechanics simulations, the region 11-14 was determined as the primary zinc recognition site and considered an important drug-target candidate to prevent Zn(2+)-induced aggregation of Aβ.

  15. Evolutionary computation for discovery of composite transcription factor binding sites

    OpenAIRE

    Fogel, Gary B.; Porto, V. William; Varga, Gabor; Dow, Ernst R.; Craven, Andrew M.; Powers, David M.; Harlow, Harry B.; Su, Eric W.; Onyia, Jude E.; Su, Chen

    2008-01-01

    Previous research demonstrated the use of evolutionary computation for the discovery of transcription factor binding sites (TFBS) in promoter regions upstream of coexpressed genes. However, it remained unclear whether or not composite TFBS elements, commonly found in higher organisms where two or more TFBSs form functional complexes, could also be identified by using this approach. Here, we present an important refinement of our previous algorithm and test the identification of composite elem...

  16. Photoaffinity labeling of the pactamycin binding site on eubacterial ribosomes

    Energy Technology Data Exchange (ETDEWEB)

    Tejedor, F.; Amils, R.; Ballesta, J.P.

    1985-07-02

    Pactamycin, an inhibitor of the initial steps of protein synthesis, has an acetophenone group in its chemical structure that makes the drug a potentially photoreactive molecule. In addition, the presence of a phenolic residue makes it easily susceptible to radioactive labeling. Through iodination, one radioactive derivative of pactamycin has been obtained with biological activities similar to the unmodified drug when tested on in vivo and cell-free systems. With the use of (/sup 125/I)iodopactamycin, ribosomes of Escherichia coli have been photolabeled under conditions that preserve the activity of the particles and guarantee the specificity of the binding sites. Under these conditions, RNA is preferentially labeled when free, small ribosomal subunits are photolabeled, but proteins are the main target in the whole ribosome. This indicates that an important conformational change takes place in the binding site on association of the two subunits. The major labeled proteins are S2, S4, S18, S21, and L13. These proteins in the pactamycin binding site are probably related to the initiation step of protein synthesis.

  17. Prediction of MHC class I binding peptides, using SVMHC

    Directory of Open Access Journals (Sweden)

    Elofsson Arne

    2002-09-01

    Full Text Available Abstract Background T-cells are key players in regulating a specific immune response. Activation of cytotoxic T-cells requires recognition of specific peptides bound to Major Histocompatibility Complex (MHC class I molecules. MHC-peptide complexes are potential tools for diagnosis and treatment of pathogens and cancer, as well as for the development of peptide vaccines. Only one in 100 to 200 potential binders actually binds to a certain MHC molecule, therefore a good prediction method for MHC class I binding peptides can reduce the number of candidate binders that need to be synthesized and tested. Results Here, we present a novel approach, SVMHC, based on support vector machines to predict the binding of peptides to MHC class I molecules. This method seems to perform slightly better than two profile based methods, SYFPEITHI and HLA_BIND. The implementation of SVMHC is quite simple and does not involve any manual steps, therefore as more data become available it is trivial to provide prediction for more MHC types. SVMHC currently contains prediction for 26 MHC class I types from the MHCPEP database or alternatively 6 MHC class I types from the higher quality SYFPEITHI database. The prediction models for these MHC types are implemented in a public web service available at http://www.sbc.su.se/svmhc/. Conclusions Prediction of MHC class I binding peptides using Support Vector Machines, shows high performance and is easy to apply to a large number of MHC class I types. As more peptide data are put into MHC databases, SVMHC can easily be updated to give prediction for additional MHC class I types. We suggest that the number of binding peptides needed for SVM training is at least 20 sequences.

  18. Function of the PEX19-binding site of human adrenoleukodystrophy protein as targeting motif in man and yeast. PMP targeting is evolutionarily conserved.

    Science.gov (United States)

    Halbach, André; Lorenzen, Stephan; Landgraf, Christiane; Volkmer-Engert, Rudolf; Erdmann, Ralf; Rottensteiner, Hanspeter

    2005-06-01

    We predicted in human peroxisomal membrane proteins (PMPs) the binding sites for PEX19, a key player in the topogenesis of PMPs, by virtue of an algorithm developed for yeast PMPs. The best scoring PEX19-binding site was found in the adrenoleukodystrophy protein (ALDP). The identified site was indeed bound by human PEX19 and was also recognized by the orthologous yeast PEX19 protein. Likewise, both human and yeast PEX19 bound with comparable affinities to the PEX19-binding site of the yeast PMP Pex13p. Interestingly, the identified PEX19-binding site of ALDP coincided with its previously determined targeting motif. We corroborated the requirement of the ALDP PEX19-binding site for peroxisomal targeting in human fibroblasts and showed that the minimal ALDP fragment targets correctly also in yeast, again in a PEX19-binding site-dependent manner. Furthermore, the human PEX19-binding site of ALDP proved interchangeable with that of yeast Pex13p in an in vivo targeting assay. Finally, we showed in vitro that most of the predicted binding sequences of human PMPs represent true binding sites for human PEX19, indicating that human PMPs harbor common PEX19-binding sites that do resemble those of yeast. Our data clearly revealed a role for PEX19-binding sites as PMP-targeting motifs across species, thereby demonstrating the evolutionary conservation of PMP signal sequences from yeast to man.

  19. Direct GR Binding Sites Potentiate Clusters of TF Binding across the Human Genome.

    Science.gov (United States)

    Vockley, Christopher M; D'Ippolito, Anthony M; McDowell, Ian C; Majoros, William H; Safi, Alexias; Song, Lingyun; Crawford, Gregory E; Reddy, Timothy E

    2016-08-25

    The glucocorticoid receptor (GR) binds the human genome at >10,000 sites but only regulates the expression of hundreds of genes. To determine the functional effect of each site, we measured the glucocorticoid (GC) responsive activity of nearly all GR binding sites (GBSs) captured using chromatin immunoprecipitation (ChIP) in A549 cells. 13% of GBSs assayed had GC-induced activity. The responsive sites were defined by direct GR binding via a GC response element (GRE) and exclusively increased reporter-gene expression. Meanwhile, most GBSs lacked GC-induced reporter activity. The non-responsive sites had epigenetic features of steady-state enhancers and clustered around direct GBSs. Together, our data support a model in which clusters of GBSs observed with ChIP-seq reflect interactions between direct and tethered GBSs over tens of kilobases. We further show that those interactions can synergistically modulate the activity of direct GBSs and may therefore play a major role in driving gene activation in response to GCs.

  20. Identification of a chloride ion binding site in Na+/Cl -dependent transporters.

    Science.gov (United States)

    Forrest, Lucy R; Tavoulari, Sotiria; Zhang, Yuan-Wei; Rudnick, Gary; Honig, Barry

    2007-07-31

    The recent determination of the crystal structure of the leucine transporter from Aquifex aeolicus (aaLeuT) has provided significant insights into the function of neurotransmitter:sodium symporters. Transport by aaLeuT is Cl(-) independent, whereas many neurotransmitter:sodium symporters from higher organisms depend on Cl(-) ions. However, the only Cl(-) ion identified in the aaLeuT structure interacts with nonconserved residues in extracellular loops, and thus the relevance of this binding site is unclear. Here, we use calculations of pK(A)s and homology modeling to predict the location of a functionally important Cl(-) binding site in serotonin transporter and other Cl(-)-dependent transporters. We validate our model through the site-directed mutagenesis of residues predicted to coordinate the Cl(-) ion and through the observation of sequence conservation patterns in other Cl(-)-dependent transporters. The proposed site is located midway across the membrane and is formed by residues from transmembrane helices 2, 6, and 7. It is close to the Na1 sodium binding site, thus providing an explanation for the coupling of Cl(-) and Na(+) ions during transport. Other implications of the model are also discussed.

  1. Predicting where small molecules bind at protein-protein interfaces.

    Directory of Open Access Journals (Sweden)

    Peter Walter

    Full Text Available Small molecules that bind at protein-protein interfaces may either block or stabilize protein-protein interactions in cells. Thus, some of these binding interfaces may turn into prospective targets for drug design. Here, we collected 175 pairs of protein-protein (PP complexes and protein-ligand (PL complexes with known three-dimensional structures for which (1 one protein from the PP complex shares at least 40% sequence identity with the protein from the PL complex, and (2 the interface regions of these proteins overlap at least partially with each other. We found that those residues of the interfaces that may bind the other protein as well as the small molecule are evolutionary more conserved on average, have a higher tendency of being located in pockets and expose a smaller fraction of their surface area to the solvent than the remaining protein-protein interface region. Based on these findings we derived a statistical classifier that predicts patches at binding interfaces that have a higher tendency to bind small molecules. We applied this new prediction method to more than 10,000 interfaces from the protein data bank. For several complexes related to apoptosis the predicted binding patches were in direct contact to co-crystallized small molecules.

  2. Calciomics:prediction and analysis of EF-hand calcium binding proteins by protein engineering

    Institute of Scientific and Technical Information of China (English)

    YANG; Jenny; Jie

    2010-01-01

    Ca2+ plays a pivotal role in the physiology and biochemistry of prokaryotic and mammalian organisms.Viruses also utilize the universal Ca2+ signal to create a specific cellular environment to achieve coexistence with the host,and to propagate.In this paper we first describe our development of a grafting approach to understand site-specific Ca2+ binding properties of EF-hand proteins with a helix-loop-helix Ca2+ binding motif,then summarize our prediction and identification of EF-hand Ca2+ binding sites on a genome-wide scale in bacteria and virus,and next report the application of the grafting approach to probe the metal binding capability of predicted EF-hand motifs within the streptococcal hemoprotein receptor(Shr) of Streptococcus pyrogenes and the nonstructural protein 1(nsP1) of Sindbis virus.When methods such as the grafting approach are developed in conjunction with prediction algorithms we are better able to probe continuous Ca2+-binding sites that have been previously underrepresented due to the limitation of conventional methodology.

  3. Steered molecular dynamics study of inhibitor binding in the internal binding site in dehaloperoxidase-hemoglobin.

    Science.gov (United States)

    Zhang, Zhisen; Santos, Andrew P; Zhou, Qing; Liang, Lijun; Wang, Qi; Wu, Tao; Franzen, Stefan

    2016-04-01

    The binding free energy of 4-bromophenol (4-BP), an inhibitor that binds in the internal binding site in dehaloperoxidase-hemoglobin (DHP) was calculated using Molecular Dynamics (MD) methods combined with pulling or umbrella sampling. The effects of systematic changes in the pulling speed, pulling force constant and restraint force constant on the calculated potential of mean force (PMF) are presented in this study. The PMFs calculated using steered molecular dynamics (SMD) were validated by umbrella sampling (US) in the strongly restrained regime. A series of restraint force constants ranging from 1000 down to 5 kJ/(mol nm(2)) were used in SMD simulations. This range was validated using US, however noting that weaker restraints give rise to a broader sampling of configurations. This comparison was further tested by a pulling simulation conducted without any restraints, which was observed to have a value closest to the experimentally measured free energy for binding of 4-BP to DHP based on ultraviolet-visible (UV-vis) and resonance Raman spectroscopies. The protein-inhibitor system is well suited for fundamental study of free energy calculations because the DHP protein is relatively small and the inhibitor is quite rigid. Simulation configuration structures are compared to the X-ray crystallography structures of the binding site of 4-BP in the distal pocket above the heme.

  4. Predicting bioactive conformations and binding modes of macrocycles

    Science.gov (United States)

    Anighoro, Andrew; de la Vega de León, Antonio; Bajorath, Jürgen

    2016-10-01

    Macrocyclic compounds experience increasing interest in drug discovery. It is often thought that these large and chemically complex molecules provide promising candidates to address difficult targets and interfere with protein-protein interactions. From a computational viewpoint, these molecules are difficult to treat. For example, flexible docking of macrocyclic compounds is hindered by the limited ability of current docking approaches to optimize conformations of extended ring systems for pose prediction. Herein, we report predictions of bioactive conformations of macrocycles using conformational search and binding modes using docking. Conformational ensembles generated using specialized search technique of about 70 % of the tested macrocycles contained accurate bioactive conformations. However, these conformations were difficult to identify on the basis of conformational energies. Moreover, docking calculations with limited ligand flexibility starting from individual low energy conformations rarely yielded highly accurate binding modes. In about 40 % of the test cases, binding modes were approximated with reasonable accuracy. However, when conformational ensembles were subjected to rigid body docking, an increase in meaningful binding mode predictions to more than 50 % of the test cases was observed. Electrostatic effects did not contribute to these predictions in a positive or negative manner. Rather, achieving shape complementarity at macrocycle-target interfaces was a decisive factor. In summary, a combined computational protocol using pre-computed conformational ensembles of macrocycles as a starting point for docking shows promise in modeling binding modes of macrocyclic compounds.

  5. Oxypred: Prediction and Classification of Oxygen-Binding Proteins

    Institute of Scientific and Technical Information of China (English)

    S.; Muthukrishnan; Aarti; Garg; G.P.S.; Raghava

    2007-01-01

    This study describes a method for predicting and classifying oxygen-binding pro- teins. Firstly, support vector machine (SVM) modules were developed using amino acid composition and dipeptide composition for predicting oxygen-binding pro- teins, and achieved maximum accuracy of 85.5% and 87.8%, respectively. Sec- ondly, an SVM module was developed based on amino acid composition, classify- ing the predicted oxygen-binding proteins into six classes with accuracy of 95.8%, 97.5%, 97.5%, 96.9%, 99.4%, and 96.0% for erythrocruorin, hemerythrin, hemo- cyanin, hemoglobin, leghemoglobin, and myoglobin proteins, respectively. Finally, an SVM module was developed using dipeptide composition for classifying the oxygen-binding proteins, and achieved maximum accuracy of 96.1%, 98.7%, 98.7%, 85.6%, 99.6%, and 93.3% for the above six classes, respectively. All modules were trained and tested by five-fold cross validation. Based on the above approach, a web server Oxypred was developed for predicting and classifying oxygen-binding proteins(available from http://www.imtech.res.in/raghava/oxypred/).

  6. Molecular modelling and competition binding study of Br-noscapine and colchicine provide insight into noscapinoid-tubulin binding site.

    Science.gov (United States)

    Naik, Pradeep K; Santoshi, Seneha; Rai, Ankit; Joshi, Harish C

    2011-06-01

    We have previously discovered the tubulin-binding anti-cancer properties of noscapine and its derivatives (noscapinoids). Here, we present three lines of evidence that noscapinoids bind at or near the well studied colchicine binding site of tubulin: (1) in silico molecular docking studies of Br-noscapine and noscapine yield highest docking score with the well characterised colchicine-binding site from the co-crystal structure; (2) the molecular mechanics-generalized Born/surface area (MM-GB/SA) scoring results ΔΔG(bind-cald) for both noscapine and Br-noscapine (3.915 and 3.025 kcal/mol) are in reasonably good agreement with our experimentally determined binding affinity (ΔΔG(bind-Expt) of 3.570 and 2.988 kcal/mol, derived from K(d) values); and (3) Br-noscapine competes with colchicine binding to tubulin. The simplest interpretation of these collective data is that Br-noscapine binds tubulin at a site overlapping with, or very close to colchicine-binding site of tubulin. Although we cannot rule out a formal possibility that Br-noscapine might bind to a site distinct and distant from the colchicine-binding site that might negatively influence the colchicine binding to tubulin.

  7. Soybean. beta. -glucan binding sites display maximal affinity for a heptaglucoside phytoalexin-elicitor

    Energy Technology Data Exchange (ETDEWEB)

    Cosio, E.G.; Waldmueller, T.; Frey, T.; Ebel, J. (Biologisches Institut II der Universitat Freiburg (West Germany))

    1990-05-01

    The affinity of soybean {beta}-glucan-binding sites for a synthetic heptaglucan elicitor was tested in a ligand-competition assay against a {sup 125}I-labeled 1,3-1,6-{beta}-glucan preparation (avg. DP=20). Half-maximal displacement of label (IC{sub 50}) was obtained at 9nM heptaglucan, the highest affinity of all fractions tested to date. Displacement followed a uniform sigmoidal pattern and was complete at 1{mu}M indicating access of heptaglucan to all sites available to the labeled elicitor. A mathematical model was used to predict IC{sub 50} values according to the DP of glucan fragments obtained from fungal cell walls. The lowest IC{sub 50} predicted by this model is 3nM. Binding affinity of the glucans was compared with their elicitor activity in a bioassay.

  8. Probabilistic prediction models for aggregate quarry siting

    Science.gov (United States)

    Robinson, G.R.; Larkins, P.M.

    2007-01-01

    Weights-of-evidence (WofE) and logistic regression techniques were used in a GIS framework to predict the spatial likelihood (prospectivity) of crushed-stone aggregate quarry development. The joint conditional probability models, based on geology, transportation network, and population density variables, were defined using quarry location and time of development data for the New England States, North Carolina, and South Carolina, USA. The Quarry Operation models describe the distribution of active aggregate quarries, independent of the date of opening. The New Quarry models describe the distribution of aggregate quarries when they open. Because of the small number of new quarries developed in the study areas during the last decade, independent New Quarry models have low parameter estimate reliability. The performance of parameter estimates derived for Quarry Operation models, defined by a larger number of active quarries in the study areas, were tested and evaluated to predict the spatial likelihood of new quarry development. Population density conditions at the time of new quarry development were used to modify the population density variable in the Quarry Operation models to apply to new quarry development sites. The Quarry Operation parameters derived for the New England study area, Carolina study area, and the combined New England and Carolina study areas were all similar in magnitude and relative strength. The Quarry Operation model parameters, using the modified population density variables, were found to be a good predictor of new quarry locations. Both the aggregate industry and the land management community can use the model approach to target areas for more detailed site evaluation for quarry location. The models can be revised easily to reflect actual or anticipated changes in transportation and population features. ?? International Association for Mathematical Geology 2007.

  9. Molecular modeling and competition binding study of Br-noscapine and colchicine provides insight into noscapinoid-tubulin binding site

    OpenAIRE

    Naik, Pradeep K.; Santoshi, Seneha; Rai, Ankit; Joshi, Harish C.

    2011-01-01

    We have previously discovered the tubulin-binding anti-cancer properties of noscapine and its derivatives (noscapinoids). Here, we present three lines of evidence that noscapinoids bind at or near the well studied colchicine binding site of tubulin: 1) In silico molecular docking studies of Br-noscapine and noscapine yield highest docking score with the well characterised colchicine-binding site from the co-crystal structure; 2) the molecular mechanics-generalized Born/surface area (MM-GB/SA)...

  10. Characterization of Binding Sites of Eukaryotic Transcription Factors

    Institute of Scientific and Technical Information of China (English)

    Jiang Qian; Jimmy Lin; Donald J. Zack

    2006-01-01

    To explore the nature of eukaryotic transcription factor (TF) binding sites and determine how they differ from surrounding DNA sequences, we examined four features associated with DNA binding sites: G+C content, pattern complexity,palindromic structure, and Markov sequence ordering. Our analysis of the regulatory motifs obtained from the TRANSFAC database, using yeast intergenic sequences as background, revealed that these four features show variable enrichment in motif sequences. For example, motif sequences were more likely to have palindromic structure than were background sequences. In addition, these features were tightly localized to the regulatory motifs, indicating that they are a property of the motif sequences themselves and are not shared by the general promoter "environment" in which the regulatory motifs reside. By breaking down the motif sequences according to the TF classes to which they bind, more specific associations were identified. Finally, we found that some correlations, such as G+C content enrichment, were species-specific, while others, such as complexity enrichment, were universal across the species examined. The quantitative analysis provided here should increase our understanding of protein-DNA interactions and also help facilitate the discovery of regulatory motifs through bioinformatics.

  11. Identification and characterization of a glycosaminoglycan binding site on interleukin-10 via molecular simulation methods.

    Science.gov (United States)

    Gehrcke, Jan-Philip; Pisabarro, M Teresa

    2015-11-01

    The biological function of the pleiotropic cytokine interleukin-10 (IL-10), which has an essential role in inflammatory processes, is known to be affected by glycosaminoglycans (GAGs). GAGs are essential constituents of the extracellular matrix with an important role in modulating the biological function of many proteins. The molecular mechanisms governing the IL-10-GAG interaction, though, are unclear so far. In particular, detailed knowledge about GAG binding sites and recognition mode on IL-10 is lacking, despite of its imminent importance for understanding the functional consequences of IL-10-GAG interaction. In the present work, we report a GAG binding site on IL-10 identified by applying computational methods based on Coulomb potential calculations and specialized molecular dynamics simulations. The identified GAG binding site is constituted of several positively charged residues, which are conserved among species. Exhaustive conformational space sampling of a series of GAG ligands binding to IL-10 led to the observation of two GAG binding modes in the predicted binding site, and to the identification of IL-10 residues R104, R106, R107, and K119 as being most important for molecular GAG recognition. In silico mutation as well as single-residue energy decomposition and detailed analysis of hydrogen-bonding behavior led to the conclusion that R107 is most essential and assumes a unique role in IL-10-GAG interaction. This structural and dynamic characterization of GAG-binding to IL-10 represents an important step for further understanding the modulation of the biological function of IL-10.

  12. Site-specific fab fragment biotinylation at the conserved nucleotide binding site for enhanced Ebola detection.

    Science.gov (United States)

    Mustafaoglu, Nur; Alves, Nathan J; Bilgicer, Basar

    2015-07-01

    The nucleotide binding site (NBS) is a highly conserved region between the variable light and heavy chains at the Fab domains of all antibodies, and a small molecule that we identified, indole-3-butyric acid (IBA), binds specifically to this site. Fab fragment, with its small size and simple production methods compared to intact antibody, is good candidate for use in miniaturized diagnostic devices and targeted therapeutic applications. However, commonly used modification techniques are not well suited for Fab fragments as they are often more delicate than intact antibodies. Fab fragments are of particular interest for sensor surface functionalization but immobilization results in damage to the antigen binding site and greatly reduced activity due to their truncated size that allows only a small area that can bind to surfaces without impeding antigen binding. In this study, we describe an NBS-UV photocrosslinking functionalization method (UV-NBS(Biotin) in which a Fab fragment is site-specifically biotinylated with an IBA-EG11-Biotin linker via UV energy exposure (1 J/cm(2)) without affecting its antigen binding activity. This study demonstrates successful immobilization of biotinylated Ebola detecting Fab fragment (KZ52 Fab fragment) via the UV-NBS(Biotin) method yielding 1031-fold and 2-fold better antigen detection sensitivity compared to commonly used immobilization methods: direct physical adsorption and NHS-Biotin functionalization, respectively. Utilization of the UV-NBS(Biotin) method for site-specific conjugation to Fab fragment represents a proof of concept use of Fab fragment for various diagnostic and therapeutic applications with numerous fluorescent probes, affinity molecules and peptides.

  13. Mapping the heparin-binding site on the 13-14F3 fragment of fibronectin.

    Science.gov (United States)

    Sachchidanand; Lequin, Olivier; Staunton, David; Mulloy, Barbara; Forster, Mark J; Yoshida, Keiichi; Campbell, Iain D

    2002-12-27

    Fibronectin, a multifunctional glycoprotein of the extracellular matrix, plays a major role in cell adhesion. Various studies have revealed that the human 13th and 14th fibronectin type III domains (labeled (13)F3 and (14)F3 here) contain a heparin-binding site. Mapping of the heparin-binding sites of (13-14)F3, (13)F3, and (14)F3 by NMR chemical shift perturbation, isothermal titration calorimetry, and molecular modeling show that (13)F3 provides the dominant heparin-binding site and that the residues involved are within the first 29 amino acids of (13)F3. Predictions from earlier biochemical and modeling studies as well as the x-ray structure of (12-14)F3 were tested. It was shown that the positively charged residues that project into the solvent from the ABE face of the triple-stranded beta sheet on (13)F3 are involved in binding, but (14)F3 does not appear to contribute significantly to heparin binding.

  14. A Unitary Anesthetic Binding Site at High Resolution

    Energy Technology Data Exchange (ETDEWEB)

    Vedula, L. Sangeetha; Brannigan, Grace; Economou, Nicoleta J.; Xi, Jin; Hall, Michael A.; Liu, Renyu; Rossi, Matthew J.; Dailey, William P.; Grasty, Kimberly C.; Klein, Michael L.; Eckenhoff, Roderic G.; Loll, Patrick J.; (Drexel-MED); (UPENN)

    2009-10-21

    Propofol is the most widely used injectable general anesthetic. Its targets include ligand-gated ion channels such as the GABA{sub A} receptor, but such receptor-channel complexes remain challenging to study at atomic resolution. Until structural biology methods advance to the point of being able to deal with systems such as the GABA{sub A} receptor, it will be necessary to use more tractable surrogates to probe the molecular details of anesthetic recognition. We have previously shown that recognition of inhalational general anesthetics by the model protein apoferritin closely mirrors recognition by more complex and clinically relevant protein targets; here we show that apoferritin also binds propofol and related GABAergic anesthetics, and that the same binding site mediates recognition of both inhalational and injectable anesthetics. Apoferritin binding affinities for a series of propofol analogs were found to be strongly correlated with the ability to potentiate GABA responses at GABA{sub A} receptors, validating this model system for injectable anesthetics. High resolution x-ray crystal structures reveal that, despite the presence of hydrogen bond donors and acceptors, anesthetic recognition is mediated largely by van der Waals forces and the hydrophobic effect. Molecular dynamics simulations indicate that the ligands undergo considerable fluctuations about their equilibrium positions. Finally, apoferritin displays both structural and dynamic responses to anesthetic binding, which may mimic changes elicited by anesthetics in physiologic targets like ion channels.

  15. A Unitary Anesthetic-Binding Site at High Resolution

    Energy Technology Data Exchange (ETDEWEB)

    Vedula, L.; Brannigan, G; Economou, N; Xi, J; Hall, M; Liu, R; Rossi, M; Dailey, W; Grasty, K; et. al.

    2009-01-01

    Propofol is the most widely used injectable general anesthetic. Its targets include ligand-gated ion channels such as the GABAA receptor, but such receptor-channel complexes remain challenging to study at atomic resolution. Until structural biology methods advance to the point of being able to deal with systems such as the GABA{sub A} receptor, it will be necessary to use more tractable surrogates to probe the molecular details of anesthetic recognition. We have previously shown that recognition of inhalational general anesthetics by the model protein apoferritin closely mirrors recognition by more complex and clinically relevant protein targets; here we show that apoferritin also binds propofol and related GABAergic anesthetics, and that the same binding site mediates recognition of both inhalational and injectable anesthetics. Apoferritin binding affinities for a series of propofol analogs were found to be strongly correlated with the ability to potentiate GABA responses at GABA{sub A} receptors, validating this model system for injectable anesthetics. High resolution x-ray crystal structures reveal that, despite the presence of hydrogen bond donors and acceptors, anesthetic recognition is mediated largely by van der Waals forces and the hydrophobic effect. Molecular dynamics simulations indicate that the ligands undergo considerable fluctuations about their equilibrium positions. Finally, apoferritin displays both structural and dynamic responses to anesthetic binding, which may mimic changes elicited by anesthetics in physiologic targets like ion channels.

  16. A Unitary Anesthetic Binding Site at High Resolution

    Energy Technology Data Exchange (ETDEWEB)

    L Vedula; G Brannigan; N Economou; J Xi; M Hall; R Liu; M Rossi; W Dailey; K Grasty; et. al.

    2011-12-31

    Propofol is the most widely used injectable general anesthetic. Its targets include ligand-gated ion channels such as the GABA{sub A} receptor, but such receptor-channel complexes remain challenging to study at atomic resolution. Until structural biology methods advance to the point of being able to deal with systems such as the GABA{sub A} receptor, it will be necessary to use more tractable surrogates to probe the molecular details of anesthetic recognition. We have previously shown that recognition of inhalational general anesthetics by the model protein apoferritin closely mirrors recognition by more complex and clinically relevant protein targets; here we show that apoferritin also binds propofol and related GABAergic anesthetics, and that the same binding site mediates recognition of both inhalational and injectable anesthetics. Apoferritin binding affinities for a series of propofol analogs were found to be strongly correlated with the ability to potentiate GABA responses at GABA{sub A} receptors, validating this model system for injectable anesthetics. High resolution x-ray crystal structures reveal that, despite the presence of hydrogen bond donors and acceptors, anesthetic recognition is mediated largely by van der Waals forces and the hydrophobic effect. Molecular dynamics simulations indicate that the ligands undergo considerable fluctuations about their equilibrium positions. Finally, apoferritin displays both structural and dynamic responses to anesthetic binding, which may mimic changes elicited by anesthetics in physiologic targets like ion channels.

  17. Gamma-aminobutyric acid-modulated benzodiazepine binding sites in bacteria

    Energy Technology Data Exchange (ETDEWEB)

    Lummis, S.C.R.; Johnston, G.A.R. (Univ. of Sydney, New South Wales (Australia)); Nicoletti, G. (Royal Melbourne Inst. of Tech. (Australia)); Holan, G. (CSIRO, Melbourne (Australia))

    1991-01-01

    Benzodiazepine binding sites, which were once considered to exist only in higher vertebrates, are here demonstrated in the bacteria E. coli. The bacterial ({sup 3}H)diazepam binding sites are modulated by GABA; the modulation is dose dependent and is reduced at high concentrations. The most potent competitors of E.Coli ({sup 3}H)diazepam binding are those that are active in displacing ({sup 3}H)benzodiazepines from vertebrate peripheral benzodiazepine binding sites. These vertebrate sites are not modulated by GABA, in contrast to vertebrate neuronal benzodiazepine binding sites. The E.coli benzodiazepine binding sites therefore differ from both classes of vertebrate benzodiazepine binding sites; however the ligand spectrum and GABA-modulatory properties of the E.coli sites are similar to those found in insects. This intermediate type of receptor in lower species suggests a precursor for at least one class of vertebrate benzodiazepine binding sites may have existed.

  18. Deconstructing the DGAT1 enzyme: membrane interactions at substrate binding sites.

    Directory of Open Access Journals (Sweden)

    Jose L S Lopes

    Full Text Available Diacylglycerol acyltransferase 1 (DGAT1 is a key enzyme in the triacylglyceride synthesis pathway. Bovine DGAT1 is an endoplasmic reticulum membrane-bound protein associated with the regulation of fat content in milk and meat. The aim of this study was to evaluate the interaction of DGAT1 peptides corresponding to putative substrate binding sites with different types of model membranes. Whilst these peptides are predicted to be located in an extramembranous loop of the membrane-bound protein, their hydrophobic substrates are membrane-bound molecules. In this study, peptides corresponding to the binding sites of the two substrates involved in the reaction were examined in the presence of model membranes in order to probe potential interactions between them that might influence the subsequent binding of the substrates. Whilst the conformation of one of the peptides changed upon binding several types of micelles regardless of their surface charge, suggesting binding to hydrophobic domains, the other peptide bound strongly to negatively-charged model membranes. This binding was accompanied by a change in conformation, and produced leakage of the liposome-entrapped dye calcein. The different hydrophobic and electrostatic interactions observed suggest the peptides may be involved in the interactions of the enzyme with membrane surfaces, facilitating access of the catalytic histidine to the triacylglycerol substrates.

  19. Detecting local ligand-binding site similarity in nonhomologous proteins by surface patch comparison.

    Science.gov (United States)

    Sael, Lee; Kihara, Daisuke

    2012-04-01

    Functional elucidation of proteins is one of the essential tasks in biology. Function of a protein, specifically, small ligand molecules that bind to a protein, can be predicted by finding similar local surface regions in binding sites of known proteins. Here, we developed an alignment free local surface comparison method for predicting a ligand molecule which binds to a query protein. The algorithm, named Patch-Surfer, represents a binding pocket as a combination of segmented surface patches, each of which is characterized by its geometrical shape, the electrostatic potential, the hydrophobicity, and the concaveness. Representing a pocket by a set of patches is effective to absorb difference of global pocket shape while capturing local similarity of pockets. The shape and the physicochemical properties of surface patches are represented using the 3D Zernike descriptor, which is a series expansion of mathematical 3D function. Two pockets are compared using a modified weighted bipartite matching algorithm, which matches similar patches from the two pockets. Patch-Surfer was benchmarked on three datasets, which consist in total of 390 proteins that bind to one of 21 ligands. Patch-Surfer showed superior performance to existing methods including a global pocket comparison method, Pocket-Surfer, which we have previously introduced. Particularly, as intended, the accuracy showed large improvement for flexible ligand molecules, which bind to pockets in different conformations.

  20. Computational protocol for predicting the binding affinities of zinc containing metalloprotein-ligand complexes.

    Science.gov (United States)

    Jain, Tarun; Jayaram, B

    2007-06-01

    Zinc is one of the most important metal ions found in proteins performing specific functions associated with life processes. Coordination geometry of the zinc ion in the active site of the metalloprotein-ligand complexes poses a challenge in determining ligand binding affinities accurately in structure-based drug design. We report here an all atom force field based computational protocol for estimating rapidly the binding affinities of zinc containing metalloprotein-ligand complexes, considering electrostatics, van der Waals, hydrophobicity, and loss in conformational entropy of protein side chains upon ligand binding along with a nonbonded approach to model the interactions of the zinc ion with all the other atoms of the complex. We examined the sensitivity of the binding affinity predictions to the choice of Lennard-Jones parameters, partial atomic charges, and dielectric treatments adopted for system preparation and scoring. The highest correlation obtained was R2 = 0.77 (r = 0.88) for the predicted binding affinity against the experiment on a heterogenous dataset of 90 zinc containing metalloprotein-ligand complexes consisting of five unique protein targets. Model validation and parameter analysis studies underscore the robustness and predictive ability of the scoring function. The high correlation obtained suggests the potential applicability of the methodology in designing novel ligands for zinc-metalloproteins. The scoring function has been web enabled for free access at www.scfbio-iitd.res.in/software/drugdesign/bapplz.jsp as BAPPL-Z server (Binding Affinity Prediction of Protein-Ligand complexes containing Zinc metal ions).

  1. LIBSA--a method for the determination of ligand-binding preference to allosteric sites on receptor ensembles.

    Science.gov (United States)

    Hocker, Harrison J; Rambahal, Nandini; Gorfe, Alemayehu A

    2014-02-24

    Incorporation of receptor flexibility into computational drug discovery through the relaxed complex scheme is well suited for screening against a single binding site. In the absence of a known pocket or if there are multiple potential binding sites, it may be necessary to do docking against the entire surface of the target (global docking). However no suitable and easy-to-use tool is currently available to rank global docking results based on the preference of a ligand for a given binding site. We have developed a protocol, termed LIBSA for LIgand Binding Specificity Analysis, that analyzes multiple docked poses against a single or ensemble of receptor conformations and returns a metric for the relative binding to a specific region of interest. By using novel filtering algorithms and the signal-to-noise ratio (SNR), the relative ligand-binding frequency at different pockets can be calculated and compared quantitatively. Ligands can then be triaged by their tendency to bind to a site instead of ranking by affinity alone. The method thus facilitates screening libraries of ligand cores against a large library of receptor conformations without prior knowledge of specific pockets, which is especially useful to search for hits that selectively target a particular site. We demonstrate the utility of LIBSA by showing that it correctly identifies known ligand binding sites and predicts the relative preference of a set of related ligands for different pockets on the same receptor.

  2. Isothermal titration calorimetry and surface plasmon resonance allow quantifying substrate binding to different binding sites of Bacillus subtilis xylanase

    DEFF Research Database (Denmark)

    Cuyvers, Sven; Dornez, Emmie; Abou Hachem, Maher;

    2012-01-01

    Isothermal titration calorimetry and surface plasmon resonance were tested for their ability to study substrate binding to the active site (AS) and to the secondary binding site (SBS) of Bacillus subtilis xylanase A separately. To this end, three enzyme variants were compared. The first was a cat......Isothermal titration calorimetry and surface plasmon resonance were tested for their ability to study substrate binding to the active site (AS) and to the secondary binding site (SBS) of Bacillus subtilis xylanase A separately. To this end, three enzyme variants were compared. The first...

  3. PRBP: Prediction of RNA-Binding Proteins Using a Random Forest Algorithm Combined with an RNA-Binding Residue Predictor.

    Science.gov (United States)

    Ma, Xin; Guo, Jing; Xiao, Ke; Sun, Xiao

    2015-01-01

    The prediction of RNA-binding proteins is an incredibly challenging problem in computational biology. Although great progress has been made using various machine learning approaches with numerous features, the problem is still far from being solved. In this study, we attempt to predict RNA-binding proteins directly from amino acid sequences. A novel approach, PRBP predicts RNA-binding proteins using the information of predicted RNA-binding residues in conjunction with a random forest based method. For a given protein, we first predict its RNA-binding residues and then judge whether the protein binds RNA or not based on information from that prediction. If the protein cannot be identified by the information associated with its predicted RNA-binding residues, then a novel random forest predictor is used to determine if the query protein is a RNA-binding protein. We incorporated features of evolutionary information combined with physicochemical features (EIPP) and amino acid composition feature to establish the random forest predictor. Feature analysis showed that EIPP contributed the most to the prediction of RNA-binding proteins. The results also showed that the information from the RNA-binding residue prediction improved the overall performance of our RNA-binding protein prediction. It is anticipated that the PRBP method will become a useful tool for identifying RNA-binding proteins. A PRBP Web server implementation is freely available at http://www.cbi.seu.edu.cn/PRBP/.

  4. Discovery and information-theoretic characterization of transcription factor binding sites that act cooperatively

    CERN Document Server

    Clifford, Jacob

    2015-01-01

    Transcription factor binding to the surface of DNA regulatory regions is one of the primary causes of regulating gene expression levels. A probabilistic approach to model protein-DNA interactions at the sequence level is through Position Weight Matrices (PWMs) that estimate the joint probability of a DNA binding site sequence by assuming positional independence within the DNA sequence. Here we construct conditional PWMs that depend on the motif signatures in the flanking DNA sequence, by conditioning known binding site loci on the presence or absence of additional binding sites in the flanking sequence of each site's locus. Pooling known sites with similar flanking sequence patterns allows for the estimation of the conditional distribution function over the binding site sequences. We apply our model to the Dorsal transcription factor binding sites active in patterning the Dorsal-Ventral axis of Drosophila development. We find that those binding sites that cooperate with nearby Twist sites on average contain a...

  5. Transcription factor binding sites are highly enriched within microRNA precursor sequences

    Directory of Open Access Journals (Sweden)

    Piriyapongsa Jittima

    2011-12-01

    Full Text Available Abstract Background Transcription factors are thought to regulate the transcription of microRNA genes in a manner similar to that of protein-coding genes; that is, by binding to conventional transcription factor binding site DNA sequences located in or near promoter regions that lie upstream of the microRNA genes. However, in the course of analyzing the genomics of human microRNA genes, we noticed that annotated transcription factor binding sites commonly lie within 70- to 110-nt long microRNA small hairpin precursor sequences. Results We report that about 45% of all human small hairpin microRNA (pre-miR sequences contain at least one predicted transcription factor binding site motif that is conserved across human, mouse and rat, and this rises to over 75% if one excludes primate-specific pre-miRs. The association is robust and has extremely strong statistical significance; it affects both intergenic and intronic pre-miRs and both isolated and clustered microRNA genes. We also confirmed and extended this finding using a separate analysis that examined all human pre-miR sequences regardless of conservation across species. Conclusions The transcription factor binding sites localized within small hairpin microRNA precursor sequences may possibly regulate their transcription. Transcription factors may also possibly bind directly to nascent primary microRNA gene transcripts or small hairpin microRNA precursors and regulate their processing. Reviewers This article was reviewed by Guillaume Bourque (nominated by Jerzy Jurka, Dmitri Pervouchine (nominated by Mikhail Gelfand, and Yuriy Gusev.

  6. Analysis of the Binding Sites of Porcine Sialoadhesin Receptor with PRRSV

    Directory of Open Access Journals (Sweden)

    Yibo Jiang

    2013-12-01

    Full Text Available Porcine reproductive and respiratory syndrome virus (PRRSV can infect pigs and cause enormous economic losses to the pig industry worldwide. Porcine sialoadhesin (pSN and CD163 have been identified as key viral receptors on porcine alveolar macrophages (PAM, a main target cell infected by PRRSV. In this study, the protein structures of amino acids 1–119 from the pSN and cSN (cattle sialoadhesin N-termini (excluding the 19-amino acid signal peptide were modeled via homology modeling based on mSN (mouse sialoadhesin template structures using bioinformatics tools. Subsequently, pSN and cSN homology structures were superposed onto the mSN protein structure to predict the binding sites of pSN. As a validation experiment, the SN N-terminus (including the wild-type and site-directed-mutant-types of pSN and cSN was cloned and expressed as a SN-GFP chimera protein. The binding activity between SN and PRRSV was confirmed by WB (Western blotting, FAR-WB (far Western blotting, ELISA (enzyme-linked immunosorbent assay and immunofluorescence assay. We found that the S107 amino acid residue in the pSN N-terminal played a crucial role in forming a special cavity, as well as a hydrogen bond for enhancing PRRSV binding during PRRSV infection. S107 may be glycosylated during PRRSV infection and may also be involved in forming the cavity for binding PRRSV along with other sites, including W2, Y44, S45, R97, R105, W106 and V109. Additionally, S107 might also be important for pSN binding with PRRSV. However, the function of these binding sites must be confirmed by further studies.

  7. CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes

    Science.gov (United States)

    Hestand, Matthew S; van Galen, Michiel; Villerius, Michel P; van Ommen, Gert-Jan B; den Dunnen, Johan T; 't Hoen, Peter AC

    2008-01-01

    Background The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments. Results We have developed a novel tool, "CORE_TF" (Conserved and Over-REpresented Transcription Factor binding sites) that identifies common transcription factor binding sites in promoters of co-regulated genes. To improve upon existing binding site predictions, the tool searches for position weight matrices from the TRANSFACR database that are over-represented in an experimental set compared to a random set of promoters and identifies cross-species conservation of the predicted transcription factor binding sites. The algorithm has been evaluated with expression and chromatin-immunoprecipitation on microarray data. We also implement and demonstrate the importance of matching the random set of promoters to the experimental promoters by GC content, which is a unique feature of our tool. Conclusion The program CORE_TF is accessible in a user friendly web interface at . It provides a table of over-represented transcription factor binding sites in the users input genes' promoters and a graphical view of evolutionary conserved transcription factor binding sites. In our test data sets it successfully predicts target transcription factors and their binding sites. PMID:19036135

  8. Predicting peptides binding to MHC class II molecules using multi-objective evolutionary algorithms

    Directory of Open Access Journals (Sweden)

    Feng Lin

    2007-11-01

    , one for self-discovery and the other for guided-discovery by experimentally determined motifs, and thereby predicting binding peptides to I-Ag7 molecule. Our experiments show that the proposed MOEA-based algorithms are better than earlier methods in predicting binding sites not only on I-Ag7 but also on most alleles of class II MHC benchmark datasets. This shows that our methods could be applicable to find binding motifs in a wide range of alleles.

  9. MONKEY: Identifying conserved transcription-factor binding sitesin multiple alignments using a binding site-specific evolutionarymodel

    Energy Technology Data Exchange (ETDEWEB)

    Moses, Alan M.; Chiang, Derek Y.; Pollard, Daniel A.; Iyer, VenkyN.; Eisen, Michael B.

    2004-10-28

    We introduce a method (MONKEY) to identify conserved transcription-factor binding sites in multispecies alignments. MONKEY employs probabilistic models of factor specificity and binding site evolution, on which basis we compute the likelihood that putative sites are conserved and assign statistical significance to each hit. Using genomes from the genus Saccharomyces, we illustrate how the significance of real sites increases with evolutionary distance and explore the relationship between conservation and function.

  10. Prediction of DNA-binding specificity in zinc finger proteins

    Indian Academy of Sciences (India)

    Sumedha Roy; Shayoni Dutta; Kanika Khanna; Shruti Singla; Durai Sundar

    2012-07-01

    Zinc finger proteins interact via their individual fingers to three base pair subsites on the target DNA. The four key residue positions −1, 2, 3 and 6 on the alpha-helix of the zinc fingers have hydrogen bond interactions with the DNA. Mutating these key residues enables generation of a plethora of combinatorial possibilities that can bind to any DNA stretch of interest. Exploiting the binding specificity and affinity of the interaction between the zinc fingers and the respective DNA can help to generate engineered zinc fingers for therapeutic purposes involving genome targeting. Exploring the structure–function relationships of the existing zinc finger–DNA complexes can aid in predicting the probable zinc fingers that could bind to any target DNA. Computational tools ease the prediction of such engineered zinc fingers by effectively utilizing information from the available experimental data. A study of literature reveals many approaches for predicting DNA-binding specificity in zinc finger proteins. However, an alternative approach that looks into the physico-chemical properties of these complexes would do away with the difficulties of designing unbiased zinc fingers with the desired affinity and specificity. We present a physico-chemical approach that exploits the relative strengths of hydrogen bonding between the target DNA and all combinatorially possible zinc fingers to select the most optimum zinc finger protein candidate.

  11. Examination of the thiamin diphosphate binding site in yeast transketolase by site-directed mutagenesis.

    Science.gov (United States)

    Meshalkina, L; Nilsson, U; Wikner, C; Kostikowa, T; Schneider, G

    1997-03-01

    The role of two conserved amino acid residues in the thiamin diphosphate binding site of yeast transketolase has been analyzed by site-directed mutagenesis. Replacement of E162, which is part of a cluster of glutamic acid residues at the subunit interface, by alanine or glutamine results in mutant enzymes with most catalytic properties similar to wild-type enzyme. The two mutant enzymes show, however, significant increases in the K0.5 values for thiamin diphosphate in the absence of substrate and in the lag of the reaction progress curves. This suggests that the interaction of E162 with residue E418, and possibly E167, from the second subunit is important for formation and stabilization of the transketolase dimer. Replacement of the conserved residue D382, which is buried upon binding of thiamin diphosphate, by asparagine and alanine, results in mutant enzymes severely impaired in thiamin diphosphate binding and catalytic efficiency. The 25-80-fold increase in K0.5 for thiamin diphosphate suggests that D382 is involved in cofactor binding, probably by electrostatic compensation of the positive charge of the thiazolium ring and stabilization of a flexible loop at the active site. The decrease in catalytic activities in the D382 mutants indicates that this residue might also be important in subsequent steps in catalysis.

  12. Effects of cytosine methylation on transcription factor binding sites

    KAUST Repository

    Medvedeva, Yulia A

    2014-03-26

    Background: DNA methylation in promoters is closely linked to downstream gene repression. However, whether DNA methylation is a cause or a consequence of gene repression remains an open question. If it is a cause, then DNA methylation may affect the affinity of transcription factors (TFs) for their binding sites (TFBSs). If it is a consequence, then gene repression caused by chromatin modification may be stabilized by DNA methylation. Until now, these two possibilities have been supported only by non-systematic evidence and they have not been tested on a wide range of TFs. An average promoter methylation is usually used in studies, whereas recent results suggested that methylation of individual cytosines can also be important.Results: We found that the methylation profiles of 16.6% of cytosines and the expression profiles of neighboring transcriptional start sites (TSSs) were significantly negatively correlated. We called the CpGs corresponding to such cytosines " traffic lights" We observed a strong selection against CpG " traffic lights" within TFBSs. The negative selection was stronger for transcriptional repressors as compared with transcriptional activators or multifunctional TFs as well as for core TFBS positions as compared with flanking TFBS positions.Conclusions: Our results indicate that direct and selective methylation of certain TFBS that prevents TF binding is restricted to special cases and cannot be considered as a general regulatory mechanism of transcription. 2013 Medvedeva et al.; licensee BioMed Central Ltd.

  13. Predicting accurate absolute binding energies in aqueous solution

    DEFF Research Database (Denmark)

    Jensen, Jan Halborg

    2015-01-01

    Recent predictions of absolute binding free energies of host-guest complexes in aqueous solution using electronic structure theory have been encouraging for some systems, while other systems remain problematic. In this paper I summarize some of the many factors that could easily contribute 1-3 kcal......-represented by continuum models. While I focus on binding free energies in aqueous solution the approach also applies (with minor adjustments) to any free energy difference such as conformational or reaction free energy differences or activation free energies in any solvent....

  14. Prediction of chloride ingress and binding in cement paste

    DEFF Research Database (Denmark)

    Geiker, Mette Rica; Nielsen, Erik Pram; Herforth, Duncan

    2007-01-01

    This paper summarizes recent work on an analytical model for predicting the ingress rate of chlorides in cement-based materials. An integral part of this is a thermodynamic model for predicting the phase equilibria in hydrated Portland cement. The model’s ability to predict chloride binding...... in Portland cement pastes at any content of chloride, alkalis, sulfates and carbonate was verified experimentally and found to be equally valid when applied to other data in the literature. The thermodynamic model for predicting the phase equilibria in hydrated Portland cement was introduced into an existing...... Finite Difference Model for the ingress of chlorides into concrete which takes into account its multi-component nature. The “composite theory” was then used to predict the diffusivity of each ion based on the phase assemblage present in the hydrated Portland cement paste. Agreement was found between...

  15. Antidepressant Binding Site in a Bacterial Homologue of Neurotransmitter Transporters

    Energy Technology Data Exchange (ETDEWEB)

    Singh,S.; Yamashita, A.; Gouaux, E.

    2007-01-01

    Sodium-coupled transporters are ubiquitous pumps that harness pre-existing sodium gradients to catalyse the thermodynamically unfavourable uptake of essential nutrients, neurotransmitters and inorganic ions across the lipid bilayer. Dysfunction of these integral membrane proteins has been implicated in glucose/galactose malabsorption, congenital hypothyroidism, Bartter's syndrome, epilepsy, depression, autism and obsessive-compulsive disorder. Sodium-coupled transporters are blocked by a number of therapeutically important compounds, including diuretics, anticonvulsants and antidepressants, many of which have also become indispensable tools in biochemical experiments designed to probe antagonist binding sites and to elucidate transport mechanisms. Steady-state kinetic data have revealed that both competitive and noncompetitive modes of inhibition exist. Antagonist dissociation experiments on the serotonin transporter (SERT) have also unveiled the existence of a low-affinity allosteric site that slows the dissociation of inhibitors from a separate high-affinity site. Despite these strides, atomic-level insights into inhibitor action have remained elusive. Here we screen a panel of molecules for their ability to inhibit LeuT, a prokaryotic homologue of mammalian neurotransmitter sodium symporters, and show that the tricyclic antidepressant (TCA) clomipramine noncompetitively inhibits substrate uptake. Cocrystal structures show that clomipramine, along with two other TCAs, binds in an extracellular-facing vestibule about 11 {angstrom} above the substrate and two sodium ions, apparently stabilizing the extracellular gate in a closed conformation. Off-rate assays establish that clomipramine reduces the rate at which leucine dissociates from LeuT and reinforce our contention that this TCA inhibits LeuT by slowing substrate release. Our results represent a molecular view into noncompetitive inhibition of a sodium-coupled transporter and define principles for the

  16. DEPTH: a web server to compute depth and predict small-molecule binding cavities in proteins.

    Science.gov (United States)

    Tan, Kuan Pern; Varadarajan, Raghavan; Madhusudhan, M S

    2011-07-01

    Depth measures the extent of atom/residue burial within a protein. It correlates with properties such as protein stability, hydrogen exchange rate, protein-protein interaction hot spots, post-translational modification sites and sequence variability. Our server, DEPTH, accurately computes depth and solvent-accessible surface area (SASA) values. We show that depth can be used to predict small molecule ligand binding cavities in proteins. Often, some of the residues lining a ligand binding cavity are both deep and solvent exposed. Using the depth-SASA pair values for a residue, its likelihood to form part of a small molecule binding cavity is estimated. The parameters of the method were calibrated over a training set of 900 high-resolution X-ray crystal structures of single-domain proteins bound to small molecules (molecular weight structures. Users have the option of tuning several parameters to detect cavities of different sizes, for example, geometrically flat binding sites. The input to the server is a protein 3D structure in PDB format. The users have the option of tuning the values of four parameters associated with the computation of residue depth and the prediction of binding cavities. The computed depths, SASA and binding cavity predictions are displayed in 2D plots and mapped onto 3D representations of the protein structure using Jmol. Links are provided to download the outputs. Our server is useful for all structural analysis based on residue depth and SASA, such as guiding site-directed mutagenesis experiments and small molecule docking exercises, in the context of protein functional annotation and drug discovery.

  17. Monoclonal Anti—CD4 Antibody MT310 Binds HIV-1 gp120 Binding Site on CD4

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    Tests show the monoclonal anti—CD4 antibody (mAb) MT310 recognizes the gp120-binding site on CD4 as part of its mechanism for strongly inhibiting human immunodeficiency virus type 1 (HIV-1) infection of CD4+ T cells. In competition tests, mAb MT310 and mAb Leu3a (an anti-CD4 mAb recognizing the gp120-binding site) all inhibited gp120-binding to CD4+ T lymphocytes, while mAb MT405 did not. This result suggests that MT310, like Leu3a, recognizes the gp120-binding site on CD4. To further confirm whether MT310 recognizes the gp120-binding site on CD4, we prepared rabbit anti-idiotypic antisera (Ab2) against MT310 (Ab1). The anti-idiotypic antisera against MT310 inhibited binding of MT310 and Leu3a to human CD4+ T lymphocytes, but did not block binding of MT151 with the second domain of CD4, while rabbit anti-idiotypic antisera to MT151 could block binding of itself to these cells, but could not inhibit the binding of MT310 and Leu3a, further indicating that MT310 recognized the gp120-binding site on CD4.

  18. Symmetrical 1-pyrrolidineacetamide showing anti-HIV activity through a new binding site on HIV-1 integrase

    Institute of Scientific and Technical Information of China (English)

    Li DU; Ya-xue ZHAO; Liu-meng YANG; Yong-tang ZHENG; Yun TANG; Xu SHEN; Hua-liang JIANG

    2008-01-01

    Aim:To characterize the functional and pharmacological features of a symmetrical 1-pyrrolidineacetamide,N,N'-(methylene-di-4,1-phenylene) bis-1-pyrrolidineacetamide,as a new anti-HIV compound which could competitively inhibit HIV-1 integrase (IN) binding to viral DNA.Methods:A surface plasma resonance (SPR)-based competitive assay was employed to determine the compound's inhibitory activity,and the 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide cell assay was used to qualify the antiviral activity.The potential binding sites were predicted by molecular modeling and determined by site-directed mutagenesis and a SPR binding assay.Results:l-pyrrolidineacetamide,N,N'-(methylene-di-4,1-phenylene) bis-1-pyrrolidineacetamide could competitively inhibit IN binding to viral DNA with a 50% inhibitory concentration (IC50) value of 7.29±0.68 μmol/L as investigated by SPR-based investigation.Another antiretroviral activity assay showed that this compound exhibited inhibition against HⅣ-Ⅰ(ⅢB) replication with a 50% effective concentration (EC50) value of 40.54 μmol/L in C8166 cells,and cytotoxicity with a cytotoxic concentration value of 173.84 μmol/L in mock-infected C8166 cells.Molecular docking predicted 3 potential residues as 1-pyrrolidineacetamide,N,N'-(methylene-di-4,1-phenylene)bis-1-pyrrolidineacetamide binding sites.The importance of 3 key amino acid residues (Lys103,Lys173,and Thr174) involved in the binding was further identified by site-directed mutagenesis and a SPR binding assay.Conclusion:This present work identified a new anti-HIV compound through a new IN-binding site which is expected to supply new potential drug-binding site information for HIV-1 integrase inhibitor discovery and development.

  19. Interpretation of Ocular Melanin Drug Binding Assays. Alternatives to the Model of Multiple Classes of Independent Sites.

    Science.gov (United States)

    Manzanares, José A; Rimpelä, Anna-Kaisa; Urtti, Arto

    2016-04-04

    Melanin has a high binding affinity for a wide range of drugs. The determination of the melanin binding capacity and its binding affinity are important, e.g., in the determination of the ocular drug distribution, the prediction of drug effects in the eye, and the trans-scleral drug delivery. The binding parameters estimated from a given data set vary significantly when using different isotherms or different nonlinear fitting methods. In this work, the commonly used bi-Langmuir isotherm, which assumes two classes of independent sites, is confronted with the Sips isotherm. Direct, log-log, and Scatchard plots are used, and the interpretation of the binding curves in the latter is critically analyzed. In addition to the goodness of fit, the emphasis is placed on the physical meaning of the binding parameters. The bi-Langmuir model imposes a bimodal distribution of binding energies for the sites on the melanin granules, but the actual distribution is most likely continuous and unimodal, as assumed by the Sips isotherm. Hence, the latter describes more accurately the distribution of binding energies and also the experimental results of melanin binding to drugs and metal ions. Simulations are used to show that the existence of two classes of sites cannot be confirmed on the sole basis of the shape of the binding curve in the Scatchard plot, and that serious doubts may appear on the meaning of the binding parameters of the bi-Langmuir model. Experimental results of melanin binding to chloroquine and metoprolol are used to illustrate the importance of the choice of the binding isotherm and of the method used to evaluate the binding parameters.

  20. QSAR Models for the Prediction of Plasma Protein Binding

    Directory of Open Access Journals (Sweden)

    Zeshan Amin

    2013-02-01

    Full Text Available Introduction: The prediction of plasma protein binding (ppb is of paramount importance in the pharmacokinetics characterization of drugs, as it causes significant changes in volume of distribution, clearance and drug half life. This study utilized Quantitative Structure – Activity Relationships (QSAR for the prediction of plasma protein binding. Methods: Protein binding values for 794 compounds were collated from literature. The data was partitioned into a training set of 662 compounds and an external validation set of 132 compounds. Physicochemical and molecular descriptors were calculated for each compound using ACD labs/logD, MOE (Chemical Computing Group and Symyx QSAR software packages. Several data mining tools were employed for the construction of models. These included stepwise regression analysis, Classification and Regression Trees (CART, Boosted trees and Random Forest. Results: Several predictive models were identified; however, one model in particular produced significantly superior prediction accuracy for the external validation set as measured using mean absolute error and correlation coefficient. The selected model was a boosted regression tree model which had the mean absolute error for training set of 13.25 and for validation set of 14.96. Conclusion: Plasma protein binding can be modeled using simple regression trees or multiple linear regressions with reasonable model accuracies. These interpretable models were able to identify the governing molecular factors for a high ppb that included hydrophobicity, van der Waals surface area parameters, and aromaticity. On the other hand, the more complicated ensemble method of boosted regression trees produced the most accurate ppb estimations for the external validation set.

  1. Automated benchmarking of peptide-MHC class I binding predictions

    DEFF Research Database (Denmark)

    Trolle, Thomas; Metushi, Imir G.; Greenbaum, Jason

    2015-01-01

    the public access to frequent, up-to-date performance evaluations of all participating tools. To overcome potential selection bias in the data included in the IEDB, a strategy was implemented that suggests a set of peptides for which different prediction methods give divergent predictions as to their binding...... educated selections between participating tools. Of the four participating servers, NetMHCpan performed the best, followed by ANN, SMM and finally ARB. Availability and implementation: Up-to-date performance evaluations of each server can be found online at http://tools.iedb.org/auto_bench/mhci/weekly. All...

  2. Predicting interaction sites from the energetics of isolated proteins: a new approach to epitope mapping.

    Science.gov (United States)

    Scarabelli, Guido; Morra, Giulia; Colombo, Giorgio

    2010-05-19

    An increasing number of functional studies of proteins have shown that sequence and structural similarities alone may not be sufficient for reliable prediction of their interaction properties. This is particularly true for proteins recognizing specific antibodies, where the prediction of antibody-binding sites, called epitopes, has proven challenging. The antibody-binding properties of an antigen depend on its structure and related dynamics. Aiming to predict the antibody-binding regions of a protein, we investigate a new approach based on the integrated analysis of the dynamical and energetic properties of antigens, to identify nonoptimized, low-intensity energetic interaction networks in the protein structure isolated in solution. The method is based on the idea that recognition sites may correspond to localized regions with low-intensity energetic couplings with the rest of the protein, which allows them to undergo conformational changes, to be recognized by a binding partner, and to tolerate mutations with minimal energetic expense. Upon analyzing the results on isolated proteins and benchmarking against antibody complexes, it is found that the method successfully identifies binding sites located on the protein surface that are accessible to putative binding partners. The combination of dynamics and energetics can thus discriminate between epitopes and other substructures based only on physical properties. We discuss implications for vaccine design.

  3. Imputation for transcription factor binding predictions based on deep learning

    Science.gov (United States)

    Qin, Qian

    2017-01-01

    Understanding the cell-specific binding patterns of transcription factors (TFs) is fundamental to studying gene regulatory networks in biological systems, for which ChIP-seq not only provides valuable data but is also considered as the gold standard. Despite tremendous efforts from the scientific community to conduct TF ChIP-seq experiments, the available data represent only a limited percentage of ChIP-seq experiments, considering all possible combinations of TFs and cell lines. In this study, we demonstrate a method for accurately predicting cell-specific TF binding for TF-cell line combinations based on only a small fraction (4%) of the combinations using available ChIP-seq data. The proposed model, termed TFImpute, is based on a deep neural network with a multi-task learning setting to borrow information across transcription factors and cell lines. Compared with existing methods, TFImpute achieves comparable accuracy on TF-cell line combinations with ChIP-seq data; moreover, TFImpute achieves better accuracy on TF-cell line combinations without ChIP-seq data. This approach can predict cell line specific enhancer activities in K562 and HepG2 cell lines, as measured by massively parallel reporter assays, and predicts the impact of SNPs on TF binding. PMID:28234893

  4. Human immunodeficiency virus drug development assisted with AlGaN/GaN high electron mobility transistors and binding-site models

    Science.gov (United States)

    Kang, Yen-Wen; Lee, Geng-Yen; Chyi, Jen-Inn; Hsu, Chen-Pin; Hsu, You-Ren; Hsu, Chia-Hsien; Huang, Yu-Fen; Sun, Yuh-Chang; Chen, Chih-Chen; Chun Hung, Sheng; Ren, Fan; Andrew Yeh, J.; Wang, Yu-Lin

    2013-04-01

    Human immunodeficiency virus (HIV) Reverse Transcriptase (RT)-immobilized AlGaN/GaN high electron mobility transistors (HEMTs) and binding-site models were used to find out the dissociation constants of the HIV RT-inhibitor complex and the number of the binding sites on RT for the inhibitor, Efavirenz. One binding site on the RT for the inhibitor is predicted and the dissociation constant extracted from the binding-site model is 0.212 nM. The AlGaN/GaN HEMTs and the binding-site-models are demonstrated to be good tools to assist drug developments by elucidating the dissociation constants and the number of binding sites, which can largely reduce the cost and time for drug developments.

  5. Leveraging cross-species transcription factor binding site patterns

    DEFF Research Database (Denmark)

    Claussnitzer, Melina; Dankel, Simon N; Klocke, Bernward;

    2014-01-01

    diabetes risk loci revealed a striking clustering of distinct homeobox TFBS. We identified the PRRX1 homeobox factor as a repressor of PPARG2 expression in adipose cells and demonstrate its adverse effect on lipid metabolism and systemic insulin sensitivity, dependent on the rs4684847 risk allele......Genome-wide association studies have revealed numerous risk loci associated with diverse diseases. However, identification of disease-causing variants within association loci remains a major challenge. Divergence in gene expression due to cis-regulatory variants in noncoding regions is central...... to disease susceptibility. We show that integrative computational analysis of phylogenetic conservation with a complexity assessment of co-occurring transcription factor binding sites (TFBS) can identify cis-regulatory variants and elucidate their mechanistic role in disease. Analysis of established type 2...

  6. Distribution of intercalative dye binding sites in chromatin.

    Science.gov (United States)

    Lurquin, P F; Seligy, V L

    1976-04-01

    Actinomycin D (AMD) and ethidium bromide (EB) were found to bind to chromatin isolated from a variety of gander tissues according to a strong and weak process analogous to that found for deproteinized DNA. Distribution of the dye intercalation sites in chromatin and DNA were evaluated at low r-values (dye bound per nucleotide) by following the appearance of free dye released from chromatin and DNA during thermal denaturation. The AMD dissociation profiles closely resembled the DNA or chromatin-DNA denaturation profiles; whereas the EB derivative dissociation profiles, indicated 3 major transitions for transcriptionally active chromatin with the main component corresponding to the single component which characterizes DNA. The DNA-like component was greatly reduced for mature erythrocyte chromatin but could be generated by removal of histone I and V. Removal of residual non acid-soluble proteins from dehistonized chromatin, urea treatment or dissociation and reconstitution of chromatin favoured conversion to the DNA-like component with loss of the other two. This study indicates that more than one type of binding exists generally in chromatin.

  7. The putative effector-binding site of Leishmania mexicana pyruvate kinase studied by site-directed mutagenesis.

    Science.gov (United States)

    Hannaert, Véronique; Yernaux, Cédric; Rigden, Daniel J; Fothergill-Gilmore, Linda A; Opperdoes, Fred R; Michels, Paul A M

    2002-03-13

    The activity of pyruvate kinase of Leishmania mexicana is allosterically regulated by fructose 2,6-bisphosphate (F-2,6-P(2)), contrary to the pyruvate kinases from other eukaryotes that are usually stimulated by fructose 1,6-bisphosphate (F-1,6-P(2)). Based on the comparison of the three-dimensional structure of Saccharomyces cerevisiae pyruvate kinase crystallized with F-1,6-P(2) present at the effector site (R-state) and the L. mexicana enzyme crystallized in the T-state, two residues (Lys453 and His480) were proposed to bind the 2-phospho group of the effector. This hypothesis was tested by site-directed mutagenesis. The allosteric activation by F-2,6-P(2) appeared to be entirely abrogated in the mutated enzymes confirming our predictions.

  8. ncDNA and drift drive binding site accumulation

    Directory of Open Access Journals (Sweden)

    Ruths Troy

    2012-08-01

    Full Text Available Abstract Background The amount of transcription factor binding sites (TFBS in an organism’s genome positively correlates with the complexity of the regulatory network of the organism. However, the manner by which TFBS arise and accumulate in genomes and the effects of regulatory network complexity on the organism’s fitness are far from being known. The availability of TFBS data from many organisms provides an opportunity to explore these issues, particularly from an evolutionary perspective. Results We analyzed TFBS data from five model organisms – E. coli K12, S. cerevisiae, C. elegans, D. melanogaster, A. thaliana – and found a positive correlation between the amount of non-coding DNA (ncDNA in the organism’s genome and regulatory complexity. Based on this finding, we hypothesize that the amount of ncDNA, combined with the population size, can explain the patterns of regulatory complexity across organisms. To test this hypothesis, we devised a genome-based regulatory pathway model and subjected it to the forces of evolution through population genetic simulations. The results support our hypothesis, showing neutral evolutionary forces alone can explain TFBS patterns, and that selection on the regulatory network function does not alter this finding. Conclusions The cis-regulome is not a clean functional network crafted by adaptive forces alone, but instead a data source filled with the noise of non-adaptive forces. From a regulatory perspective, this evolutionary noise manifests as complexity on both the binding site and pathway level, which has significant implications on many directions in microbiology, genetics, and synthetic biology.

  9. Screening Mixtures of Small Molecules for Binding to Multiple Sites on the Surface Tetanus Toxin C Fragment by Bioaffinity NMR

    Energy Technology Data Exchange (ETDEWEB)

    Cosman, M; Zeller, L; Lightstone, F C; Krishnan, V V; Balhorn, R

    2002-01-01

    The clostridial neurotoxins include the closely related tetanus (TeNT) and botulinum (BoNT) toxins. Botulinum toxin is used to treat severe muscle disorders and as a cosmetic wrinkle reducer. Large quantities of botulinum toxin have also been produced by terrorists for use as a biological weapon. Because there are no known antidotes for these toxins, they thus pose a potential threat to human health whether by an accidental overdose or by a hostile deployment. Thus, the discovery of high specificity and affinity compounds that can inhibit their binding to neural cells can be used as antidotes or in the design of chemical detectors. Using the crystal structure of the C fragment of the tetanus toxin (TetC), which is the cell recognition and cell surface binding domain, and the computational program DOCK, sets of small molecules have been predicted to bind to two different sites located on the surface of this protein. While Site-1 is common to the TeNT and BoNTs, Site-2 is unique to TeNT. Pairs of these molecules from each site can then be linked together synthetically to thereby increase the specificity and affinity for this toxin. Electrospray ionization mass spectroscopy was used to experimentally screen each compound for binding. Mixtures containing binders were further screened for activity under biologically relevant conditions using nuclear magnetic resonance (NMR) methods. The screening of mixtures of compounds offers increased efficiency and throughput as compared to testing single compounds and can also evaluate how possible structural changes induced by the binding of one ligand can influence the binding of the second ligand. In addition, competitive binding experiments with mixtures containing ligands predicted to bind the same site could identify the best binder for that site. NMR transfer nuclear Overhauser effect (trNOE) confirm that TetC binds doxorubicin but that this molecule is displaced by N-acetylneuraminic acid (sialic acid) in a mixture that

  10. Study on Synthesis and Binding Ability of a New Anion Receptor Containing NH Binding Sites

    Institute of Scientific and Technical Information of China (English)

    QIAO,Yan-Hong; LIN,Hai; LIN,Hua-Kuan

    2007-01-01

    A new colorimetric recognition receptor 1 based on the dual capability containing NH binding sites of selectively sensing anionic guest species has been synthesized. Compared with other halide anions, its UV/Vis absorption spectrum in dimethyl sulfoxide showed the response toward the presence of fluoride anion with high selectivity,and also displayed dramatic color changes from colorless to yellow in the presence of TBAF (5 × 10-5 mol/L). The similar UV/Vis absorption spectrum change also occurred when 1 was treated with AcO- while a little change with H2PO-4 and OH-. Receptor 1 has almost not affinity abilities to Cl-, Br- and I-. The binding ability of receptor 1to fluoride with high selectivity over other halides contributes to the anion size and the ability of forming hydrogen bonding. While the different ability of binding with geometrically triangular (AcO-), tetrahedral (H2PO-4 ) and linear (OH-) anions maybe result from their geometry configuration.

  11. L-(TH)glutamate binds to kainate-, NMDA- and AMPA-sensitive binding sites: an autoradiographic analysis

    Energy Technology Data Exchange (ETDEWEB)

    Monaghan, D.T.; Yao, D.; Cotman, C.W.

    1985-08-12

    The anatomical distribution of L-(TH)glutamate binding sites was determined in the presence of various glutamate analogues using quantitative autoradiography. The binding of L-(TH)glutamate is accounted for by the presence of 3 distinct binding sites when measured in the absence of CaS , Cl and Na ions. The anatomical distribution and pharmacological specificity of these binding sites correspond to that reported for the 3 excitatory amino acid binding sites selectively labelled by D-(TH)2-amino-5-phosphonopentanoate (D-(TH)AP5), (TH)kainate ((TH)KA) and (TH) -amino-3-hydroxy-5-methyl-4-isoxazolepropionic acid ((TH)AMPA) which are thought to be selective ligands for the N-methyl-D-aspartate (NMDA), KA and quisqualate (QA) receptors, respectively. (Auth.). 29 refs.; 1 figure; 1 table.

  12. Development of predictive models for predicting binding affinity of endocrine disrupting chemicals to fish sex hormone-binding globulin.

    Science.gov (United States)

    Liu, Huihui; Yang, Xianhai; Yin, Cen; Wei, Mengbi; He, Xiao

    2017-02-01

    Disturbing the transport process is a crucial pathway for endocrine disrupting chemicals (EDCs) exerting disrupting endocrine function. However, this mechanism has not received enough attention compared with that of hormones receptors and synthetase. Recently, we have explored the interaction between EDCs and sex hormone-binding globulin of human (hSHBG). In this study, interactions between EDCs and sex hormone-binding globulin of eight fish species (fSHBG) were investigated by employing classification methods and quantitative structure-activity relationships (QSAR). In the modeling, the relative binding affinity (RBA) of a chemical with 17β-estradiol binding to fSHBG was selected as the endpoint. Classification models were developed for two fish species, while QSAR models were established for the other six fish species. Statistical results indicated that the models had satisfactory goodness of fit, robustness and predictive ability, and that application domain covered a large number of endogenous and exogenous steroidal and non-steroidal chemicals. Additionally, by comparing the log RBA values, it was found that the same chemical may have different affinities for fSHBG from different fish species, thus species diversity should be taken into account. However, the affinity of fSHBG showed a high correlation for fishes within the same Order (i.e., Salmoniformes, Cypriniformes, Perciformes and Siluriformes), thus the fSHBG binding data for one fish species could be used to extrapolate other fish species in the same Order.

  13. Pocketome: an encyclopedia of small-molecule binding sites in 4D.

    Science.gov (United States)

    Kufareva, Irina; Ilatovskiy, Andrey V; Abagyan, Ruben

    2012-01-01

    The importance of binding site plasticity in protein-ligand interactions is well-recognized, and so are the difficulties in predicting the nature and the degree of this plasticity by computational means. To assist in understanding the flexible protein-ligand interactions, we constructed the Pocketome, an encyclopedia of about one thousand experimentally solved conformational ensembles of druggable binding sites in proteins, grouped by location and consistent chain/cofactor composition. The multiplicity of pockets within the ensembles adds an extra, fourth dimension to the Pocketome entry data. Within each ensemble, the pockets were carefully classified by the degree of their pairwise similarity and compatibility with different ligands. The core of the Pocketome is derived regularly and automatically from the current releases of the Protein Data Bank and the Uniprot Knowledgebase; this core is complemented by entries built from manually provided seed ligand locations. The Pocketome website (www.pocketome.org) allows searching for the sites of interest, analysis of conformational clusters, important residues, binding compatibility matrices and interactive visualization of the ensembles using the ActiveICM web browser plugin. The Pocketome collection can be used to build multi-conformational docking and 3D activity models as well as to design cross-docking and virtual ligand screening benchmarks.

  14. Characterization of the Binding Site of Aspartame in the Human Sweet Taste Receptor.

    Science.gov (United States)

    Maillet, Emeline L; Cui, Meng; Jiang, Peihua; Mezei, Mihaly; Hecht, Elizabeth; Quijada, Jeniffer; Margolskee, Robert F; Osman, Roman; Max, Marianna

    2015-10-01

    The sweet taste receptor, a heterodimeric G protein-coupled receptor comprised of T1R2 and T1R3, binds sugars, small molecule sweeteners, and sweet proteins to multiple binding sites. The dipeptide sweetener, aspartame binds in the Venus Flytrap Module (VFTM) of T1R2. We developed homology models of the open and closed forms of human T1R2 and human T1R3 VFTMs and their dimers and then docked aspartame into the closed form of T1R2's VFTM. To test and refine the predictions of our model, we mutated various T1R2 VFTM residues, assayed activity of the mutants and identified 11 critical residues (S40, Y103, D142, S144, S165, S168, Y215, D278, E302, D307, and R383) in and proximal to the binding pocket of the sweet taste receptor that are important for ligand recognition and activity of aspartame. Furthermore, we propose that binding is dependent on 2 water molecules situated in the ligand pocket that bridge 2 carbonyl groups of aspartame to residues D142 and L279. These results shed light on the activation mechanism and how signal transmission arising from the extracellular domain of the T1R2 monomer of the sweet receptor leads to the perception of sweet taste.

  15. Computational predictions provide insights into the biology of TAL effector target sites.

    Science.gov (United States)

    Grau, Jan; Wolf, Annett; Reschke, Maik; Bonas, Ulla; Posch, Stefan; Boch, Jens

    2013-01-01

    Transcription activator-like (TAL) effectors are injected into host plant cells by Xanthomonas bacteria to function as transcriptional activators for the benefit of the pathogen. The DNA binding domain of TAL effectors is composed of conserved amino acid repeat structures containing repeat-variable diresidues (RVDs) that determine DNA binding specificity. In this paper, we present TALgetter, a new approach for predicting TAL effector target sites based on a statistical model. In contrast to previous approaches, the parameters of TALgetter are estimated from training data computationally. We demonstrate that TALgetter successfully predicts known TAL effector target sites and often yields a greater number of predictions that are consistent with up-regulation in gene expression microarrays than an existing approach, Target Finder of the TALE-NT suite. We study the binding specificities estimated by TALgetter and approve that different RVDs are differently important for transcriptional activation. In subsequent studies, the predictions of TALgetter indicate a previously unreported positional preference of TAL effector target sites relative to the transcription start site. In addition, several TAL effectors are predicted to bind to the TATA-box, which might constitute one general mode of transcriptional activation by TAL effectors. Scrutinizing the predicted target sites of TALgetter, we propose several novel TAL effector virulence targets in rice and sweet orange. TAL-mediated induction of the candidates is supported by gene expression microarrays. Validity of these targets is also supported by functional analogy to known TAL effector targets, by an over-representation of TAL effector targets with similar function, or by a biological function related to pathogen infection. Hence, these predicted TAL effector virulence targets are promising candidates for studying the virulence function of TAL effectors. TALgetter is implemented as part of the open-source Java library

  16. Computational predictions provide insights into the biology of TAL effector target sites.

    Directory of Open Access Journals (Sweden)

    Jan Grau

    Full Text Available Transcription activator-like (TAL effectors are injected into host plant cells by Xanthomonas bacteria to function as transcriptional activators for the benefit of the pathogen. The DNA binding domain of TAL effectors is composed of conserved amino acid repeat structures containing repeat-variable diresidues (RVDs that determine DNA binding specificity. In this paper, we present TALgetter, a new approach for predicting TAL effector target sites based on a statistical model. In contrast to previous approaches, the parameters of TALgetter are estimated from training data computationally. We demonstrate that TALgetter successfully predicts known TAL effector target sites and often yields a greater number of predictions that are consistent with up-regulation in gene expression microarrays than an existing approach, Target Finder of the TALE-NT suite. We study the binding specificities estimated by TALgetter and approve that different RVDs are differently important for transcriptional activation. In subsequent studies, the predictions of TALgetter indicate a previously unreported positional preference of TAL effector target sites relative to the transcription start site. In addition, several TAL effectors are predicted to bind to the TATA-box, which might constitute one general mode of transcriptional activation by TAL effectors. Scrutinizing the predicted target sites of TALgetter, we propose several novel TAL effector virulence targets in rice and sweet orange. TAL-mediated induction of the candidates is supported by gene expression microarrays. Validity of these targets is also supported by functional analogy to known TAL effector targets, by an over-representation of TAL effector targets with similar function, or by a biological function related to pathogen infection. Hence, these predicted TAL effector virulence targets are promising candidates for studying the virulence function of TAL effectors. TALgetter is implemented as part of the open

  17. Quick and Simple Detection Technique to Assess the Binding of Antimicrotubule Agents to the Colchicine-Binding Site

    Directory of Open Access Journals (Sweden)

    Fortin Sébastien

    2010-04-01

    Full Text Available Abstract Development of antimitotic binding to the colchicine-binding site for the treatment of cancer is rapidly expanding. Numerous antimicrotubule agents are prepared every year, and the determination of their binding affinity to tubulin requires the use of purified tubulins and radiolabeled ligands. Such a procedure is costly and time-consuming and therefore is limited to the most promising candidates. Here, we report a quick and inexpensive method that requires only usual laboratory resources to assess the binding of antimicrotubules to colchicine-binding site. The method is based on the ability of N,N'-ethylene-bis(iodoacetamide (EBI to crosslink in living cells the cysteine residues at position 239 and 354 of β-tubulin, residues which are involved in the colchicine-binding site. The β-tubulin adduct formed by EBI is easily detectable by Western blot as a second immunoreacting band of β-tubulin that migrates faster than β-tubulin. The occupancy of colchicine-binding site by pertinent antimitotics inhibits the formation of the EBI: β-tubulin adduct, resulting in an assay that allows the screening of new molecules targeting this binding site.

  18. Quick and Simple Detection Technique to Assess the Binding of Antimicrotubule Agents to the Colchicine-Binding Site

    Directory of Open Access Journals (Sweden)

    Moreau Emmanuel

    2010-01-01

    Full Text Available Abstract Development of antimitotic binding to the colchicine-binding site for the treatment of cancer is rapidly expanding. Numerous antimicrotubule agents are prepared every year, and the determination of their binding affinity to tubulin requires the use of purified tubulins and radiolabeled ligands. Such a procedure is costly and time-consuming and therefore is limited to the most promising candidates. Here, we report a quick and inexpensive method that requires only usual laboratory resources to assess the binding of antimicrotubules to colchicine-binding site. The method is based on the ability of N,N'-ethylene-bis(iodoacetamide (EBI to crosslink in living cells the cysteine residues at position 239 and 354 of β-tubulin, residues which are involved in the colchicine-binding site. The β-tubulin adduct formed by EBI is easily detectable by Western blot as a second immunoreacting band of β-tubulin that migrates faster than β-tubulin. The occupancy of colchicine-binding site by pertinent antimitotics inhibits the formation of the EBI: β-tubulin adduct, resulting in an assay that allows the screening of new molecules targeting this binding site.

  19. Predicting accurate absolute binding energies in aqueous solution

    DEFF Research Database (Denmark)

    Jensen, Jan Halborg

    2015-01-01

    Recent predictions of absolute binding free energies of host-guest complexes in aqueous solution using electronic structure theory have been encouraging for some systems, while other systems remain problematic. In this paper I summarize some of the many factors that could easily contribute 1-3 kcal...... mol(-1) errors at 298 K: three-body dispersion effects, molecular symmetry, anharmonicity, spurious imaginary frequencies, insufficient conformational sampling, wrong or changing ionization states, errors in the solvation free energy of ions, and explicit solvent (and ion) effects that are not well......-represented by continuum models. While I focus on binding free energies in aqueous solution the approach also applies (with minor adjustments) to any free energy difference such as conformational or reaction free energy differences or activation free energies in any solvent....

  20. Discovery of a novel allosteric inhibitor-binding site in ERK5: comparison with the canonical kinase hinge ATP-binding site.

    Science.gov (United States)

    Chen, Hongming; Tucker, Julie; Wang, Xiaotao; Gavine, Paul R; Phillips, Chris; Augustin, Martin A; Schreiner, Patrick; Steinbacher, Stefan; Preston, Marian; Ogg, Derek

    2016-05-01

    MAP kinases act as an integration point for multiple biochemical signals and are involved in a wide variety of cellular processes such as proliferation, differentiation, regulation of transcription and development. As a member of the MAP kinase family, ERK5 (MAPK7) is involved in the downstream signalling pathways of various cell-surface receptors, including receptor tyrosine kinases and G protein-coupled receptors. In the current study, five structures of the ERK5 kinase domain co-crystallized with ERK5 inhibitors are reported. Interestingly, three of the compounds bind at a novel allosteric binding site in ERK5, while the other two bind at the typical ATP-binding site. Binding of inhibitors at the allosteric site is accompanied by displacement of the P-loop into the ATP-binding site and is shown to be ATP-competitive in an enzymatic assay of ERK5 kinase activity. Kinase selectivity data show that the most potent allosteric inhibitor exhibits superior kinase selectivity compared with the two inhibitors that bind at the canonical ATP-binding site. An analysis of these structures and comparison with both a previously published ERK5-inhibitor complex structure (PDB entry 4b99) and the structures of three other kinases (CDK2, ITK and MEK) in complex with allosteric inhibitors are presented.

  1. Binding of lipoic acid induces conformational change and appearance of a new binding site in methylglyoxal modified serum albumin.

    Science.gov (United States)

    Suji, George; Khedkar, Santosh A; Singh, Sreelekha K; Kishore, Nand; Coutinho, Evans C; Bhor, Vikrant M; Sivakami, S

    2008-06-01

    The binding of lipoic acid (LA), to methylglyoxal (MG) modified BSA was studied using isothermal titration calorimetry in combination with enzyme kinetics and molecular modelling. The binding of LA to BSA was sequential with two sites, one with higher binding constant and another comparatively lower. In contrast the modified protein showed three sequential binding sites with a reduction in affinity at the high affinity binding site by a factor of 10. CD results show appreciable changes in conformation of the modified protein as a result of binding to LA. The inhibition of esterase like activity of BSA by LA revealed that it binds to site II in domain III of BSA. The pH dependence of esterase activity of native BSA indicated a catalytic group with a pK(a) = 7.9 +/- 0.1, assigned to Tyr411 with the conjugate base stabilised by interaction with Arg410. Upon modification by MG, this pK(a) increased to 8.13. A complex obtained by docking of LA to BSA and BSA in which Arg410 is modified to hydroimidazolone showed that the long hydrocarbon chain of lipoic acid sits in a cavity different from the one observed for unmodified BSA. The molecular electrostatic potential showed that the modification of Arg410 reduced the positive electrostatic potential around the protein-binding site. Thus it can be concluded that the modification of BSA by MG resulted in altered ligand binding characteristics due to changes in the internal geometry and electrostatic potential at the binding site.

  2. Binding Energy Distribution Analysis Method: Hamiltonian Replica Exchange with Torsional Flattening for Binding Mode Prediction and Binding Free Energy Estimation.

    Science.gov (United States)

    Mentes, Ahmet; Deng, Nan-Jie; Vijayan, R S K; Xia, Junchao; Gallicchio, Emilio; Levy, Ronald M

    2016-05-10

    Molecular dynamics modeling of complex biological systems is limited by finite simulation time. The simulations are often trapped close to local energy minima separated by high energy barriers. Here, we introduce Hamiltonian replica exchange (H-REMD) with torsional flattening in the Binding Energy Distribution Analysis Method (BEDAM), to reduce energy barriers along torsional degrees of freedom and accelerate sampling of intramolecular degrees of freedom relevant to protein-ligand binding. The method is tested on a standard benchmark (T4 Lysozyme/L99A/p-xylene complex) and on a library of HIV-1 integrase complexes derived from the SAMPL4 blind challenge. We applied the torsional flattening strategy to 26 of the 53 known binders to the HIV Integrase LEDGF site found to have a binding energy landscape funneled toward the crystal structure. We show that our approach samples the conformational space more efficiently than the original method without flattening when starting from a poorly docked pose with incorrect ligand dihedral angle conformations. In these unfavorable cases convergence to a binding pose within 2-3 Å from the crystallographic pose is obtained within a few nanoseconds of the Hamiltonian replica exchange simulation. We found that torsional flattening is insufficient in cases where trapping is due to factors other than torsional energy, such as the formation of incorrect intramolecular hydrogen bonds and stacking. Work is in progress to generalize the approach to handle these cases and thereby make it more widely applicable.

  3. Mutated primer binding sites interacting with different tRNAs allow efficient murine leukemia virus replication

    DEFF Research Database (Denmark)

    Lund, Anders Henrik; Duch, M; Lovmand, J;

    1993-01-01

    can replicate by using various tRNA molecules as primers and propose primer binding site-tRNA primer interactions to be of major importance for tRNA primer selection. However, efficient primer selection does not require perfect Watson-Crick base pairing at all 18 positions of the primer binding site.......(Pro). Polymerase chain reaction amplification and sequence analysis of transduced proviruses confirmed the transfer of vectors with mutated primer binding sites and further showed that tRNA(Gln-2) may act efficiently in conjunction with the tRNA(Gln-1) primer binding site. We conclude that murine leukemia virus......Two Akv murine leukemia virus-based retroviral vectors with primer binding sites matching tRNA(Gln-1) and tRNA(Lys-3) were constructed. The transduction efficiency of these mutated vectors was found to be comparable to that of a vector carrying the wild-type primer binding site matching tRNA...

  4. Peripheral benzodiazepine binding sites on striated muscles of the rat: Properties and effect of denervation

    Energy Technology Data Exchange (ETDEWEB)

    Mueller, W.E.; Ickstadt, A. (Mainz Univ. (Germany, F.R.). Pharmakologisches Inst.); Hopf, H.Ch. (Mainz Univ. (Germany, F.R.))

    1985-01-01

    In order to test the hypothesis that peripheral benzodiazepine binding sites mediate some direct effects of benzodiazepines on striated muscles, the properties of specific /sup 3/H-Ro 5-4864 binding to rat biceps and rat diaphragm homogenates were investigated. In both tissues a single population of sites was found with a Ksub(D) value of 3 nmol/l. The density of these sites in both muscles was higher than the density in rat brain, but was considerably lower than in rat kidney. Competition experiments indicate a substrate specificity of specific /sup 3/H-Ro 5-4864 binding similar to the properties already demonstrated for the specific binding of this ligand to peripheral benzodiazepine binding sites in many other tissues. The properties of these sites in the rat diaphragm are not changed after motoric denervation by phrenicectomy. It is concluded that peripheral benzodiazepine binding sites are not involved in direct effects of benzodiazepines on striated muscles.

  5. Nonlinear scoring functions for similarity-based ligand docking and binding affinity prediction.

    Science.gov (United States)

    Brylinski, Michal

    2013-11-25

    A common strategy for virtual screening considers a systematic docking of a large library of organic compounds into the target sites in protein receptors with promising leads selected based on favorable intermolecular interactions. Despite a continuous progress in the modeling of protein-ligand interactions for pharmaceutical design, important challenges still remain, thus the development of novel techniques is required. In this communication, we describe eSimDock, a new approach to ligand docking and binding affinity prediction. eSimDock employs nonlinear machine learning-based scoring functions to improve the accuracy of ligand ranking and similarity-based binding pose prediction, and to increase the tolerance to structural imperfections in the target structures. In large-scale benchmarking using the Astex/CCDC data set, we show that 53.9% (67.9%) of the predicted ligand poses have RMSD of <2 Å (<3 Å). Moreover, using binding sites predicted by recently developed eFindSite, eSimDock models ligand binding poses with an RMSD of 4 Å for 50.0-39.7% of the complexes at the protein homology level limited to 80-40%. Simulations against non-native receptor structures, whose mean backbone rearrangements vary from 0.5 to 5.0 Å Cα-RMSD, show that the ratio of docking accuracy and the estimated upper bound is at a constant level of ∼0.65. Pearson correlation coefficient between experimental and predicted by eSimDock Ki values for a large data set of the crystal structures of protein-ligand complexes from BindingDB is 0.58, which decreases only to 0.46 when target structures distorted to 3.0 Å Cα-RMSD are used. Finally, two case studies demonstrate that eSimDock can be customized to specific applications as well. These encouraging results show that the performance of eSimDock is largely unaffected by the deformations of ligand binding regions, thus it represents a practical strategy for across-proteome virtual screening using protein models. eSimDock is freely

  6. In vitro site selection of a consensus binding site for the Drosophila melanogaster Tbx20 homolog midline.

    Directory of Open Access Journals (Sweden)

    Nima Najand

    Full Text Available We employed in vitro site selection to identify a consensus binding sequence for the Drosophila melanogaster Tbx20 T-box transcription factor homolog Midline. We purified a bacterially expressed T-box DNA binding domain of Midline, and used it in four rounds of precipitation and polymerase-chain-reaction based amplification. We cloned and sequenced 54 random oligonucleotides selected by Midline. Electromobility shift-assays confirmed that 27 of these could bind the Midline T-box. Sequence alignment of these 27 clones suggests that Midline binds as a monomer to a consensus sequence that contains an AGGTGT core. Thus, the Midline consensus binding site we define in this study is similar to that defined for vertebrate Tbx20, but differs from a previously reported Midline binding sequence derived through site selection.

  7. Evolutionary computation for discovery of composite transcription factor binding sites

    Science.gov (United States)

    Fogel, Gary B.; Porto, V. William; Varga, Gabor; Dow, Ernst R.; Craven, Andrew M.; Powers, David M.; Harlow, Harry B.; Su, Eric W.; Onyia, Jude E.; Su, Chen

    2008-01-01

    Previous research demonstrated the use of evolutionary computation for the discovery of transcription factor binding sites (TFBS) in promoter regions upstream of coexpressed genes. However, it remained unclear whether or not composite TFBS elements, commonly found in higher organisms where two or more TFBSs form functional complexes, could also be identified by using this approach. Here, we present an important refinement of our previous algorithm and test the identification of composite elements using NFAT/AP-1 as an example. We demonstrate that by using appropriate existing parameters such as window size, novel-scoring methods such as central bonusing and methods of self-adaptation to automatically adjust the variation operators during the evolutionary search, TFBSs of different sizes and complexity can be identified as top solutions. Some of these solutions have known experimental relationships with NFAT/AP-1. We also indicate that even after properly tuning the model parameters, the choice of the appropriate window size has a significant effect on algorithm performance. We believe that this improved algorithm will greatly augment TFBS discovery. PMID:18927103

  8. Evolving Transcription Factor Binding Site Models From Protein Binding Microarray Data

    KAUST Repository

    Wong, Ka-Chun

    2016-02-02

    Protein binding microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner. In this paper, we describe the PBM motif model building problem. We apply several evolutionary computation methods and compare their performance with the interior point method, demonstrating their performance advantages. In addition, given the PBM domain knowledge, we propose and describe a novel method called kmerGA which makes domain-specific assumptions to exploit PBM data properties to build more accurate models than the other models built. The effectiveness and robustness of kmerGA is supported by comprehensive performance benchmarking on more than 200 datasets, time complexity analysis, convergence analysis, parameter analysis, and case studies. To demonstrate its utility further, kmerGA is applied to two real world applications: 1) PBM rotation testing and 2) ChIP-Seq peak sequence prediction. The results support the biological relevance of the models learned by kmerGA, and thus its real world applicability.

  9. SVM-based prediction of caspase substrate cleavage sites

    Directory of Open Access Journals (Sweden)

    Ranganathan Shoba

    2006-12-01

    Full Text Available Abstract Background Caspases belong to a class of cysteine proteases which function as critical effectors in apoptosis and inflammation by cleaving substrates immediately after unique sites. Prediction of such cleavage sites will complement structural and functional studies on substrates cleavage as well as discovery of new substrates. Recently, different computational methods have been developed to predict the cleavage sites of caspase substrates with varying degrees of success. As the support vector machines (SVM algorithm has been shown to be useful in several biological classification problems, we have implemented an SVM-based method to investigate its applicability to this domain. Results A set of unique caspase substrates cleavage sites were obtained from literature and used for evaluating the SVM method. Datasets containing (i the tetrapeptide cleavage sites, (ii the tetrapeptide cleavage sites, augmented by two adjacent residues, P1' and P2' amino acids and (iii the tetrapeptide cleavage sites with ten additional upstream and downstream flanking sequences (where available were tested. The SVM method achieved an accuracy ranging from 81.25% to 97.92% on independent test sets. The SVM method successfully predicted the cleavage of a novel caspase substrate and its mutants. Conclusion This study presents an SVM approach for predicting caspase substrate cleavage sites based on the cleavage sites and the downstream and upstream flanking sequences. The method shows an improvement over existing methods and may be useful for predicting hitherto undiscovered cleavage sites.

  10. Multiplicity of carbohydrate-binding sites in -prism fold lectins: occurrence and possible evolutionary implications

    Indian Academy of Sciences (India)

    Alok Sharma; Divya Chandran; Desh D Singh; M Vijayan

    2007-09-01

    The -prism II fold lectins of known structure, all from monocots, invariably have three carbohydrate-binding sites in each subunit/domain. Until recently, -prism I fold lectins of known structure were all from dicots and they exhibited one carbohydrate-binding site per subunit/domain. However, the recently determined structure of the -prism fold I lectin from banana, a monocot, has two very similar carbohydrate-binding sites. This prompted a detailed analysis of all the sequences appropriate for two-lectin folds and which carry one or more relevant carbohydrate-binding motifs. The very recent observation of a -prism I fold lectin, griffithsin, with three binding sites in each domain further confirmed the need for such an analysis. The analysis demonstrates substantial diversity in the number of binding sites unrelated to the taxonomical position of the plant source. However, the number of binding sites and the symmetry within the sequence exhibit reasonable correlation. The distribution of the two families of -prism fold lectins among plants and the number of binding sites in them, appear to suggest that both of them arose through successive gene duplication, fusion and divergent evolution of the same primitive carbohydrate-binding motif involving a Greek key. Analysis with sequences in individual Greek keys as independent units lends further support to this conclusion. It would seem that the preponderance of three carbohydrate-binding sites per domain in monocot lectins, particularly those with the -prism II fold, is related to the role of plant lectins in defence.

  11. Resolving the problem of trapped water in binding cavities: prediction of host-guest binding free energies in the SAMPL5 challenge by funnel metadynamics

    Science.gov (United States)

    Bhakat, Soumendranath; Söderhjelm, Pär

    2017-01-01

    The funnel metadynamics method enables rigorous calculation of the potential of mean force along an arbitrary binding path and thereby evaluation of the absolute binding free energy. A problem of such physical paths is that the mechanism characterizing the binding process is not always obvious. In particular, it might involve reorganization of the solvent in the binding site, which is not easily captured with a few geometrically defined collective variables that can be used for biasing. In this paper, we propose and test a simple method to resolve this trapped-water problem by dividing the process into an artificial host-desolvation step and an actual binding step. We show that, under certain circumstances, the contribution from the desolvation step can be calculated without introducing further statistical errors. We apply the method to the problem of predicting host-guest binding free energies in the SAMPL5 blind challenge, using two octa-acid hosts and six guest molecules. For one of the hosts, well-converged results are obtained and the prediction of relative binding free energies is the best among all the SAMPL5 submissions. For the other host, which has a narrower binding pocket, the statistical uncertainties are slightly higher; longer simulations would therefore be needed to obtain conclusive results.

  12. A computational model of the LGI1 protein suggests a common binding site for ADAM proteins.

    Directory of Open Access Journals (Sweden)

    Emanuela Leonardi

    Full Text Available Mutations of human leucine-rich glioma inactivated (LGI1 gene encoding the epitempin protein cause autosomal dominant temporal lateral epilepsy (ADTLE, a rare familial partial epileptic syndrome. The LGI1 gene seems to have a role on the transmission of neuronal messages but the exact molecular mechanism remains unclear. In contrast to other genes involved in epileptic disorders, epitempin shows no homology with known ion channel genes but contains two domains, composed of repeated structural units, known to mediate protein-protein interactions.A three dimensional in silico model of the two epitempin domains was built to predict the structure-function relationship and propose a functional model integrating previous experimental findings. Conserved and electrostatic charged regions of the model surface suggest a possible arrangement between the two domains and identifies a possible ADAM protein binding site in the β-propeller domain and another protein binding site in the leucine-rich repeat domain. The functional model indicates that epitempin could mediate the interaction between proteins localized to different synaptic sides in a static way, by forming a dimer, or in a dynamic way, by binding proteins at different times.The model was also used to predict effects of known disease-causing missense mutations. Most of the variants are predicted to alter protein folding while several other map to functional surface regions. In agreement with experimental evidence, this suggests that non-secreted LGI1 mutants could be retained within the cell by quality control mechanisms or by altering interactions required for the secretion process.

  13. Structural Perspectives on the Evolutionary Expansion of Unique Protein-Protein Binding Sites.

    Science.gov (United States)

    Goncearenco, Alexander; Shaytan, Alexey K; Shoemaker, Benjamin A; Panchenko, Anna R

    2015-09-15

    Structures of protein complexes provide atomistic insights into protein interactions. Human proteins represent a quarter of all structures in the Protein Data Bank; however, available protein complexes cover less than 10% of the human proteome. Although it is theoretically possible to infer interactions in human proteins based on structures of homologous protein complexes, it is still unclear to what extent protein interactions and binding sites are conserved, and whether protein complexes from remotely related species can be used to infer interactions and binding sites. We considered biological units of protein complexes and clustered protein-protein binding sites into similarity groups based on their structure and sequence, which allowed us to identify unique binding sites. We showed that the growth rate of the number of unique binding sites in the Protein Data Bank was much slower than the growth rate of the number of structural complexes. Next, we investigated the evolutionary roots of unique binding sites and identified the major phyletic branches with the largest expansion in the number of novel binding sites. We found that many binding sites could be traced to the universal common ancestor of all cellular organisms, whereas relatively few binding sites emerged at the major evolutionary branching points. We analyzed the physicochemical properties of unique binding sites and found that the most ancient sites were the largest in size, involved many salt bridges, and were the most compact and least planar. In contrast, binding sites that appeared more recently in the evolution of eukaryotes were characterized by a larger fraction of polar and aromatic residues, and were less compact and more planar, possibly due to their more transient nature and roles in signaling processes.

  14. Oestradiol and testosterone binding sites in mice tibiae and their relationship with bone growth.

    Science.gov (United States)

    Lopez, A; Ventanas, J; Burgos, J

    1986-11-01

    High affinity oestradiol and testosterone binding sites were found in tibiae cytosol from entire male and female of different ages. Scatchard assay allowed to estimate a Kd of 2.7 X 10(-9) M for oestradiol binding sites indicating that the 3H-oestradiol binding was of high affinity. Oestradiol and testosterone binding sites abundance in mice tibiae are subject to change with age. It is not easy to establish a direct correlation between these changes and the values reported here on bone growth in weight and length, however seems possible to point a negative relationship between bone lengthening and oestradiol binding site levels in female, as well a positive relationship with testosterone in both sexes. The presence of oestradiol and testosterone binding sites in epiphyses and not in the diaphyses reinforces the hypothesis that both are playing some role in bone growth.

  15. Alignment-free ultra-high-throughput comparison of druggable protein-ligand binding sites.

    Science.gov (United States)

    Weill, Nathanaël; Rognan, Didier

    2010-01-01

    Inferring the biological function of a protein from its three-dimensional structure as well as explaining why a drug may bind to various targets is of crucial importance to modern drug discovery. Here we present a generic 4833-integer vector describing druggable protein-ligand binding sites that can be applied to any protein and any binding cavity. The fingerprint registers counts of pharmacophoric triplets from the Calpha atomic coordinates of binding-site-lining residues. Starting from a customized data set of diverse protein-ligand binding site pairs, the most appropriate metric and a similarity threshold could be defined for similar binding sites. The method (FuzCav) has been used in various scenarios: (i) screening a collection of 6000 binding sites for similarity to different queries; (ii) classifying protein families (serine endopeptidases, protein kinases) by binding site diversity; (iii) discriminating adenine-binding cavities from decoys. The fingerprint generation and comparison supports ultra-high throughput (ca. 1000 measures/s), does not require prior alignment of protein binding sites, and is able to detect local similarity among subpockets. It is thus particularly well suited to the functional annotation of novel genomic structures with low sequence identity to known X-ray templates.

  16. Identification of a second substrate-binding site in solute-sodium symporters.

    Science.gov (United States)

    Li, Zheng; Lee, Ashley S E; Bracher, Susanne; Jung, Heinrich; Paz, Aviv; Kumar, Jay P; Abramson, Jeff; Quick, Matthias; Shi, Lei

    2015-01-02

    The structure of the sodium/galactose transporter (vSGLT), a solute-sodium symporter (SSS) from Vibrio parahaemolyticus, shares a common structural fold with LeuT of the neurotransmitter-sodium symporter family. Structural alignments between LeuT and vSGLT reveal that the crystallographically identified galactose-binding site in vSGLT is located in a more extracellular location relative to the central substrate-binding site (S1) in LeuT. Our computational analyses suggest the existence of an additional galactose-binding site in vSGLT that aligns to the S1 site of LeuT. Radiolabeled galactose saturation binding experiments indicate that, like LeuT, vSGLT can simultaneously bind two substrate molecules under equilibrium conditions. Mutating key residues in the individual substrate-binding sites reduced the molar substrate-to-protein binding stoichiometry to ~1. In addition, the related and more experimentally tractable SSS member PutP (the Na(+)/proline transporter) also exhibits a binding stoichiometry of 2. Targeting residues in the proposed sites with mutations results in the reduction of the binding stoichiometry and is accompanied by severely impaired translocation of proline. Our data suggest that substrate transport by SSS members requires both substrate-binding sites, thereby implying that SSSs and neurotransmitter-sodium symporters share common mechanistic elements in substrate transport.

  17. Positive-Unlabeled Learning for Pupylation Sites Prediction

    Directory of Open Access Journals (Sweden)

    Ming Jiang

    2016-01-01

    Full Text Available Pupylation plays a key role in regulating various protein functions as a crucial posttranslational modification of prokaryotes. In order to understand the molecular mechanism of pupylation, it is important to identify pupylation substrates and sites accurately. Several computational methods have been developed to identify pupylation sites because the traditional experimental methods are time-consuming and labor-sensitive. With the existing computational methods, the experimentally annotated pupylation sites are used as the positive training set and the remaining nonannotated lysine residues as the negative training set to build classifiers to predict new pupylation sites from the unknown proteins. However, the remaining nonannotated lysine residues may contain pupylation sites which have not been experimentally validated yet. Unlike previous methods, in this study, the experimentally annotated pupylation sites were used as the positive training set whereas the remaining nonannotated lysine residues were used as the unlabeled training set. A novel method named PUL-PUP was proposed to predict pupylation sites by using positive-unlabeled learning technique. Our experimental results indicated that PUL-PUP outperforms the other methods significantly for the prediction of pupylation sites. As an application, PUL-PUP was also used to predict the most likely pupylation sites in nonannotated lysine sites.

  18. CD91 interacts with mannan-binding lectin (MBL) through the MBL-associated serine protease-binding site.

    Science.gov (United States)

    Duus, Karen; Thielens, Nicole M; Lacroix, Monique; Tacnet, Pascale; Frachet, Philippe; Holmskov, Uffe; Houen, Gunnar

    2010-12-01

    CD91 plays an important role in the scavenging of apoptotic material, possibly through binding to soluble pattern-recognition molecules. In this study, we investigated the interaction of CD91 with mannan-binding lectin (MBL), ficolins and lung surfactant proteins. Both MBL and L-ficolin were found to bind CD91. The MBL-CD91 interaction was time- and concentration-dependent and could be inhibited by known ligands of CD91. MBL-associated serine protease 3 (MASP-3) also inhibited binding between MBL and CD91, suggesting that the site of interaction is located at or near the MASP-MBL interaction site. This was confirmed by using MBL mutants deficient for MASP binding that were unable to interact with CD91. These findings demonstrate that MBL and L-ficolin interact with CD91, strongly suggesting that they have the potential to function as soluble recognition molecules for scavenging microbial and apoptotic material by CD91.

  19. CD91 interacts with mannan-binding lectin (MBL) through the MBL-associated serine protease-binding site

    DEFF Research Database (Denmark)

    Duus, Karen; Thielens, Nicole M; Lacroix, Monique;

    2010-01-01

    CD91 plays an important role in the scavenging of apoptotic material, possibly through binding to soluble pattern-recognition molecules. In this study, we investigated the interaction of CD91 with mannan-binding lectin (MBL), ficolins and lung surfactant proteins. Both MBL and L-ficolin were found...... to bind CD91. The MBL-CD91 interaction was time- and concentration-dependent and could be inhibited by known ligands of CD91. MBL-associated serine protease 3 (MASP-3) also inhibited binding between MBL and CD91, suggesting that the site of interaction is located at or near the MASP-MBL interaction site....... This was confirmed by using MBL mutants deficient for MASP binding that were unable to interact with CD91. These findings demonstrate that MBL and L-ficolin interact with CD91, strongly suggesting that they have the potential to function as soluble recognition molecules for scavenging microbial and apoptotic...

  20. Identification of clustered YY1 binding sites in Imprinting Control Regions

    Energy Technology Data Exchange (ETDEWEB)

    Kim, J D; Hinz, A; Bergmann, A; Huang, J; Ovcharenko, I; Stubbs, L; Kim, J

    2006-04-19

    Mammalian genomic imprinting is regulated by Imprinting Control Regions (ICRs) that are usually associated with tandem arrays of transcription factor binding sites. In the current study, the sequence features derived from a tandem array of YY1 binding sites of Peg3-DMR (differentially methylated region) led us to identify three additional clustered YY1 binding sites, which are also localized within the DMRs of Xist, Tsix, and Nespas. These regions have been shown to play a critical role as ICRs for the regulation of surrounding genes. These ICRs have maintained a tandem array of YY1 binding sites during mammalian evolution. The in vivo binding of YY1 to these regions is allele-specific and only to the unmethylated active alleles. Promoter/enhancer assays suggest that a tandem array of YY1 binding sites function as a potential orientation-dependent enhancer. Insulator assays revealed that the enhancer-blocking activity is detected only in the YY1 binding sites of Peg3-DMR but not in the YY1 binding sites of other DMRs. Overall, our identification of three additional clustered YY1 binding sites in imprinted domains suggests a significant role for YY1 in mammalian genomic imprinting.

  1. A deeper look into transcription regulatory code by preferred pair distance templates for transcription factor binding sites

    KAUST Repository

    Kulakovskiy, Ivan V.

    2011-08-18

    Motivation: Modern experimental methods provide substantial information on protein-DNA recognition. Studying arrangements of transcription factor binding sites (TFBSs) of interacting transcription factors (TFs) advances understanding of the transcription regulatory code. Results: We constructed binding motifs for TFs forming a complex with HIF-1α at the erythropoietin 3\\'-enhancer. Corresponding TFBSs were predicted in the segments around transcription start sites (TSSs) of all human genes. Using the genome-wide set of regulatory regions, we observed several strongly preferred distances between hypoxia-responsive element (HRE) and binding sites of a particular cofactor protein. The set of preferred distances was called as a preferred pair distance template (PPDT). PPDT dramatically depended on the TF and orientation of its binding sites relative to HRE. PPDT evaluated from the genome-wide set of regulatory sequences was used to detect significant PPDT-consistent binding site pairs in regulatory regions of hypoxia-responsive genes. We believe PPDT can help to reveal the layout of eukaryotic regulatory segments. © The Author 2011. Published by Oxford University Press. All rights reserved.

  2. Characterization of 6-mercaptopurine binding to bovine serum albumin and its displacement from the binding sites by quercetin and rutin

    Energy Technology Data Exchange (ETDEWEB)

    Ehteshami, Mehdi [Nutrition Research Center, School of Health and Nutrition, Tabriz University of Medical Sciences, Tabriz 51644-14766 (Iran, Islamic Republic of); Rasoulzadeh, Farzaneh [Drug Applied Research Center, Tabriz University of Medical Sciences, Tabriz 51644-14766 (Iran, Islamic Republic of); Mahboob, Soltanali [Nutrition Research Center, School of Health and Nutrition, Tabriz University of Medical Sciences, Tabriz 51644-14766 (Iran, Islamic Republic of); Rashidi, Mohammad-Reza, E-mail: rashidi@tbzmed.ac.ir [Research Center for Pharmaceutical Nanotechnology, Tabriz University of Medical Sciences, Tabriz 51644-14766 (Iran, Islamic Republic of)

    2013-03-15

    Binding of a drug to the serum albumins as major serum transport proteins can be influenced by other ligands leading to alteration of its pharmacological properties. In the present study, binding characteristics of 6-mercaptopurine (6-MP) with bovine serum albumin (BSA) together with its displacement from its binding site by quercetin and rutin have been investigated by the spectroscopic method. According to the binding parameters, a static quenching component in overall dynamic quenching process is operative in the interaction between 6-MP and BSA. The binding of 6-MP to BSA occurred spontaneously due to entropy-driven hydrophobic interactions. The synchronous fluorescence spectroscopy study revealed that the secondary structure of BSA is changed in the presence of 6-MP and both Tyr and Trp residues participate in the interaction between 6-MP and BSA with the later one being more dominant. The binding constant value of 6-MP-BSA in the presence of quercetin and rutin increased. 6-MP was displaced by ibuprofen indicating that the binding site of 6-MP on albumin is site II. Therefore, the change of the pharmacokinetic and pharmacodynamic properties of 6-MP by quercetin and rutin through alteration of binding capacity of 6-MP to the serum albumin cannot be ruled out. In addition, the displacement study showed that 6-MP is located in site II of BSA. - Highlights: Black-Right-Pointing-Pointer Participation of both Tyr and particularly Trp residues in the interaction between 6-MP and BSA. Black-Right-Pointing-Pointer Involvement of a static quenching component in an overall dynamic quenching process. Black-Right-Pointing-Pointer Ability of quercetin and rutin to change the binding constants of 6-MP-BSA complex. Black-Right-Pointing-Pointer Binding of 6-MP to BSA through entropy-driven hydrophobic interactions.

  3. Mesoscopic model and free energy landscape for protein-DNA binding sites: analysis of cyanobacterial promoters.

    Directory of Open Access Journals (Sweden)

    Rafael Tapia-Rojo

    2014-10-01

    Full Text Available The identification of protein binding sites in promoter sequences is a key problem to understand and control regulation in biochemistry and biotechnological processes. We use a computational method to analyze promoters from a given genome. Our approach is based on a physical model at the mesoscopic level of protein-DNA interaction based on the influence of DNA local conformation on the dynamics of a general particle along the chain. Following the proposed model, the joined dynamics of the protein particle and the DNA portion of interest, only characterized by its base pair sequence, is simulated. The simulation output is analyzed by generating and analyzing the Free Energy Landscape of the system. In order to prove the capacity of prediction of our computational method we have analyzed nine promoters of Anabaena PCC 7120. We are able to identify the transcription starting site of each of the promoters as the most populated macrostate in the dynamics. The developed procedure allows also to characterize promoter macrostates in terms of thermo-statistical magnitudes (free energy and entropy, with valuable biological implications. Our results agree with independent previous experimental results. Thus, our methods appear as a powerful complementary tool for identifying protein binding sites in promoter sequences.

  4. CD4 binding site broadly neutralizing antibody selection of HIV-1 escape mutants.

    Science.gov (United States)

    Dreja, Hanna; Pade, Corinna; Chen, Lei; McKnight, Áine

    2015-07-01

    All human immunodeficiency virus type-1 (HIV-1) viruses use CD4 to enter cells. Consequently, the viral envelope CD4-binding site (CD4bs) is relatively conserved, making it a logical neutralizing antibody target. It is important to understand how CD4-binding site variation allows for escape from neutralizing antibodies. Alanine scanning mutagenesis identifies residues in antigenic sites, whereas escape mutant selection identifies viable mutants. We selected HIV-1 to escape CD4bs neutralizing mAbs b12, A12 and HJ16. Viruses that escape from A12 and b12 remained susceptible to HJ16, VRC01 and J3, whilst six different viruses that escape HJ16 remained sensitive to A12, b12 and J3. In contrast, their sensitivity to VRC01 was variable. Triple HJ16/A12/b12-resistant virus proved that HIV-1 could escape multiple broadly neutralizing monoclonal antibodies, but still retain sensitivity to VRC01 and the llama-derived J3 nanobody. This antigenic variability may reflect that occurring in circulating viruses, so studies like this can predict immunologically relevant antigenic forms of the CD4bs for inclusion in HIV-1 vaccines.

  5. A Random Forest Model for Predicting Allosteric and Functional Sites on Proteins.

    Science.gov (United States)

    Chen, Ava S-Y; Westwood, Nicholas J; Brear, Paul; Rogers, Graeme W; Mavridis, Lazaros; Mitchell, John B O

    2016-04-01

    We created a computational method to identify allosteric sites using a machine learning method trained and tested on protein structures containing bound ligand molecules. The Random Forest machine learning approach was adopted to build our three-way predictive model. Based on descriptors collated for each ligand and binding site, the classification model allows us to assign protein cavities as allosteric, regular or orthosteric, and hence to identify allosteric sites. 43 structural descriptors per complex were derived and were used to characterize individual protein-ligand binding sites belonging to the three classes, allosteric, regular and orthosteric. We carried out a separate validation on a further unseen set of protein structures containing the ligand 2-(N-cyclohexylamino) ethane sulfonic acid (CHES).

  6. Identification of Host Insulin Binding Sites on Schistosoma japonicum Insulin Receptors.

    Directory of Open Access Journals (Sweden)

    Rachel J Stephenson

    Full Text Available Schistosoma japonicum insulin receptors (SjIRs have been identified as encouraging vaccine candidates. Interrupting or blocking the binding between host insulin and the schistosome insulin receptors (IRs may result in reduced glucose uptake leading to starvation and stunting of worms with a reduction in egg output. To further understand how schistosomes are able to exploit host insulin for development and growth, and whether these parasites and their mammalian hosts compete for the same insulin source, we identified insulin binding sites on the SjIRs. Based on sequence analysis and the predicted antigenic structure of the primary sequences of the SjIRs, we designed nine and eleven peptide analogues from SjIR-1 and SjIR-2, respectively. Using the Octet RED system, we identified analogues derived from SjIR-1 (10 and SjIR-2 (20, 21 and 22 with insulin-binding sequences specific for S. japonicum. Nevertheless, the human insulin receptor (HIR may compete with the SjIRs in binding human insulin in other positions which are important for HIR binding to insulin. However, no binding occurred between insulin and parasite analogues derived from SjIR-1 (2, 7 and 8 and SjIR-2 (14, 16 and 18 at the same locations as HIR sequences which have been shown to have strong insulin binding affinities. Importantly, we found two analogues (1 and 3, derived from SjIR-1, and two analogues (13 and 15 derived from SjIR-2, were responsible for the major insulin binding affinity in S. japonicum. These peptide analogues were shown to have more than 10 times (in KD value stronger binding capacity for human insulin compared with peptides derived from the HIR in the same sequence positions. Paradoxically, analogues 1, 3, 13 and 15 do not appear to contain major antigenic determinants which resulted in poor antibody responses to native S. japonicum protein. This argues against their future development as peptide-vaccine candidates.

  7. Accurate prediction of DnaK-peptide binding via homology modelling and experimental data.

    Directory of Open Access Journals (Sweden)

    Joost Van Durme

    2009-08-01

    Full Text Available Molecular chaperones are essential elements of the protein quality control machinery that governs translocation and folding of nascent polypeptides, refolding and degradation of misfolded proteins, and activation of a wide range of client proteins. The prokaryotic heat-shock protein DnaK is the E. coli representative of the ubiquitous Hsp70 family, which specializes in the binding of exposed hydrophobic regions in unfolded polypeptides. Accurate prediction of DnaK binding sites in E. coli proteins is an essential prerequisite to understand the precise function of this chaperone and the properties of its substrate proteins. In order to map DnaK binding sites in protein sequences, we have developed an algorithm that combines sequence information from peptide binding experiments and structural parameters from homology modelling. We show that this combination significantly outperforms either single approach. The final predictor had a Matthews correlation coefficient (MCC of 0.819 when assessed over the 144 tested peptide sequences to detect true positives and true negatives. To test the robustness of the learning set, we have conducted a simulated cross-validation, where we omit sequences from the learning sets and calculate the rate of repredicting them. This resulted in a surprisingly good MCC of 0.703. The algorithm was also able to perform equally well on a blind test set of binders and non-binders, of which there was no prior knowledge in the learning sets. The algorithm is freely available at http://limbo.vib.be.

  8. Characterization of melatonin binding sites in the Harderian gland and median eminence of the rat

    Energy Technology Data Exchange (ETDEWEB)

    Lopez-Gonzalez, M.A.; Calvo, J.R.; Rubio, A.; Goberna, R.; Guerrero, J.M. (Univ. of Seville School of Medicine, Sevilla (Spain))

    1991-01-01

    The characterization of specific melatonin binding sites in the Harderian gland (HG) and median eminence (ME) of the rat was studied using ({sup 125}I)melatonin. Binding of melatonin to membrane crude preparations of both tissues was dependent on time and temperature. Thus, maximal binding was obtained at 37{degree}C after 30-60 min incubation. Binding was also dependent on protein concentration. The specific binding of ({sup 125}I)melatonin was saturable, exhibiting only the class of binding sites in both tissues. The dissociation constants (Kd) were 170 and 190 pM for ME and HG, respectively. The concentration of the binding sites in ME was 8 fmol/mg protein, and in the HG 4 fmol/mg protein. In competition studies, binding of ({sup 125}I)melatonin to ME or HG was inhibited by increasing concentration of native melatonin; 50% inhibition was observed at about 702 and 422 nM for ME and HG, respectively. Additionally, the ({sup 125}I)melatonin binding to the crude membranes was not affected by the addition of different drugs such as norepinephrine, isoproterenol, phenylephrine, propranolol, or prazosin. The results confirm the presence of melatonin binding sites in median eminence and show, for the first time, the existence of melatonin binding sites in the Harderian gland.

  9. Gephyrin-binding peptides visualize postsynaptic sites and modulate neurotransmission

    DEFF Research Database (Denmark)

    Maric, Hans Michael; Hausrat, Torben Johann; Neubert, Franziska;

    2016-01-01

    γ-Aminobutyric acid type A and glycine receptors are the major mediators of fast synaptic inhibition in the human central nervous system and are established drug targets. However, all drugs targeting these receptors bind to the extracellular ligand-binding domain of the receptors, which inherently...

  10. Automatic generation of 3D motifs for classification of protein binding sites

    Directory of Open Access Journals (Sweden)

    Herzyk Pawel

    2007-08-01

    Full Text Available Abstract Background Since many of the new protein structures delivered by high-throughput processes do not have any known function, there is a need for structure-based prediction of protein function. Protein 3D structures can be clustered according to their fold or secondary structures to produce classes of some functional significance. A recent alternative has been to detect specific 3D motifs which are often associated to active sites. Unfortunately, there are very few known 3D motifs, which are usually the result of a manual process, compared to the number of sequential motifs already known. In this paper, we report a method to automatically generate 3D motifs of protein structure binding sites based on consensus atom positions and evaluate it on a set of adenine based ligands. Results Our new approach was validated by generating automatically 3D patterns for the main adenine based ligands, i.e. AMP, ADP and ATP. Out of the 18 detected patterns, only one, the ADP4 pattern, is not associated with well defined structural patterns. Moreover, most of the patterns could be classified as binding site 3D motifs. Literature research revealed that the ADP4 pattern actually corresponds to structural features which show complex evolutionary links between ligases and transferases. Therefore, all of the generated patterns prove to be meaningful. Each pattern was used to query all PDB proteins which bind either purine based or guanine based ligands, in order to evaluate the classification and annotation properties of the pattern. Overall, our 3D patterns matched 31% of proteins with adenine based ligands and 95.5% of them were classified correctly. Conclusion A new metric has been introduced allowing the classification of proteins according to the similarity of atomic environment of binding sites, and a methodology has been developed to automatically produce 3D patterns from that classification. A study of proteins binding adenine based ligands showed that

  11. A grammar inference approach for predicting kinase specific phosphorylation sites.

    Science.gov (United States)

    Datta, Sutapa; Mukhopadhyay, Subhasis

    2015-01-01

    Kinase mediated phosphorylation site detection is the key mechanism of post translational mechanism that plays an important role in regulating various cellular processes and phenotypes. Many diseases, like cancer are related with the signaling defects which are associated with protein phosphorylation. Characterizing the protein kinases and their substrates enhances our ability to understand the mechanism of protein phosphorylation and extends our knowledge of signaling network; thereby helping us to treat such diseases. Experimental methods for predicting phosphorylation sites are labour intensive and expensive. Also, manifold increase of protein sequences in the databanks over the years necessitates the improvement of high speed and accurate computational methods for predicting phosphorylation sites in protein sequences. Till date, a number of computational methods have been proposed by various researchers in predicting phosphorylation sites, but there remains much scope of improvement. In this communication, we present a simple and novel method based on Grammatical Inference (GI) approach to automate the prediction of kinase specific phosphorylation sites. In this regard, we have used a popular GI algorithm Alergia to infer Deterministic Stochastic Finite State Automata (DSFA) which equally represents the regular grammar corresponding to the phosphorylation sites. Extensive experiments on several datasets generated by us reveal that, our inferred grammar successfully predicts phosphorylation sites in a kinase specific manner. It performs significantly better when compared with the other existing phosphorylation site prediction methods. We have also compared our inferred DSFA with two other GI inference algorithms. The DSFA generated by our method performs superior which indicates that our method is robust and has a potential for predicting the phosphorylation sites in a kinase specific manner.

  12. Shared RNA-binding sites for interacting members of the Drosophila ELAV family of neuronal proteins

    OpenAIRE

    Borgeson, Claudia D.; Samson, Marie-Laure

    2005-01-01

    The product of the Drosophila embryonic lethal abnormal visual system is a conserved protein (ELAV) necessary for normal neuronal differentiation and maintenance. It possesses three RNA-binding domains and is involved in the regulation of RNA metabolism. The long elav 3′-untranslated region (3′-UTR) is necessary for autoregulation. We used RNA-binding assays and in vitro selection to identify the ELAV best binding site in the elav 3′-UTR. This site resembles ELAV-binding sites identified prev...

  13. Evidence for a non-opioid sigma binding site din the guinea-pig myenteric plexus

    Energy Technology Data Exchange (ETDEWEB)

    Roman, F.; Pascaud, X.; Vauche, D.; Junien, J.

    1988-01-01

    The presence of a binding site to (+)-(/sup 3/H)SKF 10,047 was demonstrated in a guinea-pig myenteric plexus (MYP) membrane preparation. Specific binding to this receptor was saturable, reversible, linear with protein concentration and consisted of two components, a high affinity site and a low affinity site. Morphine and naloxone 10/sup -4/M were unable to displace (+)-(/sup 3/H)SKF 10,047 binding. Haloperidol, imipramine, ethylketocyclazocine and propranolol were among the most potent compounds to inhibit this specific binding. These results suggest the presence of a non-opioid haloperidol sensitive sigma receptor in the MYP of the guinea-pig.

  14. Mechanisms of Intentional Binding and Sensory Attenuation: The Role of Temporal Prediction, Temporal Control, Identity Prediction, and Motor Prediction

    Science.gov (United States)

    Hughes, Gethin; Desantis, Andrea; Waszak, Florian

    2013-01-01

    Sensory processing of action effects has been shown to differ from that of externally triggered stimuli, with respect both to the perceived timing of their occurrence (intentional binding) and to their intensity (sensory attenuation). These phenomena are normally attributed to forward action models, such that when action prediction is consistent…

  15. Inhibition of RNA polymerase by captan at both DNA and substrate binding sites.

    Science.gov (United States)

    Luo, G; Lewis, R A

    1992-12-01

    RNA synthesis carried out in vitro by Escherichia coli RNA polymerase was inhibited irreversibly by captan when T7 DNA was used as template. An earlier report and this one show that captan blocks the DNA binding site on the enzyme. Herein, it is also revealed that captan acts at the nucleoside triphosphate (NTP) binding site, and kinetic relationships of the action of captan at the two sites are detailed. The inhibition by captan via the DNA binding site of the enzyme was confirmed by kinetic studies and it was further shown that [14C]captan bound to the beta' subunit of RNA polymerase. This subunit contains the DNA binding site. Competitive-like inhibition by captan versus UTP led to the conclusion that captan also blocked the NTP binding site. In support of this conclusion, [14C]captan was observed to bind to the beta subunit which contains the NTP binding site. Whereas, preincubation of RNA polymerase with both DNA and NTPs prevented captan inhibition, preincubation with either DNA or NTPs alone was insufficient to protect the enzyme from the action of captan. Furthermore, the interaction of [14C]captan with the beta and beta' subunits was not prevented by a similar preincubation. Captan also bound, to a lesser extent, to the alpha and sigma subunits. Therefore, captan binding appears to involve interaction with RNA polymerase at sites in addition to those for DNA and NTP; however, this action does not inhibit the polymerase activity.

  16. Does transcription play a role in creating a condensin binding site?

    Science.gov (United States)

    Bernard, Pascal; Vanoosthuyse, Vincent

    2015-01-01

    The highly conserved condensin complex is essential for the condensation and integrity of chromosomes through cell division. Published data argue that high levels of transcription contribute to specify some condensin-binding sites on chromosomes but the exact role of transcription in this process remains elusive. Here we discuss our recent data addressing the role of transcription in establishing a condensin-binding site.

  17. Convolutional neural network architectures for predicting DNA–protein binding

    Science.gov (United States)

    Zeng, Haoyang; Edwards, Matthew D.; Liu, Ge; Gifford, David K.

    2016-01-01

    Motivation: Convolutional neural networks (CNN) have outperformed conventional methods in modeling the sequence specificity of DNA–protein binding. Yet inappropriate CNN architectures can yield poorer performance than simpler models. Thus an in-depth understanding of how to match CNN architecture to a given task is needed to fully harness the power of CNNs for computational biology applications. Results: We present a systematic exploration of CNN architectures for predicting DNA sequence binding using a large compendium of transcription factor datasets. We identify the best-performing architectures by varying CNN width, depth and pooling designs. We find that adding convolutional kernels to a network is important for motif-based tasks. We show the benefits of CNNs in learning rich higher-order sequence features, such as secondary motifs and local sequence context, by comparing network performance on multiple modeling tasks ranging in difficulty. We also demonstrate how careful construction of sequence benchmark datasets, using approaches that control potentially confounding effects like positional or motif strength bias, is critical in making fair comparisons between competing methods. We explore how to establish the sufficiency of training data for these learning tasks, and we have created a flexible cloud-based framework that permits the rapid exploration of alternative neural network architectures for problems in computational biology. Availability and Implementation: All the models analyzed are available at http://cnn.csail.mit.edu. Contact: gifford@mit.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27307608

  18. CORE_TF: a user-friendly interface to identify evolutionary conserved transcription factor binding sites in sets of co-regulated genes

    Directory of Open Access Journals (Sweden)

    den Dunnen Johan T

    2008-11-01

    Full Text Available Abstract Background The identification of transcription factor binding sites is difficult since they are only a small number of nucleotides in size, resulting in large numbers of false positives and false negatives in current approaches. Computational methods to reduce false positives are to look for over-representation of transcription factor binding sites in a set of similarly regulated promoters or to look for conservation in orthologous promoter alignments. Results We have developed a novel tool, "CORE_TF" (Conserved and Over-REpresented Transcription Factor binding sites that identifies common transcription factor binding sites in promoters of co-regulated genes. To improve upon existing binding site predictions, the tool searches for position weight matrices from the TRANSFACR database that are over-represented in an experimental set compared to a random set of promoters and identifies cross-species conservation of the predicted transcription factor binding sites. The algorithm has been evaluated with expression and chromatin-immunoprecipitation on microarray data. We also implement and demonstrate the importance of matching the random set of promoters to the experimental promoters by GC content, which is a unique feature of our tool. Conclusion The program CORE_TF is accessible in a user friendly web interface at http://www.LGTC.nl/CORE_TF. It provides a table of over-represented transcription factor binding sites in the users input genes' promoters and a graphical view of evolutionary conserved transcription factor binding sites. In our test data sets it successfully predicts target transcription factors and their binding sites.

  19. Label-free microscale thermophoresis discriminates sites and affinity of protein-ligand binding.

    Science.gov (United States)

    Seidel, Susanne A I; Wienken, Christoph J; Geissler, Sandra; Jerabek-Willemsen, Moran; Duhr, Stefan; Reiter, Alwin; Trauner, Dirk; Braun, Dieter; Baaske, Philipp

    2012-10-15

    Look, no label! Microscale thermophoresis makes use of the intrinsic fluorescence of proteins to quantify the binding affinities of ligands and discriminate between binding sites. This method is suitable for studying binding interactions of very small amounts of protein in solution. The binding of ligands to iGluR membrane receptors, small-molecule inhibitorss to kinase p38, aptamers to thrombin, and Ca(2+) ions to synaptotagmin was quantified.

  20. BindUP: a web server for non-homology-based prediction of DNA and RNA binding proteins.

    Science.gov (United States)

    Paz, Inbal; Kligun, Efrat; Bengad, Barak; Mandel-Gutfreund, Yael

    2016-07-08

    Gene expression is a multi-step process involving many layers of regulation. The main regulators of the pathway are DNA and RNA binding proteins. While over the years, a large number of DNA and RNA binding proteins have been identified and extensively studied, it is still expected that many other proteins, some with yet another known function, are awaiting to be discovered. Here we present a new web server, BindUP, freely accessible through the website http://bindup.technion.ac.il/, for predicting DNA and RNA binding proteins using a non-homology-based approach. Our method is based on the electrostatic features of the protein surface and other general properties of the protein. BindUP predicts nucleic acid binding function given the proteins three-dimensional structure or a structural model. Additionally, BindUP provides information on the largest electrostatic surface patches, visualized on the server. The server was tested on several datasets of DNA and RNA binding proteins, including proteins which do not possess DNA or RNA binding domains and have no similarity to known nucleic acid binding proteins, achieving very high accuracy. BindUP is applicable in either single or batch modes and can be applied for testing hundreds of proteins simultaneously in a highly efficient manner.

  1. A general pairwise interaction model provides an accurate description of in vivo transcription factor binding sites.

    Directory of Open Access Journals (Sweden)

    Marc Santolini

    Full Text Available The identification of transcription factor binding sites (TFBSs on genomic DNA is of crucial importance for understanding and predicting regulatory elements in gene networks. TFBS motifs are commonly described by Position Weight Matrices (PWMs, in which each DNA base pair contributes independently to the transcription factor (TF binding. However, this description ignores correlations between nucleotides at different positions, and is generally inaccurate: analysing fly and mouse in vivo ChIPseq data, we show that in most cases the PWM model fails to reproduce the observed statistics of TFBSs. To overcome this issue, we introduce the pairwise interaction model (PIM, a generalization of the PWM model. The model is based on the principle of maximum entropy and explicitly describes pairwise correlations between nucleotides at different positions, while being otherwise as unconstrained as possible. It is mathematically equivalent to considering a TF-DNA binding energy that depends additively on each nucleotide identity at all positions in the TFBS, like the PWM model, but also additively on pairs of nucleotides. We find that the PIM significantly improves over the PWM model, and even provides an optimal description of TFBS statistics within statistical noise. The PIM generalizes previous approaches to interdependent positions: it accounts for co-variation of two or more base pairs, and predicts secondary motifs, while outperforming multiple-motif models consisting of mixtures of PWMs. We analyse the structure of pairwise interactions between nucleotides, and find that they are sparse and dominantly located between consecutive base pairs in the flanking region of TFBS. Nonetheless, interactions between pairs of non-consecutive nucleotides are found to play a significant role in the obtained accurate description of TFBS statistics. The PIM is computationally tractable, and provides a general framework that should be useful for describing and predicting

  2. Flood Predictions Combining Regional and Single Site Hydrometric Information

    Directory of Open Access Journals (Sweden)

    Campos–Aranda

    2010-07-01

    Full Text Available Initially, the statistic benefit of flood predictions obtained by combining reliable regional and scarce site hydrometric data is pointed out. Then the mathematical equations for combining mean and standard deviation logarithms of regional and site data are exposed, as well as the necessary expressions for desired predictions, based on Student's t distribution. Later two numerical applications are described, the first one based in Carrizal hydrometric station located in Santiago River in Nayarit and the second one which makes use of five water gauging stations in Tempoal River in Veracruz. Finally, a conclusion is formulated pointing out the simplicity of that method and the accuracy of its predictions.

  3. Evaluation of the Significance of Starch Surface Binding Sites on Human Pancreatic α-Amylase.

    Science.gov (United States)

    Zhang, Xiaohua; Caner, Sami; Kwan, Emily; Li, Chunmin; Brayer, Gary D; Withers, Stephen G

    2016-11-01

    Starch provides the major source of caloric intake in many diets. Cleavage of starch into malto-oligosaccharides in the gut is catalyzed by pancreatic α-amylase. These oligosaccharides are then further cleaved by gut wall α-glucosidases to release glucose, which is absorbed into the bloodstream. Potential surface binding sites for starch on the pancreatic amylase, distinct from the active site of the amylase, have been identified through X-ray crystallographic analyses. The role of these sites in the degradation of both starch granules and soluble starch was probed by the generation of a series of surface variants modified at each site to disrupt binding. Kinetic analysis of the binding and/or cleavage of substrates ranging from simple maltotriosides to soluble starch and insoluble starch granules has allowed evaluation of the potential role of each such surface site. In this way, two key surface binding sites, on the same face as the active site, are identified. One site, containing a pair of aromatic residues, is responsible for attachment to starch granules, while a second site featuring a tryptophan residue around which a malto-oligosaccharide wraps is shown to heavily influence soluble starch binding and hydrolysis. These studies provide insights into the mechanisms by which enzymes tackle the degradation of largely insoluble polymers and also present some new approaches to the interrogation of the binding sites involved.

  4. Characterization of the binding sites for dicarboxylic acids on bovine serum albumin.

    Science.gov (United States)

    Tonsgard, J H; Meredith, S C

    1991-06-15

    Dicarboxylic acids are prominent features of several diseases, including Reye's syndrome and inborn errors of mitochondrial and peroxisomal fatty acid oxidation. Moreover, dicarboxylic acids are potentially toxic to cellular processes. Previous studies [Tonsgard, Mendelson & Meredith (1988) J. Clin. Invest. 82, 1567-1573] demonstrated that long-chain dicarboxylic acids have a single high-affinity binding site and between one and three lower-affinity sites on albumin. Medium-chain-length dicarboxylic acids have a single low-affinity site. We further characterized dicarboxylic acid binding to albumin in order to understand the potential effects of drugs and other ligands on dicarboxylic acid binding and toxicity. Progesterone and oleate competitively inhibit octadecanedioic acid binding to the single high-affinity site. Octanoate inhibits binding to the low-affinity sites. Dansylated probes for subdomain 2AB inhibit dodecanedioic acid binding whereas probes for subdomain 3AB do not. In contrast, low concentrations of octadecanedioic acid inhibit the binding of dansylated probes to subdomain 3AB and 2AB. L-Tryptophan, which binds in subdomain 3AB, inhibits hexadecanedioic acid binding but has no effect on dodecanedioic acid. Bilirubin and acetylsalicylic acid, which bind in subdomain 2AB, inhibit the binding of medium-chain and long-chain dicarboxylic acids. Our results suggest that long-chain dicarboxylic acids bind in subdomains 2C, 3AB and 2AB. The single low-affinity binding site for medium-chain dicarboxylic acids is in subdomain 2AB. These studies suggest that dicarboxylic acids are likely to be unbound in disease states and may be potentially toxic.

  5. Functional identification and characterization of sodium binding sites in Na symporters.

    Science.gov (United States)

    Loo, Donald D F; Jiang, Xuan; Gorraitz, Edurne; Hirayama, Bruce A; Wright, Ernest M

    2013-11-19

    Sodium cotransporters from several different gene families belong to the leucine transporter (LeuT) structural family. Although the identification of Na(+) in binding sites is beyond the resolution of the structures, two Na(+) binding sites (Na1 and Na2) have been proposed in LeuT. Na2 is conserved in the LeuT family but Na1 is not. A biophysical method has been used to measure sodium dissociation constants (Kd) of wild-type and mutant human sodium glucose cotransport (hSGLT1) proteins to identify the Na(+) binding sites in hSGLT1. The Na1 site is formed by residues in the sugar binding pocket, and their mutation influences sodium binding to Na1 but not to Na2. For the canonical Na2 site formed by two -OH side chains, S392 and S393, and three backbone carbonyls, mutation of S392 to cysteine increased the sodium Kd by sixfold. This was accompanied by a dramatic reduction in the apparent sugar and phlorizin affinities. We suggest that mutation of S392 in the Na2 site produces a structural rearrangement of the sugar binding pocket to disrupt both the binding of the second Na(+) and the binding of sugar. In contrast, the S393 mutations produce no significant changes in sodium, sugar, and phlorizin affinities. We conclude that the Na2 site is conserved in hSGLT1, the side chain of S392 and the backbone carbonyl of S393 are important in the first Na(+) binding, and that Na(+) binding to Na2 promotes binding to Na1 and also sugar binding.

  6. [Type-I and -II estradiol binding sites in the endometrium during blastocyst implantation].

    Science.gov (United States)

    Bernal, A; Calzada, L; Hicks, J J; Velázquez, A

    1989-04-01

    The properties of type I and occupied and unoccupied type II cytosolic estrogen binding sites in the rat endometrium were analyzed on day five of pregnancy; the samples studied correspond to blastocyst receptive endometrium (implantation sites), nonreceptive endometrium and ovariectomized uterine horn endometrium, from the same pregnancy rats. The occupied binding site type II was analyzed by exchange assays. Dissociation constant obtained from experiments carried out at 4 or 25 degrees C are similar for each one of the binding site at the three different endometrium samples; the binding capacity (femtomoles/mg protein) from the sites type I and type II and the ratio between occupied (by endogenous estradiol) and unoccupied site type II, seems to be characteristic for each one of the three analyzed endometrium.

  7. Variable context Markov chains for HIV protease cleavage site prediction.

    Science.gov (United States)

    Oğul, Hasan

    2009-06-01

    Deciphering the knowledge of HIV protease specificity and developing computational tools for detecting its cleavage sites in protein polypeptide chain are very desirable for designing efficient and specific chemical inhibitors to prevent acquired immunodeficiency syndrome. In this study, we developed a generative model based on a generalization of variable order Markov chains (VOMC) for peptide sequences and adapted the model for prediction of their cleavability by certain proteases. The new method, called variable context Markov chains (VCMC), attempts to identify the context equivalence based on the evolutionary similarities between individual amino acids. It was applied for HIV-1 protease cleavage site prediction problem and shown to outperform existing methods in terms of prediction accuracy on a common dataset. In general, the method is a promising tool for prediction of cleavage sites of all proteases and encouraged to be used for any kind of peptide classification problem as well.

  8. Mapping the heparin-binding site of the osteoinductive protein NELL1 by site-directed mutagenesis.

    Science.gov (United States)

    Takahashi, Kaneyoshi; Imai, Arisa; Iijima, Masumi; Yoshimoto, Nobuo; Maturana, Andrés D; Kuroda, Shun'ichi; Niimi, Tomoaki

    2015-12-21

    Neural epidermal growth factor-like (NEL)-like 1 (NELL1) is a secretory osteogenic protein comprising an N-terminal thrombospondin-1-like (TSPN) domain, four von Willebrand factor type C domains, and six epidermal growth factor-like repeats. NELL1 shows heparin-binding activity; however, the biological significance remains to be explored. In this report, we demonstrate that NELL1 binds to cell surface proteoglycans through its TSPN domain. Major heparin-binding sites were identified on the three-dimensional structural model of the TSPN domain of NELL1. Mutant analysis of the heparin-binding sites indicated that the heparin-binding activity of the TSPN domain is involved in interaction of NELL1 with cell surface proteoglycans.

  9. On the specificity of heparin/heparan sulfate binding to proteins. Anion-binding sites on antithrombin and thrombin are fundamentally different.

    Directory of Open Access Journals (Sweden)

    Philip D Mosier

    Full Text Available BACKGROUND: The antithrombin-heparin/heparan sulfate (H/HS and thrombin-H/HS interactions are recognized as prototypic specific and non-specific glycosaminoglycan (GAG-protein interactions, respectively. The fundamental structural basis for the origin of specificity, or lack thereof, in these interactions remains unclear. The availability of multiple co-crystal structures facilitates a structural analysis that challenges the long-held belief that the GAG binding sites in antithrombin and thrombin are essentially similar with high solvent exposure and shallow surface characteristics. METHODOLOGY: Analyses of solvent accessibility and exposed surface areas, gyrational mobility, symmetry, cavity shape/size, conserved water molecules and crystallographic parameters were performed for 12 X-ray structures, which include 12 thrombin and 16 antithrombin chains. Novel calculations are described for gyrational mobility and prediction of water loci and conservation. RESULTS: The solvent accessibilities and gyrational mobilities of arginines and lysines in the binding sites of the two proteins reveal sharp contrasts. The distribution of positive charges shows considerable asymmetry in antithrombin, but substantial symmetry for thrombin. Cavity analyses suggest the presence of a reasonably sized bifurcated cavity in antithrombin that facilitates a firm 'hand-shake' with H/HS, but with thrombin, a weaker 'high-five'. Tightly bound water molecules were predicted to be localized in the pentasaccharide binding pocket of antithrombin, but absent in thrombin. Together, these differences in the binding sites explain the major H/HS recognition characteristics of the two prototypic proteins, thus affording an explanation of the specificity of binding. This provides a foundation for understanding specificity of interaction at an atomic level, which will greatly aid the design of natural or synthetic H/HS sequences that target proteins in a specific manner.

  10. GTRD: a database of transcription factor binding sites identified by ChIP-seq experiments

    Science.gov (United States)

    Yevshin, Ivan; Sharipov, Ruslan; Valeev, Tagir; Kel, Alexander; Kolpakov, Fedor

    2017-01-01

    GTRD—Gene Transcription Regulation Database (http://gtrd.biouml.org)—is a database of transcription factor binding sites (TFBSs) identified by ChIP-seq experiments for human and mouse. Raw ChIP-seq data were obtained from ENCODE and SRA and uniformly processed: (i) reads were aligned using Bowtie2; (ii) ChIP-seq peaks were called using peak callers MACS, SISSRs, GEM and PICS; (iii) peaks for the same factor and peak callers, but different experiment conditions (cell line, treatment, etc.), were merged into clusters; (iv) such clusters for different peak callers were merged into metaclusters that were considered as non-redundant sets of TFBSs. In addition to information on location in genome, the sets contain structured information about cell lines and experimental conditions extracted from descriptions of corresponding ChIP-seq experiments. A web interface to access GTRD was developed using the BioUML platform. It provides: (i) browsing and displaying information; (ii) advanced search possibilities, e.g. search of TFBSs near the specified gene or search of all genes potentially regulated by a specified transcription factor; (iii) integrated genome browser that provides visualization of the GTRD data: read alignments, peaks, clusters, metaclusters and information about gene structures from the Ensembl database and binding sites predicted using position weight matrices from the HOCOMOCO database. PMID:27924024

  11. Binding site turnover produces pervasive quantitative changes in transcription factor binding between closely related Drosophila species.

    Directory of Open Access Journals (Sweden)

    Robert K Bradley

    2010-03-01

    Full Text Available Changes in gene expression play an important role in evolution, yet the molecular mechanisms underlying regulatory evolution are poorly understood. Here we compare genome-wide binding of the six transcription factors that initiate segmentation along the anterior-posterior axis in embryos of two closely related species: Drosophila melanogaster and Drosophila yakuba. Where we observe binding by a factor in one species, we almost always observe binding by that factor to the orthologous sequence in the other species. Levels of binding, however, vary considerably. The magnitude and direction of the interspecies differences in binding levels of all six factors are strongly correlated, suggesting a role for chromatin or other factor-independent forces in mediating the divergence of transcription factor binding. Nonetheless, factor-specific quantitative variation in binding is common, and we show that it is driven to a large extent by the gain and loss of cognate recognition sequences for the given factor. We find only a weak correlation between binding variation and regulatory function. These data provide the first genome-wide picture of how modest levels of sequence divergence between highly morphologically similar species affect a system of coordinately acting transcription factors during animal development, and highlight the dominant role of quantitative variation in transcription factor binding over short evolutionary distances.

  12. DO-RIP-seq to quantify RNA binding sites transcriptome-wide.

    Science.gov (United States)

    Nicholson, Cindo O; Friedersdorf, Matthew B; Bisogno, Laura S; Keene, Jack D

    2016-11-10

    Post-transcriptional processes orchestrate gene expression through dynamic protein-RNA interactions. These interactions occur at specific sites determined by RNA sequence, secondary structure, or nucleotide modifications. Methods have been developed either to quantify binding of whole transcripts or to identify the binding sites, but there is none proven to quantify binding at both the whole transcript and binding site levels. Here we describe digestion optimized RNA immunoprecipitation with deep sequencing (DO-RIP-seq) as a method that quantitates at the whole transcript target (RIP-Seq-Like or RSL) level and at the binding site level (BSL) using continuous metrics. DO-RIP-seq methodology was developed using the RBP HuR/ELAVL1 as a test case (Nicholson et al., 2016). DO-RIP-seq employs treatment of cell lysates with a nuclease under optimized conditions to yield partially digested RNA fragments bound by RNA binding proteins, followed by immunoprecipitations that capture the digested RNA-protein complexes and assess non-specific or background interactions. Analyses of sequenced cDNA libraries made from the bound RNA fragments yielded two types of enrichment scores; one for RSL binding events and the other for BSL events (Nicholson et al., 2016). These analyses plus the extensive read coverage of DO-RIP-seq allows seamless integration of binding site and whole transcript information. Therefore, DO-RIP-seq is useful for quantifying RBP binding events that are regulated during dynamic biological processes.

  13. Mutational analysis of the high-affinity zinc binding site validates a refined human dopamine transporter homology model.

    Directory of Open Access Journals (Sweden)

    Thomas Stockner

    Full Text Available The high-resolution crystal structure of the leucine transporter (LeuT is frequently used as a template for homology models of the dopamine transporter (DAT. Although similar in structure, DAT differs considerably from LeuT in a number of ways: (i when compared to LeuT, DAT has very long intracellular amino and carboxyl termini; (ii LeuT and DAT share a rather low overall sequence identity (22% and (iii the extracellular loop 2 (EL2 of DAT is substantially longer than that of LeuT. Extracellular zinc binds to DAT and restricts the transporter's movement through the conformational cycle, thereby resulting in a decrease in substrate uptake. Residue H293 in EL2 praticipates in zinc binding and must be modelled correctly to allow for a full understanding of its effects. We exploited the high-affinity zinc binding site endogenously present in DAT to create a model of the complete transmemberane domain of DAT. The zinc binding site provided a DAT-specific molecular ruler for calibration of the model. Our DAT model places EL2 at the transporter lipid interface in the vicinity of the zinc binding site. Based on the model, D206 was predicted to represent a fourth co-ordinating residue, in addition to the three previously described zinc binding residues H193, H375 and E396. This prediction was confirmed by mutagenesis: substitution of D206 by lysine and cysteine affected the inhibitory potency of zinc and the maximum inhibition exerted by zinc, respectively. Conversely, the structural changes observed in the model allowed for rationalizing the zinc-dependent regulation of DAT: upon binding, zinc stabilizes the outward-facing state, because its first coordination shell can only be completed in this conformation. Thus, the model provides a validated solution to the long extracellular loop and may be useful to address other aspects of the transport cycle.

  14. Mutational analysis of the high-affinity zinc binding site validates a refined human dopamine transporter homology model.

    Science.gov (United States)

    Stockner, Thomas; Montgomery, Therese R; Kudlacek, Oliver; Weissensteiner, Rene; Ecker, Gerhard F; Freissmuth, Michael; Sitte, Harald H

    2013-01-01

    The high-resolution crystal structure of the leucine transporter (LeuT) is frequently used as a template for homology models of the dopamine transporter (DAT). Although similar in structure, DAT differs considerably from LeuT in a number of ways: (i) when compared to LeuT, DAT has very long intracellular amino and carboxyl termini; (ii) LeuT and DAT share a rather low overall sequence identity (22%) and (iii) the extracellular loop 2 (EL2) of DAT is substantially longer than that of LeuT. Extracellular zinc binds to DAT and restricts the transporter's movement through the conformational cycle, thereby resulting in a decrease in substrate uptake. Residue H293 in EL2 praticipates in zinc binding and must be modelled correctly to allow for a full understanding of its effects. We exploited the high-affinity zinc binding site endogenously present in DAT to create a model of the complete transmemberane domain of DAT. The zinc binding site provided a DAT-specific molecular ruler for calibration of the model. Our DAT model places EL2 at the transporter lipid interface in the vicinity of the zinc binding site. Based on the model, D206 was predicted to represent a fourth co-ordinating residue, in addition to the three previously described zinc binding residues H193, H375 and E396. This prediction was confirmed by mutagenesis: substitution of D206 by lysine and cysteine affected the inhibitory potency of zinc and the maximum inhibition exerted by zinc, respectively. Conversely, the structural changes observed in the model allowed for rationalizing the zinc-dependent regulation of DAT: upon binding, zinc stabilizes the outward-facing state, because its first coordination shell can only be completed in this conformation. Thus, the model provides a validated solution to the long extracellular loop and may be useful to address other aspects of the transport cycle.

  15. Multiple sup 3 H-oxytocin binding sites in rat myometrial plasma membranes

    Energy Technology Data Exchange (ETDEWEB)

    Crankshaw, D.; Gaspar, V.; Pliska, V. (McMaster Univ., Hamilton, Ontario, (Canada))

    1990-01-01

    The affinity spectrum method has been used to analyse binding isotherms for {sup 3}H-oxytocin to rat myometrial plasma membranes. Three populations of binding sites with dissociation constants (Kd) of 0.6-1.5 x 10(-9), 0.4-1.0 x 10(-7) and 7 x 10(-6) mol/l were identified and their existence verified by cluster analysis based on similarities between Kd, binding capacity and Hill coefficient. When experimental values were compared to theoretical curves constructed using the estimated binding parameters, good fits were obtained. Binding parameters obtained by this method were not influenced by the presence of GTP gamma S (guanosine-5'-O-3-thiotriphosphate) in the incubation medium. The binding parameters agree reasonably well with those found in uterine cells, they support the existence of a medium affinity site and may allow for an explanation of some of the discrepancies between binding and response in this system.

  16. Six independent fucose-binding sites in the crystal structure of Aspergillus oryzae lectin.

    Science.gov (United States)

    Makyio, Hisayoshi; Shimabukuro, Junpei; Suzuki, Tatsuya; Imamura, Akihiro; Ishida, Hideharu; Kiso, Makoto; Ando, Hiromune; Kato, Ryuichi

    2016-08-26

    The crystal structure of AOL (a fucose-specific lectin of Aspergillus oryzae) has been solved by SAD (single-wavelength anomalous diffraction) and MAD (multi-wavelength anomalous diffraction) phasing of seleno-fucosides. The overall structure is a six-bladed β-propeller similar to that of other fucose-specific lectins. The fucose moieties of the seleno-fucosides are located in six fucose-binding sites. Although the Arg and Glu/Gln residues bound to the fucose moiety are common to all fucose-binding sites, the amino-acid residues involved in fucose binding at each site are not identical. The varying peak heights of the seleniums in the electron density map suggest that each fucose-binding site has a different carbohydrate binding affinity.

  17. Isocitrate binding at two functionally distinct sites in yeast NAD+-specific isocitrate dehydrogenase.

    Science.gov (United States)

    Lin, An-Ping; McAlister-Henn, Lee

    2002-06-21

    Yeast NAD(+)-specific isocitrate dehydrogenase (IDH) is an octamer containing two types of homologous subunits. Ligand-binding analyses were conducted to examine effects of residue changes in putative catalytic and regulatory isocitrate-binding sites respectively contained in IDH2 and IDH1 subunits. Replacement of homologous serine residues in either subunit site, S98A in IDH2 or S92A in IDH1, was found to reduce by half the total number of holoenzyme isocitrate-binding sites, confirming a correlation between detrimental effects on isocitrate binding and respective kinetic defects in catalysis and allosteric activation by AMP. Replacement of both serine residues eliminates isocitrate binding and measurable catalytic activity. The putative isocitrate-binding sites of IDH1 and IDH2 contain five identical and four nonidentical residues. Reciprocal replacement of the four nonidentical residues in either or both subunits (A108R, F136Y, T241D, and N245D in IDH1 and/or R114A, Y142F, D248T, and D252N in IDH2) was found to be permissive for isocitrate binding. This provides further evidence for two types of binding sites in IDH, although the authentic residues have been shown to be necessary for normal kinetic contributions. Finally, the mutant enzymes with residue replacements in the IDH1 site were found to be unable to bind AMP, suggesting that allosteric activation is dependent both upon binding of isocitrate at the IDH1 site and upon the changes in the enzyme normally elicited by this binding.

  18. Selective prediction of interaction sites in protein structures with THEMATICS

    Directory of Open Access Journals (Sweden)

    Murga Leonel F

    2007-04-01

    Full Text Available Abstract Background Methods are now available for the prediction of interaction sites in protein 3D structures. While many of these methods report high success rates for site prediction, often these predictions are not very selective and have low precision. Precision in site prediction is addressed using Theoretical Microscopic Titration Curves (THEMATICS, a simple computational method for the identification of active sites in enzymes. Recall and precision are measured and compared with other methods for the prediction of catalytic sites. Results Using a test set of 169 enzymes from the original Catalytic Residue Dataset (CatRes it is shown that THEMATICS can deliver precise, localised site predictions. Furthermore, adjustment of the cut-off criteria can improve the recall rates for catalytic residues with only a small sacrifice in precision. Recall rates for CatRes/CSA annotated catalytic residues are 41.1%, 50.4%, and 54.2% for Z score cut-off values of 1.00, 0.99, and 0.98, respectively. The corresponding precision rates are 19.4%, 17.9%, and 16.4%. The success rate for catalytic sites is higher, with correct or partially correct predictions for 77.5%, 85.8%, and 88.2% of the enzymes in the test set, corresponding to the same respective Z score cut-offs, if only the CatRes annotations are used as the reference set. Incorporation of additional literature annotations into the reference set gives total success rates of 89.9%, 92.9%, and 94.1%, again for corresponding cut-off values of 1.00, 0.99, and 0.98. False positive rates for a 75-protein test set are 1.95%, 2.60%, and 3.12% for Z score cut-offs of 1.00, 0.99, and 0.98, respectively. Conclusion With a preferred cut-off value of 0.99, THEMATICS achieves a high success rate of interaction site prediction, about 86% correct or partially correct using CatRes/CSA annotations only and about 93% with an expanded reference set. Success rates for catalytic residue prediction are similar to those of

  19. Statistical Mechanics of Transcription-Factor Binding Site Discovery Using Hidden Markov Models.

    Science.gov (United States)

    Mehta, Pankaj; Schwab, David J; Sengupta, Anirvan M

    2011-04-01

    Hidden Markov Models (HMMs) are a commonly used tool for inference of transcription factor (TF) binding sites from DNA sequence data. We exploit the mathematical equivalence between HMMs for TF binding and the "inverse" statistical mechanics of hard rods in a one-dimensional disordered potential to investigate learning in HMMs. We derive analytic expressions for the Fisher information, a commonly employed measure of confidence in learned parameters, in the biologically relevant limit where the density of binding sites is low. We then use techniques from statistical mechanics to derive a scaling principle relating the specificity (binding energy) of a TF to the minimum amount of training data necessary to learn it.

  20. Interaction of Palmitic Acid with Metoprolol Succinate at the Binding Sites of Bovine Serum Albumin

    OpenAIRE

    Mashiur Rahman; Farzana Prianka; Mohammad Shohel; Md. Abdul Mazid

    2014-01-01

    Purpose: The aim of this study was to characterize the binding profile as well as to notify the interaction of palmitic acid with metoprolol succinate at its binding site on albumin. Methods: The binding of metoprolol succinate to bovine serum albumin (BSA) was studied by equilibrium dialysis method (ED) at 27°C and pH 7.4, in order to have an insight in the binding chemistry of the drug to BSA in presence and absence of palmitic acid. The study was carried out using ranitidine as site-1 a...

  1. Defining the plasticity of transcription factor binding sites by Deconstructing DNA consensus sequences: the PhoP-binding sites among gamma/enterobacteria.

    Directory of Open Access Journals (Sweden)

    Oscar Harari

    Full Text Available Transcriptional regulators recognize specific DNA sequences. Because these sequences are embedded in the background of genomic DNA, it is hard to identify the key cis-regulatory elements that determine disparate patterns of gene expression. The detection of the intra- and inter-species differences among these sequences is crucial for understanding the molecular basis of both differential gene expression and evolution. Here, we address this problem by investigating the target promoters controlled by the DNA-binding PhoP protein, which governs virulence and Mg(2+ homeostasis in several bacterial species. PhoP is particularly interesting; it is highly conserved in different gamma/enterobacteria, regulating not only ancestral genes but also governing the expression of dozens of horizontally acquired genes that differ from species to species. Our approach consists of decomposing the DNA binding site sequences for a given regulator into families of motifs (i.e., termed submotifs using a machine learning method inspired by the "Divide & Conquer" strategy. By partitioning a motif into sub-patterns, computational advantages for classification were produced, resulting in the discovery of new members of a regulon, and alleviating the problem of distinguishing functional sites in chromatin immunoprecipitation and DNA microarray genome-wide analysis. Moreover, we found that certain partitions were useful in revealing biological properties of binding site sequences, including modular gains and losses of PhoP binding sites through evolutionary turnover events, as well as conservation in distant species. The high conservation of PhoP submotifs within gamma/enterobacteria, as well as the regulatory protein that recognizes them, suggests that the major cause of divergence between related species is not due to the binding sites, as was previously suggested for other regulators. Instead, the divergence may be attributed to the fast evolution of orthologous target

  2. Using Carbohydrate Interaction Assays to Reveal Novel Binding Sites in Carbohydrate Active Enzymes

    DEFF Research Database (Denmark)

    Cockburn, Darrell; Wilkens, Casper; Dilokpimol, Adiphol

    2016-01-01

    Carbohydrate active enzymes often contain auxiliary binding sites located either on independent domains termed carbohydrate binding modules (CBMs) or as so-called surface binding sites (SBSs) on the catalytic module at a certain distance from the active site. The SBSs are usually critical...... for the activity of their cognate enzyme, though they are not readily detected in the sequence of a protein, but normally require a crystal structure of a complex for their identification. A variety of methods, including affinity electrophoresis (AE), insoluble polysaccharide pulldown (IPP) and surface plasmon...... sites, but also for identifying new ones, even without structural data available. We further verify the chosen assays discriminate between known SBS/CBM containing enzymes and negative controls. Altogether 35 enzymes are screened for the presence of SBSs or CBMs and several novel binding sites...

  3. Xaa-Arg-Gly triplets in the collagen triple helix are dominant binding sites for the molecular chaperone HSP47.

    Science.gov (United States)

    Koide, Takaki; Takahara, Yoshifumi; Asada, Shinichi; Nagata, Kazuhiro

    2002-02-22

    HSP47 is an essential procollagen-specific molecular chaperone that resides in the endoplasmic reticulum of procollagen-producing cells. Recent advances have revealed that HSP47 recognizes the (Pro-Pro-Gly)(n) sequence but not (Pro-Hyp-Gly)(n) and that HSP47 recognizes the triple-helical conformation. In this study, to better understand the substrate recognition by HSP47, we synthesized various collagen model peptides and examined their interaction with HSP47 in vitro. We found that the Pro-Arg-Gly triplet forms an HSP47-binding site. The HSP47 binding was observed only when Arg residues were incorporated in the Yaa positions of the Xaa-Yaa-Gly triplets. Amino acids in the Xaa position did not largely affect the interaction. The recognition of the Arg residue by HSP47 was specific to its side-chain structure because replacement of the Arg residue by other basic amino acids decreased the affinity to HSP47. The significance of Arg residues in HSP47 binding was further confirmed by using residue-specific chemical modification of types I and III collagen. Our results demonstrate that Xaa-Arg-Gly sequences in the triple-helical procollagen molecule are dominant binding sites for HSP47 and enable us to predict HSP47-binding sites in homotrimeric procollagen molecules.

  4. Predicting web site audience demographics for web advertising targeting using multi-web site clickstream data

    OpenAIRE

    Bock, K W; D. VAN DEN POEL; Manigart, S.

    2009-01-01

    Several recent studies have explored the virtues of behavioral targeting and personalization for online advertising. In this paper, we add to this literature by proposing a cost-effective methodology for the prediction of demographic web site visitor profiles that can be used for web advertising targeting purposes. The methodology involves the transformation of web site visitors’ clickstream patterns to a set of features and the training of Random Forest classifiers that generate predictions ...

  5. Partial enterectomy decreases somatostatin-binding sites in residual intestine of rabbits.

    Science.gov (United States)

    Colas, B; Bodegas, G; Sanz, M; Prieto, J C; Arilla, E

    1988-05-01

    1. Three weeks after partial enterectomy in the rabbit there was an increased somatostatin concentration and a decreased number of somatostatin-binding sites (without changes in the corresponding affinity values) in the cytosol of the residual intestinal tissue, except in the terminal ileum and the colon. 2. Five weeks after surgery both the somatostatin concentration and the number of somatostatin-binding sites returned towards control values. 3. These results suggest that an increase in bowel somatostatin content could lead to down-regulation of somatostatin-binding sites in the intestinal mucosa.

  6. Brominated lipids identify lipid binding sites on the surface of the reaction center from Rhodobacter sphaeroides.

    Science.gov (United States)

    Roszak, Aleksander W; Gardiner, Alastair T; Isaacs, Neil W; Cogdell, Richard J

    2007-03-20

    This study describes the use of brominated phospholipids to distinguish between lipid and detergent binding sites on the surface of a typical alpha-helical membrane protein. Reaction centers isolated from Rhodobacter sphaeroides were cocrystallized with added brominated phospholipids. X-ray structural analysis of these crystals has revealed the presence of two lipid binding sites from the characteristic strong X-ray scattering from the bromine atoms. These results demonstrate the usefulness of this approach to mapping lipid binding sites at the surface of membrane proteins.

  7. An Experimentally Based Computer Search Identifies Unstructured Membrane-binding Sites in Proteins

    Science.gov (United States)

    Brzeska, Hanna; Guag, Jake; Remmert, Kirsten; Chacko, Susan; Korn, Edward D.

    2010-01-01

    Programs exist for searching protein sequences for potential membrane-penetrating segments (hydrophobic regions) and for lipid-binding sites with highly defined tertiary structures, such as PH, FERM, C2, ENTH, and other domains. However, a rapidly growing number of membrane-associated proteins (including cytoskeletal proteins, kinases, GTP-binding proteins, and their effectors) bind lipids through less structured regions. Here, we describe the development and testing of a simple computer search program that identifies unstructured potential membrane-binding sites. Initially, we found that both basic and hydrophobic amino acids, irrespective of sequence, contribute to the binding to acidic phospholipid vesicles of synthetic peptides that correspond to the putative membrane-binding domains of Acanthamoeba class I myosins. Based on these results, we modified a hydrophobicity scale giving Arg- and Lys-positive, rather than negative, values. Using this basic and hydrophobic scale with a standard search algorithm, we successfully identified previously determined unstructured membrane-binding sites in all 16 proteins tested. Importantly, basic and hydrophobic searches identified previously unknown potential membrane-binding sites in class I myosins, PAKs and CARMIL (capping protein, Arp2/3, myosin I linker; a membrane-associated cytoskeletal scaffold protein), and synthetic peptides and protein domains containing these newly identified sites bound to acidic phospholipids in vitro. PMID:20018884

  8. Genome-wide identification of transcription start sites, promoters and transcription factor binding sites in E. coli.

    Directory of Open Access Journals (Sweden)

    Alfredo Mendoza-Vargas

    Full Text Available Despite almost 40 years of molecular genetics research in Escherichia coli a major fraction of its Transcription Start Sites (TSSs are still unknown, limiting therefore our understanding of the regulatory circuits that control gene expression in this model organism. RegulonDB (http://regulondb.ccg.unam.mx/ is aimed at integrating the genetic regulatory network of E. coli K12 as an entirely bioinformatic project up till now. In this work, we extended its aims by generating experimental data at a genome scale on TSSs, promoters and regulatory regions. We implemented a modified 5' RACE protocol and an unbiased High Throughput Pyrosequencing Strategy (HTPS that allowed us to map more than 1700 TSSs with high precision. From this collection, about 230 corresponded to previously reported TSSs, which helped us to benchmark both our methodologies and the accuracy of the previous mapping experiments. The other ca 1500 TSSs mapped belong to about 1000 different genes, many of them with no assigned function. We identified promoter sequences and type of sigma factors that control the expression of about 80% of these genes. As expected, the housekeeping sigma(70 was the most common type of promoter, followed by sigma(38. The majority of the putative TSSs were located between 20 to 40 nucleotides from the translational start site. Putative regulatory binding sites for transcription factors were detected upstream of many TSSs. For a few transcripts, riboswitches and small RNAs were found. Several genes also had additional TSSs within the coding region. Unexpectedly, the HTPS experiments revealed extensive antisense transcription, probably for regulatory functions. The new information in RegulonDB, now with more than 2400 experimentally determined TSSs, strengthens the accuracy of promoter prediction, operon structure, and regulatory networks and provides valuable new information that will facilitate the understanding from a global perspective the complex and

  9. Divergent evolution of human p53 binding sites: cell cycle versus apoptosis.

    Directory of Open Access Journals (Sweden)

    Monica M Horvath

    2007-07-01

    Full Text Available The p53 tumor suppressor is a sequence-specific pleiotropic transcription factor that coordinates cellular responses to DNA damage and stress, initiating cell-cycle arrest or triggering apoptosis. Although the human p53 binding site sequence (or response element [RE] is well characterized, some genes have consensus-poor REs that are nevertheless both necessary and sufficient for transactivation by p53. Identification of new functional gene regulatory elements under these conditions is problematic, and evolutionary conservation is often employed. We evaluated the comparative genomics approach for assessing evolutionary conservation of putative binding sites by examining conservation of 83 experimentally validated human p53 REs against mouse, rat, rabbit, and dog genomes and detected pronounced conservation differences among p53 REs and p53-regulated pathways. Bona fide NRF2 (nuclear factor [erythroid-derived 2]-like 2 nuclear factor and NFkappaB (nuclear factor of kappa light chain gene enhancer in B cells binding sites, which direct oxidative stress and innate immunity responses, were used as controls, and both exhibited high interspecific conservation. Surprisingly, the average p53 RE was not significantly more conserved than background genomic sequence, and p53 REs in apoptosis genes as a group showed very little conservation. The common bioinformatics practice of filtering RE predictions by 80% rodent sequence identity would not only give a false positive rate of approximately 19%, but miss up to 57% of true p53 REs. Examination of interspecific DNA base substitutions as a function of position in the p53 consensus sequence reveals an unexpected excess of diversity in apoptosis-regulating REs versus cell-cycle controlling REs (rodent comparisons: p < 1.0 e-12. While some p53 REs show relatively high levels of conservation, REs in many genes such as BAX, FAS, PCNA, CASP6, SIVA1, and P53AIP1 show little if any homology to rodent sequences. This

  10. Ligand-induced conformational changes: Improved predictions of ligand binding conformations and affinities

    DEFF Research Database (Denmark)

    Frimurer, T.M.; Peters, Günther H.J.; Iversen, L.F.

    2003-01-01

    A computational docking strategy using multiple conformations of the target protein is discussed and evaluated. A series of low molecular weight, competitive, nonpeptide protein tyrosine phosphatase inhibitors are considered for which the x-ray crystallographic structures in complex with protein...... tyrosine phosphatase 1 B (PTP1B) are known. To obtain a quantitative measure of the impact of conformational changes induced by the inhibitors, these were docked to the active site region of various structures of PTP1B using the docking program FlexX. Firstly, the inhibitors were docked to a PTP1B crystal...... predicted binding energy and a correct docking mode. Thirdly, to improve the predictability of the docking procedure in the general case, where only a single target protein structure is known, we evaluate an approach which takes possible protein side-chain conformational changes into account. Here, side...

  11. Europium ion as a probe for binding sites to carrageenans

    Energy Technology Data Exchange (ETDEWEB)

    Ramos, Ana P.; Goncalves, Rogeria R.; Serra, Osvaldo A. [Departamento de Quimica, Faculdade de Filosofia, Ciencias e Letras de Ribeirao Preto, Universidade de Sao Paulo, Ribeirao Preto, Sao Paulo 14040-901 (Brazil); Zaniquelli, Maria Elisabete D. [Departamento de Quimica, Faculdade de Filosofia, Ciencias e Letras de Ribeirao Preto, Universidade de Sao Paulo, Ribeirao Preto, Sao Paulo 14040-901 (Brazil)], E-mail: medzaniquelli@ffclrp.usp.br; Wong, Kenneth [Laboratorio de Fisico-Quimica, Centro de Pesquisas de Paulinia, Rhodia Brasil, Paulinia, Sao Paulo (Brazil)

    2007-12-15

    Carrageenans, sulfated polysaccharides extracted from red algae, present a coil-helix transition and helix aggregation dependence on the type and concentration of counterions. In this study, we focus attention on a mixed valence counterion system: Eu{sup 3+}/Na{sup +} or K{sup +} with different gel-forming carrageenans: kappa, iota, and kappa-2. Results of stationary and time-dependent luminescence showed to be a suitable tool to probe ion binding to both the negatively charged sulfate group and the hydroxyl groups present in the biopolymer. For lower europium ion concentrations, a single longer decay emission lifetime was detected, which was attributed to the binding of europium ion to the carrageenan sulfate groups. An additional decay ascribed to europium binding to hydroxyl groups was observed above a threshold concentration, and this decay was dependent on the carrageenan charge density. Symmetry of the europium ion microenvironment was estimated by the ratio between the intensities of its emission bands, which has been shown to depend on the concentration of europium ions and on the specificity of the monovalent counterion bound to the carrageenan.

  12. Protein-binding RNA aptamers affect molecular interactions distantly from their binding sites

    DEFF Research Database (Denmark)

    Dupont, Daniel M; Thuesen, Cathrine K; Bøtkjær, Kenneth A;

    2015-01-01

    Nucleic acid aptamer selection is a powerful strategy for the development of regulatory agents for molecular intervention. Accordingly, aptamers have proven their diligence in the intervention with serine protease activities, which play important roles in physiology and pathophysiology. Nonetheless...... potential, both binding to the serine protease urokinase-type plasminogen activator (uPA). We determine the subsequent impact of aptamer binding on the well-established molecular interactions (plasmin, PAI-1, uPAR, and LRP-1A) controlling uPA activities. One of the aptamers (upanap-126) binds to the area...... around the C-terminal α-helix in pro-uPA, while the other aptamer (upanap-12) binds to both the β-hairpin of the growth factor domain and the kringle domain of uPA. Based on the mapping studies, combined with data from small-angle X-ray scattering analysis, we construct a model for the upanap-12:pro...

  13. Roles of multiple surface sites, long substrate binding clefts, and carbohydrate binding modules in the action of amylolytic enzymes on polysaccharide substrates

    DEFF Research Database (Denmark)

    Nielsen, Morten Munch; Seo, E.S.; Dilokpimol, Adiphol

    2008-01-01

    with a characteristic subsite binding energy profile around the catalytic site. Furthermore, several amylolytic enzymes that facilitate attack on the natural substrate, i.e. the endosperm starch granules, have secondary sugar binding sites either situated on the surface of the protein domain or structural unit...... that contains the catalytic site or belonging to a separate starch binding domain. The role of surface sites in the function of barley alpha-amylase 1 has been investigated by using mutational analysis in conjunction with carbohydrate binding analyses and crystallography. The ability to bind starch depends...

  14. In silico engineering and optimization of Transcription Activator-Like Effectors and their derivatives for improved DNA binding predictions.

    KAUST Repository

    Piatek, Marek J.

    2015-12-01

    Transcription Activator-Like Effectors (TALEs) can be used as adaptable DNAbinding modules to create site-specific chimeric nucleases or synthetic transcriptional regulators. The central repeat domain mediates specific DNA binding via hypervariable repeat di-residues (RVDs). This DNA-Binding Domain can be engineered to bind preferentially to any user-selected DNA sequence if engineered appropriately. Therefore, TALEs and their derivatives have become indispensable molecular tools in site-specific manipulation of genes and genomes. This thesis revolves around two problems: in silico design and improved binding site prediction of TALEs. In the first part, a study is shown where TALEs are successfully designed in silico and validated in laboratory to yield the anticipated effects on selected genes. Software is developed to accompany the process of designing and prediction of binding sites. I expanded the functionality of the software to be used as a more generic set of tools for the design, target and offtarget searching. Part two contributes a method and associated toolkit developed to allow users to design in silico optimized synthetic TALEs with user-defined specificities for various experimental purposes. This method is based on a mutual relationship of three consecutive tandem repeats in the DNA-binding domain. This approach revealed positional and compositional bias behind the binding of TALEs to DNA. In conclusion, I developed methods, approaches, and software to enhance the functionality of synthetic TALEs, which should improve understanding of TALEs biology and will further advance genome-engineering applications in various organisms and cell types.

  15. The designed protein M(II)-Gly-Lys-His-Fos(138-211) specifically cleaves the AP-1 binding site containing DNA.

    Science.gov (United States)

    Harford, C; Narindrasorasak, S; Sarkar, B

    1996-04-09

    A new specific DNA cleavage protein, Gly-Lys-His-Fos(138-211), was designed, expressed, and characterized. The DNA-binding component of the design uses the basic and leucine zipper regions of the leucine zipper Fos, which are represented by Fos(138-211). The DNA cleavage moiety was provided by the design of the amino-terminal Cu(II)-, Ni(II)-binding site GKH at the amino terminus of Fos(138-211). Binding of Cu(II) or Ni(II) by the protein activates its cleavage ability. The GKH motif was predicted to form a specific amino-terminal Cu(II)-, Ni(II)-binding motif as previously defined [Predki, P. F., Harford, C., Brar, P., & Sarkar, B. (1992) Biochem. J. 287, 211 -215]. This prediction was verified as the tripeptide, GKH, and the expressed protein, GKH-Fos(138-211), were both shown to be capable of binding Cu(II) and Ni(II). The designed protein upon heterodimerization with Jun(248-334) was shown to bind to and cleave several forms of DNA which contained an AP-1 binding site. The cleavage was shown to be specific. This design demonstrates the versatility of the amino-terminal Cu(II)-, Ni(II)-binding motif and the variety of motifs which can be generated. The site of cleavage by GKH-Fos(138-211) on DNA provides further information regarding the bending of DNA upon binding to Fos-Jun heterodimers.

  16. Three-dimensional binding sites volume assessment during cardiac pacing lead extraction

    Directory of Open Access Journals (Sweden)

    Bich Lien Nguyen

    2015-07-01

    Conclusions: Real-time 3D binding sites assessment is feasible and improves transvenous lead extraction outcomes. Its role as a complementary information requires extensive validation, and might be beneficial for a tailored strategy.

  17. Probing and mapping the binding sites on streptavidin imprinted polymer surface

    Energy Technology Data Exchange (ETDEWEB)

    Duman, Memed, E-mail: memi@hacettepe.edu.tr

    2014-10-01

    Molecular imprinting is an effective technique for preparing recognition sites which act as synthetic receptors on polymeric surfaces. Herein, we synthesized MIP surfaces with specific binding sites for streptavidin and characterized them at nanoscale by using two different atomic force microscopy (AFM) techniques. While the single molecule force spectroscopy (SMFS) reveals the unbinding kinetics between streptavidin molecule and binding sites, simultaneous topography and recognition imaging (TREC) was employed, for the first time, to directly map the binding sites on streptavidin imprinted polymers. Streptavidin modified AFM cantilever showed specific unbinding events with an unbinding force around 300 pN and the binding probability was calculated as 35.2% at a given loading rate. In order to prove the specificity of the interaction, free streptavidin molecules were added to AFM liquid cell and the binding probability was significantly decreased to 7.6%. Moreover, the recognition maps show that the smallest recognition site with a diameter of around ∼ 21 nm which corresponds to a single streptavidin molecule binding site. We believe that the potential of combining SMFS and TREC opens new possibilities for the characterization of MIP surfaces with single molecule resolution under physiological conditions. - Graphical abstract: Simultaneous Topography and RECognition (TREC) imaging is a novel characterization technique to reveal binding sites on molecularly imprinted polymer surfaces with single molecule resolution under physiological conditions. - Highlights: • Highly specific streptavidin printed polymer surfaces were synthesized. • Unbinding kinetic rate of single streptavidin molecule was studied by SMFS. • The distribution of binding pockets was revealed for the first time by TREC imaging. • TREC showed that the binding pockets formed nano-domains on MIP surface. • SMFS and TREC are powerful AFM techniques for characterization of MIP surfaces.

  18. A GIS approach for predicting prehistoric site locations.

    Energy Technology Data Exchange (ETDEWEB)

    Kuiper, J. A.; Wescott, K. L.

    1999-08-04

    Use of geographic information system (GIS)-based predictive mapping to locate areas of high potential for prehistoric archaeological sites is becoming increasingly popular among archaeologists. Knowledge of the environmental variables influencing activities of original inhabitants is used to produce GIS layers representing the spatial distribution of those variables. The GIS layers are then analyzed to identify locations where combinations of environmental variables match patterns observed at known prehistoric sites. Presented are the results of a study to locate high-potential areas for prehistoric sites in a largely unsurveyed area of 39,000 acres in the Upper Chesapeake Bay region, including details of the analysis process. The project used environmental data from over 500 known sites in other parts of the region and the results corresponded well with known sites in the study area.

  19. MHC class I epitope binding prediction trained on small data sets

    DEFF Research Database (Denmark)

    Lundegaard, Claus; Nielsen, Morten; Lamberth, K.;

    2004-01-01

    prediction methods exist only for alleles were the binding pattern have been deduced from peptide motifs. Using empirical knowledge of important anchor positions within the binding peptides dramatically reduces the number of peptides needed for reliable predictions. We here present a general method...... for predicting peptides binding to specific MHC class I alleles. The method combines advanced automatic scoring matrix generation with empirical position specific differential anchor weighting. The method leads to predictions with a comparable or higher accuracy than other established prediction servers, even...

  20. A new protein binding pocket similarity measure based on comparison of clouds of atoms in 3D: application to ligand prediction

    Directory of Open Access Journals (Sweden)

    Zaslavskiy Mikhail

    2010-02-01

    Full Text Available Abstract Background Predicting which molecules can bind to a given binding site of a protein with known 3D structure is important to decipher the protein function, and useful in drug design. A classical assumption in structural biology is that proteins with similar 3D structures have related molecular functions, and therefore may bind similar ligands. However, proteins that do not display any overall sequence or structure similarity may also bind similar ligands if they contain similar binding sites. Quantitatively assessing the similarity between binding sites may therefore be useful to propose new ligands for a given pocket, based on those known for similar pockets. Results We propose a new method to quantify the similarity between binding pockets, and explore its relevance for ligand prediction. We represent each pocket by a cloud of atoms, and assess the similarity between two pockets by aligning their atoms in the 3D space and comparing the resulting configurations with a convolution kernel. Pocket alignment and comparison is possible even when the corresponding proteins share no sequence or overall structure similarities. In order to predict ligands for a given target pocket, we compare it to an ensemble of pockets with known ligands to identify the most similar pockets. We discuss two criteria to evaluate the performance of a binding pocket similarity measure in the context of ligand prediction, namely, area under ROC curve (AUC scores and classification based scores. We show that the latter is better suited to evaluate the methods with respect to ligand prediction, and demonstrate the relevance of our new binding site similarity compared to existing similarity measures. Conclusions This study demonstrates the relevance of the proposed method to identify ligands binding to known binding pockets. We also provide a new benchmark for future work in this field. The new method and the benchmark are available at http://cbio.ensmp.fr/paris/.

  1. Arabidopsis AtADF1 is Functionally Affected by Mutations on Actin Binding Sites

    Institute of Scientific and Technical Information of China (English)

    Chun-Hai Dong; Wei-Ping Tang; Jia-Yao Liu

    2013-01-01

    The plant actin depolymerizing factor (ADF) binds to both monomeric and filamentous actin,and is directly involved in the depolymerization of actin filaments.To better understand the actin binding sites of the Arabidopsis thaliana L.AtADF1,we generated mutants of AtADF1 and investigated their functions in vitro and in vivo.Analysis of mutants harboring amino acid substitutions revealed that charged residues (Arg98 and Lys100) located at the α-helix 3 and forming an actin binding site together with the N-terminus are essential for both G-and F-actin binding.The basic residues on the β-strand 5 (K82/A) and the α-helix 4 (R135/A,R137/A) form another actin binding site that is important for F-actin binding.Using transient expression of CFP-tagged AtADF1 mutant proteins in onion (Allium cepa) peel epidermal cells and transgenic Arabidopsis thaliana L.plants overexpressing these mutants,we analyzed how these mutant proteins regulate actin organization and affect seedling growth.Our results show that the ADF mutants with a lower affinity for actin filament binding can still be functional,unless the affinity foractin monomers is also affected.The G-actin binding activity of the ADF plays an essential role in actin binding,depolymerization of actin polymers,and therefore in the control of actin organization.

  2. Ligand-binding sites in human serum amyloid P component

    DEFF Research Database (Denmark)

    Heegaard, N.H.H.; Heegaard, Peter M. H.; Roepstorff, P.;

    1996-01-01

    Amyloid P component (AP) is a naturally occurring glycoprotein that is found in serum and basement membranes, AP is also a component of all types of amyloid, including that found in individuals who suffer from Alzheimer's disease and Down's syndrome. Because AP has been found to bind strongly...... of 25 mu M, while the IC50 of AP-(27-38)-peptide and AP-(33-38)-peptide are 10 mu M and 2 mu M, respectively, The understanding of the structure and function of active AP peptides will be useful for development of amyloid-targeted diagnostics and therapeutics....

  3. Radiolabelling of phoneutria nigriventer spider toxin (Tx1): a tool to study its binding site

    Energy Technology Data Exchange (ETDEWEB)

    Santos, Raquel Gouvea dos [Centro de Desenvolvimento da Tecnologia Nuclear (CDTN), Belo Horizonte, MG (Brazil); Diniz, Carlos Roberto; Nascimento, Marta Cordeiro [FUNED, Belo Horizonte, MG (Brazil); Lima, Maria Elena de [Minas Gerais Univ., Belo Horizonte, MG (Brazil). Dept. de Bioquimica e Imunologia

    1996-07-01

    The neurotoxin Tx1, isolated from the venom of the South American spider Phoneutria nigriventer produces tail elevation and spastic paralysis of posterior limbs after intracerebral ventricular injection in mice. Tx1 also produces ileum contraction in bioassay. We have investigated the binding of radioiodinated-Tx1 ({sup 125} I-Tx1) on the preparation of myenteric plexus-longitudinal muscle membrane from guinea pig ileum (MPLM) as a tool to characterize the interaction of this neurotoxin with its site. The neurotoxin Tx1 was radioiodinated with Na{sup 125} I by the lactoperoxidase method. {sup 125} I-Tx1 specifically binds to a single class of noninteracting binding sites of high affinity (Kd= 3.5 x 10{sup -10} M) and low capacity (1.2 pmol/mg protein). The specific binding increased in parallel with the protein concentration. In competition experiments the ligands of ionic channels used (sodium, potassium and calcium) did not affect the binding of {sup 125} I-Tx1 to MPLM neither did the cholinergic ligands (hemicholinium-3, hexamethonium, d-tubocurarine and atropine). Another neurotoxin (Tx2-6, one of the isoforms of Tx2 pool) decreased toxin with MPLM and showed that toxin has a specific and saturable binding site in guinea pig ileum and this binding site appears to be related to the Tx2 site. (author)

  4. Identifying ligand binding sites and poses using GPU-accelerated Hamiltonian replica exchange molecular dynamics.

    Science.gov (United States)

    Wang, Kai; Chodera, John D; Yang, Yanzhi; Shirts, Michael R

    2013-12-01

    We present a method to identify small molecule ligand binding sites and poses within a given protein crystal structure using GPU-accelerated Hamiltonian replica exchange molecular dynamics simulations. The Hamiltonians used vary from the physical end state of protein interacting with the ligand to an unphysical end state where the ligand does not interact with the protein. As replicas explore the space of Hamiltonians interpolating between these states, the ligand can rapidly escape local minima and explore potential binding sites. Geometric restraints keep the ligands from leaving the vicinity of the protein and an alchemical pathway designed to increase phase space overlap between intermediates ensures good mixing. Because of the rigorous statistical mechanical nature of the Hamiltonian exchange framework, we can also extract binding free energy estimates for all putative binding sites. We present results of this methodology applied to the T4 lysozyme L99A model system for three known ligands and one non-binder as a control, using an implicit solvent. We find that our methodology identifies known crystallographic binding sites consistently and accurately for the small number of ligands considered here and gives free energies consistent with experiment. We are also able to analyze the contribution of individual binding sites to the overall binding affinity. Our methodology points to near term potential applications in early-stage structure-guided drug discovery.

  5. Prediction of TF target sites based on atomistic models of protein-DNA complexes

    Directory of Open Access Journals (Sweden)

    Collado-Vides Julio

    2008-10-01

    Full Text Available Abstract Background The specific recognition of genomic cis-regulatory elements by transcription factors (TFs plays an essential role in the regulation of coordinated gene expression. Studying the mechanisms determining binding specificity in protein-DNA interactions is thus an important goal. Most current approaches for modeling TF specific recognition rely on the knowledge of large sets of cognate target sites and consider only the information contained in their primary sequence. Results Here we describe a structure-based methodology for predicting sequence motifs starting from the coordinates of a TF-DNA complex. Our algorithm combines information regarding the direct and indirect readout of DNA into an atomistic statistical model, which is used to estimate the interaction potential. We first measure the ability of our method to correctly estimate the binding specificities of eight prokaryotic and eukaryotic TFs that belong to different structural superfamilies. Secondly, the method is applied to two homology models, finding that sampling of interface side-chain rotamers remarkably improves the results. Thirdly, the algorithm is compared with a reference structural method based on contact counts, obtaining comparable predictions for the experimental complexes and more accurate sequence motifs for the homology models. Conclusion Our results demonstrate that atomic-detail structural information can be feasibly used to predict TF binding sites. The computational method presented here is universal and might be applied to other systems involving protein-DNA recognition.

  6. Quantitative determination of angiotensin II binding sites in rat brain and pituitary gland by autoradiography

    Energy Technology Data Exchange (ETDEWEB)

    Israel, A.; Correa, F.M.A.; Niwa, M.; Saavedra, J.M. (National Inst. of Mental Health, Bethesda, MD (USA))

    1984-11-26

    Rat brain and pituitary angiotensin II (AII) binding sites were quantitated by incubation of tissue sections with /sup 125/I-(Sar/sup 1/) AII, Ultrofilm radioautography, computerized densitometry, and comparison with /sup 125/I-standards at appropriate film exposure times. The highest number of AII binding sites was found in anterior pituitary and the circumventricular organs, organon subfornicalis and organon vasculosum laminae terminalis.

  7. Guanine Nucleotides Modulate Cell Surface cAMP-Binding Sites in Membranes from Dictyostelium discoideum

    NARCIS (Netherlands)

    Haastert, Peter J.M. van

    1984-01-01

    D. discoideum contains kinetically distinguishable cell surface cAMP binding sites. One class, S, is slowly dissociating and has high affinity for cAMP (Kd = 15 nM, t½ = 15 s). A second class is fast dissociating (t½ about 1 s) and is composed of high affinity binding sites H (Kd ≈ 60 nM), and low a

  8. Medial Temporal Lobe Activity Predicts Successful Relational Memory Binding

    Science.gov (United States)

    Hannula, Deborah E.; Ranganath, Charan

    2009-01-01

    Previous neuropsychological findings have implicated medial temporal lobe (MTL) structures in retaining object-location relations over the course of short delays, but MTL effects have not always been reported in neuroimaging investigations with similar short-term memory requirements. Here, we used event-related functional magnetic resonance imaging to test the hypothesis that the hippocampus and related MTL structures support accurate retention of relational memory representations, even across short delays. On every trial, four objects were presented, each in one of nine possible locations of a three-dimensional grid. Participants were to mentally rotate the grid and then maintain the rotated representation in anticipation of a test stimulus: a rendering of the grid, rotated 90° from the original viewpoint. The test stimulus was either a “match” display, in which object-location relations were intact, or a “mismatch” display, in which one object occupied a new, previously unfilled location (mismatch position), or two objects had swapped locations (mismatch swap). Encoding phase activation in anterior and posterior regions of the left hippocampus, and in bilateral perirhinal cortex, predicted subsequent accuracy on the short-term memory decision, as did bilateral posterior hippocampal activity after the test stimulus. Notably, activation in these posterior hippocampal regions was also sensitive to the degree to which object-location bindings were preserved in the test stimulus; activation was greatest for match displays, followed by mismatch-position displays, and finally mismatch-swap displays. These results indicate that the hippocampus and related MTL structures contribute to successful encoding and retrieval of relational information in visual short-term memory. PMID:18171929

  9. Late Pregnancy Thyroid-Binding Globulin Predicts Perinatal Depression

    Science.gov (United States)

    Pedersen, Cort; Leserman, Jane; Garcia, Nacire; Stansbury, Melissa; Meltzer-Brody, Samantha; Johnson, Jacqueline

    2016-01-01

    Previously we found that late pregnancy total and free thyroxine (TT4, FT4) concentrations were negatively related to greater pre and/or postpartum depressive symptoms. In a much larger cohort, the current study examined whether these thyroid indices measured earlier in the third trimester (31-33 weeks) predict subsequent perinatal depression and anxiety ratings as well as syndromal depression. Thyroid-binding globulin (TBG) concentrations increase markedly during pregnancy and may be an index of sensitivity to elevated estrogen levels. TBG was examined in this study because prior findings suggest that postpartum depression is related to sensitivity to mood destabilization by elevated sex hormone concentrations during pregnancy. Our cohort was 199 euthyroid women recruited from a public health obstetrics clinic (63.8% Hispanic, 21.6% Black). After screening and blood draws for hormone measures at pregnancy weeks 31-33, subjects were evaluated during home visits at pregnancy weeks 35-36 as well as postpartum weeks 6 and 12. Evaluations included psychiatric interviews for current and life-time DSM-IV psychiatric history (M.I.N.I.-Plus), subject self-ratings and interviewer ratings for depression and anxiety (Edinburgh Postnatal Depression Scale, Montgomery-Ǻsberg Depression Rating Scale; Spielberger State-Trait Anxiety Inventory, Hamilton Anxiety Inventory), as well as a standardized interview to obtain life-time trauma history. Numerous covariates were included in all regression analyses. Trauma and major depression history were robustly significant predictors of depression and anxiety ratings over the study period when these variables were analyzed individually or in a combined model including FT4 or TBG (pdepression and anxiety ratings (pdepression history, were significant individual predictors of syndromal depression during the study period (pdepression history, FT4 and TBG generally were not significantly predictive of depression or anxiety ratings, and FT4

  10. Quantitative analysis of EGR proteins binding to DNA: assessing additivity in both the binding site and the protein

    Directory of Open Access Journals (Sweden)

    Stormo Gary D

    2005-07-01

    Full Text Available Abstract Background Recognition codes for protein-DNA interactions typically assume that the interacting positions contribute additively to the binding energy. While this is known to not be precisely true, an additive model over the DNA positions can be a good approximation, at least for some proteins. Much less information is available about whether the protein positions contribute additively to the interaction. Results Using EGR zinc finger proteins, we measure the binding affinity of six different variants of the protein to each of six different variants of the consensus binding site. Both the protein and binding site variants include single and double mutations that allow us to assess how well additive models can account for the data. For each protein and DNA alone we find that additive models are good approximations, but over the combined set of data there are context effects that limit their accuracy. However, a small modification to the purely additive model, with only three additional parameters, improves the fit significantly. Conclusion The additive model holds very well for every DNA site and every protein included in this study, but clear context dependence in the interactions was detected. A simple modification to the independent model provides a better fit to the complete data.

  11. Comprehensive prediction of chromosome dimer resolution sites in bacterial genomes

    Directory of Open Access Journals (Sweden)

    Arakawa Kazuharu

    2011-01-01

    Full Text Available Abstract Background During the replication process of bacteria with circular chromosomes, an odd number of homologous recombination events results in concatenated dimer chromosomes that cannot be partitioned into daughter cells. However, many bacteria harbor a conserved dimer resolution machinery consisting of one or two tyrosine recombinases, XerC and XerD, and their 28-bp target site, dif. Results To study the evolution of the dif/XerCD system and its relationship with replication termination, we report the comprehensive prediction of dif sequences in silico using a phylogenetic prediction approach based on iterated hidden Markov modeling. Using this method, dif sites were identified in 641 organisms among 16 phyla, with a 97.64% identification rate for single-chromosome strains. The dif sequence positions were shown to be strongly correlated with the GC skew shift-point that is induced by replicational mutation/selection pressures, but the difference in the positions of the predicted dif sites and the GC skew shift-points did not correlate with the degree of replicational mutation/selection pressures. Conclusions The sequence of dif sites is widely conserved among many bacterial phyla, and they can be computationally identified using our method. The lack of correlation between dif position and the degree of GC skew suggests that replication termination does not occur strictly at dif sites.

  12. Cycloxaprid insecticide: nicotinic acetylcholine receptor binding site and metabolism.

    Science.gov (United States)

    Shao, Xusheng; Swenson, Tami L; Casida, John E

    2013-08-21

    Cycloxaprid (CYC) is a novel neonicotinoid prepared from the (nitromethylene)imidazole (NMI) analogue of imidacloprid. In this study we consider whether CYC is active per se or only as a proinsecticide for NMI. The IC50 values (nM) for displacing [(3)H]NMI binding are 43-49 for CYC and 2.3-3.2 for NMI in house fly and honeybee head membranes and 302 and 7.2, respectively, in mouse brain membranes, potency relationships interpreted as partial conversion of some CYC to NMI under the assay conditions. The 6-8-fold difference in toxicity of injected CYC and NMI to house flies is consistent with their relative potencies as in vivo nicotinic acetylcholine receptor (nAChR) inhibitors in brain measured with [(3)H]NMI binding assays. CYC metabolism in mice largely involves cytochrome P450 pathways without NMI as a major intermediate. Metabolites of CYC tentatively assigned are five monohydroxy derivatives and one each of dihydroxy, nitroso, and amino modifications. CYC appears be a proinsecticide, serving as a slow-release reservoir for NMI with selective activity for insect versus mammalian nAChRs.

  13. Theoretical estimates of exposure timescales of protein binding sites on DNA regulated by nucleosome kinetics.

    Science.gov (United States)

    Parmar, Jyotsana J; Das, Dibyendu; Padinhateeri, Ranjith

    2016-02-29

    It is being increasingly realized that nucleosome organization on DNA crucially regulates DNA-protein interactions and the resulting gene expression. While the spatial character of the nucleosome positioning on DNA has been experimentally and theoretically studied extensively, the temporal character is poorly understood. Accounting for ATPase activity and DNA-sequence effects on nucleosome kinetics, we develop a theoretical method to estimate the time of continuous exposure of binding sites of non-histone proteins (e.g. transcription factors and TATA binding proteins) along any genome. Applying the method to Saccharomyces cerevisiae, we show that the exposure timescales are determined by cooperative dynamics of multiple nucleosomes, and their behavior is often different from expectations based on static nucleosome occupancy. Examining exposure times in the promoters of GAL1 and PHO5, we show that our theoretical predictions are consistent with known experiments. We apply our method genome-wide and discover huge gene-to-gene variability of mean exposure times of TATA boxes and patches adjacent to TSS (+1 nucleosome region); the resulting timescale distributions have non-exponential tails.

  14. Characterization of the internal calcium(II) binding sites in dissolved insulin hexamer using europium(III) fluorescence.

    Science.gov (United States)

    Alameda, G K; Evelhoch, J L; Sudmeier, J L; Birge, R R

    1985-03-26

    The fluorescence of Eu(III) is used to study the nature of the Ca(II) binding sites in the central cavity of the two-zinc(II) insulin hexamer. The dependence of the Eu(III) fluorescence lifetime upon Eu(III) stoichiometry indicates that there are three identical Eu(III) binding sites present in the two-zinc(II) insulin hexamer in solution. Addition of excess Ca(II) causes a decrease in the Eu(III) fluorescence intensity, confirming that Ca(II) competes for the observed Eu(III) sites. The solvent dependence of the Eu(III) fluorescence lifetime (H2O vs. D2O) indicates that four OH groups are coordinated to each Eu(III) in the hexamer. Substitution of Co(II) for Zn(II) causes a decrease in the Eu(III) fluorescence lifetime. Calculations based on Förster energy-transfer theory predict that the Co(II) [or Zn(II) in vivo] and Eu(III) [or Ca(II) in vivo] binding sites are separated by 9.6 +/- 0.5 A. Variation of the metal stoichiometries indicates that all three Eu(III) [or Ca(II) in vivo] sites are equidistant from the Zn(II) sites. We conclude that these sites are identical with the three central Zn(II) sites present in insulin hexamer crystals soaked in excess Zn(II) [Emdin, S. O., Dodson, G., Cutfield, J. M., & Cutfield, S. M. (1980) Diabetologia 19, 174-182] and suggest that these central sites are occupied by Ca(II) in vivo.

  15. PROCARB: A Database of Known and Modelled Carbohydrate-Binding Protein Structures with Sequence-Based Prediction Tools

    Directory of Open Access Journals (Sweden)

    Adeel Malik

    2010-01-01

    Full Text Available Understanding of the three-dimensional structures of proteins that interact with carbohydrates covalently (glycoproteins as well as noncovalently (protein-carbohydrate complexes is essential to many biological processes and plays a significant role in normal and disease-associated functions. It is important to have a central repository of knowledge available about these protein-carbohydrate complexes as well as preprocessed data of predicted structures. This can be significantly enhanced by tools de novo which can predict carbohydrate-binding sites for proteins in the absence of structure of experimentally known binding site. PROCARB is an open-access database comprising three independently working components, namely, (i Core PROCARB module, consisting of three-dimensional structures of protein-carbohydrate complexes taken from Protein Data Bank (PDB, (ii Homology Models module, consisting of manually developed three-dimensional models of N-linked and O-linked glycoproteins of unknown three-dimensional structure, and (iii CBS-Pred prediction module, consisting of web servers to predict carbohydrate-binding sites using single sequence or server-generated PSSM. Several precomputed structural and functional properties of complexes are also included in the database for quick analysis. In particular, information about function, secondary structure, solvent accessibility, hydrogen bonds and literature reference, and so forth, is included. In addition, each protein in the database is mapped to Uniprot, Pfam, PDB, and so forth.

  16. Probing substrate binding to Metallo-β-Lactamase L1 from Stenotrophomonas maltophilia by using site-directed mutagenesis

    Directory of Open Access Journals (Sweden)

    Yates Robert B

    2002-02-01

    Full Text Available Abstract Background The metallo-β-lactamases are Zn(II-containing enzymes that hydrolyze the β-lactam bond in penicillins, cephalosporins, and carbapenems and are involved in bacterial antibiotic resistance. There are at least 20 distinct organisms that produce a metallo-β-lactamase, and these enzymes have been extensively studied using X-ray crystallographic, computational, kinetic, and inhibition studies; however, much is still unknown about how substrates bind and the catalytic mechanism. In an effort to probe substrate binding to metallo-β-lactamase L1 from Stenotrophomonas maltophilia, nine site-directed mutants of L1 were prepared and characterized using metal analyses, CD spectroscopy, and pre-steady state and steady state kinetics. Results Site-directed mutations were generated of amino acids previously predicted to be important in substrate binding. Steady-state kinetic studies using the mutant enzymes and 9 different substrates demonstrated varying Km and kcat values for the different enzymes and substrates and that no direct correlation between Km and the effect of the mutation on substrate binding could be drawn. Stopped-flow fluorescence studies using nitrocefin as the substrate showed that only the S224D and Y228A mutants exhibited weaker nitrocefin binding. Conclusions The data presented herein indicate that Ser224, Ile164, Phe158, Tyr228, and Asn233 are not essential for tight binding of substrate to metallo-β-lactamase L1. The results in this work also show that Km values are not reliable for showing substrate binding, and there is no correlation between substrate binding and the amount of reaction intermediate formed during the reaction. This work represents the first experimental testing of one of the computational models of the metallo-β-lactamases.

  17. Predictive model of cationic surfactant binding to humic substances

    NARCIS (Netherlands)

    Ishiguro, M.; Koopal, L.K.

    2011-01-01

    The humic substances (HS) have a high reactivity with other components in the natural environment. An important factor for the reactivity of HS is their negative charge. Cationic surfactants bind strongly to HS by electrostatic and specific interaction. Therefore, a surfactant binding model is devel

  18. Identifying functional transcription factor binding sites in yeast by considering their positional preference in the promoters.

    Directory of Open Access Journals (Sweden)

    Fu-Jou Lai

    Full Text Available Transcription factor binding site (TFBS identification plays an important role in deciphering gene regulatory codes. With comprehensive knowledge of TFBSs, one can understand molecular mechanisms of gene regulation. In the recent decades, various computational approaches have been proposed to predict TFBSs in the genome. The TFBS dataset of a TF generated by each algorithm is a ranked list of predicted TFBSs of that TF, where top ranked TFBSs are statistically significant ones. However, whether these statistically significant TFBSs are functional (i.e. biologically relevant is still unknown. Here we develop a post-processor, called the functional propensity calculator (FPC, to assign a functional propensity to each TFBS in the existing computationally predicted TFBS datasets. It is known that functional TFBSs reveal strong positional preference towards the transcriptional start site (TSS. This motivates us to take TFBS position relative to the TSS as the key idea in building our FPC. Based on our calculated functional propensities, the TFBSs of a TF in the original TFBS dataset could be reordered, where top ranked TFBSs are now the ones with high functional propensities. To validate the biological significance of our results, we perform three published statistical tests to assess the enrichment of Gene Ontology (GO terms, the enrichment of physical protein-protein interactions, and the tendency of being co-expressed. The top ranked TFBSs in our reordered TFBS dataset outperform the top ranked TFBSs in the original TFBS dataset, justifying the effectiveness of our post-processor in extracting functional TFBSs from the original TFBS dataset. More importantly, assigning functional propensities to putative TFBSs enables biologists to easily identify which TFBSs in the promoter of interest are likely to be biologically relevant and are good candidates to do further detailed experimental investigation. The FPC is implemented as a web tool at http://santiago.ee.ncku.edu.tw/FPC/.

  19. The role of CTCF binding sites in the 3' immunoglobulin heavy chain regulatory region.

    Science.gov (United States)

    Birshtein, Barbara K

    2012-01-01

    The immunoglobulin heavy chain locus undergoes a series of DNA rearrangements and modifications to achieve the construction and expression of individual antibody heavy chain genes in B cells. These events affect variable regions, through VDJ joining and subsequent somatic hypermutation, and constant regions through class switch recombination (CSR). Levels of IgH expression are also regulated during B cell development, resulting in high levels of secreted antibodies from fully differentiated plasma cells. Regulation of these events has been attributed primarily to two cis-elements that work from long distances on their target sequences, i.e., an ∼1 kb intronic enhancer, Eμ, located between the V region segments and the most 5' constant region gene, Cμ; and an ∼40 kb 3' regulatory region (3' RR) that is located downstream of the most 3' C(H) gene, Cα. The 3' RR is a candidate for an "end" of B cell-specific regulation of the Igh locus. The 3' RR contains several B cell-specific enhancers associated with DNase I hypersensitive sites (hs1-4), which are essential for CSR and for high levels of IgH expression in plasma cells. Downstream of this enhancer-containing region is a region of high-density CTCF binding sites, which extends through hs5, 6, and 7 and further downstream. CTCF, with its enhancer-blocking activities, has been associated with all mammalian insulators and implicated in multiple chromosomal interactions. Here we address the 3' RR CTCF-binding region as a potential insulator of the Igh locus, an independent regulatory element and a predicted modulator of the activity of 3' RR enhancers. Using chromosome conformation capture technology, chromatin immunoprecipitation, and genetic approaches, we have found that the 3' RR with its CTCF-binding region interacts with target sequences in the V(H), Eμ, and C(H) regions through DNA looping as regulated by protein binding. This region impacts on B cell-specific Igh processes at different stages of B cell

  20. In vitro and in vivo characterisation of [3H]ANSTO-14 binding to the sigma 1 binding sites.

    Science.gov (United States)

    Nguyen, V H; Mardon, K; Kassiou, M; Christie, M D

    1999-02-01

    N-(4-phenylbutyl)-3-hydroxy-4-azahexacyclo[5.4.1.0(2,6).0(3, 10).0(5,9) .0(8,11)]dodecane (ANSTO-14) showed the highest activity for the sigma 1 site (Ki = 9.4 nM) and 19-fold sigma 1/sigma 2 selectivity. The present study showed that [3H]ANSTO-14 binds to a single high-affinity site in guinea pig brain membranes with an equilibrium Ki of 8.0 +/- 0.3 nM, in good agreement with the kinetic studies (Kd = 13.3 +/- 5.4 nM, n = 4), and a Bmax of 3.199 +/- 105 fmol/mg protein (n = 4). The in vivo biodistribution of [3H]ANSTO-14 showed a high uptake in the diencephalon. Pretreatment of rats with sigma ligands including (+)-pentazocine (sigma 1), ANSTO-14 (sigma 1), and DTG (sigma 1 and sigma 2) did not significantly reduce radiotracer uptake in the brain, but did in the spleen. A labelled metabolite was found in the liver and brain. Due to its insensitivity to sigma ligands, the accumulation of [3H]ANSTO-14 in the brain indicates high nonspecific binding. Therefore, [3H]ANSTO-14 is a suitable ligand for labelling sigma 1 sites in vitro but is not suitable for brain imaging of sigma binding sites in vivo.

  1. The binding sites for cocaine and dopamine in the dopamine transporter overlap

    DEFF Research Database (Denmark)

    Beuming, Thijs; Kniazeff, Julie; Bergmann, Marianne L

    2008-01-01

    Cocaine is a widely abused substance with psychostimulant effects that are attributed to inhibition of the dopamine transporter (DAT). We present molecular models for DAT binding of cocaine and cocaine analogs constructed from the high-resolution structure of the bacterial transporter homolog Leu......T. Our models suggest that the binding site for cocaine and cocaine analogs is deeply buried between transmembrane segments 1, 3, 6 and 8, and overlaps with the binding sites for the substrates dopamine and amphetamine, as well as for benztropine-like DAT inhibitors. We validated our models by detailed...... inhibition of dopamine transport by cocaine....

  2. Quantitative predictions of binding free energy changes in drug-resistant influenza neuraminidase.

    Directory of Open Access Journals (Sweden)

    Daniel R Ripoll

    Full Text Available Quantitatively predicting changes in drug sensitivity associated with residue mutations is a major challenge in structural biology. By expanding the limits of free energy calculations, we successfully identified mutations in influenza neuraminidase (NA that confer drug resistance to two antiviral drugs, zanamivir and oseltamivir. We augmented molecular dynamics (MD with Hamiltonian Replica Exchange and calculated binding free energy changes for H274Y, N294S, and Y252H mutants. Based on experimental data, our calculations achieved high accuracy and precision compared with results from established computational methods. Analysis of 15 micros of aggregated MD trajectories provided insights into the molecular mechanisms underlying drug resistance that are at odds with current interpretations of the crystallographic data. Contrary to the notion that resistance is caused by mutant-induced changes in hydrophobicity of the binding pocket, our simulations showed that drug resistance mutations in NA led to subtle rearrangements in the protein structure and its dynamics that together alter the active-site electrostatic environment and modulate inhibitor binding. Importantly, different mutations confer resistance through different conformational changes, suggesting that a generalized mechanism for NA drug resistance is unlikely.

  3. Surface binding sites in amylase have distinct roles in recognition of starch structure motifs and degradation

    DEFF Research Database (Denmark)

    Cockburn, Darrell; Nielsen, Morten M.; Christiansen, Camilla

    2015-01-01

    Carbohydrate converting enzymes often possess extra substrate binding regions that enhance their activity. These can be found either on separate domains termed carbohydrate binding modules or as so-called surface binding sites (SBSs) situated on the catalytic domain. SBSs are common in starch...... to soluble polysaccharides and oligosaccharides with α-1,6 linkages, suggesting that branch points are key structural elements in recognition by SBS2. Mutation at both SBS1 and SBS2 eliminated binding to all starch granule types tested. Taken together, the findings indicate that the two SBSs act in concert...

  4. Differential Modulation of Annexin I Binding Sites on Monocytes and Neutrophils

    Directory of Open Access Journals (Sweden)

    H. S. Euzger

    1999-01-01

    Full Text Available Specific binding sites for the anti-inflammatory protein annexin I have been detected on the surface of human monocytes and polymorphonuclear leukocytes (PMN. These binding sites are proteinaceous in nature and are sensitive to cleavage by the proteolytic enzymes trypsin, collagenase, elastase and cathepsin G. When monocytes and PMN were isolated independently from peripheral blood, only the monocytes exhibited constitutive annexin I binding. However PMN acquired the capacity to bind annexin I following co-culture with monocytes. PMN incubation with sodium azide, but not protease inhibitors, partially blocked this process. A similar increase in annexin I binding capacity was also detected in PMN following adhesion to endothelial monolayers. We propose that a juxtacrine activation rather than a cleavage-mediated transfer is involved in this process. Removal of annexin I binding sites from monocytes with elastase rendered monocytes functionally insensitive to full length annexin I or to the annexin I-derived pharmacophore, peptide Ac2-26, assessed as suppression of the respiratory burst. These data indicate that the annexin I binding site on phagocytic cells may have an important function in the feedback control of the inflammatory response and their loss through cleavage could potentiate such responses.

  5. A Disease-Causing Variant in PCNA Disrupts a Promiscuous Protein Binding Site.

    Science.gov (United States)

    Duffy, Caroline M; Hilbert, Brendan J; Kelch, Brian A

    2016-03-27

    The eukaryotic DNA polymerase sliding clamp, proliferating cell nuclear antigen or PCNA, is a ring-shaped protein complex that surrounds DNA to act as a sliding platform for increasing processivity of cellular replicases and for coordinating various cellular pathways with DNA replication. A single point mutation, Ser228Ile, in the human PCNA gene was recently identified to cause a disease whose symptoms resemble those of DNA damage and repair disorders. The mutation lies near the binding site for most PCNA-interacting proteins. However, the structural consequences of the S228I mutation are unknown. Here, we describe the structure of the disease-causing variant, which reveals a large conformational change that dramatically transforms the binding pocket for PCNA client proteins. We show that the mutation markedly alters the binding energetics for some client proteins, while another, p21(CIP1), is only mildly affected. Structures of the disease variant bound to peptides derived from two PCNA partner proteins reveal that the binding pocket can adjust conformation to accommodate some ligands, indicating that the binding site is dynamic and pliable. Our work has implications for the plasticity of the binding site in PCNA and reveals how a disease mutation selectively alters interactions to a promiscuous binding site that is critical for DNA metabolism.

  6. Putative cholesterol-binding sites in human immunodeficiency virus (HIV) coreceptors CXCR4 and CCR5.

    Science.gov (United States)

    Zhukovsky, Mikhail A; Lee, Po-Hsien; Ott, Albrecht; Helms, Volkhard

    2013-04-01

    Using molecular docking, we identified a cholesterol-binding site in the groove between transmembrane helices 1 and 7 near the inner membrane-water interface of the G protein-coupled receptor CXCR4, a coreceptor for HIV entry into cells. In this docking pose, the amino group of lysine K67 establishes a hydrogen bond with the hydroxyl group of cholesterol, whereas tyrosine Y302 stacks with cholesterol by its aromatic side chain, and a number of residues form hydrophobic contacts with cholesterol. Sequence alignment showed that a similar putative cholesterol-binding site is also present in CCR5, another HIV coreceptor. We suggest that the interaction of cholesterol with these putative cholesterol-binding sites in CXCR4 and CCR5 is responsible for the presence of these receptors in lipid rafts, for the effect of cholesterol on their conformational stability and function, and for the role that cell cholesterol plays in the cell entry of HIV strains that use these membrane proteins as coreceptors. We propose that mutations of residues that are involved in cholesterol binding will make CXCR4 and CCR5 insensitive to membrane cholesterol content. Cholesterol-binding sites in HIV coreceptors are potential targets for steroid drugs that bind to CXCR4 and CCR5 with higher binding affinity than cholesterol, but do not stabilize the native conformation of these proteins.

  7. Syntax compensates for poor binding sites to encode tissue specificity of developmental enhancers.

    Science.gov (United States)

    Farley, Emma K; Olson, Katrina M; Zhang, Wei; Rokhsar, Daniel S; Levine, Michael S

    2016-06-07

    Transcriptional enhancers are short segments of DNA that switch genes on and off in response to a variety of intrinsic and extrinsic signals. Despite the discovery of the first enhancer more than 30 y ago, the relationship between primary DNA sequence and enhancer activity remains obscure. In particular, the importance of "syntax" (the order, orientation, and spacing of binding sites) is unclear. A high-throughput screen identified synthetic notochord enhancers that are activated by the combination of ZicL and ETS transcription factors in Ciona embryos. Manipulation of these enhancers elucidated a "regulatory code" of sequence and syntax features for notochord-specific expression. This code enabled in silico discovery of bona fide notochord enhancers, including those containing low-affinity binding sites that would be excluded by standard motif identification methods. One of the newly identified enhancers maps upstream of the known enhancer that regulates Brachyury (Ci-Bra), a key determinant of notochord specification. This newly identified Ci-Bra shadow enhancer contains binding sites with very low affinity, but optimal syntax, and therefore mediates surprisingly strong expression in the notochord. Weak binding sites are compensated by optimal syntax, whereas enhancers containing high-affinity binding affinities possess suboptimal syntax. We suggest this balance has obscured the importance of regulatory syntax, as noncanonical binding motifs are typically disregarded by enhancer detection methods. As a result, enhancers with low binding affinities but optimal syntax may be a vastly underappreciated feature of the regulatory genome.

  8. Detection of cell type and marker specificity of nuclear binding sites for anionic carbohydrate ligands.

    Science.gov (United States)

    Chovanec, M; Smetana, K; Purkrábková, T; Holíková, Z; Dvoránková, B; André, S; Pytlík, R; Hozák, P; Plzák, J; Sedo, A; Vacík, J; Gabius, H

    2004-01-01

    The emerging functionality of glycosaminoglycan chains engenders interest in localizing specific binding sites using cytochemical tools. We investigated nuclear binding of labeled heparin, heparan sulfate, a sulfated fucan, chondroitin sulfate, and hyaluronic acid in epidermal keratinocytes, bone marrow stromal cells, 3T3 fibroblasts and glioma cells using chemically prepared biotinylated probes. Binding of the markers was cell-type specific and influenced by extraction of histones, but was not markedly affected by degree of proliferation, differentiation or malignancy. Cell uptake of labeled heparin and other selected probes and their transport into the nucleus also was monitored. Differences between keratinocytes and bone marrow stromal cells were found. Preincubation of permeabilized bone marrow stromal cells with label-free heparin reduced the binding of carrier-immobilized hydrocortisone to its nuclear receptors. Thus, these tools enabled binding sites for glycosaminoglycans to be monitored in routine assays.

  9. Effect of cysteamine on cytosolic somatostatin binding sites in rabbit duodenal mucosa.

    Science.gov (United States)

    Gonzalez-Guijarro, L; Lopez-Ruiz, M P; Bodegas, G; Prieto, J C; Arilla, E

    1987-04-01

    Administration of cysteamine in rabbits elicited a rapid depletion of both duodenal mucosa and plasma somatostatin. A significant reduction was observed within 5 min, returning toward control values by 150 min. The depletion of somatostatin was associated with an increase in the binding capacity and a decrease in the affinity of both high- and low-affinity binding sites present in cytosol of duodenal mucosa. Incubation of cytosolic fraction from control rabbits with 1 mM cysteamine did not modify somatostatin binding. Furthermore, addition of cysteamine at the time of binding assay did not affect the integrity of 125I-Tyr11-somatostatin. It is concluded that in vivo administration of cysteamine to rabbits depletes both duodenal mucosa and plasma somatostatin and leads to up-regulation of duodenal somatostatin binding sites.

  10. Oligomycin frames a common drug-binding site in the ATP synthase

    Energy Technology Data Exchange (ETDEWEB)

    Symersky, Jindrich; Osowski, Daniel; Walters, D. Eric; Mueller, David M. (Rosalind)

    2015-12-01

    We report the high-resolution (1.9 {angstrom}) crystal structure of oligomycin bound to the subunit c10 ring of the yeast mitochondrial ATP synthase. Oligomycin binds to the surface of the c10 ring making contact with two neighboring molecules at a position that explains the inhibitory effect on ATP synthesis. The carboxyl side chain of Glu59, which is essential for proton translocation, forms an H-bond with oligomycin via a bridging water molecule but is otherwise shielded from the aqueous environment. The remaining contacts between oligomycin and subunit c are primarily hydrophobic. The amino acid residues that form the oligomycin-binding site are 100% conserved between human and yeast but are widely different from those in bacterial homologs, thus explaining the differential sensitivity to oligomycin. Prior genetics studies suggest that the oligomycin-binding site overlaps with the binding site of other antibiotics, including those effective against Mycobacterium tuberculosis, and thereby frames a common 'drug-binding site.' We anticipate that this drug-binding site will serve as an effective target for new antibiotics developed by rational design.

  11. Exploring the free-energy landscape of carbohydrate-protein complexes: development and validation of scoring functions considering the binding-site topology

    Science.gov (United States)

    Eid, Sameh; Saleh, Noureldin; Zalewski, Adam; Vedani, Angelo

    2014-12-01

    Carbohydrates play a key role in a variety of physiological and pathological processes and, hence, represent a rich source for the development of novel therapeutic agents. Being able to predict binding mode and binding affinity is an essential, yet lacking, aspect of the structure-based design of carbohydrate-based ligands. We assembled a diverse data set comprising 273 carbohydrate-protein crystal structures with known binding affinity and evaluated the prediction accuracy of a large collection of well-established scoring and free-energy functions, as well as combinations thereof. Unfortunately, the tested functions were not capable of reproducing binding affinities in the studied complexes. To simplify the complex free-energy surface of carbohydrate-protein systems, we classified the studied proteins according to the topology and solvent exposure of the carbohydrate-binding site into five distinct categories. A free-energy model based on the proposed classification scheme reproduced binding affinities in the carbohydrate data set with an r 2 of 0.71 and root-mean-squared-error of 1.25 kcal/mol ( N = 236). The improvement in model performance underlines the significance of the differences in the local micro-environments of carbohydrate-binding sites and demonstrates the usefulness of calibrating free-energy functions individually according to binding-site topology and solvent exposure.

  12. DNA deformability changes of single base pair mutants within CDE binding sites in S. Cerevisiae centromere DNA correlate with measured chromosomal loss rates and CDE binding site symmetries

    Directory of Open Access Journals (Sweden)

    Marx Kenneth A

    2006-03-01

    Full Text Available Abstract Background The centromeres in yeast (S. cerevisiae are organized by short DNA sequences (125 bp on each chromosome consisting of 2 conserved elements: CDEI and CDEIII spaced by a CDEII region. CDEI and CDEIII are critical sequence specific protein binding sites necessary for correct centromere formation and following assembly with proteins, are positioned near each other on a specialized nucleosome. Hegemann et al. BioEssays 1993, 15: 451–460 reported single base DNA mutants within the critical CDEI and CDEIII binding sites on the centromere of chromosome 6 and quantitated centromere loss of function, which they measured as loss rates for the different chromosome 6 mutants during cell division. Olson et al. Proc Natl Acad Sci USA 1998, 95: 11163–11168 reported the use of protein-DNA crystallography data to produce a DNA dinucleotide protein deformability energetic scale (PD-scale that describes local DNA deformability by sequence specific binding proteins. We have used the PD-scale to investigate the DNA sequence dependence of the yeast chromosome 6 mutants' loss rate data. Each single base mutant changes 2 PD-scale values at that changed base position relative to the wild type. In this study, we have utilized these mutants to demonstrate a correlation between the change in DNA deformability of the CDEI and CDEIII core sites and the overall experimentally measured chromosome loss rates of the chromosome 6 mutants. Results In the CDE I and CDEIII core binding regions an increase in the magnitude of change in deformability of chromosome 6 single base mutants with respect to the wild type correlates to an increase in the measured chromosome loss rate. These correlations were found to be significant relative to 105 Monte Carlo randomizations of the dinucleotide PD-scale applied to the same calculation. A net loss of deformability also tends to increase the loss rate. Binding site position specific, 4 data-point correlations were also

  13. Cooperativity between calmodulin-binding sites in Kv7.2 channels.

    Science.gov (United States)

    Alaimo, Alessandro; Alberdi, Araitz; Gomis-Perez, Carolina; Fernández-Orth, Juncal; Gómez-Posada, Juan Camilo; Areso, Pilar; Villarroel, Alvaro

    2013-01-01

    Among the multiple roles assigned to calmodulin (CaM), controlling the surface expression of Kv7.2 channels by binding to two discontinuous sites is a unique property of this Ca(2+) binding protein. Mutations that interfere with CaM binding or the sequestering of CaM prevent this M-channel component from exiting the endoplasmic reticulum (ER), which reduces M-current density in hippocampal neurons, enhancing excitability and offering a rational mechanism to explain some forms of benign familial neonatal convulsions (BFNC). Previously, we identified a mutation (S511D) that impedes CaM binding while allowing the channel to exit the ER, hinting that CaM binding may not be strictly required for Kv7.2 channel trafficking to the plasma membrane. Alternatively, this interaction with CaM might escape detection and, indeed, we now show that the S511D mutant contains functional CaM-binding sites that are not detected by classical biochemical techniques. Surface expression and function is rescued by CaM, suggesting that free CaM in HEK293 cells is limiting and reinforcing the hypothesis that CaM binding is required for ER exit. Within the CaM-binding domain formed by two sites (helix A and helix B), we show that CaM binds to helix B with higher apparent affinity than helix A, both in the presence and absence of Ca(2+), and that the two sites cooperate. Hence, CaM can bridge two binding domains, anchoring helix A of one subunit to helix B of another subunit, in this way influencing the function of Kv7.2 channels.

  14. A quantitative model for the in vivo assessment of drug binding sites with positron emission tomography

    Energy Technology Data Exchange (ETDEWEB)

    Mintun, M.A.; Raichle, M.E.; Kilbourn, M.R.; Wooten, G.F.; Welch, M.J.

    1984-03-01

    We propose an in vivo method for use with positron emission tomography (PET) that results in a quantitative characterization of neuroleptic binding sites using radiolabeled spiperone. The data are analyzed using a mathematical model that describes transport, nonspecific binding, and specific binding in the brain. The model demonstrates that the receptor quantities Bmax (i.e., the number of binding sites) and KD-1 (i.e., the binding affinity) are not separably ascertainable with tracer methodology in human subjects. We have, therefore, introduced a new term, the binding potential, equivalent to the product BmaxKD-1, which reflects the capacity of a given tissue, or region of a tissue, for ligand-binding site interaction. The procedure for obtaining these measurements is illustrated with data from sequential PET scans of baboons after intravenous injection of carrier-added (18F)spiperone. From these data we estimate the brain tissue nonspecific binding of spiperone to be in the range of 94.2 to 95.3%, and the regional brain spiperone permeability (measured as the permeability-surface area product) to be in the range of 0.025 to 0.036 cm3/(s X ml). The binding potential of the striatum ranged from 17.4 to 21.6; these in vivo estimates compare favorably to in vitro values in the literature. To our knowledge this represents the first direct evidence that PET can be used to characterize quantitatively, locally and in vivo, drug binding sites in brain. The ability to make such measurements with PET should permit the detailed investigation of diseases thought to result from disorders of receptor function.

  15. An effective approach for identification of in vivo protein-DNA binding sites from paired-end ChIP-Seq data

    Directory of Open Access Journals (Sweden)

    Wilson Zoe A

    2010-02-01

    Full Text Available Abstract Background ChIP-Seq, which combines chromatin immunoprecipitation (ChIP with high-throughput massively parallel sequencing, is increasingly being used for identification of protein-DNA interactions in vivo in the genome. However, to maximize the effectiveness of data analysis of such sequences requires the development of new algorithms that are able to accurately predict DNA-protein binding sites. Results Here, we present SIPeS (Site Identification from Paired-end Sequencing, a novel algorithm for precise identification of binding sites from short reads generated by paired-end solexa ChIP-Seq technology. In this paper we used ChIP-Seq data from the Arabidopsis basic helix-loop-helix transcription factor ABORTED MICROSPORES (AMS, which is expressed within the anther during pollen development, the results show that SIPeS has better resolution for binding site identification compared to two existing ChIP-Seq peak detection algorithms, Cisgenome and MACS. Conclusions When compared to Cisgenome and MACS, SIPeS shows better resolution for binding site discovery. Moreover, SIPeS is designed to calculate the mappable genome length accurately with the fragment length based on the paired-end reads. Dynamic baselines are also employed to effectively discriminate closely adjacent binding sites, for effective binding sites discovery, which is of particular value when working with high-density genomes.

  16. Gains and Losses of Transcription Factor Binding Sites in Saccharomyces cerevisiae and Saccharomyces paradoxus.

    Science.gov (United States)

    Schaefke, Bernhard; Wang, Tzi-Yuan; Wang, Chuen-Yi; Li, Wen-Hsiung

    2015-07-27

    Gene expression evolution occurs through changes in cis- or trans-regulatory elements or both. Interactions between transcription factors (TFs) and their binding sites (TFBSs) constitute one of the most important points where these two regulatory components intersect. In this study, we investigated the evolution of TFBSs in the promoter regions of different Saccharomyces strains and species. We divided the promoter of a gene into the proximal region and the distal region, which are defined, respectively, as the 200-bp region upstream of the transcription starting site and as the 200-bp region upstream of the proximal region. We found that the predicted TFBSs in the proximal promoter regions tend to be evolutionarily more conserved than those in the distal promoter regions. Additionally, Saccharomyces cerevisiae strains used in the fermentation of alcoholic drinks have experienced more TFBS losses than gains compared with strains from other environments (wild strains, laboratory strains, and clinical strains). We also showed that differences in TFBSs correlate with the cis component of gene expression evolution between species (comparing S. cerevisiae and its sister species Saccharomyces paradoxus) and within species (comparing two closely related S. cerevisiae strains).

  17. An Insertion Mutation That Distorts Antibody Binding Site Architecture Enhances Function of a Human Antibody

    Energy Technology Data Exchange (ETDEWEB)

    Krause, Jens C.; Ekiert, Damian C.; Tumpey, Terrence M.; Smith, Patricia B.; Wilson, Ian A.; Crowe, Jr., James E. (Vanderbilt); (Scripps); (CDC)

    2011-09-02

    The structural and functional significance of somatic insertions and deletions in antibody chains is unclear. Here, we demonstrate that a naturally occurring three-amino-acid insertion within the influenza virus-specific human monoclonal antibody 2D1 heavy-chain variable region reconfigures the antibody-combining site and contributes to its high potency against the 1918 and 2009 pandemic H1N1 influenza viruses. The insertion arose through a series of events, including a somatic point mutation in a predicted hot-spot motif, introduction of a new hot-spot motif, a molecular duplication due to polymerase slippage, a deletion due to misalignment, and additional somatic point mutations. Atomic resolution structures of the wild-type antibody and a variant in which the insertion was removed revealed that the three-amino-acid insertion near the base of heavy-chain complementarity-determining region (CDR) H2 resulted in a bulge in that loop. This enlarged CDR H2 loop impinges on adjacent regions, causing distortion of the CDR H1 architecture and its displacement away from the antigen-combining site. Removal of the insertion restores the canonical structure of CDR H1 and CDR H2, but binding, neutralization activity, and in vivo activity were reduced markedly because of steric conflict of CDR H1 with the hemagglutinin antigen.

  18. Ivermectin binding sites in human and invertebrate Cys-loop receptors

    DEFF Research Database (Denmark)

    Lynagh, Timothy Peter; Lynch, Joseph W

    2012-01-01

    Ivermectin is a gold standard antiparasitic drug that has been used successfully to treat billions of humans, livestock and pets. Until recently, the binding site on its Cys-loop receptor target had been a mystery. Recent protein crystal structures, site-directed mutagenesis data and molecular mo...... for a wide variety of human neurological disorders....

  19. Endogenously generated plasmin at the vascular wall injury site amplifies lysine binding site-dependent plasminogen accumulation in microthrombi.

    Directory of Open Access Journals (Sweden)

    Tomasz Brzoska

    Full Text Available The fibrinolytic system plays a pivotal role in the regulation of hemostasis; however, it remains unclear how and when the system is triggered to induce thrombolysis. Using intra-vital confocal fluorescence microscopy, we investigated the process of plasminogen binding to laser-induced platelet-rich microthrombi generated in the mesenteric vein of transgenic mice expressing green fluorescent protein (GFP. The accumulation of GFP-expressing platelets as well as exogenously infused Alexa Fluor 568-labeled Glu-plasminogen (Glu-plg on the injured vessel wall was assessed by measuring the increase in the corresponding fluorescence intensities. Glu-plg accumulated in a time-dependent manner in the center of the microthrombus, where phosphatidylserine is exposed on platelet surfaces and fibrin formation takes place. The rates of binding of Glu-plg in the presence of ε-aminocaproic acid and carboxypeptidase B, as well as the rates of binding of mini-plasminogen lacking kringle domains 1-4 and lysine binding sites, were significantly lower than that of Glu-plg alone, suggesting that the binding was dependent on lysine binding sites. Furthermore, aprotinin significantly suppressed the accumulation of Glu-plg, suggesting that endogenously generated plasmin activity is a prerequisite for the accumulation. In spite of the endogenous generation of plasmin and accumulation of Glu-plg in the center of microthrombi, the microthrombi did not change in size during the 2-hour observation period. When human tissue plasminogen activator was administered intravenously, Glu-plg further accumulated and the microthrombi were lysed. Glu-plg appeared to accumulate in the center of microthrombi in the early phase of microthrombus formation, and plasmin activity and lysine binding sites were required for this accumulation.

  20. HMMpTM: improving transmembrane protein topology prediction using phosphorylation and glycosylation site prediction.

    Science.gov (United States)

    Tsaousis, Georgios N; Bagos, Pantelis G; Hamodrakas, Stavros J

    2014-02-01

    During the last two decades a large number of computational methods have been developed for predicting transmembrane protein topology. Current predictors rely on topogenic signals in the protein sequence, such as the distribution of positively charged residues in extra-membrane loops and the existence of N-terminal signals. However, phosphorylation and glycosylation are post-translational modifications (PTMs) that occur in a compartment-specific manner and therefore the presence of a phosphorylation or glycosylation site in a transmembrane protein provides topological information. We examine the combination of phosphorylation and glycosylation site prediction with transmembrane protein topology prediction. We report the development of a Hidden Markov Model based method, capable of predicting the topology of transmembrane proteins and the existence of kinase specific phosphorylation and N/O-linked glycosylation sites along the protein sequence. Our method integrates a novel feature in transmembrane protein topology prediction, which results in improved performance for topology prediction and reliable prediction of phosphorylation and glycosylation sites. The method is freely available at http://bioinformatics.biol.uoa.gr/HMMpTM.

  1. Exploring the role of water in molecular recognition: predicting protein ligandability using a combinatorial search of surface hydration sites

    Science.gov (United States)

    Vukovic, Sinisa; Brennan, Paul E.; Huggins, David J.

    2016-09-01

    The interaction between any two biological molecules must compete with their interaction with water molecules. This makes water the most important molecule in medicine, as it controls the interactions of every therapeutic with its target. A small molecule binding to a protein is able to recognize a unique binding site on a protein by displacing bound water molecules from specific hydration sites. Quantifying the interactions of these water molecules allows us to estimate the potential of the protein to bind a small molecule. This is referred to as ligandability. In the study, we describe a method to predict ligandability by performing a search of all possible combinations of hydration sites on protein surfaces. We predict ligandability as the summed binding free energy for each of the constituent hydration sites, computed using inhomogeneous fluid solvation theory. We compared the predicted ligandability with the maximum observed binding affinity for 20 proteins in the human bromodomain family. Based on this comparison, it was determined that effective inhibitors have been developed for the majority of bromodomains, in the range from 10 to 100 nM. However, we predict that more potent inhibitors can be developed for the bromodomains BPTF and BRD7 with relative ease, but that further efforts to develop inhibitors for ATAD2 will be extremely challenging. We have also made predictions for the 14 bromodomains with no reported small molecule K d values by isothermal titration calorimetry. The calculations predict that PBRM1(1) will be a challenging target, while others such as TAF1L(2), PBRM1(4) and TAF1(2), should be highly ligandable. As an outcome of this work, we assembled a database of experimental maximal K d that can serve as a community resource assisting medicinal chemistry efforts focused on BRDs. Effective prediction of ligandability would be a very useful tool in the drug discovery process.

  2. Activation of phenylalanine hydroxylase by phenylalanine does not require binding in the active site.

    Science.gov (United States)

    Roberts, Kenneth M; Khan, Crystal A; Hinck, Cynthia S; Fitzpatrick, Paul F

    2014-12-16

    Phenylalanine hydroxylase (PheH), a liver enzyme that catalyzes the hydroxylation of excess phenylalanine in the diet to tyrosine, is activated by phenylalanine. The lack of activity at low levels of phenylalanine has been attributed to the N-terminus of the protein's regulatory domain acting as an inhibitory peptide by blocking substrate access to the active site. The location of the site at which phenylalanine binds to activate the enzyme is unknown, and both the active site in the catalytic domain and a separate site in the N-terminal regulatory domain have been proposed. Binding of catecholamines to the active-site iron was used to probe the accessibility of the active site. Removal of the regulatory domain increases the rate constants for association of several catecholamines with the wild-type enzyme by ∼2-fold. Binding of phenylalanine in the active site is effectively abolished by mutating the active-site residue Arg270 to lysine. The k(cat)/K(phe) value is down 10⁴ for the mutant enzyme, and the K(m) value for phenylalanine for the mutant enzyme is >0.5 M. Incubation of the R270K enzyme with phenylalanine also results in a 2-fold increase in the rate constants for catecholamine binding. The change in the tryptophan fluorescence emission spectrum seen in the wild-type enzyme upon activation by phenylalanine is also seen with the R270K mutant enzyme in the presence of phenylalanine. Both results establish that activation of PheH by phenylalanine does not require binding of the amino acid in the active site. This is consistent with a separate allosteric site, likely in the regulatory domain.

  3. Characterization of two different melatonin binding sites in peripheral tissues of the teleost Tinca tinca.

    Science.gov (United States)

    López Patiño, M A; Guijarro, A I; Alonso-Gómez, A L; Delgado, M J

    2012-01-01

    The aim of the present study was to localize and characterize 2-iodo-melatonin ([(125)I]Mel) binding sites in peripheral tissues of the teleost Tinca tinca. A wide distribution of [(125)I]Mel binding sites in peripheral locations of the tench is found, with highest densities being measured in the heart, gills and kidney, and low density of [(125)I]Mel binding sites in gastrointestinal tract, spleen, liver and gonads. Saturation, kinetics, and pharmacological approaches revealed the presence of, at least, two different [(125)I]Mel binding sites in the tench peripheral tissues. The unique characterized subtype in the heart fulfils all the criteria for a canonical melatonin receptor belonging to MT(1) family (the binding is saturable, reversible, and inhibited by GTP analogs), and gives support for the presence of a functional melatonin receptor in the heart of the tench. In contrast, kinetic and pharmacological studies in the kidney revealed the preponderance of a melatonin binding site belonging to the MT(3)-like receptor subtype. Moreover, the decrease of specific binding in both, heart and kidney membranes, and the decrease of affinity in the kidney, produced by the addition of a non-hydrolysable GTP analog, and sodium cations suggest the presence of G(i/o)-proteins (that mediate inhibition of cAMP formation) coupled to such melatonin binding sites. Our results also point to different G(i/o)-proteins involved in the underlying mechanism of melatonin binding sites activation in the kidney. Additionally, the kinetics of [(125)I]Mel binding in kidney membrane preparations is a highly thermosensitive process, being necessary to perform the assays at 4 °C since the equilibrium was not reached at 25 °C assay temperature. The time needed to complete association of [(125)I]Mel at such low temperature is only 15s, whereas 100s is required to displace [(125)I]Mel specific binding by the unlabeled melatonin in kidney membranes. Present results support previous reports on

  4. Impact of disruption of secondary binding site S2 on dopamine transporter function.

    Science.gov (United States)

    Zhen, Juan; Reith, Maarten E A

    2016-09-01

    The structures of the leucine transporter, drosophila dopamine transporter, and human serotonin transporter show a secondary binding site (designated S2 ) for drugs and substrate in the extracellular vestibule toward the membrane exterior in relation to the primary substrate recognition site (S1 ). The present experiments are aimed at disrupting S2 by mutating Asp476 and Ile159 to Ala. Both mutants displayed a profound decrease in [(3) H]DA uptake compared with wild-type associated with a reduced turnover rate kcat . This was not caused by a conformational bias as the mutants responded to Zn(2+) (10 μM) similarly as WT. The dopamine transporters with either the D476A or I159A mutation both displayed a higher Ki for dopamine for the inhibition of [3H](-)-2-β-carbomethoxy-3-β-(4-fluorophenyl)tropane binding than did the WT transporter, in accordance with an allosteric interaction between the S1 and S2 sites. The results provide evidence in favor of a general applicability of the two-site allosteric model of the Javitch/Weinstein group from LeuT to dopamine transporter and possibly other monoamine transporters. X-ray structures of transporters closely related to the dopamine (DA) transporter show a secondary binding site S2 in the extracellular vestibule proximal to the primary binding site S1 which is closely linked to one of the Na(+) binding sites. This work examines the relationship between S2 and S1 sites. We found that S2 site impairment severely reduced DA transport and allosterically reduced S1 site affinity for the cocaine analog [(3) H]CFT. Our results are the first to lend direct support for the application of the two-site allosteric model, advanced for bacterial LeuT, to the human DA transporter. The model states that, after binding of the first DA molecule (DA1 ) to the primary S1 site (along with Na(+) ), binding of a second DA (DA2 ) to the S2 site triggers, through an allosteric interaction, the release of DA1 and Na(+) into the cytoplasm.

  5. The roles of histidine residues at the starch-binding site in streptococcal-binding activities of human salivary amylase.

    Science.gov (United States)

    Tseng, C C; Miyamoto, M; Ramalingam, K; Hemavathy, K C; Levine, M J; Ramasubbu, N

    1999-02-01

    Human salivary alpha-amylase participates in the initial digestion of starch and may be involved in the colonization of viridans streptococci in the mouth. To elucidate the role of histidine residues located near the starch-binding site on the streptococcal-binding activity, the wild type and three histidine mutants, H52A, H299A and H305A were constructed and expressed in a baculovirus system. While His52 is located near the non-reducing end of the starch-binding pocket (subsite S3/S4), the residues His299 and His305 are located near the subsites S1/S1'. For the wild type, the cDNA encoding the leader and secreted sequences of human salivary amylase was amplified by polymerase chain reaction from a human submandibular salivary-gland cDNA library, and subcloned into the baculovirus shuttle vector pVL1392 downstream of the polyhedrin promoter. Oligonucleotide-based, site-directed mutagenesis was used to generate the mutants expressed in the baculovirus system. Replacing His52 or His299 or His305 to Ala residue did not alter the bacterial-binding activity significantly, but these mutants did show differences in their catalytic activities. The mutant H52A showed negligible reduction in enzymatic activity compared to that of wild type for the hydrolysis of starch and oligosaccharides. In contrast, the H299A and H305A mutants showed a 12 to 13-fold reduction (90-92%) in starch-hydrolysing activity. In addition, the k(cat) for the hydrolysis of oligosaccharides by H299A decreased by as much as 11-fold for maltoheptaoside. This reduction was even higher (40-fold) for the hydrolysis of p-nitrophenyl maltoside, with a significant change in K(M). The mutant H305A, however, exhibited a reduction in k(cat) only, with no changes in the K(M) for the hydrolysis of oligosaccharides. The reduction in the k(cat) for the H305A mutant was almost 93% for maltoheptaoside hydrolysis. The pH activity profile for the H305A mutant was also significantly different from that of the wild type

  6. SMAP-WS: a parallel web service for structural proteome-wide ligand-binding site comparison.

    Science.gov (United States)

    Ren, Jingyuan; Xie, Lei; Li, Wilfred W; Bourne, Philip E

    2010-07-01

    The proteome-wide characterization and analysis of protein ligand-binding sites and their interactions with ligands can provide pivotal information in understanding the structure, function and evolution of proteins and for designing safe and efficient therapeutics. The SMAP web service (SMAP-WS) meets this need through parallel computations designed for 3D ligand-binding site comparison and similarity searching on a structural proteome scale. SMAP-WS implements a shape descriptor (the Geometric Potential) that characterizes both local and global topological properties of the protein structure and which can be used to predict the likely ligand-binding pocket [Xie,L. and Bourne,P.E. (2007) A robust and efficient algorithm for the shape description of protein structures and its application in predicting ligand-binding sites. BMC bioinformatics, 8 (Suppl. 4.), S9.]. Subsequently a sequence order independent profile-profile alignment (SOIPPA) algorithm is used to detect and align similar pockets thereby finding protein functional and evolutionary relationships across fold space [Xie, L. and Bourne, P.E. (2008) Detecting evolutionary relationships across existing fold space, using sequence order-independent profile-profile alignments. Proc. Natl Acad. Sci. USA, 105, 5441-5446]. An extreme value distribution model estimates the statistical significance of the match [Xie, L., Xie, L. and Bourne, P.E. (2009) A unified statistical model to support local sequence order independent similarity searching for ligand-binding sites and its application to genome-based drug discovery. Bioinformatics, 25, i305-i312.]. These algorithms have been extensively benchmarked and shown to outperform most existing algorithms. Moreover, several predictions resulting from SMAP-WS have been validated experimentally. Thus far SMAP-WS has been applied to predict drug side effects, and to repurpose existing drugs for new indications. SMAP-WS provides both a user-friendly web interface and

  7. Purification of high affinity benzodiazepine receptor binding site fragments from rat brain

    Energy Technology Data Exchange (ETDEWEB)

    Klotz, K.L.

    1984-01-01

    In central nervous system benzodiazepine recognition sites occur on neuronal cell surfaces as one member of a multireceptor complex, including recognition sites for benzodiazepines, gamma aminobutyric acid (GABA), barbiturates and a chloride ionophore. During photoaffinity labelling, the benzodiazepine agonist, /sup 3/H-flunitrazepam, is irreversibly bound to central benzodiazepine high affinity recognition sites in the presence of ultraviolet light. In these studies a /sup 3/H-flunitrazepam radiolabel was used to track the isolation and purification of high affinity agonist binding site fragments from membrane-bound benzodiazepine receptor in rat brain. The authors present a method for limited proteolysis of /sup 3/H-flunitrazepam photoaffinity labeled rat brain membranes, generating photolabeled benzodiazepine receptor fragments containing the agonist binding site. Using trypsin chymotrypsin A/sub 4/, or a combination of these two proteases, they have demonstrated the extent and time course for partial digestion of benzodiazepine receptor, yielding photolabeled receptor binding site fragments. These photolabeled receptor fragments have been further purified on the basis of size, using ultrafiltration, gel permeation chromatography, and sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) as well as on the basis of hydrophobicity, using a high performance liquid chromatography (HPLC) precolumn, several HPLC elution schemes, and two different HPLC column types. Using these procedures, they have purified three photolabeled benzodiazepine receptor fragments containing the agonist binding site which appear to have a molecular weight of less than 2000 daltons each.

  8. Structure and binding efficiency relations of QB site inhibitors of photosynthetic reaction centres.

    Science.gov (United States)

    Husu, Ivan; Magyar, Melinda; Szabó, Tibor; Fiser, Béla; Gómez-Bengoa, Enrique; Nagy, László

    2015-04-01

    Many herbicides employed in agriculture and also some antibiotics bind to a specific site of the reaction centre protein (RC) blocking the photosynthetic electron transport. Crystal structures showed that all these compounds bind at the secondary ubiquinone (QB) site albeit to slightly different places. Different herbicide molecules have different binding affinities (evaluated as inhibition constants, KI, and binding enthalpy values, ΔHbind). The action of inhibitors depends on the following parameters: (i) herbicide molecular structure; (ii) interactions between herbicide and quinone binding site; (iii) protein environment. In our investigations KI and ΔHbind were determined for several inhibitors. Bound herbicide structures were optimized and their intramolecular charge distributions were calculated. Experimental and calculated data were compared to those available from databank crystal structures. We can state that the herbicide inhibition efficiency depends on steric and electronic, i.e. geometry of binding with the protein and molecular charge distribution, respectively. Apolar bulky groups on N-7 atom of the inhibitor molecule (like t-buthyl in terbutryn) are preferable for establishing stronger interactions with QB site, while such substituents are not recommended on N-8. The N-4,7,8 nitrogen atoms maintain a larger electron density so that more effective H-bonds are formed between the inhibitor and the surrounding amino acids of the protein.

  9. Membrane androgen binding sites are preferentially expressed in human prostate carcinoma cells

    Directory of Open Access Journals (Sweden)

    Delakas Dimitrios

    2003-01-01

    Full Text Available Abstract Background Prostate cancer is one of the most frequent malignancies in males. Nevertheless, to this moment, there is no specific routine diagnostic marker to be used in clinical practice. Recently, the identification of a membrane testosterone binding site involved in the remodeling of actin cytoskeleton structures and PSA secretion, on LNCaP human prostate cancer cells has been reported. We have investigated whether this membrane testosterone binding component could be of value for the identification of prostate cancer. Methods Using a non-internalizable testosterone-BSA-FITC analog, proven to bind on membrane sites only in LNCaP cells, we have investigated the expression of membrane testosterone binding sites in a series of prostate carcinomas (n = 14, morphologically normal epithelia, taken from areas of the surgical specimens far from the location of the carcinomas (n = 8 and benign prostate hyperplasia epithelia (n = 10. Isolated epithelial cells were studied by flow cytometry, and touching preparations, after 10-min incubation. In addition, routine histological slides were assayed by confocal laser microscopy. Results We show that membrane testosterone binding sites are preferentially expressed in prostate carcinoma cells, while BPH and non-malignant epithelial cells show a low or absent binding. Conclusions Our results indicate that membrane testosterone receptors might be of use for the rapid routine identification of prostate cancer, representing a new diagnostic marker of the disease.

  10. High-affinity cannabinoid binding site in brain: A possible marijuana receptor

    Energy Technology Data Exchange (ETDEWEB)

    Nye, J.S.

    1988-01-01

    The mechanism by which delta{sup 9} tetrahydrocannabinol (delta{sup 9}THC), the major psychoactive component of marijuana or hashish, produces its potent psychological and physiological effects is unknown. To find receptor binding sites for THC, we designed a water-soluble analog for use as a radioligand. 5{prime}-Trimethylammonium-delta{sup 8}THC (TMA) is a positively charged analog of delta-{sup 8}THC modified on the 5{prime} carbon, a portion of the molecule not important for its psychoactivity. We have studied the binding of ({sup 3}H)-5{prime}-trimethylammonium-delta-{sup 8}THC (({sup 3}H)TMA) to rat neuronal membranes. ({sup 3}H)TMA binds saturably and reversibly to brain membranes with high affinity to apparently one class of sites. Highest binding site density occurs in brain, but several peripheral organs also display specific binding. Detergent solubilizes the sites without affecting their pharmacologial properties. Molecular sieve chromatography reveals a bimodal peak of ({sup 3}H)TMA binding activity of approximately 60,000 daltons apparent molecular weight.

  11. Autoradiographic demonstration of oxytocin-binding sites in the macula densa

    Energy Technology Data Exchange (ETDEWEB)

    Stoeckel, M.E.; Freund-Mercier, M.J. (Centre National de la Recherche Scientifique, Strasbourg (France))

    1989-08-01

    Specific oxytocin (OT)-binding sites were localized in the rat kidney with use of a selective {sup 125}I-labeled OT antagonist ({sup 125}I-OTA). High concentrations of OT binding sites were detected on the juxtaglomerular apparatus with use of the conventional film autoradiographic technique. No labeling occurred on other renal structures. The cellular localization of the OT binding sites within the juxtaglomerular apparatus was studied in light microscope autoradiography, on semithin sections from paraformaldehyde-fixed kidney slices incubated in the presence of {sup 125}I-OTA. These preparations revealed selective labeling of the macula densa, mainly concentrated at the basal pole of the cells. Control experiments showed first that {sup 125}I-OTA binding characteristics were not noticeably altered by prior paraformaldehyde fixation of the kidneys and second that autoradiographic detection of the binding sites was not impaired by histological treatments following binding procedures. In view of the role of the macula densa in the tubuloglomerular feedback, the putative OT receptors of this structure might mediate the stimulatory effect of OT on glomerular filtration.

  12. Predicting sulfotyrosine sites using the random forest algorithm with significantly improved prediction accuracy

    Directory of Open Access Journals (Sweden)

    Yang Zheng

    2009-10-01

    Full Text Available Abstract Background Tyrosine sulfation is one of the most important posttranslational modifications. Due to its relevance to various disease developments, tyrosine sulfation has become the target for drug design. In order to facilitate efficient drug design, accurate prediction of sulfotyrosine sites is desirable. A predictor published seven years ago has been very successful with claimed prediction accuracy of 98%. However, it has a particularly low sensitivity when predicting sulfotyrosine sites in some newly sequenced proteins. Results A new approach has been developed for predicting sulfotyrosine sites using the random forest algorithm after a careful evaluation of seven machine learning algorithms. Peptides are formed by consecutive residues symmetrically flanking tyrosine sites. They are then encoded using an amino acid hydrophobicity scale. This new approach has increased the sensitivity by 22%, the specificity by 3%, and the total prediction accuracy by 10% compared with the previous predictor using the same blind data. Meanwhile, both negative and positive predictive powers have been increased by 9%. In addition, the random forest model has an excellent feature for ranking the residues flanking tyrosine sites, hence providing more information for further investigating the tyrosine sulfation mechanism. A web tool has been implemented at http://ecsb.ex.ac.uk/sulfotyrosine for public use. Conclusion The random forest algorithm is able to deliver a better model compared with the Hidden Markov Model, the support vector machine, artificial neural networks, and others for predicting sulfotyrosine sites. The success shows that the random forest algorithm together with an amino acid hydrophobicity scale encoding can be a good candidate for peptide classification.

  13. Interaction of Palmitic Acid with Metoprolol Succinate at the Binding Sites of Bovine Serum Albumin

    Directory of Open Access Journals (Sweden)

    Mashiur Rahman

    2014-12-01

    Full Text Available Purpose: The aim of this study was to characterize the binding profile as well as to notify the interaction of palmitic acid with metoprolol succinate at its binding site on albumin. Methods: The binding of metoprolol succinate to bovine serum albumin (BSA was studied by equilibrium dialysis method (ED at 27°C and pH 7.4, in order to have an insight in the binding chemistry of the drug to BSA in presence and absence of palmitic acid. The study was carried out using ranitidine as site-1 and diazepam as site-2 specific probe. Results: Different analysis of binding of metoprolol succinate to bovine serum albumin suggested two sets of association constants: high affinity association constant (k1 = 11.0 x 105 M-1 with low capacity (n1 = 2 and low affinity association (k2 = 4.0×105 M-1 constant with high capacity (n2 = 8 at pH 7.4 and 27°C. During concurrent administration of palmitic acid and metoprolol succinate in presence or absence of ranitidine or diazepam, it was found that palmitic acid displaced metoprolol succinate from its binding site on BSA resulting reduced binding of metoprolol succinate to BSA. The increment in free fraction of metoprolol succinate was from 26.27% to 55.08% upon the addition of increased concentration of palmitic acid at a concentration of 0×10-5 M to 16×10-5 M. In presence of ranitidine and diazepam, palmitic acid further increases the free fraction of metoprolol succinate from 33.05% to 66.95% and 40.68% to 72.88%, respectively. Conclusion: This data provided the evidence of interaction at higher concentration of palmitic acid at the binding sites on BSA, which might change the pharmacokinetic properties of metoprolol succinate.

  14. High-throughput prediction of RNA, DNA and protein binding regions mediated by intrinsic disorder.

    Science.gov (United States)

    Peng, Zhenling; Kurgan, Lukasz

    2015-10-15

    Intrinsically disordered proteins and regions (IDPs and IDRs) lack stable 3D structure under physiological conditions in-vitro, are common in eukaryotes, and facilitate interactions with RNA, DNA and proteins. Current methods for prediction of IDPs and IDRs do not provide insights into their functions, except for a handful of methods that address predictions of protein-binding regions. We report first-of-its-kind computational method DisoRDPbind for high-throughput prediction of RNA, DNA and protein binding residues located in IDRs from protein sequences. DisoRDPbind is implemented using a runtime-efficient multi-layered design that utilizes information extracted from physiochemical properties of amino acids, sequence complexity, putative secondary structure and disorder and sequence alignment. Empirical tests demonstrate that it provides accurate predictions that are competitive with other predictors of disorder-mediated protein binding regions and complementary to the methods that predict RNA- and DNA-binding residues annotated based on crystal structures. Application in Homo sapiens, Mus musculus, Caenorhabditis elegans and Drosophila melanogaster proteomes reveals that RNA- and DNA-binding proteins predicted by DisoRDPbind complement and overlap with the corresponding known binding proteins collected from several sources. Also, the number of the putative protein-binding regions predicted with DisoRDPbind correlates with the promiscuity of proteins in the corresponding protein-protein interaction networks. Webserver: http://biomine.ece.ualberta.ca/DisoRDPbind/.

  15. Functional identification of catalytic metal ion binding sites within RNA.

    Directory of Open Access Journals (Sweden)

    James L Hougland

    2005-09-01

    Full Text Available The viability of living systems depends inextricably on enzymes that catalyze phosphoryl transfer reactions. For many enzymes in this class, including several ribozymes, divalent metal ions serve as obligate cofactors. Understanding how metal ions mediate catalysis requires elucidation of metal ion interactions with both the enzyme and the substrate(s. In the Tetrahymena group I intron, previous work using atomic mutagenesis and quantitative analysis of metal ion rescue behavior identified three metal ions (MA, MB, and MC that make five interactions with the ribozyme substrates in the reaction's transition state. Here, we combine substrate atomic mutagenesis with site-specific phosphorothioate substitutions in the ribozyme backbone to develop a powerful, general strategy for defining the ligands of catalytic metal ions within RNA. In applying this strategy to the Tetrahymena group I intron, we have identified the pro-SP phosphoryl oxygen at nucleotide C262 as a ribozyme ligand for MC. Our findings establish a direct connection between the ribozyme core and the functionally defined model of the chemical transition state, thereby extending the known set of transition-state interactions and providing information critical for the application of the recent group I intron crystallographic structures to the understanding of catalysis.

  16. Pipeline for Efficient Mapping of Transcription Factor Binding Sites and Comparison of Their Models

    KAUST Repository

    Ba alawi, Wail

    2011-06-01

    The control of genes in every living organism is based on activities of transcription factor (TF) proteins. These TFs interact with DNA by binding to the TF binding sites (TFBSs) and in that way create conditions for the genes to activate. Of the approximately 1500 TFs in human, TFBSs are experimentally derived only for less than 300 TFs and only in generally limited portions of the genome. To be able to associate TF to genes they control we need to know if TFs will have a potential to interact with the control region of the gene. For this we need to have models of TFBS families. The existing models are not sufficiently accurate or they are too complex for use by ordinary biologists. To remove some of the deficiencies of these models, in this study we developed a pipeline through which we achieved the following: 1. Through a comparison analysis of the performance we identified the best models with optimized thresholds among the four different types of models of TFBS families. 2. Using the best models we mapped TFBSs to the human genome in an efficient way. The study shows that a new scoring function used with TFBS models based on the position weight matrix of dinucleotides with remote dependency results in better accuracy than the other three types of the TFBS models. The speed of mapping has been improved by developing a parallelized code and shows a significant speed up of 4x when going from 1 CPU to 8 CPUs. To verify if the predicted TFBSs are more accurate than what can be expected with the conventional models, we identified the most frequent pairs of TFBSs (for TFs E4F1 and ATF6) that appeared close to each other (within the distance of 200 nucleotides) over the human genome. We show unexpectedly that the genes that are most close to the multiple pairs of E4F1/ATF6 binding sites have a co-expression of over 90%. This indirectly supports our hypothesis that the TFBS models we use are more accurate and also suggests that the E4F1/ATF6 pair is exerting the

  17. Localization of the Substrate-binding Site in the Homodimeric Mannitol Transporter, EIImtl, of Escherichia coli*

    Science.gov (United States)

    Opačić, Milena; Vos, Erwin P. P.; Hesp, Ben H.; Broos, Jaap

    2010-01-01

    The mannitol transporter from Escherichia coli, EIImtl, belongs to a class of membrane proteins coupling the transport of substrates with their chemical modification. EIImtl is functional as a homodimer, and it harbors one high affinity mannitol-binding site in the membrane-embedded C domain (IICmtl). To localize this binding site, 19 single Trp-containing mutants of EIImtl were biosynthetically labeled with 5-fluorotryptophan (5-FTrp) and mixed with azi-mannitol, a substrate analog acting as a Förster resonance energy transfer (FRET) acceptor. Typically, for mutants showing FRET, only one 5-FTrp was involved, whereas the 5-FTrp from the other monomer was too distant. This proves that the mannitol-binding site is asymmetrically positioned in dimeric IICmtl. Combined with the available two-dimensional projection maps of IICmtl, it is concluded that a second resting binding site is present in this transporter. Active transport of mannitol only takes place when EIImtl becomes phosphorylated at Cys384 in the cytoplasmic B domain. Stably phosphorylated EIImtl mutants were constructed, and FRET experiments showed that the position of mannitol in IICmtl remains the same. We conclude that during the transport cycle, the phosphorylated B domain has to move to the mannitol-binding site, located in the middle of the membrane, to phosphorylate mannitol. PMID:20522557

  18. Localization of the substrate-binding site in the homodimeric mannitol transporter, EIImtl, of Escherichia coli.

    Science.gov (United States)

    Opacić, Milena; Vos, Erwin P P; Hesp, Ben H; Broos, Jaap

    2010-08-13

    The mannitol transporter from Escherichia coli, EII(mtl), belongs to a class of membrane proteins coupling the transport of substrates with their chemical modification. EII(mtl) is functional as a homodimer, and it harbors one high affinity mannitol-binding site in the membrane-embedded C domain (IIC(mtl)). To localize this binding site, 19 single Trp-containing mutants of EII(mtl) were biosynthetically labeled with 5-fluorotryptophan (5-FTrp) and mixed with azi-mannitol, a substrate analog acting as a Förster resonance energy transfer (FRET) acceptor. Typically, for mutants showing FRET, only one 5-FTrp was involved, whereas the 5-FTrp from the other monomer was too distant. This proves that the mannitol-binding site is asymmetrically positioned in dimeric IIC(mtl). Combined with the available two-dimensional projection maps of IIC(mtl), it is concluded that a second resting binding site is present in this transporter. Active transport of mannitol only takes place when EII(mtl) becomes phosphorylated at Cys(384) in the cytoplasmic B domain. Stably phosphorylated EII(mtl) mutants were constructed, and FRET experiments showed that the position of mannitol in IIC(mtl) remains the same. We conclude that during the transport cycle, the phosphorylated B domain has to move to the mannitol-binding site, located in the middle of the membrane, to phosphorylate mannitol.

  19. Pharmacophore screening of the protein data bank for specific binding site chemistry.

    Science.gov (United States)

    Campagna-Slater, Valérie; Arrowsmith, Andrew G; Zhao, Yong; Schapira, Matthieu

    2010-03-22

    A simple computational approach was developed to screen the Protein Data Bank (PDB) for putative pockets possessing a specific binding site chemistry and geometry. The method employs two commonly used 3D screening technologies, namely identification of cavities in protein structures and pharmacophore screening of chemical libraries. For each protein structure, a pocket finding algorithm is used to extract potential binding sites containing the correct types of residues, which are then stored in a large SDF-formatted virtual library; pharmacophore filters describing the desired binding site chemistry and geometry are then applied to screen this virtual library and identify pockets matching the specified structural chemistry. As an example, this approach was used to screen all human protein structures in the PDB and identify sites having chemistry similar to that of known methyl-lysine binding domains that recognize chromatin methylation marks. The selected genes include known readers of the histone code as well as novel binding pockets that may be involved in epigenetic signaling. Putative allosteric sites were identified on the structures of TP53BP1, L3MBTL3, CHEK1, KDM4A, and CREBBP.

  20. Gene and translation initiation site prediction in metagenomic sequences

    Energy Technology Data Exchange (ETDEWEB)

    Hyatt, Philip Douglas [ORNL; LoCascio, Philip F [ORNL; Hauser, Loren John [ORNL; Uberbacher, Edward C [ORNL

    2012-01-01

    Gene prediction in metagenomic sequences remains a difficult problem. Current sequencing technologies do not achieve sufficient coverage to assemble the individual genomes in a typical sample; consequently, sequencing runs produce a large number of short sequences whose exact origin is unknown. Since these sequences are usually smaller than the average length of a gene, algorithms must make predictions based on very little data. We present MetaProdigal, a metagenomic version of the gene prediction program Prodigal, that can identify genes in short, anonymous coding sequences with a high degree of accuracy. The novel value of the method consists of enhanced translation initiation site identification, ability to identify sequences that use alternate genetic codes and confidence values for each gene call. We compare the results of MetaProdigal with other methods and conclude with a discussion of future improvements.

  1. Binding Sites of miR-1273 Family on the mRNA of Target Genes

    Directory of Open Access Journals (Sweden)

    Anatoly Ivashchenko

    2014-01-01

    Full Text Available This study examined binding sites of 2,578 miRNAs in the mRNAs of 12,175 human genes using the MirTarget program. It found that the miRNAs of miR-1273 family have between 33 and 1,074 mRNA target genes, with a free hybridization energy of 90% or more of its maximum value. The miR-1273 family consists of miR-1273a, miR-1273c, miR-1273d, miR-1273e, miR-1273f, miR-1273g-3p, miR-1273g-5p, miR-1273h-3p, and miR-1273h-5p. Unique miRNAs (miR-1273e, miR-1273f, and miR-1273g-3p have more than 400 target genes. We established 99 mRNA nucleotide sequences that contain arranged binding sites for the miR-1273 family. High conservation of each miRNA binding site in the mRNA of the target genes was found. The arranged binding sites of the miR-1273 family are located in the 5′UTR, CDS, or 3′UTR of many mRNAs. Five repeating sites containing some of the miR-1273 family’s binding sites were found in the 3′UTR of several target genes. The oligonucleotide sequences of miR-1273 binding sites located in CDSs code for homologous amino acid sequences in the proteins of target genes. The biological role of unique miRNAs was also discussed.

  2. Kinetic and physiological effects of alterations in homologous isocitrate-binding sites of yeast NAD(+)-specific isocitrate dehydrogenase.

    Science.gov (United States)

    Lin, A P; McCammon, M T; McAlister-Henn, L

    2001-11-27

    Yeast NAD(+)-specific isocitrate dehydrogenase is an allosterically regulated octameric enzyme composed of four each of two homologous but nonidentical subunits designated IDH1 and IDH2. Models based on the crystallographic structure of Escherichia coli isocitrate dehydrogenase suggest that both yeast subunits contain isocitrate-binding sites. Identities in nine residue positions are predicted for the IDH2 site whereas four of the nine positions differ between the IDH1 and bacterial enzyme sites. Thus, we speculate that the IDH2 site is catalytic and that the IDH1 site may bind but not catalytically alter isocitrate. This was examined by kinetic analyses of enzymes with independent and concerted replacement of residues in each yeast IDH subunit site with the residues that differ in the other subunit site. Mutant enzymes were expressed in a yeast strain containing disrupted IDH1 and IDH2 loci and affinity-purified for kinetic analyses. The primary effects of various residue replacements in IDH2 were reductions of 30->300-fold in V(max) values, consistent with the catalytic function of this subunit. In contrast, replacement of all four residues in IDH1 produced a 17-fold reduction in V(max) under the same assay conditions, suggesting that the IDH1 site is not the primary catalytic site. However, single or multiple residue replacements in IDH1 uniformly increased half-saturation concentrations for isocitrate, implying that isocitrate can be bound at this site. Both subunits appear to contribute to cooperativity with respect to isocitrate, but AMP activation is lost only with residue replacements in IDH1. Overall, results are consistent with isocitrate binding by IDH2 for catalysis and with isocitrate binding by IDH1 being a prerequisite for allosteric activation by AMP. The effects of residue substitutions on enzyme function in vivo were assessed by analysis of various growth phenotypes. Results indicate a positive correlation between the level of IDH catalytic

  3. Computational identification of developmental enhancers:conservation and function of transcription factor binding-site clustersin drosophila melanogaster and drosophila psedoobscura

    Energy Technology Data Exchange (ETDEWEB)

    Berman, Benjamin P.; Pfeiffer, Barret D.; Laverty, Todd R.; Salzberg, Steven L.; Rubin, Gerald M.; Eisen, Michael B.; Celniker, SusanE.

    2004-08-06

    The identification of sequences that control transcription in metazoans is a major goal of genome analysis. In a previous study, we demonstrated that searching for clusters of predicted transcription factor binding sites could discover active regulatory sequences, and identified 37 regions of the Drosophila melanogaster genome with high densities of predicted binding sites for five transcription factors involved in anterior-posterior embryonic patterning. Nine of these clusters overlapped known enhancers. Here, we report the results of in vivo functional analysis of 27 remaining clusters. We generated transgenic flies carrying each cluster attached to a basal promoter and reporter gene, and assayed embryos for reporter gene expression. Six clusters are enhancers of adjacent genes: giant, fushi tarazu, odd-skipped, nubbin, squeeze and pdm2; three drive expression in patterns unrelated to those of neighboring genes; the remaining 18 do not appear to have enhancer activity. We used the Drosophila pseudoobscura genome to compare patterns of evolution in and around the 15 positive and 18 false-positive predictions. Although conservation of primary sequence cannot distinguish true from false positives, conservation of binding-site clustering accurately discriminates functional binding-site clusters from those with no function. We incorporated conservation of binding-site clustering into a new genome-wide enhancer screen, and predict several hundred new regulatory sequences, including 85 adjacent to genes with embryonic patterns. Measuring conservation of sequence features closely linked to function--such as binding-site clustering--makes better use of comparative sequence data than commonly used methods that examine only sequence identity.

  4. Effects of sodium on cell surface and intracellular TH-naloxone binding sites

    Energy Technology Data Exchange (ETDEWEB)

    Pollack, A.E.; Wooten, G.F.

    1987-07-27

    The binding of the opiate antagonist TH-naloxone was examined in rat whole brain homogenates and in crude subcellular fractions of these homogenates (nuclear, synaptosomal, and mitochondrial fractions) using buffers that approximated intra- (low sodium concentration) and extracellular (high sodium concentration) fluids. Saturation studies showed a two-fold decrease in the dissociation constant (Kd) in all subcellular fractions examined in extracellular buffer compared to intracellular buffer. In contrast, there was no significant effect of the buffers on the Bmax. Thus, TH-naloxone did not distinguish between binding sites present on cell surface and intracellular tissues in these two buffers. These results show that the sodium effect of opiate antagonist binding is probably not a function of altered selection of intra- and extracellular binding sites. 17 references, 2 tables.

  5. Prenatal exposure to methylmercury alters development of adrenergic receptor binding sites in peripheral sympathetic target tissues

    Energy Technology Data Exchange (ETDEWEB)

    Slotkin, T.A.; Orband, L.; Cowdery, T.; Kavlock, R.J.; Bartolome, J.

    1987-01-01

    In order to assess the impact of prenatal exposure to methylmercury on sympathetic neurotransmission, effects on development of adrenergic receptor binding sites in peripheral tissues was evaluated. In the liver, methylmercury produced a dose-dependent increase in alpha/sub 1/, alpha/sub 2/, and beta-receptor binding of radioliganda throughout the first 5 weeks of postnatal life. Similarly, renal alpha-receptor subtypes showed increased binding capabilities, but binding to alpha-receptor sites was reduced. At least some of the changes in receptors appear to be of functional significance, as physiological reactivity to adrenergic stimulation is altered in the same directions in these two tissues. The actions of methylmercury displayed tissue specificity in that the same receptor populations were largely unaffected in other tissues (lung, heart). These results suggest that methylmercury exposure in utero alters adrenergic responses through targeted effects on postsynaptic receptor populations in specific tissues.

  6. Discovering structural motifs using a structural alphabet: Application to magnesium-binding sites

    Directory of Open Access Journals (Sweden)

    Lim Carmay

    2007-03-01

    Full Text Available Abstract Background For many metalloproteins, sequence motifs characteristic of metal-binding sites have not been found or are so short that they would not be expected to be metal-specific. Striking examples of such metalloproteins are those containing Mg2+, one of the most versatile metal cofactors in cellular biochemistry. Even when Mg2+-proteins share insufficient sequence homology to identify Mg2+-specific sequence motifs, they may still share similarity in the Mg2+-binding site structure. However, no structural motifs characteristic of Mg2+-binding sites have been reported. Thus, our aims are (i to develop a general method for discovering structural patterns/motifs characteristic of ligand-binding sites, given the 3D protein structures, and (ii to apply it to Mg2+-proteins sharing 2+-structural motifs are identified as recurring structural patterns. Results The structural alphabet-based motif discovery method has revealed the structural preference of Mg2+-binding sites for certain local/secondary structures: compared to all residues in the Mg2+-proteins, both first and second-shell Mg2+-ligands prefer loops to helices. Even when the Mg2+-proteins share no significant sequence homology, some of them share a similar Mg2+-binding site structure: 4 Mg2+-structural motifs, comprising 21% of the binding sites, were found. In particular, one of the Mg2+-structural motifs found maps to a specific functional group, namely, hydrolases. Furthermore, 2 of the motifs were not found in non metalloproteins or in Ca2+-binding proteins. The structural motifs discovered thus capture some essential biochemical and/or evolutionary properties, and hence may be useful for discovering proteins where Mg2+ plays an important biological role. Conclusion The structural motif discovery method presented herein is general and can be applied to any set of proteins with known 3D structures. This new method is timely considering the increasing number of structures for

  7. Prediction of the binding affinities of peptides to class II MHC using a regularized thermodynamic model

    Directory of Open Access Journals (Sweden)

    Mittelmann Hans D

    2010-01-01

    Full Text Available Abstract Background The binding of peptide fragments of extracellular peptides to class II MHC is a crucial event in the adaptive immune response. Each MHC allotype generally binds a distinct subset of peptides and the enormous number of possible peptide epitopes prevents their complete experimental characterization. Computational methods can utilize the limited experimental data to predict the binding affinities of peptides to class II MHC. Results We have developed the Regularized Thermodynamic Average, or RTA, method for predicting the affinities of peptides binding to class II MHC. RTA accounts for all possible peptide binding conformations using a thermodynamic average and includes a parameter constraint for regularization to improve accuracy on novel data. RTA was shown to achieve higher accuracy, as measured by AUC, than SMM-align on the same data for all 17 MHC allotypes examined. RTA also gave the highest accuracy on all but three allotypes when compared with results from 9 different prediction methods applied to the same data. In addition, the method correctly predicted the peptide binding register of 17 out of 18 peptide-MHC complexes. Finally, we found that suboptimal peptide binding registers, which are often ignored in other prediction methods, made significant contributions of at least 50% of the total binding energy for approximately 20% of the peptides. Conclusions The RTA method accurately predicts peptide binding affinities to class II MHC and accounts for multiple peptide binding registers while reducing overfitting through regularization. The method has potential applications in vaccine design and in understanding autoimmune disorders. A web server implementing the RTA prediction method is available at http://bordnerlab.org/RTA/.

  8. The Adenovirus Type 3 Dodecahedron's RGD Loop Comprises an HSPG Binding Site That Influences Integrin Binding

    Directory of Open Access Journals (Sweden)

    E. Gout

    2010-01-01

    Full Text Available Human type 3 adenovirus dodecahedron (a virus like particle made of twelve penton bases features the ability to enter cells through Heparan Sulphate Proteoglycans (HSPGs and integrins interaction and is used as a versatile vector to deliver DNA or proteins. Cryo-EM reconstruction of the pseudoviral particle with Heparan Sulphate (HS oligosaccharide shows an extradensity on the RGD loop. A set of mutants was designed to study the respective roles of the RGD sequence (RGE mutant and of a basic sequence located just downstream. Results showed that the RGE mutant binding to the HS deficient CHO-2241 cells was abolished and unexpectedly, mutation of the basic sequence (KQKR to AQAS dramatically decreased integrin recognition by the viral pseudoparticle. This basic sequence is thus involved in integrin docking, showing a close interplay between HSPGs and integrin receptors.

  9. Zinc-induced oligomerization of zinc α2 glycoprotein reveals multiple fatty acid-binding sites.

    Science.gov (United States)

    Zahid, Henna; Miah, Layeque; Lau, Andy M; Brochard, Lea; Hati, Debolina; Bui, Tam T T; Drake, Alex F; Gor, Jayesh; Perkins, Stephen J; McDermott, Lindsay C

    2016-01-01

    Zinc α2 glycoprotein (ZAG) is an adipokine with a class I MHC protein fold and is associated with obesity and diabetes. Although its intrinsic ligand remains unknown, ZAG binds the dansylated C11 fatty acid 11-(dansylamino)undecanoic acid (DAUDA) in the groove between the α1 and α2 domains. The surface of ZAG has approximately 15 weak zinc-binding sites deemed responsible for precipitation from human plasma. In the present study the functional significance of these metal sites was investigated. Analytical ultracentrifugation (AUC) and CD showed that zinc, but not other divalent metals, causes ZAG to oligomerize in solution. Thus ZAG dimers and trimers were observed in the presence of 1 and 2 mM zinc. Molecular modelling of X-ray scattering curves and sedimentation coefficients indicated a progressive stacking of ZAG monomers, suggesting that the ZAG groove may be occluded in these. Using fluorescence-detected sedimentation velocity, these ZAG-zinc oligomers were again observed in the presence of the fluorescent boron dipyrromethene fatty acid C16-BODIPY (4,4-difluoro-5,7-dimethyl-4-bora-3a,4a-diaza-s-indacene-3-hexadecanoic acid). Fluorescence spectroscopy confirmed that ZAG binds C16-BODIPY. ZAG binding to C16-BODIPY, but not to DAUDA, was reduced by increased zinc concentrations. We conclude that the lipid-binding groove in ZAG contains at least two distinct fatty acid-binding sites for DAUDA and C16-BODIPY, similar to the multiple lipid binding seen in the structurally related immune protein CD1c. In addition, because high concentrations of zinc occur in the pancreas, the perturbation of these multiple lipid-binding sites by zinc may be significant in Type 2 diabetes where dysregulation of ZAG and zinc homoeostasis occurs.

  10. Predicting copper-, iron- and zinc-binding proteins in pathogenic species of the Paracoccidioides genus

    Directory of Open Access Journals (Sweden)

    Gabriel B Tristao

    2015-01-01

    Full Text Available Approximately one-third of all proteins have been estimated to contain at least one metal cofactor, and these proteins are referred to as metalloproteins. These represent one of the most diverse classes of proteins, containing metal ions that bind to specific sites to perform catalytic, regulatory and structural functions. Bioinformatic tools have been developed to predict metalloproteins encoded by an organism based only on its genome sequence. Its function and the type of metal binder can also be predicted via a bioinformatics approach. Paracoccidioides complex includes termodimorphic pathogenic fungi that are found as saprobic mycelia in the environment and as yeast, the parasitic form, in host tissues. They are the etiologic agents of Paracoccidioidomycosis, a prevalent systemic mycosis in Latin America. Many metalloproteins are important for the virulence of several pathogenic microorganisms. Accordingly, the present work aimed to predict the cooper, iron and zinc proteins encoded by the genomes of three phylogenetic species of Paracoccidioides (Pb01, Pb03 and Pb18. The metalloproteins were identified using bioinformatics approaches based on structure, annotation and domains. Cu-, Fe- and Zn-binding proteins represent 7% of the total proteins encoded by Paracoccidioides spp. genomes. Zinc proteins were the most abundant metalloproteins, representing 5.7% of the fungus proteome, whereas copper and iron proteins represent 0.3% and 1.2%, respectively. Functional classification revealed that metalloproteins are related to many cellular processes. Furthermore, it was observed that many of these metalloproteins serve as virulence factors in the biology of the fungus. Thus, it is concluded that the Cu, Fe and Zn metalloproteomes of the Paracoccidioides spp. are of the utmost importance for the biology and virulence of these particular human pathogens.

  11. The plasminogen binding site of the C-type lectin tetranectin is located in the carbohydrate recognition domain, and binding is sensitive to both calcium and lysine

    DEFF Research Database (Denmark)

    Graversen, Jonas Heilskov; Lorentsen, R H; Jacobsen, C

    1998-01-01

    Tetranectin, a homotrimeric protein belonging to the family of C-type lectins and structurally highly related to corresponding regions of the mannose-binding proteins, is known specifically to bind the plasminogen kringle 4 protein domain, an interaction sensitive to lysine. Surface plasmon...... resonance and isothermal calorimetry binding analyses using single-residue and deletion mutant tetranectin derivatives produced in Escherichia coli showed that the kringle 4 binding site resides in the carbohydrate recognition domain and includes residues of the putative carbohydrate binding site...

  12. Purification, molecular cloning, and expression of the mammalian sigma1-binding site.

    Science.gov (United States)

    Hanner, M; Moebius, F F; Flandorfer, A; Knaus, H G; Striessnig, J; Kempner, E; Glossmann, H

    1996-07-23

    Sigma-ligands comprise several chemically unrelated drugs such as haloperidol, pentazocine, and ditolylguanidine, which bind to a family of low molecular mass proteins in the endoplasmic reticulum. These so-called sigma-receptors are believed to mediate various pharmacological effects of sigma-ligands by as yet unknown mechanisms. Based on their opposite enantioselectivity for benzomorphans and different molecular masses, two subtypes are differentiated. We purified the sigma1-binding site as a single 30-kDa protein from guinea pig liver employing the benzomorphan(+)[3H]pentazocine and the arylazide (-)[3H]azidopamil as specific probes. The purified (+)[3H]pentazocine-binding protein retained its high affinity for haloperidol, pentazocine, and ditolylguanidine. Partial amino acid sequence obtained after trypsinolysis revealed no homology to known proteins. Radiation inactivation of the pentazocine-labeled sigma1-binding site yielded a molecular mass of 24 +/- 2 kDa. The corresponding cDNA was cloned using degenerate oligonucleotides and cDNA library screening. Its open reading frame encoded a 25.3-kDa protein with at least one putative transmembrane segment. The protein expressed in yeast cells transformed with the cDNA showed the pharmacological characteristics of the brain and liver sigma1-binding site. The deduced amino acid sequence was structurally unrelated to known mammalian proteins but it shared homology with fungal proteins involved in sterol synthesis. Northern blots showed high densities of the sigma1-binding site mRNA in sterol-producing tissues. This is also in agreement with the known ability of sigma1-binding sites to interact with steroids, such as progesterone.

  13. Deep sequencing of MYC DNA-binding sites in Burkitt lymphoma.

    Directory of Open Access Journals (Sweden)

    Volkhard Seitz

    Full Text Available BACKGROUND: MYC is a key transcription factor involved in central cellular processes such as regulation of the cell cycle, histone acetylation and ribosomal biogenesis. It is overexpressed in the majority of human tumors including aggressive B-cell lymphoma. Especially Burkitt lymphoma (BL is a highlight example for MYC overexpression due to a chromosomal translocation involving the c-MYC gene. However, no genome-wide analysis of MYC-binding sites by chromatin immunoprecipitation (ChIP followed by next generation sequencing (ChIP-Seq has been conducted in BL so far. METHODOLOGY/PRINCIPAL FINDINGS: ChIP-Seq was performed on 5 BL cell lines with a MYC-specific antibody giving rise to 7,054 MYC-binding sites after bioinformatics analysis of a total of approx. 19 million sequence reads. In line with previous findings, binding sites accumulate in gene sets known to be involved in the cell cycle, ribosomal biogenesis, histone acetyltransferase and methyltransferase complexes demonstrating a regulatory role of MYC in these processes. Unexpectedly, MYC-binding sites also accumulate in many B-cell relevant genes. To assess the functional consequences of MYC binding, the ChIP-Seq data were supplemented with siRNA- mediated knock-downs of MYC in BL cell lines followed by gene expression profiling. Interestingly, amongst others, genes involved in the B-cell function were up-regulated in response to MYC silencing. CONCLUSION/SIGNIFICANCE: The 7,054 MYC-binding sites identified by our ChIP-Seq approach greatly extend the knowledge regarding MYC binding in BL and shed further light on the enormous complexity of the MYC regulatory network. Especially our observations that (i many B-cell relevant genes are targeted by MYC and (ii that MYC down-regulation leads to an up-regulation of B-cell genes highlight an interesting aspect of BL biology.

  14. An AP1 binding site upstream of the kappa immunoglobulin intron enhancer binds inducible factors and contributes to expression.

    Science.gov (United States)

    Schanke, J T; Marcuzzi, A; Podzorski, R P; Van Ness, B

    1994-01-01

    Expression of the kappa immunoglobulin light chain gene requires developmental- and tissue-specific regulation by trans-acting factors which interact with two distinct enhancer elements. A new protein-DNA interaction has been identified upstream of the intron enhancer, within the matrix-associated region of the J-C intron. The binding activity is greatly inducible in pre-B cells by bacterial lipopolysaccharide and interleukin-1 but specific complexes are found at all stages of B cell development tested. The footprinted binding site is homologous to the consensus AP1 motif. The protein components of this complex are specifically competed by an AP1 consensus motif and were shown by supershift to include c-Jun and c-Fos, suggesting that this binding site is an AP1 motif and that the Jun and Fos families of transcription factors play a role in the regulation of the kappa light chain gene. Mutation of the AP1 motif in the context of the intron enhancer was shown to decrease enhancer-mediated activation of the promoter in both pre-B cells induced with LPS and constitutive expression in mature B cells. Images PMID:7816634

  15. Towards the identification of the allosteric Phe-binding site in phenylalanine hydroxylase.

    Science.gov (United States)

    Carluccio, Carla; Fraternali, Franca; Salvatore, Francesco; Fornili, Arianna; Zagari, Adriana

    2016-01-01

    The enzyme phenylalanine hydroxylase (PAH) is defective in the inherited disorder phenylketonuria. PAH, a tetrameric enzyme, is highly regulated and displays positive cooperativity for its substrate, Phe. Whether Phe binds to an allosteric site is a matter of debate, despite several studies worldwide. To address this issue, we generated a dimeric model for Phe-PAH interactions, by performing molecular docking combined with molecular dynamics simulations on human and rat wild-type sequences and also on a human G46S mutant. Our results suggest that the allosteric Phe-binding site lies at the dimeric interface between the regulatory and the catalytic domains of two adjacent subunits. The structural and dynamical features of the site were characterized in depth and described. Interestingly, our findings provide evidence for lower allosteric Phe-binding ability of the G46S mutant than the human wild-type enzyme. This also explains the disease-causing nature of this mutant.

  16. BiPPred: Combined sequence- and structure-based prediction of peptide binding to the Hsp70 chaperone BiP.

    Science.gov (United States)

    Schneider, Markus; Rosam, Mathias; Glaser, Manuel; Patronov, Atanas; Shah, Harpreet; Back, Katrin Christiane; Daake, Marina Angelika; Buchner, Johannes; Antes, Iris

    2016-10-01

    Substrate binding to Hsp70 chaperones is involved in many biological processes, and the identification of potential substrates is important for a comprehensive understanding of these events. We present a multi-scale pipeline for an accurate, yet efficient prediction of peptides binding to the Hsp70 chaperone BiP by combining sequence-based prediction with molecular docking and MMPBSA calculations. First, we measured the binding of 15mer peptides from known substrate proteins of BiP by peptide array (PA) experiments and performed an accuracy assessment of the PA data by fluorescence anisotropy studies. Several sequence-based prediction models were fitted using this and other peptide binding data. A structure-based position-specific scoring matrix (SB-PSSM) derived solely from structural modeling data forms the core of all models. The matrix elements are based on a combination of binding energy estimations, molecular dynamics simulations, and analysis of the BiP binding site, which led to new insights into the peptide binding specificities of the chaperone. Using this SB-PSSM, peptide binders could be predicted with high selectivity even without training of the model on experimental data. Additional training further increased the prediction accuracies. Subsequent molecular docking (DynaDock) and MMGBSA/MMPBSA-based binding affinity estimations for predicted binders allowed the identification of the correct binding mode of the peptides as well as the calculation of nearly quantitative binding affinities. The general concept behind the developed multi-scale pipeline can readily be applied to other protein-peptide complexes with linearly bound peptides, for which sufficient experimental binding data for the training of classical sequence-based prediction models is not available. Proteins 2016; 84:1390-1407. © 2016 Wiley Periodicals, Inc.

  17. Interaction of triprolidine hydrochloride with serum albumins: thermodynamic and binding characteristics, and influence of site probes.

    Science.gov (United States)

    Sandhya, B; Hegde, Ashwini H; Kalanur, Shankara S; Katrahalli, Umesha; Seetharamappa, J

    2011-04-01

    The interaction between triprolidine hydrochloride (TRP) to serum albumins viz. bovine serum albumin (BSA) and human serum albumin (HSA) has been studied by spectroscopic methods. The experimental results revealed the static quenching mechanism in the interaction of TRP with protein. The number of binding sites close to unity for both TRP-BSA and TRP-HSA indicated the presence of single class of binding site for the drug in protein. The binding constant values of TRP-BSA and TRP-HSA were observed to be 4.75 ± 0.018 × 10(3) and 2.42 ± 0.024 × 10(4)M(-1) at 294 K, respectively. Thermodynamic parameters indicated that the hydrogen bond and van der Waals forces played the major role in the binding of TRP to proteins. The distance of separation between the serum albumin and TRP was obtained from the Förster's theory of non-radioactive energy transfer. The metal ions viz., K(+), Ca(2+), Co(2+), Cu(2+), Ni(2+), Mn(2+) and Zn(2+) were found to influence the binding of the drug to protein. Displacement experiments indicated the binding of TRP to Sudlow's site I on both BSA and HSA. The CD, 3D fluorescence spectra and FT-IR spectral results revealed the changes in the secondary structure of protein upon interaction with TRP.

  18. Marked reduction in the number of platelet-tritiated imipramine binding sites in geriatric depression

    Energy Technology Data Exchange (ETDEWEB)

    Nemeroff, C.B.; Knight, D.L.; Krishnan, R.R.; Slotkin, T.A.; Bissette, G.; Melville, M.L.; Blazer, D.G.

    1988-10-01

    The number (Bmax) and affinity (Kd) of platelet-tritiated imipramine binding sites was determined in young and middle-aged controls 50 years of age and younger (n = 25), elderly normal controls over 60 years of age (n = 18), patients who fulfilled DSM-III criteria for major depression who were under 50 years of age (n = 29), patients who fulfilled DSM-III criteria for major depression who were 60 years of age and older (n = 19), and patients who fulfilled both DSM-III criteria for primary degenerative dementia and National Institute of Neurological and Communicative Disorders and Stroke-Alzheimer's Disease and Related Disorders Association criteria for probable Alzheimer's disease (n = 13). Both groups of depressed patients (under 50 and over 60 years of age) exhibited significant reductions (decreases 42%) in the number of platelet-tritiated imipramine binding sites with no change in affinity, when compared with their age-matched controls. There was little overlap in Bmax values between the elderly depressed patients and their controls. The patients with probable Alzheimer's disease showed no alteration in platelet-tritiated imipramine binding. There was no statistically significant relationship between postdexamethasone plasma cortisol concentrations and tritiated imipramine binding. These results indicate that platelet-tritiated imipramine binding may have potential utility as a diagnostic adjunct in geriatric depression, and moreover that the reduction in the number of platelet-tritiated imipramine binding sites is not due to hypercortisolemia.

  19. Assessment of Mars Exploration Rover Landing Site Predictions

    Science.gov (United States)

    Golombek, M. P.

    2005-05-01

    Comprehensive analyses of remote sensing data during the 3-year effort to select the Mars Exploration Rover landing sites at Gusev crater and Meridiani Planum correctly predicted the safe and trafficable surfaces explored by the two rovers. Gusev crater was predicted to be a relatively low relief surface that was comparably dusty, but less rocky than the Viking landing sites. Available data for Meridiani Planum indicated a very flat plain composed of basaltic sand to granules and hematite that would look completely unlike any of the existing landing sites with a dark, low albedo surface, little dust and very few rocks. Orbital thermal inertia measurements of 315 J m-2 s-0.5 K-1 at Gusev suggested surfaces dominated by duricrust to cemented soil-like materials or cohesionless sand or granules, which is consistent with observed soil characteristics and measured thermal inertias from the surface. THEMIS thermal inertias along the traverse at Gusev vary from 285 at the landing site to 330 around Bonneville rim and show systematic variations that can be related to the observed increase in rock abundance (5-30%). Meridiani has an orbital bulk inertia of ~200, similar to measured surface inertias that correspond to observed surfaces dominated by 0.2 mm sand size particles. Rock abundance derived from orbital thermal differencing techniques suggested that Meridiani Planum would have very low rock abundance, consistent with the rock free plain traversed by Opportunity. Spirit landed in an 8% orbital rock abundance pixel, consistent with the measured 7% of the surface covered by rocks >0.04 m diameter at the landing site, which is representative of the plains away from craters. The orbital albedo of the Spirit traverse varies from 0.19 to 0.30, consistent with surface measurements in and out of dust devil tracks. Opportunity is the first landing in a low albedo portion of Mars as seen from orbit, which is consistent with the dark, dust-free surface and measured albedos. The

  20. Bisphenol A binds to the local anesthetic receptor site to block the human cardiac sodium channel.

    Directory of Open Access Journals (Sweden)

    Andrias O O'Reilly

    Full Text Available Bisphenol A (BPA has attracted considerable public attention as it leaches from plastic used in food containers, is detectable in human fluids and recent epidemiologic studies link BPA exposure with diseases including cardiovascular disorders. As heart-toxicity may derive from modified cardiac electrophysiology, we investigated the interaction between BPA and hNav1.5, the predominant voltage-gated sodium channel subtype expressed in the human heart. Electrophysiology studies of heterologously-expressed hNav1.5 determined that BPA blocks the channel with a K(d of 25.4±1.3 µM. By comparing the effects of BPA and the local anesthetic mexiletine on wild type hNav1.5 and the F1760A mutant, we demonstrate that both compounds share an overlapping binding site. With a key binding determinant thus identified, an homology model of hNav1.5 was generated based on the recently-reported crystal structure of the bacterial voltage-gated sodium channel NavAb. Docking predictions position both ligands in a cavity delimited by F1760 and contiguous with the DIII-IV pore fenestration. Steered molecular dynamics simulations used to assess routes of ligand ingress indicate that the DIII-IV pore fenestration is a viable access pathway. Therefore BPA block of the human heart sodium channel involves the local anesthetic receptor and both BPA and mexiletine may enter the closed-state pore via membrane-located side fenestrations.

  1. Predicting N-terminal myristoylation sites in plant proteins

    Directory of Open Access Journals (Sweden)

    Podell Sheila

    2004-06-01

    Full Text Available Abstract Background N-terminal myristoylation plays a vital role in membrane targeting and signal transduction in plant responses to environmental stress. Although N-myristoyltransferase enzymatic function is conserved across plant, animal, and fungal kingdoms, exact substrate specificities vary, making it difficult to predict protein myristoylation accurately within specific taxonomic groups. Results A new method for predicting N-terminal myristoylation sites specifically in plants has been developed and statistically tested for sensitivity, specificity, and robustness. Compared to previously available methods, the new model is both more sensitive in detecting known positives, and more selective in avoiding false positives. Scores of myristoylated and non-myristoylated proteins are more widely separated than with other methods, greatly reducing ambiguity and the number of sequences giving intermediate, uninformative results. The prediction model is available at http://plantsp.sdsc.edu/myrist.html. Conclusion Superior performance of the new model is due to the selection of a plant-specific training set, covering 266 unique sequence examples from 40 different species, the use of a probability-based hidden Markov model to obtain predictive scores, and a threshold cutoff value chosen to provide maximum positive-negative discrimination. The new model has been used to predict 589 plant proteins likely to contain N-terminal myristoylation signals, and to analyze the functional families in which these proteins occur.

  2. Functional diversification of paralogous transcription factors via divergence in DNA binding site motif and in expression.

    Directory of Open Access Journals (Sweden)

    Larry N Singh

    Full Text Available BACKGROUND: Gene duplication is a major driver of evolutionary innovation as it allows for an organism to elaborate its existing biological functions via specialization or diversification of initially redundant gene paralogs. Gene function can diversify in several ways. Transcription factor gene paralogs in particular, can diversify either by changes in their tissue-specific expression pattern or by changes in the DNA binding site motif recognized by their protein product, which in turn alters their gene targets. The relationship between these two modes of functional diversification of transcription factor paralogs has not been previously investigated, and is essential for understanding adaptive evolution of transcription factor gene families. FINDINGS: Based on a large set of human paralogous transcription factor pairs, we show that when the DNA binding site motifs of transcription factor paralogs are similar, the expressions of the genes that encode the paralogs have diverged, so in general, at most one of the paralogs is highly expressed in a tissue. Moreover, paralogs with diverged DNA binding site motifs tend to be diverged in their function. Conversely, two paralogs that are highly expressed in a tissue tend to have dissimilar DNA binding site motifs. We have also found that in general, within a paralogous family, tissue-specific decrease in gene expression is more frequent than what is expected by chance. CONCLUSIONS: While previous investigations of paralogous gene diversification have only considered coding sequence divergence, by explicitly quantifying divergence in DNA binding site motif, our work presents a new paradigm for investigating functional diversification. Consistent with evolutionary expectation, our quantitative analysis suggests that paralogous transcription factors have survived extinction in part, either through diversification of their DNA binding site motifs or through alterations in their tissue-specific expression

  3. Disruption of NAD~+ binding site in glyceraldehyde 3-phosphate dehydrogenase affects its intranuclear interactions

    Institute of Scientific and Technical Information of China (English)

    Manali; Phadke; Natalia; Krynetskaia; Anurag; Mishra; Carlos; Barrero; Salim; Merali; Scott; A; Gothe; Evgeny; Krynetskiy

    2015-01-01

    AIM:To characterize phosphorylation of human glyceraldehyde 3-phosphate dehydrogenase(GAPDH),and mobility of GAPDH in cancer cells treated with chemotherapeutic agents. METHODS:We used proteomics analysis to detect and characterize phosphorylation sites within human GAPDH. Site-specific mutagenesis and alanine scanning was then performed to evaluate functional significance of phosphorylation sites in the GAPDH polypeptide chain. Enzymatic properties of mutated GAPDH variants were assessed using kinetic studies. Intranuclear dynamics parameters(diffusion coefficient and the immobile fraction) were estimated using fluorescence recovery after photobleaching(FRAP) experiments and confocal microscopy. Molecular modeling experiments were performed to estimate the effects of mutations on NAD+ cofactor binding.RESULTS:Using MALDI-TOF analysis,we identified novel phosphorylation sites within the NAD+ binding center of GAPDH at Y94,S98,and T99. Using polyclonal antibody specific to phospho-T99-containing peptide within GAPDH,we demonstrated accumulation of phospho-T99-GAPDH inthe nuclear fractions of A549,HCT116,and SW48 cancer cel s after cytotoxic stress. We performed site-mutagenesis,and estimated enzymatic properties,intranuclear distribution,and intranuclear mobility of GAPDH mutated variants. Site-mutagenesis at positions S98 and T99 in the NAD+ binding center reduced enzymatic activity of GAPDH due to decreased affinity to NAD+(Km = 741 ± 257 μmol/L in T99 I vs 57 ± 11.1 μmol/L in wild type GAPDH. Molecular modeling experiments revealed the effect of mutations on NAD+ binding with GAPDH. FRAP(fluorescence recovery after photo bleaching) analysis showed that mutations in NAD+ binding center of GAPDH abrogated its intranuclear interactions. CONCLUSION:Our results suggest an important functional role of phosphorylated amino acids in the NAD+ binding center in GAPDH interactions with its intranuclear partners.

  4. Surface binding sites in amylase have distinct roles in recognition of starch structure motifs and degradation.

    Science.gov (United States)

    Cockburn, Darrell; Nielsen, Morten M; Christiansen, Camilla; Andersen, Joakim M; Rannes, Julie B; Blennow, Andreas; Svensson, Birte

    2015-04-01

    Carbohydrate converting enzymes often possess extra substrate binding regions that enhance their activity. These can be found either on separate domains termed carbohydrate binding modules or as so-called surface binding sites (SBSs) situated on the catalytic domain. SBSs are common in starch degrading enzymes and critically important for their function. The affinity towards a variety of starch granules as well as soluble poly- and oligosaccharides of barley α-amylase 1 (AMY1) wild-type and mutants of two SBSs (SBS1 and SBS2) was investigated using Langmuir binding analysis, confocal laser scanning microscopy, affinity gel electrophoresis and surface plasmon resonance to unravel functional roles of the SBSs. SBS1 was critical for binding to different starch types as Kd increased by 7-62-fold or was not measurable upon mutation. By contrast SBS2 was particularly important for binding to soluble polysaccharides and oligosaccharides with α-1,6 linkages, suggesting that branch points are key structural elements in recognition by SBS2. Mutation at both SBS1 and SBS2 eliminated binding to all starch granule types tested. Taken together, the findings indicate that the two SBSs act in concert to localize AMY1 to the starch granule surface and that SBS2 works synergistically with the active site in the degradation of amylopectin.

  5. Autoradiographic distribution of /sup 125/I-galanin binding sites in the rat central nervous system

    Energy Technology Data Exchange (ETDEWEB)

    Skofitsch, G.; Sills, M.A.; Jacobowitz, D.M.

    1986-11-01

    Galanin (GAL) binding sites in coronal sections of the rat brain were demonstrated using autoradiographic methods. Scatchard analysis of /sup 125/I-GAL binding to slide-mounted tissue sections revealed saturable binding to a single class of receptors with a Kd of approximately 0.2 nM. /sup 125/I-GAL binding sites were demonstrated throughout the rat central nervous system. Dense binding was observed in the following areas: prefrontal cortex, the anterior nuclei of the olfactory bulb, several nuclei of the amygdaloid complex, the dorsal septal area, dorsal bed nucleus of the stria terminalis, the ventral pallidum, the internal medullary laminae of the thalamus, medial pretectal nucleus, nucleus of the medial optic tract, borderline area of the caudal spinal trigeminal nucleus adjacent to the spinal trigeminal tract, the substantia gelatinosa and the superficial layers of the dorsal spinal cord. Moderate binding was observed in the piriform, periamygdaloid, entorhinal, insular cortex and the subiculum, the nucleus accumbens, medial forebrain bundle, anterior hypothalamic, ventromedial, dorsal premamillary, lateral and periventricular thalamic nuclei, the subzona incerta, Forel's field H1 and H2, periventricular gray matter, medial and superficial gray strata of the superior colliculus, dorsal parts of the central gray, peripeduncular area, the interpeduncular nucleus, substantia nigra zona compacta, ventral tegmental area, the dorsal and ventral parabrachial and parvocellular reticular nuclei. The preponderance of GAL-binding in somatosensory as well as in limbic areas suggests a possible involvement of GAL in a variety of brain functions.

  6. Role of DNA binding sites and slow unbinding kinetics in titration-based oscillators.

    Science.gov (United States)

    Karapetyan, Sargis; Buchler, Nicolas E

    2015-12-01

    Genetic oscillators, such as circadian clocks, are constantly perturbed by molecular noise arising from the small number of molecules involved in gene regulation. One of the strongest sources of stochasticity is the binary noise that arises from the binding of a regulatory protein to a promoter in the chromosomal DNA. In this study, we focus on two minimal oscillators based on activator titration and repressor titration to understand the key parameters that are important for oscillations and for overcoming binary noise. We show that the rate of unbinding from the DNA, despite traditionally being considered a fast parameter, needs to be slow to broaden the space of oscillatory solutions. The addition of multiple, independent DNA binding sites further expands the oscillatory parameter space for the repressor-titration oscillator and lengthens the period of both oscillators. This effect is a combination of increased effective delay of the unbinding kinetics due to multiple binding sites and increased promoter ultrasensitivity that is specific for repression. We then use stochastic simulation to show that multiple binding sites increase the coherence of oscillations by mitigating the binary noise. Slow values of DNA unbinding rate are also effective in alleviating molecular noise due to the increased distance from the bifurcation point. Our work demonstrates how the number of DNA binding sites and slow unbinding kinetics, which are often omitted in biophysical models of gene circuits, can have a significant impact on the temporal and stochastic dynamics of genetic oscillators.

  7. Role of DNA binding sites and slow unbinding kinetics in titration-based oscillators

    Science.gov (United States)

    Karapetyan, Sargis; Buchler, Nicolas E.

    2015-12-01

    Genetic oscillators, such as circadian clocks, are constantly perturbed by molecular noise arising from the small number of molecules involved in gene regulation. One of the strongest sources of stochasticity is the binary noise that arises from the binding of a regulatory protein to a promoter in the chromosomal DNA. In this study, we focus on two minimal oscillators based on activator titration and repressor titration to understand the key parameters that are important for oscillations and for overcoming binary noise. We show that the rate of unbinding from the DNA, despite traditionally being considered a fast parameter, needs to be slow to broaden the space of oscillatory solutions. The addition of multiple, independent DNA binding sites further expands the oscillatory parameter space for the repressor-titration oscillator and lengthens the period of both oscillators. This effect is a combination of increased effective delay of the unbinding kinetics due to multiple binding sites and increased promoter ultrasensitivity that is specific for repression. We then use stochastic simulation to show that multiple binding sites increase the coherence of oscillations by mitigating the binary noise. Slow values of DNA unbinding rate are also effective in alleviating molecular noise due to the increased distance from the bifurcation point. Our work demonstrates how the number of DNA binding sites and slow unbinding kinetics, which are often omitted in biophysical models of gene circuits, can have a significant impact on the temporal and stochastic dynamics of genetic oscillators.

  8. Cortisol decreases 2[[sup 125]I] iodomelatonin binding sites in the duck thymus

    Energy Technology Data Exchange (ETDEWEB)

    Poon, A.M.S.; Liu, Z.M.; Tang, F.; Pang, S.F. (Univ. of Hong Kong (China))

    1994-03-01

    The immunosuppressive effect of chronic glucocorticoid treatment on 2[[sup 125]I] iodomelatonin binding in the duck thymus was studied. Two-week-old ducks were injected intraperitoneally with either 1 mg of cortisol per day (experimental group) or an equivalent volume of vehicle (control group) in the middle of the light period for seven days. 2[[sup 125]I] iodomelatonin binding assays were performed on thymic membranes. Cortisol injection reduced the body weight gain, size of the bursa of Fabricius and absolute weights of the primary lymphoid organs but had no effect on the spleen weights. The relative weights of the spleen were increased while those of the primary lymphoid organs were unchanged. The density of the thymus 2[[sup 125]I] iodomelatonin binding sites was decreased while the affinity was not affected. The modulation of the thymic 2[[sup 125]I] iodomelatonin binding sites by changes in the immune status of the duck suggests that these binding sites represent physiologically relevant melatonin receptors and that melatonin exerts its action on the lymphoid tissues directly. The authors findings support the hypothesis that the thymus is the target site for the immunomodulatory interactions between the pineal melatonin and the adrenal steroids. A possible inhibitory influence of adrenal steroids on the immuno-enhancing effect of melatonin is also suggested. 34 refs., 3 tabs.

  9. MetWAMer: eukaryotic translation initiation site prediction

    Directory of Open Access Journals (Sweden)

    Brendel Volker

    2008-09-01

    Full Text Available Abstract Background Translation initiation site (TIS identification is an important aspect of the gene annotation process, requisite for the accurate delineation of protein sequences from transcript data. We have developed the MetWAMer package for TIS prediction in eukaryotic open reading frames of non-viral origin. MetWAMer can be used as a stand-alone, third-party tool for post-processing gene structure annotations generated by external computational programs and/or pipelines, or directly integrated into gene structure prediction software implementations. Results MetWAMer currently implements five distinct methods for TIS prediction, the most accurate of which is a routine that combines weighted, signal-based translation initiation site scores and the contrast in coding potential of sequences flanking TISs using a perceptron. Also, our program implements clustering capabilities through use of the k-medoids algorithm, thereby enabling cluster-specific TIS parameter utilization. In practice, our static weight array matrix-based indexing method for parameter set lookup can be used with good results in data sets exhibiting moderate levels of 5'-complete coverage. Conclusion We demonstrate that improvements in statistically-based models for TIS prediction can be achieved by taking the class of each potential start-methionine into account pending certain testing conditions, and that our perceptron-based model is suitable for the TIS identification task. MetWAMer represents a well-documented, extensible, and freely available software system that can be readily re-trained for differing target applications and/or extended with existing and novel TIS prediction methods, to support further research efforts in this area.

  10. Does distant homology with Evf reveal a lipid binding site in Bacillus thuringiensis cytolytic toxins?

    Science.gov (United States)

    Rigden, Daniel J

    2009-05-19

    The Cry and Cyt classes of insecticidal toxins derived from the sporulating bacterium Bacillus thuringiensis are valuable substitutes for synthetic pesticides in agricultural contexts. Crystal structures and many biochemical data have provided insights into their molecular mechanisms, generally thought to involve oligomerization and pore formation, but have not localised the site on Cyt toxins responsible for selective binding of phospholipids containing unsaturated fatty acids. Here, distant homology between the structure of Cyt toxins and Erwinia virulence factor (Evf) is demonstrated which, along with sequence conservation analysis, allows a putative lipid binding site to be localised in the toxins.

  11. Discovery and mapping of an intracellular antagonist binding site at the chemokine receptor CCR2

    DEFF Research Database (Denmark)

    Zweemer, Annelien J M; Bunnik, Julia; Veenhuizen, Margo;

    2014-01-01

    be divided into two groups with most likely two topographically distinct binding sites. The aim of the current study was to identify the binding site of one such group of ligands, exemplified by three allosteric antagonists, CCR2-RA-[R], JNJ-27141491, and SD-24. We first used a chimeric CCR2/CCR5 receptor......The chemokine receptor CCR2 is a G protein-coupled receptor that is involved in many diseases characterized by chronic inflammation, and therefore a large variety of CCR2 small molecule antagonists has been developed. On the basis of their chemical structures these antagonists can roughly...

  12. Zinc-induced oligomerization of zinc α2 glycoprotein reveals multiple fatty acid-binding sites

    OpenAIRE

    Zahid, Henna; Miah, Layeque; Lau, Andy; Brochard, Lea; Hati, Debolina; Bui, T. T.; Drake, A. F.; Gor, Jayesh; Perkins, Stephen J.; McDermott, Lindsay C.

    2016-01-01

    Zinc α2 glycoprotein (ZAG) is an adipokine with a class I MHC protein fold and is associated with obesity and diabetes. Although its intrinsic ligand remains unknown, ZAG binds the dansylated C11 fatty acid 11-(dansylamino)undecanoic acid (DAUDA) in the groove between the α1 and α2 domains. The surface of ZAG has approximately 15 weak zinc-binding sites deemed responsible for precipitation from human plasma. In the present study the functional significance of these metal sites was investigate...

  13. Determination of the binding sites for oxaliplatin on insulin using mass spectrometry-based approaches

    DEFF Research Database (Denmark)

    Møller, Charlotte; Sprenger, Richard R; Stürup, Stefan;

    2011-01-01

    and fragmentation of the intact insulin-oxaliplatin adduct using nano-electrospray ionisation quadrupole time-of-flight mass spectrometry (nESI-Q-ToF-MS), the major binding site was assigned to histidine5 on the insulin B chain. In order to simplify the interpretation of the mass spectrum, the disulphide bridges...... were reduced. This led to the additional identification of cysteine6 on the A chain as a binding site along with histidine5 on the B chain. Digestion of insulin-oxaliplatin with endoproteinase Glu-C (GluC) followed by reduction led to the formation of five peptides with Pt(dach) attached...

  14. Rational design of a protein that binds integrin αvβ3 outside the ligand binding site

    Science.gov (United States)

    Turaga, Ravi Chakra; Yin, Lu; Yang, Jenny J.; Lee, Hsiauwei; Ivanov, Ivaylo; Yan, Chunli; Yang, Hua; Grossniklaus, Hans E.; Wang, Siming; Ma, Cheng; Sun, Li; Liu, Zhi-Ren

    2016-01-01

    Integrin αvβ3 expression is altered in various diseases and has been proposed as a drug target. Here we use a rational design approach to develop a therapeutic protein, which we call ProAgio, that binds to integrin αvβ3 outside the classical ligand-binding site. We show ProAgio induces apoptosis of integrin αvβ3-expressing cells by recruiting and activating caspase 8 to the cytoplasmic domain of integrin αvβ3. ProAgio also has anti-angiogenic activity and strongly inhibits growth of tumour xenografts, but does not affect the established vasculature. Toxicity analyses demonstrate that ProAgio is not toxic to mice. Our study reports a new integrin-targeting agent with a unique mechanism of action, and provides a template for the development of integrin-targeting therapeutics. PMID:27241473

  15. Rational design of a protein that binds integrin αvβ3 outside the ligand binding site.

    Science.gov (United States)

    Turaga, Ravi Chakra; Yin, Lu; Yang, Jenny J; Lee, Hsiauwei; Ivanov, Ivaylo; Yan, Chunli; Yang, Hua; Grossniklaus, Hans E; Wang, Siming; Ma, Cheng; Sun, Li; Liu, Zhi-Ren

    2016-05-31

    Integrin αvβ3 expression is altered in various diseases and has been proposed as a drug target. Here we use a rational design approach to develop a therapeutic protein, which we call ProAgio, that binds to integrin αvβ3 outside the classical ligand-binding site. We show ProAgio induces apoptosis of integrin αvβ3-expressing cells by recruiting and activating caspase 8 to the cytoplasmic domain of integrin αvβ3. ProAgio also has anti-angiogenic activity and strongly inhibits growth of tumour xenografts, but does not affect the established vasculature. Toxicity analyses demonstrate that ProAgio is not toxic to mice. Our study reports a new integrin-targeting agent with a unique mechanism of action, and provides a template for the development of integrin-targeting therapeutics.

  16. Kinetic studies show that Ca2+ and Tb3+ have different binding preferences toward the four Ca2+-binding sites of calmodulin.

    Science.gov (United States)

    Wang, C L; Leavis, P C; Gergely, J

    1984-12-18

    The stepwise addition of Tb3+ to calmodulin yields a large tyrosine-sensitized Tb3+ luminescence enhancement as the third and fourth ions bind to the protein [Wang, C.-L. A., Aquaron, R. R., Leavis, P. C., & Gergely, J. (1982) Eur. J. Biochem. 124, 7-12]. Since the only tyrosine residues in calmodulin are located within binding sites III and IV, these results suggest that Tb3+ binds first to sites I and II. Recent NMR studies have provided evidence that Ca2+, on the other hand, binds preferentially to sites III and IV. Kinetic studies using a stopped-flow apparatus also show that the preferential binding of Ca2+ and lanthanide ions is different. Upon rapid mixing of 2Ca-calmodulin with two Tb3+ ions, there was a small and rapid tyrosine fluorescence change, but no Tb3+ luminescence was observed, indicating that Tb3+ binds to sites I and II but not sites III and IV. When two Tb3+ ions are mixed with 2Dy-calmodulin, Tb3+ luminescence rises rapidly as Tb3+ binds to the empty sites III and IV, followed by a more gradual decrease (k = 0.4 s-1 as the ions redistribute themselves over the four sites. These results indicate that (i) both Tb3+ and Dy3+ prefer binding to sites I and II of calmodulin and (ii) the binding of Tb3+ to calmodulin is not impeded by the presence of two Ca2+ ions initially bound to the protein. Thus, the Ca2+ and lanthanide ions must exhibit opposite preferences for the four sites of calmodulin: sites III and IV are the high-affinity sites for Ca2+, whereas Tb3+ and Dy3+ prefer sites I and II.

  17. Evolution of allosteric citrate binding sites on 6-phosphofructo-1-kinase.

    Directory of Open Access Journals (Sweden)

    Aleksandra Usenik

    Full Text Available As an important part of metabolism, metabolic flux through the glycolytic pathway is tightly regulated. The most complex control is exerted on 6-phosphofructo-1-kinase (PFK1 level; this control overrules the regulatory role of other allosteric enzymes. Among other effectors, citrate has been reported to play a vital role in the suppression of this enzyme's activity. In eukaryotes, amino acid residues forming the allosteric binding site for citrate are found both on the N- and the C-terminal region of the enzyme. These site has evolved from the phosphoenolpyruvate/ADP binding site of bacterial PFK1 due to the processes of duplication and tandem fusion of prokaryotic ancestor gene followed by the divergence of the catalytic and effector binding sites. Stricter inhibition of the PFK1 enzyme was needed during the evolution of multi-cellular organisms, and the most stringent control of PFK1 by citrate occurs in vertebrates. By substituting a single amino acid (K557R or K617A as a component of the allosteric binding site in the C-terminal region of human muscle type PFK-M with a residue found in the corresponding site of a fungal enzyme, the inhibitory effect of citrate was attenuated. Moreover, the proteins carrying these single mutations enabled growth of E. coli transformants encoding mutated human PFK-M in a glucose-containing medium that did not support the growth of E. coli transformed with native human PFK-M. Substitution of another residue at the citrate-binding site (D591V of human PFK-M resulted in the complete loss of activity. Detailed analyses revealed that the mutated PFK-M subunits formed dimers but were unable to associate into the active tetrameric holoenzyme. These results suggest that stricter control over glycolytic flux developed in metazoans, whose somatic cells are largely characterized by slow proliferation.

  18. Prediction on the binding domain between human interleukin-6 and its receptor

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    Based on the spatial conformations of human interleukin-6 (hIL-6) derived from nuclear magnetic resonance analysis and human interleukin-6 receptor (hIL-6R) modeled with homology modeling method using human growth hormone receptor as template, the interaction between hIL-6 and its receptor (hIL-6R) is studied with docking program according to the surface electrostatic potential analysis and spatial conformation complement. The stable region structure composed of hIL-6 and hIL-6R is obtained on the basis of molecular mechanism optimization and molecular dynamics simulation. The binding domain between hIL-6 and hIL-6R is predicted theoretically. Furthermore, the especial binding sites that influence the interaction between hIL-6 and hIL-6R are confirmed. The results lay a theoretical foundation for confirming the active regions of hIL-6 and designing novel antagonist with computer-guided techniques.

  19. Prediction on the binding domain between human interleukin-6 and its receptor

    Institute of Scientific and Technical Information of China (English)

    冯健男; 任蕴芳; 沈倍奋

    2000-01-01

    Based on the spatial conformations of human interleukin-6 (hlL-6) derived from nuclear magnetic resonance analysis and human interleukin-6 receptor (hlL-6R) modeled with homology modeling method using human growth hormone receptor as template, the interaction between hlL-6 and its receptor (hIL-6R) is studied with docking program according to the surface electrostatic potential analysis and spatial conformation complement. The stable region structure composed of hlL-6 and hlL-6R is obtained on the basis of molecular mechanism optimization and molecular dynamics simulation. The binding domain between hIL-6 and hIL-6R is predicted theoretically. Furthermore, the especial binding sites that influence the interaction between hlL-6 and hlL-6R are confirmed. The results lay a theoretical foundation for confirming the active regions of hlL-6 and designing novel antagonist with computer-guided techniques.

  20. Germline V-genes sculpt the binding site of a family of antibodies neutralizing human cytomegalovirus

    Energy Technology Data Exchange (ETDEWEB)

    Thomson, Christy A.; Bryson, Steve; McLean, Gary R.; Creagh, A. Louise; Pai, Emil F.; Schrader, John W. (Toronto); (UBC)

    2008-10-17

    Immunoglobulin genes are generated somatically through specialized mechanisms resulting in a vast repertoire of antigen-binding sites. Despite the stochastic nature of these processes, the V-genes that encode most of the antigen-combining site are under positive evolutionary selection, raising the possibility that V-genes have been selected to encode key structural features of binding sites of protective antibodies against certain pathogens. Human, neutralizing antibodies to human cytomegalovirus that bind the AD-2S1 epitope on its gB envelope protein repeatedly use a pair of well-conserved, germline V-genes IGHV3-30 and IGKV3-11. Here, we present crystallographic, kinetic and thermodynamic analyses of the binding site of such an antibody and that of its primary immunoglobulin ancestor. These show that these germline V-genes encode key side chain contacts with the viral antigen and thereby dictate key structural features of the hypermutated, high-affinity neutralizing antibody. V-genes may thus encode an innate, protective immunological memory that targets vulnerable, invariant sites on multiple pathogens.

  1. Annealing to sequences within the primer binding site loop promotes an HIV-1 RNA conformation favoring RNA dimerization and packaging

    OpenAIRE

    Seif, Elias; Niu, Meijuan; Kleiman, Lawrence

    2013-01-01

    Experiments are presented which suggest that the binding of the primer tRNA to the primer binding site of the HIV-1 5′ UTR is involved in the dimerization of the genome, as part of the packaging process.

  2. Comparative Analysis of Regulatory Motif Discovery Tools for Transcription Factor Binding Sites

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    In the post-genomic era, identification of specific regulatory motifs or transcription factor binding sites (TFBSs) in non-coding DNA sequences, which is essential to elucidate transcriptional regulatory networks, has emerged as an obstacle that frustrates many researchers. Consequently, numerous motif discovery tools and correlated databases have been applied to solving this problem. However, these existing methods, based on different computational algorithms, show diverse motif prediction efficiency in non-coding DNA sequences. Therefore, understanding the similarities and differences of computational algorithms and enriching the motif discovery literatures are important for users to choose the most appropriate one among the online available tools. Moreover, there still lacks credible criterion to assess motif discovery tools and instructions for researchers to choose the best according to their own projects. Thus integration of the related resources might be a good approach to improve accuracy of the application. Recent studies integrate regulatory motif discovery tools with experimental methods to offer a complementary approach for researchers, and also provide a much-needed model for current researches on transcriptional regulatory networks. Here we present a comparative analysis of regulatory motif discovery tools for TFBSs.

  3. rVISTA 2.0: Evolutionary Analysis of Transcription Factor Binding Sites

    Energy Technology Data Exchange (ETDEWEB)

    Loots, G G; Ovcharenko, I

    2004-01-28

    Identifying and characterizing the patterns of DNA cis-regulatory modules represents a challenge that has the potential to reveal the regulatory language the genome uses to dictate transcriptional dynamics. Several studies have demonstrated that regulatory modules are under positive selection and therefore are often conserved between related species. Using this evolutionary principle we have created a comparative tool, rVISTA, for analyzing the regulatory potential of noncoding sequences. The rVISTA tool combines transcription factor binding site (TFBS) predictions, sequence comparisons and cluster analysis to identify noncoding DNA regions that are highly conserved and present in a specific configuration within an alignment. Here we present the newly developed version 2.0 of the rVISTA tool that can process alignments generated by both zPicture and PipMaker alignment programs or use pre-computed pairwise alignments of seven vertebrate genomes available from the ECR Browser. The rVISTA web server is closely interconnected with the TRANSFAC database, allowing users to either search for matrices present in the TRANSFAC library collection or search for user-defined consensus sequences. rVISTA tool is publicly available at http://rvista.dcode.org/.

  4. Dual Binding Site and Selective Acetylcholinesterase Inhibitors Derived from Integrated Pharmacophore Models and Sequential Virtual Screening

    Directory of Open Access Journals (Sweden)

    Shikhar Gupta

    2014-01-01

    Full Text Available In this study, we have employed in silico methodology combining double pharmacophore based screening, molecular docking, and ADME/T filtering to identify dual binding site acetylcholinesterase inhibitors that can preferentially inhibit acetylcholinesterase and simultaneously inhibit the butyrylcholinesterase also but in the lesser extent than acetylcholinesterase. 3D-pharmacophore models of AChE and BuChE enzyme inhibitors have been developed from xanthostigmine derivatives through HypoGen and validated using test set, Fischer’s randomization technique. The best acetylcholinesterase and butyrylcholinesterase inhibitors pharmacophore hypotheses Hypo1_A and Hypo1_B, with high correlation coefficient of 0.96 and 0.94, respectively, were used as 3D query for screening the Zinc database. The screened hits were then subjected to the ADME/T and molecular docking study to prioritise the compounds. Finally, 18 compounds were identified as potential leads against AChE enzyme, showing good predicted activities and promising ADME/T properties.

  5. A specific binding site recognizing a fragment of angiotensin II in bovine adrenal cortex membranes.

    Science.gov (United States)

    Bernier, S G; Fournier, A; Guillemette, G

    1994-12-12

    We have characterized a specific binding site for angiotensin IV in bovine adrenal cortex membranes. Pseudo-equilibrium studies at 37 degrees C for 2 h have shown that this binding site recognizes angiotensin IV with a high affinity (Kd = 0.24 +/- 0.03 nM). The binding site is saturable and relatively abundant (maximal binding capacity around 0.5 pmol/mg protein). Non-equilibrium kinetic analyses at 37 degrees C revealed a calculated kinetic Kd of 47 pM. The binding site is pharmacologically distinct from the classic angiotensin receptors AT1 or AT2. Competitive binding studies with bovine adrenal cortex membranes demonstrated the following rank order of effectiveness: angiotensin IV (Val-Tyr-Ile-His-Pro-Phe) = angiotensin II-(3-7) (Val-Tyr-Ile-His-Pro) > angiotensin III (Arg-Val-Tyr-Ile-His-Pro-Phe) > or = angiotensin II-(4-7) (Tyr-Ile-His-Pro) > angiotensin II (Asp-Arg-Val-Tyr-Ile-His-Pro-Phe) > angiotensin II-(1-6) (Asp-Arg-Val-Tyr-Ile-His) > angiotensin II-(4-8) (Tyr-Ile-His-Pro-Phe) > > > angiotensin II-(3-6) (Val-Tyr-Ile-His), angiotensin II-(4-6) (Tyr-Ile-His), L-158,809 (5,7-dimethyl-2-ethyl-3-[(2'(1-H-tetrazol-5-yl)[1,1'-biphenyl]-4-y l) methyl]-3-H-imidazo[4,5-beta]pyridine H2O) and PD 123319 (1-[4-(dimethylamino)3-methylphenyl]methyl-5-(diphenylacetyl)4,5,6 ,7- tetrahydro-1H-imidazo[4,5-c]pyridine-6-carboxylic acid). The divalent cations Mg2+ and Ca2+ were shown to diminish the binding of 125I-angiotensioffn IV to bovine adrenal cortex membranes.(ABSTRACT TRUNCATED AT 250 WORDS)

  6. Identification of Calcium binding sites on calsequestrin 1 and its implications to polymerization

    Science.gov (United States)

    Kumar, Amit; Chakravarty, Harapriya; Bal, Naresh C.; Balaraju, Tuniki; Jena, Nivedita; Misra, Gauri; Bal, Chandralata; Pieroni, Enrico; Periasamy, Muthu; Sharon, Ashoke

    2013-01-01

    Biophysical studies have shown that each molecule of calsequestrin 1 (CASQ1) can bind about 70–80 Ca2+ ions. However, the nature of Ca2+-binding sites has not yet been fully characterized. In this study, we employed in-silico approaches to identify the Ca2+ binding sites and to understand the molecular basis of CASQ1-Ca2+ recognition. We built the protein model by extracting the atomic coordinates for the back-to-back dimeric unit from the recently solved hexameric CASQ1 structure (PDB id: 3UOM) and adding the missing C-terminal residues (aa350–364). Using this model we performed extensive 30 ns molecular dynamics simulations exposed to wide range of Ca2+ concentrations ([Ca2+]). Our results show that the Ca2+-binding sites on CASQ1 differ both in affinity and geometry. The high affinity Ca2+-binding sites share a similar geometry and interestingly, majority of them were found to be induced by increased [Ca2+]. We also found that the system undergoes maximal Ca2+-binding to the CAS (consecutive aspartate stretch at the C-terminus) before the rest of the CASQ1 surface becomes saturated. Simulated data shows that the CASQ1 back-to-back stacking is progressively stabilized by emergence of an increasing number of hydrophobic interactions with increasing [Ca2+]. Further, this study shows that the CAS domain assumes a compact structure with increase in Ca2+ binding, which suggests that the CAS domain might function as a Ca2+-sensor that may be a novel structural motif to sense metal. We propose the term “Dn-motif” for the CAS domain. PMID:23629537

  7. Localizing Carbohydrate Binding Sites in Proteins Using Hydrogen/Deuterium Exchange Mass Spectrometry

    Science.gov (United States)

    Zhang, Jingjing; Kitova, Elena N.; Li, Jun; Eugenio, Luiz; Ng, Kenneth; Klassen, John S.

    2016-01-01

    The application of hydrogen/deuterium exchange mass spectrometry (HDX-MS) to localize ligand binding sites in carbohydrate-binding proteins is described. Proteins from three bacterial toxins, the B subunit homopentamers of Cholera toxin and Shiga toxin type 1 and a fragment of Clostridium difficile toxin A, and their interactions with native carbohydrate receptors, GM1 pentasaccharides (β-Gal-(1→3)-β-GalNAc-(1→4)[α-Neu5Ac-(2→3)]-β-Gal-(1→4)-Glc), Pk trisaccharide (α-Gal-(1→4)-β-Gal-(1→4)-Glc) and CD-grease (α-Gal-(1→3)-β-Gal-(1→4)-β-GlcNAcO(CH2)8CO2CH3), respectively, served as model systems for this study. Comparison of the differences in deuterium uptake for peptic peptides produced in the absence and presence of ligand revealed regions of the proteins that are protected against deuterium exchange upon ligand binding. Notably, protected regions generally coincide with the carbohydrate binding sites identified by X-ray crystallography. However, ligand binding can also result in increased deuterium exchange in other parts of the protein, presumably through allosteric effects. Overall, the results of this study suggest that HDX-MS can serve as a useful tool for localizing the ligand binding sites in carbohydrate-binding proteins. However, a detailed interpretation of the changes in deuterium exchange upon ligand binding can be challenging because of the presence of ligand-induced changes in protein structure and dynamics.

  8. Localizing Carbohydrate Binding Sites in Proteins Using Hydrogen/Deuterium Exchange Mass Spectrometry.

    Science.gov (United States)

    Zhang, Jingjing; Kitova, Elena N; Li, Jun; Eugenio, Luiz; Ng, Kenneth; Klassen, John S

    2016-01-01

    The application of hydrogen/deuterium exchange mass spectrometry (HDX-MS) to localize ligand binding sites in carbohydrate-binding proteins is described. Proteins from three bacterial toxins, the B subunit homopentamers of Cholera toxin and Shiga toxin type 1 and a fragment of Clostridium difficile toxin A, and their interactions with native carbohydrate receptors, GM1 pentasaccharides (β-Gal-(1→3)-β-GalNAc-(1→4)[α-Neu5Ac-(2→3)]-β-Gal-(1→4)-Glc), Pk trisaccharide (α-Gal-(1→4)-β-Gal-(1→4)-Glc) and CD-grease (α-Gal-(1→3)-β-Gal-(1→4)-β-GlcNAcO(CH2)8CO2CH3), respectively, served as model systems for this study. Comparison of the differences in deuterium uptake for peptic peptides produced in the absence and presence of ligand revealed regions of the proteins that are protected against deuterium exchange upon ligand binding. Notably, protected regions generally coincide with the carbohydrate binding sites identified by X-ray crystallography. However, ligand binding can also result in increased deuterium exchange in other parts of the protein, presumably through allosteric effects. Overall, the results of this study suggest that HDX-MS can serve as a useful tool for localizing the ligand binding sites in carbohydrate-binding proteins. However, a detailed interpretation of the changes in deuterium exchange upon ligand binding can be challenging because of the presence of ligand-induced changes in protein structure and dynamics.

  9. Binding isotope effects as a tool for distinguishing hydrophobic and hydrophilic binding sites of HIV-1 RT.

    Science.gov (United States)

    Krzemińska, Agnieszka; Paneth, Piotr; Moliner, Vicent; Świderek, Katarzyna

    2015-01-22

    The current treatment for HIV-1 infected patients consists of a cocktail of inhibitors, in an attempt to improve the potency of the drugs by adding the possible effects of each supplied compound. In this contribution, nine different inhibitors of HIV-1 RT, one of the three key proteins responsible for the virus replication, have been selected to develop and test a computational protocol that allows getting a deep insight into the inhibitors' binding mechanism. The interaction between the inhibitors and the protein have been quantified by computing binding free energies through FEP calculations, while a more detailed characterization of the kind of inhibitor-protein interactions is based on frequency analysis of the ligands in the initial and final state, i.e. in solution and binding the protein. QM/MM calculation of heavy atoms ((13)C, (15)N, and (18)O) binding isotope effects (BIE) have been used to identify the binding sites of the different inhibitors. Specific interactions between the isotopically labeled atoms of the inhibitors and polar residues and magnesium cations on the hydrophilic pocket of the protein are responsible for the frequencies shifting that can be detected when comparing the IR spectra of the compounds in solution and in the protein. On the contrary, it seems that changes in vdW interactions from solution to the final state when the ligand is interacting with residues of the hydrophobic cavity, does not influence frequency modes and then no BIE are observed. Our results suggest that a proper computational protocol can be a valuable tool which in turn can be used to increase the efficiency of anti AIDS drugs.

  10. A reexamination of information theory-based methods for DNA-binding site identification

    Directory of Open Access Journals (Sweden)

    O'Neill Michael C

    2009-02-01

    Full Text Available Abstract Background Searching for transcription factor binding sites in genome sequences is still an open problem in bioinformatics. Despite substantial progress, search methods based on information theory remain a standard in the field, even though the full validity of their underlying assumptions has only been tested in artificial settings. Here we use newly available data on transcription factors from different bacterial genomes to make a more thorough assessment of information theory-based search methods. Results Our results reveal that conventional benchmarking against artificial sequence data leads frequently to overestimation of search efficiency. In addition, we find that sequence information by itself is often inadequate and therefore must be complemented by other cues, such as curvature, in real genomes. Furthermore, results on skewed genomes show that methods integrating skew information, such as Relative Entropy, are not effective because their assumptions may not hold in real genomes. The evidence suggests that binding sites tend to evolve towards genomic skew, rather than against it, and to maintain their information content through increased conservation. Based on these results, we identify several misconceptions on information theory as applied to binding sites, such as negative entropy, and we propose a revised paradigm to explain the observed results. Conclusion We conclude that, among information theory-based methods, the most unassuming search methods perform, on average, better than any other alternatives, since heuristic corrections to these methods are prone to fail when working on real data. A reexamination of information content in binding sites reveals that information content is a compound measure of search and binding affinity requirements, a fact that has important repercussions for our understanding of binding site evolution.

  11. A reexamination of information theory-based methods for DNA-binding site identification

    Science.gov (United States)

    Erill, Ivan; O'Neill, Michael C

    2009-01-01

    Background Searching for transcription factor binding sites in genome sequences is still an open problem in bioinformatics. Despite substantial progress, search methods based on information theory remain a standard in the field, even though the full validity of their underlying assumptions has only been tested in artificial settings. Here we use newly available data on transcription factors from different bacterial genomes to make a more thorough assessment of information theory-based search methods. Results Our results reveal that conventional benchmarking against artificial sequence data leads frequently to overestimation of search efficiency. In addition, we find that sequence information by itself is often inadequate and therefore must be complemented by other cues, such as curvature, in real genomes. Furthermore, results on skewed genomes show that methods integrating skew information, such as Relative Entropy, are not effective because their assumptions may not hold in real genomes. The evidence suggests that binding sites tend to evolve towards genomic skew, rather than against it, and to maintain their information content through increased conservation. Based on these results, we identify several misconceptions on information theory as applied to binding sites, such as negative entropy, and we propose a revised paradigm to explain the observed results. Conclusion We conclude that, among information theory-based methods, the most unassuming search methods perform, on average, better than any other alternatives, since heuristic corrections to these methods are prone to fail when working on real data. A reexamination of information content in binding sites reveals that information content is a compound measure of search and binding affinity requirements, a fact that has important repercussions for our understanding of binding site evolution. PMID:19210776

  12. Number of active transcription factor binding sites is essential for the Hes7 oscillator

    Directory of Open Access Journals (Sweden)

    de Angelis Martin

    2006-02-01

    Full Text Available Abstract Background It is commonly accepted that embryonic segmentation of vertebrates is regulated by a segmentation clock, which is induced by the cycling genes Hes1 and Hes7. Their products form dimers that bind to the regulatory regions and thereby repress the transcription of their own encoding genes. An increase of the half-life of Hes7 protein causes irregular somite formation. This was shown in recent experiments by Hirata et al. In the same work, numerical simulations from a delay differential equations model, originally invented by Lewis, gave additional support. For a longer half-life of the Hes7 protein, these simulations exhibited strongly damped oscillations with, after few periods, severely attenuated the amplitudes. In these simulations, the Hill coefficient, a crucial model parameter, was set to 2 indicating that Hes7 has only one binding site in its promoter. On the other hand, Bessho et al. established three regulatory elements in the promoter region. Results We show that – with the same half life – the delay system is highly sensitive to changes in the Hill coefficient. A small increase changes the qualitative behaviour of the solutions drastically. There is sustained oscillation and hence the model can no longer explain the disruption of the segmentation clock. On the other hand, the Hill coefficient is correlated with the number of active binding sites, and with the way in which dimers bind to them. In this paper, we adopt response functions in order to estimate Hill coefficients for a variable number of active binding sites. It turns out that three active transcription factor binding sites increase the Hill coefficient by at least 20% as compared to one single active site. Conclusion Our findings lead to the following crucial dichotomy: either Hirata's model is correct for the Hes7 oscillator, in which case at most two binding sites are active in its promoter region; or at least three binding sites are active, in which

  13. Interaction of Bacillus thuringiensis Cry1 and Vip3A proteins with Spodoptera frugiperda midgut binding sites

    OpenAIRE

    Sena, J.A.D. [UNESP; Hernández Rodríguez, Carmen Sara; Ferré Manzanero, Juan

    2009-01-01

    Vip3Aa, Vip3Af, Cry1Ab, and Cry1Fa were tested for their toxicities and binding interactions. Vip3A proteins were more toxic than Cry1 proteins. Binding assays showed independent specific binding sites for Cry1 and Vip3A proteins. Cry1Ab and Cry1Fa competed for the same binding sites, whereas Vip3Aa competed for those of Vip3Af.

  14. Interaction of Bacillus thuringiensis Cry1 and Vip3A proteins with Spodoptera frugiperda midgut binding sites.

    Science.gov (United States)

    Sena, Janete A D; Hernández-Rodríguez, Carmen Sara; Ferré, Juan

    2009-04-01

    Vip3Aa, Vip3Af, Cry1Ab, and Cry1Fa were tested for their toxicities and binding interactions. Vip3A proteins were more toxic than Cry1 proteins. Binding assays showed independent specific binding sites for Cry1 and Vip3A proteins. Cry1Ab and Cry1Fa competed for the same binding sites, whereas Vip3Aa competed for those of Vip3Af.

  15. Thermodynamics of Calcium binding to the Calmodulin N-terminal domain to evaluate site-specific affinity constants and cooperativity.

    Science.gov (United States)

    Beccia, Maria Rosa; Sauge-Merle, Sandrine; Lemaire, David; Brémond, Nicolas; Pardoux, Romain; Blangy, Stéphanie; Guilbaud, Philippe; Berthomieu, Catherine

    2015-07-01

    Calmodulin (CaM) is an essential Ca(II)-dependent regulator of cell physiology. To understand its interaction with Ca(II) at a molecular level, it is essential to examine Ca(II) binding at each site of the protein, even if it is challenging to estimate the site-specific binding properties of the interdependent CaM-binding sites. In this study, we evaluated the site-specific Ca(II)-binding affinity of sites I and II of the N-terminal domain by combining site-directed mutagenesis and spectrofluorimetry. The mutations had very low impact on the protein structure and stability. We used these binding constants to evaluate the inter-site cooperativity energy and compared it with its lower limit value usually reported in the literature. We found that site I affinity for Ca(II) was 1.5 times that of site II and that cooperativity induced an approximately tenfold higher affinity for the second Ca(II)-binding event, as compared to the first one. We further showed that insertion of a tryptophan at position 7 of site II binding loop significantly increased site II affinity for Ca(II) and the intra-domain cooperativity. ΔH and ΔS parameters were studied by isothermal titration calorimetry for Ca(II) binding to site I, site II and to the entire N-terminal domain. They showed that calcium binding is mainly entropy driven for the first and second binding events. These findings provide molecular information on the structure-affinity relationship of the individual sites of the CaM N-terminal domain and new perspectives for the optimization of metal ion binding by mutating the EF-hand loops sequences.

  16. Recognition of AT-Rich DNA Binding Sites by the MogR Repressor

    Energy Technology Data Exchange (ETDEWEB)

    Shen, Aimee; Higgins, Darren E.; Panne, Daniel; (Harvard-Med); (EMBL)

    2009-07-22

    The MogR transcriptional repressor of the intracellular pathogen Listeria monocytogenes recognizes AT-rich binding sites in promoters of flagellar genes to downregulate flagellar gene expression during infection. We describe here the 1.8 A resolution crystal structure of MogR bound to the recognition sequence 5' ATTTTTTAAAAAAAT 3' present within the flaA promoter region. Our structure shows that MogR binds as a dimer. Each half-site is recognized in the major groove by a helix-turn-helix motif and in the minor groove by a loop from the symmetry-related molecule, resulting in a 'crossover' binding mode. This oversampling through minor groove interactions is important for specificity. The MogR binding site has structural features of A-tract DNA and is bent by approximately 52 degrees away from the dimer. The structure explains how MogR achieves binding specificity in the AT-rich genome of L. monocytogenes and explains the evolutionary conservation of A-tract sequence elements within promoter regions of MogR-regulated flagellar genes.

  17. Mapping cocaine binding sites in human and baboon brain in vivo.

    Science.gov (United States)

    Fowler, J S; Volkow, N D; Wolf, A P; Dewey, S L; Schlyer, D J; Macgregor, R R; Hitzemann, R; Logan, J; Bendriem, B; Gatley, S J

    1989-01-01

    The first direct measurements of cocaine binding in the brain of normal human volunteers and baboons have been made by using positron emission tomography (PET) and tracer doses of [N-11C-methyl]-(-)-cocaine ([11C]cocaine). Cocaine's binding and release from brain are rapid with the highest regional uptake of carbon-11 occurring in the corpus striatum at 4-10 minutes after intravenous injection of labeled cocaine. This was followed by a clearance to half the peak value at about 25 minutes with the overall time course paralleling the previously documented time course of the euphoria experienced after intravenous cocaine administration. Blockade of the dopamine reuptake sites with nomifensine reduced the striatal but not the cerebellar uptake of [11C]cocaine in baboons indicating that cocaine binding is associated with the dopamine reuptake site in the corpus striatum. A comparison of labeled metabolites of cocaine in human and baboon plasma showed that while cocaine is rapidly metabolized in both species, the profile of labeled metabolites is different, with baboon plasma containing significant amounts of labeled carbon dioxide, and human plasma containing no significant labeled carbon dioxide. These studies demonstrate the feasibility of using [11C]cocaine and PET to map binding sites for cocaine in human brain, to monitor its kinetics, and to characterize its binding mechanism by using appropriate pharmacological challenges.

  18. Global identification of hnRNP A1 binding sites for SSO-based splicing modulation

    DEFF Research Database (Denmark)

    Bruun, Gitte H; Doktor, Thomas K; Borch-Jensen, Jonas;

    2016-01-01

    for this deregulation by blocking other SREs with splice-switching oligonucleotides (SSOs). However, the location and sequence of most SREs are not well known. RESULTS: Here, we used individual-nucleotide resolution crosslinking immunoprecipitation (iCLIP) to establish an in vivo binding map for the key splicing...... regulatory factor hnRNP A1 and to generate an hnRNP A1 consensus binding motif. We find that hnRNP A1 binding in proximal introns may be important for repressing exons. We show that inclusion of the alternative cassette exon 3 in SKA2 can be significantly increased by SSO-based treatment which blocks an iCLIP......-identified hnRNP A1 binding site immediately downstream of the 5' splice site. Because pseudoexons are well suited as models for constitutive exons which have been inactivated by pathogenic mutations in SREs, we used a pseudoexon in MTRR as a model and showed that an iCLIP-identified hnRNP A1 binding site...

  19. Characterization of two heparan sulphate-binding sites in the mycobacterial adhesin Hlp

    Directory of Open Access Journals (Sweden)

    Previato Jose O

    2008-05-01

    Full Text Available Abstract Background The histone-like Hlp protein is emerging as a key component in mycobacterial pathogenesis, being involved in the initial events of host colonization by interacting with laminin and glycosaminoglycans (GAGs. In the present study, nuclear magnetic resonance (NMR was used to map the binding site(s of Hlp to heparan sulfate and identify the nature of the amino acid residues directly involved in this interaction. Results The capacity of a panel of 30 mer synthetic peptides covering the full length of Hlp to bind to heparin/heparan sulfate was analyzed by solid phase assays, NMR, and affinity chromatography. An additional active region between the residues Gly46 and Ala60 was defined at the N-terminal domain of Hlp, expanding the previously defined heparin-binding site between Thr31 and Phe50. Additionally, the C-terminus, rich in Lys residues, was confirmed as another heparan sulfate binding region. The amino acids in Hlp identified as mediators in the interaction with heparan sulfate were Arg, Val, Ile, Lys, Phe, and Thr. Conclusion Our data indicate that Hlp interacts with heparan sulfate through two distinct regions of the protein. Both heparan sulfate-binding regions here defined are preserved in all mycobacterial Hlp homologues that have been sequenced, suggesting important but possibly divergent roles for this surface-exposed protein in both pathogenic and saprophic species.

  20. Quantitative distribution of angiotensin II binding sites in rat brain by autoradiography

    Energy Technology Data Exchange (ETDEWEB)

    Saavedra, J.M.; Israel, A.; Plunkett, L.M.; Kurihara, M.; Shigematsu, K.; Correa, F.M.

    1986-07-01

    Angiotensin II binding sites were localized and quantified in individual brain nuclei from single rats by incubation of tissue sections with 1 nM /sup 125/I-(Sar1)-angiotensin II, (/sup 3/H)-Ultrofilm autoradiography, computerized microdensitometry and comparison with /sup 125/I-standards. High angiotensin II binding was present in the circumventricular organs (organon vasculosum laminae terminalis, organon subfornicalis and area postrema), in selected hypothalamic nuclei (nuclei suprachiasmatis, periventricularis and paraventricularis) and in the nucleus tractus olfactorii lateralis, the nucleus preopticus medianus, the dorsal motor nucleus of the vagus and the nucleus tractus solitarii. High affinity (KA from 0.3 to 1.5 X 10(9) M-1) angiotensin II binding sites were demonstrated in the organon subfornicalis, the nucleus tractus solitarii and the area postrema after incubation of consecutive sections from single rat brains with /sup 125/I-(Sar1)-angiotensin II in concentrations from 100 pM to 5 nM. These results demonstrate and characterize brain binding sites for angiotensin II of variable high affinity binding both inside and outside the blood-brain barrier.

  1. The Human p73 Promoter: Characterization and Identification of Functional E2F Binding Sites

    Directory of Open Access Journals (Sweden)

    Ratnam S. Seelan

    2002-01-01

    Full Text Available p73, a member of the p53 family, is overexpressed in many cancers. To understand the mechanism(s underlying this overexpression, we have undertaken a detailed characterization of the human p73 promoter. The promoter is strongly activated in cells expressing exogenous E2F1 and suppressed by exogenous Rb. At least three functional E2F binding sites, located immediately upstream of exon 1 (at-284,-155 and-132 mediate this induction. 5' serially deleted promoter constructs and constructs harboring mutated E2F sites were analyzed for their response to exogenously expressed E2F1 or Rb to establish functionality of these sites. Authenticity of E2F sites was further confirmed by electrophoretic mobility shift assay (EMSA using E2F1 /DP1 heterodimers synthesized in vitro, followed by competition assays with unlabeled wild-type or mutant oligonucleotides and supershift analysis using anti-E2F1 antibodies. In vivo binding of E2F1 to the p73 promoter was demonstrated using nuclear extracts prepared from E2F1-inducible Saos2 cells. The region conferring the highest promoter activity was found to reside between-113 to-217 of the p73 gene. Two of the three functional E2F sites (at-155 and-132 reside within this region. Our results suggest that regulation of p73 expression is primarily mediated through binding of E2 F1 to target sites at-155 and-132.

  2. Recognition of anesthetic barbiturates by a protein binding site: a high resolution structural analysis.

    Directory of Open Access Journals (Sweden)

    Simon Oakley

    Full Text Available Barbiturates potentiate GABA actions at the GABA(A receptor and act as central nervous system depressants that can induce effects ranging from sedation to general anesthesia. No structural information has been available about how barbiturates are recognized by their protein targets. For this reason, we tested whether these drugs were able to bind specifically to horse spleen apoferritin, a model protein that has previously been shown to bind many anesthetic agents with affinities that are closely correlated with anesthetic potency. Thiopental, pentobarbital, and phenobarbital were all found to bind to apoferritin with affinities ranging from 10-500 µM, approximately matching the concentrations required to produce anesthetic and GABAergic responses. X-ray crystal structures were determined for the complexes of apoferritin with thiopental and pentobarbital at resolutions of 1.9 and 2.0 Å, respectively. These structures reveal that the barbiturates bind to a cavity in the apoferritin shell that also binds haloalkanes, halogenated ethers, and propofol. Unlike these other general anesthetics, however, which rely entirely upon van der Waals interactions and the hydrophobic effect for recognition, the barbiturates are recognized in the apoferritin site using a mixture of both polar and nonpolar interactions. These results suggest that any protein binding site that is able to recognize and respond to the chemically and structurally diverse set of compounds used as general anesthetics is likely to include a versatile mixture of both polar and hydrophobic elements.

  3. Novel Prostate Specific Antigen plastic antibody designed withcharged binding sites for an improved protein binding and itsapplication in a biosensor of potentiometric transduction

    OpenAIRE

    Rebelo, Tânia S. C. R.; Santos, C.; Costa-Rodrigues, J.; Fernandes, M. H.; Noronha, João P. C.; Sales, M. Goreti F.

    2014-01-01

    This work shows that the synthesis of protein plastic antibodies tailored with selected charged monomersaround the binding site enhances protein binding. These charged receptor sites are placed over a neutralpolymeric matrix, thus inducing a suitable orientation the protein reception to its site. This is confirmed bypreparing control materials with neutral monomers and also with non-imprinted template. This concepthas been applied here to Prostate Specific Antigen (PSA), the protein of choice...

  4. Auto-FACE: an NMR based binding site mapping program for fast chemical exchange protein-ligand systems.

    Directory of Open Access Journals (Sweden)

    Janarthanan Krishnamoorthy

    Full Text Available BACKGROUND: Nuclear Magnetic Resonance (NMR spectroscopy offers a variety of experiments to study protein-ligand interactions at atomic resolution. Among these experiments, 15N Heteronuclear Single Quantum Correlation (HSQCexperiment is simple, less time consuming and highly informative in mapping the binding site of the ligand. The interpretation of 15N HSQC becomes ambiguous when the chemical shift perturbations are caused by non-specific interactions like allosteric changes and local structural rearrangement. Under such cases, detailed chemical exchange analysis based on chemical shift perturbation will assist in locating the binding site accurately. METHODOLOGY/PRINCIPAL FINDINGS: We have automated the mapping of binding sites for fast chemical exchange systems using information obtained from 15N HSQC spectra of protein serially titrated with ligand of increasing concentrations. The automated program Auto-FACE (Auto-FAst Chemical Exchange analyzer determines the parameters, e.g. rate of change of perturbation, binding equilibrium constant and magnitude of chemical shift perturbation to map the binding site residues.Interestingly, the rate of change of perturbation at lower ligand concentration is highly sensitive in differentiating the binding site residues from the non-binding site residues. To validate this program, the interaction between the protein hBcl(XL and the ligand BH3I-1 was studied. Residues in the hydrophobic BH3 binding groove of hBcl(XL were easily identified to be crucial for interaction with BH3I-1 from other residues that also exhibited perturbation. The geometrically averaged equilibrium constant (3.0 x 10(4 calculated for the residues present at the identified binding site is consistent with the values obtained by other techniques like isothermal calorimetry and fluorescence polarization assays (12.8 x 10(4. Adjacent to the primary site, an additional binding site was identified which had an affinity of 3.8 times weaker

  5. Use of (113)Cd NMR to probe the native metal binding sites in metalloproteins: an overview.

    Science.gov (United States)

    Armitage, Ian M; Drakenberg, Torbjörn; Reilly, Brian

    2013-01-01

    Our laboratories have actively published in this area for several years and the objective of this chapter is to present as comprehensive an overview as possible. Following a brief review of the basic principles associated with (113)Cd NMR methods, we will present the results from a thorough literature search for (113)Cd chemical shifts from metalloproteins. The updated (113)Cd chemical shift figure in this chapter will further illustrate the excellent correlation of the (113)Cd chemical shift with the nature of the coordinating ligands (N, O, S) and coordination number/geometry, reaffirming how this method can be used not only to identify the nature of the protein ligands in uncharacterized cases but also the dynamics at the metal binding site. Specific examples will be drawn from studies on alkaline phosphatase, Ca(2+) binding proteins, and metallothioneins.In the case of Escherichia coli alkaline phosphatase, a dimeric zinc metalloenzyme where a total of six metal ions (three per monomer) are involved directly or indirectly in providing the enzyme with maximal catalytic activity and structural stability, (113)Cd NMR, in conjunction with (13)C and (31)P NMR methods, were instrumental in separating out the function of each class of metal binding sites. Perhaps most importantly, these studies revealed the chemical basis for negative cooperativity that had been reported for this enzyme under metal deficient conditions. Also noteworthy was the fact that these NMR studies preceded the availability of the X-ray crystal structure.In the case of the calcium binding proteins, we will focus on two proteins: calbindin D(9k) and calmodulin. For calbindin D(9k) and its mutants, (113)Cd NMR has been useful both to follow actual changes in the metal binding sites and the cooperativity in the metal binding. Ligand binding to calmodulin has been studied extensively with (113)Cd NMR showing that the metal binding sites are not directly involved in the ligand binding. The (113)Cd

  6. Discovery and Characterization of a Cell-Permeable, Small-Molecule c-Abl Kinase Activator that Binds to the Myristoyl Binding Site

    Energy Technology Data Exchange (ETDEWEB)

    Yang, Jingsong; Campobasso, Nino; Biju, Mangatt P.; Fisher, Kelly; Pan, Xiao-Qing; Cottom, Josh; Galbraith, Sarah; Ho, Thau; Zhang, Hong; Hong, Xuan; Ward, Paris; Hofmann, Glenn; Siegfried, Brett; Zappacosta, Francesca; Washio, Yoshiaki; Cao, Ping; Qu, Junya; Bertrand, Sophie; Wang, Da-Yuan; Head, Martha S.; Li, Hu; Moores, Sheri; Lai, Zhihong; Johanson, Kyung; Burton, George; Erickson-Miller, Connie; Simpson, Graham; Tummino, Peter; Copeland, Robert A.; Oliff, Allen (GSKPA)

    2014-10-02

    c-Abl kinase activity is regulated by a unique mechanism involving the formation of an autoinhibited conformation in which the N-terminal myristoyl group binds intramolecularly to the myristoyl binding site on the kinase domain and induces the bending of the {alpha}I helix that creates a docking surface for the SH2 domain. Here, we report a small-molecule c-Abl activator, DPH, that displays potent enzymatic and cellular activity in stimulating c-Abl activation. Structural analyses indicate that DPH binds to the myristoyl binding site and prevents the formation of the bent conformation of the {alpha}I helix through steric hindrance, a mode of action distinct from the previously identified allosteric c-Abl inhibitor, GNF-2, that also binds to the myristoyl binding site. DPH represents the first cell-permeable, small-molecule tool compound for c-Abl activation.

  7. Multiple ETS family proteins regulate PF4 gene expression by binding to the same ETS binding site.

    Directory of Open Access Journals (Sweden)

    Yoshiaki Okada

    Full Text Available In previous studies on the mechanism underlying megakaryocyte-specific gene expression, several ETS motifs were found in each megakaryocyte-specific gene promoter. Although these studies suggested that several ETS family proteins regulate megakaryocyte-specific gene expression, only a few ETS family proteins have been identified. Platelet factor 4 (PF4 is a megakaryocyte-specific gene and its promoter includes multiple ETS motifs. We had previously shown that ETS-1 binds to an ETS motif in the PF4 promoter. However, the functions of the other ETS motifs are still unclear. The goal of this study was to investigate a novel functional ETS motif in the PF4 promoter and identify proteins binding to the motif. In electrophoretic mobility shift assays and a chromatin immunoprecipitation assay, FLI-1, ELF-1, and GABP bound to the -51 ETS site. Expression of FLI-1, ELF-1, and GABP activated the PF4 promoter in HepG2 cells. Mutation of a -51 ETS site attenuated FLI-1-, ELF-1-, and GABP-mediated transactivation of the promoter. siRNA analysis demonstrated that FLI-1, ELF-1, and GABP regulate PF4 gene expression in HEL cells. Among these three proteins, only FLI-1 synergistically activated the promoter with GATA-1. In addition, only FLI-1 expression was increased during megakaryocytic differentiation. Finally, the importance of the -51 ETS site for the activation of the PF4 promoter during physiological megakaryocytic differentiation was confirmed by a novel reporter gene assay using in vitro ES cell differentiation system. Together, these data suggest that FLI-1, ELF-1, and GABP regulate PF4 gene expression through the -51 ETS site in megakaryocytes and implicate the differentiation stage-specific regulation of PF4 gene expression by multiple ETS factors.

  8. Covalent binding of the organophosphorus agent FP-biotin to tyrosine in eight proteins that have no active site serine

    OpenAIRE

    Grigoryan, Hasmik; Li, Bin; Anderson, Erica K.; Xue, Weihua; Nachon, Florian; Lockridge, Oksana; Schopfer, Lawrence M.

    2009-01-01

    Organophosphorus esters (OP) are known to bind covalently to the active site serine of enzymes in the serine hydrolase family. It was a surprise to find that proteins with no active site serine are also covalently modified by OP. The binding site in albumin, transferrin, and tubulin was identified as tyrosine. The goal of the present work was to determine whether binding to tyrosine is a general phenomenon. Fourteen proteins were treated with a biotin-tagged organophosphorus agent called FP-b...

  9. Prediction of allosteric sites and mediating interactions through bond-to-bond propensities

    Science.gov (United States)

    Amor, B. R. C.; Schaub, M. T.; Yaliraki, S. N.; Barahona, M.

    2016-08-01

    Allostery is a fundamental mechanism of biological regulation, in which binding of a molecule at a distant location affects the active site of a protein. Allosteric sites provide targets to fine-tune protein activity, yet we lack computational methodologies to predict them. Here we present an efficient graph-theoretical framework to reveal allosteric interactions (atoms and communication pathways strongly coupled to the active site) without a priori information of their location. Using an atomistic graph with energy-weighted covalent and weak bonds, we define a bond-to-bond propensity quantifying the non-local effect of instantaneous bond fluctuations propagating through the protein. Significant interactions are then identified using quantile regression. We exemplify our method with three biologically important proteins: caspase-1, CheY, and h-Ras, correctly predicting key allosteric interactions, whose significance is additionally confirmed against a reference set of 100 proteins. The almost-linear scaling of our method renders it suitable for high-throughput searches for candidate allosteric sites.

  10. SPEER-SERVER: a web server for prediction of protein specificity determining sites

    Science.gov (United States)

    Chakraborty, Abhijit; Mandloi, Sapan; Lanczycki, Christopher J.; Panchenko, Anna R.; Chakrabarti, Saikat

    2012-01-01

    Sites that show specific conservation patterns within subsets of proteins in a protein family are likely to be involved in the development of functional specificity. These sites, generally termed specificity determining sites (SDS), might play a crucial role in binding to a specific substrate or proteins. Identification of SDS through experimental techniques is a slow, difficult and tedious job. Hence, it is very important to develop efficient computational methods that can more expediently identify SDS. Herein, we present Specificity prediction using amino acids’ Properties, Entropy and Evolution Rate (SPEER)-SERVER, a web server that predicts SDS by analyzing quantitative measures of the conservation patterns of protein sites based on their physico-chemical properties and the heterogeneity of evolutionary changes between and within the protein subfamilies. This web server provides an improved representation of results, adds useful input and output options and integrates a wide range of analysis and data visualization tools when compared with the original standalone version of the SPEER algorithm. Extensive benchmarking finds that SPEER-SERVER exhibits sensitivity and precision performance that, on average, meets or exceeds that of other currently available methods. SPEER-SERVER is available at http://www.hpppi.iicb.res.in/ss/. PMID:22689646

  11. The role of DNA binding sites and slow unbinding kinetics in titration-based oscillators

    CERN Document Server

    Karapetyan, Sargis

    2015-01-01

    Genetic oscillators, such as circadian clocks, are constantly perturbed by molecular noise arising from the small number of molecules involved in gene regulation. One of the strongest sources of stochasticity is the binary noise that arises from the binding of a regulatory protein to a promoter in the chromosomal DNA. In this study, we focus on two minimal oscillators based on activator titration and repressor titration to understand the key parameters that are important for oscillations and for overcoming binary noise. We show that the rate of unbinding from the DNA, despite traditionally being considered a fast parameter, needs to be slow to broaden the space of oscillatory solutions. The addition of multiple, independent DNA binding sites further expands the oscillatory parameter space for the repressor-titration oscillator and lengthens the period of both oscillators. This effect is a combination of increased effective delay of the unbinding kinetics due to multiple binding sites and increased promoter ul...

  12. Replication and pathogenicity of primer binding site mutants of SL3-3 murine leukemia viruses

    DEFF Research Database (Denmark)

    Lund, Anders Henrik; Schmidt, J; Luz, A;

    1999-01-01

    Retroviral reverse transcription is primed by a cellular tRNA molecule annealed to an 18-bp primer binding site sequence. The sequence of the primer binding site coincides with that of a negatively acting cis element that mediates transcriptional silencing of murine leukemia virus (MLV......) in undifferentiated embryonic cells. In this study we test whether SL3-3 MLV can replicate stably using tRNA primers other than the cognate tRNAPro and analyze the effect of altering the primer binding site sequence to match the 3' end of tRNA1Gln, tRNA3Lys, or tRNA1,2Arg in a mouse pathogenicity model. Contrary...... to findings from cell culture studies of primer binding site-modified human immunodeficiency virus type 1 and avian retroviruses, our findings were that SL3-3 MLV may stably and efficiently replicate with tRNA primers other than tRNAPro. Although lymphoma induction of the SL3-3 Lys3 mutant was significantly...

  13. Control of ion selectivity in LeuT: two Na+ binding sites with two different mechanisms.

    Science.gov (United States)

    Noskov, Sergei Y; Roux, Benoît

    2008-03-28

    The x-ray structure of LeuT, a bacterial homologue of Na(+)/Cl(-)-dependent neurotransmitter transporters, provides a great opportunity to better understand the molecular basis of monovalent cation selectivity in ion-coupled transporters. LeuT possesses two ion binding sites, NA1 and NA2, which are highly selective for Na(+). Extensive all-atom free-energy molecular dynamics simulations of LeuT embedded in an explicit membrane are performed at different temperatures and various occupancy states of the binding sites to dissect the molecular mechanism of ion selectivity. The results show that the two binding sites display robust selectivity for Na(+) over K(+) or Li(+), the competing ions of most similar radii. Of particular interest, the mechanism primarily responsible for selectivity for each of the two binding sites appears to be different. In NA1, selectivity for Na(+) over K(+) arises predominantly from the strong electrostatic field arising from the negatively charged carboxylate group of the leucine substrate coordinating the ion directly. In NA2, which comprises only neutral ligands, selectivity for Na(+) is enforced by the local structural restraints arising from the hydrogen-bonding network and the covalent connectivity of the polypeptide chain surrounding the ion according to a "snug-fit" mechanism.

  14. Localization of the substrate binding site in the homodimeric mannitol transporter, EIImtl, of Escherichia coli

    NARCIS (Netherlands)

    Opacic, Milena; Vos, Erwin P. P.; Hesp, Ben H.; Broos, Jaap

    2010-01-01

    The mannitol transporter from Escherichia coli, EIImtl, belongs to a class of membrane proteins coupling the transport of substrates with their chemical modification. EIImtl is functional as a homodimer, and it harbors one high affinity mannitol-binding site in the membrane-embedded C domain (IICmtl

  15. Studies on ATP-diphosphohydrolase nucleotide-binding sites by intrinsic fluorescence

    Directory of Open Access Journals (Sweden)

    A.M. Kettlun

    2000-07-01

    Full Text Available Potato apyrase, a soluble ATP-diphosphohydrolase, was purified to homogeneity from several clonal varieties of Solanum tuberosum. Depending on the source of the enzyme, differences in kinetic and physicochemical properties have been described, which cannot be explained by the amino acid residues present in the active site. In order to understand the different kinetic behavior of the Pimpernel (ATPase/ADPase = 10 and Desirée (ATPase/ADPase = 1 isoenzymes, the nucleotide-binding site of these apyrases was explored using the intrinsic fluorescence of tryptophan. The intrinsic fluorescence of the two apyrases was slightly different. The maximum emission wavelengths of the Desirée and Pimpernel enzymes were 336 and 340 nm, respectively, suggesting small differences in the microenvironment of Trp residues. The Pimpernel enzyme emitted more fluorescence than the Desirée apyrase at the same concentration although both enzymes have the same number of Trp residues. The binding of the nonhydrolyzable substrate analogs decreased the fluorescence emission of both apyrases, indicating the presence of conformational changes in the neighborhood of Trp residues. Experiments with quenchers of different polarities, such as acrylamide, Cs+ and I- indicated the existence of differences in the nucleotide-binding site, as further shown by quenching experiments in the presence of nonhydrolyzable substrate analogs. Differences in the nucleotide-binding site may explain, at least in part, the kinetic differences of the Pimpernel and Desirée isoapyrases.