WorldWideScience

Sample records for adaptor-associated clathrin-box motifs

  1. The Motif Tracking Algorithm

    CERN Document Server

    Wilson, William; Aickelin, Uwe; 10.1007/s11633.008.0032.0

    2010-01-01

    The search for patterns or motifs in data represents a problem area of key interest to finance and economic researchers. In this paper we introduce the Motif Tracking Algorithm, a novel immune inspired pattern identification tool that is able to identify unknown motifs of a non specified length which repeat within time series data. The power of the algorithm comes from the fact that it uses a small number of parameters with minimal assumptions regarding the data being examined or the underlying motifs. Our interest lies in applying the algorithm to financial time series data to identify unknown patterns that exist. The algorithm is tested using three separate data sets. Particular suitability to financial data is shown by applying it to oil price data. In all cases the algorithm identifies the presence of a motif population in a fast and efficient manner due to the utilisation of an intuitive symbolic representation. The resulting population of motifs is shown to have considerable potential value for other ap...

  2. The Motif Tracking Algorithm

    Institute of Scientific and Technical Information of China (English)

    2008-01-01

    The search for patterns or motifs in data represents a problem area of key interest to finance and economic researchers. In this paper, we introduce the motif tracking algorithm (MTA), a novel immune inspired (IS) pattern identification tool that is able to identify unknown motifs of a non specified length which repeat within time series data. The power of the algorithm comes from the fact that it uses a small number of parameters with minimal assumptions regarding the data being examined or the underlying motifs. Our interest lies in applying the algorithm to financial time series data to identify unknown patterns that exist. The algorithm is tested using three separate data sets. Particular suitability to financial data is shown by applying it to oil price data. In all cases, the algorithm identifies the presence of a motif population in a fast and efficient manner due to the utilization of an intuitive symbolic representation.The resulting population of motifs is shown to have considerable potential value for other applications such as forecasting and algorithm seeding.

  3. Visibility graph motifs

    CERN Document Server

    Iacovacci, Jacopo

    2015-01-01

    Visibility algorithms transform time series into graphs and encode dynamical information in their topology, paving the way for graph-theoretical time series analysis as well as building a bridge between nonlinear dynamics and network science. In this work we introduce and study the concept of visibility graph motifs, smaller substructures that appear with characteristic frequencies. We develop a theory to compute in an exact way the motif profiles associated to general classes of deterministic and stochastic dynamics. We find that this simple property is indeed a highly informative and computationally efficient feature capable to distinguish among different dynamics and robust against noise contamination. We finally confirm that it can be used in practice to perform unsupervised learning, by extracting motif profiles from experimental heart-rate series and being able, accordingly, to disentangle meditative from other relaxation states. Applications of this general theory include the automatic classification a...

  4. The MHC motif viewer

    DEFF Research Database (Denmark)

    Rapin, Nicolas Philippe Jean-Pierre; Hoof, Ilka; Lund, Ole;

    2010-01-01

    In vertebrates, the onset of cellular immune reactions is controlled by presentation of peptides in complex with major histocompatibility complex (MHC) molecules to T cell receptors. In humans, MHCs are called human leukocyte antigens (HLAs). Different MHC molecules present different subsets of...... peptides, and knowledge of their binding specificities is important for understanding differences in the immune response between individuals. Algorithms predicting which peptides bind a given MHC molecule have recently been developed with high prediction accuracy. The utility of these algorithms is...... binding motif for each MHC molecule is predicted using state-of-the-art, pan-specific peptide-MHC binding-prediction methods, and is visualized as a sequence logo, in a format that allows for a comprehensive interpretation of binding motif anchor positions and amino acid preferences....

  5. Mining protein sequences for motifs.

    Science.gov (United States)

    Narasimhan, Giri; Bu, Changsong; Gao, Yuan; Wang, Xuning; Xu, Ning; Mathee, Kalai

    2002-01-01

    We use methods from Data Mining and Knowledge Discovery to design an algorithm for detecting motifs in protein sequences. The algorithm assumes that a motif is constituted by the presence of a "good" combination of residues in appropriate locations of the motif. The algorithm attempts to compile such good combinations into a "pattern dictionary" by processing an aligned training set of protein sequences. The dictionary is subsequently used to detect motifs in new protein sequences. Statistical significance of the detection results are ensured by statistically determining the various parameters of the algorithm. Based on this approach, we have implemented a program called GYM. The Helix-Turn-Helix motif was used as a model system on which to test our program. The program was also extended to detect Homeodomain motifs. The detection results for the two motifs compare favorably with existing programs. In addition, the GYM program provides a lot of useful information about a given protein sequence. PMID:12487759

  6. MHC motif viewer

    DEFF Research Database (Denmark)

    Rapin, Nicolas Philippe Jean-Pierre; Hoof, Ilka; Lund, Ole;

    2008-01-01

    In vertebrates, the major histocompatibility complex (MHC) presents peptides to the immune system. In humans, MHCs are called human leukocyte antigens (HLAs), and some of the loci encoding them are the most polymorphic in the human genome. Different MHC molecules present different subsets of....... Algorithms that predict which peptides MHC molecules bind have recently been developed and cover many different alleles, but the utility of these algorithms is hampered by the lack of tools for browsing and comparing the specificity of these molecules. We have, therefore, developed a web server, MHC motif...

  7. Sequential visibility-graph motifs

    Science.gov (United States)

    Iacovacci, Jacopo; Lacasa, Lucas

    2016-04-01

    Visibility algorithms transform time series into graphs and encode dynamical information in their topology, paving the way for graph-theoretical time series analysis as well as building a bridge between nonlinear dynamics and network science. In this work we introduce and study the concept of sequential visibility-graph motifs, smaller substructures of n consecutive nodes that appear with characteristic frequencies. We develop a theory to compute in an exact way the motif profiles associated with general classes of deterministic and stochastic dynamics. We find that this simple property is indeed a highly informative and computationally efficient feature capable of distinguishing among different dynamics and robust against noise contamination. We finally confirm that it can be used in practice to perform unsupervised learning, by extracting motif profiles from experimental heart-rate series and being able, accordingly, to disentangle meditative from other relaxation states. Applications of this general theory include the automatic classification and description of physical, biological, and financial time series.

  8. Main: SEF1MOTIF [PLACE

    Lifescience Database Archive (English)

    Full Text Available inding motif; sequence found in 5'-upstream region (-640; -765) of soybean beta-conglicinin (7S globulin) ge...ne; W=A/T; SOYBEAN; STORAGE PROTEIN; 7S; GLOBULIN; BETA-CONGLICININ; seed; soybean (Glycine max) ATATTTAWW ...

  9. Reference: TCA1MOTIF [PLACE

    Lifescience Database Archive (English)

    Full Text Available TCA1MOTIF Goldsbrough AP, Albrecht H, Stratford R Salicylic acid-inducible binding ...of a tobacco nuclear protein to a 10 bp sequence which is highly conserved amongst stress-inducible genes. Plant J 3:563-571 (1993) PubMed: 8220463; ...

  10. MODIS: an audio motif discovery software

    OpenAIRE

    Catanese, Laurence; Souviraà-Labastie, Nathan; Qu, Bingqing; Campion, Sébastien; Gravier, Guillaume; Vincent, Emmanuel; Bimbot, Frédéric

    2013-01-01

    International audience MODIS is a free speech and audio motif discovery software developed at IRISA Rennes. Motif discovery is the task of discovering and collecting occurrences of repeating patterns in the absence of prior knowledge, or training material. MODIS is based on a generic approach to mine repeating audio sequences, with tolerance to motif variability. The algorithm implementation allows to process large audio streams at a reasonable speed where motif discovery often requires hu...

  11. Structural Motifs of Gold Nanoparticles.

    Science.gov (United States)

    Cleveland, C. L.; Luedtke, W. D.; Landman, Uzi

    1996-03-01

    Through an extensive search, involving energy minimization using embedded atom potentials, we found(R.L. Whetten et al./), submitted to Nature (1995). that the energetically optimal sequence for AuN clusters (30 motif, and variants thereof. These predictions for bare gold particles, and for particles coated by sef-assembled thiol monolayers, are discussed in light of recent experiments on the preparation and characterization (including mass spectrometry, electron microscopy, and X-ray diffraction) of nanocrystalline gold molecules (see Ref. 2).

  12. Main: TCA1MOTIF [PLACE

    Lifescience Database Archive (English)

    Full Text Available TCA1MOTIF S000159 17-May-1998 (last modified) kehi TCA-1 (tobacco nuclear protein 1...) binding site; Related to salicylic acid-inducible expression of many genes; Found in barley beta-1,3-gluca...nase and over 30 different plant genes which are known to be induced by one or more forms of stress; A similar sequence (TCA... et al., 1997); SA; salicylic acid; stress; TCA-1; barley (Hordeum vulgare); tobacco (Nicotiana tabacum); TCATCTTCTT ...

  13. Comprehensive discovery of DNA motifs in 349 human cells and tissues reveals new features of motifs.

    Science.gov (United States)

    Zheng, Yiyu; Li, Xiaoman; Hu, Haiyan

    2015-01-01

    Comprehensive motif discovery under experimental conditions is critical for the global understanding of gene regulation. To generate a nearly complete list of human DNA motifs under given conditions, we employed a novel approach to de novo discover significant co-occurring DNA motifs in 349 human DNase I hypersensitive site datasets. We predicted 845 to 1325 motifs in each dataset, for a total of 2684 non-redundant motifs. These 2684 motifs contained 54.02 to 75.95% of the known motifs in seven large collections including TRANSFAC. In each dataset, we also discovered 43 663 to 2 013 288 motif modules, groups of motifs with their binding sites co-occurring in a significant number of short DNA regions. Compared with known interacting transcription factors in eight resources, the predicted motif modules on average included 84.23% of known interacting motifs. We further showed new features of the predicted motifs, such as motifs enriched in proximal regions rarely overlapped with motifs enriched in distal regions, motifs enriched in 5' distal regions were often enriched in 3' distal regions, etc. Finally, we observed that the 2684 predicted motifs classified the cell or tissue types of the datasets with an accuracy of 81.29%. The resources generated in this study are available at http://server.cs.ucf.edu/predrem/.

  14. Motif Detection Inspired by Immune Memory

    CERN Document Server

    Wilson, William; Aickelin, Uwe

    2010-01-01

    The search for patterns or motifs in data represents an area of key interest to many researchers. In this paper we present the Motif Tracking Algorithm, a novel immune inspired pattern identification tool that is able to identify variable length unknown motifs which repeat within time series data. The algorithm searches from a completely neutral perspective that is independent of the data being analysed and the underlying motifs. In this paper we test the flexibility of the motif tracking algorithm by applying it to the search for patterns in two industrial data sets. The algorithm is able to identify a population of motifs successfully in both cases, and the value of these motifs is discussed.

  15. Statistical tests to compare motif count exceptionalities

    Directory of Open Access Journals (Sweden)

    Vandewalle Vincent

    2007-03-01

    Full Text Available Abstract Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use.

  16. An Algorithm for Motif Discovery with Iteration on Lengths of Motifs.

    Science.gov (United States)

    Fan, Yetian; Wu, Wei; Yang, Jie; Yang, Wenyu; Liu, Rongrong

    2015-01-01

    Analysis of DNA sequence motifs is becoming increasingly important in the study of gene regulation, and the identification of motif in DNA sequences is a complex problem in computational biology. Motif discovery has attracted the attention of more and more researchers, and varieties of algorithms have been proposed. Most existing motif discovery algorithms fix the motif's length as one of the input parameters. In this paper, a novel method is proposed to identify the optimal length of the motif and the optimal motif with that length, through an iteration process on increasing length numbers. For each fixed length, a modified genetic algorithm (GA) is used for finding the optimal motif with that length. Three operators are used in the modified GA: Mutation that is similar to the one used in usual GA but is modified to avoid local optimum in our case, and Addition and Deletion that are proposed by us for the problem. A criterion is given for singling out the optimal length in the increasing motif's lengths. We call this method AMDILM (an algorithm for motif discovery with iteration on lengths of motifs). The experiments on simulated data and real biological data show that AMDILM can accurately identify the optimal motif length. Meanwhile, the optimal motifs discovered by AMDILM are consistent with the real ones and are similar with the motifs obtained by the three well-known methods: Gibbs Sampler, MEME and Weeder. PMID:26357084

  17. rMotifGen: random motif generator for DNA and protein sequences

    Directory of Open Access Journals (Sweden)

    Hardin C Timothy

    2007-08-01

    Full Text Available Abstract Background Detection of short, subtle conserved motif regions within a set of related DNA or amino acid sequences can lead to discoveries about important regulatory domains such as transcription factor and DNA binding sites as well as conserved protein domains. In order to help assess motif detection algorithms on motifs with varying properties and levels of conservation, we have developed a computational tool, rMotifGen, with the sole purpose of generating a number of random DNA or protein sequences containing short sequence motifs. Each motif consensus can be user-defined, randomly generated, or created from a position-specific scoring matrix (PSSM. Insertions and mutations within these motifs are created according to user-defined parameters and substitution matrices. The resulting sequences can be helpful in mutational simulations and in testing the limits of motif detection algorithms. Results Two implementations of rMotifGen have been created, one providing a graphical user interface (GUI for random motif construction, and the other serving as a command line interface. The second implementation has the added advantages of platform independence and being able to be called in a batch mode. rMotifGen was used to construct sample sets of sequences containing DNA motifs and amino acid motifs that were then tested against the Gibbs sampler and MEME packages. Conclusion rMotifGen provides an efficient and convenient method for creating random DNA or amino acid sequences with a variable number of motifs, where the instance of each motif can be incorporated using a position-specific scoring matrix (PSSM or by creating an instance mutated from its corresponding consensus using an evolutionary model based on substitution matrices. rMotifGen is freely available at: http://bioinformatics.louisville.edu/brg/rMotifGen/.

  18. Biological network motif detection: principles and practice.

    Science.gov (United States)

    Wong, Elisabeth; Baur, Brittany; Quader, Saad; Huang, Chun-Hsi

    2012-03-01

    Network motifs are statistically overrepresented sub-structures (sub-graphs) in a network, and have been recognized as 'the simple building blocks of complex networks'. Study of biological network motifs may reveal answers to many important biological questions. The main difficulty in detecting larger network motifs in biological networks lies in the facts that the number of possible sub-graphs increases exponentially with the network or motif size (node counts, in general), and that no known polynomial-time algorithm exists in deciding if two graphs are topologically equivalent. This article discusses the biological significance of network motifs, the motivation behind solving the motif-finding problem, and strategies to solve the various aspects of this problem. A simple classification scheme is designed to analyze the strengths and weaknesses of several existing algorithms. Experimental results derived from a few comparative studies in the literature are discussed, with conclusions that lead to future research directions. PMID:22396487

  19. PairMotif: A New Pattern-Driven Algorithm for Planted (l, d) DNA Motif Search

    OpenAIRE

    Qiang Yu; Hongwei Huo; Yipu Zhang; Hongzhi Guo

    2012-01-01

    Motif search is a fundamental problem in bioinformatics with an important application in locating transcription factor binding sites (TFBSs) in DNA sequences. The exact algorithms can report all (l, d) motifs and find the best one under a specific objective function. However, it is still a challenging task to identify weak motifs, since either a large amount of memory or execution time is required by current exact algorithms. A new exact algorithm, PairMotif, is proposed for planted (l, d) mo...

  20. Assessment of composite motif discovery methods

    Directory of Open Access Journals (Sweden)

    Johansen Jostein

    2008-02-01

    Full Text Available Abstract Background Computational discovery of regulatory elements is an important area of bioinformatics research and more than a hundred motif discovery methods have been published. Traditionally, most of these methods have addressed the problem of single motif discovery – discovering binding motifs for individual transcription factors. In higher organisms, however, transcription factors usually act in combination with nearby bound factors to induce specific regulatory behaviours. Hence, recent focus has shifted from single motifs to the discovery of sets of motifs bound by multiple cooperating transcription factors, so called composite motifs or cis-regulatory modules. Given the large number and diversity of methods available, independent assessment of methods becomes important. Although there have been several benchmark studies of single motif discovery, no similar studies have previously been conducted concerning composite motif discovery. Results We have developed a benchmarking framework for composite motif discovery and used it to evaluate the performance of eight published module discovery tools. Benchmark datasets were constructed based on real genomic sequences containing experimentally verified regulatory modules, and the module discovery programs were asked to predict both the locations of these modules and to specify the single motifs involved. To aid the programs in their search, we provided position weight matrices corresponding to the binding motifs of the transcription factors involved. In addition, selections of decoy matrices were mixed with the genuine matrices on one dataset to test the response of programs to varying levels of noise. Conclusion Although some of the methods tested tended to score somewhat better than others overall, there were still large variations between individual datasets and no single method performed consistently better than the rest in all situations. The variation in performance on individual

  1. MSDmotif: exploring protein sites and motifs

    Directory of Open Access Journals (Sweden)

    Henrick Kim

    2008-07-01

    Full Text Available Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.

  2. Helix-packing motifs in membrane proteins.

    Science.gov (United States)

    Walters, R F S; DeGrado, W F

    2006-09-12

    The fold of a helical membrane protein is largely determined by interactions between membrane-imbedded helices. To elucidate recurring helix-helix interaction motifs, we dissected the crystallographic structures of membrane proteins into a library of interacting helical pairs. The pairs were clustered according to their three-dimensional similarity (rmsd universe of common transmembrane helix-pairing motifs is relatively simple. The largest cluster, which comprises 29% of the library members, consists of an antiparallel motif with left-handed packing angles, and it is frequently stabilized by packing of small side chains occurring every seven residues in the sequence. Right-handed parallel and antiparallel structures show a similar tendency to segregate small residues to the helix-helix interface but spaced at four-residue intervals. Position-specific sequence propensities were derived for the most populated motifs. These structural and sequential motifs should be quite useful for the design and structural prediction of membrane proteins.

  3. Temporal motifs in time-dependent networks

    International Nuclear Information System (INIS)

    Temporal networks are commonly used to represent systems where connections between elements are active only for restricted periods of time, such as telecommunication, neural signal processing, biochemical reaction and human social interaction networks. We introduce the framework of temporal motifs to study the mesoscale topological–temporal structure of temporal networks in which the events of nodes do not overlap in time. Temporal motifs are classes of similar event sequences, where the similarity refers not only to topology but also to the temporal order of the events. We provide a mapping from event sequences to coloured directed graphs that enables an efficient algorithm for identifying temporal motifs. We discuss some aspects of temporal motifs, including causality and null models, and present basic statistics of temporal motifs in a large mobile call network

  4. MotifLab: a tools and data integration workbench for motif discovery and regulatory sequence analysis

    Directory of Open Access Journals (Sweden)

    Klepper Kjetil

    2013-01-01

    Full Text Available Abstract Background Traditional methods for computational motif discovery often suffer from poor performance. In particular, methods that search for sequence matches to known binding motifs tend to predict many non-functional binding sites because they fail to take into consideration the biological state of the cell. In recent years, genome-wide studies have generated a lot of data that has the potential to improve our ability to identify functional motifs and binding sites, such as information about chromatin accessibility and epigenetic states in different cell types. However, it is not always trivial to make use of this data in combination with existing motif discovery tools, especially for researchers who are not skilled in bioinformatics programming. Results Here we present MotifLab, a general workbench for analysing regulatory sequence regions and discovering transcription factor binding sites and cis-regulatory modules. MotifLab supports comprehensive motif discovery and analysis by allowing users to integrate several popular motif discovery tools as well as different kinds of additional information, including phylogenetic conservation, epigenetic marks, DNase hypersensitive sites, ChIP-Seq data, positional binding preferences of transcription factors, transcription factor interactions and gene expression. MotifLab offers several data-processing operations that can be used to create, manipulate and analyse data objects, and complete analysis workflows can be constructed and automatically executed within MotifLab, including graphical presentation of the results. Conclusions We have developed MotifLab as a flexible workbench for motif analysis in a genomic context. The flexibility and effectiveness of this workbench has been demonstrated on selected test cases, in particular two previously published benchmark data sets for single motifs and modules, and a realistic example of genes responding to treatment with forskolin. MotifLab is freely

  5. Detecting Motifs in System Call Sequences

    CERN Document Server

    Wilson, William O; Aickelin, Uwe

    2010-01-01

    The search for patterns or motifs in data represents an area of key interest to many researchers. In this paper we present the Motif Tracking Algorithm, a novel immune inspired pattern identification tool that is able to identify unknown motifs which repeat within time series data. The power of the algorithm is derived from its use of a small number of parameters with minimal assumptions. The algorithm searches from a completely neutral perspective that is independent of the data being analysed, and the underlying motifs. In this paper the motif tracking algorithm is applied to the search for patterns within sequences of low level system calls between the Linux kernel and the operating system's user space. The MTA is able to compress data found in large system call data sets to a limited number of motifs which summarise that data. The motifs provide a resource from which a profile of executed processes can be built. The potential for these profiles and new implications for security research are highlighted. A...

  6. Automated motif discovery from glycan array data.

    Science.gov (United States)

    Cholleti, Sharath R; Agravat, Sanjay; Morris, Tim; Saltz, Joel H; Song, Xuezheng; Cummings, Richard D; Smith, David F

    2012-10-01

    Assessing interactions of a glycan-binding protein (GBP) or lectin with glycans on a microarray generates large datasets, making it difficult to identify a glycan structural motif or determinant associated with the highest apparent binding strength of the GBP. We have developed a computational method, termed GlycanMotifMiner, that uses the relative binding of a GBP with glycans within a glycan microarray to automatically reveal the glycan structural motifs recognized by a GBP. We implemented the software with a web-based graphical interface for users to explore and visualize the discovered motifs. The utility of GlycanMotifMiner was determined using five plant lectins, SNA, HPA, PNA, Con A, and UEA-I. Data from the analyses of the lectins at different protein concentrations were processed to rank the glycans based on their relative binding strengths. The motifs, defined as glycan substructures that exist in a large number of the bound glycans and few non-bound glycans, were then discovered by our algorithm and displayed in a web-based graphical user interface ( http://glycanmotifminer.emory.edu ). The information is used in defining the glycan-binding specificity of GBPs. The results were compared to the known glycan specificities of these lectins generated by manual methods. A more complex analysis was also carried out using glycan microarray data obtained for a recombinant form of human galectin-8. Results for all of these lectins show that GlycanMotifMiner identified the major motifs known in the literature along with some unexpected novel binding motifs. PMID:22877213

  7. Fitness for synchronization of network motifs

    DEFF Research Database (Denmark)

    Vega, Y.M.; Vázquez-Prada, M.; Pacheco, A.F.;

    2004-01-01

    We study the synchronization of Kuramoto's oscillators in small parts of networks known as motifs. We first report on the system dynamics for the case of a scale-free network and show the existence of a non-trivial critical point. We compute the probability that network motifs synchronize, and fi...... that the fitness for synchronization correlates well with motifs interconnectedness and structural complexity. Possible implications for present debates about network evolution in biological and other systems are discussed. © 2004 Elsevier B.V. All rights reserved....

  8. MotifMiner: A Table Driven Greedy Algorithm for DNA Motif Mining

    Science.gov (United States)

    Seeja, K. R.; Alam, M. A.; Jain, S. K.

    DNA motif discovery is a much explored problem in functional genomics. This paper describes a table driven greedy algorithm for discovering regulatory motifs in the promoter sequences of co-expressed genes. The proposed algorithm searches both DNA strands for the common patterns or motifs. The inputs to the algorithm are set of promoter sequences, the motif length and minimum Information Content. The algorithm generates subsequences of given length from the shortest input promoter sequence. It stores these subsequences and their reverse complements in a table. Then it searches the remaining sequences for good matches of these subsequences. The Information Content score is used to measure the goodness of the motifs. The algorithm has been tested with synthetic data and real data. The results are found promising. The algorithm could discover meaningful motifs from the muscle specific regulatory sequences.

  9. Detecting seeded motifs in DNA sequences

    OpenAIRE

    Pizzi, Cinzia; Bortoluzzi, Stefania; Bisognin, Andrea; Coppe, Alessandro; Danieli, Gian Antonio

    2005-01-01

    The problem of detecting DNA motifs with functional relevance in real biological sequences is difficult due to a number of biological, statistical and computational issues and also because of the lack of knowledge about the structure of searched patterns. Many algorithms are implemented in fully automated processes, which are often based upon a guess of input parameters from the user at the very first step. In this paper, we present a novel method for the detection of seeded DNA motifs, compo...

  10. A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data

    OpenAIRE

    Ngoc Tam L. Tran; Huang, Chun-Hsi

    2014-01-01

    Abstract ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs tha...

  11. Detecting seeded motifs in DNA sequences.

    Science.gov (United States)

    Pizzi, Cinzia; Bortoluzzi, Stefania; Bisognin, Andrea; Coppe, Alessandro; Danieli, Gian Antonio

    2005-01-01

    The problem of detecting DNA motifs with functional relevance in real biological sequences is difficult due to a number of biological, statistical and computational issues and also because of the lack of knowledge about the structure of searched patterns. Many algorithms are implemented in fully automated processes, which are often based upon a guess of input parameters from the user at the very first step. In this paper, we present a novel method for the detection of seeded DNA motifs, composed by regions with a different extent of variability. The method is based on a multi-step approach, which was implemented in a motif searching web tool (MOST). Overrepresented exact patterns are extracted from input sequences and clustered to produce motifs core regions, which are then extended and scored to generate seeded motifs. The combination of automated pattern discovery algorithms and different display tools for the evaluation and selection of results at several analysis steps can potentially lead to much more meaningful results than complete automation can produce. Experimental results on different yeast and human real datasets proved the methodology to be a promising solution for finding seeded motifs. MOST web tool is freely available at http://telethon.bio.unipd.it/bioinfo/MOST. PMID:16141193

  12. Chaotic motifs in gene regulatory networks.

    Science.gov (United States)

    Zhang, Zhaoyang; Ye, Weiming; Qian, Yu; Zheng, Zhigang; Huang, Xuhui; Hu, Gang

    2012-01-01

    Chaos should occur often in gene regulatory networks (GRNs) which have been widely described by nonlinear coupled ordinary differential equations, if their dimensions are no less than 3. It is therefore puzzling that chaos has never been reported in GRNs in nature and is also extremely rare in models of GRNs. On the other hand, the topic of motifs has attracted great attention in studying biological networks, and network motifs are suggested to be elementary building blocks that carry out some key functions in the network. In this paper, chaotic motifs (subnetworks with chaos) in GRNs are systematically investigated. The conclusion is that: (i) chaos can only appear through competitions between different oscillatory modes with rivaling intensities. Conditions required for chaotic GRNs are found to be very strict, which make chaotic GRNs extremely rare. (ii) Chaotic motifs are explored as the simplest few-node structures capable of producing chaos, and serve as the intrinsic source of chaos of random few-node GRNs. Several optimal motifs causing chaos with atypically high probability are figured out. (iii) Moreover, we discovered that a number of special oscillators can never produce chaos. These structures bring some advantages on rhythmic functions and may help us understand the robustness of diverse biological rhythms. (iv) The methods of dominant phase-advanced driving (DPAD) and DPAD time fraction are proposed to quantitatively identify chaotic motifs and to explain the origin of chaotic behaviors in GRNs.

  13. A novel motif identified in dependence receptors.

    Directory of Open Access Journals (Sweden)

    Gabriel del Rio

    Full Text Available Programmed cell death signaling is a critical feature of development, cellular turnover, oncogenesis, and neurodegeneration, among other processes. Such signaling may be transduced via specific receptors, either following ligand binding-to death receptors-or following the withdrawal of trophic ligands-from dependence receptors. Although dependence receptors display functional similarities, no common structural domains have been identified. Therefore, we employed the Multiple Expectation Maximization for Motif Elicitation and the Motif Alignment and Search Tool software programs to identify a novel transmembrane motif, dubbed dependence-associated receptor transmembrane (DART motif, that is common to all described dependence receptors. Of 3,465 human transmembrane proteins, 25 (0.7% display the DART motif. The predicted secondary structure features an alpha helical structure, with an unusually high percentage of valine residues. At least four of the proteins undergo regulated intramembrane proteolysis. To date, we have not identified a function for this putative domain. We speculate that the DART motif may be involved in protein processing, interaction with other proteins or lipids, or homomultimerization.

  14. Detecting seeded motifs in DNA sequences

    Science.gov (United States)

    Pizzi, Cinzia; Bortoluzzi, Stefania; Bisognin, Andrea; Coppe, Alessandro; Danieli, Gian Antonio

    2005-01-01

    The problem of detecting DNA motifs with functional relevance in real biological sequences is difficult due to a number of biological, statistical and computational issues and also because of the lack of knowledge about the structure of searched patterns. Many algorithms are implemented in fully automated processes, which are often based upon a guess of input parameters from the user at the very first step. In this paper, we present a novel method for the detection of seeded DNA motifs, composed by regions with a different extent of variability. The method is based on a multi-step approach, which was implemented in a motif searching web tool (MOST). Overrepresented exact patterns are extracted from input sequences and clustered to produce motifs core regions, which are then extended and scored to generate seeded motifs. The combination of automated pattern discovery algorithms and different display tools for the evaluation and selection of results at several analysis steps can potentially lead to much more meaningful results than complete automation can produce. Experimental results on different yeast and human real datasets proved the methodology to be a promising solution for finding seeded motifs. MOST web tool is freely available at . PMID:16141193

  15. 3matrix and 3motif: a protein structure visualization system for conserved sequence motifs

    OpenAIRE

    Bennett, Steven P.; Lu, Lin; Brutlag, Douglas L.

    2003-01-01

    Computational methods such as sequence alignment and motif construction are useful in grouping related proteins into families, as well as helping to annotate new proteins of unknown function. These methods identify conserved amino acids in protein sequences, but cannot determine the specific functional or structural roles of conserved amino acids without additional study. In this work, we present 3matrix (http://3matrix.stanford.edu) and 3motif (http://3motif.stanford.edu), a web-based sequen...

  16. WebMOTIFS: automated discovery, filtering and scoring of DNA sequence motifs using multiple programs and Bayesian approaches.

    Science.gov (United States)

    Romer, Katherine A; Kayombya, Guy-Richard; Fraenkel, Ernest

    2007-07-01

    WebMOTIFS provides a web interface that facilitates the discovery and analysis of DNA-sequence motifs. Several studies have shown that the accuracy of motif discovery can be significantly improved by using multiple de novo motif discovery programs and using randomized control calculations to identify the most significant motifs or by using Bayesian approaches. WebMOTIFS makes it easy to apply these strategies. Using a single submission form, users can run several motif discovery programs and score, cluster and visualize the results. In addition, the Bayesian motif discovery program THEME can be used to determine the class of transcription factors that is most likely to regulate a set of sequences. Input can be provided as a list of gene or probe identifiers. Used with the default settings, WebMOTIFS accurately identifies biologically relevant motifs from diverse data in several species. WebMOTIFS is freely available at http://fraenkel.mit.edu/webmotifs.

  17. MOTIFATOR : detection and characterization of regulatory motifs using prokaryote transcriptome data

    NARCIS (Netherlands)

    Blom, Evert-Jan; Roerdink, Jos B.T.M.; Kuipers, Oscar P.; Hijum, Sacha A.F.T. van

    2009-01-01

    Unraveling regulatory mechanisms (e.g. identification of motifs in cis-regulatory regions) remains a major challenge in the analysis of transcriptome experiments. Existing applications identify putative motifs from gene lists obtained at rather arbitrary cutoff and require additional manual processi

  18. Sublinear Time Motif Discovery from Multiple Sequences

    Directory of Open Access Journals (Sweden)

    Yunhui Fu

    2013-10-01

    Full Text Available In this paper, a natural probabilistic model for motif discovery has been used to experimentally test the quality of motif discovery programs. In this model, there are k background sequences, and each character in a background sequence is a random character from an alphabet, Σ. A motif G = g1g2 ... gm is a string of m characters. In each background sequence is implanted a probabilistically-generated approximate copy of G. For a probabilistically-generated approximate copy b1b2 ... bm of G, every character, bi, is probabilistically generated, such that the probability for bi ≠ gi is at most α. We develop two new randomized algorithms and one new deterministic algorithm. They make advancements in the following aspects: (1 The algorithms are much faster than those before. Our algorithms can even run in sublinear time. (2 They can handle any motif pattern. (3 The restriction for the alphabet size is a lower bound of four. This gives them potential applications in practical problems, since gene sequences have an alphabet size of four. (4 All algorithms have rigorous proofs about their performances. The methods developed in this paper have been used in the software implementation. We observed some encouraging results that show improved performance for motif detection compared with other software.

  19. Functional characterization of variations on regulatory motifs.

    Directory of Open Access Journals (Sweden)

    Michal Lapidot

    2008-03-01

    Full Text Available Transcription factors (TFs regulate gene expression through specific interactions with short promoter elements. The same regulatory protein may recognize a variety of related sequences. Moreover, once they are detected it is hard to predict whether highly similar sequence motifs will be recognized by the same TF and regulate similar gene expression patterns, or serve as binding sites for distinct regulatory factors. We developed computational measures to assess the functional implications of variations on regulatory motifs and to compare the functions of related sites. We have developed computational means for estimating the functional outcome of substituting a single position within a binding site and applied them to a collection of putative regulatory motifs. We predict the effects of nucleotide variations within motifs on gene expression patterns. In cases where such predictions could be compared to suitable published experimental evidence, we found very good agreement. We further accumulated statistics from multiple substitutions across various binding sites in an attempt to deduce general properties that characterize nucleotide substitutions that are more likely to alter expression. We found that substitutions involving Adenine are more likely to retain the expression pattern and that substitutions involving Guanine are more likely to alter expression compared to the rest of the substitutions. Our results should facilitate the prediction of the expression outcomes of binding site variations. One typical important implication is expected to be the ability to predict the phenotypic effect of variation in regulatory motifs in promoters.

  20. SMOTIF: efficient structured pattern and profile motif search

    OpenAIRE

    Zaki Mohammed J; Zhang Yongqiang

    2006-01-01

    Abstract Background A structured motif allows variable length gaps between several components, where each component is a simple motif, which allows either no gaps or only fixed length gaps. The motif can either be represented as a pattern or a profile (also called positional weight matrix). We propose an efficient algorithm, called SMOTIF, to solve the structured motif search problem, i.e., given one or more sequences and a structured motif, SMOTIF searches the sequences for all occurrences o...

  1. Armadillo motifs involved in vesicular transport.

    Directory of Open Access Journals (Sweden)

    Harald Striegl

    Full Text Available Armadillo (ARM repeat proteins function in various cellular processes including vesicular transport and membrane tethering. They contain an imperfect repeating sequence motif that forms a conserved three-dimensional structure. Recently, structural and functional insight into tethering mediated by the ARM-repeat protein p115 has been provided. Here we describe the p115 ARM-motifs for reasons of clarity and nomenclature and show that both sequence and structure are highly conserved among ARM-repeat proteins. We argue that there is no need to invoke repeat types other than ARM repeats for a proper description of the structure of the p115 globular head region. Additionally, we propose to define a new subfamily of ARM-like proteins and show lack of evidence that the ARM motifs found in p115 are present in other long coiled-coil tethering factors of the golgin family.

  2. Sequential motif profile of natural visibility graphs

    CERN Document Server

    Iacovacci, Jacopo

    2016-01-01

    The concept of sequential visibility graph motifs -subgraphs appearing with characteristic frequencies in the visibility graphs associated to time series- has been advanced recently along with a theoretical framework to compute analytically the motif profiles associated to Horizontal Visibility Graphs (HVGs). Here we develop a theory to compute the profile of sequential visibility graph motifs in the context of Natural Visibility Graphs (VGs). This theory gives exact results for deterministic aperiodic processes with a smooth invariant density or stochastic processes that fulfil the Markov property and have a continuous marginal distribution. The framework also allows for a linear time numerical estimation in the case of empirical time series. A comparison between the HVG and the VG case (including evaluation of their robustness for short series polluted with measurement noise) is also presented.

  3. Motifs and structural blocks retrieval by GHT

    Science.gov (United States)

    Cantoni, Virginio; Ferone, Alessio; Petrosino, Alfredo; Polat, Ozlem

    2014-06-01

    The structure of a protein gives more insight on the protein function than its amino acid sequence. Protein structure analysis and comparison are important for understanding the evolutionary relationships among proteins, predicting protein functions, and predicting protein folding. Proteins are formed by two basic regular 3D structural patterns, called Secondary Structures (SSs): helices and sheets. A structural motif is a compact 3D protein block referring to a small specific combination of secondary structural elements, which appears in a variety of molecules. In this paper we compare a few approaches for motif retrieval based on the Generalized Hough Transform (GHT). A primary technique is to adopt the single SS as structural primitives; alternatives are to adopt a SSs pair as primitive structural element, or a SSs triplet, and so on up-to an entire motif. The richer the primitive, the higher the time for pre-analysis and search, and the simpler the inspection process on the parameter space for analyzing the peaks. Performance comparisons, in terms of precision and computation time, are here presented considering the retrieval of motifs composed by three to five SSs for more than 15 million searches. The approach can be easily applied to the retrieval of greater blocks, up to protein domains, or even entire proteins.

  4. Highly scalable Ab initio genomic motif identification

    KAUST Repository

    Marchand, Benoît

    2011-01-01

    We present results of scaling an ab initio motif family identification system, Dragon Motif Finder (DMF), to 65,536 processor cores of IBM Blue Gene/P. DMF seeks groups of mutually similar polynucleotide patterns within a set of genomic sequences and builds various motif families from them. Such information is of relevance to many problems in life sciences. Prior attempts to scale such ab initio motif-finding algorithms achieved limited success. We solve the scalability issues using a combination of mixed-mode MPI-OpenMP parallel programming, master-slave work assignment, multi-level workload distribution, multi-level MPI collectives, and serial optimizations. While the scalability of our algorithm was excellent (94% parallel efficiency on 65,536 cores relative to 256 cores on a modest-size problem), the final speedup with respect to the original serial code exceeded 250,000 when serial optimizations are included. This enabled us to carry out many large-scale ab initio motiffinding simulations in a few hours while the original serial code would have needed decades of execution time. Copyright 2011 ACM.

  5. The Motif of Meeting in Digital Education

    Science.gov (United States)

    Sheail, Philippa

    2015-01-01

    This article draws on theoretical work which considers the composition of meetings, in order to think about the form of the meeting in digital environments for higher education. To explore the motif of meeting, I undertake a "compositional interpretation" (Rose, 2012) of the default interface offered by "Collaborate", an…

  6. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern applications require mining of motifs in one very long sequence (i.e., in the order of several gigabytes). For this case, there exist statistical approaches that are fast but inaccurate; or combinatorial methods that are sound and complete. Unfortunately, existing combinatorial methods are serial and very slow. Consequently, they are limited to very short sequences (i.e., a few megabytes), small alphabets (typically 4 symbols for DNA sequences), and restricted types of motifs. This paper presents ACME, a combinatorial method for extracting motifs from a single very long sequence. ACME arranges the search space in contiguous blocks that take advantage of the cache hierarchy in modern architectures, and achieves almost an order of magnitude performance gain in serial execution. It also decomposes the search space in a smart way that allows scalability to thousands of processors with more than 90% speedup. ACME is the only method that: (i) scales to gigabyte-long sequences; (ii) handles large alphabets; (iii) supports interesting types of motifs with minimal additional cost; and (iv) is optimized for a variety of architectures such as multi-core systems, clusters in the cloud, and supercomputers. ACME reduces the extraction time for an exact-length query from 4 hours to 7 minutes on a typical workstation; handles 3 orders of magnitude longer sequences; and scales up to 16, 384 cores on a supercomputer. Copyright is held by the owner/author(s).

  7. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun

    2013-06-29

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors\\' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  8. Using SCOPE to identify potential regulatory motifs in coregulated genes.

    Science.gov (United States)

    Martyanov, Viktor; Gross, Robert H

    2011-05-31

    SCOPE is an ensemble motif finder that uses three component algorithms in parallel to identify potential regulatory motifs by over-representation and motif position preference. Each component algorithm is optimized to find a different kind of motif. By taking the best of these three approaches, SCOPE performs better than any single algorithm, even in the presence of noisy data. In this article, we utilize a web version of SCOPE to examine genes that are involved in telomere maintenance. SCOPE has been incorporated into at least two other motif finding programs and has been used in other studies. The three algorithms that comprise SCOPE are BEAM, which finds non-degenerate motifs (ACCGGT), PRISM, which finds degenerate motifs (ASCGWT), and SPACER, which finds longer bipartite motifs (ACCnnnnnnnnGGT). These three algorithms have been optimized to find their corresponding type of motif. Together, they allow SCOPE to perform extremely well. Once a gene set has been analyzed and candidate motifs identified, SCOPE can look for other genes that contain the motif which, when added to the original set, will improve the motif score. This can occur through over-representation or motif position preference. Working with partial gene sets that have biologically verified transcription factor binding sites, SCOPE was able to identify most of the rest of the genes also regulated by the given transcription factor. Output from SCOPE shows candidate motifs, their significance, and other information both as a table and as a graphical motif map. FAQs and video tutorials are available at the SCOPE web site which also includes a "Sample Search" button that allows the user to perform a trial run. Scope has a very friendly user interface that enables novice users to access the algorithm's full power without having to become an expert in the bioinformatics of motif finding. As input, SCOPE can take a list of genes, or FASTA sequences. These can be entered in browser text fields, or read from

  9. Anticipated synchronization in neuronal network motifs

    Science.gov (United States)

    Matias, F. S.; Gollo, L. L.; Carelli, P. V.; Copelli, M.; Mirasso, C. R.

    2013-01-01

    Two identical dynamical systems coupled unidirectionally (in a so called master-slave configuration) exhibit anticipated synchronization (AS) if the one which receives the coupling (the slave) also receives a negative delayed self-feedback. In oscillatory neuronal systems AS is characterized by a phase-locking with negative time delay τ between the spikes of the master and of the slave (slave fires before the master), while in the usual delayed synchronization (DS) regime τ is positive (slave fires after the master). A 3-neuron motif in which the slave self-feedback is replaced by a feedback loop mediated by an interneuron can exhibits both AS and DS regimes. Here we show that AS is robust in the presence of noise in a 3 Hodgkin-Huxley type neuronal motif. We also show that AS is stable for large values of τ in a chain of connected slaves-interneurons.

  10. Locomotif - a graphical programming system for RNA motif search

    OpenAIRE

    Reeder, Janina

    2006-01-01

    In this thesis, I am presenting the results of my work in designing, implementing and installing a software environment for RNA motif searches: Locomotif. It includes a visual editor for motif definition, translation of the motif structure to XML code and client-server interactions, and further, translation of the XML code to ADP and compilation to C.

  11. MENGUNGKAP SEJARAH DAN MOTIF BATIK SEMARANGAN

    Directory of Open Access Journals (Sweden)

    Dewi Yuliati

    2011-10-01

    Full Text Available Batik Semarang was born in line with the needs of the people of Hyderabad of the material with a new motif or style tailored to the taste, intention, and creativity of the craftsmen. Batik is a combination of several countries influence developing in Indonesian culture. Based on its shape, Batik designs can be divided into two major groups, namely geometric and non-Geometric. The development of Semarangan batik was due to the fact that certain motif of batik can only be worn by certain people, not for all group of people. Batik semarangan craftments are found in coastal regions. It displays the design composing of ornaments plucked from marine environment. Indonesian Batik develops not only to display a blending of court Batik designs with the coastal Batik technique, but also to incorporate other ornaments which come from many various ethnic groups in Indonesia.   Key words: batik, history, ornaments, marine environment, designs   Batik Semarang lahirkan sejalan dengan kebutuhan dari orang-orang dari Hyderabad akan bahan dengan motif atau gaya baru yang berdasarkan pada rasa, niat, dan kreatifitas dari pembuatnya. Batik merupakan perpaduan dari pengaruh beberapa negara yang berkembang dalam budaya Indonesia. Ditinjau dari desainnya, desain batik dapat dibagi menjadi dua kelompok utama, yakni geometrik dan nongeometrik. Pengembangan yang dilakukan terhadap batik semarangan disebabkan adanya beberapa motif batik yang hanya digunakan oleh kalangan tertentu, dan tidak boleh untuk kalangan umum. Pengrajin batik Semarangan berkembang di kawasan pesisir. Ia menampilkan desain yang terdiri atas berbagai ornamen yang menunjukkan ciri khas kemaritiman. Batik ini dikembangakan tidak hanya menampilkan desain batik khas pesisiran, tetapi juga memasukkan berbagai ornament dari beragam kelompok etnis di Indonesia.   Kata kunci: batik, sejarah, ragam hias, lingkungan pesisir, desain  

  12. Multilayer motif analysis of brain networks

    OpenAIRE

    Battiston, Federico; Nicosia, Vincenzo; Chavez, Mario; Latora, Vito

    2016-01-01

    In the last decade network science has shed new light on the anatomical connectivity and on correlations in the activity of different areas of the human brain. The study of brain networks has made possible in fact to detect the central areas of a neural system, and to identify its building blocks by looking at overabundant small subgraphs, known as motifs. However, network analysis of the brain has so far mainly focused on structural and functional networks as separate entities. The recently ...

  13. Multilayer motif analysis of brain networks

    CERN Document Server

    Battiston, Federico; Chavez, Mario; Latora, Vito

    2016-01-01

    In the last decade network science has shed new light on the anatomical connectivity and on correlations in the activity of different areas of the human brain. The study of brain networks has made possible in fact to detect the central areas of a neural system, and to identify its building blocks by looking at overabundant small subgraphs, known as motifs. However, network analysis of the brain has so far mainly focused on structural and functional networks as separate entities. The recently developed mathematical framework of multi-layer networks allows to perform a multiplex analysis of the human brain where the structural and functional layers are considered at the same time. In this work we describe how to classify subgraphs in multiplex networks, and we extend motif analysis to networks with many layers. We then extract multi-layer motifs in brain networks of healthy subjects by considering networks with two layers, respectively obtained from diffusion and functional magnetic resonance imaging. Results i...

  14. Discovering sequence motifs with arbitrary insertions and deletions.

    Directory of Open Access Journals (Sweden)

    Martin C Frith

    2008-04-01

    Full Text Available BIOLOGY IS ENCODED IN MOLECULAR SEQUENCES: deciphering this encoding remains a grand scientific challenge. Functional regions of DNA, RNA, and protein sequences often exhibit characteristic but subtle motifs; thus, computational discovery of motifs in sequences is a fundamental and much-studied problem. However, most current algorithms do not allow for insertions or deletions (indels within motifs, and the few that do have other limitations. We present a method, GLAM2 (Gapped Local Alignment of Motifs, for discovering motifs allowing indels in a fully general manner, and a companion method GLAM2SCAN for searching sequence databases using such motifs. glam2 is a generalization of the gapless Gibbs sampling algorithm. It re-discovers variable-width protein motifs from the PROSITE database significantly more accurately than the alternative methods PRATT and SAM-T2K. Furthermore, it usefully refines protein motifs from the ELM database: in some cases, the refined motifs make orders of magnitude fewer overpredictions than the original ELM regular expressions. GLAM2 performs respectably on the BAliBASE multiple alignment benchmark, and may be superior to leading multiple alignment methods for "motif-like" alignments with N- and C-terminal extensions. Finally, we demonstrate the use of GLAM2 to discover protein kinase substrate motifs and a gapped DNA motif for the LIM-only transcriptional regulatory complex: using GLAM2SCAN, we identify promising targets for the latter. GLAM2 is especially promising for short protein motifs, and it should improve our ability to identify the protein cleavage sites, interaction sites, post-translational modification attachment sites, etc., that underlie much of biology. It may be equally useful for arbitrarily gapped motifs in DNA and RNA, although fewer examples of such motifs are known at present. GLAM2 is public domain software, available for download at http://bioinformatics.org.au/glam2.

  15. Large-scale discovery of promoter motifs in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Thomas A Down

    2007-01-01

    Full Text Available A key step in understanding gene regulation is to identify the repertoire of transcription factor binding motifs (TFBMs that form the building blocks of promoters and other regulatory elements. Identifying these experimentally is very laborious, and the number of TFBMs discovered remains relatively small, especially when compared with the hundreds of transcription factor genes predicted in metazoan genomes. We have used a recently developed statistical motif discovery approach, NestedMICA, to detect candidate TFBMs from a large set of Drosophila melanogaster promoter regions. Of the 120 motifs inferred in our initial analysis, 25 were statistically significant matches to previously reported motifs, while 87 appeared to be novel. Analysis of sequence conservation and motif positioning suggested that the great majority of these discovered motifs are predictive of functional elements in the genome. Many motifs showed associations with specific patterns of gene expression in the D. melanogaster embryo, and we were able to obtain confident annotation of expression patterns for 25 of our motifs, including eight of the novel motifs. The motifs are available through Tiffin, a new database of DNA sequence motifs. We have discovered many new motifs that are overrepresented in D. melanogaster promoter regions, and offer several independent lines of evidence that these are novel TFBMs. Our motif dictionary provides a solid foundation for further investigation of regulatory elements in Drosophila, and demonstrates techniques that should be applicable in other species. We suggest that further improvements in computational motif discovery should narrow the gap between the set of known motifs and the total number of transcription factors in metazoan genomes.

  16. ET-Motif: Solving the Exact (l, d)-Planted Motif Problem Using Error Tree Structure.

    Science.gov (United States)

    Al-Okaily, Anas; Huang, Chun-Hsi

    2016-07-01

    Motif finding is an important and a challenging problem in many biological applications such as discovering promoters, enhancers, locus control regions, transcription factors, and more. The (l, d)-planted motif search, PMS, is one of several variations of the problem. In this problem, there are n given sequences over alphabets of size [Formula: see text], each of length m, and two given integers l and d. The problem is to find a motif m of length l, where in each sequence there is at least an l-mer at a Hamming distance of [Formula: see text] of m. In this article, we propose ET-Motif, an algorithm that can solve the PMS problem in [Formula: see text] time and [Formula: see text] space. The time bound can be further reduced by a factor of m with [Formula: see text] space. In case the suffix tree that is built for the input sequences is balanced, the problem can be solved in [Formula: see text] time and [Formula: see text] space. Similarly, the time bound can be reduced by a factor of m using [Formula: see text] space. Moreover, the variations of the problem, namely the edit distance PMS and edited PMS (Quorum), can be solved using ET-Motif with simple modifications but upper bands of space and time. For edit distance PMS, the time and space bounds will be increased by [Formula: see text], while for edited PMS the increase will be of [Formula: see text] in the time bound. PMID:27152692

  17. Dynamics of network motifs in genetic regulatory networks

    Institute of Scientific and Technical Information of China (English)

    Li Ying; Liu Zeng-Rong; Zhang Jian-Bao

    2007-01-01

    Network motifs hold a very important status in genetic regulatory networks. This paper aims to analyse the dynamical property of the network motifs in genetic regulatory networks. The main result we obtained is that the dynamical property of a single motif is very simple with only an asymptotically stable equilibrium point, but the combination of several motifs can make more complicated dynamical properties emerge such as limit cycles. The above-mentioned result shows that network motif is a stable substructure in genetic regulatory networks while their combinations make the genetic regulatory network more complicated.

  18. The EH1 motif in metazoan transcription factors

    Directory of Open Access Journals (Sweden)

    Copley Richard R

    2005-11-01

    Full Text Available Abstract Background The Engrailed Homology 1 (EH1 motif is a small region, believed to have evolved convergently in homeobox and forkhead containing proteins, that interacts with the Drosophila protein groucho (C. elegans unc-37, Human Transducin-like Enhancers of Split. The small size of the motif makes its reliable identification by computational means difficult. I have systematically searched the predicted proteomes of Drosophila, C. elegans and human for further instances of the motif. Results Using motif identification methods and database searching techniques, I delimit which homeobox and forkhead domain containing proteins also have likely EH1 motifs. I show that despite low database search scores, there is a significant association of the motif with transcription factor function. I further show that likely EH1 motifs are found in combination with T-Box, Zinc Finger and Doublesex domains as well as discussing other plausible candidate associations. I identify strong candidate EH1 motifs in basal metazoan phyla. Conclusion Candidate EH1 motifs exist in combination with a variety of transcription factor domains, suggesting that these proteins have repressor functions. The distribution of the EH1 motif is suggestive of convergent evolution, although in many cases, the motif has been conserved throughout bilaterian orthologs. Groucho mediated repression was established prior to the evolution of bilateria.

  19. CLIMP: Clustering Motifs via Maximal Cliques with Parallel Computing Design.

    Science.gov (United States)

    Zhang, Shaoqiang; Chen, Yong

    2016-01-01

    A set of conserved binding sites recognized by a transcription factor is called a motif, which can be found by many applications of comparative genomics for identifying over-represented segments. Moreover, when numerous putative motifs are predicted from a collection of genome-wide data, their similarity data can be represented as a large graph, where these motifs are connected to one another. However, an efficient clustering algorithm is desired for clustering the motifs that belong to the same groups and separating the motifs that belong to different groups, or even deleting an amount of spurious ones. In this work, a new motif clustering algorithm, CLIMP, is proposed by using maximal cliques and sped up by parallelizing its program. When a synthetic motif dataset from the database JASPAR, a set of putative motifs from a phylogenetic foot-printing dataset, and a set of putative motifs from a ChIP dataset are used to compare the performances of CLIMP and two other high-performance algorithms, the results demonstrate that CLIMP mostly outperforms the two algorithms on the three datasets for motif clustering, so that it can be a useful complement of the clustering procedures in some genome-wide motif prediction pipelines. CLIMP is available at http://sqzhang.cn/climp.html. PMID:27487245

  20. CLIMP: Clustering Motifs via Maximal Cliques with Parallel Computing Design.

    Science.gov (United States)

    Zhang, Shaoqiang; Chen, Yong

    2016-01-01

    A set of conserved binding sites recognized by a transcription factor is called a motif, which can be found by many applications of comparative genomics for identifying over-represented segments. Moreover, when numerous putative motifs are predicted from a collection of genome-wide data, their similarity data can be represented as a large graph, where these motifs are connected to one another. However, an efficient clustering algorithm is desired for clustering the motifs that belong to the same groups and separating the motifs that belong to different groups, or even deleting an amount of spurious ones. In this work, a new motif clustering algorithm, CLIMP, is proposed by using maximal cliques and sped up by parallelizing its program. When a synthetic motif dataset from the database JASPAR, a set of putative motifs from a phylogenetic foot-printing dataset, and a set of putative motifs from a ChIP dataset are used to compare the performances of CLIMP and two other high-performance algorithms, the results demonstrate that CLIMP mostly outperforms the two algorithms on the three datasets for motif clustering, so that it can be a useful complement of the clustering procedures in some genome-wide motif prediction pipelines. CLIMP is available at http://sqzhang.cn/climp.html.

  1. No tradeoff between versatility and robustness in gene circuit motifs

    Science.gov (United States)

    Payne, Joshua L.

    2016-05-01

    Circuit motifs are small directed subgraphs that appear in real-world networks significantly more often than in randomized networks. In the Boolean model of gene circuits, most motifs are realized by multiple circuit genotypes. Each of a motif's constituent circuit genotypes may have one or more functions, which are embodied in the expression patterns the circuit forms in response to specific initial conditions. Recent enumeration of a space of nearly 17 million three-gene circuit genotypes revealed that all circuit motifs have more than one function, with the number of functions per motif ranging from 12 to nearly 30,000. This indicates that some motifs are more functionally versatile than others. However, the individual circuit genotypes that constitute each motif are less robust to mutation if they have many functions, hinting that functionally versatile motifs may be less robust to mutation than motifs with few functions. Here, I explore the relationship between versatility and robustness in circuit motifs, demonstrating that functionally versatile motifs are robust to mutation despite the inherent tradeoff between versatility and robustness at the level of an individual circuit genotype.

  2. AISMOTIF-An Artificial Immune System for DNA Motif Discovery

    Directory of Open Access Journals (Sweden)

    Seeja K R

    2011-03-01

    Full Text Available Discovery of transcription factor binding sites is a much explored and still exploring area of research in functional genomics. Many computational tools have been developed for finding motifs and each of them has their own advantages as well as disadvantages. Most of these algorithms need prior knowledge about the data to construct background models. However there is not a single technique that can be considered as best for finding regulatory motifs. This paper proposes an artificial immune system based algorithm for finding the transcription factor binding sites or motifs and two new weighted scores for motif evaluation. The algorithm is enumerative, but sufficient pruning of the pattern search space has been incorporated using immune system concepts. The performance of AISMOTIF has been evaluated by comparing it with eight state of art composite motif discovery algorithms and found that AISMOTIF predicts known motifs as well as new motifs from the benchmark dataset without any prior knowledge about the data.

  3. AISMOTIF-An Artificial Immune System for DNA Motif Discovery

    CERN Document Server

    Seeja, K R

    2011-01-01

    Discovery of transcription factor binding sites is a much explored and still exploring area of research in functional genomics. Many computational tools have been developed for finding motifs and each of them has their own advantages as well as disadvantages. Most of these algorithms need prior knowledge about the data to construct background models. However there is not a single technique that can be considered as best for finding regulatory motifs. This paper proposes an artificial immune system based algorithm for finding the transcription factor binding sites or motifs and two new weighted scores for motif evaluation. The algorithm is enumerative, but sufficient pruning of the pattern search space has been incorporated using immune system concepts. The performance of AISMOTIF has been evaluated by comparing it with eight state of art composite motif discovery algorithms and found that AISMOTIF predicts known motifs as well as new motifs from the benchmark dataset without any prior knowledge about the data...

  4. Chaotic motif sampler: detecting motifs from biological sequences by using chaotic neurodynamics

    Science.gov (United States)

    Matsuura, Takafumi; Ikeguchi, Tohru

    Identification of a region in biological sequences, motif extraction problem (MEP) is solved in bioinformatics. However, the MEP is an NP-hard problem. Therefore, it is almost impossible to obtain an optimal solution within a reasonable time frame. To find near optimal solutions for NP-hard combinatorial optimization problems such as traveling salesman problems, quadratic assignment problems, and vehicle routing problems, chaotic search, which is one of the deterministic approaches, has been proposed and exhibits better performance than stochastic approaches. In this paper, we propose a new alignment method that employs chaotic dynamics to solve the MEPs. It is called the Chaotic Motif Sampler. We show that the performance of the Chaotic Motif Sampler is considerably better than that of the conventional methods such as the Gibbs Site Sampler and the Neighborhood Optimization for Multiple Alignment Discovery.

  5. Assessing the Exceptionality of Coloured Motifs in Networks

    Directory of Open Access Journals (Sweden)

    Lacroix Vincent

    2009-01-01

    Full Text Available Various methods have been recently employed to characterise the structure of biological networks. In particular, the concept of network motif and the related one of coloured motif have proven useful to model the notion of a functional/evolutionary building block. However, algorithms that enumerate all the motifs of a network may produce a very large output, and methods to decide which motifs should be selected for downstream analysis are needed. A widely used method is to assess if the motif is exceptional, that is, over- or under-represented with respect to a null hypothesis. Much effort has been put in the last thirty years to derive -values for the frequencies of topological motifs, that is, fixed subgraphs. They rely either on (compound Poisson and Gaussian approximations for the motif count distribution in Erdös-Rényi random graphs or on simulations in other models. We focus on a different definition of graph motifs that corresponds to coloured motifs. A coloured motif is a connected subgraph with fixed vertex colours but unspecified topology. Our work is the first analytical attempt to assess the exceptionality of coloured motifs in networks without any simulation. We first establish analytical formulae for the mean and the variance of the count of a coloured motif in an Erdös-Rényi random graph model. Using simulations under this model, we further show that a Pólya-Aeppli distribution better approximates the distribution of the motif count compared to Gaussian or Poisson distributions. The Pólya-Aeppli distribution, and more generally the compound Poisson distributions, are indeed well designed to model counts of clumping events. Altogether, these results enable to derive a -value for a coloured motif, without spending time on simulations.

  6. Acidic/IQ Motif Regulator of Calmodulin*

    OpenAIRE

    Putkey, John A.; Waxham, M. Neal; Gaertner, Tara R.; Brewer, Kari J.; Goldsmith, Michael; Kubota, Yoshihisa; Kleerekoper, Quinn K.

    2007-01-01

    The small IQ motif proteins PEP-19 (62 amino acids) and RC3 (78 amino acids) greatly accelerate the rates of Ca2+ binding to sites III and IV in the C-domain of calmodulin (CaM). We show here that PEP-19 decreases the degree of cooperativity of Ca2+ binding to sites III and IV, and we present a model showing that this could increase Ca2+ binding rate constants. Comparative sequence analysis showed that residues 28 to 58 from PEP-19 are conserved in other proteins. This region includes the IQ ...

  7. RMOD: a tool for regulatory motif detection in signaling network.

    Directory of Open Access Journals (Sweden)

    Jinki Kim

    Full Text Available Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.

  8. A combinatorial optimization approach for diverse motif finding applications

    Directory of Open Access Journals (Sweden)

    Singh Mona

    2006-08-01

    Full Text Available Abstract Background Discovering approximately repeated patterns, or motifs, in biological sequences is an important and widely-studied problem in computational molecular biology. Most frequently, motif finding applications arise when identifying shared regulatory signals within DNA sequences or shared functional and structural elements within protein sequences. Due to the diversity of contexts in which motif finding is applied, several variations of the problem are commonly studied. Results We introduce a versatile combinatorial optimization framework for motif finding that couples graph pruning techniques with a novel integer linear programming formulation. Our approach is flexible and robust enough to model several variants of the motif finding problem, including those incorporating substitution matrices and phylogenetic distances. Additionally, we give an approach for determining statistical significance of uncovered motifs. In testing on numerous DNA and protein datasets, we demonstrate that our approach typically identifies statistically significant motifs corresponding to either known motifs or other motifs of high conservation. Moreover, in most cases, our approach finds provably optimal solutions to the underlying optimization problem. Conclusion Our results demonstrate that a combined graph theoretic and mathematical programming approach can be the basis for effective and powerful techniques for diverse motif finding applications.

  9. Protein functional-group 3D motif and its applications

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    Representing and recognizing protein active sites sequence motif (1D motif) and structural motif (3D motif) is an important topic for predicting and designing protein function. Prevalent methods for extracting and searching 3D motif always consider residue as the minimal unit, which have limited sensitivity. Here we present a new spatial representation of protein active sites, called "functional-group 3D motif ", based on the fact that the functional groups inside a residue contribute mostly to its function. Relevant algorithm and computer program are developed, which could be widely used in the function prediction and the study of structural-function relationship of proteins. As a test, we defined a functional-group 3D motif of the catalytic triad and oxyanion hole with the structure of porcine trypsin (PDB code: 1mct) as the template. With our motif-searching program, we successfully found similar sub-structures in trypsins, subtilisins and a/b hydrolases, which show distinct folds but share similar catalytic mechanism. Moreover, this motif can be used to elucidate the structural basis of other proteins with variant catalytic triads by comparing it to those proteins. Finally, we scanned this motif against a non-redundant protein structure database to find its matches, and the results demonstrated the potential application of functional group 3D motif in function prediction. Above all, compared with the other 3D-motif representations on residues, the functional group 3D motif achieves better representation of protein active region, which is more sensitive for protein function prediction.

  10. The network motif architecture of dominance hierarchies.

    Science.gov (United States)

    Shizuka, Daizaburo; McDonald, David B

    2015-04-01

    The widespread existence of dominance hierarchies has been a central puzzle in social evolution, yet we lack a framework for synthesizing the vast empirical data on hierarchy structure in animal groups. We applied network motif analysis to compare the structures of dominance networks from data published over the past 80 years. Overall patterns of dominance relations, including some aspects of non-interactions, were strikingly similar across disparate group types. For example, nearly all groups exhibited high frequencies of transitive triads, whereas cycles were very rare. Moreover, pass-along triads were rare, and double-dominant triads were common in most groups. These patterns did not vary in any systematic way across taxa, study settings (captive or wild) or group size. Two factors significantly affected network motif structure: the proportion of dyads that were observed to interact and the interaction rates of the top-ranked individuals. Thus, study design (i.e. how many interactions were observed) and the behaviour of key individuals in the group could explain much of the variations we see in social hierarchies across animals. Our findings confirm the ubiquity of dominance hierarchies across all animal systems, and demonstrate that network analysis provides new avenues for comparative analyses of social hierarchies. PMID:25762649

  11. The network motif architecture of dominance hierarchies.

    Science.gov (United States)

    Shizuka, Daizaburo; McDonald, David B

    2015-04-01

    The widespread existence of dominance hierarchies has been a central puzzle in social evolution, yet we lack a framework for synthesizing the vast empirical data on hierarchy structure in animal groups. We applied network motif analysis to compare the structures of dominance networks from data published over the past 80 years. Overall patterns of dominance relations, including some aspects of non-interactions, were strikingly similar across disparate group types. For example, nearly all groups exhibited high frequencies of transitive triads, whereas cycles were very rare. Moreover, pass-along triads were rare, and double-dominant triads were common in most groups. These patterns did not vary in any systematic way across taxa, study settings (captive or wild) or group size. Two factors significantly affected network motif structure: the proportion of dyads that were observed to interact and the interaction rates of the top-ranked individuals. Thus, study design (i.e. how many interactions were observed) and the behaviour of key individuals in the group could explain much of the variations we see in social hierarchies across animals. Our findings confirm the ubiquity of dominance hierarchies across all animal systems, and demonstrate that network analysis provides new avenues for comparative analyses of social hierarchies.

  12. A Gibbs sampler for motif detection in phylogenetically close sequences

    Science.gov (United States)

    Siddharthan, Rahul; van Nimwegen, Erik; Siggia, Eric

    2004-03-01

    Genes are regulated by transcription factors that bind to DNA upstream of genes and recognize short conserved ``motifs'' in a random intergenic ``background''. Motif-finders such as the Gibbs sampler compare the probability of these short sequences being represented by ``weight matrices'' to the probability of their arising from the background ``null model'', and explore this space (analogous to a free-energy landscape). But closely related species may show conservation not because of functional sites but simply because they have not had sufficient time to diverge, so conventional methods will fail. We introduce a new Gibbs sampler algorithm that accounts for common ancestry when searching for motifs, while requiring minimal ``prior'' assumptions on the number and types of motifs, assessing the significance of detected motifs by ``tracking'' clusters that stay together. We apply this scheme to motif detection in sporulation-cycle genes in the yeast S. cerevisiae, using recent sequences of other closely-related Saccharomyces species.

  13. MADMX: A Novel Strategy for Maximal Dense Motif Extraction

    CERN Document Server

    Grossi, Roberto; Pisanti, Nadia; Pucci, Geppino; Upfal, Eli; Vandin, Fabio

    2010-01-01

    We develop, analyze and experiment with a new tool, called MADMX, which extracts frequent motifs, possibly including don't care characters, from biological sequences. We introduce density, a simple and flexible measure for bounding the number of don't cares in a motif, defined as the ratio of solid (i.e., different from don't care) characters to the total length of the motif. By extracting only maximal dense motifs, MADMX reduces the output size and improves performance, while enhancing the quality of the discoveries. The efficiency of our approach relies on a newly defined combining operation, dubbed fusion, which allows for the construction of maximal dense motifs in a bottom-up fashion, while avoiding the generation of nonmaximal ones. We provide experimental evidence of the efficiency and the quality of the motifs returned by MADMX

  14. Triadic motifs in the dependence networks of virtual societies

    CERN Document Server

    Xie, Wen-Jie; Jiang, Zhi-Qiang; Zhou, Wei-Xing

    2014-01-01

    In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (${\\rm{M}}_9$) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks...

  15. An Affinity Propagation-Based DNA Motif Discovery Algorithm.

    Science.gov (United States)

    Sun, Chunxiao; Huo, Hongwei; Yu, Qiang; Guo, Haitao; Sun, Zhigang

    2015-01-01

    The planted (l, d) motif search (PMS) is one of the fundamental problems in bioinformatics, which plays an important role in locating transcription factor binding sites (TFBSs) in DNA sequences. Nowadays, identifying weak motifs and reducing the effect of local optimum are still important but challenging tasks for motif discovery. To solve the tasks, we propose a new algorithm, APMotif, which first applies the Affinity Propagation (AP) clustering in DNA sequences to produce informative and good candidate motifs and then employs Expectation Maximization (EM) refinement to obtain the optimal motifs from the candidate motifs. Experimental results both on simulated data sets and real biological data sets show that APMotif usually outperforms four other widely used algorithms in terms of high prediction accuracy.

  16. An Affinity Propagation-Based DNA Motif Discovery Algorithm

    Directory of Open Access Journals (Sweden)

    Chunxiao Sun

    2015-01-01

    Full Text Available The planted (l,d motif search (PMS is one of the fundamental problems in bioinformatics, which plays an important role in locating transcription factor binding sites (TFBSs in DNA sequences. Nowadays, identifying weak motifs and reducing the effect of local optimum are still important but challenging tasks for motif discovery. To solve the tasks, we propose a new algorithm, APMotif, which first applies the Affinity Propagation (AP clustering in DNA sequences to produce informative and good candidate motifs and then employs Expectation Maximization (EM refinement to obtain the optimal motifs from the candidate motifs. Experimental results both on simulated data sets and real biological data sets show that APMotif usually outperforms four other widely used algorithms in terms of high prediction accuracy.

  17. Probabilistic models for semisupervised discriminative motif discovery in DNA sequences.

    Science.gov (United States)

    Kim, Jong Kyoung; Choi, Seungjin

    2011-01-01

    Methods for discriminative motif discovery in DNA sequences identify transcription factor binding sites (TFBSs), searching only for patterns that differentiate two sets (positive and negative sets) of sequences. On one hand, discriminative methods increase the sensitivity and specificity of motif discovery, compared to generative models. On the other hand, generative models can easily exploit unlabeled sequences to better detect functional motifs when labeled training samples are limited. In this paper, we develop a hybrid generative/discriminative model which enables us to make use of unlabeled sequences in the framework of discriminative motif discovery, leading to semisupervised discriminative motif discovery. Numerical experiments on yeast ChIP-chip data for discovering DNA motifs demonstrate that the best performance is obtained between the purely-generative and the purely-discriminative and the semisupervised learning improves the performance when labeled sequences are limited.

  18. RNA motif search with data-driven element ordering

    OpenAIRE

    Rampasek, L; Jimenez, RM; Luptak, A; Vinar, T; Brejova, B

    2016-01-01

    Background In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. Results We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, wh...

  19. Detecting DNA regulatory motifs by incorporating positional trendsin information content

    Energy Technology Data Exchange (ETDEWEB)

    Kechris, Katherina J.; van Zwet, Erik; Bickel, Peter J.; Eisen,Michael B.

    2004-05-04

    On the basis of the observation that conserved positions in transcription factor binding sites are often clustered together, we propose a simple extension to the model-based motif discovery methods. We assign position-specific prior distributions to the frequency parameters of the model, penalizing deviations from a specified conservation profile. Examples with both simulated and real data show that this extension helps discover motifs as the data become noisier or when there is a competing false motif.

  20. Sequence motif discovery with computational genome-wide analysis

    OpenAIRE

    Akashi, Hirofumi; Aoki, Fumio; Toyota, Minoru; Maruyama, Reo; Sasaki, Yasushi; Mita, Hiroaki; Tokura, Hajime; Imai, Kohzoh; Tatsumi, Haruyuki

    2006-01-01

    As a result of the human genome project and advancements in DNA sequencing technology, we can utilize a huge amount of nucleotide sequence data and can search DNA sequence motifs in whole human genome. However, searching motifs with the naked eye is an enormous task and searching throughout the whole genome is absolutely impossible. Therefore, we have developed a computational genome-wide analyzing system for detecting DNA sequence motifs with biological significance. We used a multi-parallel...

  1. A Comparative Study of Bases for Motif Inference

    OpenAIRE

    Pisanti, Nadia; Crochemore, Maxime; Grossi, Roberto; Sagot, Marie-France

    2005-01-01

    International audience Motif inference is at the heart of several time-demanding computational tasks, such as in molecular biology, data mining and identification of structured motifs in sequences, and in data compression, to name a few. In this scenario, a motif is a pattern that appears repeated at least a certain number of times (the quorum), to be of interest. The pattern can be approximated in that some of its characters can be left unspecified (the don't cares). Motif inference is not ...

  2. STEME: a robust, accurate motif finder for large data sets.

    Science.gov (United States)

    Reid, John E; Wernisch, Lorenz

    2014-01-01

    Motif finding is a difficult problem that has been studied for over 20 years. Some older popular motif finders are not suitable for analysis of the large data sets generated by next-generation sequencing. We recently published an efficient approximation (STEME) to the EM algorithm that is at the core of many motif finders such as MEME. This approximation allows the EM algorithm to be applied to large data sets. In this work we describe several efficient extensions to STEME that are based on the MEME algorithm. Together with the original STEME EM approximation, these extensions make STEME a fully-fledged motif finder with similar properties to MEME. We discuss the difficulty of objectively comparing motif finders. We show that STEME performs comparably to existing prominent discriminative motif finders, DREME and Trawler, on 13 sets of transcription factor binding data in mouse ES cells. We demonstrate the ability of STEME to find long degenerate motifs which these discriminative motif finders do not find. As part of our method, we extend an earlier method due to Nagarajan et al. for the efficient calculation of motif E-values. STEME's source code is available under an open source license and STEME is available via a web interface. PMID:24625410

  3. STEME: a robust, accurate motif finder for large data sets.

    Directory of Open Access Journals (Sweden)

    John E Reid

    Full Text Available Motif finding is a difficult problem that has been studied for over 20 years. Some older popular motif finders are not suitable for analysis of the large data sets generated by next-generation sequencing. We recently published an efficient approximation (STEME to the EM algorithm that is at the core of many motif finders such as MEME. This approximation allows the EM algorithm to be applied to large data sets. In this work we describe several efficient extensions to STEME that are based on the MEME algorithm. Together with the original STEME EM approximation, these extensions make STEME a fully-fledged motif finder with similar properties to MEME. We discuss the difficulty of objectively comparing motif finders. We show that STEME performs comparably to existing prominent discriminative motif finders, DREME and Trawler, on 13 sets of transcription factor binding data in mouse ES cells. We demonstrate the ability of STEME to find long degenerate motifs which these discriminative motif finders do not find. As part of our method, we extend an earlier method due to Nagarajan et al. for the efficient calculation of motif E-values. STEME's source code is available under an open source license and STEME is available via a web interface.

  4. Modeling Network Evolution Using Graph Motifs

    CERN Document Server

    Conway, Drew

    2011-01-01

    Network structures are extremely important to the study of political science. Much of the data in its subfields are naturally represented as networks. This includes trade, diplomatic and conflict relationships. The social structure of several organization is also of interest to many researchers, such as the affiliations of legislators or the relationships among terrorist. A key aspect of studying social networks is understanding the evolutionary dynamics and the mechanism by which these structures grow and change over time. While current methods are well suited to describe static features of networks, they are less capable of specifying models of change and simulating network evolution. In the following paper I present a new method for modeling network growth and evolution. This method relies on graph motifs to generate simulated network data with particular structural characteristic. This technique departs notably from current methods both in form and function. Rather than a closed-form model, or stochastic ...

  5. Motif-role-fingerprints: the building-blocks of motifs, clustering-coefficients and transitivities in directed networks.

    Directory of Open Access Journals (Sweden)

    Mark D McDonnell

    Full Text Available Complex networks are frequently characterized by metrics for which particular subgraphs are counted. One statistic from this category, which we refer to as motif-role fingerprints, differs from global subgraph counts in that the number of subgraphs in which each node participates is counted. As with global subgraph counts, it can be important to distinguish between motif-role fingerprints that are 'structural' (induced subgraphs and 'functional' (partial subgraphs. Here we show mathematically that a vector of all functional motif-role fingerprints can readily be obtained from an arbitrary directed adjacency matrix, and then converted to structural motif-role fingerprints by multiplying that vector by a specific invertible conversion matrix. This result demonstrates that a unique structural motif-role fingerprint exists for any given functional motif-role fingerprint. We demonstrate a similar result for the cases of functional and structural motif-fingerprints without node roles, and global subgraph counts that form the basis of standard motif analysis. We also explicitly highlight that motif-role fingerprints are elemental to several popular metrics for quantifying the subgraph structure of directed complex networks, including motif distributions, directed clustering coefficient, and transitivity. The relationships between each of these metrics and motif-role fingerprints also suggest new subtypes of directed clustering coefficients and transitivities. Our results have potential utility in analyzing directed synaptic networks constructed from neuronal connectome data, such as in terms of centrality. Other potential applications include anomaly detection in networks, identification of similar networks and identification of similar nodes within networks. Matlab code for calculating all stated metrics following calculation of functional motif-role fingerprints is provided as S1 Matlab File.

  6. A structure filter for the Eukaryotic Linear Motif Resource

    Directory of Open Access Journals (Sweden)

    Gemünd Christine

    2009-10-01

    Full Text Available Abstract Background Many proteins are highly modular, being assembled from globular domains and segments of natively disordered polypeptides. Linear motifs, short sequence modules functioning independently of protein tertiary structure, are most abundant in natively disordered polypeptides but are also found in accessible parts of globular domains, such as exposed loops. The prediction of novel occurrences of known linear motifs attempts the difficult task of distinguishing functional matches from stochastically occurring non-functional matches. Although functionality can only be confirmed experimentally, confidence in a putative motif is increased if a motif exhibits attributes associated with functional instances such as occurrence in the correct taxonomic range, cellular compartment, conservation in homologues and accessibility to interacting partners. Several tools now use these attributes to classify putative motifs based on confidence of functionality. Results Current methods assessing motif accessibility do not consider much of the information available, either predicting accessibility from primary sequence or regarding any motif occurring in a globular region as low confidence. We present a method considering accessibility and secondary structural context derived from experimentally solved protein structures to rectify this situation. Putatively functional motif occurrences are mapped onto a representative domain, given that a high quality reference SCOP domain structure is available for the protein itself or a close relative. Candidate motifs can then be scored for solvent-accessibility and secondary structure context. The scores are calibrated on a benchmark set of experimentally verified motif instances compared with a set of random matches. A combined score yields 3-fold enrichment for functional motifs assigned to high confidence classifications and 2.5-fold enrichment for random motifs assigned to low confidence classifications

  7. The MHC motif viewer: a visualization tool for MHC binding motifs

    DEFF Research Database (Denmark)

    Rapin, Nicolas; Hoof, Ilka; Lund, Ole;

    2010-01-01

    In vertebrates, the onset of cellular immune reactions is controlled by presentation of peptides in complex with major histocompatibility complex (MHC) molecules to T cell receptors. In humans, MHCs are called human leukocyte antigens (HLAs). Different MHC molecules present different subsets of...... peptides, and knowledge of their binding specificities is important for understanding differences in the immune response between individuals. Algorithms predicting which peptides bind a given MHC molecule have recently been developed with high prediction accuracy. The utility of these algorithms is...... binding motif for each MHC molecule is predicted using state-of-the-art, pan-specific peptide-MHC binding-prediction methods, and is visualized as a sequence logo, in a format that allows for a comprehensive interpretation of binding motif anchor positions and amino acid preferences....

  8. SLIDER: Mining correlated motifs in protein-protein interaction networks

    NARCIS (Netherlands)

    Boyen, P.; Dijk, van A.D.J.; Ham, van R.C.H.J.; Neven, F.

    2009-01-01

    Abstract—Correlated motif mining (CMM) is the problem to find overrepresented pairs of patterns, called motif pairs, in interacting protein sequences. Algorithmic solutions for CMM thereby provide a computational method for predicting binding sites for protein interaction. In this paper, we adopt a

  9. BlockLogo: Visualization of peptide and sequence motif conservation

    DEFF Research Database (Denmark)

    Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian;

    2013-01-01

    BlockLogo is a web-server application for the visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, se...

  10. Aztec, Incan and Mayan Motifs...Lead to Distinctive Designs.

    Science.gov (United States)

    Shields, Joanne

    2001-01-01

    Describes an art project for seventh-grade students in which they choose motifs based on Incan, Aztec, and Mayan Indian materials to incorporate into two-dimensional designs. Explains that the activity objective is to create a unified, balanced and pleasing composition using a minimum of three motifs. (CMK)

  11. The phenomenon of astral motifs on late mediaeval tombstones

    Science.gov (United States)

    Mijatović, V.; Ninković, S.; Vemić, D.

    2003-10-01

    The authors study astral motifs present on some mediaeval tombstones found in present-day Serbia and Montenegro and in the neighbouring countries (especially in Bosnia and Herzegovina). The authors discern some important astral motifs, explain them and present a short review concerning their frequency.

  12. Probing structural changes of self assembled i-motif DNA

    KAUST Repository

    Lee, Iljoon

    2015-01-01

    We report an i-motif structural probing system based on Thioflavin T (ThT) as a fluorescent sensor. This probe can discriminate the structural changes of RET and Rb i-motif sequences according to pH change. This journal is

  13. The effect of orthology and coregulation on detecting regulatory motifs.

    Directory of Open Access Journals (Sweden)

    Valerie Storms

    Full Text Available BACKGROUND: Computational de novo discovery of transcription factor binding sites is still a challenging problem. The growing number of sequenced genomes allows integrating orthology evidence with coregulation information when searching for motifs. Moreover, the more advanced motif detection algorithms explicitly model the phylogenetic relatedness between the orthologous input sequences and thus should be well adapted towards using orthologous information. In this study, we evaluated the conditions under which complementing coregulation with orthologous information improves motif detection for the class of probabilistic motif detection algorithms with an explicit evolutionary model. METHODOLOGY: We designed datasets (real and synthetic covering different degrees of coregulation and orthologous information to test how well Phylogibbs and Phylogenetic sampler, as representatives of the motif detection algorithms with evolutionary model performed as compared to MEME, a more classical motif detection algorithm that treats orthologs independently. RESULTS AND CONCLUSIONS: Under certain conditions detecting motifs in the combined coregulation-orthology space is indeed more efficient than using each space separately, but this is not always the case. Moreover, the difference in success rate between the advanced algorithms and MEME is still marginal. The success rate of motif detection depends on the complex interplay between the added information and the specificities of the applied algorithms. Insights in this relation provide information useful to both developers and users. All benchmark datasets are available at http://homes.esat.kuleuven.be/~kmarchal/Supplementary_Storms_Valerie_PlosONE.

  14. MotifCombinator: a web-based tool to search for combinations of cis-regulatory motifs

    Directory of Open Access Journals (Sweden)

    Tsunoda Tatsuhiko

    2007-03-01

    Full Text Available Abstract Background A combination of multiple types of transcription factors and cis-regulatory elements is often required for gene expression in eukaryotes, and the combinatorial regulation confers specific gene expression to tissues or environments. To reveal the combinatorial regulation, computational methods are developed that efficiently infer combinations of cis-regulatory motifs that are important for gene expression as measured by DNA microarrays. One promising type of computational method is to utilize regression analysis between expression levels and scores of motifs in input sequences. This type takes full advantage of information on expression levels because it does not require that the expression level of each gene be dichotomized according to whether or not it reaches a certain threshold level. However, there is no web-based tool that employs regression methods to systematically search for motif combinations and that practically handles combinations of more than two or three motifs. Results We here introduced MotifCombinator, an online tool with a user-friendly interface, to systematically search for combinations composed of any number of motifs based on regression methods. The tool utilizes well-known regression methods (the multivariate linear regression, the multivariate adaptive regression spline or MARS, and the multivariate logistic regression method for this purpose, and uses the genetic algorithm to search for combinations composed of any desired number of motifs. The visualization systems in this tool help users to intuitively grasp the process of the combination search, and the backup system allows users to easily stop and restart calculations that are expected to require large computational time. This tool also provides preparatory steps needed for systematic combination search – i.e., selecting single motifs to constitute combinations and cutting out redundant similar motifs based on clustering analysis. Conclusion

  15. An algorithm for motif-based network design

    CERN Document Server

    Mäki-Marttunen, Tuomo

    2016-01-01

    A determinant property of the structure of a biological network is the distribution of local connectivity patterns, i.e., network motifs. In this work, a method for creating directed, unweighted networks while promoting a certain combination of motifs is presented. This motif-based network algorithm starts with an empty graph and randomly connects the nodes by advancing or discouraging the formation of chosen motifs. The in- or out-degree distribution of the generated networks can be explicitly chosen. The algorithm is shown to perform well in producing networks with high occurrences of the targeted motifs, both ones consisting of 3 nodes as well as ones consisting of 4 nodes. Moreover, the algorithm can also be tuned to bring about global network characteristics found in many natural networks, such as small-worldness and modularity.

  16. Dynamic Motifs of Strategies in Prisoner's Dilemma Games

    CERN Document Server

    Kim, Young Jin; Jeong, Seon-Young; Son, Seung-Woo

    2014-01-01

    We investigate the win-lose relations between strategies of iterated prisoner's dilemma games by using a directed network concept to display the replicator dynamics results. In the giant strongly-connected component of the win/lose network, we find win-lose circulations similar to rock-paper-scissors and analyze the fixed point and its stability. Applying the network motif concept, we introduce dynamic motifs, which describe the population dynamics relations among the three strategies. Through exact enumeration, we find 22 dynamic motifs and display their phase portraits. Visualization using directed networks and motif analysis is a useful method to make complex dynamic behavior simple in order to understand it more intuitively. Dynamic motifs can be building blocks for dynamic behavior among strategies when they are applied to other types of games.

  17. Discovering multiple realistic TFBS motifs based on a generalized model

    Directory of Open Access Journals (Sweden)

    Leung Kwong-Sak

    2009-10-01

    Full Text Available Abstract Background Identification of transcription factor binding sites (TFBSs is a central problem in Bioinformatics on gene regulation. de novo motif discovery serves as a promising way to predict and better understand TFBSs for biological verifications. Real TFBSs of a motif may vary in their widths and their conservation degrees within a certain range. Deciding a single motif width by existing models may be biased and misleading. Additionally, multiple, possibly overlapping, candidate motifs are desired and necessary for biological verification in practice. However, current techniques either prohibit overlapping TFBSs or lack explicit control of different motifs. Results We propose a new generalized model to tackle the motif widths by considering and evaluating a width range of interest simultaneously, which should better address the width uncertainty. Moreover, a meta-convergence framework for genetic algorithms (GAs, is proposed to provide multiple overlapping optimal motifs simultaneously in an effective and flexible way. Users can easily specify the difference amongst expected motif kinds via similarity test. Incorporating Genetic Algorithm with Local Filtering (GALF for searching, the new GALF-G (G for generalized algorithm is proposed based on the generalized model and meta-convergence framework. Conclusion GALF-G was tested extensively on over 970 synthetic, real and benchmark datasets, and is usually better than the state-of-the-art methods. The range model shows an increase in sensitivity compared with the single-width ones, while providing competitive precisions on the E. coli benchmark. Effectiveness can be maintained even using a very small population, exhibiting very competitive efficiency. In discovering multiple overlapping motifs in a real liver-specific dataset, GALF-G outperforms MEME by up to 73% in overall F-scores. GALF-G also helps to discover an additional motif which has probably not been annotated in the dataset

  18. De Novo Regulatory Motif Discovery Identifies Significant Motifs in Promoters of Five Classes of Plant Dehydrin Genes.

    Science.gov (United States)

    Zolotarov, Yevgen; Strömvik, Martina

    2015-01-01

    Plants accumulate dehydrins in response to osmotic stresses. Dehydrins are divided into five different classes, which are thought to be regulated in different manners. To better understand differences in transcriptional regulation of the five dehydrin classes, de novo motif discovery was performed on 350 dehydrin promoter sequences from a total of 51 plant genomes. Overrepresented motifs were identified in the promoters of five dehydrin classes. The Kn dehydrin promoters contain motifs linked with meristem specific expression, as well as motifs linked with cold/dehydration and abscisic acid response. KS dehydrin promoters contain a motif with a GATA core. SKn and YnSKn dehydrin promoters contain motifs that match elements connected with cold/dehydration, abscisic acid and light response. YnKn dehydrin promoters contain motifs that match abscisic acid and light response elements, but not cold/dehydration response elements. Conserved promoter motifs are present in the dehydrin classes and across different plant lineages, indicating that dehydrin gene regulation is likely also conserved.

  19. Computational analyses of synergism in small molecular network motifs.

    Directory of Open Access Journals (Sweden)

    Yili Zhang

    2014-03-01

    Full Text Available Cellular functions and responses to stimuli are controlled by complex regulatory networks that comprise a large diversity of molecular components and their interactions. However, achieving an intuitive understanding of the dynamical properties and responses to stimuli of these networks is hampered by their large scale and complexity. To address this issue, analyses of regulatory networks often focus on reduced models that depict distinct, reoccurring connectivity patterns referred to as motifs. Previous modeling studies have begun to characterize the dynamics of small motifs, and to describe ways in which variations in parameters affect their responses to stimuli. The present study investigates how variations in pairs of parameters affect responses in a series of ten common network motifs, identifying concurrent variations that act synergistically (or antagonistically to alter the responses of the motifs to stimuli. Synergism (or antagonism was quantified using degrees of nonlinear blending and additive synergism. Simulations identified concurrent variations that maximized synergism, and examined the ways in which it was affected by stimulus protocols and the architecture of a motif. Only a subset of architectures exhibited synergism following paired changes in parameters. The approach was then applied to a model describing interlocked feedback loops governing the synthesis of the CREB1 and CREB2 transcription factors. The effects of motifs on synergism for this biologically realistic model were consistent with those for the abstract models of single motifs. These results have implications for the rational design of combination drug therapies with the potential for synergistic interactions.

  20. Triadic motifs in the dependence networks of virtual societies

    Science.gov (United States)

    Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

    2014-06-01

    In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.

  1. Profile-based short linear protein motif discovery

    Directory of Open Access Journals (Sweden)

    Haslam Niall J

    2012-05-01

    Full Text Available Abstract Background Short linear protein motifs are attracting increasing attention as functionally independent sites, typically 3–10 amino acids in length that are enriched in disordered regions of proteins. Multiple methods have recently been proposed to discover over-represented motifs within a set of proteins based on simple regular expressions. Here, we extend these approaches to profile-based methods, which provide a richer motif representation. Results The profile motif discovery method MEME performed relatively poorly for motifs in disordered regions of proteins. However, when we applied evolutionary weighting to account for redundancy amongst homologous proteins, and masked out poorly conserved regions of disordered proteins, the performance of MEME is equivalent to that of regular expression methods. However, the two approaches returned different subsets within both a benchmark dataset, and a more realistic discovery dataset. Conclusions Profile-based motif discovery methods complement regular expression based methods. Whilst profile-based methods are computationally more intensive, they are likely to discover motifs currently overlooked by regular expression methods.

  2. A speedup technique for (l, d-motif finding algorithms

    Directory of Open Access Journals (Sweden)

    Dinh Hieu

    2011-03-01

    Full Text Available Abstract Background The discovery of patterns in DNA, RNA, and protein sequences has led to the solution of many vital biological problems. For instance, the identification of patterns in nucleic acid sequences has resulted in the determination of open reading frames, identification of promoter elements of genes, identification of intron/exon splicing sites, identification of SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have proven to be extremely helpful in domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, etc. Motifs are important patterns that are helpful in finding transcriptional regulatory elements, transcription factor binding sites, functional genomics, drug design, etc. As a result, numerous papers have been written to solve the motif search problem. Results Three versions of the motif search problem have been proposed in the literature: Simple Motif Search (SMS, (l, d-motif search (or Planted Motif Search (PMS, and Edit-distance-based Motif Search (EMS. In this paper we focus on PMS. Two kinds of algorithms can be found in the literature for solving the PMS problem: exact and approximate. An exact algorithm identifies the motifs always and an approximate algorithm may fail to identify some or all of the motifs. The exact version of PMS problem has been shown to be NP-hard. Exact algorithms proposed in the literature for PMS take time that is exponential in some of the underlying parameters. In this paper we propose a generic technique that can be used to speedup PMS algorithms. Conclusions We present a speedup technique that can be used on any PMS algorithm. We have tested our speedup technique on a number of algorithms. These experimental results show that our speedup technique is indeed very

  3. Evaluating deterministic motif significance measures in protein databases

    Directory of Open Access Journals (Sweden)

    Azevedo Paulo J

    2007-12-01

    Full Text Available Abstract Background Assessing the outcome of motif mining algorithms is an essential task, as the number of reported motifs can be very large. Significance measures play a central role in automatically ranking those motifs, and therefore alleviating the analysis work. Spotting the most interesting and relevant motifs is then dependent on the choice of the right measures. The combined use of several measures may provide more robust results. However caution has to be taken in order to avoid spurious evaluations. Results From the set of conducted experiments, it was verified that several of the selected significance measures show a very similar behavior in a wide range of situations therefore providing redundant information. Some measures have proved to be more appropriate to rank highly conserved motifs, while others are more appropriate for weakly conserved ones. Support appears as a very important feature to be considered for correct motif ranking. We observed that not all the measures are suitable for situations with poorly balanced class information, like for instance, when positive data is significantly less than negative data. Finally, a visualization scheme was proposed that, when several measures are applied, enables an easy identification of high scoring motifs. Conclusion In this work we have surveyed and categorized 14 significance measures for pattern evaluation. Their ability to rank three types of deterministic motifs was evaluated. Measures were applied in different testing conditions, where relations were identified. This study provides some pertinent insights on the choice of the right set of significance measures for the evaluation of deterministic motifs extracted from protein databases.

  4. Identification of protein superfamily from structure- based sequence motif

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The structure-based sequence motif of the distant proteins in evolution, protein tyrosine phosphatases (PTP) Ⅰ and Ⅱ superfamilies, as an example, has been defined by the structural comparison, structure-based sequence alignment and analyses on substitution patterns of residues in common sequence conserved regions. And the phosphatases Ⅰ and Ⅱ can be correctly identified together by the structure-based PTP sequence motif from SWISS-PROT and TrEBML databases. The results show that the correct rates of identification are over 98%. This is the first time to identify PTP Ⅰ and Ⅱ together by this motif.

  5. A comprehensive search for recombinogenic motifs in the human genome.

    Directory of Open Access Journals (Sweden)

    Henry R Johnston

    Full Text Available The patterns of male and female recombination vary greatly on a macro scale. A unique motif in each gender, triggering a double strand break at its location, much in the way Chi sites operate in E. coli, could logically explain this difference. As such, we have undertaken a comprehensive search of all small motifs in an attempt to identify one or more that match to the available data. In the end, we conclude that no such motifs appear to exist in the human genome.

  6. A Novel Alignment-Free Method for Comparing Transcription Factor Binding Site Motifs

    OpenAIRE

    Minli Xu; Zhengchang Su

    2010-01-01

    BACKGROUND: Transcription factor binding site (TFBS) motifs can be accurately represented by position frequency matrices (PFM) or other equivalent forms. We often need to compare TFBS motifs using their PFMs in order to search for similar motifs in a motif database, or cluster motifs according to their binding preference. The majority of current methods for motif comparison involve a similarity metric for column-to-column comparison and a method to find the optimal position alignment between ...

  7. ROMANIAN FOLKLORE MOTIFS IN FASHION DESIGN

    Directory of Open Access Journals (Sweden)

    MOCENCO Alexandra

    2014-05-01

    Full Text Available The traditional Romanian costume such as the entire popular art (architecture, woodcarvins, pottery etc. was born and lasted in our country since ancient times. Closely related to human existence, the traditional costume reflected over the years as reflected nowadays, the mentality and artistic conception of the people. Today the traditional Romanian costume became an inspiration source to the wholesale fashion production industry designers, both Romanian and international. Although the contemporary designers are working in accordance with a vision, using a wide area of styles, methods and current technology, they usually return to traditional techniques and ethnic folklore motifs, which converts and resize them, integrating them in their contemporary space. Adrian Oianu is a very appreciated Romanian designer who launched two collections inspired by his native’s country traditional costumes: “Suflecata pan’ la brau” (“Turned up ‘til the belt” and “Bucurie” (“Joy”. Dorin Negrau had as inspiration for his “Lost” collection the traditional costume from the Bihor region. Yves Saint Laurent had a collection inspired by the Romanian traditional flax blouses called “La blouse roumaine”. The paper presents the traditional Romanian values throw fashion collections. The research activity will create innovative concepts to support the garment industry in order to develop their own brand and to bring the design activities in Romania at an international level. The research was conducted during the initial stage of a project, financed through national founds, consisting in a documentary study on ethnographic characteristics of the popular costume from different regions of the country.

  8. Targeting functional motifs of a protein family

    Science.gov (United States)

    Bhadola, Pradeep; Deo, Nivedita

    2016-10-01

    The structural organization of a protein family is investigated by devising a method based on the random matrix theory (RMT), which uses the physiochemical properties of the amino acid with multiple sequence alignment. A graphical method to represent protein sequences using physiochemical properties is devised that gives a fast, easy, and informative way of comparing the evolutionary distances between protein sequences. A correlation matrix associated with each property is calculated, where the noise reduction and information filtering is done using RMT involving an ensemble of Wishart matrices. The analysis of the eigenvalue statistics of the correlation matrix for the β -lactamase family shows the universal features as observed in the Gaussian orthogonal ensemble (GOE). The property-based approach captures the short- as well as the long-range correlation (approximately following GOE) between the eigenvalues, whereas the previous approach (treating amino acids as characters) gives the usual short-range correlations, while the long-range correlations are the same as that of an uncorrelated series. The distribution of the eigenvector components for the eigenvalues outside the bulk (RMT bound) deviates significantly from RMT observations and contains important information about the system. The information content of each eigenvector of the correlation matrix is quantified by introducing an entropic estimate, which shows that for the β -lactamase family the smallest eigenvectors (low eigenmodes) are highly localized as well as informative. These small eigenvectors when processed gives clusters involving positions that have well-defined biological and structural importance matching with experiments. The approach is crucial for the recognition of structural motifs as shown in β -lactamase (and other families) and selectively identifies the important positions for targets to deactivate (activate) the enzymatic actions.

  9. Early illness recognition using frequent motif discovery.

    Science.gov (United States)

    Hajihashemi, Zahra; Popescu, Mihail

    2015-08-01

    Living alone in their own residence, older adults are at risk for late assessment of physical or cognitive changes due to many factors such as their impression that such changes are simply a normal part of aging or their reluctance to admit to a problem. This paper describes an early illness recognition framework using sensor network technology to identify the health trajectory of older adults reflected in patterns of day-today activities. Describing the behavior of older adults could help clinicians to identify those at the greatest risk for functional decline and adverse events. The proposed framework, denoted as Abnormal Frequent Activity Pattern (AFAP), is based on the identification of known past abnormal frequent activities in current sensor data. More specifically, AFAP declares a day abnormal when past frequent abnormal behavior patterns, not found during normal days, are discovered in the current activity data. While AFAP requires the labeling of past days as normal/abnormal, it doesn't need specific activity identification. Frequent activity patterns (FAP) are found using MEME, a bioinformatics motif detection algorithm. To validate our approach, we used data obtained from TigerPlace, an aging in place community situated in Columbia, MO, where apartments are equipped with sensor networks (motion, bed and depth sensors). A retrospective multiple case study (N=3) design was used to quantify the in-home older adult's daily routines, over a period of two weeks. Within-person variability of routine activities may be used as a new predictor in the study of health trajectories of older adults. PMID:26737096

  10. An autoinhibited conformation of LGN reveals a distinct interaction mode between GoLoco motifs and TPR motifs.

    Science.gov (United States)

    Pan, Zhu; Zhu, Jinwei; Shang, Yuan; Wei, Zhiyi; Jia, Min; Xia, Caihao; Wen, Wenyu; Wang, Wenning; Zhang, Mingjie

    2013-06-01

    LGN plays essential roles in asymmetric cell divisions via its N-terminal TPR-motif-mediated binding to mInsc and NuMA. This scaffolding activity requires the release of the autoinhibited conformation of LGN by binding of Gα(i) to its C-terminal GoLoco (GL) motifs. The interaction between the GL and TPR motifs of LGN represents a distinct GL/target binding mode with an unknown mechanism. Here, we show that two consecutive GL motifs of LGN form a minimal TPR-motif-binding unit. GL12 and GL34 bind to TPR0-3 and TPR4-7, respectively. The crystal structure of a truncated LGN reveals that GL34 forms a pair of parallel α helices and binds to the concave surface of TPR4-7, thereby preventing LGN from binding to other targets. Importantly, the GLs bind to TPR motifs with a mode distinct from that observed in the GL/Gα(i)·GDP complexes. Our results also indicate that multiple and orphan GL motif proteins likely respond to G proteins with distinct mechanisms.

  11. Local graph alignment and motif search in biological networks

    Science.gov (United States)

    Berg, Johannes; Lässig, Michael

    2004-10-01

    Interaction networks are of central importance in postgenomic molecular biology, with increasing amounts of data becoming available by high-throughput methods. Examples are gene regulatory networks or protein interaction maps. The main challenge in the analysis of these data is to read off biological functions from the topology of the network. Topological motifs, i.e., patterns occurring repeatedly at different positions in the network, have recently been identified as basic modules of molecular information processing. In this article, we discuss motifs derived from families of mutually similar but not necessarily identical patterns. We establish a statistical model for the occurrence of such motifs, from which we derive a scoring function for their statistical significance. Based on this scoring function, we develop a search algorithm for topological motifs called graph alignment, a procedure with some analogies to sequence alignment. The algorithm is applied to the gene regulation network of Escherichia coli.

  12. Review article: The mountain motif in the plot of Matthew

    Directory of Open Access Journals (Sweden)

    Gert J. Volschenk

    2010-02-01

    Full Text Available This article reviewed T.L. Donaldson’s book, Jesus on the mountain: A study in Matthean theology, published in 1985 by JSOT Press, Sheffield, and focused on the mountain motif in the structure and plot of the Gospel of Matthew, in addition to the work of Donaldson on the mountain motif as a literary motif and as theological symbol. The mountain is a primary theological setting for Jesus’ ministry and thus is an important setting, serving as one of the literary devices by which Matthew structured and progressed his narrative. The Zion theological and eschatological significance and Second Temple Judaism serve as the historical and theological background for the mountain motif. The last mountain setting (Mt 28:16–20 is the culmination of the three theological themes in the plot of Matthew, namely Christology, ecclesiology and salvation history.

  13. Automatic Network Fingerprinting through Single-Node Motifs

    CERN Document Server

    Echtermeyer, Christoph; Rodrigues, Francisco A; Kaiser, Marcus; 10.1371/journal.pone.0015765

    2011-01-01

    Complex networks have been characterised by their specific connectivity patterns (network motifs), but their building blocks can also be identified and described by node-motifs---a combination of local network features. One technique to identify single node-motifs has been presented by Costa et al. (L. D. F. Costa, F. A. Rodrigues, C. C. Hilgetag, and M. Kaiser, Europhys. Lett., 87, 1, 2009). Here, we first suggest improvements to the method including how its parameters can be determined automatically. Such automatic routines make high-throughput studies of many networks feasible. Second, the new routines are validated in different network-series. Third, we provide an example of how the method can be used to analyse network time-series. In conclusion, we provide a robust method for systematically discovering and classifying characteristic nodes of a network. In contrast to classical motif analysis, our approach can identify individual components (here: nodes) that are specific to a network. Such special nodes...

  14. Robust and Adaptive MicroRNA-Mediated Incoherent Feedforward Motifs

    Science.gov (United States)

    Xu, Feng-Dan; Liu, Zeng-Rong; Zhang, Zhi-Yong; Shen, Jian-Wei

    2009-02-01

    We integrate transcriptional and post-transcriptional regulation into microRNA-mediated incoherent feedforward motifs and analyse their dynamical behaviour and functions. The analysis show that the behaviour of the system is almost uninfluenced by the varying input in certain ranges and by introducing of delay and noise. The results indicate that microRNA-mediated incoherent feedforward motifs greatly enhance the robustness of gene regulation.

  15. Robust and Adaptive MicroRNA-Mediated Incoherent Feedforward Motifs

    Institute of Scientific and Technical Information of China (English)

    XU Feng-Dan; LIU Zeng-Rong; ZHANG Zhi-Yong; SHEN Jian-Wei

    2009-01-01

    We integrate transcriptional and post-transcriptional regulation into microRNA-mediated incoherent feedforward motifs and analyse their dynamical behaviour and functions. The analysis show that the behaviour of the system is almost uninfluenced by the varying input in certain ranges and by introducing of delay and noise. The results indicate that microRNA-mediated incoherent feedforward motifs greatly enhance the robustness of gene regulation.

  16. Mining Tertiary Structural Motifs for Assessment of Designability

    OpenAIRE

    Zhang, Jian; Grigoryan, Gevorg

    2013-01-01

    The observation of a limited secondary-structural alphabet in native proteins, with significant sequence preferences, has profoundly influenced the fields of protein design and structure prediction (Simons et al., 1997; Verschueren et al., 2011). In the era of structural genomics, as the size of the structural dataset continues to grow rapidly, it is becoming possible to extend this analysis to tertiary structural motifs and their sequences. For a hypothetical tertiary motif, the rate of its ...

  17. Temporal Analysis of Motif Mixtures using Dirichlet Processes

    OpenAIRE

    Emonet, Rémi; Varadarajan, J.; Odobez, Jean-Marc

    2014-01-01

    International audience In this paper, we present a new model for unsupervised discovery of recurrent temporal patterns (or motifs) in time series (or documents). The model is designed to handle the difficult case of multivariate time series obtained from a mixture of activities, that is, our observations are caused by the superposition of multiple phenomena occurring concurrently and with no synchronization. The model uses nonparametric Bayesian methods to describe both the motifs and thei...

  18. Triplex-induced recombination and repair in the pyrimidine motif

    OpenAIRE

    Kalish, Jennifer M.; Seidman, Michael M.; Weeks, Daniel L.; Glazer, Peter M.

    2005-01-01

    Triplex-forming oligonucleotides (TFOs) bind DNA in a sequence-specific manner at polypurine/polypyrimidine sites and mediate targeted genome modification. Triplexes are formed by either pyrimidine TFOs, which bind parallel to the purine strand of the duplex (pyrimidine, parallel motif), or purine TFOs, which bind in an anti-parallel orientation (purine, anti-parallel motif). Both purine and pyrimidine TFOs, when linked to psoralen, have been shown to direct psoralen adduct formation in cells...

  19. Cross-Disciplinary Detection and Analysis of Network Motifs

    OpenAIRE

    Ngoc Tam L. Tran; Luke DeLuccia; McDonald, Aidan F; Chun-Hsi Huang

    2015-01-01

    The detection of network motifs has recently become an important part of network analysis across all disciplines. In this work, we detected and analyzed network motifs from undirected and directed networks of several different disciplines, including biological network, social network, ecological network, as well as other networks such as airlines, power grid, and co-purchase of political books networks. Our analysis revealed that undirected networks are similar at the basic three and four nod...

  20. The Origin of Motif Families in Food Webs

    OpenAIRE

    Klaise, Janis; Johnson, Samuel

    2016-01-01

    Food webs have been found to exhibit remarkable motif profiles, patterns in the relative prevalences of all possible three-species sub-graphs, and this has been related to ecosystem properties such as stability and robustness. Analysing 46 food webs of various kinds, we find that most food webs fall into one of two distinct motif families. The separation between the families is well predicted by a global measure of hierarchical order in directed networks - trophic coherence. We find that trop...

  1. Motif depletion in bacteriophages infecting hosts with CRISPR systems

    OpenAIRE

    Kupczok, Anne; Bollback, Jonathan P

    2014-01-01

    Background CRISPR is a microbial immune system likely to be involved in host-parasite coevolution. It functions using target sequences encoded by the bacterial genome, which interfere with invading nucleic acids using a homology-dependent system. The system also requires protospacer associated motifs (PAMs), short motifs close to the target sequence that are required for interference in CRISPR types I and II. Here, we investigate whether PAMs are depleted in phage genomes due to selection pre...

  2. Transcriptional Network growing Models using Motif-based Preferential Attachment

    Directory of Open Access Journals (Sweden)

    Ahmed Farouk Abdelzaher

    2015-10-01

    Full Text Available Understanding relationships between architectural properties of gene-regulatory networks (GRNs has been one of the major goals in systems biology and bioinformatics, as it can provide insights into, e.g., disease dynamics and drug development. Such GRNs are characterized by their scale-free degree distributions and existence of network motifs--i.e., small-node subgraphs that occur more abundantly in GRNs than expected from chance alone. Because these transcriptional modules represent ``building blocks'' of complex networks and exhibit a wide range of functional and dynamical properties, they may contribute to the remarkable robustness and dynamical stability associated with the whole of GRNs. Here we developed network-construction models to better understand this relationship, which produce randomized GRNs by using transcriptional motifs as the fundamental growth unit in contrast to other methods that construct similar networks on a node-by-node basis. Because this model produces networks with a prescribed lower bound on the number of choice transcriptional motifs (e.g., downlinks, feed-forward loops, its fidelity to the motif distributions observed in model organisms represents an improvement over existing methods, which we validated by contrasting their resultant motif and degree distributions against existing network-growth models and data from the model organism of the bacterium Escherichia coli. These models may therefore serve as novel testbeds for further elucidating relationships between the topology of transcriptional motifs and network-wide dynamical properties.

  3. Discovering Motifs in Biological Sequences Using the Micron Automata Processor.

    Science.gov (United States)

    Roy, Indranil; Aluru, Srinivas

    2016-01-01

    Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-complete, and the largest solved instance reported to date is (26,11). We propose a novel algorithm for the (l,d) motif search problem using streaming execution over a large set of non-deterministic finite automata (NFA). This solution is designed to take advantage of the micron automata processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We demonstrate the capability for solving much larger instances of the (l, d) motif search problem using the resources available within a single automata processor board, by estimating run-times for problem instances (39,18) and (40,17). The paper serves as a useful guide to solving problems using this new accelerator technology. PMID:26886735

  4. An experimental test of a fundamental food web motif.

    Science.gov (United States)

    Rip, Jason M K; McCann, Kevin S; Lynn, Denis H; Fawcett, Sonia

    2010-06-01

    Large-scale changes to the world's ecosystem are resulting in the deterioration of biostructure-the complex web of species interactions that make up ecological communities. A difficult, yet crucial task is to identify food web structures, or food web motifs, that are the building blocks of this baroque network of interactions. Once identified, these food web motifs can then be examined through experiments and theory to provide mechanistic explanations for how structure governs ecosystem stability. Here, we synthesize recent ecological research to show that generalist consumers coupling resources with different interaction strengths, is one such motif. This motif amazingly occurs across an enormous range of spatial scales, and so acts to distribute coupled weak and strong interactions throughout food webs. We then perform an experiment that illustrates the importance of this motif to ecological stability. We find that weak interactions coupled to strong interactions by generalist consumers dampen strong interaction strengths and increase community stability. This study takes a critical step by isolating a common food web motif and through clear, experimental manipulation, identifies the fundamental stabilizing consequences of this structure for ecological communities. PMID:20129988

  5. Efficient motif finding algorithms for large-alphabet inputs

    Directory of Open Access Journals (Sweden)

    Pavlovic Vladimir

    2010-10-01

    Full Text Available Abstract Background We consider the problem of identifying motifs, recurring or conserved patterns, in the biological sequence data sets. To solve this task, we present a new deterministic algorithm for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. Results The proposed algorithm (1 improves search efficiency compared to existing algorithms, and (2 scales well with the size of alphabet. On a synthetic planted DNA motif finding problem our algorithm is over 10× more efficient than MITRA, PMSPrune, and RISOTTO for long motifs. Improvements are orders of magnitude higher in the same setting with large alphabets. On benchmark TF-binding site problems (FNP, CRP, LexA we observed reduction in running time of over 12×, with high detection accuracy. The algorithm was also successful in rapidly identifying protein motifs in Lipocalin, Zinc metallopeptidase, and supersecondary structure motifs for Cadherin and Immunoglobin families. Conclusions Our algorithm reduces computational complexity of the current motif finding algorithms and demonstrate strong running time improvements over existing exact algorithms, especially in important and difficult cases of large-alphabet sequences.

  6. Discovering motifs in ranked lists of DNA sequences.

    Directory of Open Access Journals (Sweden)

    Eran Eden

    2007-03-01

    Full Text Available Computational methods for discovery of sequence elements that are enriched in a target set compared with a background set are fundamental in molecular biology research. One example is the discovery of transcription factor binding motifs that are inferred from ChIP-chip (chromatin immuno-precipitation on a microarray measurements. Several major challenges in sequence motif discovery still require consideration: (i the need for a principled approach to partitioning the data into target and background sets; (ii the lack of rigorous models and of an exact p-value for measuring motif enrichment; (iii the need for an appropriate framework for accounting for motif multiplicity; (iv the tendency, in many of the existing methods, to report presumably significant motifs even when applied to randomly generated data. In this paper we present a statistical framework for discovering enriched sequence elements in ranked lists that resolves these four issues. We demonstrate the implementation of this framework in a software application, termed DRIM (discovery of rank imbalanced motifs, which identifies sequence motifs in lists of ranked DNA sequences. We applied DRIM to ChIP-chip and CpG methylation data and obtained the following results. (i Identification of 50 novel putative transcription factor (TF binding sites in yeast ChIP-chip data. The biological function of some of them was further investigated to gain new insights on transcription regulation networks in yeast. For example, our discoveries enable the elucidation of the network of the TF ARO80. Another finding concerns a systematic TF binding enhancement to sequences containing CA repeats. (ii Discovery of novel motifs in human cancer CpG methylation data. Remarkably, most of these motifs are similar to DNA sequence elements bound by the Polycomb complex that promotes histone methylation. Our findings thus support a model in which histone methylation and CpG methylation are mechanistically linked

  7. Fitting a mixture model by expectation maximization to discover motifs in biopolymers

    Energy Technology Data Exchange (ETDEWEB)

    Bailey, T.L.; Elkan, C. [Univ. of California, La Jolla, CA (United States)

    1994-12-31

    The algorithm described in this paper discovers one or more motifs in a collection of DNA or protein sequences by using the technique of expectation maximization to fit a two-component finite mixture model to the set of sequences. Multiple motifs are found by fitting a mixture model to the data, probabilistically erasing the occurrences of the motif thus found, and repeating the process to find successive motifs. The algorithm requires only a set of unaligned sequences and a number specifying the width of the motifs as input. It returns a model of each motif and a threshold which together can be used as a Bayes-optimal classifier for searching for occurrences of the motif in other databases. The algorithm estimates how many times each motif occurs in each sequence in the dataset and outputs an alignment of the occurrences of the motif. The algorithm is capable of discovering several different motifs with differing numbers of occurrences in a single dataset.

  8. Modeling Small Noncanonical RNA Motifs with the Rosetta FARFAR Server.

    Science.gov (United States)

    Yesselman, Joseph D; Das, Rhiju

    2016-01-01

    Noncanonical RNA motifs help define the vast complexity of RNA structure and function, and in many cases, these loops and junctions are on the order of only ten nucleotides in size. Unfortunately, despite their small size, there is no reliable method to determine the ensemble of lowest energy structures of junctions and loops at atomic accuracy. This chapter outlines straightforward protocols using a webserver for Rosetta Fragment Assembly of RNA with Full Atom Refinement (FARFAR) ( http://rosie.rosettacommons.org/rna_denovo/submit ) to model the 3D structure of small noncanonical RNA motifs for use in visualizing motifs and for further refinement or filtering with experimental data such as NMR chemical shifts. PMID:27665600

  9. PMS6MC: A Multicore Algorithm for Motif Discovery

    Directory of Open Access Journals (Sweden)

    Shibdas Bandyopadhyay

    2013-11-01

    Full Text Available We develop an efficient multicore algorithm, PMS6MC, for the (l; d-motif discovery problem in which we are to find all strings of length l that appear in every string of a given set of strings with at most d mismatches. PMS6MC is based on PMS6, which is currently the fastest single-core algorithm for motif discovery in large instances. The speedup, relative to PMS6, attained by our multicore algorithm ranges from a high of 6.62 for the (17,6 challenging instances to a low of 2.75 for the (13,4 challenging instances on an Intel 6-core system. We estimate that PMS6MC is 2 to 4 times faster than other parallel algorithms for motif search on large instances.

  10. Motifs in Triadic Random Graphs based on Steiner Triple Systems

    CERN Document Server

    Winkler, Marco

    2013-01-01

    Conventionally, pairwise relationships between nodes are considered to be the fundamental building blocks of complex networks. However, over the last decade the overabundance of certain sub-network patterns, so called motifs, has attracted high attention. It has been hypothesized, these motifs, instead of links, serve as the building blocks of network structures. Although the relation between a network's topology and the general properties of the system, such as its function, its robustness against perturbations, or its efficiency in spreading information is the central theme of network science, there is still a lack of sound generative models needed for testing the functional role of subgraph motifs. Our work aims to overcome this limitation. We employ the framework of exponential random graphs (ERGMs) to define novel models based on triadic substructures. The fact that only a small portion of triads can actually be set independently poses a challenge for the formulation of such models. To overcome this obst...

  11. How pathogens use linear motifs to perturb host cell networks

    KAUST Repository

    Via, Allegra

    2015-01-01

    Molecular mimicry is one of the powerful stratagems that pathogens employ to colonise their hosts and take advantage of host cell functions to guarantee their replication and dissemination. In particular, several viruses have evolved the ability to interact with host cell components through protein short linear motifs (SLiMs) that mimic host SLiMs, thus facilitating their internalisation and the manipulation of a wide range of cellular networks. Here we present convincing evidence from the literature that motif mimicry also represents an effective, widespread hijacking strategy in prokaryotic and eukaryotic parasites. Further insights into host motif mimicry would be of great help in the elucidation of the molecular mechanisms behind host cell invasion and the development of anti-infective therapeutic strategies.

  12. Selection against spurious promoter motifs correlates withtranslational efficiency across bacteria

    Energy Technology Data Exchange (ETDEWEB)

    Froula, Jeffrey L.; Francino, M. Pilar

    2007-05-01

    Because binding of RNAP to misplaced sites could compromise the efficiency of transcription, natural selection for the optimization of gene expression should regulate the distribution of DNA motifs capable of RNAP-binding across the genome. Here we analyze the distribution of the -10 promoter motifs that bind the {sigma}{sup 70} subunit of RNAP in 42 bacterial genomes. We show that selection on these motifs operates across the genome, maintaining an over-representation of -10 motifs in regulatory sequences while eliminating them from the nonfunctional and, in most cases, from the protein coding regions. In some genomes, however, -10 sites are over-represented in the coding sequences; these sites could induce pauses effecting regulatory roles throughout the length of a transcriptional unit. For nonfunctional sequences, the extent of motif under-representation varies across genomes in a manner that broadly correlates with the number of tRNA genes, a good indicator of translational speed and growth rate. This suggests that minimizing the time invested in gene transcription is an important selective pressure against spurious binding. However, selection against spurious binding is detectable in the reduced genomes of host-restricted bacteria that grow at slow rates, indicating that components of efficiency other than speed may also be important. Minimizing the number of RNAP molecules per cell required for transcription, and the corresponding energetic expense, may be most relevant in slow growers. These results indicate that genome-level properties affecting the efficiency of transcription and translation can respond in an integrated manner to optimize gene expression. The detection of selection against promoter motifs in nonfunctional regions also implies that no sequence may evolve free of selective constraints, at least in the relatively small and unstructured genomes of bacteria.

  13. Some results on more flexible versions of Graph Motif

    CERN Document Server

    Rizzi, Romeo

    2012-01-01

    The problems studied in this paper originate from Graph Motif, a problem introduced in 2006 in the context of biological networks. Informally speaking, it consists in deciding if a multiset of colors occurs in a connected subgraph of a vertex-colored graph. Due to the high rate of noise in the biological data, more flexible definitions of the problem have been outlined. We present in this paper two inapproximability results for two different optimization variants of Graph Motif. We also study another definition of the problem, when the connectivity constraint is replaced by modularity. While the problem stays NP-complete, it allows algorithms in FPT for biologically relevant parameterizations.

  14. Positional bias of general and tissue-specific regulatory motifs in mouse gene promoters

    Directory of Open Access Journals (Sweden)

    Farré Domènec

    2007-12-01

    Full Text Available Abstract Background The arrangement of regulatory motifs in gene promoters, or promoter architecture, is the result of mutation and selection processes that have operated over many millions of years. In mammals, tissue-specific transcriptional regulation is related to the presence of specific protein-interacting DNA motifs in gene promoters. However, little is known about the relative location and spacing of these motifs. To fill this gap, we have performed a systematic search for motifs that show significant bias at specific promoter locations in a large collection of housekeeping and tissue-specific genes. Results We observe that promoters driving housekeeping gene expression are enriched in particular motifs with strong positional bias, such as YY1, which are of little relevance in promoters driving tissue-specific expression. We also identify a large number of motifs that show positional bias in genes expressed in a highly tissue-specific manner. They include well-known tissue-specific motifs, such as HNF1 and HNF4 motifs in liver, kidney and small intestine, or RFX motifs in testis, as well as many potentially novel regulatory motifs. Based on this analysis, we provide predictions for 559 tissue-specific motifs in mouse gene promoters. Conclusion The study shows that motif positional bias is an important feature of mammalian proximal promoters and that it affects both general and tissue-specific motifs. Motif positional constraints define very distinct promoter architectures depending on breadth of expression and type of tissue.

  15. How curved membranes recruit amphipathic helices and protein anchoring motifs

    DEFF Research Database (Denmark)

    Hatzakis, Nikos; Bhatia, Vikram Kjøller; Larsen, Jannik;

    2009-01-01

    Lipids and several specialized proteins are thought to be able to sense the curvature of membranes (MC). Here we used quantitative fluorescence microscopy to measure curvature-selective binding of amphipathic motifs on single liposomes 50-700 nm in diameter. Our results revealed that sensing is p...

  16. Motifs in triadic random graphs based on Steiner triple systems

    Science.gov (United States)

    Winkler, Marco; Reichardt, Jörg

    2013-08-01

    Conventionally, pairwise relationships between nodes are considered to be the fundamental building blocks of complex networks. However, over the last decade, the overabundance of certain subnetwork patterns, i.e., the so-called motifs, has attracted much attention. It has been hypothesized that these motifs, instead of links, serve as the building blocks of network structures. Although the relation between a network's topology and the general properties of the system, such as its function, its robustness against perturbations, or its efficiency in spreading information, is the central theme of network science, there is still a lack of sound generative models needed for testing the functional role of subgraph motifs. Our work aims to overcome this limitation. We employ the framework of exponential random graph models (ERGMs) to define models based on triadic substructures. The fact that only a small portion of triads can actually be set independently poses a challenge for the formulation of such models. To overcome this obstacle, we use Steiner triple systems (STSs). These are partitions of sets of nodes into pair-disjoint triads, which thus can be specified independently. Combining the concepts of ERGMs and STSs, we suggest generative models capable of generating ensembles of networks with nontrivial triadic Z-score profiles. Further, we discover inevitable correlations between the abundance of triad patterns, which occur solely for statistical reasons and need to be taken into account when discussing the functional implications of motif statistics. Moreover, we calculate the degree distributions of our triadic random graphs analytically.

  17. Themes or Motifs? Aiming for Coherence through Interdisciplinary Outlines.

    Science.gov (United States)

    Barton, Keith C.; Smith Lynne A.

    2000-01-01

    Describes how "motif-units" undermine the potential benefits of integrated thematic instruction. Suggests replacing the term "thematic unit" with the concept of "interdisciplinary outline," which focus on meaningful content, authentic activities, students' needs, teacher mediation, and a variety of resources. Shows how one fourth-grade teacher…

  18. Linear motif atlas for phosphorylation-dependent signaling

    DEFF Research Database (Denmark)

    Miller, Martin Lee; Jensen, LJ; Diella, F;

    2008-01-01

    Systematic and quantitative analysis of protein phosphorylation is revealing dynamic regulatory networks underlying cellular responses to environmental cues. However, matching these sites to the kinases that phosphorylate them and the phosphorylation-dependent binding domains that may subsequently...... sequence models of linear motifs. The atlas is available as a community resource (http://netphorest.info)....

  19. Variable structure motifs for transcription factor binding sites

    Directory of Open Access Journals (Sweden)

    Wernisch Lorenz

    2010-01-01

    Full Text Available Abstract Background Classically, models of DNA-transcription factor binding sites (TFBSs have been based on relatively few known instances and have treated them as sites of fixed length using position weight matrices (PWMs. Various extensions to this model have been proposed, most of which take account of dependencies between the bases in the binding sites. However, some transcription factors are known to exhibit some flexibility and bind to DNA in more than one possible physical configuration. In some cases this variation is known to affect the function of binding sites. With the increasing volume of ChIP-seq data available it is now possible to investigate models that incorporate this flexibility. Previous work on variable length models has been constrained by: a focus on specific zinc finger proteins in yeast using restrictive models; a reliance on hand-crafted models for just one transcription factor at a time; and a lack of evaluation on realistically sized data sets. Results We re-analysed binding sites from the TRANSFAC database and found motivating examples where our new variable length model provides a better fit. We analysed several ChIP-seq data sets with a novel motif search algorithm and compared the results to one of the best standard PWM finders and a recently developed alternative method for finding motifs of variable structure. All the methods performed comparably in held-out cross validation tests. Known motifs of variable structure were recovered for p53, Stat5a and Stat5b. In addition our method recovered a novel generalised version of an existing PWM for Sp1 that allows for variable length binding. This motif improved classification performance. Conclusions We have presented a new gapped PWM model for variable length DNA binding sites that is not too restrictive nor over-parameterised. Our comparison with existing tools shows that on average it does not have better predictive accuracy than existing methods. However, it does

  20. Predicting conserved protein motifs with Sub-HMMs

    Directory of Open Access Journals (Sweden)

    Girke Thomas

    2010-04-01

    Full Text Available Abstract Background Profile HMMs (hidden Markov models provide effective methods for modeling the conserved regions of protein families. A limitation of the resulting domain models is the difficulty to pinpoint their much shorter functional sub-features, such as catalytically relevant sequence motifs in enzymes or ligand binding signatures of receptor proteins. Results To identify these conserved motifs efficiently, we propose a method for extracting the most information-rich regions in protein families from their profile HMMs. The method was used here to predict a comprehensive set of sub-HMMs from the Pfam domain database. Cross-validations with the PROSITE and CSA databases confirmed the efficiency of the method in predicting most of the known functionally relevant motifs and residues. At the same time, 46,768 novel conserved regions could be predicted. The data set also allowed us to link at least 461 Pfam domains of known and unknown function by their common sub-HMMs. Finally, the sub-HMM method showed very promising results as an alternative search method for identifying proteins that share only short sequence similarities. Conclusions Sub-HMMs extend the application spectrum of profile HMMs to motif discovery. Their most interesting utility is the identification of the functionally relevant residues in proteins of known and unknown function. Additionally, sub-HMMs can be used for highly localized sequence similarity searches that focus on shorter conserved features rather than entire domains or global similarities. The motif data generated by this study is a valuable knowledge resource for characterizing protein functions in the future.

  1. Composite motifs integrating multiple protein structures increase sensitivity for function prediction.

    Science.gov (United States)

    Chen, Brian Y; Bryant, Drew H; Cruess, Amanda E; Bylund, Joseph H; Fofanov, Viacheslav Y; Kristensen, David M; Kimmel, Marek; Lichtarge, Olivier; Kavraki, Lydia E

    2007-01-01

    The study of disease often hinges on the biological function of proteins, but determining protein function is a difficult experimental process. To minimize duplicated effort, algorithms for function prediction seek characteristics indicative of possible protein function. One approach is to identify substructural matches of geometric and chemical similarity between motifs representing known active sites and target protein structures with unknown function. In earlier work, statistically significant matches of certain effective motifs have identified functionally related active sites. Effective motifs must be carefully designed to maintain similarity to functionally related sites (sensitivity) and avoid incidental similarities to functionally unrelated protein geometry (specificity). Existing motif design techniques use the geometry of a single protein structure. Poor selection of this structure can limit motif effectiveness if the selected functional site lacks similarity to functionally related sites. To address this problem, this paper presents composite motifs, which combine structures of functionally related active sites to potentially increase sensitivity. Our experimentation compares the effectiveness of composite motifs with simple motifs designed from single protein structures. On six distinct families of functionally related proteins, leave-one-out testing showed that composite motifs had sensitivity comparable to the most sensitive of all simple motifs and specificity comparable to the average simple motif. On our data set, we observed that composite motifs simultaneously capture variations in active site conformation, diminish the problem of selecting motif structures, and enable the fusion of protein structures from diverse data sources. PMID:17951837

  2. FPGA implementation of motifs-based neuronal network and synchronization analysis

    Science.gov (United States)

    Deng, Bin; Zhu, Zechen; Yang, Shuangming; Wei, Xile; Wang, Jiang; Yu, Haitao

    2016-06-01

    Motifs in complex networks play a crucial role in determining the brain functions. In this paper, 13 kinds of motifs are implemented with Field Programmable Gate Array (FPGA) to investigate the relationships between the networks properties and motifs properties. We use discretization method and pipelined architecture to construct various motifs with Hindmarsh-Rose (HR) neuron as the node model. We also build a small-world network based on these motifs and conduct the synchronization analysis of motifs as well as the constructed network. We find that the synchronization properties of motif determine that of motif-based small-world network, which demonstrates effectiveness of our proposed hardware simulation platform. By imitation of some vital nuclei in the brain to generate normal discharges, our proposed FPGA-based artificial neuronal networks have the potential to replace the injured nuclei to complete the brain function in the treatment of Parkinson's disease and epilepsy.

  3. A Bioinformatics Approach for Detecting Repetitive Nested Motifs using Pattern Matching

    Science.gov (United States)

    Romero, José R.; Carballido, Jessica A.; Garbus, Ingrid; Echenique, Viviana C.; Ponzoni, Ignacio

    2016-01-01

    The identification of nested motifs in genomic sequences is a complex computational problem. The detection of these patterns is important to allow the discovery of transposable element (TE) insertions, incomplete reverse transcripts, deletions, and/or mutations. In this study, a de novo strategy for detecting patterns that represent nested motifs was designed based on exhaustive searches for pairs of motifs and combinatorial pattern analysis. These patterns can be grouped into three categories, motifs within other motifs, motifs flanked by other motifs, and motifs of large size. The methodology used in this study, applied to genomic sequences from the plant species Aegilops tauschii and Oryza sativa, revealed that it is possible to identify putative nested TEs by detecting these three types of patterns. The results were validated through BLAST alignments, which revealed the efficacy and usefulness of the new method, which is called Mamushka. PMID:27812277

  4. Functional characterization of transcription factor motifs using cross-species comparison across large evolutionary distances

    OpenAIRE

    Jaebum Kim; Ryan Cunningham; Brian James; Stefan Wyder; Gibson, Joshua D.; Oliver Niehuis; Zdobnov, Evgeny M.; Hugh M Robertson; Robinson, Gene E.; Werren, John H; Saurabh Sinha

    2010-01-01

    We address the problem of finding statistically significant associations between cis-regulatory motifs and functional gene sets, in order to understand the biological roles of transcription factors. We develop a computational framework for this task, whose features include a new statistical score for motif scanning, the use of different scores for predicting targets of different motifs, and new ways to deal with redundancies among significant motif-function associations. This framework is app...

  5. Selection against spurious promoter motifs correlates with translational efficiency across bacteria

    OpenAIRE

    Froula, Jeffrey L.; M. Pilar Francino

    2008-01-01

    Because binding of RNAP to misplaced sites could compromise the efficiency of transcription, natural selection for the optimization of gene expression should regulate the distribution of DNA motifs capable of RNAP-binding across the genome. Here we analyze the distribution of the -10 promoter motifs that bind the sigma(70) subunit of RNAP in 42 bacterial genomes. We show that selection on these motifs operates across the genome, maintaining an over-representation of -10 motifs in regulatory s...

  6. RNAMotifScanX: a graph alignment approach for RNA structural motif identification

    OpenAIRE

    Zhong, Cuncong; Zhang, Shaojie

    2015-01-01

    RNA structural motifs are recurrent three-dimensional (3D) components found in the RNA architecture. These RNA structural motifs play important structural or functional roles and usually exhibit highly conserved 3D geometries and base-interaction patterns. Analysis of the RNA 3D structures and elucidation of their molecular functions heavily rely on efficient and accurate identification of these motifs. However, efficient RNA structural motif search tools are lacking due to the high complexit...

  7. SPIC: A novel similarity metric for comparing transcription factor binding site motifs based on information contents

    OpenAIRE

    Zhang, Shaoqiang; Zhou, Xiguo; Du, Chuanbin; Su, Zhengchang

    2013-01-01

    Background Discovering transcription factor binding sites (TFBS) is one of primary challenges to decipher complex gene regulatory networks encrypted in a genome. A set of short DNA sequences identified by a transcription factor (TF) is known as a motif, which can be expressed accurately in matrix form such as a position-specific scoring matrix (PSSM) and a position frequency matrix. Very frequently, we need to query a motif in a database of motifs by seeking its similar motifs, merge similar ...

  8. A new motif for inhibitors of geranylgeranyl diphosphate synthase.

    Science.gov (United States)

    Foust, Benjamin J; Allen, Cheryl; Holstein, Sarah A; Wiemer, David F

    2016-08-15

    The enzyme geranylgeranyl diphosphate synthase (GGDPS) is believed to receive the substrate farnesyl diphosphate through one lipophilic channel and release the product geranylgeranyl diphosphate through another. Bisphosphonates with two isoprenoid chains positioned on the α-carbon have proven to be effective inhibitors of this enzyme. Now a new motif has been prepared with one isoprenoid chain on the α-carbon, a second included as a phosphonate ester, and the potential for a third at the α-carbon. The pivaloyloxymethyl prodrugs of several compounds based on this motif have been prepared and the resulting compounds have been tested for their ability to disrupt protein geranylgeranylation and induce cytotoxicity in myeloma cells. The initial biological studies reveal activity consistent with GGDPS inhibition, and demonstrate a structure-function relationship which is dependent on the nature of the alkyl group at the α-carbon. PMID:27338660

  9. Discovering sequence motifs in quantitative and qualitative pepetide data

    DEFF Research Database (Denmark)

    Andreatta, Massimo

    and interpret such data. The first paper in this thesis presents a new, publicly available method based on artificial neural networks that allows custom analysis of quantitative peptide data. The online NNAlign web-server provides a simple yet powerful tool for the discovery of sequence motifs in large...... of interactions in a single experiment, with virtually unlimited choice of potential targets and variants of these targets. However, the amount and complexity of data produced by high-throughput techniques poses serious challenges to researchers of limited bioinformatics expertise who need to analyze...... with the presence of multiple motifs, due to the experimental setup or the actual poly-specificity of the receptor, in peptide data. A new algorithm, based on Gibbs sampling, identifies multiple specificities by performing two tasks simultaneously: alignment and clustering of peptide data. The method, available...

  10. SLIDER: A Generic Metaheuristic for the Discovery of Correlated Motifs in Protein-Protein Interaction Networks

    NARCIS (Netherlands)

    Boyen, P.; Dyck, van D.; Neven, F.; Ham, van R.C.H.J.; Dijk, van A.D.J.

    2011-01-01

    Correlated motif mining (CMM) is the problem of finding overrepresented pairs of patterns, called motifs, in sequences of interacting proteins. Algorithmic solutions for CMM thereby provide a computational method for predicting binding sites for protein interaction. In this paper, we adopt a motif-d

  11. Distinct configurations of protein complexes and biochemical pathways revealed by epistatic interaction network motifs

    LENUS (Irish Health Repository)

    Casey, Fergal

    2011-08-22

    Abstract Background Gene and protein interactions are commonly represented as networks, with the genes or proteins comprising the nodes and the relationship between them as edges. Motifs, or small local configurations of edges and nodes that arise repeatedly, can be used to simplify the interpretation of networks. Results We examined triplet motifs in a network of quantitative epistatic genetic relationships, and found a non-random distribution of particular motif classes. Individual motif classes were found to be associated with different functional properties, suggestive of an underlying biological significance. These associations were apparent not only for motif classes, but for individual positions within the motifs. As expected, NNN (all negative) motifs were strongly associated with previously reported genetic (i.e. synthetic lethal) interactions, while PPP (all positive) motifs were associated with protein complexes. The two other motif classes (NNP: a positive interaction spanned by two negative interactions, and NPP: a negative spanned by two positives) showed very distinct functional associations, with physical interactions dominating for the former but alternative enrichments, typical of biochemical pathways, dominating for the latter. Conclusion We present a model showing how NNP motifs can be used to recognize supportive relationships between protein complexes, while NPP motifs often identify opposing or regulatory behaviour between a gene and an associated pathway. The ability to use motifs to point toward underlying biological organizational themes is likely to be increasingly important as more extensive epistasis mapping projects in higher organisms begin.

  12. Graph animals, subgraph sampling, and motif search in large networks

    Science.gov (United States)

    Baskerville, Kim; Grassberger, Peter; Paczuski, Maya

    2007-09-01

    We generalize a sampling algorithm for lattice animals (connected clusters on a regular lattice) to a Monte Carlo algorithm for “graph animals,” i.e., connected subgraphs in arbitrary networks. As with the algorithm in [N. Kashtan , Bioinformatics 20, 1746 (2004)], it provides a weighted sample, but the computation of the weights is much faster (linear in the size of subgraphs, instead of superexponential). This allows subgraphs with up to ten or more nodes to be sampled with very high statistics, from arbitrarily large networks. Using this together with a heuristic algorithm for rapidly classifying isomorphic graphs, we present results for two protein interaction networks obtained using the tandem affinity purification (TAP) method: one of Escherichia coli with 230 nodes and 695 links, and one for yeast (Saccharomyces cerevisiae) with roughly ten times more nodes and links. We find in both cases that most connected subgraphs are strong motifs ( Z scores >10 ) or antimotifs ( Z scores motifs in E. coli being (nearly) bipartite graphs and having many pairs of nodes that connect to the same neighbors, while dominant motifs in yeast tend towards completeness or contain large cliques. We also explore a number of methods that do not rely on measurements of Z scores or comparisons with null models. For instance, we discuss the influence of specific complexes like the 26S proteasome in yeast, where a small number of complexes dominate the k cores with large k and have a decisive effect on the strongest motifs with 6-8 nodes. We also present Zipf plots of counts versus rank. They show broad distributions that are not power laws, in contrast to the case when disconnected subgraphs are included.

  13. Exon silencing by UAGG motifs in response to neuronal excitation.

    Directory of Open Access Journals (Sweden)

    Ping An

    2007-02-01

    Full Text Available Alternative pre-mRNA splicing plays fundamental roles in neurons by generating functional diversity in proteins associated with the communication and connectivity of the synapse. The CI cassette of the NMDA R1 receptor is one of a variety of exons that show an increase in exon skipping in response to cell excitation, but the molecular nature of this splicing responsiveness is not yet understood. Here we investigate the molecular basis for the induced changes in splicing of the CI cassette exon in primary rat cortical cultures in response to KCl-induced depolarization using an expression assay with a tight neuron-specific readout. In this system, exon silencing in response to neuronal excitation was mediated by multiple UAGG-type silencing motifs, and transfer of the motifs to a constitutive exon conferred a similar responsiveness by gain of function. Biochemical analysis of protein binding to UAGG motifs in extracts prepared from treated and mock-treated cortical cultures showed an increase in nuclear hnRNP A1-RNA binding activity in parallel with excitation. Evidence for the role of the NMDA receptor and calcium signaling in the induced splicing response was shown by the use of specific antagonists, as well as cell-permeable inhibitors of signaling pathways. Finally, a wider role for exon-skipping responsiveness is shown to involve additional exons with UAGG-related silencing motifs, and transcripts involved in synaptic functions. These results suggest that, at the post-transcriptional level, excitable exons such as the CI cassette may be involved in strategies by which neurons mount adaptive responses to hyperstimulation.

  14. The ADAMTS (A Disintegrin and Metalloproteinase with Thrombospondin motifs) family

    OpenAIRE

    Kelwick, Richard; Desanlis, Ines; Wheeler, Grant N.; Edwards, Dylan R

    2015-01-01

    The ADAMTS (A Disintegrin and Metalloproteinase with Thrombospondin motifs) enzymes are secreted, multi-domain matrix-associated zinc metalloendopeptidases that have diverse roles in tissue morphogenesis and patho-physiological remodeling, in inflammation and in vascular biology. The human family includes 19 members that can be sub-grouped on the basis of their known substrates, namely the aggrecanases or proteoglycanases (ADAMTS1, 4, 5, 8, 9, 15 and 20), the procollagen N-propeptidases (ADAM...

  15. Defense-Inducing Volatiles: In Search of the Active Motif

    OpenAIRE

    Heil, Martin; Lion, Ulrich; Boland, Wilhelm

    2008-01-01

    Herbivore-induced volatile organic compounds (VOCs) are widely appreciated as an indirect defense mechanism since carnivorous arthropods use VOCs as cues for host localization and then attack herbivores. Another function of VOCs is plant–plant signaling. That VOCs elicit defensive responses in neighboring plants has been reported from various species, and different compounds have been found to be active. In order to search for a structural motif that characterizes active VOCs, we used lima be...

  16. A combinatorial code for splicing silencing: UAGG and GGGG motifs.

    Directory of Open Access Journals (Sweden)

    Kyoungha Han

    2005-05-01

    Full Text Available Alternative pre-mRNA splicing is widely used to regulate gene expression by tuning the levels of tissue-specific mRNA isoforms. Few regulatory mechanisms are understood at the level of combinatorial control despite numerous sequences, distinct from splice sites, that have been shown to play roles in splicing enhancement or silencing. Here we use molecular approaches to identify a ternary combination of exonic UAGG and 5'-splice-site-proximal GGGG motifs that functions cooperatively to silence the brain-region-specific CI cassette exon (exon 19 of the glutamate NMDA R1 receptor (GRIN1 transcript. Disruption of three components of the motif pattern converted the CI cassette into a constitutive exon, while predominant skipping was conferred when the same components were introduced, de novo, into a heterologous constitutive exon. Predominant exon silencing was directed by the motif pattern in the presence of six competing exonic splicing enhancers, and this effect was retained after systematically repositioning the two exonic UAGGs within the CI cassette. In this system, hnRNP A1 was shown to mediate silencing while hnRNP H antagonized silencing. Genome-wide computational analysis combined with RT-PCR testing showed that a class of skipped human and mouse exons can be identified by searches that preserve the sequence and spatial configuration of the UAGG and GGGG motifs. This analysis suggests that the multi-component silencing code may play an important role in the tissue-specific regulation of the CI cassette exon, and that it may serve more generally as a molecular language to allow for intricate adjustments and the coordination of splicing patterns from different genes.

  17. A motif-independent metric for DNA sequence specificity

    OpenAIRE

    Pinello Luca; Lo Bosco Giosuè; Hanlon Bret; Yuan Guo-Cheng

    2011-01-01

    Abstract Background Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity. Results We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM)...

  18. Tricksters Trot to America: Areal Distribution of Folklore Motifs

    OpenAIRE

    Yuri Berezkin

    2010-01-01

    The folklore Trickster is usually considered a universally known combination of features intrinsic to human nature. However, there are strong anomalies in the areal distribution of such a figure. Sub-Saharan Africa, North America (except for the Arctic), Northeast Asia and South American Chaco not only are the preferred zones of tricksters’ activity but also share some peculiar trickster motifs unknown in most of the other regions. The range of animals which play the role of tricksters is als...

  19. Event Networks and the Identification of Crime Pattern Motifs.

    Directory of Open Access Journals (Sweden)

    Toby Davies

    Full Text Available In this paper we demonstrate the use of network analysis to characterise patterns of clustering in spatio-temporal events. Such clustering is of both theoretical and practical importance in the study of crime, and forms the basis for a number of preventative strategies. However, existing analytical methods show only that clustering is present in data, while offering little insight into the nature of the patterns present. Here, we show how the classification of pairs of events as close in space and time can be used to define a network, thereby generalising previous approaches. The application of graph-theoretic techniques to these networks can then offer significantly deeper insight into the structure of the data than previously possible. In particular, we focus on the identification of network motifs, which have clear interpretation in terms of spatio-temporal behaviour. Statistical analysis is complicated by the nature of the underlying data, and we provide a method by which appropriate randomised graphs can be generated. Two datasets are used as case studies: maritime piracy at the global scale, and residential burglary in an urban area. In both cases, the same significant 3-vertex motif is found; this result suggests that incidents tend to occur not just in pairs, but in fact in larger groups within a restricted spatio-temporal domain. In the 4-vertex case, different motifs are found to be significant in each case, suggesting that this technique is capable of discriminating between clustering patterns at a finer granularity than previously possible.

  20. MAR characteristic motifs mediate episomal vector in CHO cells.

    Science.gov (United States)

    Lin, Yan; Li, Zhaoxi; Wang, Tianyun; Wang, Xiaoyin; Wang, Li; Dong, Weihua; Jing, Changqin; Yang, Xianjun

    2015-04-01

    An ideal gene therapy vector should enable persistent transgene expression without limitations in safety and reproducibility. Recent researches' insight into the ability of chromosomal matrix attachment regions (MARs) to mediate episomal maintenance of genetic elements allowed the development of a circular episomal vector. Although a MAR-mediated engineered vector has been developed, little is known on which motifs of MAR confer this function during interaction with the host genome. Here, we report an artificially synthesized DNA fragment containing only characteristic motif sequences that served as an alternative to human beta-interferon matrix attachment region sequence. The potential of the vector to mediate gene transfer in CHO cells was investigated. The short synthetic MAR motifs were found to mediate episomal vector at a low copy number for many generations without integration into the host genome. Higher transgene expression was maintained for at least 4 months. In addition, MAR was maintained episomally and conferred sustained EGFP expression even in nonselective CHO cells. All the results demonstrated that MAR characteristic sequence-based vector can function as stable episomes in CHO cells, supporting long-term and effective transgene expression.

  1. Motif structure and cooperation in real-world complex networks

    Science.gov (United States)

    Salehi, Mostafa; Rabiee, Hamid R.; Jalili, Mahdi

    2010-12-01

    Networks of dynamical nodes serve as generic models for real-world systems in many branches of science ranging from mathematics to physics, technology, sociology and biology. Collective behavior of agents interacting over complex networks is important in many applications. The cooperation between selfish individuals is one of the most interesting collective phenomena. In this paper we address the interplay between the motifs’ cooperation properties and their abundance in a number of real-world networks including yeast protein-protein interaction, human brain, protein structure, email communication, dolphins’ social interaction, Zachary karate club and Net-science coauthorship networks. First, the amount of cooperativity for all possible undirected subgraphs with three to six nodes is calculated. To this end, the evolutionary dynamics of the Prisoner’s Dilemma game is considered and the cooperativity of each subgraph is calculated as the percentage of cooperating agents at the end of the simulation time. Then, the three- to six-node motifs are extracted for each network. The significance of the abundance of a motif, represented by a Z-value, is obtained by comparing them with some properly randomized versions of the original network. We found that there is always a group of motifs showing a significant inverse correlation between their cooperativity amount and Z-value, i.e. the more the Z-value the less the amount of cooperativity. This suggests that networks composed of well-structured units do not have good cooperativity properties.

  2. Insertion of tetracysteine motifs into dopamine transporter extracellular domains.

    Directory of Open Access Journals (Sweden)

    Deanna M Navaroli

    Full Text Available The neuronal dopamine transporter (DAT is a major determinant of extracellular dopamine (DA levels and is the primary target for a variety of addictive and therapeutic psychoactive drugs. DAT is acutely regulated by protein kinase C (PKC activation and amphetamine exposure, both of which modulate DAT surface expression by endocytic trafficking. In order to use live imaging approaches to study DAT endocytosis, methods are needed to exclusively label the DAT surface pool. The use of membrane impermeant, sulfonated biarsenic dyes holds potential as one such approach, and requires introduction of an extracellular tetracysteine motif (tetraCys; CCPGCC to facilitate dye binding. In the current study, we took advantage of intrinsic proline-glycine (Pro-Gly dipeptides encoded in predicted DAT extracellular domains to introduce tetraCys motifs into DAT extracellular loops 2, 3, and 4. [(3H]DA uptake studies, surface biotinylation and fluorescence microscopy in PC12 cells indicate that tetraCys insertion into the DAT second extracellular loop results in a functional transporter that maintains PKC-mediated downregulation. Introduction of tetraCys into extracellular loops 3 and 4 yielded DATs with severely compromised function that failed to mature and traffic to the cell surface. This is the first demonstration of successful introduction of a tetracysteine motif into a DAT extracellular domain, and may hold promise for use of biarsenic dyes in live DAT imaging studies.

  3. Interlinking motifs and entropy landscapes of statistically interacting particles

    Directory of Open Access Journals (Sweden)

    P. Lu

    2012-03-01

    Full Text Available The s=1/2 Ising chain with uniform nearest-neighbor and next-nearest-neighbor coupling is used to construct a system of floating particles characterized by motifs of up to six consecutive local spins. The spin couplings cause the assembly of particles which, in turn, remain free of interaction energies even at high density. All microstates are configurations of particles from one of three different sets, excited from pseudo-vacua associated with ground states of periodicities one, two, and four. The motifs of particles and elements of pseudo-vacuum interlink in two shared site variables. The statistical interaction between particles is encoded in a generalized Pauli principle, describing how the placement of one particle modifies the options for placing further particles. In the statistical mechanical analysis arbitrary energies can be assigned to all particle species. The entropy is a function of the particle populations. The statistical interaction specifications are transparently built into that expression. The energies and structures of the particles alone govern the ordering at low temperature. Under special circumstances the particles can be replaced by more fundamental particles with shorter motifs that interlink in only one shared site variable. Structures emerge from interactions on two levels: particles with shapes from coupled spins and long-range ordering tendencies from statistically interacting particles with shapes.

  4. TOPDOM: database of conservatively located domains and motifs in proteins

    Science.gov (United States)

    Varga, Julia; Dobson, László; Tusnády, Gábor E.

    2016-01-01

    Summary: The TOPDOM database—originally created as a collection of domains and motifs located consistently on the same side of the membranes in α-helical transmembrane proteins—has been updated and extended by taking into consideration consistently localized domains and motifs in globular proteins, too. By taking advantage of the recently developed CCTOP algorithm to determine the type of a protein and predict topology in case of transmembrane proteins, and by applying a thorough search for domains and motifs as well as utilizing the most up-to-date version of all source databases, we managed to reach a 6-fold increase in the size of the whole database and a 2-fold increase in the number of transmembrane proteins. Availability and implementation: TOPDOM database is available at http://topdom.enzim.hu. The webpage utilizes the common Apache, PHP5 and MySQL software to provide the user interface for accessing and searching the database. The database itself is generated on a high performance computer. Contact: tusnady.gabor@ttk.mta.hu. Supplementary information: Supplementary data are available at Bioinformatics online. PMID:27153630

  5. A Monte Carlo-based framework enhances the discovery and interpretation of regulatory sequence motifs

    Directory of Open Access Journals (Sweden)

    Seitzer Phillip

    2012-11-01

    Full Text Available Abstract Background Discovery of functionally significant short, statistically overrepresented subsequence patterns (motifs in a set of sequences is a challenging problem in bioinformatics. Oftentimes, not all sequences in the set contain a motif. These non-motif-containing sequences complicate the algorithmic discovery of motifs. Filtering the non-motif-containing sequences from the larger set of sequences while simultaneously determining the identity of the motif is, therefore, desirable and a non-trivial problem in motif discovery research. Results We describe MotifCatcher, a framework that extends the sensitivity of existing motif-finding tools by employing random sampling to effectively remove non-motif-containing sequences from the motif search. We developed two implementations of our algorithm; each built around a commonly used motif-finding tool, and applied our algorithm to three diverse chromatin immunoprecipitation (ChIP data sets. In each case, the motif finder with the MotifCatcher extension demonstrated improved sensitivity over the motif finder alone. Our approach organizes candidate functionally significant discovered motifs into a tree, which allowed us to make additional insights. In all cases, we were able to support our findings with experimental work from the literature. Conclusions Our framework demonstrates that additional processing at the sequence entry level can significantly improve the performance of existing motif-finding tools. For each biological data set tested, we were able to propose novel biological hypotheses supported by experimental work from the literature. Specifically, in Escherichia coli, we suggested binding site motifs for 6 non-traditional LexA protein binding sites; in Saccharomyces cerevisiae, we hypothesize 2 disparate mechanisms for novel binding sites of the Cse4p protein; and in Halobacterium sp. NRC-1, we discoverd subtle differences in a general transcription factor (GTF binding site motif

  6. A novel alignment-free method for comparing transcription factor binding site motifs.

    Directory of Open Access Journals (Sweden)

    Minli Xu

    Full Text Available BACKGROUND: Transcription factor binding site (TFBS motifs can be accurately represented by position frequency matrices (PFM or other equivalent forms. We often need to compare TFBS motifs using their PFMs in order to search for similar motifs in a motif database, or cluster motifs according to their binding preference. The majority of current methods for motif comparison involve a similarity metric for column-to-column comparison and a method to find the optimal position alignment between the two compared motifs. In some applications, alignment-free methods might be preferred; however, few such methods with high accuracy have been described. METHODOLOGY/PRINCIPAL FINDINGS: Here we describe a novel alignment-free method for quantifying the similarity of motifs using their PFMs by converting PFMs into k-mer vectors. The motifs could then be compared by measuring the similarity among their corresponding k-mer vectors. CONCLUSIONS/SIGNIFICANCE: We demonstrate that our method in general achieves similar performance or outperforms the existing methods for clustering motifs according to their binding preference and identifying similar motifs of transcription factors of the same family.

  7. A Novel Alignment-Free Method for Comparing Transcription Factor Binding Site Motifs

    Science.gov (United States)

    Xu, Minli; Su, Zhengchang

    2010-01-01

    Background Transcription factor binding site (TFBS) motifs can be accurately represented by position frequency matrices (PFM) or other equivalent forms. We often need to compare TFBS motifs using their PFMs in order to search for similar motifs in a motif database, or cluster motifs according to their binding preference. The majority of current methods for motif comparison involve a similarity metric for column-to-column comparison and a method to find the optimal position alignment between the two compared motifs. In some applications, alignment-free methods might be preferred; however, few such methods with high accuracy have been described. Methodology/Principal Findings Here we describe a novel alignment-free method for quantifying the similarity of motifs using their PFMs by converting PFMs into k-mer vectors. The motifs could then be compared by measuring the similarity among their corresponding k-mer vectors. Conclusions/Significance We demonstrate that our method in general achieves similar performance or outperforms the existing methods for clustering motifs according to their binding preference and identifying similar motifs of transcription factors of the same family. PMID:20098703

  8. Motif-based analysis of large nucleotide data sets using MEME-ChIP.

    Science.gov (United States)

    Ma, Wenxiu; Noble, William S; Bailey, Timothy L

    2014-01-01

    MEME-ChIP is a web-based tool for analyzing motifs in large DNA or RNA data sets. It can analyze peak regions identified by ChIP-seq, cross-linking sites identified by CLIP-seq and related assays, as well as sets of genomic regions selected using other criteria. MEME-ChIP performs de novo motif discovery, motif enrichment analysis, motif location analysis and motif clustering, providing a comprehensive picture of the DNA or RNA motifs that are enriched in the input sequences. MEME-ChIP performs two complementary types of de novo motif discovery: weight matrix-based discovery for high accuracy; and word-based discovery for high sensitivity. Motif enrichment analysis using DNA or RNA motifs from human, mouse, worm, fly and other model organisms provides even greater sensitivity. MEME-ChIP's interactive HTML output groups and aligns significant motifs to ease interpretation. This protocol takes less than 3 h, and it provides motif discovery approaches that are distinct and complementary to other online methods. PMID:24853928

  9. Leucine-based receptor sorting motifs are dependent on the spacing relative to the plasma membrane

    DEFF Research Database (Denmark)

    Geisler, C; Dietrich, J; Nielsen, B L;

    1998-01-01

    amino acid, is constitutively active. In this study, we have investigated how the spacing relative to the plasma membrane affects the function of both types of leucine-based motifs. For phosphorylation-dependent leucine-based motifs, a minimal spacing of 7 residues between the plasma membrane and the...... phospho-acceptor was required for phosphorylation and thereby activation of the motifs. For constitutively active leucine-based motifs, a minimal spacing of 6 residues between the plasma membrane and the acidic residue was required for optimal activity of the motifs. In addition, we found that the acidic......Many integral membrane proteins contain leucine-based motifs within their cytoplasmic domains that mediate internalization and intracellular sorting. Two types of leucine-based motifs have been identified. One type is dependent on phosphorylation, whereas the other type, which includes an acidic...

  10. Discriminative motif discovery in DNA and protein sequences using the DEME algorithm

    Directory of Open Access Journals (Sweden)

    Bailey Timothy L

    2007-10-01

    Full Text Available Abstract Background Motif discovery aims to detect short, highly conserved patterns in a collection of unaligned DNA or protein sequences. Discriminative motif finding algorithms aim to increase the sensitivity and selectivity of motif discovery by utilizing a second set of sequences, and searching only for patterns that can differentiate the two sets of sequences. Potential applications of discriminative motif discovery include discovering transcription factor binding site motifs in ChIP-chip data and finding protein motifs involved in thermal stability using sets of orthologous proteins from thermophilic and mesophilic organisms. Results We describe DEME, a discriminative motif discovery algorithm for use with protein and DNA sequences. Input to DEME is two sets of sequences; a "positive" set and a "negative" set. DEME represents motifs using a probabilistic model, and uses a novel combination of global and local search to find the motif that optimally discriminates between the two sets of sequences. DEME is unique among discriminative motif finders in that it uses an informative Bayesian prior on protein motif columns, allowing it to incorporate prior knowledge of residue characteristics. We also introduce four, synthetic, discriminative motif discovery problems that are designed for evaluating discriminative motif finders in various biologically motivated contexts. We test DEME using these synthetic problems and on two biological problems: finding yeast transcription factor binding motifs in ChIP-chip data, and finding motifs that discriminate between groups of thermophilic and mesophilic orthologous proteins. Conclusion Using artificial data, we show that DEME is more effective than a non-discriminative approach when there are "decoy" motifs or when a variant of the motif is present in the "negative" sequences. With real data, we show that DEME is as good, but not better than non-discriminative algorithms at discovering yeast transcription

  11. DNA nanotechnology based on i-motif structures.

    Science.gov (United States)

    Dong, Yuanchen; Yang, Zhongqiang; Liu, Dongsheng

    2014-06-17

    CONSPECTUS: Most biological processes happen at the nanometer scale, and understanding the energy transformations and material transportation mechanisms within living organisms has proved challenging. To better understand the secrets of life, researchers have investigated artificial molecular motors and devices over the past decade because such systems can mimic certain biological processes. DNA nanotechnology based on i-motif structures is one system that has played an important role in these investigations. In this Account, we summarize recent advances in functional DNA nanotechnology based on i-motif structures. The i-motif is a DNA quadruplex that occurs as four stretches of cytosine repeat sequences form C·CH(+) base pairs, and their stabilization requires slightly acidic conditions. This unique property has produced the first DNA molecular motor driven by pH changes. The motor is reliable, and studies show that it is capable of millisecond running speeds, comparable to the speed of natural protein motors. With careful design, the output of these types of motors was combined to drive micrometer-sized cantilevers bend. Using established DNA nanostructure assembly and functionalization methods, researchers can easily integrate the motor within other DNA assembled structures and functional units, producing DNA molecular devices with new functions such as suprahydrophobic/suprahydrophilic smart surfaces that switch, intelligent nanopores triggered by pH changes, molecular logic gates, and DNA nanosprings. Recently, researchers have produced motors driven by light and electricity, which have allowed DNA motors to be integrated within silicon-based nanodevices. Moreover, some devices based on i-motif structures have proven useful for investigating processes within living cells. The pH-responsiveness of the i-motif structure also provides a way to control the stepwise assembly of DNA nanostructures. In addition, because of the stability of the i-motif, this

  12. DNA nanotechnology based on i-motif structures.

    Science.gov (United States)

    Dong, Yuanchen; Yang, Zhongqiang; Liu, Dongsheng

    2014-06-17

    CONSPECTUS: Most biological processes happen at the nanometer scale, and understanding the energy transformations and material transportation mechanisms within living organisms has proved challenging. To better understand the secrets of life, researchers have investigated artificial molecular motors and devices over the past decade because such systems can mimic certain biological processes. DNA nanotechnology based on i-motif structures is one system that has played an important role in these investigations. In this Account, we summarize recent advances in functional DNA nanotechnology based on i-motif structures. The i-motif is a DNA quadruplex that occurs as four stretches of cytosine repeat sequences form C·CH(+) base pairs, and their stabilization requires slightly acidic conditions. This unique property has produced the first DNA molecular motor driven by pH changes. The motor is reliable, and studies show that it is capable of millisecond running speeds, comparable to the speed of natural protein motors. With careful design, the output of these types of motors was combined to drive micrometer-sized cantilevers bend. Using established DNA nanostructure assembly and functionalization methods, researchers can easily integrate the motor within other DNA assembled structures and functional units, producing DNA molecular devices with new functions such as suprahydrophobic/suprahydrophilic smart surfaces that switch, intelligent nanopores triggered by pH changes, molecular logic gates, and DNA nanosprings. Recently, researchers have produced motors driven by light and electricity, which have allowed DNA motors to be integrated within silicon-based nanodevices. Moreover, some devices based on i-motif structures have proven useful for investigating processes within living cells. The pH-responsiveness of the i-motif structure also provides a way to control the stepwise assembly of DNA nanostructures. In addition, because of the stability of the i-motif, this

  13. Sequence-based classification using discriminatory motif feature selection.

    Directory of Open Access Journals (Sweden)

    Hao Xiong

    Full Text Available Most existing methods for sequence-based classification use exhaustive feature generation, employing, for example, all k-mer patterns. The motivation behind such (enumerative approaches is to minimize the potential for overlooking important features. However, there are shortcomings to this strategy. First, practical constraints limit the scope of exhaustive feature generation to patterns of length ≤ k, such that potentially important, longer (> k predictors are not considered. Second, features so generated exhibit strong dependencies, which can complicate understanding of derived classification rules. Third, and most importantly, numerous irrelevant features are created. These concerns can compromise prediction and interpretation. While remedies have been proposed, they tend to be problem-specific and not broadly applicable. Here, we develop a generally applicable methodology, and an attendant software pipeline, that is predicated on discriminatory motif finding. In addition to the traditional training and validation partitions, our framework entails a third level of data partitioning, a discovery partition. A discriminatory motif finder is used on sequences and associated class labels in the discovery partition to yield a (small set of features. These features are then used as inputs to a classifier in the training partition. Finally, performance assessment occurs on the validation partition. Important attributes of our approach are its modularity (any discriminatory motif finder and any classifier can be deployed and its universality (all data, including sequences that are unaligned and/or of unequal length, can be accommodated. We illustrate our approach on two nucleosome occupancy datasets and a protein solubility dataset, previously analyzed using enumerative feature generation. Our method achieves excellent performance results, with and without optimization of classifier tuning parameters. A Python pipeline implementing the approach is

  14. Identification of imine reductase-specific sequence motifs.

    Science.gov (United States)

    Fademrecht, Silvia; Scheller, Philipp N; Nestl, Bettina M; Hauer, Bernhard; Pleiss, Jürgen

    2016-05-01

    Chiral amines are valuable building blocks for the production of a variety of pharmaceuticals, agrochemicals and other specialty chemicals. Only recently, imine reductases (IREDs) were discovered which catalyze the stereoselective reduction of imines to chiral amines. Although several IREDs were biochemically characterized in the last few years, knowledge of the reaction mechanism and the molecular basis of substrate specificity and stereoselectivity is limited. To gain further insights into the sequence-function relationships, the Imine Reductase Engineering Database (www.IRED.BioCatNet.de) was established and a systematic analysis of 530 putative IREDs was performed. A standard numbering scheme based on R-IRED-Sk was introduced to facilitate the identification and communication of structurally equivalent positions in different proteins. A conservation analysis revealed a highly conserved cofactor binding region and a predominantly hydrophobic substrate binding cleft. Two IRED-specific motifs were identified, the cofactor binding motif GLGxMGx5 [ATS]x4 Gx4 [VIL]WNR[TS]x2 [KR] and the active site motif Gx[DE]x[GDA]x[APS]x3 {K}x[ASL]x[LMVIAG]. Our results indicate a preference toward NADPH for all IREDs and explain why, despite their sequence similarity to β-hydroxyacid dehydrogenases (β-HADs), no conversion of β-hydroxyacids has been observed. Superfamily-specific conservations were investigated to explore the molecular basis of their stereopreference. Based on our analysis and previous experimental results on IRED mutants, an exclusive role of standard position 187 for stereoselectivity is excluded. Alternatively, two standard positions 139 and 194 were identified which are superfamily-specifically conserved and differ in R- and S-selective enzymes. Proteins 2016; 84:600-610. © 2016 Wiley Periodicals, Inc. PMID:26857686

  15. Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

    Energy Technology Data Exchange (ETDEWEB)

    Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

    2007-02-21

    Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

  16. Tricksters Trot to America: Areal Distribution of Folklore Motifs

    Directory of Open Access Journals (Sweden)

    Yuri Berezkin

    2010-12-01

    Full Text Available The folklore Trickster is usually considered a universally known combination of features intrinsic to human nature. However, there are strong anomalies in the areal distribution of such a figure. Sub-Saharan Africa, North America (except for the Arctic, Northeast Asia and South American Chaco not only are the preferred zones of tricksters’ activity but also share some peculiar trickster motifs unknown in most of the other regions. The range of animals which play the role of tricksters is also restricted and not always easily explained, E.g. the Hare and Spider, known in both Africa and North America, are neither “mediators” between life and death (suggested by C. Lévi-Strauss for Coyote nor “really tricky” (“materialistic” hypothesis of M. Harris. The set of trickster motifs and the zoo- or anthropomorphic impersonations of the Trickster are independentvariables. The same episodes are easily linked to different tricksters while every trickster usually attracts episodes characteristic of a particular region. Though the original emergence of Trickster as a mental construct can indeed be rooted in human psychology (and where else?, the distribution of tricksters in folklore is discretionary and depends of many uncertain, i.e. chance, factors. The wide spread or lack of tricksters in certain cultural areas hardly reflect any fundamental differences in the psychology of inhabitants of these regions. The study of trickster motifs, just as of any other folklore motifs, helps us reconstruct possible historic links between populations. The African – North American links remain enigmatic (independent emergence is possible but slight historicallinks cannot be completely excluded but the parallels between (Western and Northeast Siberian – North American tricksters are almost certainly due to former cultural ties across Northern Asia. Another interesting case is the proliferation of tricksters with different zoomorphic and other identities

  17. Present status of quinoxaline motifs: excellent pathfinders in therapeutic medicine.

    Science.gov (United States)

    Ajani, Olayinka Oyewale

    2014-10-01

    Quinoxalines belong to a class of excellent heterocyclic scaffolds owing to their wide biological properties and diverse therapeutic applications in medicinal research. They are complementary in shapes and charges to numerous biomolecules they interact with, thereby resulting in increased binding affinity. The pharmacokinetic properties of drugs bearing quinoxaline cores have shown them to be relatively easy to administer either as intramuscular solutions, oral capsules or rectal suppositories. This work deals with recent advances in the synthesis and pharmacological diversities of quinoxaline motifs which might pave ways for novel drugs development.

  18. Nucleic Acid i-Motif Structures in Analytical Chemistry.

    Science.gov (United States)

    Alba, Joan Josep; Sadurní, Anna; Gargallo, Raimundo

    2016-09-01

    Under the appropriate experimental conditions of pH and temperature, cytosine-rich segments in DNA or RNA sequences may produce a characteristic folded structure known as an i-motif. Besides its potential role in vivo, which is still under investigation, this structure has attracted increasing interest in other fields due to its sharp, fast and reversible pH-driven conformational changes. This "on/off" switch at molecular level is being used in nanotechnology and analytical chemistry to develop nanomachines and sensors, respectively. This paper presents a review of the latest applications of this structure in the field of chemical analysis.

  19. Real Time Motif Classification from Database Using Intelligent Algorithms

    Directory of Open Access Journals (Sweden)

    Paresh Kotak

    2012-12-01

    Full Text Available The amount of raw data being accumulated in the databases is increasing at an inconceivable rate.However, these data-rich databases are poor in providing substantial information. This is where datamining comes into picture. Specifically, data mining is "the process of extracting or mining informationfrom large amount of data". Motif classification has been an active area of research in data mining. Itconsists of assigning a data instance to one of the predefined classes/groups based upon the knowledgegained from previously seen (classified data.

  20. Finding a Leucine in a Haystack: Searching the Proteome for ambigous Leucine-Aspartic Acid motifs

    KAUST Repository

    Arold, Stefan T.

    2016-01-25

    Leucine-aspartic acid (LD) motifs are short helical protein-protein interaction motifs involved in cell motility, survival and communication. LD motif interactions are also implicated in cancer metastasis and are targeted by several viruses. LD motifs are notoriously difficult to detect because sequence pattern searches lead to an excessively high number of false positives. Hence, despite 20 years of research, only six LD motif–containing proteins are known in humans, three of which are close homologues of the paxillin family. To enable the proteome-wide discovery of LD motifs, we developed LD Motif Finder (LDMF), a web tool based on machine learning that combines sequence information with structural predictions to detect LD motifs with high accuracy. LDMF predicted 13 new LD motifs in humans. Using biophysical assays, we experimentally confirmed in vitro interactions for four novel LD motif proteins. Thus, LDMF allows proteome-wide discovery of LD motifs, despite a highly ambiguous sequence pattern. Functional implications will be discussed.

  1. Automated protein motif generation in the structure-based protein function prediction tool ProMOL.

    Science.gov (United States)

    Osipovitch, Mikhail; Lambrecht, Mitchell; Baker, Cameron; Madha, Shariq; Mills, Jeffrey L; Craig, Paul A; Bernstein, Herbert J

    2015-12-01

    ProMOL, a plugin for the PyMOL molecular graphics system, is a structure-based protein function prediction tool. ProMOL includes a set of routines for building motif templates that are used for screening query structures for enzyme active sites. Previously, each motif template was generated manually and required supervision in the optimization of parameters for sensitivity and selectivity. We developed an algorithm and workflow for the automation of motif building and testing routines in ProMOL. The algorithm uses a set of empirically derived parameters for optimization and requires little user intervention. The automated motif generation algorithm was first tested in a performance comparison with a set of manually generated motifs based on identical active sites from the same 112 PDB entries. The two sets of motifs were equally effective in identifying alignments with homologs and in rejecting alignments with unrelated structures. A second set of 296 active site motifs were generated automatically, based on Catalytic Site Atlas entries with literature citations, as an expansion of the library of existing manually generated motif templates. The new motif templates exhibited comparable performance to the existing ones in terms of hit rates against native structures, homologs with the same EC and Pfam designations, and randomly selected unrelated structures with a different EC designation at the first EC digit, as well as in terms of RMSD values obtained from local structural alignments of motifs and query structures. This research is supported by NIH grant GM078077. PMID:26573864

  2. Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information

    Directory of Open Access Journals (Sweden)

    James Douglas Engel

    2007-12-01

    Full Text Available Motif discovery for the identification of functional regulatory elements underlying gene expression is a challenging problem. Sequence inspection often leads to discovery of novel motifs (including transcription factor sites with previously uncharacterized function in gene expression. Coupled with the complexity underlying tissue-specific gene expression, there are several motifs that are putatively responsible for expression in a certain cell type. This has important implications in understanding fundamental biological processes such as development and disease progression. In this work, we present an approach to the identification of motifs (not necessarily transcription factor sites and examine its application to some questions in current bioinformatics research. These motifs are seen to discriminate tissue-specific gene promoter or regulatory regions from those that are not tissue-specific. There are two main contributions of this work. Firstly, we propose the use of directed information for such classification constrained motif discovery, and then use the selected features with a support vector machine (SVM classifier to find the tissue specificity of any sequence of interest. Such analysis yields several novel interesting motifs that merit further experimental characterization. Furthermore, this approach leads to a principled framework for the prospective examination of any chosen motif to be discriminatory motif for a group of coexpressed/coregulated genes, thereby integrating sequence and expression perspectives. We hypothesize that the discovery of these motifs would enable the large-scale investigation for the tissue-specific regulatory role of any conserved sequence element identified from genome-wide studies.

  3. Motif Discovery in Tissue-Specific Regulatory Sequences Using Directed Information

    Directory of Open Access Journals (Sweden)

    States David

    2007-01-01

    Full Text Available Motif discovery for the identification of functional regulatory elements underlying gene expression is a challenging problem. Sequence inspection often leads to discovery of novel motifs (including transcription factor sites with previously uncharacterized function in gene expression. Coupled with the complexity underlying tissue-specific gene expression, there are several motifs that are putatively responsible for expression in a certain cell type. This has important implications in understanding fundamental biological processes such as development and disease progression. In this work, we present an approach to the identification of motifs (not necessarily transcription factor sites and examine its application to some questions in current bioinformatics research. These motifs are seen to discriminate tissue-specific gene promoter or regulatory regions from those that are not tissue-specific. There are two main contributions of this work. Firstly, we propose the use of directed information for such classification constrained motif discovery, and then use the selected features with a support vector machine (SVM classifier to find the tissue specificity of any sequence of interest. Such analysis yields several novel interesting motifs that merit further experimental characterization. Furthermore, this approach leads to a principled framework for the prospective examination of any chosen motif to be discriminatory motif for a group of coexpressed/coregulated genes, thereby integrating sequence and expression perspectives. We hypothesize that the discovery of these motifs would enable the large-scale investigation for the tissue-specific regulatory role of any conserved sequence element identified from genome-wide studies.

  4. Transduction motif analysis of gastric cancer based on a human signaling network

    Energy Technology Data Exchange (ETDEWEB)

    Liu, G.; Li, D.Z.; Jiang, C.S.; Wang, W. [Fuzhou General Hospital of Nanjing Command, Department of Gastroenterology, Fuzhou, China, Department of Gastroenterology, Fuzhou General Hospital of Nanjing Command, Fuzhou (China)

    2014-04-04

    To investigate signal regulation models of gastric cancer, databases and literature were used to construct the signaling network in humans. Topological characteristics of the network were analyzed by CytoScape. After marking gastric cancer-related genes extracted from the CancerResource, GeneRIF, and COSMIC databases, the FANMOD software was used for the mining of gastric cancer-related motifs in a network with three vertices. The significant motif difference method was adopted to identify significantly different motifs in the normal and cancer states. Finally, we conducted a series of analyses of the significantly different motifs, including gene ontology, function annotation of genes, and model classification. A human signaling network was constructed, with 1643 nodes and 5089 regulating interactions. The network was configured to have the characteristics of other biological networks. There were 57,942 motifs marked with gastric cancer-related genes out of a total of 69,492 motifs, and 264 motifs were selected as significantly different motifs by calculating the significant motif difference (SMD) scores. Genes in significantly different motifs were mainly enriched in functions associated with cancer genesis, such as regulation of cell death, amino acid phosphorylation of proteins, and intracellular signaling cascades. The top five significantly different motifs were mainly cascade and positive feedback types. Almost all genes in the five motifs were cancer related, including EPOR, MAPK14, BCL2L1, KRT18, PTPN6, CASP3, TGFBR2, AR, and CASP7. The development of cancer might be curbed by inhibiting signal transductions upstream and downstream of the selected motifs.

  5. A motif for reversible nitric oxide interactions in metalloenzymes.

    Science.gov (United States)

    Zhang, Shiyu; Melzer, Marie M; Sen, S Nermin; Çelebi-Ölçüm, Nihan; Warren, Timothy H

    2016-07-01

    Nitric oxide (NO) participates in numerous biological processes, such as signalling in the respiratory system and vasodilation in the cardiovascular system. Many metal-mediated processes involve direct reaction of NO to form a metal-nitrosyl (M-NO), as occurs at the Fe(2+) centres of soluble guanylate cyclase or cytochrome c oxidase. However, some copper electron-transfer proteins that bear a type 1 Cu site (His2Cu-Cys) reversibly bind NO by an unknown motif. Here, we use model complexes of type 1 Cu sites based on tris(pyrazolyl)borate copper thiolates [Cu(II)]-SR to unravel the factors involved in NO reactivity. Addition of NO provides the fully characterized S-nitrosothiol adduct [Cu(I)](κ(1)-N(O)SR), which reversibly loses NO on purging with an inert gas. Computational analysis outlines a low-barrier pathway for the capture and release of NO. These findings suggest a new motif for reversible binding of NO at bioinorganic metal centres that can interconvert NO and RSNO molecular signals at copper sites. PMID:27325092

  6. Graph animals, subgraph sampling and motif search in large networks

    CERN Document Server

    Baskerville, Kim; Paczuski, Maya

    2007-01-01

    We generalize a sampling algorithm for lattice animals (connected clusters on a regular lattice) to a Monte Carlo algorithm for `graph animals', i.e. connected subgraphs in arbitrary networks. As with the algorithm in [N. Kashtan et al., Bioinformatics 20, 1746 (2004)], it provides a weighted sample, but the computation of the weights is much faster (linear in the size of subgraphs, instead of super-exponential). This allows subgraphs with up to ten or more nodes to be sampled with very high statistics, from arbitrarily large networks. Using this together with a heuristic algorithm for rapidly classifying isomorphic graphs, we present results for two protein interaction networks obtained using the TAP high throughput method: one of Escherichia coli with 230 nodes and 695 links, and one for yeast (Saccharomyces cerevisiae) with roughly ten times more nodes and links. We find in both cases that most connected subgraphs are strong motifs (Z-scores >10) or anti-motifs (Z-scores <-10) when the null model is the...

  7. The Origin of Motif Families in Food Webs

    CERN Document Server

    Klaise, Janis

    2016-01-01

    Food webs have been found to exhibit remarkable motif profiles, patterns in the relative prevalences of all possible three-species sub-graphs, and this has been related to ecosystem properties such as stability and robustness. Analysing 46 food webs of various kinds, we find that most food webs fall into one of two distinct motif families. The separation between the families is well predicted by a global measure of hierarchical order in directed networks - trophic coherence. We find that trophic coherence is also a good predictor for the extent of omnivory, defined as the tendency of species to feed on multiple trophic levels. We compare our results to a network assembly model that admits tunable trophic coherence via a single free parameter. The model is able to generate food webs in either of the two families by varying this parameter, and correctly classifies almost all the food webs in our database. This establishes a link between global order and local preying patterns in food webs.

  8. The discodermolide hairpin structure flows from conformationally stable modular motifs.

    Science.gov (United States)

    Jogalekar, Ashutosh S; Kriel, Frederik H; Shi, Qi; Cornett, Ben; Cicero, Daniel; Snyder, James P

    2010-01-14

    (+)-Discodermolide (DDM), a polyketide macrolide from marine sponge, is a potent microtubule assembly promoter. Reported solid-state, solution, and protein-bound DDM conformations reveal the unusual result that a common hairpin conformational motif exists in all three microenvironments. No other flexible microtubule binding agent exhibits such constancy of conformation. In the present study, we combine force-field conformational searches with NMR deconvolution in different solvents to compare DDM conformers with those observed in other environments. While several conformational families are perceived, the hairpin form dominates. The stability of this motif is dictated primarily by steric factors arising from repeated modular segments in DDM composed of the C(Me)-CHX-C(Me) fragment. Furthermore, docking protocols were utilized to probe the DDM binding mode in beta-tubulin. A previously suggested pose is substantiated (Pose-1), while an alternative (Pose-2) has been identified. SAR analysis for DDM analogues differentiates the two poses and suggests that Pose-2 is better able to accommodate the biodata.

  9. A motif-independent metric for DNA sequence specificity

    Directory of Open Access Journals (Sweden)

    Pinello Luca

    2011-10-01

    Full Text Available Abstract Background Genome-wide mapping of protein-DNA interactions has been widely used to investigate biological functions of the genome. An important question is to what extent such interactions are regulated at the DNA sequence level. However, current investigation is hampered by the lack of computational methods for systematic evaluating sequence specificity. Results We present a simple, unbiased quantitative measure for DNA sequence specificity called the Motif Independent Measure (MIM. By analyzing both simulated and real experimental data, we found that the MIM measure can be used to detect sequence specificity independent of presence of transcription factor (TF binding motifs. We also found that the level of specificity associated with H3K4me1 target sequences is highly cell-type specific and highest in embryonic stem (ES cells. We predicted H3K4me1 target sequences by using the N- score model and found that the prediction accuracy is indeed high in ES cells.The software to compute the MIM is freely available at: https://github.com/lucapinello/mim. Conclusions Our method provides a unified framework for quantifying DNA sequence specificity and serves as a guide for development of sequence-based prediction models.

  10. Over-represented localized sequence motifs in ribosomal protein gene promoters of basal metazoans.

    Science.gov (United States)

    Perina, Drago; Korolija, Marina; Roller, Maša; Harcet, Matija; Jeličić, Branka; Mikoč, Andreja; Cetković, Helena

    2011-07-01

    Equimolecular presence of ribosomal proteins (RPs) in the cell is needed for ribosome assembly and is achieved by synchronized expression of ribosomal protein genes (RPGs) with promoters of similar strengths. Over-represented motifs of RPG promoter regions are identified as targets for specific transcription factors. Unlike RPs, those motifs are not conserved between mammals, drosophila, and yeast. We analyzed RPGs proximal promoter regions of three basal metazoans with sequenced genomes: sponge, cnidarian, and placozoan and found common features, such as 5'-terminal oligopyrimidine tracts and TATA-boxes. Furthermore, we identified over-represented motifs, some of which displayed the highest similarity to motifs abundant in human RPG promoters and not present in Drosophila or yeast. Our results indicate that humans over-represented motifs, as well as corresponding domains of transcription factors, were established very early in metazoan evolution. The fast evolving nature of RPGs regulatory network leads to formation of other, lineage specific, over-represented motifs. PMID:21457775

  11. Sequence Length Limits for Controlling False Positives in Discovering Nucleotide Sequence Motifs

    Institute of Scientific and Technical Information of China (English)

    CHEN Lei; QiAN Zi-liang

    2008-01-01

    In the study of motif discovery, especially the transcription factor DNA binding sites discovery, a too long input sequence would return non-informative motifs rather than those biological functional motifs. This paper gave theoretical analyses and computational experiments to suggest the length limits of the input sequence. When the sequence length exceeds a certain critical point, the probability of discovering the motif decreases sharply. The work not only gave an explanation on the unsatisfying results of the existed motif discovery problems that the input sequence length might be too long and exceed the point, but also provided an estimation of input sequence length we should accept to get more meaningful and reliable results in motif discovery.

  12. Vampirism today : the change of the vampire motif from the gothic novel to today's fantasy literature

    OpenAIRE

    2009-01-01

    This thesis examins the change of the vampire motif throughout time. How have vampires and their clichés changed and why? Starting with a brief examination of the 'classical' litarary vampire, I mainly focus on contemporary fantasy literature by discussing recent works of vampire fiction. The adaptation of the vampire motif in role-playing games will as well be discussed as the effects the vampire film had on the motif.

  13. Belief-propagation algorithm and the Ising model on networks with arbitrary distributions of motifs

    OpenAIRE

    Yoon, S; Goltsev, A. V.; Dorogovtsev, S. N.; Mendes, J. F. F.

    2011-01-01

    We generalize the belief-propagation algorithm to sparse random networks with arbitrary distributions of motifs (triangles, loops, etc.). Each vertex in these networks belongs to a given set of motifs (generalization of the configuration model). These networks can be treated as sparse uncorrelated hypergraphs in which hyperedges represent motifs. Here a hypergraph is a generalization of a graph, where a hyperedge can connect any number of vertices. These uncorrelated hypergraphs are tree-like...

  14. An analysis of the positional distribution of DNA motifs in promoter regions and its biological relevance

    OpenAIRE

    Vinga Susana; Casimiro Ana C; Freitas Ana T; Oliveira Arlindo L

    2008-01-01

    Abstract Background Motif finding algorithms have developed in their ability to use computationally efficient methods to detect patterns in biological sequences. However the posterior classification of the output still suffers from some limitations, which makes it difficult to assess the biological significance of the motifs found. Previous work has highlighted the existence of positional bias of motifs in the DNA sequences, which might indicate not only that the pattern is important, but als...

  15. NestedMICA as an ab initio protein motif discovery tool

    Directory of Open Access Journals (Sweden)

    Down Thomas A

    2008-01-01

    Full Text Available Abstract Background Discovering overrepresented patterns in amino acid sequences is an important step in protein functional element identification. We adapted and extended NestedMICA, an ab initio motif finder originally developed for finding transcription binding site motifs, to find short protein signals, and compared its performance with another popular protein motif finder, MEME. NestedMICA, an open source protein motif discovery tool written in Java, is driven by a Monte Carlo technique called Nested Sampling. It uses multi-class sequence background models to represent different "uninteresting" parts of sequences that do not contain motifs of interest. In order to assess NestedMICA as a protein motif finder, we have tested it on synthetic datasets produced by spiking instances of known motifs into a randomly selected set of protein sequences. NestedMICA was also tested using a biologically-authentic test set, where we evaluated its performance with respect to varying sequence length. Results Generally NestedMICA recovered most of the short (3–9 amino acid long test protein motifs spiked into a test set of sequences at different frequencies. We showed that it can be used to find multiple motifs at the same time, too. In all the assessment experiments we carried out, its overall motif discovery performance was better than that of MEME. Conclusion NestedMICA proved itself to be a robust and sensitive ab initio protein motif finder, even for relatively short motifs that exist in only a small fraction of sequences. Availability NestedMICA is available under the Lesser GPL open-source license from: http://www.sanger.ac.uk/Software/analysis/nmica/

  16. A motif extraction algorithm based on hashing and modulo-4 arithmetic.

    Science.gov (United States)

    Sheng, Huitao; Mehrotra, Kishan; Mohan, Chilukuri; Raina, Ramesh

    2008-01-01

    We develop an algorithm to identify cis-elements in promoter regions of coregulated genes. This algorithm searches for subsequences of desired length whose frequency of occurrence is relatively high, while accounting for slightly perturbed variants using hash table and modulo arithmetic. Motifs are evaluated using profile matrices and higher-order Markov background model. Simulation results show that our algorithm discovers more motifs present in the test sequences, when compared with two well-known motif-discovery tools (MDScan and AlignACE). The algorithm produces very promising results on real data set; the output of the algorithm contained many known motifs.

  17. Mapping network motif tunability and robustness in the design of synthetic signaling circuits.

    Directory of Open Access Journals (Sweden)

    Sergio Iadevaia

    Full Text Available Cellular networks are highly dynamic in their function, yet evolutionarily conserved in their core network motifs or topologies. Understanding functional tunability and robustness of network motifs to small perturbations in function and structure is vital to our ability to synthesize controllable circuits. In establishing core sets of network motifs, we selected topologies that are overrepresented in mammalian networks, including the linear, feedback, feed-forward, and bifan circuits. Static and dynamic tunability of network motifs were defined as the motif ability to respectively attain steady-state or transient outputs in response to pre-defined input stimuli. Detailed computational analysis suggested that static tunability is insensitive to the circuit topology, since all of the motifs displayed similar ability to attain predefined steady-state outputs in response to constant inputs. Dynamic tunability, in contrast, was tightly dependent on circuit topology, with some motifs performing superiorly in achieving observed time-course outputs. Finally, we mapped dynamic tunability onto motif topologies to determine robustness of motif structures to changes in topology and identify design principles for the rational assembly of robust synthetic networks.

  18. Comparative genomic analysis of upstream miRNA regulatory motifs in Caenorhabditis.

    Science.gov (United States)

    Jovelin, Richard; Krizus, Aldis; Taghizada, Bakhtiyar; Gray, Jeremy C; Phillips, Patrick C; Claycomb, Julie M; Cutter, Asher D

    2016-07-01

    MicroRNAs (miRNAs) comprise a class of short noncoding RNA molecules that play diverse developmental and physiological roles by controlling mRNA abundance and protein output of the vast majority of transcripts. Despite the importance of miRNAs in regulating gene function, we still lack a complete understanding of how miRNAs themselves are transcriptionally regulated. To fill this gap, we predicted regulatory sequences by searching for abundant short motifs located upstream of miRNAs in eight species of Caenorhabditis nematodes. We identified three conserved motifs across the Caenorhabditis phylogeny that show clear signatures of purifying selection from comparative genomics, patterns of nucleotide changes in motifs of orthologous miRNAs, and correlation between motif incidence and miRNA expression. We then validated our predictions with transgenic green fluorescent protein reporters and site-directed mutagenesis for a subset of motifs located in an enhancer region upstream of let-7 We demonstrate that a CT-dinucleotide motif is sufficient for proper expression of GFP in the seam cells of adult C. elegans, and that two other motifs play incremental roles in combination with the CT-rich motif. Thus, functional tests of sequence motifs identified through analysis of molecular evolutionary signatures provide a powerful path for efficiently characterizing the transcriptional regulation of miRNA genes. PMID:27140965

  19. Motif-Driven Design of Protein-Protein Interfaces.

    Science.gov (United States)

    Silva, Daniel-Adriano; Correia, Bruno E; Procko, Erik

    2016-01-01

    Protein-protein interfaces regulate many critical processes for cellular function. The ability to accurately control and regulate these molecular interactions is of major interest for biomedical and synthetic biology applications, as well as to address fundamental biological questions. In recent years, computational protein design has emerged as a tool for designing novel protein-protein interactions with functional relevance. Although attractive, these computational tools carry a steep learning curve. In order to make some of these methods more accessible, we present detailed descriptions and examples of ROSETTA computational protocols for the design of functional protein binders using seeded protein interface design. In these protocols, a motif of known structure that interacts with the target site is grafted into a scaffold protein, followed by design of the surrounding interaction surface. PMID:27094298

  20. Appearance of the bulk motif in Al clusters

    Science.gov (United States)

    Sun, Jiao; Lu, Wen-Cai; Li, Ze-Sheng; Wang, C. Z.; Ho, K. M.

    2008-07-01

    We have performed an unbiased search for the lowest-energy structures of medium-sized aluminum clusters Aln (n=19-26) using a genetic algorithm (GA) coupled with a tight-binding interatomic potential. Structural candidates obtained from our GA search were further optimized using density functional theory. It is found that the double icosahedron is not the most stable structure for Al19 but serves as the core for Al20 and Al21. The lowest-energy structures of Aln are found to undergo a transition to an aluminum bulk motif above Al23. In particular, the lowest-energy structure of Al26 is almost a fragment of the bulk face-centered-cubic crystal except for the stacking fault at the bottom layer. Anion clusters were also studied.

  1. A cooperative fast annealing coevolutionary algorithm for protein motif extraction

    Institute of Scientific and Technical Information of China (English)

    CHEN Chao; TIAN YuanXin; ZOU XiaoYong; CAI PeiXiang; MO JinYuan

    2007-01-01

    By integrating the cooperative approach with the fast annealing coevolutionary algorithm (FAEA), a so-called cooperative fast annealing coevolutionary algorithm (CFACA) is presented in this paper for the purpose of solving high-dimensional problems. After the partition of the search space in CFACA, each smaller one is then searched by a separate FAEA. The fitness function is evaluated by combining sub-solutions found by each of the FAEAs. It demonstrates that the CFACA outperforms the FAEA in the domain of function optimization, especially in terms of convergence rate. The current algorithm is also applied to a real optimization problem of protein motif extraction. And a satisfactory result has been obtained with the accuracy of prediction achieving 67.0%, which is in agreement with the result in the PROSITE database.

  2. Sequential dynamics in the motif of excitatory coupled elements

    Science.gov (United States)

    Korotkov, Alexander G.; Kazakov, Alexey O.; Osipov, Grigory V.

    2015-11-01

    In this article a new model of motif (small ensemble) of neuron-like elements is proposed. It is built with the use of the generalized Lotka-Volterra model with excitatory couplings. The main motivation for this work comes from the problems of neuroscience where excitatory couplings are proved to be the predominant type of interaction between neurons of the brain. In this paper it is shown that there are two modes depending on the type of coupling between the elements: the mode with a stable heteroclinic cycle and the mode with a stable limit cycle. Our second goal is to examine the chaotic dynamics of the generalized three-dimensional Lotka-Volterra model.

  3. The sword motif 'n Matthew 10:34

    Directory of Open Access Journals (Sweden)

    David C. Sim

    2000-01-01

    Full Text Available 'n Mathew 10:34 Jesus uters a very dificult saying. He claims that he has not come to bring peace, but a sword. The form of this saying does not trace back to the historical Jesus; it is the product of Matthew's redaction of a Q passage which is found 'n a more original form 'n Luke 12:51. What did the evangelist mean when he wrote that Jesus brought a sword? 'n the Hebrew scriptures the sword was acommon symbol for the judgement and punishment of God, and 'n later times it represented a number of themes associated with the eschaton. It is argued 'n this study that Mathew, who was fully immersed 'n the apocalyptic-eschatological traditions of his day, probably used the sword motif 'n Matthew 10:34 to symbolise anumber of important eschatological events.

  4. Study on online community user motif using web usage mining

    Science.gov (United States)

    Alphy, Meera; Sharma, Ajay

    2016-04-01

    The Web usage mining is the application of data mining, which is used to extract useful information from the online community. The World Wide Web contains at least 4.73 billion pages according to Indexed Web and it contains at least 228.52 million pages according Dutch Indexed web on 6th august 2015, Thursday. It’s difficult to get needed data from these billions of web pages in World Wide Web. Here is the importance of web usage mining. Personalizing the search engine helps the web user to identify the most used data in an easy way. It reduces the time consumption; automatic site search and automatic restore the useful sites. This study represents the old techniques to latest techniques used in pattern discovery and analysis in web usage mining from 1996 to 2015. Analyzing user motif helps in the improvement of business, e-commerce, personalisation and improvement of websites.

  5. Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

    Directory of Open Access Journals (Sweden)

    Vassilev Boris

    2010-04-01

    Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.

  6. The EDLL motif: a potent plant transcriptional activation domain from AP2/ERF transcription factors.

    Science.gov (United States)

    Tiwari, Shiv B; Belachew, Alemu; Ma, Siu Fong; Young, Melinda; Ade, Jules; Shen, Yu; Marion, Colleen M; Holtan, Hans E; Bailey, Adina; Stone, Jeffrey K; Edwards, Leslie; Wallace, Andreah D; Canales, Roger D; Adam, Luc; Ratcliffe, Oliver J; Repetti, Peter P

    2012-06-01

    In plants, the ERF/EREBP family of transcriptional regulators plays a key role in adaptation to various biotic and abiotic stresses. These proteins contain a conserved AP2 DNA-binding domain and several uncharacterized motifs. Here, we describe a short motif, termed 'EDLL', that is present in AtERF98/TDR1 and other clade members from the same AP2 sub-family. We show that the EDLL motif, which has a unique arrangement of acidic amino acids and hydrophobic leucines, functions as a strong activation domain. The motif is transferable to other proteins, and is active at both proximal and distal positions of target promoters. As such, the EDLL motif is able to partly overcome the repression conferred by the AtHB2 transcription factor, which contains an ERF-associated amphiphilic repression (EAR) motif. We further examined the activation potential of EDLL by analysis of the regulation of flowering time by NF-Y (nuclear factor Y) proteins. Genetic evidence indicates that NF-Y protein complexes potentiate the action of CONSTANS in regulation of flowering in Arabidopsis; we show that the transcriptional activation function of CONSTANS can be substituted by direct fusion of the EDLL activation motif to NF-YB subunits. The EDLL motif represents a potent plant activation domain that can be used as a tool to confer transcriptional activation potential to heterologous DNA-binding proteins.

  7. MOMFER: A Search Engine of Thompson's Motif-Index of Folk Literature

    NARCIS (Netherlands)

    Karsdorp, F.; Meulen, M. van der; Meder, Th.; Bosch, A.P.J. van den

    2015-01-01

    More than fifty years after the first edition of Thompson's seminal Motif-Indexof Folk Literature, we present an online search engine tailored to fully disclose the index digitally. This search engine, called MOMFER, greatly enhances the searchability of the Motif-Index and provides exciting new way

  8. GOmotif: A web server for investigating the biological role of protein sequence motifs

    Directory of Open Access Journals (Sweden)

    He Runtao

    2011-09-01

    Full Text Available Abstract Background Many proteins contain conserved sequence patterns (motifs that contribute to their functionality. The process of experimentally identifying and validating novel protein motifs can be difficult, expensive, and time consuming. A means for helping to identify in advance the possible function of a novel motif is important to test hypotheses concerning the biological relevance of these motifs, thus reducing experimental trial-and-error. Results GOmotif accepts PROSITE and regular expression formatted motifs as input and searches a Gene Ontology annotated protein database using motif search tools. The search returns the set of proteins containing matching motifs and their associated Gene Ontology terms. These results are presented as: 1 a hierarchical, navigable tree separated into the three Gene Ontology biological domains - biological process, cellular component, and molecular function; 2 corresponding pie charts indicating raw and statistically adjusted distributions of the results, and 3 an interactive graphical network view depicting the location of the results in the Gene Ontology. Conclusions GOmotif is a web-based tool designed to assist researchers in investigating the biological role of novel protein motifs. GOmotif can be freely accessed at http://www.gomotif.ca

  9. Genome adaptations of a tripartite motif protein for retroviral defense in cattle and sheep

    Science.gov (United States)

    Tripartite motif (TRIM) genes encode proteins composed of RING, B-box, and coiled coil motif domains. Primate TRIM5' has been shown to be a primary determinant of retroviral host cell range restriction in primates. TRIM5 restriction was originally thought to be a primate-specific defense mechanism...

  10. Dynamic consequences of mutating the typical HPGG motif of apocytochrome b5 revealed by computer simulation

    Institute of Scientific and Technical Information of China (English)

    Ying Wu Lin; Tian Lei Ying; Li Fu Liao

    2009-01-01

    Apecytochrome b5 with a typical heme-binding motif of HPGC,and its variants with mutated motifs,GPGG,GPGH,HVGG,and HPGP,have been subjected to molecular dynamics simulation.Comparison of the dynamic consequences has revealed the crucial role of HPGG in assembling the heme group of cytochrome b5 and in modulating protein structure,property and function.

  11. Distinct recognition modes of FXXLF and LXXLL motifs by the androgen receptor.

    NARCIS (Netherlands)

    H.J. Dubbink (Erik Jan); R. Hersmus (Remko); C.S. Verma (Chandra); H.A.G.M. van der Korput (Hetty); C.A. Berrevoets (Cor); J. van Tol (Judith); A.C.J. Ziel-van der Made (Angelique); A.O. Brinkmann (Albert); A.C. Pike (Ashley); J. Trapman (Jan)

    2004-01-01

    textabstractAmong nuclear receptors, the androgen receptor (AR) is unique in that its ligand-binding domain (LBD) interacts with the FXXLF motif in the N-terminal domain, resembling coactivator LXXLL motifs. We compared AR- and estrogen receptor alpha-LBD interactions of the wild-t

  12. An Efficient Exact Algorithm for the Motif Stem Search Problem over Large Alphabets.

    Science.gov (United States)

    Yu, Qiang; Huo, Hongwei; Vitter, Jeffrey Scott; Huan, Jun; Nekrich, Yakov

    2015-01-01

    In recent years, there has been an increasing interest in planted (l, d) motif search (PMS) with applications to discovering significant segments in biological sequences. However, there has been little discussion about PMS over large alphabets. This paper focuses on motif stem search (MSS), which is recently introduced to search motifs on large-alphabet inputs. A motif stem is an l-length string with some wildcards. The goal of the MSS problem is to find a set of stems that represents a superset of all (l , d) motifs present in the input sequences, and the superset is expected to be as small as possible. The three main contributions of this paper are as follows: (1) We build motif stem representation more precisely by using regular expressions. (2) We give a method for generating all possible motif stems without redundant wildcards. (3) We propose an efficient exact algorithm, called StemFinder, for solving the MSS problem. Compared with the previous MSS algorithms, StemFinder runs much faster and reports fewer stems which represent a smaller superset of all (l, d) motifs. StemFinder is freely available at http://sites.google.com/site/feqond/stemfinder. PMID:26357225

  13. A Fast Cluster Motif Finding Algorithm for ChIP-Seq Data Sets.

    Science.gov (United States)

    Zhang, Yipu; Wang, Ping

    2015-01-01

    New high-throughput technique ChIP-seq, coupling chromatin immunoprecipitation experiment with high-throughput sequencing technologies, has extended the identification of binding locations of a transcription factor to the genome-wide regions. However, the most existing motif discovery algorithms are time-consuming and limited to identify binding motifs in ChIP-seq data which normally has the significant characteristics of large scale data. In order to improve the efficiency, we propose a fast cluster motif finding algorithm, named as FCmotif, to identify the (l,  d) motifs in large scale ChIP-seq data set. It is inspired by the emerging substrings mining strategy to find the enriched substrings and then searching the neighborhood instances to construct PWM and cluster motifs in different length. FCmotif is not following the OOPS model constraint and can find long motifs. The effectiveness of proposed algorithm has been proved by experiments on the ChIP-seq data sets from mouse ES cells. The whole detection of the real binding motifs and processing of the full size data of several megabytes finished in a few minutes. The experimental results show that FCmotif has advantageous to deal with the (l,  d) motif finding in the ChIP-seq data; meanwhile it also demonstrates better performance than other current widely-used algorithms such as MEME, Weeder, ChIPMunk, and DREME. PMID:26236718

  14. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Directory of Open Access Journals (Sweden)

    Fauteux François

    2009-10-01

    Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

  15. cWords - systematic microRNA regulatory motif discovery from mRNA expression data

    DEFF Research Database (Denmark)

    Rasmussen, Simon Horskjær; Jacobsen, Anders; Krogh, Anders

    2013-01-01

    -transcriptional regulation by small RNAs is mediated through partial complementary binding to messenger RNAs leaving nucleotide signatures or motifs throughout the entire transcriptome. Computational methods for discovery and analysis of sequence motifs in high-throughput mRNA expression profiling experiments are becoming...... increasingly important tools for the identification of post-transcriptional regulatory motifs and the inference of the regulators and their targets. RESULTS:cWords is a method designed for regulatory motif discovery in differential case-control mRNA expression datasets. We have improved the algorithms...... and statistical methods of cWords, resulting in at least a factor 100 speed gain over the previous implementation. On a benchmark dataset of 19 microRNA (miRNA) perturbation experiments cWords showed equal or better performance than two comparable methods, miReduce and Sylamer. We have developed rigorous motif...

  16. MOTIFSIM: A web tool for detecting similarity in multiple DNA motif datasets.

    Science.gov (United States)

    Tran, Ngoc Tam L; Huang, Chun-Hsi

    2015-07-01

    Currently, there are a number of motif detection tools available that possess unique functionality. These tools often report different motifs, and therefore use of multiple tools is generally advised since common motifs reported by multiple tools are more likely to be biologically significant. However, results produced by these different tools need to be compared and existing similarity detection tools only allow comparison between two data sets. Here, we describe a motif similarity detection tool (MOTIFSIM) possessing a web-based, user-friendly interface that is capable of detecting similarity from multiple DNA motif data sets concurrently. Results can either be viewed online or downloaded. Users may also download and run MOTIFSIM as a command-line tool in stand-alone mode. The web tool, along with its command-line version, user manuals, and source codes, are freely available at http://biogrid-head.engr.uconn.edu/motifsim/. PMID:26156781

  17. RSAT::Plants: Motif Discovery Within Clusters of Upstream Sequences in Plant Genomes.

    Science.gov (United States)

    Contreras-Moreira, Bruno; Castro-Mondragon, Jaime A; Rioualen, Claire; Cantalapiedra, Carlos P; van Helden, Jacques

    2016-01-01

    The plant-dedicated mirror of the Regulatory Sequence Analysis Tools (RSAT, http://plants.rsat.eu ) offers specialized options for researchers dealing with plant transcriptional regulation. The website contains whole-sequenced genomes from species regularly updated from Ensembl Plants and other sources (currently 40), and supports an array of tasks frequently required for the analysis of regulatory sequences, such as retrieving upstream sequences, motif discovery, motif comparison, and pattern matching. RSAT::Plants also integrates the footprintDB collection of DNA motifs. This protocol explains step-by-step how to discover DNA motifs in regulatory regions of clusters of co-expressed genes in plants. It also explains how to empirically control the significance of the result, and how to associate the discovered motifs with putative binding factors. PMID:27557774

  18. Selection for the G4 DNA motif at the 5' end of human genes.

    Science.gov (United States)

    Eddy, Johanna; Maizels, Nancy

    2009-04-01

    Formation of G4 DNA may occur in the course of replication and transcription, and contribute to genomic instability. We have quantitated abundance of G4 motifs and potential for G4 DNA formation of the nontemplate strand of 5' exons and introns of transcripts of human genes. We find that, for all human genes, G4 motifs are enriched in 5' regions of transcripts relative to downstream regions; and in 5' regulatory regions relative to coding regions. Notably, although tumor suppressor genes are depleted and proto-oncogenes enriched in G4 motifs, abundance of G4 motifs in the 5' regions of transcripts of genes in these categories does not differ. These results support the hypothesis that G4 motifs are under selection in the human genome. They further show that for tumor suppressor genes and proto-oncogenes, independent selection determines potential for G4 DNA formation of 5' regulatory regions of transcripts and downstream coding regions.

  19. The Land of the Dead – International Motifs in the Oldest Work of Japanese Literature

    Directory of Open Access Journals (Sweden)

    Danijela Vasić

    2010-02-01

    Full Text Available Il existe dans le Kojiki (712, la plus ancienne œuvre littéraire du Japon, une abondance de motifs que l’on peut retrouver dans les cultures de nombreux peuples dans le monde entier. Cet article traite des motifs internationaux tissés dans deux mythes du premier tome, formant une image poétique du Pays des morts, la partie souterraine d’une structure cosmique tripartite. Sont abordés, entre autres, le motif largement connu de Perséphone, le motif orphique ou encore le motif de la fuite du Pays des morts.In the Kojiki (712, the oldest literary work of Japan, there is a plethora of motifs which could be found in the cultures of many peoples all over the world. This paper deals with the international motifs interwoven in two myths from the first volume, forming a poetic picture of the Land of the Dead, the underworld part of the trichotomic cosmic structure. Among other things, we find the widely known Persephone motif, the Orphic motif or the motif of the successful escape from the Land of the Dead.En Kojiki (712, la obra literaria más antigua de Japón, abundan motivos que pueden encontrarse en numerosas culturas de todo el mundo. Este artículo analiza los motivos internacionales entretejidos en dos mitos del primer volumen, los cuales forman una imagen poética del País de los Muertos, la sección subterránea de una estructura cósmica tripartita. Se abordan, entre otros, el famoso motivo de Perséfone, el motivo órfico de la huída exitosa del País de los Muertos.

  20. Bioinformatics Study of Cancer-Related Mutations within p53 Phosphorylation Site Motifs

    Directory of Open Access Journals (Sweden)

    Xiaona Ji

    2014-07-01

    Full Text Available p53 protein has about thirty phosphorylation sites located at the N- and C-termini and in the core domain. The phosphorylation sites are relatively less mutated than other residues in p53. To understand why and how p53 phosphorylation sites are rarely mutated in human cancer, using a bioinformatics approaches, we examined the phosphorylation site and its nearby flanking residues, focusing on the consensus phosphorylation motif pattern, amino-acid correlations within the phosphorylation motifs, the propensity of structural disorder of the phosphorylation motifs, and cancer mutations observed within the phosphorylation motifs. Many p53 phosphorylation sites are targets for several kinases. The phosphorylation sites match 17 consensus sequence motifs out of the 29 classified. In addition to proline, which is common in kinase specificity-determining sites, we found high propensity of acidic residues to be adjacent to phosphorylation sites. Analysis of human cancer mutations in the phosphorylation motifs revealed that motifs with adjacent acidic residues generally have fewer mutations, in contrast to phosphorylation sites near proline residues. p53 phosphorylation motifs are mostly disordered. However, human cancer mutations within phosphorylation motifs tend to decrease the disorder propensity. Our results suggest that combination of acidic residues Asp and Glu with phosphorylation sites provide charge redundancy which may safe guard against loss-of-function mutations, and that the natively disordered nature of p53 phosphorylation motifs may help reduce mutational damage. Our results further suggest that engineering acidic amino acids adjacent to potential phosphorylation sites could be a p53 gene therapy strategy.

  1. One motif to bind them: A small-XXX-small motif affects transmembrane domain 1 oligomerization, function, localization, and cross-talk between two yeast GPCRs.

    Science.gov (United States)

    Lock, Antonia; Forfar, Rachel; Weston, Cathryn; Bowsher, Leo; Upton, Graham J G; Reynolds, Christopher A; Ladds, Graham; Dixon, Ann M

    2014-12-01

    G protein-coupled receptors (GPCRs) are the largest family of cell-surface receptors in mammals and facilitate a range of physiological responses triggered by a variety of ligands. GPCRs were thought to function as monomers, however it is now accepted that GPCR homo- and hetero-oligomers also exist and influence receptor properties. The Schizosaccharomyces pombe GPCR Mam2 is a pheromone-sensing receptor involved in mating and has previously been shown to form oligomers in vivo. The first transmembrane domain (TMD) of Mam2 contains a small-XXX-small motif, overrepresented in membrane proteins and well-known for promoting helix-helix interactions. An ortholog of Mam2 in Saccharomyces cerevisiae, Ste2, contains an analogous small-XXX-small motif which has been shown to contribute to receptor homo-oligomerization, localization and function. Here we have used experimental and computational techniques to characterize the role of the small-XXX-small motif in function and assembly of Mam2 for the first time. We find that disruption of the motif via mutagenesis leads to reduction of Mam2 TMD1 homo-oligomerization and pheromone-responsive cellular signaling of the full-length protein. It also impairs correct targeting to the plasma membrane. Mutation of the analogous motif in Ste2 yielded similar results, suggesting a conserved mechanism for assembly. Using co-expression of the two fungal receptors in conjunction with computational models, we demonstrate a functional change in G protein specificity and propose that this is brought about through hetero-dimeric interactions of Mam2 with Ste2 via the complementary small-XXX-small motifs. This highlights the potential of these motifs to affect a range of properties that can be investigated in other GPCRs.

  2. The bridge: suggestions about the meaning of a pictorial motif

    Directory of Open Access Journals (Sweden)

    Omar Calabrese

    2011-12-01

    Full Text Available Developing research begun at the Warburg Institute in 1983, this paper reflects on the construction of meaning in a work of art, through the analysis of the bridge’s function in painting. It tries to reply to some objections the author received there from Gombrich, about the chance of finding a stable content in the configuration of the bridge. Hence, the study reconsiders the concept of ‘motif’ applied to this structure. In a semiotic perspective a motif is partially independent as regards to a single textual organization, because it has a mobile and migrant feature. However, it is also partially flexible as it depends upon the same organization. The inquiry shows that bridge’s internal structure corresponds to the category of a ‘junction’, with two opposite items, ‘conjunction’ and ‘disjunction’. The development of this theoretical object can be carried out also by figures that are not ‘bridges’, in the natural sense of the word. Furthermore, its meaning does not depend upon the number of examples we can find but only upon their relevance for constructing a ‘grammar of cases’. Differently from the traditional iconographical approach, but also from panofskian iconology, the analysis moves not only towards the simple or complex content of a figure but also towards its description.

  3. Network motifs that stabilize the hybrid epithelial/mesenchymal phenotype

    Science.gov (United States)

    Jolly, Mohit Kumar; Jia, Dongya; Tripathi, Satyendra; Hanash, Samir; Mani, Sendurai; Ben-Jacob, Eshel; Levine, Herbert

    Epithelial to Mesenchymal Transition (EMT) and its reverse - MET - are hallmarks of cancer metastasis. While transitioning between E and M phenotypes, cells can also attain a hybrid epithelial/mesenchymal (E/M) phenotype that enables collective cell migration as a cluster of Circulating Tumor Cells (CTCs). These clusters can form 50-times more tumors than individually migrating CTCs, underlining their importance in metastasis. However, this hybrid E/M phenotype has been hypothesized to be only a transient one that is attained en route EMT. Here, via mathematically modeling, we identify certain `phenotypic stability factors' that couple with the core three-way decision-making circuit (miR-200/ZEB) and can maintain or stabilize the hybrid E/M phenotype. Further, we show experimentally that this phenotype can be maintained stably at a single-cell level, and knockdown of these factors impairs collective cell migration. We also show that these factors enable the association of hybrid E/M with high stemness or tumor-initiating potential. Finally, based on these factors, we deduce specific network motifs that can maintain the E/M phenotype. Our framework can be used to elucidate the effect of other players in regulating cellular plasticity during metastasis. This work was supported by NSF PHY-1427654 (Center for Theoretical Biological Physics) and the CPRIT Scholar in Cancer Research of the State of Texas at Rice University.

  4. Ab initio coordination chemistry for nickel chelation motifs.

    Science.gov (United States)

    Sudan, R Jesu Jaya; Kumari, J Lesitha Jeeva; Sudandiradoss, C

    2015-01-01

    Chelation therapy is one of the most appreciated methods in the treatment of metal induced disease predisposition. Coordination chemistry provides a way to understand metal association in biological structures. In this work we have implemented coordination chemistry to study nickel coordination due to its high impact in industrial usage and thereby health consequences. This paper reports the analysis of nickel coordination from a large dataset of nickel bound structures and sequences. Coordination patterns predicted from the structures are reported in terms of donors, chelate length, coordination number, chelate geometry, structural fold and architecture. The analysis revealed histidine as the most favored residue in nickel coordination. The most common chelates identified were histidine based namely HHH, HDH, HEH and HH spaced at specific intervals. Though a maximum coordination number of 8 was observed, the presence of a single protein donor was noted to be mandatory in nickel coordination. The coordination pattern did not reveal any specific fold, nevertheless we report preferable residue spacing for specific structural architecture. In contrast, the analysis of nickel binding proteins from bacterial and archeal species revealed no common coordination patterns. Nickel binding sequence motifs were noted to be organism specific and protein class specific. As a result we identified about 13 signatures derived from 13 classes of nickel binding proteins. The specifications on nickel coordination presented in this paper will prove beneficial for developing better chelation strategies.

  5. Complexe de Poids, Dualit\\'e et Motifs de Beilinson

    CERN Document Server

    Hébert, David

    2010-01-01

    In the article [GS96], Gillet and Soul\\'e define a weight complex on the category of Voevodsky motives over a field of characteristic 0. In [Bon07], Bondarko generalizes this construction for any f-category with a bounded weight structure, as is the case for Beilinson motives (following Cisinski-D\\'eglise ; [CD09]). The first purpose of this note is to generalize [GS96, thm. 2] in the world of Beilinson motives. This done, we will naturally be led to define the motivic Euler characteristic dual to that considered by Bondarko in [Bon10]. This fact will motivate the second line of this note : proving that the duality operation exchanges the weight as is the case for t-structure ([BBD, 5.1.14.(iii)]). ----- Dans l'article [GS96], Gillet et Soul\\'e d\\'efinissent un complexe de poids sur la cat\\'egorie des motifs de Voevodsky d\\'efinie sur un corps de caract\\'eristique 0. Dans [Bon07], Bondarko g\\'en\\'eralise cette construction pour toute f-cat\\'egorie munie d'une structure de poids born\\'ee, comme c'est le cas po...

  6. Sulfur-induced structural motifs on copper and gold surfaces

    Science.gov (United States)

    Walen, Holly

    The interaction of sulfur with copper and gold surfaces plays a fundamental role in important phenomena that include coarsening of surface nanostructures, and self-assembly of alkanethiols. Here, we identify and analyze unique sulfur-induced structural motifs observed on the low-index surfaces of these two metals. We seek out these structures in an effort to better understand the fundamental interactions between these metals and sulfur that lends to the stability and favorability of metal-sulfur complexes vs. chemisorbed atomic sulfur. We choose very specific conditions: very low temperature (5 K), and very low sulfur coverage (≤ 0.1 monolayer). In this region of temperature-coverage space, which has not been examined previously for these adsorbate-metal systems, the effects of individual interactions between metals and sulfur are most apparent and can be assessed extensively with the aid of theory and modeling. Furthermore, at this temperature diffusion is minimal and relatively-mobile species can be isolated, and at low coverage the structures observed are not consumed by an extended reconstruction. The primary experimental technique is scanning tunneling microscopy (STM). The experimental observations presented here---made under identical conditions---together with extensive DFT analyses, allow comparisons and insights into factors that favor the existence of metal-sulfur complexes, vs. chemisorbed atomic sulfur, on metal terraces. We believe this data will be instrumental in better understanding the complex phenomena occurring between the surfaces of coinage metals and sulfur.

  7. VAMP subfamilies identified by specific R-SNARE motifs.

    Science.gov (United States)

    Rossi, Valeria; Picco, Raffaella; Vacca, Marcella; D'Esposito, Maurizio; D'Urso, Michele; Galli, Thierry; Filippini, Francesco

    2004-05-01

    In eukaryotes, interactions among the alpha-helical coiled-coil domains (CCDs) of soluble N-ethylmaleimide-sensitive factor attachment protein receptors (SNAREs) play a pivotal role in mediating the fusion among vesicles and target membranes. Surface residues of such CCDs are major candidates to regulate the specificity of membrane fusion, as they may alter local charge at the interaction layers and surface of the fusion complex, possibly modulating its formation and/or the binding of non-SNARE regulatory factors. Based on alternate patterns in surface residues, we have identified two motifs which group vesicular SNAREs in two novel subfamilies: RG-SNAREs and RD-SNAREs. The RG-SNARE CCD is common to all members of the widely conserved family of long VAMPs or longins and to yeast and non-neuronal VAMPs, possibly mediating "basic" fusion mechanisms; instead, only synaptobrevins from Bilateria share an RD-SNARE CCD, which is likely to mediate interactions to specific, yet unknown, regulatory factors and/or be the landmark of rapid fusion reactions like that mediating the release of neurotransmitters.

  8. Tyrosine motifs are required for prestin basolateral membrane targeting

    Directory of Open Access Journals (Sweden)

    Yifan Zhang

    2015-01-01

    Full Text Available Prestin is targeted to the lateral wall of outer hair cells (OHCs where its electromotility is critical for cochlear amplification. Using MDCK cells as a model system for polarized epithelial sorting, we demonstrate that prestin uses tyrosine residues, in a YXXΦ motif, to target the basolateral surface. Both Y520 and Y667 are important for basolateral targeting of prestin. Mutation of these residues to glutamine or alanine resulted in retention within the Golgi and delayed egress from the Golgi in Y667Q. Basolateral targeting is restored upon mutation to phenylalanine suggesting the importance of a phenol ring in the tyrosine side chain. We also demonstrate that prestin targeting to the basolateral surface is dependent on AP1B (μ1B, and that prestin uses transferrin containing early endosomes in its passage from the Golgi to the basolateral plasma membrane. The presence of AP1B (μ1B in OHCs, and parallels between prestin targeting to the basolateral surface of OHCs and polarized epithelial cells suggest that outer hair cells resemble polarized epithelia rather than neurons in this important phenotypic measure.

  9. Ab initio coordination chemistry for nickel chelation motifs.

    Directory of Open Access Journals (Sweden)

    R Jesu Jaya Sudan

    Full Text Available Chelation therapy is one of the most appreciated methods in the treatment of metal induced disease predisposition. Coordination chemistry provides a way to understand metal association in biological structures. In this work we have implemented coordination chemistry to study nickel coordination due to its high impact in industrial usage and thereby health consequences. This paper reports the analysis of nickel coordination from a large dataset of nickel bound structures and sequences. Coordination patterns predicted from the structures are reported in terms of donors, chelate length, coordination number, chelate geometry, structural fold and architecture. The analysis revealed histidine as the most favored residue in nickel coordination. The most common chelates identified were histidine based namely HHH, HDH, HEH and HH spaced at specific intervals. Though a maximum coordination number of 8 was observed, the presence of a single protein donor was noted to be mandatory in nickel coordination. The coordination pattern did not reveal any specific fold, nevertheless we report preferable residue spacing for specific structural architecture. In contrast, the analysis of nickel binding proteins from bacterial and archeal species revealed no common coordination patterns. Nickel binding sequence motifs were noted to be organism specific and protein class specific. As a result we identified about 13 signatures derived from 13 classes of nickel binding proteins. The specifications on nickel coordination presented in this paper will prove beneficial for developing better chelation strategies.

  10. FR3D: finding local and composite recurrent structural motifs in RNA 3D structures.

    Science.gov (United States)

    Sarver, Michael; Zirbel, Craig L; Stombaugh, Jesse; Mokdad, Ali; Leontis, Neocles B

    2008-01-01

    New methods are described for finding recurrent three-dimensional (3D) motifs in RNA atomic-resolution structures. Recurrent RNA 3D motifs are sets of RNA nucleotides with similar spatial arrangements. They can be local or composite. Local motifs comprise nucleotides that occur in the same hairpin or internal loop. Composite motifs comprise nucleotides belonging to three or more different RNA strand segments or molecules. We use a base-centered approach to construct efficient, yet exhaustive search procedures using geometric, symbolic, or mixed representations of RNA structure that we implement in a suite of MATLAB programs, "Find RNA 3D" (FR3D). The first modules of FR3D preprocess structure files to classify base-pair and -stacking interactions. Each base is represented geometrically by the position of its glycosidic nitrogen in 3D space and by the rotation matrix that describes its orientation with respect to a common frame. Base-pairing and base-stacking interactions are calculated from the base geometries and are represented symbolically according to the Leontis/Westhof basepairing classification, extended to include base-stacking. These data are stored and used to organize motif searches. For geometric searches, the user supplies the 3D structure of a query motif which FR3D uses to find and score geometrically similar candidate motifs, without regard to the sequential position of their nucleotides in the RNA chain or the identity of their bases. To score and rank candidate motifs, FR3D calculates a geometric discrepancy by rigidly rotating candidates to align optimally with the query motif and then comparing the relative orientations of the corresponding bases in the query and candidate motifs. Given the growing size of the RNA structure database, it is impossible to explicitly compute the discrepancy for all conceivable candidate motifs, even for motifs with less than ten nucleotides. The screening algorithm that we describe finds all candidate motifs whose

  11. RNA 3D Structural Motifs: Definition, Identification, Annotation, and Database Searching

    Science.gov (United States)

    Nasalean, Lorena; Stombaugh, Jesse; Zirbel, Craig L.; Leontis, Neocles B.

    Structured RNA molecules resemble proteins in the hierarchical organization of their global structures, folding and broad range of functions. Structured RNAs are composed of recurrent modular motifs that play specific functional roles. Some motifs direct the folding of the RNA or stabilize the folded structure through tertiary interactions. Others bind ligands or proteins or catalyze chemical reactions. Therefore, it is desirable, starting from the RNA sequence, to be able to predict the locations of recurrent motifs in RNA molecules. Conversely, the potential occurrence of one or more known 3D RNA motifs may indicate that a genomic sequence codes for a structured RNA molecule. To identify known RNA structural motifs in new RNA sequences, precise structure-based definitions are needed that specify the core nucleotides of each motif and their conserved interactions. By comparing instances of each recurrent motif and applying base pair isosteriCity relations, one can identify neutral mutations that preserve its structure and function in the contexts in which it occurs.

  12. Network motifs in integrated cellular networks of transcription-regulation and protein-protein interaction

    Science.gov (United States)

    Yeger-Lotem, Esti; Sattath, Shmuel; Kashtan, Nadav; Itzkovitz, Shalev; Milo, Ron; Pinter, Ron Y.; Alon, Uri; Margalit, Hanah

    2004-04-01

    Genes and proteins generate molecular circuitry that enables the cell to process information and respond to stimuli. A major challenge is to identify characteristic patterns in this network of interactions that may shed light on basic cellular mechanisms. Previous studies have analyzed aspects of this network, concentrating on either transcription-regulation or protein-protein interactions. Here we search for composite network motifs: characteristic network patterns consisting of both transcription-regulation and protein-protein interactions that recur significantly more often than in random networks. To this end we developed algorithms for detecting motifs in networks with two or more types of interactions and applied them to an integrated data set of protein-protein interactions and transcription regulation in Saccharomyces cerevisiae. We found a two-protein mixed-feedback loop motif, five types of three-protein motifs exhibiting coregulation and complex formation, and many motifs involving four proteins. Virtually all four-protein motifs consisted of combinations of smaller motifs. This study presents a basic framework for detecting the building blocks of networks with multiple types of interactions.

  13. Trend Motif: A Graph Mining Approach for Analysis of Dynamic Complex Networks

    Energy Technology Data Exchange (ETDEWEB)

    Jin, R; McCallen, S; Almaas, E

    2007-05-28

    Complex networks have been used successfully in scientific disciplines ranging from sociology to microbiology to describe systems of interacting units. Until recently, studies of complex networks have mainly focused on their network topology. However, in many real world applications, the edges and vertices have associated attributes that are frequently represented as vertex or edge weights. Furthermore, these weights are often not static, instead changing with time and forming a time series. Hence, to fully understand the dynamics of the complex network, we have to consider both network topology and related time series data. In this work, we propose a motif mining approach to identify trend motifs for such purposes. Simply stated, a trend motif describes a recurring subgraph where each of its vertices or edges displays similar dynamics over a userdefined period. Given this, each trend motif occurrence can help reveal significant events in a complex system; frequent trend motifs may aid in uncovering dynamic rules of change for the system, and the distribution of trend motifs may characterize the global dynamics of the system. Here, we have developed efficient mining algorithms to extract trend motifs. Our experimental validation using three disparate empirical datasets, ranging from the stock market, world trade, to a protein interaction network, has demonstrated the efficiency and effectiveness of our approach.

  14. A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif

    Directory of Open Access Journals (Sweden)

    Asita Elengoe

    2015-01-01

    Full Text Available Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD of heat shock 70 kDa protein (PDB: 1HJO with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD simulation. Human DNA binding domain of p53 motif (SCMGGMNR retrieved from UniProt (UniProtKB: P04637 was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.

  15. MODA: an efficient algorithm for network motif discovery in biological networks.

    Science.gov (United States)

    Omidi, Saeed; Schreiber, Falk; Masoudi-Nejad, Ali

    2009-10-01

    In recent years, interest has been growing in the study of complex networks. Since Erdös and Rényi (1960) proposed their random graph model about 50 years ago, many researchers have investigated and shaped this field. Many indicators have been proposed to assess the global features of networks. Recently, an active research area has developed in studying local features named motifs as the building blocks of networks. Unfortunately, network motif discovery is a computationally hard problem and finding rather large motifs (larger than 8 nodes) by means of current algorithms is impractical as it demands too much computational effort. In this paper, we present a new algorithm (MODA) that incorporates techniques such as a pattern growth approach for extracting larger motifs efficiently. We have tested our algorithm and found it able to identify larger motifs with more than 8 nodes more efficiently than most of the current state-of-the-art motif discovery algorithms. While most of the algorithms rely on induced subgraphs as motifs of the networks, MODA is able to extract both induced and non-induced subgraphs simultaneously. The MODA source code is freely available at: http://LBB.ut.ac.ir/Download/LBBsoft/MODA/ PMID:20154426

  16. MODA: an efficient algorithm for network motif discovery in biological networks.

    Science.gov (United States)

    Omidi, Saeed; Schreiber, Falk; Masoudi-Nejad, Ali

    2009-10-01

    In recent years, interest has been growing in the study of complex networks. Since Erdös and Rényi (1960) proposed their random graph model about 50 years ago, many researchers have investigated and shaped this field. Many indicators have been proposed to assess the global features of networks. Recently, an active research area has developed in studying local features named motifs as the building blocks of networks. Unfortunately, network motif discovery is a computationally hard problem and finding rather large motifs (larger than 8 nodes) by means of current algorithms is impractical as it demands too much computational effort. In this paper, we present a new algorithm (MODA) that incorporates techniques such as a pattern growth approach for extracting larger motifs efficiently. We have tested our algorithm and found it able to identify larger motifs with more than 8 nodes more efficiently than most of the current state-of-the-art motif discovery algorithms. While most of the algorithms rely on induced subgraphs as motifs of the networks, MODA is able to extract both induced and non-induced subgraphs simultaneously. The MODA source code is freely available at: http://LBB.ut.ac.ir/Download/LBBsoft/MODA/

  17. Recurrent motifs as resonant attractor states in the narrative field: a testable model of archetype.

    Science.gov (United States)

    Goodwyn, Erik

    2013-06-01

    At the most basic level, archetypes represented Jung's attempt to explain the phenomenon of recurrent myths and folktale motifs (Jung 1956, 1959, para. 99). But the archetype remains controversial as an explanation of recurrent motifs, as the existence of recurrent motifs does not prove that archetypes exist. Thus, the challenge for contemporary archetype theory is not merely to demonstrate that recurrent motifs exist, since that is not disputed, but to demonstrate that archetypes exist and cause recurrent motifs. The present paper proposes a new model which is unlike others in that it postulates how the archetype creates resonant motifs. This model necessarily clarifies and adapts some of Jung's seminal ideas on archetype in order to provide a working framework grounded in contemporary practice and methodologies. For the first time, a model of archetype is proposed that can be validated on empirical, rather than theoretical grounds. This is achieved by linking the archetype to the hard data of recurrent motifs rather than academic trends in other fields.

  18. Recurrent motifs as resonant attractor states in the narrative field: a testable model of archetype.

    Science.gov (United States)

    Goodwyn, Erik

    2013-06-01

    At the most basic level, archetypes represented Jung's attempt to explain the phenomenon of recurrent myths and folktale motifs (Jung 1956, 1959, para. 99). But the archetype remains controversial as an explanation of recurrent motifs, as the existence of recurrent motifs does not prove that archetypes exist. Thus, the challenge for contemporary archetype theory is not merely to demonstrate that recurrent motifs exist, since that is not disputed, but to demonstrate that archetypes exist and cause recurrent motifs. The present paper proposes a new model which is unlike others in that it postulates how the archetype creates resonant motifs. This model necessarily clarifies and adapts some of Jung's seminal ideas on archetype in order to provide a working framework grounded in contemporary practice and methodologies. For the first time, a model of archetype is proposed that can be validated on empirical, rather than theoretical grounds. This is achieved by linking the archetype to the hard data of recurrent motifs rather than academic trends in other fields. PMID:23750942

  19. Fission yeast hotspot sequence motifs are also active in budding yeast.

    Directory of Open Access Journals (Sweden)

    Walter W Steiner

    Full Text Available In most organisms, including humans, meiotic recombination occurs preferentially at a limited number of sites in the genome known as hotspots. There has been substantial progress recently in elucidating the factors determining the location of meiotic recombination hotspots, and it is becoming clear that simple sequence motifs play a significant role. In S. pombe, there are at least five unique sequence motifs that have been shown to produce hotspots of recombination, and it is likely that there are more. In S. cerevisiae, simple sequence motifs have also been shown to produce hotspots or show significant correlations with hotspots. Some of the hotspot motifs in both yeasts are known or suspected to bind transcription factors (TFs, which are required for the activity of those hotspots. Here we show that four of the five hotspot motifs identified in S. pombe also create hotspots in the distantly related budding yeast S. cerevisiae. For one of these hotspots, M26 (also called CRE, we identify TFs, Cst6 and Sko1, that activate and inhibit the hotspot, respectively. In addition, two of the hotspot motifs show significant correlations with naturally occurring hotspots. The conservation of these hotspots between the distantly related fission and budding yeasts suggests that these sequence motifs, and others yet to be discovered, may function widely as hotspots in many diverse organisms.

  20. Comparative Analysis of Regulatory Motif Discovery Tools for Transcription Factor Binding Sites

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    In the post-genomic era, identification of specific regulatory motifs or transcription factor binding sites (TFBSs) in non-coding DNA sequences, which is essential to elucidate transcriptional regulatory networks, has emerged as an obstacle that frustrates many researchers. Consequently, numerous motif discovery tools and correlated databases have been applied to solving this problem. However, these existing methods, based on different computational algorithms, show diverse motif prediction efficiency in non-coding DNA sequences. Therefore, understanding the similarities and differences of computational algorithms and enriching the motif discovery literatures are important for users to choose the most appropriate one among the online available tools. Moreover, there still lacks credible criterion to assess motif discovery tools and instructions for researchers to choose the best according to their own projects. Thus integration of the related resources might be a good approach to improve accuracy of the application. Recent studies integrate regulatory motif discovery tools with experimental methods to offer a complementary approach for researchers, and also provide a much-needed model for current researches on transcriptional regulatory networks. Here we present a comparative analysis of regulatory motif discovery tools for TFBSs.

  1. Identification of a putative nuclear export signal motif in human NANOG homeobox domain

    International Nuclear Information System (INIS)

    Highlights: ► We found the putative nuclear export signal motif within human NANOG homeodomain. ► Leucine-rich residues are important for human NANOG homeodomain nuclear export. ► CRM1-specific inhibitor LMB blocked the potent human NANOG NES-mediated nuclear export. -- Abstract: NANOG is a homeobox-containing transcription factor that plays an important role in pluripotent stem cells and tumorigenic cells. To understand how nuclear localization of human NANOG is regulated, the NANOG sequence was examined and a leucine-rich nuclear export signal (NES) motif (125MQELSNILNL134) was found in the homeodomain (HD). To functionally validate the putative NES motif, deletion and site-directed mutants were fused to an EGFP expression vector and transfected into COS-7 cells, and the localization of the proteins was examined. While hNANOG HD exclusively localized to the nucleus, a mutant with both NLSs deleted and only the putative NES motif contained (hNANOG HD-ΔNLSs) was predominantly cytoplasmic, as observed by nucleo/cytoplasmic fractionation and Western blot analysis as well as confocal microscopy. Furthermore, site-directed mutagenesis of the putative NES motif in a partial hNANOG HD only containing either one of the two NLS motifs led to localization in the nucleus, suggesting that the NES motif may play a functional role in nuclear export. Furthermore, CRM1-specific nuclear export inhibitor LMB blocked the hNANOG potent NES-mediated export, suggesting that the leucine-rich motif may function in CRM1-mediated nuclear export of hNANOG. Collectively, a NES motif is present in the hNANOG HD and may be functionally involved in CRM1-mediated nuclear export pathway.

  2. Minimal motif peptide structure of metzincin clan zinc peptidases in micelles.

    Science.gov (United States)

    Onoda, Akira; Suzuki, Takako; Ishizuka, Hiroaki; Sugiyama, Rumiko; Ariyasu, Shinya; Yamamura, Takeshi

    2009-12-01

    It is well known that the functions of metalloproteins generally originate from their metal-binding motifs. However, the intrinsic nature of individual motifs remains unknown, particularly the details about metal-binding effects on the folding of motifs; the converse is also unknown, although there is no doubt that the motif is the core of the reactivity for each metalloprotein. In this study, we focused our attention on the zinc-binding motif of the metzincin clan family, HEXXHXXGXXH; this family contains the general zinc-binding sequence His-Glu-Xaa-Xaa-His (HEXXH) and the extended GXXH region. We adopted the motif sequence of stromelysin-1 and investigated the folding properties of the Trp-labeled peptides WAHEIAHSLGLFHA (STR-W1), AWHEIAHSLGLFHA (STR-W2), AHEIAHSLGWFHA (STR-W11), and AHEIAHSLGLFHWA (STR-W14) in the presence and absence of zinc ions in hydrophobic micellar environments by circular dichroism (CD) measurements. We accessed successful incorporation of these zinc peptides into micelles using quenching of Trp fluorescence. Results of CD studies indicated that two of the Trp-incorporated peptides, STR-W1 and STR-W14, exhibited helical folding in the hydrophobic region of cetyltrimethylammonium chloride micelle. The NMR structural analysis of the apo STR-W14 revealed that the conformation in the C-terminus GXXH region significantly differred between the apo state in the micelle and the reported Zn-bound state of stromelysin-1 in crystal structures. The structural analyses of the qualitative Zn-binding properties of this motif peptide provide an interesting Zn-binding mechanism: the minimum consensus motif in the metzincin clan, a basic zinc-binding motif with an extended GXXH region, has the potential to serve as a preorganized Zn binding scaffold in a hydrophobic environment.

  3. Miz-1 activates gene expression via a novel consensus DNA binding motif.

    Science.gov (United States)

    Barrilleaux, Bonnie L; Burow, Dana; Lockwood, Sarah H; Yu, Abigail; Segal, David J; Knoepfler, Paul S

    2014-01-01

    The transcription factor Miz-1 can either activate or repress gene expression in concert with binding partners including the Myc oncoprotein. The genomic binding of Miz-1 includes both core promoters and more distal sites, but the preferred DNA binding motif of Miz-1 has been unclear. We used a high-throughput in vitro technique, Bind-n-Seq, to identify two Miz-1 consensus DNA binding motif sequences--ATCGGTAATC and ATCGAT (Mizm1 and Mizm2)--bound by full-length Miz-1 and its zinc finger domain, respectively. We validated these sequences directly as high affinity Miz-1 binding motifs. Competition assays using mutant probes indicated that the binding affinity of Miz-1 for Mizm1 and Mizm2 is highly sequence-specific. Miz-1 strongly activates gene expression through the motifs in a Myc-independent manner. MEME-ChIP analysis of Miz-1 ChIP-seq data in two different cell types reveals a long motif with a central core sequence highly similar to the Mizm1 motif identified by Bind-n-Seq, validating the in vivo relevance of the findings. Miz-1 ChIP-seq peaks containing the long motif are predominantly located outside of proximal promoter regions, in contrast to peaks without the motif, which are highly concentrated within 1.5 kb of the nearest transcription start site. Overall, our results indicate that Miz-1 may be directed in vivo to the novel motif sequences we have identified, where it can recruit its specific binding partners to control gene expression and ultimately regulate cell fate. PMID:24983942

  4. Miz-1 activates gene expression via a novel consensus DNA binding motif.

    Directory of Open Access Journals (Sweden)

    Bonnie L Barrilleaux

    Full Text Available The transcription factor Miz-1 can either activate or repress gene expression in concert with binding partners including the Myc oncoprotein. The genomic binding of Miz-1 includes both core promoters and more distal sites, but the preferred DNA binding motif of Miz-1 has been unclear. We used a high-throughput in vitro technique, Bind-n-Seq, to identify two Miz-1 consensus DNA binding motif sequences--ATCGGTAATC and ATCGAT (Mizm1 and Mizm2--bound by full-length Miz-1 and its zinc finger domain, respectively. We validated these sequences directly as high affinity Miz-1 binding motifs. Competition assays using mutant probes indicated that the binding affinity of Miz-1 for Mizm1 and Mizm2 is highly sequence-specific. Miz-1 strongly activates gene expression through the motifs in a Myc-independent manner. MEME-ChIP analysis of Miz-1 ChIP-seq data in two different cell types reveals a long motif with a central core sequence highly similar to the Mizm1 motif identified by Bind-n-Seq, validating the in vivo relevance of the findings. Miz-1 ChIP-seq peaks containing the long motif are predominantly located outside of proximal promoter regions, in contrast to peaks without the motif, which are highly concentrated within 1.5 kb of the nearest transcription start site. Overall, our results indicate that Miz-1 may be directed in vivo to the novel motif sequences we have identified, where it can recruit its specific binding partners to control gene expression and ultimately regulate cell fate.

  5. Salt-bridge Swapping in the EXXERFXYY Motif of Proton Coupled Oligopeptide Transporters

    DEFF Research Database (Denmark)

    Aduri, Nanda G; Prabhala, Bala K; Ernst, Heidi A;

    2015-01-01

    to as E1XXE2R), located on Helix I, in interactions with the proton. In this study we investigated the intracellular substrate accumulation by motif variants with all possible combinations of glutamate residues changed to glutamine and arginine changed to a tyrosine; the latter being a natural variant......-motif salt bridge, i.e. R-E2 to R-E1, which is consistent with previous structural studies. Molecular dynamics simulations of the motif variants E1XXE2R and E1XXQ2R support this mechanism. The simulations showed that upon changing conformation, arginine pushes Helix V, through interactions with the highly...

  6. Analysis of the characteristic sequence of intein and revi-sion of its motifs

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    Since the first intein (Sce VMA) was found in Saccharomydes cerevisiae ATPases gene in 1990, more and more inteins were identified. It is necessary to analyze the new inteins to understand the sequence charateristics of inteins. By searching protein and nucleic acid database systematically, 101 inteins were found, of which 69 inteins contain homing endonuclease motifs. We only analyze the 69 inteins since most inteins are the classic inteins with homing endonuclease motifs. We found that the distribution of these inteins is particular among species and protein. By multiple sequence alignment, some new sequence characteristics were found and the motifs described previously were revised.

  7. FTZ-Factor1 and Fushi tarazu interact via conserved nuclear receptor and coactivator motifs

    OpenAIRE

    Schwartz, Carol J.E.; Sampson, Heidi M.; Hlousek, Daniela; Percival-Smith, Anthony; Copeland, John W.R.; Simmonds, Andrew J.; Krause, Henry M.

    2001-01-01

    To activate transcription, most nuclear receptor proteins require coactivators that bind to their ligand-binding domains (LBDs). The Drosophila FTZ-Factor1 (FTZ-F1) protein is a conserved member of the nuclear receptor superfamily, but was previously thought to lack an AF2 motif, a motif that is required for ligand and coactivator binding. Here we show that FTZ-F1 does have an AF2 motif and that it is required to bind a coactivator, the homeodomain-containing protein Fushi tarazu (FTZ). We al...

  8. Identifying Function, Agent, and Setting Motifs in Some Early Spanish "libros de caballerías"

    OpenAIRE

    NEUMAYER, KRISTIN

    2012-01-01

    The essay presents the methodology of a doctoral thesis (2008, University of Wisconsin-Madison) which classifies plot motifs in some sixteenth-century Castilian books of chivalry. Therein, two critical approaches to the texts are noted: motif studies, which analyze narrative components, and structural studies, which examine whole plotlines. Based on V. Propp’s Morphology of the Folktale, the motif is defined as a unit of plot structure. Propp’s thirty-one functions and seven tale-roles are th...

  9. Evaluation of subgraph searching algorithms for detecting network motifs in biological networks

    Institute of Scientific and Technical Information of China (English)

    Jialu HU; Lin GAO; Guimin QIN

    2009-01-01

    Despite several algorithms for searching sub-graphs in motif detection presented in the literature, no ef-fort has been done for characterizing their performance till now. This paper presents a methodology to evaluate the performance of three algorithms: edge sampling algorithm (ESA), enumerate subgraphs (ESU) and randomly enumer-ate subgraphs (RAND-ESU). A series of experiments are performed to test sampling speed and sampling quality. The results show that RAND-ESU is more efficient and has less computational cost than other algorithms for large-size mo-tif detection, and ESU has its own advantage in small-size motif detection.

  10. Stochastic Resonance in Neuronal Network Motifs with Ornstein-Uhlenbeck Colored Noise

    Directory of Open Access Journals (Sweden)

    Xuyang Lou

    2014-01-01

    Full Text Available We consider here the effect of the Ornstein-Uhlenbeck colored noise on the stochastic resonance of the feed-forward-loop (FFL network motif. The FFL motif is modeled through the FitzHugh-Nagumo neuron model as well as the chemical coupling. Our results show that the noise intensity and the correlation time of the noise process serve as the control parameters, which have great impacts on the stochastic dynamics of the FFL motif. We find that, with a proper choice of noise intensities and the correlation time of the noise process, the signal-to-noise ratio (SNR can display more than one peak.

  11. The ADAMTS (A Disintegrin and Metalloproteinase with Thrombospondin motifs) family.

    Science.gov (United States)

    Kelwick, Richard; Desanlis, Ines; Wheeler, Grant N; Edwards, Dylan R

    2015-01-01

    The ADAMTS (A Disintegrin and Metalloproteinase with Thrombospondin motifs) enzymes are secreted, multi-domain matrix-associated zinc metalloendopeptidases that have diverse roles in tissue morphogenesis and patho-physiological remodeling, in inflammation and in vascular biology. The human family includes 19 members that can be sub-grouped on the basis of their known substrates, namely the aggrecanases or proteoglycanases (ADAMTS1, 4, 5, 8, 9, 15 and 20), the procollagen N-propeptidases (ADAMTS2, 3 and 14), the cartilage oligomeric matrix protein-cleaving enzymes (ADAMTS7 and 12), the von-Willebrand Factor proteinase (ADAMTS13) and a group of orphan enzymes (ADAMTS6, 10, 16, 17, 18 and 19). Control of the structure and function of the extracellular matrix (ECM) is a central theme of the biology of the ADAMTS, as exemplified by the actions of the procollagen-N-propeptidases in collagen fibril assembly and of the aggrecanases in the cleavage or modification of ECM proteoglycans. Defects in certain family members give rise to inherited genetic disorders, while the aberrant expression or function of others is associated with arthritis, cancer and cardiovascular disease. In particular, ADAMTS4 and 5 have emerged as therapeutic targets in arthritis. Multiple ADAMTSs from different sub-groupings exert either positive or negative effects on tumorigenesis and metastasis, with both metalloproteinase-dependent and -independent actions known to occur. The basic ADAMTS structure comprises a metalloproteinase catalytic domain and a carboxy-terminal ancillary domain, the latter determining substrate specificity and the localization of the protease and its interaction partners; ancillary domains probably also have independent biological functions. Focusing primarily on the aggrecanases and proteoglycanases, this review provides a perspective on the evolution of the ADAMTS family, their links with developmental and disease mechanisms, and key questions for the future. PMID:26025392

  12. Elongated polyproline motifs facilitate enamel evolution through matrix subunit compaction.

    Directory of Open Access Journals (Sweden)

    Tianquan Jin

    2009-12-01

    Full Text Available Vertebrate body designs rely on hydroxyapatite as the principal mineral component of relatively light-weight, articulated endoskeletons and sophisticated tooth-bearing jaws, facilitating rapid movement and efficient predation. Biological mineralization and skeletal growth are frequently accomplished through proteins containing polyproline repeat elements. Through their well-defined yet mobile and flexible structure polyproline-rich proteins control mineral shape and contribute many other biological functions including Alzheimer's amyloid aggregation and prolamine plant storage. In the present study we have hypothesized that polyproline repeat proteins exert their control over biological events such as mineral growth, plaque aggregation, or viscous adhesion by altering the length of their central repeat domain, resulting in dramatic changes in supramolecular assembly dimensions. In order to test our hypothesis, we have used the vertebrate mineralization protein amelogenin as an exemplar and determined the biological effect of the four-fold increased polyproline tandem repeat length in the amphibian/mammalian transition. To study the effect of polyproline repeat length on matrix assembly, protein structure, and apatite crystal growth, we have measured supramolecular assembly dimensions in various vertebrates using atomic force microscopy, tested the effect of protein assemblies on crystal growth by electron microscopy, generated a transgenic mouse model to examine the effect of an abbreviated polyproline sequence on crystal growth, and determined the structure of polyproline repeat elements using 3D NMR. Our study shows that an increase in PXX/PXQ tandem repeat motif length results (i in a compaction of protein matrix subunit dimensions, (ii reduced conformational variability, (iii an increase in polyproline II helices, and (iv promotion of apatite crystal length. Together, these findings establish a direct relationship between polyproline tandem

  13. The GTP binding motif: variations on a theme.

    Science.gov (United States)

    Kjeldgaard, M; Nyborg, J; Clark, B F

    1996-10-01

    GTP binding proteins (G-proteins) have wide-ranging functions in biology, being involved in cell proliferation, signal transduction, protein synthesis, and protein targeting. Common to their functioning is that they are active in the GTP-bound form and inactive in the GDP-bound form. The protein synthesis elongation factor EF-Tu was the first G-protein whose nucleotide binding domain was solved structurally by X-ray crystallography to yield a structural definition of the GDP-bound form, but a still increasing number of new structures of G-proteins are appearing in the literature, in both GDP and GTP bound forms. A common structural core for nucleotide binding is present in all these structures, and this core has long been known to include common consensus sequence elements involved in binding of the nucleotide. Nevertheless, subtle changes in the common sequences reflect functional differences. Therefore, it becomes increasingly important to focus on how these differences are reflected in the structures, and how these structural differences are related to function. The aim of this review is to describe to what extent this structural motif for GDP/GTP binding is common to other known structures of this class of proteins. We first describe the common structural core of the G-proteins. Next, examples are based on information available on the Ras protein superfamily, the targeting protein ARF, elongation factors EF-Tu and EF-G, and the heterotrimeric G-proteins. Finally, we discuss the important structures of complexes between GTP binding proteins and their substrates that have appeared in the literature recently.

  14. Identification of putative regulatory motifs in the upstream regions of co-expressed functional groups of genes in Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Joshi NV

    2009-01-01

    Full Text Available Abstract Background Regulation of gene expression in Plasmodium falciparum (Pf remains poorly understood. While over half the genes are estimated to be regulated at the transcriptional level, few regulatory motifs and transcription regulators have been found. Results The study seeks to identify putative regulatory motifs in the upstream regions of 13 functional groups of genes expressed in the intraerythrocytic developmental cycle of Pf. Three motif-discovery programs were used for the purpose, and motifs were searched for only on the gene coding strand. Four motifs – the 'G-rich', the 'C-rich', the 'TGTG' and the 'CACA' motifs – were identified, and zero to all four of these occur in the 13 sets of upstream regions. The 'CACA motif' was absent in functional groups expressed during the ring to early trophozoite transition. For functional groups expressed in each transition, the motifs tended to be similar. Upstream motifs in some functional groups showed 'positional conservation' by occurring at similar positions relative to the translational start site (TLS; this increases their significance as regulatory motifs. In the ribonucleotide synthesis, mitochondrial, proteasome and organellar translation machinery genes, G-rich, C-rich, CACA and TGTG motifs, respectively, occur with striking positional conservation. In the organellar translation machinery group, G-rich motifs occur close to the TLS. The same motifs were sometimes identified for multiple functional groups; differences in location and abundance of the motifs appear to ensure different modes of action. Conclusion The identification of positionally conserved over-represented upstream motifs throws light on putative regulatory elements for transcription in Pf.

  15. Rice bZIP protein, REB, interacts with GCN4 motif in promoter of Waxy gene

    Institute of Scientific and Technical Information of China (English)

    CHENG; Shijun; (程世军); WANG; Zongyang(王宗阳); HONG; Mengmin(洪孟民)

    2002-01-01

    A bifactorial endosperm box (EB), which contains an endosperm motif (EM) and a GCN4 motif, was found in rice Wx promoter. EB was found in 5′ upstream region of many seed storage protein genes accounting for these genes expression exclusive in endosperm among various cereals. Many reports demonstrated that the bZIP transcription activators isolated from wheat, barley and maize, etc. regulate the gene expression through binding to the GCN4 motif. In this research, we showed that GCN4 sequence could be recognized by nuclear proteins extracted from immature rice seeds. Furthermore, a rice bZIP protein, REB was isolated by using PCR method and REB fusion protein was expressed in E. coli. The results of gel shift analysis showed that REB could recognize and bind to the GCN4 motif in the Wx gene in addition to binding to the target sequence in the promoter of α-globulin.

  16. Strategic Lean Organizational Design: Towards Lean World-Small World Configurations through Discrete Dynamic Organizational Motifs

    Directory of Open Access Journals (Sweden)

    Javier Villalba-Diez

    2016-01-01

    Full Text Available Organizations face strong international competition in the global market arena in achieving strategic goals such as high quality of product or service at lower cost while increasing their ability to respond quickly to requirements of the market. These challenges concern strategically designing organizations that can meet global challenges and specialize locally to meet performance constraints. After introducing the concept of organizational functional and structural motifs as small organizational building block, our findings suggest the hypothesis that a strategic organizational design (SOD approach to meet these challenges involves maximizing the number and diversity of functional motifs, while minimizing the repertoire of structural motifs. By detecting characteristic structural motifs, we provide organizational leaders with specific Lean SOD solutions with which to meet local and global challenges simultaneously. As a matter of application, we show the implementation of such an SOD approach in nine US hospitals that form one large health care holding.

  17. Correlating overrepresented upstream motifs to gene expression a computational approach to regulatory element discovery in eukaryotes

    CERN Document Server

    Caselle, M; Provero, P

    2002-01-01

    Gene regulation in eukaryotes is mainly effected through transcription factors binding to rather short recognition motifs generally located upstream of the coding region. We present a novel computational method to identify regulatory elements in the upstream region of eukaryotic genes. The genes are grouped in sets sharing an overrepresented short motif in their upstream sequence. For each set, the average expression level from a microarray experiment is determined: If this level is significantly higher or lower than the average taken over the whole genome, then the overerpresented motif shared by the genes in the set is likely to play a role in their regulation. The method was tested by applying it to the genome of Saccharomyces cerevisiae, using the publicly available results of a DNA microarray experiment, in which expression levels for virtually all the genes were measured during the diauxic shift from fermentation to respiration. Several known motifs were correctly identified, and a new candidate regulat...

  18. Effect of the DEF motif on phosphorylation of peptide substrates by ERK

    OpenAIRE

    Fernandes, Neychelle; Allbritton, Nancy L.

    2009-01-01

    MAP kinase ERK maintains specificity by binding to docking sites such as the DEF domain or D domain. It was previously shown that appending peptides derived from D domains to a substrate peptide increased apparent efficiency of peptide phosphorylation while preserving its apparent specificity for ERK. Here we determine the effect of the DEF motif on efficiency and specificity of peptide phosphorylation by ERK. The DEF motif modulated the apparent affinity of the peptide for ERK while the subs...

  19. A Nucleotide Binding Motif in Hepatitis C Virus (HCV) NS4B Mediates HCV RNA Replication

    OpenAIRE

    Einav, Shirit; Elazar, Menashe; Danieli, Tsafi; Glenn, Jeffrey S.

    2004-01-01

    Hepatitis C virus (HCV) is a major cause of viral hepatitis. There is no effective therapy for most patients. We have identified a nucleotide binding motif (NBM) in one of the virus's nonstructural proteins, NS4B. This structural motif binds and hydrolyzes GTP and is conserved across HCV isolates. Genetically disrupting the NBM impairs GTP binding and hydrolysis and dramatically inhibits HCV RNA replication. These results have exciting implications for the HCV life cycle and novel antiviral s...

  20. An artificial intelligence approach to motif discovery in protein sequences: application to steriod dehydrogenases.

    Science.gov (United States)

    Bailey, T L; Baker, M E; Elkan, C P

    1997-05-01

    MEME (Multiple Expectation-maximization for Motif Elicitation) is a unique new software tool that uses artificial intelligence techniques to discover motifs shared by a set of protein sequences in a fully automated manner. This paper is the first detailed study of the use of MEME to analyse a large, biologically relevant set of sequences, and to evaluate the sensitivity and accuracy of MEME in identifying structurally important motifs. For this purpose, we chose the short-chain alcohol dehydrogenase superfamily because it is large and phylogenetically diverse, providing a test of how well MEME can work on sequences with low amino acid similarity. Moreover, this dataset contains enzymes of biological importance, and because several enzymes have known X-ray crystallographic structures, we can test the usefulness of MEME for structural analysis. The first six motifs from MEME map onto structurally important alpha-helices and beta-strands on Streptomyces hydrogenans 20beta-hydroxysteroid dehydrogenase. We also describe MAST (Motif Alignment Search Tool), which conveniently uses output from MEME for searching databases such as SWISS-PROT and Genpept. MAST provides statistical measures that permit a rigorous evaluation of the significance of database searches with individual motifs or groups of motifs. A database search of Genpept90 by MAST with the log-odds matrix of the first six motifs obtained from MEME yields a bimodal output, demonstrating the selectivity of MAST. We show for the first time, using primary sequence analysis, that bacterial sugar epimerases are homologs of short-chain dehydrogenases. MEME and MAST will be increasingly useful as genome sequencing provides large datasets of phylogenetically divergent sequences of biomedical interest. PMID:9366496

  1. The Phe-Phe Motif for Peptide Self-Assembly in Nanomedicine

    OpenAIRE

    Silvia Marchesan; Vargiu, Attilio V.; Katie E. Styan

    2015-01-01

    Since its discovery, the Phe-Phe motif has gained in popularity as a minimalist building block to drive the self-assembly of short peptides and their analogues into nanostructures and hydrogels. Molecules based on the Phe-Phe motif have found a range of applications in nanomedicine, from drug delivery and biomaterials to new therapeutic paradigms. Here we discuss the various production methods for this class of compounds, and the characterization, nanomorphologies, and application of their se...

  2. The Verrucomicrobia LexA-binding Motif: Insights into the Evolutionary Dynamics of the SOS Response

    Directory of Open Access Journals (Sweden)

    Ivan Erill

    2016-07-01

    Full Text Available The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  3. The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response.

    Science.gov (United States)

    Erill, Ivan; Campoy, Susana; Kılıç, Sefa; Barbé, Jordi

    2016-01-01

    The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division, and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  4. Decorative motifs in the interior of the town house of the 19th century in Macedonia

    OpenAIRE

    Namicev, Petar; Namiceva, Ekaterina

    2015-01-01

    An integral part of the decoration of the house in Macedonia in the 19th century is, the application of certain stylized motifs in shaping the interior. Based upon the specific material (wood, plaster) gets a certain typology of decorative elements, with partial or full use of the wood or plaster in their representation in the interior. According to the style of decorative motifs include geometric processing, vegetable and zoomorphic decoration. Vegetabe and geometric decoration representing ...

  5. The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response.

    Science.gov (United States)

    Erill, Ivan; Campoy, Susana; Kılıç, Sefa; Barbé, Jordi

    2016-01-01

    The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division, and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls. PMID:27489856

  6. Waddling Random Walk: Fast and Accurate Sampling of Motif Statistics in Large Graphs

    OpenAIRE

    Han, Guyue; Sethu, Harish

    2016-01-01

    The relative frequency of small subgraphs within a large graph, such as one representing an online social network, is of high interest to sociologists, computer scientists and marketeers alike. However, the computation of these network motif statistics via naive enumeration is infeasible for either its prohibitive computational costs or access restrictions on the full graph data. Methods to estimate the motif statistics based on random walks by sampling only a small fraction of the subgraphs ...

  7. A Nucleotide Binding Motif in Hepatitis C Virus (HCV) NS4B Mediates HCV RNA Replication

    Science.gov (United States)

    Einav, Shirit; Elazar, Menashe; Danieli, Tsafi; Glenn, Jeffrey S.

    2004-01-01

    Hepatitis C virus (HCV) is a major cause of viral hepatitis. There is no effective therapy for most patients. We have identified a nucleotide binding motif (NBM) in one of the virus's nonstructural proteins, NS4B. This structural motif binds and hydrolyzes GTP and is conserved across HCV isolates. Genetically disrupting the NBM impairs GTP binding and hydrolysis and dramatically inhibits HCV RNA replication. These results have exciting implications for the HCV life cycle and novel antiviral strategies. PMID:15452248

  8. CytoKavosh: a cytoscape plug-in for finding network motifs in large biological networks.

    Science.gov (United States)

    Masoudi-Nejad, Ali; Ansariola, Mitra; Kashani, Zahra Razaghi Moghadam; Salehzadeh-Yazdi, Ali; Khakabimamaghani, Sahand

    2012-01-01

    Network motifs are small connected sub-graphs that have recently gathered much attention to discover structural behaviors of large and complex networks. Finding motifs with any size is one of the most important problems in complex and large networks. It needs fast and reliable algorithms and tools for achieving this purpose. CytoKavosh is one of the best choices for finding motifs with any given size in any complex network. It relies on a fast algorithm, Kavosh, which makes it faster than other existing tools. Kavosh algorithm applies some well known algorithmic features and includes tricky aspects, which make it an efficient algorithm in this field. CytoKavosh is a Cytoscape plug-in which supports us in finding motifs of given size in a network that is formerly loaded into the Cytoscape work-space (directed or undirected). High performance of CytoKavosh is achieved by dynamically linking highly optimized functions of Kavosh's C++ to the Cytoscape Java program, which makes this plug-in suitable for analyzing large biological networks. Some significant attributes of CytoKavosh is efficiency in time usage and memory and having no limitation related to the implementation in motif size. CytoKavosh is implemented in a visual environment Cytoscape that is convenient for the users to interact and create visual options to analyze the structural behavior of a network. This plug-in can work on any given network and is very simple to use and generates graphical results of discovered motifs with any required details. There is no specific Cytoscape plug-in, specific for finding the network motifs, based on original concept. So, we have introduced for the first time, CytoKavosh as the first plug-in, and we hope that this plug-in can be improved to cover other options to make it the best motif-analyzing tool.

  9. CytoKavosh: a cytoscape plug-in for finding network motifs in large biological networks.

    Directory of Open Access Journals (Sweden)

    Ali Masoudi-Nejad

    Full Text Available Network motifs are small connected sub-graphs that have recently gathered much attention to discover structural behaviors of large and complex networks. Finding motifs with any size is one of the most important problems in complex and large networks. It needs fast and reliable algorithms and tools for achieving this purpose. CytoKavosh is one of the best choices for finding motifs with any given size in any complex network. It relies on a fast algorithm, Kavosh, which makes it faster than other existing tools. Kavosh algorithm applies some well known algorithmic features and includes tricky aspects, which make it an efficient algorithm in this field. CytoKavosh is a Cytoscape plug-in which supports us in finding motifs of given size in a network that is formerly loaded into the Cytoscape work-space (directed or undirected. High performance of CytoKavosh is achieved by dynamically linking highly optimized functions of Kavosh's C++ to the Cytoscape Java program, which makes this plug-in suitable for analyzing large biological networks. Some significant attributes of CytoKavosh is efficiency in time usage and memory and having no limitation related to the implementation in motif size. CytoKavosh is implemented in a visual environment Cytoscape that is convenient for the users to interact and create visual options to analyze the structural behavior of a network. This plug-in can work on any given network and is very simple to use and generates graphical results of discovered motifs with any required details. There is no specific Cytoscape plug-in, specific for finding the network motifs, based on original concept. So, we have introduced for the first time, CytoKavosh as the first plug-in, and we hope that this plug-in can be improved to cover other options to make it the best motif-analyzing tool.

  10. Creation of Hybrid Nanorods From Sequences of Natural Trimeric Fibrous Proteins Using the Fibritin Trimerization Motif

    Science.gov (United States)

    Papanikolopoulou, Katerina; van Raaij, Mark J.; Mitraki, Anna

    Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, β-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple β-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.

  11. Pipeline for the Analysis of ChIP-seq Data and New Motif Ranking Procedure

    KAUST Repository

    Ashoor, Haitham

    2011-06-01

    This thesis presents a computational methodology for ab-initio identification of transcription factor binding sites based on ChIP-seq data. This method consists of three main steps, namely ChIP-seq data processing, motif discovery and models selection. A novel method for ranking the models of motifs identified in this process is proposed. This method combines multiple factors in order to rank the provided candidate motifs. It combines the model coverage of the ChIP-seq fragments that contain motifs from which that model is built, the suitable background data made up of shuffled ChIP-seq fragments, and the p-value that resulted from evaluating the model on actual and background data. Two ChIP-seq datasets retrieved from ENCODE project are used to evaluate and demonstrate the ability of the method to predict correct TFBSs with high precision. The first dataset relates to neuron-restrictive silencer factor, NRSF, while the second one corresponds to growth-associated binding protein, GABP. The pipeline system shows high precision prediction for both datasets, as in both cases the top ranked motif closely resembles the known motifs for the respective transcription factors.

  12. Functional characterization of transcription factor motifs using cross-species comparison across large evolutionary distances.

    Science.gov (United States)

    Kim, Jaebum; Cunningham, Ryan; James, Brian; Wyder, Stefan; Gibson, Joshua D; Niehuis, Oliver; Zdobnov, Evgeny M; Robertson, Hugh M; Robinson, Gene E; Werren, John H; Sinha, Saurabh

    2010-01-01

    We address the problem of finding statistically significant associations between cis-regulatory motifs and functional gene sets, in order to understand the biological roles of transcription factors. We develop a computational framework for this task, whose features include a new statistical score for motif scanning, the use of different scores for predicting targets of different motifs, and new ways to deal with redundancies among significant motif-function associations. This framework is applied to the recently sequenced genome of the jewel wasp, Nasonia vitripennis, making use of the existing knowledge of motifs and gene annotations in another insect genome, that of the fruitfly. The framework uses cross-species comparison to improve the specificity of its predictions, and does so without relying upon non-coding sequence alignment. It is therefore well suited for comparative genomics across large evolutionary divergences, where existing alignment-based methods are not applicable. We also apply the framework to find motifs associated with socially regulated gene sets in the honeybee, Apis mellifera, using comparisons with Nasonia, a solitary species, to identify honeybee-specific associations. PMID:20126523

  13. Functional characterization of transcription factor motifs using cross-species comparison across large evolutionary distances.

    Directory of Open Access Journals (Sweden)

    Jaebum Kim

    2010-01-01

    Full Text Available We address the problem of finding statistically significant associations between cis-regulatory motifs and functional gene sets, in order to understand the biological roles of transcription factors. We develop a computational framework for this task, whose features include a new statistical score for motif scanning, the use of different scores for predicting targets of different motifs, and new ways to deal with redundancies among significant motif-function associations. This framework is applied to the recently sequenced genome of the jewel wasp, Nasonia vitripennis, making use of the existing knowledge of motifs and gene annotations in another insect genome, that of the fruitfly. The framework uses cross-species comparison to improve the specificity of its predictions, and does so without relying upon non-coding sequence alignment. It is therefore well suited for comparative genomics across large evolutionary divergences, where existing alignment-based methods are not applicable. We also apply the framework to find motifs associated with socially regulated gene sets in the honeybee, Apis mellifera, using comparisons with Nasonia, a solitary species, to identify honeybee-specific associations.

  14. Discriminative motif discovery via simulated evolution and random under-sampling.

    Directory of Open Access Journals (Sweden)

    Tao Song

    Full Text Available Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  15. Identification of disease-specific motifs in the antibody specificity repertoire via next-generation sequencing.

    Science.gov (United States)

    Pantazes, Robert J; Reifert, Jack; Bozekowski, Joel; Ibsen, Kelly N; Murray, Joseph A; Daugherty, Patrick S

    2016-01-01

    Disease-specific antibodies can serve as highly effective biomarkers but have been identified for only a relatively small number of autoimmune diseases. A method was developed to identify disease-specific binding motifs through integration of bacterial display peptide library screening, next-generation sequencing (NGS) and computational analysis. Antibody specificity repertoires were determined by identifying bound peptide library members for each specimen using cell sorting and performing NGS. A computational algorithm, termed Identifying Motifs Using Next- generation sequencing Experiments (IMUNE), was developed and applied to discover disease- and healthy control-specific motifs. IMUNE performs comprehensive pattern searches, identifies patterns statistically enriched in the disease or control groups and clusters the patterns to generate motifs. Using celiac disease sera as a discovery set, IMUNE identified a consensus motif (QPEQPF[PS]E) with high diagnostic sensitivity and specificity in a validation sera set, in addition to novel motifs. Peptide display and sequencing (Display-Seq) coupled with IMUNE analysis may thus be useful to characterize antibody repertoires and identify disease-specific antibody epitopes and biomarkers. PMID:27481573

  16. Identification of disease-specific motifs in the antibody specificity repertoire via next-generation sequencing

    Science.gov (United States)

    Pantazes, Robert J.; Reifert, Jack; Bozekowski, Joel; Ibsen, Kelly N.; Murray, Joseph A.; Daugherty, Patrick S.

    2016-01-01

    Disease-specific antibodies can serve as highly effective biomarkers but have been identified for only a relatively small number of autoimmune diseases. A method was developed to identify disease-specific binding motifs through integration of bacterial display peptide library screening, next-generation sequencing (NGS) and computational analysis. Antibody specificity repertoires were determined by identifying bound peptide library members for each specimen using cell sorting and performing NGS. A computational algorithm, termed Identifying Motifs Using Next- generation sequencing Experiments (IMUNE), was developed and applied to discover disease- and healthy control-specific motifs. IMUNE performs comprehensive pattern searches, identifies patterns statistically enriched in the disease or control groups and clusters the patterns to generate motifs. Using celiac disease sera as a discovery set, IMUNE identified a consensus motif (QPEQPF[PS]E) with high diagnostic sensitivity and specificity in a validation sera set, in addition to novel motifs. Peptide display and sequencing (Display-Seq) coupled with IMUNE analysis may thus be useful to characterize antibody repertoires and identify disease-specific antibody epitopes and biomarkers. PMID:27481573

  17. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Directory of Open Access Journals (Sweden)

    Pooya Zandevakili

    Full Text Available Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  18. A robust elicitation algorithm for discovering DNA motifs using fuzzy self-organizing maps.

    Science.gov (United States)

    Wang, Dianhui; Tapan, Sarwar

    2013-10-01

    It is important to identify DNA motifs in promoter regions to understand the mechanism of gene regulation. Computational approaches for finding DNA motifs are well recognized as useful tools to biologists, which greatly help in saving experimental time and cost in wet laboratories. Self-organizing maps (SOMs), as a powerful clustering tool, have demonstrated good potential for problem solving. However, the current SOM-based motif discovery algorithms unfairly treat data samples lying around the cluster boundaries by assigning them to one of the nodes, which may result in unreliable system performance. This paper aims to develop a robust framework for discovering DNA motifs, where fuzzy SOMs, with an integration of fuzzy c-means membership functions and a standard batch-learning scheme, are employed to extract putative motifs with varying length in a recursive manner. Experimental results on eight real datasets show that our proposed algorithm outperforms the other searching tools such as SOMBRERO, SOMEA, MEME, AlignACE, and WEEDER in terms of the F-measure and algorithm reliability. It is observed that a remarkable 24.6% improvement can be achieved compared to the state-of-the-art SOMBRERO. Furthermore, our algorithm can produce a 20% and 6.6% improvement over SOMBRERO and SOMEA, respectively, in finding multiple motifs on five artificial datasets. PMID:24808603

  19. LRRCE: a leucine-rich repeat cysteine capping motif unique to the chordate lineage

    Directory of Open Access Journals (Sweden)

    Bishop Paul N

    2008-12-01

    Full Text Available Abstract Background The small leucine-rich repeat proteins and proteoglycans (SLRPs form an important family of regulatory molecules that participate in many essential functions. They typically control the correct assembly of collagen fibrils, regulate mineral deposition in bone, and modulate the activity of potent cellular growth factors through many signalling cascades. SLRPs belong to the group of extracellular leucine-rich repeat proteins that are flanked at both ends by disulphide-bonded caps that protect the hydrophobic core of the terminal repeats. A capping motif specific to SLRPs has been recently described in the crystal structures of the core proteins of decorin and biglycan. This motif, designated as LRRCE, differs in both sequence and structure from other, more widespread leucine-rich capping motifs. To investigate if the LRRCE motif is a common structural feature found in other leucine-rich repeat proteins, we have defined characteristic sequence patterns and used them in genome-wide searches. Results The LRRCE motif is a structural element exclusive to the main group of SLRPs. It appears to have evolved during early chordate evolution and is not found in protein sequences from non-chordate genomes. Our search has expanded the family of SLRPs to include new predicted protein sequences, mainly in fishes but with intriguing putative orthologs in mammals. The chromosomal locations of the newly predicted SLRP genes would support the large-scale genome or gene duplications that are thought to have occurred during vertebrate evolution. From this expanded list we describe a new class of SLRP sequences that could be representative of an ancestral SLRP gene. Conclusion Given its exclusivity the LRRCE motif is a useful annotation tool for the identification and classification of new SLRP sequences in genome databases. The expanded list of members of the SLRP family offers interesting insights into early vertebrate evolution and suggests an

  20. Lipid motif of a bacterial antigen mediates immune responses via TLR2 signaling.

    Directory of Open Access Journals (Sweden)

    Amit A Lugade

    Full Text Available The cross-talk between the innate and the adaptive immune system is facilitated by the initial interaction of antigen with dendritic cells. As DCs express a large array of TLRs, evidence has accumulated that engagement of these molecules contributes to the activation of adaptive immunity. We have evaluated the immunostimulatory role of the highly-conserved outer membrane lipoprotein P6 from non-typeable Haemophilus influenzae (NTHI to determine whether the presence of the lipid motif plays a critical role on its immunogenicity. We undertook a systematic analysis of the role that the lipid motif plays in the activation of DCs and the subsequent stimulation of antigen-specific T and B cells. To facilitate our studies, recombinant P6 protein that lacked the lipid motif was generated. Mice immunized with non-lipidated rP6 were unable to elicit high titers of anti-P6 Ig. Expression of the lipid motif on P6 was also required for proliferation and cytokine secretion by antigen-specific T cells. Upregulation of T cell costimulatory molecules was abrogated in DCs exposed to non-lipidated rP6 and in TLR2(-/- DCs exposed to native P6, thereby resulting in diminished adaptive immune responses. Absence of either the lipid motif on the antigen or TLR2 expression resulted in diminished cytokine production from stimulated DCs. Collectively, our data suggest that the lipid motif of the lipoprotein antigen is essential for triggering TLR2 signaling and effective stimulation of APCs. Our studies establish the pivotal role of a bacterial lipid motif on activating both innate and adaptive immune responses to an otherwise poorly immunogenic protein antigen.

  1. The Geometry of Plasticity-Induced Sensitization in Isoinhibitory Rate Motifs.

    Science.gov (United States)

    Kumar, Gautam; Ching, ShiNung

    2016-09-01

    A well-known phenomenon in sensory perception is desensitization, wherein behavioral responses to persistent stimuli become attenuated over time. In this letter, our focus is on studying mechanisms through which desensitization may be mediated at the network level and, specifically, how sensitivity changes arise as a function of long-term plasticity. Our principal object of study is a generic isoinhibitory motif: a small excitatory-inhibitory network with recurrent inhibition. Such a motif is of interest due to its overrepresentation in laminar sensory network architectures. Here, we introduce a sensitivity analysis derived from control theory in which we characterize the fixed-energy reachable set of the motif. This set describes the regions of the phase-space that are more easily (in terms of stimulus energy) accessed, thus providing a holistic assessment of sensitivity. We specifically focus on how the geometry of this set changes due to repetitive application of a persistent stimulus. We find that for certain motif dynamics, this geometry contracts along the stimulus orientation while expanding in orthogonal directions. In other words, the motif not only desensitizes to the persistent input, but heightens its responsiveness (sensitizes) to those that are orthogonal. We develop a perturbation analysis that links this sensitization to both plasticity-induced changes in synaptic weights and the intrinsic dynamics of the network, highlighting that the effect is not purely due to weight-dependent disinhibition. Instead, this effect depends on the relative neuronal time constants and the consequent stimulus-induced drift that arises in the motif phase-space. For tightly distributed (but random) parameter ranges, sensitization is quite generic and manifests in larger recurrent E-I networks within which the motif is embedded. PMID:27391684

  2. Systematic discovery of regulatory motifs in Fusarium graminearum by comparing four Fusarium genomes

    Directory of Open Access Journals (Sweden)

    Kistler Corby

    2010-03-01

    Full Text Available Abstract Background Fusarium graminearum (Fg, a major fungal pathogen of cultivated cereals, is responsible for billions of dollars in agriculture losses. There is a growing interest in understanding the transcriptional regulation of this organism, especially the regulation of genes underlying its pathogenicity. The generation of whole genome sequence assemblies for Fg and three closely related Fusarium species provides a unique opportunity for such a study. Results Applying comparative genomics approaches, we developed a computational pipeline to systematically discover evolutionarily conserved regulatory motifs in the promoter, downstream and the intronic regions of Fg genes, based on the multiple alignments of sequenced Fusarium genomes. Using this method, we discovered 73 candidate regulatory motifs in the promoter regions. Nearly 30% of these motifs are highly enriched in promoter regions of Fg genes that are associated with a specific functional category. Through comparison to Saccharomyces cerevisiae (Sc and Schizosaccharomyces pombe (Sp, we observed conservation of transcription factors (TFs, their binding sites and the target genes regulated by these TFs related to pathways known to respond to stress conditions or phosphate metabolism. In addition, this study revealed 69 and 39 conserved motifs in the downstream regions and the intronic regions, respectively, of Fg genes. The top intronic motif is the splice donor site. For the downstream regions, we noticed an intriguing absence of the mammalian and Sc poly-adenylation signals among the list of conserved motifs. Conclusion This study provides the first comprehensive list of candidate regulatory motifs in Fg, and underscores the power of comparative genomics in revealing functional elements among related genomes. The conservation of regulatory pathways among the Fusarium genomes and the two yeast species reveals their functional significance, and provides new insights in their

  3. Identification of Biomarker and Co-Regulatory Motifs in Lung Adenocarcinoma Based on Differential Interactions.

    Directory of Open Access Journals (Sweden)

    Ning Zhao

    Full Text Available Changes in intermolecular interactions (differential interactions may influence the progression of cancer. Specific genes and their regulatory networks may be more closely associated with cancer when taking their transcriptional and post-transcriptional levels and dynamic and static interactions into account simultaneously. In this paper, a differential interaction analysis was performed to detect lung adenocarcinoma-related genes. Furthermore, a miRNA-TF (transcription factor synergistic regulation network was constructed to identify three kinds of co-regulated motifs, namely, triplet, crosstalk and joint. Not only were the known cancer-related miRNAs and TFs (let-7, miR-15a, miR-17, TP53, ETS1, and so on were detected in the motifs, but also the miR-15, let-7 and miR-17 families showed a tendency to regulate the triplet, crosstalk and joint motifs, respectively. Moreover, several biological functions (i.e., cell cycle, signaling pathways and hemopoiesis associated with the three motifs were found to be frequently targeted by the drugs for lung adenocarcinoma. Specifically, the two 4-node motifs (crosstalk and joint based on co-expression and interaction had a closer relationship to lung adenocarcinoma, and so further research was performed on them. A 10-gene biomarker (UBC, SRC, SP1, MYC, STAT3, JUN, NR3C1, RB1, GRB2 and MAPK1 was selected from the joint motif, and a survival analysis indicated its significant association with survival. Among the ten genes, JUN, NR3C1 and GRB2 are our newly detected candidate lung adenocarcinoma-related genes. The genes, regulators and regulatory motifs detected in this work will provide potential drug targets and new strategies for individual therapy.

  4. APOCALYPTIC MOTIFS IN THE CYCLE OF STORIES BY M.A. BULGAKOV «NOTES OF A YOUNG DOCTOR»

    Directory of Open Access Journals (Sweden)

    Evgeniy Igorevich Erokhov

    2015-10-01

    Full Text Available The motif analysis of a cycle of stories by M.A. Bulgakov «Notes of a Young Doctor» from the point of view of their apocalyptic problematics was first performed in this article. To identify apocalyptic motifs the method of motif analysis, developed by B.M. Gasparov, was used which will also help to prove the interpenetration of motifs in the cycle of stories. The result of the research work is the identification of apocalyptic motifs which are manifested in the experiences of the main character and the events taking place around him and passing through the prism of physician’s perception of the world. Our identified motifs show that the stories in the cycle are united not only thematically and with the help of the image of the main character, but with the help of the motifs which reflect interpenetration of apocalyptic motifs in the stories of one cycle. There are the following apocalyptic motifs in the cycle of stories by Bulgakov: diseases, darkness (as part of the landscape, resurrection from the dead and beast. They all belong to the biblical type which is allocated on the basis of the associative bond of these motifs with the biblical texts.

  5. How to find a leucine in a haystack? Structure, ligand recognition and regulation of leucine-aspartic acid (LD) motifs

    KAUST Repository

    Alam, Tanvir

    2014-05-29

    LD motifs (leucine-aspartic acidmotifs) are short helical protein-protein interaction motifs that have emerged as key players in connecting cell adhesion with cell motility and survival. LD motifs are required for embryogenesis, wound healing and the evolution of multicellularity. LD motifs also play roles in disease, such as in cancer metastasis or viral infection. First described in the paxillin family of scaffolding proteins, LD motifs and similar acidic LXXLL interaction motifs have been discovered in several other proteins, whereas 16 proteins have been reported to contain LDBDs (LD motif-binding domains). Collectively, structural and functional analyses have revealed a surprising multivalency in LD motif interactions and a wide diversity in LDBD architectures. In the present review, we summarize the molecular basis for function, regulation and selectivity of LD motif interactions that has emerged from more than a decade of research. This overview highlights the intricate multi-level regulation and the inherently noisy and heterogeneous nature of signalling through short protein-protein interaction motifs. © 2014 Biochemical Society.

  6. Disparate requirements for the Walker A and B ATPase motifs ofhuman RAD51D in homologous recombination

    Energy Technology Data Exchange (ETDEWEB)

    Wiese, Claudia; Hinz, John M.; Tebbs, Robert S.; Nham, Peter B.; Urbin, Salustra S.; Collins, David W.; Thompson, Larry H.; Schild, David

    2006-04-21

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C, and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks. Ectopic expression of wild type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  7. V1R promoters are well conserved and exhibit common putative regulatory motifs

    Directory of Open Access Journals (Sweden)

    Lane Robert P

    2007-07-01

    Full Text Available Abstract Background The mouse vomeronasal organ (VNO processes chemosensory information, including pheromone signals that influence reproductive behaviors. The sensory neurons of the VNO express two types of chemosensory receptors, V1R and V2R. There are ~165 V1R genes in the mouse genome that have been classified into ~12 divergent subfamilies. Each sensory neuron of the apical compartment of the VNO transcribes only one of the repertoire of V1R genes. A model for mutually exclusive V1R transcription in these cells has been proposed in which each V1R gene might compete stochastically for a single transcriptional complex. This model predicts that the large repertoire of divergent V1R genes in the mouse genome contains common regulatory elements. In this study, we have characterized V1R promoter regions by comparative genomics and by mapping transcription start sites. Results We find that transcription is initiated from ~1 kb promoter regions that are well conserved within V1R subfamilies. While cross-subfamily homology is not evident by traditional methods, we developed a heuristic motif-searching tool, LogoAlign, and applied this tool to identify motifs shared within the promoters of all V1R genes. Our motif-searching tool exhibits rapid convergence to a relatively small number of non-redundant solutions (97% convergence. We also find that the best motifs contain significantly more information than those identified in controls, and that these motifs are more likely to be found in the immediate vicinity of transcription start sites than elsewhere in gene blocks. The best motifs occur near transcription start sites of ~90% of all V1R genes and across all of the divergent subfamilies. Therefore, these motifs are candidate binding sites for transcription factors involved in V1R co-regulation. Conclusion Our analyses show that V1R subfamilies have broad and well conserved promoter regions from which transcription is initiated. Results from a new

  8. DXD Motif-Dependent and -Independent Effects of the Chlamydia trachomatis Cytotoxin CT166

    Directory of Open Access Journals (Sweden)

    Miriam Bothe

    2015-02-01

    Full Text Available The Gram-negative, intracellular bacterium Chlamydia trachomatis causes acute and chronic urogenital tract infection, potentially leading to infertility and ectopic pregnancy. The only partially characterized cytotoxin CT166 of serovar D exhibits a DXD motif, which is important for the enzymatic activity of many bacterial and mammalian type A glycosyltransferases, leading to the hypothesis that CT166 possess glycosyltransferase activity. CT166-expressing HeLa cells exhibit actin reorganization, including cell rounding, which has been attributed to the inhibition of the Rho-GTPases Rac/Cdc42. Exploiting the glycosylation-sensitive Ras(27H5 antibody, we here show that CT166 induces an epitope change in Ras, resulting in inhibited ERK and PI3K signaling and delayed cell cycle progression. Consistent with the hypothesis that these effects strictly depend on the DXD motif, CT166 with the mutated DXD motif causes neither Ras-ERK inhibition nor delayed cell cycle progression. In contrast, CT166 with the mutated DXD motif is still capable of inhibiting cell migration, suggesting that CT166 with the mutated DXD motif cannot be regarded as inactive in any case. Taken together, CT166 affects various fundamental cellular processes, strongly suggesting its importance for the intracellular survival of chlamydia.

  9. Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

    KAUST Repository

    Kalkatawi, Manal Matoq Saeed

    2011-11-15

    Motivation: Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants. The Author(s) 2011. Published by Oxford University Press. All rights reserved.

  10. Network motif identification and structure detection with exponential random graph models

    Directory of Open Access Journals (Sweden)

    Munni Begum

    2014-12-01

    Full Text Available Local regulatory motifs are identified in the transcription regulatory network of the most studied model organism Escherichia coli (E. coli through graphical models. Network motifs are small structures in a network that appear more frequently than expected by chance alone. We apply social network methodologies such as p* models, also known as Exponential Random Graph Models (ERGMs, to identify statistically significant network motifs. In particular, we generate directed graphical models that can be applied to study interaction networks in a broad range of databases. The Markov Chain Monte Carlo (MCMC computational algorithms are implemented to obtain the estimates of model parameters to the corresponding network statistics. A variety of ERGMs are fitted to identify statistically significant network motifs in transcription regulatory networks of E. coli. A total of nine ERGMs are fitted to study the transcription factor - transcription factor interactions and eleven ERGMs are fitted for the transcription factor-operon interactions. For both of these interaction networks, arc (a directed edge in a directed network and k-istar (or incoming star structures, for values of k between 2 and 10, are found to be statistically significant local structures or network motifs. The goodness of fit statistics are provided to determine the quality of these models.

  11. EEVD motif of heat shock cognate protein 70 contributes to bacterial uptake by trophoblast giant cells

    Directory of Open Access Journals (Sweden)

    Kim Suk

    2009-12-01

    Full Text Available Abstract Background The uptake of abortion-inducing pathogens by trophoblast giant (TG cells is a key event in infectious abortion. However, little is known about phagocytic functions of TG cells against the pathogens. Here we show that heat shock cognate protein 70 (Hsc70 contributes to bacterial uptake by TG cells and the EEVD motif of Hsc70 plays an important role in this. Methods Brucella abortus and Listeria monocytogenes were used as the bacterial antigen in this study. Recombinant proteins containing tetratricopeptide repeat (TPR domains were constructed and confirmation of the binding capacity to Hsc70 was assessed by ELISA. The recombinant TPR proteins were used for investigation of the effect of TPR proteins on bacterial uptake by TG cells and on pregnancy in mice. Results The monoclonal antibody that inhibits bacterial uptake by TG cells reacted with the EEVD motif of Hsc70. Bacterial TPR proteins bound to the C-terminal of Hsc70 through its EEVD motif and this binding inhibited bacterial uptake by TG cells. Infectious abortion was also prevented by blocking the EEVD motif of Hsc70. Conclusions Our results demonstrate that surface located Hsc70 on TG cells mediates the uptake of pathogenic bacteria and proteins containing the TPR domain inhibit the function of Hsc70 by binding to its EEVD motif. These molecules may be useful in the development of methods for preventing infectious abortion.

  12. Importance of NPA motifs in the expression and function of water channel aquaporin-1

    Institute of Scientific and Technical Information of China (English)

    JIANG Yong; MA TongHui

    2007-01-01

    The asparagine-proline-alanine sequences (NPA motifs) are highly conserved in aquaporin water channel family. Crystallographic studies of AQP1 structure demonstrated that the two NPA motifs are in the narrow central constriction of the channel, serving to bind water molecules for selective and efficient water passage. To investigate the importance of the two NPA motifs in the structure, function and biogenesis of aquaporin water channels, we generated AQP1 mutations with NPA1 deletion, NPA2 deletion and NPA1,2 double deletion. The coding sequences of the three mutated cDNAs were subcloned into the mammalian expression vector pcDNA3.1 to form expression plasmids. We established stably transfected CHO cell lines expressing these AQP1 mutants. Immunofluorescence indicated that all the three mutated AQP1 proteins are expressed normally on the plasma membrane of stably transfected CHO cells, suggesting that deletion of NPA motifs does not influence the expression and intracellular processing of AQP1. Functional analysis demonstrated that NPA1 or NPA2 deletion reduced AQP1 water permeability by 49.6% and 46.7%, respectively, while NPA1,2 double deletion had little effect on AQP1 water permeability. These results provide evidence that NPA motifs are important for water per-meation but not essential for the expression, intracellular processing and the basic structure of AQP1 water channel.

  13. Mechano-chemical selections of two competitive unfolding pathways of a single DNA i-motif

    International Nuclear Information System (INIS)

    The DNA i-motif is a quadruplex structure formed in tandem cytosine-rich sequences in slightly acidic conditions. Besides being considered as a building block of DNA nano-devices, it may also play potential roles in regulating chromosome stability and gene transcriptions. The stability of i-motif is crucial for these functions. In this work, we investigated the mechanical stability of a single i-motif formed in the human telomeric sequence 5'-(CCCTAA)3CCC, which revealed a novel pH and loading rate-dependent bimodal unfolding force distribution. Although the cause of the bimodal unfolding force species is not clear, we proposed a phenomenological model involving a direct unfolding favored at lower loading rate or higher pH value, which is subject to competition with another unfolding pathway through a mechanically stable intermediate state whose nature is yet to be determined. Overall, the unique mechano—chemical responses of i-motif-provide a new perspective to its stability, which may be useful to guide designing new i-motif-based DNA mechanical nano-devices

  14. qPMS9: An Efficient Algorithm for Quorum Planted Motif Search

    Science.gov (United States)

    Nicolae, Marius; Rajasekaran, Sanguthevar

    2015-01-01

    Discovering patterns in biological sequences is a crucial problem. For example, the identification of patterns in DNA sequences has resulted in the determination of open reading frames, identification of gene promoter elements, intron/exon splicing sites, and SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have led to domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, discovery of short functional motifs, etc. In this paper we focus on the identification of an important class of patterns, namely, motifs. We study the (l, d) motif search problem or Planted Motif Search (PMS). PMS receives as input n strings and two integers l and d. It returns all sequences M of length l that occur in each input string, where each occurrence differs from M in at most d positions. Another formulation is quorum PMS (qPMS), where the motif appears in at least q% of the strings. We introduce qPMS9, a parallel exact qPMS algorithm that offers significant runtime improvements on DNA and protein datasets. qPMS9 solves the challenging DNA (l, d)-instances (28, 12) and (30, 13). The source code is available at https://code.google.com/p/qpms9/.

  15. A Simple Decision Rule for Recognition of Poly(A) Tail Signal Motifs in Human Genome

    KAUST Repository

    Eisha, Hassan Abou

    2015-05-12

    Background is the numerous attempts were made to predict motifs in genomic sequences that correspond to poly (A) tail signals. Vast portion of this effort has been directed to a plethora of nonlinear classification methods. Even when such approaches yield good discriminant results, identifying dominant features of regulatory mechanisms nevertheless remains a challenge. In this work, we look at decision rules that may help identifying such features. Findings are we present a simple decision rule for classification of candidate poly (A) tail signal motifs in human genomic sequence obtained by evaluating features during the construction of gradient boosted trees. We found that values of a single feature based on the frequency of adenine in the genomic sequence surrounding candidate signal and the number of consecutive adenine molecules in a well-defined region immediately following the motif displays good discriminative potential in classification of poly (A) tail motifs for samples covered by the rule. Conclusions is the resulting simple rule can be used as an efficient filter in construction of more complex poly(A) tail motifs classification algorithms.

  16. Designing synthetic RNAs to determine the relevance of structural motifs in picornavirus IRES elements

    Science.gov (United States)

    Fernandez-Chamorro, Javier; Lozano, Gloria; Garcia-Martin, Juan Antonio; Ramajo, Jorge; Dotu, Ivan; Clote, Peter; Martinez-Salas, Encarnacion

    2016-04-01

    The function of Internal Ribosome Entry Site (IRES) elements is intimately linked to their RNA structure. Viral IRES elements are organized in modular domains consisting of one or more stem-loops that harbor conserved RNA motifs critical for internal initiation of translation. A conserved motif is the pyrimidine-tract located upstream of the functional initiation codon in type I and II picornavirus IRES. By computationally designing synthetic RNAs to fold into a structure that sequesters the polypyrimidine tract in a hairpin, we establish a correlation between predicted inaccessibility of the pyrimidine tract and IRES activity, as determined in both in vitro and in vivo systems. Our data supports the hypothesis that structural sequestration of the pyrimidine-tract within a stable hairpin inactivates IRES activity, since the stronger the stability of the hairpin the higher the inhibition of protein synthesis. Destabilization of the stem-loop immediately upstream of the pyrimidine-tract also decreases IRES activity. Our work introduces a hybrid computational/experimental method to determine the importance of structural motifs for biological function. Specifically, we show the feasibility of using the software RNAiFold to design synthetic RNAs with particular sequence and structural motifs that permit subsequent experimental determination of the importance of such motifs for biological function.

  17. Identification of helix capping and {beta}-turn motifs from NMR chemical shifts

    Energy Technology Data Exchange (ETDEWEB)

    Shen Yang; Bax, Ad, E-mail: bax@nih.gov [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)

    2012-03-15

    We present an empirical method for identification of distinct structural motifs in proteins on the basis of experimentally determined backbone and {sup 13}C{sup {beta}} chemical shifts. Elements identified include the N-terminal and C-terminal helix capping motifs and five types of {beta}-turns: I, II, I Prime , II Prime and VIII. Using a database of proteins of known structure, the NMR chemical shifts, together with the PDB-extracted amino acid preference of the helix capping and {beta}-turn motifs are used as input data for training an artificial neural network algorithm, which outputs the statistical probability of finding each motif at any given position in the protein. The trained neural networks, contained in the MICS (motif identification from chemical shifts) program, also provide a confidence level for each of their predictions, and values ranging from ca 0.7-0.9 for the Matthews correlation coefficient of its predictions far exceed those attainable by sequence analysis. MICS is anticipated to be useful both in the conventional NMR structure determination process and for enhancing on-going efforts to determine protein structures solely on the basis of chemical shift information, where it can aid in identifying protein database fragments suitable for use in building such structures.

  18. CHEM-PATH-TRACKER: An automated tool to analyze chemical motifs in molecular structures.

    Science.gov (United States)

    Ribeiro, João V; Cerqueira, N M F S A; Fernandes, Pedro A; Ramos, Maria J

    2014-07-01

    In this article, we propose a method for locating functionally relevant chemical motifs in protein structures. The chemical motifs can be a small group of residues or structure protein fragments with highly conserved properties that have important biological functions. However, the detection of chemical motifs is rather difficult because they often consist of a set of amino acid residues separated by long, variable regions, and they only come together to form a functional group when the protein is folded into its three-dimensional structure. Furthermore, the assemblage of these residues is often dependent on non-covalent interactions among the constituent amino acids that are difficult to detect or visualize. To simplify the analysis of these chemical motifs and give access to a generalized use for all users, we developed chem-path-tracker. This software is a VMD plug-in that allows the user to highlight and reveal potential chemical motifs requiring only a few selections. The analysis is based on atoms/residues pair distances applying a modified version of Dijkstra's algorithm, and it makes possible to monitor the distances of a large pathway, even during a molecular dynamics simulation. This tool turned out to be very useful, fast, and user-friendly in the performed tests. The chem-path-tracker package is distributed as an independent platform and can be found at http://www.fc.up.pt/PortoBioComp/database/doku.php?id=chem-path-tracker. PMID:24775806

  19. Through the Portal: Viking Motifs Incorporated in the Romanesque Style in Telemark, Norway

    Directory of Open Access Journals (Sweden)

    Kristine Ødeby

    2013-09-01

    Full Text Available This paper presents the results of an analysis of motifs identified on six carved wooden Romanesque portal panels from the Norwegian county of Telemark. The findings suggest that animal motifs in the Late Viking style survived long into the Late Medieval period and were reused on these medieval portals. Stylistically, late expressions of Viking animal art do not differ a great deal from those of the subsequent Romanesque style. However, their symbolical differences are considered to be significant. The motifs themselves, and the issue of whether the Romanesque style adopted motifs from pre-Christian art, have attracted less attention. The motif portraying Sigurd slaying the dragon is considered in depth. It will be suggested that Sigurd, serving as a mediator between the old and the new beliefs when he appeared in late Viking contexts, was given a new role when portrayed in Christian art. Metaphor and liminality are a central part of this paper, and the theories of Alfred Gell and Margrete Andås suggest that the portal itself affects those who pass through it, and that the iconography is meaningful from a liminal perspective.

  20. Correlating novel variable and conserved motifs in the Hemagglutinin protein with significant biological functions

    Directory of Open Access Journals (Sweden)

    Werner Mark

    2008-08-01

    Full Text Available Abstract Background Variations in the influenza Hemagglutinin protein contributes to antigenic drift resulting in decreased efficiency of seasonal influenza vaccines and escape from host immune response. We performed an in silico study to determine characteristics of novel variable and conserved motifs in the Hemagglutinin protein from previously reported H3N2 strains isolated from Hong Kong from 1968–1999 to predict viral motifs involved in significant biological functions. Results 14 MEME blocks were generated and comparative analysis of the MEME blocks identified blocks 1, 2, 3 and 7 to correlate with several biological functions. Analysis of the different Hemagglutinin sequences elucidated that the single block 7 has the highest frequency of amino acid substitution and the highest number of co-mutating pairs. MEME 2 showed intermediate variability and MEME 1 was the most conserved. Interestingly, MEME blocks 2 and 7 had the highest incidence of potential post-translational modifications sites including phosphorylation sites, ASN glycosylation motifs and N-myristylation sites. Similarly, these 2 blocks overlap with previously identified antigenic sites and receptor binding sites. Conclusion Our study identifies motifs in the Hemagglutinin protein with different amino acid substitution frequencies over a 31 years period, and derives relevant functional characteristics by correlation of these motifs with potential post-translational modifications sites, antigenic and receptor binding sites.

  1. Effects of rate-limiting steps in transcription initiation on genetic filter motifs.

    Science.gov (United States)

    Häkkinen, Antti; Tran, Huy; Yli-Harja, Olli; Ribeiro, Andre S

    2013-01-01

    The behavior of genetic motifs is determined not only by the gene-gene interactions, but also by the expression patterns of the constituent genes. Live single-molecule measurements have provided evidence that transcription initiation is a sequential process, whose kinetics plays a key role in the dynamics of mRNA and protein numbers. The extent to which it affects the behavior of cellular motifs is unknown. Here, we examine how the kinetics of transcription initiation affects the behavior of motifs performing filtering in amplitude and frequency domain. We find that the performance of each filter is degraded as transcript levels are lowered. This effect can be reduced by having a transcription process with more steps. In addition, we show that the kinetics of the stepwise transcription initiation process affects features such as filter cutoffs. These results constitute an assessment of the range of behaviors of genetic motifs as a function of the kinetics of transcription initiation, and thus will aid in tuning of synthetic motifs to attain specific characteristics without affecting their protein products.

  2. The position of the Gly-xxx-Gly motif in transmembrane segments modulates dimer affinity.

    Science.gov (United States)

    Johnson, Rachel M; Rath, Arianna; Deber, Charles M

    2006-12-01

    Although the intrinsic low solubility of membrane proteins presents challenges to their high-resolution structure determination, insight into the amino acid sequence features and forces that stabilize their folds has been provided through study of sequence-dependent helix-helix interactions between single transmembrane (TM) helices. While the stability of helix-helix partnerships mediated by the Gly-xxx-Gly (GG4) motif is known to be generally modulated by distal interfacial residues, it has not been established whether the position of this motif, with respect to the ends of a given TM segment, affects dimer affinity. Here we examine the relationship between motif position and affinity in the homodimers of 2 single-spanning membrane protein TM sequences: glycophorin A (GpA) and bacteriophage M13 coat protein (MCP). Using the TOXCAT assay for dimer affinity on a series of GpA and MCP TM segments that have been modified with either 4 Leu residues at each end or with 8 Leu residues at the N-terminal end, we show that in each protein, centrally located GG4 motifs are capable of stronger helix-helix interactions than those proximal to TM helix ends, even when surrounding interfacial residues are maintained. The relative importance of GG4 motifs in stabilizing helix-helix interactions therefore must be considered not only in its specific residue context but also in terms of the location of the interactive surface relative to the N and C termini of alpha-helical TM segments.

  3. Comparative analysis of evolutionarily conserved motifs of epidermal growth factor receptor 2 (HER2) predicts novel potential therapeutic epitopes

    DEFF Research Database (Denmark)

    Deng, Xiaohong; Zheng, Xuxu; Yang, Huanming;

    2014-01-01

    druggable epitopes/targets. We employed the PROSITE Scan to detect structurally conserved motifs and PRINTS to search for linearly conserved motifs of ECD HER2. We found that the epitopes recognized by trastuzumab and pertuzumab are located in the predicted conserved motifs of ECD HER2, supporting our...... relevant anti-HER2 antibodies. In the present study, we present a novel computational approach as an auxiliary tool for identification of novel HER2 epitopes. We hypothesized that the structurally and linearly evolutionarily conserved motifs of the extracellular domain of HER2 (ECD HER2) contain potential...... initial hypothesis. Considering that structurally and linearly conserved motifs can provide functional specific configurations, we propose that by comparing the two types of conserved motifs, additional druggable epitopes/targets in the ECD HER2 protein can be identified, which can be further modified...

  4. Conserved motifs II to VI of DNA helicase II from Escherichia coli are all required for biological activity.

    OpenAIRE

    Zhang, G.; Deng, E; Baugh, L R; Hamilton, C. M.; Maples, V F; Kushner, S R

    1997-01-01

    There are seven conserved motifs (IA, IB, and II to VI) in DNA helicase II of Escherichia coli that have high homology among a large family of proteins involved in DNA metabolism. To address the functional importance of motifs II to VI, we employed site-directed mutagenesis to replace the charged amino acid residues in each motif with alanines. Cells carrying these mutant alleles exhibited higher UV and methyl methanesulfonate sensitivity, increased rates of spontaneous mutagenesis, and eleva...

  5. Discovery of sequence motifs related to coexpression of genes using evolutionary computation

    Science.gov (United States)

    Fogel, Gary B.; Weekes, Dana G.; Varga, Gabor; Dow, Ernst R.; Harlow, Harry B.; Onyia, Jude E.; Su, Chen

    2004-01-01

    Transcription factors are key regulatory elements that control gene expression. Recognition of transcription factor binding site (TFBS) motifs in the upstream region of coexpressed genes is therefore critical towards a true understanding of the regulations of gene expression. The task of discovering eukaryotic TFBSs remains a challenging problem. Here, we demonstrate that evolutionary computation can be used to search for TFBSs in upstream regions of genes known to be coexpressed. Evolutionary computation was used to search for TFBSs of genes regulated by octamer-binding factor and nuclear factor kappa B. The discovered binding sites included experimentally determined known binding motifs as well as lists of putative, previously unknown TFBSs. We believe that this method to search nucleotide sequence information efficiently for similar motifs will be useful for discovering TFBSs that affect gene regulation. PMID:15266008

  6. How motif environment influences transcription factor search dynamics: Finding a needle in a haystack.

    Science.gov (United States)

    Dror, Iris; Rohs, Remo; Mandel-Gutfreund, Yael

    2016-07-01

    Transcription factors (TFs) have to find their binding sites, which are distributed throughout the genome. Facilitated diffusion is currently the most widely accepted model for this search process. Based on this model the TF alternates between one-dimensional sliding along the DNA, and three-dimensional bulk diffusion. In this view, the non-specific associations between the proteins and the DNA play a major role in the search dynamics. However, little is known about how the DNA properties around the motif contribute to the search. Accumulating evidence showing that TF binding sites are embedded within a unique environment, specific to each TF, leads to the hypothesis that the search process is facilitated by favorable DNA features that help to improve the search efficiency. Here, we review the field and present the hypothesis that TF-DNA recognition is dictated not only by the motif, but is also influenced by the environment in which the motif resides. PMID:27192961

  7. Improved Exact Enumerative Algorithms for the Planted (l, d)-Motif Search Problem.

    Science.gov (United States)

    Tanaka, Shunji

    2014-01-01

    In this paper efficient exact algorithms are proposed for the planted ( l, d)-motif search problem. This problem is to find all motifs of length l that are planted in each input string with at most d mismatches. The "quorum" version of this problem is also treated in this paper to find motifs planted not in all input strings but in at least q input strings. The proposed algorithms are based on the previous algorithms called qPMSPruneI and qPMS7 that traverse a search tree starting from a l-length substring of an input string. To improve these previous algorithms, several techniques are introduced, which contribute to reducing the computation time for the traversal. In computational experiments, it will be shown that the proposed algorithms outperform the previous algorithms.

  8. Negative in vitro selection identifies the rRNA recognition motif for ErmE methyltransferase

    DEFF Research Database (Denmark)

    Nielsen, A K; Douthwaite, S; Vester, B

    1999-01-01

    Erm methyltransferases modify bacterial 23S ribosomal RNA at adenosine 2058 (A2058, Escherichia coli numbering) conferring resistance to macrolide, lincosamide, and streptogramin B (MLS) antibiotics. The motif that is recognized by Erm methyltransferases is contained within helix 73 of 23S r......RNA and the adjacent single-stranded region around A2058. An RNA transcript of 72 nt that displays this motif functions as an efficient substrate for the ErmE methyltransferase. Pools of degenerate RNAs were formed by doping 34-nt positions that extend over and beyond the putative Erm recognition motif within the 72......-mer RNA. The RNAs were passed through a series of rounds of methylation with ErmE. After each round, RNAs were selected that had partially or completely lost their ability to be methylated. After several rounds of methylation/selection, 187 subclones were analyzed. Forty-three of the subclones...

  9. Feedback through graph motifs relates structure and function in complex networks

    CERN Document Server

    Hu, Yu; Cain, Nicholas; Mihalas, Stefan; Kutz, J Nathan; Shea-Brown, Eric

    2016-01-01

    How does the connectivity of a network system combine with the behavior of its individual components to determine its collective function? We approach this question by relating the internal network feedback to the statistical prevalence of connectivity motifs, a set of surprisingly simple and local statistics on the network topology. The resulting motif description provides a reduced order model of the network input-output dynamics and it relates the overall network function to feedback control theory. For example, this new formulation dramatically simplifies the classic Erdos-Renyi graph, reducing the overall graph behavior to a simple proportional feedback wrapped around the dynamics of a single node. Higher-order motifs systematically provide further layers and types of feedback to regulate the network response. Thus, the local connectivity shapes temporal and spectral processing by the network as a whole, and we show how this enables robust, yet tunable, functionality such as extending the time constant w...

  10. Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.

    Directory of Open Access Journals (Sweden)

    Arnoldo J Müller-Molina

    Full Text Available To know the map between transcription factors (TFs and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.

  11. Structural relationships in the lysozyme superfamily: significant evidence for glycoside hydrolase signature motifs.

    Directory of Open Access Journals (Sweden)

    Alexandre Wohlkönig

    Full Text Available BACKGROUND: Chitin is a polysaccharide that forms the hard, outer shell of arthropods and the cell walls of fungi and some algae. Peptidoglycan is a polymer of sugars and amino acids constituting the cell walls of most bacteria. Enzymes that are able to hydrolyze these cell membrane polymers generally play important roles for protecting plants and animals against infection with insects and pathogens. A particular group of such glycoside hydrolase enzymes share some common features in their three-dimensional structure and in their molecular mechanism, forming the lysozyme superfamily. RESULTS: Besides having a similar fold, all known catalytic domains of glycoside hydrolase proteins of lysozyme superfamily (families and subfamilies GH19, GH22, GH23, GH24 and GH46 share in common two structural elements: the central helix of the all-α domain, which invariably contains the catalytic glutamate residue acting as general-acid catalyst, and a β-hairpin pointed towards the substrate binding cleft. The invariant β-hairpin structure is interestingly found to display the highest amino acid conservation in aligned sequences of a given family, thereby allowing to define signature motifs for each GH family. Most of such signature motifs are found to have promising performances for searching sequence databases. Our structural analysis further indicates that the GH motifs participate in enzymatic catalysis essentially by containing the catalytic water positioning residue of inverting mechanism. CONCLUSIONS: The seven families and subfamilies of the lysozyme superfamily all have in common a β-hairpin structure which displays a family-specific sequence motif. These GH β-hairpin motifs contain potentially important residues for the catalytic activity, thereby suggesting the participation of the GH motif to catalysis and also revealing a common catalytic scheme utilized by enzymes of the lysozyme superfamily.

  12. The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

    Directory of Open Access Journals (Sweden)

    Roberts Richard J

    2008-05-01

    Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.

  13. Motif-guided sparse decomposition of gene expression data for regulatory module identification

    Directory of Open Access Journals (Sweden)

    Hoffman Eric P

    2011-03-01

    Full Text Available Abstract Background Genes work coordinately as gene modules or gene networks. Various computational approaches have been proposed to find gene modules based on gene expression data; for example, gene clustering is a popular method for grouping genes with similar gene expression patterns. However, traditional gene clustering often yields unsatisfactory results for regulatory module identification because the resulting gene clusters are co-expressed but not necessarily co-regulated. Results We propose a novel approach, motif-guided sparse decomposition (mSD, to identify gene regulatory modules by integrating gene expression data and DNA sequence motif information. The mSD approach is implemented as a two-step algorithm comprising estimates of (1 transcription factor activity and (2 the strength of the predicted gene regulation event(s. Specifically, a motif-guided clustering method is first developed to estimate the transcription factor activity of a gene module; sparse component analysis is then applied to estimate the regulation strength, and so predict the target genes of the transcription factors. The mSD approach was first tested for its improved performance in finding regulatory modules using simulated and real yeast data, revealing functionally distinct gene modules enriched with biologically validated transcription factors. We then demonstrated the efficacy of the mSD approach on breast cancer cell line data and uncovered several important gene regulatory modules related to endocrine therapy of breast cancer. Conclusion We have developed a new integrated strategy, namely motif-guided sparse decomposition (mSD of gene expression data, for regulatory module identification. The mSD method features a novel motif-guided clustering method for transcription factor activity estimation by finding a balance between co-regulation and co-expression. The mSD method further utilizes a sparse decomposition method for regulation strength estimation. The

  14. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo

    2013-06-21

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  15. Mycobacterial PE_PGRS Proteins Contain Calcium-Binding Motifs with Parallel β-roll Folds

    Institute of Scientific and Technical Information of China (English)

    Nandita; Bachhawat; Balvinder; Singh

    2007-01-01

    The PE_PGRS family of proteins unique to mycobacteria is demonstrated to con- rain multiple calcium-binding and glycine-rich sequence motifs GGXGXD/NXUX. This sequence repeat constitutes a calcium-binding parallel/3-roll or parallel β-helix structure and is found in RTX toxins secreted by many Gram-negative bacteria. It is predicted that the highly homologous PE_PGRS proteins containing multiple copies of the nona-peptide motif could fold into similar calcium-binding structures. The implication of the predicted calcium-binding property of PE_PGRS proteins in the Ught of macrophage-pathogen interaction and pathogenesis is presented.

  16. Requirement for asparagine in the aquaporin NPA sequence signature motifs for cation exclusion

    DEFF Research Database (Denmark)

    Wree, Dorothea; Wu, Binghua; Zeuthen, Thomas;

    2011-01-01

    Two highly conserved NPA motifs are a hallmark of the aquaporin (AQP) family. The NPA triplets form N-terminal helix capping structures with the Asn side chains located in the centre of the water or solute-conducting channel, and are considered to play an important role in AQP selectivity. Although...... electrophysiology, we found that an analogous mammalian AQP1 N76S mutant excluded protons and potassium ions, but leaked sodium ions, providing an argument for the overwhelming prevalence of Asn over other amino acids. We conclude that, at the first position in the NPA motifs, only Asn provides efficient helix cap...

  17. BetaSearch: a new method for querying β-residue motifs

    Directory of Open Access Journals (Sweden)

    Ho Hui

    2012-07-01

    Full Text Available Abstract Background Searching for structural motifs across known protein structures can be useful for identifying unrelated proteins with similar function and characterising secondary structures such as β-sheets. This is infeasible using conventional sequence alignment because linear protein sequences do not contain spatial information. β-residue motifs are β-sheet substructures that can be represented as graphs and queried using existing graph indexing methods, however, these approaches are designed for general graphs that do not incorporate the inherent structural constraints of β-sheets and require computationally-expensive filtering and verification procedures. 3D substructure search methods, on the other hand, allow β-residue motifs to be queried in a three-dimensional context but at significant computational costs. Findings We developed a new method for querying β-residue motifs, called BetaSearch, which leverages the natural planar constraints of β-sheets by indexing them as 2D matrices, thus avoiding much of the computational complexities involved with structural and graph querying. BetaSearch exhibits faster filtering, verification, and overall query time than existing graph indexing approaches whilst producing comparable index sizes. Compared to 3D substructure search methods, BetaSearch achieves 33 and 240 times speedups over index-based and pairwise alignment-based approaches, respectively. Furthermore, we have presented case-studies to demonstrate its capability of motif matching in sequentially dissimilar proteins and described a method for using BetaSearch to predict β-strand pairing. Conclusions We have demonstrated that BetaSearch is a fast method for querying substructure motifs. The improvements in speed over existing approaches make it useful for efficiently performing high-volume exploratory querying of possible protein substructural motifs or conformations. BetaSearch was used to identify a nearly identical

  18. The Phe-Phe Motif for Peptide Self-Assembly in Nanomedicine.

    Science.gov (United States)

    Marchesan, Silvia; Vargiu, Attilio V; Styan, Katie E

    2015-01-01

    Since its discovery, the Phe-Phe motif has gained in popularity as a minimalist building block to drive the self-assembly of short peptides and their analogues into nanostructures and hydrogels. Molecules based on the Phe-Phe motif have found a range of applications in nanomedicine, from drug delivery and biomaterials to new therapeutic paradigms. Here we discuss the various production methods for this class of compounds, and the characterization, nanomorphologies, and application of their self-assembled nanostructures. We include the most recent findings on their remarkable properties, which hold substantial promise for the creation of the next generation nanomedicines. PMID:26540034

  19. The IQ Motif is Crucial for Ca(v)1.1 Function

    OpenAIRE

    Katarina Stroffekova

    2011-01-01

    Ca2+-dependent modulation via calmodulin, with consensus CaM-binding IQ motif playing a key role, has been documented for most high-voltage-activated Ca2+ channels. The skeletal muscle Cav1.1 also exhibits Ca2+-/CaM-dependent modulation. Here, whole-cell Ca2+ current, Ca2+ transient, and maximal, immobilization-resistant charge movement (Q max) recordings were obtained from cultured mouse myotubes, to test a role of IQ motif in function of Cav1.1. The effect of introducing mutation (IQ to AA)...

  20. The Phe-Phe Motif for Peptide Self-Assembly in Nanomedicine

    Directory of Open Access Journals (Sweden)

    Silvia Marchesan

    2015-11-01

    Full Text Available Since its discovery, the Phe-Phe motif has gained in popularity as a minimalist building block to drive the self-assembly of short peptides and their analogues into nanostructures and hydrogels. Molecules based on the Phe-Phe motif have found a range of applications in nanomedicine, from drug delivery and biomaterials to new therapeutic paradigms. Here we discuss the various production methods for this class of compounds, and the characterization, nanomorphologies, and application of their self-assembled nanostructures. We include the most recent findings on their remarkable properties, which hold substantial promise for the creation of the next generation nanomedicines.

  1. Pyrimidone-based series of glucokinase activators with alternative donor-acceptor motif.

    Science.gov (United States)

    Filipski, Kevin J; Guzman-Perez, Angel; Bian, Jianwei; Perreault, Christian; Aspnes, Gary E; Didiuk, Mary T; Dow, Robert L; Hank, Richard F; Jones, Christopher S; Maguire, Robert J; Tu, Meihua; Zeng, Dongxiang; Liu, Shenping; Knafels, John D; Litchfield, John; Atkinson, Karen; Derksen, David R; Bourbonais, Francis; Gajiwala, Ketan S; Hickey, Michael; Johnson, Theodore O; Humphries, Paul S; Pfefferkorn, Jeffrey A

    2013-08-15

    Glucokinase activators are a class of experimental agents under investigation as a therapy for Type 2 diabetes mellitus. An X-ray crystal structure of a modestly potent agent revealed the potential to substitute the common heterocyclic amide donor-acceptor motif for a pyridone moiety. We have successfully demonstrated that both pyridone and pyrimidone heterocycles can be used as a potent donor-acceptor substituent. Several sub-micromolar analogs that possess the desired partial activator profile were synthesized and characterized. Unfortunately, the most potent activators suffered from sub-optimal pharmacokinetic properties. Nonetheless, these donor-acceptor motifs may find utility in other glucokinase activator series or beyond.

  2. Characterization of transcription factors binding to-120 GATA motif of rat βbminy-globin promoter

    OpenAIRE

    Pavlović Sonja T.; Mitrović Tatjana; Karan-Đurašević Teodora; Nikčević Gordana T.

    2005-01-01

    The aim of this study was to elucidate the regulation of rat adult βbminy-globin gene transcription. We used DNasel foot printing, gel mobility shift and super shift assays to characterize transcription factors involved in this regulation. In this study we analyzed GATA motif at-120 bp in the distal promoter of βbminy-globin gene. Footprint analysis revealed the binding of nuclear factors from MEL cells to the GATA motif. By using gel mobility shift assay two protein complexes were detected. ...

  3. Alanine substitutions of noncysteine residues in the cysteine-stabilized αβ motif

    OpenAIRE

    Yang, Ying-Fang; Cheng, Kuo-Chang; Tsai, Ping-Hsing; Liu, Chung-Cheng; Lee, Tian-Ren; Ping-Chiang Lyu

    2009-01-01

    The protein scaffold is a peptide framework with a high tolerance of residue modifications. The cysteine-stabilized αβ motif (CSαβ) consists of an α-helix and an antiparallel triple-stranded β-sheet connected by two disulfide bridges. Proteins containing this motif share low sequence identity but high structural similarity and has been suggested as a good scaffold for protein engineering. The Vigna radiate defensin 1 (VrD1), a plant defensin, serves here as a model protein to probe the amino ...

  4. Introducing tetraCys motifs at two different sites results in a functional dopamine transporter

    DEFF Research Database (Denmark)

    Orun, Oya; Rasmussen, S; Gether, U

    2009-01-01

    We have introduced tetracysteine motifs into different positions of the dopamine transporter (DAT) for specific FlAsH labeling. Two of the constructs expressed at the cell surface and were functional as determined by [3H] dopamine uptake experiments. The N-terminally modified transporter showed...... uptake levels comparable to the wild-type DAT, while the construct with tetracysteine motif at position 511 displayed an uptake level about 1/3 of its wild-type counterpart. In addition, these two transporter constructs were visualized on the cell surface following labeling with a fluorescent cocaine...

  5. Finding Common Sequence and Structure Motifs in a set of RNA sequences

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Heyer, Laurie J.; Stormo, Gary D.

    1997-01-01

    We present a computational scheme to search for the most common motif, composed of a combination of sequence and structure constraints, among a collection of RNA sequences. The method uses a simplified version of the Sankoff algorithm for simultaneous folding and alignment of RNA sequences...

  6. Amphipathic motifs in BAR domains are essential for membrane curvature sensing

    DEFF Research Database (Denmark)

    Bhatia, Vikram K; Madsen, Kenneth L; Bolinger, Pierre-Yves;

    2009-01-01

    BAR (Bin/Amphiphysin/Rvs) domains and amphipathic alpha-helices (AHs) are believed to be sensors of membrane curvature thus facilitating the assembly of protein complexes on curved membranes. Here, we used quantitative fluorescence microscopy to compare the binding of both motifs on single nanosi...

  7. Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

    KAUST Repository

    Sayadi, Ahmed

    2011-07-20

    The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length of the motifs and their variable degree of conservation makes their identification hard since it is difficult to correctly estimate the statistical significance of their occurrence. Consequently, only a small fraction of them have been discovered so far. We describe here an approach for the discovery of SLiMs based on their occurrence in evolutionarily unrelated proteins belonging to the same biological, signalling or metabolic pathway and give specific examples of its effectiveness in both rediscovering known motifs and in discovering novel ones. An automatic implementation of the procedure, available for download, allows significant motifs to be identified, automatically annotated with functional, evolutionary and structural information and organized in a database that can be inspected and queried. An instance of the database populated with pre-computed data on seven organisms is accessible through a publicly available server and we believe it constitutes by itself a useful resource for the life sciences (http://www.biocomputing.it/modipath).

  8. Analysis of tetra- and hepta-nucleotides motifs promoting -1 ribosomal frameshifting in Escherichia coli.

    Science.gov (United States)

    Sharma, Virag; Prère, Marie-Françoise; Canal, Isabelle; Firth, Andrew E; Atkins, John F; Baranov, Pavel V; Fayet, Olivier

    2014-06-01

    Programmed ribosomal -1 frameshifting is a non-standard decoding process occurring when ribosomes encounter a signal embedded in the mRNA of certain eukaryotic and prokaryotic genes. This signal has a mandatory component, the frameshift motif: it is either a Z_ZZN tetramer or a X_XXZ_ZZN heptamer (where ZZZ and XXX are three identical nucleotides) allowing cognate or near-cognate repairing to the -1 frame of the A site or A and P sites tRNAs. Depending on the signal, the frameshifting frequency can vary over a wide range, from less than 1% to more than 50%. The present study combines experimental and bioinformatics approaches to carry out (i) a systematic analysis of the frameshift propensity of all possible motifs (16 Z_ZZN tetramers and 64 X_XXZ_ZZN heptamers) in Escherichia coli and (ii) the identification of genes potentially using this mode of expression amongst 36 Enterobacteriaceae genomes. While motif efficiency varies widely, a major distinctive rule of bacterial -1 frameshifting is that the most efficient motifs are those allowing cognate re-pairing of the A site tRNA from ZZN to ZZZ. The outcome of the genomic search is a set of 69 gene clusters, 59 of which constitute new candidates for functional utilization of -1 frameshifting. PMID:24875478

  9. Use of a Probabilistic Motif Search to Identify Histidine Phosphotransfer Domain-Containing Proteins.

    Science.gov (United States)

    Surujon, Defne; Ratner, David I

    2016-01-01

    The wealth of newly obtained proteomic information affords researchers the possibility of searching for proteins of a given structure or function. Here we describe a general method for the detection of a protein domain of interest in any species for which a complete proteome exists. In particular, we apply this approach to identify histidine phosphotransfer (HPt) domain-containing proteins across a range of eukaryotic species. From the sequences of known HPt domains, we created an amino acid occurrence matrix which we then used to define a conserved, probabilistic motif. Examination of various organisms either known to contain (plant and fungal species) or believed to lack (mammals) HPt domains established criteria by which new HPt candidates were identified and ranked. Search results using a probabilistic motif matrix compare favorably with data to be found in several commonly used protein structure/function databases: our method identified all known HPt proteins in the Arabidopsis thaliana proteome, confirmed the absence of such motifs in mice and humans, and suggests new candidate HPts in several organisms. Moreover, probabilistic motif searching can be applied more generally, in a manner both readily customized and computationally compact, to other protein domains; this utility is demonstrated by our identification of histones in a range of eukaryotic organisms. PMID:26751210

  10. Finding the most significant common sequence and structure motifs in a set of RNA sequences

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Heyer, L.J.; Stormo, G.D.

    1997-01-01

    We present a computational scheme to locally align a collection of RNA sequences using sequence and structure constraints, In addition, the method searches for the resulting alignments with the most significant common motifs, among all possible collections, The first part utilizes a simplified...

  11. ATtRACT-a database of RNA-binding proteins and associated motifs.

    Science.gov (United States)

    Giudice, Girolamo; Sánchez-Cabo, Fátima; Torroja, Carlos; Lara-Pezzi, Enrique

    2016-01-01

    RNA-binding proteins (RBPs) play a crucial role in key cellular processes, including RNA transport, splicing, polyadenylation and stability. Understanding the interaction between RBPs and RNA is key to improve our knowledge of RNA processing, localization and regulation in a global manner. Despite advances in recent years, a unified non-redundant resource that includes information on experimentally validated motifs, RBPs and integrated tools to exploit this information is lacking. Here, we developed a database named ATtRACT (available athttp://attract.cnic.es) that compiles information on 370 RBPs and 1583 RBP consensus binding motifs, 192 of which are not present in any other database. To populate ATtRACT we (i) extracted and hand-curated experimentally validated data from CISBP-RNA, SpliceAid-F, RBPDB databases, (ii) integrated and updated the unavailable ASD database and (iii) extracted information from Protein-RNA complexes present in Protein Data Bank database through computational analyses. ATtRACT provides also efficient algorithms to search a specific motif and scan one or more RNA sequences at a time. It also allows discoveringde novomotifs enriched in a set of related sequences and compare them with the motifs included in the database.Database URL:http:// attract. cnic. es. PMID:27055826

  12. Use of a Probabilistic Motif Search to Identify Histidine Phosphotransfer Domain-Containing Proteins.

    Directory of Open Access Journals (Sweden)

    Defne Surujon

    Full Text Available The wealth of newly obtained proteomic information affords researchers the possibility of searching for proteins of a given structure or function. Here we describe a general method for the detection of a protein domain of interest in any species for which a complete proteome exists. In particular, we apply this approach to identify histidine phosphotransfer (HPt domain-containing proteins across a range of eukaryotic species. From the sequences of known HPt domains, we created an amino acid occurrence matrix which we then used to define a conserved, probabilistic motif. Examination of various organisms either known to contain (plant and fungal species or believed to lack (mammals HPt domains established criteria by which new HPt candidates were identified and ranked. Search results using a probabilistic motif matrix compare favorably with data to be found in several commonly used protein structure/function databases: our method identified all known HPt proteins in the Arabidopsis thaliana proteome, confirmed the absence of such motifs in mice and humans, and suggests new candidate HPts in several organisms. Moreover, probabilistic motif searching can be applied more generally, in a manner both readily customized and computationally compact, to other protein domains; this utility is demonstrated by our identification of histones in a range of eukaryotic organisms.

  13. Nuclear localization of the dehydrin OpsDHN1 is determined by histidine-rich motif.

    Science.gov (United States)

    Hernández-Sánchez, Itzell E; Maruri-López, Israel; Ferrando, Alejandro; Carbonell, Juan; Graether, Steffen P; Jiménez-Bremont, Juan F

    2015-01-01

    The cactus OpsDHN1 dehydrin belongs to a large family of disordered and highly hydrophilic proteins known as Late Embryogenesis Abundant (LEA) proteins, which accumulate during the late stages of embryogenesis and in response to abiotic stresses. Herein, we present the in vivo OpsDHN1 subcellular localization by N-terminal GFP translational fusion; our results revealed a cytoplasmic and nuclear localization of the GFP::OpsDHN1 protein in Nicotiana benthamiana epidermal cells. In addition, dimer assembly of OpsDHN1 in planta using a Bimolecular Fluorescence Complementation (BiFC) approach was demonstrated. In order to understand the in vivo role of the histidine-rich motif, the OpsDHN1-ΔHis version was produced and assayed for its subcellular localization and dimer capability by GFP fusion and BiFC assays, respectively. We found that deletion of the OpsDHN1 histidine-rich motif restricted its localization to cytoplasm, but did not affect dimer formation. In addition, the deletion of the S-segment in the OpsDHN1 protein affected its nuclear localization. Our data suggest that the deletion of histidine-rich motif and S-segment show similar effects, preventing OpsDHN1 from getting into the nucleus. Based on these results, the histidine-rich motif is proposed as a targeting element for OpsDHN1 nuclear localization. PMID:26442018

  14. A conserved upstream motif orchestrates autonomous, germline-enriched expression of Caenorhabditis elegans piRNAs.

    Directory of Open Access Journals (Sweden)

    Allison C Billi

    Full Text Available Piwi-interacting RNAs (piRNAs fulfill a critical, conserved role in defending the genome against foreign genetic elements. In many organisms, piRNAs appear to be derived from processing of a long, polycistronic RNA precursor. Here, we establish that each Caenorhabditis elegans piRNA represents a tiny, autonomous transcriptional unit. Remarkably, the minimal C. elegans piRNA cassette requires only a 21 nucleotide (nt piRNA sequence and an ∼50 nt upstream motif with limited genomic context for expression. Combining computational analyses with a novel, in vivo transgenic system, we demonstrate that this upstream motif is necessary for independent expression of a germline-enriched, Piwi-dependent piRNA. We further show that a single nucleotide position within this motif directs differential germline enrichment. Accordingly, over 70% of C. elegans piRNAs are selectively expressed in male or female germline, and comparison of the genes they target suggests that these two populations have evolved independently. Together, our results indicate that C. elegans piRNA upstream motifs act as independent promoters to specify which sequences are expressed as piRNAs, how abundantly they are expressed, and in what germline. As the genome encodes well over 15,000 unique piRNA sequences, our study reveals that the number of transcriptional units encoding piRNAs rivals the number of mRNA coding genes in the C. elegans genome.

  15. Modulation of i-motif thermodynamic stability by the introduction of UNA (unlocked nucleic acid) monomers

    DEFF Research Database (Denmark)

    Pasternak, Anna; Wengel, Jesper

    2011-01-01

    The influence of acyclic RNA derivatives, UNA (unlocked nucleic acid) monomers, on i-DNA thermodynamic stability has been investigated. The 22 nt human telomeric fragment was chosen as the model sequence for stability studies. UNA monomers modulate i-motif stability in a position-depending manner...

  16. Nuclear localization of the dehydrin OpsDHN1 is determined by histidine-rich motif

    Directory of Open Access Journals (Sweden)

    Itzell Euridice Hernández-Sánchez

    2015-09-01

    Full Text Available The cactus OpsDHN1 dehydrin belongs to a large family of disordered and highly hydrophilic proteins known as Late Embryogenesis Abundant (LEA proteins, which accumulate during the late stages of embryogenesis and in response to abiotic stresses. Herein, we present the in vivo OpsDHN1 subcellular localization by N-terminal GFP translational fusion; our results revealed a cytoplasmic and nuclear localization of the GFP::OpsDHN1 protein in Nicotiana benthamiana epidermal cells. In addition, dimer assembly of OpsDHN1 in planta using a Bimolecular Fluorescence Complementation (BiFC approach was demonstrated. In order to understand the in vivo role of the histidine-rich motif, the OpsDHN1-ΔHis version was produced and assayed for its subcellular localization and dimer capability by GFP fusion and BiFC assays, respectively. We found that deletion of the OpsDHN1 histidine-rich motif restricted its localization to cytoplasm, but did not affect dimer formation. In addition, the deletion of the S-segment in the OpsDHN1 protein affected its nuclear localization. Our data suggest that the deletion of histidine-rich motif and S-segment show similar effects, preventing OpsDHN1 from getting into the nucleus. Based on these results, the histidine rich motif is proposed as a targeting element for OpsDHN1 nuclear localization.

  17. Position Weight Matrix, Gibbs Sampler, and the Associated Significance Tests in Motif Characterization and Prediction

    Directory of Open Access Journals (Sweden)

    Xuhua Xia

    2012-01-01

    Full Text Available Position weight matrix (PWM is not only one of the most widely used bioinformatic methods, but also a key component in more advanced computational algorithms (e.g., Gibbs sampler for characterizing and discovering motifs in nucleotide or amino acid sequences. However, few generally applicable statistical tests are available for evaluating the significance of site patterns, PWM, and PWM scores (PWMS of putative motifs. Statistical significance tests of the PWM output, that is, site-specific frequencies, PWM itself, and PWMS, are in disparate sources and have never been collected in a single paper, with the consequence that many implementations of PWM do not include any significance test. Here I review PWM-based methods used in motif characterization and prediction (including a detailed illustration of the Gibbs sampler for de novo motif discovery, present statistical and probabilistic rationales behind statistical significance tests relevant to PWM, and illustrate their application with real data. The multiple comparison problem associated with the test of site-specific frequencies is best handled by false discovery rate methods. The test of PWM, due to the use of pseudocounts, is best done by resampling methods. The test of individual PWMS for each sequence segment should be based on the extreme value distribution.

  18. Thermal Stability of Modified i-Motif Oligonucleotides with Naphthalimide Intercalating Nucleic Acids

    DEFF Research Database (Denmark)

    El-Sayed, Ahmed Ali; Pedersen, Erik B.; Khaireldin, Nahid Y.

    2016-01-01

    naphthalimide (1H-benzo[de]isoquinoline-1,3(2H)-dione) as the intercalating nucleic acid. The stabilities of i-motif structures with inserted naphthalimide intercalating nucleotides were studied using UV melting temperatures (Tm) and circular dichroism spectra at different pH values and conditions (crowding and...

  19. ATtRACT-a database of RNA-binding proteins and associated motifs.

    Science.gov (United States)

    Giudice, Girolamo; Sánchez-Cabo, Fátima; Torroja, Carlos; Lara-Pezzi, Enrique

    2016-01-01

    RNA-binding proteins (RBPs) play a crucial role in key cellular processes, including RNA transport, splicing, polyadenylation and stability. Understanding the interaction between RBPs and RNA is key to improve our knowledge of RNA processing, localization and regulation in a global manner. Despite advances in recent years, a unified non-redundant resource that includes information on experimentally validated motifs, RBPs and integrated tools to exploit this information is lacking. Here, we developed a database named ATtRACT (available athttp://attract.cnic.es) that compiles information on 370 RBPs and 1583 RBP consensus binding motifs, 192 of which are not present in any other database. To populate ATtRACT we (i) extracted and hand-curated experimentally validated data from CISBP-RNA, SpliceAid-F, RBPDB databases, (ii) integrated and updated the unavailable ASD database and (iii) extracted information from Protein-RNA complexes present in Protein Data Bank database through computational analyses. ATtRACT provides also efficient algorithms to search a specific motif and scan one or more RNA sequences at a time. It also allows discoveringde novomotifs enriched in a set of related sequences and compare them with the motifs included in the database.Database URL:http:// attract. cnic. es.

  20. Temporal motifs reveal collaboration patterns in online task-oriented networks

    Science.gov (United States)

    Xuan, Qi; Fang, Huiting; Fu, Chenbo; Filkov, Vladimir

    2015-05-01

    Real networks feature layers of interactions and complexity. In them, different types of nodes can interact with each other via a variety of events. Examples of this complexity are task-oriented social networks (TOSNs), where teams of people share tasks towards creating a quality artifact, such as academic research papers or software development in commercial or open source environments. Accomplishing those tasks involves both work, e.g., writing the papers or code, and communication, to discuss and coordinate. Taking into account the different types of activities and how they alternate over time can result in much more precise understanding of the TOSNs behaviors and outcomes. That calls for modeling techniques that can accommodate both node and link heterogeneity as well as temporal change. In this paper, we report on methodology for finding temporal motifs in TOSNs, limited to a system of two people and an artifact. We apply the methods to publicly available data of TOSNs from 31 Open Source Software projects. We find that these temporal motifs are enriched in the observed data. When applied to software development outcome, temporal motifs reveal a distinct dependency between collaboration and communication in the code writing process. Moreover, we show that models based on temporal motifs can be used to more precisely relate both individual developer centrality and team cohesion to programmer productivity than models based on aggregated TOSNs.

  1. Functional analysis of the putative integrin recognition motif on adeno-associated virus 9.

    Science.gov (United States)

    Shen, Shen; Berry, Garrett E; Castellanos Rivera, Ruth M; Cheung, Roland Y; Troupes, Andrew N; Brown, Sarah M; Kafri, Tal; Asokan, Aravind

    2015-01-16

    Adeno-associated viruses (AAVs) display a highly conserved NGR motif on the capsid surface. Earlier studies have established this tripeptide motif as being essential for integrin-mediated uptake of recombinant AAV serotype 2 (AAV2) in cultured cells. However, functional attributes of this putative integrin recognition motif in other recombinant AAV serotypes displaying systemic transduction in vivo remain unknown. In this study, we dissect the biology of an integrin domain capsid mutant derived from the human isolate AAV9 in mice. The AAV9/NGA mutant shows decreased systemic transduction in mice. This defective phenotype was accompanied by rapid clearance of mutant virions from the blood circulation and nonspecific sequestration by the spleen. Transient vascular hyperpermeability, induced by histamine coinjection, exacerbated AAV9/NGA uptake by the spleen but not the liver. However, such treatment did not affect AAV9 virions, suggesting a potential entry/post-entry defect for the mutant in different tissues. Further characterization revealed modestly decreased cell surface binding but a more pronounced defect in the cellular entry of mutant virions. These findings were corroborated by the observation that blocking multiple integrins adversely affected recombinant AAV9 transduction in different cell types, albeit with variable efficiencies. From a structural perspective, we observed that the integrin recognition motif is located in close proximity to the galactose binding footprint on AAV9 capsids and postulate that this feature could influence cell surface attachment, cellular uptake at the tissue level, and systemic clearance by the reticuloendothelial system. PMID:25404742

  2. Functional importance of motif I of pseudouridine synthases: mutagenesis of aligned lysine and proline residues.

    Science.gov (United States)

    Spedaliere, C J; Hamilton, C S; Mueller, E G

    2000-08-01

    On the basis of sequence alignments, the pseudouridine synthases were grouped into four families that share no statistically significant global sequence similarity, though some common sequence motifs were discovered [Koonin, E. V. (1996) Nucleic Acids. Res. 24, 2411-2415; Gustafsson, C., Reid, R., Greene, P. J., and Santi, D. V. (1996) Nucleic Acids Res. 24, 3756-3762]. We have investigated the functional significance of these alignments by substituting the nearly invariant lysine and proline residues in Motif I of RluA and TruB, pseudouridine synthases belonging to different families. Contrary to our expectations, the altered enzymes display only very mild kinetic impairment. Substitution of the aligned lysine and proline residues does, however, reduce structural stability, consistent with a temperature sensitive phenotype that results from substitution of the cognate proline residue in Cbf5p, a yeast homologue of TruB [Zerbarjadian, Y., King, T., Fournier, M. J., Clarke, L., and Carbon, J. (1999) Mol. Cell. Biol. 19, 7461-7472]. Together, our data support a functional role for Motif I, as predicted by sequence alignments, though the effect of substituting the highly conserved residues was milder than we anticipated. By extrapolation, our findings also support the assignment of pseudouridine synthase function to certain physiologically important eukaryotic proteins that contain Motif I, including the human protein dyskerin, alteration of which leads to the disease dyskeratosis congenita.

  3. The MRE11 GAR motif regulates DNA double-strand break processing and ATR activation

    Institute of Scientific and Technical Information of China (English)

    Zhenbao Yu; Gillian Vogel; Yan Coulombe; Danielle Dubeau; Elizabeth Spehalski; Josée Hébert; David O Ferguson; Jean Yves Masson; Stéphane Richard

    2012-01-01

    The MRE11/RAD50/NBS1 complex is the primary sensor rapidly recruited to DNA double-strand breaks (DSBs).MRE11 is known to be arginine methylated by PRMT1 within its glycine-arginine-rich (GAR) motif.In this study,we report a mouse knock-in allele of Mre11 that substitutes the arginines with lysines in the GAR motif and generates the MRE11RK protein devoid of methylated arginines.The Mre11RK/RK mice were hypersensitive to γ-irradiation (IR) and the cells from these mice displayed cell cycle checkpoint defects and chromosome instability.Moreover,the Mre11RK/RK MEFs exhibited ATR/CHK1 signaling defects and impairment in the recruitment of RPA and RAD51 to the damaged sites.The MRKRN complex formed and localized to the sites of DNA damage and normally activated the ATM pathway in response to IR.The MRKRN complex exhibited exonuclease and DNA-binding defects in vitro responsible for the impaired DNA end resection and ATR activation observed in vivo in response to IR.Our findings provide genetic evidence for the critical role of the MRE11 GAR motif in DSB repair,and demonstrate a mechanistic link between post-translational modifications at the MRE11 GAR motif and DSB processing,as well as the ATR/CHK1 checkpoint signaling.

  4. Bioinformatics analysis of biomarkers and transcriptional factor motifs in Down syndrome

    Directory of Open Access Journals (Sweden)

    X.D. Kong

    2014-10-01

    Full Text Available In this study, biomarkers and transcriptional factor motifs were identified in order to investigate the etiology and phenotypic severity of Down syndrome. GSE 1281, GSE 1611, and GSE 5390 were downloaded from the gene expression ominibus (GEO. A robust multiarray analysis (RMA algorithm was applied to detect differentially expressed genes (DEGs. In order to screen for biological pathways and to interrogate the Kyoto Encyclopedia of Genes and Genomes (KEGG pathway database, the database for annotation, visualization, and integrated discovery (DAVID was used to carry out a gene ontology (GO function enrichment for DEGs. Finally, a transcriptional regulatory network was constructed, and a hypergeometric distribution test was applied to select for significantly enriched transcriptional factor motifs. CBR1, DYRK1A, HMGN1, ITSN1, RCAN1, SON, TMEM50B, and TTC3 were each up-regulated two-fold in Down syndrome samples compared to normal samples; of these, SON and TTC3 were newly reported. CBR1, DYRK1A, HMGN1, ITSN1, RCAN1, SON, TMEM50B, and TTC3 were located on human chromosome 21 (mouse chromosome 16. The DEGs were significantly enriched in macromolecular complex subunit organization and focal adhesion pathways. Eleven significantly enriched transcription factor motifs (PAX5, EGR1, XBP1, SREBP1, OLF1, MZF1, NFY, NFKAPPAB, MYCMAX, NFE2, and RP58 were identified. The DEGs and transcription factor motifs identified in our study provide biomarkers for the understanding of Down syndrome pathogenesis and progression.

  5. The WSXWS motif in cytokine receptors is a molecular switch involved in receptor activation

    DEFF Research Database (Denmark)

    Dagil, Robert; Knudsen, Maiken J.; Olsen, Johan Gotthardt;

    2012-01-01

    The prolactin receptor (PRLR) is activated by binding of prolactin in a 2:1 complex, but the activation mechanism is poorly understood. PRLR has a conserved WSXWS motif generic to cytokine class I receptors. We have determined the nuclear magnetic resonance solution structure of the membrane...

  6. Anion–arene adducts: C–H hydrogen bonding, anion– interaction, and carbon bonding motifs

    OpenAIRE

    Hay, Benjamin P.; Bryantsev, Vyacheslav S.

    2008-01-01

    This article summarizes experimental and theoretical evidence for the existence of four distinct binding modes for complexes of anions with charge-neutral arenes. These include C–H hydrogen bonding and three motifs involving the arene– system—the noncovalent anion– interaction, weakly covalent interaction, and strongly covalent interaction.

  7. The IQ Motif is Crucial for Ca v 1.1 Function

    Directory of Open Access Journals (Sweden)

    Katarina Stroffekova

    2011-01-01

    Full Text Available Ca2+-dependent modulation via calmodulin, with consensus CaM-binding IQ motif playing a key role, has been documented for most high-voltage-activated Ca2+ channels. The skeletal muscle Cav1.1 also exhibits Ca2+-/CaM-dependent modulation. Here, whole-cell Ca2+ current, Ca2+ transient, and maximal, immobilization-resistant charge movement (Qmax recordings were obtained from cultured mouse myotubes, to test a role of IQ motif in function of Cav1.1. The effect of introducing mutation (IQ to AA of IQ motif into Cav1.1 was examined. In dysgenic myotubes expressing YFP-Cav1.1AA, neither Ca2+ currents nor evoked Ca2+ transients were detectable. The loss of Ca2+ current and excitation-contraction coupling did not appear to be a consequence of defective trafficking to the sarcolemma. The Qmax in dysgenic myotubes expressing YFP-Cav1.1AA was similar to that of normal myotubes. These findings suggest that the IQ motif of the Cav1.1 may be an unrecognized site of structural and functional coupling between DHPR and RyR.

  8. STUDYING THE INFLUENCE OF THE PYRENE INTERCALATOR TINA ON THE STABILITY OF DNA i-MOTIFS

    DEFF Research Database (Denmark)

    El-Sayed, Ahmed A.; Pedersen, Erik Bjerregaard; Khaireldin, Nahid A.

    2012-01-01

    -motif structures was studied by using UV melting temperature measurements and circular dichroism spectra at different pH values under noncrowding and crowding conditions (20% poly(ethylene glycol)). When TINA ((R)-3-((4-(1-pyrenylethynyl)benzyl)oxy) propane-1,2-diol) is inserted, the oligonucleotides could form...

  9. SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.

    Science.gov (United States)

    Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude

    2011-07-01

    The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.

  10. SiO2 nanoparticles modified CPE as a biosensor for determination of i-motif DNA/Tamoxifen interaction.

    Science.gov (United States)

    Heydari, Elham; Raoof, Jahan Bakhsh; Ojani, Reza; Bagheryan, Zahra

    2016-08-01

    Cytosine-rich DNA sequences can form a highly ordered structure known as i-motif in slightly acidic solutions. The stability of the folded i-motif structure is a good strategy to inhibit the telomerase reaction in cancer cells. The electrochemical biosensor was prepared by modifying carbon paste electrode with SiO2 nanoparticles to investigate drugs which can stabilize this structure. Tamoxifen (Tam), an antiestrogen hormonal agent for treatment of breast cancer, was chosen as the model ligand and its interaction with i-motif structure was examined. The interaction between i-motif DNA and Tam was studied in PBS buffer and [Fe(CN)6](3-) through the cyclic voltammetry and square wave voltammetry methods. The oxidation peak of Tam, due to the i-motif DNA/Tam interaction, was observed after i-motif immobilized on the surface of the electrode. The i-motif formation was investigated by circular dichroism spectroscopy and the results showed that this structure can certainly be made with pH around 4.5, but its stability reduced by going to the more alkaline pH. The selectivity which was studied in the presence of complementary strand demonstrated that i-motif structure could be stabilized in acidic pH even in the presence of its complementary strand. PMID:27151665

  11. An efficient identification strategy of clonal tea cultivars using long-core motif SSR markers.

    Science.gov (United States)

    Wang, Rang Jian; Gao, Xiang Feng; Kong, Xiang Rui; Yang, Jun

    2016-01-01

    Microsatellites, or simple sequence repeats (SSRs), especially those with long-core motifs (tri-, tetra-, penta-, and hexa-nucleotide) represent an excellent tool for DNA fingerprinting. SSRs with long-core motifs are preferred since neighbor alleles are more easily separated and identified from each other, which render the interpretation of electropherograms and the true alleles more reliable. In the present work, with the purpose of characterizing a set of core SSR markers with long-core motifs for well fingerprinting clonal cultivars of tea (Camellia sinensis), we analyzed 66 elite clonal tea cultivars in China with 33 initially-chosen long-core motif SSR markers covering all the 15 linkage groups of tea plant genome. A set of 6 SSR markers were conclusively selected as core SSR markers after further selection. The polymorphic information content (PIC) of the core SSR markers was >0.5, with ≤5 alleles in each marker containing 10 or fewer genotypes. Phylogenetic analysis revealed that the core SSR markers were not strongly correlated with the trait 'cultivar processing-property'. The combined probability of identity (PID) between two random cultivars for the whole set of 6 SSR markers was estimated to be 2.22 × 10(-5), which was quite low, confirmed the usefulness of the proposed SSR markers for fingerprinting analyses in Camellia sinensis. Moreover, for the sake of quickly discriminating the clonal tea cultivars, a cultivar identification diagram (CID) was subsequently established using these core markers, which fully reflected the identification process and provided the immediate information about which SSR markers were needed to identify a cultivar chosen among the tested ones. The results suggested that long-core motif SSR markers used in the investigation contributed to the accurate and efficient identification of the clonal tea cultivars and enabled the protection of intellectual property.

  12. An efficient identification strategy of clonal tea cultivars using long-core motif SSR markers.

    Science.gov (United States)

    Wang, Rang Jian; Gao, Xiang Feng; Kong, Xiang Rui; Yang, Jun

    2016-01-01

    Microsatellites, or simple sequence repeats (SSRs), especially those with long-core motifs (tri-, tetra-, penta-, and hexa-nucleotide) represent an excellent tool for DNA fingerprinting. SSRs with long-core motifs are preferred since neighbor alleles are more easily separated and identified from each other, which render the interpretation of electropherograms and the true alleles more reliable. In the present work, with the purpose of characterizing a set of core SSR markers with long-core motifs for well fingerprinting clonal cultivars of tea (Camellia sinensis), we analyzed 66 elite clonal tea cultivars in China with 33 initially-chosen long-core motif SSR markers covering all the 15 linkage groups of tea plant genome. A set of 6 SSR markers were conclusively selected as core SSR markers after further selection. The polymorphic information content (PIC) of the core SSR markers was >0.5, with ≤5 alleles in each marker containing 10 or fewer genotypes. Phylogenetic analysis revealed that the core SSR markers were not strongly correlated with the trait 'cultivar processing-property'. The combined probability of identity (PID) between two random cultivars for the whole set of 6 SSR markers was estimated to be 2.22 × 10(-5), which was quite low, confirmed the usefulness of the proposed SSR markers for fingerprinting analyses in Camellia sinensis. Moreover, for the sake of quickly discriminating the clonal tea cultivars, a cultivar identification diagram (CID) was subsequently established using these core markers, which fully reflected the identification process and provided the immediate information about which SSR markers were needed to identify a cultivar chosen among the tested ones. The results suggested that long-core motif SSR markers used in the investigation contributed to the accurate and efficient identification of the clonal tea cultivars and enabled the protection of intellectual property. PMID:27504250

  13. GRISOTTO: A greedy approach to improve combinatorial algorithms for motif discovery with prior knowledge

    Directory of Open Access Journals (Sweden)

    Oliveira Arlindo L

    2011-04-01

    Full Text Available Abstract Background Position-specific priors (PSP have been used with success to boost EM and Gibbs sampler-based motif discovery algorithms. PSP information has been computed from different sources, including orthologous conservation, DNA duplex stability, and nucleosome positioning. The use of prior information has not yet been used in the context of combinatorial algorithms. Moreover, priors have been used only independently, and the gain of combining priors from different sources has not yet been studied. Results We extend RISOTTO, a combinatorial algorithm for motif discovery, by post-processing its output with a greedy procedure that uses prior information. PSP's from different sources are combined into a scoring criterion that guides the greedy search procedure. The resulting method, called GRISOTTO, was evaluated over 156 yeast TF ChIP-chip sequence-sets commonly used to benchmark prior-based motif discovery algorithms. Results show that GRISOTTO is at least as accurate as other twelve state-of-the-art approaches for the same task, even without combining priors. Furthermore, by considering combined priors, GRISOTTO is considerably more accurate than the state-of-the-art approaches for the same task. We also show that PSP's improve GRISOTTO ability to retrieve motifs from mouse ChiP-seq data, indicating that the proposed algorithm can be applied to data from a different technology and for a higher eukaryote. Conclusions The conclusions of this work are twofold. First, post-processing the output of combinatorial algorithms by incorporating prior information leads to a very efficient and effective motif discovery method. Second, combining priors from different sources is even more beneficial than considering them separately.

  14. qPMS7: a fast algorithm for finding (ℓ, d-motifs in DNA and protein sequences.

    Directory of Open Access Journals (Sweden)

    Hieu Dinh

    Full Text Available Detection of rare events happening in a set of DNA/protein sequences could lead to new biological discoveries. One kind of such rare events is the presence of patterns called motifs in DNA/protein sequences. Finding motifs is a challenging problem since the general version of motif search has been proven to be intractable. Motifs discovery is an important problem in biology. For example, it is useful in the detection of transcription factor binding sites and transcriptional regulatory elements that are very crucial in understanding gene function, human disease, drug design, etc. Many versions of the motif search problem have been proposed in the literature. One such is the (ℓ, d-motif search (or Planted Motif Search (PMS. A generalized version of the PMS problem, namely, Quorum Planted Motif Search (qPMS, is shown to accurately model motifs in real data. However, solving the qPMS problem is an extremely difficult task because a special case of it, the PMS Problem, is already NP-hard, which means that any algorithm solving it can be expected to take exponential time in the worse case scenario. In this paper, we propose a novel algorithm named qPMS7 that tackles the qPMS problem on real data as well as challenging instances. Experimental results show that our Algorithm qPMS7 is on an average 5 times faster than the state-of-art algorithm. The executable program of Algorithm qPMS7 is freely available on the web at http://pms.engr.uconn.edu/downloads/qPMS7.zip. Our online motif discovery tools that use Algorithm qPMS7 are freely available at http://pms.engr.uconn.edu or http://motifsearch.com.

  15. Identification of SNP-containing regulatory motifs in the myelodysplastic syndromes model using SNP arrays and gene expression arrays

    Institute of Scientific and Technical Information of China (English)

    Jing Fan; Jennifer G.Dy; Chung-Che Chang; Xiaobo Zhou

    2013-01-01

    Myelodysplastic syndromes have increased in frequency and incidence in the American population,but patient prognosis has not significantly improved over the last decade.Such improvements could be realized if biomarkers for accurate diagnosis and prognostic stratification were successfully identified.In this study,we propose a method that associates two state-of-the-art array technologies-single nucleotide polymorphism (SNP) array and gene expression array-with gene motifs considered transcription factor-binding sites (TFBS).We are particularly interested in SNP-containing motifs introduced by genetic variation and mutation as TFBS.The potential regulation of SNP-containing motifs affects only when certain mutations occur.These motifs can be identified from a group of co-expressed genes with copy number variation.Then,we used a sliding window to identify motif candidates near SNPs on gene sequences.The candidates were filtered by coarse thresholding and fine statistical testing.Using the regression-based LARS-EN algorithm and a level-wise sequence combination procedure,we identified 28 SNP-containing motifs as candidate TFBS.We confirmed 21 of the 28 motifs with ChIP-chip fragments in the TRANSFAC database.Another six motifs were validated by TRANSFAC via searching binding fragments on coregulated genes.The identified motifs and their location genes can be considered potential biomarkers for myelodysplastic syndromes.Thus,our proposed method,a novel strategy for associating two data categories,is capable of integrating information from different sources to identify reliable candidate regulatory SNP-containing motifs introduced by genetic variation and mutation.

  16. A dinucleotide motif in oligonucleotides shows potent immunomodulatory activity and overrides species-specific recognition observed with CpG motif

    OpenAIRE

    Kandimalla, Ekambar R; Bhagat, Lakshmi; Zhu, Fu-Gang; Yu, Dong; Cong, Yan-Ping; Wang, Daqing; Tang, Jimmy X.; Tang, Jin-Yan; Knetter, Cathrine F.; Lien, Egil; Agrawal, Sudhir

    2003-01-01

    Bacterial and synthetic DNAs containing CpG dinucleotides in specific sequence contexts activate the vertebrate immune system through Toll-like receptor 9 (TLR9). In the present study, we used a synthetic nucleoside with a bicyclic heterobase [1-(2′-deoxy-β-d-ribofuranosyl)-2-oxo-7-deaza-8-methyl-purine; R] to replace the C in CpG, resulting in an RpG dinucleotide. The RpG dinucleotide was incorporated in mouse- and human-specific motifs in oligodeoxynucleotides (oligos) and 3′-3-linked oligo...

  17. Assessment of algorithms for inferring positional weight matrix motifs of transcription factor binding sites using protein binding microarray data.

    Directory of Open Access Journals (Sweden)

    Yaron Orenstein

    Full Text Available The new technology of protein binding microarrays (PBMs allows simultaneous measurement of the binding intensities of a transcription factor to tens of thousands of synthetic double-stranded DNA probes, covering all possible 10-mers. A key computational challenge is inferring the binding motif from these data. We present a systematic comparison of four methods developed specifically for reconstructing a binding site motif represented as a positional weight matrix from PBM data. The reconstructed motifs were evaluated in terms of three criteria: concordance with reference motifs from the literature and ability to predict in vivo and in vitro bindings. The evaluation encompassed over 200 transcription factors and some 300 assays. The results show a tradeoff between how the methods perform according to the different criteria, and a dichotomy of method types. Algorithms that construct motifs with low information content predict PBM probe ranking more faithfully, while methods that produce highly informative motifs match reference motifs better. Interestingly, in predicting high-affinity binding, all methods give far poorer results for in vivo assays compared to in vitro assays.

  18. A Conserved GPG-Motif in the HIV-1 Nef Core Is Required for Principal Nef-Activities.

    Directory of Open Access Journals (Sweden)

    Marta Martínez-Bonet

    Full Text Available To find out new determinants required for Nef activity we performed a functional alanine scanning analysis along a discrete but highly conserved region at the core of HIV-1 Nef. We identified the GPG-motif, located at the 121-137 region of HIV-1 NL4.3 Nef, as a novel protein signature strictly required for the p56Lck dependent Nef-induced CD4-downregulation in T-cells. Since the Nef-GPG motif was dispensable for CD4-downregulation in HeLa-CD4 cells, Nef/AP-1 interaction and Nef-dependent effects on Tf-R trafficking, the observed effects on CD4 downregulation cannot be attributed to structure constraints or to alterations on general protein trafficking. Besides, we found that the GPG-motif was also required for Nef-dependent inhibition of ring actin re-organization upon TCR triggering and MHCI downregulation, suggesting that the GPG-motif could actively cooperate with the Nef PxxP motif for these HIV-1 Nef-related effects. Finally, we observed that the Nef-GPG motif was required for optimal infectivity of those viruses produced in T-cells. According to these findings, we propose the conserved GPG-motif in HIV-1 Nef as functional region required for HIV-1 infectivity and therefore with a potential interest for the interference of Nef activity during HIV-1 infection.

  19. Functional Analysis of Semi-conserved Transit Peptide Motifs and Mechanistic Implications in Precursor Targeting and Recognition.

    Science.gov (United States)

    Holbrook, Kristen; Subramanian, Chitra; Chotewutmontri, Prakitchai; Reddick, L Evan; Wright, Sarah; Zhang, Huixia; Moncrief, Lily; Bruce, Barry D

    2016-09-01

    Over 95% of plastid proteins are nuclear-encoded as their precursors containing an N-terminal extension known as the transit peptide (TP). Although highly variable, TPs direct the precursors through a conserved, posttranslational mechanism involving translocons in the outer (TOC) and inner envelope (TOC). The organelle import specificity is mediated by one or more components of the Toc complex. However, the high TP diversity creates a paradox on how the sequences can be specifically recognized. An emerging model of TP design is that they contain multiple loosely conserved motifs that are recognized at different steps in the targeting and transport process. Bioinformatics has demonstrated that many TPs contain semi-conserved physicochemical motifs, termed FGLK. In order to characterize FGLK motifs in TP recognition and import, we have analyzed two well-studied TPs from the precursor of RuBisCO small subunit (SStp) and ferredoxin (Fdtp). Both SStp and Fdtp contain two FGLK motifs. Analysis of large set mutations (∼85) in these two motifs using in vitro, in organello, and in vivo approaches support a model in which the FGLK domains mediate interaction with TOC34 and possibly other TOC components. In vivo import analysis suggests that multiple FGLK motifs are functionally redundant. Furthermore, we discuss how FGLK motifs are required for efficient precursor protein import and how these elements may permit a convergent function of this highly variable class of targeting sequences. PMID:27378725

  20. DistAMo: A web-based tool to characterize DNA-motif distribution on bacterial chromosomes

    Directory of Open Access Journals (Sweden)

    Patrick eSobetzko

    2016-03-01

    Full Text Available Short DNA motifs are involved in a multitude of functions such as for example chromosome segregation, DNA replication or mismatch repair. Distribution of such motifs is often not random and the specific chromosomal pattern relates to the respective motif function. Computational approaches which quantitatively assess such chromosomal motif patterns are necessary. Here we present a new computer tool DistAMo (Distribution Analysis of DNA Motifs. The algorithm uses codon redundancy to calculate the relative abundance of short DNA motifs from single genes to entire chromosomes. Comparative genomics analyses of the GATC-motif distribution in γ-proteobacterial genomes using DistAMo revealed that (i genes beside the replication origin are enriched in GATCs, (ii genome-wide GATC distribution follows a distinct pattern and (iii genes involved in DNA replication and repair are enriched in GATCs. These features are specific for bacterial chromosomes encoding a Dam methyltransferase. The new software is available as a stand-alone or as an easy-to-use web-based server version at http://www.computational.bio.uni-giessen.de/distamo.

  1. Super-transient scaling in time-delay autonomous Boolean network motifs

    Science.gov (United States)

    D'Huys, Otti; Lohmann, Johannes; Haynes, Nicholas D.; Gauthier, Daniel J.

    2016-09-01

    Autonomous Boolean networks are commonly used to model the dynamics of gene regulatory networks and allow for the prediction of stable dynamical attractors. However, most models do not account for time delays along the network links and noise, which are crucial features of real biological systems. Concentrating on two paradigmatic motifs, the toggle switch and the repressilator, we develop an experimental testbed that explicitly includes both inter-node time delays and noise using digital logic elements on field-programmable gate arrays. We observe transients that last millions to billions of characteristic time scales and scale exponentially with the amount of time delays between nodes, a phenomenon known as super-transient scaling. We develop a hybrid model that includes time delays along network links and allows for stochastic variation in the delays. Using this model, we explain the observed super-transient scaling of both motifs and recreate the experimentally measured transient distributions.

  2. Exploration of tetrahedral structures in silicate cathodes using a motif-network scheme

    Science.gov (United States)

    Zhao, Xin; Wu, Shunqing; Lv, Xiaobao; Nguyen, Manh Cuong; Wang, Cai-Zhuang; Lin, Zijing; Zhu, Zi-Zhong; Ho, Kai-Ming

    2015-10-01

    Using a motif-network search scheme, we studied the tetrahedral structures of the dilithium/disodium transition metal orthosilicates A2MSiO4 with A = Li or Na and M = Mn, Fe or Co. In addition to finding all previously reported structures, we discovered many other different tetrahedral-network-based crystal structures which are highly degenerate in energy. These structures can be classified into structures with 1D, 2D and 3D M-Si-O frameworks. A clear trend of the structural preference in different systems was revealed and possible indicators that affect the structure stabilities were introduced. For the case of Na systems which have been much less investigated in the literature relative to the Li systems, we predicted their ground state structures and found evidence for the existence of new structural motifs.

  3. Fast motif-network scheme for extensive exploration of complex crystal structures in silicate cathodes

    CERN Document Server

    Zhao, Xin; Lv, Xiaobao; Nguyen, Manh Cuong; Wang, Cai-Zhuang; Lin, Zijing; Zhu, Zi-Zhong; Ho, Kai-Ming

    2015-01-01

    A motif-network search scheme is proposed to study the crystal structures of the dilithium/disodium transition metal orthosilicates A2MSiO4. Using this fast and efficient method, the structures of all six combinations with A = Li or Na and M = Mn, Fe or Co were extensively explored. In addition to finding all previously reported structures, we discovered many other different crystal structures which are highly degenerate in energy. These tetrahedral-network-based structures can be classified into 1D, 2D and 3D types based on M-Si-O frames. A clear trend of the structural preference in different systems was revealed and possible indicators that affect the structure stabilities were introduced. For the case of Na systems which have been much less investigated in the literature relative to the Li systems, we predicted their ground state structures and found evidence for the existence of new structural motifs.

  4. Fast motif-network scheme for extensive exploration of complex crystal structures in silicate cathodes

    Science.gov (United States)

    Ho, Kai-Ming; Zhao, Xin; Wu, Shunqing; Lv, Xiaobao; Nguyen, Manh Cuong; Wang, Cai-Zhuang; Lin, Zijing; Zhu, Zi-Zhong

    2015-03-01

    A motif-network search scheme is proposed to study the crystal structures of the dilithium/disodium transition metal orthosilicates A2MSiO4. Using this fast and efficient method, the structures of all six combinations with A = Li or Na and M = Mn, Fe or Co were extensively explored in this work. In addition to finding all previously reported experimental structures, we discover many other different crystal structures which are highly degenerate in energy. These tetrahedral-network-based structures can be classified into 1D, 2D and 3D types. A clear trend of the structural preference in different systems is revealed and possible indicators that affect the structure stabilities are introduced. For the case of Na systems which have been much less investigated in the literature relative to the Li systems, we predicted their ground state structures and found evidence for the existence of new structural motifs.

  5. Structure and mechanical characterization of DNA i-motif nanowires by molecular dynamics simulation

    CERN Document Server

    Singh, Raghvendra Pratap; Cleri, Fabrizio

    2013-01-01

    We studied the structure and mechanical properties of DNA i-motif nanowires by means of molecular dynamics computer simulations. We built up to 230 nm long nanowires, based on a repeated TC5 sequence from crystallographic data, fully relaxed and equilibrated in water. The unusual stacked C*C+ stacked structure, formed by four ssDNA strands arranged in an intercalated tetramer, is here fully characterized both statically and dynamically. By applying stretching, compression and bending deformation with the steered molecular dynamics and umbrella sampling methods, we extract the apparent Young's and bending moduli of the nanowire, as wel as estimates for the tensile strength and persistence length. According to our results, the i-motif nanowire shares similarities with structural proteins, as far as its tensile stiffness, but is closer to nucleic acids and flexible proteins, as far as its bending rigidity is concerned. Furthermore, thanks to its very thin cross section, the apparent tensile toughness is close to...

  6. Native characterization of nucleic acid motif thermodynamics via non-covalent catalysis.

    Science.gov (United States)

    Wang, Chunyan; Bae, Jin H; Zhang, David Yu

    2016-01-01

    DNA hybridization thermodynamics is critical for accurate design of oligonucleotides for biotechnology and nanotechnology applications, but parameters currently in use are inaccurately extrapolated based on limited quantitative understanding of thermal behaviours. Here, we present a method to measure the ΔG° of DNA motifs at temperatures and buffer conditions of interest, with significantly better accuracy (6- to 14-fold lower s.e.) than prior methods. The equilibrium constant of a reaction with thermodynamics closely approximating that of a desired motif is numerically calculated from directly observed reactant and product equilibrium concentrations; a DNA catalyst is designed to accelerate equilibration. We measured the ΔG° of terminal fluorophores, single-nucleotide dangles and multinucleotide dangles, in temperatures ranging from 10 to 45 °C.

  7. A Review of Protein-DNA Binding Motif using Association Rule Mining

    Directory of Open Access Journals (Sweden)

    Virendra Kumar Tripathi,

    2013-04-01

    Full Text Available Thesurvival of gene regulation and lifemechanisms is pre-request of finding unknownpattern oftranscription factor binding sites. Thediscovery motif of gene regulation inbioinformaticsis challenging jobs for getting relation betweentranscription factors and transcription factorbinding sites. The increasing size and length ofstring pattern of motif is issued a problem related tomodeling and optimization of gene selectionprocess. In this paper we give a survey of protein-DNA binding using association rule mining.Association rule mining well knowndata miningtechnique for pattern analysis. The capability ofnegative and positive pattern generation help fullfordiscoveringof new pattern in DNA bindingbioinformatics data. The other data miningapproach such as clustering and classification alsoapplied the process of gene selection grouping forknown and unknown pattern. But faced a problemof valid string of DNA data, the rule miningprinciple find a better relation between transcriptionfactors and transcription factor binding sites.

  8. Stabilizing Motifs in Autonomous Boolean Networks and the Yeast Cell Cycle Oscillator

    Science.gov (United States)

    Sevim, Volkan; Gong, Xinwei; Socolar, Joshua

    2009-03-01

    Synchronously updated Boolean networks are widely used to model gene regulation. Some properties of these model networks are known to be artifacts of the clocking in the update scheme. Autonomous updating is a less artificial scheme that allows one to introduce small timing perturbations and study stability of the attractors. We argue that the stabilization of a limit cycle in an autonomous Boolean network requires a combination of motifs such as feed-forward loops and auto-repressive links that can correct small fluctuations in the timing of switching events. A recently published model of the transcriptional cell-cycle oscillator in yeast contains the motifs necessary for stability under autonomous updating [1]. [1] D. A. Orlando, et al. Nature (London), 4530 (7197):0 944--947, 2008.

  9. Crystal structure and functional characterization of a light-driven chloride pump having an NTQ motif.

    Science.gov (United States)

    Kim, Kuglae; Kwon, Soon-Kyeong; Jun, Sung-Hoon; Cha, Jeong Seok; Kim, Hoyoung; Lee, Weontae; Kim, Jihyun F; Cho, Hyun-Soo

    2016-01-01

    A novel light-driven chloride-pumping rhodopsin (ClR) containing an 'NTQ motif' in its putative ion conduction pathway has been discovered and functionally characterized in a genomic analysis study of a marine bacterium. Here we report the crystal structure of ClR from the flavobacterium Nonlabens marinus S1-08(T) determined under two conditions at 2.0 and 1.56 Å resolutions. The structures reveal two chloride-binding sites, one around the protonated Schiff base and the other on a cytoplasmic loop. We identify a '3 omega motif' formed by three non-consecutive aromatic amino acids that is correlated with the B-C loop orientation. Detailed ClR structural analyses with functional studies in E. coli reveal the chloride ion transduction pathway. Our results help understand the molecular mechanism and physiological role of ClR and provide a structural basis for optogenetic applications. PMID:27554809

  10. Organelle RNA recognition motif-containing (ORRM) proteins are plastid and mitochondrial editing factors in Arabidopsis.

    Science.gov (United States)

    Shi, Xiaowen; Bentolila, Stephane; Hanson, Maureen R

    2016-05-01

    Post-transcriptional C-to-U RNA editing occurs at specific sites in plastid and plant mitochondrial transcripts. Members of the Arabidopsis pentatricopeptide repeat (PPR) motif-containing protein family and RNA-editing factor Interacting Protein (RIP, also known as MORF) family have been characterized as essential components of the RNA editing apparatus. Recent studies reveal that several organelle-targeted RNA recognition motif (RRM)-containing proteins are involved in either plastid or mitochondrial RNA editing. ORRM1 (Organelle RRM protein 1) is essential for plastid editing, whereas ORRM2, ORRM3 and ORRM4 are involved in mitochondrial RNA editing. The RRM domain of ORRM1, ORRM3 and ORRM4 is required for editing activity, whereas the auxiliary RIP and Glycine-Rich (GR) domains mediate the ORRM proteins' interactions with other editing factors. The identification of the ORRM proteins as RNA editing factors further expands our knowledge of the composition of the editosome. PMID:27082488

  11. Thymoproteasomes produce unique peptide motifs for positive selection of CD8(+) T cells.

    Science.gov (United States)

    Sasaki, Katsuhiro; Takada, Kensuke; Ohte, Yuki; Kondo, Hiroyuki; Sorimachi, Hiroyuki; Tanaka, Keiji; Takahama, Yousuke; Murata, Shigeo

    2015-01-01

    Positive selection in the thymus provides low-affinity T-cell receptor (TCR) engagement to support the development of potentially useful self-major histocompatibility complex class I (MHC-I)-restricted T cells. Optimal positive selection of CD8(+) T cells requires cortical thymic epithelial cells that express β5t-containing thymoproteasomes (tCPs). However, how tCPs govern positive selection is unclear. Here we show that the tCPs produce unique cleavage motifs in digested peptides and in MHC-I-associated peptides. Interestingly, MHC-I-associated peptides carrying these tCP-dependent motifs are enriched with low-affinity TCR ligands that efficiently induce the positive selection of functionally competent CD8(+) T cells in antigen-specific TCR-transgenic models. These results suggest that tCPs contribute to the positive selection of CD8(+) T cells by preferentially producing low-affinity TCR ligand peptides.

  12. Spectral Barcoding of Quantum Dots: Deciphering Structural Motifs from the Excitonic Spectra

    International Nuclear Information System (INIS)

    Self-assembled semiconductor quantum dots (QDs) show in high-resolution single-dot spectra a multitude of sharp lines, resembling a barcode, due to various neutral and charged exciton complexes. Here we propose the 'spectral barcoding' method that deciphers structural motifs of dots by using such barcode as input to an artificial-intelligence learning system. Thus, we invert the common practice of deducing spectra from structure by deducing structure from spectra. This approach (i) lays the foundation for building a much needed structure-spectra understanding for large nanostructures and (ii) can guide future design of desired optical features of QDs by controlling during growth only those structural motifs that decide given optical features.

  13. Target motifs affecting natural immunity by a constitutive CRISPR-Cas system in Escherichia coli.

    Directory of Open Access Journals (Sweden)

    Cristóbal Almendros

    Full Text Available Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR and CRISPR associated (cas genes conform the CRISPR-Cas systems of various bacteria and archaea and produce degradation of invading nucleic acids containing sequences (protospacers that are complementary to repeat intervening spacers. It has been demonstrated that the base sequence identity of a protospacer with the cognate spacer and the presence of a protospacer adjacent motif (PAM influence CRISPR-mediated interference efficiency. By using an original transformation assay with plasmids targeted by a resident spacer here we show that natural CRISPR-mediated immunity against invading DNA occurs in wild type Escherichia coli. Unexpectedly, the strongest activity is observed with protospacer adjoining nucleotides (interference motifs that differ from the PAM both in sequence and location. Hence, our results document for the first time native CRISPR activity in E. coli and demonstrate that positions next to the PAM in invading DNA influence their recognition and degradation by these prokaryotic immune systems.

  14. De Novo Discovery of Structured ncRNA Motifs in Genomic Sequences

    DEFF Research Database (Denmark)

    Ruzzo, Walter L; Gorodkin, Jan

    2014-01-01

    De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphas...... on an approach based on the CMfinder CMfinder program as a case study. Applications to genomic screens for novel de novo structured ncRNA ncRNA s, including structured RNA elements in untranslated portions of protein-coding genes, are presented.......De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphasis...

  15. The nitrogen responsive transcriptome in potato (Solanum tuberosum L.) reveals significant gene regulatory motifs.

    Science.gov (United States)

    Gálvez, José Héctor; Tai, Helen H; Lagüe, Martin; Zebarth, Bernie J; Strömvik, Martina V

    2016-01-01

    Nitrogen (N) is the most important nutrient for the growth of potato (Solanum tuberosum L.). Foliar gene expression in potato plants with and without N supplementation at 180 kg N ha(-1) was compared at mid-season. Genes with consistent differences in foliar expression due to N supplementation over three cultivars and two developmental time points were examined. In total, thirty genes were found to be over-expressed and nine genes were found to be under-expressed with supplemented N. Functional relationships between over-expressed genes were found. The main metabolic pathway represented among differentially expressed genes was amino acid metabolism. The 1000 bp upstream flanking regions of the differentially expressed genes were analysed and nine overrepresented motifs were found using three motif discovery algorithms (Seeder, Weeder and MEME). These results point to coordinated gene regulation at the transcriptional level controlling steady state potato responses to N sufficiency. PMID:27193058

  16. Discovery of widespread GTP-binding motifs in genomic DNA and RNA.

    Science.gov (United States)

    Curtis, Edward A; Liu, David R

    2013-04-18

    Biological RNAs that bind small molecules have been implicated in a variety of regulatory and catalytic processes. Inspired by these examples, we used in vitro selection to search a pool of genome-encoded RNA fragments for naturally occurring GTP aptamers. Several aptamer classes were identified, including one (the "G motif") with a G-quadruplex structure. Further analysis revealed that most RNA and DNA G-quadruplexes bind GTP. The G motif is abundant in eukaryotes, and the human genome contains ~75,000 examples with dissociation constants comparable to the GTP concentration of a eukaryotic cell (~300 μM). G-quadruplexes play roles in diverse cellular processes, and our findings raise the possibility that GTP may play a role in the function of these elements. Consistent with this possibility, the sequence requirements of several classes of regulatory G-quadruplexes parallel those of GTP binding.

  17. Rapid Identification of Protein Kinase Phosphorylation Site Motifs Using Combinatorial Peptide Libraries.

    Science.gov (United States)

    Miller, Chad J; Turk, Benjamin E

    2016-01-01

    Eukaryotic protein kinases phosphorylate substrates at serine, threonine, and tyrosine residues that fall within the context of short sequence motifs. Knowing the phosphorylation site motif for a protein kinase facilitates designing substrates for kinase assays and mapping phosphorylation sites in protein substrates. Here, we describe an arrayed peptide library protocol for rapidly determining kinase phosphorylation consensus sequences. This method uses a set of peptide mixtures in which each of the 20 amino acid residues is systematically substituted at nine positions surrounding a central site of phosphorylation. Peptide mixtures are arrayed in multiwell plates and analyzed by radiolabel assay with the kinase of interest. The preferred sequence is determined from the relative rate of phosphorylation of each peptide in the array. Consensus peptides based on these sequences typically serve as efficient and specific kinase substrates for high-throughput screening or incorporation into biosensors.

  18. Research data supporting "Extending motifs in lithiocuprate chemistry: unexpected structural diversity in thiocyanate complexes"

    OpenAIRE

    Peel, Andrew J.; Hedidi, Madani; Ghenia, Bentabed-Ababsa; Thierry, Roisnel; Mongin, Florence; Wheatley, Andrew E. H.

    2015-01-01

    Supporting data for the publication "Extending motifs in lithiocuprate chemistry: unexpected structural diversity in thiocyanate complexes". Single crystal X-ray diffraction structures and NMR spectroscopic data for complexes 8-11 in the paper. All four complexes were made at Cambridge in 2015 and were characterized in house in 2015. NMR spectra for compounds 15 and 16. These were made and characterized in Rennes in 2015.

  19. Discovery of sequence motifs related to coexpression of genes using evolutionary computation

    OpenAIRE

    Fogel, Gary B.; Weekes, Dana G.; Varga, Gabor; Dow, Ernst R.; Harlow, Harry B.; Onyia, Jude E.; Su, Chen

    2004-01-01

    Transcription factors are key regulatory elements that control gene expression. Recognition of transcription factor binding site (TFBS) motifs in the upstream region of coexpressed genes is therefore critical towards a true understanding of the regulations of gene expression. The task of discovering eukaryotic TFBSs remains a challenging problem. Here, we demonstrate that evolutionary computation can be used to search for TFBSs in upstream regions of genes known to be coexpressed. Evolutionar...

  20. Matching of structural motifs using hashing on residue labels and geometric filtering for protein function prediction.

    Science.gov (United States)

    Moll, Mark; Kavraki, Lydia E

    2008-01-01

    There is an increasing number of proteins with known structure but unknown function. Determining their function would have a significant impact on understanding diseases and designing new therapeutics. However, experimental protein function determination is expensive and very time-consuming. Computational methods can facilitate function determination by identifying proteins that have high structural and chemical similarity. Our focus is on methods that determine binding site similarity. Although several such methods exist, it still remains a challenging problem to quickly find all functionally-related matches for structural motifs in large data sets with high specificity. In this context, a structural motif is a set of 3D points annotated with physicochemical information that characterize a molecular function. We propose a new method called LabelHash that creates hash tables of n-tuples of residues for a set of targets. Using these hash tables, we can quickly look up partial matches to a motif and expand those matches to complete matches. We show that by applying only very mild geometric constraints we can find statistically significant matches with extremely high specificity in very large data sets and for very general structural motifs. We demonstrate that our method requires a reasonable amount of storage when employing a simple geometric filter and further improves on the specificity of our previous work while maintaining very high sensitivity. Our algorithm is evaluated on 20 homolog classes and a non-redundant version of the Protein Data Bank as our background data set. We use cluster analysis to analyze why certain classes of homologs are more difficult to classify than others. The LabelHash algorithm is implemented on a web server at http://kavrakilab.org/labelhash/.

  1. An evolutionary analysis of flightin reveals a conserved motif unique and widespread in Pancrustacea.

    Science.gov (United States)

    Soto-Adames, Felipe N; Alvarez-Ortiz, Pedro; Vigoreaux, Jim O

    2014-01-01

    Flightin is a thick filament protein that in Drosophila melanogaster is uniquely expressed in the asynchronous, indirect flight muscles (IFM). Flightin is required for the structure and function of the IFM and is indispensable for flight in Drosophila. Given the importance of flight acquisition in the evolutionary history of insects, here we study the phylogeny and distribution of flightin. Flightin was identified in 69 species of hexapods in classes Collembola (springtails), Protura, Diplura, and insect orders Thysanura (silverfish), Dictyoptera (roaches), Orthoptera (grasshoppers), Pthiraptera (lice), Hemiptera (true bugs), Coleoptera (beetles), Neuroptera (green lacewing), Hymenoptera (bees, ants, and wasps), Lepidoptera (moths), and Diptera (flies and mosquitoes). Flightin was also found in 14 species of crustaceans in orders Anostraca (water flea), Cladocera (brine shrimp), Isopoda (pill bugs), Amphipoda (scuds, sideswimmers), and Decapoda (lobsters, crabs, and shrimps). Flightin was not identified in representatives of chelicerates, myriapods, or any species outside Pancrustacea (Tetraconata, sensu Dohle). Alignment of amino acid sequences revealed a conserved region of 52 amino acids, referred herein as WYR, that is bound by strictly conserved tryptophan (W) and arginine (R) and an intervening sequence with a high content of tyrosines (Y). This motif has no homologs in GenBank or PROSITE and is unique to flightin and paraflightin, a putative flightin paralog identified in decapods. A third motif of unclear affinities to pancrustacean WYR was observed in chelicerates. Phylogenetic analysis of amino acid sequences of the conserved motif suggests that paraflightin originated before the divergence of amphipods, isopods, and decapods. We conclude that flightin originated de novo in the ancestor of Pancrustacea > 500 MYA, well before the divergence of insects (~400 MYA) and the origin of flight (~325 MYA), and that its IFM-specific function in Drosophila is a more

  2. Reusable amine-based structural motifs for green house gas (CO2) fixation.

    Science.gov (United States)

    Dalapati, Sasanka; Jana, Sankar; Saha, Rajat; Alam, Md Akhtarul; Guchhait, Nikhil

    2012-07-01

    A series of compounds with an amine based structural motif (ASM) have been synthesized for efficient atmospheric CO(2) fixation. The H-bonded ASM-bicarbonate complexes were formed with an in situ generated HCO(3)(-) ion. The complexes have been characterized by IR, (13)C NMR, and X-ray single-crystal structural analysis. ASM-bicarbonate salts have been converted to pure ASMs in quantitative yield under mild conditions for recycling processes.

  3. Molecularly Defined Nanostructures Based on a Novel AAA-DDD Triple Hydrogen-Bonding Motif.

    Science.gov (United States)

    Papmeyer, Marcus; Vuilleumier, Clément A; Pavan, Giovanni M; Zhurov, Konstantin O; Severin, Kay

    2016-01-26

    A facile and flexible method for the synthesis of a new AAA-DDD triple hydrogen-bonding motif is described. Polytopic supramolecular building blocks with precisely oriented AAA and DDD groups are thus accessible in few steps. These building blocks were used for the assembly of large macrocycles featuring four AAA-DDD interactions and a macrobicyclic complex with a total of six AAA-DDD interactions.

  4. Coregulator control of androgen receptor action by a novel nuclear receptor-binding motif.

    Science.gov (United States)

    Jehle, Katja; Cato, Laura; Neeb, Antje; Muhle-Goll, Claudia; Jung, Nicole; Smith, Emmanuel W; Buzon, Victor; Carbó, Laia R; Estébanez-Perpiñá, Eva; Schmitz, Katja; Fruk, Ljiljana; Luy, Burkhard; Chen, Yu; Cox, Marc B; Bräse, Stefan; Brown, Myles; Cato, Andrew C B

    2014-03-28

    The androgen receptor (AR) is a ligand-activated transcription factor that is essential for prostate cancer development. It is activated by androgens through its ligand-binding domain (LBD), which consists predominantly of 11 α-helices. Upon ligand binding, the last helix is reorganized to an agonist conformation termed activator function-2 (AF-2) for coactivator binding. Several coactivators bind to the AF-2 pocket through conserved LXXLL or FXXLF sequences to enhance the activity of the receptor. Recently, a small compound-binding surface adjacent to AF-2 has been identified as an allosteric modulator of the AF-2 activity and is termed binding function-3 (BF-3). However, the role of BF-3 in vivo is currently unknown, and little is understood about what proteins can bind to it. Here we demonstrate that a duplicated GARRPR motif at the N terminus of the cochaperone Bag-1L functions through the BF-3 pocket. These findings are supported by the fact that a selective BF-3 inhibitor or mutations within the BF-3 pocket abolish the interaction between the GARRPR motif(s) and the BF-3. Conversely, amino acid exchanges in the two GARRPR motifs of Bag-1L can impair the interaction between Bag-1L and AR without altering the ability of Bag-1L to bind to chromatin. Furthermore, the mutant Bag-1L increases androgen-dependent activation of a subset of AR targets in a genome-wide transcriptome analysis, demonstrating a repressive function of the GARRPR/BF-3 interaction. We have therefore identified GARRPR as a novel BF-3 regulatory sequence important for fine-tuning the activity of the AR.

  5. Phosphorylation of PTEN at STT motif is associated with DNA damage response

    Energy Technology Data Exchange (ETDEWEB)

    Misra, Sandip; Mukherjee, Ananda; Karmakar, Parimal, E-mail: pkarmakar_28@yahoo.co.in

    2014-12-15

    Highlights: • Phosphorylation PTEN at the C-terminal STT motif is necessary for DNA repair. • DNA damage induces phosphorylation of STT motif of PTEN. • Phospho-PTEN translocates to nucleus after DNA damage. • Phospho-PTEN forms nuclear foci after DNA damage which co localized with γH2AX. - Abstract: Phosphatase and tensin homolog deleted on chromosome Ten (PTEN), a tumor suppressor protein participates in multiple cellular activities including DNA repair. In this work we found a relationship between phosphorylation of carboxy (C)-terminal STT motif of PTEN and DNA damage response. Ectopic expression of C-terminal phospho-mutants of PTEN, in PTEN deficient human glioblastoma cells, U87MG, resulted in reduced viability and DNA repair after etoposide induced DNA damage compared to cells expressing wild type PTEN. Also, after etoposide treatment phosphorylation of PTEN increased at C-terminal serine 380 and threonine 382/383 residues in PTEN positive HEK293T cells and wild type PTEN transfected U87MG cells. One-step further, DNA damage induced phosphorylation of PTEN was confirmed by immunoprecipitation of total PTEN from cellular extract followed by immunobloting with phospho-specific PTEN antibodies. Additionally, phospho-PTEN translocated to nucleus after etoposide treatment as revealed by indirect immunolabeling. Further, phosphorylation dependent nuclear foci formation of PTEN was observed after ionizing radiation or etoposide treatment which colocalized with γH2AX. Additionally, etoposide induced γH2AX, Mre11 and Ku70 foci persisted for a longer period of times in U87MG cells after ectopic expression of PTEN C-terminal phospho-mutant constructs compared to wild type PTEN expressing cells. Thus, our findings strongly suggest that DNA damage induced phosphorylation of C-terminal STT motif of PTEN is necessary for DNA repair.

  6. Systematic discovery of linear binding motifs targeting an ancient protein interaction surface on MAP kinases.

    Science.gov (United States)

    Zeke, András; Bastys, Tomas; Alexa, Anita; Garai, Ágnes; Mészáros, Bálint; Kirsch, Klára; Dosztányi, Zsuzsanna; Kalinina, Olga V; Reményi, Attila

    2015-11-01

    Mitogen-activated protein kinases (MAPK) are broadly used regulators of cellular signaling. However, how these enzymes can be involved in such a broad spectrum of physiological functions is not understood. Systematic discovery of MAPK networks both experimentally and in silico has been hindered because MAPKs bind to other proteins with low affinity and mostly in less-characterized disordered regions. We used a structurally consistent model on kinase-docking motif interactions to facilitate the discovery of short functional sites in the structurally flexible and functionally under-explored part of the human proteome and applied experimental tools specifically tailored to detect low-affinity protein-protein interactions for their validation in vitro and in cell-based assays. The combined computational and experimental approach enabled the identification of many novel MAPK-docking motifs that were elusive for other large-scale protein-protein interaction screens. The analysis produced an extensive list of independently evolved linear binding motifs from a functionally diverse set of proteins. These all target, with characteristic binding specificity, an ancient protein interaction surface on evolutionarily related but physiologically clearly distinct three MAPKs (JNK, ERK, and p38). This inventory of human protein kinase binding sites was compared with that of other organisms to examine how kinase-mediated partnerships evolved over time. The analysis suggests that most human MAPK-binding motifs are surprisingly new evolutionarily inventions and newly found links highlight (previously hidden) roles of MAPKs. We propose that short MAPK-binding stretches are created in disordered protein segments through a variety of ways and they represent a major resource for ancient signaling enzymes to acquire new regulatory roles. PMID:26538579

  7. A novel fibronectin binding motif in MSCRAMMs targets F3 modules.

    Directory of Open Access Journals (Sweden)

    Sabitha Prabhakaran

    Full Text Available BACKGROUND: BBK32 is a surface expressed lipoprotein and fibronectin (Fn-binding microbial surface component recognizing adhesive matrix molecule (MSCRAMM of Borrelia burgdorferi, the causative agent of Lyme disease. Previous studies from our group showed that BBK32 is a virulence factor in experimental Lyme disease and located the Fn-binding region to residues 21-205 of the lipoprotein. METHODOLOGY/PRINCIPAL FINDINGS: Studies aimed at identifying interacting sites between BBK32 and Fn revealed an interaction between the MSCRAMM and the Fn F3 modules. Further analysis of this interaction showed that BBK32 can cause the aggregation of human plasma Fn in a similar concentration-dependent manner to that of anastellin, the superfibronectin (sFn inducing agent. The resulting Fn aggregates are conformationally distinct from plasma Fn as indicated by a change in available thermolysin cleavage sites. Recombinant BBK32 and anastellin affect the structure of Fn matrices formed by cultured fibroblasts and inhibit endothelial cell proliferation similarly. Within BBK32, we have located the sFn-forming activity to a region between residues 160 and 175 which contains two sequence motifs that are also found in anastellin. Synthetic peptides mimicking these motifs induce Fn aggregation, whereas a peptide with a scrambled sequence motif was inactive, suggesting that these motifs represent the sFn-inducing sequence. CONCLUSIONS/SIGNIFICANCE: We conclude that BBK32 induces the formation of Fn aggregates that are indistinguishable from those formed by anastellin. The results of this study provide evidence for how bacteria can target host proteins to manipulate host cell activities.

  8. Identification of high-efficiency 3′GG gRNA motifs in indexed FASTA files with ngg2

    OpenAIRE

    Roberson, Elisha D.O.

    2015-01-01

    CRISPR/Cas9 is emerging as one of the most-used methods of genome modification in organisms ranging from bacteria to human cells. However, the efficiency of editing varies tremendously site-to-site. A recent report identified a novel motif, called the 3′GG motif, which substantially increases the efficiency of editing at all sites tested in C. elegans. Furthermore, they highlighted that previously published gRNAs with high editing efficiency also had this motif. I designed a python command-li...

  9. Variation in number of cagA EPIYA-C phosphorylation motifs between cultured Helicobacter pylori and biopsy strain DNA

    OpenAIRE

    Karlsson, Anneli; Ryberg, Anna; Nosouhi Dehnoei, Marjan; Borch, Kurt; Monstein, Hans-Jürg

    2012-01-01

    The Helicobacter pylori cagA gene encodes a cytotoxin which is activated by phosphorylation after entering the host epithelial cell. Phosphorylation occurs on specific tyrosine residues within EPIYA motifs in the variable 3'-region. Four different cagA EPIYA motifs have been defined according to the surrounding amino acid sequence; EPIYA-A, -B, -C and -D. Commonly, EPIYA-A and -B are followed by one or more EPIYA-C or -D motif. Due to observed discrepancies in cagA genotypes in cultured H. py...

  10. Fouille de données pour la stylistique : cas des motifs séquentiels émergents

    OpenAIRE

    Quiniou, Solen; Cellier, Peggy; Charnois, Thierry; Legallois, Dominique

    2012-01-01

    Dans cet article, nous présentons une étude sur l'utilisation de méthodes de fouille de données pour l'analyse stylistique - d'un point de vue linguistique - en considérant des motifs séquentiels émergents. Nous montrons tout d'abord que la fouille de motifs séquentiels de mots en utilisant la contrainte gap permet d'obtenir de nouveaux patrons linguistiques pertinents par rapport aux patrons construits à partir de n-grammes. Nous étudions ensuite l'utilisation de motifs séquentiels d'itemset...

  11. Motif analysis for small-number effects in chemical reaction dynamics

    Science.gov (United States)

    Saito, Nen; Sughiyama, Yuki; Kaneko, Kunihiko

    2016-09-01

    The number of molecules involved in a cell or subcellular structure is sometimes rather small. In this situation, ordinary macroscopic-level fluctuations can be overwhelmed by non-negligible large fluctuations, which results in drastic changes in chemical-reaction dynamics and statistics compared to those observed under a macroscopic system (i.e., with a large number of molecules). In order to understand how salient changes emerge from fluctuations in molecular number, we here quantitatively define small-number effect by focusing on a "mesoscopic" level, in which the concentration distribution is distinguishable both from micro- and macroscopic ones and propose a criterion for determining whether or not such an effect can emerge in a given chemical reaction network. Using the proposed criterion, we systematically derive a list of motifs of chemical reaction networks that can show small-number effects, which includes motifs showing emergence of the power law and the bimodal distribution observable in a mesoscopic regime with respect to molecule number. The list of motifs provided herein is helpful in the search for candidates of biochemical reactions with a small-number effect for possible biological functions, as well as for designing a reaction system whose behavior can change drastically depending on molecule number, rather than concentration.

  12. Identification and characterization of a leucine-rich repeat kinase 2 (LRRK2 consensus phosphorylation motif.

    Directory of Open Access Journals (Sweden)

    Pooja P Pungaliya

    Full Text Available Mutations in LRRK2 (leucine-rich repeat kinase 2 have been identified as major genetic determinants of Parkinson's disease (PD. The most prevalent mutation, G2019S, increases LRRK2's kinase activity, therefore understanding the sites and substrates that LRRK2 phosphorylates is critical to understanding its role in disease aetiology. Since the physiological substrates of this kinase are unknown, we set out to reveal potential targets of LRRK2 G2019S by identifying its favored phosphorylation motif. A non-biased screen of an oriented peptide library elucidated F/Y-x-T-x-R/K as the core dependent substrate sequence. Bioinformatic analysis of the consensus phosphorylation motif identified several novel candidate substrates that potentially function in neuronal pathophysiology. Peptides corresponding to the most PD relevant proteins were efficiently phosphorylated by LRRK2 in vitro. Interestingly, the phosphomotif was also identified within LRRK2 itself. Autophosphorylation was detected by mass spectrometry and biochemical means at the only F-x-T-x-R site (Thr 1410 within LRRK2. The relevance of this site was assessed by measuring effects of mutations on autophosphorylation, kinase activity, GTP binding, GTP hydrolysis, and LRRK2 multimerization. These studies indicate that modification of Thr1410 subtly regulates GTP hydrolysis by LRRK2, but with minimal effects on other parameters measured. Together the identification of LRRK2's phosphorylation consensus motif, and the functional consequences of its phosphorylation, provide insights into downstream LRRK2-signaling pathways.

  13. Prediction and experimental characterization of nsSNPs altering human PDZ-binding motifs.

    Directory of Open Access Journals (Sweden)

    David Gfeller

    Full Text Available Single nucleotide polymorphisms (SNPs are a major contributor to genetic and phenotypic variation within populations. Non-synonymous SNPs (nsSNPs modify the sequence of proteins and can affect their folding or binding properties. Experimental analysis of all nsSNPs is currently unfeasible and therefore computational predictions of the molecular effect of nsSNPs are helpful to guide experimental investigations. While some nsSNPs can be accurately characterized, for instance if they fall into strongly conserved or well annotated regions, the molecular consequences of many others are more challenging to predict. In particular, nsSNPs affecting less structured, and often less conserved regions, are difficult to characterize. Binding sites that mediate protein-protein or other protein interactions are an important class of functional sites on proteins and can be used to help interpret nsSNPs. Binding sites targeted by the PDZ modular peptide recognition domain have recently been characterized. Here we use this data to show that it is possible to computationally identify nsSNPs in PDZ binding motifs that modify or prevent binding to the proteins containing the motifs. We confirm these predictions by experimentally validating a selected subset with ELISA. Our work also highlights the importance of better characterizing linear motifs in proteins as many of these can be affected by genetic variations.

  14. The Mytholotical Motif of Entering the Underworld in Julio Cortázar's Novel Rayuela (Hopscotch

    Directory of Open Access Journals (Sweden)

    Agata Šega

    2015-04-01

    Full Text Available Twentieth-century literature frequently made use of classical mythology, and in Hispano-American literature especially Jorge Luis Borges and Octavio Paz come to mind in the regard, while Julio Cortázar also deserves mention. This paper aims to analyse from this perspective a few scenes from his novel Rayuela (Hopscotch. It will attempt to uncover the hidden meaning of seemingly quotidian events in the novel which, in addition to the direct and the superficial, contain an even deeper symbolic and archetypical meaning. Of primary interest are the motifs, actions, and characters in the novel which evoke the mythological theme of entering the underworld. This motif, which is closely linked with the motif of rising from the dead, is repeated in many classical myths and often appears in both older and contemporary literature. Relying on Carl Gustav Jung's theory, according to which mythological content represents innate and inherited forms of the human mind, the paper highlights those symbolic representations in Cortázar that are linked to mythological material and which are shown in a banal and trivial form in various chapters of the novel Hopscotch, especially in chapters 36 and 54. This is no coincidence, as it is precisely in these two places that the main protagonist, Cortázar's seeker, enters an initiation phase for development of his personality and with that commences the long journey to the other side which is in fact a Jungian journey to himself, to his own essence.

  15. LDSS-P: an advanced algorithm to extract functional short motifs associated with coordinated gene expression

    Science.gov (United States)

    Ichida, Hiroyuki; Long, Sharon R.

    2016-01-01

    Identifying functional elements in promoter sequences is a major goal in computational and experimental genome biology. Here, we describe an algorithm, Local Distribution of Short Sequences for Prokaryotes (LDSS-P), to identify conserved short motifs located at specific positions in the promoters of co-expressed prokaryotic genes. As a test case, we applied this algorithm to a symbiotic nitrogen-fixing bacterium, Sinorhizobium meliloti. The LDSS-P profiles that overlap with the 5′ section of the extracytoplasmic function RNA polymerase sigma factor RpoE2 consensus sequences displayed a sharp peak between -34 and -32 from TSS positions. The corresponding genes overlap significantly with RpoE2 targets identified from previous experiments. We further identified several groups of genes that are co-regulated with characterized marker genes. Our data indicate that in S. meliloti, and possibly in other Rhizobiaceae species, the master cell cycle regulator CtrA may recognize an expanded motif (AACCAT), which is positionally shifted from the previously reported CtrA consensus sequence in Caulobacter crescentus. Bacterial one-hybrid experiments showed that base substitution in the expanded motif either increase or decrease the binding by CtrA. These results show the effectiveness of LDSS-P as a method to delineate functional promoter elements. PMID:27190233

  16. Maximum-likelihood density modification using pattern recognition of structural motifs

    International Nuclear Information System (INIS)

    A likelihood-based density-modification method is extended to include pattern recognition of structural motifs. The likelihood-based approach to density modification [Terwilliger (2000 ▶), Acta Cryst. D56, 965–972] is extended to include the recognition of patterns of electron density. Once a region of electron density in a map is recognized as corresponding to a known structural element, the likelihood of the map is reformulated to include a term that reflects how closely the map agrees with the expected density for that structural element. This likelihood is combined with other aspects of the likelihood of the map, including the presence of a flat solvent region and the electron-density distribution in the protein region. This likelihood-based pattern-recognition approach was tested using the recognition of helical segments in a largely helical protein. The pattern-recognition method yields a substantial phase improvement over both conventional and likelihood-based solvent-flattening and histogram-matching methods. The method can potentially be used to recognize any common structural motif and incorporate prior knowledge about that motif into density modification

  17. “The Birds of Clay”: An Apocryphal Motif in Folklore Legends

    Directory of Open Access Journals (Sweden)

    Olga V. Belova

    2015-08-01

    Full Text Available The article describes the adaptation of the apocryphal Gospels motif—the revival of clay birds by Jesus—in the folk traditions of Eastern and Western Slavs. The texts of folk legends demonstrate not only the active inclusion of apocryphal motifs in oral narratives, but they also incorporate the motifs’ biblical contexts and they emphasize themes that are close to everyday life and that reflect local history. The folklore texts analyzed here are from different regions of the Slavic world (Russia, Ukraine, Belarus, and Poland; they allow us to conclude that the oral tradition has retained, with great stability, these fragments from medieval sources up to the present day. Moreover, it is interesting to note the different interpretations of the same motif in monuments of Christian and Jewish literature (apocryphal Gospels and the pamphlet Toledot Yeshu.The fairly large group of folk legends with apocryphal motifs, occurring in different Slavic traditions from the 19th to the 21st centuries, thus testifies not only to the continued relevance of the biblical plots for oral culture, but also to the importance of the Apocrypha for the broadcasting and preservation of biblical stories in the folk tradition.

  18. Host-targeting-motif Harbored Secretary Proteins in Genome of Plant Pathogenic Fungus Botrytis cinerea

    Institute of Scientific and Technical Information of China (English)

    Zhang Yue; Chen Zi-niu; Su Yuan; Yu Lei

    2012-01-01

    According to our previous study, saprophytic fungi Botrytis cinerea contained 579 predicted secretary proteins. Among them, we found that 122 of these proteins contained the highly conserved pathogenic-related host-targeting-motif RxLx within 100 residues adjacent to the signal peptide cleavage site. According to PEDNAT and COG of the GenBank database, the functions of this motif containing proteins included metabolism modification and cell secretion. We blasted them in GenBank and found 47.54% had highly conserved homologues in other species, among them 74.1% had putative functional domains. This suggests these proteins are presumably ancient and vertically transmitted within the species. Many of these domains belonged to proteins which played roles in the pathogenic process of other kinds of pathogens and some had already been proved to be pathogenic secretary proteins of Botrytis cinerea. So we postulated that proteins contained host-targeting-motif RxLx were candidates participating in the pathogenesis of Botrytis cinerea.

  19. Metal-binding and redox properties of substituted linear and cyclic ATCUN motifs.

    Science.gov (United States)

    Neupane, Kosh P; Aldous, Amanda R; Kritzer, Joshua A

    2014-10-01

    The amino-terminal copper and nickel binding (ATCUN) motif is a short peptide sequence found in human serum albumin and other proteins. Synthetic ATCUN-metal complexes have been used to oxidatively cleave proteins and DNA, cross-link proteins, and damage cancer cells. The ATCUN motif consists of a tripeptide that coordinates Cu(II) and Ni(II) ions in a square planar geometry, anchored by chelation sites at the N-terminal amine, histidine imidazole and two backbone amides. Many studies have shown that the histidine is required for tight binding and square planar geometry. Previously, we showed that macrocyclization of the ATCUN motif can lead to high-affinity binding with altered metal ion selectivity and enhanced Cu(II)/Cu(III) redox cycling (Inorg. Chem. 2013, 52, 2729-2735). In this work, we synthesize and characterize several linear and cyclic ATCUN variants to explore how substitutions at the histidine alter the metal-binding and catalytic properties. UV-visible spectroscopy, EPR spectroscopy and mass spectrometry indicate that cyclization can promote the formation of ATCUN-like complexes even in the absence of imidazole. We also report several novel ATCUN-like complexes and quantify their redox properties. These findings further demonstrate the effects of conformational constraints on short, metal-binding peptides, and also provide novel redox-active metallopeptides suitable for testing as catalysts for stereoselective or regioselective oxidation reactions.

  20. Nucleotide binding database NBDB – a collection of sequence motifs with specific protein-ligand interactions

    Science.gov (United States)

    Zheng, Zejun; Goncearenco, Alexander; Berezovsky, Igor N.

    2016-01-01

    NBDB database describes protein motifs, elementary functional loops (EFLs) that are involved in binding of nucleotide-containing ligands and other biologically relevant cofactors/coenzymes, including ATP, AMP, ATP, GMP, GDP, GTP, CTP, PAP, PPS, FMN, FAD(H), NAD(H), NADP, cAMP, cGMP, c-di-AMP and c-di-GMP, ThPP, THD, F-420, ACO, CoA, PLP and SAM. The database is freely available online at http://nbdb.bii.a-star.edu.sg. In total, NBDB contains data on 249 motifs that work in interactions with 24 ligands. Sequence profiles of EFL motifs were derived de novo from nonredundant Uniprot proteome sequences. Conserved amino acid residues in the profiles interact specifically with distinct chemical parts of nucleotide-containing ligands, such as nitrogenous bases, phosphate groups, ribose, nicotinamide, and flavin moieties. Each EFL profile in the database is characterized by a pattern of corresponding ligand–protein interactions found in crystallized ligand–protein complexes. NBDB database helps to explore the determinants of nucleotide and cofactor binding in different protein folds and families. NBDB can also detect fragments that match to profiles of particular EFLs in the protein sequence provided by user. Comprehensive information on sequence, structures, and interactions of EFLs with ligands provides a foundation for experimental and computational efforts on design of required protein functions. PMID:26507856

  1. Nucleotide binding database NBDB--a collection of sequence motifs with specific protein-ligand interactions.

    Science.gov (United States)

    Zheng, Zejun; Goncearenco, Alexander; Berezovsky, Igor N

    2016-01-01

    NBDB database describes protein motifs, elementary functional loops (EFLs) that are involved in binding of nucleotide-containing ligands and other biologically relevant cofactors/coenzymes, including ATP, AMP, ATP, GMP, GDP, GTP, CTP, PAP, PPS, FMN, FAD(H), NAD(H), NADP, cAMP, cGMP, c-di-AMP and c-di-GMP, ThPP, THD, F-420, ACO, CoA, PLP and SAM. The database is freely available online at http://nbdb.bii.a-star.edu.sg. In total, NBDB contains data on 249 motifs that work in interactions with 24 ligands. Sequence profiles of EFL motifs were derived de novo from nonredundant Uniprot proteome sequences. Conserved amino acid residues in the profiles interact specifically with distinct chemical parts of nucleotide-containing ligands, such as nitrogenous bases, phosphate groups, ribose, nicotinamide, and flavin moieties. Each EFL profile in the database is characterized by a pattern of corresponding ligand-protein interactions found in crystallized ligand-protein complexes. NBDB database helps to explore the determinants of nucleotide and cofactor binding in different protein folds and families. NBDB can also detect fragments that match to profiles of particular EFLs in the protein sequence provided by user. Comprehensive information on sequence, structures, and interactions of EFLs with ligands provides a foundation for experimental and computational efforts on design of required protein functions. PMID:26507856

  2. Conformational preference of 'CαNN' short peptide motif towards recognition of anions.

    Directory of Open Access Journals (Sweden)

    Tridip Sheet

    Full Text Available Among several 'anion binding motifs', the recently described 'C(αNN' motif occurring in the loop regions preceding a helix, is conserved through evolution both in sequence and its conformation. To establish the significance of the conserved sequence and their intrinsic affinity for anions, a series of peptides containing the naturally occurring 'C(αNN' motif at the N-terminus of a designed helix, have been modeled and studied in a context free system using computational techniques. Appearance of a single interacting site with negative binding free-energy for both the sulfate and phosphate ions, as evidenced in docking experiments, establishes that the 'C(αNN' segment has an intrinsic affinity for anions. Molecular Dynamics (MD simulation studies reveal that interaction with anion triggers a conformational switch from non-helical to helical state at the 'C(αNN' segment, which extends the length of the anchoring-helix by one turn at the N-terminus. Computational experiments substantiate the significance of sequence/structural context and justify the conserved nature of the 'C(αNN' sequence for anion recognition through "local" interaction.

  3. Relative edge energy in the stability of transition metal nanoclusters of different motifs.

    Science.gov (United States)

    Zhao, X J; Xue, X L; Guo, Z X; Li, S F

    2016-07-01

    When a structure is reduced to a nanometer scale, the proportion of the edge atoms increases significantly, which can play a crucial role in determining both their geometric and electronic properties, as demonstrated by the recently established generalized Wulff construction principle [S. F. Li, et al., Phys. Rev. Lett., 2013, 111, 115501]. Consequently, it is of great interest to clarify quantitatively the role of the edge atoms that dominate the motifs of these nanostructures. In principle, establishing an effective method valid for determining the absolute value of the surface energy and particularly the edge energy for a given nanostructure is expected to resolve such a problem. However, hitherto, it is difficult to obtain the absolute edge energy of transition metal clusters, particularly when their sizes approach the nanometer regime. In this paper, taking Ru nanoclusters as a prototypical example, our first-principles calculations introduce the concept of relative edge energy (REE), reflecting the net edge atom effect over the surface (facet) atom effect, which is fairly powerful to quasi-quantitatively estimate the critical size at which the crossover occurs between different configurations of a given motif, such as from an icosahedron to an fcc nanocrystal. By contrast, the bulk effect should be re-considered to rationalize the power of the REE in predicting the relative stability of larger nanostructures between different motifs, such as fcc-like and hcp-like nanocrystals. PMID:27296770

  4. Coordination of platinum therapeutic agents to met-rich motifs of human copper transport protein1.

    Science.gov (United States)

    Crider, Sarah E; Holbrook, Robert J; Franz, Katherine J

    2010-01-01

    Platinum therapeutic agents are widely used in the treatment of several forms of cancer. Various mechanisms for the transport of the drugs have been proposed including passive diffusion across the cellular membrane and active transport via proteins. The copper transport protein Ctr1 is responsible for high affinity copper uptake but has also been implicated in the transport of cisplatin into cells. Human hCtr1 contains two methionine-rich Mets motifs on its extracellular N-terminus that are potential platinum-binding sites: the first one encompasses residues 7-14 with amino acid sequence Met-Gly-Met-Ser-Tyr-Met-Asp-Ser and the second one spans residues 39-46 with sequence Met-Met-Met-Met-Pro-Met-Thr-Phe. In these studies, we use liquid chromatography and mass spectrometry to compare the binding interactions between cisplatin, carboplatin and oxaliplatin with synthetic peptides corresponding to hCtr1 Mets motifs. The interactions of cisplatin and carboplatin with Met-rich motifs that contain three or more methionines result in removal of the carrier ligands of both platinum complexes. In contrast, oxaliplatin retains its cyclohexyldiamine ligand upon platinum coordination to the peptide.

  5. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun

    2015-06-11

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  6. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray.

    Science.gov (United States)

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San

    2016-01-01

    Transcription factor binding sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, protein binding microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k = 8∼10). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build TFBS (also known as DNA motif) models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement if choosing di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  7. PH motifs in PAR1&2 endow breast cancer growth.

    Science.gov (United States)

    Kancharla, A; Maoz, M; Jaber, M; Agranovich, D; Peretz, T; Grisaru-Granovsky, S; Uziely, B; Bar-Shavit, R

    2015-01-01

    Although emerging roles of protease-activated receptor1&2 (PAR1&2) in cancer are recognized, their underlying signalling events are poorly understood. Here we show signal-binding motifs in PAR1&2 that are critical for breast cancer growth. This occurs via the association of the pleckstrin homology (PH) domain with Akt/PKB as a key signalling event of PARs. Other PH-domain signal-proteins such as Etk/Bmx and Vav3 also associate with PAR1 and PAR2 through their PH domains. PAR1 and PAR2 bind with priority to Etk/Bmx. A point mutation in PAR2, H349A, but not in R352A, abrogates PH-protein association and is sufficient to markedly reduce PAR2-instigated breast tumour growth in vivo and placental extravillous trophoblast (EVT) invasion in vitro. Similarly, the PAR1 mutant hPar1-7A, which is unable to bind the PH domain, reduces mammary tumours and EVT invasion, endowing these motifs with physiological significance and underscoring the importance of these previously unknown PAR1 and PAR2 PH-domain-binding motifs in both pathological and physiological invasion processes. PMID:26600192

  8. Design of hyperthermophilic lipase chimeras by key motif-directed recombination.

    Science.gov (United States)

    Zhou, Xiaoli; Gao, Le; Yang, Guangyu; Liu, Donglai; Bai, Aixi; Li, Binchun; Deng, Zixin; Feng, Yan

    2015-02-01

    Recombination of diverse natural evolved domains within a superfamily offers greater opportunity for enzyme function leaps. How to recombine protein modules from distant parents with less disruption in cross-interfaces is a challenging issue. Here, we identified the existence of a key motif, the sequence VVSVN(D)YR, within a structural motif ψ loop in the α/β-hydrolase fold superfamily, by using a MEME server and the PROMOTIF program. To obtain thermostable lipase-like enzymes, two chimeras were engineered at the key motif regions through recombination of domains from a mesophilic lipase and a hyperthermophilic esterase/peptidase with amino acid identity less than 21 %. The chimeras retained the desirable substrate preference of their mesophilic parent and exhibited more than 100-fold increased thermostability at 50 °C. Through site-directed mutation, we further improved activity of the chimera by 4.6-fold. The recombination strategy presented here enables the creation of novel catalysts. PMID:25530200

  9. [A primary study of evolution of hepatitis B virus based on motif discovery].

    Science.gov (United States)

    Ma, Lei; Yi, Qing-Qing; Zhang, Qi; He, Jian-Feng

    2014-01-01

    Hepatitis B is a serious infectious disease worldwide, and hepatitis B virus (HBV) is the direct cause of this disease. In recent years, as an essential part of its evolutionary process, HBV mutation has been extensively studied domestically and globally. However, the study on the conserved sequences in HBV sequences is still in its infancy. In this study, we applied multiple EM for motif elicitation (MEME) algorithm to discover HBV motif and proposed a new metric, conservative index (CI), to carry out phylogenetic analysis based on HBV sequences. Then, the constructed phylogenetic tree was subjected to reliability assessment. The results demonstrated that the new metric CI combined with the MEME algorithm can effectively help to discover motifs in HBV sequences and construct a phylogenetic tree based on them and to analyze the evolutionary relationship between HBV sequences; in addition, the possible ancestral sequences of samples may be obtained by conservative analysis. The proposed method is valuable for the exploratory study on large HBV sequence data sets. PMID:24772892

  10. Automatic generation of 3D motifs for classification of protein binding sites

    Directory of Open Access Journals (Sweden)

    Herzyk Pawel

    2007-08-01

    Full Text Available Abstract Background Since many of the new protein structures delivered by high-throughput processes do not have any known function, there is a need for structure-based prediction of protein function. Protein 3D structures can be clustered according to their fold or secondary structures to produce classes of some functional significance. A recent alternative has been to detect specific 3D motifs which are often associated to active sites. Unfortunately, there are very few known 3D motifs, which are usually the result of a manual process, compared to the number of sequential motifs already known. In this paper, we report a method to automatically generate 3D motifs of protein structure binding sites based on consensus atom positions and evaluate it on a set of adenine based ligands. Results Our new approach was validated by generating automatically 3D patterns for the main adenine based ligands, i.e. AMP, ADP and ATP. Out of the 18 detected patterns, only one, the ADP4 pattern, is not associated with well defined structural patterns. Moreover, most of the patterns could be classified as binding site 3D motifs. Literature research revealed that the ADP4 pattern actually corresponds to structural features which show complex evolutionary links between ligases and transferases. Therefore, all of the generated patterns prove to be meaningful. Each pattern was used to query all PDB proteins which bind either purine based or guanine based ligands, in order to evaluate the classification and annotation properties of the pattern. Overall, our 3D patterns matched 31% of proteins with adenine based ligands and 95.5% of them were classified correctly. Conclusion A new metric has been introduced allowing the classification of proteins according to the similarity of atomic environment of binding sites, and a methodology has been developed to automatically produce 3D patterns from that classification. A study of proteins binding adenine based ligands showed that

  11. A phosphoserine/threonine-binding pocket in AGC kinases and PDK1 mediates activation by hydrophobic motif phosphorylation

    DEFF Research Database (Denmark)

    Frödin, Morten; Antal, Torben L; Dümmler, Bettina A;

    2002-01-01

    The growth factor-activated AGC protein kinases RSK, S6K, PKB, MSK and SGK are activated by serine/threonine phosphorylation in the activation loop and in the hydrophobic motif, C-terminal to the kinase domain. In some of these kinases, phosphorylation of the hydrophobic motif creates a specific...... docking site that recruits and activates PDK1, which then phosphorylates the activation loop. Here, we discover a pocket in the kinase domain of PDK1 that recognizes the phosphoserine/phosphothreonine in the hydrophobic motif by identifying two oppositely positioned arginine and lysine residues that bind...... the phosphate. Moreover, we demonstrate that RSK2, S6K1, PKBalpha, MSK1 and SGK1 contain a similar phosphate-binding pocket, which they use for intramolecular interaction with their own phosphorylated hydrophobic motif. Molecular modelling and experimental data provide evidence for a common activation mechanism...

  12. Pivotal Role of the C-terminal DW-motif in Mediating Inhibition of Pyruvate Dehydrogenase Kinase 2 by Dichloroacetate*

    OpenAIRE

    Li, Jun; Kato, Masato; Chuang, David T.

    2009-01-01

    The mitochondrial pyruvate dehydrogenase complex (PDC) is down-regulated by phosphorylation catalyzed by pyruvate dehydrogenase kinase (PDK) isoforms 1–4. Overexpression of PDK isoforms and therefore reduced PDC activity prevails in cancer and diabetes. In the present study, we investigated the role of the invariant C-terminal DW-motif in inhibition of human PDK2 by dichloroacetate (DCA). Substitutions were made in the DW-motif (Asp-382 and Trp-383) and its interacting residues (Tyr-145 and A...

  13. Block of Brain Sodium Channels by Peptide Mimetics of the Isoleucine, Phenylalanine, and Methionine (IFM) Motif from the Inactivation Gate

    OpenAIRE

    Eaholtz, Galen; Colvin, Anita; Leonard, Daniele; Taylor, Charles(8 Cherryl House, Seymour Gardens, Sutton Coldfield, West Midlands, B74 4ST, U.K.); Catterall, William A.

    1999-01-01

    Inactivation of sodium channels is thought to be mediated by an inactivation gate formed by the intracellular loop connecting domains III and IV. A hydrophobic motif containing the amino acid sequence isoleucine, phenylalanine, and methionine (IFM) is required for the inactivation process. Peptides containing the IFM motif, when applied to the cytoplasmic side of these channels, produce two types of block: fast block, which resembles the inactivation process, and slow, use-dependent block sti...

  14. The special neuraminidase stalk-motif responsible for increased virulence and pathogenesis of H5N1 influenza A virus.

    Directory of Open Access Journals (Sweden)

    Hongbo Zhou

    Full Text Available The variation of highly pathogenic avian influenza H5N1 virus results in gradually increased virulence in poultry, and human cases continue to accumulate. The neuraminidase (NA stalk region of influenza virus varies considerably and may associate with its virulence. The NA stalk region of all N1 subtype influenza A viruses can be divided into six different stalk-motifs, H5N1/2004-like (NA-wt, WSN-like, H5N1/97-like, PR/8-like, H7N1/99-like and H5N1/96-like. The NA-wt is a special NA stalk-motif which was first observed in H5N1 influenza virus in 2000, with a 20-amino acid deletion in the 49(th to 68(th positions of the stalk region. Here we show that there is a gradual increase of the special NA stalk-motif in H5N1 isolates from 2000 to 2007, and notably, the special stalk-motif is observed in all 173 H5N1 human isolates from 2004 to 2007. The recombinant H5N1 virus with the special stalk-motif possesses the highest virulence and pathogenicity in chicken and mice, while the recombinant viruses with the other stalk-motifs display attenuated phenotype. This indicates that the special stalk-motif has contributed to the high virulence and pathogenicity of H5N1 isolates since 2000. The gradually increasing emergence of the special NA stalk-motif in H5N1 isolates, especially in human isolates, deserves attention by all.

  15. Divergent evolution of a beta/alpha-barrel subclass: detection of numerous phosphate-binding sites by motif search.

    OpenAIRE

    Bork, P.; Gellerich, J.; Groth, H.; Hooft, R.; Martin, F.

    1995-01-01

    Study of the most conserved region in many beta/alpha-barrels, the phosphate-binding site, revealed a sequence motif in a few beta/alpha-barrels with known tertiary structure, namely glycolate oxidase (GOX), cytochrome b2 (Cyb2), tryptophan synthase alpha subunit (TrpA), and the indoleglycerolphosphate synthase (TrpC). Database searches identified this motif in numerous other enzyme families: (1) IMP dehydrogenase (IMPDH) and GMP reductase (GuaC); (2) phosphoribosylformimino-5-aminoimidazol c...

  16. Interaction of Individual Structural Domains of hnRNP LL with the BCL2 Promoter i-Motif DNA.

    Science.gov (United States)

    Roy, Basab; Talukder, Poulami; Kang, Hyun-Jin; Tsuen, Shujian S; Alam, Mohammad P; Hurley, Laurence H; Hecht, Sidney M

    2016-08-31

    The recently discovered role of the BCL2 (B-cell lymphoma 2 gene) promoter i-motif DNA in modulation of gene expression via interaction with the ribonucleoprotein hnRNP L-like (hnRNP LL) has prompted a more detailed study of the nature of this protein-DNA interaction. The RNA recognition motifs (RRMs) of hnRNP LL were expressed individually, and both RRM1 and RRM2 were found to bind efficiently to the BCL2 i-motif DNA, as well as being critical for transcriptional activation, whereas RRM3-4 bound only weakly to this DNA. Binding was followed by unfolding of the DNA as monitored by changes in the CD spectrum. Mutational analysis of the i-motif DNA revealed that binding involved primarily the lateral loops of the i-motif. The kinetics of binding of the DNA with RRM1 was explored by recording CD spectra at predetermined times following admixture of the protein and DNA. The change in molar ellipticity was readily apparent after 30 s and largely complete within 1 min. A more detailed view of protein-DNA interaction was obtained by introducing the fluorescence donor 6-CNTrp in RRM1 at position 137, and the acceptor 4-aminobenzo[g]quinazoline-2-one (Cf) in lieu of cytidine22 in the i-motif DNA. The course of binding of the two species was monitored by FRET, which reflected a steady increase in energy transfer over a period of several minutes. The FRET signal could be diminished by the further addition of (unlabeled) RRM2, no doubt reflecting competition for binding to the i-motif DNA. These experiments using the individual RRM domains from hnRNP LL confirm the role of this transcription factor in activation of BCL2 transcription via the i-motif in the promoter element. PMID:27483029

  17. Transient α-helices in the disordered RPEL motifs of the serum response factor coactivator MKL1

    Science.gov (United States)

    Mizuguchi, Mineyuki; Fuju, Takahiro; Obita, Takayuki; Ishikawa, Mitsuru; Tsuda, Masaaki; Tabuchi, Akiko

    2014-06-01

    The megakaryoblastic leukemia 1 (MKL1) protein functions as a transcriptional coactivator of the serum response factor. MKL1 has three RPEL motifs (RPEL1, RPEL2, and RPEL3) in its N-terminal region. MKL1 binds to monomeric G-actin through RPEL motifs, and the dissociation of MKL1 from G-actin promotes the translocation of MKL1 to the nucleus. Although structural data are available for RPEL motifs of MKL1 in complex with G-actin, the structural characteristics of RPEL motifs in the free state have been poorly defined. Here we characterized the structures of free RPEL motifs using NMR and CD spectroscopy. NMR and CD measurements showed that free RPEL motifs are largely unstructured in solution. However, NMR analysis identified transient α-helices in the regions where helices α1 and α2 are induced upon binding to G-actin. Proline mutagenesis showed that the transient α-helices are locally formed without helix-helix interactions. The helix content is higher in the order of RPEL1, RPEL2, and RPEL3. The amount of preformed structure may correlate with the binding affinity between the intrinsically disordered protein and its target molecule.

  18. Export of malaria proteins requires co-translational processing of the PEXEL motif independent of phosphatidylinositol-3-phosphate binding.

    Science.gov (United States)

    Boddey, Justin A; O'Neill, Matthew T; Lopaticki, Sash; Carvalho, Teresa G; Hodder, Anthony N; Nebl, Thomas; Wawra, Stephan; van West, Pieter; Ebrahimzadeh, Zeinab; Richard, Dave; Flemming, Sven; Spielmann, Tobias; Przyborski, Jude; Babon, Jeff J; Cowman, Alan F

    2016-01-01

    Plasmodium falciparum exports proteins into erythrocytes using the Plasmodium export element (PEXEL) motif, which is cleaved in the endoplasmic reticulum (ER) by plasmepsin V (PMV). A recent study reported that phosphatidylinositol-3-phosphate (PI(3)P) concentrated in the ER binds to PEXEL motifs and is required for export independent of PMV, and that PEXEL motifs are functionally interchangeable with RxLR motifs of oomycete effectors. Here we show that the PEXEL does not bind PI(3)P, and that this lipid is not concentrated in the ER. We find that RxLR motifs cannot mediate export in P. falciparum. Parasites expressing a mutated version of KAHRP, with the PEXEL motif repositioned near the signal sequence, prevented PMV cleavage. This mutant possessed the putative PI(3)P-binding residues but is not exported. Reinstatement of PEXEL to its original location restores processing by PMV and export. These results challenge the PI(3)P hypothesis and provide evidence that PEXEL position is conserved for co-translational processing and export. PMID:26832821

  19. Searching for motifs in the behaviour of larval Drosophila melanogaster and Caenorhabditis elegans reveals continuity between behavioural states.

    Science.gov (United States)

    Szigeti, Balázs; Deogade, Ajinkya; Webb, Barbara

    2015-12-01

    We present a novel method for the unsupervised discovery of behavioural motifs in larval Drosophila melanogaster and Caenorhabditis elegans. A motif is defined as a particular sequence of postures that recurs frequently. The animal's changing posture is represented by an eigenshape time series, and we look for motifs in this time series. To find motifs, the eigenshape time series is segmented, and the segments clustered using spline regression. Unlike previous approaches, our method can classify sequences of unequal duration as the same motif. The behavioural motifs are used as the basis of a probabilistic behavioural annotator, the eigenshape annotator (ESA). Probabilistic annotation avoids rigid threshold values and allows classification uncertainty to be quantified. We apply eigenshape annotation to both larval Drosophila and C. elegans and produce a good match to hand annotation of behavioural states. However, we find many behavioural events cannot be unambiguously classified. By comparing the results with ESA of an artificial agent's behaviour, we argue that the ambiguity is due to greater continuity between behavioural states than is generally assumed for these organisms.

  20. Structural basis for the recognition of two consecutive mutually interacting DPF motifs by the SGIP1 μ homology domain

    Science.gov (United States)

    Shimada, Atsushi; Yamaguchi, Atsuko; Kohda, Daisuke

    2016-01-01

    FCHo1, FCHo2, and SGIP1 are key regulators of clathrin-mediated endocytosis. Their μ homology domains (μHDs) interact with the C-terminal region of an endocytic scaffold protein, Eps15, containing fifteen Asp-Pro-Phe (DPF) motifs. Here, we show that the high-affinity μHD-binding site in Eps15 is a region encompassing six consecutive DPF motifs, while the minimal μHD-binding unit is two consecutive DPF motifs. We present the crystal structures of the SGIP1 μHD in complex with peptides containing two DPF motifs. The peptides bind to a novel ligand-binding site of the μHD, which is distinct from those of other distantly related μHD-containing proteins. The two DPF motifs, which adopt three-dimensional structures stabilized by sequence-specific intramotif and intermotif interactions, are extensively recognized by the μHD and are both required for binding. Thus, consecutive and singly scattered DPF motifs play distinct roles in μHD binding.

  1. GABPα Binding to Overlapping ETS and CRE DNA Motifs Is Enhanced by CREB1: Custom DNA Microarrays.

    Science.gov (United States)

    He, Ximiao; Syed, Khund Sayeed; Tillo, Desiree; Mann, Ishminder; Weirauch, Matthew T; Vinson, Charles

    2015-07-16

    To achieve proper spatiotemporal control of gene expression, transcription factors cooperatively assemble onto specific DNA sequences. The ETS domain protein monomer of GABPα and the B-ZIP domain protein dimer of CREB1 cooperatively bind DNA only when the ETS ((C)/GCGGAA GT: ) and CRE ( GT: GACGTCAC) motifs overlap precisely, producing the ETS↔CRE motif ((C)/GCGGAA GT: GACGTCAC). We designed a Protein Binding Microarray (PBM) with 60-bp DNAs containing four identical sectors, each with 177,440 features that explore the cooperative interactions between GABPα and CREB1 upon binding the ETS↔CRE motif. The DNA sequences include all 15-mers of the form (C)/GCGGA--CG-, the ETS↔CRE motif, and all single nucleotide polymorphisms (SNPs), and occurrences in the human and mouse genomes. CREB1 enhanced GABPα binding to the canonical ETS↔CRE motif CCGGAAGT two-fold, and up to 23-fold for several SNPs at the beginning and end of the ETS motif, which is suggestive of two separate and distinct allosteric mechanisms of cooperative binding. We show that the ETS-CRE array data can be used to identify regions likely cooperatively bound by GABPα and CREB1 in vivo, and demonstrate their ability to identify human genetic variants that might inhibit cooperative binding.

  2. Relative edge energy in the stability of transition metal nanoclusters of different motifs

    Science.gov (United States)

    Zhao, X. J.; Xue, X. L.; Guo, Z. X.; Li, S. F.

    2016-06-01

    When a structure is reduced to a nanometer scale, the proportion of the lowly-coordinated edge atoms increases significantly, which can play a crucial role in determining both their geometric and electronic properties, as demonstrated by the recently established generalized Wulff construction principle [S. F. Li, et al., Phys. Rev. Lett., 2013, 111, 115501]. Consequently, it is of great interest to clarify quantitatively the role of the edge atoms that dominate the motifs of these nanostructures. In principle, establishing an effective method valid for determining the absolute value of the surface energy and particularly the edge energy for a given nanostructure is expected to resolve such a problem. However, hitherto, it is difficult to obtain the absolute edge energy of transition metal clusters, particularly when their sizes approach the nanometer regime. In this paper, taking Ru nanoclusters as a prototypical example, our first-principles calculations introduce the concept of relative edge energy (REE), reflecting the net edge atom effect over the surface (facet) atom effect, which is fairly powerful to quasi-quantitatively estimate the critical size at which the crossover occurs between different configurations of a given motif, such as from an icosahedron to an fcc nanocrystal. By contrast, the bulk effect should be re-considered to rationalize the power of the REE in predicting the relative stability of larger nanostructures between different motifs, such as fcc-like and hcp-like nanocrystals.When a structure is reduced to a nanometer scale, the proportion of the lowly-coordinated edge atoms increases significantly, which can play a crucial role in determining both their geometric and electronic properties, as demonstrated by the recently established generalized Wulff construction principle [S. F. Li, et al., Phys. Rev. Lett., 2013, 111, 115501]. Consequently, it is of great interest to clarify quantitatively the role of the edge atoms that dominate the

  3. Powdery mildew fungal effector candidates share N-terminal Y/F/WxC-motif

    Directory of Open Access Journals (Sweden)

    Emmersen Jeppe

    2010-05-01

    Full Text Available Abstract Background Powdery mildew and rust fungi are widespread, serious pathogens that depend on developing haustoria in the living plant cells. Haustoria are separated from the host cytoplasm by a plant cell-derived extrahaustorial membrane. They secrete effector proteins, some of which are subsequently transferred across this membrane to the plant cell to suppress defense. Results In a cDNA library from barley epidermis containing powdery mildew haustoria, two-thirds of the sequenced ESTs were fungal and represented ~3,000 genes. Many of the most highly expressed genes encoded small proteins with N-terminal signal peptides. While these proteins are novel and poorly related, they do share a three-amino acid motif, which we named "Y/F/WxC", in the N-terminal of the mature proteins. The first amino acid of this motif is aromatic: tyrosine, phenylalanine or tryptophan, and the last is always cysteine. In total, we identified 107 such proteins, for which the ESTs represent 19% of the fungal clones in our library, suggesting fundamental roles in haustoria function. While overall sequence similarity between the powdery mildew Y/F/WxC-proteins is low, they do have a highly similar exon-intron structure, suggesting they have a common origin. Interestingly, searches of public fungal genome and EST databases revealed that haustoria-producing rust fungi also encode large numbers of novel, short proteins with signal peptides and the Y/F/WxC-motif. No significant numbers of such proteins were identified from genome and EST sequences from either fungi which do not produce haustoria or from haustoria-producing Oomycetes. Conclusion In total, we identified 107, 178 and 57 such Y/F/WxC-proteins from the barley powdery mildew, the wheat stem rust and the wheat leaf rust fungi, respectively. All together, our findings suggest the Y/F/WxC-proteins to be a new class of effectors from haustoria-producing pathogenic fungi.

  4. A self-assembling peptide RADA16-I integrated with spider fibroin uncrystalline motifs.

    Science.gov (United States)

    Sun, Lijuan; Zhao, Xiaojun

    2012-01-01

    Mechanical strength of nanofiber scaffolds formed by the self-assembling peptide RADA16-I or its derivatives is not very good and limits their application. To address this problem, we inserted spidroin uncrystalline motifs, which confer incomparable elasticity and hydrophobicity to spider silk GGAGGS or GPGGY, into the C-terminus of RADA16-I to newly design two peptides: R3 (n-RADARADARADARADA-GGAGGS-c) and R4 (n-RADARADARADARADA-GPGGY-c), and then observed the effect of these motifs on biophysical properties of the peptide. Atomic force microscopy, transmitting electron microscopy, and circular dichroism spectroscopy confirm that R3 and R4 display β-sheet structure and self-assemble into long nanofibers. Compared with R3, the β-sheet structure and nanofibers formed by R4 are more stable; they change to random coil and unordered aggregation at higher temperature. Rheology measurements indicate that novel peptides form hydrogel when induced by DMEM, and the storage modulus of R3 and R4 hydrogel is 0.5 times and 3 times higher than that of RADA16-I, respectively. Furthermore, R4 hydrogel remarkably promotes growth of liver cell L02 and liver cancer cell SMCC7721 compared with 2D culture, determined by MTT assay. Novel peptides still have potential as hydrophobic drug carriers; they can stabilize pyrene microcrystals in aqueous solution and deliver this into a lipophilic environment, identified by fluorescence emission spectra. Altogether, the spider fibroin motif GPGGY most effectively enhances mechanical strength and hydrophobicity of the peptide. This study provides a new method in the design of nanobiomaterials and helps us to understand the role of the amino acid sequence in nanofiber formation.

  5. The vitronectin RGD motif regulates TGF-β-induced alveolar epithelial cell apoptosis.

    Science.gov (United States)

    Wheaton, Amanda K; Velikoff, Miranda; Agarwal, Manisha; Loo, Tiffany T; Horowitz, Jeffrey C; Sisson, Thomas H; Kim, Kevin K

    2016-06-01

    Transforming growth factor-β (TGF-β) is a critical driver of acute lung injury and fibrosis. Injury leads to activation of TGF-β, which regulates changes in the cellular and matrix makeup of the lung during the repair and fibrosis phase. TGF-β can also initiate alveolar epithelial cell (AEC) apoptosis. Injury leads to destruction of the laminin-rich basement membrane, which is replaced by a provisional matrix composed of arginine-glycine-aspartate (RGD) motif-containing plasma matrix proteins, including vitronectin and fibronectin. To determine the role of specific matrix proteins on TGF-β-induced apoptosis, we studied primary AECs cultured on different matrix conditions and utilized mice with deletion of vitronectin (Vtn(-/-)) or mice in which the vitronectin RGD motif is mutated to nonintegrin-binding arginine-glycine-glutamate (RGE) (Vtn(RGE/RGE)). We found that AECs cultured on fibronectin and vitronectin or in wild-type mouse serum are resistant to TGF-β-induced apoptosis. In contrast, AECs cultured on laminin or in serum from Vtn(-/-) or Vtn(RGE/RGE) mice undergo robust TGF-β-induced apoptosis. Plasminogen activator inhibitor-1 (PAI-1) sensitizes AECs to greater apoptosis by disrupting AEC engagement to vitronectin. Inhibition of integrin-associated signaling proteins augments AEC apoptosis. Mice with transgenic deletion of PAI-1 have less apoptosis after bleomycin, but deletion of vitronectin or disruption of the vitronectin RGD motif reverses this protection, suggesting that the proapoptotic function of PAI-1 is mediated through vitronectin inhibition. Collectively, these data suggest that integrin-matrix signaling is an important regulator of TGF-β-mediated AEC apoptosis and that PAI-1 functions as a natural regulator of this interaction. PMID:27106291

  6. Sequence motifs in MADS transcription factors responsible for specificity and diversification of protein-protein interaction.

    Directory of Open Access Journals (Sweden)

    Aalt D J van Dijk

    Full Text Available Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein families with high sequence similarity, such as the plant MADS domain transcription factor family. In comparison to the situation in mammalian species, this important family of transcription regulators has expanded enormously in plant species and contains over 100 members in the model plant species Arabidopsis thaliana. Here, we provide insight into the mechanisms that determine protein-protein interaction specificity for the Arabidopsis MADS domain transcription factor family, using an integrated computational and experimental approach. Plant MADS proteins have highly similar amino acid sequences, but their dimerization patterns vary substantially. Our computational analysis uncovered small sequence regions that explain observed differences in dimerization patterns with reasonable accuracy. Furthermore, we show the usefulness of the method for prediction of MADS domain transcription factor interaction networks in other plant species. Introduction of mutations in the predicted interaction motifs demonstrated that single amino acid mutations can have a large effect and lead to loss or gain of specific interactions. In addition, various performed bioinformatics analyses shed light on the way evolution has shaped MADS domain transcription factor interaction specificity. Identified protein-protein interaction motifs appeared to be strongly conserved among orthologs, indicating their evolutionary importance. We also provide evidence that mutations in these motifs can be a source for sub- or neo-functionalization. The analyses presented here take us a step forward in understanding protein-protein interactions and the interplay between protein sequences and

  7. Identification of novel conserved functional motifs across most Influenza A viral strains

    Directory of Open Access Journals (Sweden)

    El-Azab Iman

    2011-01-01

    Full Text Available Abstract Background Influenza A virus poses a continuous threat to global public health. Design of novel universal drugs and vaccine requires a careful analysis of different strains of Influenza A viral genome from diverse hosts and subtypes. We performed a systematic in silico analysis of Influenza A viral segments of all available Influenza A viral strains and subtypes and grouped them based on host, subtype, and years isolated, and through multiple sequence alignments we extrapolated conserved regions, motifs, and accessible regions for functional mapping and annotation. Results Across all species and strains 87 highly conserved regions (conservation percentage > = 90% and 19 functional motifs (conservation percentage = 100% were found in PB2, PB1, PA, NP, M, and NS segments. The conservation percentage of these segments ranged between 94 - 98% in human strains (the most conserved, 85 - 93% in swine strains (the most variable, and 91 - 94% in avian strains. The most conserved segment was different in each host (PB1 for human strains, NS for avian strains, and M for swine strains. Target accessibility prediction yielded 324 accessible regions, with a single stranded probability > 0.5, of which 78 coincided with conserved regions. Some of the interesting annotations in these regions included sites for protein-protein interactions, the RNA binding groove, and the proton ion channel. Conclusions The influenza virus has evolved to adapt to its host through variations in the GC content and conservation percentage of the conserved regions. Nineteen universal conserved functional motifs were discovered, of which some were accessible regions with interesting biological functions. These regions will serve as a foundation for universal drug targets as well as universal vaccine design.

  8. Gapped alignment of protein sequence motifs through Monte Carlo optimization of a hidden Markov model

    Directory of Open Access Journals (Sweden)

    Liu Jun S

    2004-10-01

    Full Text Available Abstract Background Certain protein families are highly conserved across distantly related organisms and belong to large and functionally diverse superfamilies. The patterns of conservation present in these protein sequences presumably are due to selective constraints maintaining important but unknown structural mechanisms with some constraints specific to each family and others shared by a larger subset or by the entire superfamily. To exploit these patterns as a source of functional information, we recently devised a statistically based approach called contrast hierarchical alignment and interaction network (CHAIN analysis, which infers the strengths of various categories of selective constraints from co-conserved patterns in a multiple alignment. The power of this approach strongly depends on the quality of the multiple alignments, which thus motivated development of theoretical concepts and strategies to improve alignment of conserved motifs within large sets of distantly related sequences. Results Here we describe a hidden Markov model (HMM, an algebraic system, and Markov chain Monte Carlo (MCMC sampling strategies for alignment of multiple sequence motifs. The MCMC sampling strategies are useful both for alignment optimization and for adjusting position specific background amino acid frequencies for alignment uncertainties. Associated statistical formulations provide an objective measure of alignment quality as well as automatic gap penalty optimization. Improved alignments obtained in this way are compared with PSI-BLAST based alignments within the context of CHAIN analysis of three protein families: Giα subunits, prolyl oligopeptidases, and transitional endoplasmic reticulum (p97 AAA+ ATPases. Conclusion While not entirely replacing PSI-BLAST based alignments, which likewise may be optimized for CHAIN analysis using this approach, these motif-based methods often more accurately align very distantly related sequences and thus can

  9. Cancer bioinformatics: detection of chromatin states,SNP-containing motifs, and functional enrichment modules

    Institute of Scientific and Technical Information of China (English)

    Xiaobo Zhou

    2013-01-01

    In this editorial preface,I briefly review cancer bioinformatics and introduce the four articles in this special issue highlighting important applications of the field:detection of chromatin states; detection of SNP-containing motifs and association with transcription factor-binding sites; improvements in functional enrichment modules; and gene association studies on aging and cancer.We expect this issue to provide bioinformatics scientists,cancer biologists,and clinical doctors with a better understanding of how cancer bioinformatics can be used to identify candidate biomarkers and targets and to conduct functional analysis.

  10. Motif-Synchronization: A new method for analysis of dynamic brain networks with EEG

    Science.gov (United States)

    Rosário, R. S.; Cardoso, P. T.; Muñoz, M. A.; Montoya, P.; Miranda, J. G. V.

    2015-12-01

    The major aim of this work was to propose a new association method known as Motif-Synchronization. This method was developed to provide information about the synchronization degree and direction between two nodes of a network by counting the number of occurrences of some patterns between any two time series. The second objective of this work was to present a new methodology for the analysis of dynamic brain networks, by combining the Time-Varying Graph (TVG) method with a directional association method. We further applied the new algorithms to a set of human electroencephalogram (EEG) signals to perform a dynamic analysis of the brain functional networks (BFN).

  11. A self-assembling peptide RADA16-I integrated with spider fibroin uncrystalline motifs

    Directory of Open Access Journals (Sweden)

    Sun L

    2012-02-01

    Full Text Available Lijuan Sun1,2, Xiaojun Zhao1,31West China Hospital Laboratory of Nanomedicine and Institute for Nanobiomedical Technology and Membrane Biology, Sichuan University, Chengdu 610041, Sichuan, China; 2Dept of Oral and Maxillofacial Surgery, Sun Yat-sen Memorial Hospital, Sun Yat-sen University, Guangzhou 510120, Guangdong, China; 3Center for Biomedical Engineering NE47-378, Massachusetts Institute of Technology, Cambridge, MA 02139-4307, USAAbstract: Mechanical strength of nanofiber scaffolds formed by the self-assembling peptide RADA16-I or its derivatives is not very good and limits their application. To address this problem, we inserted spidroin uncrystalline motifs, which confer incomparable elasticity and hydrophobicity to spider silk GGAGGS or GPGGY, into the C-terminus of RADA16-I to newly design two peptides: R3 (n-RADARADARADARADA-GGAGGS-c and R4 (n-RADARADARADARADA-GPGGY-c, and then observed the effect of these motifs on biophysical properties of the peptide. Atomic force microscopy, transmitting electron microscopy, and circular dichroism spectroscopy confirm that R3 and R4 display ß-sheet structure and self-assemble into long nanofibers. Compared with R3, the ß-sheet structure and nanofibers formed by R4 are more stable; they change to random coil and unordered aggregation at higher temperature. Rheology measurements indicate that novel peptides form hydrogel when induced by DMEM, and the storage modulus of R3 and R4 hydrogel is 0.5 times and 3 times higher than that of RADA16-I, respectively. Furthermore, R4 hydrogel remarkably promotes growth of liver cell L02 and liver cancer cell SMCC7721 compared with 2D culture, determined by MTT assay. Novel peptides still have potential as hydrophobic drug carriers; they can stabilize pyrene microcrystals in aqueous solution and deliver this into a lipophilic environment, identified by fluorescence emission spectra. Altogether, the spider fibroin motif GPGGY most effectively enhances mechanical

  12. A Parallel-Displaced Directly Linked 21-Carba-23-Thiaporphyrin Dimer Incorporating a Dihydrofulvalene Motif.

    Science.gov (United States)

    Berlicka, Anna; Białek, Michał J; Latos-Grażyński, Lechosław

    2016-09-01

    In the search of porphyrin arrays with a unique geometry, the efficient synthesis of a directly linked 21-carba-23-thiaporphyrin dimer with the distinctive dihydrofulvalene bridging motif has been developed. This compound acquires an uncommon parallel-displaced arrangement of two carbaporphyrin planes. The dimer undergoes an acid-triggered cleavage to create of the asymmetric carbathiaporphyrin-carbathiachlorin dyad or 2,3-dihalo-21-carba-23-thiachlorin depending on choice of acid. A formation of a reactive carbocation intermediate is postulated to account for mechanism of cleavage. PMID:27530897

  13. Evaluation of the Ottoman Royal Dream With Tree Motif in Terms of Science of Dream Interpretation

    OpenAIRE

    Çetin , Halil

    2012-01-01

    The main goal of royal dreams is to indicate the legitimacy of a dynasty/ruler. As a “manifest of foundation” of the state various royal dreams which were narrated about the foundation of the Ottoman State have also such aims. The meaning of dream depends on the symbols of dream closely. However the meanings of symbols in the dreams have been not evaluated yet. As the most widely narrated royal dream by the early cronicles The Dream With Motif of Tree Emerging from the Navel is analysed by th...

  14. Condensation-Driven Assembly of Boron-Containing Bis(Heteroaryl) Motifs Using a Linchpin Approach.

    Science.gov (United States)

    Adachi, Shinya; Liew, Sean K; Lee, C Frank; Lough, Alan; He, Zhi; St Denis, Jeffrey D; Poda, Gennady; Yudin, Andrei K

    2015-11-20

    Herein, we describe the bromomethyl acyl boronate linchpin--an enabling reagent for the condensation-driven assembly of novel bis(heteroaryl) motifs. This building block is readily accessible from commercially available starting materials. A variety of 2-amino- and 2-methylpyridines were reacted with MIDA-protected bromomethyl acylboronate to afford 2-boryl imidazo[1,2-a]pyridine and 2-boryl indolizine derivatives, respectively, in excellent yields. Subsequent condensation with hydroxyamidines and hydrazonamides converted the intermediate heterocycles into novel boron-containing bis(heteroaryl) units characterized by high thermal stability.

  15. Constructing a taxonomy of fine-grained human movement and activity motifs through social media

    CERN Document Server

    Frank, Morgan R; Mitchell, Lewis; Bagrow, James P; Dodds, Peter Sheridan; Danforth, Christopher M

    2014-01-01

    Profiting from the emergence of web-scale social data sets, numerous recent studies have systematically explored human mobility patterns over large populations and large time scales. Relatively little attention, however, has been paid to mobility and activity over smaller time-scales, such as a day. Here, we use Twitter to identify people's frequently visited locations along with their likely activities as a function of time of day and day of week, capitalizing on both the content and geolocation of messages. We subsequently characterize people's transition pattern motifs and demonstrate that spatial information is encoded in word choice.

  16. Frequency patterns of T-cell exposed motifs in immunoglobulin heavy chain peptides presented by MHCs

    Directory of Open Access Journals (Sweden)

    Robert D. Bremel

    2014-10-01

    Full Text Available Immunoglobulins are highly diverse protein sequences that are processed and presented to T-cells by B-cells and other antigen presenting cells. We examined a large dataset of immunoglobulin heavy chain variable regions (IGHV to assess the diversity of T-cell exposed motifs (TCEM. TCEM comprise those amino acids in a MHC-bound peptide which face outwards, surrounded by the MHC histotope, and which engage the T-cell receptor. Within IGHV there is a distinct pattern of predicted MHC class II binding and a very high frequency of re-use of the TCEMs. The re-use frequency indicates that only a limited number of different cognate T-cells are required to engage many different clonal B-cells. The amino acids in each outward-facing TCEM are intercalated with the amino acids of inward-facing MHC groove-exposed motifs (GEM. Different GEM may have differing, allele-specific, MHC binding affinities. The intercalation of TCEM and GEM in a peptide allows for a vast combinatorial repertoire of epitopes, each eliciting a different response. Outcome of T-cell receptor binding is determined by overall signal strength, which is a function of the number of responding T-cells and the duration of engagement. Hence, the frequency of T-cell exposed motif re-use appears to be an important determinant of whether a T-cell response is stimulatory or suppressive. The frequency distribution of TCEMs implies that somatic hypermutation is followed by clonal expansion that develop along repeated pathways. The observations of TCEM and GEM derived from immunoglobulins suggest a relatively simple, yet powerful, mechanism to correlate T-cell polyspecificity, through re-use of TCEMs, with a very high degree of specificity achieved by combination with a diversity of GEMs. The frequency profile of TCEMs also points to an economical mechanism for maintaining T-cell memory, recall, and self-discrimination based on an endogenously generated profile of motifs.

  17. Type 2 diabetes mellitus: phylogenetic motifs for predicting protein functional sites

    Indian Academy of Sciences (India)

    Ashok Sharma; Tanuja Rastogi; Meenakshi Bhartiya; A K Shasany; S P S Khanuja

    2007-08-01

    Diabetes mellitus, commonly referred to as diabetes, is a medical condition associated with abnormally high levels of glucose (or sugar) in the blood. Keeping this view, we demonstrate the phylogenetic motifs (PMs) identification in type 2 diabetes mellitus very likely corresponding to protein functional sites. In this article, we have identified PMs for all the candidate genes for type 2 diabetes mellitus. Glycine 310 remains conserved for glucokinase and potassium channel KCNJ11. Isoleucine 137 was conserved for insulin receptor and regulatory subunit of a phosphorylating enzyme. Whereas residues valine, leucine, methionine were highly conserved for insulin receptor. Occurrence of proline was very high for calpain 10 gene and glucose transporter

  18. Cancer bioinformatics: detection of chromatin states, SNP-containing motifs, and functional enrichment modules

    Directory of Open Access Journals (Sweden)

    Xiaobo Zhou

    2013-04-01

    Full Text Available In this editorial preface, I briefly review cancer bioinformatics and introduce the four articles in this special issue highlighting important applications of the field: detection of chromatin states; detection of SNP-containing motifs and association with transcription factor-binding sites; improvements in functional enrichment modules; and gene association studies on aging and cancer. We expect this issue to provide bioinformatics scientists, cancer biologists, and clinical doctors with a better understanding of how cancer bioinformatics can be used to identify candidate biomarkers and targets and to conduct functional analysis.

  19. GNG Motifs Can Replace a GGG Stretch during G-Quadruplex Formation in a Context Dependent Manner.

    Science.gov (United States)

    Das, Kohal; Srivastava, Mrinal; Raghavan, Sathees C

    2016-01-01

    G-quadruplexes are one of the most commonly studied non-B DNA structures. Generally, these structures are formed using a minimum of 4, three guanine tracts, with connecting loops ranging from one to seven. Recent studies have reported deviation from this general convention. One such deviation is the involvement of bulges in the guanine tracts. In this study, guanines along with bulges, also referred to as GNG motifs have been extensively studied using recently reported HOX11 breakpoint fragile region I as a model template. By strategic mutagenesis approach we show that the contribution from continuous G-tracts may be dispensible during G-quadruplex formation when such motifs are flanked by GNGs. Importantly, the positioning and number of GNG/GNGNG can also influence the formation of G-quadruplexes. Further, we assessed three genomic regions from HIF1 alpha, VEGF and SHOX gene for G-quadruplex formation using GNG motifs. We show that HIF1 alpha sequence harbouring GNG motifs can fold into intramolecular G-quadruplex. In contrast, GNG motifs in mutant VEGF sequence could not participate in structure formation, suggesting that the usage of GNG is context dependent. Importantly, we show that when two continuous stretches of guanines are flanked by two independent GNG motifs in a naturally occurring sequence (SHOX), it can fold into an intramolecular G-quadruplex. Finally, we show the specific binding of G-quadruplex binding protein, Nucleolin and G-quadruplex antibody, BG4 to SHOX G-quadruplex. Overall, our study provides novel insights into the role of GNG motifs in G-quadruplex structure formation which may have both physiological and pathological implications. PMID:27414642

  20. MOTIF OF LIGHT IN “AUTUMN LIGHT” BY B. ZAYTSEV

    Directory of Open Access Journals (Sweden)

    Marina V. Zavarkina

    2015-01-01

    Full Text Available B. Zaytsev is one of the most signifi cant fi gures of Russian emigration, the emigrant of the fi rst wave, whose prose has left a noticeable trace in Russian literature. One of the favorite genres of the writer was the genre of the story. The subject of this article is the story “Autumn Light”, which was fi rst published in 1919. The story belongs to the transitional period in the writer’s creative process, when his style changed signifi cantly. Light symbolism occupies an important place in B. Zaytsev’s works. So does it in the story “Autumn Light”, that is stated in the title of the story. The article concludes that in the story the motif of light is semantically close to the image of the Moon, which is genetically traced both to romantic aesthetics, and to Russian religious and philosophical traditions. In this story this motive is a symbol of the perfect world, passing into the past. This motif is also included in the structure of the hero’s image. Sunlight in the artistic space of the text does not reach the ground, it is surrounded by damp haze. The hero’s thoughts are hazed too and he is full of doubts. The fragility of life and temporality of existence are emphasized in the story by the moonlight, ghostly and lifeless.

  1. Dynamics of simple gene-network motifs subject to extrinsic fluctuations

    Science.gov (United States)

    Roberts, Elijah; Be'er, Shay; Bohrer, Chris; Sharma, Rati; Assaf, Michael

    2015-12-01

    Cellular processes do not follow deterministic rules; even in identical environments genetically identical cells can make random choices leading to different phenotypes. This randomness originates from fluctuations present in the biomolecular interaction networks. Most previous work has been focused on the intrinsic noise (IN) of these networks. Yet, especially for high-copy-number biomolecules, extrinsic or environmental noise (EN) has been experimentally shown to dominate the variation. Here, we develop an analytical formalism that allows for calculation of the effect of EN on gene-expression motifs. We introduce a method for modeling bounded EN as an auxiliary species in the master equation. The method is fully generic and is not limited to systems with small EN magnitudes. We focus our study on motifs that can be viewed as the building blocks of genetic switches: a nonregulated gene, a self-inhibiting gene, and a self-promoting gene. The role of the EN properties (magnitude, correlation time, and distribution) on the statistics of interest are systematically investigated, and the effect of fluctuations in different reaction rates is compared. Due to its analytical nature, our formalism can be used to quantify the effect of EN on the dynamics of biochemical networks and can also be used to improve the interpretation of data from single-cell gene-expression experiments.

  2. Detecting the bipartite World Trade Web evolution across 2007: a motifs-based analysis

    CERN Document Server

    Saracco, Fabio; Gabrielli, Andrea; Squartini, Tiziano

    2015-01-01

    In the present paper we employ the theoretical tools developed in network theory, in order to shed light on the response of world wide trade to the financial crisis of 2007. In particular, we have explored the evolution of the bipartite country-product World Trade Web across the years 1995-2010, monitoring the behaviour of the system both before and after 2007. Remarkably, our results indicate that, from 2003 on, the abundances of a recently-defined class of bipartite motifs assume values progressively closer to the ones predicted by a null model which preserves only basic features of the observed structure, completely randomizing the rest. In other words, as 2007 approaches the World Trade Web becomes more and more compatible with the picture of a bipartite network where correlations between countries and products are progressively lost. Moreover, the trends characterizing the z-scores of the considered family of motifs suggest that the most evident modification in the structure of the world trade network ca...

  3. Beyond consensus: statistical free energies reveal hidden interactions in the design of a TPR motif.

    Science.gov (United States)

    Magliery, Thomas J; Regan, Lynne

    2004-10-22

    Consensus design methods have been used successfully to engineer proteins with a particular fold, and moreover to engineer thermostable exemplars of particular folds. Here, we consider how a statistical free energy approach can expand upon current methods of phylogenetic design. As an example, we have analyzed the tetratricopeptide repeat (TPR) motif, using multiple sequence alignment to identify the significance of each position in the TPR. The results provide information above and beyond that revealed by consensus design alone, especially at poorly conserved positions. A particularly striking finding is that certain residues, which TPR-peptide co-crystal structures show are in direct contact with the ligand, display a marked hypervariability. This suggests a novel means of identifying ligand-binding sites, and also implies that TPRs generally function as ligand-binding domains. Using perturbation analysis (or statistical coupling analysis), we examined site-site interactions within the TPR motif. Correlated occurrences of amino acid residues at poorly conserved positions explain how TPRs achieve their near-neutral surface charge distributions, and why a TPR designed from straight consensus has an unusually high net charge. Networks of interacting sites revealed that TPRs fall into two unrecognized families with distinct sets of interactions related to the identity of position 7 (Leu or Lys/Arg). Statistical free energy analysis provides a more complete description of "What makes a TPR a TPR?" than consensus alone, and it suggests general approaches to extend and improve the phylogenetic design of proteins.

  4. CyanoLyase: a database of phycobilin lyase sequences, motifs and functions.

    Science.gov (United States)

    Bretaudeau, Anthony; Coste, François; Humily, Florian; Garczarek, Laurence; Le Corguillé, Gildas; Six, Christophe; Ratin, Morgane; Collin, Olivier; Schluchter, Wendy M; Partensky, Frédéric

    2013-01-01

    CyanoLyase (http://cyanolyase.genouest.org/) is a manually curated sequence and motif database of phycobilin lyases and related proteins. These enzymes catalyze the covalent ligation of chromophores (phycobilins) to specific binding sites of phycobiliproteins (PBPs). The latter constitute the building bricks of phycobilisomes, the major light-harvesting systems of cyanobacteria and red algae. Phycobilin lyases sequences are poorly annotated in public databases. Sequences included in CyanoLyase were retrieved from all available genomes of these organisms and a few others by similarity searches using biochemically characterized enzyme sequences and then classified into 3 clans and 32 families. Amino acid motifs were computed for each family using Protomata learner. CyanoLyase also includes BLAST and a novel pattern matching tool (Protomatch) that allow users to rapidly retrieve and annotate lyases from any new genome. In addition, it provides phylogenetic analyses of all phycobilin lyases families, describes their function, their presence/absence in all genomes of the database (phyletic profiles) and predicts the chromophorylation of PBPs in each strain. The site also includes a thorough bibliography about phycobilin lyases and genomes included in the database. This resource should be useful to scientists and companies interested in natural or artificial PBPs, which have a number of biotechnological applications, notably as fluorescent markers.

  5. A common minimal motif for the ligands of HLA-B*27 class I molecules.

    Directory of Open Access Journals (Sweden)

    Alejandro Barriga

    Full Text Available CD8(+ T cells identify and kill infected cells through the specific recognition of short viral antigens bound to human major histocompatibility complex (HLA class I molecules. The colossal number of polymorphisms in HLA molecules makes it essential to characterize the antigen-presenting properties common to large HLA families or supertypes. In this context, the HLA-B*27 family comprising at least 100 different alleles, some of them widely distributed in the human population, is involved in the cellular immune response against pathogens and also associated to autoimmune spondyloarthritis being thus a relevant target of study. To this end, HLA binding assays performed using nine HLA-B*2705-restricted ligands endogenously processed and presented in virus-infected cells revealed a common minimal peptide motif for efficient binding to the HLA-B*27 family. The motif was independently confirmed using four unrelated peptides. This experimental approach, which could be easily transferred to other HLA class I families and supertypes, has implications for the validation of new bioinformatics tools in the functional clustering of HLA molecules, for the identification of antiviral cytotoxic T lymphocyte responses, and for future vaccine development.

  6. An isoprenylation and palmitoylation motif promotes intraluminal vesicle delivery of proteins in cells from distant species.

    Directory of Open Access Journals (Sweden)

    Clara L Oeste

    Full Text Available The C-terminal ends of small GTPases contain hypervariable sequences which may be posttranslationally modified by defined lipid moieties. The diverse structural motifs generated direct proteins towards specific cellular membranes or organelles. However, knowledge on the factors that determine these selective associations is limited. Here we show, using advanced microscopy, that the isoprenylation and palmitoylation motif of human RhoB (-CINCCKVL targets chimeric proteins to intraluminal vesicles of endolysosomes in human cells, displaying preferential co-localization with components of the late endocytic pathway. Moreover, this distribution is conserved in distant species, including cells from amphibians, insects and fungi. Blocking lipidic modifications results in accumulation of CINCCKVL chimeras in the cytosol, from where they can reach endolysosomes upon release of this block. Remarkably, CINCCKVL constructs are sorted to intraluminal vesicles in a cholesterol-dependent process. In the lower species, neither the C-terminal sequence of RhoB, nor the endosomal distribution of its homologs are conserved; in spite of this, CINCCKVL constructs also reach endolysosomes in Xenopus laevis and insect cells. Strikingly, this behavior is prominent in the filamentous ascomycete fungus Aspergillus nidulans, in which GFP-CINCCKVL is sorted into endosomes and vacuoles in a lipidation-dependent manner and allows monitoring endosomal movement in live fungi. In summary, the isoprenylated and palmitoylated CINCCKVL sequence constitutes a specific structure which delineates an endolysosomal sorting strategy operative in phylogenetically diverse organisms.

  7. A Review of Protein-DNA Binding Motif using Association Rule Mining

    Directory of Open Access Journals (Sweden)

    Virendra Kumar Tripathi

    2013-03-01

    Full Text Available The survival of gene regulation and life mechanisms is pre-request of finding unknown pattern of transcription factor binding sites. The discovery motif of gene regulation in bioinformatics is challenging jobs for getting relation between transcription factors and transcription factor binding sites. The increasing size and length of string pattern of motif is issued a problem related to modeling and optimization of gene selection process. In this paper we give a survey of protein-DNA binding using association rule mining. Association rule mining well known data mining technique for pattern analysis. The capability of negative and positive pattern generation help full for discovering of new pattern in DNA binding bioinformatics data. The other data mining approach such as clustering and classification also applied the process of gene selection grouping for known and unknown pattern. But faced a problem of valid string of DNA data, the rule mining principle find a better relation between transcription factors and transcription factor binding sites.

  8. Mining Natural-Products Screening Data for Target-Class Chemical Motifs.

    Science.gov (United States)

    Coma, Isabel; Bandyopadhyay, Deepak; Diez, Emilio; Ruiz, Emilio Alvarez; de los Frailes, Maria Teresa; Colmenarejo, Gonzalo

    2014-06-01

    In this article, we describe two complementary data-mining approaches used to characterize the GlaxoSmithKline (GSK) natural-products set (NPS) based on information from the high-throughput screening (HTS) databases. Both methods rely on the aggregation and analysis of a large set of single-shot screening data for a number of biological assays, with the goal to reveal natural-product chemical motifs. One of them is an established method based on the data-driven clustering of compounds using a wide range of descriptors,(1)whereas the other method partitions and hierarchically clusters the data to identify chemical cores.(2,3)Both methods successfully find structural scaffolds that significantly hit different groups of discrete drug targets, compared with their relative frequency of demonstrating inhibitory activity in a large number of screens. We describe how these methods can be applied to unveil hidden information in large single-shot HTS data sets. Applied prospectively, this type of information could contribute to the design of new chemical templates for drug-target classes and guide synthetic efforts for lead optimization of tractable hits that are based on natural-product chemical motifs. Relevant findings for 7TM receptors (7TMRs), ion channels, class-7 transferases (protein kinases), hydrolases, and oxidoreductases will be discussed. PMID:24518065

  9. Flow Motifs Reveal Limitations of the Static Framework to Represent Human interactions

    CERN Document Server

    Rocha, Luis Enrique Correa

    2013-01-01

    Networks are commonly used to define underlying interaction structures where infections, information, or other quantities may spread. Although the standard approach has been to aggregate all links into a static structure, some studies suggest that the time order in which the links are established may alter the dynamics of spreading. In this paper, we study the impact of the time ordering in the limits of flow on various empirical temporal networks. By using a random walk dynamics, we estimate the flow on links and convert the original undirected network (temporal and static) into a directed flow network. We then introduce the concept of flow motifs and quantify the divergence in the representativity of motifs when using the temporal and static frameworks. We find that the regularity of contacts and persistence of vertices (common in email communication and face-to-face interactions) result on little differences in the limits of flow for both frameworks. On the other hand, in the case of communication within a...

  10. Trans-Regulation of RNA-Binding Protein Motifs by MicroRNA

    Directory of Open Access Journals (Sweden)

    Scott eTenenbaum

    2014-04-01

    Full Text Available The wide array of vital functions that RNA performs is dependent on its ability to dynamically fold into different structures in response to intracellular and extracellular changes. RNA-binding proteins regulate much of this activity by targeting specific RNA structures or motifs. One of these structures, the 3-way RNA junction, is characteristically found in ribosomal RNA and results from the RNA folding in cis, to produce three separate helices that meet around a central unpaired region. Here we demonstrate that 3-way junctions can also form in trans as a result of the binding of microRNAs in an unconventional manner with mRNA by splinting two non-contiguous regions together. This may be used to reinforce the base of a stem-loop motif being targeted by an RNA-binding protein. Trans interactions between non-coding RNA and mRNA may be used to control the post-transcriptional regulatory code and suggests a possible role for some of the recently described transcripts of unknown function expressed from the human genome.

  11. Genetic analysis of Escherichia coli RadA: functional motifs and genetic interactions.

    Science.gov (United States)

    Cooper, Deani L; Boyle, Daniel C; Lovett, Susan T

    2015-03-01

    The RadA/Sms protein is a RecA-related protein found universally in eubacteria and plants, implicated in processing of recombination intermediates. Here we show that the putative Zn finger, Walker A motif, KNRXG motif and Lon protease homology domain of the Escherichia coli RadA protein are required for DNA damage survival. RadA is unlikely to possess protease activity as the putative active site serine is not required. Mutants in RadA have strong synergistic phenotypes with those in the branch migration protein RecG. Sensitivity of radA recG mutants to azidothymidine (AZT) can be rescued by blocking recombination with recA or recF mutations or by overexpression of RuvAB, suggesting that lethal recombination intermediates accumulate in the absence of RadA and RecG. Synthetic genetic interactions for survival to AZT or ciprofloxacin exposure were observed between RadA and known or putative helicases including DinG, Lhr, PriA, Rep, RuvAB, UvrD, YejH and YoaA. These represent the first affected phenotypes reported for Lhr, YejH and YoaA. The specificity of these effects sheds new light on the role of these proteins in DNA damage avoidance and repair and implicates a role in replication gap processing for DinG and YoaA and a role in double-strand break repair for YejH. PMID:25484163

  12. Yeast one-hybrid gγ recruitment system for identification of protein lipidation motifs.

    Science.gov (United States)

    Fukuda, Nobuo; Doi, Motomichi; Honda, Shinya

    2013-01-01

    Fatty acids and isoprenoids can be covalently attached to a variety of proteins. These lipid modifications regulate protein structure, localization and function. Here, we describe a yeast one-hybrid approach based on the Gγ recruitment system that is useful for identifying sequence motifs those influence lipid modification to recruit proteins to the plasma membrane. Our approach facilitates the isolation of yeast cells expressing lipid-modified proteins via a simple and easy growth selection assay utilizing G-protein signaling that induces diploid formation. In the current study, we selected the N-terminal sequence of Gα subunits as a model case to investigate dual lipid modification, i.e., myristoylation and palmitoylation, a modification that is widely conserved from yeast to higher eukaryotes. Our results suggest that both lipid modifications are required for restoration of G-protein signaling. Although we could not differentiate between myristoylation and palmitoylation, N-terminal position 7 and 8 play some critical role. Moreover, we tested the preference for specific amino-acid residues at position 7 and 8 using library-based screening. This new approach will be useful to explore protein-lipid associations and to determine the corresponding sequence motifs.

  13. Yeast one-hybrid gγ recruitment system for identification of protein lipidation motifs.

    Directory of Open Access Journals (Sweden)

    Nobuo Fukuda

    Full Text Available Fatty acids and isoprenoids can be covalently attached to a variety of proteins. These lipid modifications regulate protein structure, localization and function. Here, we describe a yeast one-hybrid approach based on the Gγ recruitment system that is useful for identifying sequence motifs those influence lipid modification to recruit proteins to the plasma membrane. Our approach facilitates the isolation of yeast cells expressing lipid-modified proteins via a simple and easy growth selection assay utilizing G-protein signaling that induces diploid formation. In the current study, we selected the N-terminal sequence of Gα subunits as a model case to investigate dual lipid modification, i.e., myristoylation and palmitoylation, a modification that is widely conserved from yeast to higher eukaryotes. Our results suggest that both lipid modifications are required for restoration of G-protein signaling. Although we could not differentiate between myristoylation and palmitoylation, N-terminal position 7 and 8 play some critical role. Moreover, we tested the preference for specific amino-acid residues at position 7 and 8 using library-based screening. This new approach will be useful to explore protein-lipid associations and to determine the corresponding sequence motifs.

  14. Using oriented peptide array libraries to evaluate methylarginine-specific antibodies and arginine methyltransferase substrate motifs

    Science.gov (United States)

    Gayatri, Sitaram; Cowles, Martis W.; Vemulapalli, Vidyasiri; Cheng, Donghang; Sun, Zu-Wen; Bedford, Mark T.

    2016-01-01

    Signal transduction in response to stimuli relies on the generation of cascades of posttranslational modifications that promote protein-protein interactions and facilitate the assembly of distinct signaling complexes. Arginine methylation is one such modification, which is catalyzed by a family of nine protein arginine methyltransferases, or PRMTs. Elucidating the substrate specificity of each PRMT will promote a better understanding of which signaling networks these enzymes contribute to. Although many PRMT substrates have been identified, and their methylation sites mapped, the optimal target motif for each of the nine PRMTs has not been systematically addressed. Here we describe the use of Oriented Peptide Array Libraries (OPALs) to methodically dissect the preferred methylation motifs for three of these enzymes – PRMT1, CARM1 and PRMT9. In parallel, we show that an OPAL platform with a fixed methylarginine residue can be used to validate the methyl-specific and sequence-specific properties of antibodies that have been generated against different PRMT substrates, and can also be used to confirm the pan nature of some methylarginine-specific antibodies. PMID:27338245

  15. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification

    Directory of Open Access Journals (Sweden)

    Yin Wang

    2016-01-01

    Full Text Available Background. Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Results. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. Conclusions. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

  16. Triadic motifs and dyadic self-organization in the World Trade Network

    CERN Document Server

    Squartini, Tiziano

    2012-01-01

    In self-organizing networks, topology and dynamics coevolve in a continuous feedback, without exogenous driving. The World Trade Network (WTN) is one of the few empirically well documented examples of self-organizing networks: its topology strongly depends on the GDP of world countries, which in turn depends on the structure of trade. Therefore, understanding which are the key topological properties of the WTN that deviate from randomness provides direct empirical information about the structural effects of self-organization. Here, using an analytical pattern-detection method that we have recently proposed, we study the occurrence of triadic "motifs" (subgraphs of three vertices) in the WTN between 1950 and 2000. We find that, unlike other properties, motifs are not explained by only the in- and out-degree sequences. By contrast, they are completely explained if also the numbers of reciprocal edges are taken into account. This implies that the self-organization process underlying the evolution of the WTN is a...

  17. Model-based Comparative Prediction of Transcription-Factor Binding Motifs in Anabolic Responses in Bone

    Institute of Scientific and Technical Information of China (English)

    Andy; B.; Chen; Kazunori; Hamamura; Guohua; Wang; Weirong; Xing; Subburaman; Mohan; Hiroki; Yokota; Yunlong; Liu

    2007-01-01

    Understanding the regulatory mechanism that controls the alteration of global gene expression patterns continues to be a challenging task in computational biology. We previously developed an ant algorithm, a biologically-inspired computational technique for microarray data, and predicted putative transcription-factor binding motifs (TFBMs) through mimicking interactive behaviors of natural ants. Here we extended the algorithm into a set of web-based software, Ant Modeler, and applied it to investigate the transcriptional mechanism underlying bone formation. Mechanical loading and administration of bone morphogenic proteins (BMPs) are two known treatments to strengthen bone. We addressed a question: Is there any TFBM that stimulates both "anabolic responses of mechanical loading" and "BMP-mediated osteogenic signaling"? Although there is no significant overlap among genes in the two responses, a comparative model-based analysis suggests that the two independent osteogenic processes employ common TFBMs, such as a stress responsive element and a motif for peroxisome proliferator-activated recep- tor (PPAR). The post-modeling in vitro analysis using mouse osteoblast cells sup- ported involvements of the predicted TFBMs such as PPAR, Ikaros 3, and LMO2 in response to mechanical loading. Taken together, the results would be useful to derive a set of testable hypotheses and examine the role of specific regulators in complex transcriptional control of bone formation.

  18. A unique SUMO-2-interacting motif within LANA is essential for KSHV latency.

    Directory of Open Access Journals (Sweden)

    Qiliang Cai

    Full Text Available Kaposi's sarcoma-associated herpesvirus (KSHV stabilizes hypoxia-inducible factor α (HIF-1α during latent infection, and HIF-1α reactivates lytic replication under hypoxic stress. However, the mechanism utilized by KSHV to block lytic reactivation with the accumulation of HIF-1α in latency remains unclear. Here, we report that LANA encoded by KSHV contains a unique SUMO-interacting motif (LANA(SIM which is specific for interaction with SUMO-2 and facilitates LANA SUMOylation at lysine 1140. Proteomic and co-immunoprecipitation analysis further reveal that the SUMO-2 modified transcription repressor KAP1 is a critical factor recruited by LANA(SIM. Deletion of LANA(SIM led to functional loss of both LANA-mediated viral episome maintenance and lytic gene silencing. Moreover, hypoxia reduced KAP1 SUMOylation and resulted in dissociation of both KAP1 and Sin3A repressors from LANA(SIM-associated complex. Therefore, the LANA(SIM motif plays an essential role in KSHV latency and is a potential drug target against KSHV-associated cancers.

  19. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.

    Science.gov (United States)

    Wang, Yin; Li, Rudong; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei

    2016-01-01

    Background. Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Results. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. Conclusions. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data. PMID:27057545

  20. Recurrent Motifs: The Fiction of Time in José Manuel Caballero Bonald’s Poetry

    Directory of Open Access Journals (Sweden)

    Olga Guadalupe Mella

    2012-02-01

    Full Text Available Time, memory and forgetfulness are recurring motifs in the work of the poets of the generation of the 50’s or generación del medio siglo. This article focuses on José Manuel Caballero Bonald’s personal revision of the concept of time, one of the most universal poetic themes. The poet takes his rhetoric from the Baroque tradition to re-visit ideas of time and provide a reinterpretation that contrasts sharply with that of contemporary poets while laying down a clear thread of intertextuality with the literary culture of the past. For his meditation on time, the poet's legacy results in a unique amalgam of Borges’ motifs and the Baroque worldview. From Quevedo’s intertext, the author’s new Baroque style blends poetic culteranismo and conceptismo with lexical precision to result in a lyrical universe that reconciles opposites, a poetic writing more nourished by knowledge than by experience, and a form of expression strongly embedded in the formal and conceptual systems of the Baroque.

  1. Stringency of the 2-His-1-Asp active-site motif in prolyl 4-hydroxylase.

    Directory of Open Access Journals (Sweden)

    Kelly L Gorres

    Full Text Available The non-heme iron(II dioxygenase family of enzymes contain a common 2-His-1-carboxylate iron-binding motif. These enzymes catalyze a wide variety of oxidative reactions, such as the hydroxylation of aliphatic C-H bonds. Prolyl 4-hydroxylase (P4H is an alpha-ketoglutarate-dependent iron(II dioxygenase that catalyzes the post-translational hydroxylation of proline residues in protocollagen strands, stabilizing the ensuing triple helix. Human P4H residues His412, Asp414, and His483 have been identified as an iron-coordinating 2-His-1-carboxylate motif. Enzymes that catalyze oxidative halogenation do so by a mechanism similar to that of P4H. These halogenases retain the active-site histidine residues, but the carboxylate ligand is replaced with a halide ion. We replaced Asp414 of P4H with alanine (to mimic the active site of a halogenase and with glycine. These substitutions do not, however, convert P4H into a halogenase. Moreover, the hydroxylase activity of D414A P4H cannot be rescued with small molecules. In addition, rearranging the two His and one Asp residues in the active site eliminates hydroxylase activity. Our results demonstrate a high stringency for the iron-binding residues in the P4H active site. We conclude that P4H, which catalyzes an especially demanding chemical transformation, is recalcitrant to change.

  2. Extraction of Motif Patterns from Protein Sequences using SVD with Rough K-Means Algorithm

    Directory of Open Access Journals (Sweden)

    E.Elayaraja

    2012-11-01

    Full Text Available Discovering protein sequence motif information is one of the most crucial tasks in bioinformatics research. In this work, we try to obtain protein recurring patterns which are universally conserved across protein family boundaries. In order to generate higher quality protein sequence motif information from Protein Sequence Culling Server (PISCES dataset, we tried several different advanced clustering algorithms, such as hierarchical clustering, Self-Organizing Maps (SOM etc. However, since the dataset itself contains more than 6, 60,000 segments where each segment contains 180 dimensions, any clustering algorithm required more than O(n complexity is not applicable. Therefore, the very first step of our research is trying to reduce segments. The results suggest that the Singular Value Decomposition (SVD computing technique is more suits for reducing segments. After that the reduced segments are followed by applying Rough K-Means clustering algorithm. Our experiments indicate that the Rough K-Means algorithm satisfactorily increases the percentage of sequence segments belonging to clusters with high structural similarity than K-Means. The experimental results suggest that the SVD with Rough K-Means algorithm may be applied to other areas of bioinformatics research in order to explore the underlying relationships between data samples more effectively.

  3. An Intrinsically Disordered Motif Mediates Diverse Actions of Monomeric C-reactive Protein.

    Science.gov (United States)

    Li, Hai-Yun; Wang, Jing; Meng, Fan; Jia, Zhe-Kun; Su, Yang; Bai, Qi-Feng; Lv, Ling-Ling; Ma, Fu-Rong; Potempa, Lawrence A; Yan, Yong-Bin; Ji, Shang-Rong; Wu, Yi

    2016-04-15

    Most proinflammatory actions of C-reactive protein (CRP) are only expressed following dissociation of its native pentameric assembly into monomeric form (mCRP). However, little is known about what underlies the greatly enhanced activities of mCRP. Here we show that a single sequence motif, i.e. cholesterol binding sequence (CBS; a.a. 35-47), is responsible for mediating the interactions of mCRP with diverse ligands. The binding of mCRP to lipoprotein component ApoB, to complement component C1q, to extracellular matrix components fibronectin and collagen, to blood coagulation component fibrinogen, and to membrane lipid component cholesterol, are all found to be markedly inhibited by the synthetic CBS peptide but not by other CRP sequences tested. Likewise, mutating CBS in mCRP also greatly impairs these interactions. Functional experiments further reveal that CBS peptide significantly reduces the effects of mCRP on activation of endothelial cells in vitro and on acute induction of IL-6 in mice. The potency and specificity of CBS are critically determined by the N-terminal residues Cys-36, Leu-37, and His-38; while the versatility of CBS appears to originate from its intrinsically disordered conformation polymorphism. Together, these data unexpectedly identify CBS as the major recognition site of mCRP and suggest that this motif may be exploited to tune the proinflammatory actions of mCRP.

  4. The JAMM motif of human deubiquitinase Poh1 is essential for cell viability.

    Science.gov (United States)

    Gallery, Melissa; Blank, Jonathan L; Lin, Yinghui; Gutierrez, Juan A; Pulido, Jacqueline C; Rappoli, David; Badola, Sunita; Rolfe, Mark; Macbeth, Kyle J

    2007-01-01

    Poh1 deubiquitinase activity is required for proteolytic processing of polyubiquitinated substrates by the 26S proteasome, linking deubiquitination to complete substrate degradation. Poh1 RNA interference (RNAi) in HeLa cells resulted in a reduction in cell viability and an increase in polyubiquitinated protein levels, supporting the link between Poh1 and the ubiquitin proteasome pathway. To more specifically test for any requirement of the zinc metalloproteinase motif of Poh1 to support cell viability and proteasome function, we developed a RNAi complementation strategy. Effects on cell viability and proteasome activity were assessed in cells with RNAi of endogenous Poh1 and induced expression of wild-type Poh1 or a mutant form of Poh1, in which two conserved histidines of the proposed catalytic site were replaced with alanines. We show that an intact zinc metalloproteinase motif is essential for cell viability and 26S proteasome function. As a required enzymatic component of the proteasome, Poh1 is an intriguing therapeutic drug target for cancer.

  5. Synthetic protein scaffolds based on peptide motifs and cognate adaptor domains for improving metabolic productivity

    Directory of Open Access Journals (Sweden)

    Anselm H.C. Horn

    2015-11-01

    Full Text Available The efficiency of many cellular processes relies on the defined interaction among different proteins within the same metabolic or signaling pathway. Consequently, a spatial colocalization of functionally interacting proteins has frequently emerged during evolution. This concept has been adapted within the synthetic biology community for the purpose of creating artificial scaffolds. A recent advancement of this concept is the use of peptide motifs and their cognate adaptor domains. SH2, SH3, GBD, and PDZ domains have been used most often in research studies to date. The approach has been successfully applied to the synthesis of a variety of target molecules including catechin, D-glucaric acid, H2, hydrochinone, resveratrol, butyrate, gamma-aminobutyric acid, and mevalonate. Increased production levels of up to 77-fold have been observed compared to non-scaffolded systems. A recent extension of this concept is the creation of a covalent linkage between peptide motifs and adaptor domains, which leads to a more stable association of the scaffolded systems and thus bears the potential to further enhance metabolic productivity.

  6. Crystal structure and functional characterization of a light-driven chloride pump having an NTQ motif

    Science.gov (United States)

    Kim, Kuglae; Kwon, Soon-Kyeong; Jun, Sung-Hoon; Cha, Jeong Seok; Kim, Hoyoung; Lee, Weontae; Kim, Jihyun F.; Cho, Hyun-Soo

    2016-01-01

    A novel light-driven chloride-pumping rhodopsin (ClR) containing an ‘NTQ motif' in its putative ion conduction pathway has been discovered and functionally characterized in a genomic analysis study of a marine bacterium. Here we report the crystal structure of ClR from the flavobacterium Nonlabens marinus S1-08T determined under two conditions at 2.0 and 1.56 Å resolutions. The structures reveal two chloride-binding sites, one around the protonated Schiff base and the other on a cytoplasmic loop. We identify a ‘3 omega motif' formed by three non-consecutive aromatic amino acids that is correlated with the B–C loop orientation. Detailed ClR structural analyses with functional studies in E. coli reveal the chloride ion transduction pathway. Our results help understand the molecular mechanism and physiological role of ClR and provide a structural basis for optogenetic applications. PMID:27554809

  7. IQ Motif-Containing G (Iqcg) Is Required for Mouse Spermiogenesis

    Science.gov (United States)

    Harris, Tanya P.; Schimenti, Kerry J.; Munroe, Robert J.; Schimenti, John C.

    2013-01-01

    Spermiogenesis in mammals is the process by which the newly formed products of meiosis, haploid spermatids, undergo a dramatic morphological transformation from round cells into flagellated spermatozoa. The underlying genetic control of spermiogenesis is complicated and not well-characterized. We have used forward genetic screens in mice to illuminate the mechanisms of spermatozoon development. Here, we report that the oligoasthenoteratospermia in a male-specific infertility mutant (esgd12d) is attributable to disruption of a gene called Iqcg (IQ motif-containing G). The causality of the mutation was confirmed with a targeted null allele. Loss of Iqcg disrupts spermiogenesis such that tail formation either occurs incompletely or breaks apart from the sperm heads. Orthologs are present in diverse species as distant as hemichordates, mollusks, and green algae. Consistent with a conserved role in flagellar formation and/or function, the orthologous Chlamydomonas protein is present in that organism’s flagella. Because IQ motif-containing genes typically regulate calmodulin (CaM), which in turn can impact the actin cytoskeleton, these findings suggest a potential role for localized calcium signaling in sperm flagellum morphogenesis. PMID:24362311

  8. Sequence-dependent stability test of a left-handed β-helix motif.

    Science.gov (United States)

    Hayre, Natha R; Singh, Rajiv R P; Cox, Daniel L

    2012-03-21

    The left-handed β-helix (LHBH) is an intriguing, rare structural pattern in polypeptides that has been implicated in the formation of amyloid aggregates. We used accurate all-atom replica-exchange molecular dynamics (REMD) simulations to study the relative stability of diverse sequences in the LHBH conformation. Ensemble-average coordinates from REMD served as a scoring criterion to identify sequences and threadings optimally suited to the LHBH, as in a fold recognition paradigm. We examined the repeatability of our REMD simulations, finding that single simulations can be reliable to a quantifiable extent. We find expected behavior for the positive and negative control cases of a native LHBH and intrinsically disordered sequences, respectively. Polyglutamine and a designed hexapeptide repeat show remarkable affinity for the LHBH motif. A structural model for misfolded murine prion protein was also considered, and showed intermediate stability under the given conditions. Our technique is found to be an effective probe of LHBH stability, and promises to be scalable to broader studies of this and potentially other novel or rare motifs. The superstable character of the designed hexapeptide repeat suggests theoretical and experimental follow-ups.

  9. Genome-wide comparison of ferritin family from Archaea, Bacteria, Eukarya, and Viruses: its distribution, characteristic motif, and phylogenetic relationship

    Science.gov (United States)

    Bai, Lina; Xie, Ting; Hu, Qingqing; Deng, Changyan; Zheng, Rong; Chen, Wanping

    2015-10-01

    Ferritins are highly conserved proteins that are widely distributed in various species from archaea to humans. The ubiquitous characteristic of these proteins reflects the pivotal contribution of ferritins to the safe storage and timely delivery of iron to achieve iron homeostasis. This study investigated the ferritin genes in 248 genomes from various species, including viruses, archaea, bacteria, and eukarya. The distribution comparison suggests that mammals and eudicots possess abundant ferritin genes, whereas fungi contain very few ferritin genes. Archaea and bacteria show considerable numbers of ferritin genes. Generally, prokaryotes possess three types of ferritin (the typical ferritin, bacterioferritin, and DNA-binding protein from starved cell), whereas eukaryotes have various subunit types of ferritin, thereby indicating the individuation of the ferritin family during evolution. The characteristic motif analysis of ferritins suggested that all key residues specifying the unique structural motifs of ferritin are highly conserved across three domains of life. Meanwhile, the characteristic motifs were also distinguishable between ferritin groups, especially phytoferritins, which show a plant-specific motif. The phylogenetic analyses show that ferritins within the same subfamily or subunits are generally clustered together. The phylogenetic relationships among ferritin members suggest that both gene duplication and horizontal transfer contribute to the wide variety of ferritins, and their possible evolutionary scenario was also proposed. The results contribute to a better understanding of the distribution, characteristic motif, and evolutionary relationship of the ferritin family.

  10. Conserved function of the lysine-based KXD/E motif in Golgi retention for endomembrane proteins among different organisms.

    Science.gov (United States)

    Woo, Cheuk Hang; Gao, Caiji; Yu, Ping; Tu, Linna; Meng, Zhaoyue; Banfield, David K; Yao, Xiaoqiang; Jiang, Liwen

    2015-11-15

    We recently identified a new COPI-interacting KXD/E motif in the C-terminal cytosolic tail (CT) of Arabidopsis endomembrane protein 12 (AtEMP12) as being a crucial Golgi retention mechanism for AtEMP12. This KXD/E motif is conserved in CTs of all EMPs found in plants, yeast, and humans and is also present in hundreds of other membrane proteins. Here, by cloning selective EMP isoforms from plants, yeast, and mammals, we study the localizations of EMPs in different expression systems, since there are contradictory reports on the localizations of EMPs. We show that the N-terminal and C-terminal GFP-tagged EMP fusions are localized to Golgi and post-Golgi compartments, respectively, in plant, yeast, and mammalian cells. In vitro pull-down assay further proves the interaction of the KXD/E motif with COPI coatomer in yeast. COPI loss of function in yeast and plants causes mislocalization of EMPs or KXD/E motif-containing proteins to vacuole. Ultrastructural studies further show that RNA interference (RNAi) knockdown of coatomer expression in transgenic Arabidopsis plants causes severe morphological changes in the Golgi. Taken together, our results demonstrate that N-terminal GFP fusions reflect the real localization of EMPs, and KXD/E is a conserved motif in COPI interaction and Golgi retention in eukaryotes. PMID:26378254

  11. Specificity of the chromodomain Y chromosome family of chromodomains for lysine-methylated ARK(S/T) motifs.

    Science.gov (United States)

    Fischle, Wolfgang; Franz, Henriette; Jacobs, Steven A; Allis, C David; Khorasanizadeh, Sepideh

    2008-07-11

    Previous studies have shown two homologous chromodomain modules in the HP1 and Polycomb proteins exhibit discriminatory binding to related methyllysine residues (embedded in ARKS motifs) of the histone H3 tail. Methylated ARK(S/T) motifs have recently been identified in other chromatin factors (e.g. linker histone H1.4 and lysine methyltransferase G9a). These are thought to function as peripheral docking sites for the HP1 chromodomain. In vertebrates, HP1-like chromodomains are also present in the chromodomain Y chromosome (CDY) family of proteins adjacent to a putative catalytic motif. The human genome encodes three CDY family proteins, CDY, CDYL, and CDYL2. These have putative functions ranging from establishment of histone H4 acetylation during spermiogenesis to regulation of transcription co-repressor complexes. To delineate the biochemical functions of the CDY family chromodomains, we analyzed their specificity of methyllysine recognition. We detected substantial differences among these factors. The CDY chromodomain exhibits discriminatory binding to lysine-methylated ARK(S/T) motifs, whereas the CDYL2 chromodomain binds with comparable strength to multiple ARK(S/T) motifs. Interestingly, subtle amino acid changes in the CDYL chromodomain prohibit such binding interactions in vitro and in vivo. However, point mutations can rescue binding. In support of the in vitro binding properties of the chromodomains, the full-length CDY family proteins exhibit substantial variability in chromatin localization. Our studies underscore the significance of subtle sequence differences in a conserved signaling module for diverse epigenetic regulatory pathways.

  12. A Significant Regulatory Mutation Burden at a High-Affinity Position of the CTCF Motif in Gastrointestinal Cancers.

    Science.gov (United States)

    Umer, Husen M; Cavalli, Marco; Dabrowski, Michal J; Diamanti, Klev; Kruczyk, Marcin; Pan, Gang; Komorowski, Jan; Wadelius, Claes

    2016-09-01

    Somatic mutations drive cancer and there are established ways to study those in coding sequences. It has been shown that some regulatory mutations are over-represented in cancer. We develop a new strategy to find putative regulatory mutations based on experimentally established motifs for transcription factors (TFs). In total, we find 1,552 candidate regulatory mutations predicted to significantly reduce binding affinity of many TFs in hepatocellular carcinoma and affecting binding of CTCF also in esophagus, gastric, and pancreatic cancers. Near mutated motifs, there is a significant enrichment of (1) genes mutated in cancer, (2) tumor-suppressor genes, (3) genes in KEGG cancer pathways, and (4) sets of genes previously associated to cancer. Experimental and functional validations support the findings. The strategy can be applied to identify regulatory mutations in any cell type with established TF motifs and will aid identifications of genes contributing to cancer. PMID:27174533

  13. Crystal Structures of IAPP Amyloidogenic Segments Reveal a Novel Packing Motif of Out-of-Register Beta Sheets.

    Science.gov (United States)

    Soriaga, Angela B; Sangwan, Smriti; Macdonald, Ramsay; Sawaya, Michael R; Eisenberg, David

    2016-07-01

    Structural studies of amyloidogenic segments by X-ray crystallography have revealed a novel packing motif, consisting of out-of-register β sheets, which may constitute one of the toxic species in aggregation related diseases. Here we sought to determine the presence of such a motif in islet amyloid polypeptide (IAPP), whose amyloidogenic properties are associated with type 2 diabetes. We determined four new crystal structures of segments within IAPP, all forming steric zippers. Most interestingly, one of the segments in the fibril core of IAPP forms an out-of-register steric zipper. Analysis of this structure reveals several commonalities with previously solved out-of-register fibrils. Our results provide additional evidence of out-of-register β sheets as a common structural motif in amyloid aggregates. PMID:26629790

  14. Novel porphyrin-daunomycin hybrids: Synthesis and preferential binding to G-quadruplexes over i-motif

    Science.gov (United States)

    Zhao, Ping; Jin, Shu-fang; Lu, Jia-Zheng; Lv, Jun-liang; Wu, Gong-qing; Chen, Pan-Pan; Tan, Cai-Lian; Chen, Dian-Wen

    2015-02-01

    Encouraged by the enormous importance attributed to the structure and function of human telomeric DNA, herein we focused our attention on the interaction of a serious of newly prepared porphyrin-daunomycin (Por-DNR) hybrids with the guanine-rich single-strand oligomer (G4) and the complementary cytosine-rich strand (i-motif). Various spectral methods such as absorption and fluorescence titration, surface-enhanced Raman and circular dichroism spectrum were integrated in the experiment and it was found that these Por-DNR hybrids could serve as prominent molecules to recognize G4 and i-motif. What is more, interesting results were obtained that the hybrids with longer flexible links are more favorable in binding with both G4 and i-motif than the hybrid with shorter linkage. These Por-DNR hybrids may help to develop new ideas in the research of human telomeric DNA with small molecules.

  15. First principles structures and circular dichroism spectra for the close-packed and the 7/2 motif of collagen

    CERN Document Server

    Jalkanen, Karl J; Knapp-Mohammady, Michaela; Bohr, Jakob

    2012-01-01

    The recently proposed close-packed motif for collagen is investigated using first principles semi-empirical wave function theory and Kohn-Sham density functional theory. Under these refinements the close-packed motif is shown to be stable. For the case of the 7/2 motif a similar stability exists. The electronic circular dichroism of the close-packed model has a significant negative bias and a large signal. An interesting feature of the close-packed structure is the existence of a central channel. Simulations show that, if hydrogen atoms are placed in the cavity, a chain of molecular hydrogens is formed suggesting a possible biological function for molecular hydrogen.

  16. Translational Control of Host Gene Expression by a Cys-Motif Protein Encoded in a Bracovirus.

    Science.gov (United States)

    Kim, Eunseong; Kim, Yonggyun

    2016-01-01

    Translational control is a strategy that various viruses use to manipulate their hosts to suppress acute antiviral response. Polydnaviruses, a group of insect double-stranded DNA viruses symbiotic to some endoparasitoid wasps, are divided into two genera: ichnovirus (IV) and bracovirus (BV). In IV, some Cys-motif genes are known as host translation-inhibitory factors (HTIF). The genome of endoparasitoid wasp Cotesia plutellae contains a Cys-motif gene (Cp-TSP13) homologous to an HTIF known as teratocyte-secretory protein 14 (TSP14) of Microplitis croceipes. Cp-TSP13 consists of 129 amino acid residues with a predicted molecular weight of 13.987 kDa and pI value of 7.928. Genomic DNA region encoding its open reading frame has three introns. Cp-TSP13 possesses six conserved cysteine residues as other Cys-motif genes functioning as HTIF. Cp-TSP13 was expressed in Plutella xylostella larvae parasitized by C. plutellae. C. plutellae bracovirus (CpBV) was purified and injected into non-parasitized P. xylostella that expressed Cp-TSP13. Cp-TSP13 was cloned into a eukaryotic expression vector and used to infect Sf9 cells to transiently express Cp-TSP13. The synthesized Cp-TSP13 protein was detected in culture broth. An overlaying experiment showed that the purified Cp-TSP13 entered hemocytes. It was localized in the cytosol. Recombinant Cp-TSP13 significantly inhibited protein synthesis of secretory proteins when it was added to in vitro cultured fat body. In addition, the recombinant Cp-TSP13 directly inhibited the translation of fat body mRNAs in in vitro translation assay using rabbit reticulocyte lysate. Moreover, the recombinant Cp-TSP13 significantly suppressed cellular immune responses by inhibiting hemocyte-spreading behavior. It also exhibited significant insecticidal activities by both injection and feeding routes. These results indicate that Cp-TSP13 is a viral HTIF. PMID:27598941

  17. An Algorithm for Finding Conserved Secondary Structure Motifs in Unaligned RNA Sequences

    Institute of Scientific and Technical Information of China (English)

    Giulio Pavesi; Giancarlo Mauri; Graziano Pesole

    2004-01-01

    Several experiments and observations have revealed the fact that small local distinct structural features in RNA molecules are correlated with their biological function, for example, in post-transcriptional regulation of gene expression. Thus, finding similar structural features in a set of RNA sequences known to play the same biological function could provide substantial information concerning which parts of the sequences are responsible for the function itself. Unfortunately, finding common structural elements in RNA molecules is a very challenging task, even if limited to secondary structure. The main difficulty lies in the fact that in nearly all the cases the structure of the molecules is unknown, has to be somehow predicted, and that sequences with little or no similarity can fold into similar structures. Although they differ in some details, the approaches proposed so far are usually based on the preliminary alignment of the sequences and attempt to predict common structures (either local or global, or for some selected regions) for the aligned sequences. These methods give good results when sequence and structure similarity are very high, but function less well when similarity is limited to small and local elements, like single stem-loop motifs. Instead of aligning the sequences, the algorithm we present directly searches for regions of the sequences that can fold into similar structures, where the degree of similarity can be defined by the user. Any information concerning sequence similarity in the motifs can be used either as a search constraint, or a posteriori, by post-processing the output. The search for the regions sharing structural similarity is implemented with the affix tree, a novel text-indexing structure that significantly accelerates the search for patterns having a symmetric layout, such as those forming stem-loop structures. Tests based on experimentally known structures have shown that the algorithm is able to identify functional motifs in

  18. An approach to evaluate the topological significance of motifs and other patterns in regulatory networks

    Directory of Open Access Journals (Sweden)

    Wingender Edgar

    2009-05-01

    Full Text Available Abstract Background The identification of network motifs as statistically over-represented topological patterns has become one of the most promising topics in the analysis of complex networks. The main focus is commonly made on how they operate by means of their internal organization. Yet, their contribution to a network's global architecture is poorly understood. However, this requires switching from the abstract view of a topological pattern to the level of its instances. Here, we show how a recently proposed metric, the pairwise disconnectivity index, can be adapted to survey if and which kind of topological patterns and their instances are most important for sustaining the connectivity within a network. Results The pairwise disconnectivity index of a pattern instance quantifies the dependency of the pairwise connections between vertices in a network on the presence of this pattern instance. Thereby, it particularly considers how the coherence between the unique constituents of a pattern instance relates to the rest of a network. We have applied the method exemplarily to the analysis of 3-vertex topological pattern instances in the transcription networks of a bacteria (E. coli, a unicellular eukaryote (S. cerevisiae and higher eukaryotes (human, mouse, rat. We found that in these networks only very few pattern instances break lots of the pairwise connections between vertices upon the removal of an instance. Among them network motifs do not prevail. Rather, those patterns that are shared by the three networks exhibit a conspicuously enhanced pairwise disconnectivity index. Additionally, these are often located in close vicinity to each other or are even overlapping, since only a small number of genes are repeatedly present in most of them. Moreover, evidence has gathered that the importance of these pattern instances is due to synergistic rather than merely additive effects between their constituents. Conclusion A new method has been proposed

  19. Conserved retinoblastoma protein-binding motif in human cytomegalovirus UL97 kinase minimally impacts viral replication but affects susceptibility to maribavir

    Directory of Open Access Journals (Sweden)

    Chou Sunwen

    2009-01-01

    Full Text Available Abstract The UL97 kinase has been shown to phosphorylate and inactivate the retinoblastoma protein (Rb and has three consensus Rb-binding motifs that might contribute to this activity. Recombinant viruses containing mutations in the Rb-binding motifs generally replicated well in human foreskin fibroblasts with only a slight delay in replication kinetics. Their susceptibility to the specific UL97 kinase inhibitor, maribavir, was also examined. Mutation of the amino terminal motif, which is involved in the inactivation of Rb, also renders the virus hypersensitive to the drug and suggests that the motif may play a role in its mechanism of action.

  20. Detecting remote sequence homology in disordered proteins: discovery of conserved motifs in the N-termini of Mononegavirales phosphoproteins.

    Directory of Open Access Journals (Sweden)

    David Karlin

    Full Text Available Paramyxovirinae are a large group of viruses that includes measles virus and parainfluenza viruses. The viral Phosphoprotein (P plays a central role in viral replication. It is composed of a highly variable, disordered N-terminus and a conserved C-terminus. A second viral protein alternatively expressed, the V protein, also contains the N-terminus of P, fused to a zinc finger. We suspected that, despite their high variability, the N-termini of P/V might all be homologous; however, using standard approaches, we could previously identify sequence conservation only in some Paramyxovirinae. We now compared the N-termini using sensitive sequence similarity search programs, able to detect residual similarities unnoticeable by conventional approaches. We discovered that all Paramyxovirinae share a short sequence motif in their first 40 amino acids, which we called soyuz1. Despite its short length (11-16aa, several arguments allow us to conclude that soyuz1 probably evolved by homologous descent, unlike linear motifs. Conservation across such evolutionary distances suggests that soyuz1 plays a crucial role and experimental data suggest that it binds the viral nucleoprotein to prevent its illegitimate self-assembly. In some Paramyxovirinae, the N-terminus of P/V contains a second motif, soyuz2, which might play a role in blocking interferon signaling. Finally, we discovered that the P of related Mononegavirales contain similarly overlooked motifs in their N-termini, and that their C-termini share a previously unnoticed structural similarity suggesting a common origin. Our results suggest several testable hypotheses regarding the replication of Mononegavirales and suggest that disordered regions with little overall sequence similarity, common in viral and eukaryotic proteins, might contain currently overlooked motifs (intermediate in length between linear motifs and disordered domains that could be detected simply by comparing orthologous proteins.

  1. Comparative analysis of evolutionarily conserved motifs of epidermal growth factor receptor 2 (HER2 predicts novel potential therapeutic epitopes.

    Directory of Open Access Journals (Sweden)

    Xiaohong Deng

    Full Text Available Overexpression of human epidermal growth factor receptor 2 (HER2 is associated with tumor aggressiveness and poor prognosis in breast cancer. With the availability of therapeutic antibodies against HER2, great strides have been made in the clinical management of HER2 overexpressing breast cancer. However, de novo and acquired resistance to these antibodies presents a serious limitation to successful HER2 targeting treatment. The identification of novel epitopes of HER2 that can be used for functional/region-specific blockade could represent a central step in the development of new clinically relevant anti-HER2 antibodies. In the present study, we present a novel computational approach as an auxiliary tool for identification of novel HER2 epitopes. We hypothesized that the structurally and linearly evolutionarily conserved motifs of the extracellular domain of HER2 (ECD HER2 contain potential druggable epitopes/targets. We employed the PROSITE Scan to detect structurally conserved motifs and PRINTS to search for linearly conserved motifs of ECD HER2. We found that the epitopes recognized by trastuzumab and pertuzumab are located in the predicted conserved motifs of ECD HER2, supporting our initial hypothesis. Considering that structurally and linearly conserved motifs can provide functional specific configurations, we propose that by comparing the two types of conserved motifs, additional druggable epitopes/targets in the ECD HER2 protein can be identified, which can be further modified for potential therapeutic application. Thus, this novel computational process for predicting or searching for potential epitopes or key target sites may contribute to epitope-based vaccine and function-selected drug design, especially when x-ray crystal structure protein data is not available.

  2. An unusual helix turn helix motif in the catalytic core of HIV-1 integrase binds viral DNA and LEDGF.

    Directory of Open Access Journals (Sweden)

    Hayate Merad

    Full Text Available BACKGROUND: Integrase (IN of the type 1 human immunodeficiency virus (HIV-1 catalyzes the integration of viral DNA into host cellular DNA. We identified a bi-helix motif (residues 149-186 in the crystal structure of the catalytic core (CC of the IN-Phe185Lys variant that consists of the alpha(4 and alpha(5 helices connected by a 3 to 5-residue turn. The motif is embedded in a large array of interactions that stabilize the monomer and the dimer. PRINCIPAL FINDINGS: We describe the conformational and binding properties of the corresponding synthetic peptide. This displays features of the protein motif structure thanks to the mutual intramolecular interactions of the alpha(4 and alpha(5 helices that maintain the fold. The main properties are the binding to: 1- the processing-attachment site at the LTR (long terminal repeat ends of virus DNA with a K(d (dissociation constant in the sub-micromolar range; 2- the whole IN enzyme; and 3- the IN binding domain (IBD but not the IBD-Asp366Asn variant of LEDGF (lens epidermal derived growth factor lacking the essential Asp366 residue. In our motif, in contrast to the conventional HTH (helix-turn-helix, it is the N terminal helix (alpha(4 which has the role of DNA recognition helix, while the C terminal helix (alpha(5 would rather contribute to the motif stabilization by interactions with the alpha(4 helix. CONCLUSION: The motif, termed HTHi (i, for inverted emerges as a central piece of the IN structure and function. It could therefore represent an attractive target in the search for inhibitors working at the DNA-IN, IN-IN and IN-LEDGF interfaces.

  3. G-quadruplex forming structural motifs in the genome of Deinococcus radiodurans and their regulatory roles in promoter functions.

    Science.gov (United States)

    Kota, Swathi; Dhamodharan, V; Pradeepkumar, P I; Misra, Hari S

    2015-11-01

    Deinococcus radiodurans displays compromised radioresistance in the presence of guanine quadruplex (G4)-binding drugs (G4 drugs). Genome-wide scanning showed islands of guanine runs (G-motif) in the upstream regions of coding sequences as well as in the structural regions of many genes, indicating a role for G4 DNA in the regulation of genome functions in this bacterium. G-motifs present upstream to some of the DNA damage-responsive genes like lexA, pprI, recF, recQ, mutL and radA were synthesized, and the formation of G4 DNA structures was probed in vitro. The G-motifs present at the 67th position upstream to recQ and at the 121st position upstream to mutL produced parallel and mixed G4 DNA structures, respectively. Expression of β-galactosidase under recQ and mutL promoters containing respective G-motifs was inhibited by G4 drugs under normal growth conditions in D. radiodurans. However, when such cells were exposed to γ radiation, mutL promoter activity was stimulated while recQ promoter activity was inhibited in the presence of G4 drugs. Deletion of the G-motif from the recQ promoter could relax it from G4 drug repression. D. radiodurans cells treated with G4 drug showed reduction in recQ expression and γ radiation resistance, indicating an involvement of G4 DNA in the radioresistance of this bacterium. These results suggest that G-motifs from D. radiodurans genome form different types of G4 DNA structures at least in vitro, and the recQ and mutL promoters seem to be differentially regulated at the levels of G4 DNA structures.

  4. Crystal structure of bacterial cell-surface alginate-binding protein with an M75 peptidase motif

    Energy Technology Data Exchange (ETDEWEB)

    Maruyama, Yukie; Ochiai, Akihito [Laboratory of Basic and Applied Molecular Biotechnology, Graduate School of Agriculture, Kyoto University, Uji, Kyoto 611-0011 (Japan); Mikami, Bunzo [Laboratory of Applied Structural Biology, Graduate School of Agriculture, Kyoto University, Uji, Kyoto 611-0011 (Japan); Hashimoto, Wataru [Laboratory of Basic and Applied Molecular Biotechnology, Graduate School of Agriculture, Kyoto University, Uji, Kyoto 611-0011 (Japan); Murata, Kousaku, E-mail: kmurata@kais.kyoto-u.ac.jp [Laboratory of Basic and Applied Molecular Biotechnology, Graduate School of Agriculture, Kyoto University, Uji, Kyoto 611-0011 (Japan)

    2011-02-18

    Research highlights: {yields} Bacterial alginate-binding Algp7 is similar to component EfeO of Fe{sup 2+} transporter. {yields} We determined the crystal structure of Algp7 with a metal-binding motif. {yields} Algp7 consists of two helical bundles formed through duplication of a single bundle. {yields} A deep cleft involved in alginate binding locates around the metal-binding site. {yields} Algp7 may function as a Fe{sup 2+}-chelated alginate-binding protein. -- Abstract: A gram-negative Sphingomonas sp. A1 directly incorporates alginate polysaccharide into the cytoplasm via the cell-surface pit and ABC transporter. A cell-surface alginate-binding protein, Algp7, functions as a concentrator of the polysaccharide in the pit. Based on the primary structure and genetic organization in the bacterial genome, Algp7 was found to be homologous to an M75 peptidase motif-containing EfeO, a component of a ferrous ion transporter. Despite the presence of an M75 peptidase motif with high similarity, the Algp7 protein purified from recombinant Escherichia coli cells was inert on insulin B chain and N-benzoyl-Phe-Val-Arg-p-nitroanilide, both of which are substrates for a typical M75 peptidase, imelysin, from Pseudomonas aeruginosa. The X-ray crystallographic structure of Algp7 was determined at 2.10 A resolution by single-wavelength anomalous diffraction. Although a metal-binding motif, HxxE, conserved in zinc ion-dependent M75 peptidases is also found in Algp7, the crystal structure of Algp7 contains no metal even at the motif. The protein consists of two structurally similar up-and-down helical bundles as the basic scaffold. A deep cleft between the bundles is sufficiently large to accommodate macromolecules such as alginate polysaccharide. This is the first structural report on a bacterial cell-surface alginate-binding protein with an M75 peptidase motif.

  5. Characterizing the binding motifs of 11 common human HLA‐DP and HLA‐DQ molecules using NNAlign

    DEFF Research Database (Denmark)

    Andreatta, Massimo; Nielsen, Morten

    2012-01-01

    Compared with HLA‐DR molecules, the specificities of HLA‐DP and HLA‐DQ molecules have only been studied to a limited extent. The description of the binding motifs has been mostly anecdotal and does not provide a quantitative measure of the importance of each position in the binding core and the......‐based method NNAlign, we characterized the binding specificities of five HLA‐DP and six HLA‐DQ among the most frequent in the human population. The identified binding motifs showed an overall concurrence with earlier studies but revealed subtle differences. The DP molecules revealed a large overlap in the...

  6. The Unique-5 and -6 Motifs of ZO-1 Regulate Tight Junction Strand Localization and Scaffolding Properties

    OpenAIRE

    Fanning, Alan S.; Little, Brent P.; Rahner, Christoph; Utepbergenov, Darkhan; Walther, Zenta; James M Anderson

    2007-01-01

    The proper cellular location and sealing of tight junctions is assumed to depend on scaffolding properties of ZO-1, a member of the MAGUK protein family. ZO-1 contains a conserved SH3-GUK module that is separated by a variable region (unique-5), which in other MAGUKs has proven regulatory functions. To identify motifs in ZO-1 critical for its putative scaffolding functions, we focused on the SH3-GUK module including unique-5 (U5) and unique-6 (U6), a motif immediately C-terminal of the GUK do...

  7. Homology modeling studies of yeast Mitogen-Activated Protein Kinases (MAPKS): structural motifs as a basis for specificity.

    Science.gov (United States)

    Smith, D L; Nilar, S H

    2010-06-01

    Mitogen-activated protein kinases (MAPKs) are key components of cellular signal transduction. It is the objective of this communication to demonstrate that insight into protein-protein interactions in the Common Docking motif of yeast mitogen-activated protein kinases can be obtained based on homology models. Homology models for four yeast MAPKs, FUS3, KSS1, HOG1 and MPK1 were built based on the X-ray structures of active and inactive rat ERK2. The structural motifs required for the basis of specificity were rationalized based on these structures. PMID:19995338

  8. Temporal motifs reveal homophily, gender-specific patterns and group talk in mobile communication networks

    CERN Document Server

    Kovanen, Lauri; Kertész, János; Saramäki, Jari

    2013-01-01

    Electronic communication records provide detailed information about temporal aspects of human interaction. Previous studies have shown that individuals' communication patterns have complex temporal structure, and that this structure has system-wide effects. In this paper we use mobile phone records to show that interaction patterns involving multiple individuals have non-trivial temporal structure that cannot be deduced from a network presentation where only interaction frequencies are taken into account. We apply a recently introduced method, temporal motifs, to identify interaction patterns in a temporal network where nodes have additional attributes such as gender and age. We then develop a null model that allows identifying differences between various types of nodes so that these differences are independent of the network based on interaction frequencies. We find gender-related differences in communication patters, and show the existence of temporal homophily, the tendency of similar individuals to partic...

  9. THE MOTIF OF ASPIRATION FOR PEACE IN YAKOV POLONSKY'S LYRIC POETRY

    Directory of Open Access Journals (Sweden)

    Garicheva E. A.

    2008-11-01

    Full Text Available The article examines semantics of "peace" as one of the main motifs of Yakov Polonsky's poetry. Peace is the birth of faith in a man's soul, lowly acceptance of God's grace, striving for the balance between the earthly and the heavenly, renewal of wholeness of the Soul, transfiguration. It is asserted that a person's transfiguration is possible if the human heart keeps faith in the ideal and ability to sympathize. Intonation and the melodies of the poems reveal the person's mental and spiritual pursuit of peace (eupathy. The article discovers the meaning of metaphors, which become symbols in Polonsky's poems, and the symbolism of color, which is associated with icon painting.

  10. Dysprosium-carboxylate nanomeshes with tunable cavity size and assembly motif through ionic interactions.

    Science.gov (United States)

    Cirera, B; Đorđević, L; Otero, R; Gallego, J M; Bonifazi, D; Miranda, R; Ecija, D

    2016-09-28

    We report the design of dysprosium directed metallo-supramolecular architectures on a pristine Cu(111) surface. By an appropriate selection of the ditopic molecular linkers equipped with terminal carboxylic groups (TPA, PDA and TDA species), we create reticular and mononuclear metal-organic nanomeshes of tunable internodal distance, which are stabilized by eight-fold DyO interactions. A thermal annealing treatment for the reticular Dy:TDA architecture gives rise to an unprecedented quasi-hexagonal nanostructure based on dinuclear Dy clusters, exhibiting a unique six-fold DyO bonding motif. All metallo-supramolecular architectures are stable at room temperature. Our results open new avenues for the engineering of supramolecular architectures on surfaces incorporating f-block elements forming thermally robust nanoarchitectures through ionic bonds. PMID:27560774

  11. The Reovirus Sigmal Aspartic Acid Sandwich: A Trimerization Motif Poised for Conformational Change

    Energy Technology Data Exchange (ETDEWEB)

    Schelling,P.; Guglielml, K.; Kirchner, E.; Paetzold, b.; Dermody, T.; Stehle, T.

    2007-01-01

    Reovirus attachment protein {sigma}1 mediates engagement of receptors on the surface of target cells and undergoes dramatic conformational rearrangements during viral disassembly in the endocytic pathway. The {sigma}1 protein is a filamentous, trimeric molecule with a globular {beta}-barrel head domain. An unusual cluster of aspartic acid residues sandwiched between hydrophobic tyrosines is located at the {sigma}1 subunit interface. A 1.75 {angstrom} structure of the {sigma}1 head domain now reveals two water molecules at the subunit interface that are held strictly in position and interact with neighboring residues. Structural and biochemical analyses of mutants affecting the aspartic acid sandwich indicate that these residues and the corresponding chelated water molecules act as a plug to block the free flow of solvent and stabilize the trimer. This arrangement of residues at the {sigma}1 head trimer interface illustrates a new protein design motif that may confer conformational mobility during cell entry.

  12. The reovirus sigma1 aspartic acid sandwich: a trimerization motif poised for conformational change.

    Science.gov (United States)

    Schelling, Pierre; Guglielmi, Kristen M; Kirchner, Eva; Paetzold, Bernhard; Dermody, Terence S; Stehle, Thilo

    2007-04-13

    Reovirus attachment protein sigma1 mediates engagement of receptors on the surface of target cells and undergoes dramatic conformational rearrangements during viral disassembly in the endocytic pathway. The sigma1 protein is a filamentous, trimeric molecule with a globular beta-barrel head domain. An unusual cluster of aspartic acid residues sandwiched between hydrophobic tyrosines is located at the sigma1 subunit interface. A 1.75-A structure of the sigma1 head domain now reveals two water molecules at the subunit interface that are held strictly in position and interact with neighboring residues. Structural and biochemical analyses of mutants affecting the aspartic acid sandwich indicate that these residues and the corresponding chelated water molecules act as a plug to block the free flow of solvent and stabilize the trimer. This arrangement of residues at the sigma1 head trimer interface illustrates a new protein design motif that may confer conformational mobility during cell entry.

  13. Constitutional Dynamics of Metal-Organic Motifs on a Au(111) Surface.

    Science.gov (United States)

    Kong, Huihui; Zhang, Chi; Xie, Lei; Wang, Likun; Xu, Wei

    2016-06-13

    Constitutional dynamic chemistry (CDC), including both dynamic covalent chemistry and dynamic noncovalent chemistry, relies on reversible formation and breakage of bonds to achieve continuous changes in constitution by reorganization of components. In this regard, CDC is considered to be an efficient and appealing strategy for selective fabrication of surface nanostructures by virtue of dynamic diversity. Although constitutional dynamics of monolayered structures has been recently demonstrated at liquid/solid interfaces, most of molecular reorganization/reaction processes were thought to be irreversible under ultrahigh vacuum (UHV) conditions where CDC is therefore a challenge to be achieved. Here, we have successfully constructed a system that presents constitutional dynamics on a solid surface based on dynamic coordination chemistry, in which selective formation of metal-organic motifs is achieved under UHV conditions. The key to making this reversible switching successful is the molecule-substrate interaction as revealed by DFT calculations. PMID:27144822

  14. Identification of Ubiquinol Binding Motifs at the Qo-Site of the Cytochrome bc1 Complex

    DEFF Research Database (Denmark)

    Barragan, Angela M.; Crofts, Antony R.; Schulten, Klaus;

    2015-01-01

    Enzymes of the bc1 complex family power the biosphere through their central role in respiration and photosynthesis. These enzymes couple the oxidation of quinol molecules by cytochrome c to the transfer of protons across the membrane, to generate a proton-motive force that drives ATP synthesis. Key...... for the function of the bc1 complex is the initial redox process that involves a bifurcated electron transfer in which the two electrons from a quinol substrate are passed to different electron acceptors in the bc1 complex. The electron transfer is coupled to proton transfer. The overall mechanism of...... quinol oxidation by the bc1 complex is well enough characterized to allow exploration at the atomistic level, but details are still highly controversial. The controversy stems from the uncertain binding motifs of quinol at the so-called Qo active site of the bc1 complex. Here we employ a combination of...

  15. Noise transmission and delay-induced stochasticoscillations in biochemical network motifs

    Institute of Scientific and Technical Information of China (English)

    Liu Sheng-Jun; Wang Qi; Liu Bo; Yan Shi-Wei; Fumihiko Sakata

    2011-01-01

    With the aid of stochastic delayed-feedback differential equations,we derive an analytic expression for the power spectra of reacting molecules included in a generic biological network motif that is incorporated with a feedback mechanism and time delays in gene regulation.We systematically analyse the effects of time delays,the feedback mechanism,and biological stochasticity on the power spectra.It has been clarified that the time delays together with the feedback mechanism can induce stochastic oscillations at the molecular level and invalidate the noise addition rule for a modular description of the noise propagator.Delay-induced stochastic resonance can be expected,which is related to the stability loss of the reaction systems and Hopf bifurcation occurring for solutions of the corresponding deterministic reaction equations.Through the analysis of the power spectrum,a new approach is proposed to estimate the oscillation period.

  16. Oxadiazoles as privileged motifs for promising anticancer leads: recent advances and future prospects.

    Science.gov (United States)

    Khan, Imtiaz; Ibrar, Aliya; Abbas, Naeem

    2014-01-01

    Taking into account the rising trend of the incidence of cancers of various organs, effective therapies are urgently needed to control human malignancies. The rapid emergence of hundreds of new agents that modulate an ever-growing list of cancer-specific molecular targets offers tremendous hope for cancer patients. However, almost all of the chemotherapy drugs currently on the market cause serious side effects. Based on these facts, the design of new chemical entities as anticancer agents requires the simulation of a suitable bioactive pharmacophore. The pharmacophore not only should have the required potency but must also be safer on normal cell lines than on tumor cells. In this perspective, oxadiazole scaffolds with well-defined anticancer activity profile have fueled intense academic and industrial research in recent years. This paper is intended to highlight the recent advances along with current developments as well as future outlooks for the design of novel and efficacious anticancer agents based on oxadiazole motifs.

  17. Illumina MiSeq sequencing disfavours a sequence motif in the GFP reporter gene.

    Science.gov (United States)

    Van den Hoecke, Silvie; Verhelst, Judith; Saelens, Xavier

    2016-01-01

    Green fluorescent protein (GFP) is one of the most used reporter genes. We have used next-generation sequencing (NGS) to analyse the genetic diversity of a recombinant influenza A virus that expresses GFP and found a remarkable coverage dip in the GFP coding sequence. This coverage dip was present when virus-derived RT-PCR product or the parental plasmid DNA was used as starting material for NGS and regardless of whether Nextera XT transposase or Covaris shearing was used for DNA fragmentation. Therefore, the sequence coverage dip in the GFP coding sequence was not the result of emerging GFP mutant viruses or a bias introduced by Nextera XT fragmentation. Instead, we found that the Illumina MiSeq sequencing method disfavours the 'CCCGCC' motif in the GFP coding sequence. PMID:27193250

  18. Detection of motifs in anomalies from nuclear power plant data using data mining techniques

    International Nuclear Information System (INIS)

    Anomaly detection deals with the discovery of abnormal behaviour from the given data. In the recent times, there has been great research interest towards anomaly detection using data mining techniques. The reason being that in many real world applications, extraction of abnormalities is much more important than detection and analysis of normal behaviour. This is specifically significant in those applications wherein timely maintenance of anomalies is costly and very crucial to the application. In certain cases, it is also possible that there exist some pattern in the anomalies. In the present work, the focus is on detection of patterns in anomalies from Nuclear Power Plant (NPP) data. Further, an analysis has been done to identify the different types of patterns from the NPP data. These different types of patterns have been denoted as 'motifs' to signify the repetitive nature of various types of patterns in anomalies. Such analysis has been done for predictive maintenance in nuclear power plants. (author)

  19. The Human Papillomavirus E6 PDZ Binding Motif: From Life Cycle to Malignancy

    Directory of Open Access Journals (Sweden)

    Ketaki Ganti

    2015-07-01

    Full Text Available Cancer-causing HPV E6 oncoproteins are characterized by the presence of a PDZ binding motif (PBM at their extreme carboxy terminus. It was long thought that this region of E6 had a sole function to confer interaction with a defined set of cellular substrates. However, more recent studies have shown that the E6 PBM has a complex pattern of regulation, whereby phosphorylation within the PBM can regulate interaction with two classes of cellular proteins: those containing PDZ domains and the members of the 14-3-3 family of proteins. In this review, we explore the roles that the PBM and its ligands play in the virus life cycle, and subsequently how these can inadvertently contribute towards the development of malignancy. We also explore how subtle alterations in cellular signal transduction pathways might result in aberrant E6 phosphorylation, which in turn might contribute towards disease progression.

  20. Small circular DNA molecules act as rigid motifs to build DNA nanotubes.

    Science.gov (United States)

    Zheng, Hongning; Xiao, Minyu; Yan, Qin; Ma, Yinzhou; Xiao, Shou-Jun

    2014-07-23

    Small circular DNA molecules with designed lengths, for example 64 and 96 nucleotides (nt), after hybridization with a few 32-nt staple strands respectively, can act as rigid motifs for the construction of DNA nanotubes with excellent uniformity in ring diameter. Unlike most native DNA nanotubes, which consist of longitudinal double helices, nanotubes assembled from circular DNAs are constructed from lateral double helices. Of the five types of DNA nanotubes designed here, four are built by alternating two different rings of the same ring size, while one is composed of all the same 96-nt rings. Nanotubes constructed from the same 96-nt rings are 10-100 times shorter than those constructed from two different 96-nt rings, because there are fewer hinge joints on the rings.