WorldWideScience

Sample records for tfbs motifs based

  1. Nencki Genomics Database--Ensembl funcgen enhanced with intersections, user data and genome-wide TFBS motifs.

    Science.gov (United States)

    Krystkowiak, Izabella; Lenart, Jakub; Debski, Konrad; Kuterba, Piotr; Petas, Michal; Kaminska, Bozena; Dabrowski, Michal

    2013-01-01

    We present the Nencki Genomics Database, which extends the functionality of Ensembl Regulatory Build (funcgen) for the three species: human, mouse and rat. The key enhancements over Ensembl funcgen include the following: (i) a user can add private data, analyze them alongside the public data and manage access rights; (ii) inside the database, we provide efficient procedures for computing intersections between regulatory features and for mapping them to the genes. To Ensembl funcgen-derived data, which include data from ENCODE, we add information on conserved non-coding (putative regulatory) sequences, and on genome-wide occurrence of transcription factor binding site motifs from the current versions of two major motif libraries, namely, Jaspar and Transfac. The intersections and mapping to the genes are pre-computed for the public data, and the result of any procedure run on the data added by the users is stored back into the database, thus incrementally increasing the body of pre-computed data. As the Ensembl funcgen schema for the rat is currently not populated, our database is the first database of regulatory features for this frequently used laboratory animal. The database is accessible without registration using the mysql client: mysql -h database.nencki-genomics.org -u public. Registration is required only to add or access private data. A WSDL webservice provides access to the database from any SOAP client, including the Taverna Workbench with a graphical user interface.

  2. Direct AUC optimization of regulatory motifs.

    Science.gov (United States)

    Zhu, Lin; Zhang, Hong-Bo; Huang, De-Shuang

    2017-07-15

    The discovery of transcription factor binding site (TFBS) motifs is essential for untangling the complex mechanism of genetic variation under different developmental and environmental conditions. Among the huge amount of computational approaches for de novo identification of TFBS motifs, discriminative motif learning (DML) methods have been proven to be promising for harnessing the discovery power of accumulated huge amount of high-throughput binding data. However, they have to sacrifice accuracy for speed and could fail to fully utilize the information of the input sequences. We propose a novel algorithm called CDAUC for optimizing DML-learned motifs based on the area under the receiver-operating characteristic curve (AUC) criterion, which has been widely used in the literature to evaluate the significance of extracted motifs. We show that when the considered AUC loss function is optimized in a coordinate-wise manner, the cost function of each resultant sub-problem is a piece-wise constant function, whose optimal value can be found exactly and efficiently. Further, a key step of each iteration of CDAUC can be efficiently solved as a computational geometry problem. Experimental results on real world high-throughput datasets illustrate that CDAUC outperforms competing methods for refining DML motifs, while being one order of magnitude faster. Meanwhile, preliminary results also show that CDAUC may also be useful for improving the interpretability of convolutional kernels generated by the emerging deep learning approaches for predicting TF sequences specificities. CDAUC is available at: https://drive.google.com/drive/folders/0BxOW5MtIZbJjNFpCeHlBVWJHeW8 . dshuang@tongji.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  3. Motifs in triadic random graphs based on Steiner triple systems

    Science.gov (United States)

    Winkler, Marco; Reichardt, Jörg

    2013-08-01

    Conventionally, pairwise relationships between nodes are considered to be the fundamental building blocks of complex networks. However, over the last decade, the overabundance of certain subnetwork patterns, i.e., the so-called motifs, has attracted much attention. It has been hypothesized that these motifs, instead of links, serve as the building blocks of network structures. Although the relation between a network's topology and the general properties of the system, such as its function, its robustness against perturbations, or its efficiency in spreading information, is the central theme of network science, there is still a lack of sound generative models needed for testing the functional role of subgraph motifs. Our work aims to overcome this limitation. We employ the framework of exponential random graph models (ERGMs) to define models based on triadic substructures. The fact that only a small portion of triads can actually be set independently poses a challenge for the formulation of such models. To overcome this obstacle, we use Steiner triple systems (STSs). These are partitions of sets of nodes into pair-disjoint triads, which thus can be specified independently. Combining the concepts of ERGMs and STSs, we suggest generative models capable of generating ensembles of networks with nontrivial triadic Z-score profiles. Further, we discover inevitable correlations between the abundance of triad patterns, which occur solely for statistical reasons and need to be taken into account when discussing the functional implications of motif statistics. Moreover, we calculate the degree distributions of our triadic random graphs analytically.

  4. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

    Directory of Open Access Journals (Sweden)

    Down Thomas A

    2010-09-01

    Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.

  5. Leucine-based receptor sorting motifs are dependent on the spacing relative to the plasma membrane

    DEFF Research Database (Denmark)

    Geisler, C; Dietrich, J; Nielsen, B L

    1998-01-01

    Many integral membrane proteins contain leucine-based motifs within their cytoplasmic domains that mediate internalization and intracellular sorting. Two types of leucine-based motifs have been identified. One type is dependent on phosphorylation, whereas the other type, which includes an acidic...... amino acid, is constitutively active. In this study, we have investigated how the spacing relative to the plasma membrane affects the function of both types of leucine-based motifs. For phosphorylation-dependent leucine-based motifs, a minimal spacing of 7 residues between the plasma membrane...... and the phospho-acceptor was required for phosphorylation and thereby activation of the motifs. For constitutively active leucine-based motifs, a minimal spacing of 6 residues between the plasma membrane and the acidic residue was required for optimal activity of the motifs. In addition, we found that the acidic...

  6. Sequence-based classification using discriminatory motif feature selection.

    Directory of Open Access Journals (Sweden)

    Hao Xiong

    Full Text Available Most existing methods for sequence-based classification use exhaustive feature generation, employing, for example, all k-mer patterns. The motivation behind such (enumerative approaches is to minimize the potential for overlooking important features. However, there are shortcomings to this strategy. First, practical constraints limit the scope of exhaustive feature generation to patterns of length ≤ k, such that potentially important, longer (> k predictors are not considered. Second, features so generated exhibit strong dependencies, which can complicate understanding of derived classification rules. Third, and most importantly, numerous irrelevant features are created. These concerns can compromise prediction and interpretation. While remedies have been proposed, they tend to be problem-specific and not broadly applicable. Here, we develop a generally applicable methodology, and an attendant software pipeline, that is predicated on discriminatory motif finding. In addition to the traditional training and validation partitions, our framework entails a third level of data partitioning, a discovery partition. A discriminatory motif finder is used on sequences and associated class labels in the discovery partition to yield a (small set of features. These features are then used as inputs to a classifier in the training partition. Finally, performance assessment occurs on the validation partition. Important attributes of our approach are its modularity (any discriminatory motif finder and any classifier can be deployed and its universality (all data, including sequences that are unaligned and/or of unequal length, can be accommodated. We illustrate our approach on two nucleosome occupancy datasets and a protein solubility dataset, previously analyzed using enumerative feature generation. Our method achieves excellent performance results, with and without optimization of classifier tuning parameters. A Python pipeline implementing the approach is

  7. DNA regulatory motif selection based on support vector machine ...

    African Journals Online (AJOL)

    ... machine (SVM) and its application in microarray experiment of Kashin-Beck disease. ... speed and amount of the corresponding mRNA in gene replication process. ... and revealed that some motifs may be related to the immune reactions.

  8. Overlapping ETS and CRE Motifs (G/CCGGAAGTGACGTCA) Preferentially Bound by GABPα and CREB Proteins

    Science.gov (United States)

    Chatterjee, Raghunath; Zhao, Jianfei; He, Ximiao; Shlyakhtenko, Andrey; Mann, Ishminder; Waterfall, Joshua J.; Meltzer, Paul; Sathyanarayana, B. K.; FitzGerald, Peter C.; Vinson, Charles

    2012-01-01

    Previously, we identified 8-bps long DNA sequences (8-mers) that localize in human proximal promoters and grouped them into known transcription factor binding sites (TFBS). We now examine split 8-mers consisting of two 4-mers separated by 1-bp to 30-bps (X4-N1-30-X4) to identify pairs of TFBS that localize in proximal promoters at a precise distance. These include two overlapping TFBS: the ETS⇔ETS motif (C/GCCGGAAGCGGAA) and the ETS⇔CRE motif (C/GCGGAAGTGACGTCAC). The nucleotides in bold are part of both TFBS. Molecular modeling shows that the ETS⇔CRE motif can be bound simultaneously by both the ETS and the B-ZIP domains without protein-protein clashes. The electrophoretic mobility shift assay (EMSA) shows that the ETS protein GABPα and the B-ZIP protein CREB preferentially bind to the ETS⇔CRE motif only when the two TFBS overlap precisely. In contrast, the ETS domain of ETV5 and CREB interfere with each other for binding the ETS⇔CRE. The 11-mer (CGGAAGTGACG), the conserved part of the ETS⇔CRE motif, occurs 226 times in the human genome and 83% are in known regulatory regions. In vivo GABPα and CREB ChIP-seq peaks identified the ETS⇔CRE as the most enriched motif occurring in promoters of genes involved in mRNA processing, cellular catabolic processes, and stress response, suggesting that a specific class of genes is regulated by this composite motif. PMID:23050235

  9. Codon based co-occurrence network motifs in human mitochondria

    Directory of Open Access Journals (Sweden)

    Pramod Shinde

    2017-10-01

    Full Text Available The nucleotide polymorphism in human mitochondrial genome (mtDNA tolled by codon position bias plays an indispensable role in human population dispersion and expansion. Herein, we constructed genome-wide nucleotide co-occurrence networks using a massive data consisting of five different geographical regions and around 3000 samples for each region. We developed a powerful network model to describe complex mitochondrial evolutionary patterns between codon and non-codon positions. It was interesting to report a different evolution of Asian genomes than those of the rest which is divulged by network motifs. We found evidence that mtDNA undergoes substantial amounts of adaptive evolution, a finding which was supported by a number of previous studies. The dominance of higher order motifs indicated the importance of long-range nucleotide co-occurrence in genomic diversity. Most notably, codon motifs apparently underpinned the preferences among codon positions for co-evolution which is probably highly biased during the origin of the genetic code. Our analyses manifested that codon position co-evolution is very well conserved across human sub-populations and independently maintained within human sub-populations implying the selective role of evolutionary processes on codon position co-evolution. Ergo, this study provided a framework to investigate cooperative genomic interactions which are critical in underlying complex mitochondrial evolution.

  10. Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets.

    Science.gov (United States)

    Vishnevsky, Oleg V; Bocharnikov, Andrey V; Kolchanov, Nikolay A

    2018-02-01

    The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.

  11. Discovery and validation of information theory-based transcription factor and cofactor binding site motifs.

    Science.gov (United States)

    Lu, Ruipeng; Mucaki, Eliseos J; Rogan, Peter K

    2017-03-17

    Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, discovered 23 cofactor motifs for 127 TFs and revealed six high-confidence novel motifs. The reliability and accuracy of these iPWMs were determined via four independent validation methods, including the detection of experimentally proven binding sites, explanation of effects of characterized SNPs, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF coregulatory interactions (e.g. TF complexes). These iPWMs constitute a powerful tool for predicting the effects of sequence variants in known binding sites, performing mutation analysis on regulatory SNPs and predicting previously unrecognized binding sites and target genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. Efficient sequential and parallel algorithms for finding edit distance based motifs.

    Science.gov (United States)

    Pal, Soumitra; Xiao, Peng; Rajasekaran, Sanguthevar

    2016-08-18

    Motif search is an important step in extracting meaningful patterns from biological data. The general problem of motif search is intractable and there is a pressing need to develop efficient, exact and approximation algorithms to solve this problem. In this paper, we present several novel, exact, sequential and parallel algorithms for solving the (l,d) Edit-distance-based Motif Search (EMS) problem: given two integers l,d and n biological strings, find all strings of length l that appear in each input string with atmost d errors of types substitution, insertion and deletion. One popular technique to solve the problem is to explore for each input string the set of all possible l-mers that belong to the d-neighborhood of any substring of the input string and output those which are common for all input strings. We introduce a novel and provably efficient neighborhood exploration technique. We show that it is enough to consider the candidates in neighborhood which are at a distance exactly d. We compactly represent these candidate motifs using wildcard characters and efficiently explore them with very few repetitions. Our sequential algorithm uses a trie based data structure to efficiently store and sort the candidate motifs. Our parallel algorithm in a multi-core shared memory setting uses arrays for storing and a novel modification of radix-sort for sorting the candidate motifs. The algorithms for EMS are customarily evaluated on several challenging instances such as (8,1), (12,2), (16,3), (20,4), and so on. The best previously known algorithm, EMS1, is sequential and in estimated 3 days solves up to instance (16,3). Our sequential algorithms are more than 20 times faster on (16,3). On other hard instances such as (9,2), (11,3), (13,4), our algorithms are much faster. Our parallel algorithm has more than 600 % scaling performance while using 16 threads. Our algorithms have pushed up the state-of-the-art of EMS solvers and we believe that the techniques introduced in

  13. Identify Beta-Hairpin Motifs with Quadratic Discriminant Algorithm Based on the Chemical Shifts.

    Directory of Open Access Journals (Sweden)

    Feng YongE

    Full Text Available Successful prediction of the beta-hairpin motif will be helpful for understanding the of the fold recognition. Some algorithms have been proposed for the prediction of beta-hairpin motifs. However, the parameters used by these methods were primarily based on the amino acid sequences. Here, we proposed a novel model for predicting beta-hairpin structure based on the chemical shift. Firstly, we analyzed the statistical distribution of chemical shifts of six nuclei in not beta-hairpin and beta-hairpin motifs. Secondly, we used these chemical shifts as features combined with three algorithms to predict beta-hairpin structure. Finally, we achieved the best prediction, namely sensitivity of 92%, the specificity of 94% with 0.85 of Mathew's correlation coefficient using quadratic discriminant analysis algorithm, which is clearly superior to the same method for the prediction of beta-hairpin structure from 20 amino acid compositions in the three-fold cross-validation. Our finding showed that the chemical shift is an effective parameter for beta-hairpin prediction, suggesting the quadratic discriminant analysis is a powerful algorithm for the prediction of beta-hairpin.

  14. DistAMo: A web-based tool to characterize DNA-motif distribution on bacterial chromosomes

    Directory of Open Access Journals (Sweden)

    Patrick eSobetzko

    2016-03-01

    Full Text Available Short DNA motifs are involved in a multitude of functions such as for example chromosome segregation, DNA replication or mismatch repair. Distribution of such motifs is often not random and the specific chromosomal pattern relates to the respective motif function. Computational approaches which quantitatively assess such chromosomal motif patterns are necessary. Here we present a new computer tool DistAMo (Distribution Analysis of DNA Motifs. The algorithm uses codon redundancy to calculate the relative abundance of short DNA motifs from single genes to entire chromosomes. Comparative genomics analyses of the GATC-motif distribution in γ-proteobacterial genomes using DistAMo revealed that (i genes beside the replication origin are enriched in GATCs, (ii genome-wide GATC distribution follows a distinct pattern and (iii genes involved in DNA replication and repair are enriched in GATCs. These features are specific for bacterial chromosomes encoding a Dam methyltransferase. The new software is available as a stand-alone or as an easy-to-use web-based server version at http://www.computational.bio.uni-giessen.de/distamo.

  15. Transduction motif analysis of gastric cancer based on a human signaling network

    Energy Technology Data Exchange (ETDEWEB)

    Liu, G.; Li, D.Z.; Jiang, C.S.; Wang, W. [Fuzhou General Hospital of Nanjing Command, Department of Gastroenterology, Fuzhou, China, Department of Gastroenterology, Fuzhou General Hospital of Nanjing Command, Fuzhou (China)

    2014-04-04

    To investigate signal regulation models of gastric cancer, databases and literature were used to construct the signaling network in humans. Topological characteristics of the network were analyzed by CytoScape. After marking gastric cancer-related genes extracted from the CancerResource, GeneRIF, and COSMIC databases, the FANMOD software was used for the mining of gastric cancer-related motifs in a network with three vertices. The significant motif difference method was adopted to identify significantly different motifs in the normal and cancer states. Finally, we conducted a series of analyses of the significantly different motifs, including gene ontology, function annotation of genes, and model classification. A human signaling network was constructed, with 1643 nodes and 5089 regulating interactions. The network was configured to have the characteristics of other biological networks. There were 57,942 motifs marked with gastric cancer-related genes out of a total of 69,492 motifs, and 264 motifs were selected as significantly different motifs by calculating the significant motif difference (SMD) scores. Genes in significantly different motifs were mainly enriched in functions associated with cancer genesis, such as regulation of cell death, amino acid phosphorylation of proteins, and intracellular signaling cascades. The top five significantly different motifs were mainly cascade and positive feedback types. Almost all genes in the five motifs were cancer related, including EPOR, MAPK14, BCL2L1, KRT18, PTPN6, CASP3, TGFBR2, AR, and CASP7. The development of cancer might be curbed by inhibiting signal transductions upstream and downstream of the selected motifs.

  16. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.

    Science.gov (United States)

    Wang, Yin; Li, Rudong; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei

    2016-01-01

    Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

  17. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification

    Directory of Open Access Journals (Sweden)

    Yin Wang

    2016-01-01

    Full Text Available Background. Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Results. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. Conclusions. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

  18. Novel peptide-based platform for the dual presentation of biologically active peptide motifs on biomaterials.

    Science.gov (United States)

    Mas-Moruno, Carlos; Fraioli, Roberta; Albericio, Fernando; Manero, José María; Gil, F Javier

    2014-05-14

    Biofunctionalization of metallic materials with cell adhesive molecules derived from the extracellular matrix is a feasible approach to improve cell-material interactions and enhance the biointegration of implant materials (e.g., osseointegration of bone implants). However, classical biomimetic strategies may prove insufficient to elicit complex and multiple biological signals required in the processes of tissue regeneration. Thus, newer strategies are focusing on installing multifunctionality on biomaterials. In this work, we introduce a novel peptide-based divalent platform with the capacity to simultaneously present distinct bioactive peptide motifs in a chemically controlled fashion. As a proof of concept, the integrin-binding sequences RGD and PHSRN were selected and introduced in the platform. The biofunctionalization of titanium with this platform showed a positive trend towards increased numbers of cell attachment, and statistically higher values of spreading and proliferation of osteoblast-like cells compared to control noncoated samples. Moreover, it displayed statistically comparable or improved cell responses compared to samples coated with the single peptides or with an equimolar mixture of the two motifs. Osteoblast-like cells produced higher levels of alkaline phosphatase on surfaces functionalized with the platform than on control titanium; however, these values were not statistically significant. This study demonstrates that these peptidic structures are versatile tools to convey multiple biofunctionality to biomaterials in a chemically defined manner.

  19. Physical-chemical property based sequence motifs and methods regarding same

    Science.gov (United States)

    Braun, Werner [Friendswood, TX; Mathura, Venkatarajan S [Sarasota, FL; Schein, Catherine H [Friendswood, TX

    2008-09-09

    A data analysis system, program, and/or method, e.g., a data mining/data exploration method, using physical-chemical property motifs. For example, a sequence database may be searched for identifying segments thereof having physical-chemical properties similar to the physical-chemical property motifs.

  20. Sequence-specific high mobility group box factors recognize 10-12-base pair minor groove motifs

    DEFF Research Database (Denmark)

    van Beest, M; Dooijes, D; van De Wetering, M

    2000-01-01

    Sequence-specific high mobility group (HMG) box factors bind and bend DNA via interactions in the minor groove. Three-dimensional NMR analyses have provided the structural basis for this interaction. The cognate HMG domain DNA motif is generally believed to span 6-8 bases. However, alignment...

  1. Fragment-based modelling of single stranded RNA bound to RNA recognition motif containing proteins

    Science.gov (United States)

    de Beauchene, Isaure Chauvot; de Vries, Sjoerd J.; Zacharias, Martin

    2016-01-01

    Abstract Protein-RNA complexes are important for many biological processes. However, structural modeling of such complexes is hampered by the high flexibility of RNA. Particularly challenging is the docking of single-stranded RNA (ssRNA). We have developed a fragment-based approach to model the structure of ssRNA bound to a protein, based on only the protein structure, the RNA sequence and conserved contacts. The conformational diversity of each RNA fragment is sampled by an exhaustive library of trinucleotides extracted from all known experimental protein–RNA complexes. The method was applied to ssRNA with up to 12 nucleotides which bind to dimers of the RNA recognition motifs (RRMs), a highly abundant eukaryotic RNA-binding domain. The fragment based docking allows a precise de novo atomic modeling of protein-bound ssRNA chains. On a benchmark of seven experimental ssRNA–RRM complexes, near-native models (with a mean heavy-atom deviation of <3 Å from experiment) were generated for six out of seven bound RNA chains, and even more precise models (deviation < 2 Å) were obtained for five out of seven cases, a significant improvement compared to the state of the art. The method is not restricted to RRMs but was also successfully applied to Pumilio RNA binding proteins. PMID:27131381

  2. URS DataBase: universe of RNA structures and their motifs.

    Science.gov (United States)

    Baulin, Eugene; Yacovlev, Victor; Khachko, Denis; Spirin, Sergei; Roytberg, Mikhail

    2016-01-01

    The Universe of RNA Structures DataBase (URSDB) stores information obtained from all RNA-containing PDB entries (2935 entries in October 2015). The content of the database is updated regularly. The database consists of 51 tables containing indexed data on various elements of the RNA structures. The database provides a web interface allowing user to select a subset of structures with desired features and to obtain various statistical data for a selected subset of structures or for all structures. In particular, one can easily obtain statistics on geometric parameters of base pairs, on structural motifs (stems, loops, etc.) or on different types of pseudoknots. The user can also view and get information on an individual structure or its selected parts, e.g. RNA-protein hydrogen bonds. URSDB employs a new original definition of loops in RNA structures. That definition fits both pseudoknot-free and pseudoknotted secondary structures and coincides with the classical definition in case of pseudoknot-free structures. To our knowledge, URSDB is the first database supporting searches based on topological classification of pseudoknots and on extended loop classification.Database URL: http://server3.lpm.org.ru/urs/. © The Author(s) 2016. Published by Oxford University Press.

  3. Fast social-like learning of complex behaviors based on motor motifs

    Science.gov (United States)

    Calvo Tapia, Carlos; Tyukin, Ivan Y.; Makarov, Valeri A.

    2018-05-01

    Social learning is widely observed in many species. Less experienced agents copy successful behaviors exhibited by more experienced individuals. Nevertheless, the dynamical mechanisms behind this process remain largely unknown. Here we assume that a complex behavior can be decomposed into a sequence of n motor motifs. Then a neural network capable of activating motor motifs in a given sequence can drive an agent. To account for (n -1 )! possible sequences of motifs in a neural network, we employ the winnerless competition approach. We then consider a teacher-learner situation: one agent exhibits a complex movement, while another one aims at mimicking the teacher's behavior. Despite the huge variety of possible motif sequences we show that the learner, equipped with the provided learning model, can rewire "on the fly" its synaptic couplings in no more than (n -1 ) learning cycles and converge exponentially to the durations of the teacher's motifs. We validate the learning model on mobile robots. Experimental results show that the learner is indeed capable of copying the teacher's behavior composed of six motor motifs in a few learning cycles. The reported mechanism of learning is general and can be used for replicating different functions, including, for example, sound patterns or speech.

  4. Ni2+-binding RNA motifs with an asymmetric purine-rich internal loop and a G-A base pair.

    Science.gov (United States)

    Hofmann, H P; Limmer, S; Hornung, V; Sprinzl, M

    1997-01-01

    RNA molecules with high affinity for immobilized Ni2+ were isolated from an RNA pool with 50 randomized positions by in vitro selection-amplification. The selected RNAs preferentially bind Ni2+ and Co2+ over other cations from first series transition metals. Conserved structure motifs, comprising about 15 nt, were identified that are likely to represent the Ni2+ binding sites. Two conserved motifs contain an asymmetric purine-rich internal loop and probably a mismatch G-A base pair. The structure of one of these motifs was studied with proton NMR spectroscopy and formation of the G-A pair at the junction of helix and internal loop was demonstrated. Using Ni2+ as a paramagnetic probe, a divalent metal ion binding site near this G-A base pair was identified. Ni2+ ions bound to this motif exert a specific stabilization effect. We propose that small asymmetric purine-rich loops that contain a G-A interaction may represent a divalent metal ion binding site in RNA. PMID:9409620

  5. Motif finding in DNA sequences based on skipping nonconserved positions in background Markov chains.

    Science.gov (United States)

    Zhao, Xiaoyan; Sze, Sing-Hoi

    2011-05-01

    One strategy to identify transcription factor binding sites is through motif finding in upstream DNA sequences of potentially co-regulated genes. Despite extensive efforts, none of the existing algorithms perform very well. We consider a string representation that allows arbitrary ignored positions within the nonconserved portion of single motifs, and use O(2(l)) Markov chains to model the background distributions of motifs of length l while skipping these positions within each Markov chain. By focusing initially on positions that have fixed nucleotides to define core occurrences, we develop an algorithm to identify motifs of moderate lengths. We compare the performance of our algorithm to other motif finding algorithms on a few benchmark data sets, and show that significant improvement in accuracy can be obtained when the sites are sufficiently conserved within a given sample, while comparable performance is obtained when the site conservation rate is low. A software program (PosMotif ) and detailed results are available online at http://faculty.cse.tamu.edu/shsze/posmotif.

  6. Design of character-based DNA barcode motif for species identification: A computational approach and its validation in fishes.

    Science.gov (United States)

    Chakraborty, Mohua; Dhar, Bishal; Ghosh, Sankar Kumar

    2017-11-01

    The DNA barcodes are generally interpreted using distance-based and character-based methods. The former uses clustering of comparable groups, based on the relative genetic distance, while the latter is based on the presence or absence of discrete nucleotide substitutions. The distance-based approach has a limitation in defining a universal species boundary across the taxa as the rate of mtDNA evolution is not constant throughout the taxa. However, character-based approach more accurately defines this using a unique set of nucleotide characters. The character-based analysis of full-length barcode has some inherent limitations, like sequencing of the full-length barcode, use of a sparse-data matrix and lack of a uniform diagnostic position for each group. A short continuous stretch of a fragment can be used to resolve the limitations. Here, we observe that a 154-bp fragment, from the transversion-rich domain of 1367 COI barcode sequences can successfully delimit species in the three most diverse orders of freshwater fishes. This fragment is used to design species-specific barcode motifs for 109 species by the character-based method, which successfully identifies the correct species using a pattern-matching program. The motifs also correctly identify geographically isolated population of the Cypriniformes species. Further, this region is validated as a species-specific mini-barcode for freshwater fishes by successful PCR amplification and sequencing of the motif (154 bp) using the designed primers. We anticipate that use of such motifs will enhance the diagnostic power of DNA barcode, and the mini-barcode approach will greatly benefit the field-based system of rapid species identification. © 2017 John Wiley & Sons Ltd.

  7. Kopi dan Kakao dalam Kreasi Motif Batik Khas Jember

    Directory of Open Access Journals (Sweden)

    Irfa'ina Rohana Salma

    2015-06-01

    , design motifs creation and the embodiment of batik. From the creation of this art successfully created into 6 (six motif, namely: (1 Motif Uwoh Kopi; (2 Motif Godhong Kopi; (3 Motif Ceplok Kakao; (4 Motif Kakao Raja; (5 Motif Kakao Biru; and (6 Motif Wiji Mukti. Based on the results of the “Aesthetics assessment taste" has been noticed that the most widely preferred motif is a Uwoh Kopi motif and Kakao Raja motif. Keywords: Motif Uwoh Kopi, Motif Godong Kopi, Motif Ceplok Kakao, Motif Kakao Raja, Motif Kakao Biru, Motif Wiji Mukti

  8. Plasmodium vivax antigen discovery based on alpha-helical coiled coil protein motif

    DEFF Research Database (Denmark)

    Céspedes, Nora; Habel, Catherine; Lopez-Perez, Mary

    2014-01-01

    Protein α-helical coiled coil structures that elicit antibody responses, which block critical functions of medically important microorganisms, represent a means for vaccine development. By using bioinformatics algorithms, a total of 50 antigens with α-helical coiled coil motifs orthologous to Pla...

  9. RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.

    Science.gov (United States)

    Pan, Xiaoyong; Shen, Hong-Bin

    2017-02-28

    RNAs play key roles in cells through the interactions with proteins known as the RNA-binding proteins (RBP) and their binding motifs enable crucial understanding of the post-transcriptional regulation of RNAs. How the RBPs correctly recognize the target RNAs and why they bind specific positions is still far from clear. Machine learning-based algorithms are widely acknowledged to be capable of speeding up this process. Although many automatic tools have been developed to predict the RNA-protein binding sites from the rapidly growing multi-resource data, e.g. sequence, structure, their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains. The other difficulty is how to interpret the prediction results. Existing approaches tend to terminate after outputting the potential discrete binding sites on the sequences, but how to assemble them into the meaningful binding motifs is a topic worth of further investigation. In viewing of these challenges, we propose a deep learning-based framework (iDeep) by using a novel hybrid convolutional neural network and deep belief network to predict the RBP interaction sites and motifs on RNAs. This new protocol is featured by transforming the original observed data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6

  10. A systems wide mass spectrometric based linear motif screen to identify dominant in-vivo interacting proteins for the ubiquitin ligase MDM2.

    Science.gov (United States)

    Nicholson, Judith; Scherl, Alex; Way, Luke; Blackburn, Elizabeth A; Walkinshaw, Malcolm D; Ball, Kathryn L; Hupp, Ted R

    2014-06-01

    Linear motifs mediate protein-protein interactions (PPI) that allow expansion of a target protein interactome at a systems level. This study uses a proteomics approach and linear motif sub-stratifications to expand on PPIs of MDM2. MDM2 is a multi-functional protein with over one hundred known binding partners not stratified by hierarchy or function. A new linear motif based on a MDM2 interaction consensus is used to select novel MDM2 interactors based on Nutlin-3 responsiveness in a cell-based proteomics screen. MDM2 binds a subset of peptide motifs corresponding to real proteins with a range of allosteric responses to MDM2 ligands. We validate cyclophilin B as a novel protein with a consensus MDM2 binding motif that is stabilised by Nutlin-3 in vivo, thus identifying one of the few known interactors of MDM2 that is stabilised by Nutlin-3. These data invoke two modes of peptide binding at the MDM2 N-terminus that rely on a consensus core motif to control the equilibrium between MDM2 binding proteins. This approach stratifies MDM2 interacting proteins based on the linear motif feature and provides a new biomarker assay to define clinically relevant Nutlin-3 responsive MDM2 interactors. Copyright © 2014 Elsevier Inc. All rights reserved.

  11. T cell receptor zeta allows stable expression of receptors containing the CD3gamma leucine-based receptor-sorting motif

    DEFF Research Database (Denmark)

    Dietrich, J; Geisler, C

    1998-01-01

    The leucine-based motif in the T cell receptor (TCR) subunit CD3gamma constitutes a strong internalization signal. In fully assembled TCR this motif is inactive unless phosphorylated. In contrast, the motif is constitutively active in CD4/CD3gamma and Tac/CD3gamma chimeras independently of phosph......The leucine-based motif in the T cell receptor (TCR) subunit CD3gamma constitutes a strong internalization signal. In fully assembled TCR this motif is inactive unless phosphorylated. In contrast, the motif is constitutively active in CD4/CD3gamma and Tac/CD3gamma chimeras independently...... of phosphorylation and leads to rapid internalization and sorting of these chimeras to lysosomal degradation. Because the TCRzeta chain rescues incomplete TCR complexes from lysosomal degradation and allows stable surface expression of fully assembled TCR, we addressed the question whether TCRzeta has the potential...... to mask the CD3gamma leucine-based motif. By studying CD4/CD3gamma and CD16/CD3gamma chimeras, we found that CD16/CD3gamma chimeras associated with TCRzeta. The CD16/CD3gamma-TCRzeta complexes were stably expressed at the cell surface and had a low spontaneous internalization rate, indicating...

  12. MicroRNA genes preferentially expressed in dendritic cells contain sites for conserved transcription factor binding motifs in their promoters

    Directory of Open Access Journals (Sweden)

    Huynen Martijn A

    2011-06-01

    Full Text Available Abstract Background MicroRNAs (miRNAs play a fundamental role in the regulation of gene expression by translational repression or target mRNA degradation. Regulatory elements in miRNA promoters are less well studied, but may reveal a link between their expression and a specific cell type. Results To explore this link in myeloid cells, miRNA expression profiles were generated from monocytes and dendritic cells (DCs. Differences in miRNA expression among monocytes, DCs and their stimulated progeny were observed. Furthermore, putative promoter regions of miRNAs that are significantly up-regulated in DCs were screened for Transcription Factor Binding Sites (TFBSs based on TFBS motif matching score, the degree to which those TFBSs are over-represented in the promoters of the up-regulated miRNAs, and the extent of conservation of the TFBSs in mammals. Conclusions Analysis of evolutionarily conserved TFBSs in DC promoters revealed preferential clustering of sites within 500 bp upstream of the precursor miRNAs and that many mRNAs of cognate TFs of the conserved TFBSs were indeed expressed in the DCs. Taken together, our data provide evidence that selected miRNAs expressed in DCs have evolutionarily conserved TFBSs relevant to DC biology in their promoters.

  13. Synthetic protein scaffolds based on peptide motifs and cognate adaptor domains for improving metabolic productivity

    Directory of Open Access Journals (Sweden)

    Anselm H.C. Horn

    2015-11-01

    Full Text Available The efficiency of many cellular processes relies on the defined interaction among different proteins within the same metabolic or signaling pathway. Consequently, a spatial colocalization of functionally interacting proteins has frequently emerged during evolution. This concept has been adapted within the synthetic biology community for the purpose of creating artificial scaffolds. A recent advancement of this concept is the use of peptide motifs and their cognate adaptor domains. SH2, SH3, GBD, and PDZ domains have been used most often in research studies to date. The approach has been successfully applied to the synthesis of a variety of target molecules including catechin, D-glucaric acid, H2, hydrochinone, resveratrol, butyrate, gamma-aminobutyric acid, and mevalonate. Increased production levels of up to 77-fold have been observed compared to non-scaffolded systems. A recent extension of this concept is the creation of a covalent linkage between peptide motifs and adaptor domains, which leads to a more stable association of the scaffolded systems and thus bears the potential to further enhance metabolic productivity.

  14. Fine-tuning of T-cell development by the CD3γ di-leucine-based TCR-sorting motif

    DEFF Research Database (Denmark)

    Lauritsen, Jens Peter Holst; Boding, Lasse; Buus, Terkild B

    2015-01-01

    The CD3γ di-leucine-based (diL) receptor-sorting motif plays a central role in TCR down-regulation and in clonal expansion of virus-specific T cells. However, the role of the CD3γ diL motif in T-cell development is not known. In this study, we show that protein kinase C-induced TCR down-regulatio......The CD3γ di-leucine-based (diL) receptor-sorting motif plays a central role in TCR down-regulation and in clonal expansion of virus-specific T cells. However, the role of the CD3γ diL motif in T-cell development is not known. In this study, we show that protein kinase C-induced TCR down...

  15. A Parzen window-based approach for the detection of locally enriched transcription factor binding sites.

    Science.gov (United States)

    Vandenbon, Alexis; Kumagai, Yutaro; Teraguchi, Shunsuke; Amada, Karlou Mar; Akira, Shizuo; Standley, Daron M

    2013-01-21

    Identification of cis- and trans-acting factors regulating gene expression remains an important problem in biology. Bioinformatics analyses of regulatory regions are hampered by several difficulties. One is that binding sites for regulatory proteins are often not significantly over-represented in the set of DNA sequences of interest, because of high levels of false positive predictions, and because of positional restrictions on functional binding sites with regard to the transcription start site. We have developed a novel method for the detection of regulatory motifs based on their local over-representation in sets of regulatory regions. The method makes use of a Parzen window-based approach for scoring local enrichment, and during evaluation of significance it takes into account GC content of sequences. We show that the accuracy of our method compares favourably to that of other methods, and that our method is capable of detecting not only generally over-represented regulatory motifs, but also locally over-represented motifs that are often missed by standard motif detection approaches. Using a number of examples we illustrate the validity of our approach and suggest applications, such as the analysis of weaker binding sites. Our approach can be used to suggest testable hypotheses for wet-lab experiments. It has potential for future analyses, such as the prediction of weaker binding sites. An online application of our approach, called LocaMo Finder (Local Motif Finder), is available at http://sysimm.ifrec.osaka-u.ac.jp/tfbs/locamo/.

  16. Design of potent inhibitors of human RAD51 recombinase based on BRC motifs of BRCA2 protein: modeling and experimental validation of a chimera peptide.

    KAUST Repository

    Nomme, Julian; Renodon-Corniè re, Axelle; Asanomi, Yuya; Sakaguchi, Kazuyasu; Stasiak, Alicja Z; Stasiak, Andrzej; Norden, Bengt; Tran, Vinh; Takahashi, Masayuki

    2010-01-01

    We have previously shown that a 28-amino acid peptide derived from the BRC4 motif of BRCA2 tumor suppressor inhibits selectively human RAD51 recombinase (HsRad51). With the aim of designing better inhibitors for cancer treatment, we combined an in silico docking approach with in vitro biochemical testing to construct a highly efficient chimera peptide from eight existing human BRC motifs. We built a molecular model of all BRC motifs complexed with HsRad51 based on the crystal structure of the BRC4 motif-HsRad51 complex, computed the interaction energy of each residue in each BRC motif, and selected the best amino acid residue at each binding position. This analysis enabled us to propose four amino acid substitutions in the BRC4 motif. Three of these increased the inhibitory effect in vitro, and this effect was found to be additive. We thus obtained a peptide that is about 10 times more efficient in inhibiting HsRad51-ssDNA complex formation than the original peptide.

  17. Design of potent inhibitors of human RAD51 recombinase based on BRC motifs of BRCA2 protein: modeling and experimental validation of a chimera peptide.

    KAUST Repository

    Nomme, Julian

    2010-08-01

    We have previously shown that a 28-amino acid peptide derived from the BRC4 motif of BRCA2 tumor suppressor inhibits selectively human RAD51 recombinase (HsRad51). With the aim of designing better inhibitors for cancer treatment, we combined an in silico docking approach with in vitro biochemical testing to construct a highly efficient chimera peptide from eight existing human BRC motifs. We built a molecular model of all BRC motifs complexed with HsRad51 based on the crystal structure of the BRC4 motif-HsRad51 complex, computed the interaction energy of each residue in each BRC motif, and selected the best amino acid residue at each binding position. This analysis enabled us to propose four amino acid substitutions in the BRC4 motif. Three of these increased the inhibitory effect in vitro, and this effect was found to be additive. We thus obtained a peptide that is about 10 times more efficient in inhibiting HsRad51-ssDNA complex formation than the original peptide.

  18. Plasmodium vivax antigen discovery based on alpha-helical coiled coil protein motif.

    Directory of Open Access Journals (Sweden)

    Nora Céspedes

    Full Text Available Protein α-helical coiled coil structures that elicit antibody responses, which block critical functions of medically important microorganisms, represent a means for vaccine development. By using bioinformatics algorithms, a total of 50 antigens with α-helical coiled coil motifs orthologous to Plasmodium falciparum were identified in the P. vivax genome. The peptides identified in silico were chemically synthesized; circular dichroism studies indicated partial or high α-helical content. Antigenicity was evaluated using human sera samples from malaria-endemic areas of Colombia and Papua New Guinea. Eight of these fragments were selected and used to assess immunogenicity in BALB/c mice. ELISA assays indicated strong reactivity of serum samples from individuals residing in malaria-endemic regions and sera of immunized mice, with the α-helical coiled coil structures. In addition, ex vivo production of IFN-γ by murine mononuclear cells confirmed the immunogenicity of these structures and the presence of T-cell epitopes in the peptide sequences. Moreover, sera of mice immunized with four of the eight antigens recognized native proteins on blood-stage P. vivax parasites, and antigenic cross-reactivity with three of the peptides was observed when reacted with both the P. falciparum orthologous fragments and whole parasites. Results here point to the α-helical coiled coil peptides as possible P. vivax malaria vaccine candidates as were observed for P. falciparum. Fragments selected here warrant further study in humans and non-human primate models to assess their protective efficacy as single components or assembled as hybrid linear epitopes.

  19. TCR comodulation of nonengaged TCR takes place by a protein kinase C and CD3 gamma di-leucine-based motif-dependent mechanism

    DEFF Research Database (Denmark)

    Bonefeld, Charlotte Menné; Rasmussen, B. A.; Lauritsen, J P

    2003-01-01

    of comodulation. Like internalization of engaged TCR, comodulation was dependent on protein tyrosine kinase activity. Finally, we found that in contrast to internalization of engaged TCR, comodulation was highly dependent on protein kinase C activity and the CD3 gamma di-leucine-based motif. Based...

  20. Common motifs in the response of cereal primary metabolism to fungal pathogens are not based on similar transcriptional reprogramming

    Directory of Open Access Journals (Sweden)

    Lars Matthias Voll

    2011-08-01

    Full Text Available During compatible interactions with their host plants, biotrophic plant pathogens subvert host metabolism to ensure the sustained provision of nutrient assimilates by the colonized host cells. To investigate, whether common motifs can be revealed in the response of primary carbon and nitrogen metabolism towards colonization with biotrophic fungi in cereal leaves, we have conducted a combined metabolome and transcriptome study of three quite divergent pathosystems, the barley powdery mildew fungus (Blumeria graminis f.sp. hordei, the corn smut fungus Ustilago maydis and the maize anthracnose fungus Colletotrichum graminicola, the latter being a hemibiotroph that only exhibits an initial biotrophic phase during its establishment.Based on the analysis of 42 water-soluble metabolites, we were able to separate early biotrophic from late biotrophic interactions by hierarchical cluster analysis and principal component analysis, irrespective of the plant host. Interestingly, the corresponding transcriptome dataset could not discriminate between these stages of biotrophy, irrespective, of whether transcript data for genes of central metabolism or the entire transcriptome dataset was used. Strong differences in the transcriptional regulation of photosynthesis, glycolysis, the TCA cycle, lipid biosynthesis, and cell wall metabolism were observed between the pathosystems. Increased contents of Gln, Asn, and glucose as well as diminished contents of PEP and 3-PGA were common to early post-penetration stages of all interactions. On the transcriptional level, genes of the TCA cycle, nucleotide energy metabolism and amino acid biosynthesis exhibited consistent trends among the compared biotrophic interactions, identifying the requirement for metabolic energy and the rearrangement of amino acid pools as common transcriptional motifs during early biotrophy. Both metabolome and transcript data were employed to generate models of leaf primary metabolism during

  1. BayesMotif: de novo protein sorting motif discovery from impure datasets.

    Science.gov (United States)

    Hu, Jianjun; Zhang, Fan

    2010-01-18

    Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of

  2. MotifMark: Finding regulatory motifs in DNA sequences.

    Science.gov (United States)

    Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D

    2017-07-01

    The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.

  3. Masking of the CD3 gamma di-leucine-based motif by zeta is required for efficient T-cell receptor expression

    DEFF Research Database (Denmark)

    Lauritsen, Jens Peter H; Bonefeld, Charlotte Menné; von Essen, Marina

    2004-01-01

    containing the di-leucine-based endocytosis motif of the TCR subunit CD3 gamma have indicated that the zeta chain can mask this motif. In this study, we show that successive truncations of the cytoplasmic tail of zeta led to reduced surface expression levels of completely assembled TCR complexes. The reduced...... TCR expression levels were caused by an increase in the TCR endocytic rate constant in combination with an unaffected exocytic rate constant. Furthermore, the TCR degradation rate constant was increased in cells with truncated zeta. Introduction of a CD3 gamma chain with a disrupted di-leucine...

  4. The conserved dileucine- and tyrosine-based motifs in MLV and MPMV envelope glycoproteins are both important to regulate a common Env intracellular trafficking

    Directory of Open Access Journals (Sweden)

    Lopez-Vergès Sandra

    2006-09-01

    Full Text Available Abstract Background Retrovirus particles emerge from the assembly of two structural protein components, Gag that is translated as a soluble protein in the cytoplasm of the host cells, and Env, a type I transmembrane protein. Because both components are translated in different intracellular compartments, elucidating the mechanisms of retrovirus assembly thus requires the study of their intracellular trafficking. Results We used a CD25 (Tac chimera-based approach to study the trafficking of Moloney murine leukemia virus and Mason-Pfizer monkey virus Env proteins. We found that the cytoplasmic tails (CTs of both Env conserved two major signals that control a complex intracellular trafficking. A dileucine-based motif controls the sorting of the chimeras from the trans-Golgi network (TGN toward endosomal compartments. Env proteins then follow a retrograde transport to the TGN due to the action of a tyrosine-based motif. Mutation of either motif induces the mis-localization of the chimeric proteins and both motifs are found to mediate interactions of the viral CTs with clathrin adaptors. Conclusion This data reveals the unexpected complexity of the intracellular trafficking of retrovirus Env proteins that cycle between the TGN and endosomes. Given that Gag proteins hijack endosomal host proteins, our work suggests that the endosomal pathway may be used by retroviruses to ensure proper encountering of viral structural Gag and Env proteins in cells, an essential step of virus assembly.

  5. Non-Watson Crick base pairs might stabilize RNA structural motifs in ...

    Indian Academy of Sciences (India)

    Watson Crick base pairs, internal loops and pseudoknots have been the highlighting feature of recent structural determination of RNAs. The recent crystal structure of group-I introns has demonstrated that these might constitute RNA structural ...

  6. Salt-bridging effects on short amphiphilic helical structure and introducing sequence-based short beta-turn motifs.

    Science.gov (United States)

    Guarracino, Danielle A; Gentile, Kayla; Grossman, Alec; Li, Evan; Refai, Nader; Mohnot, Joy; King, Daniel

    2018-02-01

    Determining the minimal sequence necessary to induce protein folding is beneficial in understanding the role of protein-protein interactions in biological systems, as their three-dimensional structures often dictate their activity. Proteins are generally comprised of discrete secondary structures, from α-helices to β-turns and larger β-sheets, each of which is influenced by its primary structure. Manipulating the sequence of short, moderately helical peptides can help elucidate the influences on folding. We created two new scaffolds based on a modestly helical eight-residue peptide, PT3, we previously published. Using circular dichroism (CD) spectroscopy and changing the possible salt-bridging residues to new combinations of Lys, Arg, Glu, and Asp, we found that our most helical improvements came from the Arg-Glu combination, whereas the Lys-Asp was not significantly different from the Lys-Glu of the parent scaffold, PT3. The marked 3 10 -helical contributions in PT3 were lessened in the Arg-Glu-containing peptide with the beginning of cooperative unfolding seen through a thermal denaturation. However, a unique and unexpected signature was seen for the denaturation of the Lys-Asp peptide which could help elucidate the stages of folding between the 3 10 and α-helix. In addition, we developed a short six-residue peptide with β-turn/sheet CD signature, again to help study minimal sequences needed for folding. Overall, the results indicate that improvements made to short peptide scaffolds by fine-tuning the salt-bridging residues can enhance scaffold structure. Likewise, with the results from the new, short β-turn motif, these can help impact future peptidomimetic designs in creating biologically useful, short, structured β-sheet-forming peptides.

  7. NNAlign: A Web-Based Prediction Method Allowing Non-Expert End-User Discovery of Sequence Motifs in Quantitative Peptide Data

    DEFF Research Database (Denmark)

    Andreatta, Massimo; Schafer-Nielsen, Claus; Lund, Ole

    2011-01-01

    Recent advances in high-throughput technologies have made it possible to generate both gene and protein sequence data at an unprecedented rate and scale thereby enabling entirely new "omics"-based approaches towards the analysis of complex biological processes. However, the amount and complexity...... to interpret large data sets. We have recently developed a method, NNAlign, which is generally applicable to any biological problem where quantitative peptide data is available. This method efficiently identifies underlying sequence patterns by simultaneously aligning peptide sequences and identifying motifs...... associated with quantitative readouts. Here, we provide a web-based implementation of NNAlign allowing non-expert end-users to submit their data (optionally adjusting method parameters), and in return receive a trained method (including a visual representation of the identified motif) that subsequently can...

  8. CompariMotif: quick and easy comparisons of sequence motifs.

    Science.gov (United States)

    Edwards, Richard J; Davey, Norman E; Shields, Denis C

    2008-05-15

    CompariMotif is a novel tool for making motif-motif comparisons, identifying and describing similarities between regular expression motifs. CompariMotif can identify a number of different relationships between motifs, including exact matches, variants of degenerate motifs and complex overlapping motifs. Motif relationships are scored using shared information content, allowing the best matches to be easily identified in large comparisons. Many input and search options are available, enabling a list of motifs to be compared to itself (to identify recurring motifs) or to datasets of known motifs. CompariMotif can be run online at http://bioware.ucd.ie/ and is freely available for academic use as a set of open source Python modules under a GNU General Public License from http://bioinformatics.ucd.ie/shields/software/comparimotif/

  9. Network motif-based identification of transcription factor-target gene relationships by integrating multi-source biological data

    Directory of Open Access Journals (Sweden)

    de los Reyes Benildo G

    2008-04-01

    Full Text Available Abstract Background Integrating data from multiple global assays and curated databases is essential to understand the spatio-temporal interactions within cells. Different experiments measure cellular processes at various widths and depths, while databases contain biological information based on established facts or published data. Integrating these complementary datasets helps infer a mutually consistent transcriptional regulatory network (TRN with strong similarity to the structure of the underlying genetic regulatory modules. Decomposing the TRN into a small set of recurring regulatory patterns, called network motifs (NM, facilitates the inference. Identifying NMs defined by specific transcription factors (TF establishes the framework structure of a TRN and allows the inference of TF-target gene relationship. This paper introduces a computational framework for utilizing data from multiple sources to infer TF-target gene relationships on the basis of NMs. The data include time course gene expression profiles, genome-wide location analysis data, binding sequence data, and gene ontology (GO information. Results The proposed computational framework was tested using gene expression data associated with cell cycle progression in yeast. Among 800 cell cycle related genes, 85 were identified as candidate TFs and classified into four previously defined NMs. The NMs for a subset of TFs are obtained from literature. Support vector machine (SVM classifiers were used to estimate NMs for the remaining TFs. The potential downstream target genes for the TFs were clustered into 34 biologically significant groups. The relationships between TFs and potential target gene clusters were examined by training recurrent neural networks whose topologies mimic the NMs to which the TFs are classified. The identified relationships between TFs and gene clusters were evaluated using the following biological validation and statistical analyses: (1 Gene set enrichment

  10. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

    Science.gov (United States)

    Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

    2013-01-01

    The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545

  11. A regenerated electrochemical biosensor for label-free detection of glucose and urea based on conformational switch of i-motif oligonucleotide probe

    Energy Technology Data Exchange (ETDEWEB)

    Gao, Zhong Feng; Chen, Dong Mei [Key Laboratory of Eco-environments in Three Gorges Reservoir Region (Ministry of Education), School of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715 (China); Lei, Jing Lei [School of Chemistry and Chemical Engineering, Chongqing University, Chongqing 400044 (China); Luo, Hong Qun, E-mail: luohq@swu.edu.cn [Key Laboratory of Eco-environments in Three Gorges Reservoir Region (Ministry of Education), School of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715 (China); Li, Nian Bing, E-mail: linb@swu.edu.cn [Key Laboratory of Eco-environments in Three Gorges Reservoir Region (Ministry of Education), School of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715 (China)

    2015-10-15

    Improving the reproducibility of electrochemical signal remains a great challenge over the past decades. In this work, i-motif oligonucleotide probe-based electrochemical DNA (E-DNA) sensor is introduced for the first time as a regenerated sensing platform, which enhances the reproducibility of electrochemical signal, for label-free detection of glucose and urea. The addition of glucose or urea is able to activate glucose oxidase-catalyzed or urease-catalyzed reaction, inducing or destroying the formation of i-motif oligonucleotide probe. The conformational switch of oligonucleotide probe can be recorded by electrochemical impedance spectroscopy. Thus, the difference of electron transfer resistance is utilized for the quantitative determination of glucose and urea. We further demonstrate that the E-DNA sensor exhibits high selectivity, excellent stability, and remarkable regenerated ability. The human serum analysis indicates that this simple and regenerated strategy holds promising potential in future biosensing applications. - Highlights: • Conformational switch of i-motif is used for the detection of glucose and urea. • The sensor can be regenerated. • The proposed method is successfully applied in real sample assay. • Our method is label-free and inexpensive.

  12. Motif discovery in ranked lists of sequences

    DEFF Research Database (Denmark)

    Nielsen, Morten Muhlig; Tataru, Paula; Madsen, Tobias

    2016-01-01

    Motif analysis has long been an important method to characterize biological functionality and the current growth of sequencing-based genomics experiments further extends its potential. These diverse experiments often generate sequence lists ranked by some functional property. There is therefore...... advantage of the regular expression feature, including enrichments for combinations of different microRNA seed sites. The method is implemented and made publicly available as an R package and supports high parallelization on multi-core machinery....... a growing need for motif analysis methods that can exploit this coupled data structure and be tailored for specific biological questions. Here, we present an exploratory motif analysis tool, Regmex (REGular expression Motif EXplorer), which offers several methods to evaluate the correlation of motifs...

  13. The CD3 gamma leucine-based receptor-sorting motif is required for efficient ligand-mediated TCR down-regulation

    DEFF Research Database (Denmark)

    von Essen, Marina; Menné, Charlotte; Nielsen, Bodil L

    2002-01-01

    . The other pathway is dependent on protein kinase C (PKC)-mediated activation of the CD3 gamma di-leucine-based receptor-sorting motif. Previous studies have failed to demonstrate a connection between ligand- and PKC-induced TCR down-regulation. Thus, although an apparent paradox, the dogma has been...... that ligand- and PKC-induced TCR down-regulations are not interrelated. By analyses of a newly developed CD3 gamma-negative T cell variant, freshly isolated and PHA-activated PBMC, and a mouse T cell line, we challenged this dogma and demonstrate in this work that PKC activation and the CD3 gamma di...

  14. Space-related pharma-motifs for fast search of protein binding motifs and polypharmacological targets.

    Science.gov (United States)

    Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon

    2012-01-01

    To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.

  15. A microstructural analysis of isoprenol ether-based polycarboxylates and the impact of structural motifs on the dispersing effectiveness

    International Nuclear Information System (INIS)

    Plank, Johann; Li, Huiqun; Ilg, Manuel; Pickelmann, Julia; Eisenreich, Wolfgang; Yao, Yan; Wang, Ziming

    2016-01-01

    Generally, polycarboxylate superplasticizers (PCEs) are synthesized via aqueous free radical copolymerization. The conditions during copolymerization such as relative reactivity and feeding mode and ratio of monomers can cause different monomer sequences in the final product. In this study, the sequence of monomers in PCE polymers synthesized from acrylic acid and isoprenyloxy polyethylene glycol (IPEG) macromonomer was characterized by 13 C nuclear magnetic resonance (NMR) spectroscopy. Three different triads of monomer sequences (EAE, AAE and AAA; E = ether, A = acid monomer) were detected. It was found that IPEG PCEs predominantly contain the structural motifs of AAE and EAE, and less of AAA. Higher additions of acrylic acid do not incorporate into the structure of PCE, but convert to HMW polyacrylate as by-product instead. A PCE with optimal dispersing effectiveness was achieved at high contents of IPEG macromonomer, a molecular weight (M w ) around 40,000 Da and narrow molecular weight distribution.

  16. MHC motif viewer

    DEFF Research Database (Denmark)

    Rapin, Nicolas Philippe Jean-Pierre; Hoof, Ilka; Lund, Ole

    2008-01-01

    . Algorithms that predict which peptides MHC molecules bind have recently been developed and cover many different alleles, but the utility of these algorithms is hampered by the lack of tools for browsing and comparing the specificity of these molecules. We have, therefore, developed a web server, MHC motif....... A special viewing feature, MHC fight, allows for display of the specificity of two different MHC molecules side by side. We show how the web server can be used to discover and display surprising similarities as well as differences between MHC molecules within and between different species. The MHC motif...

  17. [Personal motif in art].

    Science.gov (United States)

    Gerevich, József

    2015-01-01

    One of the basic questions of the art psychology is whether a personal motif is to be found behind works of art and if so, how openly or indirectly it appears in the work itself. Analysis of examples and documents from the fine arts and literature allow us to conclude that the personal motif that can be identified by the viewer through symbols, at times easily at others with more difficulty, gives an emotional plus to the artistic product. The personal motif may be found in traumatic experiences, in communication to the model or with other emotionally important persons (mourning, disappointment, revenge, hatred, rivalry, revolt etc.), in self-searching, or self-analysis. The emotions are expressed in artistic activity either directly or indirectly. The intention nourished by the artist's identity (Kunstwollen) may stand in the way of spontaneous self-expression, channelling it into hidden paths. Under the influence of certain circumstances, the artist may arouse in the viewer, consciously or unconsciously, an illusionary, misleading image of himself. An examination of the personal motif is one of the important research areas of art therapy.

  18. A Tyrosine-Based Trafficking Motif of the Tegument Protein pUL71 Is Crucial for Human Cytomegalovirus Secondary Envelopment.

    Science.gov (United States)

    Dietz, Andrea N; Villinger, Clarissa; Becker, Stefan; Frick, Manfred; von Einem, Jens

    2018-01-01

    The human cytomegalovirus (HCMV) tegument protein pUL71 is required for efficient secondary envelopment and accumulates at the Golgi compartment-derived viral assembly complex (vAC) during infection. Analysis of various C-terminally truncated pUL71 proteins fused to enhanced green fluorescent protein (eGFP) identified amino acids 23 to 34 as important determinants for its Golgi complex localization. Sequence analysis and mutational verification revealed the presence of an N-terminal tyrosine-based trafficking motif (YXXΦ) in pUL71. This led us to hypothesize a requirement of the YXXΦ motif for the function of pUL71 in infection. Mutation of both the tyrosine residue and the entire YXXΦ motif resulted in an altered distribution of mutant pUL71 at the plasma membrane and in the cytoplasm during infection. Both YXXΦ mutant viruses exhibited similarly decreased focal growth and reduced virus yields in supernatants. Ultrastructurally, mutant-virus-infected cells exhibited impaired secondary envelopment manifested by accumulations of capsids undergoing an envelopment process. Additionally, clusters of capsid accumulations surrounding the vAC were observed, similar to the ultrastructural phenotype of a UL71-deficient mutant. The importance of endocytosis and thus the YXXΦ motif for targeting pUL71 to the Golgi complex was further demonstrated when clathrin-mediated endocytosis was inhibited either by coexpression of the C-terminal part of cellular AP180 (AP180-C) or by treatment with methyl-β-cyclodextrin. Both conditions resulted in a plasma membrane accumulation of pUL71. Altogether, these data reveal the presence of a functional N-terminal endocytosis motif that is an important determinant for intracellular localization of pUL71 and that is furthermore required for the function of pUL71 during secondary envelopment of HCMV capsids at the vAC. IMPORTANCE Human cytomegalovirus (HCMV) is the leading cause of birth defects among congenital virus infections and can

  19. Motif enrichment tool.

    Science.gov (United States)

    Blatti, Charles; Sinha, Saurabh

    2014-07-01

    The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  20. Structure-Based Analysis of Toxoplasma gondii Profilin: A Parasite-Specific Motif Is Required for Recognition by Toll-Like Receptor 11

    Energy Technology Data Exchange (ETDEWEB)

    K Kucera; A Koblansky; L Saunders; K Frederick; E De La Cruz; S Ghosh; Y Modis

    2011-12-31

    Profilins promote actin polymerization by exchanging ADP for ATP on monomeric actin and delivering ATP-actin to growing filament barbed ends. Apicomplexan protozoa such as Toxoplasma gondii invade host cells using an actin-dependent gliding motility. Toll-like receptor (TLR) 11 generates an innate immune response upon sensing T. gondii profilin (TgPRF). The crystal structure of TgPRF reveals a parasite-specific surface motif consisting of an acidic loop, followed by a long {beta}-hairpin. A series of structure-based profilin mutants show that TLR11 recognition of the acidic loop is responsible for most of the interleukin (IL)-12 secretion response to TgPRF in peritoneal macrophages. Deletion of both the acidic loop and the {beta}-hairpin completely abrogates IL-12 secretion. Insertion of the T. gondii acidic loop and {beta}-hairpin into yeast profilin is sufficient to generate TLR11-dependent signaling. Substitution of the acidic loop in TgPRF with the homologous loop from the apicomplexan parasite Cryptosporidium parvum does not affect TLR11-dependent IL-12 secretion, while substitution with the acidic loop from Plasmodium falciparum results in reduced but significant IL-12 secretion. We conclude that the parasite-specific motif in TgPRF is the key molecular pattern recognized by TLR11. Unlike other profilins, TgPRF slows nucleotide exchange on monomeric rabbit actin and binds rabbit actin weakly. The putative TgPRF actin-binding surface includes the {beta}-hairpin and diverges widely from the actin-binding surfaces of vertebrate profilins.

  1. RMOD: a tool for regulatory motif detection in signaling network.

    Directory of Open Access Journals (Sweden)

    Jinki Kim

    Full Text Available Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.

  2. Analisis Unsur Matematika pada Motif Sulam Usus

    Directory of Open Access Journals (Sweden)

    Fredi Ganda Putra

    2017-12-01

    Full Text Available Based on interviews with researchers sources said that the beginning of the intestine embroidery is an art of genuine crafts. Called the intestine embroidery because this technique is a technique of combining a strand of cloth resembling the intestine formed according to the pattern by means of embroidered using a thread. Intestinal embroidery techniques were originally used to create a cover of the women's customary wardrobe of Lampung or often referred to as bebe. But not many people in Lampung, especially people who live in Lampung are still many who do not know and recognize the intestine embroidery because most only know tapis only characteristic of Lampung, besides that there are other cultural results that is embroidered intestine. There are still many who do not know that the intestine motif there is a knowledge of mathematics. The researcher's problem formulation is whether there are mathematical elements contained in the intestine embroidery motif based on the concept of geometry. The purpose of this study is to determine whether there are elements of mathematics contained in the intestine motif based on the concept of geometry. Subjects in this study consisted of 4 people obtained by purposive sampling technique. From the results of data analysis conducted by using descriptive analysis and discussion as follows: (1 Intestinal embroidery motif contains the meaning of mathematics and culture or often called Etnomatematika. On the meaning of culture there is a link between the embroidery intestine with a culture that has been there before as the existence of cultural linkage between Hindu belief Buddhism and there are similarities of motifs and decorative patterns contained in the motif embroidery intestine with ornamental variety in Indonesia. (2 The relationship between the intestine with mathematical motifs there are elements of mathematics such as geometry elements in the form of geometry of dimension one and dimension two, and the

  3. An unusual hybrid fluoride featuring a [V7F27]6- chain motif based on a pyrochlore-like building unit

    International Nuclear Information System (INIS)

    Aldous, David W.; Slawin, Alexandra M.Z.; Lightfoot, Philip

    2008-01-01

    A new hybrid vanadium (III) fluoride [C 4 H 12 N 2 ] 3 [V 7 F 27 ] has been synthesised solvothermally. The crystal structure (trigonal, R3-bar c; a=17.367(2) A, c=19.604(2) A) reveals an unusual and novel chain motif consisting of pyrochlore-like heptameric units of corner-sharing octahedra, which are further linked into linear chains of alternating triple and single octahedral groups. The chains are separated by hydrogen-bonded piperazinium moieties. Magnetic susceptibility data show moderate antiferromagnetic interactions but no long-range order above 2 K, consistent with pronounced one-dimensional character, as well as frustration arising within the triangular units of magnetic ions in the chains. - Graphical abstract: A unique chain-structure vanadium(III) fluoride [C 4 H 12 N 2 ] 3 [V 7 F 27 ], based on a pyrochlore-like building unit, has been prepared solvothermally. Despite antiferromagnetic interactions, no long-range magnetic order occurs above 2 K, suggesting possible frustration

  4. Probing structural changes of self assembled i-motif DNA

    KAUST Repository

    Lee, Iljoon; Patil, Sachin; Fhayli, Karim; Alsaiari, Shahad K.; Khashab, Niveen M.

    2015-01-01

    We report an i-motif structural probing system based on Thioflavin T (ThT) as a fluorescent sensor. This probe can discriminate the structural changes of RET and Rb i-motif sequences according to pH change. This journal is

  5. RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach

    DEFF Research Database (Denmark)

    Pan, Xiaoyong; Shen, Hong Bin

    2017-01-01

    , their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains...... space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can...... be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6%. Besides the overall enhanced prediction performance, the convolutional neural network module embedded in i...

  6. DNA motif alignment by evolving a population of Markov chains.

    Science.gov (United States)

    Bi, Chengpeng

    2009-01-30

    Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences.

  7. RNAPattMatch: a web server for RNA sequence/structure motif detection based on pattern matching with flexible gaps

    Science.gov (United States)

    Drory Retwitzer, Matan; Polishchuk, Maya; Churkin, Elena; Kifer, Ilona; Yakhini, Zohar; Barash, Danny

    2015-01-01

    Searching for RNA sequence-structure patterns is becoming an essential tool for RNA practitioners. Novel discoveries of regulatory non-coding RNAs in targeted organisms and the motivation to find them across a wide range of organisms have prompted the use of computational RNA pattern matching as an enhancement to sequence similarity. State-of-the-art programs differ by the flexibility of patterns allowed as queries and by their simplicity of use. In particular—no existing method is available as a user-friendly web server. A general program that searches for RNA sequence-structure patterns is RNA Structator. However, it is not available as a web server and does not provide the option to allow flexible gap pattern representation with an upper bound of the gap length being specified at any position in the sequence. Here, we introduce RNAPattMatch, a web-based application that is user friendly and makes sequence/structure RNA queries accessible to practitioners of various background and proficiency. It also extends RNA Structator and allows a more flexible variable gaps representation, in addition to analysis of results using energy minimization methods. RNAPattMatch service is available at http://www.cs.bgu.ac.il/rnapattmatch. A standalone version of the search tool is also available to download at the site. PMID:25940619

  8. Genome-wide targeted prediction of ABA responsive genes in rice based on over-represented cis-motif in co-expressed genes.

    Science.gov (United States)

    Lenka, Sangram K; Lohia, Bikash; Kumar, Abhay; Chinnusamy, Viswanathan; Bansal, Kailash C

    2009-02-01

    Abscisic acid (ABA), the popular plant stress hormone, plays a key role in regulation of sub-set of stress responsive genes. These genes respond to ABA through specific transcription factors which bind to cis-regulatory elements present in their promoters. We discovered the ABA Responsive Element (ABRE) core (ACGT) containing CGMCACGTGB motif as over-represented motif among the promoters of ABA responsive co-expressed genes in rice. Targeted gene prediction strategy using this motif led to the identification of 402 protein coding genes potentially regulated by ABA-dependent molecular genetic network. RT-PCR analysis of arbitrarily chosen 45 genes from the predicted 402 genes confirmed 80% accuracy of our prediction. Plant Gene Ontology (GO) analysis of ABA responsive genes showed enrichment of signal transduction and stress related genes among diverse functional categories.

  9. Positional bias of general and tissue-specific regulatory motifs in mouse gene promoters

    Directory of Open Access Journals (Sweden)

    Farré Domènec

    2007-12-01

    Full Text Available Abstract Background The arrangement of regulatory motifs in gene promoters, or promoter architecture, is the result of mutation and selection processes that have operated over many millions of years. In mammals, tissue-specific transcriptional regulation is related to the presence of specific protein-interacting DNA motifs in gene promoters. However, little is known about the relative location and spacing of these motifs. To fill this gap, we have performed a systematic search for motifs that show significant bias at specific promoter locations in a large collection of housekeeping and tissue-specific genes. Results We observe that promoters driving housekeeping gene expression are enriched in particular motifs with strong positional bias, such as YY1, which are of little relevance in promoters driving tissue-specific expression. We also identify a large number of motifs that show positional bias in genes expressed in a highly tissue-specific manner. They include well-known tissue-specific motifs, such as HNF1 and HNF4 motifs in liver, kidney and small intestine, or RFX motifs in testis, as well as many potentially novel regulatory motifs. Based on this analysis, we provide predictions for 559 tissue-specific motifs in mouse gene promoters. Conclusion The study shows that motif positional bias is an important feature of mammalian proximal promoters and that it affects both general and tissue-specific motifs. Motif positional constraints define very distinct promoter architectures depending on breadth of expression and type of tissue.

  10. Identification of a novel calcium binding motif based on the detection of sequence insertions in the animal peroxidase domain of bacterial proteins.

    Directory of Open Access Journals (Sweden)

    Saray Santamaría-Hernando

    Full Text Available Proteins of the animal heme peroxidase (ANP superfamily differ greatly in size since they have either one or two catalytic domains that match profile PS50292. The orf PP_2561 of Pseudomonas putida KT2440 that we have called PepA encodes a two-domain ANP. The alignment of these domains with those of PepA homologues revealed a variable number of insertions with the consensus G-x-D-G-x-x-[GN]-[TN]-x-D-D. This motif has also been detected in the structure of pseudopilin (pdb 3G20, where it was found to be involved in Ca(2+ coordination although a sequence analysis did not reveal the presence of any known calcium binding motifs in this protein. Isothermal titration calorimetry revealed that a peptide containing this consensus motif bound specifically calcium ions with affinities ranging between 33-79 µM depending on the pH. Microcalorimetric titrations of the purified N-terminal ANP-like domain of PepA revealed Ca(2+ binding with a K(D of 12 µM and stoichiometry of 1.25 calcium ions per protein monomer. This domain exhibited peroxidase activity after its reconstitution with heme. These data led to the definition of a novel calcium binding motif that we have termed PERCAL and which was abundantly present in animal peroxidase-like domains of bacterial proteins. Bacterial heme peroxidases thus possess two different types of calcium binding motifs, namely PERCAL and the related hemolysin type calcium binding motif, with the latter being located outside the catalytic domains and in their C-terminal end. A phylogenetic tree of ANP-like catalytic domains of bacterial proteins with PERCAL motifs, including single domain peroxidases, was divided into two major clusters, representing domains with and without PERCAL motif containing insertions. We have verified that the recently reported classification of bacterial heme peroxidases in two families (cd09819 and cd09821 is unrelated to these insertions. Sequences matching PERCAL were detected in all kingdoms of

  11. Identification of a novel calcium binding motif based on the detection of sequence insertions in the animal peroxidase domain of bacterial proteins.

    Science.gov (United States)

    Santamaría-Hernando, Saray; Krell, Tino; Ramos-González, María-Isabel

    2012-01-01

    Proteins of the animal heme peroxidase (ANP) superfamily differ greatly in size since they have either one or two catalytic domains that match profile PS50292. The orf PP_2561 of Pseudomonas putida KT2440 that we have called PepA encodes a two-domain ANP. The alignment of these domains with those of PepA homologues revealed a variable number of insertions with the consensus G-x-D-G-x-x-[GN]-[TN]-x-D-D. This motif has also been detected in the structure of pseudopilin (pdb 3G20), where it was found to be involved in Ca(2+) coordination although a sequence analysis did not reveal the presence of any known calcium binding motifs in this protein. Isothermal titration calorimetry revealed that a peptide containing this consensus motif bound specifically calcium ions with affinities ranging between 33-79 µM depending on the pH. Microcalorimetric titrations of the purified N-terminal ANP-like domain of PepA revealed Ca(2+) binding with a K(D) of 12 µM and stoichiometry of 1.25 calcium ions per protein monomer. This domain exhibited peroxidase activity after its reconstitution with heme. These data led to the definition of a novel calcium binding motif that we have termed PERCAL and which was abundantly present in animal peroxidase-like domains of bacterial proteins. Bacterial heme peroxidases thus possess two different types of calcium binding motifs, namely PERCAL and the related hemolysin type calcium binding motif, with the latter being located outside the catalytic domains and in their C-terminal end. A phylogenetic tree of ANP-like catalytic domains of bacterial proteins with PERCAL motifs, including single domain peroxidases, was divided into two major clusters, representing domains with and without PERCAL motif containing insertions. We have verified that the recently reported classification of bacterial heme peroxidases in two families (cd09819 and cd09821) is unrelated to these insertions. Sequences matching PERCAL were detected in all kingdoms of life.

  12. Epitope-based vaccines with the Anaplasma marginale MSP1a functional motif induce a balanced humoral and cellular immune response in mice.

    Directory of Open Access Journals (Sweden)

    Paula S Santos

    Full Text Available Bovine anaplasmosis is a hemoparasitic disease that causes considerable economic loss to the dairy and beef industries. Cattle immunized with the Anaplasma marginale MSP1 outer membrane protein complex presents a protective humoral immune response; however, its efficacy is variable. Immunodominant epitopes seem to be a key-limiting factor for the adaptive immunity. We have successfully demonstrated that critical motifs of the MSP1a functional epitope are essential for antibody recognition of infected animal sera, but its protective immunity is yet to be tested. We have evaluated two synthetic vaccine formulations against A. marginale, using epitope-based approach in mice. Mice infection with bovine anaplasmosis was demonstrated by qPCR analysis of erythrocytes after 15-day exposure. A proof-of-concept was obtained in this murine model, in which peptides conjugated to bovine serum albumin were used for immunization in three 15-day intervals by intraperitoneal injections before challenging with live bacteria. Blood samples were analyzed for the presence of specific IgG2a and IgG1 antibodies, as well as for the rickettsemia analysis. A panel containing the cytokines' transcriptional profile for innate and adaptive immune responses was carried out through qPCR. Immunized BALB/c mice challenged with A. marginale presented stable body weight, reduced number of infected erythrocytes, and no mortality; and among control groups mortality rates ranged from 15% to 29%. Additionally, vaccines have significantly induced higher IgG2a than IgG1 response, followed by increased expression of pro-inflammatory cytokines. This is a successful demonstration of epitope-based vaccines, and protection against anaplasmosis may be associated with elicitation of effector functions of humoral and cellular immune responses in murine model.

  13. Automatic annotation of protein motif function with Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Gopalakrishnan Vanathi

    2004-09-01

    Full Text Available Abstract Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs.

  14. New scoring schema for finding motifs in DNA Sequences

    Directory of Open Access Journals (Sweden)

    Nowzari-Dalini Abbas

    2009-03-01

    Full Text Available Abstract Background Pattern discovery in DNA sequences is one of the most fundamental problems in molecular biology with important applications in finding regulatory signals and transcription factor binding sites. An important task in this problem is to search (or predict known binding sites in a new DNA sequence. For this reason, all subsequences of the given DNA sequence are scored based on an scoring function and the prediction is done by selecting the best score. By assuming no dependency between binding site base positions, most of the available tools for known binding site prediction are designed. Recently Tomovic and Oakeley investigated the statistical basis for either a claim of dependence or independence, to determine whether such a claim is generally true, and they presented a scoring function for binding site prediction based on the dependency between binding site base positions. Our primary objective is to investigate the scoring functions which can be used in known binding site prediction based on the assumption of dependency or independency in binding site base positions. Results We propose a new scoring function based on the dependency between all positions in biding site base positions. This scoring function uses joint information content and mutual information as a measure of dependency between positions in transcription factor binding site. Our method for modeling dependencies is simply an extension of position independency methods. We evaluate our new scoring function on the real data sets extracted from JASPAR and TRANSFAC data bases, and compare the obtained results with two other well known scoring functions. Conclusion The results demonstrate that the new approach improves known binding site discovery and show that the joint information content and mutual information provide a better and more general criterion to investigate the relationships between positions in the TFBS. Our scoring function is formulated by simple

  15. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-01-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  16. DNA motif elucidation using belief propagation.

    Science.gov (United States)

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-09-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k=8∼10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/∼wkc/kmerHMM.

  17. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun

    2013-06-29

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors\\' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  18. Identification of sequence motifs significantly associated with antisense activity

    Directory of Open Access Journals (Sweden)

    Peek Andrew S

    2007-06-01

    Full Text Available Abstract Background Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. Results We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. Conclusion The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic

  19. HOCOMOCO: a comprehensive collection of human transcription factor binding sites models

    Science.gov (United States)

    Kulakovskiy, Ivan V.; Medvedeva, Yulia A.; Schaefer, Ulf; Kasianov, Artem S.; Vorontsov, Ilya E.; Bajic, Vladimir B.; Makeev, Vsevolod J.

    2013-01-01

    Transcription factor (TF) binding site (TFBS) models are crucial for computational reconstruction of transcription regulatory networks. In existing repositories, a TF often has several models (also called binding profiles or motifs), obtained from different experimental data. Having a single TFBS model for a TF is more pragmatic for practical applications. We show that integration of TFBS data from various types of experiments into a single model typically results in the improved model quality probably due to partial correction of source specific technique bias. We present the Homo sapiens comprehensive model collection (HOCOMOCO, http://autosome.ru/HOCOMOCO/, http://cbrc.kaust.edu.sa/hocomoco/) containing carefully hand-curated TFBS models constructed by integration of binding sequences obtained by both low- and high-throughput methods. To construct position weight matrices to represent these TFBS models, we used ChIPMunk software in four computational modes, including newly developed periodic positional prior mode associated with DNA helix pitch. We selected only one TFBS model per TF, unless there was a clear experimental evidence for two rather distinct TFBS models. We assigned a quality rating to each model. HOCOMOCO contains 426 systematically curated TFBS models for 401 human TFs, where 172 models are based on more than one data source. PMID:23175603

  20. HOCOMOCO: A comprehensive collection of human transcription factor binding sites models

    KAUST Repository

    Kulakovskiy, Ivan V.; Medvedeva, Yulia A.; Schaefer, Ulf; Kasianov, Artem S.; Vorontsov, Ilya E.; Bajic, Vladimir B.; Makeev, Vsevolod J.

    2012-01-01

    Transcription factor (TF) binding site (TFBS) models are crucial for computational reconstruction of transcription regulatory networks. In existing repositories, a TF often has several models (also called binding profiles or motifs), obtained from different experimental data. Having a single TFBS model for a TF is more pragmatic for practical applications. We show that integration of TFBS data from various types of experiments into a single model typically results in the improved model quality probably due to partial correction of source specific technique bias. We present the Homo sapiens comprehensive model collection (HOCOMOCO, http://autosome.ru/HOCOMOCO/, http://cbrc.kaust.edu.sa/ hocomoco/) containing carefully hand-curated TFBS models constructed by integration of binding sequences obtained by both low- and high-throughput methods. To construct position weight matrices to represent these TFBS models, we used ChIPMunk software in four computational modes, including newly developed periodic positional prior mode associated with DNA helix pitch. We selected only one TFBS model per TF, unless there was a clear experimental evidence for two rather distinct TFBS models. We assigned a quality rating to each model. HOCOMOCO contains 426 systematically curated TFBS models for 401 human TFs, where 172 models are based on more than one data source. The Author(s) 2012.

  1. HOCOMOCO: A comprehensive collection of human transcription factor binding sites models

    KAUST Repository

    Kulakovskiy, Ivan V.

    2012-11-21

    Transcription factor (TF) binding site (TFBS) models are crucial for computational reconstruction of transcription regulatory networks. In existing repositories, a TF often has several models (also called binding profiles or motifs), obtained from different experimental data. Having a single TFBS model for a TF is more pragmatic for practical applications. We show that integration of TFBS data from various types of experiments into a single model typically results in the improved model quality probably due to partial correction of source specific technique bias. We present the Homo sapiens comprehensive model collection (HOCOMOCO, http://autosome.ru/HOCOMOCO/, http://cbrc.kaust.edu.sa/ hocomoco/) containing carefully hand-curated TFBS models constructed by integration of binding sequences obtained by both low- and high-throughput methods. To construct position weight matrices to represent these TFBS models, we used ChIPMunk software in four computational modes, including newly developed periodic positional prior mode associated with DNA helix pitch. We selected only one TFBS model per TF, unless there was a clear experimental evidence for two rather distinct TFBS models. We assigned a quality rating to each model. HOCOMOCO contains 426 systematically curated TFBS models for 401 human TFs, where 172 models are based on more than one data source. The Author(s) 2012.

  2. Motif statistics and spike correlations in neuronal networks

    International Nuclear Information System (INIS)

    Hu, Yu; Shea-Brown, Eric; Trousdale, James; Josić, Krešimir

    2013-01-01

    Motifs are patterns of subgraphs of complex networks. We studied the impact of such patterns of connectivity on the level of correlated, or synchronized, spiking activity among pairs of cells in a recurrent network of integrate and fire neurons. For a range of network architectures, we find that the pairwise correlation coefficients, averaged across the network, can be closely approximated using only three statistics of network connectivity. These are the overall network connection probability and the frequencies of two second order motifs: diverging motifs, in which one cell provides input to two others, and chain motifs, in which two cells are connected via a third intermediary cell. Specifically, the prevalence of diverging and chain motifs tends to increase correlation. Our method is based on linear response theory, which enables us to express spiking statistics using linear algebra, and a resumming technique, which extrapolates from second order motifs to predict the overall effect of coupling on network correlation. Our motif-based results seek to isolate the effect of network architecture perturbatively from a known network state. (paper)

  3. DMINDA: an integrated web server for DNA motif identification and analyses.

    Science.gov (United States)

    Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying

    2014-07-01

    DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs.

    Science.gov (United States)

    Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude

    2011-06-20

    One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.

  5. A speedup technique for (l, d-motif finding algorithms

    Directory of Open Access Journals (Sweden)

    Dinh Hieu

    2011-03-01

    Full Text Available Abstract Background The discovery of patterns in DNA, RNA, and protein sequences has led to the solution of many vital biological problems. For instance, the identification of patterns in nucleic acid sequences has resulted in the determination of open reading frames, identification of promoter elements of genes, identification of intron/exon splicing sites, identification of SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have proven to be extremely helpful in domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, etc. Motifs are important patterns that are helpful in finding transcriptional regulatory elements, transcription factor binding sites, functional genomics, drug design, etc. As a result, numerous papers have been written to solve the motif search problem. Results Three versions of the motif search problem have been proposed in the literature: Simple Motif Search (SMS, (l, d-motif search (or Planted Motif Search (PMS, and Edit-distance-based Motif Search (EMS. In this paper we focus on PMS. Two kinds of algorithms can be found in the literature for solving the PMS problem: exact and approximate. An exact algorithm identifies the motifs always and an approximate algorithm may fail to identify some or all of the motifs. The exact version of PMS problem has been shown to be NP-hard. Exact algorithms proposed in the literature for PMS take time that is exponential in some of the underlying parameters. In this paper we propose a generic technique that can be used to speedup PMS algorithms. Conclusions We present a speedup technique that can be used on any PMS algorithm. We have tested our speedup technique on a number of algorithms. These experimental results show that our speedup technique is indeed very

  6. Annotating RNA motifs in sequences and alignments.

    Science.gov (United States)

    Gardner, Paul P; Eldai, Hisham

    2015-01-01

    RNA performs a diverse array of important functions across all cellular life. These functions include important roles in translation, building translational machinery and maturing messenger RNA. More recent discoveries include the miRNAs and bacterial sRNAs that regulate gene expression, the thermosensors, riboswitches and other cis-regulatory elements that help prokaryotes sense their environment and eukaryotic piRNAs that suppress transposition. However, there can be a long period between the initial discovery of a RNA and determining its function. We present a bioinformatic approach to characterize RNA motifs, which are critical components of many RNA structure-function relationships. These motifs can, in some instances, provide researchers with functional hypotheses for uncharacterized RNAs. Moreover, we introduce a new profile-based database of RNA motifs--RMfam--and illustrate some applications for investigating the evolution and functional characterization of RNA. All the data and scripts associated with this work are available from: https://github.com/ppgardne/RMfam. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Motif signatures of transcribed enhancers

    KAUST Repository

    Kleftogiannis, Dimitrios

    2017-09-14

    In mammalian cells, transcribed enhancers (TrEn) play important roles in the initiation of gene expression and maintenance of gene expression levels in spatiotemporal manner. One of the most challenging questions in biology today is how the genomic characteristics of enhancers relate to enhancer activities. This is particularly critical, as several recent studies have linked enhancer sequence motifs to specific functional roles. To date, only a limited number of enhancer sequence characteristics have been investigated, leaving space for exploring the enhancers genomic code in a more systematic way. To address this problem, we developed a novel computational method, TELS, aimed at identifying predictive cell type/tissue specific motif signatures. We used TELS to compile a comprehensive catalog of motif signatures for all known TrEn identified by the FANTOM5 consortium across 112 human primary cells and tissues. Our results confirm that distinct cell type/tissue specific motif signatures characterize TrEn. These signatures allow discriminating successfully a) TrEn from random controls, proxy of non-enhancer activity, and b) cell type/tissue specific TrEn from enhancers expressed and transcribed in different cell types/tissues. TELS codes and datasets are publicly available at http://www.cbrc.kaust.edu.sa/TELS.

  8. Evolutionarily conserved bias of amino-acid usage refines the definition of PDZ-binding motif

    Directory of Open Access Journals (Sweden)

    Launey Thomas

    2011-06-01

    Full Text Available Abstract Background The interactions between PDZ (PSD-95, Dlg, ZO-1 domains and PDZ-binding motifs play central roles in signal transductions within cells. Proteins with PDZ domains bind to PDZ-binding motifs almost exclusively when the motifs are located at the carboxyl (C- terminal ends of their binding partners. However, it remains little explored whether PDZ-binding motifs show any preferential location at the C-terminal ends of proteins, at genome-level. Results Here, we examined the distribution of the type-I (x-x-S/T-x-I/L/V or type-II (x-x-V-x-I/V PDZ-binding motifs in proteins encoded in the genomes of five different species (human, mouse, zebrafish, fruit fly and nematode. We first established that these PDZ-binding motifs are indeed preferentially present at their C-terminal ends. Moreover, we found specific amino acid (AA bias for the 'x' positions in the motifs at the C-terminal ends. In general, hydrophilic AAs were favored. Our genomics-based findings confirm and largely extend the results of previous interaction-based studies, allowing us to propose refined consensus sequences for all of the examined PDZ-binding motifs. An ontological analysis revealed that the refined motifs are functionally relevant since a large fraction of the proteins bearing the motif appear to be involved in signal transduction. Furthermore, co-precipitation experiments confirmed two new protein interactions predicted by our genomics-based approach. Finally, we show that influenza virus pathogenicity can be correlated with PDZ-binding motif, with high-virulence viral proteins bearing a refined PDZ-binding motif. Conclusions Our refined definition of PDZ-binding motifs should provide important clues for identifying functional PDZ-binding motifs and proteins involved in signal transduction.

  9. PISMA: A Visual Representation of Motif Distribution in DNA Sequences

    Directory of Open Access Journals (Sweden)

    Rogelio Alcántara-Silva

    2017-03-01

    Full Text Available Background: Because the graphical presentation and analysis of motif distribution can provide insights for experimental hypothesis, PISMA aims at identifying motifs on DNA sequences, counting and showing them graphically. The motif length ranges from 2 to 10 bases, and the DNA sequences range up to 10 kb. The motif distribution is shown as a bar-code–like, as a gene-map–like, and as a transcript scheme. Results: We obtained graphical schemes of the CpG site distribution from 91 human papillomavirus genomes. Also, we present 2 analyses: one of DNA motifs associated with either methylation-resistant or methylation-sensitive CpG islands and another analysis of motifs associated with exosome RNA secretion. Availability and Implementation: PISMA is developed in Java; it is executable in any type of hardware and in diverse operating systems. PISMA is freely available to noncommercial users. The English version and the User Manual are provided in Supplementary Files 1 and 2, and a Spanish version is available at www.biomedicas.unam.mx/wp-content/software/pisma.zip and www.biomedicas.unam.mx/wp-content/pdf/manual/pisma.pdf .

  10. Trypanosoma cruzi I genotypes in different geographic regions and transmission cycles based on a microsatellite motif of the intergenic spacer of spliced leader genes✯

    Science.gov (United States)

    Cura, Carolina I.; Mejía-Jaramillo, Ana M.; Duffy, Tomás; Burgos, Juan M.; Rodriguero, Marcela; Cardinal, Marta V.; Kjos, Sonia; Gurgel-Gonçalves, Rodrigo; Blanchet, Denis; De Pablos, Luis M.; Tomasini, Nicolás; Silva, Alex Da; Russomando, Graciela; Cuba Cuba, Cesar A.; Aznar, Christine; Abate, Teresa; Levin, Mariano J.; Osuna, Antonio; Gürtler, Ricardo E.; Diosque, Patricio; Solari, Aldo; Triana-Chávez, Omar; Schijman, Alejandro G.

    2011-01-01

    The intergenic region of spliced-leader (SL-IR) genes from 105 Trypanosoma cruzi I (Tc I) infected biological samples, culture isolates and stocks from 11 endemic countries, from Argentina to the USA were characterised, allowing identification of 76 genotypes with 54 polymorphic sites from 123 aligned sequences. On the basis of the microsatellite motif proposed by Herrera et al. (2007) to define four haplotypes in Colombia, we could classify these genotypes into four distinct Tc I SL-IR groups, three corresponding to the former haplotypes Ia (11 genotypes), Ib (11 genotypes) and Id (35 genotypes); and one novel group, Ie (19 genotypes). Genotypes harboring the Tc Ic motif were not detected in our study. Tc Ia was associated with domestic cycles in southern and northern South America and sylvatic cycles in Central and North America. Tc Ib was found in all transmission cycles from Colombia. Tc Id was identified in all transmission cycles from Argentina and Colombia, including Chagas cardiomyopathy patients, sylvatic Brazilian samples and human cases from French Guiana, Panama and Venezuela. Tc Ie gathered five samples from domestic Triatoma infestans from northern Argentina, nine samples from wild Mepraia spinolai and Mepraia gajardoi and two chagasic patients from Chile and one from a Bolivian patient with chagasic reactivation. Mixed infections by Tc Ia + Tc Id, Tc Ia + Tc Ie and Tc Id + Tc Ie were detected in vector faeces and isolates from human and vector samples. In addition, Tc Ia and Tc Id were identified in different tissues from a heart transplanted Chagas cardiomyopathy patient with reactivation, denoting histotropism. Trypanosoma cruzi I SL-IR genotypes from parasites infecting Triatoma gerstaeckeri and Didelphis virginiana from USA, T. infestans from Paraguay, Rhodnius nasutus and Rhodnius neglectus from Brazil and M. spinolai and M. gajardoi from Chile are to our knowledge described for the first time. PMID:20670628

  11. CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures

    Directory of Open Access Journals (Sweden)

    Hamed Bostan

    2012-01-01

    Full Text Available Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and applications, there is no complete reference and motif query available at the moment. Cysteine motif database (CMD is the first online resource that stores all cysteine residues, their flanking motifs with their secondary structure, and propensity values assignment derived from the laboratory data. We extracted more than 3 million cysteine motifs from PDB and UniProt data, annotated with secondary structure assignment, propensity value assignment, and frequency of occurrence and coefficiency of their bonding status. Removal of redundancies generated 15875 unique flanking motifs that are always bonded and 41577 unique patterns that are always nonbonded. Queries are based on the protein ID, FASTA sequence, sequence motif, and secondary structure individually or in batch format using the provided APIs that allow remote users to query our database via third party software and/or high throughput screening/querying. The CMD offers extensive information about the bonded, free cysteine residues, and their motifs that allows in-depth characterization of the sequence motif composition.

  12. Finding a Leucine in a Haystack: Searching the Proteome for ambigous Leucine-Aspartic Acid motifs

    KAUST Repository

    Arold, Stefan T.

    2016-01-25

    Leucine-aspartic acid (LD) motifs are short helical protein-protein interaction motifs involved in cell motility, survival and communication. LD motif interactions are also implicated in cancer metastasis and are targeted by several viruses. LD motifs are notoriously difficult to detect because sequence pattern searches lead to an excessively high number of false positives. Hence, despite 20 years of research, only six LD motif–containing proteins are known in humans, three of which are close homologues of the paxillin family. To enable the proteome-wide discovery of LD motifs, we developed LD Motif Finder (LDMF), a web tool based on machine learning that combines sequence information with structural predictions to detect LD motifs with high accuracy. LDMF predicted 13 new LD motifs in humans. Using biophysical assays, we experimentally confirmed in vitro interactions for four novel LD motif proteins. Thus, LDMF allows proteome-wide discovery of LD motifs, despite a highly ambiguous sequence pattern. Functional implications will be discussed.

  13. Phyloproteomic Analysis of 11780 Six-Residue-Long Motifs Occurrences

    Directory of Open Access Journals (Sweden)

    O. V. Galzitskaya

    2015-01-01

    Full Text Available How is it possible to find good traits for phylogenetic reconstructions? Here, we present a new phyloproteomic criterion that is an occurrence of simple motifs which can be imprints of evolution history. We studied the occurrences of 11780 six-residue-long motifs consisting of two randomly located amino acids in 97 eukaryotic and 25 bacterial proteomes. For all eukaryotic proteomes, with the exception of the Amoebozoa, Stramenopiles, and Diplomonadida kingdoms, the number of proteins containing the motifs from the first group (one of the two amino acids occurs once at the terminal position made about 20%; in the case of motifs from the second (one of two amino acids occurs one time within the pattern and third (the two amino acids occur randomly groups, 30% and 50%, respectively. For bacterial proteomes, this relationship was 10%, 27%, and 63%, respectively. The matrices of correlation coefficients between numbers of proteins where a motif from the set of 11780 motifs appears at least once in 9 kingdoms and 5 phyla of bacteria were calculated. Among the correlation coefficients for eukaryotic proteomes, the correlation between the animal and fungi kingdoms (0.62 is higher than between fungi and plants (0.54. Our study provides support that animals and fungi are sibling kingdoms. Comparison of the frequencies of six-residue-long motifs in different proteomes allows obtaining phylogenetic relationships based on similarities between these frequencies: the Diplomonadida kingdoms are more close to Bacteria than to Eukaryota; Stramenopiles and Amoebozoa are more close to each other than to other kingdoms of Eukaryota.

  14. Statistical tests to compare motif count exceptionalities

    Directory of Open Access Journals (Sweden)

    Vandewalle Vincent

    2007-03-01

    Full Text Available Abstract Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use.

  15. Detection of Cytosolic Shigella flexneri via a C-Terminal Triple-Arginine Motif of GBP1 Inhibits Actin-Based Motility

    Directory of Open Access Journals (Sweden)

    Anthony S. Piro

    2017-12-01

    Full Text Available Dynamin-like guanylate binding proteins (GBPs are gamma interferon (IFN-γ-inducible host defense proteins that can associate with cytosol-invading bacterial pathogens. Mouse GBPs promote the lytic destruction of targeted bacteria in the host cell cytosol, but the antimicrobial function of human GBPs and the mechanism by which these proteins associate with cytosolic bacteria are poorly understood. Here, we demonstrate that human GBP1 is unique among the seven human GBP paralogs in its ability to associate with at least two cytosolic Gram-negative bacteria, Burkholderia thailandensis and Shigella flexneri. Rough lipopolysaccharide (LPS mutants of S. flexneri colocalize with GBP1 less frequently than wild-type S. flexneri does, suggesting that host recognition of O antigen promotes GBP1 targeting to Gram-negative bacteria. The targeting of GBP1 to cytosolic bacteria, via a unique triple-arginine motif present in its C terminus, promotes the corecruitment of four additional GBP paralogs (GBP2, GBP3, GBP4, and GBP6. GBP1-decorated Shigella organisms replicate but fail to form actin tails, leading to their intracellular aggregation. Consequentially, the wild type but not the triple-arginine GBP1 mutant restricts S. flexneri cell-to-cell spread. Furthermore, human-adapted S. flexneri, through the action of one its secreted effectors, IpaH9.8, is more resistant to GBP1 targeting than the non-human-adapted bacillus B. thailandensis. These studies reveal that human GBP1 uniquely functions as an intracellular “glue trap,” inhibiting the cytosolic movement of normally actin-propelled Gram-negative bacteria. In response to this powerful human defense program, S. flexneri has evolved an effective counterdefense to restrict GBP1 recruitment.

  16. Anion induced conformational preference of Cα NN motif residues in functional proteins.

    Science.gov (United States)

    Patra, Piya; Ghosh, Mahua; Banerjee, Raja; Chakrabarti, Jaydeb

    2017-12-01

    Among different ligand binding motifs, anion binding C α NN motif consisting of peptide backbone atoms of three consecutive residues are observed to be important for recognition of free anions, like sulphate or biphosphate and participate in different key functions. Here we study the interaction of sulphate and biphosphate with C α NN motif present in different proteins. Instead of total protein, a peptide fragment has been studied keeping C α NN motif flanked in between other residues. We use classical force field based molecular dynamics simulations to understand the stability of this motif. Our data indicate fluctuations in conformational preferences of the motif residues in absence of the anion. The anion gives stability to one of these conformations. However, the anion induced conformational preferences are highly sequence dependent and specific to the type of anion. In particular, the polar residues are more favourable compared to the other residues for recognising the anion. © 2017 Wiley Periodicals, Inc.

  17. Genome-wide prediction and functional validation of promoter motifs regulating gene expression in spore and infection stages of Phytophthora infestans.

    Directory of Open Access Journals (Sweden)

    Sourav Roy

    2013-03-01

    Full Text Available Most eukaryotic pathogens have complex life cycles in which gene expression networks orchestrate the formation of cells specialized for dissemination or host colonization. In the oomycete Phytophthora infestans, the potato late blight pathogen, major shifts in mRNA profiles during developmental transitions were identified using microarrays. We used those data with search algorithms to discover about 100 motifs that are over-represented in promoters of genes up-regulated in hyphae, sporangia, sporangia undergoing zoosporogenesis, swimming zoospores, or germinated cysts forming appressoria (infection structures. Most of the putative stage-specific transcription factor binding sites (TFBSs thus identified had features typical of TFBSs such as position or orientation bias, palindromy, and conservation in related species. Each of six motifs tested in P. infestans transformants using the GUS reporter gene conferred the expected stage-specific expression pattern, and several were shown to bind nuclear proteins in gel-shift assays. Motifs linked to the appressoria-forming stage, including a functionally validated TFBS, were over-represented in promoters of genes encoding effectors and other pathogenesis-related proteins. To understand how promoter and genome architecture influence expression, we also mapped transcription patterns to the P. infestans genome assembly. Adjacent genes were not typically induced in the same stage, including genes transcribed in opposite directions from small intergenic regions, but co-regulated gene pairs occurred more than expected by random chance. These data help illuminate the processes regulating development and pathogenesis, and will enable future attempts to purify the cognate transcription factors.

  18. Fitness for synchronization of network motifs

    DEFF Research Database (Denmark)

    Vega, Y.M.; Vázquez-Prada, M.; Pacheco, A.F.

    2004-01-01

    We study the synchronization of Kuramoto's oscillators in small parts of networks known as motifs. We first report on the system dynamics for the case of a scale-free network and show the existence of a non-trivial critical point. We compute the probability that network motifs synchronize, and fi...... that the fitness for synchronization correlates well with motifs interconnectedness and structural complexity. Possible implications for present debates about network evolution in biological and other systems are discussed....

  19. Low-dimensional morphospace of topological motifs in human fMRI brain networks

    Directory of Open Access Journals (Sweden)

    Sarah E. Morgan

    2018-06-01

    Full Text Available We present a low-dimensional morphospace of fMRI brain networks, where axes are defined in a data-driven manner based on the network motifs. The morphospace allows us to identify the key variations in healthy fMRI networks in terms of their underlying motifs, and we observe that two principal components (PCs can account for 97% of the motif variability. The first PC of the motif distribution is correlated with efficiency and inversely correlated with transitivity. Hence this axis approximately conforms to the well-known economical small-world trade-off between integration and segregation in brain networks. Finally, we show that the economical clustering generative model proposed by Vértes et al. (2012 can approximately reproduce the motif morphospace of the real fMRI brain networks, in contrast to other generative models. Overall, the motif morphospace provides a powerful way to visualize the relationships between network properties and to investigate generative or constraining factors in the formation of complex human brain functional networks. Motifs have been described as the building blocks of complex networks. Meanwhile, a morphospace allows networks to be placed in a common space and can reveal the relationships between different network properties and elucidate the driving forces behind network topology. We combine the concepts of motifs and morphospaces to create the first motif morphospace of fMRI brain networks. Crucially, the morphospace axes are defined by the motifs, in a data-driven manner. We observe strong correlations between the networks’ positions in morphospace and their global topological properties, suggesting that motif morphospaces are a powerful way to capture the topology of networks in a low-dimensional space and to compare generative models of brain networks. Motif morphospaces could also be used to study other complex networks’ topologies.

  20. An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.

    Science.gov (United States)

    Liu, Bingqiang; Zhang, Hanyuan; Zhou, Chuan; Li, Guojun; Fennell, Anne; Wang, Guanghui; Kang, Yu; Liu, Qi; Ma, Qin

    2016-08-09

    Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance

  1. Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs

    Directory of Open Access Journals (Sweden)

    Martin Juliette

    2011-06-01

    Full Text Available Abstract Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet, which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i ubiquitous motifs, shared by several superfamilies and (ii superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.

  2. A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.

    Science.gov (United States)

    Tran, Ngoc Tam L; Huang, Chun-Hsi

    2014-02-20

    ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs that other tools may not discover. We recommended the users to use multiple motif finding Web tools that implement different algorithms for obtaining significant motifs, overlapping resemble motifs, and non-overlapping motifs. Finally, we provided our suggestions for future development of motif finding Web tool that better assists researchers for finding motifs in ChIP-Seq data.

  3. MSDmotif: exploring protein sites and motifs

    Directory of Open Access Journals (Sweden)

    Henrick Kim

    2008-07-01

    Full Text Available Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.

  4. Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise

    Science.gov (United States)

    Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San

    2018-01-01

    Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…

  5. A proposed vestigial translation initiation motif in VP1 of hepatitis A virus.

    Science.gov (United States)

    Kang, Jeong-Ah; Funkhouser, Ann W

    2002-07-01

    The internal ribosome entry site (IRES) of picornaviruses has a 3' polypyrimidine tract (PPT) 16-24 bases upstream of an AUG triplet (PPT/AUG motif). This motif is critical in determining the efficiency of cap-independent translation. HAV has a conserved PPT/AUG motif consisting of a nine base sequence (AGGUUUUUC) 23 bases upstream of the preferred AUG start codon. This HAV-specific PPT/AUG motif is repeated and conserved in VP1 of HAV, but not of other picornaviruses. We proposed that the PPT/AUG motif in the open reading frame initiated translation and/or had an impact on the life cycle of the virus. In vitro translation of mutant bicistronic mRNAs and growth in cell culture of mutant viruses provided no evidence that the VP1 PPT/AUG motif had any impact on either translation or growth. HAV differs from other picornaviruses in its inefficient growth in cell culture. Since the HAV-specific PPT/AUG motif is found in only 1 in 300,000 reported viral sequences outside the hepatovirus genus, this motif may be a vestigial translation initiation element and may have played a role in determining the unusual phenotype of HAV.

  6. Temporal motifs in time-dependent networks

    International Nuclear Information System (INIS)

    Kovanen, Lauri; Karsai, Márton; Kaski, Kimmo; Kertész, János; Saramäki, Jari

    2011-01-01

    Temporal networks are commonly used to represent systems where connections between elements are active only for restricted periods of time, such as telecommunication, neural signal processing, biochemical reaction and human social interaction networks. We introduce the framework of temporal motifs to study the mesoscale topological–temporal structure of temporal networks in which the events of nodes do not overlap in time. Temporal motifs are classes of similar event sequences, where the similarity refers not only to topology but also to the temporal order of the events. We provide a mapping from event sequences to coloured directed graphs that enables an efficient algorithm for identifying temporal motifs. We discuss some aspects of temporal motifs, including causality and null models, and present basic statistics of temporal motifs in a large mobile call network

  7. MotifNet: a web-server for network motif analysis.

    Science.gov (United States)

    Smoly, Ilan Y; Lerman, Eugene; Ziv-Ukelson, Michal; Yeger-Lotem, Esti

    2017-06-15

    Network motifs are small topological patterns that recur in a network significantly more often than expected by chance. Their identification emerged as a powerful approach for uncovering the design principles underlying complex networks. However, available tools for network motif analysis typically require download and execution of computationally intensive software on a local computer. We present MotifNet, the first open-access web-server for network motif analysis. MotifNet allows researchers to analyze integrated networks, where nodes and edges may be labeled, and to search for motifs of up to eight nodes. The output motifs are presented graphically and the user can interactively filter them by their significance, number of instances, node and edge labels, and node identities, and view their instances. MotifNet also allows the user to distinguish between motifs that are centered on specific nodes and motifs that recur in distinct parts of the network. MotifNet is freely available at http://netbio.bgu.ac.il/motifnet . The website was implemented using ReactJs and supports all major browsers. The server interface was implemented in Python with data stored on a MySQL database. estiyl@bgu.ac.il or michaluz@cs.bgu.ac.il. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  8. Mobile Technology for Improved Family Planning (MOTIF): the development of a mobile phone-based (mHealth) intervention to support post-abortion family planning (PAFP) in Cambodia.

    Science.gov (United States)

    Smith, Chris; Vannak, Uk; Sokhey, Ly; Ngo, Thoai D; Gold, Judy; Free, Caroline

    2016-01-05

    The objective of this paper is to outline the formative research process used to develop the MOTIF mobile phone-based (mHealth) intervention to support post-abortion family planning in Cambodia. The formative research process involved literature reviews, interviews and focus group discussions with clients, and consultation with clinicians and organisations implementing mHealth activities in Cambodia. This process led to the development of a conceptual framework and the intervention. Key findings from the formative research included identification of the main reasons for non-use of contraception and patterns of mobile phone use in Cambodia. We drew on components of existing interventions and behaviour change theory to develop a conceptual framework. A multi-faceted voice-based intervention was designed to address health concerns and other key determinants of contraception use. Formative research was essential in order to develop an appropriate mHealth intervention to support post-abortion contraception in Cambodia. Each component of the formative research contributed to the final intervention design.

  9. Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

    KAUST Repository

    Alam, Tanvir

    2018-03-11

    Short Linear Motifs (SLiMs) contribute to almost every cellular function by connecting appropriate protein partners. Accurate prediction of SLiMs is difficult due to their shortness and sequence degeneracy. Leucine-aspartic acid (LD) motifs are SLiMs that link paxillin family proteins to factors controlling (cancer) cell adhesion, motility and survival. The existence and importance of LD motifs beyond the paxillin family is poorly understood. To enable a proteome-wide assessment of these motifs, we developed an active-learning based framework that iteratively integrates computational predictions with experimental validation. Our analysis of the human proteome identified a dozen proteins that contain LD motifs, all being involved in cell adhesion and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter-species comparison revealed a conserved LD signalling core, and reveals the emergence of species-specific adaptive connections, while maintaining a strong functional focus of the LD motif interactome. Collectively, our data elucidate the mechanisms underlying the origin and adaptation of an ancestral SLiM.

  10. Hunting Motifs in Situla Art

    Directory of Open Access Journals (Sweden)

    Andrej Preložnik

    2013-07-01

    Full Text Available Situla art developed as an echo of the toreutic style which had spread from the Near East through the Phoenicians, Greeks and Etruscans as far as the Veneti, Raeti, Histri, and their eastern neighbours in the region of Dolenjska (Lower Carniola. An Early Iron Age phenomenon (c. 600—300 BC, it rep- resents the major and most arresting form of the contemporary visual arts in an area stretching from the foot of the Apennines in the south to the Drava and Sava rivers in the east. Indeed, individual pieces have found their way across the Alpine passes and all the way north to the Danube. In the world and art of the situlae, a prominent role is accorded to ani- mals. They are displayed in numerous representations of human activities on artefacts crafted in the classic situla style – that is, between the late 6th  and early 5th centuries BC – as passive participants (e.g. in pageants or in harness or as an active element of the situla narrative. The most typical example of the latter is the hunting scene. Today we know at least four objects decorat- ed exclusively with hunting themes, and a number of situlae and other larger vessels where hunting scenes are embedded in composite narratives. All this suggests a popularity unparallelled by any other genre. Clearly recognisable are various hunting techniques and weapons, each associated with a particu- lar type of game (Fig. 1. The chase of a stag with javelin, horse and hound is depicted on the long- familiar and repeatedly published fibula of Zagorje (Fig. 2. It displays a hound mauling the stag’s back and a hunter on horseback pursuing a hind, her neck already pierced by the javelin. To judge by the (so far unnoticed shaft end un- der the stag’s muzzle, the hunter would have been brandishing a second jave- lin as well, like the warrior of the Vače fibula or the rider of the Nesactium situla, presumably himself a hunter. Many parallels to his motif are known from Greece, Etruria, and

  11. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed; Mansour, Essam; Kalnis, Panos

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern

  12. Deciphering functional glycosaminoglycan motifs in development.

    Science.gov (United States)

    Townley, Robert A; Bülow, Hannes E

    2018-03-23

    Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.

  13. Bayesian centroid estimation for motif discovery.

    Science.gov (United States)

    Carvalho, Luis

    2013-01-01

    Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  14. Bayesian centroid estimation for motif discovery.

    Directory of Open Access Journals (Sweden)

    Luis Carvalho

    Full Text Available Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  15. Sequence-based Screening for Rare Enzymes: New Insights into the World of AMDases Reveal a Conserved Motif and 58 Novel Enzymes Clustering in Eight Distinct Families.

    Directory of Open Access Journals (Sweden)

    Janine Maimanakos

    2016-08-01

    Full Text Available Arylmalonate-Decarboxylases (AMDases, EC 4.1.1.76 are very rare and mostly underexplored enzymes. Currently only four known and biochemically characterized representatives exist. However, their ability to decarboxylate α-disubstituted malonic acid derivatives to optically pure products without cofactors makes them attractive and promising candidates for the use as biocatalysts in industrial processes. Until now, AMDases could not be separated from other members of the aspartate/glutamate racemase superfamily based on their gene sequences. Within this work, a search algorithm was developed that enables a reliable prediction of AMDase activity for potential candidates. Based on specific sequence patterns and screening methods 58 novel AMDase candidate genes could be identified in this work. Thereby, AMDases with the conserved sequence pattern of Bordetella bronchiseptica’s prototype appeared to be limited to the classes of Alpha-, Beta- and Gammaproteobacteria. Amino acid homologies and comparison of gene surrounding sequences enabled the classification of eight enzyme clusters. Particularly striking is the accumulation of genes coding for different transporters of the TTT family, TRAP transporters and ABC transporters as well as genes coding for mandelate racemases/muconate lactonizing enzymes that might be involved in substrate uptake or degradation of AMDase products. Further, three novel AMDases were characterized which showed a high enantiomeric excess (>99% of the (R-enantiomer of flurbiprofen. These are the recombinant AmdA and AmdV from Variovorax sp. strains HH01 and HH02, originated from soil, and AmdP from Polymorphum gilvum found by a data base search. Altogether our findings give new insights into the class of AMDases and reveal many previously unknown enzyme candidates with high potential for bioindustrial processes.

  16. Sequence alignment reveals possible MAPK docking motifs on HIV proteins.

    Directory of Open Access Journals (Sweden)

    Perry Evans

    Full Text Available Over the course of HIV infection, virus replication is facilitated by the phosphorylation of HIV proteins by human ERK1 and ERK2 mitogen-activated protein kinases (MAPKs. MAPKs are known to phosphorylate their substrates by first binding with them at a docking site. Docking site interactions could be viable drug targets because the sequences guiding them are more specific than phosphorylation consensus sites. In this study we use multiple bioinformatics tools to discover candidate MAPK docking site motifs on HIV proteins known to be phosphorylated by MAPKs, and we discuss the possibility of targeting docking sites with drugs. Using sequence alignments of HIV proteins of different subtypes, we show that MAPK docking patterns previously described for human proteins appear on the HIV matrix, Tat, and Vif proteins in a strain dependent manner, but are absent from HIV Rev and appear on all HIV Nef strains. We revise the regular expressions of previously annotated MAPK docking patterns in order to provide a subtype independent motif that annotates all HIV proteins. One revision is based on a documented human variant of one of the substrate docking motifs, and the other reduces the number of required basic amino acids in the standard docking motifs from two to one. The proposed patterns are shown to be consistent with in silico docking between ERK1 and the HIV matrix protein. The motif usage on HIV proteins is sufficiently different from human proteins in amino acid sequence similarity to allow for HIV specific targeting using small-molecule drugs.

  17. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Science.gov (United States)

    Fauteux, François; Strömvik, Martina V

    2009-01-01

    Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs

  18. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Directory of Open Access Journals (Sweden)

    Fauteux François

    2009-10-01

    Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

  19. A combined statistical model for multiple motifs search

    International Nuclear Information System (INIS)

    Gao Lifeng; Liu Xin; Guan Shan

    2008-01-01

    Transcription factor binding sites (TFBS) play key roles in genebior 6.8 wavelet expression and regulation. They are short sequence segments with definite structure and can be recognized by the corresponding transcription factors correctly. From the viewpoint of statistics, the candidates of TFBS should be quite different from the segments that are randomly combined together by nucleotide. This paper proposes a combined statistical model for finding over-represented short sequence segments in different kinds of data set. While the over-represented short sequence segment is described by position weight matrix, the nucleotide distribution at most sites of the segment should be far from the background nucleotide distribution. The central idea of this approach is to search for such kind of signals. This algorithm is tested on 3 data sets, including binding sites data set of cyclic AMP receptor protein in E.coli, PlantProm DB which is a non-redundant collection of proximal promoter sequences from different species, collection of the intergenic sequences of the whole genome of E.Coli. Even though the complexity of these three data sets is quite different, the results show that this model is rather general and sensible. (general)

  20. Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences

    Science.gov (United States)

    König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.

    2013-01-01

    G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141

  1. Perception Enhancement using Visual Attributes in Sequence Motif Visualization

    OpenAIRE

    Oon, Yin; Lee, Nung; Kok, Wei

    2016-01-01

    Sequence logo is a well-accepted scientific method to visualize the conservation characteristics of biological sequence motifs. Previous studies found that using sequence logo graphical representation for scientific evidence reports or arguments could seriously cause biases and misinterpretation by users. This study investigates on the visual attributes performance of a sequence logo in helping users to perceive and interpret the information based on preattentive theories and Gestalt principl...

  2. The MHC motif viewer: a visualization tool for MHC binding motifs

    DEFF Research Database (Denmark)

    Rapin, Nicolas; Hoof, Ilka; Lund, Ole

    2010-01-01

    is hampered by the lack of tools for browsing and comparing specificity of these molecules. We have developed a Web server, MHC Motif Viewer, which allows the display of the binding motif for MHC class I proteins for human, chimpanzee, rhesus monkey, mouse, and swine, as well as HLA-DR protein sequences...

  3. A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif

    Directory of Open Access Journals (Sweden)

    Asita Elengoe

    2015-01-01

    Full Text Available Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD of heat shock 70 kDa protein (PDB: 1HJO with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD simulation. Human DNA binding domain of p53 motif (SCMGGMNR retrieved from UniProt (UniProtKB: P04637 was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.

  4. Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family

    Science.gov (United States)

    Soufari, Heddy

    2017-01-01

    Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515

  5. WildSpan: mining structured motifs from protein sequences

    Directory of Open Access Journals (Sweden)

    Chen Chien-Yu

    2011-03-01

    Full Text Available Abstract Background Automatic extraction of motifs from biological sequences is an important research problem in study of molecular biology. For proteins, it is desired to discover sequence motifs containing a large number of wildcard symbols, as the residues associated with functional sites are usually largely separated in sequences. Discovering such patterns is time-consuming because abundant combinations exist when long gaps (a gap consists of one or more successive wildcards are considered. Mining algorithms often employ constraints to narrow down the search space in order to increase efficiency. However, improper constraint models might degrade the sensitivity and specificity of the motifs discovered by computational methods. We previously proposed a new constraint model to handle large wildcard regions for discovering functional motifs of proteins. The patterns that satisfy the proposed constraint model are called W-patterns. A W-pattern is a structured motif that groups motif symbols into pattern blocks interleaved with large irregular gaps. Considering large gaps reflects the fact that functional residues are not always from a single region of protein sequences, and restricting motif symbols into clusters corresponds to the observation that short motifs are frequently present within protein families. To efficiently discover W-patterns for large-scale sequence annotation and function prediction, this paper first formally introduces the problem to solve and proposes an algorithm named WildSpan (sequential pattern mining across large wildcard regions that incorporates several pruning strategies to largely reduce the mining cost. Results WildSpan is shown to efficiently find W-patterns containing conserved residues that are far separated in sequences. We conducted experiments with two mining strategies, protein-based and family-based mining, to evaluate the usefulness of W-patterns and performance of WildSpan. The protein-based mining mode

  6. Factoring local sequence composition in motif significance analysis.

    Science.gov (United States)

    Ng, Patrick; Keich, Uri

    2008-01-01

    We recently introduced a biologically realistic and reliable significance analysis of the output of a popular class of motif finders. In this paper we further improve our significance analysis by incorporating local base composition information. Relying on realistic biological data simulation, as well as on FDR analysis applied to real data, we show that our method is significantly better than the increasingly popular practice of using the normal approximation to estimate the significance of a finder's output. Finally we turn to leveraging our reliable significance analysis to improve the actual motif finding task. Specifically, endowing a variant of the Gibbs Sampler with our improved significance analysis we demonstrate that de novo finders can perform better than has been perceived. Significantly, our new variant outperforms all the finders reviewed in a recently published comprehensive analysis of the Harbison genome-wide binding location data. Interestingly, many of these finders incorporate additional information such as nucleosome positioning and the significance of binding data.

  7. Armadillo motifs involved in vesicular transport.

    Directory of Open Access Journals (Sweden)

    Harald Striegl

    Full Text Available Armadillo (ARM repeat proteins function in various cellular processes including vesicular transport and membrane tethering. They contain an imperfect repeating sequence motif that forms a conserved three-dimensional structure. Recently, structural and functional insight into tethering mediated by the ARM-repeat protein p115 has been provided. Here we describe the p115 ARM-motifs for reasons of clarity and nomenclature and show that both sequence and structure are highly conserved among ARM-repeat proteins. We argue that there is no need to invoke repeat types other than ARM repeats for a proper description of the structure of the p115 globular head region. Additionally, we propose to define a new subfamily of ARM-like proteins and show lack of evidence that the ARM motifs found in p115 are present in other long coiled-coil tethering factors of the golgin family.

  8. Targeting functional motifs of a protein family

    Science.gov (United States)

    Bhadola, Pradeep; Deo, Nivedita

    2016-10-01

    The structural organization of a protein family is investigated by devising a method based on the random matrix theory (RMT), which uses the physiochemical properties of the amino acid with multiple sequence alignment. A graphical method to represent protein sequences using physiochemical properties is devised that gives a fast, easy, and informative way of comparing the evolutionary distances between protein sequences. A correlation matrix associated with each property is calculated, where the noise reduction and information filtering is done using RMT involving an ensemble of Wishart matrices. The analysis of the eigenvalue statistics of the correlation matrix for the β -lactamase family shows the universal features as observed in the Gaussian orthogonal ensemble (GOE). The property-based approach captures the short- as well as the long-range correlation (approximately following GOE) between the eigenvalues, whereas the previous approach (treating amino acids as characters) gives the usual short-range correlations, while the long-range correlations are the same as that of an uncorrelated series. The distribution of the eigenvector components for the eigenvalues outside the bulk (RMT bound) deviates significantly from RMT observations and contains important information about the system. The information content of each eigenvector of the correlation matrix is quantified by introducing an entropic estimate, which shows that for the β -lactamase family the smallest eigenvectors (low eigenmodes) are highly localized as well as informative. These small eigenvectors when processed gives clusters involving positions that have well-defined biological and structural importance matching with experiments. The approach is crucial for the recognition of structural motifs as shown in β -lactamase (and other families) and selectively identifies the important positions for targets to deactivate (activate) the enzymatic actions.

  9. iFORM: Incorporating Find Occurrence of Regulatory Motifs.

    Science.gov (United States)

    Ren, Chao; Chen, Hebing; Yang, Bite; Liu, Feng; Ouyang, Zhangyi; Bo, Xiaochen; Shu, Wenjie

    2016-01-01

    Accurately identifying the binding sites of transcription factors (TFs) is crucial to understanding the mechanisms of transcriptional regulation and human disease. We present incorporating Find Occurrence of Regulatory Motifs (iFORM), an easy-to-use and efficient tool for scanning DNA sequences with TF motifs described as position weight matrices (PWMs). Both performance assessment with a receiver operating characteristic (ROC) curve and a correlation-based approach demonstrated that iFORM achieves higher accuracy and sensitivity by integrating five classical motif discovery programs using Fisher's combined probability test. We have used iFORM to provide accurate results on a variety of data in the ENCODE Project and the NIH Roadmap Epigenomics Project, and the tool has demonstrated its utility in further elucidating individual roles of functional elements. Both the source and binary codes for iFORM can be freely accessed at https://github.com/wenjiegroup/iFORM. The identified TF binding sites across human cell and tissue types using iFORM have been deposited in the Gene Expression Omnibus under the accession ID GSE53962.

  10. Highly scalable Ab initio genomic motif identification

    KAUST Repository

    Marchand, Benoit; Bajic, Vladimir B.; Kaushik, Dinesh

    2011-01-01

    We present results of scaling an ab initio motif family identification system, Dragon Motif Finder (DMF), to 65,536 processor cores of IBM Blue Gene/P. DMF seeks groups of mutually similar polynucleotide patterns within a set of genomic sequences and builds various motif families from them. Such information is of relevance to many problems in life sciences. Prior attempts to scale such ab initio motif-finding algorithms achieved limited success. We solve the scalability issues using a combination of mixed-mode MPI-OpenMP parallel programming, master-slave work assignment, multi-level workload distribution, multi-level MPI collectives, and serial optimizations. While the scalability of our algorithm was excellent (94% parallel efficiency on 65,536 cores relative to 256 cores on a modest-size problem), the final speedup with respect to the original serial code exceeded 250,000 when serial optimizations are included. This enabled us to carry out many large-scale ab initio motiffinding simulations in a few hours while the original serial code would have needed decades of execution time. Copyright 2011 ACM.

  11. oPOSSUM-3: advanced analysis of regulatory motif over-representation across genes or ChIP-Seq datasets.

    Science.gov (United States)

    Kwon, Andrew T; Arenillas, David J; Worsley Hunt, Rebecca; Wasserman, Wyeth W

    2012-09-01

    oPOSSUM-3 is a web-accessible software system for identification of over-represented transcription factor binding sites (TFBS) and TFBS families in either DNA sequences of co-expressed genes or sequences generated from high-throughput methods, such as ChIP-Seq. Validation of the system with known sets of co-regulated genes and published ChIP-Seq data demonstrates the capacity for oPOSSUM-3 to identify mediating transcription factors (TF) for co-regulated genes or co-recovered sequences. oPOSSUM-3 is available at http://opossum.cisreg.ca.

  12. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern applications require mining of motifs in one very long sequence (i.e., in the order of several gigabytes). For this case, there exist statistical approaches that are fast but inaccurate; or combinatorial methods that are sound and complete. Unfortunately, existing combinatorial methods are serial and very slow. Consequently, they are limited to very short sequences (i.e., a few megabytes), small alphabets (typically 4 symbols for DNA sequences), and restricted types of motifs. This paper presents ACME, a combinatorial method for extracting motifs from a single very long sequence. ACME arranges the search space in contiguous blocks that take advantage of the cache hierarchy in modern architectures, and achieves almost an order of magnitude performance gain in serial execution. It also decomposes the search space in a smart way that allows scalability to thousands of processors with more than 90% speedup. ACME is the only method that: (i) scales to gigabyte-long sequences; (ii) handles large alphabets; (iii) supports interesting types of motifs with minimal additional cost; and (iv) is optimized for a variety of architectures such as multi-core systems, clusters in the cloud, and supercomputers. ACME reduces the extraction time for an exact-length query from 4 hours to 7 minutes on a typical workstation; handles 3 orders of magnitude longer sequences; and scales up to 16, 384 cores on a supercomputer. Copyright is held by the owner/author(s).

  13. The Verrucomicrobia LexA-binding Motif: Insights into the Evolutionary Dynamics of the SOS Response

    Directory of Open Access Journals (Sweden)

    Ivan Erill

    2016-07-01

    Full Text Available The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  14. The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response.

    Science.gov (United States)

    Erill, Ivan; Campoy, Susana; Kılıç, Sefa; Barbé, Jordi

    2016-01-01

    The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division, and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  15. CombiMotif: A new algorithm for network motifs discovery in protein-protein interaction networks

    Science.gov (United States)

    Luo, Jiawei; Li, Guanghui; Song, Dan; Liang, Cheng

    2014-12-01

    Discovering motifs in protein-protein interaction networks is becoming a current major challenge in computational biology, since the distribution of the number of network motifs can reveal significant systemic differences among species. However, this task can be computationally expensive because of the involvement of graph isomorphic detection. In this paper, we present a new algorithm (CombiMotif) that incorporates combinatorial techniques to count non-induced occurrences of subgraph topologies in the form of trees. The efficiency of our algorithm is demonstrated by comparing the obtained results with the current state-of-the art subgraph counting algorithms. We also show major differences between unicellular and multicellular organisms. The datasets and source code of CombiMotif are freely available upon request.

  16. An Analysis of Multi-type Relational Interactions in FMA Using Graph Motifs with Disjointness Constraints

    Science.gov (United States)

    Zhang, Guo-Qiang; Luo, Lingyun; Ogbuji, Chime; Joslyn, Cliff; Mejino, Jose; Sahoo, Satya S

    2012-01-01

    The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions for detecting logical inconsistencies as well as other anomalies represented by the motifs. MOCH represents patterns of multi-type interaction as small labeled (with multiple types of edges) sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology, we performed exhaustive analyses of a variety of labeled sub-graph motifs. The quality assurance feature of MOCH comes from the distinct use of a subset of the edges of the graph motifs as constraints for disjointness, whereby bringing in rule-based flavor to the approach as well. With possible disjointness implied by antonyms, we performed manual inspection of the resulting FMA fragments and tracked down sources of abnormal inferred conclusions (logical inconsistencies), which are amendable for programmatic revision of the FMA. Our results demonstrate that MOCH provides a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation. PMID:23304382

  17. An analysis of multi-type relational interactions in FMA using graph motifs with disjointness constraints.

    Science.gov (United States)

    Zhang, Guo-Qiang; Luo, Lingyun; Ogbuji, Chime; Joslyn, Cliff; Mejino, Jose; Sahoo, Satya S

    2012-01-01

    The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions for detecting logical inconsistencies as well as other anomalies represented by the motifs. MOCH represents patterns of multi-type interaction as small labeled (with multiple types of edges) sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology, we performed exhaustive analyses of a variety of labeled sub-graph motifs. The quality assurance feature of MOCH comes from the distinct use of a subset of the edges of the graph motifs as constraints for disjointness, whereby bringing in rule-based flavor to the approach as well. With possible disjointness implied by antonyms, we performed manual inspection of the resulting FMA fragments and tracked down sources of abnormal inferred conclusions (logical inconsistencies), which are amendable for programmatic revision of the FMA. Our results demonstrate that MOCH provides a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation.

  18. Pipeline for the Analysis of ChIP-seq Data and New Motif Ranking Procedure

    KAUST Repository

    Ashoor, Haitham

    2011-06-01

    This thesis presents a computational methodology for ab-initio identification of transcription factor binding sites based on ChIP-seq data. This method consists of three main steps, namely ChIP-seq data processing, motif discovery and models selection. A novel method for ranking the models of motifs identified in this process is proposed. This method combines multiple factors in order to rank the provided candidate motifs. It combines the model coverage of the ChIP-seq fragments that contain motifs from which that model is built, the suitable background data made up of shuffled ChIP-seq fragments, and the p-value that resulted from evaluating the model on actual and background data. Two ChIP-seq datasets retrieved from ENCODE project are used to evaluate and demonstrate the ability of the method to predict correct TFBSs with high precision. The first dataset relates to neuron-restrictive silencer factor, NRSF, while the second one corresponds to growth-associated binding protein, GABP. The pipeline system shows high precision prediction for both datasets, as in both cases the top ranked motif closely resembles the known motifs for the respective transcription factors.

  19. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Science.gov (United States)

    Zandevakili, Pooya; Hu, Ming; Qin, Zhaohui

    2012-01-01

    Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS) that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU)-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  20. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Directory of Open Access Journals (Sweden)

    Pooya Zandevakili

    Full Text Available Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  1. Discovery of cell-type specific DNA motif grammar in cis-regulatory elements using random Forest.

    Science.gov (United States)

    Wang, Xin; Lin, Peijie; Ho, Joshua W K

    2018-01-19

    It has been observed that many transcription factors (TFs) can bind to different genomic loci depending on the cell type in which a TF is expressed in, even though the individual TF usually binds to the same core motif in different cell types. How a TF can bind to the genome in such a highly cell-type specific manner, is a critical research question. One hypothesis is that a TF requires co-binding of different TFs in different cell types. If this is the case, it may be possible to observe different combinations of TF motifs - a motif grammar - located at the TF binding sites in different cell types. In this study, we develop a bioinformatics method to systematically identify DNA motifs in TF binding sites across multiple cell types based on published ChIP-seq data, and address two questions: (1) can we build a machine learning classifier to predict cell-type specificity based on motif combinations alone, and (2) can we extract meaningful cell-type specific motif grammars from this classifier model. We present a Random Forest (RF) based approach to build a multi-class classifier to predict the cell-type specificity of a TF binding site given its motif content. We applied this RF classifier to two published ChIP-seq datasets of TF (TCF7L2 and MAX) across multiple cell types. Using cross-validation, we show that motif combinations alone are indeed predictive of cell types. Furthermore, we present a rule mining approach to extract the most discriminatory rules in the RF classifier, thus allowing us to discover the underlying cell-type specific motif grammar. Our bioinformatics analysis supports the hypothesis that combinatorial TF motif patterns are cell-type specific.

  2. Alanine substitutions in the GXXXG motif alter C99 cleavage by γ-secretase but not its dimerization.

    Science.gov (United States)

    Higashide, Hidekazu; Ishihara, Seiko; Nobuhara, Mika; Ihara, Yasuo; Funamoto, Satoru

    2017-03-01

    The amyloid β (Aβ) protein is a major component of senile plaques, one of the neuropathological hallmarks of Alzheimer's disease. Amyloidogenic processing of amyloid precursor protein (APP) by β- and γ-secretases leads to production of Aβ. APP contains tandem triple repeats of the GXXXG motif in its extracellular juxtamembrane and transmembrane regions. It is reported that the GXXXG motif is related to protein-protein interactions, but it remains controversial whether the GXXXG motif in APP is involved in substrate dimerization and whether dimerization affects γ-secretase-dependent cleavage. Therefore, the relationship between the GXXXG motifs, substrate dimerization, and γ-secretase-dependent cleavage sites remains unclear. Here, we applied blue native poly acrylamide gel electrophoresis to examine the effect of alanine substitutions within the GXXXG motifs of APP carboxyl terminal fragment (C99) on its dimerization and Aβ production. Surprisingly, alanine substitutions in the motif failed to alter C99 dimerization in detergent soluble state. Cell-based and solubilized γ-secretase assays demonstrated that increasing alanine substitutions in the motif tended to decrease long Aβ species such as Aβ42 and Aβ43 and to increase in short Aβ species concomitantly. Our data suggest that the GXXXG motif is crucial for Aβ production, but not for C99 dimerization. © 2016 International Society for Neurochemistry.

  3. Dynamic motifs in socio-economic networks

    Science.gov (United States)

    Zhang, Xin; Shao, Shuai; Stanley, H. Eugene; Havlin, Shlomo

    2014-12-01

    Socio-economic networks are of central importance in economic life. We develop a method of identifying and studying motifs in socio-economic networks by focusing on “dynamic motifs,” i.e., evolutionary connection patterns that, because of “node acquaintances” in the network, occur much more frequently than random patterns. We examine two evolving bi-partite networks: i) the world-wide commercial ship chartering market and ii) the ship build-to-order market. We find similar dynamic motifs in both bipartite networks, even though they describe different economic activities. We also find that “influence” and “persistence” are strong factors in the interaction behavior of organizations. When two companies are doing business with the same customer, it is highly probable that another customer who currently only has business relationship with one of these two companies, will become customer of the second in the future. This is the effect of influence. Persistence means that companies with close business ties to customers tend to maintain their relationships over a long period of time.

  4. Large-scale discovery of promoter motifs in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Thomas A Down

    2007-01-01

    Full Text Available A key step in understanding gene regulation is to identify the repertoire of transcription factor binding motifs (TFBMs that form the building blocks of promoters and other regulatory elements. Identifying these experimentally is very laborious, and the number of TFBMs discovered remains relatively small, especially when compared with the hundreds of transcription factor genes predicted in metazoan genomes. We have used a recently developed statistical motif discovery approach, NestedMICA, to detect candidate TFBMs from a large set of Drosophila melanogaster promoter regions. Of the 120 motifs inferred in our initial analysis, 25 were statistically significant matches to previously reported motifs, while 87 appeared to be novel. Analysis of sequence conservation and motif positioning suggested that the great majority of these discovered motifs are predictive of functional elements in the genome. Many motifs showed associations with specific patterns of gene expression in the D. melanogaster embryo, and we were able to obtain confident annotation of expression patterns for 25 of our motifs, including eight of the novel motifs. The motifs are available through Tiffin, a new database of DNA sequence motifs. We have discovered many new motifs that are overrepresented in D. melanogaster promoter regions, and offer several independent lines of evidence that these are novel TFBMs. Our motif dictionary provides a solid foundation for further investigation of regulatory elements in Drosophila, and demonstrates techniques that should be applicable in other species. We suggest that further improvements in computational motif discovery should narrow the gap between the set of known motifs and the total number of transcription factors in metazoan genomes.

  5. CONTEMPORARY USAGE OF TRADITIONAL TURKISH MOTIFS IN PRODUCT DESIGNS

    Directory of Open Access Journals (Sweden)

    Tulay Gumuser

    2012-12-01

    Full Text Available The aim of this study is to identify the traditional Turkish motifs and its relations among present industrial designs. Traditional Turkish motifs played a very important role in 16th century onwards. The arts of the Ottoman Empire were used because of their symbolic meanings and unique styles. When we examine these motifs we encounter; Tiger Stripe, Three Spot (Çintemani, Rumi, Hatayi, Penç, Cloud, Crescent, Star, Crown, Hyacinth, Tulip and Carnation motifs. Nowadays, Turkish designers have begun to use these traditional Turkish motifs in their designs so as to create differences and awareness in the world design. The examples of these industrial designs, using the Turkish motifs, have survived and have Ottoman heritage and historical value. In this study, the Turkish motifs will be examined along with their focus on contemporary Turkish industrial designs used today.

  6. Memetic algorithms for de novo motif-finding in biomedical sequences.

    Science.gov (United States)

    Bi, Chengpeng

    2012-09-01

    The objectives of this study are to design and implement a new memetic algorithm for de novo motif discovery, which is then applied to detect important signals hidden in various biomedical molecular sequences. In this paper, memetic algorithms are developed and tested in de novo motif-finding problems. Several strategies in the algorithm design are employed that are to not only efficiently explore the multiple sequence local alignment space, but also effectively uncover the molecular signals. As a result, there are a number of key features in the implementation of the memetic motif-finding algorithm (MaMotif), including a chromosome replacement operator, a chromosome alteration-aware local search operator, a truncated local search strategy, and a stochastic operation of local search imposed on individual learning. To test the new algorithm, we compare MaMotif with a few of other similar algorithms using simulated and experimental data including genomic DNA, primary microRNA sequences (let-7 family), and transmembrane protein sequences. The new memetic motif-finding algorithm is successfully implemented in C++, and exhaustively tested with various simulated and real biological sequences. In the simulation, it shows that MaMotif is the most time-efficient algorithm compared with others, that is, it runs 2 times faster than the expectation maximization (EM) method and 16 times faster than the genetic algorithm-based EM hybrid. In both simulated and experimental testing, results show that the new algorithm is compared favorably or superior to other algorithms. Notably, MaMotif is able to successfully discover the transcription factors' binding sites in the chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) data, correctly uncover the RNA splicing signals in gene expression, and precisely find the highly conserved helix motif in the transmembrane protein sequences, as well as rightly detect the palindromic segments in the primary micro

  7. Aplikasi Ornamen Khas Maluku untuk Pengembangan Desain Motif Batik

    Directory of Open Access Journals (Sweden)

    Masiswo Masiswo

    2016-04-01

    Full Text Available ABSTRAKMaluku memiliki banyak ragam hias budaya warisan nilai leluhur berupa ornamen etnis yang merupakan kesenian dan keterampilan kerajinan. Hasil warisan tersebut sampai saat ini masih lestari hidup serta dapat dinikmati sebagai konsumsi rohani yang memuaskan manusia. Berkaitan dengan keberlangsungan nilai-nilai tradisi etnis yang berwujud pada ornamen-ornamen daerah Maluku, maka dikembangkan untuk kebutuhan manusia berupa motif batik pada kain. Pengembangan ornamen ini lebih menekankan pada representasi akan bentuk-bentuk ornamen yang diterapkan pada kerajinan batik berupa motif khas Maluku. Pengembangan alternatif desain motif batik dibuat tiga variasi yang bersumber dari ornamen khas Maluku dibuat prototipe produknya dan diuji ketahanan luntur warnanya. Hasil uji ketahanan luntur warna terhadap gosokan basah dari tiga prototipe produk berpredikat baik sekali terdapat pada “Motif Siwa” dan predikat baik pada motif “Siwa Talang” dan motif “Matahari Siwa Talang”.Kata kunci: desain, Maluku, motif batik, ornamenABSTRACTMaluku has much decorative ancestral cultural heritage value in the form of ornament ethnic arts and crafts skills. The result of the legacy is still sustainable living can be enjoyed as well as satisfying spiritual human consumption.Related to the sustainability of traditional values in the form of ethnic ornaments Maluku, it was developed for human needs in the form of batik cloth . The development of these ornaments will be more emphasis on the representation forms of ornamentation that is applied to a batik motif Maluku. Development of alternative design motif made three variations. The development of three alternative design motifs derived from the Maluku ornaments made and tested a prototype product color fastness. The test results of color fastness to wet rubbing of the three prototypes are excellent products predicated on the "Motif Siwa" and a good rating on the motif "Siwa Talang" and motif "Matahari Siwa

  8. Identity and functions of CxxC-derived motifs.

    Science.gov (United States)

    Fomenko, Dmitri E; Gladyshev, Vadim N

    2003-09-30

    Two cysteines separated by two other residues (the CxxC motif) are employed by many redox proteins for formation, isomerization, and reduction of disulfide bonds and for other redox functions. The place of the C-terminal cysteine in this motif may be occupied by serine (the CxxS motif), modifying the functional repertoire of redox proteins. Here we found that the CxxC motif may also give rise to a motif, in which the C-terminal cysteine is replaced with threonine (the CxxT motif). Moreover, in contrast to a view that the N-terminal cysteine in the CxxC motif always serves as a nucleophilic attacking group, this residue could also be replaced with threonine (the TxxC motif), serine (the SxxC motif), or other residues. In each of these CxxC-derived motifs, the presence of a downstream alpha-helix was strongly favored. A search for conserved CxxC-derived motif/helix patterns in four complete genomes representing bacteria, archaea, and eukaryotes identified known redox proteins and suggested possible redox functions for several additional proteins. Catalytic sites in peroxiredoxins were major representatives of the TxxC motif, whereas those in glutathione peroxidases represented the CxxT motif. Structural assessments indicated that threonines in these enzymes could stabilize catalytic thiolates, suggesting revisions to previously proposed catalytic triads. Each of the CxxC-derived motifs was also observed in natural selenium-containing proteins, in which selenocysteine was present in place of a catalytic cysteine.

  9. The regulation of ER export and Golgi retention of ST3Gal5 (GM3/GM4 synthase) and B4GalNAcT1 (GM2/GD2/GA2 synthase) by arginine/lysine-based motif adjacent to the transmembrane domain.

    Science.gov (United States)

    Uemura, Satoshi; Shishido, Fumi; Kashimura, Madoka; Inokuchi, Jin-ichi

    2015-12-01

    In the Golgi maturation model, the Golgi cisternae dynamically mature along a secretory pathway. In this dynamic process, glycosyltransferases are transported from the endoplasmic reticulum (ER) to the Golgi apparatus where they remain and function. The precise mechanism behind this maturation process remains unclear. We investigated two glycosyltransferases, ST3Gal5 (ST3G5) and B4GalNAcT1 (B4GN1), involved in ganglioside synthesis and examined their signal sequences for ER export and Golgi retention. Reports have suggested that the [R/K](X)[R/K] motif functions as an ER exporting signal; however, this signal sequence is insufficient in stably expressed, full-length ST3G5. Through further analysis, we have clarified that the (2)R(3)R(X)(5) (9)K(X)(3) (13)K sequence in ST3G5 is essential for ER export. We have named the sequence the R/K-based motif. On the other hand, for ER export of B4GN1, the homodimer formation in addition to the R/K-based motif is required for ER export suggesting the importance of unidentified lumenal side interaction. We found that ST3G5 R2A/R3A and K9A/K13A mutants localized not only in Golgi apparatus but also in endosomes. Furthermore, the amounts of mature type asparagine-linked (N)-glycans in ST3G5 R2A/R3A and K9A/K13A mutants were decreased compared with those in wild-type proteins, and the stability of the mutants was lower. These results suggest that the R/K-based motif is necessary for the Golgi retention of ST3G5 and that the retention is involved in the maturation of N-glycans and in stability. Thus, several basic amino acids located on the cytoplasmic tail of ST3G5 play important roles in both ER export and Golgi retention. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  10. Systematic discovery of regulatory motifs in Fusarium graminearum by comparing four Fusarium genomes

    Directory of Open Access Journals (Sweden)

    Kistler Corby

    2010-03-01

    Full Text Available Abstract Background Fusarium graminearum (Fg, a major fungal pathogen of cultivated cereals, is responsible for billions of dollars in agriculture losses. There is a growing interest in understanding the transcriptional regulation of this organism, especially the regulation of genes underlying its pathogenicity. The generation of whole genome sequence assemblies for Fg and three closely related Fusarium species provides a unique opportunity for such a study. Results Applying comparative genomics approaches, we developed a computational pipeline to systematically discover evolutionarily conserved regulatory motifs in the promoter, downstream and the intronic regions of Fg genes, based on the multiple alignments of sequenced Fusarium genomes. Using this method, we discovered 73 candidate regulatory motifs in the promoter regions. Nearly 30% of these motifs are highly enriched in promoter regions of Fg genes that are associated with a specific functional category. Through comparison to Saccharomyces cerevisiae (Sc and Schizosaccharomyces pombe (Sp, we observed conservation of transcription factors (TFs, their binding sites and the target genes regulated by these TFs related to pathways known to respond to stress conditions or phosphate metabolism. In addition, this study revealed 69 and 39 conserved motifs in the downstream regions and the intronic regions, respectively, of Fg genes. The top intronic motif is the splice donor site. For the downstream regions, we noticed an intriguing absence of the mammalian and Sc poly-adenylation signals among the list of conserved motifs. Conclusion This study provides the first comprehensive list of candidate regulatory motifs in Fg, and underscores the power of comparative genomics in revealing functional elements among related genomes. The conservation of regulatory pathways among the Fusarium genomes and the two yeast species reveals their functional significance, and provides new insights in their

  11. Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

    KAUST Repository

    Kalkatawi, Manal M.

    2011-11-15

    Motivation: Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants. The Author(s) 2011. Published by Oxford University Press. All rights reserved.

  12. UKIRAN KERAWANG ACEH GAYO SEBAGAI INSPIRASI PENCIPTAAN MOTIF BATIK KHAS GAYO

    Directory of Open Access Journals (Sweden)

    Irfa ina Rohana Salma

    2016-12-01

    Full Text Available ABSTRAK Industri batik mulai berkembang di Gayo, tetapi belum memiliki motif batik khas daerah. Oleh karena itu perlu diciptakan motif batik khas Gayo, dengan mengambil inspirasi dari ukiran yang terdapat pada rumah tradisional yang biasa disebut ukiran kerawang Gayo. Tujuan penciptaan seni ini adalah untuk menciptakan motif batik yang memiliki ciri khas Gayo. Metode yang digunakan yaitu eksplorasi ide, perancangan, dan perwujudan menjadi motif batik. Dalam kegiatan ini telah diciptakan enam motif batik khas Gayo yaitu: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif Gayo Lurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. Hasil uji kesukaan terhadap motif kepada lima puluh responden menunjukkan bahwa Motif Ceplok Gayo paling banyak dipilih oleh responden yaitu sebesar 19%, sedangkan Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo Lurus 15% dan Motif Gayo Tegak 14%. Rata-rata motif yang dihasilkan mendapatkan apresiasi yang baik dari responden, sehingga semua motif layak diproduksi sebagai batik khas Gayo.Kata kunci: batik Gayo, Motif Ceplok Gayo, Motif Parang Gayo.ABSTRACTBatik industry began to develop in Gayo, but have not had a typical batik motif itself. Therefore, it is necessary to create batik motifs of Gayo, by taking inspiration from the carvings found in traditional houses commonly called kerawang Gayo. The purpose of this art is to create motifs those have a Gayo characteristic. The method used are the idea exploration, design, and motifs embodiment. In this activity has created six Gayo batik motifs, namely: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif GayoLurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. The test results fondness of the motives to fifty respondents indicated that the Motif Ceplok Gayo most preferred by respondents ie 19%, while Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo

  13. A ChIP-Seq benchmark shows that sequence conservation mainly improves detection of strong transcription factor binding sites.

    Directory of Open Access Journals (Sweden)

    Tony Håndstad

    Full Text Available BACKGROUND: Transcription factors are important controllers of gene expression and mapping transcription factor binding sites (TFBS is key to inferring transcription factor regulatory networks. Several methods for predicting TFBS exist, but there are no standard genome-wide datasets on which to assess the performance of these prediction methods. Also, it is believed that information about sequence conservation across different genomes can generally improve accuracy of motif-based predictors, but it is not clear under what circumstances use of conservation is most beneficial. RESULTS: Here we use published ChIP-seq data and an improved peak detection method to create comprehensive benchmark datasets for prediction methods which use known descriptors or binding motifs to detect TFBS in genomic sequences. We use this benchmark to assess the performance of five different prediction methods and find that the methods that use information about sequence conservation generally perform better than simpler motif-scanning methods. The difference is greater on high-affinity peaks and when using short and information-poor motifs. However, if the motifs are specific and information-rich, we find that simple motif-scanning methods can perform better than conservation-based methods. CONCLUSIONS: Our benchmark provides a comprehensive test that can be used to rank the relative performance of transcription factor binding site prediction methods. Moreover, our results show that, contrary to previous reports, sequence conservation is better suited for predicting strong than weak transcription factor binding sites.

  14. Network motif frequency vectors reveal evolving metabolic network organisation.

    Science.gov (United States)

    Pearcy, Nicole; Crofts, Jonathan J; Chuzhanova, Nadia

    2015-01-01

    At the systems level many organisms of interest may be described by their patterns of interaction, and as such, are perhaps best characterised via network or graph models. Metabolic networks, in particular, are fundamental to the proper functioning of many important biological processes, and thus, have been widely studied over the past decade or so. Such investigations have revealed a number of shared topological features, such as a short characteristic path-length, large clustering coefficient and hierarchical modular structure. However, the extent to which evolutionary and functional properties of metabolism manifest via this underlying network architecture remains unclear. In this paper, we employ a novel graph embedding technique, based upon low-order network motifs, to compare metabolic network structure for 383 bacterial species categorised according to a number of biological features. In particular, we introduce a new global significance score which enables us to quantify important evolutionary relationships that exist between organisms and their physical environments. Using this new approach, we demonstrate a number of significant correlations between environmental factors, such as growth conditions and habitat variability, and network motif structure, providing evidence that organism adaptability leads to increased complexities in the resultant metabolic networks.

  15. Genetic interaction motif finding by expectation maximization – a novel statistical model for inferring gene modules from synthetic lethality

    Directory of Open Access Journals (Sweden)

    Ye Ping

    2005-12-01

    Full Text Available Abstract Background Synthetic lethality experiments identify pairs of genes with complementary function. More direct functional associations (for example greater probability of membership in a single protein complex may be inferred between genes that share synthetic lethal interaction partners than genes that are directly synthetic lethal. Probabilistic algorithms that identify gene modules based on motif discovery are highly appropriate for the analysis of synthetic lethal genetic interaction data and have great potential in integrative analysis of heterogeneous datasets. Results We have developed Genetic Interaction Motif Finding (GIMF, an algorithm for unsupervised motif discovery from synthetic lethal interaction data. Interaction motifs are characterized by position weight matrices and optimized through expectation maximization. Given a seed gene, GIMF performs a nonlinear transform on the input genetic interaction data and automatically assigns genes to the motif or non-motif category. We demonstrate the capacity to extract known and novel pathways for Saccharomyces cerevisiae (budding yeast. Annotations suggested for several uncharacterized genes are supported by recent experimental evidence. GIMF is efficient in computation, requires no training and automatically down-weights promiscuous genes with high degrees. Conclusion GIMF effectively identifies pathways from synthetic lethality data with several unique features. It is mostly suitable for building gene modules around seed genes. Optimal choice of one single model parameter allows construction of gene networks with different levels of confidence. The impact of hub genes the generic probabilistic framework of GIMF may be used to group other types of biological entities such as proteins based on stochastic motifs. Analysis of the strongest motifs discovered by the algorithm indicates that synthetic lethal interactions are depleted between genes within a motif, suggesting that synthetic

  16. Structural motifs of pre-nucleation clusters.

    Science.gov (United States)

    Zhang, Y; Türkmen, I R; Wassermann, B; Erko, A; Rühl, E

    2013-10-07

    Structural motifs of pre-nucleation clusters prepared in single, optically levitated supersaturated aqueous aerosol microparticles containing CaBr2 as a model system are reported. Cluster formation is identified by means of X-ray absorption in the Br K-edge regime. The salt concentration beyond the saturation point is varied by controlling the humidity in the ambient atmosphere surrounding the 15-30 μm microdroplets. This leads to the formation of metastable supersaturated liquid particles. Distinct spectral shifts in near-edge spectra as a function of salt concentration are observed, in which the energy position of the Br K-edge is red-shifted by up to 7.1 ± 0.4 eV if the dilute solution is compared to the solid. The K-edge positions of supersaturated solutions are found between these limits. The changes in electronic structure are rationalized in terms of the formation of pre-nucleation clusters. This assumption is verified by spectral simulations using first-principle density functional theory and molecular dynamics calculations, in which structural motifs are considered, explaining the experimental results. These consist of solvated CaBr2 moieties, rather than building blocks forming calcium bromide hexahydrates, the crystal system that is formed by drying aqueous CaBr2 solutions.

  17. POWRS: position-sensitive motif discovery.

    Directory of Open Access Journals (Sweden)

    Ian W Davis

    Full Text Available Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm "POWRS" (POsition-sensitive WoRd Set for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties.BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/.

  18. Promoter Motifs in NCLDVs: An Evolutionary Perspective

    Science.gov (United States)

    Oliveira, Graziele Pereira; Andrade, Ana Cláudia dos Santos Pereira; Rodrigues, Rodrigo Araújo Lima; Arantes, Thalita Souza; Boratto, Paulo Victor Miranda; Silva, Ludmila Karen dos Santos; Dornas, Fábio Pio; Trindade, Giliane de Souza; Drumond, Betânia Paiva; La Scola, Bernard; Kroon, Erna Geessien; Abrahão, Jônatas Santos

    2017-01-01

    For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV), raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’) that could be evolved gradually by nucleotides’ gain and loss and point mutations. PMID:28117683

  19. Promoter Motifs in NCLDVs: An Evolutionary Perspective

    Directory of Open Access Journals (Sweden)

    Graziele Pereira Oliveira

    2017-01-01

    Full Text Available For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV, raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’ that could be evolved gradually by nucleotides’ gain and loss and point mutations.

  20. A Simple Decision Rule for Recognition of Poly(A) Tail Signal Motifs in Human Genome

    KAUST Repository

    AbouEisha, Hassan M.

    2015-05-12

    Background is the numerous attempts were made to predict motifs in genomic sequences that correspond to poly (A) tail signals. Vast portion of this effort has been directed to a plethora of nonlinear classification methods. Even when such approaches yield good discriminant results, identifying dominant features of regulatory mechanisms nevertheless remains a challenge. In this work, we look at decision rules that may help identifying such features. Findings are we present a simple decision rule for classification of candidate poly (A) tail signal motifs in human genomic sequence obtained by evaluating features during the construction of gradient boosted trees. We found that values of a single feature based on the frequency of adenine in the genomic sequence surrounding candidate signal and the number of consecutive adenine molecules in a well-defined region immediately following the motif displays good discriminative potential in classification of poly (A) tail motifs for samples covered by the rule. Conclusions is the resulting simple rule can be used as an efficient filter in construction of more complex poly(A) tail motifs classification algorithms.

  1. Parole, Sintagmatik, dan Paradigmatik Motif Batik Mega Mendung

    Directory of Open Access Journals (Sweden)

    Rudi - Nababan

    2012-04-01

    Full Text Available ABSTRACT   Discussing traditional batik is related a lot to the organization system of fine arts element ac- companying it, either the pattern of the motif or the technique of the making. In this case, the motif of Mega Mendung Cirebon certainly has patterns and rules which are traditionally different from the other motifs in other areas. Through  semiotics analysis especially with Saussure and Pierce concept, it can be traced that batik with Cirebon motif, in this case Mega Mendung motif, has parole and langue system, as unique fine arts language in batik, and structure of visual syntagmatic and paradigmatic. In the context of batik motif as fine arts language, it is surely related to sign system as symbol and icon.       Keywords: visual semiotic, Cirebon’s batik.

  2. Structure-based design synthesis of functionalized 3-(5-(s-phenyl)-4H-pyrazol-3-yl)-2H-chromen-2-one motifs and indigenous plant extracts and their antimalarial potential

    Science.gov (United States)

    Olayinka, Ajani; Grace, Olasehinde; Titilope, Dokunmu; Ruth, Diji-Geske; Olabode, Onileere; John, Openibo; Oreoluwa, Oluseye; Tochukwu, Chileke; Ezekiel, Adebiyi

    2018-04-01

    Resistance of the malaria parasite to conventional therapeutic agents calls for increased efforts in antimalarial drug discovery. Current efforts should be targeted at developing safe and affordable new agents to counter the spread of malaria parasites that are resistant to existing therapy. In this study, toxicological and in vivo antiplasmodial properties of 3-(5-(s-phenyl)-4H-pyrazol-3-yl)-42H-chromen-2, Mangifera indica and Tithonia diversifolia in swiss albino mice models, Musmusculus were investigated. 2H-Chromen-2-one also known as coumarin is highly privileged oxygen-containing heterocyclic entity which are present in plant kingdom as secondary metabolites. The maceration technique of crude drug extraction was employed using cold water extraction. Toxicological analysis was carried out using Lorke's method for acute toxicity testing while the chemosuppressive activity was carried out using Peter's four day test on early infection. We also report the synthesis of functionalized 3-(5-(s-phenyl)-4H-pyrazol-3-yl)-2H-chromen-2-one motifs via microwave assisted synthetic approach and isolation of indigenous plant extract in order to investigate their antimalarial efficacy. The condensation reaction of 3-acetylcoumarin with various benzaldehyde derivatives resulted in the formation of 3-[3-acryloyl]-2H-chromen-2-one which was subsequently reaction the hydrazine hydrate via microwave assisted hydrazinolysis to afford the targeted 3-(5-(s-phenyl)-4H-pyrazol-3-yl)-2H-chromen-2-one motifs. The chemical structures were confirmed by analytical data and spectroscopic means such as FT-IR, UV, 1H NMR, 13C NMR and DEPT-135. The microwave assisted reaction was remarkably successful and gave targeted 3-(5-(s-phenyl)-4H-pyrazol-3-yl)-2H-chromen-2-one motifs in higher yields at lesser reaction time compared to conventional heating method. The LD50 of the aqueous extracts of the leaves and stem bark Mangifera indica was established to be ± 707.11 mg/kg b.w., p.o. (body weight

  3. Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

    Directory of Open Access Journals (Sweden)

    Jie Zhu

    Full Text Available DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.

  4. Accurate Quantification of microRNA via Single Strand Displacement Reaction on DNA Origami Motif

    Science.gov (United States)

    Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

    2013-01-01

    DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs. PMID:23990889

  5. The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

    Directory of Open Access Journals (Sweden)

    Roberts Richard J

    2008-05-01

    Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.

  6. Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

    Science.gov (United States)

    Zhu, Jie; Feng, Xiaolu; Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

    2013-01-01

    DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.

  7. Motif-role-fingerprints: the building-blocks of motifs, clustering-coefficients and transitivities in directed networks.

    Directory of Open Access Journals (Sweden)

    Mark D McDonnell

    Full Text Available Complex networks are frequently characterized by metrics for which particular subgraphs are counted. One statistic from this category, which we refer to as motif-role fingerprints, differs from global subgraph counts in that the number of subgraphs in which each node participates is counted. As with global subgraph counts, it can be important to distinguish between motif-role fingerprints that are 'structural' (induced subgraphs and 'functional' (partial subgraphs. Here we show mathematically that a vector of all functional motif-role fingerprints can readily be obtained from an arbitrary directed adjacency matrix, and then converted to structural motif-role fingerprints by multiplying that vector by a specific invertible conversion matrix. This result demonstrates that a unique structural motif-role fingerprint exists for any given functional motif-role fingerprint. We demonstrate a similar result for the cases of functional and structural motif-fingerprints without node roles, and global subgraph counts that form the basis of standard motif analysis. We also explicitly highlight that motif-role fingerprints are elemental to several popular metrics for quantifying the subgraph structure of directed complex networks, including motif distributions, directed clustering coefficient, and transitivity. The relationships between each of these metrics and motif-role fingerprints also suggest new subtypes of directed clustering coefficients and transitivities. Our results have potential utility in analyzing directed synaptic networks constructed from neuronal connectome data, such as in terms of centrality. Other potential applications include anomaly detection in networks, identification of similar networks and identification of similar nodes within networks. Matlab code for calculating all stated metrics following calculation of functional motif-role fingerprints is provided as S1 Matlab File.

  8. Rekayasa Pengembangan Desain Motif Batik Khas Melayu

    Directory of Open Access Journals (Sweden)

    Eustasia Sri Murwati

    2016-04-01

    Full Text Available ABSTRAKPengembangan desain batik melalui rancang bangun perekayasaan desain menurut ragam hias Melayu meliputi pengembangan motif dan proses, termasuk pemilihan komposisi warna. Proses yang sering dilakukan yaitu proses celup, penghilangan lilin dan celup warna tumpangan atau proses colet, celup, penghilangan lilin atau celup kemudian penghilangan lilin yang disebut Batik Kelengan. Setiap pulau di Indonesia mempunyai ciri khas budaya dan kesenian yang dikenal dengan corak/ragam hias khas daerah, juga ornamen yang diminati oleh masyarakat dari daerah tersebut atau dari daerah lain. Kondisi demikian mendorong pertumbuhan industri kerajinan yang memanfaatkan unsur–unsur seni. Adapun motif yang diperoleh adalah: Ayam Berlaga, Bungo Matahari, Kuntum Bersanding, Lancang Kuning, Encong Kerinci, Durian Pecah, Bungo Bintang, Bungo Pauh Kecil, Riang-riang, Bungo Nagaro. Pengembangan desain tersebut dipilih 3 produk terbaik yang dinilai oleh 5 penilai yang ahli di bidang desain batik, yaitu motif Durian Pecah, Ayam Berlaga, dan Bungo Matahari. Rancang bangun diversifikasi desain dengan memanfaatkan unsur–unsur seni dan ketrampilan etnis Melayu yaitu pemilihan ragam hias dan motif batik Melayu untuk diterapkan ke bahan sandang dengan komposisi warna yang menarik, sehingga produk memenuhi selera konsumen. Memperbaiki keberagaman batik dengan meningkatkan desain produk antara lain menuangkan ragam hias Melayu ke dalam proses batik yang menggunakan berbagai macam warna sehingga komposisi warna memadai. Diperoleh hasil produk batik dengan ragam hias Melayu yang berkualitas dan komposisi warna yang sesuai dengan karakter ragam hias Melayu. Rancang bangun desain produk untuk mendapatkan formulasi desain serta kelayakan prosesnya dengan penekanan pada teknologi akrab lingkungan dilaksanakan dengan alternatif pendekatan yaitu penciptaan desain bentuk baru.Kata kunci: desain, batik, rancang bangun, ragam hias, MelayuABSTRACTDevelopment of batik design through

  9. Transnationalism as a motif in family stories.

    Science.gov (United States)

    Stone, Elizabeth; Gomez, Erica; Hotzoglou, Despina; Lipnitsky, Jane Y

    2005-12-01

    Family stories have long been recognized as a vehicle for assessing components of a family's emotional and social life, including the degree to which an immigrant family has been willing to assimilate. Transnationalism, defined as living in one or more cultures and maintaining connections to both, is now increasingly common. A qualitative study of family stories in the family of those who appear completely "American" suggests that an affiliation with one's home country is nevertheless detectable in the stories via motifs such as (1) positively connotated home remedies, (2) continuing denigration of home country "enemies," (3) extensive knowledge of the home country history and politics, (4) praise of endogamy and negative assessment of exogamy, (5) superiority of home country to America, and (6) beauty of home country. Furthermore, an awareness of which model--assimilationist or transnational--governs a family's experience may help clarify a clinician's understanding of a family's strengths, vulnerabilities, and mode of framing their cultural experiences.

  10. Motif decomposition of the phosphotyrosine proteome reveals a new N-terminal binding motif for SHIP2

    DEFF Research Database (Denmark)

    Miller, Martin Lee; Hanke, S.; Hinsby, A. M.

    2008-01-01

    set of 481 unique phosphotyrosine (Tyr(P)) peptides by sequence similarity to known ligands of the Src homology 2 (SH2) and the phosphotyrosine binding (PTB) domains. From 20 clusters we extracted 16 known and four new interaction motifs. Using quantitative mass spectrometry we pulled down Tyr......(P)-specific binding partners for peptides corresponding to the extracted motifs. We confirmed numerous previously known interaction motifs and found 15 new interactions mediated by phosphosites not previously known to bind SH2 or PTB. Remarkably, a novel hydrophobic N-terminal motif ((L/V/I)(L/V/I)pY) was identified...

  11. Specificity and affinity motifs for Grb2 SH2-ligand interactions

    NARCIS (Netherlands)

    Kessels, Helmut W. H. G.; Ward, Alister C.; Schumacher, Ton N. M.

    2002-01-01

    Protein-protein interactions are often mediated by the recognition of short continuous amino acid stretches on target proteins by specific binding domains. Affinity-based selection strategies have successfully been used to define recognition motifs for a large series of such protein domains.

  12. A cell-surface-anchored ratiometric i-motif sensor for extracellular pH detection.

    Science.gov (United States)

    Ying, Le; Xie, Nuli; Yang, Yanjing; Yang, Xiaohai; Zhou, Qifeng; Yin, Bincheng; Huang, Jin; Wang, Kemin

    2016-06-14

    A FRET-based sensor is anchored on the cell surface through streptavidin-biotin interactions. Due to the excellent properties of the pH-sensitive i-motif structure, the sensor can detect extracellular pH with high sensitivity and excellent reversibility.

  13. STUDYING THE INFLUENCE OF THE PYRENE INTERCALATOR TINA ON THE STABILITY OF DNA i-MOTIFS

    DEFF Research Database (Denmark)

    El-Sayed, Ahmed A.; Pedersen, Erik Bjerregaard; Khaireldin, Nahid A.

    2012-01-01

    Certain cytosine-rich (C-rich) DNA sequences can fold into secondary structures as four-stranded i-motifs with hemiprotonated base pairs. Here we synthesized C-rich TINA-intercalating oligonucleotides by inserting a nonnucleotide pyrene moiety between two C-rich regions. The stability of their i-...

  14. Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

    KAUST Repository

    Kalkatawi, Manal M.; Rangkuti, Farania; Schramm, Michael C.; Jankovic, Boris R.; Kamau, Allan; Chowdhary, Rajesh; Archer, John A.C.; Bajic, Vladimir B.

    2011-01-01

    . These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity

  15. The limits of de novo DNA motif discovery.

    Directory of Open Access Journals (Sweden)

    David Simcha

    Full Text Available A major challenge in molecular biology is reverse-engineering the cis-regulatory logic that plays a major role in the control of gene expression. This program includes searching through DNA sequences to identify "motifs" that serve as the binding sites for transcription factors or, more generally, are predictive of gene expression across cellular conditions. Several approaches have been proposed for de novo motif discovery-searching sequences without prior knowledge of binding sites or nucleotide patterns. However, unbiased validation is not straightforward. We consider two approaches to unbiased validation of discovered motifs: testing the statistical significance of a motif using a DNA "background" sequence model to represent the null hypothesis and measuring performance in predicting membership in gene clusters. We demonstrate that the background models typically used are "too null," resulting in overly optimistic assessments of significance, and argue that performance in predicting TF binding or expression patterns from DNA motifs should be assessed by held-out data, as in predictive learning. Applying this criterion to common motif discovery methods resulted in universally poor performance, although there is a marked improvement when motifs are statistically significant against real background sequences. Moreover, on synthetic data where "ground truth" is known, discriminative performance of all algorithms is far below the theoretical upper bound, with pronounced "over-fitting" in training. A key conclusion from this work is that the failure of de novo discovery approaches to accurately identify motifs is basically due to statistical intractability resulting from the fixed size of co-regulated gene clusters, and thus such failures do not necessarily provide evidence that unfound motifs are not active biologically. Consequently, the use of prior knowledge to enhance motif discovery is not just advantageous but necessary. An implementation of

  16. Phylogenetic analysis, based on EPIYA repeats in the cagA gene of Indian Helicobacter pylori, and the implications of sequence variation in tyrosine phosphorylation motifs on determining the clinical outcome

    Directory of Open Access Journals (Sweden)

    Santosh K. Tiwari

    2011-01-01

    Full Text Available The population of India harbors one of the world's most highly diverse gene pools, owing to the influx of successive waves of immigrants over regular periods in time. Several phylogenetic studies involving mitochondrial DNA and Y chromosomal variation have demonstrated Europeans to have been the first settlers in India. Nevertheless, certain controversy exists, due to the support given to the thesis that colonization was by the Austro-Asiatic group, prior to the Europeans. Thus, the aim was to investigate pre-historic colonization of India by anatomically modern humans, using conserved stretches of five amino acid (EPIYA sequences in the cagA gene of Helicobacter pylori. Simultaneously, the existence of a pathogenic relationship of tyrosine phosphorylation motifs (TPMs, in 32 H. pylori strains isolated from subjects with several forms of gastric diseases, was also explored. High resolution sequence analysis of the above described genes was performed. The nucleotide sequences obtained were translated into amino acids using MEGA (version 4.0 software for EPIYA. An MJ-Network was constructed for obtaining TPM haplotypes by using NETWORK (version 4.5 software. The findings of the study suggest that Indian H. pylori strains share a common ancestry with Europeans. No specific association of haplotypes with the outcome of disease was revealed through additional network analysis of TPMs.

  17. Clustering and Candidate Motif Detection in Exosomal miRNAs by Application of Machine Learning Algorithms.

    Science.gov (United States)

    Gaur, Pallavi; Chaturvedi, Anoop

    2017-07-22

    The clustering pattern and motifs give immense information about any biological data. An application of machine learning algorithms for clustering and candidate motif detection in miRNAs derived from exosomes is depicted in this paper. Recent progress in the field of exosome research and more particularly regarding exosomal miRNAs has led much bioinformatic-based research to come into existence. The information on clustering pattern and candidate motifs in miRNAs of exosomal origin would help in analyzing existing, as well as newly discovered miRNAs within exosomes. Along with obtaining clustering pattern and candidate motifs in exosomal miRNAs, this work also elaborates the usefulness of the machine learning algorithms that can be efficiently used and executed on various programming languages/platforms. Data were clustered and sequence candidate motifs were detected successfully. The results were compared and validated with some available web tools such as 'BLASTN' and 'MEME suite'. The machine learning algorithms for aforementioned objectives were applied successfully. This work elaborated utility of machine learning algorithms and language platforms to achieve the tasks of clustering and candidate motif detection in exosomal miRNAs. With the information on mentioned objectives, deeper insight would be gained for analyses of newly discovered miRNAs in exosomes which are considered to be circulating biomarkers. In addition, the execution of machine learning algorithms on various language platforms gives more flexibility to users to try multiple iterations according to their requirements. This approach can be applied to other biological data-mining tasks as well.

  18. Assessing local structure motifs using order parameters for motif recognition, interstitial identification, and diffusion path characterization

    Science.gov (United States)

    Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; Haranczyk, Maciej

    2017-11-01

    Structure-property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal closed packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.

  19. Assessing Local Structure Motifs Using Order Parameters for Motif Recognition, Interstitial Identification, and Diffusion Path Characterization

    Directory of Open Access Journals (Sweden)

    Nils E. R. Zimmermann

    2017-11-01

    Full Text Available Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP database (61,422 compounds for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.

  20. The identification of functional motifs in temporal gene expression analysis

    Directory of Open Access Journals (Sweden)

    Michael G. Surette

    2005-01-01

    Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.

  1. RNA recognition motif (RRM)-containing proteins in Bombyx mori

    African Journals Online (AJOL)

    STORAGESEVER

    2009-03-20

    Mar 20, 2009 ... Recognition Motif (RRM), sometimes referred to as. RNP1, is one of the first identified domains for RNA interaction. RRM is very common ..... Apart from the RRM motif, eIF3-S9 has a Trp-Asp. (WD) repeat domain, Poly (A) ...

  2. BlockLogo: Visualization of peptide and sequence motif conservation

    DEFF Research Database (Denmark)

    Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian

    2013-01-01

    BlockLogo is a web-server application for the visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, se...

  3. Fingerprint motifs of phytases | Fan | African Journal of Biotechnology

    African Journals Online (AJOL)

    Among the total of potential 173 phytases gained in 11 plant genomes through MAST, PAPhys are the major phytases, and HAPhys are the minor, and other phytase groups are not found in planta. Keywords: Phytase, fingerprint motif, multiple EM for motif elicitation (MEME), MAST African Journal of Biotechnology Vol.

  4. Identifying cis-regulatory modules by combining comparative and compositional analysis of DNA.

    Science.gov (United States)

    Pierstorff, Nora; Bergman, Casey M; Wiehe, Thomas

    2006-12-01

    Predicting cis-regulatory modules (CRMs) in higher eukaryotes is a challenging computational task. Commonly used methods to predict CRMs based on the signal of transcription factor binding sites (TFBS) are limited by prior information about transcription factor specificity. More general methods that bypass the reliance on TFBS models are needed for comprehensive CRM prediction. We have developed a method to predict CRMs called CisPlusFinder that identifies high density regions of perfect local ungapped sequences (PLUSs) based on multiple species conservation. By assuming that PLUSs contain core TFBS motifs that are locally overrepresented, the method attempts to capture the expected features of CRM structure and evolution. Applied to a benchmark dataset of CRMs involved in early Drosophila development, CisPlusFinder predicts more annotated CRMs than all other methods tested. Using the REDfly database, we find that some 'false positive' predictions in the benchmark dataset correspond to recently annotated CRMs. Our work demonstrates that CRM prediction methods that combine comparative genomic data with statistical properties of DNA may achieve reasonable performance when applied genome-wide in the absence of an a priori set of known TFBS motifs. The program CisPlusFinder can be downloaded at http://jakob.genetik.uni-koeln.de/bioinformatik/people/nora/nora.html. All software is licensed under the Lesser GNU Public License (LGPL).

  5. Comparison of the local binding motifs in the imidazolium-based ionic liquids [EMIM][BF{sub 4}] and [EMMIM][BF{sub 4}] through cryogenic ion vibrational predissociation spectroscopy: Unraveling the roles of anharmonicity and intermolecular interactions

    Energy Technology Data Exchange (ETDEWEB)

    Fournier, Joseph A.; Wolke, Conrad T.; Johnson, Christopher J.; Johnson, Mark A., E-mail: mark.johnson@yale.edu, E-mail: mccoy@chemistry.ohio-state.edu [Sterling Chemistry Laboratory, Yale University, New Haven, Connecticut 06520 (United States); McCoy, Anne B., E-mail: mark.johnson@yale.edu, E-mail: mccoy@chemistry.ohio-state.edu [Department of Chemistry and Biochemistry, The Ohio State University, Columbus, Ohio 43210 (United States)

    2015-02-14

    We clarify the role of the critical imidazolium C{sub (2)}H position (the central C between N atoms in the heterocycle) in the assembly motif of the [EMIM][BF{sub 4}] ionic liquid by analyzing the vibrational spectra of the bare EMIM{sup +} ion as well as that of the cationic [EMIM]{sub 2}[BF{sub 4}]{sup +} (EMIM{sup +} = 1-ethyl-3-methylimidazolium, C{sub 6}H{sub 11}N{sub 2}{sup +}) cluster. Vibrational spectra of the cold, mass-selected ions are obtained using cryogenic ion vibrational predissociation of weakly bound D{sub 2} molecules formed in a 10 K ion trap. The C{sub (2)}H behavior is isolated by following the evolution of key vibrational features when the C{sub (2)} hydrogen, the proposed binding location of the anion to the imidazolium ring, is replaced by either deuterium or a methyl group (i.e., in the EMMIM{sup +} analogue). Strong features in the ring CH stretching region of the bare ion are traced to Fermi resonances with overtones of lower frequency modes. Upon incorporation into the EMIM{sup +} ⋅ ⋅ ⋅ BF{sub 4}{sup −} ⋅ ⋅ ⋅ EMIM{sup +} ternary complex, the C{sub (2)}H oscillator strength is dramatically increased, accounting for the much more complicated patterns derived from the EMIM{sup +} ring CH stretches in the light isotopomer, which are strongly suppressed in the deuterated analogue. Further changes in the spectra that occur when the C{sub (2)}H is replaced by a methyl group are consistent with BF{sub 4}{sup −} attachment directly to the imidazolium ring in an arrangement that maximizes the electrostatic interaction between the molecular ions.

  6. Temporal motifs reveal collaboration patterns in online task-oriented networks

    Science.gov (United States)

    Xuan, Qi; Fang, Huiting; Fu, Chenbo; Filkov, Vladimir

    2015-05-01

    Real networks feature layers of interactions and complexity. In them, different types of nodes can interact with each other via a variety of events. Examples of this complexity are task-oriented social networks (TOSNs), where teams of people share tasks towards creating a quality artifact, such as academic research papers or software development in commercial or open source environments. Accomplishing those tasks involves both work, e.g., writing the papers or code, and communication, to discuss and coordinate. Taking into account the different types of activities and how they alternate over time can result in much more precise understanding of the TOSNs behaviors and outcomes. That calls for modeling techniques that can accommodate both node and link heterogeneity as well as temporal change. In this paper, we report on methodology for finding temporal motifs in TOSNs, limited to a system of two people and an artifact. We apply the methods to publicly available data of TOSNs from 31 Open Source Software projects. We find that these temporal motifs are enriched in the observed data. When applied to software development outcome, temporal motifs reveal a distinct dependency between collaboration and communication in the code writing process. Moreover, we show that models based on temporal motifs can be used to more precisely relate both individual developer centrality and team cohesion to programmer productivity than models based on aggregated TOSNs.

  7. Identification of group specific motifs in Beta-lactamase family of proteins

    Directory of Open Access Journals (Sweden)

    Saxena Akansha

    2009-12-01

    Full Text Available Abstract Background Beta-lactamases are one of the most serious threats to public health. In order to combat this threat we need to study the molecular and functional diversity of these enzymes and identify signatures specific to these enzymes. These signatures will enable us to develop inhibitors and diagnostic probes specific to lactamases. The existing classification of beta-lactamases was developed nearly 30 years ago when few lactamases were available. DLact database contain more than 2000 beta-lactamase, which can be used to study the molecular diversity and to identify signatures specific to this family. Methods A set of 2020 beta-lactamase proteins available in the DLact database http://59.160.102.202/DLact were classified using graph-based clustering of Best Bi-Directional Hits. Non-redundant (> 90 percent identical protein sequences from each group were aligned using T-Coffee and annotated using information available in literature. Motifs specific to each group were predicted using PRATT program. Results The graph-based classification of beta-lactamase proteins resulted in the formation of six groups (Four major groups containing 191, 726, 774 and 73 proteins while two minor groups containing 50 and 8 proteins. Based on the information available in literature, we found that each of the four major groups correspond to the four classes proposed by Ambler. The two minor groups were novel and do not contain molecular signatures of beta-lactamase proteins reported in literature. The group-specific motifs showed high sensitivity (> 70% and very high specificity (> 90%. The motifs from three groups (corresponding to class A, C and D had a high level of conservation at DNA as well as protein level whereas the motifs from the fourth group (corresponding to class B showed conservation at only protein level. Conclusion The graph-based classification of beta-lactamase proteins corresponds with the classification proposed by Ambler, thus there is

  8. Transcription factor binding sites prediction based on modified nucleosomes.

    Directory of Open Access Journals (Sweden)

    Mohammad Talebzadeh

    Full Text Available In computational methods, position weight matrices (PWMs are commonly applied for transcription factor binding site (TFBS prediction. Although these matrices are more accurate than simple consensus sequences to predict actual binding sites, they usually produce a large number of false positive (FP predictions and so are impoverished sources of information. Several studies have employed additional sources of information such as sequence conservation or the vicinity to transcription start sites to distinguish true binding regions from random ones. Recently, the spatial distribution of modified nucleosomes has been shown to be associated with different promoter architectures. These aligned patterns can facilitate DNA accessibility for transcription factors. We hypothesize that using data from these aligned and periodic patterns can improve the performance of binding region prediction. In this study, we propose two effective features, "modified nucleosomes neighboring" and "modified nucleosomes occupancy", to decrease FP in binding site discovery. Based on these features, we designed a logistic regression classifier which estimates the probability of a region as a TFBS. Our model learned each feature based on Sp1 binding sites on Chromosome 1 and was tested on the other chromosomes in human CD4+T cells. In this work, we investigated 21 histone modifications and found that only 8 out of 21 marks are strongly correlated with transcription factor binding regions. To prove that these features are not specific to Sp1, we combined the logistic regression classifier with the PWM, and created a new model to search TFBSs on the genome. We tested the model using transcription factors MAZ, PU.1 and ELF1 and compared the results to those using only the PWM. The results show that our model can predict Transcription factor binding regions more successfully. The relative simplicity of the model and capability of integrating other features make it a superior method

  9. Mitochondrial and Y chromosome haplotype motifs as diagnostic markers of Jewish ancestry: a reconsideration.

    Directory of Open Access Journals (Sweden)

    Sergio eTofanelli

    2014-11-01

    Full Text Available Several authors have proposed haplotype motifs based on site variants at the mitochondrial genome (mtDNA and the non-recombining portion of the Y chromosome (NRY to trace the genealogies of Jewish people. Here, we analyzed their main approaches and test the feasibility of adopting motifs as ancestry markers through construction of a large database of mtDNA and NRY haplotypes from public genetic genealogical repositories. We verified the reliability of Jewish ancestry prediction based on the Cohen and Levite Modal Haplotypes in their classical 6 STR marker format or in the extended 12 STR format, as well as four founder mtDNA lineages (HVS-I segments accounting for about 40% of the current population of Ashkenazi Jews. For this purpose we compared haplotype composition in individuals of self-reported Jewish ancestry with the rest of European, African or Middle Eastern samples, to test for non-random association of ethno-geographic groups and haplotypes. Overall, NRY and mtDNA based motifs, previously reported to differentiate between groups, were found to be more represented in Jewish compared to non-Jewish groups. However, this seems to stem from common ancestors of Jewish lineages being rather recent respect to ancestors of non-Jewish lineages with the same haplotype signatures. Moreover, the polyphyly of haplotypes which contain the proposed motifs and the misuse of constant mutation rates heavily affected previous attempts to correctly dating the origin of common ancestries. Accordingly, our results stress the limitations of using the above haplotype motifs as reliable Jewish ancestry predictors and show its inadequacy for forensic or genealogical purposes.

  10. Computational analyses of synergism in small molecular network motifs.

    Directory of Open Access Journals (Sweden)

    Yili Zhang

    2014-03-01

    Full Text Available Cellular functions and responses to stimuli are controlled by complex regulatory networks that comprise a large diversity of molecular components and their interactions. However, achieving an intuitive understanding of the dynamical properties and responses to stimuli of these networks is hampered by their large scale and complexity. To address this issue, analyses of regulatory networks often focus on reduced models that depict distinct, reoccurring connectivity patterns referred to as motifs. Previous modeling studies have begun to characterize the dynamics of small motifs, and to describe ways in which variations in parameters affect their responses to stimuli. The present study investigates how variations in pairs of parameters affect responses in a series of ten common network motifs, identifying concurrent variations that act synergistically (or antagonistically to alter the responses of the motifs to stimuli. Synergism (or antagonism was quantified using degrees of nonlinear blending and additive synergism. Simulations identified concurrent variations that maximized synergism, and examined the ways in which it was affected by stimulus protocols and the architecture of a motif. Only a subset of architectures exhibited synergism following paired changes in parameters. The approach was then applied to a model describing interlocked feedback loops governing the synthesis of the CREB1 and CREB2 transcription factors. The effects of motifs on synergism for this biologically realistic model were consistent with those for the abstract models of single motifs. These results have implications for the rational design of combination drug therapies with the potential for synergistic interactions.

  11. Triadic motifs in the dependence networks of virtual societies

    Science.gov (United States)

    Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

    2014-06-01

    In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.

  12. RNA motif search with data-driven element ordering.

    Science.gov (United States)

    Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa

    2016-05-18

    In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .

  13. Triadic motifs in the dependence networks of virtual societies.

    Science.gov (United States)

    Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

    2014-06-10

    In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.

  14. Mutational analysis of the RecJ exonuclease of Escherichia coli: identification of phosphoesterase motifs.

    Science.gov (United States)

    Sutera, V A; Han, E S; Rajman, L A; Lovett, S T

    1999-10-01

    The recJ gene, identified in Escherichia coli, encodes a Mg(+2)-dependent 5'-to-3' exonuclease with high specificity for single-strand DNA. Genetic and biochemical experiments implicate RecJ exonuclease in homologous recombination, base excision, and methyl-directed mismatch repair. Genes encoding proteins with strong similarities to RecJ have been found in every eubacterial genome sequenced to date, with the exception of Mycoplasma and Mycobacterium tuberculosis. Multiple genes encoding proteins similar to RecJ are found in some eubacteria, including Bacillus and Helicobacter, and in the archaea. Among this divergent set of sequences, seven conserved motifs emerge. We demonstrate here that amino acids within six of these motifs are essential for both the biochemical and genetic functions of E. coli RecJ. These motifs may define interactions with Mg(2+) ions or substrate DNA. A large family of proteins more distantly related to RecJ is present in archaea, eubacteria, and eukaryotes, including a hypothetical protein in the MgPa adhesin operon of Mycoplasma, a domain of putative polyA polymerases in Synechocystis and Aquifex, PRUNE of Drosophila, and an exopolyphosphatase (PPX1) of Saccharomyces cereviseae. Because these six RecJ motifs are shared between exonucleases and exopolyphosphatases, they may constitute an ancient phosphoesterase domain now found in all kingdoms of life.

  15. The MARVEL transmembrane motif of occludin mediates oligomerization and targeting to the basolateral surface in epithelia.

    Science.gov (United States)

    Yaffe, Yakey; Shepshelovitch, Jeanne; Nevo-Yassaf, Inbar; Yeheskel, Adva; Shmerling, Hedva; Kwiatek, Joanna M; Gaus, Katharina; Pasmanik-Chor, Metsada; Hirschberg, Koret

    2012-08-01

    Occludin (Ocln), a MARVEL-motif-containing protein, is found in all tight junctions. MARVEL motifs are comprised of four transmembrane helices associated with the localization to or formation of diverse membrane subdomains by interacting with the proximal lipid environment. The functions of the Ocln MARVEL motif are unknown. Bioinformatics sequence- and structure-based analyses demonstrated that the MARVEL domain of Ocln family proteins has distinct evolutionarily conserved sequence features that are consistent with its basolateral membrane localization. Live-cell microscopy, fluorescence resonance energy transfer (FRET) and bimolecular fluorescence complementation (BiFC) were used to analyze the intracellular distribution and self-association of fluorescent-protein-tagged full-length human Ocln or the Ocln MARVEL motif excluding the cytosolic C- and N-termini (amino acids 60-269, FP-MARVEL-Ocln). FP-MARVEL-Ocln efficiently arrived at the plasma membrane (PM) and was sorted to the basolateral PM in filter-grown polarized MDCK cells. A series of conserved aromatic amino acids within the MARVEL domain were found to be associated with Ocln dimerization using BiFC. FP-MARVEL-Ocln inhibited membrane pore growth during Triton-X-100-induced solubilization and was shown to increase the membrane-ordered state using Laurdan, a lipid dye. These data demonstrate that the Ocln MARVEL domain mediates self-association and correct sorting to the basolateral membrane.

  16. Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

    KAUST Repository

    Sayadi, Ahmed

    2011-07-20

    The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length of the motifs and their variable degree of conservation makes their identification hard since it is difficult to correctly estimate the statistical significance of their occurrence. Consequently, only a small fraction of them have been discovered so far. We describe here an approach for the discovery of SLiMs based on their occurrence in evolutionarily unrelated proteins belonging to the same biological, signalling or metabolic pathway and give specific examples of its effectiveness in both rediscovering known motifs and in discovering novel ones. An automatic implementation of the procedure, available for download, allows significant motifs to be identified, automatically annotated with functional, evolutionary and structural information and organized in a database that can be inspected and queried. An instance of the database populated with pre-computed data on seven organisms is accessible through a publicly available server and we believe it constitutes by itself a useful resource for the life sciences (http://www.biocomputing.it/modipath).

  17. Nuclear localization of the dehydrin OpsDHN1 is determined by histidine-rich motif

    Science.gov (United States)

    Hernández-Sánchez, Itzell E.; Maruri-López, Israel; Ferrando, Alejandro; Carbonell, Juan; Graether, Steffen P.; Jiménez-Bremont, Juan F.

    2015-01-01

    The cactus OpsDHN1 dehydrin belongs to a large family of disordered and highly hydrophilic proteins known as Late Embryogenesis Abundant (LEA) proteins, which accumulate during the late stages of embryogenesis and in response to abiotic stresses. Herein, we present the in vivo OpsDHN1 subcellular localization by N-terminal GFP translational fusion; our results revealed a cytoplasmic and nuclear localization of the GFP::OpsDHN1 protein in Nicotiana benthamiana epidermal cells. In addition, dimer assembly of OpsDHN1 in planta using a Bimolecular Fluorescence Complementation (BiFC) approach was demonstrated. In order to understand the in vivo role of the histidine-rich motif, the OpsDHN1-ΔHis version was produced and assayed for its subcellular localization and dimer capability by GFP fusion and BiFC assays, respectively. We found that deletion of the OpsDHN1 histidine-rich motif restricted its localization to cytoplasm, but did not affect dimer formation. In addition, the deletion of the S-segment in the OpsDHN1 protein affected its nuclear localization. Our data suggest that the deletion of histidine-rich motif and S-segment show similar effects, preventing OpsDHN1 from getting into the nucleus. Based on these results, the histidine-rich motif is proposed as a targeting element for OpsDHN1 nuclear localization. PMID:26442018

  18. Nuclear localization of the dehydrin OpsDHN1 is determined by histidine-rich motif

    Directory of Open Access Journals (Sweden)

    Itzell Euridice Hernández-Sánchez

    2015-09-01

    Full Text Available The cactus OpsDHN1 dehydrin belongs to a large family of disordered and highly hydrophilic proteins known as Late Embryogenesis Abundant (LEA proteins, which accumulate during the late stages of embryogenesis and in response to abiotic stresses. Herein, we present the in vivo OpsDHN1 subcellular localization by N-terminal GFP translational fusion; our results revealed a cytoplasmic and nuclear localization of the GFP::OpsDHN1 protein in Nicotiana benthamiana epidermal cells. In addition, dimer assembly of OpsDHN1 in planta using a Bimolecular Fluorescence Complementation (BiFC approach was demonstrated. In order to understand the in vivo role of the histidine-rich motif, the OpsDHN1-ΔHis version was produced and assayed for its subcellular localization and dimer capability by GFP fusion and BiFC assays, respectively. We found that deletion of the OpsDHN1 histidine-rich motif restricted its localization to cytoplasm, but did not affect dimer formation. In addition, the deletion of the S-segment in the OpsDHN1 protein affected its nuclear localization. Our data suggest that the deletion of histidine-rich motif and S-segment show similar effects, preventing OpsDHN1 from getting into the nucleus. Based on these results, the histidine rich motif is proposed as a targeting element for OpsDHN1 nuclear localization.

  19. Rtt107/Esc4 binds silent chromatin and DNA repair proteins using different BRCT motifs

    Directory of Open Access Journals (Sweden)

    Jockusch Rebecca A

    2006-11-01

    Full Text Available Abstract Background By screening a plasmid library for proteins that could cause silencing when targeted to the HMR locus in Saccharomyces cerevisiae, we previously reported the identification of Rtt107/Esc4 based on its ability to establish silent chromatin. In this study we aimed to determine the mechanism of Rtt107/Esc4 targeted silencing and also learn more about its biological functions. Results Targeted silencing by Rtt107/Esc4 was dependent on the SIR genes, which encode obligatory structural and enzymatic components of yeast silent chromatin. Based on its sequence, Rtt107/Esc4 was predicted to contain six BRCT motifs. This motif, originally identified in the human breast tumor suppressor gene BRCA1, is a protein interaction domain. The targeted silencing activity of Rtt107/Esc4 resided within the C-terminal two BRCT motifs, and this region of the protein bound to Sir3 in two-hybrid tests. Deletion of RTT107/ESC4 caused sensitivity to the DNA damaging agent MMS as well as to hydroxyurea. A two-hybrid screen showed that the N-terminal BRCT motifs of Rtt107/Esc4 bound to Slx4, a protein previously shown to be involved in DNA repair and required for viability in a strain lacking the DNA helicase Sgs1. Like SLX genes, RTT107ESC4 interacted genetically with SGS1; esc4Δ sgs1Δ mutants were viable, but exhibited a slow-growth phenotype and also a synergistic DNA repair defect. Conclusion Rtt107/Esc4 binds to the silencing protein Sir3 and the DNA repair protein Slx4 via different BRCT motifs, thus providing a bridge linking silent chromatin to DNA repair enzymes.

  20. ROMANIAN FOLKLORE MOTIFS IN FASHION DESIGN

    Directory of Open Access Journals (Sweden)

    MOCENCO Alexandra

    2014-05-01

    Full Text Available The traditional Romanian costume such as the entire popular art (architecture, woodcarvins, pottery etc. was born and lasted in our country since ancient times. Closely related to human existence, the traditional costume reflected over the years as reflected nowadays, the mentality and artistic conception of the people. Today the traditional Romanian costume became an inspiration source to the wholesale fashion production industry designers, both Romanian and international. Although the contemporary designers are working in accordance with a vision, using a wide area of styles, methods and current technology, they usually return to traditional techniques and ethnic folklore motifs, which converts and resize them, integrating them in their contemporary space. Adrian Oianu is a very appreciated Romanian designer who launched two collections inspired by his native’s country traditional costumes: “Suflecata pan’ la brau” (“Turned up ‘til the belt” and “Bucurie” (“Joy”. Dorin Negrau had as inspiration for his “Lost” collection the traditional costume from the Bihor region. Yves Saint Laurent had a collection inspired by the Romanian traditional flax blouses called “La blouse roumaine”. The paper presents the traditional Romanian values throw fashion collections. The research activity will create innovative concepts to support the garment industry in order to develop their own brand and to bring the design activities in Romania at an international level. The research was conducted during the initial stage of a project, financed through national founds, consisting in a documentary study on ethnographic characteristics of the popular costume from different regions of the country.

  1. Review article: The mountain motif in the plot of Matthew

    Directory of Open Access Journals (Sweden)

    Gert J. Volschenk

    2010-09-01

    Full Text Available This article reviewed T.L. Donaldson’s book, Jesus on the mountain: A study in Matthean theology, published in 1985 by JSOT Press, Sheffield, and focused on the mountain motif in the structure and plot of the Gospel of Matthew, in addition to the work of Donaldson on the mountain motif as a literary motif and as theological symbol. The mountain is a primary theological setting for Jesus’ ministry and thus is an important setting, serving as one of the literary devices by which Matthew structured and progressed his narrative. The Zion theological and eschatological significance and Second Temple Judaism serve as the historical and theological background for the mountain motif. The last mountain setting (Mt 28:16–20 is the culmination of the three theological themes in the plot of Matthew, namely Christology, ecclesiology and salvation history.

  2. Methods and statistics for combining motif match scores.

    Science.gov (United States)

    Bailey, T L; Gribskov, M

    1998-01-01

    Position-specific scoring matrices are useful for representing and searching for protein sequence motifs. A sequence family can often be described by a group of one or more motifs, and an effective search must combine the scores for matching a sequence to each of the motifs in the group. We describe three methods for combining match scores and estimating the statistical significance of the combined scores and evaluate the search quality (classification accuracy) and the accuracy of the estimate of statistical significance of each. The three methods are: 1) sum of scores, 2) sum of reduced variates, 3) product of score p-values. We show that method 3) is superior to the other two methods in both regards, and that combining motif scores indeed gives better search accuracy. The MAST sequence homology search algorithm utilizing the product of p-values scoring method is available for interactive use and downloading at URL http:/(/)www.sdsc.edu/MEME.

  3. CD3 gamma contains a phosphoserine-dependent di-leucine motif involved in down-regulation of the T cell receptor

    DEFF Research Database (Denmark)

    Dietrich, J; Hou, X; Wegener, A M

    1994-01-01

    -regulation of the TCR. Furthermore, analysis of a series of CD3 gamma truncation mutants indicated that in addition to S126 phosphorylation a motif C-terminal of S126 was required for TCR down-regulation. Point mutation analyses confirmed this observation and demonstrated that a membrane-proximal di-leucine motif (L131......, indicating that the TCR was down-regulated by endocytosis via clathrin coated pits. Based on the present results and previously published observations on intracellular receptor sorting, a general model for intracellular sorting of receptors containing di-leucine- or tyrosine-based motifs is proposed....

  4. BEAM web server: a tool for structural RNA motif discovery.

    Science.gov (United States)

    Pietrosanto, Marco; Adinolfi, Marta; Casula, Riccardo; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2018-03-15

    RNA structural motif finding is a relevant problem that becomes computationally hard when working on high-throughput data (e.g. eCLIP, PAR-CLIP), often represented by thousands of RNA molecules. Currently, the BEAM server is the only web tool capable to handle tens of thousands of RNA in input with a motif discovery procedure that is only limited by the current secondary structure prediction accuracies. The recently developed method BEAM (BEAr Motifs finder) can analyze tens of thousands of RNA molecules and identify RNA secondary structure motifs associated to a measure of their statistical significance. BEAM is extremely fast thanks to the BEAR encoding that transforms each RNA secondary structure in a string of characters. BEAM also exploits the evolutionary knowledge contained in a substitution matrix of secondary structure elements, extracted from the RFAM database of families of homologous RNAs. The BEAM web server has been designed to streamline data pre-processing by automatically handling folding and encoding of RNA sequences, giving users a choice for the preferred folding program. The server provides an intuitive and informative results page with the list of secondary structure motifs identified, the logo of each motif, its significance, graphic representation and information about its position in the RNA molecules sharing it. The web server is freely available at http://beam.uniroma2.it/ and it is implemented in NodeJS and Python with all major browsers supported. marco.pietrosanto@uniroma2.it. Supplementary data are available at Bioinformatics online.

  5. Characterizing Motif Dynamics of Electric Brain Activity Using Symbolic Analysis

    Directory of Open Access Journals (Sweden)

    Massimiliano Zanin

    2014-10-01

    Full Text Available Motifs are small recurring circuits of interactions which constitute the backbone of networked systems. Characterizing motif dynamics is therefore key to understanding the functioning of such systems. Here we propose a method to define and quantify the temporal variability and time scales of electroencephalogram (EEG motifs of resting brain activity. Given a triplet of EEG sensors, links between them are calculated by means of linear correlation; each pattern of links (i.e., each motif is then associated to a symbol, and its appearance frequency is analyzed by means of Shannon entropy. Our results show that each motif becomes observable with different coupling thresholds and evolves at its own time scale, with fronto-temporal sensors emerging at high thresholds and changing at fast time scales, and parietal ones at low thresholds and changing at slower rates. Finally, while motif dynamics differed across individuals, for each subject, it showed robustness across experimental conditions, indicating that it could represent an individual dynamical signature.

  6. Efficient motif finding algorithms for large-alphabet inputs

    Directory of Open Access Journals (Sweden)

    Pavlovic Vladimir

    2010-10-01

    Full Text Available Abstract Background We consider the problem of identifying motifs, recurring or conserved patterns, in the biological sequence data sets. To solve this task, we present a new deterministic algorithm for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. Results The proposed algorithm (1 improves search efficiency compared to existing algorithms, and (2 scales well with the size of alphabet. On a synthetic planted DNA motif finding problem our algorithm is over 10× more efficient than MITRA, PMSPrune, and RISOTTO for long motifs. Improvements are orders of magnitude higher in the same setting with large alphabets. On benchmark TF-binding site problems (FNP, CRP, LexA we observed reduction in running time of over 12×, with high detection accuracy. The algorithm was also successful in rapidly identifying protein motifs in Lipocalin, Zinc metallopeptidase, and supersecondary structure motifs for Cadherin and Immunoglobin families. Conclusions Our algorithm reduces computational complexity of the current motif finding algorithms and demonstrate strong running time improvements over existing exact algorithms, especially in important and difficult cases of large-alphabet sequences.

  7. Discovering Motifs in Biological Sequences Using the Micron Automata Processor.

    Science.gov (United States)

    Roy, Indranil; Aluru, Srinivas

    2016-01-01

    Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-complete, and the largest solved instance reported to date is (26,11). We propose a novel algorithm for the (l,d) motif search problem using streaming execution over a large set of non-deterministic finite automata (NFA). This solution is designed to take advantage of the micron automata processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We demonstrate the capability for solving much larger instances of the (l, d) motif search problem using the resources available within a single automata processor board, by estimating run-times for problem instances (39,18) and (40,17). The paper serves as a useful guide to solving problems using this new accelerator technology.

  8. An experimental test of a fundamental food web motif.

    Science.gov (United States)

    Rip, Jason M K; McCann, Kevin S; Lynn, Denis H; Fawcett, Sonia

    2010-06-07

    Large-scale changes to the world's ecosystem are resulting in the deterioration of biostructure-the complex web of species interactions that make up ecological communities. A difficult, yet crucial task is to identify food web structures, or food web motifs, that are the building blocks of this baroque network of interactions. Once identified, these food web motifs can then be examined through experiments and theory to provide mechanistic explanations for how structure governs ecosystem stability. Here, we synthesize recent ecological research to show that generalist consumers coupling resources with different interaction strengths, is one such motif. This motif amazingly occurs across an enormous range of spatial scales, and so acts to distribute coupled weak and strong interactions throughout food webs. We then perform an experiment that illustrates the importance of this motif to ecological stability. We find that weak interactions coupled to strong interactions by generalist consumers dampen strong interaction strengths and increase community stability. This study takes a critical step by isolating a common food web motif and through clear, experimental manipulation, identifies the fundamental stabilizing consequences of this structure for ecological communities.

  9. CAGEd-oPOSSUM: motif enrichment analysis from CAGE-derived TSSs.

    Science.gov (United States)

    Arenillas, David J; Forrest, Alistair R R; Kawaji, Hideya; Lassmann, Timo; Wasserman, Wyeth W; Mathelier, Anthony

    2016-09-15

    With the emergence of large-scale Cap Analysis of Gene Expression (CAGE) datasets from individual labs and the FANTOM consortium, one can now analyze the cis-regulatory regions associated with gene transcription at an unprecedented level of refinement. By coupling transcription factor binding site (TFBS) enrichment analysis with CAGE-derived genomic regions, CAGEd-oPOSSUM can identify TFs that act as key regulators of genes involved in specific mammalian cell and tissue types. The webtool allows for the analysis of CAGE-derived transcription start sites (TSSs) either provided by the user or selected from ∼1300 mammalian samples from the FANTOM5 project with pre-computed TFBS predicted with JASPAR TF binding profiles. The tool helps power insights into the regulation of genes through the study of the specific usage of TSSs within specific cell types and/or under specific conditions. The CAGEd-oPOSUM web tool is implemented in Perl, MySQL and Apache and is available at http://cagedop.cmmt.ubc.ca/CAGEd_oPOSSUM CONTACTS: anthony.mathelier@ncmm.uio.no or wyeth@cmmt.ubc.ca Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  10. Verification of the MOTIF code version 3.0

    International Nuclear Information System (INIS)

    Chan, T.; Guvanasen, V.; Nakka, B.W.; Reid, J.A.K.; Scheier, N.W.; Stanchell, F.W.

    1996-12-01

    As part of the Canadian Nuclear Fuel Waste Management Program (CNFWMP), AECL has developed a three-dimensional finite-element code, MOTIF (Model Of Transport In Fractured/ porous media), for detailed modelling of groundwater flow, heat transport and solute transport in a fractured rock mass. The code solves the transient and steady-state equations of groundwater flow, solute (including one-species radionuclide) transport, and heat transport in variably saturated fractured/porous media. The initial development was completed in 1985 (Guvanasen 1985) and version 3.0 was completed in 1986. This version is documented in detail in Guvanasen and Chan (in preparation). This report describes a series of fourteen verification cases which has been used to test the numerical solution techniques and coding of MOTIF, as well as demonstrate some of the MOTIF analysis capabilities. For each case the MOTIF solution has been compared with a corresponding analytical or independently developed alternate numerical solution. Several of the verification cases were included in Level 1 of the International Hydrologic Code Intercomparison Project (HYDROCOIN). The MOTIF results for these cases were also described in the HYDROCOIN Secretariat's compilation and comparison of results submitted by the various project teams (Swedish Nuclear Power Inspectorate 1988). It is evident from the graphical comparisons presented that the MOTIF solutions for the fourteen verification cases are generally in excellent agreement with known analytical or numerical solutions obtained from independent sources. This series of verification studies has established the ability of the MOTIF finite-element code to accurately model the groundwater flow and solute and heat transport phenomena for which it is intended. (author). 20 refs., 14 tabs., 32 figs

  11. Mechanisms of zero-lag synchronization in cortical motifs.

    Directory of Open Access Journals (Sweden)

    Leonardo L Gollo

    2014-04-01

    Full Text Available Zero-lag synchronization between distant cortical areas has been observed in a diversity of experimental data sets and between many different regions of the brain. Several computational mechanisms have been proposed to account for such isochronous synchronization in the presence of long conduction delays: Of these, the phenomenon of "dynamical relaying"--a mechanism that relies on a specific network motif--has proven to be the most robust with respect to parameter mismatch and system noise. Surprisingly, despite a contrary belief in the community, the common driving motif is an unreliable means of establishing zero-lag synchrony. Although dynamical relaying has been validated in empirical and computational studies, the deeper dynamical mechanisms and comparison to dynamics on other motifs is lacking. By systematically comparing synchronization on a variety of small motifs, we establish that the presence of a single reciprocally connected pair--a "resonance pair"--plays a crucial role in disambiguating those motifs that foster zero-lag synchrony in the presence of conduction delays (such as dynamical relaying from those that do not (such as the common driving triad. Remarkably, minor structural changes to the common driving motif that incorporate a reciprocal pair recover robust zero-lag synchrony. The findings are observed in computational models of spiking neurons, populations of spiking neurons and neural mass models, and arise whether the oscillatory systems are periodic, chaotic, noise-free or driven by stochastic inputs. The influence of the resonance pair is also robust to parameter mismatch and asymmetrical time delays amongst the elements of the motif. We call this manner of facilitating zero-lag synchrony resonance-induced synchronization, outline the conditions for its occurrence, and propose that it may be a general mechanism to promote zero-lag synchrony in the brain.

  12. Crammed signaling motifs in the T-cell receptor.

    Science.gov (United States)

    Borroto, Aldo; Abia, David; Alarcón, Balbino

    2014-09-01

    Although the T cell antigen receptor (TCR) is long known to contain multiple signaling subunits (CD3γ, CD3δ, CD3ɛ and CD3ζ), their role in signal transduction is still not well understood. The presence of at least one immunoreceptor tyrosine-based activation motif (ITAM) in each CD3 subunit has led to the idea that the multiplication of such elements essentially serves to amplify signals. However, the evolutionary conservation of non-ITAM sequences suggests that each CD3 subunit is likely to have specific non-redundant roles at some stage of development or in mature T cell function. The CD3ɛ subunit is paradigmatic because in a relatively short cytoplasmic sequence (∼55 amino acids) it contains several docking sites for proteins involved in intracellular trafficking and signaling, proteins whose relevance in T cell activation is slowly starting to be revealed. In this review we will summarize our current knowledge on the signaling effectors that bind directly to the TCR and we will propose a hierarchy in their response to TCR triggering. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. DNA mutation motifs in the genes associated with inherited diseases.

    Directory of Open Access Journals (Sweden)

    Michal Růžička

    Full Text Available Mutations in human genes can be responsible for inherited genetic disorders and cancer. Mutations can arise due to environmental factors or spontaneously. It has been shown that certain DNA sequences are more prone to mutate. These sites are termed hotspots and exhibit a higher mutation frequency than expected by chance. In contrast, DNA sequences with lower mutation frequencies than expected by chance are termed coldspots. Mutation hotspots are usually derived from a mutation spectrum, which reflects particular population where an effect of a common ancestor plays a role. To detect coldspots/hotspots unaffected by population bias, we analysed the presence of germline mutations obtained from HGMD database in the 5-nucleotide segments repeatedly occurring in genes associated with common inherited disorders, in particular, the PAH, LDLR, CFTR, F8, and F9 genes. Statistically significant sequences (mutational motifs rarely associated with mutations (coldspots and frequently associated with mutations (hotspots exhibited characteristic sequence patterns, e.g. coldspots contained purine tract while hotspots showed alternating purine-pyrimidine bases, often with the presence of CpG dinucleotide. Using molecular dynamics simulations and free energy calculations, we analysed the global bending properties of two selected coldspots and two hotspots with a G/T mismatch. We observed that the coldspots were inherently more flexible than the hotspots. We assume that this property might be critical for effective mismatch repair as DNA with a mutation recognized by MutSα protein is noticeably bent.

  14. Complete motif analysis of sequence requirements for translation initiation at non-AUG start codons.

    Science.gov (United States)

    Diaz de Arce, Alexander J; Noderer, William L; Wang, Clifford L

    2018-01-25

    The initiation of mRNA translation from start codons other than AUG was previously believed to be rare and of relatively low impact. More recently, evidence has suggested that as much as half of all translation initiation utilizes non-AUG start codons, codons that deviate from AUG by a single base. Furthermore, non-AUG start codons have been shown to be involved in regulation of expression and disease etiology. Yet the ability to gauge expression based on the sequence of a translation initiation site (start codon and its flanking bases) has been limited. Here we have performed a comprehensive analysis of translation initiation sites that utilize non-AUG start codons. By combining genetic-reporter, cell-sorting, and high-throughput sequencing technologies, we have analyzed the expression associated with all possible variants of the -4 to +4 positions of non-AUG translation initiation site motifs. This complete motif analysis revealed that 1) with the right sequence context, certain non-AUG start codons can generate expression comparable to that of AUG start codons, 2) sequence context affects each non-AUG start codon differently, and 3) initiation at non-AUG start codons is highly sensitive to changes in the flanking sequences. Complete motif analysis has the potential to be a key tool for experimental and diagnostic genomics. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Binding properties of SUMO-interacting motifs (SIMs) in yeast.

    Science.gov (United States)

    Jardin, Christophe; Horn, Anselm H C; Sticht, Heinrich

    2015-03-01

    Small ubiquitin-like modifier (SUMO) conjugation and interaction play an essential role in many cellular processes. A large number of yeast proteins is known to interact non-covalently with SUMO via short SUMO-interacting motifs (SIMs), but the structural details of this interaction are yet poorly characterized. In the present work, sequence analysis of a large dataset of 148 yeast SIMs revealed the existence of a hydrophobic core binding motif and a preference for acidic residues either within or adjacent to the core motif. Thus the sequence properties of yeast SIMs are highly similar to those described for human. Molecular dynamics simulations were performed to investigate the binding preferences for four representative SIM peptides differing in the number and distribution of acidic residues. Furthermore, the relative stability of two previously observed alternative binding orientations (parallel, antiparallel) was assessed. For all SIMs investigated, the antiparallel binding mode remained stable in the simulations and the SIMs were tightly bound via their hydrophobic core residues supplemented by polar interactions of the acidic residues. In contrary, the stability of the parallel binding mode is more dependent on the sequence features of the SIM motif like the number and position of acidic residues or the presence of additional adjacent interaction motifs. This information should be helpful to enhance the prediction of SIMs and their binding properties in different organisms to facilitate the reconstruction of the SUMO interactome.

  16. RegRNA: an integrated web server for identifying regulatory RNA motifs and elements

    OpenAIRE

    Huang, Hsi-Yuan; Chien, Chia-Hung; Jen, Kuan-Hua; Huang, Hsien-Da

    2006-01-01

    Numerous regulatory structural motifs have been identified as playing essential roles in transcriptional and post-transcriptional regulation of gene expression. RegRNA is an integrated web server for identifying the homologs of regulatory RNA motifs and elements against an input mRNA sequence. Both sequence homologs and structural homologs of regulatory RNA motifs can be recognized. The regulatory RNA motifs supported in RegRNA are categorized into several classes: (i) motifs in mRNA 5′-untra...

  17. Target motifs affecting natural immunity by a constitutive CRISPR-Cas system in Escherichia coli.

    Directory of Open Access Journals (Sweden)

    Cristóbal Almendros

    Full Text Available Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR and CRISPR associated (cas genes conform the CRISPR-Cas systems of various bacteria and archaea and produce degradation of invading nucleic acids containing sequences (protospacers that are complementary to repeat intervening spacers. It has been demonstrated that the base sequence identity of a protospacer with the cognate spacer and the presence of a protospacer adjacent motif (PAM influence CRISPR-mediated interference efficiency. By using an original transformation assay with plasmids targeted by a resident spacer here we show that natural CRISPR-mediated immunity against invading DNA occurs in wild type Escherichia coli. Unexpectedly, the strongest activity is observed with protospacer adjoining nucleotides (interference motifs that differ from the PAM both in sequence and location. Hence, our results document for the first time native CRISPR activity in E. coli and demonstrate that positions next to the PAM in invading DNA influence their recognition and degradation by these prokaryotic immune systems.

  18. De Novo Discovery of Structured ncRNA Motifs in Genomic Sequences

    DEFF Research Database (Denmark)

    Ruzzo, Walter L; Gorodkin, Jan

    2014-01-01

    De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphas...... on an approach based on the CMfinder CMfinder program as a case study. Applications to genomic screens for novel de novo structured ncRNA ncRNA s, including structured RNA elements in untranslated portions of protein-coding genes, are presented.......De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphasis...

  19. A Study on the Motif Pattern of Dark-Cloud Cover in the Securities

    Directory of Open Access Journals (Sweden)

    Long Jing

    2017-01-01

    Full Text Available Morphological analysis is the analysis and mining of the graphics formed of the securities price changes. Investors need to forecast the trend of future before buying and selling points, which can avoid great loss. Therefore, the analysis of motif pattern of K-line in the form of futures investment technology analysis is very significant. Based on the thoughts of short-term trend clustering, this paper proposes a method of detecting the motif pattern of Dark-Cloud Cover in stock time series by analysing stock historic data and K-line shape, in order to predict the stock market trends. And we prove the effectiveness and practicality of the method by a series of experimental analysis.

  20. Native characterization of nucleic acid motif thermodynamics via non-covalent catalysis

    Science.gov (United States)

    Wang, Chunyan; Bae, Jin H.; Zhang, David Yu

    2016-01-01

    DNA hybridization thermodynamics is critical for accurate design of oligonucleotides for biotechnology and nanotechnology applications, but parameters currently in use are inaccurately extrapolated based on limited quantitative understanding of thermal behaviours. Here, we present a method to measure the ΔG° of DNA motifs at temperatures and buffer conditions of interest, with significantly better accuracy (6- to 14-fold lower s.e.) than prior methods. The equilibrium constant of a reaction with thermodynamics closely approximating that of a desired motif is numerically calculated from directly observed reactant and product equilibrium concentrations; a DNA catalyst is designed to accelerate equilibration. We measured the ΔG° of terminal fluorophores, single-nucleotide dangles and multinucleotide dangles, in temperatures ranging from 10 to 45 °C. PMID:26782977

  1. Human telomeric DNA: G-quadruplex, i-motif and Watson–Crick double helix

    Science.gov (United States)

    Phan, Anh Tuân; Mergny, Jean-Louis

    2002-01-01

    Human telomeric DNA composed of (TTAGGG/CCCTAA)n repeats may form a classical Watson–Crick double helix. Each individual strand is also prone to quadruplex formation: the G-rich strand may adopt a G-quadruplex conformation involving G-quartets whereas the C-rich strand may fold into an i-motif based on intercalated C·C+ base pairs. Using an equimolar mixture of the telomeric oligonucleotides d[AGGG(TTAGGG)3] and d[(CCCTAA)3CCCT], we defined which structures existed and which would be the predominant species under a variety of experimental conditions. Under near-physiological conditions of pH, temperature and salt concentration, telomeric DNA was predominantly in a double-helix form. However, at lower pH values or higher temperatures, the G-quadruplex and/or the i-motif efficiently competed with the duplex. We also present kinetic and thermodynamic data for duplex association and for G-quadruplex/i-motif unfolding. PMID:12409451

  2. Identification of a novel immunoreceptor tyrosine-based activation motif-containing molecule, STAM2, by mass spectrometry and its involvement in growth factor and cytokine receptor signaling pathways

    DEFF Research Database (Denmark)

    Pandey, A; Fernandez, M M; Steen, H

    2000-01-01

    In an effort to clone novel tyrosine-phosphorylated substrates of the epidermal growth factor receptor, we have initiated an approach coupling affinity purification using anti-phosphotyrosine antibodies to mass spectrometry-based identification. Here, we report the identification of a signaling m...

  3. How pathogens use linear motifs to perturb host cell networks

    KAUST Repository

    Via, Allegra; Uyar, Bora; Brun, Christine; Zanzoni, Andreas

    2015-01-01

    Molecular mimicry is one of the powerful stratagems that pathogens employ to colonise their hosts and take advantage of host cell functions to guarantee their replication and dissemination. In particular, several viruses have evolved the ability to interact with host cell components through protein short linear motifs (SLiMs) that mimic host SLiMs, thus facilitating their internalisation and the manipulation of a wide range of cellular networks. Here we present convincing evidence from the literature that motif mimicry also represents an effective, widespread hijacking strategy in prokaryotic and eukaryotic parasites. Further insights into host motif mimicry would be of great help in the elucidation of the molecular mechanisms behind host cell invasion and the development of anti-infective therapeutic strategies.

  4. Discovery of candidate KEN-box motifs using cell cycle keyword enrichment combined with native disorder prediction and motif conservation.

    Science.gov (United States)

    Michael, Sushama; Travé, Gilles; Ramu, Chenna; Chica, Claudia; Gibson, Toby J

    2008-02-15

    KEN-box-mediated target selection is one of the mechanisms used in the proteasomal destruction of mitotic cell cycle proteins via the APC/C complex. While annotating the Eukaryotic Linear Motif resource (ELM, http://elm.eu.org/), we found that KEN motifs were significantly enriched in human protein entries with cell cycle keywords in the UniProt/Swiss-Prot database-implying that KEN-boxes might be more common than reported. Matches to short linear motifs in protein database searches are not, per se, significant. KEN-box enrichment with cell cycle Gene Ontology terms suggests that collectively these motifs are functional but does not prove that any given instance is so. Candidates were surveyed for native disorder prediction using GlobPlot and IUPred and for motif conservation in homologues. Among >25 strong new candidates, the most notable are human HIPK2, CHFR, CDC27, Dab2, Upf2, kinesin Eg5, DNA Topoisomerase 1 and yeast Cdc5 and Swi5. A similar number of weaker candidates were present. These proteins have yet to be tested for APC/C targeted destruction, providing potential new avenues of research.

  5. Selection against spurious promoter motifs correlates withtranslational efficiency across bacteria

    Energy Technology Data Exchange (ETDEWEB)

    Froula, Jeffrey L.; Francino, M. Pilar

    2007-05-01

    Because binding of RNAP to misplaced sites could compromise the efficiency of transcription, natural selection for the optimization of gene expression should regulate the distribution of DNA motifs capable of RNAP-binding across the genome. Here we analyze the distribution of the -10 promoter motifs that bind the {sigma}{sup 70} subunit of RNAP in 42 bacterial genomes. We show that selection on these motifs operates across the genome, maintaining an over-representation of -10 motifs in regulatory sequences while eliminating them from the nonfunctional and, in most cases, from the protein coding regions. In some genomes, however, -10 sites are over-represented in the coding sequences; these sites could induce pauses effecting regulatory roles throughout the length of a transcriptional unit. For nonfunctional sequences, the extent of motif under-representation varies across genomes in a manner that broadly correlates with the number of tRNA genes, a good indicator of translational speed and growth rate. This suggests that minimizing the time invested in gene transcription is an important selective pressure against spurious binding. However, selection against spurious binding is detectable in the reduced genomes of host-restricted bacteria that grow at slow rates, indicating that components of efficiency other than speed may also be important. Minimizing the number of RNAP molecules per cell required for transcription, and the corresponding energetic expense, may be most relevant in slow growers. These results indicate that genome-level properties affecting the efficiency of transcription and translation can respond in an integrated manner to optimize gene expression. The detection of selection against promoter motifs in nonfunctional regions also implies that no sequence may evolve free of selective constraints, at least in the relatively small and unstructured genomes of bacteria.

  6. Genome Analysis of Conserved Dehydrin Motifs in Vascular Plants

    Directory of Open Access Journals (Sweden)

    Ahmad A. Malik

    2017-05-01

    Full Text Available Dehydrins, a large family of abiotic stress proteins, are defined by the presence of a mostly conserved motif known as the K-segment, and may also contain two other conserved motifs known as the Y-segment and S-segment. Using the dehydrin literature, we developed a sequence motif definition of the K-segment, which we used to create a large dataset of dehydrin sequences by searching the Pfam00257 dehydrin dataset and the Phytozome 10 sequences of vascular plants. A comprehensive analysis of these sequences reveals that lysine residues are highly conserved in the K-segment, while the amino acid type is often conserved at other positions. Despite the Y-segment name, the central tyrosine is somewhat conserved, but can be substituted with two other small aromatic amino acids (phenylalanine or histidine. The S-segment contains a series of serine residues, but in some proteins is also preceded by a conserved LHR sequence. In many dehydrins containing all three of these motifs the S-segment is linked to the K-segment by a GXGGRRKK motif (where X can be any amino acid, suggesting a functional linkage between these two motifs. An analysis of the sequences shows that the dehydrin architecture and several biochemical properties (isoelectric point, molecular mass, and hydrophobicity score are dependent on each other, and that some dehydrin architectures are overexpressed during certain abiotic stress, suggesting that they may be optimized for a specific abiotic stress while others are involved in all forms of dehydration stress (drought, cold, and salinity.

  7. BayesMD: flexible biological modeling for motif discovery

    DEFF Research Database (Denmark)

    Tang, Man-Hung Eric; Krogh, Anders; Winther, Ole

    2008-01-01

    We present BayesMD, a Bayesian Motif Discovery model with several new features. Three different types of biological a priori knowledge are built into the framework in a modular fashion. A mixture of Dirichlets is used as prior over nucleotide probabilities in binding sites. It is trained on trans......We present BayesMD, a Bayesian Motif Discovery model with several new features. Three different types of biological a priori knowledge are built into the framework in a modular fashion. A mixture of Dirichlets is used as prior over nucleotide probabilities in binding sites. It is trained...

  8. The tomato DWD motif-containing protein DDI1 interacts with the CUL4–DDB1-based ubiquitin ligase and plays a pivotal role in abiotic stress responses

    International Nuclear Information System (INIS)

    Miao, Min; Zhu, Yunye; Qiao, Maiju; Tang, Xiaofeng; Zhao, Wei; Xiao, Fangming; Liu, Yongsheng

    2014-01-01

    Highlights: • We identify DDI1 as a DAMAGED DNA BINDING PROTEIN1 (DDB1)-interacting protein. • DDI1 interacts with the CUL4–DDB1-based ubiquitin ligase in the nucleus. • DDI1 plays a positive role in regulating abiotic stress response in tomato. - Abstract: CULLIN4(CUL4)–DAMAGED DNA BINDING PROTEIN1 (DDB1)-based ubiquitin ligase plays significant roles in multiple physiological processes via ubiquitination-mediated degradation of relevant target proteins. The DDB1–CUL4-associated factor (DCAF) acts as substrate receptor in the CUL4–DDB1 ubiquitin ligase complex and determines substrate specificity. In this study, we identified a tomato (Solanum lycopersicum) DDB1-interacting (DDI1) protein as a DCAF protein involved in response to abiotic stresses, including UV radiation, high salinity and osmotic stress. Co-immunoprecipitation and bimolecular fluorescence complementation assay indicated that DDI1 associates with CUL4–DDB1 in the nucleus. Quantitative RT-PCR analysis indicated the DDI1 gene is induced by salt, mannitol and UV-C treatment. Moreover, transgenic tomato plants with overexpression or knockdown of the DDI1 gene exhibited enhanced or attenuated tolerance to salt/mannitol/UV-C, respectively. Thus, our data suggest that DDI1 functions as a substrate receptor of the CUL4–DDB1 ubiquitin ligase, positively regulating abiotic stress response in tomato

  9. The tomato DWD motif-containing protein DDI1 interacts with the CUL4–DDB1-based ubiquitin ligase and plays a pivotal role in abiotic stress responses

    Energy Technology Data Exchange (ETDEWEB)

    Miao, Min [Ministry of Education Key Laboratory for Bio-resource and Eco-environment, College of Life Science, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu 610064 (China); School of Biotechnology and Food Engineering, Hefei University of Technology, Hefei 230009 (China); Department of Plant, Soil and Entomological Sciences, University of Idaho, Moscow, ID 83844-2339 (United States); Zhu, Yunye [School of Biotechnology and Food Engineering, Hefei University of Technology, Hefei 230009 (China); Qiao, Maiju [Ministry of Education Key Laboratory for Bio-resource and Eco-environment, College of Life Science, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu 610064 (China); Tang, Xiaofeng [Ministry of Education Key Laboratory for Bio-resource and Eco-environment, College of Life Science, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu 610064 (China); School of Biotechnology and Food Engineering, Hefei University of Technology, Hefei 230009 (China); Zhao, Wei [School of Biotechnology and Food Engineering, Hefei University of Technology, Hefei 230009 (China); Xiao, Fangming [Department of Plant, Soil and Entomological Sciences, University of Idaho, Moscow, ID 83844-2339 (United States); Liu, Yongsheng, E-mail: liuyongsheng1122@hfut.edu.cn [Ministry of Education Key Laboratory for Bio-resource and Eco-environment, College of Life Science, State Key Laboratory of Hydraulics and Mountain River Engineering, Sichuan University, Chengdu 610064 (China); School of Biotechnology and Food Engineering, Hefei University of Technology, Hefei 230009 (China)

    2014-08-08

    Highlights: • We identify DDI1 as a DAMAGED DNA BINDING PROTEIN1 (DDB1)-interacting protein. • DDI1 interacts with the CUL4–DDB1-based ubiquitin ligase in the nucleus. • DDI1 plays a positive role in regulating abiotic stress response in tomato. - Abstract: CULLIN4(CUL4)–DAMAGED DNA BINDING PROTEIN1 (DDB1)-based ubiquitin ligase plays significant roles in multiple physiological processes via ubiquitination-mediated degradation of relevant target proteins. The DDB1–CUL4-associated factor (DCAF) acts as substrate receptor in the CUL4–DDB1 ubiquitin ligase complex and determines substrate specificity. In this study, we identified a tomato (Solanum lycopersicum) DDB1-interacting (DDI1) protein as a DCAF protein involved in response to abiotic stresses, including UV radiation, high salinity and osmotic stress. Co-immunoprecipitation and bimolecular fluorescence complementation assay indicated that DDI1 associates with CUL4–DDB1 in the nucleus. Quantitative RT-PCR analysis indicated the DDI1 gene is induced by salt, mannitol and UV-C treatment. Moreover, transgenic tomato plants with overexpression or knockdown of the DDI1 gene exhibited enhanced or attenuated tolerance to salt/mannitol/UV-C, respectively. Thus, our data suggest that DDI1 functions as a substrate receptor of the CUL4–DDB1 ubiquitin ligase, positively regulating abiotic stress response in tomato.

  10. iELM—a web server to explore short linear motif-mediated interactions

    Science.gov (United States)

    Weatheritt, Robert J.; Jehl, Peter; Dinkel, Holger; Gibson, Toby J.

    2012-01-01

    The recent expansion in our knowledge of protein–protein interactions (PPIs) has allowed the annotation and prediction of hundreds of thousands of interactions. However, the function of many of these interactions remains elusive. The interactions of Eukaryotic Linear Motif (iELM) web server provides a resource for predicting the function and positional interface for a subset of interactions mediated by short linear motifs (SLiMs). The iELM prediction algorithm is based on the annotated SLiM classes from the Eukaryotic Linear Motif (ELM) resource and allows users to explore both annotated and user-generated PPI networks for SLiM-mediated interactions. By incorporating the annotated information from the ELM resource, iELM provides functional details of PPIs. This can be used in proteomic analysis, for example, to infer whether an interaction promotes complex formation or degradation. Furthermore, details of the molecular interface of the SLiM-mediated interactions are also predicted. This information is displayed in a fully searchable table, as well as graphically with the modular architecture of the participating proteins extracted from the UniProt and Phospho.ELM resources. A network figure is also presented to aid the interpretation of results. The iELM server supports single protein queries as well as large-scale proteomic submissions and is freely available at http://i.elm.eu.org. PMID:22638578

  11. Faster exact Markovian probability functions for motif occurrences: a DFA-only approach.

    Science.gov (United States)

    Ribeca, Paolo; Raineri, Emanuele

    2008-12-15

    The computation of the statistical properties of motif occurrences has an obviously relevant application: patterns that are significantly over- or under-represented in genomes or proteins are interesting candidates for biological roles. However, the problem is computationally hard; as a result, virtually all the existing motif finders use fast but approximate scoring functions, in spite of the fact that they have been shown to produce systematically incorrect results. A few interesting exact approaches are known, but they are very slow and hence not practical in the case of realistic sequences. We give an exact solution, solely based on deterministic finite-state automata (DFA), to the problem of finding the whole relevant part of the probability distribution function of a simple-word motif in a homogeneous (biological) sequence. Out of that, the z-value can always be computed, while the P-value can be obtained either when it is not too extreme with respect to the number of floating-point digits available in the implementation, or when the number of pattern occurrences is moderately low. In particular, the time complexity of the algorithms for Markov models of moderate order (0 manage to obtain an algorithm which is both easily interpretable and efficient. This approach can be used for exact statistical studies of very long genomes and protein sequences, as we illustrate with some examples on the scale of the human genome.

  12. Characterizing the binding motifs of 11 common human HLA‐DP and HLA‐DQ molecules using NNAlign

    DEFF Research Database (Denmark)

    Andreatta, Massimo; Nielsen, Morten

    2012-01-01

    based method NNAlign, we characterized the binding specificities of five HLA‐DP and six HLA‐DQ among the most frequent in the human population. The identified binding motifs showed an overall concurrence with earlier studies but revealed subtle differences. The DP molecules revealed a large overlap...

  13. SSTRAP: A computational model for genomic motif discovery ...

    African Journals Online (AJOL)

    Computational methods can potentially provide high-quality prediction of biological molecules such as DNA binding sites and Transcription factors and therefore reduce the time needed for experimental verification and challenges associated with experimental methods. These biological molecules or motifs have significant ...

  14. Identification of a Baeyer-Villiger monooxygenase sequence motif

    NARCIS (Netherlands)

    Fraaije, MW; Kamerbeek, NM; van Berkel, WJH; Janssen, DB; Kamerbeek, Nanne M.; Berkel, Willem J.H. van

    2002-01-01

    Baeyer-Villiger monooxygenases (BVMOs) form a distinct class of flavoproteins that catalyze the insertion of an oxygen atom in a C-C bond using dioxygen and NAD(P)H. Using newly characterized BVMO sequences, we have uncovered a BVMO-identifying sequence motif: FXGXXXRXXXW(P/D). Studies with

  15. Linear motif atlas for phosphorylation-dependent signaling

    DEFF Research Database (Denmark)

    Miller, Martin Lee; Jensen, LJ; Diella, F

    2008-01-01

    bind to them remains a challenge. NetPhorest is an atlas of consensus sequence motifs that covers 179 kinases and 104 phosphorylation-dependent binding domains [Src homology 2 (SH2), phosphotyrosine binding (PTB), BRCA1 C-terminal (BRCT), WW, and 14-3-3]. The atlas reveals new aspects of signaling...

  16. [Cover motifs of the Tidsskrift. A 14-year cavalcade].

    Science.gov (United States)

    Nylenna, M

    1998-12-10

    In 1985 the Journal of the Norwegian Medical Association changed its cover policy, moving the table of contents inside the Journal and introducing cover illustrations. This article provides an analysis of all cover illustrations published over this 14-year period, 420 covers in all. There is a great variation in cover motifs and designs and a development towards more general motifs. The initial emphasis on historical and medical aspects is now less pronounced, while the use of works of art and nature motifs has increased, and the cover now more often has a direct bearing on the specific contents of the issue. Professor of medical history Oivind Larsen has photographed two thirds of the covers and contributed 95% of the inside essay-style reflections on the cover motif. Over the years, he has expanded the role of the historian of medicine disseminating knowledge to include that of the raconteur with a personal tone of voice. The Journal's covers are now one of its most characteristic features, emblematic of the Journal's ambition of standing for quality and timelessness vis-à-vis the news media, and of its aim of bridging the gap between medicine and the humanities.

  17. Perspektif Psikologi Humanistik Abraham Maslow dalam Meninjau Motif Pelaku Pembunuhan

    OpenAIRE

    Nurwatie, Azrina; Fauzia, Rahmi; Akbar, Sukma Noor

    2014-01-01

    Fokus penelitian ini diarahkan pada motif pelaku pembunuhan dengan meninjaunya melalui perspektif psikologi humanistik Abraham Maslow. Subyek dalam penelitian ini berjumlah dua orang narapidana yang berada di Lapas Kelas IIA Anak Martapura dengan kasus pembunuhan. Metode penelitian yang digunakan dalam penelitian ini adalah metode penelitian kualitatif. Teknik pengumpulan data melalui wawancara, observasi, dokumentasi,dan pemeriksaan psikologis (tes grafis). Berdasarkan hasil analisis data da...

  18. Genetic analysis of beta1 integrin "activation motifs" in mice

    DEFF Research Database (Denmark)

    Czuchra, Aleksandra; Meyer, Hannelore; Legate, Kyle R

    2006-01-01

    -null phenotype in vivo. Surprisingly, neither the substitution of the tyrosines with phenylalanine nor the aspartic acid with alanine resulted in an obvious defect. These data suggest that the NPXY motifs of the beta1 integrin tail are essential for beta1 integrin function, whereas tyrosine phosphorylation...

  19. Insights into the motif preference of APOBEC3 enzymes.

    Directory of Open Access Journals (Sweden)

    Diako Ebrahimi

    Full Text Available We used a multivariate data analysis approach to identify motifs associated with HIV hypermutation by different APOBEC3 enzymes. The analysis showed that APOBEC3G targets G mainly within GG, TG, TGG, GGG, TGGG and also GGGT. The G nucleotides flanked by a C at the 3' end (in +1 and +2 positions were indicated as disfavoured targets by APOBEC3G. The G nucleotides within GGGG were found to be targeted at a frequency much less than what is expected. We found that the infrequent G-to-A mutation within GGGG is not limited to the inaccessibility, to APOBEC3, of poly Gs in the central and 3'polypurine tracts (PPTs which remain double stranded during the HIV reverse transcription. GGGG motifs outside the PPTs were also disfavoured. The motifs GGAG and GAGG were also found to be disfavoured targets for APOBEC3. The motif-dependent mutation of G within the HIV genome by members of the APOBEC3 family other than APOBEC3G was limited to GA→AA changes. The results did not show evidence of other types of context dependent G-to-A changes in the HIV genome.

  20. An effective approach for annotation of protein families with low sequence similarity and conserved motifs: identifying GDSL hydrolases across the plant kingdom.

    Science.gov (United States)

    Vujaklija, Ivan; Bielen, Ana; Paradžik, Tina; Biđin, Siniša; Goldstein, Pavle; Vujaklija, Dušica

    2016-02-18

    The massive accumulation of protein sequences arising from the rapid development of high-throughput sequencing, coupled with automatic annotation, results in high levels of incorrect annotations. In this study, we describe an approach to decrease annotation errors of protein families characterized by low overall sequence similarity. The GDSL lipolytic family comprises proteins with multifunctional properties and high potential for pharmaceutical and industrial applications. The number of proteins assigned to this family has increased rapidly over the last few years. In particular, the natural abundance of GDSL enzymes reported recently in plants indicates that they could be a good source of novel GDSL enzymes. We noticed that a significant proportion of annotated sequences lack specific GDSL motif(s) or catalytic residue(s). Here, we applied motif-based sequence analyses to identify enzymes possessing conserved GDSL motifs in selected proteomes across the plant kingdom. Motif-based HMM scanning (Viterbi decoding-VD and posterior decoding-PD) and the here described PD/VD protocol were successfully applied on 12 selected plant proteomes to identify sequences with GDSL motifs. A significant number of identified GDSL sequences were novel. Moreover, our scanning approach successfully detected protein sequences lacking at least one of the essential motifs (171/820) annotated by Pfam profile search (PfamA) as GDSL. Based on these analyses we provide a curated list of GDSL enzymes from the selected plants. CLANS clustering and phylogenetic analysis helped us to gain a better insight into the evolutionary relationship of all identified GDSL sequences. Three novel GDSL subfamilies as well as unreported variations in GDSL motifs were discovered in this study. In addition, analyses of selected proteomes showed a remarkable expansion of GDSL enzymes in the lycophyte, Selaginella moellendorffii. Finally, we provide a general motif-HMM scanner which is easily accessible through

  1. Solution structure of a DNA mimicking motif of an RNA aptamer against transcription factor AML1 Runt domain.

    Science.gov (United States)

    Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi

    2013-12-01

    AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.

  2. DNA regulatory motif selection based on support vector machine ...

    African Journals Online (AJOL)

    Administrator

    2011-10-19

    Oct 19, 2011 ... ... gene expression values of controls and i x i y. 1 i y = 1 i y = −. 1. 2. { , ,..., , } i i i im i g. x x. x y. = 1. 2. 1. 2. , ,..., ,. , ,..., k i i i im. x x x. x x x x x. = =.

  3. Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.

    Science.gov (United States)

    Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique

    2015-06-01

    Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.

  4. Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

    KAUST Repository

    Alam, Tanvir; Alazmi, Meshari; Naser, Rayan Mohammad Mahmoud; Huser, Franceline; Momin, Afaque Ahmad Imtiyaz; Walkiewicz, Katarzyna Wiktoria; Canlas, Christian; Huser, Raphaë l; Ali, Amal J.; Merzaban, Jasmeen; Bajic, Vladimir B.; Gao, Xin; Arold, Stefan T.

    2018-01-01

    and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter

  5. Finding a Leucine in a Haystack: Searching the Proteome for ambigous Leucine-Aspartic Acid motifs

    KAUST Repository

    Arold, Stefan T.

    2016-01-01

    LDMF predicted 13 new LD motifs in humans. Using biophysical assays, we experimentally confirmed in vitro interactions for four novel LD motif proteins. Thus, LDMF allows proteome-wide discovery of LD motifs, despite a highly ambiguous sequence pattern. Functional implications will be discussed.

  6. Distinct configurations of protein complexes and biochemical pathways revealed by epistatic interaction network motifs

    LENUS (Irish Health Repository)

    Casey, Fergal

    2011-08-22

    Abstract Background Gene and protein interactions are commonly represented as networks, with the genes or proteins comprising the nodes and the relationship between them as edges. Motifs, or small local configurations of edges and nodes that arise repeatedly, can be used to simplify the interpretation of networks. Results We examined triplet motifs in a network of quantitative epistatic genetic relationships, and found a non-random distribution of particular motif classes. Individual motif classes were found to be associated with different functional properties, suggestive of an underlying biological significance. These associations were apparent not only for motif classes, but for individual positions within the motifs. As expected, NNN (all negative) motifs were strongly associated with previously reported genetic (i.e. synthetic lethal) interactions, while PPP (all positive) motifs were associated with protein complexes. The two other motif classes (NNP: a positive interaction spanned by two negative interactions, and NPP: a negative spanned by two positives) showed very distinct functional associations, with physical interactions dominating for the former but alternative enrichments, typical of biochemical pathways, dominating for the latter. Conclusion We present a model showing how NNP motifs can be used to recognize supportive relationships between protein complexes, while NPP motifs often identify opposing or regulatory behaviour between a gene and an associated pathway. The ability to use motifs to point toward underlying biological organizational themes is likely to be increasingly important as more extensive epistasis mapping projects in higher organisms begin.

  7. Gene regulatory and signaling networks exhibit distinct topological distributions of motifs

    Science.gov (United States)

    Ferreira, Gustavo Rodrigues; Nakaya, Helder Imoto; Costa, Luciano da Fontoura

    2018-04-01

    The biological processes of cellular decision making and differentiation involve a plethora of signaling pathways and gene regulatory circuits. These networks in turn exhibit a multitude of motifs playing crucial parts in regulating network activity. Here we compare the topological placement of motifs in gene regulatory and signaling networks and observe that it suggests different evolutionary strategies in motif distribution for distinct cellular subnetworks.

  8. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    Science.gov (United States)

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with tandem CYR motifs, has endo- and exonuclease activities against abasic site and other types of base damage. PALF accumulates rapidly at single-strand breaks in a poly(ADP-ribose) polymerase 1 (PARP1)-dependent manner in human cells. Indeed, PALF interacts directly with PARP1 and is required for its activation and for cellular resistance to methyl-methane sulfonate. PALF also interacts directly with KU86, LIGASEIV and phosphorylated XRCC4 proteins and possesses endo/exonuclease activity at protruding DNA ends. Various treatments that produce double-strand breaks induce formation of PALF foci, which fully coincide with γH2AX foci. Thus, PALF and the CYR motif may play important roles in DNA repair of higher eukaryotes. PMID:17396150

  9. Motif distributions in phase-space networks for characterizing experimental two-phase flow patterns with chaotic features.

    Science.gov (United States)

    Gao, Zhong-Ke; Jin, Ning-De; Wang, Wen-Xu; Lai, Ying-Cheng

    2010-07-01

    The dynamics of two-phase flows have been a challenging problem in nonlinear dynamics and fluid mechanics. We propose a method to characterize and distinguish patterns from inclined water-oil flow experiments based on the concept of network motifs that have found great usage in network science and systems biology. In particular, we construct from measured time series phase-space complex networks and then calculate the distribution of a set of distinct network motifs. To gain insight, we first test the approach using time series from classical chaotic systems and find a universal feature: motif distributions from different chaotic systems are generally highly heterogeneous. Our main finding is that the distributions from experimental two-phase flows tend to be heterogeneous as well, suggesting the underlying chaotic nature of the flow patterns. Calculation of the maximal Lyapunov exponent provides further support for this. Motif distributions can thus be a feasible tool to understand the dynamics of realistic two-phase flow patterns.

  10. Core signalling motif displaying multistability through multi-state enzymes

    DEFF Research Database (Denmark)

    Feng, Song; Saez Cornellana, Meritxell; Wiuf, Carsten Henrik

    2016-01-01

    Bistability, and more generally multistability, is a key system dynamics feature enabling decision-making and memory in cells. Deciphering the molecular determinants of multistability is thus crucial for a better understanding of cellular pathways and their (re)engineering in synthetic biology....... Here, we show that a key motif found predominantly in eukaryotic signalling systems, namely a futile signalling cycle, can display bistability when featuring a two-state kinase. We provide necessary and sufficient mathematical conditions on the kinetic parameters of this motif that guarantee...... the existence of multiple steady states. These conditions foster the intuition that bistability arises as a consequence of competition between the two states of the kinase. Extending from this result, we find that increasing the number of kinase states linearly translates into an increase in the number...

  11. The FOLDALIGN web server for pairwise structural RNA alignment and mutual motif search

    DEFF Research Database (Denmark)

    Havgaard, Jakob Hull; Lyngsø, Rune B.; Gorodkin, Jan

    2005-01-01

    FOLDALIGN is a Sankoff-based algorithm for making structural alignments of RNA sequences. Here, we present a web server for making pairwise alignments between two RNA sequences, using the recently updated version of FOLDALIGN. The server can be used to scan two sequences for a common structural RNA...... motif of limited size, or the entire sequences can be aligned locally or globally. The web server offers a graphical interface, which makes it simple to make alignments and manually browse the results. the web server can be accessed at http://foldalign.kvl.dk...

  12. Neoanalysis, Orality, and Intertextuality: An Examination of Homeric Motif Transference

    Directory of Open Access Journals (Sweden)

    Jonathan Burgess

    2006-03-01

    Full Text Available In Homeric studies scholars have speculated on the influence of (non-surviving preHomeric material on the Iliad. This article expands this line of argument from an oralist perspective, with reference to modern intertextual theory. It concludes that preHomeric and nonHomeric motifs from oral traditions were transferred into the epic poem, creating an intertextually allusive poetics that would have been recognizable to an early Greek audience informed of mythological traditions.

  13. Motif Subscriber Menonton Channel YouTube Raditya Dika

    OpenAIRE

    Mellyaningsih, Adinda

    2016-01-01

    Penelitian ini dilakukan untuk mengetahui motif para subscriber dalam menonton channelYouTube Raditya Dika. Raditya Dika merupakan YouTuber Indonesia dengan jumlah subscriber terbanyak dan merupakan orang pertama di Indonesia yang mendapatkan penghargaan Certifies Award oleh YouTube. Peneliti menggunakan teori Uses and Gratification dengan empat indikator, yaitu hiburan dan relaksasi, hubungan antar pribadi, mencari informasi, dan persahabatan. Metode dalam penelitian ini adalah online survei...

  14. Exon silencing by UAGG motifs in response to neuronal excitation.

    Directory of Open Access Journals (Sweden)

    Ping An

    2007-02-01

    Full Text Available Alternative pre-mRNA splicing plays fundamental roles in neurons by generating functional diversity in proteins associated with the communication and connectivity of the synapse. The CI cassette of the NMDA R1 receptor is one of a variety of exons that show an increase in exon skipping in response to cell excitation, but the molecular nature of this splicing responsiveness is not yet understood. Here we investigate the molecular basis for the induced changes in splicing of the CI cassette exon in primary rat cortical cultures in response to KCl-induced depolarization using an expression assay with a tight neuron-specific readout. In this system, exon silencing in response to neuronal excitation was mediated by multiple UAGG-type silencing motifs, and transfer of the motifs to a constitutive exon conferred a similar responsiveness by gain of function. Biochemical analysis of protein binding to UAGG motifs in extracts prepared from treated and mock-treated cortical cultures showed an increase in nuclear hnRNP A1-RNA binding activity in parallel with excitation. Evidence for the role of the NMDA receptor and calcium signaling in the induced splicing response was shown by the use of specific antagonists, as well as cell-permeable inhibitors of signaling pathways. Finally, a wider role for exon-skipping responsiveness is shown to involve additional exons with UAGG-related silencing motifs, and transcripts involved in synaptic functions. These results suggest that, at the post-transcriptional level, excitable exons such as the CI cassette may be involved in strategies by which neurons mount adaptive responses to hyperstimulation.

  15. Multiple TPR motifs characterize the Fanconi anemia FANCG protein.

    Science.gov (United States)

    Blom, Eric; van de Vrugt, Henri J; de Vries, Yne; de Winter, Johan P; Arwert, Fré; Joenje, Hans

    2004-01-05

    The genome protection pathway that is defective in patients with Fanconi anemia (FA) is controlled by at least eight genes, including BRCA2. A key step in the pathway involves the monoubiquitylation of FANCD2, which critically depends on a multi-subunit nuclear 'core complex' of at least six FANC proteins (FANCA, -C, -E, -F, -G, and -L). Except for FANCL, which has WD40 repeats and a RING finger domain, no significant domain structure has so far been recognized in any of the core complex proteins. By using a homology search strategy comparing the human FANCG protein sequence with its ortholog sequences in Oryzias latipes (Japanese rice fish) and Danio rerio (zebrafish) we identified at least seven tetratricopeptide repeat motifs (TPRs) covering a major part of this protein. TPRs are degenerate 34-amino acid repeat motifs which function as scaffolds mediating protein-protein interactions, often found in multiprotein complexes. In four out of five TPR motifs tested (TPR1, -2, -5, and -6), targeted missense mutagenesis disrupting the motifs at the critical position 8 of each TPR caused complete or partial loss of FANCG function. Loss of function was evident from failure of the mutant proteins to complement the cellular FA phenotype in FA-G lymphoblasts, which was correlated with loss of binding to FANCA. Although the TPR4 mutant fully complemented the cells, it showed a reduced interaction with FANCA, suggesting that this TPR may also be of functional importance. The recognition of FANCG as a typical TPR protein predicts this protein to play a key role in the assembly and/or stabilization of the nuclear FA protein core complex.

  16. Efficient sequential and parallel algorithms for planted motif search.

    Science.gov (United States)

    Nicolae, Marius; Rajasekaran, Sanguthevar

    2014-01-31

    Motif searching is an important step in the detection of rare events occurring in a set of DNA or protein sequences. One formulation of the problem is known as (l,d)-motif search or Planted Motif Search (PMS). In PMS we are given two integers l and d and n biological sequences. We want to find all sequences of length l that appear in each of the input sequences with at most d mismatches. The PMS problem is NP-complete. PMS algorithms are typically evaluated on certain instances considered challenging. Despite ample research in the area, a considerable performance gap exists because many state of the art algorithms have large runtimes even for moderately challenging instances. This paper presents a fast exact parallel PMS algorithm called PMS8. PMS8 is the first algorithm to solve the challenging (l,d) instances (25,10) and (26,11). PMS8 is also efficient on instances with larger l and d such as (50,21). We include a comparison of PMS8 with several state of the art algorithms on multiple problem instances. This paper also presents necessary and sufficient conditions for 3 l-mers to have a common d-neighbor. The program is freely available at http://engr.uconn.edu/~man09004/PMS8/. We present PMS8, an efficient exact algorithm for Planted Motif Search. PMS8 introduces novel ideas for generating common neighborhoods. We have also implemented a parallel version for this algorithm. PMS8 can solve instances not solved by any previous algorithms.

  17. Aplikasi Ornamen Khas Maluku untuk Pengembangan Desain Motif Batik

    OpenAIRE

    Masiswo Masiswo; Vivin Atika

    2016-01-01

    ABSTRAKMaluku memiliki banyak ragam hias budaya warisan nilai leluhur berupa ornamen etnis yang merupakan kesenian dan keterampilan kerajinan. Hasil warisan tersebut sampai saat ini masih lestari hidup serta dapat dinikmati sebagai konsumsi rohani yang memuaskan manusia. Berkaitan dengan keberlangsungan nilai-nilai tradisi etnis yang berwujud pada ornamen-ornamen daerah Maluku, maka dikembangkan untuk kebutuhan manusia berupa motif batik pada kain. Pengembangan ornamen ini lebih menekankan pa...

  18. ROMANIAN TRADITIONAL MOTIF ELEMENT OF MODERNITY IN CLOTHING

    Directory of Open Access Journals (Sweden)

    ŞUTEU Marius Darius

    2017-05-01

    Full Text Available In this paper are presented the phases for improving from an aesthetic point of view a clothing item, the T-shirt for women using software design patterns, computerised graphics and textile different modern technologies including: industrial embroidery, digital printing, sublimation. In the first phase a documentation was prepared in the University of Oradea and traditional motif was selected from a collection comprising a number of Romanian traditional motifs from different parts of the country and were reintepreted and stylized whilst preserving the symbolism and color range specified to the area. For the styling phase was used CorelDraw vector graphics program that allows changing the shape, size and color of the drawings without affecting the identity of the pattern. The embroidery was done using BERNINA Embroidery Software Designer Plus Software. This software allows you to export the model to any domestic or industrial embroidery machine regardless of brand. Finally we observed the resistance of the printed and embroided model to various: elasticity, resistance to abrasion and a sensory analysis on the preservation of color. After testing we noticed the imprint resistance applied to the fabric, resulting in a quality that makes possible to keep the Romanian traditional motif from generation to generation.

  19. Insertion of tetracysteine motifs into dopamine transporter extracellular domains.

    Directory of Open Access Journals (Sweden)

    Deanna M Navaroli

    Full Text Available The neuronal dopamine transporter (DAT is a major determinant of extracellular dopamine (DA levels and is the primary target for a variety of addictive and therapeutic psychoactive drugs. DAT is acutely regulated by protein kinase C (PKC activation and amphetamine exposure, both of which modulate DAT surface expression by endocytic trafficking. In order to use live imaging approaches to study DAT endocytosis, methods are needed to exclusively label the DAT surface pool. The use of membrane impermeant, sulfonated biarsenic dyes holds potential as one such approach, and requires introduction of an extracellular tetracysteine motif (tetraCys; CCPGCC to facilitate dye binding. In the current study, we took advantage of intrinsic proline-glycine (Pro-Gly dipeptides encoded in predicted DAT extracellular domains to introduce tetraCys motifs into DAT extracellular loops 2, 3, and 4. [(3H]DA uptake studies, surface biotinylation and fluorescence microscopy in PC12 cells indicate that tetraCys insertion into the DAT second extracellular loop results in a functional transporter that maintains PKC-mediated downregulation. Introduction of tetraCys into extracellular loops 3 and 4 yielded DATs with severely compromised function that failed to mature and traffic to the cell surface. This is the first demonstration of successful introduction of a tetracysteine motif into a DAT extracellular domain, and may hold promise for use of biarsenic dyes in live DAT imaging studies.

  20. Organofluorine chemistry: synthesis and conformation of vicinal fluoromethylene motifs.

    Science.gov (United States)

    O'Hagan, David

    2012-04-20

    The C-F bond is the most polar bond in organic chemistry, and thus the bond has a relatively large dipole moment with a significant -ve charge density on the fluorine atom and correspondingly a +ve charge density on carbon. The electrostatic nature of the bond renders it the strongest one in organic chemistry. However, the fluorine atom itself is nonpolarizable, and thus, despite the charge localization on fluorine, it is a poor hydrogen-bonding acceptor. These properties of the C-F bond make it attractive in the design of nonviscous but polar organic compounds, with a polarity limited to influencing the intramolecular nature of the molecule and less so intermolecular interactions with the immediate environment. In this Perspective, the synthesis of aliphatic chains carrying multivicinal fluoromethylene motifs is described. It emerges that the dipoles of adjacent C-F bonds orientate relative to each other, and thus, individual diastereoisomers display different backbone carbon chain conformations. These conformational preferences recognize the influence of the well-known gauche effect associated with 1,2-difluoroethane but extend to considering 1,3-fluorine-fluorine dipolar repulsions. The synthesis of carbon chains carrying two, three, four, five, and six vicinal fluoromethylene motifs is described, with an emphasis on our own research contributions. These motifs obey almost predictable conformational behavior, and they emerge as candidates for inclusion in the design of performance organic molecules. © 2012 American Chemical Society

  1. THE MOTIF OF THE PRODIGAL SON IN IVAN TURGENEV'S NOVELS

    Directory of Open Access Journals (Sweden)

    Valentina Ivanovna Gabdullina

    2013-11-01

    Full Text Available The author questions the perception of Ivan Turgenev as a “non- Christian writer” and studies the problem of the prodigal son motif functioning in a series of his novels. In his novels, Turgenev pictured different phases of the archetypal story, originating from the Gospel parable of the prodigal son. In the novel Rudin he depicted the phase of spiritual wanderings of the hero who had lost touch with his native land — Russia. In his next novels (Home of the Gentry, Fathers and Sons and Smoke, after leading his hero in circles and sending him back to his paternal home, Turgenev reconstructs the model of human behavior, represented in the parable, thereby recognizing the immutability of the idea formalized in the Gospel. The motif of the return to Russian land gets its completion in Turgenev's last novel Virgin Soil, in which the author paradoxically connects the Westernist idea with the Gospel imperative. Solomin, the son of a deacon, sent by his wise father out to Europe “to get education”, studies in England, masters the European knowledge and returns back “to his native land” to establish his own business in inland Russia. Thus, a series of Turgenev's novels, in which he portrayed different phases of social life, are interlinked with the motif of the prodigal son, who is represented by novels' main characters.

  2. The city as a motif in Slovene youth literature

    Directory of Open Access Journals (Sweden)

    Milena Mileva Blažić

    2003-01-01

    Full Text Available The article presents the city as motif of Slovenian youth literature in four different periods, beginning in the first period of original Slovenian youth literature in the second half of the 19th century, second period in the first half of the 20th century, third period in the second half of the 20th century and after 1950, when significant books were produced in the field of short modern stories, emphasising on picture books and realistic narrative prose, and the fourth period after 1990. A discernable shift can be observed in the thirties of the 20th century, during the times of socialist realism. The most significant change occurred after 1960, when massive migration from rural to urban environments caused by industrialisation began. The motif of urban environment especially marked modern realistic narrative, coined problematic narrative after 1990, with its focus on issues of growing up in such environments. The city as motif or theme doesn’t appear only in realistic narrative, but since the early 20th century also in fantastic narrative, thus it dichotomically presents the image of real world in Slovenian youth realistic narrative.

  3. TOPDOM: database of conservatively located domains and motifs in proteins.

    Science.gov (United States)

    Varga, Julia; Dobson, László; Tusnády, Gábor E

    2016-09-01

    The TOPDOM database-originally created as a collection of domains and motifs located consistently on the same side of the membranes in α-helical transmembrane proteins-has been updated and extended by taking into consideration consistently localized domains and motifs in globular proteins, too. By taking advantage of the recently developed CCTOP algorithm to determine the type of a protein and predict topology in case of transmembrane proteins, and by applying a thorough search for domains and motifs as well as utilizing the most up-to-date version of all source databases, we managed to reach a 6-fold increase in the size of the whole database and a 2-fold increase in the number of transmembrane proteins. TOPDOM database is available at http://topdom.enzim.hu The webpage utilizes the common Apache, PHP5 and MySQL software to provide the user interface for accessing and searching the database. The database itself is generated on a high performance computer. tusnady.gabor@ttk.mta.hu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  4. Mapping of Minimal Motifs of B-Cell Epitopes on Human Zona Pellucida Glycoprotein-3

    Directory of Open Access Journals (Sweden)

    Wan-Xiang Xu

    2012-01-01

    Full Text Available The human zona pellucida glycoprotein-3 (hZP3 by virtue of its critical role during fertilization has been proposed as a promising candidate antigen to develop a contraceptive vaccine. In this direction, it is imperative to map minimal motifs of the B cell epitopes (BCEs so as to avoid ZP-specific oophoritogenic T cell epitopes (TCEs in the ZP3-based immunogens. In this study, based on known results of mapping marmoset and bonnet monkey ZP3 (mstZP3 and bmZP3, two predictable epitopes23–30  and  301–320 on hZP3 were first confirmed and five minimal motifs within four epitopes on hZP3 were defined using serum to recombinant hZP3a22–176 or hZP3b177–348 as well as a biosynthetic peptide strategy. These defined minimal motifs were QPLWLL23–28 for hZP323–30, MQVTDD103–108 for hZP393–110, EENW178–181 for hZP3172–190, as well as SNSWF306–310 and EGP313–315 for hZP3301–320, respectively. Furthermore, the antigenicity of two peptides for hZP3172–187 and hZP3301–315 and specificity of the antibody response to these peptides were also evaluated, which produced high-titer antibodies in immunized animals that were capable of reacting to ZP on human oocytes, r-hZP3b177–348 protein, as well as r-hZP3172–190, r-hZP3303–310, and r-hZP3313–320 epitope peptides fused with truncated GST188 protein.

  5. Interaction of Cu+ with cytosine and formation of i-motif-like C-M+-C complexes: alkali versus coinage metals

    NARCIS (Netherlands)

    Gao, J.; Berden, G.; Rodgers, M.T.; Oomens, J.

    2016-01-01

    The Watson-Crick structure of DNA is among the most well-known molecular structures of our time. However, alternative base-pairing motifs are also known to occur, often depending on base sequence, pH, or the presence of cations. Pairing of cytosine (C) bases induced by the sharing of a single proton

  6. Neighboring phosphoSer-Pro motifs in the undefined domain of IRAK1 impart bivalent advantage for Pin1 binding.

    Science.gov (United States)

    Rogals, Monique J; Greenwood, Alexander I; Kwon, Jeahoo; Lu, Kun Ping; Nicholson, Linda K

    2016-12-01

    The peptidyl prolyl isomerase Pin1 has two domains that are considered to be its binding (WW) and catalytic (PPIase) domains, both of which interact with phosphorylated Ser/Thr-Pro motifs. This shared specificity might influence substrate selection, as many known Pin1 substrates have multiple sequentially close phosphoSer/Thr-Pro motifs, including the protein interleukin-1 receptor-associated kinase-1 (IRAK1). The IRAK1 undefined domain (UD) contains two sets of such neighboring motifs (Ser131/Ser144 and Ser163/Ser173), suggesting possible bivalent interactions with Pin1. Using a series of NMR titrations with 15N-labeled full-length Pin1 (Pin1-FL), PPIase, or WW domain and phosphopeptides representing the Ser131/Ser144 and Ser163/Ser173 regions of IRAK1-UD, bivalent interactions were investigated. Binding studies using singly phosphorylated peptides showed that individual motifs displayed weak affinities (> 100 μm) for Pin1-FL and each isolated domain. Analysis of dually phosphorylated peptides binding to Pin1-FL showed that inclusion of bivalent states was necessary to fit the data. The resulting complex model and fitted parameters were applied to predict the impact of bivalent states at low micromolar concentrations, demonstrating significant affinity enhancement for both dually phosphorylated peptides (3.5 and 24 μm for peptides based on the Ser131/Ser144 and Ser163/Ser173 regions, respectively). The complementary technique biolayer interferometry confirmed the predicted affinity enhancement for a representative set of singly and dually phosphorylated Ser131/Ser144 peptides at low micromolar concentrations, validating model predictions. These studies provide novel insights regarding the complexity of interactions between Pin1 and activated IRAK1, and more broadly suggest that phosphorylation of neighboring Ser/Thr-Pro motifs in proteins might provide competitive advantage at cellular concentrations for engaging with Pin1. © 2016 Federation of European

  7. Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks.

    Science.gov (United States)

    Gorochowski, Thomas E; Grierson, Claire S; di Bernardo, Mario

    2018-03-01

    Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli . Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution.

  8. Assessment of clusters of transcription factor binding sites in relationship to human promoter, CpG islands and gene expression

    Directory of Open Access Journals (Sweden)

    Sakaki Yoshiyuki

    2004-02-01

    Full Text Available Abstract Background Gene expression is regulated mainly by transcription factors (TFs that interact with regulatory cis-elements on DNA sequences. To identify functional regulatory elements, computer searching can predict TF binding sites (TFBS using position weight matrices (PWMs that represent positional base frequencies of collected experimentally determined TFBS. A disadvantage of this approach is the large output of results for genomic DNA. One strategy to identify genuine TFBS is to utilize local concentrations of predicted TFBS. It is unclear whether there is a general tendency for TFBS to cluster at promoter regions, although this is the case for certain TFBS. Also unclear is the identification of TFs that have TFBS concentrated in promoters and to what level this occurs. This study hopes to answer some of these questions. Results We developed the cluster score measure to evaluate the correlation between predicted TFBS clusters and promoter sequences for each PWM. Non-promoter sequences were used as a control. Using the cluster score, we identified a PWM group called PWM-PCP, in which TFBS clusters positively correlate with promoters, and another PWM group called PWM-NCP, in which TFBS clusters negatively correlate with promoters. The PWM-PCP group comprises 47% of the 199 vertebrate PWMs, while the PWM-NCP group occupied 11 percent. After reducing the effect of CpG islands (CGI against the clusters using partial correlation coefficients among three properties (promoter, CGI and predicted TFBS cluster, we identified two PWM groups including those strongly correlated with CGI and those not correlated with CGI. Conclusion Not all PWMs predict TFBS correlated with human promoter sequences. Two main PWM groups were identified: (1 those that show TFBS clustered in promoters associated with CGI, and (2 those that show TFBS clustered in promoters independent of CGI. Assessment of PWM matches will allow more positive interpretation of TFBS in

  9. Identification of novel conserved functional motifs across most Influenza A viral strains

    Directory of Open Access Journals (Sweden)

    El-Azab Iman

    2011-01-01

    Full Text Available Abstract Background Influenza A virus poses a continuous threat to global public health. Design of novel universal drugs and vaccine requires a careful analysis of different strains of Influenza A viral genome from diverse hosts and subtypes. We performed a systematic in silico analysis of Influenza A viral segments of all available Influenza A viral strains and subtypes and grouped them based on host, subtype, and years isolated, and through multiple sequence alignments we extrapolated conserved regions, motifs, and accessible regions for functional mapping and annotation. Results Across all species and strains 87 highly conserved regions (conservation percentage > = 90% and 19 functional motifs (conservation percentage = 100% were found in PB2, PB1, PA, NP, M, and NS segments. The conservation percentage of these segments ranged between 94 - 98% in human strains (the most conserved, 85 - 93% in swine strains (the most variable, and 91 - 94% in avian strains. The most conserved segment was different in each host (PB1 for human strains, NS for avian strains, and M for swine strains. Target accessibility prediction yielded 324 accessible regions, with a single stranded probability > 0.5, of which 78 coincided with conserved regions. Some of the interesting annotations in these regions included sites for protein-protein interactions, the RNA binding groove, and the proton ion channel. Conclusions The influenza virus has evolved to adapt to its host through variations in the GC content and conservation percentage of the conserved regions. Nineteen universal conserved functional motifs were discovered, of which some were accessible regions with interesting biological functions. These regions will serve as a foundation for universal drug targets as well as universal vaccine design.

  10. Pierced Lasso Bundles are a new class of knot-like motifs.

    Directory of Open Access Journals (Sweden)

    Ellinor Haglund

    2014-06-01

    Full Text Available A four-helix bundle is a well-characterized motif often used as a target for designed pharmaceutical therapeutics and nutritional supplements. Recently, we discovered a new structural complexity within this motif created by a disulphide bridge in the long-chain helical bundle cytokine leptin. When oxidized, leptin contains a disulphide bridge creating a covalent-loop through which part of the polypeptide chain is threaded (as seen in knotted proteins. We explored whether other proteins contain a similar intriguing knot-like structure as in leptin and discovered 11 structurally homologous proteins in the PDB. We call this new helical family class the Pierced Lasso Bundle (PLB and the knot-like threaded structural motif a Pierced Lasso (PL. In the current study, we use structure-based simulation to investigate the threading/folding mechanisms for all the PLBs along with three unthreaded homologs as the covalent loop (or lasso in leptin is important in folding dynamics and activity. We find that the presence of a small covalent loop leads to a mechanism where structural elements slipknot to thread through the covalent loop. Larger loops use a piercing mechanism where the free terminal plugs through the covalent loop. Remarkably, the position of the loop as well as its size influences the native state dynamics, which can impact receptor binding and biological activity. This previously unrecognized complexity of knot-like proteins within the helical bundle family comprises a completely new class within the knot family, and the hidden complexity we unraveled in the PLBs is expected to be found in other protein structures outside the four-helix bundles. The insights gained here provide critical new elements for future investigation of this emerging class of proteins, where function and the energetic landscape can be controlled by hidden topology, and should be take into account in ab initio predictions of newly identified protein targets.

  11. Canonical Bcl-2 motifs of the Na+/K+ pump revealed by the BH3 mimetic chelerythrine: early signal transducers of apoptosis?

    Science.gov (United States)

    Lauf, Peter K; Heiny, Judith; Meller, Jarek; Lepera, Michael A; Koikov, Leonid; Alter, Gerald M; Brown, Thomas L; Adragna, Norma C

    2013-01-01

    Chelerythrine [CET], a protein kinase C [PKC] inhibitor, is a prop-apoptotic BH3-mimetic binding to BH1-like motifs of Bcl-2 proteins. CET action was examined on PKC phosphorylation-dependent membrane transporters (Na+/K+ pump/ATPase [NKP, NKA], Na+-K+-2Cl+ [NKCC] and K+-Cl- [KCC] cotransporters, and channel-supported K+ loss) in human lens epithelial cells [LECs]. K+ loss and K+ uptake, using Rb+ as congener, were measured by atomic absorption/emission spectrophotometry with NKP and NKCC inhibitors, and Cl- replacement by NO3ˉ to determine KCC. 3H-Ouabain binding was performed on a pig renal NKA in the presence and absence of CET. Bcl-2 protein and NKA sequences were aligned and motifs identified and mapped using PROSITE in conjunction with BLAST alignments and analysis of conservation and structural similarity based on prediction of secondary and crystal structures. CET inhibited NKP and NKCC by >90% (IC50 values ~35 and ~15 μM, respectively) without significant KCC activity change, and stimulated K+ loss by ~35% at 10-30 μM. Neither ATP levels nor phosphorylation of the NKA α1 subunit changed. 3H-ouabain was displaced from pig renal NKA only at 100 fold higher CET concentrations than the ligand. Sequence alignments of NKA with BH1- and BH3-like motifs containing pro-survival Bcl-2 and BclXl proteins showed more than one BH1-like motif within NKA for interaction with CET or with BH3 motifs. One NKA BH1-like motif (ARAAEILARDGPN) was also found in all P-type ATPases. Also, NKA possessed a second motif similar to that near the BH3 region of Bcl-2. Findings support the hypothesis that CET inhibits NKP by binding to BH1-like motifs and disrupting the α1 subunit catalytic activity through conformational changes. By interacting with Bcl-2 proteins through their complementary BH1- or BH3-like-motifs, NKP proteins may be sensors of normal and pathological cell functions, becoming important yet unrecognized signal transducers in the initial phases of apoptosis. CET

  12. Canonical Bcl-2 Motifs of the Na+/K+ Pump Revealed by the BH3 Mimetic Chelerythrine: Early Signal Transducers of Apoptosis?

    Directory of Open Access Journals (Sweden)

    Peter K. Lauf

    2013-02-01

    Full Text Available Background/Aims: Chelerythrine [CET], a protein kinase C [PKC] inhibitor, is a prop-apoptotic BH3-mimetic binding to BH1-like motifs of Bcl-2 proteins. CET action was examined on PKC phosphorylation-dependent membrane transporters (Na+/K+ pump/ATPase [NKP, NKA], Na+-K+-2Cl+ [NKCC] and K+-Cl- [KCC] cotransporters, and channel-supported K+ loss in human lens epithelial cells [LECs]. Methods: K+ loss and K+ uptake, using Rb+ as congener, were measured by atomic absorption/emission spectrophotometry with NKP and NKCC inhibitors, and Cl- replacement by NO3ˉ to determine KCC. 3H-Ouabain binding was performed on a pig renal NKA in the presence and absence of CET. Bcl-2 protein and NKA sequences were aligned and motifs identified and mapped using PROSITE in conjunction with BLAST alignments and analysis of conservation and structural similarity based on prediction of secondary and crystal structures. Results: CET inhibited NKP and NKCC by >90% (IC50 values ∼35 and ∼15 µM, respectively without significant KCC activity change, and stimulated K+ loss by ∼35% at 10-30 µM. Neither ATP levels nor phosphorylation of the NKA α1 subunit changed. 3H-ouabain was displaced from pig renal NKA only at 100 fold higher CET concentrations than the ligand. Sequence alignments of NKA with BH1- and BH3-like motifs containing pro-survival Bcl-2 and BclXl proteins showed more than one BH1-like motif within NKA for interaction with CET or with BH3 motifs. One NKA BH1-like motif (ARAAEILARDGPN was also found in all P-type ATPases. Also, NKA possessed a second motif similar to that near the BH3 region of Bcl-2. Conclusion: Findings support the hypothesis that CET inhibits NKP by binding to BH1-like motifs and disrupting the α1 subunit catalytic activity through conformational changes. By interacting with Bcl-2 proteins through their complementary BH1- or BH3-like-motifs, NKP proteins may be sensors of normal and pathological cell functions, becoming important yet

  13. Frequency patterns of T-cell exposed motifs in immunoglobulin heavy chain peptides presented by MHCs

    Directory of Open Access Journals (Sweden)

    Robert D. Bremel

    2014-10-01

    Full Text Available Immunoglobulins are highly diverse protein sequences that are processed and presented to T-cells by B-cells and other antigen presenting cells. We examined a large dataset of immunoglobulin heavy chain variable regions (IGHV to assess the diversity of T-cell exposed motifs (TCEM. TCEM comprise those amino acids in a MHC-bound peptide which face outwards, surrounded by the MHC histotope, and which engage the T-cell receptor. Within IGHV there is a distinct pattern of predicted MHC class II binding and a very high frequency of re-use of the TCEMs. The re-use frequency indicates that only a limited number of different cognate T-cells are required to engage many different clonal B-cells. The amino acids in each outward-facing TCEM are intercalated with the amino acids of inward-facing MHC groove-exposed motifs (GEM. Different GEM may have differing, allele-specific, MHC binding affinities. The intercalation of TCEM and GEM in a peptide allows for a vast combinatorial repertoire of epitopes, each eliciting a different response. Outcome of T-cell receptor binding is determined by overall signal strength, which is a function of the number of responding T-cells and the duration of engagement. Hence, the frequency of T-cell exposed motif re-use appears to be an important determinant of whether a T-cell response is stimulatory or suppressive. The frequency distribution of TCEMs implies that somatic hypermutation is followed by clonal expansion that develop along repeated pathways. The observations of TCEM and GEM derived from immunoglobulins suggest a relatively simple, yet powerful, mechanism to correlate T-cell polyspecificity, through re-use of TCEMs, with a very high degree of specificity achieved by combination with a diversity of GEMs. The frequency profile of TCEMs also points to an economical mechanism for maintaining T-cell memory, recall, and self-discrimination based on an endogenously generated profile of motifs.

  14. Analysis of a conserved RGE/RGD motif in HCV E2 in mediating entry

    Directory of Open Access Journals (Sweden)

    Rong Lijun

    2009-01-01

    Full Text Available Abstract Background Hepatitis C virus (HCV encodes two transmembrane glycoproteins E1 and E2 which form a heterodimer. E1 is believed to mediate fusion while E2 has been shown to bind cellular receptors. It is clear that HCV uses a multi-receptor complex to gain entry into susceptible cells, however key elements of this complex remain elusive. In this study, the role of a highly conserved RGE/RGD motif of HCV E2 glycoprotein in viral entry was examined. The effect of each substitution mutation in this motif was tested by challenging susceptible cell lines with mutant HCV E1E2 pseudotyped viruses generated using a lentiviral system (HCVpp. In addition to assaying infectivity, producer cell expression and HCVpp incorporation of HCV E2 proteins, CD81 binding profiles, and conformation of mutants were examined. Results Based on these characteristics, mutants either displayed wt characteristics (high infectivity [≥ 90% of wt HCVpp], CD81 binding, E1E2 expression, and incorporation into viral particles and proper conformation or very low infectivity (≤ 20% of wt HCVpp. Only amino acid substitutions of the 3rd position (D or E resulted in wt characteristics as long as the negative charge was maintained or a neutral alanine was introduced. A change in charge to a positive lysine, disrupted HCVpp infectivity at this position. Conclusion Although most amino acid substitutions within this conserved motif displayed greatly reduced HCVpp infectivity, they retained soluble CD81 binding, proper E2 conformation, and incorporation into HCVpp. Our results suggest that although RGE/D is a well-defined integrin binding motif, in this case the role of these three hyperconserved amino acids does not appear to be integrin binding. As the extent of conservation of this region extends well beyond these three amino acids, we speculate that this region may play an important role in the structure of HCV E2 or in mediating the interaction with other factor(s during

  15. Biomimetic trapping cocktail to screen reactive metabolites: use of an amino acid and DNA motif mixture as light/heavy isotope pairs differing in mass shift.

    Science.gov (United States)

    Hosaka, Shuto; Honda, Takuto; Lee, Seon Hwa; Oe, Tomoyuki

    2018-06-01

    Candidate drugs that can be metabolically transformed into reactive electrophilic products, such as epoxides, quinones, and nitroso compounds, are of special concern because subsequent covalent binding to bio-macromolecules can cause adverse drug reactions, such as allergic reactions, hepatotoxicity, and genotoxicity. Several strategies have been reported for screening reactive metabolites, such as a covalent binding assay with radioisotope-labeled drugs and a trapping method followed by LC-MS/MS analyses. Of these, a trapping method using glutathione is the most common, especially at the early stage of drug development. However, the cysteine of glutathione is not the only nucleophilic site in vivo; lysine, histidine, arginine, and DNA bases are also nucleophilic. Indeed, the glutathione trapping method tends to overlook several types of reactive metabolites, such as aldehydes, acylglucuronides, and nitroso compounds. Here, we introduce an alternate way for screening reactive metabolites as follows: A mixture of the light and heavy isotopes of simplified amino acid motifs and a DNA motif is used as a biomimetic trapping cocktail. This mixture consists of [ 2 H 0 ]/[ 2 H 3 ]-1-methylguanidine (arginine motif, Δ 3 Da), [ 2 H 0 ]/[ 2 H 4 ]-2-mercaptoethanol (cysteine motif, Δ 4 Da), [ 2 H 0 ]/[ 2 H 5 ]-4-methylimidazole (histidine motif, Δ 5 Da), [ 2 H 0 ]/[ 2 H 9 ]-n-butylamine (lysine motif, Δ 9 Da), and [ 13 C 0 , 15 N 0 ]/[ 13 C 1 , 15 N 2 ]-2'-deoxyguanosine (DNA motif, Δ 3 Da). Mass tag triggered data-dependent acquisition is used to find the characteristic doublet peaks, followed by specific identification of the light isotope peak using MS/MS. Forty-two model drugs were examined using an in vitro microsome experiment to validate the strategy. Graphical abstract Biomimetic trapping cocktail to screen reactive metabolites.

  16. Role of NH2-terminal hydrophobic motif in the subcellular localization of ATP-binding cassette protein subfamily D: Common features in eukaryotic organisms

    International Nuclear Information System (INIS)

    Lee, Asaka; Asahina, Kota; Okamoto, Takumi; Kawaguchi, Kosuke; Kostsin, Dzmitry G.; Kashiwayama, Yoshinori; Takanashi, Kojiro; Yazaki, Kazufumi; Imanaka, Tsuneo; Morita, Masashi

    2014-01-01

    Highlights: • ABCD proteins classifies based on with or without NH 2 -terminal hydrophobic segment. • The ABCD proteins with the segment are targeted peroxisomes. • The ABCD proteins without the segment are targeted to the endoplasmic reticulum. • The role of the segment in organelle targeting is conserved in eukaryotic organisms. - Abstract: In mammals, four ATP-binding cassette (ABC) proteins belonging to subfamily D have been identified. ABCD1–3 possesses the NH 2 -terminal hydrophobic region and are targeted to peroxisomes, while ABCD4 lacking the region is targeted to the endoplasmic reticulum (ER). Based on hydropathy plot analysis, we found that several eukaryotes have ABCD protein homologs lacking the NH 2 -terminal hydrophobic segment (H0 motif). To investigate whether the role of the NH 2 -terminal H0 motif in subcellular localization is conserved across species, we expressed ABCD proteins from several species (metazoan, plant and fungi) in fusion with GFP in CHO cells and examined their subcellular localization. ABCD proteins possessing the NH 2 -terminal H0 motif were localized to peroxisomes, while ABCD proteins lacking this region lost this capacity. In addition, the deletion of the NH 2 -terminal H0 motif of ABCD protein resulted in their localization to the ER. These results suggest that the role of the NH 2 -terminal H0 motif in organelle targeting is widely conserved in living organisms

  17. Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

    Energy Technology Data Exchange (ETDEWEB)

    Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

    2007-02-21

    Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

  18. Indonesian Traditional Toys and the Development of Batik Motifs

    Directory of Open Access Journals (Sweden)

    Bagus Indrayana

    2016-06-01

    Full Text Available There is a wide array of traditional toys in Indonesia. In the past, traditional toys played an important role for skill and creativity development of children. Today, the position of traditional toys in the society is displaced by toys from large-scale manufacturers. Given the critical role of traditional toys for children’s motoric and social development, there is a need to develop media that can be used to promote these traditional products and strengthen their position in the public. We propose to use Batik as a way to effectively disseminate and promote traditional toys to the general public. Apart from this, using traditional toys to create new Batik motifs can have an economic value for the producers of Batik, promote Indonesian products and enrich the Indonesian Batik. This study aims to explore the variety of traditional toys, mainly from Klaten and Magelang, in the Central Java province of Indonesia, and use them as the basis for the development of Batik motif creation. This study used Trilogi Keseimbangan (or Harmony Trilogy aesthetic theory analytical approach that explains the creation of craft consists of the following phases: exploration, design, and materialization. The creation method in this study adopts Tiga Tahap Enam Langkah (Three Phases, Six Steps method offered in the theory. The finding in the field found that the traditional toys material used in Klaten and Magelang, mostly made from waste wood, plywood, and zinc. The manufacturing process is done manually by two or three craftsmen using a simple technology. The traditional toys are designed by the artisans mostly, although there may be designs from the clients. In addition, we also found that the traditional toys have never been used as a Batik motif. The traditional toys Batik motif presented in this work is researcher’s design. For the purposes of this study, we first research the variety of traditional toys available in the market today in Indonesia. We look

  19. Dimensionality of social networks using motifs and eigenvalues.

    Directory of Open Access Journals (Sweden)

    Anthony Bonato

    Full Text Available We consider the dimensionality of social networks, and develop experiments aimed at predicting that dimension. We find that a social network model with nodes and links sampled from an m-dimensional metric space with power-law distributed influence regions best fits samples from real-world networks when m scales logarithmically with the number of nodes of the network. This supports a logarithmic dimension hypothesis, and we provide evidence with two different social networks, Facebook and LinkedIn. Further, we employ two different methods for confirming the hypothesis: the first uses the distribution of motif counts, and the second exploits the eigenvalue distribution.

  20. Gentamicin binds to the megalin receptor as a competitive inhibitor using the common ligand binding motif of complement type repeats

    DEFF Research Database (Denmark)

    Dagil, Robert; O'Shea, Charlotte; Nykjær, Anders

    2013-01-01

    megalin and investigated its interaction with gentamicin. Using NMR titration data in HADDOCK, we have generated a three-dimensional model describing the complex between megalin and gentamicin. Gentamicin binds to megalin with low affinity and exploits the common ligand binding motif previously described...... to megalin is highly similar to gentamicin binding to calreticulin. We discuss the impact of this novel insight for the future structure-based design of gentamicin antagonists....

  1. RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.

    Science.gov (United States)

    Castro-Mondragon, Jaime Abraham; Jaeger, Sébastien; Thieffry, Denis; Thomas-Chollier, Morgane; van Helden, Jacques

    2017-07-27

    Transcription factor (TF) databases contain multitudes of binding motifs (TFBMs) from various sources, from which non-redundant collections are derived by manual curation. The advent of high-throughput methods stimulated the production of novel collections with increasing numbers of motifs. Meta-databases, built by merging these collections, contain redundant versions, because available tools are not suited to automatically identify and explore biologically relevant clusters among thousands of motifs. Motif discovery from genome-scale data sets (e.g. ChIP-seq) also produces redundant motifs, hampering the interpretation of results. We present matrix-clustering, a versatile tool that clusters similar TFBMs into multiple trees, and automatically creates non-redundant TFBM collections. A feature unique to matrix-clustering is its dynamic visualisation of aligned TFBMs, and its capability to simultaneously treat multiple collections from various sources. We demonstrate that matrix-clustering considerably simplifies the interpretation of combined results from multiple motif discovery tools, and highlights biologically relevant variations of similar motifs. We also ran a large-scale application to cluster ∼11 000 motifs from 24 entire databases, showing that matrix-clustering correctly groups motifs belonging to the same TF families, and drastically reduced motif redundancy. matrix-clustering is integrated within the RSAT suite (http://rsat.eu/), accessible through a user-friendly web interface or command-line for its integration in pipelines. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. The ARTT motif and a unified structural understanding of substraterecognition in ADP ribosylating bacterial toxins and eukaryotic ADPribosyltransferases

    Energy Technology Data Exchange (ETDEWEB)

    Han, S.; Tainer, J.A.

    2001-08-01

    ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core has been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT

  3. Fibrinogen Motif Discriminates Platelet and Cell Capture in Peptide-Modified Gold Micropore Arrays.

    Science.gov (United States)

    Adamson, Kellie; Spain, Elaine; Prendergast, Una; Moran, Niamh; Forster, Robert J; Keyes, Tia E

    2018-01-16

    Human blood platelets and SK-N-AS neuroblastoma cancer-cell capture at spontaneously adsorbed monolayers of fibrinogen-binding motifs, GRGDS (generic integrin adhesion), HHLGGAKQAGDV (exclusive to platelet integrin α IIb β 3 ), or octanethiol (adhesion inhibitor) at planar gold and ordered 1.6 μm diameter spherical cap gold cavity arrays were compared. In all cases, arginine/glycine/aspartic acid (RGD) promoted capture, whereas alkanethiol monolayers inhibited adhesion. Conversely only platelets adhered to alanine/glycine/aspartic acid (AGD)-modified surfaces, indicating that the AGD motif is recognized preferentially by the platelet-specific integrin, α IIb β 3 . Microstructuring of the surface effectively eliminated nonspecific platelet/cell adsorption and dramatically enhanced capture compared to RGD/AGD-modified planar surfaces. In all cases, adhesion was reversible. Platelets and cells underwent morphological change on capture, the extent of which depended on the topography of the underlying substrate. This work demonstrates that both the nature of the modified interface and its underlying topography influence the capture of cancer cells and platelets. These insights may be useful in developing cell-based cancer diagnostics as well as in identifying strategies for the disruption of platelet cloaks around circulating tumor cells.

  4. Romanian traditional motif - element of modernity in clothing

    Science.gov (United States)

    Doble, L.; Stan, O.; Suteu, M. D.; Albu, A.; Bohm, G.; Tsatsarou-Michalaki, A.; Gialinou, E.

    2017-10-01

    In this paper are presented the phases for improving from an aesthetic point of view a clothing item, the jacket respectively, with a straight cut for women using software design patterns, computerised graphics and textile different modern technologies including: industrial embroidery, digital printing, sublimation. In the first phase a documentation was prepared in the Ethnographic Museum of Transylvania from Cluj Napoca where more traditional motifs were selected specific to Transylvania etnographic region and were reintepreted and stylized whilst preserving the symbolism and color range specified to the area. For the styling phase was used CorelDraw vector graphics program that allows changing the shape, size and color of the drawings without affecting the identity of the pattern. In the patterns design phase Gemini CAD software was used and for the modeling and model development Optitex software was used. The part for garnishing the model was performed using Embrodery machine software reproducing the stylized motif identically. In order to obtain a significantly improved aesthetic look and an added artistic value the pattern chosen for the jacket was done using a combination of modern textile technologies. This has allowed the realization of a particular texture on the surface of the designed product, demonstrating that traditional patterns can be reintepreted in modern clothing

  5. THE MOTIF OF THE SECOND COMING IN RUSSIAN FANTASTIC FICTION

    Directory of Open Access Journals (Sweden)

    Tatyana I. Khoruzhenko

    2017-06-01

    Full Text Available The motif of the Second Coming of Christ takes a special place in Russian fantastic fiction at the turn of the millennium. In the recent decades allusions to the Gospel topic appears in increasing frequency in the genre of fantasy. The aim of the given article was to analyze the peculiarities of the depiction of the subject of Advent in Russian fantastic fiction. As the basis for the research the novels of Y. Voznesenskaya, N. Perumov, V. Khlumov, S. Lukyanenko and T. Ustimenko are of particular interest. The Advent motif appears in the story line of each of the novels in question. Though, the attitude of the authors to the image of the Savior and his second coming to the world fluctuates: from a respectful expectation (Y. Voznesenskaya, T. Ustimenko, S. Lukyanenko to the depiction of the Savior as a monster (N. Perumov. The possibility of an ambivalent interpretation of the Savior is the eloquent evidence of desacralization of this image. The profaning of the sacred is one of the tendencies of the modern popular culture. The genre of fantastic fiction, as a product of mass culture, has caught this trend quite precisely.

  6. Regulation of amyloid precursor protein processing by its KFERQ motif.

    Science.gov (United States)

    Park, Ji-Seon; Kim, Dong-Hou; Yoon, Seung-Yong

    2016-06-01

    Understanding of trafficking, processing, and degradation mechanisms of amyloid precursor protein (APP) is important because APP can be processed to produce β-amyloid (Aβ), a key pathogenic molecule in Alzheimer's disease (AD). Here, we found that APP contains KFERQ motif at its C-terminus, a consensus sequence for chaperone-mediated autophagy (CMA) or microautophagy which are another types of autophagy for degradation of pathogenic molecules in neurodegenerative diseases. Deletion of KFERQ in APP increased C-terminal fragments (CTFs) and secreted N-terminal fragments of APP and kept it away from lysosomes. KFERQ deletion did not abolish the interaction of APP or its cleaved products with heat shock cognate protein 70 (Hsc70), a protein necessary for CMA or microautophagy. These findings suggest that KFERQ motif is important for normal processing and degradation of APP to preclude the accumulation of APP-CTFs although it may not be important for CMA or microautophagy. [BMB Reports 2016; 49(6): 337-342].

  7. The heptanucleotide motif GAGACGC is a key component of a cis-acting promoter element that is critical for SnSAG1 expression in Sarcocystis neurona.

    Science.gov (United States)

    Gaji, Rajshekhar Y; Howe, Daniel K

    2009-07-01

    The apicomplexan parasite Sarcocystis neurona undergoes a complex process of intracellular development, during which many genes are temporally regulated. The described study was undertaken to begin identifying the basic promoter elements that control gene expression in S. neurona. Sequence analysis of the 5'-flanking region of five S. neurona genes revealed a conserved heptanucleotide motif GAGACGC that is similar to the WGAGACG motif described upstream of multiple genes in Toxoplasma gondii. The promoter region for the major surface antigen gene SnSAG1, which contains three heptanucleotide motifs within 135 bases of the transcription start site, was dissected by functional analysis using a dual luciferase reporter assay. These analyses revealed that a minimal promoter fragment containing all three motifs was sufficient to drive reporter molecule expression, with the presence and orientation of the 5'-most heptanucleotide motif being absolutely critical for promoter function. Further studies should help to identify additional sequence elements important for promoter function and for controlling gene expression during intracellular development by this apicomplexan pathogen.

  8. A CACGTG motif of the Antirrhinum majus chalcone synthase promoter is recognized by an evolutionarily conserved nuclear protein

    International Nuclear Information System (INIS)

    Staiger, D.; Kaulen, H.; Schell, J.

    1989-01-01

    In the chalcone synthase gene of Antirrhinum majus (snapdragon), 150 base pairs of the 5' flanking region contain cis-acting signals for UV light-induced expression. A nuclear factor, designated CG-1, specifically recognizes a hexameric motif with internal dyad symmetry, CACGTG, located within this light-responsive sequence. Binding of CG-1 is influenced by C-methylation of the CpG dinucleotide in the recognition sequence. CG-1 is a factor found in a variety of dicotyledonous plant species including Nicotiana tabacum, A. majus, Petunia hybrida, Arabidopsis thaliana, and Glycine max. CACGTG motifs contained within trans-acting factor recognition sites in various other plant promoters can interact with CG-1. In addition, the binding site of the human adenovirus major late transcription factor USF can compete for CG-1 binding to the chalcone synthase promoter. This suggests an evolutionary conservation of trans-acting factor recognition sites involved in divergent mechanisms of gene control. (author)

  9. Systematic analysis of phosphotyrosine antibodies recognizing single phosphorylated EPIYA-motifs in CagA of Western-type Helicobacter pylori strains.

    Directory of Open Access Journals (Sweden)

    Judith Lind

    Full Text Available The clinical outcome of Helicobacter pylori infections is determined by multiple host-pathogen interactions that may develop to chronic gastritis, and sometimes peptic ulcers or gastric cancer. Highly virulent strains encode a type IV secretion system (T4SS that delivers the effector protein CagA into gastric epithelial cells. Translocated CagA undergoes tyrosine phosphorylation at EPIYA-sequence motifs, called A, B and C in Western-type strains, by members of the oncogenic Src and Abl host kinases. Phosphorylated EPIYA-motifs mediate interactions of CagA with host signaling factors--in particular various SH2-domain containing human proteins--thereby hijacking multiple downstream signaling cascades. Observations of tyrosine-phosphorylated CagA are mainly based on the use of commercial phosphotyrosine antibodies, which originally were selected to detect phosphotyrosines in mammalian proteins. Systematic studies of phosphorylated EPIYA-motif detection by the different antibodies would be very useful, but are not yet available. To address this issue, we synthesized phospho- and non-phosphopeptides representing each predominant Western CagA EPIYA-motif, and determined the recognition patterns of seven different phosphotyrosine antibodies in Western blots, and also performed infection studies with diverse representative Western H. pylori strains. Our results show that a total of 9-11 amino acids containing the phosphorylated EPIYA-motifs are necessary and sufficient for specific detection by these antibodies, but revealed great variability in sequence recognition. Three of the antibodies recognized phosphorylated EPIYA-motifs A, B and C similarly well; whereas preferential binding to phosphorylated motif A and motifs A and C was found with two and one antibodies, respectively, and the seventh anti-phosphotyrosine antibody did not recognize any phosphorylated EPIYA-motif. Controls showed that none of the antibodies recognized the corresponding non

  10. I-Ad-binding peptides derived from unrelated protein antigens share a common structural motif

    DEFF Research Database (Denmark)

    Sette, A; Buus, S; Colon, S

    1988-01-01

    on the I-Ad binding of the immunogenic peptide OVA 323-339. The results obtained demonstrated the very permissive nature of Ag-Ia interaction. We also showed that unrelated peptides that are good I-Ad binders share a common structural motif and speculated that recognition of such motifs could represent...... that I-Ad molecules recognize a large library of Ag by virtue of common structural motifs present in peptides derived from phylogenetically unrelated proteins....

  11. Lucky Motifs in Chinese Folk Art: Interpreting Paper-cut from Chinese Shaanxi

    OpenAIRE

    Xuxiao WANG

    2013-01-01

    Paper-cut is not simply a form of traditional Chinese folk art. Lucky motifs developed in paper-cut certainly acquired profound cultural connotations. As paper-cut is a time-honoured skill across the nation, interpreting those motifs requires cultural receptiveness and anthropological sensitivity. The author of this article analyzes examples of paper-cut from Northern Shaanxi, China, to identify the cohesive motifs and explore the auspiciousness of the specific concepts of Fu, Lu, Shou, Xi. T...

  12. The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element.

    Science.gov (United States)

    Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko

    2013-07-01

    AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5'-NNCCAC-3' and 5'-GCGMGN'N'-3' (M:A or C; N and N' form Watson-Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences.

  13. Synthesis of most polyene natural product motifs using just twelve building blocks and one coupling reaction

    Science.gov (United States)

    Woerly, Eric M.; Roy, Jahnabi; Burke, Martin D.

    2014-01-01

    The inherent modularity of polypeptides, oligonucleotides, and oligosaccharides has been harnessed to achieve generalized building block-based synthesis platforms. Importantly, like these other targets, most small molecule natural products are biosynthesized via iterative coupling of bifunctional building blocks. This suggests that many small molecules also possess inherent modularity commensurate with systematic building block-based construction. Supporting this hypothesis, here we report that the polyene motifs found in >75% of all known polyene natural products can be synthesized using just 12 building blocks and one coupling reaction. Using the same general retrosynthetic algorithm and reaction conditions, this platform enabled the synthesis of a wide range of polyene frameworks covering all of this natural product chemical space, and first total syntheses of the polyene natural products asnipyrone B, physarigin A, and neurosporaxanthin β-D-glucopyranoside. Collectively, these results suggest the potential for a more generalized approach for making small molecules in the laboratory. PMID:24848233

  14. Hybrid DNA i-motif: Aminoethylprolyl-PNA (pC5) enhance the stability of DNA (dC5) i-motif structure.

    Science.gov (United States)

    Gade, Chandrasekhar Reddy; Sharma, Nagendra K

    2017-12-15

    This report describes the synthesis of C-rich sequence, cytosine pentamer, of aep-PNA and its biophysical studies for the formation of hybrid DNA:aep-PNAi-motif structure with DNA cytosine pentamer (dC 5 ) under acidic pH conditions. Herein, the CD/UV/NMR/ESI-Mass studies strongly support the formation of stable hybrid DNA i-motif structure with aep-PNA even near acidic conditions. Hence aep-PNA C-rich sequence cytosine could be considered as potential DNA i-motif stabilizing agents in vivo conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Christian J. Michel

    2017-12-01

    Full Text Available A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C 3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X , using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X , in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level, and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R . We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions. This property is true for all cardinalities of X motifs (from 4 to 20 and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non- X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together

  16. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

    Science.gov (United States)

    Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

    2017-12-03

    A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first

  17. Purification and functional motifs of the recombinant ATPase of orf virus.

    Science.gov (United States)

    Lin, Fong-Yuan; Chan, Kun-Wei; Wang, Chi-Young; Wong, Min-Liang; Hsu, Wei-Li

    2011-10-01

    Our previous study showed that the recombinant ATPase encoded by the A32L gene of orf virus displayed ATP hydrolysis activity as predicted from its amino acids sequence. This viral ATPase contains four known functional motifs (motifs I-IV) and a novel AYDG motif; they are essential for ATP hydrolysis reaction by binding ATP and magnesium ions. The motifs I and II correspond with the Walker A and B motifs of the typical ATPase, respectively. To examine the biochemical roles of these five conserved motifs, recombinant ATPases of five deletion mutants derived from the Taiping strain were expressed and purified. Their ATPase functions were assayed and compared with those of two wild type strains, Taiping and Nantou isolated in Taiwan. Our results showed that deletions at motifs I-III or IV exhibited lower activity than that of the wild type. Interestingly, deletion of AYDG motif decreased the ATPase activity more significantly than those of motifs I-IV deletions. Divalent ions such as magnesium and calcium were essential for ATPase activity. Moreover, our recombinant proteins of orf virus also demonstrated GTPase activity, though weaker than the original ATPase activity. Copyright © 2011 Elsevier Inc. All rights reserved.

  18. Systematic comparison of the response properties of protein and RNA mediated gene regulatory motifs.

    Science.gov (United States)

    Iyengar, Bharat Ravi; Pillai, Beena; Venkatesh, K V; Gadgil, Chetan J

    2017-05-30

    We present a framework enabling the dissection of the effects of motif structure (feedback or feedforward), the nature of the controller (RNA or protein), and the regulation mode (transcriptional, post-transcriptional or translational) on the response to a step change in the input. We have used a common model framework for gene expression where both motif structures have an activating input and repressing regulator, with the same set of parameters, to enable a comparison of the responses. We studied the global sensitivity of the system properties, such as steady-state gain, overshoot, peak time, and peak duration, to parameters. We find that, in all motifs, overshoot correlated negatively whereas peak duration varied concavely with peak time. Differences in the other system properties were found to be mainly dependent on the nature of the controller rather than the motif structure. Protein mediated motifs showed a higher degree of adaptation i.e. a tendency to return to baseline levels; in particular, feedforward motifs exhibited perfect adaptation. RNA mediated motifs had a mild regulatory effect; they also exhibited a lower peaking tendency and mean overshoot. Protein mediated feedforward motifs showed higher overshoot and lower peak time compared to the corresponding feedback motifs.

  19. Sulfur-induced structural motifs on copper and gold surfaces

    Energy Technology Data Exchange (ETDEWEB)

    Walen, Holly [Iowa State Univ., Ames, IA (United States)

    2016-01-01

    The interaction of sulfur with copper and gold surfaces plays a fundamental role in important phenomena that include coarsening of surface nanostructures, and self-assembly of alkanethiols. Here, we identify and analyze unique sulfur-induced structural motifs observed on the low-index surfaces of these two metals. We seek out these structures in an effort to better understand the fundamental interactions between these metals and sulfur that lends to the stability and favorability of metal-sulfur complexes vs. chemisorbed atomic sulfur. The experimental observations presented here—made under identical conditions—together with extensive DFT analyses, allow comparisons and insights into factors that favor the existence of metal-sulfur complexes, vs. chemisorbed atomic sulfur, on metal terraces. We believe this data will be instrumental in better understanding the complex phenomena occurring between the surfaces of coinage metals and sulfur.

  20. Sequential dynamics in the motif of excitatory coupled elements

    Science.gov (United States)

    Korotkov, Alexander G.; Kazakov, Alexey O.; Osipov, Grigory V.

    2015-11-01

    In this article a new model of motif (small ensemble) of neuron-like elements is proposed. It is built with the use of the generalized Lotka-Volterra model with excitatory couplings. The main motivation for this work comes from the problems of neuroscience where excitatory couplings are proved to be the predominant type of interaction between neurons of the brain. In this paper it is shown that there are two modes depending on the type of coupling between the elements: the mode with a stable heteroclinic cycle and the mode with a stable limit cycle. Our second goal is to examine the chaotic dynamics of the generalized three-dimensional Lotka-Volterra model.

  1. Study on online community user motif using web usage mining

    Science.gov (United States)

    Alphy, Meera; Sharma, Ajay

    2016-04-01

    The Web usage mining is the application of data mining, which is used to extract useful information from the online community. The World Wide Web contains at least 4.73 billion pages according to Indexed Web and it contains at least 228.52 million pages according Dutch Indexed web on 6th august 2015, Thursday. It’s difficult to get needed data from these billions of web pages in World Wide Web. Here is the importance of web usage mining. Personalizing the search engine helps the web user to identify the most used data in an easy way. It reduces the time consumption; automatic site search and automatic restore the useful sites. This study represents the old techniques to latest techniques used in pattern discovery and analysis in web usage mining from 1996 to 2015. Analyzing user motif helps in the improvement of business, e-commerce, personalisation and improvement of websites.

  2. μXRF analysis of decoration motifs on Majolica pottery

    International Nuclear Information System (INIS)

    Padilla Lavarez, Roman; Van Espen, Pierr M.; Janssens, K; Schalm, O.

    2001-01-01

    μXRF analysis of decoration motifs on Majolica pottery in fragments corresponding to several Majolica types was carried out using an spectrometer comprising a low power Mo X-ray tube and a elliptic-shape concentration lens with a 60 um spot. Both surface scanning and spot measurements were carried a out, allowing the qualitative identification of the inorganic pigments used for the surface painting decoration and the quantitative analysis of the main glaze composition. The absence of interference signal arising from the excitation on the underlying paste when analysing thin-lead glazing was evaluated, allowing ensuring the suitable of the analytical procedures. A distinction was found between different types of majolica by the composition of the lead tin glaze enamel and by the presence of other elements in the blue, black and orange decoration

  3. A Nuclear Ribosomal DNA Phylogeny of Acer Inferred with Maximum Likelihood, Splits Graphs, and Motif Analysis of 606 Sequences

    Science.gov (United States)

    Grimm, Guido W.; Renner, Susanne S.; Stamatakis, Alexandros; Hemleben, Vera

    2007-01-01

    The multi-copy internal transcribed spacer (ITS) region of nuclear ribosomal DNA is widely used to infer phylogenetic relationships among closely related taxa. Here we use maximum likelihood (ML) and splits graph analyses to extract phylogenetic information from ~ 600 mostly cloned ITS sequences, representing 81 species and subspecies of Acer, and both species of its sister Dipteronia. Additional analyses compared sequence motifs in Acer and several hundred Anacardiaceae, Burseraceae, Meliaceae, Rutaceae, and Sapindaceae ITS sequences in GenBank. We also assessed the effects of using smaller data sets of consensus sequences with ambiguity coding (accounting for within-species variation) instead of the full (partly redundant) original sequences. Neighbor-nets and bipartition networks were used to visualize conflict among character state patterns. Species clusters observed in the trees and networks largely agree with morphology-based classifications; of de Jong’s (1994) 16 sections, nine are supported in neighbor-net and bipartition networks, and ten by sequence motifs and the ML tree; of his 19 series, 14 are supported in networks, motifs, and the ML tree. Most nodes had higher bootstrap support with matrices of 105 or 40 consensus sequences than with the original matrix. Within-taxon ITS divergence did not differ between diploid and polyploid Acer, and there was little evidence of differentiated parental ITS haplotypes, suggesting that concerted evolution in Acer acts rapidly. PMID:19455198

  4. A Nuclear Ribosomal DNA Phylogeny of Acer Inferred with Maximum Likelihood, Splits Graphs, and Motif Analysis of 606 Sequences

    Directory of Open Access Journals (Sweden)

    Guido W. Grimm

    2006-01-01

    Full Text Available The multi-copy internal transcribed spacer (ITS region of nuclear ribosomal DNA is widely used to infer phylogenetic relationships among closely related taxa. Here we use maximum likelihood (ML and splits graph analyses to extract phylogenetic information from ~ 600 mostly cloned ITS sequences, representing 81 species and subspecies of Acer, and both species of its sister Dipteronia. Additional analyses compared sequence motifs in Acer and several hundred Anacardiaceae, Burseraceae, Meliaceae, Rutaceae, and Sapindaceae ITS sequences in GenBank. We also assessed the effects of using smaller data sets of consensus sequences with ambiguity coding (accounting for within-species variation instead of the full (partly redundant original sequences. Neighbor-nets and bipartition networks were used to visualize conflict among character state patterns. Species clusters observed in the trees and networks largely agree with morphology-based classifications; of de Jong’s (1994 16 sections, nine are supported in neighbor-net and bipartition networks, and ten by sequence motifs and the ML tree; of his 19 series, 14 are supported in networks, motifs, and the ML tree. Most nodes had higher bootstrap support with matrices of 105 or 40 consensus sequences than with the original matrix. Within-taxon ITS divergence did not differ between diploid and polyploid Acer, and there was little evidence of differentiated parental ITS haplotypes, suggesting that concerted evolution in Acer acts rapidly.

  5. Myosin-1A Targets to Microvilli Using Multiple Membrane Binding Motifs in the Tail Homology 1 (TH1) Domain*

    Science.gov (United States)

    Mazerik, Jessica N.; Tyska, Matthew J.

    2012-01-01

    One of the most abundant components of the enterocyte brush border is the actin-based monomeric motor, myosin-1a (Myo1a). Within brush border microvilli, Myo1a carries out a number of critical functions at the interface between membrane and actin cytoskeleton. Proper physiological function of Myo1a depends on its ability to bind to microvillar membrane, an interaction mediated by a C-terminal tail homology 1 (TH1) domain. However, little is known about the mechanistic details of the Myo1a-TH1/membrane interaction. Structure-function analysis of Myo1a-TH1 targeting in epithelial cells revealed that an N-terminal motif conserved among class I myosins and a C-terminal motif unique to Myo1a-TH1 are both required for steady state microvillar enrichment. Purified Myo1a bound to liposomes composed of phosphatidylserine and phosphoinositol 4,5-bisphosphate, with moderate affinity in a charge-dependent manner. Additionally, peptides of the N- and C-terminal regions required for targeting were able to compete with Myo1a for binding to highly charged liposomes in vitro. Single molecule total internal reflection fluorescence microscopy showed that these motifs are also necessary for slowing the membrane detachment rate in cells. Finally, Myo1a-TH1 co-localized with both lactadherin-C2 (a phosphatidylserine-binding protein) and PLCδ1-PH (a phosphoinositol 4,5-bisphosphate-binding protein) in microvilli, but only lactaderin-C2 expression reduced brush border targeting of Myo1a-TH1. Together, our results suggest that Myo1a targeting to microvilli is driven by membrane binding potential that is distributed throughout TH1 rather than localized to a single motif. These data highlight the diversity of mechanisms that enable different class I myosins to target membranes in distinct biological contexts. PMID:22367206

  6. Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d

    Directory of Open Access Journals (Sweden)

    Moffatt Barbara A

    2010-08-01

    Full Text Available Abstract Background Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB for coplanar aromatic motifs similar to those found in known glycan-binding proteins. Results The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192 in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Conclusions Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.

  7. Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d.

    Science.gov (United States)

    Doxey, Andrew C; Cheng, Zhenyu; Moffatt, Barbara A; McConkey, Brendan J

    2010-08-03

    Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB) for coplanar aromatic motifs similar to those found in known glycan-binding proteins. The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO) enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192) in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.

  8. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin

    2015-01-01

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  9. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun

    2015-09-27

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  10. Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

    Directory of Open Access Journals (Sweden)

    Vassilev Boris

    2010-04-01

    Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.

  11. Counting of oligomers in sequences generated by markov chains for DNA motif discovery.

    Science.gov (United States)

    Shan, Gao; Zheng, Wei-Mou

    2009-02-01

    By means of the technique of the imbedded Markov chain, an efficient algorithm is proposed to exactly calculate first, second moments of word counts and the probability for a word to occur at least once in random texts generated by a Markov chain. A generating function is introduced directly from the imbedded Markov chain to derive asymptotic approximations for the problem. Two Z-scores, one based on the number of sequences with hits and the other on the total number of word hits in a set of sequences, are examined for discovery of motifs on a set of promoter sequences extracted from A. thaliana genome. Source code is available at http://www.itp.ac.cn/zheng/oligo.c.

  12. Novel Strategy for Discrimination of Transcription Factor Binding Motifs Employing Mathematical Neural Network

    Science.gov (United States)

    Sugimoto, Asuka; Sumi, Takuya; Kang, Jiyoung; Tateno, Masaru

    2017-07-01

    Recognition in biological macromolecular systems, such as DNA-protein recognition, is one of the most crucial problems to solve toward understanding the fundamental mechanisms of various biological processes. Since specific base sequences of genome DNA are discriminated by proteins, such as transcription factors (TFs), finding TF binding motifs (TFBMs) in whole genome DNA sequences is currently a central issue in interdisciplinary biophysical and information sciences. In the present study, a novel strategy to create a discriminant function for discrimination of TFBMs by constituting mathematical neural networks (NNs) is proposed, together with a method to determine the boundary of signals (TFBMs) and noise in the NN-score (output) space. This analysis also leads to the mathematical limitation of discrimination in the recognition of features representing TFBMs, in an information geometrical manifold. Thus, the present strategy enables the identification of the whole space of TFBMs, right up to the noise boundary.

  13. Using workstation GUIs in HEP, X-Windows, Motif and the Nirvana project

    International Nuclear Information System (INIS)

    Edel, M.; Kryiakopoulos, J.; Lebrun, P.; Ren, B.; Kallenbach, J.; Iourcha, K.

    1992-10-01

    We present four small, high quality, Motif based tools for high energy physicists and discuss some of the less obvious work that is necessary to fully take advantage of graphical user interfaces (GUIs). Histo-Scope and NPlot are interactive data display programs. Histo-Scope is for viewing data as it is collected in running analysis or data acquisition programs, NPlot, for plotting data from text files. Visajet is an interface for the ISAJET event simulation program. It gives users a fast way to configure ISAJET and displays ISAJET events in a three dimensional phase space display that users can rotate and manipulate with the mouse. NEdit is a GUI style plain text editor

  14. Synthesis of most polyene natural product motifs using just 12 building blocks and one coupling reaction.

    Science.gov (United States)

    Woerly, Eric M; Roy, Jahnabi; Burke, Martin D

    2014-06-01

    The inherent modularity of polypeptides, oligonucleotides and oligosaccharides has been harnessed to achieve generalized synthesis platforms. Importantly, like these other targets, most small-molecule natural products are biosynthesized via iterative coupling of bifunctional building blocks. This suggests that many small molecules also possess inherent modularity commensurate with systematic building block-based construction. Supporting this hypothesis, here we report that the polyene motifs found in >75% of all known polyene natural products can be synthesized using just 12 building blocks and one coupling reaction. Using the same general retrosynthetic algorithm and reaction conditions, this platform enabled both the synthesis of a wide range of polyene frameworks that covered all of this natural-product chemical space and the first total syntheses of the polyene natural products asnipyrone B, physarigin A and neurosporaxanthin b-D-glucopyranoside. Collectively, these results suggest the potential for a more generalized approach to making small molecules in the laboratory.

  15. Principal component analysis for predicting transcription-factor binding motifs from array-derived data

    Directory of Open Access Journals (Sweden)

    Vincenti Matthew P

    2005-11-01

    Full Text Available Abstract Background The responses to interleukin 1 (IL-1 in human chondrocytes constitute a complex regulatory mechanism, where multiple transcription factors interact combinatorially to transcription-factor binding motifs (TFBMs. In order to select a critical set of TFBMs from genomic DNA information and an array-derived data, an efficient algorithm to solve a combinatorial optimization problem is required. Although computational approaches based on evolutionary algorithms are commonly employed, an analytical algorithm would be useful to predict TFBMs at nearly no computational cost and evaluate varying modelling conditions. Singular value decomposition (SVD is a powerful method to derive primary components of a given matrix. Applying SVD to a promoter matrix defined from regulatory DNA sequences, we derived a novel method to predict the critical set of TFBMs. Results The promoter matrix was defined to establish a quantitative relationship between the IL-1-driven mRNA alteration and genomic DNA sequences of the IL-1 responsive genes. The matrix was decomposed with SVD, and the effects of 8 potential TFBMs (5'-CAGGC-3', 5'-CGCCC-3', 5'-CCGCC-3', 5'-ATGGG-3', 5'-GGGAA-3', 5'-CGTCC-3', 5'-AAAGG-3', and 5'-ACCCA-3' were predicted from a pool of 512 random DNA sequences. The prediction included matches to the core binding motifs of biologically known TFBMs such as AP2, SP1, EGR1, KROX, GC-BOX, ABI4, ETF, E2F, SRF, STAT, IK-1, PPARγ, STAF, ROAZ, and NFκB, and their significance was evaluated numerically using Monte Carlo simulation and genetic algorithm. Conclusion The described SVD-based prediction is an analytical method to provide a set of potential TFBMs involved in transcriptional regulation. The results would be useful to evaluate analytically a contribution of individual DNA sequences.

  16. Secbase: database module to retrieve secondary structure elements with ligand binding motifs.

    Science.gov (United States)

    Koch, Oliver; Cole, Jason; Block, Peter; Klebe, Gerhard

    2009-10-01

    Secbase is presented as a novel extension module of Relibase. It integrates the information about secondary structure elements into the retrieval facilities of Relibase. The data are accessible via the extended Relibase user interface, and integrated retrieval queries can be addressed using an extended version of Reliscript. The primary information about alpha-helices and beta-sheets is used as provided by the PDB. Furthermore, a uniform classification of all turn families, based on recent clustering methods, and a new helix assignment that is based on this turn classification has been included. Algorithms to analyze the geometric features of helices and beta-strands were also implemented. To demonstrate the performance of the Secbase implementation, some application examples are given. They provide new insights into the involvement of secondary structure elements in ligand binding. A survey of water molecules detected next to the N-terminus of helices is analyzed to show their involvement in ligand binding. Additionally, the parallel oriented NH groups at the alpha-helix N-termini provide special binding motifs to bind particular ligand functional groups with two adjacent oxygen atoms, e.g., as found in negatively charged carboxylate or phosphate groups, respectively. The present study also shows that the specific structure of the first turn of alpha-helices provides a suitable explanation for stabilizing charged structures. The magnitude of the overall helix macrodipole seems to have no or only a minor influence on binding. Furthermore, an overview of the involvement of secondary structure elements with the recognition of some important endogenous ligands such as cofactors shows some distinct preference for particular binding motifs and amino acids.

  17. Wayward Warriors: The Viking Motif in Swedish and English Children's Literature

    Science.gov (United States)

    Sundmark, Björn

    2014-01-01

    In this article the Viking motif in children's literature is explored--from its roots in (adult) nationalist and antiquarian discourse, over pedagogical and historical texts for children, to the eventual diversification (or dissolution) of the motif into different genres and forms. The focus is on Swedish Viking narratives, but points of…

  18. MOMFER: A Search Engine of Thompson's Motif-Index of Folk Literature

    NARCIS (Netherlands)

    Karsdorp, F.B.; van der Meulen, Marten; Meder, Theo; van den Bosch, Antal

    2015-01-01

    More than fifty years after the first edition of Thompson's seminal Motif-Indexof Folk Literature, we present an online search engine tailored to fully disclose the index digitally. This search engine, called MOMFER, greatly enhances the searchability of the Motif-Index and provides exciting new

  19. Aggregation of topological motifs in the Escherichia coli transcriptional regulatory network

    Directory of Open Access Journals (Sweden)

    Barabási Albert-László

    2004-01-01

    Full Text Available Abstract Background Transcriptional regulation of cellular functions is carried out through a complex network of interactions among transcription factors and the promoter regions of genes and operons regulated by them.To better understand the system-level function of such networks simplification of their architecture was previously achieved by identifying the motifs present in the network, which are small, overrepresented, topologically distinct regulatory interaction patterns (subgraphs. However, the interaction of such motifs with each other, and their form of integration into the full network has not been previously examined. Results By studying the transcriptional regulatory network of the bacterium, Escherichia coli, we demonstrate that the two previously identified motif types in the network (i.e., feed-forward loops and bi-fan motifs do not exist in isolation, but rather aggregate into homologous motif clusters that largely overlap with known biological functions. Moreover, these clusters further coalesce into a supercluster, thus establishing distinct topological hierarchies that show global statistical properties similar to the whole network. Targeted removal of motif links disintegrates the network into small, isolated clusters, while random disruptions of equal number of links do not cause such an effect. Conclusion Individual motifs aggregate into homologous motif clusters and a supercluster forming the backbone of the E. coli transcriptional regulatory network and play a central role in defining its global topological organization.

  20. High affinity recognition of a Phytophthora protein by Arabidopsis via an RGD motif

    NARCIS (Netherlands)

    Senchou, V.; Weide, R.L.; Carrasco, A.; Bouyssou, H.; Pont-Lezica, R.; Govers, F.; Canut, H.

    2004-01-01

    The RGD tripeptide sequence, a cell adhesion motif present in several extracellular matrix proteins of mammalians, is involved in numerous plant processes. In plant-pathogen interactions, the RGD motif is believed to reduce plant defence responses by disrupting adhesions between the cell wall and

  1. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium

    Directory of Open Access Journals (Sweden)

    Lynch Michael

    2010-05-01

    Full Text Available Abstract Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1 shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2 are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3 reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  2. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

    Science.gov (United States)

    Catania, Francesco; Lynch, Michael

    2010-05-04

    In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  3. Mechanism for activation of the growth factor-activated AGC kinases by turn motif phosphorylation

    DEFF Research Database (Denmark)

    Hauge, Camilla; Antal, Torben L; Hirschberg, Daniel

    2007-01-01

    investigated the role of the third, so-called turn motif phosphate, also located in the tail, in the AGC kinases PKB, S6K, RSK, MSK, PRK and PKC. We report cooperative action of the HM phosphate and the turn motif phosphate, because it binds a phosphoSer/Thr-binding site above the glycine-rich loop within...

  4. Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins

    Science.gov (United States)

    Kinjo, Akira R.; Nakamura, Haruki

    2012-01-01

    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478

  5. Thermal Stability of Modified i-Motif Oligonucleotides with Naphthalimide Intercalating Nucleic Acids

    DEFF Research Database (Denmark)

    El-Sayed, Ahmed Ali; Pedersen, Erik B.; Khaireldin, Nahid Y.

    2016-01-01

    In continuation of our investigation of characteristics and thermodynamic properties of the i-motif 5′-d[(CCCTAA)3CCCT)] upon insertion of intercalating nucleotides into the cytosine-rich oligonucleotide, this article evaluates the stabilities of i-motif oligonucleotides upon insertion of naphtha......In continuation of our investigation of characteristics and thermodynamic properties of the i-motif 5′-d[(CCCTAA)3CCCT)] upon insertion of intercalating nucleotides into the cytosine-rich oligonucleotide, this article evaluates the stabilities of i-motif oligonucleotides upon insertion...... of naphthalimide (1H-benzo[de]isoquinoline-1,3(2H)-dione) as the intercalating nucleic acid. The stabilities of i-motif structures with inserted naphthalimide intercalating nucleotides were studied using UV melting temperatures (Tm) and circular dichroism spectra at different pH values and conditions (crowding...

  6. I-motif DNA structures are formed in the nuclei of human cells

    Science.gov (United States)

    Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel

    2018-06-01

    Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.

  7. MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

    Science.gov (United States)

    Ozaki, Haruka; Iwasaki, Wataru

    2016-08-01

    As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. A Repeating Sulfated Galactan Motif Resuscitates Dormant Micrococcus luteus Bacteria.

    Science.gov (United States)

    Böttcher, Thomas; Szamosvári, Dávid; Clardy, Jon

    2018-07-01

    Only a small fraction of bacteria can autonomously initiate growth on agar plates. Nongrowing bacteria typically enter a metabolically inactive dormant state and require specific chemical trigger factors or signals to exit this state and to resume growth. Micrococcus luteus has become a model organism for this important yet poorly understood phenomenon. Only a few resuscitation signals have been described to date, and all of them are produced endogenously by bacterial species. We report the discovery of a novel type of resuscitation signal that allows M. luteus to grow on agar but not agarose plates. Fractionation of the agar polysaccharide complex and sulfation of agarose allowed us to identify the signal as highly sulfated saccharides found in agar or carrageenans. Purification of hydrolyzed κ-carrageenan ultimately led to the identification of the signal as a small fragment of a large linear polysaccharide, i.e., an oligosaccharide of five or more sugars with a repeating disaccharide motif containing d-galactose-4-sulfate (G4S) 1,4-linked to 3,6-anhydro-α-d-galactose (DA), G4S-(DA-G4S) n ≥2 IMPORTANCE Most environmental bacteria cannot initiate growth on agar plates, but they can flourish on the same plates once growth is initiated. While there are a number of names for and manifestations of this phenomenon, the underlying cause appears to be the requirement for a molecular signal indicating safe growing conditions. Micrococcus luteus has become a model organism for studying this growth initiation process, often called resuscitation, because of its apparent connection with the persistent or dormant form of Mycobacterium tuberculosis , an important human pathogen. In this report, we identify a highly sulfated saccharide from agar or carrageenans that robustly resuscitates dormant M. luteus on agarose plates. We identified and characterized the signal as a small repeating disaccharide motif. Our results indicate that signals inherent in or absent from the

  9. Combining flagelliform and dragline spider silk motifs to produce tunable synthetic biopolymer fibers.

    Science.gov (United States)

    Teulé, Florence; Addison, Bennett; Cooper, Alyssa R; Ayon, Joel; Henning, Robert W; Benmore, Chris J; Holland, Gregory P; Yarger, Jeffery L; Lewis, Randolph V

    2012-06-01

    The two Flag/MaSp 2 silk proteins produced recombinantly were based on the basic consensus repeat of the dragline silk spidroin 2 protein (MaSp 2) from the Nephila clavipes orb weaving spider. However, the proline-containing pentapeptides juxtaposed to the polyalanine segments resembled those found in the flagelliform silk protein (Flag) composing the web spiral: (GPGGX(1) GPGGX(2))(2) with X(1) /X(2) = A/A or Y/S. Fibers were formed from protein films in aqueous solutions or extruded from resolubilized protein dopes in organic conditions when the Flag motif was (GPGGX(1) GPGGX(2))(2) with X(1) /X(2) = Y/S or A/A, respectively. Post-fiber processing involved similar drawing ratios (2-2.5×) before or after water-treatment. Structural (ssNMR and XRD) and morphological (SEM) changes in the fibers were compared to the mechanical properties of the fibers at each step. Nuclear magnetic resonance indicated that the fraction of β-sheet nanocrystals in the polyalanine regions formed upon extrusion, increased during stretching, and was maximized after water-treatment. X-ray diffraction showed that nanocrystallite orientation parallel to the fiber axis increased the ultimate strength and initial stiffness of the fibers. Water furthered nanocrystal orientation and three-dimensional growth while plasticizing the amorphous regions, thus producing tougher fibers due to increased extensibility. These fibers were highly hygroscopic and had similar internal network organization, thus similar range of mechanical properties that depended on their diameters. The overall structure of the consensus repeat of the silk-like protein dictated the mechanical properties of the fibers while protein molecular weight limited these same properties. Subtle structural motif re-design impacted protein self-assembly mechanisms and requirements for fiber formation. Copyright © 2011 Wiley Periodicals, Inc.

  10. MicroRNA categorization using sequence motifs and k-mers.

    Science.gov (United States)

    Yousef, Malik; Khalifa, Waleed; Acar, İlhan Erkin; Allmer, Jens

    2017-03-14

    Post-transcriptional gene dysregulation can be a hallmark of diseases like cancer and microRNAs (miRNAs) play a key role in the modulation of translation efficiency. Known pre-miRNAs are listed in miRBase, and they have been discovered in a variety of organisms ranging from viruses and microbes to eukaryotic organisms. The computational detection of pre-miRNAs is of great interest, and such approaches usually employ machine learning to discriminate between miRNAs and other sequences. Many features have been proposed describing pre-miRNAs, and we have previously introduced the use of sequence motifs and k-mers as useful ones. There have been reports of xeno-miRNAs detected via next generation sequencing. However, they may be contaminations and to aid that important decision-making process, we aimed to establish a means to differentiate pre-miRNAs from different species. To achieve distinction into species, we used one species' pre-miRNAs as the positive and another species' pre-miRNAs as the negative training and test data for the establishment of machine learned models based on sequence motifs and k-mers as features. This approach resulted in higher accuracy values between distantly related species while species with closer relation produced lower accuracy values. We were able to differentiate among species with increasing success when the evolutionary distance increases. This conclusion is supported by previous reports of fast evolutionary changes in miRNAs since even in relatively closely related species a fairly good discrimination was possible.

  11. One motif to bind them: A small-XXX-small motif affects transmembrane domain 1 oligomerization, function, localization, and cross-talk between two yeast GPCRs.

    Science.gov (United States)

    Lock, Antonia; Forfar, Rachel; Weston, Cathryn; Bowsher, Leo; Upton, Graham J G; Reynolds, Christopher A; Ladds, Graham; Dixon, Ann M

    2014-12-01

    G protein-coupled receptors (GPCRs) are the largest family of cell-surface receptors in mammals and facilitate a range of physiological responses triggered by a variety of ligands. GPCRs were thought to function as monomers, however it is now accepted that GPCR homo- and hetero-oligomers also exist and influence receptor properties. The Schizosaccharomyces pombe GPCR Mam2 is a pheromone-sensing receptor involved in mating and has previously been shown to form oligomers in vivo. The first transmembrane domain (TMD) of Mam2 contains a small-XXX-small motif, overrepresented in membrane proteins and well-known for promoting helix-helix interactions. An ortholog of Mam2 in Saccharomyces cerevisiae, Ste2, contains an analogous small-XXX-small motif which has been shown to contribute to receptor homo-oligomerization, localization and function. Here we have used experimental and computational techniques to characterize the role of the small-XXX-small motif in function and assembly of Mam2 for the first time. We find that disruption of the motif via mutagenesis leads to reduction of Mam2 TMD1 homo-oligomerization and pheromone-responsive cellular signaling of the full-length protein. It also impairs correct targeting to the plasma membrane. Mutation of the analogous motif in Ste2 yielded similar results, suggesting a conserved mechanism for assembly. Using co-expression of the two fungal receptors in conjunction with computational models, we demonstrate a functional change in G protein specificity and propose that this is brought about through hetero-dimeric interactions of Mam2 with Ste2 via the complementary small-XXX-small motifs. This highlights the potential of these motifs to affect a range of properties that can be investigated in other GPCRs. Copyright © 2014. Published by Elsevier B.V.

  12. Perkembangan Motif Sineas Film Indie dalam Menghadapi Industri Film Mainstream

    Directory of Open Access Journals (Sweden)

    Yoppy Ardiyono

    2016-03-01

    Full Text Available The research aims to review to review determine the effect and its impact raised by motive - a motive the ada in the hearts period travel time history of film short against cinematographer-filmmaker as principal especially filmmakers left path (indie. The used platform theory research hearts singer adopts from theory commodification media vincent mosco. Singer helped shift theory understanding the motive filmmakers working hearts differences fundamental basis of political pressure economic happens under with demands regime. The method used is descriptive qualitative research methods. Data collection techniques through observation of the environment of an independent film live and in-depth interviews with speakers including mr. Yang prayer orangutan direct contact 'with realm of research. Coupled with study to review the literature references adding insight research. And that was concluded change appears motif among indie film cinematographer it is true the situation is closely linked to the mainstream industry, konstilasi politics, and the orientation of capitalism. Necessary their one thing is clear and systematic regulation from the government to the future movement of currents sidestream (indie more with good operates professionally arranged, the air so that the contribution of indie cinema film land for progress can feels good to yourself indie filmmakers as well as those of its main industries.

  13. ARCHETYPES AND MYTHOLOGICAL MOTIFS: JOHN UPDIKE’S LEGACY REVISITED

    Directory of Open Access Journals (Sweden)

    Loreta Ulvydienė

    2018-04-01

    Full Text Available John Updike is widely considered to be one of the greatest, one of the most popular and sometimes most controversial writers concerned with the American small town and middle-class materialism. A lot of literary critics and researchers observe that Updike’s finest work came from his exploration of ordinary America and from his use of elegant prose, rich with metaphor, to portray the public and private feelings of Americans, their daily rounds of life. In addition, discussing Updike’s individual works a lot of literary critics and researchers have observed the writer’s attempts to re-write myth in “the mythical age”1 of the twentieth century. Naturally enough, as the return to myth is assumed to be a certain feature of the Modernist movement, half a century later since Updike’s famous novel Centaur was penned, it is indispensable to re-examine the writer’s fictional intentions in the usage of myth. More importantly, it is needful to determine whether we can see the mythic elements and realistic details as a continuum or as the contrasted opposites in his so called “historical chronicles”. Updike’s novels and stories are filled with mythological motifs and character archetypes. Thus, the study aims at revisiting John Updike’s creation considering mythological elements and archetypal images of his heroes alongside with heroic masculinity, war, terrorism and American perfectionism.

  14. Dystroglycan versatility in cell adhesion: a tale of multiple motifs

    Directory of Open Access Journals (Sweden)

    Winder Steve J

    2010-02-01

    Full Text Available Abstract Dystroglycan is a ubiquitously expressed heterodimeric adhesion receptor. The extracellular α-subunit makes connections with a number of laminin G domain ligands including laminins, agrin and perlecan in the extracellular matrix and the transmembrane β-subunit makes connections to the actin filament network via cytoskeletal linkers including dystrophin, utrophin, ezrin and plectin, depending on context. Originally discovered as part of the dystrophin glycoprotein complex of skeletal muscle, dystroglycan is an important adhesion molecule and signalling scaffold in a multitude of cell types and tissues and is involved in several diseases. Dystroglycan has emerged as a multifunctional adhesion platform with many interacting partners associating with its short unstructured cytoplasmic domain. Two particular hotspots are the cytoplasmic juxtamembrane region and at the very carboxy terminus of dystroglycan. Regions which between them have several overlapping functions: in the juxtamembrane region; a nuclear localisation signal, ezrin/radixin/moesin protein, rapsyn and ERK MAP Kinase binding function, and at the C terminus a regulatory tyrosine governing WW, SH2 and SH3 domain interactions. We will discuss the binding partners for these motifs and how their interactions and regulation can modulate the involvement of dystroglycan in a range of different adhesion structures and functions depending on context. Thus dystroglycan presents as a multifunctional scaffold involved in adhesion and adhesion-mediated signalling with its functions under exquisite spatio-temporal regulation.

  15. MODA: an efficient algorithm for network motif discovery in biological networks.

    Science.gov (United States)

    Omidi, Saeed; Schreiber, Falk; Masoudi-Nejad, Ali

    2009-10-01

    In recent years, interest has been growing in the study of complex networks. Since Erdös and Rényi (1960) proposed their random graph model about 50 years ago, many researchers have investigated and shaped this field. Many indicators have been proposed to assess the global features of networks. Recently, an active research area has developed in studying local features named motifs as the building blocks of networks. Unfortunately, network motif discovery is a computationally hard problem and finding rather large motifs (larger than 8 nodes) by means of current algorithms is impractical as it demands too much computational effort. In this paper, we present a new algorithm (MODA) that incorporates techniques such as a pattern growth approach for extracting larger motifs efficiently. We have tested our algorithm and found it able to identify larger motifs with more than 8 nodes more efficiently than most of the current state-of-the-art motif discovery algorithms. While most of the algorithms rely on induced subgraphs as motifs of the networks, MODA is able to extract both induced and non-induced subgraphs simultaneously. The MODA source code is freely available at: http://LBB.ut.ac.ir/Download/LBBsoft/MODA/

  16. Comparative analysis of evolutionarily conserved motifs of epidermal growth factor receptor 2 (HER2) predicts novel potential therapeutic epitopes

    DEFF Research Database (Denmark)

    Deng, Xiaohong; Zheng, Xuxu; Yang, Huanming

    2014-01-01

    druggable epitopes/targets. We employed the PROSITE Scan to detect structurally conserved motifs and PRINTS to search for linearly conserved motifs of ECD HER2. We found that the epitopes recognized by trastuzumab and pertuzumab are located in the predicted conserved motifs of ECD HER2, supporting our...

  17. Identification of a putative nuclear export signal motif in human NANOG homeobox domain

    International Nuclear Information System (INIS)

    Park, Sung-Won; Do, Hyun-Jin; Huh, Sun-Hyung; Sung, Boreum; Uhm, Sang-Jun; Song, Hyuk; Kim, Nam-Hyung; Kim, Jae-Hwan

    2012-01-01

    Highlights: ► We found the putative nuclear export signal motif within human NANOG homeodomain. ► Leucine-rich residues are important for human NANOG homeodomain nuclear export. ► CRM1-specific inhibitor LMB blocked the potent human NANOG NES-mediated nuclear export. -- Abstract: NANOG is a homeobox-containing transcription factor that plays an important role in pluripotent stem cells and tumorigenic cells. To understand how nuclear localization of human NANOG is regulated, the NANOG sequence was examined and a leucine-rich nuclear export signal (NES) motif ( 125 MQELSNILNL 134 ) was found in the homeodomain (HD). To functionally validate the putative NES motif, deletion and site-directed mutants were fused to an EGFP expression vector and transfected into COS-7 cells, and the localization of the proteins was examined. While hNANOG HD exclusively localized to the nucleus, a mutant with both NLSs deleted and only the putative NES motif contained (hNANOG HD-ΔNLSs) was predominantly cytoplasmic, as observed by nucleo/cytoplasmic fractionation and Western blot analysis as well as confocal microscopy. Furthermore, site-directed mutagenesis of the putative NES motif in a partial hNANOG HD only containing either one of the two NLS motifs led to localization in the nucleus, suggesting that the NES motif may play a functional role in nuclear export. Furthermore, CRM1-specific nuclear export inhibitor LMB blocked the hNANOG potent NES-mediated export, suggesting that the leucine-rich motif may function in CRM1-mediated nuclear export of hNANOG. Collectively, a NES motif is present in the hNANOG HD and may be functionally involved in CRM1-mediated nuclear export pathway.

  18. Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA.

    Science.gov (United States)

    Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W

    2016-02-02

    The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.

  19. Conserved amino acid motifs from the novel Piv/MooV family of transposases and site-specific recombinases are required for catalysis of DNA inversion by Piv.

    Science.gov (United States)

    Tobiason, D M; Buchner, J M; Thiel, W H; Gernert, K M; Karls, A C

    2001-02-01

    Piv, a site-specific invertase from Moraxella lacunata, exhibits amino acid homology with the transposases of the IS110/IS492 family of insertion elements. The functions of conserved amino acid motifs that define this novel family of both transposases and site-specific recombinases (Piv/MooV family) were examined by mutagenesis of fully conserved amino acids within each motif in Piv. All Piv mutants altered in conserved residues were defective for in vivo inversion of the M. lacunata invertible DNA segment, but competent for in vivo binding to Piv DNA recognition sequences. Although the primary amino acid sequences of the Piv/MooV recombinases do not contain a conserved DDE motif, which defines the retroviral integrase/transposase (IN/Tnps) family, the predicted secondary structural elements of Piv align well with those of the IN/Tnps for which crystal structures have been determined. Molecular modelling of Piv based on these alignments predicts that E59, conserved as either E or D in the Piv/MooV family, forms a catalytic pocket with the conserved D9 and D101 residues. Analysis of Piv E59G confirms a role for E59 in catalysis of inversion. These results suggest that Piv and the related IS110/IS492 transposases mediate DNA recombination by a common mechanism involving a catalytic DED or DDD motif.

  20. Salt-bridge Swapping in the EXXERFXYY Motif of Proton Coupled Oligopeptide Transporters

    DEFF Research Database (Denmark)

    Aduri, Nanda G; Prabhala, Bala K; Ernst, Heidi A

    2015-01-01

    to as E1XXE2R), located on Helix I, in interactions with the proton. In this study we investigated the intracellular substrate accumulation by motif variants with all possible combinations of glutamate residues changed to glutamine and arginine changed to a tyrosine; the latter being a natural variant......-motif salt bridge, i.e. R-E2 to R-E1, which is consistent with previous structural studies. Molecular dynamics simulations of the motif variants E1XXE2R and E1XXQ2R support this mechanism. The simulations showed that upon changing conformation, arginine pushes Helix V, through interactions with the highly...

  1. Stochastic Resonance in Neuronal Network Motifs with Ornstein-Uhlenbeck Colored Noise

    Directory of Open Access Journals (Sweden)

    Xuyang Lou

    2014-01-01

    Full Text Available We consider here the effect of the Ornstein-Uhlenbeck colored noise on the stochastic resonance of the feed-forward-loop (FFL network motif. The FFL motif is modeled through the FitzHugh-Nagumo neuron model as well as the chemical coupling. Our results show that the noise intensity and the correlation time of the noise process serve as the control parameters, which have great impacts on the stochastic dynamics of the FFL motif. We find that, with a proper choice of noise intensities and the correlation time of the noise process, the signal-to-noise ratio (SNR can display more than one peak.

  2. Identification of putative regulatory motifs in the upstream regions of co-expressed functional groups of genes in Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Joshi NV

    2009-01-01

    Full Text Available Abstract Background Regulation of gene expression in Plasmodium falciparum (Pf remains poorly understood. While over half the genes are estimated to be regulated at the transcriptional level, few regulatory motifs and transcription regulators have been found. Results The study seeks to identify putative regulatory motifs in the upstream regions of 13 functional groups of genes expressed in the intraerythrocytic developmental cycle of Pf. Three motif-discovery programs were used for the purpose, and motifs were searched for only on the gene coding strand. Four motifs – the 'G-rich', the 'C-rich', the 'TGTG' and the 'CACA' motifs – were identified, and zero to all four of these occur in the 13 sets of upstream regions. The 'CACA motif' was absent in functional groups expressed during the ring to early trophozoite transition. For functional groups expressed in each transition, the motifs tended to be similar. Upstream motifs in some functional groups showed 'positional conservation' by occurring at similar positions relative to the translational start site (TLS; this increases their significance as regulatory motifs. In the ribonucleotide synthesis, mitochondrial, proteasome and organellar translation machinery genes, G-rich, C-rich, CACA and TGTG motifs, respectively, occur with striking positional conservation. In the organellar translation machinery group, G-rich motifs occur close to the TLS. The same motifs were sometimes identified for multiple functional groups; differences in location and abundance of the motifs appear to ensure different modes of action. Conclusion The identification of positionally conserved over-represented upstream motifs throws light on putative regulatory elements for transcription in Pf.

  3. An intracellular motif of GLUT4 regulates fusion of GLUT4-containing vesicles.

    Science.gov (United States)

    Heyward, Catherine A; Pettitt, Trevor R; Leney, Sophie E; Welsh, Gavin I; Tavaré, Jeremy M; Wakelam, Michael J O

    2008-05-20

    Insulin stimulates glucose uptake by adipocytes through increasing translocation of the glucose transporter GLUT4 from an intracellular compartment to the plasma membrane. Fusion of GLUT4-containing vesicles at the cell surface is thought to involve phospholipase D activity, generating the signalling lipid phosphatidic acid, although the mechanism of action is not yet clear. Here we report the identification of a putative phosphatidic acid-binding motif in a GLUT4 intracellular loop. Mutation of this motif causes a decrease in the insulin-induced exposure of GLUT4 at the cell surface of 3T3-L1 adipocytes via an effect on vesicle fusion. The potential phosphatidic acid-binding motif identified in this study is unique to GLUT4 among the sugar transporters, therefore this motif may provide a unique mechanism for regulating insulin-induced translocation by phospholipase D signalling.

  4. An intracellular motif of GLUT4 regulates fusion of GLUT4-containing vesicles

    Directory of Open Access Journals (Sweden)

    Welsh Gavin I

    2008-05-01

    Full Text Available Abstract Background Insulin stimulates glucose uptake by adipocytes through increasing translocation of the glucose transporter GLUT4 from an intracellular compartment to the plasma membrane. Fusion of GLUT4-containing vesicles at the cell surface is thought to involve phospholipase D activity, generating the signalling lipid phosphatidic acid, although the mechanism of action is not yet clear. Results Here we report the identification of a putative phosphatidic acid-binding motif in a GLUT4 intracellular loop. Mutation of this motif causes a decrease in the insulin-induced exposure of GLUT4 at the cell surface of 3T3-L1 adipocytes via an effect on vesicle fusion. Conclusion The potential phosphatidic acid-binding motif identified in this study is unique to GLUT4 among the sugar transporters, therefore this motif may provide a unique mechanism for regulating insulin-induced translocation by phospholipase D signalling.

  5. A Simple Decision Rule for Recognition of Poly(A) Tail Signal Motifs in Human Genome

    KAUST Repository

    AbouEisha, Hassan M.; Chikalov, Igor; Moshkov, Mikhail; Jankovic, Boris R.

    2015-01-01

    Background is the numerous attempts were made to predict motifs in genomic sequences that correspond to poly (A) tail signals. Vast portion of this effort has been directed to a plethora of nonlinear classification methods. Even when such approaches

  6. Lucky Motifs in Chinese Folk Art: Interpreting Paper-cut from Chinese Shaanxi

    Directory of Open Access Journals (Sweden)

    Xuxiao WANG

    2013-11-01

    Full Text Available Paper-cut is not simply a form of traditional Chinese folk art. Lucky motifs developed in paper-cut certainly acquired profound cultural connotations. As paper-cut is a time-honoured skill across the nation, interpreting those motifs requires cultural receptiveness and anthropological sensitivity. The author of this article analyzes examples of paper-cut from Northern Shaanxi, China, to identify the cohesive motifs and explore the auspiciousness of the specific concepts of Fu, Lu, Shou, Xi. The paper-cut of Northern Shaanxi is an ideal representative of the craft as a whole because of the relative stability of this region in history, in terms of both art and culture. Furthermore, its straightforward style provides a clear demonstration of motifs regarding folk understanding of expectations for life.

  7. Design of Fashion Accessories Using Akwa-Ocha Motifs and Symbols

    African Journals Online (AJOL)

    Nneka Umera-Okeke

    Nkpopu: holes. 16. Osikapa na ... accessories anchors in both social semiotics and archetypal theories. Social semiotics theory as ... the two earrings incorporate the Onwa (moon) motif in spherical shape and in black colour. They are held ...

  8. On the origin of distribution patterns of motifs in biological networks

    Directory of Open Access Journals (Sweden)

    Lesk Arthur M

    2008-08-01

    Full Text Available Abstract Background Inventories of small subgraphs in biological networks have identified commonly-recurring patterns, called motifs. The inference that these motifs have been selected for function rests on the idea that their occurrences are significantly more frequent than random. Results Our analysis of several large biological networks suggests, in contrast, that the frequencies of appearance of common subgraphs are similar in natural and corresponding random networks. Conclusion Indeed, certain topological features of biological networks give rise naturally to the common appearance of the motifs. We therefore question whether frequencies of occurrences are reasonable evidence that the structures of motifs have been selected for their functional contribution to the operation of networks.

  9. Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

    KAUST Repository

    Sayadi, Ahmed; Briganti, Leonardo; Tramontano, Anna; Via, Allegra

    2011-01-01

    The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length

  10. FTZ-Factor1 and Fushi tarazu interact via conserved nuclear receptor and coactivator motifs

    Science.gov (United States)

    Schwartz, Carol J.E.; Sampson, Heidi M.; Hlousek, Daniela; Percival-Smith, Anthony; Copeland, John W.R.; Simmonds, Andrew J.; Krause, Henry M.

    2001-01-01

    To activate transcription, most nuclear receptor proteins require coactivators that bind to their ligand-binding domains (LBDs). The Drosophila FTZ-Factor1 (FTZ-F1) protein is a conserved member of the nuclear receptor superfamily, but was previously thought to lack an AF2 motif, a motif that is required for ligand and coactivator binding. Here we show that FTZ-F1 does have an AF2 motif and that it is required to bind a coactivator, the homeodomain-containing protein Fushi tarazu (FTZ). We also show that FTZ contains an AF2-interacting nuclear receptor box, the first to be found in a homeodomain protein. Both interaction motifs are shown to be necessary for physical interactions in vitro and for functional interactions in developing embryos. These unexpected findings have important implications for the conserved homologs of the two proteins. PMID:11157757

  11. Discriminative Motif Discovery via Simulated Evolution and Random Under-Sampling

    OpenAIRE

    Song, Tao; Gu, Hong

    2014-01-01

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the sta...

  12. A novel k-mer set memory (KSM) motif representation improves regulatory variant prediction.

    Science.gov (United States)

    Guo, Yuchun; Tian, Kevin; Zeng, Haoyang; Guo, Xiaoyun; Gifford, David Kenneth

    2018-04-13

    The representation and discovery of transcription factor (TF) sequence binding specificities is critical for understanding gene regulatory networks and interpreting the impact of disease-associated noncoding genetic variants. We present a novel TF binding motif representation, the k -mer set memory (KSM), which consists of a set of aligned k -mers that are overrepresented at TF binding sites, and a new method called KMAC for de novo discovery of KSMs. We find that KSMs more accurately predict in vivo binding sites than position weight matrix (PWM) models and other more complex motif models across a large set of ChIP-seq experiments. Furthermore, KSMs outperform PWMs and more complex motif models in predicting in vitro binding sites. KMAC also identifies correct motifs in more experiments than five state-of-the-art motif discovery methods. In addition, KSM-derived features outperform both PWM and deep learning model derived sequence features in predicting differential regulatory activities of expression quantitative trait loci (eQTL) alleles. Finally, we have applied KMAC to 1600 ENCODE TF ChIP-seq data sets and created a public resource of KSM and PWM motifs. We expect that the KSM representation and KMAC method will be valuable in characterizing TF binding specificities and in interpreting the effects of noncoding genetic variations. © 2018 Guo et al.; Published by Cold Spring Harbor Laboratory Press.

  13. Creation of Hybrid Nanorods From Sequences of Natural Trimeric Fibrous Proteins Using the Fibritin Trimerization Motif

    Science.gov (United States)

    Papanikolopoulou, Katerina; van Raaij, Mark J.; Mitraki, Anna

    Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, β-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple β-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.

  14. Motif formation and industry specific topologies in the Japanese business firm network

    Science.gov (United States)

    Maluck, Julian; Donner, Reik V.; Takayasu, Hideki; Takayasu, Misako

    2017-05-01

    Motifs and roles are basic quantities for the characterization of interactions among 3-node subsets in complex networks. In this work, we investigate how the distribution of 3-node motifs can be influenced by modifying the rules of an evolving network model while keeping the statistics of simpler network characteristics, such as the link density and the degree distribution, invariant. We exemplify this problem for the special case of the Japanese Business Firm Network, where a well-studied and relatively simple yet realistic evolving network model is available, and compare the resulting motif distribution in the real-world and simulated networks. To better approximate the motif distribution of the real-world network in the model, we introduce both subgraph dependent and global additional rules. We find that a specific rule that allows only for the merging process between nodes with similar link directionality patterns reduces the observed excess of densely connected motifs with bidirectional links. Our study improves the mechanistic understanding of motif formation in evolving network models to better describe the characteristic features of real-world networks with a scale-free topology.

  15. Short Arginine Motifs Drive Protein Stickiness in the Escherichia coli Cytoplasm.

    Science.gov (United States)

    Kyne, Ciara; Crowley, Peter B

    2017-09-19

    Although essential to numerous biotech applications, knowledge of molecular recognition by arginine-rich motifs in live cells remains limited. 1 H, 15 N HSQC and 19 F NMR spectroscopies were used to investigate the effects of C-terminal -GR n (n = 1-5) motifs on GB1 interactions in Escherichia coli cells and cell extracts. While the "biologically inert" GB1 yields high-quality in-cell spectra, the -GR n fusions with n = 4 or 5 were undetectable. This result suggests that a tetra-arginine motif is sufficient to drive interactions between a test protein and macromolecules in the E. coli cytoplasm. The inclusion of a 12 residue flexible linker between GB1 and the -GR 5 motif did not improve detection of the "inert" domain. In contrast, all of the constructs were detectable in cell lysates and extracts, suggesting that the arginine-mediated complexes were weak. Together these data reveal the significance of weak interactions between short arginine-rich motifs and the E. coli cytoplasm and demonstrate the potential of such motifs to modify protein interactions in living cells. These interactions must be considered in the design of (in vivo) nanoscale assemblies that rely on arginine-rich sequences.

  16. Discriminative motif discovery via simulated evolution and random under-sampling.

    Directory of Open Access Journals (Sweden)

    Tao Song

    Full Text Available Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  17. Discriminative motif discovery via simulated evolution and random under-sampling.

    Science.gov (United States)

    Song, Tao; Gu, Hong

    2014-01-01

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs) training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  18. Improved i-motif thermal stability by insertion of anthraquinone monomers

    DEFF Research Database (Denmark)

    Gouda, Alaa S; Amine, Mahasen S.; Pedersen, Erik Bjerregaard

    2017-01-01

    In order to gain insight into how to improve thermal stability of i-motifs when used in the context of biomedical and nanotechnological applications, novel anthraquinone-modified i-motifs were synthesized by insertion of 1,8-, 1,4-, 1,5- and 2,6-disubstituted anthraquinone monomers into the TAA...... loops of a 22mer cytosine-rich human telomeric DNA sequence. The influence of the four anthraquinone linkers on the i-motif thermal stability was investigated at 295 nm and pH 5.5. Anthraquinone monomers modulate the i-motif stability in a position-depending manner and the modulation also depends...... unlocked nucleic acid monomers or twisted intercalating nucleic acid. The 2,6-disubstituted anthraquinone linker replacing T10 enabled a significant increase of i-motif thermal melting by 8.2 °C. A substantial increase of 5.0 °C in i-motif thermal melting was recorded when both A6 and T16 were modified...

  19. Peptide-binding motifs of two common equine class I MHC molecules in Thoroughbred horses.

    Science.gov (United States)

    Bergmann, Tobias; Lindvall, Mikaela; Moore, Erin; Moore, Eugene; Sidney, John; Miller, Donald; Tallmadge, Rebecca L; Myers, Paisley T; Malaker, Stacy A; Shabanowitz, Jeffrey; Osterrieder, Nikolaus; Peters, Bjoern; Hunt, Donald F; Antczak, Douglas F; Sette, Alessandro

    2017-05-01

    Quantitative peptide-binding motifs of MHC class I alleles provide a valuable tool to efficiently identify putative T cell epitopes. Detailed information on equine MHC class I alleles is still very limited, and to date, only a single equine MHC class I allele, Eqca-1*00101 (ELA-A3 haplotype), has been characterized. The present study extends the number of characterized ELA class I specificities in two additional haplotypes found commonly in the Thoroughbred breed. Accordingly, we here report quantitative binding motifs for the ELA-A2 allele Eqca-16*00101 and the ELA-A9 allele Eqca-1*00201. Utilizing analyses of endogenously bound and eluted ligands and the screening of positional scanning combinatorial libraries, detailed and quantitative peptide-binding motifs were derived for both alleles. Eqca-16*00101 preferentially binds peptides with aliphatic/hydrophobic residues in position 2 and at the C-terminus, and Eqca-1*00201 has a preference for peptides with arginine in position 2 and hydrophobic/aliphatic residues at the C-terminus. Interestingly, the Eqca-16*00101 motif resembles that of the human HLA A02-supertype, while the Eqca-1*00201 motif resembles that of the HLA B27-supertype and two macaque class I alleles. It is expected that the identified motifs will facilitate the selection of candidate epitopes for the study of immune responses in horses.

  20. Computational analysis and prediction of the binding motif and protein interacting partners of the Abl SH3 domain.

    Directory of Open Access Journals (Sweden)

    Tingjun Hou

    2006-01-01

    Full Text Available Protein-protein interactions, particularly weak and transient ones, are often mediated by peptide recognition domains, such as Src Homology 2 and 3 (SH2 and SH3 domains, which bind to specific sequence and structural motifs. It is important but challenging to determine the binding specificity of these domains accurately and to predict their physiological interacting partners. In this study, the interactions between 35 peptide ligands (15 binders and 20 non-binders and the Abl SH3 domain were analyzed using molecular dynamics simulation and the Molecular Mechanics/Poisson-Boltzmann Solvent Area method. The calculated binding free energies correlated well with the rank order of the binding peptides and clearly distinguished binders from non-binders. Free energy component analysis revealed that the van der Waals interactions dictate the binding strength of peptides, whereas the binding specificity is determined by the electrostatic interaction and the polar contribution of desolvation. The binding motif of the Abl SH3 domain was then determined by a virtual mutagenesis method, which mutates the residue at each position of the template peptide relative to all other 19 amino acids and calculates the binding free energy difference between the template and the mutated peptides using the Molecular Mechanics/Poisson-Boltzmann Solvent Area method. A single position mutation free energy profile was thus established and used as a scoring matrix to search peptides recognized by the Abl SH3 domain in the human genome. Our approach successfully picked ten out of 13 experimentally determined binding partners of the Abl SH3 domain among the top 600 candidates from the 218,540 decapeptides with the PXXP motif in the SWISS-PROT database. We expect that this physical-principle based method can be applied to other protein domains as well.

  1. Molecular Detection, Phylogenetic Analysis, and Identification of Transcription Motifs in Feline Leukemia Virus from Naturally Infected Cats in Malaysia

    Directory of Open Access Journals (Sweden)

    Faruku Bande

    2014-01-01

    Full Text Available A nested PCR assay was used to determine the viral RNA and proviral DNA status of naturally infected cats. Selected samples that were FeLV-positive by PCR were subjected to sequencing, phylogenetic analysis, and motifs search. Of the 39 samples that were positive for FeLV p27 antigen, 87.2% (34/39 were confirmed positive with nested PCR. FeLV proviral DNA was detected in 38 (97.3% of p27-antigen negative samples. Malaysian FeLV isolates are found to be highly similar with a homology of 91% to 100%. Phylogenetic analysis revealed that Malaysian FeLV isolates divided into two clusters, with a majority (86.2% sharing similarity with FeLV-K01803 and fewer isolates (13.8% with FeLV-GM1 strain. Different enhancer motifs including NF-GMa, Krox-20/WT1I-del2, BAF1, AP-2, TBP, TFIIF-beta, TRF, and TFIID are found to occur either in single, duplicate, triplicate, or sets of 5 in different positions within the U3-LTR-gag region. The present result confirms the occurrence of FeLV viral RNA and provirus DNA in naturally infected cats. Malaysian FeLV isolates are highly similar, and a majority of them are closely related to a UK isolate. This study provides the first molecular based information on FeLV in Malaysia. Additionally, different enhancer motifs likely associated with FeLV related pathogenesis have been identified.

  2. Insights into the molecular evolution of the PDZ/LIM family and identification of a novel conserved protein motif.

    Directory of Open Access Journals (Sweden)

    Aartjan J W Te Velthuis

    Full Text Available The PDZ and LIM domain-containing protein family is encoded by a diverse group of genes whose phylogeny has currently not been analyzed. In mammals, ten genes are found that encode both a PDZ- and one or several LIM-domains. These genes are: ALP, RIL, Elfin (CLP36, Mystique, Enigma (LMP-1, Enigma homologue (ENH, ZASP (Cypher, Oracle, LMO7 and the two LIM domain kinases (LIMK1 and LIMK2. As conventional alignment and phylogenetic procedures of full-length sequences fell short of elucidating the evolutionary history of these genes, we started to analyze the PDZ and LIM domain sequences themselves. Using information from most sequenced eukaryotic lineages, our phylogenetic analysis is based on full-length cDNA-, EST-derived- and genomic- PDZ and LIM domain sequences of over 25 species, ranging from yeast to humans. Plant and protozoan homologs were not found. Our phylogenetic analysis identifies a number of domain duplication and rearrangement events, and shows a single convergent event during evolution of the PDZ/LIM family. Further, we describe the separation of the ALP and Enigma subfamilies in lower vertebrates and identify a novel consensus motif, which we call 'ALP-like motif' (AM. This motif is highly-conserved between ALP subfamily proteins of diverse organisms. We used here a combinatorial approach to define the relation of the PDZ and LIM domain encoding genes and to reconstruct their phylogeny. This analysis allowed us to classify the PDZ/LIM family and to suggest a meaningful model for the molecular evolution of the diverse gene architectures found in this multi-domain family.

  3. Computational study of the fibril organization of polyglutamine repeats reveals a common motif identified in beta-helices.

    Science.gov (United States)

    Zanuy, David; Gunasekaran, Kannan; Lesk, Arthur M; Nussinov, Ruth

    2006-04-21

    The formation of fibril aggregates by long polyglutamine sequences is assumed to play a major role in neurodegenerative diseases such as Huntington. Here, we model peptides rich in glutamine, through a series of molecular dynamics simulations. Starting from a rigid nanotube-like conformation, we have obtained a new conformational template that shares structural features of a tubular helix and of a beta-helix conformational organization. Our new model can be described as a super-helical arrangement of flat beta-sheet segments linked by planar turns or bends. Interestingly, our comprehensive analysis of the Protein Data Bank reveals that this is a common motif in beta-helices (termed beta-bend), although it has not been identified so far. The motif is based on the alternation of beta-sheet and helical conformation as the protein sequence is followed from the N to the C termini (beta-alpha(R)-beta-polyPro-beta). We further identify this motif in the ssNMR structure of the protofibril of the amyloidogenic peptide Abeta(1-40). The recurrence of the beta-bend suggests a general mode of connecting long parallel beta-sheet segments that would allow the growth of partially ordered fibril structures. The design allows the peptide backbone to change direction with a minimal loss of main chain hydrogen bonds. The identification of a coherent organization beyond that of the beta-sheet segments in different folds rich in parallel beta-sheets suggests a higher degree of ordered structure in protein fibrils, in agreement with their low solubility and dense molecular packing.

  4. Structural and Functional Motifs in Influenza Virus RNAs

    Directory of Open Access Journals (Sweden)

    Damien Ferhadian

    2018-03-01

    have now been validated experimentally and their role in the viral life cycle demonstrated. This review aims to compile the structural motifs found in the different RNA classes (vRNA, cRNA, and vmRNA of influenza viruses and their function in the viral replication cycle.

  5. Prediction of host - pathogen protein interactions between Mycobacterium tuberculosis and Homo sapiens using sequence motifs.

    Science.gov (United States)

    Huo, Tong; Liu, Wei; Guo, Yu; Yang, Cheng; Lin, Jianping; Rao, Zihe

    2015-03-26

    Emergence of multiple drug resistant strains of M. tuberculosis (MDR-TB) threatens to derail global efforts aimed at reigning in the pathogen. Co-infections of M. tuberculosis with HIV are difficult to treat. To counter these new challenges, it is essential to study the interactions between M. tuberculosis and the host to learn how these bacteria cause disease. We report a systematic flow to predict the host pathogen interactions (HPIs) between M. tuberculosis and Homo sapiens based on sequence motifs. First, protein sequences were used as initial input for identifying the HPIs by 'interolog' method. HPIs were further filtered by prediction of domain-domain interactions (DDIs). Functional annotations of protein and publicly available experimental results were applied to filter the remaining HPIs. Using such a strategy, 118 pairs of HPIs were identified, which involve 43 proteins from M. tuberculosis and 48 proteins from Homo sapiens. A biological interaction network between M. tuberculosis and Homo sapiens was then constructed using the predicted inter- and intra-species interactions based on the 118 pairs of HPIs. Finally, a web accessible database named PATH (Protein interactions of M. tuberculosis and Human) was constructed to store these predicted interactions and proteins. This interaction network will facilitate the research on host-pathogen protein-protein interactions, and may throw light on how M. tuberculosis interacts with its host.

  6. Recurring polyhedral motifs in the amorphous indium gallium zinc oxide network

    International Nuclear Information System (INIS)

    Divya; Deepak; Prasad, Rajendra

    2017-01-01

    The coordination polyhedra around the cations are the building blocks of ionic solids. For amorphous InGaZn oxide (a-IGZO), these coordination polyhedra are identified to investigate properties that depend on short range interactions. Therefore, in this first principles based study, a large number (10) of samples of a-IGZO were prepared by ab initio melt-and-quench molecular dynamics, so that several distinct samples of the amorphous landscape are obtained corresponding to local minima in energy. Based on a method of comparing bond angles between metal and oxygen atoms, the identified polyhedra were matched to the polyhedral motifs present in the related crystalline systems, such as, InGaZnO_4, In_2O_3, Ga_2O_3, and ZnO. Consequently, we find, the a-IGZO primarily consists of the following polyhedra: a tetrahedron from space group 199 and an octahedron from space group 206 of In_2O_3; a tetrahedron from space group 12 and an octahedron from space group 167 of Ga_2O_3; a tetrahedron from space group 186 of ZnO; zinc and gallium trigonal bipyramids from c-IGZO; and one zinc fourfold, one zinc fivefold, and one indium fivefold coordination polyhedra that occur only in the amorphous phase. Thus, we were able to reduce the description of structure from 360 to 10 groups of polyhedra. (copyright 2017 WILEY-VCH Verlag GmbH and Co. KGaA, Weinheim)

  7. Disparate requirements for the Walker A and B ATPase motifs of human RAD51D in homologous recombination.

    Science.gov (United States)

    Wiese, Claudia; Hinz, John M; Tebbs, Robert S; Nham, Peter B; Urbin, Salustra S; Collins, David W; Thompson, Larry H; Schild, David

    2006-01-01

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks (ICLs). Ectopic expression of wild-type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  8. How to find a leucine in a haystack? Structure, ligand recognition and regulation of leucine-aspartic acid (LD) motifs

    KAUST Repository

    Alam, Tanvir

    2014-05-29

    LD motifs (leucine-aspartic acidmotifs) are short helical protein-protein interaction motifs that have emerged as key players in connecting cell adhesion with cell motility and survival. LD motifs are required for embryogenesis, wound healing and the evolution of multicellularity. LD motifs also play roles in disease, such as in cancer metastasis or viral infection. First described in the paxillin family of scaffolding proteins, LD motifs and similar acidic LXXLL interaction motifs have been discovered in several other proteins, whereas 16 proteins have been reported to contain LDBDs (LD motif-binding domains). Collectively, structural and functional analyses have revealed a surprising multivalency in LD motif interactions and a wide diversity in LDBD architectures. In the present review, we summarize the molecular basis for function, regulation and selectivity of LD motif interactions that has emerged from more than a decade of research. This overview highlights the intricate multi-level regulation and the inherently noisy and heterogeneous nature of signalling through short protein-protein interaction motifs. © 2014 Biochemical Society.

  9. Disparate requirements for the Walker A and B ATPase motifs ofhuman RAD51D in homologous recombination

    Energy Technology Data Exchange (ETDEWEB)

    Wiese, Claudia; Hinz, John M.; Tebbs, Robert S.; Nham, Peter B.; Urbin, Salustra S.; Collins, David W.; Thompson, Larry H.; Schild, David

    2006-04-21

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C, and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks. Ectopic expression of wild type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  10. APOCALYPTIC MOTIFS IN THE CYCLE OF STORIES BY M.A. BULGAKOV «NOTES OF A YOUNG DOCTOR»

    Directory of Open Access Journals (Sweden)

    Evgeniy Igorevich Erokhov

    2015-10-01

    Full Text Available The motif analysis of a cycle of stories by M.A. Bulgakov «Notes of a Young Doctor» from the point of view of their apocalyptic problematics was first performed in this article. To identify apocalyptic motifs the method of motif analysis, developed by B.M. Gasparov, was used which will also help to prove the interpenetration of motifs in the cycle of stories. The result of the research work is the identification of apocalyptic motifs which are manifested in the experiences of the main character and the events taking place around him and passing through the prism of physician’s perception of the world. Our identified motifs show that the stories in the cycle are united not only thematically and with the help of the image of the main character, but with the help of the motifs which reflect interpenetration of apocalyptic motifs in the stories of one cycle. There are the following apocalyptic motifs in the cycle of stories by Bulgakov: diseases, darkness (as part of the landscape, resurrection from the dead and beast. They all belong to the biblical type which is allocated on the basis of the associative bond of these motifs with the biblical texts.

  11. How to find a leucine in a haystack? Structure, ligand recognition and regulation of leucine-aspartic acid (LD) motifs

    KAUST Repository

    Alam, Tanvir; Alazmi, Meshari; Gao, Xin; Arold, Stefan T.

    2014-01-01

    LD motifs (leucine-aspartic acidmotifs) are short helical protein-protein interaction motifs that have emerged as key players in connecting cell adhesion with cell motility and survival. LD motifs are required for embryogenesis, wound healing and the evolution of multicellularity. LD motifs also play roles in disease, such as in cancer metastasis or viral infection. First described in the paxillin family of scaffolding proteins, LD motifs and similar acidic LXXLL interaction motifs have been discovered in several other proteins, whereas 16 proteins have been reported to contain LDBDs (LD motif-binding domains). Collectively, structural and functional analyses have revealed a surprising multivalency in LD motif interactions and a wide diversity in LDBD architectures. In the present review, we summarize the molecular basis for function, regulation and selectivity of LD motif interactions that has emerged from more than a decade of research. This overview highlights the intricate multi-level regulation and the inherently noisy and heterogeneous nature of signalling through short protein-protein interaction motifs. © 2014 Biochemical Society.

  12. Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

    Science.gov (United States)

    Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A

    2013-09-02

    In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome

  13. Glycomic Analysis of Life Stages of the Human Parasite Schistosoma mansoni Reveals Developmental Expression Profiles of Functional and Antigenic Glycan Motifs.

    Science.gov (United States)

    Smit, Cornelis H; van Diepen, Angela; Nguyen, D Linh; Wuhrer, Manfred; Hoffmann, Karl F; Deelder, André M; Hokke, Cornelis H

    2015-07-01

    Glycans present on glycoproteins and glycolipids of the major human parasite Schistosoma mansoni induce innate as well as adaptive immune responses in the host. To be able to study the molecular characteristics of schistosome infections it is therefore required to determine the expression profiles of glycans and antigenic glycan-motifs during a range of critical stages of the complex schistosome lifecycle. We performed a longitudinal profiling study covering schistosome glycosylation throughout worm- and egg-development using a mass spectrometry-based glycomics approach. Our study revealed that during worm development N-glycans with Galβ1-4(Fucα1-3)GlcNAc (LeX) and core-xylose motifs were rapidly lost after cercariae to schistosomula transformation, whereas GalNAcβ1-4GlcNAc (LDN)-motifs gradually became abundant and predominated in adult worms. LeX-motifs were present on glycolipids up to 2 weeks of schistosomula development, whereas glycolipids with mono- and multifucosylated LDN-motifs remained present up to the adult worm stage. In contrast, expression of complex O-glycans diminished to undetectable levels within days after transformation. During egg development, a rich diversity of N-glycans with fucosylated motifs was expressed, but with α3-core fucose and a high degree of multifucosylated antennae only in mature eggs and miracidia. N-glycan antennae were exclusively LDN-based in miracidia. O-glycans in the mature eggs were also diverse and contained LeX- and multifucosylated LDN, but none of these were associated with miracidia in which we detected only the Galβ1-3(Galβ1-6)GalNAc core glycan. Immature eggs also exhibited short O-glycan core structures only, suggesting that complex fucosylated O-glycans of schistosome eggs are derived primarily from glycoproteins produced by the subshell envelope in the developed egg. Lipid glycans with multifucosylated GlcNAc repeats were present throughout egg development, but with the longer highly fucosylated

  14. ANALYSIS OF STABILITY OF TRINUCLEOTIDE TTC MOTIFS IN COMMON FLAX PLANTED IN THE CHERNOBYL AREA

    Directory of Open Access Journals (Sweden)

    Veronika Lancíková

    2015-02-01

    Full Text Available Flax (Linum usitatissimum L. is one of the oldest domesticated plants — it was cultivated as early as in ancient Egypt and Samaria 10,000 years ago to serve as a source of fiber and oil, whence it later spread around the world. Compared with other plants, the flax genome consists of a high number of repetitive sequences, middle repetitive sequences and small repetitive sequences of nucleotides. The aim of the study was to analyze the stability of the existing trinucleotides motifs of microsatellite DNA of the flax genome (genotype Kyivskyi, growing in the Chernobyl conditions. The Chernobyl area is the most extensive “natural” laboratory suitable for the study of radiation effects. Over the last 20 years, the researches collected important knowledge about the effects of low and high radiation doses on the DNA isolated from the plant material growing on the remediated fields near Chernobyl and the plant material from fields contaminated by radioactive cesium 137Cs and strontium 90Sr. Using eight pairs of microsatellite primers, we successfully amplified the samples from the remediated fields. For each primer in the control samples and remediated samples, we detected 1 to 3 fragments per locus, each in size up to 120 to 250 base pairs. The applied microsatellite primers confirmed the monomorphic condition of microsatellite loci.

  15. Characterization of hydrogen bonding motifs in proteins: hydrogen elimination monitoring by ultraviolet photodissociation mass spectrometry.

    Science.gov (United States)

    Morrison, Lindsay J; Chai, Wenrui; Rosenberg, Jake A; Henkelman, Graeme; Brodbelt, Jennifer S

    2017-08-02

    Determination of structure and folding of certain classes of proteins remains intractable by conventional structural characterization strategies and has spurred the development of alternative methodologies. Mass spectrometry-based approaches have a unique capacity to differentiate protein heterogeneity due to the ability to discriminate populations, whether minor or major, featuring modifications or complexation with non-covalent ligands on the basis of m/z. Cleavage of the peptide backbone can be further utilized to obtain residue-specific structural information. Here, hydrogen elimination monitoring (HEM) upon ultraviolet photodissociation (UVPD) of proteins transferred to the gas phase via nativespray ionization is introduced as an innovative approach to deduce backbone hydrogen bonding patterns. Using well-characterized peptides and a series of proteins, prediction of the engagement of the amide carbonyl oxygen of the protein backbone in hydrogen bonding using UVPD-HEM is demonstrated to show significant agreement with the hydrogen-bonding motifs derived from molecular dynamics simulations and X-ray crystal structures.

  16. oPOSSUM: integrated tools for analysis of regulatory motif over-representation

    Science.gov (United States)

    Ho Sui, Shannan J.; Fulton, Debra L.; Arenillas, David J.; Kwon, Andrew T.; Wasserman, Wyeth W.

    2007-01-01

    The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/ PMID:17576675

  17. Identification of important nodes in directed biological networks: a network motif approach.

    Directory of Open Access Journals (Sweden)

    Pei Wang

    Full Text Available Identification of important nodes in complex networks has attracted an increasing attention over the last decade. Various measures have been proposed to characterize the importance of nodes in complex networks, such as the degree, betweenness and PageRank. Different measures consider different aspects of complex networks. Although there are numerous results reported on undirected complex networks, few results have been reported on directed biological networks. Based on network motifs and principal component analysis (PCA, this paper aims at introducing a new measure to characterize node importance in directed biological networks. Investigations on five real-world biological networks indicate that the proposed method can robustly identify actually important nodes in different networks, such as finding command interneurons, global regulators and non-hub but evolutionary conserved actually important nodes in biological networks. Receiver Operating Characteristic (ROC curves for the five networks indicate remarkable prediction accuracy of the proposed measure. The proposed index provides an alternative complex network metric. Potential implications of the related investigations include identifying network control and regulation targets, biological networks modeling and analysis, as well as networked medicine.

  18. Structures of Metalloporphyrin-Oligomer Multianions: Cofacial versus Coplanar Motifs as Resolved by Ion Mobility Spectrometry.

    Science.gov (United States)

    Brendle, Katrina; Schwarz, Ulrike; Jäger, Patrick; Weis, Patrick; Kappes, Manfred

    2016-11-03

    We have combined ion mobility mass spectrometry with quantum chemical calculations to investigate the gas-phase structures of multiply negatively charged oligomers of meso-tetra(4-sulfonatophenyl)metalloporphyrins comprising the divalent metal centers Zn II , Cu II , and Pd II . Sets of candidate structures were obtained by geometry optimizations based on calculations at both the semiempirical PM7 and density functional theory (DFT) levels. The corresponding theoretical cross sections were calculated with the projection approximation and also with the trajectory method. By comparing these collision cross sections with the respective experimental values we were able to assign oligomer structures up to the tetramer. In most cases the cross sections of the lowest energy isomers predicted by theory were found to agree with the measurements to within the experimental uncertainty (2%). Specifically, we find that for a given oligomer size the structures are independent of the metal center but depend strongly on the charge state. Oligomers in low charge states with a correspondingly larger number of sodium counterions tend to form stacked, cofacial structures reminiscent of H-aggregate motifs observed in solution. By contrast, in higher charge states, the stack opens to form coplanar structures.

  19. Design of polymer motifs for nucleic acid recognition and assembly stabilization

    Science.gov (United States)

    Zhou, Zhun

    This dissertation describes the synthesis and assembly of bio-functional polymers and the applications of these polymers to drug encapsulation, delivery, and multivalent biomimetic macromolecular recognition between synthetic polymer and nucleic acids. The main content is divided into three parts: (1) polyacidic domains as strongly stabilizing design elements for aqueous phase polyacrylate diblock assembly; (2) small molecule/polymer recognition triggered macromolecular assembly and drug encapsulation; (3) trizaine derivatized polymer as a novel class of "bifacial polymer nucleic acid" (bPoNA) and applications of bPoNA to nanoparticle loading of DNA/RNA, silencing delivery as well as control of aptamer function. Through the studies in part (1) and part (2), it was demonstrated that well-designed polymer motifs are not only able to enhance assemblies driven by non-specific hydrophobic effect, but are also able to direct assemblies based on specific recognitions. In part (3) of this dissertation, this concept was further extended by the design of polyacrylate polymers that are capable of discrete and robust hybridization with nucleic acids. This surprising finding demonstrated both fundamental and practical applications. Overall, these studies provided insights into the rational design elements for improving the bio-functions of synthetic polymers, and significantly expanded the scope of biological applications in which polymers synthesized via controlled radical polymerization may play a role.

  20. Statistically validated mobile communication networks: the evolution of motifs in European and Chinese data

    International Nuclear Information System (INIS)

    Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing; Palchykov, Vasyl; Kaski, Kimmo; Kertész, János; Miccichè, Salvatore; Tumminello, Michele; N Mantegna, Rosario

    2014-01-01

    Big data open up unprecedented opportunities for investigating complex systems, including society. In particular, communication data serve as major sources for computational social sciences, but they have to be cleaned and filtered as they may contain spurious information due to recording errors as well as interactions, like commercial and marketing activities, not directly related to the social network. The network constructed from communication data can only be considered as a proxy for the network of social relationships. Here we apply a systematic method, based on multiple-hypothesis testing, to statistically validate the links and then construct the corresponding Bonferroni network, generalized to the directed case. We study two large datasets of mobile phone records, one from Europe and the other from China. For both datasets we compare the raw data networks with the corresponding Bonferroni networks and point out significant differences in the structures and in the basic network measures. We show evidence that the Bonferroni network provides a better proxy for the network of social interactions than the original one. Using the filtered networks, we investigated the statistics and temporal evolution of small directed 3-motifs and concluded that closed communication triads have a formation time scale, which is quite fast and typically intraday. We also find that open communication triads preferentially evolve into other open triads with a higher fraction of reciprocated calls. These stylized facts were observed for both datasets. (paper)

  1. Conserved Functional Motifs and Homology Modeling to Predict Hidden Moonlighting Functional Sites

    KAUST Repository

    Wong, Aloysius Tze; Gehring, Christoph A; Irving, Helen R.

    2015-01-01

    Moonlighting functional centers within proteins can provide them with hitherto unrecognized functions. Here, we review how hidden moonlighting functional centers, which we define as binding sites that have catalytic activity or regulate protein function in a novel manner, can be identified using targeted bioinformatic searches. Functional motifs used in such searches include amino acid residues that are conserved across species and many of which have been assigned functional roles based on experimental evidence. Molecules that were identified in this manner seeking cyclic mononucleotide cyclases in plants are used as examples. The strength of this computational approach is enhanced when good homology models can be developed to test the functionality of the predicted centers in silico, which, in turn, increases confidence in the ability of the identified candidates to perform the predicted functions. Computational characterization of moonlighting functional centers is not diagnostic for catalysis but serves as a rapid screening method, and highlights testable targets from a potentially large pool of candidates for subsequent in vitro and in vivo experiments required to confirm the functionality of the predicted moonlighting centers.

  2. Conserved Functional Motifs and Homology Modeling to Predict Hidden Moonlighting Functional Sites

    KAUST Repository

    Wong, Aloysius Tze

    2015-06-09

    Moonlighting functional centers within proteins can provide them with hitherto unrecognized functions. Here, we review how hidden moonlighting functional centers, which we define as binding sites that have catalytic activity or regulate protein function in a novel manner, can be identified using targeted bioinformatic searches. Functional motifs used in such searches include amino acid residues that are conserved across species and many of which have been assigned functional roles based on experimental evidence. Molecules that were identified in this manner seeking cyclic mononucleotide cyclases in plants are used as examples. The strength of this computational approach is enhanced when good homology models can be developed to test the functionality of the predicted centers in silico, which, in turn, increases confidence in the ability of the identified candidates to perform the predicted functions. Computational characterization of moonlighting functional centers is not diagnostic for catalysis but serves as a rapid screening method, and highlights testable targets from a potentially large pool of candidates for subsequent in vitro and in vivo experiments required to confirm the functionality of the predicted moonlighting centers.

  3. Statistically validated mobile communication networks: the evolution of motifs in European and Chinese data

    Science.gov (United States)

    Li, Ming-Xia; Palchykov, Vasyl; Jiang, Zhi-Qiang; Kaski, Kimmo; Kertész, János; Miccichè, Salvatore; Tumminello, Michele; Zhou, Wei-Xing; Mantegna, Rosario N.

    2014-08-01

    Big data open up unprecedented opportunities for investigating complex systems, including society. In particular, communication data serve as major sources for computational social sciences, but they have to be cleaned and filtered as they may contain spurious information due to recording errors as well as interactions, like commercial and marketing activities, not directly related to the social network. The network constructed from communication data can only be considered as a proxy for the network of social relationships. Here we apply a systematic method, based on multiple-hypothesis testing, to statistically validate the links and then construct the corresponding Bonferroni network, generalized to the directed case. We study two large datasets of mobile phone records, one from Europe and the other from China. For both datasets we compare the raw data networks with the corresponding Bonferroni networks and point out significant differences in the structures and in the basic network measures. We show evidence that the Bonferroni network provides a better proxy for the network of social interactions than the original one. Using the filtered networks, we investigated the statistics and temporal evolution of small directed 3-motifs and concluded that closed communication triads have a formation time scale, which is quite fast and typically intraday. We also find that open communication triads preferentially evolve into other open triads with a higher fraction of reciprocated calls. These stylized facts were observed for both datasets.

  4. Through the Portal: Viking Motifs Incorporated in the Romanesque Style in Telemark, Norway

    Directory of Open Access Journals (Sweden)

    Kristine Ødeby

    2013-09-01

    Full Text Available This paper presents the results of an analysis of motifs identified on six carved wooden Romanesque portal panels from the Norwegian county of Telemark. The findings suggest that animal motifs in the Late Viking style survived long into the Late Medieval period and were reused on these medieval portals. Stylistically, late expressions of Viking animal art do not differ a great deal from those of the subsequent Romanesque style. However, their symbolical differences are considered to be significant. The motifs themselves, and the issue of whether the Romanesque style adopted motifs from pre-Christian art, have attracted less attention. The motif portraying Sigurd slaying the dragon is considered in depth. It will be suggested that Sigurd, serving as a mediator between the old and the new beliefs when he appeared in late Viking contexts, was given a new role when portrayed in Christian art. Metaphor and liminality are a central part of this paper, and the theories of Alfred Gell and Margrete Andås suggest that the portal itself affects those who pass through it, and that the iconography is meaningful from a liminal perspective.

  5. Spatiotemporal network motif reveals the biological traits of developmental gene regulatory networks in Drosophila melanogaster

    Directory of Open Access Journals (Sweden)

    Kim Man-Sun

    2012-05-01

    Full Text Available Abstract Background Network motifs provided a “conceptual tool” for understanding the functional principles of biological networks, but such motifs have primarily been used to consider static network structures. Static networks, however, cannot be used to reveal time- and region-specific traits of biological systems. To overcome this limitation, we proposed the concept of a “spatiotemporal network motif,” a spatiotemporal sequence of network motifs of sub-networks which are active only at specific time points and body parts. Results On the basis of this concept, we analyzed the developmental gene regulatory network of the Drosophila melanogaster embryo. We identified spatiotemporal network motifs and investigated their distribution pattern in time and space. As a result, we found how key developmental processes are temporally and spatially regulated by the gene network. In particular, we found that nested feedback loops appeared frequently throughout the entire developmental process. From mathematical simulations, we found that mutual inhibition in the nested feedback loops contributes to the formation of spatial expression patterns. Conclusions Taken together, the proposed concept and the simulations can be used to unravel the design principle of developmental gene regulatory networks.

  6. Do motifs reflect evolved function?--No convergent evolution of genetic regulatory network subgraph topologies.

    Science.gov (United States)

    Knabe, Johannes F; Nehaniv, Chrystopher L; Schilstra, Maria J

    2008-01-01

    Methods that analyse the topological structure of networks have recently become quite popular. Whether motifs (subgraph patterns that occur more often than in randomized networks) have specific functions as elementary computational circuits has been cause for debate. As the question is difficult to resolve with currently available biological data, we approach the issue using networks that abstractly model natural genetic regulatory networks (GRNs) which are evolved to show dynamical behaviors. Specifically one group of networks was evolved to be capable of exhibiting two different behaviors ("differentiation") in contrast to a group with a single target behavior. In both groups we find motif distribution differences within the groups to be larger than differences between them, indicating that evolutionary niches (target functions) do not necessarily mold network structure uniquely. These results show that variability operators can have a stronger influence on network topologies than selection pressures, especially when many topologies can create similar dynamics. Moreover, analysis of motif functional relevance by lesioning did not suggest that motifs were of greater importance to the functioning of the network than arbitrary subgraph patterns. Only when drastically restricting network size, so that one motif corresponds to a whole functionally evolved network, was preference for particular connection patterns found. This suggests that in non-restricted, bigger networks, entanglement with the rest of the network hinders topological subgraph analysis.

  7. The position of the Gly-xxx-Gly motif in transmembrane segments modulates dimer affinity.

    Science.gov (United States)

    Johnson, Rachel M; Rath, Arianna; Deber, Charles M

    2006-12-01

    Although the intrinsic low solubility of membrane proteins presents challenges to their high-resolution structure determination, insight into the amino acid sequence features and forces that stabilize their folds has been provided through study of sequence-dependent helix-helix interactions between single transmembrane (TM) helices. While the stability of helix-helix partnerships mediated by the Gly-xxx-Gly (GG4) motif is known to be generally modulated by distal interfacial residues, it has not been established whether the position of this motif, with respect to the ends of a given TM segment, affects dimer affinity. Here we examine the relationship between motif position and affinity in the homodimers of 2 single-spanning membrane protein TM sequences: glycophorin A (GpA) and bacteriophage M13 coat protein (MCP). Using the TOXCAT assay for dimer affinity on a series of GpA and MCP TM segments that have been modified with either 4 Leu residues at each end or with 8 Leu residues at the N-terminal end, we show that in each protein, centrally located GG4 motifs are capable of stronger helix-helix interactions than those proximal to TM helix ends, even when surrounding interfacial residues are maintained. The relative importance of GG4 motifs in stabilizing helix-helix interactions therefore must be considered not only in its specific residue context but also in terms of the location of the interactive surface relative to the N and C termini of alpha-helical TM segments.

  8. Identification of helix capping and {beta}-turn motifs from NMR chemical shifts

    Energy Technology Data Exchange (ETDEWEB)

    Shen Yang; Bax, Ad, E-mail: bax@nih.gov [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)

    2012-03-15

    We present an empirical method for identification of distinct structural motifs in proteins on the basis of experimentally determined backbone and {sup 13}C{sup {beta}} chemical shifts. Elements identified include the N-terminal and C-terminal helix capping motifs and five types of {beta}-turns: I, II, I Prime , II Prime and VIII. Using a database of proteins of known structure, the NMR chemical shifts, together with the PDB-extracted amino acid preference of the helix capping and {beta}-turn motifs are used as input data for training an artificial neural network algorithm, which outputs the statistical probability of finding each motif at any given position in the protein. The trained neural networks, contained in the MICS (motif identification from chemical shifts) program, also provide a confidence level for each of their predictions, and values ranging from ca 0.7-0.9 for the Matthews correlation coefficient of its predictions far exceed those attainable by sequence analysis. MICS is anticipated to be useful both in the conventional NMR structure determination process and for enhancing on-going efforts to determine protein structures solely on the basis of chemical shift information, where it can aid in identifying protein database fragments suitable for use in building such structures.

  9. EEVD motif of heat shock cognate protein 70 contributes to bacterial uptake by trophoblast giant cells

    Directory of Open Access Journals (Sweden)

    Kim Suk

    2009-12-01

    Full Text Available Abstract Background The uptake of abortion-inducing pathogens by trophoblast giant (TG cells is a key event in infectious abortion. However, little is known about phagocytic functions of TG cells against the pathogens. Here we show that heat shock cognate protein 70 (Hsc70 contributes to bacterial uptake by TG cells and the EEVD motif of Hsc70 plays an important role in this. Methods Brucella abortus and Listeria monocytogenes were used as the bacterial antigen in this study. Recombinant proteins containing tetratricopeptide repeat (TPR domains were constructed and confirmation of the binding capacity to Hsc70 was assessed by ELISA. The recombinant TPR proteins were used for investigation of the effect of TPR proteins on bacterial uptake by TG cells and on pregnancy in mice. Results The monoclonal antibody that inhibits bacterial uptake by TG cells reacted with the EEVD motif of Hsc70. Bacterial TPR proteins bound to the C-terminal of Hsc70 through its EEVD motif and this binding inhibited bacterial uptake by TG cells. Infectious abortion was also prevented by blocking the EEVD motif of Hsc70. Conclusions Our results demonstrate that surface located Hsc70 on TG cells mediates the uptake of pathogenic bacteria and proteins containing the TPR domain inhibit the function of Hsc70 by binding to its EEVD motif. These molecules may be useful in the development of methods for preventing infectious abortion.

  10. Regulation of TCF ETS-domain transcription factors by helix-loop-helix motifs.

    Science.gov (United States)

    Stinson, Julie; Inoue, Toshiaki; Yates, Paula; Clancy, Anne; Norton, John D; Sharrocks, Andrew D

    2003-08-15

    DNA binding by the ternary complex factor (TCF) subfamily of ETS-domain transcription factors is tightly regulated by intramolecular and intermolecular interactions. The helix-loop-helix (HLH)-containing Id proteins are trans-acting negative regulators of DNA binding by the TCFs. In the TCF, SAP-2/Net/ERP, intramolecular inhibition of DNA binding is promoted by the cis-acting NID region that also contains an HLH-like motif. The NID also acts as a transcriptional repression domain. Here, we have studied the role of HLH motifs in regulating DNA binding and transcription by the TCF protein SAP-1 and how Cdk-mediated phosphorylation affects the inhibitory activity of the Id proteins towards the TCFs. We demonstrate that the NID region of SAP-1 is an autoinhibitory motif that acts to inhibit DNA binding and also functions as a transcription repression domain. This region can be functionally replaced by fusion of Id proteins to SAP-1, whereby the Id moiety then acts to repress DNA binding in cis. Phosphorylation of the Ids by cyclin-Cdk complexes results in reduction in protein-protein interactions between the Ids and TCFs and relief of their DNA-binding inhibitory activity. In revealing distinct mechanisms through which HLH motifs modulate the activity of TCFs, our results therefore provide further insight into the role of HLH motifs in regulating TCF function and how the inhibitory properties of the trans-acting Id HLH proteins are themselves regulated by phosphorylation.

  11. OSR1 regulates a subset of inward rectifier potassium channels via a binding motif variant.

    Science.gov (United States)

    Taylor, Clinton A; An, Sung-Wan; Kankanamalage, Sachith Gallolu; Stippec, Steve; Earnest, Svetlana; Trivedi, Ashesh T; Yang, Jonathan Zijiang; Mirzaei, Hamid; Huang, Chou-Long; Cobb, Melanie H

    2018-04-10

    The with-no-lysine (K) (WNK) signaling pathway to STE20/SPS1-related proline- and alanine-rich kinase (SPAK) and oxidative stress-responsive 1 (OSR1) kinase is an important mediator of cell volume and ion transport. SPAK and OSR1 associate with upstream kinases WNK 1-4, substrates, and other proteins through their C-terminal domains which interact with linear R-F-x-V/I sequence motifs. In this study we find that SPAK and OSR1 also interact with similar affinity with a motif variant, R-x-F-x-V/I. Eight of 16 human inward rectifier K + channels have an R-x-F-x-V motif. We demonstrate that two of these channels, Kir2.1 and Kir2.3, are activated by OSR1, while Kir4.1, which does not contain the motif, is not sensitive to changes in OSR1 or WNK activity. Mutation of the motif prevents activation of Kir2.3 by OSR1. Both siRNA knockdown of OSR1 and chemical inhibition of WNK activity disrupt NaCl-induced plasma membrane localization of Kir2.3. Our results suggest a mechanism by which WNK-OSR1 enhance Kir2.1 and Kir2.3 channel activity by increasing their plasma membrane localization. Regulation of members of the inward rectifier K + channel family adds functional and mechanistic insight into the physiological impact of the WNK pathway.

  12. DXD Motif-Dependent and -Independent Effects of the Chlamydia trachomatis Cytotoxin CT166

    Directory of Open Access Journals (Sweden)

    Miriam Bothe

    2015-02-01

    Full Text Available The Gram-negative, intracellular bacterium Chlamydia trachomatis causes acute and chronic urogenital tract infection, potentially leading to infertility and ectopic pregnancy. The only partially characterized cytotoxin CT166 of serovar D exhibits a DXD motif, which is important for the enzymatic activity of many bacterial and mammalian type A glycosyltransferases, leading to the hypothesis that CT166 possess glycosyltransferase activity. CT166-expressing HeLa cells exhibit actin reorganization, including cell rounding, which has been attributed to the inhibition of the Rho-GTPases Rac/Cdc42. Exploiting the glycosylation-sensitive Ras(27H5 antibody, we here show that CT166 induces an epitope change in Ras, resulting in inhibited ERK and PI3K signaling and delayed cell cycle progression. Consistent with the hypothesis that these effects strictly depend on the DXD motif, CT166 with the mutated DXD motif causes neither Ras-ERK inhibition nor delayed cell cycle progression. In contrast, CT166 with the mutated DXD motif is still capable of inhibiting cell migration, suggesting that CT166 with the mutated DXD motif cannot be regarded as inactive in any case. Taken together, CT166 affects various fundamental cellular processes, strongly suggesting its importance for the intracellular survival of chlamydia.

  13. Identification of helix capping and β-turn motifs from NMR chemical shifts

    International Nuclear Information System (INIS)

    Shen Yang; Bax, Ad

    2012-01-01

    We present an empirical method for identification of distinct structural motifs in proteins on the basis of experimentally determined backbone and 13 C β chemical shifts. Elements identified include the N-terminal and C-terminal helix capping motifs and five types of β-turns: I, II, I′, II′ and VIII. Using a database of proteins of known structure, the NMR chemical shifts, together with the PDB-extracted amino acid preference of the helix capping and β-turn motifs are used as input data for training an artificial neural network algorithm, which outputs the statistical probability of finding each motif at any given position in the protein. The trained neural networks, contained in the MICS (motif identification from chemical shifts) program, also provide a confidence level for each of their predictions, and values ranging from ca 0.7–0.9 for the Matthews correlation coefficient of its predictions far exceed those attainable by sequence analysis. MICS is anticipated to be useful both in the conventional NMR structure determination process and for enhancing on-going efforts to determine protein structures solely on the basis of chemical shift information, where it can aid in identifying protein database fragments suitable for use in building such structures.

  14. The 3,7-diazabicyclo[3.3.1]nonane scaffold for subtype selective nicotinic acetylcholine receptor ligands. Part 2: carboxamide derivatives with different spacer motifs.

    Science.gov (United States)

    Eibl, Christoph; Munoz, Lenka; Tomassoli, Isabelle; Stokes, Clare; Papke, Roger L; Gündisch, Daniela

    2013-12-01

    3,7-Diazabicyclo[3.3.1]nonane (bispidine) based nicotinic acetylcholine receptor (nAChR) ligands have been synthesized and evaluated for nAChRs interaction. Diverse spacer motifs were incorporated between the hydrogen bond acceptor (HBA) part and a variety of substituted (hetero)aryl moieties. Bispidine carboxamides bearing spacer motifs often showed high affinity in the low nanomolar range and selectivity for the α4β2(∗) nAChR. Compounds 15, 25, and 47 with Ki values of about 1 nM displayed the highest affinities for α4β2(∗) nAChR. All evaluated compounds are partial agonists or antagonists at α4β2(∗), with reduced or no effects on α3β4(∗) with the exception of compound 15 (agonist), and reduced or no effect at α7 and muscle subtypes. Copyright © 2013 Elsevier Ltd. All rights reserved.

  15. Molecular dynamics simulations of electrostatics and hydration distributions around RNA and DNA motifs

    Science.gov (United States)

    Marlowe, Ashley E.; Singh, Abhishek; Semichaevsky, Andrey V.; Yingling, Yaroslava G.

    2009-03-01

    Nucleic acid nanoparticles can self-assembly through the formation of complementary loop-loop interactions or stem-stem interactions. Presence and concentration of ions can significantly affect the self-assembly process and the stability of the nanostructure. In this presentation we use explicit molecular dynamics simulations to examine the variations in cationic distributions and hydration environment around DNA and RNA helices and loop-loop interactions. Our simulations show that the potassium and sodium ionic distributions are different around RNA and DNA motifs which could be indicative of ion mediated relative stability of loop-loop complexes. Moreover in RNA loop-loop motifs ions are consistently present and exchanged through a distinct electronegative channel. We will also show how we used the specific RNA loop-loop motif to design a RNA hexagonal nanoparticle.

  16. PDL1 Signals through Conserved Sequence Motifs to Overcome Interferon-Mediated Cytotoxicity

    Directory of Open Access Journals (Sweden)

    Maria Gato-Cañas

    2017-08-01

    Full Text Available PDL1 blockade produces remarkable clinical responses, thought to occur by T cell reactivation through prevention of PDL1-PD1 T cell inhibitory interactions. Here, we find that PDL1 cell-intrinsic signaling protects cancer cells from interferon (IFN cytotoxicity and accelerates tumor progression. PDL1 inhibited IFN signal transduction through a conserved class of sequence motifs that mediate crosstalk with IFN signaling. Abrogation of PDL1 expression or antibody-mediated PDL1 blockade strongly sensitized cancer cells to IFN cytotoxicity through a STAT3/caspase-7-dependent pathway. Moreover, somatic mutations found in human carcinomas within these PDL1 sequence motifs disrupted motif regulation, resulting in PDL1 molecules with enhanced protective activities from type I and type II IFN cytotoxicity. Overall, our results reveal a mode of action of PDL1 in cancer cells as a first line of defense against IFN cytotoxicity.

  17. Discovery of a Regulatory Motif for Human Satellite DNA Transcription in Response to BATF2 Overexpression.

    Science.gov (United States)

    Bai, Xuejia; Huang, Wenqiu; Zhang, Chenguang; Niu, Jing; Ding, Wei

    2016-03-01

    One of the basic leucine zipper transcription factors, BATF2, has been found to suppress cancer growth and migration. However, little is known about the genes downstream of BATF2. HeLa cells were stably transfected with BATF2, then chromatin immunoprecipitation-sequencing was employed to identify the DNA motifs responsive to BATF2. Comprehensive bioinformatics analyses indicated that the most significant motif discovered as TTCCATT[CT]GATTCCATTC[AG]AT was primarily distributed among the chromosome centromere regions and mostly within human type II satellite DNA. Such motifs were able to prime the transcription of type II satellite DNA in a directional and asymmetrical manner. Consistently, satellite II transcription was up-regulated in BATF2-overexpressing cells. The present study provides insight into understanding the role of BATF2 in tumours and the importance of satellite DNA in the maintenance of genomic stability. Copyright© 2016 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  18. A Woman Voice in an Epic: Tracing Gendered Motifs in Anne Vabarna's Peko

    Directory of Open Access Journals (Sweden)

    Andreas Kalkun

    2008-12-01

    Full Text Available In the article the gendered motifs found in Anne Vabarna’s Seto epic Peko are analysed. Besides the narrative telling of the life of the male hero, the motives regarding eating, refusing to eat or offering food, and the aspect of the female body or its control deserve to be noticed. These scenes do not communicate the main plot, they are often related to minor characters of the epic and slow down the narrative, but at the same time they clearly carry artistic purpose and meaning. I consider these motifs, present in the liminal parts of the epic, to be the dominant symbols of the epic where the author’s feminine world is being exposed. Observing these motifs of Peko in the context of Seto religious worldview, the life of Anne Vabarna and the social position of Seto women, the symbols become eloquent and informative.

  19. The human Ago2 MC region does not contain an eIF4E-like mRNA cap binding motif

    Directory of Open Access Journals (Sweden)

    Grishin Nick V

    2009-01-01

    Full Text Available Abstract Background Argonaute (Ago proteins interact with small regulatory RNAs to mediate gene regulatory pathways. A recent report by Kiriakidou et al. 1 describes an MC sequence region identified in Ago2 that displays similarity to the cap-binding motif in translation initiation factor 4E (eIF4E. In a cap-bound eIF4E structure, two important aromatic residues of the motif stack on either side of a 7-methylguanosine 5'-triphosphate (m7Gppp base. The corresponding Ago2 aromatic residues (F450 and F505 were hypothesized to perform the same cap-binding function. However, the detected similarity between the MC sequence and the eIF4E cap-binding motif was questionable. Results A number of sequence-based and structure-based bioinformatics methods reveal the reported similarity between the Ago2 MC sequence region and the eIF4E cap-binding motif to be spurious. Alternatively, the MC sequence region is confidently assigned to the N-terminus of the Ago piwi module, within the mid domain of experimentally determined prokaryotic Ago structures. Confident mapping of the Ago2 MC sequence region to the piwi mid domain results in a homology-based structure model that positions the identified aromatic residues over 20 Å apart, with one of the aromatic side chains (F450 contributing instead to the hydrophobic core of the domain. Conclusion Correct functional prediction based on weak sequence similarity requires substantial evolutionary and structural support. The evolutionary context of the Ago mid domain suggested by multiple sequence alignment is limited to a conserved hydrophobicity profile required for the fold and a motif following the MC region that binds guide RNA. Mapping of the MC sequence to the mid domain structure reveals Ago2 aromatics that are incompatible with eIF4E-like mRNA cap-binding, yet display some limited local structure similarities that cause the chance sequence match to eIF4E. Reviewers This article was reviewed by Arcady Mushegian

  20. Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.

    Directory of Open Access Journals (Sweden)

    Arnoldo J Müller-Molina

    Full Text Available To know the map between transcription factors (TFs and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.

  1. A Conserved Metal Binding Motif in the Bacillus subtilis Competence Protein ComFA Enhances Transformation.

    Science.gov (United States)

    Chilton, Scott S; Falbel, Tanya G; Hromada, Susan; Burton, Briana M

    2017-08-01

    Genetic competence is a process in which cells are able to take up DNA from their environment, resulting in horizontal gene transfer, a major mechanism for generating diversity in bacteria. Many bacteria carry homologs of the central DNA uptake machinery that has been well characterized in Bacillus subtilis It has been postulated that the B. subtilis competence helicase ComFA belongs to the DEAD box family of helicases/translocases. Here, we made a series of mutants to analyze conserved amino acid motifs in several regions of B. subtilis ComFA. First, we confirmed that ComFA activity requires amino acid residues conserved among the DEAD box helicases, and second, we show that a zinc finger-like motif consisting of four cysteines is required for efficient transformation. Each cysteine in the motif is important, and mutation of at least two of the cysteines dramatically reduces transformation efficiency. Further, combining multiple cysteine mutations with the helicase mutations shows an additive phenotype. Our results suggest that the helicase and metal binding functions are two distinct activities important for ComFA function during transformation. IMPORTANCE ComFA is a highly conserved protein that has a role in DNA uptake during natural competence, a mechanism for horizontal gene transfer observed in many bacteria. Investigation of the details of the DNA uptake mechanism is important for understanding the ways in which bacteria gain new traits from their environment, such as drug resistance. To dissect the role of ComFA in the DNA uptake machinery, we introduced point mutations into several motifs in the protein sequence. We demonstrate that several amino acid motifs conserved among ComFA proteins are important for efficient transformation. This report is the first to demonstrate the functional requirement of an amino-terminal cysteine motif in ComFA. Copyright © 2017 American Society for Microbiology.

  2. Memfasilitasi Penalaran Geometri Transformasi Siswa Melalui Eksplorasi Motif Melayu dengan Bantuan Grid

    Directory of Open Access Journals (Sweden)

    Febrian Febrian

    2017-10-01

    Full Text Available Geometri transformasi merupakan pengetahuan yang krusial dalam geometri yang dapat membangun banyak kemampuan lainnya seperti penalaran matematis. Oleh karena itu, geometri transformasi disarankan untuk diberikan pada pebelajar mulai dari usia dini. Penelitian terdahulu menunjukkan bahwa anak-anak memiliki sense untuk melihat karakteristik kedinamisan pada benda, oleh karena itu memfasilitasi pembelajaran yang dapat memanfaatkan sense ini menjadi sangat penting untuk membangun pemahaman geometri transformasi. Penelitian design research ini bertujuan untuk memfasilitasi siswa sekolah dasar untuk dapat mengembangkan pengetahuan awal mereka mengenai komposisi transformasi. Subjek penelitian adalah siswa kelas IV Sekolah Dasar Negeri 001 Toapaya, Kabupaten Bintan, Kepulauan Riau. Pendekatan pembelajaran yang digunakan adalah PMRI dengan konteks motif melayu itik pulang petang dengan bantuan grid. Hasil menunjukkan bahwa setting pembelajaran dapat memfasilitasi penalaran geometri transformasi melalui kegiatan eksplorasi motif dengan bantuan grid. Kata Kunci: komposisi transformasi, penalaran, motif melayu, grid, PMRI Transformation geometry is a crucial knowledge in geometry that can emerge many skills especially mathematical reasoning. Therefore, transformation geometry is suggested to be taught to children especially the young learners. Existing research implies that children have particular sense to see dynamic characteristic of an object or others. On the behalf of this statement, facilitating students in learning process that makes use of this students sense becomes important to undertake to help develop students reasoning of transformation geometry. The subtopic being highlighted is the composition of transformation. This design research aims to facilitate this situation. The subject of the research is fourth graders of the State Elementary School of 001 at Toapaya, Kabupaten Bintan, Kepulauan Riau. The learning approach used was PMRI by using

  3. A Novel Dual-cre Motif Enables Two-Way Autoregulation of CcpA in Clostridium acetobutylicum.

    Science.gov (United States)

    Zhang, Lu; Liu, Yanqiang; Yang, Yunpeng; Jiang, Weihong; Gu, Yang

    2018-04-15

    The master regulator CcpA (catabolite control protein A) manages a large and complex regulatory network that is essential for cellular physiology and metabolism in Gram-positive bacteria. Although CcpA can affect the expression of target genes by binding to a cis -acting catabolite-responsive element ( cre ), whether and how the expression of CcpA is regulated remain poorly explored. Here, we report a novel dual- cre motif that is employed by the CcpA in Clostridium acetobutylicum , a typical solventogenic Clostridium species, for autoregulation. Two cre sites are involved in CcpA autoregulation, and they reside in the promoter and coding regions of CcpA. In this dual- cre motif, cre P , in the promoter region, positively regulates ccpA transcription, whereas cre ORF , in the coding region, negatively regulates this transcription, thus enabling two-way autoregulation of CcpA. Although CcpA bound cre P more strongly than cre ORF in vitro , the in vivo assay showed that cre ORF -based repression dominates CcpA autoregulation during the entire fermentation. Finally, a synonymous mutation of cre ORF was made within the coding region, achieving an increased intracellular CcpA expression and improved cellular performance. This study provides new insights into the regulatory role of CcpA in C. acetobutylicum and, moreover, contributes a new engineering strategy for this industrial strain. IMPORTANCE CcpA is known to be a key transcription factor in Gram-positive bacteria. However, it is still unclear whether and how the intracellular CcpA level is regulated, which may be essential for maintaining normal cell physiology and metabolism. We discovered here that CcpA employs a dual- cre motif to autoregulate, enabling dynamic control of its own expression level during the entire fermentation process. This finding answers the questions above and fills a void in our understanding of the regulatory network of CcpA. Interference in CcpA autoregulation leads to improved cellular

  4. Examples of the Motif of the Shrew in European Literature and Film

    OpenAIRE

    Vasvári, Louise O.

    2001-01-01

    In her article "Examples of the Motif of the Shrew in European Literature and Film" Louise O. Vasvári presents the shrew-taming story as a masterplot of both Eastern and Western folklore and literature concerned with establishing the appropriate power dynamic between a married couple. Vasvári firts reviews the comparative groundwork of the story she has documented in her earlier studies of the topic. In addition to tracing the bundle of motifs that make up the shrew story from medieval Arabic...

  5. The conjugal-bed motif in the Alcestis Barcinonensis: two notes

    Directory of Open Access Journals (Sweden)

    Rosario Moreno Soldevila

    2011-06-01

    Full Text Available This paper focuses on the centrality occupied by the conjugal-bed motif in the anonymous poem known as Alcestis Barcinonensis, in the light of which two new interpretations of lines 21-22 and 83-85 are provided. In the first passage, beato … toro should be read as a subtle allusion to marital love, one of the central themes of the poem; in the second, uestigia alludes to a well-known literary motif related to the bed of love, thus providing a more accurate interpretation of the post mortem fidelity which Alcestis demands from her husband.

  6. Motif trie: An efficient text index for pattern discovery with don't cares

    DEFF Research Database (Denmark)

    Grossi, Roberto; Menconi, Giulia; Pisanti, Nadia

    2017-01-01

    We introduce the motif trie data structure, which has applications in pattern matching and discovery in genomic analysis, plagiarism detection, data mining, intrusion detection, spam fighting and time series analysis, to name a few. Here the extraction of recurring patterns in sequential and text......We introduce the motif trie data structure, which has applications in pattern matching and discovery in genomic analysis, plagiarism detection, data mining, intrusion detection, spam fighting and time series analysis, to name a few. Here the extraction of recurring patterns in sequential...

  7. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo; Jankovic, Boris R.; Bajic, Vladimir B.; Song, Le; Gao, Xin

    2013-01-01

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  8. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo

    2013-06-21

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  9. Recurring polyhedral motifs in the amorphous indium gallium zinc oxide network

    Energy Technology Data Exchange (ETDEWEB)

    Divya; Deepak [Department of Materials Science and Engineering, Indian Institute of Technology, Kanpur (India); National Center for Flexible Electronics, Indian Institute of Technology, Kanpur (India); Prasad, Rajendra [Department of Physics, Indian Institute of Technology, Kanpur (India)

    2017-02-15

    The coordination polyhedra around the cations are the building blocks of ionic solids. For amorphous InGaZn oxide (a-IGZO), these coordination polyhedra are identified to investigate properties that depend on short range interactions. Therefore, in this first principles based study, a large number (10) of samples of a-IGZO were prepared by ab initio melt-and-quench molecular dynamics, so that several distinct samples of the amorphous landscape are obtained corresponding to local minima in energy. Based on a method of comparing bond angles between metal and oxygen atoms, the identified polyhedra were matched to the polyhedral motifs present in the related crystalline systems, such as, InGaZnO{sub 4}, In{sub 2}O{sub 3}, Ga{sub 2}O{sub 3}, and ZnO. Consequently, we find, the a-IGZO primarily consists of the following polyhedra: a tetrahedron from space group 199 and an octahedron from space group 206 of In{sub 2}O{sub 3}; a tetrahedron from space group 12 and an octahedron from space group 167 of Ga{sub 2}O{sub 3}; a tetrahedron from space group 186 of ZnO; zinc and gallium trigonal bipyramids from c-IGZO; and one zinc fourfold, one zinc fivefold, and one indium fivefold coordination polyhedra that occur only in the amorphous phase. Thus, we were able to reduce the description of structure from 360 to 10 groups of polyhedra. (copyright 2017 WILEY-VCH Verlag GmbH and Co. KGaA, Weinheim)

  10. Circuit motifs for contrast-adaptive differentiation in early sensory systems: the role of presynaptic inhibition and short-term plasticity.

    Science.gov (United States)

    Zhang, Danke; Wu, Si; Rasch, Malte J

    2015-01-01

    In natural signals, such as the luminance value across of a visual scene, abrupt changes in intensity value are often more relevant to an organism than intensity values at other positions and times. Thus to reduce redundancy, sensory systems are specialized to detect the times and amplitudes of informative abrupt changes in the input stream rather than coding the intensity values at all times. In theory, a system that responds transiently to fast changes is called a differentiator. In principle, several different neural circuit mechanisms exist that are capable of responding transiently to abrupt input changes. However, it is unclear which circuit would be best suited for early sensory systems, where the dynamic range of the natural input signals can be very wide. We here compare the properties of different simple neural circuit motifs for implementing signal differentiation. We found that a circuit motif based on presynaptic inhibition (PI) is unique in a sense that the vesicle resources in the presynaptic site can be stably maintained over a wide range of stimulus intensities, making PI a biophysically plausible mechanism to implement a differentiator with a very wide dynamical range. Moreover, by additionally considering short-term plasticity (STP), differentiation becomes contrast adaptive in the PI-circuit but not in other potential neural circuit motifs. Numerical simulations show that the behavior of the adaptive PI-circuit is consistent with experimental observations suggesting that adaptive presynaptic inhibition might be a good candidate neural mechanism to achieve differentiation in early sensory systems.

  11. Analysis of alkaptonuria (AKU) mutations and polymorphisms reveals that the CCC sequence motif is a mutational hot spot in the homogentisate 1,2 dioxygenase gene (HGO).

    Science.gov (United States)

    Beltrán-Valero de Bernabé, D; Jimenez, F J; Aquaron, R; Rodríguez de Córdoba, S

    1999-01-01

    We recently showed that alkaptonuria (AKU) is caused by loss-of-function mutations in the homogentisate 1,2 dioxygenase gene (HGO). Herein we describe haplotype and mutational analyses of HGO in seven new AKU pedigrees. These analyses identified two novel single-nucleotide polymorphisms (INV4+31A-->G and INV11+18A-->G) and six novel AKU mutations (INV1-1G-->A, W60G, Y62C, A122D, P230T, and D291E), which further illustrates the remarkable allelic heterogeneity found in AKU. Reexamination of all 29 mutations and polymorphisms thus far described in HGO shows that these nucleotide changes are not randomly distributed; the CCC sequence motif and its inverted complement, GGG, are preferentially mutated. These analyses also demonstrated that the nucleotide substitutions in HGO do not involve CpG dinucleotides, which illustrates important differences between HGO and other genes for the occurrence of mutation at specific short-sequence motifs. Because the CCC sequence motifs comprise a significant proportion (34.5%) of all mutated bases that have been observed in HGO, we conclude that the CCC triplet is a mutational hot spot in HGO. PMID:10205262

  12. ATP-binding motifs play key roles in Krp1p, kinesin-related protein 1, function for bi-polar growth control in fission yeast

    International Nuclear Information System (INIS)

    Rhee, Dong Keun; Cho, Bon A; Kim, Hyong Bai

    2005-01-01

    Kinesin is a microtubule-based motor protein with various functions related to the cell growth and division. It has been reported that Krp1p, kinesin-related protein 1, which belongs to the kinesin heavy chain superfamily, localizes on microtubules and may play an important role in cytokinesis. However, the function of Krp1p has not been fully elucidated. In this study, we overexpressed an intact form and three different mutant forms of Krp1p in fission yeast constructed by site-directed mutagenesis in two ATP-binding motifs or by truncation of the leucine zipper-like motif (LZiP). We observed hyper-extended microtubules and the aberrant nuclear shape in Krp1p-overexpressed fission yeast. As a functional consequence, a point mutation of ATP-binding domain 1 (G89E) in Krp1p reversed the effect of Krp1p overexpression in fission yeast, whereas the specific mutation in ATP-binding domain 2 (G238E) resulted in the altered cell polarity. Additionally, truncation of the leucine zipper-like domain (LZiP) at the C-terminal of Krp1p showed a normal nuclear division. Taken together, we suggest that krp1p is involved in regulation of cell-polarized growth through ATP-binding motifs in fission yeast

  13. Use of a Yeast tRNase Killer Toxin to Diagnose Kti12 Motifs Required for tRNA Modification by Elongator.

    Science.gov (United States)

    Mehlgarten, Constance; Prochaska, Heike; Hammermeister, Alexander; Abdel-Fattah, Wael; Wagner, Melanie; Krutyhołowa, Rościsław; Jun, Sang Eun; Kim, Gyung-Tae; Glatt, Sebastian; Breunig, Karin D; Stark, Michael J R; Schaffrath, Raffael

    2017-09-05

    Saccharomyces cerevisiae cells are killed by zymocin, a tRNase ribotoxin complex from Kluyveromyces lactis , which cleaves anticodons and inhibits protein synthesis. Zymocin's action requires specific chemical modification of uridine bases in the anticodon wobble position (U34) by the Elongator complex (Elp1-Elp6). Hence, loss of anticodon modification in mutants lacking Elongator or related KTI ( K. lactis Toxin Insensitive) genes protects against tRNA cleavage and confers resistance to the toxin. Here, we show that zymocin can be used as a tool to genetically analyse KTI12 , a gene previously shown to code for an Elongator partner protein. From a kti12 mutant pool of zymocin survivors, we identify motifs in Kti12 that are functionally directly coupled to Elongator activity. In addition, shared requirement of U34 modifications for nonsense and missense tRNA suppression ( SUP4 ; SOE1 ) strongly suggests that Kti12 and Elongator cooperate to assure proper tRNA functioning. We show that the Kti12 motifs are conserved in plant ortholog DRL1/ELO4 from Arabidopsis thaliana and seem to be involved in binding of cofactors (e.g., nucleotides, calmodulin). Elongator interaction defects triggered by mutations in these motifs correlate with phenotypes typical for loss of U34 modification. Thus, tRNA modification by Elongator appears to require physical contact with Kti12, and our preliminary data suggest that metabolic signals may affect proper communication between them.

  14. A single amino-acid change in a highly conserved motif of gp41 elicits HIV-1 neutralization and protects against CD4 depletion.

    Science.gov (United States)

    Petitdemange, Caroline; Achour, Abla; Dispinseri, Stefania; Malet, Isabelle; Sennepin, Alexis; Ho Tsong Fang, Raphaël; Crouzet, Joël; Marcelin, Anne-Geneviève; Calvez, Vincent; Scarlatti, Gabriella; Debré, Patrice; Vieillard, Vincent

    2013-09-01

    The induction of neutralizing antibodies against conserved regions of the human immunodeficiency virus type 1 (HIV-1) envelope protein is a major goal of vaccine strategies. We previously identified 3S, a critical conserved motif of gp41 that induces the NKp44L ligand of an activating NK receptor. In vivo, anti-3S antibodies protect against the natural killer (NK) cell-mediated CD4 depletion that occurs without efficient viral neutralization. Specific substitutions within the 3S peptide motif were prepared by directed mutagenesis. Virus production was monitored by measuring the p24 production. Neutralization assays were performed with immune-purified antibodies from immunized mice and a cohort of HIV-infected patients. Expression of NKp44L on CD4(+) T cells and degranulation assay on activating NK cells were both performed by flow cytometry. Here, we show that specific substitutions in the 3S motif reduce viral infection without affecting gp41 production, while decreasing both its capacity to induce NKp44L expression on CD4(+) T cells and its sensitivity to autologous NK cells. Generation of antibodies in mice against the W614 specific position in the 3S motif elicited a capacity to neutralize cross-clade viruses, notable in its magnitude, breadth, and durability. Antibodies against this 3S variant were also detected in sera from some HIV-1-infected patients, demonstrating both neutralization activity and protection against CD4 depletion. These findings suggest that a specific substitution in a 3S-based immunogen might allow the generation of specific antibodies, providing a foundation for a rational vaccine that combine a capacity to neutralize HIV-1 and to protect CD4(+) T cells.

  15. Loop Entropy Assists Tertiary Order: Loopy Stabilization of Stacking Motifs

    Directory of Open Access Journals (Sweden)

    Daniel P. Aalberts

    2011-11-01

    Full Text Available The free energy of an RNA fold is a combination of favorable base pairing and stacking interactions competing with entropic costs of forming loops. Here we show how loop entropy, surprisingly, can promote tertiary order. A general formula for the free energy of forming multibranch and other RNA loops is derived with a polymer-physics based theory. We also derive a formula for the free energy of coaxial stacking in the context of a loop. Simulations support the analytic formulas. The effects of stacking of unpaired bases are also studied with simulations.

  16. The primary structure of L37--a rat ribosomal protein with a zinc finger-like motif.

    Science.gov (United States)

    Chan, Y L; Paz, V; Olvera, J; Wool, I G

    1993-04-30

    The amino acid sequence of the rat 60S ribosomal subunit protein L37 was deduced from the sequence of nucleotides in a recombinant cDNA. Ribosomal protein L37 has 96 amino acids, the NH2-terminal methionine is removed after translation of the mRNA, and has a molecular weight of 10,939. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type. Hybridization of the cDNA to digests of nuclear DNA suggests that there are 13 or 14 copies of the L37 gene. The mRNA for the protein is about 500 nucleotides in length. Rat L37 is related to Saccharomyces cerevisiae ribosomal protein YL35 and to Caenorhabditis elegans L37. We have identified in the data base a DNA sequence that encodes the chicken homolog of rat L37.

  17. GPS 2.1: enhanced prediction of kinase-specific phosphorylation sites with an algorithm of motif length selection.

    Science.gov (United States)

    Xue, Yu; Liu, Zexian; Cao, Jun; Ma, Qian; Gao, Xinjiao; Wang, Qingqi; Jin, Changjiang; Zhou, Yanhong; Wen, Longping; Ren, Jian

    2011-03-01

    As the most important post-translational modification of proteins, phosphorylation plays essential roles in all aspects of biological processes. Besides experimental approaches, computational prediction of phosphorylated proteins with their kinase-specific phosphorylation sites has also emerged as a popular strategy, for its low-cost, fast-speed and convenience. In this work, we developed a kinase-specific phosphorylation sites predictor of GPS 2.1 (Group-based Prediction System), with a novel but simple approach of motif length selection (MLS). By this approach, the robustness of the prediction system was greatly improved. All algorithms in GPS old versions were also reserved and integrated in GPS 2.1. The online service and local packages of GPS 2.1 were implemented in JAVA 1.5 (J2SE 5.0) and freely available for academic researches at: http://gps.biocuckoo.org.

  18. Comprehensive Identification of Glycated Peptides and Their Glycation Motifs in Plasma and Erythrocytes of Control and Diabetic Subjects

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Qibin; Monroe, Matthew E.; Schepmoes, Athena A.; Clauss, Therese RW; Gritsenko, Marina A.; Meng, Da; Petyuk, Vladislav A.; Smith, Richard D.; Metz, Thomas O.

    2011-07-01

    Non-enzymatic glycation of proteins is implicated in diabetes mellitus and its related complications. In this report, we extend our previous development and refinement of proteomics-based methods for the analysis of non-enzymatically glycated proteins to comprehensively identify glycated proteins in normal and diabetic human plasma and erythrocytes. Using immunodepletion, enrichment, and fractionation strategies, we identified 7749 unique glycated peptides, corresponding to 3742 unique glycated proteins. Semi-quantitative comparisons revealed a number of proteins with glycation levels significantly increased in diabetes relative to control samples and that erythrocyte proteins are more extensively glycated than plasma proteins. A glycation motif analysis revealed amino acids that are favored more than others in the protein primary structures in the vicinity of the glycation sites in both sample types. The glycated peptides and corresponding proteins reported here provide a foundation for the potential identification of novel markers for diabetes, glycemia, or diabetic complications.

  19. Topological side-chain classification of beta-turns: ideal motifs for peptidomimetic development.

    Science.gov (United States)

    Tran, Tran Trung; McKie, Jim; Meutermans, Wim D F; Bourne, Gregory T; Andrews, Peter R; Smythe, Mark L

    2005-08-01

    Beta-turns are important topological motifs for biological recognition of proteins and peptides. Organic molecules that sample the side chain positions of beta-turns have shown broad binding capacity to multiple different receptors, for example benzodiazepines. Beta-turns have traditionally been classified into various types based on the backbone dihedral angles (phi2, psi2, phi3 and psi3). Indeed, 57-68% of beta-turns are currently classified into 8 different backbone families (Type I, Type II, Type I', Type II', Type VIII, Type VIa1, Type VIa2 and Type VIb and Type IV which represents unclassified beta-turns). Although this classification of beta-turns has been useful, the resulting beta-turn types are not ideal for the design of beta-turn mimetics as they do not reflect topological features of the recognition elements, the side chains. To overcome this, we have extracted beta-turns from a data set of non-homologous and high-resolution protein crystal structures. The side chain positions, as defined by C(alpha)-C(beta) vectors, of these turns have been clustered using the kth nearest neighbor clustering and filtered nearest centroid sorting algorithms. Nine clusters were obtained that cluster 90% of the data, and the average intra-cluster RMSD of the four C(alpha)-C(beta) vectors is 0.36. The nine clusters therefore represent the topology of the side chain scaffold architecture of the vast majority of beta-turns. The mean structures of the nine clusters are useful for the development of beta-turn mimetics and as biological descriptors for focusing combinatorial chemistry towards biologically relevant topological space.

  20. The regulatory G4 motif of the Kirsten ras (KRAS) gene is sensitive to guanine oxidation

    DEFF Research Database (Denmark)

    Cogoi, Susanna; Ferino, Annalisa; Miglietta, Giulia

    2018-01-01

    KRAS is one of the most mutated genes in human cancer. It is controlled by a G4 motif located upstream of the transcription start site. In this paper, we demonstrate that 8-oxoguanine (8-oxoG), being more abundant in G4 than in non-G4 regions, is a new player in the regulation of this oncogene. W...

  1. Functional motifs responsible for human metapneumovirus M2-2-mediated innate immune evasion.

    Science.gov (United States)

    Chen, Yu; Deng, Xiaoling; Deng, Junfang; Zhou, Jiehua; Ren, Yuping; Liu, Shengxuan; Prusak, Deborah J; Wood, Thomas G; Bao, Xiaoyong

    2016-12-01

    Human metapneumovirus (hMPV) is a major cause of lower respiratory infection in young children. Repeated infections occur throughout life, but its immune evasion mechanisms are largely unknown. We recently found that hMPV M2-2 protein elicits immune evasion by targeting mitochondrial antiviral-signaling protein (MAVS), an antiviral signaling molecule. However, the molecular mechanisms underlying such inhibition are not known. Our mutagenesis studies revealed that PDZ-binding motifs, 29-DEMI-32 and 39-KEALSDGI-46, located in an immune inhibitory region of M2-2, are responsible for M2-2-mediated immune evasion. We also found both motifs prevent TRAF5 and TRAF6, the MAVS downstream adaptors, to be recruited to MAVS, while the motif 39-KEALSDGI-46 also blocks TRAF3 migrating to MAVS. In parallel, these TRAFs are important in activating transcription factors NF-kB and/or IRF-3 by hMPV. Our findings collectively demonstrate that M2-2 uses its PDZ motifs to launch the hMPV immune evasion through blocking the interaction of MAVS and its downstream TRAFs. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. Finding the most significant common sequence and structure motifs in a set of RNA sequences

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Heyer, L.J.; Stormo, G.D.

    1997-01-01

    We present a computational scheme to locally align a collection of RNA sequences using sequence and structure constraints, In addition, the method searches for the resulting alignments with the most significant common motifs, among all possible collections, The first part utilizes a simplified...

  3. “The Birds of Clay”: An Apocryphal Motif in Folklore Legends

    Directory of Open Access Journals (Sweden)

    Olga V. Belova

    2015-08-01

    The fairly large group of folk legends with apocryphal motifs, occurring in different Slavic traditions from the 19th to the 21st centuries, thus testifies not only to the continued relevance of the biblical plots for oral culture, but also to the importance of the Apocrypha for the broadcasting and preservation of biblical stories in the folk tradition.

  4. The NTP-binding motif in cowpea mosaic virus B polyprotein is essential for viral replication

    NARCIS (Netherlands)

    Peters, S A; Verver, J; Nollen, E A; van Lent, J W; Wellink, J; van Kammen, A

    1994-01-01

    We have assessed the functional importance of the NTP-binding motif (NTBM) in the cowpea mosaic virus (CPMV) B-RNA-encoded 58K domain by changing two conserved amino acids within the consensus A and B sites (GKSRTGK500S and MDD545, respectively). Both Lys-500 to Thr and Asp-545 to Pro substitutions

  5. Dipeptide frequency/bias analysis identifies conserved sites of nonrandomness shared by cysteine-rich motifs.

    Science.gov (United States)

    Campion, S R; Ameen, A S; Lai, L; King, J M; Munzenmaier, T N

    2001-08-15

    This report describes the application of a simple computational tool, AAPAIR.TAB, for the systematic analysis of the cysteine-rich EGF, Sushi, and Laminin motif/sequence families at the two-amino acid level. Automated dipeptide frequency/bias analysis detects preferences in the distribution of amino acids in established protein families, by determining which "ordered dipeptides" occur most frequently in comprehensive motif-specific sequence data sets. Graphic display of the dipeptide frequency/bias data revealed family-specific preferences for certain dipeptides, but more importantly detected a shared preference for employment of the ordered dipeptides Gly-Tyr (GY) and Gly-Phe (GF) in all three protein families. The dipeptide Asn-Gly (NG) also exhibited high-frequency and bias in the EGF and Sushi motif families, whereas Asn-Thr (NT) was distinguished in the Laminin family. Evaluation of the distribution of dipeptides identified by frequency/bias analysis subsequently revealed the highly restricted localization of the G(F/Y) and N(G/T) sequence elements at two separate sites of extreme conservation in the consensus sequence of all three sequence families. The similar employment of the high-frequency/bias dipeptides in three distinct protein sequence families was further correlated with the concurrence of these shared molecular determinants at similar positions within the distinctive scaffolds of three structurally divergent, but similarly employed, motif modules.

  6. Promoter motifs required for c-mpl gene expression induced by thrombopoietin in CMK cells.

    Science.gov (United States)

    Sunohara, Masataka; Sato, Iwao; Morikawa, Shigeru

    2017-11-30

    Thrombopoietin (TPO) and its receptor, c-Mpl, are the central regulators of megakaryocyte development and platelet production and are also crucial to regulate megakaryocytopoiesis. TPO remarkably elevated c-mpl promoter activity, while the protein kinase C (PKC) inhibitors, GF109203, H7 and Calphostin C, clearly reduced the steady level of its promoter activity.  In the present study, motifs crucial for c-mpl promoter activity induced by TPO treatment have been analyzed using a human megakaryoblastic cell line, CMK. Destruction of the -107Sp1 and the -57Sp1 sites in the c-mpl promoter enhancer region resulted in decrease of the promoter activity by 53.1% and 64.4%, respectively, and destruction of -69Ets and -28Ets elements dramatically decreased the promoter activity by 96.4% and 87.8%, respectively, while mutation of -77GATA moderately reduced the activity by 31.4%. The result was in agreement with our previous report that showed the crucial motifs in the c-mpl promoter for the promoter activity induced by PMA-treatment. This indicates that TPO-induced activation of the c-mpl promoter activity is fully modulated by transcription through a PKC-dependent pathway and the two Sp1 and two Ets motifs are crucial for the activation of the c-mpl promoter activity rather than a GATA motif in the c-mpl promoter of CMK cells.

  7. The WSXWS motif in cytokine receptors is a molecular switch involved in receptor activation

    DEFF Research Database (Denmark)

    Dagil, Robert; Knudsen, Maiken J.; Olsen, Johan Gotthardt

    2012-01-01

    The prolactin receptor (PRLR) is activated by binding of prolactin in a 2:1 complex, but the activation mechanism is poorly understood. PRLR has a conserved WSXWS motif generic to cytokine class I receptors. We have determined the nuclear magnetic resonance solution structure of the membrane...

  8. Monitoring lysin motif-ligand interactions via tryptophan analog fluorescence spectroscopy

    NARCIS (Netherlands)

    Petrovic, Dejan M.; Leenhouts, Kees; van Roosmalen, Maarten L.; KleinJan, Fenneke; Broos, Jaap

    2012-01-01

    The lysin motif (LysM) is a peptidoglycan binding protein domain found in a wide range of prokaryotes and eukaryotes. Various techniques have been used to study the LysM-ligand interaction, but a sensitive spectroscopic method to directly monitor this interaction has not been reported. Here a

  9. The nature of hydrogen bonding in R-2(2)(8) crystal motifs - a computational exploration

    Czech Academy of Sciences Publication Activity Database

    Deepa, Palanisamy; Solomon, R. V.; Vedha, S. A.; Kolandaivel, P.; Venuvanalingam, P.

    2014-01-01

    Roč. 112, č. 24 (2014), s. 3195-3205 ISSN 0026-8976 Institutional support: RVO:61388963 Keywords : NCI plot * hydrogen bonds * R-2(2)(8) motif * organic crystals * NBO * QTAIM analysis Subject RIV: CF - Physical ; Theoretical Chemistry Impact factor: 1.720, year: 2014

  10. Modulation of i-motif thermodynamic stability by the introduction of UNA (unlocked nucleic acid) monomers

    DEFF Research Database (Denmark)

    Pasternak, Anna; Wengel, Jesper

    2011-01-01

    The influence of acyclic RNA derivatives, UNA (unlocked nucleic acid) monomers, on i-DNA thermodynamic stability has been investigated. The 22 nt human telomeric fragment was chosen as the model sequence for stability studies. UNA monomers modulate i-motif stability in a position-depending manner...

  11. A single thiazole orange molecule forms an exciplex in a DNA i-motif.

    Science.gov (United States)

    Xu, Baochang; Wu, Xiangyang; Yeow, Edwin K L; Shao, Fangwei

    2014-06-18

    A fluorescent exciplex of thiazole orange (TO) is formed in a single-dye conjugated DNA i-motif. The exciplex fluorescence exhibits a large Stokes shift, high quantum yield, robust response to pH oscillation and little structural disturbance to the DNA quadruplex, which can be used to monitor the folding of high-order DNA structures.

  12. Cave acoustics in prehistory: Exploring the association of Palaeolithic visual motifs and acoustic response.

    Science.gov (United States)

    Fazenda, Bruno; Scarre, Chris; Till, Rupert; Pasalodos, Raquel Jiménez; Guerra, Manuel Rojo; Tejedor, Cristina; Peredo, Roberto Ontañón; Watson, Aaron; Wyatt, Simon; Benito, Carlos García; Drinkall, Helen; Foulds, Frederick

    2017-09-01

    During the 1980 s, acoustic studies of Upper Palaeolithic imagery in French caves-using the technology then available-suggested a relationship between acoustic response and the location of visual motifs. This paper presents an investigation, using modern acoustic measurement techniques, into such relationships within the caves of La Garma, Las Chimeneas, La Pasiega, El Castillo, and Tito Bustillo in Northern Spain. It addresses methodological issues concerning acoustic measurement at enclosed archaeological sites and outlines a general framework for extraction of acoustic features that may be used to support archaeological hypotheses. The analysis explores possible associations between the position of visual motifs (which may be up to 40 000 yrs old) and localized acoustic responses. Results suggest that motifs, in general, and lines and dots, in particular, are statistically more likely to be found in places where reverberation is moderate and where the low frequency acoustic response has evidence of resonant behavior. The work presented suggests that an association of the location of Palaeolithic motifs with acoustic features is a statistically weak but tenable hypothesis, and that an appreciation of sound could have influenced behavior among Palaeolithic societies of this region.

  13. Emergence of Nonlinear Optical Activity by Incorporation of a Linker Carrying the p-Nitroaniline Motif in MIL-53 Frameworks

    NARCIS (Netherlands)

    Markey, Karen; Krüger, Martin; Seidler, Tomasz; Reinsch, Helge; Verbiest, Thierry; De Vos, Dirk E.; Champagne, Benoît; Stock, Norbert; van der Veen, M.A.

    2017-01-01

    p-Nitroaniline presents the typical motif of a second-order nonlinear optically (NLO) active molecule. However, because of its crystallization in an antiparallel and hence centrosymmetric structure, the NLO activity is lost. In this contribution, the p-nitroaniline motif was built successfully

  14. SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.

    Science.gov (United States)

    Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude

    2011-07-01

    The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.

  15. Evidence for the additions of clustered interacting nodes during the evolution of protein interaction networks from network motifs

    Directory of Open Access Journals (Sweden)

    Guo Hao

    2011-05-01

    Full Text Available Abstract Background High-throughput screens have revealed large-scale protein interaction networks defining most cellular functions. How the proteins were added to the protein interaction network during its growth is a basic and important issue. Network motifs represent the simplest building blocks of cellular machines and are of biological significance. Results Here we study the evolution of protein interaction networks from the perspective of network motifs. We find that in current protein interaction networks, proteins of the same age class tend to form motifs and such co-origins of motif constituents are affected by their topologies and biological functions. Further, we find that the proteins within motifs whose constituents are of the same age class tend to be densely interconnected, co-evolve and share the same biological functions, and these motifs tend to be within protein complexes. Conclusions Our findings provide novel evidence for the hypothesis of the additions of clustered interacting nodes and point out network motifs, especially the motifs with the dense topology and specific function may play important roles during this process. Our results suggest functional constraints may be the underlying driving force for such additions of clustered interacting nodes.

  16. Insights into the evolution and diversification of the AT-hook Motif Nuclear Localized gene family in land plants.

    Science.gov (United States)

    Zhao, Jianfei; Favero, David S; Qiu, Jiwen; Roalson, Eric H; Neff, Michael M

    2014-10-14

    Members of the ancient land-plant-specific transcription factor AT-Hook Motif Nuclear Localized (AHL) gene family regulate various biological processes. However, the relationships among the AHL genes, as well as their evolutionary history, still remain unexplored. We analyzed over 500 AHL genes from 19 land plant species, ranging from the early diverging Physcomitrella patens and Selaginella to a variety of monocot and dicot flowering plants. We classified the AHL proteins into three types (Type-I/-II/-III) based on the number and composition of their functional domains, the AT-hook motif(s) and PPC domain. We further inferred their phylogenies via Bayesian inference analysis and predicted gene gain/loss events throughout their diversification. Our analyses suggested that the AHL gene family emerged in embryophytes and further evolved into two distinct clades, with Type-I AHLs forming one clade (Clade-A), and the other two types together diversifying in another (Clade-B). The two AHL clades likely diverged before the separation of Physcomitrella patens from the vascular plant lineage. In angiosperms, Clade-A AHLs expanded into 5 subfamilies; while, the ones in Clade-B expanded into 4 subfamilies. Examination of their expression patterns suggests that the AHLs within each clade share similar expression patterns with each other; however, AHLs in one monophyletic clade exhibit distinct expression patterns from the ones in the other clade. Over-expression of a Glycine max AHL PPC domain in Arabidopsis thaliana recapitulates the phenotype observed when over-expressing its Arabidopsis thaliana counterpart. This result suggests that the AHL genes from different land plant species may share conserved functions in regulating plant growth and development. Our study further suggests that such functional conservation may be due to conserved physical interactions among the PPC domains of AHL proteins. Our analyses reveal a possible evolutionary scenario for the AHL gene family

  17. LDsplit: screening for cis-regulatory motifs stimulating meiotic recombination hotspots by analysis of DNA sequence polymorphisms.

    Science.gov (United States)

    Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie

    2014-02-17

    As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of

  18. An efficient identification strategy of clonal tea cultivars using long-core motif SSR markers.

    Science.gov (United States)

    Wang, Rang Jian; Gao, Xiang Feng; Kong, Xiang Rui; Yang, Jun

    2016-01-01

    Microsatellites, or simple sequence repeats (SSRs), especially those with long-core motifs (tri-, tetra-, penta-, and hexa-nucleotide) represent an excellent tool for DNA fingerprinting. SSRs with long-core motifs are preferred since neighbor alleles are more easily separated and identified from each other, which render the interpretation of electropherograms and the true alleles more reliable. In the present work, with the purpose of characterizing a set of core SSR markers with long-core motifs for well fingerprinting clonal cultivars of tea (Camellia sinensis), we analyzed 66 elite clonal tea cultivars in China with 33 initially-chosen long-core motif SSR markers covering all the 15 linkage groups of tea plant genome. A set of 6 SSR markers were conclusively selected as core SSR markers after further selection. The polymorphic information content (PIC) of the core SSR markers was >0.5, with ≤5 alleles in each marker containing 10 or fewer genotypes. Phylogenetic analysis revealed that the core SSR markers were not strongly correlated with the trait 'cultivar processing-property'. The combined probability of identity (PID) between two random cultivars for the whole set of 6 SSR markers was estimated to be 2.22 × 10(-5), which was quite low, confirmed the usefulness of the proposed SSR markers for fingerprinting analyses in Camellia sinensis. Moreover, for the sake of quickly discriminating the clonal tea cultivars, a cultivar identification diagram (CID) was subsequently established using these core markers, which fully reflected the identification process and provided the immediate information about which SSR markers were needed to identify a cultivar chosen among the tested ones. The results suggested that long-core motif SSR markers used in the investigation contributed to the accurate and efficient identification of the clonal tea cultivars and enabled the protection of intellectual property.

  19. Photonic crystal borax competitive binding carbohydrate sensing motif.

    Science.gov (United States)

    Cui, Qingzhou; Ward Muscatello, Michelle M; Asher, Sanford A

    2009-05-01

    We developed a photonic crystal sensing method for diol containing species such as carbohydrates based on a poly(vinyl alcohol) (PVA) hydrogel containing an embedded crystalline colloidal array (CCA). The polymerized CCA (PCCA) diffracts visible light. We show that in the presence of borax the diffraction wavelength shifts as the concentration of glucose changes. The diffraction shifts result from the competitive binding of glucose to borate, which reduces the concentration of borate bound to the PVA diols.

  20. Chemical Transformation Motifs --- Modelling Pathways as Integer Hyperflows

    DEFF Research Database (Denmark)

    Andersen, Jakob L.; Flamm, Christoph; Merkle, Daniel

    2018-01-01

    analysis are discussed in detail. To demonstrate the applicability of the mathematical framework to real-life problems we first explore the design space of possible non-oxidative glycolysis pathways and show that recent manually designed pathways can be further optimised. We then use a model of sugar...... chemistry to investigate pathways in the autocatalytic formose process. A graph transformation-based approach is used to automatically generate the reaction networks of interest....

  1. qPMS7: a fast algorithm for finding (ℓ, d-motifs in DNA and protein sequences.

    Directory of Open Access Journals (Sweden)

    Hieu Dinh

    Full Text Available Detection of rare events happening in a set of DNA/protein sequences could lead to new biological discoveries. One kind of such rare events is the presence of patterns called motifs in DNA/protein sequences. Finding motifs is a challenging problem since the general version of motif search has been proven to be intractable. Motifs discovery is an important problem in biology. For example, it is useful in the detection of transcription factor binding sites and transcriptional regulatory elements that are very crucial in understanding gene function, human disease, drug design, etc. Many versions of the motif search problem have been proposed in the literature. One such is the (ℓ, d-motif search (or Planted Motif Search (PMS. A generalized version of the PMS problem, namely, Quorum Planted Motif Search (qPMS, is shown to accurately model motifs in real data. However, solving the qPMS problem is an extremely difficult task because a special case of it, the PMS Problem, is already NP-hard, which means that any algorithm solving it can be expected to take exponential time in the worse case scenario. In this paper, we propose a novel algorithm named qPMS7 that tackles the qPMS problem on real data as well as challenging instances. Experimental results show that our Algorithm qPMS7 is on an average 5 times faster than the state-of-art algorithm. The executable program of Algorithm qPMS7 is freely available on the web at http://pms.engr.uconn.edu/downloads/qPMS7.zip. Our online motif discovery tools that use Algorithm qPMS7 are freely available at http://pms.engr.uconn.edu or http://motifsearch.com.

  2. Optimizations of siRNA design for the activation of gene transcription by targeting the TATA-box motif.

    Directory of Open Access Journals (Sweden)

    Miaomiao Fan

    Full Text Available Small interfering RNAs (siRNAs are widely used to repress gene expression by targeting mRNAs. Some reports reveal that siRNAs can also activate or inhibit gene expression through targeting the gene promoters. Our group has found that microRNAs (miRNAs could activate gene transcription via interaction with the TATA-box motif in gene promoters. To investigate whether siRNA targeting the same region could upregulate the promoter activity, we test the activating efficiency of siRNAs targeting the TATA-box motif of 16 genes and perform a systematic analysis to identify the common features of the functional siRNAs for effective activation of gene promoters. Further, we try various modifications to improve the activating efficiency of siRNAs and find that it is quite useful to design the promoter-targeting activating siRNA by following several rules such as (a complementary to the TATA-box-centered region; (b UA usage at the first two bases of the antisense strand; (c twenty-three nucleotides (nts in length; (d 2'-O-Methyl (2'-OMe modification at the 3' terminus of the antisense strand; (e avoiding mismatches at the 3' end of the antisense strand. The optimized activating siRNAs potently enhance the expression of interleukin-2 (IL-2 gene in human and mouse primary CD4+ T cells with a long-time effect. Taken together, our study provides a guideline for rational design the promoter-targeting siRNA to sequence-specifically enhance gene expression.

  3. Using hexamers to predict cis-regulatory motifs in Drosophila

    Directory of Open Access Journals (Sweden)

    Kibler Dennis

    2005-10-01

    Full Text Available Abstract Background Cis-regulatory modules (CRMs are short stretches of DNA that help regulate gene expression in higher eukaryotes. They have been found up to 1 megabase away from the genes they regulate and can be located upstream, downstream, and even within their target genes. Due to the difficulty of finding CRMs using biological and computational techniques, even well-studied regulatory systems may contain CRMs that have not yet been discovered. Results We present a simple, efficient method (HexDiff based only on hexamer frequencies of known CRMs and non-CRM sequence to predict novel CRMs in regulatory systems. On a data set of 16 gap and pair-rule genes containing 52 known CRMs, predictions made by HexDiff had a higher correlation with the known CRMs than several existing CRM prediction algorithms: Ahab, Cluster Buster, MSCAN, MCAST, and LWF. After combining the results of the different algorithms, 10 putative CRMs were identified and are strong candidates for future study. The hexamers used by HexDiff to distinguish between CRMs and non-CRM sequence were also analyzed and were shown to be enriched in regulatory elements. Conclusion HexDiff provides an efficient and effective means for finding new CRMs based on known CRMs, rather than known binding sites.

  4. Radiation and desiccation response motif mediates radiation induced gene expression in D. radiodurans

    International Nuclear Information System (INIS)

    Anaganti, Narasimha; Basu, Bhakti; Apte, Shree Kumar

    2015-01-01

    Deinococcus radiodurans is an extremophile that withstands lethal doses of several DNA damaging agents such as gamma irradiation, UV rays, desiccation and chemical mutagens. The organism responds to DNA damage by inducing expression of several DNA repair genes. At least 25 radiation inducible gene promoters harbour a 17 bp palindromic sequence known as radiation and desiccation response motif (RDRM) implicated in gamma radiation inducible gene expression. However, mechanistic details of gamma radiation-responsive up-regulation in gene expression remain enigmatic. The promoters of highly radiation induced genes ddrB (DR0070), gyrB (DR0906), gyrA (DR1913), a hypothetical gene (DR1143) and recA (DR2338) from D. radiodurans were cloned in a green fluorescence protein (GFP)-based promoter probe shuttle vector pKG and their promoter activity was assessed in both E. coli as well as in D. radiodurans. The gyrA, gyrB and DR1143 gene promoters were active in E. coli although ddrB and recA promoters showed very weak activity. In D. radiodurans, all the five promoters were induced several fold following 6 kGy gamma irradiation. Highest induction was observed for ddrB promoter (25 fold), followed by DR1143 promoter (15 fold). The induction in the activity of gyrB, gyrA and recA promoters was 5, 3 and 2 fold, respectively. To assess the role of RDRM, the 17 bp palindromic sequence was deleted from these promoters. The promoters devoid of RDRM sequence displayed increase in the basal expression activity, but the radiation-responsive induction in promoter activity was completely lost. The substitution of two conserved bases of RDRM sequence yielded decreased radiation induction of PDR0070 promoter. Deletion of 5 bases from 5'-end of PDR0070 RDRM increased basal promoter activity, but radiation induction was completely abolished. Replacement of RDRM with non specific sequence of PDR0070 resulted in loss of basal expression and radiation induction. The results demonstrate that

  5. Linker-mediated assembly of gold nanoparticles into multimeric motifs

    Energy Technology Data Exchange (ETDEWEB)

    Sikora, Mateusz; Cieplak, Marek [Institute of Physics, Polish Academy of Sciences, Aleja Lotnikow 32/46, 02-668 Warsaw (Poland); Szymczak, Piotr [Institute of Theoretical Physics, Faculty of Physics, University of Warsaw, ulica Hoza 69, 00-681 Warsaw (Poland); Thompson, Damien, E-mail: mc@ifpan.edu.pl [Tyndall National Institute, Lee Maltings, University College Cork, Cork (Ireland)

    2011-11-04

    We present a theoretical description of linker-mediated self-assembly of gold nanoparticles (Au-NP). Using mesoscale simulations with a coarse-grained model for the Au NPs and dirhenium-based linker molecules, we investigate the conditions under which large clusters can grow and construct a phase diagram that identifies favorable growth conditions in terms of floating and bound linker concentrations. The findings can be considered as generic, as we expect other NP-linker systems to behave in a qualitatively similar way. In particular, we also discuss the case of antibody-functionalised Au NPs connected by the C-reactive proteins (CRPs). We extract some general rules for NP linking that may aid the production of size- and shape-specific NP clusters for technology applications.

  6. Protein–Phospholipid Interaction Motifs: A Focus on Phosphatidic Acid

    Directory of Open Access Journals (Sweden)

    Emeline Tanguy

    2018-04-01

    Full Text Available Cellular membranes are composed of thousands of different lipids usually maintained within a narrow range of concentrations. In addition to their well-known structural and metabolic roles, signaling functions for many lipids have also emerged over the last two decades. The latter largely depend on the ability of particular classes of lipids to interact specifically with a great variety of proteins and to regulate their localization and activity. Among these lipids, phosphatidic acid (PA plays a unique role in a large repertoire of cellular activities, most likely in relation to its unique biophysical properties. However, until recently, only incomplete information was available to model the interaction between PA and its protein partners. The development of new liposome-based assays as well as molecular dynamic simulation are now providing novel information. We will review the different factors that have shown to modulate the capacity of PA to interact with specific domains in target proteins.

  7. PERANCANGAN FONT TAPYCH BERBASIS KARAKTER VISUAL MOTIF TAPIS LAMPUNG

    Directory of Open Access Journals (Sweden)

    Wantoro Wantoro

    2017-02-01

    Full Text Available Abstrak Tapis merupakan kain khas asal Lampung yang terbuat dari tenunan benang kapas berwarna emas dan perak. Sebagai produk budaya, tapis memiliki visualisasi yang khas. Berdasarkan observasi, pemanfaatan tapis yang memiliki representasi nilai dan nilai estetika tinggi masih terbatas pada media dan waktu tertentu. Hal ini tentu saja menimbulkan kekhawatiran tentang pelestarian dan pengembangan tapis karena keterbatasan pemanfaatannya. Contoh pemanfaatan tapis yang dapat dilakukan adalah upaya eksplorasi visual kedalam bentuk produk budaya lain yang lekat dengan keseharian masyarakat. Salah satu produk budaya yang dapat dirancang adalah huruf. Huruf merupakan media komunikasi tulisan yang sebenarnya telah lama ada dan berkembang. Di Indonesia, komunikasi tulisan mayoritas  menggunakan huruf latin. Penggunaan huruf latin di Indonesia dapat dilihat mulai dari cara manual hingga sudah dilakukan secara digital. Huruf digital atau disebut dengan font saat ini banyak digunakan pada perangkat-perangkat digital/elektronik. Perkembangan desain font juga terus berkembang karena kebutuhan akan penyampaian pesan maupun kesan estetis pada suatu karya juga berkembang. Hal inilah yang kemudian menginspirasi lahirnya kemungkinan-kemungkinan bentuk visual baru font khususnya yang berbasis huruf Latin. Tapis, dengan kekayaan estetik pada visualnya dapat dijadikan sebagai inspirasi dalam menciptakan kemungkinan bentuk baru dari sebuah font yang berbasis pada bentuk huruf latin. Perpaduan visual antara tapis dan karakter huruf latin dapat menghasilkan karakteristik yang khas. Kata Kunci: Adaptasi, Font, Perancangan, Tapis, Visual Abstract Tapis from Lampung is a unique fabric made from cotton yarn woven gold and silver. As cultural products, Tapis have a typical visualization. Based on observations, the use of Tapis which has a value of representation and a high aesthetic value is still limited to the media and a certain time. This raises concerns about the

  8. Assessment of algorithms for inferring positional weight matrix motifs of transcription factor binding sites using protein binding microarray data.

    Directory of Open Access Journals (Sweden)

    Yaron Orenstein

    Full Text Available The new technology of protein binding microarrays (PBMs allows simultaneous measurement of the binding intensities of a transcription factor to tens of thousands of synthetic double-stranded DNA probes, covering all possible 10-mers. A key computational challenge is inferring the binding motif from these data. We present a systematic comparison of four methods developed specifically for reconstructing a binding site motif represented as a positional weight matrix from PBM data. The reconstructed motifs were evaluated in terms of three criteria: concordance with reference motifs from the literature and ability to predict in vivo and in vitro bindings. The evaluation encompassed over 200 transcription factors and some 300 assays. The results show a tradeoff between how the methods perform according to the different criteria, and a dichotomy of method types. Algorithms that construct motifs with low information content predict PBM probe ranking more faithfully, while methods that produce highly informative motifs match reference motifs better. Interestingly, in predicting high-affinity binding, all methods give far poorer results for in vivo assays compared to in vitro assays.

  9. A Conserved GPG-Motif in the HIV-1 Nef Core Is Required for Principal Nef-Activities.

    Directory of Open Access Journals (Sweden)

    Marta Martínez-Bonet

    Full Text Available To find out new determinants required for Nef activity we performed a functional alanine scanning analysis along a discrete but highly conserved region at the core of HIV-1 Nef. We identified the GPG-motif, located at the 121-137 region of HIV-1 NL4.3 Nef, as a novel protein signature strictly required for the p56Lck dependent Nef-induced CD4-downregulation in T-cells. Since the Nef-GPG motif was dispensable for CD4-downregulation in HeLa-CD4 cells, Nef/AP-1 interaction and Nef-dependent effects on Tf-R trafficking, the observed effects on CD4 downregulation cannot be attributed to structure constraints or to alterations on general protein trafficking. Besides, we found that the GPG-motif was also required for Nef-dependent inhibition of ring actin re-organization upon TCR triggering and MHCI downregulation, suggesting that the GPG-motif could actively cooperate with the Nef PxxP motif for these HIV-1 Nef-related effects. Finally, we observed that the Nef-GPG motif was required for optimal infectivity of those viruses produced in T-cells. According to these findings, we propose the conserved GPG-motif in HIV-1 Nef as functional region required for HIV-1 infectivity and therefore with a potential interest for the interference of Nef activity during HIV-1 infection.

  10. Motif Naga pada Hinggi Sumba Timur Sebuah Metamorfosa Estetik

    Directory of Open Access Journals (Sweden)

    Biranul Anas

    2007-03-01

    Full Text Available Hinggi, a traditional ikat cloth, is one of East Sumba’s foremost material culture. As a component of the East Sumbanese traditional man’s dress, hinggi is visually impressive. Its patterns display attractive motives, decorative and symbolic, that depict religious conviction, power, wealth, status, honor and gender prestige, based on local and foreign sources that comply to traditional standards of quality. Intrinsically it indicates hinggi’s central role in social and ritual context, making it a highly prized item among the East Sumbanese, a matter that has been taken place since ancient days, and the international audience as well. Throughout its long history, the making of the hinggi came in contact with outside powers i.e. through trade, colonialism, education, and industrialization, including today’s international tourism. From a cloth that functioned to serve the internal culture of its producers, hinggi became a commodity for outsiders that consequently affected its aesthetical aspects. Hinggi’s patterning went through changes towards new forms adjusting to external demand, especially for the past 30 years, resulting in salient differences between hinggis made for sale and those made for internal use. Significant visual changes are that of the introduction of secular designs depicting local cultural themes such as ceremonies and mock battles, conveyed in realistic expressions with  smooth lines in a coinciding asymmetric configuration within a narrative set up, replacing traditional spiritual iconography which is relatively stylistic, abstract and stiff, in a diametrical symmetric setting with no expository relation. New hinggi designs principally sprung up from three main themes i.e. the papanggang, a spectacular burial ceremony carried out for personalities of high social status; the pasola, a ceremonial (in the past real battle between opposing parties; and the palai ngandi, a folk tale about the groom’s kidnapping of

  11. Amphipathic motifs in BAR domains are essential for membrane curvature sensing

    DEFF Research Database (Denmark)

    Bhatia, Vikram K; Madsen, Kenneth L; Bolinger, Pierre-Yves

    2009-01-01

    BAR (Bin/Amphiphysin/Rvs) domains and amphipathic alpha-helices (AHs) are believed to be sensors of membrane curvature thus facilitating the assembly of protein complexes on curved membranes. Here, we used quantitative fluorescence microscopy to compare the binding of both motifs on single...... nanosized liposomes of different diameters and therefore membrane curvature. Characterization of members of the three BAR domain families showed surprisingly that the crescent-shaped BAR dimer with its positively charged concave face is not able to sense membrane curvature. Mutagenesis on BAR domains showed...... that membrane curvature sensing critically depends on the N-terminal AH and furthermore that BAR domains sense membrane curvature through hydrophobic insertion in lipid packing defects and not through electrostatics. Consequently, amphipathic motifs, such as AHs, that are often associated with BAR domains...

  12. Super-transient scaling in time-delay autonomous Boolean network motifs

    Energy Technology Data Exchange (ETDEWEB)

    D' Huys, Otti, E-mail: otti.dhuys@phy.duke.edu; Haynes, Nicholas D. [Department of Physics, Duke University, Durham, North Carolina 27708 (United States); Lohmann, Johannes [Department of Physics, Duke University, Durham, North Carolina 27708 (United States); Institut für Theoretische Physik, Technische Universität Berlin, Hardenbergstraße 36, 10623 Berlin (Germany); Gauthier, Daniel J. [Department of Physics, Duke University, Durham, North Carolina 27708 (United States); Department of Physics, The Ohio State University, Columbus, Ohio 43210 (United States)

    2016-09-15

    Autonomous Boolean networks are commonly used to model the dynamics of gene regulatory networks and allow for the prediction of stable dynamical attractors. However, most models do not account for time delays along the network links and noise, which are crucial features of real biological systems. Concentrating on two paradigmatic motifs, the toggle switch and the repressilator, we develop an experimental testbed that explicitly includes both inter-node time delays and noise using digital logic elements on field-programmable gate arrays. We observe transients that last millions to billions of characteristic time scales and scale exponentially with the amount of time delays between nodes, a phenomenon known as super-transient scaling. We develop a hybrid model that includes time delays along network links and allows for stochastic variation in the delays. Using this model, we explain the observed super-transient scaling of both motifs and recreate the experimentally measured transient distributions.

  13. Homeric Motifs in Cavafy’s Poem »Priam’s Night Journey«

    Directory of Open Access Journals (Sweden)

    Dragica Fabjan Andritsakos

    2016-12-01

    Full Text Available The paper discusses Homeric motifs in ‘Priam’s Night Journey’, a poem by the contemporary Greek poet Constantine P. Cavafy and, more precisely, one of the ten poems composed by Cavafy on mythological themes. The discussion begins by comparing Cavafy’s treatment of a motif from the Iliad, Canto 24 – Priam’s journey to Achilles – with its ancient counterpart. The question of Cavafy’s sources is addressed as well: does the poet draw on the Ancient Greek original or on the Modern Greek translation? The second part of the article analyses in detail those passages which closely lean on the ancient epic, and concludes by illustrating Cavafy’s departure from the myth, which emerges most radically in the close of the poem.

  14. Spectral Barcoding of Quantum Dots: Deciphering Structural Motifs from the Excitonic Spectra

    International Nuclear Information System (INIS)

    Mlinar, V.; Zunger, A.

    2009-01-01

    Self-assembled semiconductor quantum dots (QDs) show in high-resolution single-dot spectra a multitude of sharp lines, resembling a barcode, due to various neutral and charged exciton complexes. Here we propose the 'spectral barcoding' method that deciphers structural motifs of dots by using such barcode as input to an artificial-intelligence learning system. Thus, we invert the common practice of deducing spectra from structure by deducing structure from spectra. This approach (i) lays the foundation for building a much needed structure-spectra understanding for large nanostructures and (ii) can guide future design of desired optical features of QDs by controlling during growth only those structural motifs that decide given optical features.

  15. Characterization of a Smad motif similar to Drosophila mad in the mouse Msx 1 promoter.

    Science.gov (United States)

    Alvarez Martinez, Cristina E; Binato, Renata; Gonzalez, Sayonara; Pereira, Monica; Robert, Benoit; Abdelhay, Eliana

    2002-03-01

    Mouse Msx 1 gene, orthologous of the Drosophila msh, is involved in several developmental processes. BMP family members are major proteins in the regulation of Msx 1 expression. BMP signaling activates Smad 1/5/8 proteins, which associate to Smad 4 before translocating to the nucleus. Analysis of Msx 1 promoter revealed the presence of three elements similar to the consensus established for Mad, the Smad 1 Drosophila counterpart. Notably, such an element was identified in an enhancer important for Msx 1 regulation. Gel shift analysis demonstrated that proteins from 13.5 dpc embryo associate to this enhancer. Remarkably, supershift assays showed that Smad proteins are present in the complex. Purified Smad 1 and 4 also bind to this fragment. We demonstrate that functional binding sites in this enhancer are confined to the Mad motif and flanking region. Our data suggest that this Mad motif may be functional in response to BMP signaling. ©2002 Elsevier Science (USA).

  16. Pyrene functionalized molecular beacon with pH-sensitive i-motif in a loop.

    Science.gov (United States)

    Dembska, Anna; Juskowiak, Bernard

    2015-01-01

    In this work, we present a spectral characterization of pH-sensitive system, which combines the i-motif properties with the spatially sensitive fluorescence signal of pyrene molecules attached to hairpin ends. The excimer production (fluorescence max. ∼480 nm) by pyrene labels at the ends of the molecular beacon is driven by pH-dependent i-motif formation in the loop. To illustrate the performance and reversible work of our systems, we performed the experiments with repeatedly pH cycling between pH values of 7.5±0.3 and 6.5±0.3. The sensor gives analytical response in excimer-monomer switching mode in narrow pH range (1.5 pH units) and exhibits high pH resolution (0.1 pH unit). Copyright © 2015 Elsevier B.V. All rights reserved.

  17. An essential GT motif in the lamin A promoter mediates activation by CREB-binding protein

    International Nuclear Information System (INIS)

    Janaki Ramaiah, M.; Parnaik, Veena K.

    2006-01-01

    Lamin A is an important component of nuclear architecture in mammalian cells. Mutations in the human lamin A gene lead to highly degenerative disorders that affect specific tissues. In studies directed towards understanding the mode of regulation of the lamin A promoter, we have identified an essential GT motif at -55 position by reporter gene assays and mutational analysis. Binding of this sequence to Sp transcription factors has been observed in electrophoretic mobility shift assays and by chromatin immunoprecipitation studies. Further functional analysis by co-expression of recombinant proteins and ChIP assays has shown an important regulatory role for CREB-binding protein in promoter activation, which is mediated by the GT motif

  18. A Common Structural Motif in the Binding of Virulence Factors to Bacterial Secretion Chaperones

    International Nuclear Information System (INIS)

    Lilic, M.; Vujanac, M.; Stebbins, C.

    2006-01-01

    Salmonella invasion protein A (SipA) is translocated into host cells by a type III secretion system (T3SS) and comprises two regions: one domain binds its cognate type III secretion chaperone, InvB, in the bacterium to facilitate translocation, while a second domain functions in the host cell, contributing to bacterial uptake by polymerizing actin. We present here the crystal structures of the SipA chaperone binding domain (CBD) alone and in complex with InvB. The SipA CBD is found to consist of a nonglobular polypeptide as well as a large globular domain, both of which are necessary for binding to InvB. We also identify a structural motif that may direct virulence factors to their cognate chaperones in a diverse range of pathogenic bacteria. Disruption of this structural motif leads to a destabilization of several chaperone-substrate complexes from different species, as well as an impairment of secretion in Salmonella

  19. Requirement for asparagine in the aquaporin NPA sequence signature motifs for cation exclusion

    DEFF Research Database (Denmark)

    Wree, Dorothea; Wu, Binghua; Zeuthen, Thomas

    2011-01-01

    Two highly conserved NPA motifs are a hallmark of the aquaporin (AQP) family. The NPA triplets form N-terminal helix capping structures with the Asn side chains located in the centre of the water or solute-conducting channel, and are considered to play an important role in AQP selectivity. Although...... interchangeable at both NPA sites without affecting protein expression or water, glycerol and methylamine permeability. However, other mutations in the NPA region led to reduced permeability (S186C and S186D), to nonfunctional channels (N64D), or even to lack of protein expression (S186A and S186T). Using...... electrophysiology, we found that an analogous mammalian AQP1 N76S mutant excluded protons and potassium ions, but leaked sodium ions, providing an argument for the overwhelming prevalence of Asn over other amino acids. We conclude that, at the first position in the NPA motifs, only Asn provides efficient helix cap...

  20. The nitrogen responsive transcriptome in potato (Solanum tuberosum L.) reveals significant gene regulatory motifs.

    Science.gov (United States)

    Gálvez, José Héctor; Tai, Helen H; Lagüe, Martin; Zebarth, Bernie J; Strömvik, Martina V

    2016-05-19

    Nitrogen (N) is the most important nutrient for the growth of potato (Solanum tuberosum L.). Foliar gene expression in potato plants with and without N supplementation at 180 kg N ha(-1) was compared at mid-season. Genes with consistent differences in foliar expression due to N supplementation over three cultivars and two developmental time points were examined. In total, thirty genes were found to be over-expressed and nine genes were found to be under-expressed with supplemented N. Functional relationships between over-expressed genes were found. The main metabolic pathway represented among differentially expressed genes was amino acid metabolism. The 1000 bp upstream flanking regions of the differentially expressed genes were analysed and nine overrepresented motifs were found using three motif discovery algorithms (Seeder, Weeder and MEME). These results point to coordinated gene regulation at the transcriptional level controlling steady state potato responses to N sufficiency.

  1. A novel fibronectin binding motif in MSCRAMMs targets F3 modules.

    Directory of Open Access Journals (Sweden)

    Sabitha Prabhakaran

    Full Text Available BBK32 is a surface expressed lipoprotein and fibronectin (Fn-binding microbial surface component recognizing adhesive matrix molecule (MSCRAMM of Borrelia burgdorferi, the causative agent of Lyme disease. Previous studies from our group showed that BBK32 is a virulence factor in experimental Lyme disease and located the Fn-binding region to residues 21-205 of the lipoprotein.Studies aimed at identifying interacting sites between BBK32 and Fn revealed an interaction between the MSCRAMM and the Fn F3 modules. Further analysis of this interaction showed that BBK32 can cause the aggregation of human plasma Fn in a similar concentration-dependent manner to that of anastellin, the superfibronectin (sFn inducing agent. The resulting Fn aggregates are conformationally distinct from plasma Fn as indicated by a change in available thermolysin cleavage sites. Recombinant BBK32 and anastellin affect the structure of Fn matrices formed by cultured fibroblasts and inhibit endothelial cell proliferation similarly. Within BBK32, we have located the sFn-forming activity to a region between residues 160 and 175 which contains two sequence motifs that are also found in anastellin. Synthetic peptides mimicking these motifs induce Fn aggregation, whereas a peptide with a scrambled sequence motif was inactive, suggesting that these motifs represent the sFn-inducing sequence.We conclude that BBK32 induces the formation of Fn aggregates that are indistinguishable from those formed by anastellin. The results of this study provide evidence for how bacteria can target host proteins to manipulate host cell activities.

  2. SiteBinder: an improved approach for comparing multiple protein structural motifs.

    Science.gov (United States)

    Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav

    2012-02-27

    There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.

  3. Directed connectivity of brain default networks in resting state using GCA and motif.

    Science.gov (United States)

    Jiao, Zhuqing; Wang, Huan; Ma, Kai; Zou, Ling; Xiang, Jianbo

    2017-06-01

    Nowadays, there is a lot of interest in assessing functional interactions between key brain regions. In this paper, Granger causality analysis (GCA) and motif structure are adopted to study directed connectivity of brain default mode networks (DMNs) in resting state. Firstly, the time series of functional magnetic resonance imaging (fMRI) data in resting state were extracted, and the causal relationship values of the nodes representing related brain regions are analyzed in time domain to construct a default network. Then, the network structures were searched from the default networks of controls and patients to determine the fixed connection mode in the networks. The important degree of motif structures in directed connectivity of default networks was judged according to p-value and Z-score. Both node degree and average distance were used to analyze the effect degree an information transfer rate of brain regions in motifs and default networks, and efficiency of the network. Finally, activity and functional connectivity strength of the default brain regions are researched according to the change of energy distributions between the normals and the patients' brain regions. Experimental results demonstrate that, both normal subjects and stroke patients have some corresponding fixed connection mode of three nodes, and the efficiency and power spectrum of the patient's default network is somewhat lower than that of the normal person. In particular, the Right Posterior Cingulate Gyrus (PCG.R) has a larger change in functional connectivity and its activity. The research results verify the feasibility of the application of GCA and motif structure to study the functional connectivity of default networks in resting state.

  4. Identification and characterization of a selenoprotein family containing a diselenide bond in a redox motif

    OpenAIRE

    Shchedrina, Valentina A.; Novoselov, Sergey V.; Malinouski, Mikalai Yu.; Gladyshev, Vadim N.

    2007-01-01

    Selenocysteine (Sec, U) insertion into proteins is directed by translational recoding of specific UGA codons located upstream of a stem-loop structure known as Sec insertion sequence (SECIS) element. Selenoproteins with known functions are oxidoreductases containing a single redox-active Sec in their active sites. In this work, we identified a family of selenoproteins, designated SelL, containing two Sec separated by two other residues to form a UxxU motif. SelL proteins show an unusual occur...

  5. 'On Earth as it is in Heaven...' The heavenly sanctuary motif in ...

    African Journals Online (AJOL)

    ... archetype] by Philo) and σκια [shadow] – is studied within the context of Hebrews 8:1–5. The purpose of this investigation is to explore the possible Graeco-Jewish background(s) of the 'heavenly sanctuary' motif in Hebrews 8:5, the presence of its key terminology and some of its intertextual occurrences in, amongst others ...

  6. Presence of a consensus DNA motif at nearby DNA sequence of the mutation susceptible CG nucleotides.

    Science.gov (United States)

    Chowdhury, Kaushik; Kumar, Suresh; Sharma, Tanu; Sharma, Ankit; Bhagat, Meenakshi; Kamai, Asangla; Ford, Bridget M; Asthana, Shailendra; Mandal, Chandi C

    2018-01-10

    Complexity in tissues affected by cancer arises from somatic mutations and epigenetic modifications in the genome. The mutation susceptible hotspots present within the genome indicate a non-random nature and/or a position specific selection of mutation. An association exists between the occurrence of mutations and epigenetic DNA methylation. This study is primarily aimed at determining mutation status, and identifying a signature for predicting mutation prone zones of tumor suppressor (TS) genes. Nearby sequences from the top five positions having a higher mutation frequency in each gene of 42 TS genes were selected from a cosmic database and were considered as mutation prone zones. The conserved motifs present in the mutation prone DNA fragments were identified. Molecular docking studies were done to determine putative interactions between the identified conserved motifs and enzyme methyltransferase DNMT1. Collective analysis of 42 TS genes found GC as the most commonly replaced and AT as the most commonly formed residues after mutation. Analysis of the top 5 mutated positions of each gene (210 DNA segments for 42 TS genes) identified that CG nucleotides of the amino acid codons (e.g., Arginine) are most susceptible to mutation, and found a consensus DNA "T/AGC/GAGGA/TG" sequence present in these mutation prone DNA segments. Similar to TS genes, analysis of 54 oncogenes not only found CG nucleotides of the amino acid Arg as the most susceptible to mutation, but also identified the presence of similar consensus DNA motifs in the mutation prone DNA fragments (270 DNA segments for 54 oncogenes) of oncogenes. Docking studies depicted that, upon binding of DNMT1 methylates to this consensus DNA motif (C residues of CpG islands), mutation was likely to occur. Thus, this study proposes that DNMT1 mediated methylation in chromosomal DNA may decrease if a foreign DNA segment containing this consensus sequence along with CG nucleotides is exogenously introduced to dividing

  7. Feedback loops and reciprocal regulation: recurring motifs in the systems biology of the cell cycle

    OpenAIRE

    Ferrell, James E.

    2013-01-01

    The study of eukaryotic cell cycle regulation over the last several decades has led to a remarkably detailed understanding of the complex regulatory system that drives this fundamental process. This allows us to now look for recurring motifs in the regulatory system. Among these are negative feedback loops, which underpin checkpoints and generate cell cycle oscillations; positive feedback loops, which promote oscillations and make cell cycle transitions switch-like and unidirectional; and rec...

  8. Stanniocalcin 1 binds hemin through a partially conserved heme regulatory motif

    International Nuclear Information System (INIS)

    Westberg, Johan A.; Jiang, Ji; Andersson, Leif C.

    2011-01-01

    Highlights: → Stanniocalcin 1 (STC1) binds heme through novel heme binding motif. → Central iron atom of heme and cysteine-114 of STC1 are essential for binding. → STC1 binds Fe 2+ and Fe 3+ heme. → STC1 peptide prevents oxidative decay of heme. -- Abstract: Hemin (iron protoporphyrin IX) is a necessary component of many proteins, functioning either as a cofactor or an intracellular messenger. Hemoproteins have diverse functions, such as transportation of gases, gas detection, chemical catalysis and electron transfer. Stanniocalcin 1 (STC1) is a protein involved in respiratory responses of the cell but whose mechanism of action is still undetermined. We examined the ability of STC1 to bind hemin in both its reduced and oxidized states and located Cys 114 as the axial ligand of the central iron atom of hemin. The amino acid sequence differs from the established (Cys-Pro) heme regulatory motif (HRM) and therefore presents a novel heme binding motif (Cys-Ser). A STC1 peptide containing the heme binding sequence was able to inhibit both spontaneous and H 2 O 2 induced decay of hemin. Binding of hemin does not affect the mitochondrial localization of STC1.

  9. Crystallization and preliminary X-ray diffraction analysis of motif N from Saccharomyces cerevisiae Dbf4

    International Nuclear Information System (INIS)

    Matthews, Lindsay A.; Duong, Andrew; Prasad, Ajai A.; Duncker, Bernard P.; Guarné, Alba

    2009-01-01

    To understand the role of the Cdc7–Dbf4 complex in checkpoint responses, a fragment of Saccharomyces cerevisiae Dbf4 encompassing motif N was isolated, overproduced and crystallized. The Cdc7–Dbf4 complex plays an instrumental role in the initiation of DNA replication and is a target of replication-checkpoint responses in Saccharomyces cerevisiae. Cdc7 is a conserved serine/threonine kinase whose activity depends on association with its regulatory subunit, Dbf4. A conserved sequence near the N-terminus of Dbf4 (motif N) is necessary for the interaction of Cdc7–Dbf4 with the checkpoint kinase Rad53. To understand the role of the Cdc7–Dbf4 complex in checkpoint responses, a fragment of Saccharomyces cerevisiae Dbf4 encompassing motif N was isolated, overproduced and crystallized. A complete native data set was collected at 100 K from crystals that diffracted X-rays to 2.75 Å resolution and structure determination is currently under way

  10. Stanniocalcin 1 binds hemin through a partially conserved heme regulatory motif

    Energy Technology Data Exchange (ETDEWEB)

    Westberg, Johan A., E-mail: johan.westberg@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland); Jiang, Ji, E-mail: ji.jiang@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland); Andersson, Leif C., E-mail: leif.andersson@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland)

    2011-06-03

    Highlights: {yields} Stanniocalcin 1 (STC1) binds heme through novel heme binding motif. {yields} Central iron atom of heme and cysteine-114 of STC1 are essential for binding. {yields} STC1 binds Fe{sup 2+} and Fe{sup 3+} heme. {yields} STC1 peptide prevents oxidative decay of heme. -- Abstract: Hemin (iron protoporphyrin IX) is a necessary component of many proteins, functioning either as a cofactor or an intracellular messenger. Hemoproteins have diverse functions, such as transportation of gases, gas detection, chemical catalysis and electron transfer. Stanniocalcin 1 (STC1) is a protein involved in respiratory responses of the cell but whose mechanism of action is still undetermined. We examined the ability of STC1 to bind hemin in both its reduced and oxidized states and located Cys{sup 114} as the axial ligand of the central iron atom of hemin. The amino acid sequence differs from the established (Cys-Pro) heme regulatory motif (HRM) and therefore presents a novel heme binding motif (Cys-Ser). A STC1 peptide containing the heme binding sequence was able to inhibit both spontaneous and H{sub 2}O{sub 2} induced decay of hemin. Binding of hemin does not affect the mitochondrial localization of STC1.

  11. Double-hydrophobic elastin-like polypeptides with added functional motifs: Self-assembly and cytocompatibility.

    Science.gov (United States)

    Le, Duc H T; Tsutsui, Yoko; Sugawara-Narutaki, Ayae; Yukawa, Hiroshi; Baba, Yoshinobu; Ohtsuki, Chikara

    2017-09-01

    We have recently developed a novel double-hydrophobic elastin-like triblock polypeptide called GPG, designed after the uneven distribution of two different hydrophobic domains found in elastin, an extracellular matrix protein providing elasticity and resilience to tissues. Upon temperature trigger, GPG undergoes a sequential self-assembling process to form flexible beaded nanofibers with high homogeneity and excellent dispersibility in water. Given that GPG might be a potential elastin-mimetic material, we sought to explore the biological activities of this block polypeptide. Besides GPG, several functionalized derivatives were also constructed by fusing functional motifs such as KAAK or KAAKGRGDS at the C-terminal of GPG. Although the added motifs affected the kinetics of fiber formation and β-sheet contents, all three GPGs assembled into beaded nanofibers at the physiological temperature. The resulting GPG nanofibers preserved their beaded structures in cell culture medium; therefore, they were coated on polystyrene substrates to study their cytocompatibility toward mouse embryonic fibroblasts, NIH-3T3. Among the three polypeptides, GPG having the cell-binding motif GRGDS derived from fibronectin showed excellent cell adhesion and cell proliferation properties compared to other conventional materials, suggesting its promising applications as extracellular matrices for mammalian cells. © 2017 Wiley Periodicals, Inc. J Biomed Mater Res Part A: 105A: 2475-2484, 2017. © 2017 Wiley Periodicals, Inc.

  12. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San

    2015-01-01

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  13. The LINKS motif zippers trans-acyltransferase polyketide synthase assembly lines into a biosynthetic megacomplex.

    Science.gov (United States)

    Gay, Darren C; Wagner, Drew T; Meinke, Jessica L; Zogzas, Charles E; Gay, Glen R; Keatinge-Clay, Adrian T

    2016-03-01

    Polyketides such as the clinically-valuable antibacterial agent mupirocin are constructed by architecturally-sophisticated assembly lines known as trans-acyltransferase polyketide synthases. Organelle-sized megacomplexes composed of several copies of trans-acyltransferase polyketide synthase assembly lines have been observed by others through transmission electron microscopy to be located at the Bacillus subtilis plasma membrane, where the synthesis and export of the antibacterial polyketide bacillaene takes place. In this work we analyze ten crystal structures of trans-acyltransferase polyketide synthases ketosynthase domains, seven of which are reported here for the first time, to characterize a motif capable of zippering assembly lines into a megacomplex. While each of the three-helix LINKS (Laterally-INteracting Ketosynthase Sequence) motifs is observed to similarly dock with a spatially-reversed copy of itself through hydrophobic and ionic interactions, the amino acid sequences of this motif are not conserved. Such a code is appropriate for mediating homotypic contacts between assembly lines to ensure the ordered self-assembly of a noncovalent, yet tightly-knit, enzymatic network. LINKS-mediated lateral interactions would also have the effect of bolstering the vertical association of the polypeptides that comprise a polyketide synthase assembly line. Copyright © 2015 Elsevier Inc. All rights reserved.

  14. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun

    2015-06-11

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  15. Dual hydrogen-bonding motifs in complexes formed between tropolone and formic acid

    Science.gov (United States)

    Nemchick, Deacon J.; Cohen, Michael K.; Vaccaro, Patrick H.

    2016-11-01

    The near-ultraviolet π*←π absorption system of weakly bound complexes formed between tropolone (TrOH) and formic acid (FA) under cryogenic free-jet expansion conditions has been interrogated by exploiting a variety of fluorescence-based laser-spectroscopic probes, with synergistic quantum-chemical calculations built upon diverse model chemistries being enlisted to unravel the structural and dynamical properties of the pertinent ground [X˜ 1A'] and excited [A˜ 1A'(" separators="π*π )] electronic states. For binary TrOH ṡ FA adducts, the presence of dual hydrogen-bond linkages gives rise to three low-lying isomers designated (in relative energy order) as INT, EXT1, and EXT2 depending on whether docking of the FA ligand to the TrOH substrate takes place internal or external to the five-membered reaction cleft of tropolone. While the symmetric double-minimum topography predicted for the INT potential surface mediates an intermolecular double proton-transfer event, the EXT1 and EXT2 structures are interconverted by an asymmetric single proton-transfer process that is TrOH-centric in nature. The A ˜ -X ˜ origin of TrOH ṡ FA at ν˜ 00=27 484 .45 cm-1 is displaced by δ ν˜ 00=+466 .76 cm-1 with respect to the analogous feature for bare tropolone and displays a hybrid type - a/b rotational contour that reflects the configuration of binding. A comprehensive analysis of vibrational landscapes supported by the optically connected X˜ 1A' and A˜ 1A'(" separators="π*π ) manifolds, including the characteristic isotopic shifts incurred by partial deuteration of the labile TrOH and FA protons, has been performed leading to the uniform assignment of numerous intermolecular (viz., modulating hydrogen-bond linkages) and intramolecular (viz., localized on monomer subunits) degrees of freedom. The holistic interpretation of all experimental and computational findings affords compelling evidence that an external-binding motif (attributed to EXT1), rather than the

  16. Characterization of the Canine MHC Class I DLA-88*50101 Peptide Binding Motif as a Prerequisite for Canine T Cell Immunotherapy.

    Directory of Open Access Journals (Sweden)

    Sharon M Barth

    Full Text Available There are limitations in pre-clinical settings using mice as a basis for clinical development in humans. In cancer, similarities exist between humans and dogs; thus, the dog patient can be a link in the transition from laboratory research on mouse models to clinical trials in humans. Knowledge of the peptides presented on MHC molecules is fundamental for the development of highly specific T cell-based immunotherapies. This information is available for human MHC molecules but is absent for the canine MHC. In the present study, we characterized the binding motif of dog leukocyte antigen (DLA class I allele DLA-88*50101, using human C1R and K562 transfected cells expressing the DLA-88*50101 heavy chain. MHC class I immunoaffinity-purification revealed 3720 DLA-88*50101 derived peptides, which enabled the determination of major anchor positions. The characterized binding motif of DLA-88*50101 was similar to HLA-A*02:01. Peptide binding analyses on HLA-A*02:01 and DLA-88*50101 via flow cytometry showed weak binding of DLA-88*50101 derived peptides to HLA-A*02:01, and vice versa. Our results present for the first time a detailed peptide binding motif of the canine MHC class I allelic product DLA-88*50101. These data support the goal of establishing dogs as a suitable animal model for the evaluation and development of T cell-based cancer immunotherapies, benefiting both dog and human patients.

  17. QuadBase2: web server for multiplexed guanine quadruplex mining and visualization

    Science.gov (United States)

    Dhapola, Parashar; Chowdhury, Shantanu

    2016-01-01

    DNA guanine quadruplexes or G4s are non-canonical DNA secondary structures which affect genomic processes like replication, transcription and recombination. G4s are computationally identified by specific nucleotide motifs which are also called putative G4 (PG4) motifs. Despite the general relevance of these structures, there is currently no tool available that can allow batch queries and genome-wide analysis of these motifs in a user-friendly interface. QuadBase2 (quadbase.igib.res.in) presents a completely reinvented web server version of previously published QuadBase database. QuadBase2 enables users to mine PG4 motifs in up to 178 eukaryotes through the EuQuad module. This module interfaces with Ensembl Compara database, to allow users mine PG4 motifs in the orthologues of genes of interest across eukaryotes. PG4 motifs can be mined across genes and their promoter sequences in 1719 prokaryotes through ProQuad module. This module includes a feature that allows genome-wide mining of PG4 motifs and their visualization as circular histograms. TetraplexFinder, the module for mining PG4 motifs in user-provided sequences is now capable of handling up to 20 MB of data. QuadBase2 is a comprehensive PG4 motif mining tool that further expands the configurations and algorithms for mining PG4 motifs in a user-friendly way. PMID:27185890

  18. A sialoreceptor binding motif in the Mycoplasma synoviae adhesin VlhA.

    Directory of Open Access Journals (Sweden)

    Meghan May

    Full Text Available Mycoplasma synoviae depends on its adhesin VlhA to mediate cytadherence to sialylated host cell receptors. Allelic variants of VlhA arise through recombination between an assemblage of promoterless vlhA pseudogenes and a single transcription promoter site, creating lineages of M. synoviae that each express a different vlhA allele. The predicted full-length VlhA sequences adjacent to the promoter of nine lineages of M. synoviae varying in avidity of cytadherence were aligned with that of the reference strain MS53 and with a 60-a.a. hemagglutinating VlhA C-terminal fragment from a Tunisian lineage of strain WVU1853(T. Seven different sequence variants of an imperfectly conserved, single-copy, 12-a.a. candidate cytadherence motif were evident amid the flanking variable residues of the 11 total sequences examined. The motif was predicted to adopt a short hairpin structure in a low-complexity region near the C-terminus of VlhA. Biotinylated synthetic oligopeptides representing four selected variants of the 12-a.a. motif, with the whole synthesized 60-a.a. fragment as a positive control, differed (P<0.01 in the extent they bound to chicken erythrocyte membranes. All bound to a greater extent (P<0.01 than scrambled or irrelevant VlhA domain negative control peptides did. Experimentally introduced branched-chain amino acid (BCAA substitutions Val3Ile and Leu7Ile did not significantly alter binding, whereas fold-destabilizing substitutions Thr4Gly and Ala9Gly tended to reduce it (P<0.05. Binding was also reduced to background levels (P<0.01 when the peptides were exposed to desialylated membranes, or were pre-saturated with free sialic acid before exposure to untreated membranes. From this evidence we conclude that the motif P-X-(BCAA-X-F-X-(BCAA-X-A-K-X-G binds sialic acid and likely mediates VlhA-dependent M. synoviae attachment to host cells. This conserved mechanism retains the potential for fine-scale rheostasis in binding avidity, which could be a

  19. Solution NMR characterization of Sgf73(1-104) indicates that Zn ion is required to stabilize zinc finger motif

    International Nuclear Information System (INIS)

    Lai, Chaohua; Wu, Minhao; Li, Pan; Shi, Chaowei; Tian, Changlin; Zang, Jianye

    2010-01-01

    Zinc finger motif contains a zinc ion coordinated by several conserved amino acid residues. Yeast Sgf73 protein was identified as a component of SAGA (Spt/Ada/Gcn5 acetyltransferase) multi-subunit complex and Sgf73 protein was known to contain two zinc finger motifs. Sgf73(1-104), containing the first zinc finger motif, was necessary to modulate the deubiquitinase activity of SAGA complex. Here, Sgf73(1-104) was over-expressed using bacterial expression system and purified for solution NMR (nuclear magnetic resonance) structural studies. Secondary structure and site-specific relaxation analysis of Sgf73(1-104) were achieved after solution NMR backbone assignment. Solution NMR and circular dichroism analysis of Sgf73(1-104) after zinc ion removal using chelation reagent EDTA (ethylene-diamine-tetraacetic acid) demonstrated that zinc ion was required to maintain stable conformation of the zinc finger motif.

  20. Interaction of Cu(+) with cytosine and formation of i-motif-like C-M(+)-C complexes: alkali versus coinage metals.

    Science.gov (United States)

    Gao, Juehan; Berden, Giel; Rodgers, M T; Oomens, Jos

    2016-03-14

    The Watson-Crick structure of DNA is among the most well-known molecular structures of our time. However, alternative base-pairing motifs are also known to occur, often depending on base sequence, pH, or the presence of cations. Pairing of cytosine (C) bases induced by the sharing of a single proton (C-H(+)-C) may give rise to the so-called i-motif, which occurs primarily in expanded trinucleotide repeats and the telomeric region of DNA, particularly at low pH. At physiological pH, silver cations were recently found to stabilize C dimers in a C-Ag(+)-C structure analogous to the hemiprotonated C-dimer. Here we use infrared ion spectroscopy in combination with density functional theory calculations at the B3LYP/6-311G+(2df,2p) level to show that copper in the 1+ oxidation state induces an analogous formation of C-Cu(+)-C structures. In contrast to protons and these transition metal ions, alkali metal ions induce a different dimer structure, where each ligand coordinates the alkali metal ion in a bidentate fashion in which the N3 and O2 atoms of both cytosine ligands coordinate to the metal ion, sacrificing hydrogen-bonding interactions between the ligands for improved chelation of the metal cation.

  1. The Q Motif Is Involved in DNA Binding but Not ATP Binding in ChlR1 Helicase.

    Directory of Open Access Journals (Sweden)

    Hao Ding

    Full Text Available Helicases are molecular motors that couple the energy of ATP hydrolysis to the unwinding of structured DNA or RNA and chromatin remodeling. The conversion of energy derived from ATP hydrolysis into unwinding and remodeling is coordinated by seven sequence motifs (I, Ia, II, III, IV, V, and VI. The Q motif, consisting of nine amino acids (GFXXPXPIQ with an invariant glutamine (Q residue, has been identified in some, but not all helicases. Compared to the seven well-recognized conserved helicase motifs, the role of the Q motif is less acknowledged. Mutations in the human ChlR1 (DDX11 gene are associated with a unique genetic disorder known as Warsaw Breakage Syndrome, which is characterized by cellular defects in genome maintenance. To examine the roles of the Q motif in ChlR1 helicase, we performed site directed mutagenesis of glutamine to alanine at residue 23 in the Q motif of ChlR1. ChlR1 recombinant protein was overexpressed and purified from HEK293T cells. ChlR1-Q23A mutant abolished the helicase activity of ChlR1 and displayed reduced DNA binding ability. The mutant showed impaired ATPase activity but normal ATP binding. A thermal shift assay revealed that ChlR1-Q23A has a melting point value similar to ChlR1-WT. Partial proteolysis mapping demonstrated that ChlR1-WT and Q23A have a similar globular structure, although some subtle conformational differences in these two proteins are evident. Finally, we found ChlR1 exists and functions as a monomer in solution, which is different from FANCJ, in which the Q motif is involved in protein dimerization. Taken together, our results suggest that the Q motif is involved in DNA binding but not ATP binding in ChlR1 helicase.

  2. Results of de-novo and Motif activity analyses - FANTOM5 | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data List Contact us FANTOM... JASPAR) Data file File name: Motifs File URL: ftp://ftp.biosciencedbc.jp/archive/fantom5/datafiles/phase1.3...tabase Database Description Download License Update History of This Database Site Policy | Contact Us Results of de-novo and Motif activity analyses - FANTOM5 | LSDB Archive ...

  3. The special neuraminidase stalk-motif responsible for increased virulence and pathogenesis of H5N1 influenza A virus.

    Directory of Open Access Journals (Sweden)

    Hongbo Zhou

    Full Text Available The variation of highly pathogenic avian influenza H5N1 virus results in gradually increased virulence in poultry, and human cases continue to accumulate. The neuraminidase (NA stalk region of influenza virus varies considerably and may associate with its virulence. The NA stalk region of all N1 subtype influenza A viruses can be divided into six different stalk-motifs, H5N1/2004-like (NA-wt, WSN-like, H5N1/97-like, PR/8-like, H7N1/99-like and H5N1/96-like. The NA-wt is a special NA stalk-motif which was first observed in H5N1 influenza virus in 2000, with a 20-amino acid deletion in the 49(th to 68(th positions of the stalk region. Here we show that there is a gradual increase of the special NA stalk-motif in H5N1 isolates from 2000 to 2007, and notably, the special stalk-motif is observed in all 173 H5N1 human isolates from 2004 to 2007. The recombinant H5N1 virus with the special stalk-motif possesses the highest virulence and pathogenicity in chicken and mice, while the recombinant viruses with the other stalk-motifs display attenuated phenotype. This indicates that the special stalk-motif has contributed to the high virulence and pathogenicity of H5N1 isolates since 2000. The gradually increasing emergence of the special NA stalk-motif in H5N1 isolates, especially in human isolates, deserves attention by all.

  4. An enhanced computational platform for investigating the roles of regulatory RNA and for identifying functional RNA motifs

    OpenAIRE

    Chang, Tzu-Hao; Huang, Hsi-Yuan; Hsu, Justin Bo-Kai; Weng, Shun-Long; Horng, Jorng-Tzong; Huang, Hsien-Da

    2013-01-01

    Background Functional RNA molecules participate in numerous biological processes, ranging from gene regulation to protein synthesis. Analysis of functional RNA motifs and elements in RNA sequences can obtain usef