short rna motif: Topics by WorldWideScience.org

Sample records for short rna motif

Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

Science.gov (United States)

Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

2013-01-01

The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545
RegRNA: an integrated web server for identifying regulatory RNA motifs and elements

OpenAIRE

Huang, Hsi-Yuan; Chien, Chia-Hung; Jen, Kuan-Hua; Huang, Hsien-Da

2006-01-01

Numerous regulatory structural motifs have been identified as playing essential roles in transcriptional and post-transcriptional regulation of gene expression. RegRNA is an integrated web server for identifying the homologs of regulatory RNA motifs and elements against an input mRNA sequence. Both sequence homologs and structural homologs of regulatory RNA motifs can be recognized. The regulatory RNA motifs supported in RegRNA are categorized into several classes: (i) motifs in mRNA 5′-untra...
RNA motif search with data-driven element ordering.

Science.gov (United States)

Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa

2016-05-18

In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .
Annotating RNA motifs in sequences and alignments.

Science.gov (United States)

Gardner, Paul P; Eldai, Hisham

2015-01-01

RNA performs a diverse array of important functions across all cellular life. These functions include important roles in translation, building translational machinery and maturing messenger RNA. More recent discoveries include the miRNAs and bacterial sRNAs that regulate gene expression, the thermosensors, riboswitches and other cis-regulatory elements that help prokaryotes sense their environment and eukaryotic piRNAs that suppress transposition. However, there can be a long period between the initial discovery of a RNA and determining its function. We present a bioinformatic approach to characterize RNA motifs, which are critical components of many RNA structure-function relationships. These motifs can, in some instances, provide researchers with functional hypotheses for uncharacterized RNAs. Moreover, we introduce a new profile-based database of RNA motifs--RMfam--and illustrate some applications for investigating the evolution and functional characterization of RNA. All the data and scripts associated with this work are available from: https://github.com/ppgardne/RMfam. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
BEAM web server: a tool for structural RNA motif discovery.

Science.gov (United States)

Pietrosanto, Marco; Adinolfi, Marta; Casula, Riccardo; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

2018-03-15

RNA structural motif finding is a relevant problem that becomes computationally hard when working on high-throughput data (e.g. eCLIP, PAR-CLIP), often represented by thousands of RNA molecules. Currently, the BEAM server is the only web tool capable to handle tens of thousands of RNA in input with a motif discovery procedure that is only limited by the current secondary structure prediction accuracies. The recently developed method BEAM (BEAr Motifs finder) can analyze tens of thousands of RNA molecules and identify RNA secondary structure motifs associated to a measure of their statistical significance. BEAM is extremely fast thanks to the BEAR encoding that transforms each RNA secondary structure in a string of characters. BEAM also exploits the evolutionary knowledge contained in a substitution matrix of secondary structure elements, extracted from the RFAM database of families of homologous RNAs. The BEAM web server has been designed to streamline data pre-processing by automatically handling folding and encoding of RNA sequences, giving users a choice for the preferred folding program. The server provides an intuitive and informative results page with the list of secondary structure motifs identified, the logo of each motif, its significance, graphic representation and information about its position in the RNA molecules sharing it. The web server is freely available at http://beam.uniroma2.it/ and it is implemented in NodeJS and Python with all major browsers supported. marco.pietrosanto@uniroma2.it. Supplementary data are available at Bioinformatics online.
An enhanced computational platform for investigating the roles of regulatory RNA and for identifying functional RNA motifs

OpenAIRE

Chang, Tzu-Hao; Huang, Hsi-Yuan; Hsu, Justin Bo-Kai; Weng, Shun-Long; Horng, Jorng-Tzong; Huang, Hsien-Da

2013-01-01

Background Functional RNA molecules participate in numerous biological processes, ranging from gene regulation to protein synthesis. Analysis of functional RNA motifs and elements in RNA sequences can obtain useful information for deciphering RNA regulatory mechanisms. Our previous work, RegRNA, is widely used in the identification of regulatory motifs, and this work extends it by incorporating more comprehensive and updated data sources and analytical approaches into a new platform. Methods ...
RNA recognition motif (RRM)-containing proteins in Bombyx mori

African Journals Online (AJOL)

STORAGESEVER

2009-03-20

Mar 20, 2009 ... Recognition Motif (RRM), sometimes referred to as. RNP1, is one of the first identified domains for RNA interaction. RRM is very common ..... Apart from the RRM motif, eIF3-S9 has a Trp-Asp. (WD) repeat domain, Poly (A) ...
Molecular dynamics simulations of electrostatics and hydration distributions around RNA and DNA motifs

Science.gov (United States)

Marlowe, Ashley E.; Singh, Abhishek; Semichaevsky, Andrey V.; Yingling, Yaroslava G.

2009-03-01

Nucleic acid nanoparticles can self-assembly through the formation of complementary loop-loop interactions or stem-stem interactions. Presence and concentration of ions can significantly affect the self-assembly process and the stability of the nanostructure. In this presentation we use explicit molecular dynamics simulations to examine the variations in cationic distributions and hydration environment around DNA and RNA helices and loop-loop interactions. Our simulations show that the potassium and sodium ionic distributions are different around RNA and DNA motifs which could be indicative of ion mediated relative stability of loop-loop complexes. Moreover in RNA loop-loop motifs ions are consistently present and exchanged through a distinct electronegative channel. We will also show how we used the specific RNA loop-loop motif to design a RNA hexagonal nanoparticle.
Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

Science.gov (United States)

Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A

2013-09-02

In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome
Systematic comparison of the response properties of protein and RNA mediated gene regulatory motifs.

Science.gov (United States)

Iyengar, Bharat Ravi; Pillai, Beena; Venkatesh, K V; Gadgil, Chetan J

2017-05-30

We present a framework enabling the dissection of the effects of motif structure (feedback or feedforward), the nature of the controller (RNA or protein), and the regulation mode (transcriptional, post-transcriptional or translational) on the response to a step change in the input. We have used a common model framework for gene expression where both motif structures have an activating input and repressing regulator, with the same set of parameters, to enable a comparison of the responses. We studied the global sensitivity of the system properties, such as steady-state gain, overshoot, peak time, and peak duration, to parameters. We find that, in all motifs, overshoot correlated negatively whereas peak duration varied concavely with peak time. Differences in the other system properties were found to be mainly dependent on the nature of the controller rather than the motif structure. Protein mediated motifs showed a higher degree of adaptation i.e. a tendency to return to baseline levels; in particular, feedforward motifs exhibited perfect adaptation. RNA mediated motifs had a mild regulatory effect; they also exhibited a lower peaking tendency and mean overshoot. Protein mediated feedforward motifs showed higher overshoot and lower peak time compared to the corresponding feedback motifs.
De Novo Discovery of Structured ncRNA Motifs in Genomic Sequences

DEFF Research Database (Denmark)

Ruzzo, Walter L; Gorodkin, Jan

2014-01-01

De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphas...... on an approach based on the CMfinder CMfinder program as a case study. Applications to genomic screens for novel de novo structured ncRNA ncRNA s, including structured RNA elements in untranslated portions of protein-coding genes, are presented.......De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphasis...
Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

Directory of Open Access Journals (Sweden)

Jie Zhu

Full Text Available DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.
Accurate Quantification of microRNA via Single Strand Displacement Reaction on DNA Origami Motif

Science.gov (United States)

Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

2013-01-01

DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs. PMID:23990889
Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

Science.gov (United States)

Zhu, Jie; Feng, Xiaolu; Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

2013-01-01

DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.
Structure of the central RNA recognition motif of human TIA-1 at 1.95 A resolution

International Nuclear Information System (INIS)

Kumar, Amit O.; Swenson, Matthew C.; Benning, Matthew M.; Kielkopf, Clara L.

2008-01-01

T-cell-restricted intracellular antigen-1 (TIA-1) regulates alternative pre-mRNA splicing in the nucleus, and mRNA translation in the cytoplasm, by recognizing uridine-rich sequences of RNAs. As a step towards understanding RNA recognition by this regulatory factor, the X-ray structure of the central RNA recognition motif (RRM2) of human TIA-1 is presented at 1.95 A resolution. Comparison with structurally homologous RRM-RNA complexes identifies residues at the RNA interfaces that are conserved in TIA-1-RRM2. The versatile capability of RNP motifs to interact with either proteins or RNA is reinforced by symmetry-related protein-protein interactions mediated by the RNP motifs of TIA-1-RRM2. Importantly, the TIA-1-RRM2 structure reveals the locations of mutations responsible for inhibiting nuclear import. In contrast with previous assumptions, the mutated residues are buried within the hydrophobic interior of the domain, where they would be likely to destabilize the RRM fold rather than directly inhibit RNA binding
cWords - systematic microRNA regulatory motif discovery from mRNA expression data

DEFF Research Database (Denmark)

Rasmussen, Simon Horskjær; Jacobsen, Anders; Krogh, Anders

2013-01-01

and statistical methods of cWords, resulting in at least a factor 100 speed gain over the previous implementation. On a benchmark dataset of 19 microRNA (miRNA) perturbation experiments cWords showed equal or better performance than two comparable methods, miReduce and Sylamer. We have developed rigorous motif...... that demonstrate comparable or better performance than other existing methods. Rich visualization of results promotes intuitive and efficient interpretation of data. cWords is available as a stand-alone Open Source program at Github https://github.com/simras/cWords webcite and as a web-service at: http...
Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

Energy Technology Data Exchange (ETDEWEB)

Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

2007-02-21

Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by
Motif III in superfamily 2 "helicases" helps convert the binding energy of ATP into a high-affinity RNA binding site in the yeast DEAD-box protein Ded1.

Science.gov (United States)

Banroques, Josette; Doère, Monique; Dreyfus, Marc; Linder, Patrick; Tanner, N Kyle

2010-03-05

Motif III in the putative helicases of superfamily 2 is highly conserved in both its sequence and its structural context. It typically consists of the sequence alcohol-alanine-alcohol (S/T-A-S/T). Historically, it was thought to link ATPase activity with a "helicase" strand displacement activity that disrupts RNA or DNA duplexes. DEAD-box proteins constitute the largest family of superfamily 2; they are RNA-dependent ATPases and ATP-dependent RNA binding proteins that, in some cases, are able to disrupt short RNA duplexes. We made mutations of motif III (S-A-T) in the yeast DEAD-box protein Ded1 and analyzed in vivo phenotypes and in vitro properties. Moreover, we made a tertiary model of Ded1 based on the solved structure of Vasa. We used Ded1 because it has relatively high ATPase and RNA binding activities; it is able to displace moderately stable duplexes at a large excess of substrate. We find that the alanine and the threonine in the second and third positions of motif III are more important than the serine, but that mutations of all three residues have strong phenotypes. We purified the wild-type and various mutants expressed in Escherichia coli. We found that motif III mutations affect the RNA-dependent hydrolysis of ATP (k(cat)), but not the affinity for ATP (K(m)). Moreover, mutations alter and reduce the affinity for single-stranded RNA and subsequently reduce the ability to disrupt duplexes. We obtained intragenic suppressors of the S-A-C mutant that compensate for the mutation by enhancing the affinity for ATP and RNA. We conclude that motif III and the binding energy of gamma-PO(4) of ATP are used to coordinate motifs I, II, and VI and the two RecA-like domains to create a high-affinity single-stranded RNA binding site. It also may help activate the beta,gamma-phosphoanhydride bond of ATP. (c) 2009 Elsevier Ltd. All rights reserved.
Fragment-based modelling of single stranded RNA bound to RNA recognition motif containing proteins

Science.gov (United States)

de Beauchene, Isaure Chauvot; de Vries, Sjoerd J.; Zacharias, Martin

2016-01-01

Abstract Protein-RNA complexes are important for many biological processes. However, structural modeling of such complexes is hampered by the high flexibility of RNA. Particularly challenging is the docking of single-stranded RNA (ssRNA). We have developed a fragment-based approach to model the structure of ssRNA bound to a protein, based on only the protein structure, the RNA sequence and conserved contacts. The conformational diversity of each RNA fragment is sampled by an exhaustive library of trinucleotides extracted from all known experimental protein–RNA complexes. The method was applied to ssRNA with up to 12 nucleotides which bind to dimers of the RNA recognition motifs (RRMs), a highly abundant eukaryotic RNA-binding domain. The fragment based docking allows a precise de novo atomic modeling of protein-bound ssRNA chains. On a benchmark of seven experimental ssRNA–RRM complexes, near-native models (with a mean heavy-atom deviation of <3 Å from experiment) were generated for six out of seven bound RNA chains, and even more precise models (deviation < 2 Å) were obtained for five out of seven cases, a significant improvement compared to the state of the art. The method is not restricted to RRMs but was also successfully applied to Pumilio RNA binding proteins. PMID:27131381
Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

Energy Technology Data Exchange (ETDEWEB)

Chojnowski, Grzegorz, E-mail: gchojnowski@genesilico.pl [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Waleń, Tomasz [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); University of Warsaw, Banacha 2, 02-097 Warsaw (Poland); Piątkowski, Paweł; Potrzebowski, Wojciech [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Bujnicki, Janusz M. [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Adam Mickiewicz University, Umultowska 89, 61-614 Poznan (Poland)

2015-03-01

A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure is RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx.

Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

International Nuclear Information System (INIS)

Chojnowski, Grzegorz; Waleń, Tomasz; Piątkowski, Paweł; Potrzebowski, Wojciech; Bujnicki, Janusz M.

2015-01-01

A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure is RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx
Short Arginine Motifs Drive Protein Stickiness in the Escherichia coli Cytoplasm.

Science.gov (United States)

Kyne, Ciara; Crowley, Peter B

2017-09-19

Although essential to numerous biotech applications, knowledge of molecular recognition by arginine-rich motifs in live cells remains limited. 1 H, 15 N HSQC and 19 F NMR spectroscopies were used to investigate the effects of C-terminal -GR n (n = 1-5) motifs on GB1 interactions in Escherichia coli cells and cell extracts. While the "biologically inert" GB1 yields high-quality in-cell spectra, the -GR n fusions with n = 4 or 5 were undetectable. This result suggests that a tetra-arginine motif is sufficient to drive interactions between a test protein and macromolecules in the E. coli cytoplasm. The inclusion of a 12 residue flexible linker between GB1 and the -GR 5 motif did not improve detection of the "inert" domain. In contrast, all of the constructs were detectable in cell lysates and extracts, suggesting that the arginine-mediated complexes were weak. Together these data reveal the significance of weak interactions between short arginine-rich motifs and the E. coli cytoplasm and demonstrate the potential of such motifs to modify protein interactions in living cells. These interactions must be considered in the design of (in vivo) nanoscale assemblies that rely on arginine-rich sequences.
Solution structure of a DNA mimicking motif of an RNA aptamer against transcription factor AML1 Runt domain.

Science.gov (United States)

Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi

2013-12-01

AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.
Identification of high-efficiency 3′GG gRNA motifs in indexed FASTA files with ngg2

Directory of Open Access Journals (Sweden)

Elisha D. Roberson

2015-11-01

Full Text Available CRISPR/Cas9 is emerging as one of the most-used methods of genome modification in organisms ranging from bacteria to human cells. However, the efficiency of editing varies tremendously site-to-site. A recent report identified a novel motif, called the 3′GG motif, which substantially increases the efficiency of editing at all sites tested in C. elegans. Furthermore, they highlighted that previously published gRNAs with high editing efficiency also had this motif. I designed a Python command-line tool, ngg2, to identify 3′GG gRNA sites from indexed FASTA files. As a proof-of-concept, I screened for these motifs in six model genomes: Saccharomyces cerevisiae, Caenorhabditis elegans, Drosophila melanogaster, Danio rerio, Mus musculus, and Homo sapiens. I also scanned the genomes of pig (Sus scrofa and African elephant (Loxodonta africana to demonstrate the utility in non-model organisms. I identified more than 60 million single match 3′GG motifs in these genomes. Greater than 61% of all protein coding genes in the reference genomes had at least one unique 3′GG gRNA site overlapping an exon. In particular, more than 96% of mouse and 93% of human protein coding genes have at least one unique, overlapping 3′GG gRNA. These identified sites can be used as a starting point in gRNA selection, and the ngg2 tool provides an important ability to identify 3′GG editing sites in any species with an available genome sequence.
Ni2+-binding RNA motifs with an asymmetric purine-rich internal loop and a G-A base pair.

Science.gov (United States)

Hofmann, H P; Limmer, S; Hornung, V; Sprinzl, M

1997-01-01

RNA molecules with high affinity for immobilized Ni2+ were isolated from an RNA pool with 50 randomized positions by in vitro selection-amplification. The selected RNAs preferentially bind Ni2+ and Co2+ over other cations from first series transition metals. Conserved structure motifs, comprising about 15 nt, were identified that are likely to represent the Ni2+ binding sites. Two conserved motifs contain an asymmetric purine-rich internal loop and probably a mismatch G-A base pair. The structure of one of these motifs was studied with proton NMR spectroscopy and formation of the G-A pair at the junction of helix and internal loop was demonstrated. Using Ni2+ as a paramagnetic probe, a divalent metal ion binding site near this G-A base pair was identified. Ni2+ ions bound to this motif exert a specific stabilization effect. We propose that small asymmetric purine-rich loops that contain a G-A interaction may represent a divalent metal ion binding site in RNA. PMID:9409620
The FOLDALIGN web server for pairwise structural RNA alignment and mutual motif search

DEFF Research Database (Denmark)

Havgaard, Jakob Hull; Lyngsø, Rune B.; Gorodkin, Jan

2005-01-01

FOLDALIGN is a Sankoff-based algorithm for making structural alignments of RNA sequences. Here, we present a web server for making pairwise alignments between two RNA sequences, using the recently updated version of FOLDALIGN. The server can be used to scan two sequences for a common structural RNA...... motif of limited size, or the entire sequences can be aligned locally or globally. The web server offers a graphical interface, which makes it simple to make alignments and manually browse the results. the web server can be accessed at http://foldalign.kvl.dk...
Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

KAUST Repository

Sayadi, Ahmed

2011-07-20

The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length of the motifs and their variable degree of conservation makes their identification hard since it is difficult to correctly estimate the statistical significance of their occurrence. Consequently, only a small fraction of them have been discovered so far. We describe here an approach for the discovery of SLiMs based on their occurrence in evolutionarily unrelated proteins belonging to the same biological, signalling or metabolic pathway and give specific examples of its effectiveness in both rediscovering known motifs and in discovering novel ones. An automatic implementation of the procedure, available for download, allows significant motifs to be identified, automatically annotated with functional, evolutionary and structural information and organized in a database that can be inspected and queried. An instance of the database populated with pre-computed data on seven organisms is accessible through a publicly available server and we believe it constitutes by itself a useful resource for the life sciences (http://www.biocomputing.it/modipath).
URS DataBase: universe of RNA structures and their motifs.

Science.gov (United States)

Baulin, Eugene; Yacovlev, Victor; Khachko, Denis; Spirin, Sergei; Roytberg, Mikhail

2016-01-01

The Universe of RNA Structures DataBase (URSDB) stores information obtained from all RNA-containing PDB entries (2935 entries in October 2015). The content of the database is updated regularly. The database consists of 51 tables containing indexed data on various elements of the RNA structures. The database provides a web interface allowing user to select a subset of structures with desired features and to obtain various statistical data for a selected subset of structures or for all structures. In particular, one can easily obtain statistics on geometric parameters of base pairs, on structural motifs (stems, loops, etc.) or on different types of pseudoknots. The user can also view and get information on an individual structure or its selected parts, e.g. RNA-protein hydrogen bonds. URSDB employs a new original definition of loops in RNA structures. That definition fits both pseudoknot-free and pseudoknotted secondary structures and coincides with the classical definition in case of pseudoknot-free structures. To our knowledge, URSDB is the first database supporting searches based on topological classification of pseudoknots and on extended loop classification.Database URL: http://server3.lpm.org.ru/urs/. © The Author(s) 2016. Published by Oxford University Press.
Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

KAUST Repository

Sayadi, Ahmed; Briganti, Leonardo; Tramontano, Anna; Via, Allegra

2011-01-01

The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length
A Conserved Target Site in HIV-1 Gag RNA is Accessible to Inhibition by Both an HDV Ribozyme and a Short Hairpin RNA

Directory of Open Access Journals (Sweden)

Robert J Scarborough

2014-01-01

Full Text Available Antisense-based molecules targeting HIV-1 RNA have the potential to be used as part of gene or drug therapy to treat HIV-1 infection. In this study, HIV-1 RNA was screened to identify more conserved and accessible target sites for ribozymes based on the hepatitis delta virus motif. Using a quantitative screen for effects on HIV-1 production, we identified a ribozyme targeting a highly conserved site in the Gag coding sequence with improved inhibitory potential compared to our previously described candidates targeting the overlapping Tat/Rev coding sequence. We also demonstrate that this target site is highly accessible to short hairpin directed RNA interference, suggesting that it may be available for the binding of antisense RNAs with different modes of action. We provide evidence that this target site is structurally conserved in diverse viral strains and that it is sufficiently different from the human transcriptome to limit off-target effects from antisense therapies. We also show that the modified hepatitis delta virus ribozyme is more sensitive to a mismatch in its target site compared to the short hairpin RNA. Overall, our results validate the potential of a new target site in HIV-1 RNA to be used for the development of antisense therapies.
RNA Binding of T-cell Intracellular Antigen-1 (TIA-1) C-terminal RNA Recognition Motif Is Modified by pH Conditions*

Science.gov (United States)

Cruz-Gallardo, Isabel; Aroca, Ángeles; Persson, Cecilia; Karlsson, B. Göran; Díaz-Moreno, Irene

2013-01-01

T-cell intracellular antigen-1 (TIA-1) is a DNA/RNA-binding protein that regulates critical events in cell physiology by the regulation of pre-mRNA splicing and mRNA translation. TIA-1 is composed of three RNA recognition motifs (RRMs) and a glutamine-rich domain and binds to uridine-rich RNA sequences through its C-terminal RRM2 and RRM3 domains. Here, we show that RNA binding mediated by either isolated RRM3 or the RRM23 construct is controlled by slight environmental pH changes due to the protonation/deprotonation of TIA-1 RRM3 histidine residues. The auxiliary role of the C-terminal RRM3 domain in TIA-1 RNA recognition is poorly understood, and this work provides insight into its binding mechanisms. PMID:23902765
Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA.

Science.gov (United States)

Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W

2016-02-02

The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.
Crystal-Structure-Guided Design of Self-Assembling RNA Nanotriangles.

Science.gov (United States)

Boerneke, Mark A; Dibrov, Sergey M; Hermann, Thomas

2016-03-14

RNA nanotechnology uses RNA structural motifs to build nanosized architectures that assemble through selective base-pair interactions. Herein, we report the crystal-structure-guided design of highly stable RNA nanotriangles that self-assemble cooperatively from short oligonucleotides. The crystal structure of an 81 nucleotide nanotriangle determined at 2.6 Å resolution reveals the so-far smallest circularly closed nanoobject made entirely of double-stranded RNA. The assembly of the nanotriangle architecture involved RNA corner motifs that were derived from ligand-responsive RNA switches, which offer the opportunity to control self-assembly and dissociation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Finding the most significant common sequence and structure motifs in a set of RNA sequences

DEFF Research Database (Denmark)

Gorodkin, Jan; Heyer, L.J.; Stormo, G.D.

1997-01-01

We present a computational scheme to locally align a collection of RNA sequences using sequence and structure constraints, In addition, the method searches for the resulting alignments with the most significant common motifs, among all possible collections, The first part utilizes a simplified...
Structural insight into RNA recognition motifs: versatile molecular Lego building blocks for biological systems.

Science.gov (United States)

Muto, Yutaka; Yokoyama, Shigeyuki

2012-01-01

'RNA recognition motifs (RRMs)' are common domain-folds composed of 80-90 amino-acid residues in eukaryotes, and have been identified in many cellular proteins. At first they were known as RNA binding domains. Through discoveries over the past 20 years, however, the RRMs have been shown to exhibit versatile molecular recognition activities and to behave as molecular Lego building blocks to construct biological systems. Novel RNA/protein recognition modes by RRMs are being identified, and more information about the molecular recognition by RRMs is becoming available. These RNA/protein recognition modes are strongly correlated with their biological significance. In this review, we would like to survey the recent progress on these versatile molecular recognition modules. Copyright © 2012 John Wiley & Sons, Ltd.
miRNA Enriched in Human Neuroblast Nuclei Bind the MAZ Transcription Factor and Their Precursors Contain the MAZ Consensus Motif.

Science.gov (United States)

Goldie, Belinda J; Fitzsimmons, Chantel; Weidenhofer, Judith; Atkins, Joshua R; Wang, Dan O; Cairns, Murray J

2017-01-01

While the cytoplasmic function of microRNA (miRNA) as post-transcriptional regulators of mRNA has been the subject of significant research effort, their activity in the nucleus is less well characterized. Here we use a human neuronal cell model to show that some mature miRNA are preferentially enriched in the nucleus. These molecules were predominantly primate-specific and contained a sequence motif with homology to the consensus MAZ transcription factor binding element. Precursor miRNA containing this motif were shown to have affinity for MAZ protein in nuclear extract. We then used Ago1/2 RIP-Seq to explore nuclear miRNA-associated mRNA targets. Interestingly, the genes for Ago2-associated transcripts were also significantly enriched with MAZ binding sites and neural function, whereas Ago1-transcripts were associated with general metabolic processes and localized with SC35 spliceosomes. These findings suggest the MAZ transcription factor is associated with miRNA in the nucleus and may influence the regulation of neuronal development through Ago2-associated miRNA induced silencing complexes. The MAZ transcription factor may therefore be important for organizing higher order integration of transcriptional and post-transcriptional processes in primate neurons.
Design of a Bioactive Small Molecule that Targets the Myotonic Dystrophy Type 1 RNA Via an RNA Motif-Ligand Database & Chemical Similarity Searching

Science.gov (United States)

Parkesh, Raman; Childs-Disney, Jessica L.; Nakamori, Masayuki; Kumar, Amit; Wang, Eric; Wang, Thomas; Hoskins, Jason; Tran, Tuan; Housman, David; Thornton, Charles A.; Disney, Matthew D.

2012-01-01

Myotonic dystrophy type 1 (DM1) is a triplet repeating disorder caused by expanded CTG repeats in the 3′ untranslated region of the dystrophia myotonica protein kinase (DMPK) gene. The transcribed repeats fold into an RNA hairpin with multiple copies of a 5′CUG/3′GUC motif that binds the RNA splicing regulator muscleblind-like 1 protein (MBNL1). Sequestration of MBNL1 by expanded r(CUG) repeats causes splicing defects in a subset of pre-mRNAs including the insulin receptor, the muscle-specific chloride ion channel, Sarco(endo)plasmic reticulum Ca2+ ATPase 1 (Serca1/Atp2a1), and cardiac troponin T (cTNT). Based on these observations, the development of small molecule ligands that target specifically expanded DM1 repeats could serve as therapeutics. In the present study, computational screening was employed to improve the efficacy of pentamidine and Hoechst 33258 ligands that have been shown previously to target the DM1 triplet repeat. A series of inhibitors of the RNA-protein complex with low micromolar IC50’s, which are >20-fold more potent than the query compounds, were identified. Importantly, a bis-benzimidazole identified from the Hoechst query improves DM1-associated pre-mRNA splicing defects in cell and mouse models of DM1 (when dosed with 1 mM and 100 mg/kg, respectively). Since Hoechst 33258 was identified as a DM1 binder through analysis of an RNA motif-ligand database, these studies suggest that lead ligands targeting RNA with improved biological activity can be identified by using a synergistic approach that combines analysis of known RNA-ligand interactions with virtual screening. PMID:22300544
miRNA Enriched in Human Neuroblast Nuclei Bind the MAZ Transcription Factor and Their Precursors Contain the MAZ Consensus Motif

Directory of Open Access Journals (Sweden)

Belinda J. Goldie

2017-08-01

Full Text Available While the cytoplasmic function of microRNA (miRNA as post-transcriptional regulators of mRNA has been the subject of significant research effort, their activity in the nucleus is less well characterized. Here we use a human neuronal cell model to show that some mature miRNA are preferentially enriched in the nucleus. These molecules were predominantly primate-specific and contained a sequence motif with homology to the consensus MAZ transcription factor binding element. Precursor miRNA containing this motif were shown to have affinity for MAZ protein in nuclear extract. We then used Ago1/2 RIP-Seq to explore nuclear miRNA-associated mRNA targets. Interestingly, the genes for Ago2-associated transcripts were also significantly enriched with MAZ binding sites and neural function, whereas Ago1-transcripts were associated with general metabolic processes and localized with SC35 spliceosomes. These findings suggest the MAZ transcription factor is associated with miRNA in the nucleus and may influence the regulation of neuronal development through Ago2-associated miRNA induced silencing complexes. The MAZ transcription factor may therefore be important for organizing higher order integration of transcriptional and post-transcriptional processes in primate neurons.
Three RNA recognition motifs participate in RNA recognition and structural organization by the pro-apoptotic factor TIA-1

Science.gov (United States)

Bauer, William J.; Heath, Jason; Jenkins, Jermaine L.; Kielkopf, Clara L.

2012-01-01

T-cell intracellular antigen-1 (TIA-1) regulates developmental and stress-responsive pathways through distinct activities at the levels of alternative pre-mRNA splicing and mRNA translation. The TIA-1 polypeptide contains three RNA recognition motifs (RRMs). The central RRM2 and C-terminal RRM3 associate with cellular mRNAs. The N-terminal RRM1 enhances interactions of a C-terminal Q-rich domain of TIA-1 with the U1-C splicing factor, despite linear separation of the domains in the TIA-1 sequence. Given the expanded functional repertoire of the RRM family, it was unknown whether TIA-1 RRM1 contributes to RNA binding as well as documented protein interactions. To address this question, we used isothermal titration calorimetry and small-angle X-ray scattering (SAXS) to dissect the roles of the TIA-1 RRMs in RNA recognition. Notably, the fas RNA exhibited two binding sites with indistinguishable affinities for TIA-1. Analyses of TIA-1 variants established that RRM1 was dispensable for binding AU-rich fas sites, yet all three RRMs were required to bind a polyU RNA with high affinity. SAXS analyses demonstrated a `V' shape for a TIA-1 construct comprising the three RRMs, and revealed that its dimensions became more compact in the RNA-bound state. The sequence-selective involvement of TIA-1 RRM1 in RNA recognition suggests a possible role for RNA sequences in regulating the distinct functions of TIA-1. Further implications for U1-C recruitment by the adjacent TIA-1 binding sites of the fas pre-mRNA and the bent TIA-1 shape, which organizes the N- and C-termini on the same side of the protein, are discussed. PMID:22154808
Use of a Yeast tRNase Killer Toxin to Diagnose Kti12 Motifs Required for tRNA Modification by Elongator.

Science.gov (United States)

Mehlgarten, Constance; Prochaska, Heike; Hammermeister, Alexander; Abdel-Fattah, Wael; Wagner, Melanie; Krutyhołowa, Rościsław; Jun, Sang Eun; Kim, Gyung-Tae; Glatt, Sebastian; Breunig, Karin D; Stark, Michael J R; Schaffrath, Raffael

2017-09-05

Saccharomyces cerevisiae cells are killed by zymocin, a tRNase ribotoxin complex from Kluyveromyces lactis , which cleaves anticodons and inhibits protein synthesis. Zymocin's action requires specific chemical modification of uridine bases in the anticodon wobble position (U34) by the Elongator complex (Elp1-Elp6). Hence, loss of anticodon modification in mutants lacking Elongator or related KTI ( K. lactis Toxin Insensitive) genes protects against tRNA cleavage and confers resistance to the toxin. Here, we show that zymocin can be used as a tool to genetically analyse KTI12 , a gene previously shown to code for an Elongator partner protein. From a kti12 mutant pool of zymocin survivors, we identify motifs in Kti12 that are functionally directly coupled to Elongator activity. In addition, shared requirement of U34 modifications for nonsense and missense tRNA suppression ( SUP4 ; SOE1 ) strongly suggests that Kti12 and Elongator cooperate to assure proper tRNA functioning. We show that the Kti12 motifs are conserved in plant ortholog DRL1/ELO4 from Arabidopsis thaliana and seem to be involved in binding of cofactors (e.g., nucleotides, calmodulin). Elongator interaction defects triggered by mutations in these motifs correlate with phenotypes typical for loss of U34 modification. Thus, tRNA modification by Elongator appears to require physical contact with Kti12, and our preliminary data suggest that metabolic signals may affect proper communication between them.

Short RNA guides cleavage by eukaryotic RNase III.

Directory of Open Access Journals (Sweden)

Bruno Lamontagne

Full Text Available In eukaryotes, short RNAs guide a variety of enzymatic activities that range from RNA editing to translation repression. It is hypothesized that pre-existing proteins evolved to bind and use guide RNA during evolution. However, the capacity of modern proteins to adopt new RNA guides has never been demonstrated. Here we show that Rnt1p, the yeast orthologue of the bacterial dsRNA-specific RNase III, can bind short RNA transcripts and use them as guides for sequence-specific cleavage. Target cleavage occurred at a constant distance from the Rnt1p binding site, leaving the guide RNA intact for subsequent cleavage. Our results indicate that RNase III may trigger sequence-specific RNA degradation independent of the RNAi machinery, and they open the road for a new generation of precise RNA silencing tools that do not trigger a dsRNA-mediated immune response.
Induction of cell death by tospoviral protein NSs and the motif critical for cell death does not control RNA silencing suppression activity.

Science.gov (United States)

Singh, Ajeet; Permar, Vipin; Jain, R K; Goswami, Suneha; Kumar, Ranjeet Ranjan; Canto, Tomas; Palukaitis, Peter; Praveen, Shelly

2017-08-01

Groundnut bud necrosis virus induces necrotic symptoms in different hosts. Previous studies showed reactive oxygen species-mediated programmed cell death (PCD) resulted in necrotic symptoms. Transgenic expression of viral protein NSs mimics viral symptoms. Here, we showed a role for NSs in influencing oxidative burst in the cell, by analyzing H 2 O 2 accumulation, activities of antioxidant enzymes and expression levels of vacuolar processing enzymes, H 2 O 2 -responsive microRNA 319a.2 plus its possible target metacaspase-8. The role of NSs in PCD, was shown using two NSs mutants: one in the Trp/GH3 motif (a homologue of pro-apototic domain) (NSs S189R ) and the other in a non-Trp/GH3 motif (NSs L172R ). Tobacco rattle virus (TRV) expressing NSs S189R enhanced the PCD response, but not TRV-NSs L172R , while RNA silencing suppression activity was lost in TRV-NSs L172R , but not in TRV-NSs S189R . Therefore, we propose dual roles of NSs in RNA silencing suppression and induction of cell death, controlled by different motifs. Copyright © 2017 Elsevier Inc. All rights reserved.
Flow Cytometry-Assisted Cloning of Specific Sequence Motifs from Complex 16S rRNA Gene Libraries

DEFF Research Database (Denmark)

Nielsen, Jeppe Lund; Schramm, Andreas; Bernhard, Anne E.

2004-01-01

for Systems Biology,3 Seattle, Washington, and Department of Ecological Microbiology, University of Bayreuth, Bayreuth, Germany2 A flow cytometry method was developed for rapid screening and recovery of cloned DNA containing common sequence motifs. This approach, termed fluorescence-activated cell sorting...... FLOW CYTOMETRY-ASSISTED CLONING OF SPECIFIC SEQUENCE MOTIFS FROM COMPLEX 16S RRNA GENE LIBRARIES Jeppe L. Nielsen,1 Andreas Schramm,1,2 Anne E. Bernhard,1 Gerrit J. van den Engh,3 and David A. Stahl1* Department of Civil and Environmental Engineering, University of Washington,1 and Institute......-assisted cloning, was used to recover sequences affiliated with a unique lineage within the Bacteroidetes not abundant in a clone library of environmental 16S rRNA genes. ...
iELM—a web server to explore short linear motif-mediated interactions

Science.gov (United States)

Weatheritt, Robert J.; Jehl, Peter; Dinkel, Holger; Gibson, Toby J.

2012-01-01

The recent expansion in our knowledge of protein–protein interactions (PPIs) has allowed the annotation and prediction of hundreds of thousands of interactions. However, the function of many of these interactions remains elusive. The interactions of Eukaryotic Linear Motif (iELM) web server provides a resource for predicting the function and positional interface for a subset of interactions mediated by short linear motifs (SLiMs). The iELM prediction algorithm is based on the annotated SLiM classes from the Eukaryotic Linear Motif (ELM) resource and allows users to explore both annotated and user-generated PPI networks for SLiM-mediated interactions. By incorporating the annotated information from the ELM resource, iELM provides functional details of PPIs. This can be used in proteomic analysis, for example, to infer whether an interaction promotes complex formation or degradation. Furthermore, details of the molecular interface of the SLiM-mediated interactions are also predicted. This information is displayed in a fully searchable table, as well as graphically with the modular architecture of the participating proteins extracted from the UniProt and Phospho.ELM resources. A network figure is also presented to aid the interpretation of results. The iELM server supports single protein queries as well as large-scale proteomic submissions and is freely available at http://i.elm.eu.org. PMID:22638578
MotifMark: Finding regulatory motifs in DNA sequences.

Science.gov (United States)

Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D

2017-07-01

The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.
RNA-protein binding motifs mining with a new hybrid deep learning based cross-domain knowledge integration approach.

Science.gov (United States)

Pan, Xiaoyong; Shen, Hong-Bin

2017-02-28

RNAs play key roles in cells through the interactions with proteins known as the RNA-binding proteins (RBP) and their binding motifs enable crucial understanding of the post-transcriptional regulation of RNAs. How the RBPs correctly recognize the target RNAs and why they bind specific positions is still far from clear. Machine learning-based algorithms are widely acknowledged to be capable of speeding up this process. Although many automatic tools have been developed to predict the RNA-protein binding sites from the rapidly growing multi-resource data, e.g. sequence, structure, their domain specific features and formats have posed significant computational challenges. One of current difficulties is that the cross-source shared common knowledge is at a higher abstraction level beyond the observed data, resulting in a low efficiency of direct integration of observed data across domains. The other difficulty is how to interpret the prediction results. Existing approaches tend to terminate after outputting the potential discrete binding sites on the sequences, but how to assemble them into the meaningful binding motifs is a topic worth of further investigation. In viewing of these challenges, we propose a deep learning-based framework (iDeep) by using a novel hybrid convolutional neural network and deep belief network to predict the RBP interaction sites and motifs on RNAs. This new protocol is featured by transforming the original observed data into a high-level abstraction feature space using multiple layers of learning blocks, where the shared representations across different domains are integrated. To validate our iDeep method, we performed experiments on 31 large-scale CLIP-seq datasets, and our results show that by integrating multiple sources of data, the average AUC can be improved by 8% compared to the best single-source-based predictor; and through cross-domain knowledge integration at an abstraction level, it outperforms the state-of-the-art predictors by 6
The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element.

Science.gov (United States)

Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko

2013-07-01

AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5'-NNCCAC-3' and 5'-GCGMGN'N'-3' (M:A or C; N and N' form Watson-Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences.
BayesMotif: de novo protein sorting motif discovery from impure datasets.

Science.gov (United States)

Hu, Jianjun; Zhang, Fan

2010-01-18

Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of
Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family

Science.gov (United States)

Soufari, Heddy

2017-01-01

Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515
Short Hairpin RNA (shRNA): Design, Delivery, and Assessment of Gene Knockdown

Science.gov (United States)

Moore, Chris B.; Guthrie, Elizabeth H.; Huang, Max Tze-Han; Taxman, Debra J.

2013-01-01

Shortly after the cellular mechanism of RNA interference (RNAi) was first described, scientists began using this powerful technique to study gene function. This included designing better methods for the successful delivery of small interfering RNAs (siRNAs) and short hairpin RNAs (shRNAs) into mammalian cells. While the simplest method for RNAi is the cytosolic delivery of siRNA oligonucleotides, this technique is limited to cells capable of transfection and is primarily utilized during transient in vitro studies. The introduction of shRNA into mammalian cells through infection with viral vectors allows for stable integration of shRNA and long-term knockdown of the targeted gene; however, several challenges exist with the implementation of this technology. Here we describe some well-tested protocols which should increase the chances of successful design, delivery, and assessment of gene knockdown by shRNA. We provide suggestions for designing shRNA targets and controls, a protocol for sequencing through the secondary structure of the shRNA hairpin structure, and protocols for packaging and delivery of shRNA lentiviral particles. Using real-time PCR and functional assays we demonstrate the successful knockdown of ASC, an inflammatory adaptor molecule. These studies demonstrate the practicality of including two shRNAs with different efficacies of knockdown to provide an additional level of control and to verify dose dependency of functional effects. Along with the methods described here, as new techniques and algorithms are designed in the future, shRNA is likely to include further promising application and continue to be a critical component of gene discovery. PMID:20387148
Optimizations of siRNA design for the activation of gene transcription by targeting the TATA-box motif.

Directory of Open Access Journals (Sweden)

Miaomiao Fan

Full Text Available Small interfering RNAs (siRNAs are widely used to repress gene expression by targeting mRNAs. Some reports reveal that siRNAs can also activate or inhibit gene expression through targeting the gene promoters. Our group has found that microRNAs (miRNAs could activate gene transcription via interaction with the TATA-box motif in gene promoters. To investigate whether siRNA targeting the same region could upregulate the promoter activity, we test the activating efficiency of siRNAs targeting the TATA-box motif of 16 genes and perform a systematic analysis to identify the common features of the functional siRNAs for effective activation of gene promoters. Further, we try various modifications to improve the activating efficiency of siRNAs and find that it is quite useful to design the promoter-targeting activating siRNA by following several rules such as (a complementary to the TATA-box-centered region; (b UA usage at the first two bases of the antisense strand; (c twenty-three nucleotides (nts in length; (d 2'-O-Methyl (2'-OMe modification at the 3' terminus of the antisense strand; (e avoiding mismatches at the 3' end of the antisense strand. The optimized activating siRNAs potently enhance the expression of interleukin-2 (IL-2 gene in human and mouse primary CD4+ T cells with a long-time effect. Taken together, our study provides a guideline for rational design the promoter-targeting siRNA to sequence-specifically enhance gene expression.
The Regulatory Factor ZFHX3 Modifies Circadian Function in SCN via an AT Motif-Driven Axis

Science.gov (United States)

Parsons, Michael J.; Brancaccio, Marco; Sethi, Siddharth; Maywood, Elizabeth S.; Satija, Rahul; Edwards, Jessica K.; Jagannath, Aarti; Couch, Yvonne; Finelli, Mattéa J.; Smyllie, Nicola J.; Esapa, Christopher; Butler, Rachel; Barnard, Alun R.; Chesham, Johanna E.; Saito, Shoko; Joynson, Greg; Wells, Sara; Foster, Russell G.; Oliver, Peter L.; Simon, Michelle M.; Mallon, Ann-Marie; Hastings, Michael H.; Nolan, Patrick M.

2015-01-01

Summary We identified a dominant missense mutation in the SCN transcription factor Zfhx3, termed short circuit (Zfhx3Sci), which accelerates circadian locomotor rhythms in mice. ZFHX3 regulates transcription via direct interaction with predicted AT motifs in target genes. The mutant protein has a decreased ability to activate consensus AT motifs in vitro. Using RNA sequencing, we found minimal effects on core clock genes in Zfhx3Sci/+ SCN, whereas the expression of neuropeptides critical for SCN intercellular signaling was significantly disturbed. Moreover, mutant ZFHX3 had a decreased ability to activate AT motifs in the promoters of these neuropeptide genes. Lentiviral transduction of SCN slices showed that the ZFHX3-mediated activation of AT motifs is circadian, with decreased amplitude and robustness of these oscillations in Zfhx3Sci/+ SCN slices. In conclusion, by cloning Zfhx3Sci, we have uncovered a circadian transcriptional axis that determines the period and robustness of behavioral and SCN molecular rhythms. PMID:26232227
Motif discovery in ranked lists of sequences

DEFF Research Database (Denmark)

Nielsen, Morten Muhlig; Tataru, Paula; Madsen, Tobias

2016-01-01

Motif analysis has long been an important method to characterize biological functionality and the current growth of sequencing-based genomics experiments further extends its potential. These diverse experiments often generate sequence lists ranked by some functional property. There is therefore...... advantage of the regular expression feature, including enrichments for combinations of different microRNA seed sites. The method is implemented and made publicly available as an R package and supports high parallelization on multi-core machinery....... a growing need for motif analysis methods that can exploit this coupled data structure and be tailored for specific biological questions. Here, we present an exploratory motif analysis tool, Regmex (REGular expression Motif EXplorer), which offers several methods to evaluate the correlation of motifs...
Salt-bridging effects on short amphiphilic helical structure and introducing sequence-based short beta-turn motifs.

Science.gov (United States)

Guarracino, Danielle A; Gentile, Kayla; Grossman, Alec; Li, Evan; Refai, Nader; Mohnot, Joy; King, Daniel

2018-02-01

Determining the minimal sequence necessary to induce protein folding is beneficial in understanding the role of protein-protein interactions in biological systems, as their three-dimensional structures often dictate their activity. Proteins are generally comprised of discrete secondary structures, from α-helices to β-turns and larger β-sheets, each of which is influenced by its primary structure. Manipulating the sequence of short, moderately helical peptides can help elucidate the influences on folding. We created two new scaffolds based on a modestly helical eight-residue peptide, PT3, we previously published. Using circular dichroism (CD) spectroscopy and changing the possible salt-bridging residues to new combinations of Lys, Arg, Glu, and Asp, we found that our most helical improvements came from the Arg-Glu combination, whereas the Lys-Asp was not significantly different from the Lys-Glu of the parent scaffold, PT3. The marked 3 10 -helical contributions in PT3 were lessened in the Arg-Glu-containing peptide with the beginning of cooperative unfolding seen through a thermal denaturation. However, a unique and unexpected signature was seen for the denaturation of the Lys-Asp peptide which could help elucidate the stages of folding between the 3 10 and α-helix. In addition, we developed a short six-residue peptide with β-turn/sheet CD signature, again to help study minimal sequences needed for folding. Overall, the results indicate that improvements made to short peptide scaffolds by fine-tuning the salt-bridging residues can enhance scaffold structure. Likewise, with the results from the new, short β-turn motif, these can help impact future peptidomimetic designs in creating biologically useful, short, structured β-sheet-forming peptides.
The human Ago2 MC region does not contain an eIF4E-like mRNA cap binding motif

Directory of Open Access Journals (Sweden)

Grishin Nick V

2009-01-01

Full Text Available Abstract Background Argonaute (Ago proteins interact with small regulatory RNAs to mediate gene regulatory pathways. A recent report by Kiriakidou et al. 1 describes an MC sequence region identified in Ago2 that displays similarity to the cap-binding motif in translation initiation factor 4E (eIF4E. In a cap-bound eIF4E structure, two important aromatic residues of the motif stack on either side of a 7-methylguanosine 5'-triphosphate (m7Gppp base. The corresponding Ago2 aromatic residues (F450 and F505 were hypothesized to perform the same cap-binding function. However, the detected similarity between the MC sequence and the eIF4E cap-binding motif was questionable. Results A number of sequence-based and structure-based bioinformatics methods reveal the reported similarity between the Ago2 MC sequence region and the eIF4E cap-binding motif to be spurious. Alternatively, the MC sequence region is confidently assigned to the N-terminus of the Ago piwi module, within the mid domain of experimentally determined prokaryotic Ago structures. Confident mapping of the Ago2 MC sequence region to the piwi mid domain results in a homology-based structure model that positions the identified aromatic residues over 20 Å apart, with one of the aromatic side chains (F450 contributing instead to the hydrophobic core of the domain. Conclusion Correct functional prediction based on weak sequence similarity requires substantial evolutionary and structural support. The evolutionary context of the Ago mid domain suggested by multiple sequence alignment is limited to a conserved hydrophobicity profile required for the fold and a motif following the MC region that binds guide RNA. Mapping of the MC sequence to the mid domain structure reveals Ago2 aromatics that are incompatible with eIF4E-like mRNA cap-binding, yet display some limited local structure similarities that cause the chance sequence match to eIF4E. Reviewers This article was reviewed by Arcady Mushegian
Amino acid sequence motifs essential for P0-mediated suppression of RNA silencing in an isolate of potato leafroll virus from Inner Mongolia.

Science.gov (United States)

Zhuo, Tao; Li, Yuan-Yuan; Xiang, Hai-Ying; Wu, Zhan-Yu; Wang, Xian-Bin; Wang, Ying; Zhang, Yong-Liang; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui

2014-06-01

Polerovirus P0 suppressors of host gene silencing contain a consensus F-box-like motif with Leu/Pro (L/P) requirements for suppressor activity. The Inner Mongolian Potato leafroll virus (PLRV) P0 protein (P0(PL-IM)) has an unusual F-box-like motif that contains a Trp/Gly (W/G) sequence and an additional GW/WG-like motif (G139/W140/G141) that is lacking in other P0 proteins. We used Agrobacterium infiltration-mediated RNA silencing assays to establish that P0(PL-IM) has a strong suppressor activity. Mutagenesis experiments demonstrated that the P0(PL-IM) F-box-like motif encompasses amino acids 76-LPRHLHYECLEWGLLCG THP-95, and that the suppressor activity is abolished by L76A, W87A, or G88A substitution. The suppressor activity is also weakened substantially by mutations within the G139/W140/G141 region and is eliminated by a mutation (F220R) in a C-terminal conserved sequence of P0(PL-IM). As has been observed with other P0 proteins, P0(PL-IM) suppression is correlated with reduced accumulation of the host AGO1-silencing complex protein. However, P0(PL-IM) fails to bind SKP1, which functions in a proteasome pathway that may be involved in AGO1 degradation. These results suggest that P0(PL-IM) may suppress RNA silencing by using an alternative pathway to target AGO1 for degradation. Our results help improve our understanding of the molecular mechanisms involved in PLRV infection.
Memetic algorithms for de novo motif-finding in biomedical sequences.

Science.gov (United States)

Bi, Chengpeng

2012-09-01

The objectives of this study are to design and implement a new memetic algorithm for de novo motif discovery, which is then applied to detect important signals hidden in various biomedical molecular sequences. In this paper, memetic algorithms are developed and tested in de novo motif-finding problems. Several strategies in the algorithm design are employed that are to not only efficiently explore the multiple sequence local alignment space, but also effectively uncover the molecular signals. As a result, there are a number of key features in the implementation of the memetic motif-finding algorithm (MaMotif), including a chromosome replacement operator, a chromosome alteration-aware local search operator, a truncated local search strategy, and a stochastic operation of local search imposed on individual learning. To test the new algorithm, we compare MaMotif with a few of other similar algorithms using simulated and experimental data including genomic DNA, primary microRNA sequences (let-7 family), and transmembrane protein sequences. The new memetic motif-finding algorithm is successfully implemented in C++, and exhaustively tested with various simulated and real biological sequences. In the simulation, it shows that MaMotif is the most time-efficient algorithm compared with others, that is, it runs 2 times faster than the expectation maximization (EM) method and 16 times faster than the genetic algorithm-based EM hybrid. In both simulated and experimental testing, results show that the new algorithm is compared favorably or superior to other algorithms. Notably, MaMotif is able to successfully discover the transcription factors' binding sites in the chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) data, correctly uncover the RNA splicing signals in gene expression, and precisely find the highly conserved helix motif in the transmembrane protein sequences, as well as rightly detect the palindromic segments in the primary microRNA
DNA interrogation by the CRISPR RNA-guided endonuclease Cas9.

Science.gov (United States)

Sternberg, Samuel H; Redding, Sy; Jinek, Martin; Greene, Eric C; Doudna, Jennifer A

2014-03-06

The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9-guide RNA complexes are also effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experiments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA heteroduplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA.
DNA interrogation by the CRISPR RNA-guided endonuclease Cas9

Science.gov (United States)

Sternberg, Samuel H.; Redding, Sy; Jinek, Martin; Greene, Eric C.; Doudna, Jennifer A.

2014-03-01

The clustered regularly interspaced short palindromic repeats (CRISPR)-associated enzyme Cas9 is an RNA-guided endonuclease that uses RNA-DNA base-pairing to target foreign DNA in bacteria. Cas9-guide RNA complexes are also effective genome engineering agents in animals and plants. Here we use single-molecule and bulk biochemical experiments to determine how Cas9-RNA interrogates DNA to find specific cleavage sites. We show that both binding and cleavage of DNA by Cas9-RNA require recognition of a short trinucleotide protospacer adjacent motif (PAM). Non-target DNA binding affinity scales with PAM density, and sequences fully complementary to the guide RNA but lacking a nearby PAM are ignored by Cas9-RNA. Competition assays provide evidence that DNA strand separation and RNA-DNA heteroduplex formation initiate at the PAM and proceed directionally towards the distal end of the target sequence. Furthermore, PAM interactions trigger Cas9 catalytic activity. These results reveal how Cas9 uses PAM recognition to quickly identify potential target sites while scanning large DNA molecules, and to regulate scission of double-stranded DNA.
Identification of amino acid residues in protein SRP72 required for binding to a kinked 5e motif of the human signal recognition particle RNA.

Science.gov (United States)

Iakhiaeva, Elena; Iakhiaev, Alexei; Zwieb, Christian

2010-11-13

Human cells depend critically on the signal recognition particle (SRP) for the sorting and delivery of their proteins. The SRP is a ribonucleoprotein complex which binds to signal sequences of secretory polypeptides as they emerge from the ribosome. Among the six proteins of the eukaryotic SRP, the largest protein, SRP72, is essential for protein targeting and possesses a poorly characterized RNA binding domain. We delineated the minimal region of SRP72 capable of forming a stable complex with an SRP RNA fragment. The region encompassed residues 545 to 585 of the full-length human SRP72 and contained a lysine-rich cluster (KKKKKKKKGK) at postions 552 to 561 as well as a conserved Pfam motif with the sequence PDPXRWLPXXER at positions 572 to 583. We demonstrated by site-directed mutagenesis that both regions participated in the formation of a complex with the RNA. In agreement with biochemical data and results from chymotryptic digestion experiments, molecular modeling of SRP72 implied that the invariant W577 was located inside the predicted structure of an RNA binding domain. The 11-nucleotide 5e motif contained within the SRP RNA fragment was shown by comparative electrophoresis on native polyacrylamide gels to conform to an RNA kink-turn. The model of the complex suggested that the conserved A240 of the K-turn, previously identified as being essential for the binding to SRP72, could protrude into a groove of the SRP72 RNA binding domain, similar but not identical to how other K-turn recognizing proteins interact with RNA. The results from the presented experiments provided insights into the molecular details of a functionally important and structurally interesting RNA-protein interaction. A model for how a ligand binding pocket of SRP72 can accommodate a new RNA K-turn in the 5e region of the eukaryotic SRP RNA is proposed.

The conservation pattern of short linear motifs is highly correlated with the function of interacting protein domains

Directory of Open Access Journals (Sweden)

Wang Yiguo

2008-10-01

Full Text Available Abstract Background Many well-represented domains recognize primary sequences usually less than 10 amino acids in length, called Short Linear Motifs (SLiMs. Accurate prediction of SLiMs has been difficult because they are short (often Results Our combined approach revealed that SLiMs are highly conserved in proteins from functional classes that are known to interact with a specific domain, but that they are not conserved in most other protein groups. We found that SLiMs recognized by SH2 domains were highly conserved in receptor kinases/phosphatases, adaptor molecules, and tyrosine kinases/phosphatases, that SLiMs recognized by SH3 domains were highly conserved in cytoskeletal and cytoskeletal-associated proteins, that SLiMs recognized by PDZ domains were highly conserved in membrane proteins such as channels and receptors, and that SLiMs recognized by S/T kinase domains were highly conserved in adaptor molecules, S/T kinases/phosphatases, and proteins involved in transcription or cell cycle control. We studied Tyr-SLiMs recognized by SH2 domains in more detail, and found that SH2-recognized Tyr-SLiMs on the cytoplasmic side of membrane proteins are more highly conserved than those on the extra-cellular side. Also, we found that SH2-recognized Tyr-SLiMs that are associated with SH3 motifs and a tyrosine kinase phosphorylation motif are more highly conserved. Conclusion The interactome of protein domains is reflected by the evolutionary conservation of SLiMs recognized by these domains. Combining scoring matrixes derived from peptide libraries and conservation analysis, we would be able to find those protein groups that are more likely to interact with specific domains.
Space-related pharma-motifs for fast search of protein binding motifs and polypharmacological targets.

Science.gov (United States)

Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon

2012-01-01

To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.
Structural basis of RNA folding and recognition in an AMP-RNA aptamer complex.

Science.gov (United States)

Jiang, F; Kumar, R A; Jones, R A; Patel, D J

1996-07-11

The catalytic properties of RNA and its well known role in gene expression and regulation are the consequence of its unique solution structures. Identification of the structural determinants of ligand recognition by RNA molecules is of fundamental importance for understanding the biological functions of RNA, as well as for the rational design of RNA Sequences with specific catalytic activities. Towards this latter end, Szostak et al. used in vitro selection techniques to isolate RNA sequences ('aptamers') containing a high-affinity binding site for ATP, the universal currency of cellular energy, and then used this motif to engineer ribozymes with polynucleotide kinase activity. Here we present the solution structure, as determined by multidimensional NMR spectroscopy and molecular dynamics calculations, of both uniformly and specifically 13C-, 15N-labelled 40-mer RNA containing the ATP-binding motif complexed with AMP. The aptamer adopts an L-shaped structure with two nearly orthogonal stems, each capped proximally by a G x G mismatch pair, binding the AMP ligand at their junction in a GNRA-like motif.
Characterization and identification of microRNA core promoters in four model species.

Directory of Open Access Journals (Sweden)

Xuefeng Zhou

2007-03-01

Full Text Available MicroRNAs are short, noncoding RNAs that play important roles in post-transcriptional gene regulation. Although many functions of microRNAs in plants and animals have been revealed in recent years, the transcriptional mechanism of microRNA genes is not well-understood. To elucidate the transcriptional regulation of microRNA genes, we study and characterize, in a genome scale, the promoters of intergenic microRNA genes in Caenorhabditis elegans, Homo sapiens, Arabidopsis thaliana, and Oryza sativa. We show that most known microRNA genes in these four species have the same type of promoters as protein-coding genes have. To further characterize the promoters of microRNA genes, we developed a novel promoter prediction method, called common query voting (CoVote, which is more effective than available promoter prediction methods. Using this new method, we identify putative core promoters of most known microRNA genes in the four model species. Moreover, we characterize the promoters of microRNA genes in these four species. We discover many significant, characteristic sequence motifs in these core promoters, several of which match or resemble the known cis-acting elements for transcription initiation. Among these motifs, some are conserved across different species while some are specific to microRNA genes of individual species.
RNA versatility, flexibility, and thermostability for practice in RNA nanotechnology and biomedical applications.

Science.gov (United States)

Haque, Farzin; Pi, Fengmei; Zhao, Zhengyi; Gu, Shanqing; Hu, Haibo; Yu, Hang; Guo, Peixuan

2018-01-01

In recent years, RNA has attracted widespread attention as a unique biomaterial with distinct biophysical properties for designing sophisticated architectures in the nanometer scale. RNA is much more versatile in structure and function with higher thermodynamic stability compared to its nucleic acid counterpart DNA. Larger RNA molecules can be viewed as a modular structure built from a combination of many 'Lego' building blocks connected via different linker sequences. By exploiting the diversity of RNA motifs and flexibility of structure, varieties of RNA architectures can be fabricated with precise control of shape, size, and stoichiometry. Many structural motifs have been discovered and characterized over the years and the crystal structures of many of these motifs are available for nanoparticle construction. For example, using the flexibility and versatility of RNA structure, RNA triangles, squares, pentagons, and hexagons can be constructed from phi29 pRNA three-way-junction (3WJ) building block. This review will focus on 2D RNA triangles, squares, and hexamers; 3D and 4D structures built from basic RNA building blocks; and their prospective applications in vivo as imaging or therapeutic agents via specific delivery and targeting. Methods for intracellular cloning and expression of RNA molecules and the in vivo assembly of RNA nanoparticles will also be reviewed. WIREs RNA 2018, 9:e1452. doi: 10.1002/wrna.1452 This article is categorized under: RNA Methods > RNA Nanotechnology RNA Structure and Dynamics > RNA Structure, Dynamics and Chemistry RNA in Disease and Development > RNA in Disease Regulatory RNAs/RNAi/Riboswitches > Regulatory RNAs. © 2017 Wiley Periodicals, Inc.
Viroids: from genotype to phenotype just relying on RNA sequence and structural motifs

Directory of Open Access Journals (Sweden)

Ricardo eFlores

2012-06-01

Full Text Available As a consequence of two unique physical properties, small size and circularity, viroid RNAs do not code for proteins and thus depend on RNA sequence/structural motifs for interacting with host proteins that mediate their invasion, replication, spread, and circumvention of defensive barriers. Viroid genomes fold up on themselves adopting collapsed secondary structures wherein stretches of nucleotides stabilized by Watson-Crick pairs are flanked by apparently unstructured loops. However, compelling data show that they are instead stabilized by alternative non-canonical pairs and that specific loops in the rod-like secondary structure, characteristic of Potato spindle tuber viroid and most other members of the family Pospiviroidae, are critical for replication and systemic trafficking. In contrast, rather than folding into a rod-like secondary structure, most members of the family Avsunvioidae adopt multibranched conformations occasionally stabilized by kissing loop interactions critical for viroid viability in vivo. Besides these most stable secondary structures, viroid RNAs alternatively adopt during replication transient metastable conformations containing elements of local higher-order structure, prominent among which are the hammerhead ribozymes catalyzing a key replicative step in the family Avsunvioidae, and certain conserved hairpins that also mediate replication steps in the family Pospiviroidae. Therefore, different RNA structures ⎯either global or local ⎯ determine different functions, thus highlighting the need for in-depth structural studies on viroid RNAs.
Identification of amino acid residues in protein SRP72 required for binding to a kinked 5e motif of the human signal recognition particle RNA

Directory of Open Access Journals (Sweden)

Zwieb Christian

2010-11-01

Full Text Available Abstract Background Human cells depend critically on the signal recognition particle (SRP for the sorting and delivery of their proteins. The SRP is a ribonucleoprotein complex which binds to signal sequences of secretory polypeptides as they emerge from the ribosome. Among the six proteins of the eukaryotic SRP, the largest protein, SRP72, is essential for protein targeting and possesses a poorly characterized RNA binding domain. Results We delineated the minimal region of SRP72 capable of forming a stable complex with an SRP RNA fragment. The region encompassed residues 545 to 585 of the full-length human SRP72 and contained a lysine-rich cluster (KKKKKKKKGK at postions 552 to 561 as well as a conserved Pfam motif with the sequence PDPXRWLPXXER at positions 572 to 583. We demonstrated by site-directed mutagenesis that both regions participated in the formation of a complex with the RNA. In agreement with biochemical data and results from chymotryptic digestion experiments, molecular modeling of SRP72 implied that the invariant W577 was located inside the predicted structure of an RNA binding domain. The 11-nucleotide 5e motif contained within the SRP RNA fragment was shown by comparative electrophoresis on native polyacrylamide gels to conform to an RNA kink-turn. The model of the complex suggested that the conserved A240 of the K-turn, previously identified as being essential for the binding to SRP72, could protrude into a groove of the SRP72 RNA binding domain, similar but not identical to how other K-turn recognizing proteins interact with RNA. Conclusions The results from the presented experiments provided insights into the molecular details of a functionally important and structurally interesting RNA-protein interaction. A model for how a ligand binding pocket of SRP72 can accommodate a new RNA K-turn in the 5e region of the eukaryotic SRP RNA is proposed.
A speedup technique for (l, d-motif finding algorithms

Directory of Open Access Journals (Sweden)

Dinh Hieu

2011-03-01

Full Text Available Abstract Background The discovery of patterns in DNA, RNA, and protein sequences has led to the solution of many vital biological problems. For instance, the identification of patterns in nucleic acid sequences has resulted in the determination of open reading frames, identification of promoter elements of genes, identification of intron/exon splicing sites, identification of SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have proven to be extremely helpful in domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, etc. Motifs are important patterns that are helpful in finding transcriptional regulatory elements, transcription factor binding sites, functional genomics, drug design, etc. As a result, numerous papers have been written to solve the motif search problem. Results Three versions of the motif search problem have been proposed in the literature: Simple Motif Search (SMS, (l, d-motif search (or Planted Motif Search (PMS, and Edit-distance-based Motif Search (EMS. In this paper we focus on PMS. Two kinds of algorithms can be found in the literature for solving the PMS problem: exact and approximate. An exact algorithm identifies the motifs always and an approximate algorithm may fail to identify some or all of the motifs. The exact version of PMS problem has been shown to be NP-hard. Exact algorithms proposed in the literature for PMS take time that is exponential in some of the underlying parameters. In this paper we propose a generic technique that can be used to speedup PMS algorithms. Conclusions We present a speedup technique that can be used on any PMS algorithm. We have tested our speedup technique on a number of algorithms. These experimental results show that our speedup technique is indeed very
Alfalfa dwarf cytorhabdovirus P protein is a local and systemic RNA silencing supressor which inhibits programmed RISC activity and prevents transitive amplification of RNA silencing.

Science.gov (United States)

Bejerman, Nicolás; Mann, Krin S; Dietzgen, Ralf G

2016-09-15

Plants employ RNA silencing as an innate defense mechanism against viruses. As a counter-defense, plant viruses have evolved to express RNA silencing suppressor proteins (RSS), which target one or more steps of the silencing pathway. In this study, we show that the phosphoprotein (P) encoded by the negative-sense RNA virus alfalfa dwarf virus (ADV), a species of the genus Cytorhabdovirus, family Rhabdoviridae, is a suppressor of RNA silencing. ADV P has a relatively weak local RSS activity, and does not prevent siRNA accumulation. On the other hand, ADV P strongly suppresses systemic RNA silencing, but does not interfere with the short-distance spread of silencing, which is consistent with its lack of inhibition of siRNA accumulation. The mechanism of suppression appears to involve ADV P binding to RNA-induced silencing complex proteins AGO1 and AGO4 as shown in protein-protein interaction assays when ectopically expressed. In planta, we demonstrate that ADV P likely functions by inhibiting miRNA-guided AGO1 cleavage and prevents transitive amplification by repressing the production of secondary siRNAs. As recently described for lettuce necrotic yellows cytorhabdovirus P, but in contrast to other viral RSS known to disrupt AGO activity, ADV P sequence does not contain any recognizable GW/WG or F-box motifs, which suggests that cytorhabdovirus P proteins may use alternative motifs to bind to AGO proteins. Crown Copyright © 2016. Published by Elsevier B.V. All rights reserved.
RNA polymerase II mediated transcription from the polymerase III promoters in short hairpin RNA expression vector

International Nuclear Information System (INIS)

Rumi, Mohammad; Ishihara, Shunji; Aziz, Monowar; Kazumori, Hideaki; Ishimura, Norihisa; Yuki, Takafumi; Kadota, Chikara; Kadowaki, Yasunori; Kinoshita, Yoshikazu

2006-01-01

RNA polymerase III promoters of human ribonuclease P RNA component H1, human U6, and mouse U6 small nuclear RNA genes are commonly used in short hairpin RNA (shRNA) expression vectors due their precise initiation and termination sites. During transient transfection of shRNA vectors, we observed that H1 or U6 promoters also express longer transcripts enough to express several reporter genes including firefly luciferase, green fluorescent protein EGFP, and red fluorescent protein JRed. Expression of such longer transcripts was augmented by upstream RNA polymerase II enhancers and completely inhibited by downstream polyA signal sequences. Moreover, the transcription of firefly luciferase from human H1 promoter was sensitive to RNA polymerase II inhibitor α-amanitin. Our findings suggest that commonly used polymerase III promoters in shRNA vectors are also prone to RNA polymerase II mediated transcription, which may have negative impacts on their targeted use
A Viral RNA Structural Element Alters Host Recognition of Nonself RNA

Energy Technology Data Exchange (ETDEWEB)

Hyde, J. L.; Gardner, C. L.; Kimura, T.; White, J. P.; Liu, G.; Trobaugh, D. W.; Huang, C.; Tonelli, M.; Paessler, S.; Takeda, K.; Klimstra, W. B.; Amarasinghe, G. K.; Diamond, M. S.

2014-01-30

Although interferon (IFN) signaling induces genes that limit viral infection, many pathogenic viruses overcome this host response. As an example, 2'-O methylation of the 5' cap of viral RNA subverts mammalian antiviral responses by evading restriction of Ifit1, an IFN-stimulated gene that regulates protein synthesis. However, alphaviruses replicate efficiently in cells expressing Ifit1 even though their genomic RNA has a 5' cap lacking 2'-O methylation. We show that pathogenic alphaviruses use secondary structural motifs within the 5' untranslated region (UTR) of their RNA to alter Ifit1 binding and function. Mutations within the 5'-UTR affecting RNA structural elements enabled restriction by or antagonism of Ifit1 in vitro and in vivo. These results identify an evasion mechanism by which viruses use RNA structural motifs to avoid immune restriction.
Motifs of Madness, Indifference, and Cannibalism as Symbols of a Depraved Society in Lu Xun's Short Stories

Directory of Open Access Journals (Sweden)

Tina Ilgo

2011-07-01

Full Text Available This article analyzes two short stories by Lu Xun from his collection Outcry, which came into being at the culmination of the Chinese spiritual rebirth between 1818 and 1922. In “Diary of a Madman” and “The True Story of A Q” the author expresses his conviction that the existing system’s depravity produces “cannibalism,” causes a gradual decline in humanity, and exposes the main defects of human character. The impossibility of destroying the “iron house,” or people’s incapacity to change their “cannibalistic” nature, causes the loss of hope on the side of the “madmen” . It forces them to give up their insightfull knowledge and adapt to the majority. With the repetition of motifs such as “madness,” “indifference,” and “cannibalism,” which constantly recur in Lu Xun’s short stories, the author expressed his vision of traditional Chinese society and his pessimism about the future. At the same time these motifs reflect the author’s state of mind and his everlasting journey between hope and despair, “madness” and “indifference,” and tradition and modernity. If the stories are read in the context of twentieth-century China they can be understood as a direct criticism of the established Chinese society, whose values and norms derive from Confucianism, but they also contain deep symbolic meaning that renders them timeless.
CompariMotif: quick and easy comparisons of sequence motifs.

Science.gov (United States)

Edwards, Richard J; Davey, Norman E; Shields, Denis C

2008-05-15

CompariMotif is a novel tool for making motif-motif comparisons, identifying and describing similarities between regular expression motifs. CompariMotif can identify a number of different relationships between motifs, including exact matches, variants of degenerate motifs and complex overlapping motifs. Motif relationships are scored using shared information content, allowing the best matches to be easily identified in large comparisons. Many input and search options are available, enabling a list of motifs to be compared to itself (to identify recurring motifs) or to datasets of known motifs. CompariMotif can be run online at http://bioware.ucd.ie/ and is freely available for academic use as a set of open source Python modules under a GNU General Public License from http://bioinformatics.ucd.ie/shields/software/comparimotif/
MicroRNA genes preferentially expressed in dendritic cells contain sites for conserved transcription factor binding motifs in their promoters

Directory of Open Access Journals (Sweden)

Huynen Martijn A

2011-06-01

Full Text Available Abstract Background MicroRNAs (miRNAs play a fundamental role in the regulation of gene expression by translational repression or target mRNA degradation. Regulatory elements in miRNA promoters are less well studied, but may reveal a link between their expression and a specific cell type. Results To explore this link in myeloid cells, miRNA expression profiles were generated from monocytes and dendritic cells (DCs. Differences in miRNA expression among monocytes, DCs and their stimulated progeny were observed. Furthermore, putative promoter regions of miRNAs that are significantly up-regulated in DCs were screened for Transcription Factor Binding Sites (TFBSs based on TFBS motif matching score, the degree to which those TFBSs are over-represented in the promoters of the up-regulated miRNAs, and the extent of conservation of the TFBSs in mammals. Conclusions Analysis of evolutionarily conserved TFBSs in DC promoters revealed preferential clustering of sites within 500 bp upstream of the precursor miRNAs and that many mRNAs of cognate TFs of the conserved TFBSs were indeed expressed in the DCs. Taken together, our data provide evidence that selected miRNAs expressed in DCs have evolutionarily conserved TFBSs relevant to DC biology in their promoters.
A mutation in the glutamate-rich region of RNA-binding motif protein 20 causes dilated cardiomyopathy through missplicing of titin and impaired Frank-Starling mechanism

DEFF Research Database (Denmark)

Beqqali, Abdelaziz; Bollen, I. A. E.; Rasmussen, T. B.

2016-01-01

Mutations in the RS-domain of RNA-binding motif protein 20 (RBM20) have recently been identified to segregate with aggressive forms of familial dilated cardiomyopathy (DCM). Loss of RBM20 in rats results in missplicing of the sarcomeric gene titin (TTN). The functional and physiological consequen......Mutations in the RS-domain of RNA-binding motif protein 20 (RBM20) have recently been identified to segregate with aggressive forms of familial dilated cardiomyopathy (DCM). Loss of RBM20 in rats results in missplicing of the sarcomeric gene titin (TTN). The functional and physiological...... consequences of RBM20 mutations outside the mutational hotspot of RBM20 have not been explored to date. In this study, we investigated the pathomechanism of DCM caused by a novel RBM20 mutation in human cardiomyocytes. We identified a family with DCM carrying a mutation (RBM20(E913K/+)) in a glutamate...... to the early onset, and malignant course of DCM caused by RBM20 mutations. Altogether, our results demonstrate that heterozygous loss of RBM20 suffices to profoundly impair myocyte biomechanics by its disturbance of TTN splicing....
Assessing the 5S ribosomal RNA heterogeneity in Arabidopsis thaliana using short RNA next generation sequencing data.

Science.gov (United States)

Szymanski, Maciej; Karlowski, Wojciech M

2016-01-01

In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.
Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

KAUST Repository

Kalkatawi, Manal M.

2011-11-15

Motivation: Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants. The Author(s) 2011. Published by Oxford University Press. All rights reserved.
Exploring RNA structure by integrative molecular modelling

DEFF Research Database (Denmark)

Masquida, Benoît; Beckert, Bertrand; Jossinet, Fabrice

2010-01-01

RNA molecular modelling is adequate to rapidly tackle the structure of RNA molecules. With new structured RNAs constituting a central class of cellular regulators discovered every year, the need for swift and reliable modelling methods is more crucial than ever. The pragmatic method based...... on interactive all-atom molecular modelling relies on the observation that specific structural motifs are recurrently found in RNA sequences. Once identified by a combination of comparative sequence analysis and biochemical data, the motifs composing the secondary structure of a given RNA can be extruded...
Suppression of HPV-16 late L1 5′-splice site SD3632 by binding of hnRNP D proteins and hnRNP A2/B1 to upstream AUAGUA RNA motifs

Science.gov (United States)

Li, Xiaoze; Johansson, Cecilia; Glahder, Jacob; Mossberg, Ann-Kristin; Schwartz, Stefan

2013-01-01

Human papillomavirus type 16 (HPV-16) 5′-splice site SD3632 is used exclusively to produce late L1 mRNAs. We identified a 34-nt splicing inhibitory element located immediately upstream of HPV-16 late 5′-splice site SD3632. Two AUAGUA motifs located in these 34 nt inhibited SD3632. Two nucleotide substitutions in each of the HPV-16 specific AUAGUA motifs alleviated splicing inhibition and induced late L1 mRNA production from episomal forms of the HPV-16 genome in primary human keratinocytes. The AUAGUA motifs bind specifically not only to the heterogeneous nuclear RNP (hnRNP) D family of RNA-binding proteins including hnRNP D/AUF, hnRNP DL and hnRNP AB but also to hnRNP A2/B1. Knock-down of these proteins induced HPV-16 late L1 mRNA expression, and overexpression of hnRNP A2/B1, hnRNP AB, hnRNP DL and the two hnRNP D isoforms hnRNP D37 and hnRNP D40 further suppressed L1 mRNA expression. This inhibition may allow HPV-16 to hide from the immune system and establish long-term persistent infections with enhanced risk at progressing to cancer. There is an inverse correlation between expression of hnRNP D proteins and hnRNP A2/B1 and HPV-16 L1 production in the cervical epithelium, as well as in cervical cancer, supporting the conclusion that hnRNP D proteins and A2/B1 inhibit HPV-16 L1 mRNA production. PMID:24013563
DNA regulatory motif selection based on support vector machine ...

African Journals Online (AJOL)

... machine (SVM) and its application in microarray experiment of Kashin-Beck disease. ... speed and amount of the corresponding mRNA in gene replication process. ... and revealed that some motifs may be related to the immune reactions.

CORE-SINEs: eukaryotic short interspersed retroposing elements with common sequence motifs.

Science.gov (United States)

Gilbert, N; Labuda, D

1999-03-16

A 65-bp "core" sequence is dispersed in hundreds of thousands copies in the human genome. This sequence was found to constitute the central segment of a group of short interspersed elements (SINEs), referred to as mammalian-wide interspersed repeats, that proliferated before the radiation of placental mammals. Here, we propose that the core identifies an ancient tRNA-like SINE element, which survived in different lineages such as mammals, reptiles, birds, and fish, as well as mollusks, presumably for >550 million years. This element gave rise to a number of sequence families (CORE-SINEs), including mammalian-wide interspersed repeats, whose distinct 3' ends are shared with different families of long interspersed elements (LINEs). The evolutionary success of the generic CORE-SINE element can be related to the recruitment of the internal promoter from highly transcribed host RNA as well as to its capacity to adapt to changing retropositional opportunities by sequence exchange with actively amplifying LINEs. It reinforces the notion that the very existence of SINEs depends on the cohabitation with both LINEs and the host genome.
Two Novel Motifs of Watermelon Silver Mottle Virus NSs Protein Are Responsible for RNA Silencing Suppression and Pathogenicity.

Science.gov (United States)

Huang, Chung-Hao; Hsiao, Weng-Rong; Huang, Ching-Wen; Chen, Kuan-Chun; Lin, Shih-Shun; Chen, Tsung-Chi; Raja, Joseph A J; Wu, Hui-Wen; Yeh, Shyi-Dong

2015-01-01

The NSs protein of Watermelon silver mottle virus (WSMoV) is the RNA silencing suppressor and pathogenicity determinant. In this study, serial deletion and point-mutation mutagenesis of conserved regions (CR) of NSs protein were performed, and the silencing suppression function was analyzed through agroinfiltration in Nicotiana benthamiana plants. We found two amino acid (aa) residues, H113 and Y398, are novel functional residues for RNA silencing suppression. Our further analyses demonstrated that H113 at the common epitope (CE) ((109)KFTMHNQ(117)), which is highly conserved in Asia type tospoviruses, and the benzene ring of Y398 at the C-terminal β-sheet motif ((397)IYFL(400)) affect NSs mRNA stability and protein stability, respectively, and are thus critical for NSs RNA silencing suppression. Additionally, protein expression of other six deleted (ΔCR1-ΔCR6) and five point-mutated (Y15A, Y27A, G180A, R181A and R212A) mutants were hampered and their silencing suppression ability was abolished. The accumulation of the mutant mRNAs and proteins, except Y398A, could be rescued or enhanced by co-infiltration with potyviral suppressor HC-Pro. When assayed with the attenuated Zucchini yellow mosaic virus vector in squash plants, the recombinants carrying individual seven point-mutated NSs proteins displayed symptoms much milder than the recombinant carrying the wild type NSs protein, suggesting that these aa residues also affect viral pathogenicity by suppressing the host silencing mechanism.
Parallel motif extraction from very long sequences

KAUST Repository

Sahli, Majed

2013-01-01

Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern applications require mining of motifs in one very long sequence (i.e., in the order of several gigabytes). For this case, there exist statistical approaches that are fast but inaccurate; or combinatorial methods that are sound and complete. Unfortunately, existing combinatorial methods are serial and very slow. Consequently, they are limited to very short sequences (i.e., a few megabytes), small alphabets (typically 4 symbols for DNA sequences), and restricted types of motifs. This paper presents ACME, a combinatorial method for extracting motifs from a single very long sequence. ACME arranges the search space in contiguous blocks that take advantage of the cache hierarchy in modern architectures, and achieves almost an order of magnitude performance gain in serial execution. It also decomposes the search space in a smart way that allows scalability to thousands of processors with more than 90% speedup. ACME is the only method that: (i) scales to gigabyte-long sequences; (ii) handles large alphabets; (iii) supports interesting types of motifs with minimal additional cost; and (iv) is optimized for a variety of architectures such as multi-core systems, clusters in the cloud, and supercomputers. ACME reduces the extraction time for an exact-length query from 4 hours to 7 minutes on a typical workstation; handles 3 orders of magnitude longer sequences; and scales up to 16, 384 cores on a supercomputer. Copyright is held by the owner/author(s).
DistAMo: A web-based tool to characterize DNA-motif distribution on bacterial chromosomes

Directory of Open Access Journals (Sweden)

Patrick eSobetzko

2016-03-01

Full Text Available Short DNA motifs are involved in a multitude of functions such as for example chromosome segregation, DNA replication or mismatch repair. Distribution of such motifs is often not random and the specific chromosomal pattern relates to the respective motif function. Computational approaches which quantitatively assess such chromosomal motif patterns are necessary. Here we present a new computer tool DistAMo (Distribution Analysis of DNA Motifs. The algorithm uses codon redundancy to calculate the relative abundance of short DNA motifs from single genes to entire chromosomes. Comparative genomics analyses of the GATC-motif distribution in γ-proteobacterial genomes using DistAMo revealed that (i genes beside the replication origin are enriched in GATCs, (ii genome-wide GATC distribution follows a distinct pattern and (iii genes involved in DNA replication and repair are enriched in GATCs. These features are specific for bacterial chromosomes encoding a Dam methyltransferase. The new software is available as a stand-alone or as an easy-to-use web-based server version at http://www.computational.bio.uni-giessen.de/distamo.
Crystal Structure of the N-Terminal RNA Recognition Motif of mRNA Decay Regulator AUF1

Directory of Open Access Journals (Sweden)

Young Jun Choi

2016-01-01

Full Text Available AU-rich element binding/degradation factor 1 (AUF1 plays a role in destabilizing mRNAs by forming complexes with AU-rich elements (ARE in the 3′-untranslated regions. Multiple AUF1-ARE complexes regulate the translation of encoded products related to the cell cycle, apoptosis, and inflammation. AUF1 contains two tandem RNA recognition motifs (RRM and a Gln- (Q- rich domain in their C-terminal region. To observe how the two RRMs are involved in recognizing ARE, we obtained the AUF1-p37 protein covering the two RRMs. However, only N-terminal RRM (RRM1 was crystallized and its structure was determined at 1.7 Å resolution. It appears that the RRM1 and RRM2 separated before crystallization. To demonstrate which factors affect the separate RRM1-2, we performed limited proteolysis using trypsin. The results indicated that the intact proteins were cleaved by unknown proteases that were associated with them prior to crystallization. In comparison with each of the monomers, the conformations of the β2-β3 loops were highly variable. Furthermore, a comparison with the RRM1-2 structures of HuR and hnRNP A1 revealed that a dimer of RRM1 could be one of the possible conformations of RRM1-2. Our data may provide a guidance for further structural investigations of AUF1 tandem RRM repeat and its mode of ARE binding.
Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

KAUST Repository

Alam, Tanvir

2018-03-11

Short Linear Motifs (SLiMs) contribute to almost every cellular function by connecting appropriate protein partners. Accurate prediction of SLiMs is difficult due to their shortness and sequence degeneracy. Leucine-aspartic acid (LD) motifs are SLiMs that link paxillin family proteins to factors controlling (cancer) cell adhesion, motility and survival. The existence and importance of LD motifs beyond the paxillin family is poorly understood. To enable a proteome-wide assessment of these motifs, we developed an active-learning based framework that iteratively integrates computational predictions with experimental validation. Our analysis of the human proteome identified a dozen proteins that contain LD motifs, all being involved in cell adhesion and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter-species comparison revealed a conserved LD signalling core, and reveals the emergence of species-specific adaptive connections, while maintaining a strong functional focus of the LD motif interactome. Collectively, our data elucidate the mechanisms underlying the origin and adaptation of an ancestral SLiM.
The HIV-1 leader RNA conformational switch regulates RNA dimerization but does not regulate mRNA translation

NARCIS (Netherlands)

Abbink, Truus E. M.; Ooms, Marcel; Haasnoot, P. C. Joost; Berkhout, Ben

2005-01-01

The untranslated leader RNA is the most conserved part of the human immunodeficiency virus type I (HIV-1) genome. It contains many regulatory motifs that mediate a variety of steps in the viral life cycle. Previous work showed that the full-length leader RNA can adopt two alternative structures: a
PISMA: A Visual Representation of Motif Distribution in DNA Sequences

Directory of Open Access Journals (Sweden)

Rogelio Alcántara-Silva

2017-03-01

Full Text Available Background: Because the graphical presentation and analysis of motif distribution can provide insights for experimental hypothesis, PISMA aims at identifying motifs on DNA sequences, counting and showing them graphically. The motif length ranges from 2 to 10 bases, and the DNA sequences range up to 10 kb. The motif distribution is shown as a bar-code–like, as a gene-map–like, and as a transcript scheme. Results: We obtained graphical schemes of the CpG site distribution from 91 human papillomavirus genomes. Also, we present 2 analyses: one of DNA motifs associated with either methylation-resistant or methylation-sensitive CpG islands and another analysis of motifs associated with exosome RNA secretion. Availability and Implementation: PISMA is developed in Java; it is executable in any type of hardware and in diverse operating systems. PISMA is freely available to noncommercial users. The English version and the User Manual are provided in Supplementary Files 1 and 2, and a Spanish version is available at www.biomedicas.unam.mx/wp-content/software/pisma.zip and www.biomedicas.unam.mx/wp-content/pdf/manual/pisma.pdf .
The crystal structure of the Split End protein SHARP adds a new layer of complexity to proteins containing RNA recognition motifs.

Science.gov (United States)

Arieti, Fabiana; Gabus, Caroline; Tambalo, Margherita; Huet, Tiphaine; Round, Adam; Thore, Stéphane

2014-06-01

The Split Ends (SPEN) protein was originally discovered in Drosophila in the late 1990s. Since then, homologous proteins have been identified in eukaryotic species ranging from plants to humans. Every family member contains three predicted RNA recognition motifs (RRMs) in the N-terminal region of the protein. We have determined the crystal structure of the region of the human SPEN homolog that contains these RRMs-the SMRT/HDAC1 Associated Repressor Protein (SHARP), at 2.0 Å resolution. SHARP is a co-regulator of the nuclear receptors. We demonstrate that two of the three RRMs, namely RRM3 and RRM4, interact via a highly conserved interface. Furthermore, we show that the RRM3-RRM4 block is the main platform mediating the stable association with the H12-H13 substructure found in the steroid receptor RNA activator (SRA), a long, non-coding RNA previously shown to play a crucial role in nuclear receptor transcriptional regulation. We determine that SHARP association with SRA relies on both single- and double-stranded RNA sequences. The crystal structure of the SHARP-RRM fragment, together with the associated RNA-binding studies, extend the repertoire of nucleic acid binding properties of RRM domains suggesting a new hypothesis for a better understanding of SPEN protein functions. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Discovery of candidate KEN-box motifs using cell cycle keyword enrichment combined with native disorder prediction and motif conservation.

Science.gov (United States)

Michael, Sushama; Travé, Gilles; Ramu, Chenna; Chica, Claudia; Gibson, Toby J

2008-02-15

KEN-box-mediated target selection is one of the mechanisms used in the proteasomal destruction of mitotic cell cycle proteins via the APC/C complex. While annotating the Eukaryotic Linear Motif resource (ELM, http://elm.eu.org/), we found that KEN motifs were significantly enriched in human protein entries with cell cycle keywords in the UniProt/Swiss-Prot database-implying that KEN-boxes might be more common than reported. Matches to short linear motifs in protein database searches are not, per se, significant. KEN-box enrichment with cell cycle Gene Ontology terms suggests that collectively these motifs are functional but does not prove that any given instance is so. Candidates were surveyed for native disorder prediction using GlobPlot and IUPred and for motif conservation in homologues. Among >25 strong new candidates, the most notable are human HIPK2, CHFR, CDC27, Dab2, Upf2, kinesin Eg5, DNA Topoisomerase 1 and yeast Cdc5 and Swi5. A similar number of weaker candidates were present. These proteins have yet to be tested for APC/C targeted destruction, providing potential new avenues of research.
Rice MEL2, the RNA recognition motif (RRM) protein, binds in vitro to meiosis-expressed genes containing U-rich RNA consensus sequences in the 3'-UTR.

Science.gov (United States)

Miyazaki, Saori; Sato, Yutaka; Asano, Tomoya; Nagamura, Yoshiaki; Nonomura, Ken-Ichi

2015-10-01

Post-transcriptional gene regulation by RNA recognition motif (RRM) proteins through binding to cis-elements in the 3'-untranslated region (3'-UTR) is widely used in eukaryotes to complete various biological processes. Rice MEIOSIS ARRESTED AT LEPTOTENE2 (MEL2) is the RRM protein that functions in the transition to meiosis in proper timing. The MEL2 RRM preferentially associated with the U-rich RNA consensus, UUAGUU[U/A][U/G][A/U/G]U, dependently on sequences and proportionally to MEL2 protein amounts in vitro. The consensus sequences were located in the putative looped structures of the RNA ligand. A genome-wide survey revealed a tendency of MEL2-binding consensus appearing in 3'-UTR of rice genes. Of 249 genes that conserved the consensus in their 3'-UTR, 13 genes spatiotemporally co-expressed with MEL2 in meiotic flowers, and included several genes whose function was supposed in meiosis; such as Replication protein A and OsMADS3. The proteome analysis revealed that the amounts of small ubiquitin-related modifier-like protein and eukaryotic translation initiation factor3-like protein were dramatically altered in mel2 mutant anthers. Taken together with transcriptome and gene ontology results, we propose that the rice MEL2 is involved in the translational regulation of key meiotic genes on 3'-UTRs to achieve the faithful transition of germ cells to meiosis.
Molecular dynamics simulations of RNA motifs

Czech Academy of Sciences Publication Activity Database

Csaszar, K.; Špačková, Naďa; Šponer, Jiří; Leontis, N. B.

2002-01-01

Roč. 223, - (2002), s. 154 ISSN 0065-7727. [Annual Meeting of the American Chemistry Society /223./. 07.04.2002-11.04.2002, Orlando ] Institutional research plan: CEZ:AV0Z5004920 Keywords : molecular dynamics * RNA * hydration Subject RIV: BO - Biophysics
Parallel motif extraction from very long sequences

KAUST Repository

Sahli, Majed; Mansour, Essam; Kalnis, Panos

2013-01-01

Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern
MotifNet: a web-server for network motif analysis.

Science.gov (United States)

Smoly, Ilan Y; Lerman, Eugene; Ziv-Ukelson, Michal; Yeger-Lotem, Esti

2017-06-15

Network motifs are small topological patterns that recur in a network significantly more often than expected by chance. Their identification emerged as a powerful approach for uncovering the design principles underlying complex networks. However, available tools for network motif analysis typically require download and execution of computationally intensive software on a local computer. We present MotifNet, the first open-access web-server for network motif analysis. MotifNet allows researchers to analyze integrated networks, where nodes and edges may be labeled, and to search for motifs of up to eight nodes. The output motifs are presented graphically and the user can interactively filter them by their significance, number of instances, node and edge labels, and node identities, and view their instances. MotifNet also allows the user to distinguish between motifs that are centered on specific nodes and motifs that recur in distinct parts of the network. MotifNet is freely available at http://netbio.bgu.ac.il/motifnet . The website was implemented using ReactJs and supports all major browsers. The server interface was implemented in Python with data stored on a MySQL database. estiyl@bgu.ac.il or michaluz@cs.bgu.ac.il. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Is TNF-a-targeted short hairpin RNA (shRNA) a novel potential therapeutic tool in psoriasis treatment?

DEFF Research Database (Denmark)

Stenderup, Karin; Jakobsen, Maria; Rosada, Cecilia

2008-01-01

TNF-α is a well known target in psoriasis treatment and biological treatments targeting TNF-a are already clinically used against psoriasis and psoriasis arthritis. Attention is however given to a novel therapeutic tool: RNA interference that controls gene silencing. This study investigates...... the efficiency of targeting TNF-a with specific short hairpin RNA (shRNA) and explores its potential in treating psoriasis. ShRNAs targeting human TNF-α mRNA were generated. Their efficiency in down-regulating TNF-a protein expression was evaluated using a Renilla luciferase screening-assay and a transient co...... TNF-a shRNA was used to transduce HEK293 cells and verify vector-derived TNF-a knockdown in vitro. In vivo, psoriasis skin was exposed to lentiviral TNF-a shRNAs by a single intra-dermal injection. Psoriasis skin for the in vivo study was obtained from psoriatic plaque skin biopsies that were...
Conserved generation of short products at piRNA loci

Directory of Open Access Journals (Sweden)

Khorshid Mohsen

2011-01-01

Full Text Available Abstract Background The piRNA pathway operates in animal germ lines to ensure genome integrity through retrotransposon silencing. The Piwi protein-associated small RNAs (piRNAs guide Piwi proteins to retrotransposon transcripts, which are degraded and thereby post-transcriptionally silenced through a ping-pong amplification process. Cleavage of the retrotransposon transcript defines at the same time the 5' end of a secondary piRNA that will in turn guide a Piwi protein to a primary piRNA precursor, thereby amplifying primary piRNAs. Although several studies provided evidence that this mechanism is conserved among metazoa, how the process is initiated and what enzymatic activities are responsible for generating the primary and secondary piRNAs are not entirely clear. Results Here we analyzed small RNAs from three mammalian species, seeking to gain further insight into the mechanisms responsible for the piRNA amplification loop. We found that in all these species piRNA-directed targeting is accompanied by the generation of short sequences that have a very precisely defined length, 19 nucleotides, and a specific spatial relationship with the guide piRNAs. Conclusions This suggests that the processing of the 5' product of piRNA-guided cleavage occurs while the piRNA target is engaged by the Piwi protein. Although they are not stabilized through methylation of their 3' ends, the 19-mers are abundant not only in testes lysates but also in immunoprecipitates of Miwi and Mili proteins. They will enable more accurate identification of piRNA loci in deep sequencing data sets.
Identification of sequence motifs significantly associated with antisense activity

Directory of Open Access Journals (Sweden)

Peek Andrew S

2007-06-01

mediators to speed the process along like the RNA Induced Silencing Complex (RISC in RNAi. The independence of motif position and antisense activity also allows us to bypass consideration of this feature in the modelling process, promoting model efficiency and reducing the chance of overfitting when predicting antisense activity. The increase in SVR correlation with significant features compared to nearest-neighbour features indicates that thermodynamics alone is likely not the only factor in determining antisense efficiency.
Short hairpin RNA interference therapy for ischemic heart disease

Science.gov (United States)

Huang, Mei; Chan, Denise; Jia, Fangjun; Xie, Xiaoyan; Li, Zongjin; Hoyt, Grant; Robbins, Robert C.; Chen, Xiaoyuan; Giaccia, Amato; Wu, Joseph C.

2013-01-01

Background During hypoxia, upregulation of hypoxia inducible factor-1 alpha (HIF-1α) transcriptional factor can activate several downstream angiogenic genes. However, HIF-1α is naturally degraded by prolyl hydroxylase-2 (PHD2) protein. Here we hypothesize that short hairpin RNA (shRNA) interference therapy targeting PHD2 can be used for treatment of myocardial ischemia and this process can be followed noninvasively by molecular imaging. Methods and Results PHD2 was cloned from mouse embryonic stem (ES) cells by comparing the homolog gene in human and rat. The best candidate shRNA sequence for inhibiting PHD2 was inserted into the pSuper vector driven by the H1 promoter, followed by a separate hypoxia response element (HRE)-incorporated promoter driving a firefly luciferase (Fluc) reporter gene. This construct was used to transfect mouse C2C12 myoblast cell line for in vitro confirmation. Compared to the control short hairpin scramble (shScramble) as control, inhibition of PHD2 increased levels of HIF-1α protein and several downstream angiogenic genes by >30% (P<0.01). Afterwards, shRNA targeting PHD2 (shPHD2) plasmid was injected intramyocardially following ligation of left anterior descending (LAD) artery in mice. Animals were randomized into shPHD2 group (n=20) versus shScramble sequence as control (n=20). Bioluminescence imaging detected transgene expression for 4–5 weeks. Echocardiographic study showed the shPHD2 group had improved fractional shortening compared with the shScramble group at week 4 (33.7%±1.9% vs. 28.4%±2.8%; P<0.05). Postmortem analysis showed increased presence of small capillaries and venules in the infarcted zones by CD31 staining. Finally, Western blot anlaysis of explanted hearts also confirm that animals treated with shPHD2 had significantly higher levels of HIF-1α protein. Conclusions This is the first study to image the biological role of shRNA therapy for improving cardiac function. Inhibition of PHD2 by shRNA led to
Selective RNA targeting and regulated signaling by RIG-I is controlled by coordination of RNA and ATP binding.

Science.gov (United States)

Fitzgerald, Megan E; Rawling, David C; Potapova, Olga; Ren, Xiaoming; Kohlway, Andrew; Pyle, Anna Marie

2017-02-17

RIG-I is an innate immune receptor that detects and responds to infection by deadly RNA viruses such as influenza, and Hepatitis C. In the cytoplasm, RIG-I is faced with a difficult challenge: it must sensitively detect viral RNA while ignoring the abundance of host RNA. It has been suggested that RIG-I has a ‘proof-reading’ mechanism for rejecting host RNA targets, and that disruptions of this selectivity filter give rise to autoimmune diseases. Here, we directly monitor RNA proof-reading by RIG-I and we show that it is controlled by a set of conserved amino acids that couple RNA and ATP binding to the protein (Motif III). Mutations of this motif directly modulate proof-reading by eliminating or enhancing selectivity for viral RNA, with major implications for autoimmune disease and cancer. More broadly, the results provide a physical explanation for the ATP-gated behavior of SF2 RNA helicases and receptor proteins.
How short RNAs impact the human ribonuclease Dicer activity: putative regulatory feedback-loops and other RNA-mediated mechanisms controlling microRNA processing.

Science.gov (United States)

Koralewska, Natalia; Hoffmann, Weronika; Pokornowska, Maria; Milewski, Marek; Lipinska, Andrea; Bienkowska-Szewczyk, Krystyna; Figlerowicz, Marek; Kurzynska-Kokorniak, Anna

2016-01-01

Ribonuclease Dicer plays a pivotal role in RNA interference pathways by processing long double-stranded RNAs and single-stranded hairpin RNA precursors into small interfering RNAs (siRNAs) and microRNAs (miRNAs), respectively. While details of Dicer regulation by a variety of proteins are being elucidated, less is known about non-protein factors, e.g. RNA molecules, that may influence this enzyme's activity. Therefore, we decided to investigate the question of whether the RNA molecules can function not only as Dicer substrates but also as its regulators. Our previous in vitro studies indicated that the activity of human Dicer can be influenced by short RNA molecules that either bind to Dicer or interact with its substrates, or both. Those studies were carried out with commercial Dicer preparations. Nevertheless, such preparations are usually not homogeneous enough to carry out more detailed RNA-binding studies. Therefore, we have established our own system for the production of human Dicer in insect cells. In this manuscript, we characterize the RNA-binding and RNA-cleavage properties of the obtained preparation. We demonstrate that Dicer can efficiently bind single-stranded RNAs that are longer than ~20-nucleotides. Consequently, we revisit possible scenarios of Dicer regulation by single-stranded RNA species ranging from ~10- to ~60-nucleotides, in the context of their binding to this enzyme. Finally, we show that siRNA/miRNA-sized RNAs may affect miRNA production either by binding to Dicer or by participating in regulatory feedback-loops. Altogether, our studies suggest a broad regulatory role of short RNAs in Dicer functioning.

Karyological characterization and identification of four repetitive element groups (the 18S – 28S rRNA gene, telomeric sequences, microsatellite repeat motifs, Rex retroelements) of the Asian swamp eel (Monopterus albus)

Science.gov (United States)

Suntronpong, Aorarat; Thapana, Watcharaporn; Twilprawat, Panupon; Prakhongcheep, Ornjira; Somyong, Suthasinee; Muangmai, Narongrit; Surin Peyachoknagul; Srikulnath, Kornsorn

2017-01-01

Abstract Among teleost fishes, Asian swamp eel (Monopterus albus Zuiew, 1793) possesses the lowest chromosome number, 2n = 24. To characterize the chromosome constitution and investigate the genome organization of repetitive sequences in M. albus, karyotyping and chromosome mapping were performed with the 18S – 28S rRNA gene, telomeric repeats, microsatellite repeat motifs, and Rex retroelements. The 18S – 28S rRNA genes were observed to the pericentromeric region of chromosome 4 at the same position with large propidium iodide and C-positive bands, suggesting that the molecular structure of the pericentromeric regions of chromosome 4 has evolved in a concerted manner with amplification of the 18S – 28S rRNA genes. (TTAGGG)n sequences were found at the telomeric ends of all chromosomes. Eight of 19 microsatellite repeat motifs were dispersedly mapped on different chromosomes suggesting the independent amplification of microsatellite repeat motifs in M. albus. Monopterus albus Rex1 (MALRex1) was observed at interstitial sites of all chromosomes and in the pericentromeric regions of most chromosomes whereas MALRex3 was scattered and localized to all chromosomes and MALRex6 to several chromosomes. This suggests that these retroelements were independently amplified or lost in M. albus. Among MALRexs (MALRex1, MALRex3, and MALRex6), MALRex6 showed higher interspecific sequence divergences from other teleost species in comparison. This suggests that the divergence of Rex6 sequences of M. albus might have occurred a relatively long time ago. PMID:29093797
How to find a leucine in a haystack? Structure, ligand recognition and regulation of leucine-aspartic acid (LD) motifs

KAUST Repository

Alam, Tanvir

2014-05-29

LD motifs (leucine-aspartic acidmotifs) are short helical protein-protein interaction motifs that have emerged as key players in connecting cell adhesion with cell motility and survival. LD motifs are required for embryogenesis, wound healing and the evolution of multicellularity. LD motifs also play roles in disease, such as in cancer metastasis or viral infection. First described in the paxillin family of scaffolding proteins, LD motifs and similar acidic LXXLL interaction motifs have been discovered in several other proteins, whereas 16 proteins have been reported to contain LDBDs (LD motif-binding domains). Collectively, structural and functional analyses have revealed a surprising multivalency in LD motif interactions and a wide diversity in LDBD architectures. In the present review, we summarize the molecular basis for function, regulation and selectivity of LD motif interactions that has emerged from more than a decade of research. This overview highlights the intricate multi-level regulation and the inherently noisy and heterogeneous nature of signalling through short protein-protein interaction motifs. © 2014 Biochemical Society.
How to find a leucine in a haystack? Structure, ligand recognition and regulation of leucine-aspartic acid (LD) motifs

KAUST Repository

Alam, Tanvir; Alazmi, Meshari; Gao, Xin; Arold, Stefan T.

2014-01-01

LD motifs (leucine-aspartic acidmotifs) are short helical protein-protein interaction motifs that have emerged as key players in connecting cell adhesion with cell motility and survival. LD motifs are required for embryogenesis, wound healing and the evolution of multicellularity. LD motifs also play roles in disease, such as in cancer metastasis or viral infection. First described in the paxillin family of scaffolding proteins, LD motifs and similar acidic LXXLL interaction motifs have been discovered in several other proteins, whereas 16 proteins have been reported to contain LDBDs (LD motif-binding domains). Collectively, structural and functional analyses have revealed a surprising multivalency in LD motif interactions and a wide diversity in LDBD architectures. In the present review, we summarize the molecular basis for function, regulation and selectivity of LD motif interactions that has emerged from more than a decade of research. This overview highlights the intricate multi-level regulation and the inherently noisy and heterogeneous nature of signalling through short protein-protein interaction motifs. © 2014 Biochemical Society.
Exon silencing by UAGG motifs in response to neuronal excitation.

Directory of Open Access Journals (Sweden)

Ping An

2007-02-01

Full Text Available Alternative pre-mRNA splicing plays fundamental roles in neurons by generating functional diversity in proteins associated with the communication and connectivity of the synapse. The CI cassette of the NMDA R1 receptor is one of a variety of exons that show an increase in exon skipping in response to cell excitation, but the molecular nature of this splicing responsiveness is not yet understood. Here we investigate the molecular basis for the induced changes in splicing of the CI cassette exon in primary rat cortical cultures in response to KCl-induced depolarization using an expression assay with a tight neuron-specific readout. In this system, exon silencing in response to neuronal excitation was mediated by multiple UAGG-type silencing motifs, and transfer of the motifs to a constitutive exon conferred a similar responsiveness by gain of function. Biochemical analysis of protein binding to UAGG motifs in extracts prepared from treated and mock-treated cortical cultures showed an increase in nuclear hnRNP A1-RNA binding activity in parallel with excitation. Evidence for the role of the NMDA receptor and calcium signaling in the induced splicing response was shown by the use of specific antagonists, as well as cell-permeable inhibitors of signaling pathways. Finally, a wider role for exon-skipping responsiveness is shown to involve additional exons with UAGG-related silencing motifs, and transcripts involved in synaptic functions. These results suggest that, at the post-transcriptional level, excitable exons such as the CI cassette may be involved in strategies by which neurons mount adaptive responses to hyperstimulation.
Optimizing de novo common wheat transcriptome assembly using short-read RNA-Seq data

Directory of Open Access Journals (Sweden)

Duan Jialei

2012-08-01

Full Text Available Abstract Background Rapid advances in next-generation sequencing methods have provided new opportunities for transcriptome sequencing (RNA-Seq. The unprecedented sequencing depth provided by RNA-Seq makes it a powerful and cost-efficient method for transcriptome study, and it has been widely used in model organisms and non-model organisms to identify and quantify RNA. For non-model organisms lacking well-defined genomes, de novo assembly is typically required for downstream RNA-Seq analyses, including SNP discovery and identification of genes differentially expressed by phenotypes. Although RNA-Seq has been successfully used to sequence many non-model organisms, the results of de novo assembly from short reads can still be improved by using recent bioinformatic developments. Results In this study, we used 212.6 million pair-end reads, which accounted for 16.2 Gb, to assemble the hexaploid wheat transcriptome. Two state-of-the-art assemblers, Trinity and Trans-ABySS, which use the single and multiple k-mer methods, respectively, were used, and the whole de novo assembly process was divided into the following four steps: pre-assembly, merging different samples, removal of redundancy and scaffolding. We documented every detail of these steps and how these steps influenced assembly performance to gain insight into transcriptome assembly from short reads. After optimization, the assembled transcripts were comparable to Sanger-derived ESTs in terms of both continuity and accuracy. We also provided considerable new wheat transcript data to the community. Conclusions It is feasible to assemble the hexaploid wheat transcriptome from short reads. Special attention should be paid to dealing with multiple samples to balance the spectrum of expression levels and redundancy. To obtain an accurate overview of RNA profiling, removal of redundancy may be crucial in de novo assembly.
Circuit motifs for contrast-adaptive differentiation in early sensory systems: the role of presynaptic inhibition and short-term plasticity.

Science.gov (United States)

Zhang, Danke; Wu, Si; Rasch, Malte J

2015-01-01

In natural signals, such as the luminance value across of a visual scene, abrupt changes in intensity value are often more relevant to an organism than intensity values at other positions and times. Thus to reduce redundancy, sensory systems are specialized to detect the times and amplitudes of informative abrupt changes in the input stream rather than coding the intensity values at all times. In theory, a system that responds transiently to fast changes is called a differentiator. In principle, several different neural circuit mechanisms exist that are capable of responding transiently to abrupt input changes. However, it is unclear which circuit would be best suited for early sensory systems, where the dynamic range of the natural input signals can be very wide. We here compare the properties of different simple neural circuit motifs for implementing signal differentiation. We found that a circuit motif based on presynaptic inhibition (PI) is unique in a sense that the vesicle resources in the presynaptic site can be stably maintained over a wide range of stimulus intensities, making PI a biophysically plausible mechanism to implement a differentiator with a very wide dynamical range. Moreover, by additionally considering short-term plasticity (STP), differentiation becomes contrast adaptive in the PI-circuit but not in other potential neural circuit motifs. Numerical simulations show that the behavior of the adaptive PI-circuit is consistent with experimental observations suggesting that adaptive presynaptic inhibition might be a good candidate neural mechanism to achieve differentiation in early sensory systems.
Mapping Hfq-RNA interaction surfaces using tryptophan fluorescence quenching

Science.gov (United States)

Robinson, Kirsten E.; Orans, Jillian; Kovach, Alexander R.; Link, Todd M.; Brennan, Richard G.

2014-01-01

Hfq is a posttranscriptional riboregulator and RNA chaperone that binds small RNAs and target mRNAs to effect their annealing and message-specific regulation in response to environmental stressors. Structures of Hfq-RNA complexes indicate that U-rich sequences prefer the proximal face and A-rich sequences the distal face; however, the Hfq-binding sites of most RNAs are unknown. Here, we present an Hfq-RNA mapping approach that uses single tryptophan-substituted Hfq proteins, all of which retain the wild-type Hfq structure, and tryptophan fluorescence quenching (TFQ) by proximal RNA binding. TFQ properly identified the respective distal and proximal binding of A15 and U6 RNA to Gram-negative Escherichia coli (Ec) Hfq and the distal face binding of (AA)3A, (AU)3A and (AC)3A to Gram-positive Staphylococcus aureus (Sa) Hfq. The inability of (GU)3G to bind the distal face of Sa Hfq reveals the (R-L)n binding motif is a more restrictive (A-L)n binding motif. Remarkably Hfq from Gram-positive Listeria monocytogenes (Lm) binds (GU)3G on its proximal face. TFQ experiments also revealed the Ec Hfq (A-R-N)n distal face-binding motif should be redefined as an (A-A-N)n binding motif. TFQ data also demonstrated that the 5′-untranslated region of hfq mRNA binds both the proximal and distal faces of Ec Hfq and the unstructured C-terminus. PMID:24288369
MicroRNA categorization using sequence motifs and k-mers.

Science.gov (United States)

Yousef, Malik; Khalifa, Waleed; Acar, İlhan Erkin; Allmer, Jens

2017-03-14

Post-transcriptional gene dysregulation can be a hallmark of diseases like cancer and microRNAs (miRNAs) play a key role in the modulation of translation efficiency. Known pre-miRNAs are listed in miRBase, and they have been discovered in a variety of organisms ranging from viruses and microbes to eukaryotic organisms. The computational detection of pre-miRNAs is of great interest, and such approaches usually employ machine learning to discriminate between miRNAs and other sequences. Many features have been proposed describing pre-miRNAs, and we have previously introduced the use of sequence motifs and k-mers as useful ones. There have been reports of xeno-miRNAs detected via next generation sequencing. However, they may be contaminations and to aid that important decision-making process, we aimed to establish a means to differentiate pre-miRNAs from different species. To achieve distinction into species, we used one species' pre-miRNAs as the positive and another species' pre-miRNAs as the negative training and test data for the establishment of machine learned models based on sequence motifs and k-mers as features. This approach resulted in higher accuracy values between distantly related species while species with closer relation produced lower accuracy values. We were able to differentiate among species with increasing success when the evolutionary distance increases. This conclusion is supported by previous reports of fast evolutionary changes in miRNAs since even in relatively closely related species a fairly good discrimination was possible.
The Crc and Hfq proteins of Pseudomonas putida cooperate in catabolite repression and formation of ribonucleic acid complexes with specific target motifs.

Science.gov (United States)

Moreno, Renata; Hernández-Arranz, Sofía; La Rosa, Ruggero; Yuste, Luis; Madhushani, Anjana; Shingler, Victoria; Rojo, Fernando

2015-01-01

The Crc protein is a global regulator that has a key role in catabolite repression and optimization of metabolism in Pseudomonads. Crc inhibits gene expression post-transcriptionally, preventing translation of mRNAs bearing an AAnAAnAA motif [the catabolite activity (CA) motif] close to the translation start site. Although Crc was initially believed to bind RNA by itself, this idea was recently challenged by results suggesting that a protein co-purifying with Crc, presumably the Hfq protein, could account for the detected RNA-binding activity. Hfq is an abundant protein that has a central role in post-transcriptional gene regulation. Herein, we show that the Pseudomonas putida Hfq protein can recognize the CA motifs of RNAs through its distal face and that Crc facilitates formation of a more stable complex at these targets. Crc was unable to bind RNA in the absence of Hfq. However, pull-down assays showed that Crc and Hfq can form a co-complex with RNA containing a CA motif in vitro. Inactivation of the hfq or the crc gene impaired catabolite repression to a similar extent. We propose that Crc and Hfq cooperate in catabolite repression, probably through forming a stable co-complex with RNAs containing CA motifs to result in inhibition of translation initiation. © 2014 Society for Applied Microbiology and John Wiley & Sons Ltd.
Poly(A) motif prediction using spectral latent features from human DNA sequences

KAUST Repository

Xie, Bo; Jankovic, Boris R.; Bajic, Vladimir B.; Song, Le; Gao, Xin

2013-01-01

Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other
Poly(A) motif prediction using spectral latent features from human DNA sequences

KAUST Repository

Xie, Bo

2013-06-21

Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other
Assembling RNA Nanoparticles.

Science.gov (United States)

Xiao, Shou-Jun

2017-01-01

RNA nanoparticles are designed and self-assembled according to noncanonical interactions of naturally conserved RNA motifs and/or canonical Watson-Crick base-pairing interactions, which have potential applications in gene therapy and nanomedicine. These artificially engineered nanoparticles are mainly synthesized from in vitro transcribed RNAs, purified by denaturing and native polyacrylamide gel electrophoresis (PAGE), and characterized with native PAGE, AFM, and TEM technologies. The protocols of in vitro transcription, denaturing and native PAGE, and RNA nanoparticle self-assembly are described in detail.
Structural and functional characterization of an archaeal clustered regularly interspaced short palindromic repeat (CRISPR)-associated complex for antiviral defense (CASCADE).

Science.gov (United States)

Lintner, Nathanael G; Kerou, Melina; Brumfield, Susan K; Graham, Shirley; Liu, Huanting; Naismith, James H; Sdano, Matthew; Peng, Nan; She, Qunxin; Copié, Valérie; Young, Mark J; White, Malcolm F; Lawrence, C Martin

2011-06-17

In response to viral infection, many prokaryotes incorporate fragments of virus-derived DNA into loci called clustered regularly interspaced short palindromic repeats (CRISPRs). The loci are then transcribed, and the processed CRISPR transcripts are used to target invading viral DNA and RNA. The Escherichia coli "CRISPR-associated complex for antiviral defense" (CASCADE) is central in targeting invading DNA. Here we report the structural and functional characterization of an archaeal CASCADE (aCASCADE) from Sulfolobus solfataricus. Tagged Csa2 (Cas7) expressed in S. solfataricus co-purifies with Cas5a-, Cas6-, Csa5-, and Cas6-processed CRISPR-RNA (crRNA). Csa2, the dominant protein in aCASCADE, forms a stable complex with Cas5a. Transmission electron microscopy reveals a helical complex of variable length, perhaps due to substoichiometric amounts of other CASCADE components. A recombinant Csa2-Cas5a complex is sufficient to bind crRNA and complementary ssDNA. The structure of Csa2 reveals a crescent-shaped structure unexpectedly composed of a modified RNA-recognition motif and two additional domains present as insertions in the RNA-recognition motif. Conserved residues indicate potential crRNA- and target DNA-binding sites, and the H160A variant shows significantly reduced affinity for crRNA. We propose a general subunit architecture for CASCADE in other bacteria and Archaea.
Finding a Leucine in a Haystack: Searching the Proteome for ambigous Leucine-Aspartic Acid motifs

KAUST Repository

Arold, Stefan T.

2016-01-25

Leucine-aspartic acid (LD) motifs are short helical protein-protein interaction motifs involved in cell motility, survival and communication. LD motif interactions are also implicated in cancer metastasis and are targeted by several viruses. LD motifs are notoriously difficult to detect because sequence pattern searches lead to an excessively high number of false positives. Hence, despite 20 years of research, only six LD motif–containing proteins are known in humans, three of which are close homologues of the paxillin family. To enable the proteome-wide discovery of LD motifs, we developed LD Motif Finder (LDMF), a web tool based on machine learning that combines sequence information with structural predictions to detect LD motifs with high accuracy. LDMF predicted 13 new LD motifs in humans. Using biophysical assays, we experimentally confirmed in vitro interactions for four novel LD motif proteins. Thus, LDMF allows proteome-wide discovery of LD motifs, despite a highly ambiguous sequence pattern. Functional implications will be discussed.
Extrapolative microRNA precursor based SSR mining from tea EST database in respect to agronomic traits.

Science.gov (United States)

Hazra, Anjan; Dasgupta, Nirjhar; Sengupta, Chandan; Das, Sauren

2017-07-06

Tea (Camellia sinensis, (L.) Kuntze) is considered as most popular drink across the world and it is widely consumed beverage for its several health-benefit characteristics. These positive traits primarily rely on its regulatory networks of different metabolic pathways. Development of microsatellite markers from the conserved genomic regions are being worthwhile for reviewing the genetic diversity of closely related species or self-pollinated species. Although several SSR markers have been reported, in tea, the trait-specific Simple Sequence Repeat (SSR) markers, leading to be useful in marker assisted breeding technique, are yet to be identified. Micro RNAs are short, non-coding RNA molecules, involved in post transcriptional mode of gene regulation and thus effects on related phenotype. Present study deals with identification of the microsatellite motifs within the reported and predicted miRNA precursors that are effectively followed by designing of primers from SSR flanking regions in order to PCR validation. In addition to the earlier reports, two new miRNAs are predicting here from tea expressed tag sequence database. Furthermore, 18 SSR motifs are found to be in 13 of all 33 predicted miRNAs. Trinucleotide motifs are most abundant among all followed by dinucleotides. Since, miRNA based SSR markers are evidenced to have significant role on genetic fingerprinting study, these outcomes would pave the way in developing novel markers for tagging tea specific agronomic traits as well as substantiating non-conventional breeding program.
G-Quadruplexes influence pri-microRNA processing.

Science.gov (United States)

Rouleau, Samuel G; Garant, Jean-Michel; Bolduc, François; Bisaillon, Martin; Perreault, Jean-Pierre

2018-02-01

RNA G-Quadruplexes (G4) have been shown to possess many biological functions, including the regulation of microRNA (miRNA) biogenesis and function. However, their impact on pri-miRNA processing remains unknown. We identified G4 located near the Drosha cleavage site in three distinct pri-miRNAs: pri-mir200c, pri-mir451a, and pri-mir497. The folding of the potential G4 motifs was determined in solution. Subsequently, mutations disrupting G4 folding led to important changes in the mature miRNAs levels in cells. Moreover, using small antisense oligonucleotides binding to the pri-miRNA, it was possible to modulate, either positively or negatively, the mature miRNA levels. Together, these data demonstrate that G4 motifs could contribute to the regulation of pri-mRNA processing, a novel role for G4. Considering that bio-informatics screening indicates that between 9% and 50% of all pri-miRNAs contain a putative G4, these structures possess interesting potential as future therapeutic targets.
Automatic annotation of protein motif function with Gene Ontology terms

Directory of Open Access Journals (Sweden)

Gopalakrishnan Vanathi

2004-09-01

Full Text Available Abstract Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs.
Structures and short linear motif of disordered transcription factor regions provide clues to the interactome of the cellular hub radical-induced cell death1

DEFF Research Database (Denmark)

O'Shea, Charlotte; Staby, Lasse; Bendsen, Sidsel Krogh

2017-01-01

Intrinsically disordered protein regions (IDRs) lack a well-defined three-dimensional structure, but often facilitate key protein functions. Some interactions between IDRs and folded protein domains rely on short linear motifs (SLiMs). These motifs are challenging to identify, but once found can...... point to larger networks of interactions, such as with proteins that serve as hubs for essential cellular functions. The stress-associated plant protein Radical-Induced Cell Death1 (RCD1) is one such hub, interacting with many transcription factors via their flexible IDRs. To identify the SLiM bound......046 formed different structures or were fuzzy in the complexes. These findings allow us to present a model of the stress-associated RCD1-transcription factor interactome and to contribute to the emerging understanding of the interactions between folded hubs and their intrinsically disordered partners....
High-Resolution RNA Maps Suggest Common Principles of Splicing and Polyadenylation Regulation by TDP-43

Directory of Open Access Journals (Sweden)

Gregor Rot

2017-05-01

Full Text Available Many RNA-binding proteins (RBPs regulate both alternative exons and poly(A site selection. To understand their regulatory principles, we developed expressRNA, a web platform encompassing computational tools for integration of iCLIP and RNA motif analyses with RNA-seq and 3′ mRNA sequencing. This reveals at nucleotide resolution the “RNA maps” describing how the RNA binding positions of RBPs relate to their regulatory functions. We use this approach to examine how TDP-43, an RBP involved in several neurodegenerative diseases, binds around its regulated poly(A sites. Binding close to the poly(A site generally represses, whereas binding further downstream enhances use of the site, which is similar to TDP-43 binding around regulated exons. Our RNAmotifs2 software also identifies sequence motifs that cluster together with the binding motifs of TDP-43. We conclude that TDP-43 directly regulates diverse types of pre-mRNA processing according to common position-dependent principles.
How pathogens use linear motifs to perturb host cell networks

KAUST Repository

Via, Allegra; Uyar, Bora; Brun, Christine; Zanzoni, Andreas

2015-01-01

Molecular mimicry is one of the powerful stratagems that pathogens employ to colonise their hosts and take advantage of host cell functions to guarantee their replication and dissemination. In particular, several viruses have evolved the ability to interact with host cell components through protein short linear motifs (SLiMs) that mimic host SLiMs, thus facilitating their internalisation and the manipulation of a wide range of cellular networks. Here we present convincing evidence from the literature that motif mimicry also represents an effective, widespread hijacking strategy in prokaryotic and eukaryotic parasites. Further insights into host motif mimicry would be of great help in the elucidation of the molecular mechanisms behind host cell invasion and the development of anti-infective therapeutic strategies.

A viral suppressor protein inhibits host RNA silencing by hooking up with Argonautes

KAUST Repository

Jin, Hailing; Zhu, Jian-Kang

2010-01-01

RNA viruses are particularly vulnerable to RNAi-based defenses in the host, and thus have evolved specific proteins, known as viral suppressors of RNA silencing (VSRs), as a counterdefense. In this issue of Genes & Development, Azevedo and colleagues (pp. 904-915) discovered that P38, the VSR of Turnip crinkle virus, uses its glycine/tryptophane (GW) motifs as an ARGONAUTE (AGO) hook to attract and disarm the host's essential effector of RNA silencing. Several GW motif-containing cellular proteins are known to be important partners of AGOs in RNA silencing effector complexes in yeast, plants, and animals. The GW motif appears to be a versatile and effective tool for regulating the activities of RNA silencing pathways, and the use of GW mimicry to compete for and inhibit host AGOs may be a strategy used by many pathogens to counteract host RNAi-based defenses. © 2010 by Cold Spring Harbor Laboratory Press.
Mechanism of duplex DNA destabilization by RNA-guided Cas9 nuclease during target interrogation.

Science.gov (United States)

Mekler, Vladimir; Minakhin, Leonid; Severinov, Konstantin

2017-05-23

The prokaryotic clustered regularly interspaced short palindromic repeats (CRISPR)-associated 9 (Cas9) endonuclease cleaves double-stranded DNA sequences specified by guide RNA molecules and flanked by a protospacer adjacent motif (PAM) and is widely used for genome editing in various organisms. The RNA-programmed Cas9 locates the target site by scanning genomic DNA. We sought to elucidate the mechanism of initial DNA interrogation steps that precede the pairing of target DNA with guide RNA. Using fluorometric and biochemical assays, we studied Cas9/guide RNA complexes with model DNA substrates that mimicked early intermediates on the pathway to the final Cas9/guide RNA-DNA complex. The results show that Cas9/guide RNA binding to PAM favors separation of a few PAM-proximal protospacer base pairs allowing initial target interrogation by guide RNA. The duplex destabilization is mediated, in part, by Cas9/guide RNA affinity for unpaired segments of nontarget strand DNA close to PAM. Furthermore, our data indicate that the entry of double-stranded DNA beyond a short threshold distance from PAM into the Cas9/single-guide RNA (sgRNA) interior is hindered. We suggest that the interactions unfavorable for duplex DNA binding promote DNA bending in the PAM-proximal region during early steps of Cas9/guide RNA-DNA complex formation, thus additionally destabilizing the protospacer duplex. The mechanism that emerges from our analysis explains how the Cas9/sgRNA complex is able to locate the correct target sequence efficiently while interrogating numerous nontarget sequences associated with correct PAMs.
Construction of RNA nanocages by re-engineering the packaging RNA of Phi29 bacteriophage

Science.gov (United States)

Hao, Chenhui; Li, Xiang; Tian, Cheng; Jiang, Wen; Wang, Guansong; Mao, Chengde

2014-05-01

RNA nanotechnology promises rational design of RNA nanostructures with wide array of structural diversities and functionalities. Such nanostructures could be used in applications such as small interfering RNA delivery and organization of in vivo chemical reactions. Though having impressive development in recent years, RNA nanotechnology is still quite limited and its programmability and complexity could not rival the degree of its closely related cousin: DNA nanotechnology. Novel strategies are needed for programmed RNA self-assembly. Here, we have assembled RNA nanocages by re-engineering a natural, biological RNA motif: the packaging RNA of phi29 bacteriophage. The resulting RNA nanostructures have been thoroughly characterized by gel electrophoresis, cryogenic electron microscopy imaging and dynamic light scattering.
Nucleolin Mediates MicroRNA-directed CSF-1 mRNA Deadenylation but Increases Translation of CSF-1 mRNA*

Science.gov (United States)

Woo, Ho-Hyung; Baker, Terri; Laszlo, Csaba; Chambers, Setsuko K.

2013-01-01

CSF-1 mRNA 3′UTR contains multiple unique motifs, including a common microRNA (miRNA) target in close proximity to a noncanonical G-quadruplex and AU-rich elements (AREs). Using a luciferase reporter system fused to CSF-1 mRNA 3′UTR, disruption of the miRNA target region, G-quadruplex, and AREs together dramatically increased reporter RNA levels, suggesting important roles for these cis-acting regulatory elements in the down-regulation of CSF-1 mRNA. We find that nucleolin, which binds both G-quadruplex and AREs, enhances deadenylation of CSF-1 mRNA, promoting CSF-1 mRNA decay, while having the capacity to increase translation of CSF-1 mRNA. Through interaction with the CSF-1 3′UTR miRNA common target, we find that miR-130a and miR-301a inhibit CSF-1 expression by enhancing mRNA decay. Silencing of nucleolin prevents the miRNA-directed mRNA decay, indicating a requirement for nucleolin in miRNA activity on CSF-1 mRNA. Downstream effects followed by miR-130a and miR-301a inhibition of directed cellular motility of ovarian cancer cells were found to be dependent on nucleolin. The paradoxical effects of nucleolin on miRNA-directed CSF-1 mRNA deadenylation and on translational activation were explored further. The nucleolin protein contains four acidic stretches, four RNA recognition motifs (RRMs), and nine RGG repeats. All three domains in nucleolin regulate CSF-1 mRNA and protein levels. RRMs increase CSF-1 mRNA, whereas the acidic and RGG domains decrease CSF-1 protein levels. This suggests that nucleolin has the capacity to differentially regulate both CSF-1 RNA and protein levels. Our finding that nucleolin interacts with Ago2 indirectly via RNA and with poly(A)-binding protein C (PABPC) directly suggests a nucleolin-Ago2-PABPC complex formation on mRNA. This complex is in keeping with our suggestion that nucleolin may work with PABPC as a double-edged sword on both mRNA deadenylation and translational activation. Our findings underscore the complexity of
Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence

NARCIS (Netherlands)

Semenova, E.V.; Jore, M.M.; Westra, E.R.; Oost, van der J.; Brouns, S.J.J.

2011-01-01

Prokaryotic clustered regularly interspaced short palindromic repeat (CRISPR)/Cas (CRISPR-associated sequences) systems provide adaptive immunity against viruses when a spacer sequence of small CRISPR RNA (crRNA) matches a protospacer sequence in the viral genome. Viruses that escape CRISPR/Cas
Citrus psorosis virus RNA 1 is of negative polarity and potentially encodes in its complementary strand a 24K protein of unknown function and 280K putative RNA dependent RNA polymerase.

Science.gov (United States)

Naum-Onganía, Gabriela; Gago-Zachert, Selma; Peña, Eduardo; Grau, Oscar; Garcia, Maria Laura

2003-10-01

Citrus psorosis virus (CPsV), the type member of genus Ophiovirus, has three genomic RNAs. Complete sequencing of CPsV RNA 1 revealed a size of 8184 nucleotides and Northern blot hybridization with chain specific probes showed that its non-coding strand is preferentially encapsidated. The complementary strand of RNA 1 contains two open reading frames (ORFs) separated by a 109-nt intergenic region, one located near the 5'-end potentially encoding a 24K protein of unknown function, and another of 280K containing the core polymerase motifs characteristic of viral RNA-dependent RNA polymerases (RdRp). Comparison of the core RdRp motifs of negative-stranded RNA viruses, supports grouping CPsV, Ranunculus white mottle virus (RWMV) and Mirafiori lettuce virus (MiLV) within the same genus (Ophiovirus), constituting a monophyletic group separated from all other negative-stranded RNA viruses. Furthermore, RNAs 1 of MiLV, CPsV and RWMV are similar in size and those of MiLV and CPsV also in genomic organization and sequence.
A viral suppressor protein inhibits host RNA silencing by hooking up with Argonautes

KAUST Repository

Jin, Hailing

2010-05-01

RNA viruses are particularly vulnerable to RNAi-based defenses in the host, and thus have evolved specific proteins, known as viral suppressors of RNA silencing (VSRs), as a counterdefense. In this issue of Genes & Development, Azevedo and colleagues (pp. 904-915) discovered that P38, the VSR of Turnip crinkle virus, uses its glycine/tryptophane (GW) motifs as an ARGONAUTE (AGO) hook to attract and disarm the host\\'s essential effector of RNA silencing. Several GW motif-containing cellular proteins are known to be important partners of AGOs in RNA silencing effector complexes in yeast, plants, and animals. The GW motif appears to be a versatile and effective tool for regulating the activities of RNA silencing pathways, and the use of GW mimicry to compete for and inhibit host AGOs may be a strategy used by many pathogens to counteract host RNAi-based defenses. © 2010 by Cold Spring Harbor Laboratory Press.
MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

Science.gov (United States)

Ozaki, Haruka; Iwasaki, Wataru

2016-08-01

As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.
A novel RNA-recognition-motif protein is required for premeiotic G1/S-phase transition in rice (Oryza sativa L..

Directory of Open Access Journals (Sweden)

Ken-Ichi Nonomura

2011-01-01

Full Text Available The molecular mechanism for meiotic entry remains largely elusive in flowering plants. Only Arabidopsis SWI1/DYAD and maize AM1, both of which are the coiled-coil protein, are known to be required for the initiation of plant meiosis. The mechanism underlying the synchrony of male meiosis, characteristic to flowering plants, has also been unclear in the plant kingdom. In other eukaryotes, RNA-recognition-motif (RRM proteins are known to play essential roles in germ-cell development and meiosis progression. Rice MEL2 protein discovered in this study shows partial similarity with human proline-rich RRM protein, deleted in Azoospermia-Associated Protein1 (DAZAP1, though MEL2 also possesses ankyrin repeats and a RING finger motif. Expression analyses of several cell-cycle markers revealed that, in mel2 mutant anthers, most germ cells failed to enter premeiotic S-phase and meiosis, and a part escaped from the defect and underwent meiosis with a significant delay or continued mitotic cycles. Immunofluorescent detection revealed that T7 peptide-tagged MEL2 localized at cytoplasmic perinuclear region of germ cells during premeiotic interphase in transgenic rice plants. This study is the first report of the plant RRM protein, which is required for regulating the premeiotic G1/S-phase transition of male and female germ cells and also establishing synchrony of male meiosis. This study will contribute to elucidation of similarities and diversities in reproduction system between plants and other species.
Structure of Escherichia coli Hfq bound to polyriboadenylate RNA

DEFF Research Database (Denmark)

Link, Todd M; Valentin-Hansen, Poul; Brennan, Richard G

2009-01-01

(A) RNA, A(15). The structure reveals a unique RNA binding mechanism. Unlike uridine-containing sequences, which bind to the "proximal" face, the poly(A) tract binds to the "distal" face of Hfq using 6 tripartite binding motifs. Each motif consists of an adenosine specificity site (A site), which......Hfq is a small, highly abundant hexameric protein that is found in many bacteria and plays a critical role in mRNA expression and RNA stability. As an "RNA chaperone," Hfq binds AU-rich sequences and facilitates the trans annealing of small RNAs (sRNAs) to their target mRNAs, typically resulting...... in the down-regulation of gene expression. Hfq also plays a key role in bacterial RNA decay by binding tightly to polyadenylate [poly(A)] tracts. The structural mechanism by which Hfq recognizes and binds poly(A) is unknown. Here, we report the crystal structure of Escherichia coli Hfq bound to the poly...
RBPmap: a web server for mapping binding sites of RNA-binding proteins.

Science.gov (United States)

Paz, Inbal; Kosti, Idit; Ares, Manuel; Cline, Melissa; Mandel-Gutfreund, Yael

2014-07-01

Regulation of gene expression is executed in many cases by RNA-binding proteins (RBPs) that bind to mRNAs as well as to non-coding RNAs. RBPs recognize their RNA target via specific binding sites on the RNA. Predicting the binding sites of RBPs is known to be a major challenge. We present a new webserver, RBPmap, freely accessible through the website http://rbpmap.technion.ac.il/ for accurate prediction and mapping of RBP binding sites. RBPmap has been developed specifically for mapping RBPs in human, mouse and Drosophila melanogaster genomes, though it supports other organisms too. RBPmap enables the users to select motifs from a large database of experimentally defined motifs. In addition, users can provide any motif of interest, given as either a consensus or a PSSM. The algorithm for mapping the motifs is based on a Weighted-Rank approach, which considers the clustering propensity of the binding sites and the overall tendency of regulatory regions to be conserved. In addition, RBPmap incorporates a position-specific background model, designed uniquely for different genomic regions, such as splice sites, 5' and 3' UTRs, non-coding RNA and intergenic regions. RBPmap was tested on high-throughput RNA-binding experiments and was proved to be highly accurate. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Knockdown of Rice microRNA166 by Short Tandem Target Mimic (STTM).

Science.gov (United States)

Teotia, Sachin; Zhang, Dabing; Tang, Guiliang

2017-01-01

Small RNAs, including microRNAs (miRNAs), are abundant in plants and play key roles in controlling plant development and physiology. miRNAs regulate the expression of the target genes involved in key plant processes. Due to functional redundancy among miRNA family members in plants, an ideal approach to silence the expression of all members simultaneously, for their functional characterization, is desirable. Target mimic (TM) was the first approach to achieve this goal. Short tandem target mimic (STTM) is a potent approach complementing TM for silencing miRNAs in plants. STTMs have been successfully used in dicots to block miRNA functions. Here, we describe in detail the protocol for designing STTM construct to block miRNA functions in rice. Such approach can be applied to silence miRNAs in other monocots as well.
Non-canonical binding interactions of the RNA recognition motif (RRM) domains of P34 protein modulate binding within the 5S ribonucleoprotein particle (5S RNP).

Science.gov (United States)

Kamina, Anyango D; Williams, Noreen

2017-01-01

RNA binding proteins are involved in many aspects of RNA metabolism. In Trypanosoma brucei, our laboratory has identified two trypanosome-specific RNA binding proteins P34 and P37 that are involved in the maturation of the 60S subunit during ribosome biogenesis. These proteins are part of the T. brucei 5S ribonucleoprotein particle (5S RNP) and P34 binds to 5S ribosomal RNA (rRNA) and ribosomal protein L5 through its N-terminus and its RNA recognition motif (RRM) domains. We generated truncated P34 proteins to determine these domains' interactions with 5S rRNA and L5. Our analyses demonstrate that RRM1 of P34 mediates the majority of binding with 5S rRNA and the N-terminus together with RRM1 contribute the most to binding with L5. We determined that the consensus ribonucleoprotein (RNP) 1 and 2 sequences, characteristic of canonical RRM domains, are not fully conserved in the RRM domains of P34. However, the aromatic amino acids previously described to mediate base stacking interactions with their RNA target are conserved in both of the RRM domains of P34. Surprisingly, mutation of these aromatic residues did not disrupt but instead enhanced 5S rRNA binding. However, we identified four arginine residues located in RRM1 of P34 that strongly impact L5 binding. These mutational analyses of P34 suggest that the binding site for 5S rRNA and L5 are near each other and specific residues within P34 regulate the formation of the 5S RNP. These studies show the unique way that the domains of P34 mediate binding with the T. brucei 5S RNP.
NoFold: RNA structure clustering without folding or alignment.

Science.gov (United States)

Middleton, Sarah A; Kim, Junhyong

2014-11-01

Structures that recur across multiple different transcripts, called structure motifs, often perform a similar function-for example, recruiting a specific RNA-binding protein that then regulates translation, splicing, or subcellular localization. Identifying common motifs between coregulated transcripts may therefore yield significant insight into their binding partners and mechanism of regulation. However, as most methods for clustering structures are based on folding individual sequences or doing many pairwise alignments, this results in a tradeoff between speed and accuracy that can be problematic for large-scale data sets. Here we describe a novel method for comparing and characterizing RNA secondary structures that does not require folding or pairwise alignment of the input sequences. Our method uses the idea of constructing a distance function between two objects by their respective distances to a collection of empirical examples or models, which in our case consists of 1973 Rfam family covariance models. Using this as a basis for measuring structural similarity, we developed a clustering pipeline called NoFold to automatically identify and annotate structure motifs within large sequence data sets. We demonstrate that NoFold can simultaneously identify multiple structure motifs with an average sensitivity of 0.80 and precision of 0.98 and generally exceeds the performance of existing methods. We also perform a cross-validation analysis of the entire set of Rfam families, achieving an average sensitivity of 0.57. We apply NoFold to identify motifs enriched in dendritically localized transcripts and report 213 enriched motifs, including both known and novel structures. © 2014 Middleton and Kim; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Programmable RNA recognition and cleavage by CRISPR/Cas9.

Science.gov (United States)

O'Connell, Mitchell R; Oakes, Benjamin L; Sternberg, Samuel H; East-Seletsky, Alexandra; Kaplan, Matias; Doudna, Jennifer A

2014-12-11

The CRISPR-associated protein Cas9 is an RNA-guided DNA endonuclease that uses RNA-DNA complementarity to identify target sites for sequence-specific double-stranded DNA (dsDNA) cleavage. In its native context, Cas9 acts on DNA substrates exclusively because both binding and catalysis require recognition of a short DNA sequence, known as the protospacer adjacent motif (PAM), next to and on the strand opposite the twenty-nucleotide target site in dsDNA. Cas9 has proven to be a versatile tool for genome engineering and gene regulation in a large range of prokaryotic and eukaryotic cell types, and in whole organisms, but it has been thought to be incapable of targeting RNA. Here we show that Cas9 binds with high affinity to single-stranded RNA (ssRNA) targets matching the Cas9-associated guide RNA sequence when the PAM is presented in trans as a separate DNA oligonucleotide. Furthermore, PAM-presenting oligonucleotides (PAMmers) stimulate site-specific endonucleolytic cleavage of ssRNA targets, similar to PAM-mediated stimulation of Cas9-catalysed DNA cleavage. Using specially designed PAMmers, Cas9 can be specifically directed to bind or cut RNA targets while avoiding corresponding DNA sequences, and we demonstrate that this strategy enables the isolation of a specific endogenous messenger RNA from cells. These results reveal a fundamental connection between PAM binding and substrate selection by Cas9, and highlight the utility of Cas9 for programmable transcript recognition without the need for tags.
The Drosophila hnRNP F/H Homolog Glorund Uses Two Distinct RNA-Binding Modes to Diversify Target Recognition

Energy Technology Data Exchange (ETDEWEB)

Tamayo, Joel V.; Teramoto, Takamasa; Chatterjee, Seema; Hall, Traci M. Tanaka; Gavis, Elizabeth R. (Princeton); (NIH)

2017-04-01

The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subset of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.
A quantitative analysis of secondary RNA structure using domination based parameters on trees

Directory of Open Access Journals (Sweden)

Zou Yue

2006-03-01

Full Text Available Abstract Background It has become increasingly apparent that a comprehensive database of RNA motifs is essential in order to achieve new goals in genomic and proteomic research. Secondary RNA structures have frequently been represented by various modeling methods as graph-theoretic trees. Using graph theory as a modeling tool allows the vast resources of graphical invariants to be utilized to numerically identify secondary RNA motifs. The domination number of a graph is a graphical invariant that is sensitive to even a slight change in the structure of a tree. The invariants selected in this study are variations of the domination number of a graph. These graphical invariants are partitioned into two classes, and we define two parameters based on each of these classes. These parameters are calculated for all small order trees and a statistical analysis of the resulting data is conducted to determine if the values of these parameters can be utilized to identify which trees of orders seven and eight are RNA-like in structure. Results The statistical analysis shows that the domination based parameters correctly distinguish between the trees that represent native structures and those that are not likely candidates to represent RNA. Some of the trees previously identified as candidate structures are found to be "very" RNA like, while others are not, thereby refining the space of structures likely to be found as representing secondary RNA structure. Conclusion Search algorithms are available that mine nucleotide sequence databases. However, the number of motifs identified can be quite large, making a further search for similar motif computationally difficult. Much of the work in the bioinformatics arena is toward the development of better algorithms to address the computational problem. This work, on the other hand, uses mathematical descriptors to more clearly characterize the RNA motifs and thereby reduce the corresponding search space. These
A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.

Science.gov (United States)

Tran, Ngoc Tam L; Huang, Chun-Hsi

2014-02-20

ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs that other tools may not discover. We recommended the users to use multiple motif finding Web tools that implement different algorithms for obtaining significant motifs, overlapping resemble motifs, and non-overlapping motifs. Finally, we provided our suggestions for future development of motif finding Web tool that better assists researchers for finding motifs in ChIP-Seq data.
Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.

Science.gov (United States)

Wang, Yin; Li, Rudong; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei

2016-01-01

Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.
A Method to Predict the Structure and Stability of RNA/RNA Complexes.

Science.gov (United States)

Xu, Xiaojun; Chen, Shi-Jie

2016-01-01

RNA/RNA interactions are essential for genomic RNA dimerization and regulation of gene expression. Intermolecular loop-loop base pairing is a widespread and functionally important tertiary structure motif in RNA machinery. However, computational prediction of intermolecular loop-loop base pairing is challenged by the entropy and free energy calculation due to the conformational constraint and the intermolecular interactions. In this chapter, we describe a recently developed statistical mechanics-based method for the prediction of RNA/RNA complex structures and stabilities. The method is based on the virtual bond RNA folding model (Vfold). The main emphasis in the method is placed on the evaluation of the entropy and free energy for the loops, especially tertiary kissing loops. The method also uses recursive partition function calculations and two-step screening algorithm for large, complicated structures of RNA/RNA complexes. As case studies, we use the HIV-1 Mal dimer and the siRNA/HIV-1 mutant (T4) to illustrate the method.

Novel guanidinylated bioresponsive poly(amidoamines designed for short hairpin RNA delivery

Directory of Open Access Journals (Sweden)

Yu J

2016-12-01

Full Text Available Jiankun Yu,1 Jinmin Zhang,1 Haonan Xing,1 Yanping Sun,1 Zhen Yang,1 Tianzhi Yang,2 Cuifang Cai,1 Xiaoyun Zhao,3 Li Yang,1 Pingtian Ding1 1School of Pharmacy, Shenyang Pharmaceutical University, Shenyang, China; 2Department of Basic Pharmaceutical Sciences, School of Pharmacy, Husson University, Bangor, ME, USA; 3Department of Microbiology and Cell Biology, School of Life Science and Biopharmaceutics, Shenyang Pharmaceutical University, Shenyang, China Abstract: Two different disulfide (SS-containing poly(amidoamine (PAA polymers were constructed using guanidino (Gua-containing monomers (ie, arginine [Arg] and agmatine [Agm] and N,N'-cystamine bisacrylamide (CBA by Michael-addition polymerization. In order to characterize these two Gua-SS-PAA polymers and investigate their potentials as short hairpin RNA (shRNA-delivery carriers, pSilencer 4.1-CMV FANCF shRNA was chosen as a model plasmid DNA to form complexes with these two polymers. The Gua-SS-PAAs and plasmid DNA complexes were determined with particle sizes less than 90 nm and positive ζ-potentials under 20 mV at nucleic acid:polymer weight ratios lower than 1:24. Bioresponsive release of plasmid DNA was observed from both newly constructed complexes. Significantly lower cytotoxicity was observed for both polymer complexes compared with polyethylenimine and Lipofectamine 2000, two widely used transfection reagents as reference carriers. Arg-CBA showed higher transfection efficiency and gene-silencing efficiency in MCF7 cells than Agm-CBA and the reference carriers. In addition, the cellular uptake of Arg-CBA in MCF7 cells was found to be higher and faster than Agm-CBA and the reference carriers. Similarly, plasmid DNA transport into the nucleus mediated by Arg-CBA was more than that by Agm-CBA and the reference carriers. The study suggested that guanidine and carboxyl introduced into Gua-SS-PAAs polymers resulted in a better nuclear localization effect, which played a key role in the
Identification of high-confidence RNA regulatory elements by combinatorial classification of RNA-protein binding sites.

Science.gov (United States)

Li, Yang Eric; Xiao, Mu; Shi, Binbin; Yang, Yu-Cheng T; Wang, Dong; Wang, Fei; Marcia, Marco; Lu, Zhi John

2017-09-08

Crosslinking immunoprecipitation sequencing (CLIP-seq) technologies have enabled researchers to characterize transcriptome-wide binding sites of RNA-binding protein (RBP) with high resolution. We apply a soft-clustering method, RBPgroup, to various CLIP-seq datasets to group together RBPs that specifically bind the same RNA sites. Such combinatorial clustering of RBPs helps interpret CLIP-seq data and suggests functional RNA regulatory elements. Furthermore, we validate two RBP-RBP interactions in cell lines. Our approach links proteins and RNA motifs known to possess similar biochemical and cellular properties and can, when used in conjunction with additional experimental data, identify high-confidence RBP groups and their associated RNA regulatory elements.
Characterization of short interspersed elements (SINEs) in a red alga, Porphyra yezoensis.

Science.gov (United States)

Zhang, Wenbo; Lin, Xiaofei; Peddigari, Suresh; Takechi, Katsuaki; Takano, Hiroyoshi; Takio, Susumu

2007-02-01

Short interspersed element (SINE)-like sequences referred to as PySN1 and PySN2 were identified in a red alga, Porphyra yezoensis. Both elements contained an internal promoter with motifs (A box and B box) recognized by RNA polymerase III, and target site duplications at both ends. Genomic Southern blot analysis revealed that both elements were widely and abundantly distributed on the genome. 3' and 5' RACE suggested that PySN1 was expressed as a chimera transcript with flanking SINE-unrelated sequences and possessed the poly-A tail at the same position near the 3' end of PySN1.
CHSalign: A Web Server That Builds upon Junction-Explorer and RNAJAG for Pairwise Alignment of RNA Secondary Structures with Coaxial Helical Stacking.

Directory of Open Access Journals (Sweden)

Lei Hua

Full Text Available RNA junctions are important structural elements of RNA molecules. They are formed when three or more helices come together in three-dimensional space. Recent studies have focused on the annotation and prediction of coaxial helical stacking (CHS motifs within junctions. Here we exploit such predictions to develop an efficient alignment tool to handle RNA secondary structures with CHS motifs. Specifically, we build upon our Junction-Explorer software for predicting coaxial stacking and RNAJAG for modelling junction topologies as tree graphs to incorporate constrained tree matching and dynamic programming algorithms into a new method, called CHSalign, for aligning the secondary structures of RNA molecules containing CHS motifs. Thus, CHSalign is intended to be an efficient alignment tool for RNAs containing similar junctions. Experimental results based on thousands of alignments demonstrate that CHSalign can align two RNA secondary structures containing CHS motifs more accurately than other RNA secondary structure alignment tools. CHSalign yields a high score when aligning two RNA secondary structures with similar CHS motifs or helical arrangement patterns, and a low score otherwise. This new method has been implemented in a web server, and the program is also made freely available, at http://bioinformatics.njit.edu/CHSalign/.
A Structural Overview of RNA-Dependent RNA Polymerases from the Flaviviridae Family

Directory of Open Access Journals (Sweden)

Jiqin Wu

2015-06-01

Full Text Available RNA-dependent RNA polymerases (RdRPs from the Flaviviridae family are representatives of viral polymerases that carry out RNA synthesis through a de novo initiation mechanism. They share a ≈ 600-residue polymerase core that displays a canonical viral RdRP architecture resembling an encircled right hand with palm, fingers, and thumb domains surrounding the active site. Polymerase catalytic motifs A–E in the palm and motifs F/G in the fingers are shared by all viral RdRPs with sequence and/or structural conservations regardless of the mechanism of initiation. Different from RdRPs carrying out primer-dependent initiation, Flaviviridae and other de novo RdRPs utilize a priming element often integrated in the thumb domain to facilitate primer-independent initiation. Upon the transition to the elongation phase, this priming element needs to undergo currently unresolved conformational rearrangements to accommodate the growth of the template-product RNA duplex. In the genera of Flavivirus and Pestivirus, the polymerase module in the C-terminal part of the RdRP protein may be regulated in cis by the N-terminal region of the same polypeptide. Either being a methyltransferase in Flavivirus or a functionally unclarified module in Pestivirus, this region could play auxiliary roles for the canonical folding and/or the catalysis of the polymerase, through defined intra-molecular interactions.
Binding properties of SUMO-interacting motifs (SIMs) in yeast.

Science.gov (United States)

Jardin, Christophe; Horn, Anselm H C; Sticht, Heinrich

2015-03-01

Small ubiquitin-like modifier (SUMO) conjugation and interaction play an essential role in many cellular processes. A large number of yeast proteins is known to interact non-covalently with SUMO via short SUMO-interacting motifs (SIMs), but the structural details of this interaction are yet poorly characterized. In the present work, sequence analysis of a large dataset of 148 yeast SIMs revealed the existence of a hydrophobic core binding motif and a preference for acidic residues either within or adjacent to the core motif. Thus the sequence properties of yeast SIMs are highly similar to those described for human. Molecular dynamics simulations were performed to investigate the binding preferences for four representative SIM peptides differing in the number and distribution of acidic residues. Furthermore, the relative stability of two previously observed alternative binding orientations (parallel, antiparallel) was assessed. For all SIMs investigated, the antiparallel binding mode remained stable in the simulations and the SIMs were tightly bound via their hydrophobic core residues supplemented by polar interactions of the acidic residues. In contrary, the stability of the parallel binding mode is more dependent on the sequence features of the SIM motif like the number and position of acidic residues or the presence of additional adjacent interaction motifs. This information should be helpful to enhance the prediction of SIMs and their binding properties in different organisms to facilitate the reconstruction of the SUMO interactome.
Selection against spurious promoter motifs correlates withtranslational efficiency across bacteria

Energy Technology Data Exchange (ETDEWEB)

Froula, Jeffrey L.; Francino, M. Pilar

2007-05-01

Because binding of RNAP to misplaced sites could compromise the efficiency of transcription, natural selection for the optimization of gene expression should regulate the distribution of DNA motifs capable of RNAP-binding across the genome. Here we analyze the distribution of the -10 promoter motifs that bind the {sigma}{sup 70} subunit of RNAP in 42 bacterial genomes. We show that selection on these motifs operates across the genome, maintaining an over-representation of -10 motifs in regulatory sequences while eliminating them from the nonfunctional and, in most cases, from the protein coding regions. In some genomes, however, -10 sites are over-represented in the coding sequences; these sites could induce pauses effecting regulatory roles throughout the length of a transcriptional unit. For nonfunctional sequences, the extent of motif under-representation varies across genomes in a manner that broadly correlates with the number of tRNA genes, a good indicator of translational speed and growth rate. This suggests that minimizing the time invested in gene transcription is an important selective pressure against spurious binding. However, selection against spurious binding is detectable in the reduced genomes of host-restricted bacteria that grow at slow rates, indicating that components of efficiency other than speed may also be important. Minimizing the number of RNAP molecules per cell required for transcription, and the corresponding energetic expense, may be most relevant in slow growers. These results indicate that genome-level properties affecting the efficiency of transcription and translation can respond in an integrated manner to optimize gene expression. The detection of selection against promoter motifs in nonfunctional regions also implies that no sequence may evolve free of selective constraints, at least in the relatively small and unstructured genomes of bacteria.
Structure-Function Model for Kissing Loop Interactions That Initiate Dimerization of Ty1 RNA

Directory of Open Access Journals (Sweden)

Eric R. Gamache

2017-04-01

Full Text Available The genomic RNA of the retrotransposon Ty1 is packaged as a dimer into virus-like particles. The 5′ terminus of Ty1 RNA harbors cis-acting sequences required for translation initiation, packaging and initiation of reverse transcription (TIPIRT. To identify RNA motifs involved in dimerization and packaging, a structural model of the TIPIRT domain in vitro was developed from single-nucleotide resolution RNA structural data. In general agreement with previous models, the first 326 nucleotides of Ty1 RNA form a pseudoknot with a 7-bp stem (S1, a 1-nucleotide interhelical loop and an 8-bp stem (S2 that delineate two long, structured loops. Nucleotide substitutions that disrupt either pseudoknot stem greatly reduced helper-Ty1-mediated retrotransposition of a mini-Ty1, but only mutations in S2 destabilized mini-Ty1 RNA in cis and helper-Ty1 RNA in trans. Nested in different loops of the pseudoknot are two hairpins with complementary 7-nucleotide motifs at their apices. Nucleotide substitutions in either motif also reduced retrotransposition and destabilized mini- and helper-Ty1 RNA. Compensatory mutations that restore base-pairing in the S2 stem or between the hairpins rescued retrotransposition and RNA stability in cis and trans. These data inform a model whereby a Ty1 RNA kissing complex with two intermolecular kissing-loop interactions initiates dimerization and packaging.
The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

Directory of Open Access Journals (Sweden)

Roberts Richard J

2008-05-01

Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.
RNA and RNP as Building Blocks for Nanotechnology and Synthetic Biology.

Science.gov (United States)

Ohno, Hirohisa; Saito, Hirohide

2016-01-01

Recent technologies that aimed to elucidate cellular function have revealed essential roles for RNA molecules in living systems. Our knowledge concerning functional and structural information of naturally occurring RNA and RNA-protein (RNP) complexes is increasing rapidly. RNA and RNP interaction motifs are structural units that function as building blocks to constitute variety of complex structures. RNA-central synthetic biology and nanotechnology are constructive approaches that employ the accumulated information and build synthetic RNA (RNP)-based circuits and nanostructures. Here, we describe how to design and construct synthetic RNA (RNP)-based devices and structures at the nanometer-scale for biological and future therapeutic applications. RNA/RNP nanostructures can also be utilized as the molecular scaffold to control the localization or interactions of target molecule(s). Moreover, RNA motifs recognized by RNA-binding proteins can be applied to make protein-responsive translational "switches" that can turn gene expression "on" or "off" depending on the intracellular environment. This "synthetic RNA and RNP world" will expand tools for nanotechnology and synthetic biology. In addition, these reconstructive approaches would lead to a greater understanding of building principle in naturally occurring RNA/RNP molecules and systems. Copyright © 2016 Elsevier Inc. All rights reserved.
Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs.

Science.gov (United States)

Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude

2011-06-20

One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.
Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs

Directory of Open Access Journals (Sweden)

Martin Juliette

2011-06-01

Full Text Available Abstract Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet, which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i ubiquitous motifs, shared by several superfamilies and (ii superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.
Specificity and affinity motifs for Grb2 SH2-ligand interactions

NARCIS (Netherlands)

Kessels, Helmut W. H. G.; Ward, Alister C.; Schumacher, Ton N. M.

2002-01-01

Protein-protein interactions are often mediated by the recognition of short continuous amino acid stretches on target proteins by specific binding domains. Affinity-based selection strategies have successfully been used to define recognition motifs for a large series of such protein domains.
Overlapping ETS and CRE Motifs (G/CCGGAAGTGACGTCA) Preferentially Bound by GABPα and CREB Proteins

Science.gov (United States)

Chatterjee, Raghunath; Zhao, Jianfei; He, Ximiao; Shlyakhtenko, Andrey; Mann, Ishminder; Waterfall, Joshua J.; Meltzer, Paul; Sathyanarayana, B. K.; FitzGerald, Peter C.; Vinson, Charles

2012-01-01

Previously, we identified 8-bps long DNA sequences (8-mers) that localize in human proximal promoters and grouped them into known transcription factor binding sites (TFBS). We now examine split 8-mers consisting of two 4-mers separated by 1-bp to 30-bps (X4-N1-30-X4) to identify pairs of TFBS that localize in proximal promoters at a precise distance. These include two overlapping TFBS: the ETS⇔ETS motif (C/GCCGGAAGCGGAA) and the ETS⇔CRE motif (C/GCGGAAGTGACGTCAC). The nucleotides in bold are part of both TFBS. Molecular modeling shows that the ETS⇔CRE motif can be bound simultaneously by both the ETS and the B-ZIP domains without protein-protein clashes. The electrophoretic mobility shift assay (EMSA) shows that the ETS protein GABPα and the B-ZIP protein CREB preferentially bind to the ETS⇔CRE motif only when the two TFBS overlap precisely. In contrast, the ETS domain of ETV5 and CREB interfere with each other for binding the ETS⇔CRE. The 11-mer (CGGAAGTGACG), the conserved part of the ETS⇔CRE motif, occurs 226 times in the human genome and 83% are in known regulatory regions. In vivo GABPα and CREB ChIP-seq peaks identified the ETS⇔CRE as the most enriched motif occurring in promoters of genes involved in mRNA processing, cellular catabolic processes, and stress response, suggesting that a specific class of genes is regulated by this composite motif. PMID:23050235
The Drosophila hnRNP F/H Homolog Glorund Uses Two Distinct RNA-Binding Modes to Diversify Target Recognition.

Science.gov (United States)

Tamayo, Joel V; Teramoto, Takamasa; Chatterjee, Seema; Hall, Traci M Tanaka; Gavis, Elizabeth R

2017-04-04

The Drosophila hnRNP F/H homolog, Glorund (Glo), regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3' untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo's RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subset of Glo's functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins. Copyright © 2017 The Author(s). Published by Elsevier Inc. All rights reserved.
The Drosophila hnRNP F/H Homolog Glorund Uses Two Distinct RNA-Binding Modes to Diversify Target Recognition

Directory of Open Access Journals (Sweden)

Joel V. Tamayo

2017-04-01

Full Text Available The Drosophila hnRNP F/H homolog, Glorund (Glo, regulates nanos mRNA translation by interacting with a structured UA-rich motif in the nanos 3′ untranslated region. Glo regulates additional RNAs, however, and mammalian homologs bind G-tract sequences to regulate alternative splicing, suggesting that Glo also recognizes G-tract RNA. To gain insight into how Glo recognizes both structured UA-rich and G-tract RNAs, we used mutational analysis guided by crystal structures of Glo’s RNA-binding domains and identified two discrete RNA-binding surfaces that allow Glo to recognize both RNA motifs. By engineering Glo variants that favor a single RNA-binding mode, we show that a subset of Glo’s functions in vivo is mediated solely by the G-tract binding mode, whereas regulation of nanos requires both recognition modes. Our findings suggest a molecular mechanism for the evolution of dual RNA motif recognition in Glo that may be applied to understanding the functional diversity of other RNA-binding proteins.
Phenotypic silencing of cytoplasmic genes using sequence-specific double-stranded short interfering RNA and its application in the reverse genetics of wild type negative-strand RNA viruses

Directory of Open Access Journals (Sweden)

Barik Sailen

2001-12-01

Full Text Available Abstract Background Post-transcriptional gene silencing (PTGS by short interfering RNA has opened up new directions in the phenotypic mutation of cellular genes. However, its efficacy on non-nuclear genes and its effect on the interferon pathway remain unexplored. Since directed mutation of RNA genomes is not possible through conventional mutagenesis, we have tested sequence-specific 21-nucleotide long double-stranded RNAs (dsRNAs for their ability to silence cytoplasmic RNA genomes. Results Short dsRNAs were generated against specific mRNAs of respiratory syncytial virus, a nonsegmented negative-stranded RNA virus with a cytoplasmic life cycle. At nanomolar concentrations, the dsRNAs specifically abrogated expression of the corresponding viral proteins, and produced the expected mutant phenotype ex vivo. The dsRNAs did not induce an interferon response, and did not inhibit cellular gene expression. The ablation of the viral proteins correlated with the loss of the specific mRNAs. In contrast, viral genomic and antigenomic RNA, which are encapsidated, were not directly affected. Conclusions Synthetic inhibitory dsRNAs are effective in specific silencing of RNA genomes that are exclusively cytoplasmic and transcribed by RNA-dependent RNA polymerases. RNA-directed RNA gene silencing does not require cloning, expression, and mutagenesis of viral cDNA, and thus, will allow the generation of phenotypic null mutants of specific RNA viral genes under normal infection conditions and at any point in the infection cycle. This will, for the first time, permit functional genomic studies, attenuated infections, reverse genetic analysis, and studies of host-virus signaling pathways using a wild type RNA virus, unencumbered by any superinfecting virus.
Defining the RNA Internal Loops Preferred by Benzimidazole Derivatives via Two-Dimensional Combinatorial Screening and Computational Analysis

Science.gov (United States)

Velagapudi, Sai Pradeep; Seedhouse, Steven J.; French, Jonathan

2011-01-01

RNA is an important therapeutic target, however, RNA targets are generally underexploited due to a lack of understanding of the small molecules that bind RNA and the RNA motifs that bind small molecules. Herein, we describe the identification of the RNA internal loops derived from a 4096-member 3×3 nucleotide loop library that are the most specific and highest affinity binders to a series of four designer, drug-like benzimidazoles. These studies establish a potentially general protocol to define the highest affinity and most specific RNA motif targets for heterocyclic small molecules. Such information could be used to target functionally important RNAs in genomic sequence. PMID:21604752
MultiSETTER: web server for multiple RNA structure comparison.

Science.gov (United States)

Čech, Petr; Hoksza, David; Svozil, Daniel

2015-08-12

Understanding the architecture and function of RNA molecules requires methods for comparing and analyzing their tertiary and quaternary structures. While structural superposition of short RNAs is achievable in a reasonable time, large structures represent much bigger challenge. Therefore, we have developed a fast and accurate algorithm for RNA pairwise structure superposition called SETTER and implemented it in the SETTER web server. However, though biological relationships can be inferred by a pairwise structure alignment, key features preserved by evolution can be identified only from a multiple structure alignment. Thus, we extended the SETTER algorithm to the alignment of multiple RNA structures and developed the MultiSETTER algorithm. In this paper, we present the updated version of the SETTER web server that implements a user friendly interface to the MultiSETTER algorithm. The server accepts RNA structures either as the list of PDB IDs or as user-defined PDB files. After the superposition is computed, structures are visualized in 3D and several reports and statistics are generated. To the best of our knowledge, the MultiSETTER web server is the first publicly available tool for a multiple RNA structure alignment. The MultiSETTER server offers the visual inspection of an alignment in 3D space which may reveal structural and functional relationships not captured by other multiple alignment methods based either on a sequence or on secondary structure motifs.
POWRS: position-sensitive motif discovery.

Directory of Open Access Journals (Sweden)

Ian W Davis

Full Text Available Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm "POWRS" (POsition-sensitive WoRd Set for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties.BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/.

Genomic binding profiles of functionally distinct RNA polymerase III transcription complexes in human cells.

Science.gov (United States)

Moqtaderi, Zarmik; Wang, Jie; Raha, Debasish; White, Robert J; Snyder, Michael; Weng, Zhiping; Struhl, Kevin

2010-05-01

Genome-wide occupancy profiles of five components of the RNA polymerase III (Pol III) machinery in human cells identified the expected tRNA and noncoding RNA targets and revealed many additional Pol III-associated loci, mostly near short interspersed elements (SINEs). Several genes are targets of an alternative transcription factor IIIB (TFIIIB) containing Brf2 instead of Brf1 and have extremely low levels of TFIIIC. Strikingly, expressed Pol III genes, unlike nonexpressed Pol III genes, are situated in regions with a pattern of histone modifications associated with functional Pol II promoters. TFIIIC alone associates with numerous ETC loci, via the B box or a novel motif. ETCs are often near CTCF binding sites, suggesting a potential role in chromosome organization. Our results suggest that human Pol III complexes associate preferentially with regions near functional Pol II promoters and that TFIIIC-mediated recruitment of TFIIIB is regulated in a locus-specific manner.
A G-quadruplex-containing RNA activates fluorescence in a GFP-like fluorophore

Energy Technology Data Exchange (ETDEWEB)

Huang, Hao; Suslov, Nikolai B.; Li, Nan-Sheng; Shelke, Sandip A.; Evans, Molly E.; Koldobskaya, Yelena; Rice, Phoebe A.; Piccirilli, Joseph A. [UC

2014-08-21

Spinach is an in vitro–selected RNA aptamer that binds a GFP-like ligand and activates its green fluorescence. Spinach is thus an RNA analog of GFP and has potentially widespread applications for in vivo labeling and imaging. We used antibody-assisted crystallography to determine the structures of Spinach both with and without bound fluorophore at 2.2-Å and 2.4-Å resolution, respectively. Spinach RNA has an elongated structure containing two helical domains separated by an internal bulge that folds into a G-quadruplex motif of unusual topology. The G-quadruplex motif and adjacent nucleotides comprise a partially preformed binding site for the fluorophore. The fluorophore binds in a planar conformation and makes extensive aromatic stacking and hydrogen bond interactions with the RNA. Our findings provide a foundation for structure-based engineering of new fluorophore-binding RNA aptamers.
Import of desired nucleic acid sequences using addressing motif of mitochondrial ribosomal 5S-rRNA for fluorescent in vivo hybridization of mitochondrial DNA and RNA.

Science.gov (United States)

Zelenka, Jaroslav; Alán, Lukáš; Jabůrek, Martin; Ježek, Petr

2014-04-01

Based on the matrix-addressing sequence of mitochondrial ribosomal 5S-rRNA (termed MAM), which is naturally imported into mitochondria, we have constructed an import system for in vivo targeting of mitochondrial DNA (mtDNA) or mt-mRNA, in order to provide fluorescence hybridization of the desired sequences. Thus DNA oligonucleotides were constructed, containing the 5'-flanked T7 RNA polymerase promoter. After in vitro transcription and fluorescent labeling with Alexa Fluor(®) 488 or 647 dye, we obtained the fluorescent "L-ND5 probe" containing MAM and exemplar cargo, i.e., annealing sequence to a short portion of ND5 mRNA and to the light-strand mtDNA complementary to the heavy strand nd5 mt gene (5'-end 21 base pair sequence). For mitochondrial in vivo fluorescent hybridization, HepG2 cells were treated with dequalinium micelles, containing the fluorescent probes, bringing the probes proximally to the mitochondrial outer membrane and to the natural import system. A verification of import into the mitochondrial matrix of cultured HepG2 cells was provided by confocal microscopy colocalizations. Transfections using lipofectamine or probes without 5S-rRNA addressing MAM sequence or with MAM only were ineffective. Alternatively, the same DNA oligonucleotides with 5'-CACC overhang (substituting T7 promoter) were transcribed from the tetracycline-inducible pENTRH1/TO vector in human embryonic kidney T-REx®-293 cells, while mitochondrial matrix localization after import of the resulting unlabeled RNA was detected by PCR. The MAM-containing probe was then enriched by three-order of magnitude over the natural ND5 mRNA in the mitochondrial matrix. In conclusion, we present a proof-of-principle for mitochondrial in vivo hybridization and mitochondrial nucleic acid import.
Multi-resistance strategy for viral diseases and short hairpin RNA verification method in pigs

Directory of Open Access Journals (Sweden)

Jong-nam Oh

2018-04-01

Full Text Available Objective Foot and mouth disease (FMD and porcine reproductive and respiratory syndrome (PRRS are major diseases that interrupt porcine production. Because they are viral diseases, vaccinations are of only limited effectiveness in preventing outbreaks. To establish an alternative multi-resistant strategy against FMD virus (FMDV and PRRS virus (PRRSV, the present study introduced two genetic modification techniques to porcine cells. Methods First, cluster of differentiation 163 (CD163, the PRRSV viral receptor, was edited with the clustered regularly interspaced short palindromic repeats-CRISPR-associated protein 9 technique. The CD163 gene sequences of edited cells and control cells differed. Second, short hairpin RNA (shRNAs were integrated into the cells. The shRNAs, targeting the 3D gene of FMDV and the open reading frame 7 (ORF7 gene of PRRSV, were transferred into fibroblasts. We also developed an in vitro shRNA verification method with a target gene expression vector. Results shRNA activity was confirmed in vitro with vectors that expressed the 3D and ORF7 genes in the cells. Cells containing shRNAs showed lower transcript levels than cells with only the expression vectors. The shRNAs were integrated into CD163-edited cells to combine the two techniques, and the viral genes were suppressed in these cells. Conclusion We established a multi-resistant strategy against viral diseases and an in vitro shRNA verification method.
Motif-role-fingerprints: the building-blocks of motifs, clustering-coefficients and transitivities in directed networks.

Directory of Open Access Journals (Sweden)

Mark D McDonnell

Full Text Available Complex networks are frequently characterized by metrics for which particular subgraphs are counted. One statistic from this category, which we refer to as motif-role fingerprints, differs from global subgraph counts in that the number of subgraphs in which each node participates is counted. As with global subgraph counts, it can be important to distinguish between motif-role fingerprints that are 'structural' (induced subgraphs and 'functional' (partial subgraphs. Here we show mathematically that a vector of all functional motif-role fingerprints can readily be obtained from an arbitrary directed adjacency matrix, and then converted to structural motif-role fingerprints by multiplying that vector by a specific invertible conversion matrix. This result demonstrates that a unique structural motif-role fingerprint exists for any given functional motif-role fingerprint. We demonstrate a similar result for the cases of functional and structural motif-fingerprints without node roles, and global subgraph counts that form the basis of standard motif analysis. We also explicitly highlight that motif-role fingerprints are elemental to several popular metrics for quantifying the subgraph structure of directed complex networks, including motif distributions, directed clustering coefficient, and transitivity. The relationships between each of these metrics and motif-role fingerprints also suggest new subtypes of directed clustering coefficients and transitivities. Our results have potential utility in analyzing directed synaptic networks constructed from neuronal connectome data, such as in terms of centrality. Other potential applications include anomaly detection in networks, identification of similar networks and identification of similar nodes within networks. Matlab code for calculating all stated metrics following calculation of functional motif-role fingerprints is provided as S1 Matlab File.
Protein clustering and RNA phylogenetic reconstruction of the influenza A [corrected] virus NS1 protein allow an update in classification and identification of motif conservation.

Science.gov (United States)

Sevilla-Reyes, Edgar E; Chavaro-Pérez, David A; Piten-Isidro, Elvira; Gutiérrez-González, Luis H; Santos-Mendoza, Teresa

2013-01-01

The non-structural protein 1 (NS1) of influenza A virus (IAV), coded by its third most diverse gene, interacts with multiple molecules within infected cells. NS1 is involved in host immune response regulation and is a potential contributor to the virus host range. Early phylogenetic analyses using 50 sequences led to the classification of NS1 gene variants into groups (alleles) A and B. We reanalyzed NS1 diversity using 14,716 complete NS IAV sequences, downloaded from public databases, without host bias. Removal of sequence redundancy and further structured clustering at 96.8% amino acid similarity produced 415 clusters that enhanced our capability to detect distinct subgroups and lineages, which were assigned a numerical nomenclature. Maximum likelihood phylogenetic reconstruction using RNA sequences indicated the previously identified deep branching separating group A from group B, with five distinct subgroups within A as well as two and five lineages within the A4 and A5 subgroups, respectively. Our classification model proposes that sequence patterns in thirteen amino acid positions are sufficient to fit >99.9% of all currently available NS1 sequences into the A subgroups/lineages or the B group. This classification reduces host and virus bias through the prioritization of NS1 RNA phylogenetics over host or virus phenetics. We found significant sequence conservation within the subgroups and lineages with characteristic patterns of functional motifs, such as the differential binding of CPSF30 and crk/crkL or the availability of a C-terminal PDZ-binding motif. To understand selection pressures and evolution acting on NS1, it is necessary to organize the available data. This updated classification may help to clarify and organize the study of NS1 interactions and pathogenic differences and allow the drawing of further functional inferences on sequences in each group, subgroup and lineage rather than on a strain-by-strain basis.
Modulation of i-motif thermodynamic stability by the introduction of UNA (unlocked nucleic acid) monomers

DEFF Research Database (Denmark)

Pasternak, Anna; Wengel, Jesper

2011-01-01

The influence of acyclic RNA derivatives, UNA (unlocked nucleic acid) monomers, on i-DNA thermodynamic stability has been investigated. The 22 nt human telomeric fragment was chosen as the model sequence for stability studies. UNA monomers modulate i-motif stability in a position-depending manner...
Programmed self-assembly of DNA/RNA for biomedical applications

Science.gov (United States)

Wang, Pengfei

Three self-assembly strategies were utilized for assembly of novel functional DNA/RNA nanostructures. RNA-DNA hybrid origami method was developed to fabricate nano-objects (ribbon, rectangle, and triangle) with precisely controlled geometry. Unlike conventional DNA origami which use long DNA single strand as scaffold, a long RNA single strand was used instead, which was folded by short DNA single strands (staples) into prescribed objects through sequence specific hybridization between RNA and DNA. Single stranded tiles (SST) and RNA-DNA hybrid origami were utilized to fabricate a variety of barcode-like nanostructures with unique patterns by expanding a plain rectangle via introducing spacers (10-bp dsDNA segment) between parallel duplexes. Finally, complex 2D array and 3D polyhedrons with multiple patterns within one structure were assembled from simple DNA motifs. Two demonstrations of biomedical applications of DNA nanotechnology were presented. Firstly, lambda-DNA was used as template to direct the fabrication of multi-component magnetic nanoparticle chains. Nuclear magnetic relaxation (NMR) characterization showed superb magnetic relaxativity of the nanoparticle chains which have large potential to be utilized as MRI contrast agents. Secondly, DNA nanotechnology was introduced into the conformational study of a routinely used catalytic DNAzyme, the RNA-cleaving 10-23 DNAzyme. The relative angle between two flanking duplexes of the catalytic core was determined (94.8°), which shall be able to provide a clue to further understanding of the cleaving mechanism of this DNAzyme from a conformational perspective.
Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification

Directory of Open Access Journals (Sweden)

Yin Wang

2016-01-01

Full Text Available Background. Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Results. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. Conclusions. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.
Endogenous short RNAs generated by Dicer 2 and RNA-dependent RNA polymerase 1 regulate mRNAs in the basal fungus Mucor circinelloides

Science.gov (United States)

Nicolas, Francisco Esteban; Moxon, Simon; de Haro, Juan P.; Calo, Silvia; Grigoriev, Igor V.; Torres-Martínez, Santiago; Moulton, Vincent; Ruiz-Vázquez, Rosa M.; Dalmay, Tamas

2010-01-01

Endogenous short RNAs (esRNAs) play diverse roles in eukaryotes and usually are produced from double-stranded RNA (dsRNA) by Dicer. esRNAs are grouped into different classes based on biogenesis and function but not all classes are present in all three eukaryotic kingdoms. The esRNA register of fungi is poorly described compared to other eukaryotes and it is not clear what esRNA classes are present in this kingdom and whether they regulate the expression of protein coding genes. However, evidence that some dicer mutant fungi display altered phenotypes suggests that esRNAs play an important role in fungi. Here, we show that the basal fungus Mucor circinelloides produces new classes of esRNAs that map to exons and regulate the expression of many protein coding genes. The largest class of these exonic-siRNAs (ex-siRNAs) are generated by RNA-dependent RNA Polymerase 1 (RdRP1) and dicer-like 2 (DCL2) and target the mRNAs of protein coding genes from which they were produced. Our results expand the range of esRNAs in eukaryotes and reveal a new role for esRNAs in fungi. PMID:20427422
Endogenous short RNAs generated by Dicer 2 and RNA-dependent RNA polymerase 1 regulate mRNAs in the basal fungus Mucor circinelloides

Energy Technology Data Exchange (ETDEWEB)

Grigoriev, Igor; Nicolas, Francisco; Moxon, Simon; Haro, Juan de; Calo, Silvia; Torres-Martinez, Santiago; Moulton, Vincent; Ruiz-Vazquez, Rosa; Dalmay, Tamas

2011-09-01

Endogenous short RNAs (esRNAs) play diverse roles in eukaryotes and usually are produced from double-stranded RNA (dsRNA) by Dicer. esRNAs are grouped into different classes based on biogenesis and function but not all classes are present in all three eukaryotic kingdoms. The esRNA register of fungi is poorly described compared to other eukaryotes and it is not clear what esRNA classes are present in this kingdom and whether they regulate the expression of protein coding genes. However, evidence that some dicer mutant fungi display altered phenotypes suggests that esRNAs play an important role in fungi. Here, we show that the basal fungus Mucor circinelloides produces new classes of esRNAs that map to exons and regulate the expression of many protein coding genes. The largest class of these exonic-siRNAs (ex-siRNAs) are generated by RNA-dependent RNA Polymerase 1 (RdRP1) and dicer-like 2 (DCL2) and target the mRNAs of protein coding genes from which they were produced. Our results expand the range of esRNAs in eukaryotes and reveal a new role for esRNAs in fungi
Adeno-Associated Viral Vector-Mediated mTOR Inhibition by Short Hairpin RNA Suppresses Laser-Induced Choroidal Neovascularization

Directory of Open Access Journals (Sweden)

Tae Kwann Park

2017-09-01

Full Text Available Choroidal neovascularization (CNV is the defining characteristic feature of the wet subtype of age-related macular degeneration (AMD and may result in irreversible blindness. Based on anti-vascular endothelial growth factor (anti-VEGF, the current therapeutic approaches to CNV are fraught with difficulties, and mammalian target of rapamycin (mTOR has recently been proposed as a possible therapeutic target, although few studies have been conducted. Here, we show that a recombinant adeno-associated virus-delivered mTOR-inhibiting short hairpin RNA (rAAV-mTOR shRNA, which blocks the activity of both mTOR complex 1 and 2, represents a promising therapeutic approach for the treatment of CNV. Eight-week-old male C57/B6 mice were treated with the short hairpin RNA (shRNA after generating CNV lesions in the eyes via laser photocoagulation. The recombinant adeno-associated virus (rAAV delivery vehicle was able to effectively transduce cells in the inner retina, and significantly fewer inflammatory cells and less extensive CNV were observed in the animals treated with rAAV-mTOR shRNA when compared with control- and rAAV-scrambled shRNA-treated groups. Presumably related to the reduction of CNV, increased autophagy was detected in CNV lesions treated with rAAV-mTOR shRNA, whereas significantly fewer apoptotic cells detected in the outer nuclear layer around the CNV indicate that mTOR inhibition may also have neuroprotective effects. Taken together, these results demonstrate the therapeutic potential of mTOR inhibition, resulting from rAAV-mTOR shRNA activity, in the treatment of AMD-related CNV. Keywords: retinal neovascularization, choroidal neovascularization, adeno-associated virus, mTOR, RNA interference, mTOR shRNA, autophagy
Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

Science.gov (United States)

Fauteux, François; Strömvik, Martina V

2009-01-01

Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs
Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

Directory of Open Access Journals (Sweden)

Fauteux François

2009-10-01

Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination
Characterization of the Zika virus induced small RNA response in Aedes aegypti cells.

Directory of Open Access Journals (Sweden)

Margus Varjak

2017-10-01

Full Text Available RNA interference (RNAi controls arbovirus infections in mosquitoes. Two different RNAi pathways are involved in antiviral responses: the PIWI-interacting RNA (piRNA and exogenous short interfering RNA (exo-siRNA pathways, which are characterized by the production of virus-derived small RNAs of 25-29 and 21 nucleotides, respectively. The exo-siRNA pathway is considered to be the key mosquito antiviral response mechanism. In Aedes aegypti-derived cells, Zika virus (ZIKV-specific siRNAs were produced and loaded into the exo-siRNA pathway effector protein Argonaute 2 (Ago2; although the knockdown of Ago2 did not enhance virus replication. Enhanced ZIKV replication was observed in a Dcr2-knockout cell line suggesting that the exo-siRNA pathway is implicated in the antiviral response. Although ZIKV-specific piRNA-sized small RNAs were detected, these lacked the characteristic piRNA ping-pong signature motif and were bound to Ago3 but not Piwi5 or Piwi6. Silencing of PIWI proteins indicated that the knockdown of Ago3, Piwi5 or Piwi6 did not enhance ZIKV replication and only Piwi4 displayed antiviral activity. We also report that the expression of ZIKV capsid (C protein amplified the replication of a reporter alphavirus; although, unlike yellow fever virus C protein, it does not inhibit the exo-siRNA pathway. Our findings elucidate ZIKV-mosquito RNAi interactions that are important for understanding its spread.
[Regulatory effect and mechanism of RNA binding motif protein 38 on the expression of progesterone receptor in human breast cancer ZR-75-1 cells].

Science.gov (United States)

Lou, P P; Li, C L; Xia, T S; Shi, L; Wu, J; Zhou, X J; Wang, Y; Ding, Q

2016-06-23

To investigate the regulatory mechanism of RNA binding motif protein 38 (RNPC1) on the expression of progesterone receptor (PR) in breast cancer cell line ZR-75-1. Lentiviral vector was used to induce overexpression of RNPC1 in ZR-75-1 cells. qRT-PCR and Western blot were used to assess the regulatory effect of RNPC1 on PR expression. Actinomycin was used to detect the regulatory mechanism involved. Immunohistochemical (IHC) staining was used to determine the protein expression of RNPC1 and PR in 80 breast cancer tissues. IHC staining showed that the expression of RNPC1 was significantly higher in the PR positive breast cancer tissues than that in the PR negative breast cancer tissues (P<0.05). The qRT-PCR results showed that overexpression of RNPC1 in ZR-75-1 cells significantly upregulated the mRNA level of PR (1.764±0.028 vs. 1.001±0.037, P<0.01), whereas knockdown of RNPC1 did the opposite (0.579± 0.007 vs. 1.000±0.002, P<0.01). The Western blot results also showed that overexpression of RNPC1 up-regulated PR levels, while knockdown of RNPC1 resulted in down-regulation of PR levels in the ZR-75-1 cells.The actinomycin assay showed that overexpression of RNPC1 increased the mRNA stability of PR. The half-life of PR mRNA was increased from 4.0 h to 6.5 h. Knockdown of RNPC1 decreased the mRNA stability of PR and the half-life of PR transcript was decreased from 4.1 h to 3.0 h. RNPC1 plays a crucial role in regulating the expression of PR in breast cancer ZR-75-1 cells.
Recent advances in developing small molecules targeting RNA.

Science.gov (United States)

Guan, Lirui; Disney, Matthew D

2012-01-20

RNAs are underexploited targets for small molecule drugs or chemical probes of function. This may be due, in part, to a fundamental lack of understanding of the types of small molecules that bind RNA specifically and the types of RNA motifs that specifically bind small molecules. In this review, we describe recent advances in the development and design of small molecules that bind to RNA and modulate function that aim to fill this void.
MicroRNA sequence motifs reveal asymmetry between the stem arms

DEFF Research Database (Denmark)

Gorodkin, Jan; Havgaard, Jakob Hull; Ensterö, M.

2006-01-01

The processing of micro RNAs (miRNAs) from their stemloop precursor have revealed asymmetry in the processing of the mature and its star sequence. Furthermore, the miRNA processing system between organism differ. To assess this at the sequence level we have investigated mature miRNAs in their gen......The processing of micro RNAs (miRNAs) from their stemloop precursor have revealed asymmetry in the processing of the mature and its star sequence. Furthermore, the miRNA processing system between organism differ. To assess this at the sequence level we have investigated mature mi...
Short interfering RNAs targeting a vampire-bat related rabies virus phosphoprotein mRNA.

Science.gov (United States)

Ono, Ekaterina Alexandrovna Durymanova; Taniwaki, Sueli Akemi; Brandão, Paulo

The aim of this study was to assess the in vitro and in vivo effects of short-interfering RNAs (siRNAs) against rabies virus phosphoprotein (P) mRNA in a post-infection treatment for rabies as an extension of a previous report (Braz J Microbiol. 2013 Nov 15;44(3):879-82). To this end, rabies virus strain RABV-4005 (related to the Desmodus rotundus vampire bat) were used to inoculate BHK-21 cells and mice, and the transfection with each of the siRNAs was made with Lipofectamine-2000™. In vitro results showed that siRNA 360 was able to inhibit the replication of strain RABV-4005 with a 1log decrease in virus titter and 5.16-fold reduction in P mRNA, 24h post-inoculation when compared to non-treated cells. In vivo, siRNA 360 was able to induce partial protection, but with no significant difference when compared to non-treated mice. These results indicate that, despite the need for improvement for in vivo applications, P mRNA might be a target for an RNAi-based treatment for rabies. Copyright © 2017 Sociedade Brasileira de Microbiologia. Published by Elsevier Editora Ltda. All rights reserved.
B Cell Receptor Activation Predominantly Regulates AKT-mTORC1/2 Substrates Functionally Related to RNA Processing.

Directory of Open Access Journals (Sweden)

Dara K Mohammad

Full Text Available Protein kinase B (AKT phosphorylates numerous substrates on the consensus motif RXRXXpS/T, a docking site for 14-3-3 interactions. To identify novel AKT-induced phosphorylation events following B cell receptor (BCR activation, we performed proteomics, biochemical and bioinformatics analyses. Phosphorylated consensus motif-specific antibody enrichment, followed by tandem mass spectrometry, identified 446 proteins, containing 186 novel phosphorylation events. Moreover, we found 85 proteins with up regulated phosphorylation, while in 277 it was down regulated following stimulation. Up regulation was mainly in proteins involved in ribosomal and translational regulation, DNA binding and transcription regulation. Conversely, down regulation was preferentially in RNA binding, mRNA splicing and mRNP export proteins. Immunoblotting of two identified RNA regulatory proteins, RBM25 and MEF-2D, confirmed the proteomics data. Consistent with these findings, the AKT-inhibitor (MK-2206 dramatically reduced, while the mTORC-inhibitor PP242 totally blocked phosphorylation on the RXRXXpS/T motif. This demonstrates that this motif, previously suggested as an AKT target sequence, also is a substrate for mTORC1/2. Proteins with PDZ, PH and/or SH3 domains contained the consensus motif, whereas in those with an HMG-box, H15 domains and/or NF-X1-zinc-fingers, the motif was absent. Proteins carrying the consensus motif were found in all eukaryotic clades indicating that they regulate a phylogenetically conserved set of proteins.

A single-stranded architecture for cotranscriptional folding of RNA nanostructures

DEFF Research Database (Denmark)

Geary, Cody; Rothemund, Paul; Andersen, Ebbe Sloth

2014-01-01

Artificial DNA and RNA structures have been used as scaffolds for a variety of nanoscale devices. In comparison to DNA structures, RNA structures have been limited in size, but they also have advantages: RNA can fold during transcription and thus can be genetically encoded and expressed in cells....... We introduce an architecture for designing artificial RNA structures that fold from a single strand, in which arrays of antiparallel RNA helices are precisely organized by RNA tertiary motifs and a new type of crossover pattern. We constructed RNA tiles that assemble into hexagonal lattices...
Transmissible gastroenteritis coronavirus genome packaging signal is located at the 5' end of the genome and promotes viral RNA incorporation into virions in a replication-independent process.

Science.gov (United States)

Morales, Lucia; Mateos-Gomez, Pedro A; Capiscol, Carmen; del Palacio, Lorena; Enjuanes, Luis; Sola, Isabel

2013-11-01

Preferential RNA packaging in coronaviruses involves the recognition of viral genomic RNA, a crucial process for viral particle morphogenesis mediated by RNA-specific sequences, known as packaging signals. An essential packaging signal component of transmissible gastroenteritis coronavirus (TGEV) has been further delimited to the first 598 nucleotides (nt) from the 5' end of its RNA genome, by using recombinant viruses transcribing subgenomic mRNA that included potential packaging signals. The integrity of the entire sequence domain was necessary because deletion of any of the five structural motifs defined within this region abrogated specific packaging of this viral RNA. One of these RNA motifs was the stem-loop SL5, a highly conserved motif in coronaviruses located at nucleotide positions 106 to 136. Partial deletion or point mutations within this motif also abrogated packaging. Using TGEV-derived defective minigenomes replicated in trans by a helper virus, we have shown that TGEV RNA packaging is a replication-independent process. Furthermore, the last 494 nt of the genomic 3' end were not essential for packaging, although this region increased packaging efficiency. TGEV RNA sequences identified as necessary for viral genome packaging were not sufficient to direct packaging of a heterologous sequence derived from the green fluorescent protein gene. These results indicated that TGEV genome packaging is a complex process involving many factors in addition to the identified RNA packaging signal. The identification of well-defined RNA motifs within the TGEV RNA genome that are essential for packaging will be useful for designing packaging-deficient biosafe coronavirus-derived vectors and providing new targets for antiviral therapies.
The city as a motif in Slovene youth literature

Directory of Open Access Journals (Sweden)

Milena Mileva Blažić

2003-01-01

Full Text Available The article presents the city as motif of Slovenian youth literature in four different periods, beginning in the first period of original Slovenian youth literature in the second half of the 19th century, second period in the first half of the 20th century, third period in the second half of the 20th century and after 1950, when significant books were produced in the field of short modern stories, emphasising on picture books and realistic narrative prose, and the fourth period after 1990. A discernable shift can be observed in the thirties of the 20th century, during the times of socialist realism. The most significant change occurred after 1960, when massive migration from rural to urban environments caused by industrialisation began. The motif of urban environment especially marked modern realistic narrative, coined problematic narrative after 1990, with its focus on issues of growing up in such environments. The city as motif or theme doesn’t appear only in realistic narrative, but since the early 20th century also in fantastic narrative, thus it dichotomically presents the image of real world in Slovenian youth realistic narrative.
Unlocked nucleic acids with a pyrene-modified uracil: Synthesis, hybridization studies, fluorescent properties and i-motif stability

DEFF Research Database (Denmark)

Perlíková, P.; Karlsen, K.K.; Pedersen, E.B.

2014-01-01

The synthesis of two new phosphoramidite building blocks for the incorporation of 5-(pyren-1-yl)uracilyl unlocked nucleic acid (UNA) monomers into oligonucleotides has been developed. Monomers containing a pyrene-modified nucleobase component were found to destabilize an i-motif structure at pH 5...... intensities upon hybridization to DNA or RNA. Efficient quenching of fluorescence of pyrene-modified UNA monomers was observed after formation of i-motif structures at pH 5.2. The stabilizing/destabilizing effect of pyrene-modified nucleic acids might be useful for designing antisense oligonucleotides...
Mature clustered, regularly interspaced, short palindromic repeats RNA (crRNA) length is measured by a ruler mechanism anchored at the precursor processing site.

Science.gov (United States)

Hatoum-Aslan, Asma; Maniv, Inbal; Marraffini, Luciano A

2011-12-27

Precise RNA processing is fundamental to all small RNA-mediated interference pathways. In prokaryotes, clustered, regularly interspaced, short palindromic repeats (CRISPR) loci encode small CRISPR RNAs (crRNAs) that protect against invasive genetic elements by antisense targeting. CRISPR loci are transcribed as a long precursor that is cleaved within repeat sequences by CRISPR-associated (Cas) proteins. In many organisms, this primary processing generates crRNA intermediates that are subject to additional nucleolytic trimming to render mature crRNAs of specific lengths. The molecular mechanisms underlying this maturation event remain poorly understood. Here, we defined the genetic requirements for crRNA primary processing and maturation in Staphylococcus epidermidis. We show that changes in the position of the primary processing site result in extended or diminished maturation to generate mature crRNAs of constant length. These results indicate that crRNA maturation occurs by a ruler mechanism anchored at the primary processing site. We also show that maturation is mediated by specific cas genes distinct from those genes involved in primary processing, showing that this event is directed by CRISPR/Cas loci.
Authentic interdomain communication in an RNA helicase reconstituted by expressed protein ligation of two helicase domains.

Science.gov (United States)

Karow, Anne R; Theissen, Bettina; Klostermeier, Dagmar

2007-01-01

RNA helicases mediate structural rearrangements of RNA or RNA-protein complexes at the expense of ATP hydrolysis. Members of the DEAD box helicase family consist of two flexibly connected helicase domains. They share nine conserved sequence motifs that are involved in nucleotide binding and hydrolysis, RNA binding, and helicase activity. Most of these motifs line the cleft between the two helicase domains, and extensive communication between them is required for RNA unwinding. The two helicase domains of the Bacillus subtilis RNA helicase YxiN were produced separately as intein fusions, and a functional RNA helicase was generated by expressed protein ligation. The ligated helicase binds adenine nucleotides with very similar affinities to the wild-type protein. Importantly, its intrinsically low ATPase activity is stimulated by RNA, and the Michaelis-Menten parameters are similar to those of the wild-type. Finally, ligated YxiN unwinds a minimal RNA substrate to an extent comparable to that of the wild-type helicase, confirming authentic interdomain communication.
Capturing microRNA targets using an RNA-induced silencing complex (RISC)-trap approach.

Science.gov (United States)

Cambronne, Xiaolu A; Shen, Rongkun; Auer, Paul L; Goodman, Richard H

2012-12-11

Identifying targets is critical for understanding the biological effects of microRNA (miRNA) expression. The challenge lies in characterizing the cohort of targets for a specific miRNA, especially when targets are being actively down-regulated in miRNA- RNA-induced silencing complex (RISC)-messengerRNA (mRNA) complexes. We have developed a robust and versatile strategy called RISCtrap to stabilize and purify targets from this transient interaction. Its utility was demonstrated by determining specific high-confidence target datasets for miR-124, miR-132, and miR-181 that contained known and previously unknown transcripts. Two previously unknown miR-132 targets identified with RISCtrap, adaptor protein CT10 regulator of kinase 1 (CRK1) and tight junction-associated protein 1 (TJAP1), were shown to be endogenously regulated by miR-132 in adult mouse forebrain. The datasets, moreover, differed in the number of targets and in the types and frequency of microRNA recognition element (MRE) motifs, thus revealing a previously underappreciated level of specificity in the target sets regulated by individual miRNAs.
Forced selection of a human immunodeficiency virus type 1 variant that uses a non-self tRNA primer for reverse transcription: Involvement of viral RNA sequences and the reverse transcriptase enzyme

NARCIS (Netherlands)

Abbink, Truus E. M.; Beerens, Nancy; Berkhout, Ben

2004-01-01

Human immunodeficiency virus type 1 uses the tRNA(3)(Lys) molecule as a selective primer for reverse transcription. This primer specificity is imposed by sequence complementarity between the tRNA primer and two motifs in the viral RNA genome: the primer-binding site (PBS) and the primer activation
CLIP-seq analysis of multi-mapped reads discovers novel functional RNA regulatory sites in the human transcriptome.

Science.gov (United States)

Zhang, Zijun; Xing, Yi

2017-09-19

Crosslinking or RNA immunoprecipitation followed by sequencing (CLIP-seq or RIP-seq) allows transcriptome-wide discovery of RNA regulatory sites. As CLIP-seq/RIP-seq reads are short, existing computational tools focus on uniquely mapped reads, while reads mapped to multiple loci are discarded. We present CLAM (CLIP-seq Analysis of Multi-mapped reads). CLAM uses an expectation-maximization algorithm to assign multi-mapped reads and calls peaks combining uniquely and multi-mapped reads. To demonstrate the utility of CLAM, we applied it to a wide range of public CLIP-seq/RIP-seq datasets involving numerous splicing factors, microRNAs and m6A RNA methylation. CLAM recovered a large number of novel RNA regulatory sites inaccessible by uniquely mapped reads. The functional significance of these sites was demonstrated by consensus motif patterns and association with alternative splicing (splicing factors), transcript abundance (AGO2) and mRNA half-life (m6A). CLAM provides a useful tool to discover novel protein-RNA interactions and RNA modification sites from CLIP-seq and RIP-seq data, and reveals the significant contribution of repetitive elements to the RNA regulatory landscape of the human transcriptome. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural and Functional Motifs in Influenza Virus RNAs

Directory of Open Access Journals (Sweden)

Damien Ferhadian

2018-03-01

have now been validated experimentally and their role in the viral life cycle demonstrated. This review aims to compile the structural motifs found in the different RNA classes (vRNA, cRNA, and vmRNA of influenza viruses and their function in the viral replication cycle.
The MHC motif viewer: a visualization tool for MHC binding motifs

DEFF Research Database (Denmark)

Rapin, Nicolas; Hoof, Ilka; Lund, Ole

2010-01-01

is hampered by the lack of tools for browsing and comparing specificity of these molecules. We have developed a Web server, MHC Motif Viewer, which allows the display of the binding motif for MHC class I proteins for human, chimpanzee, rhesus monkey, mouse, and swine, as well as HLA-DR protein sequences...
Alanine substitutions in the GXXXG motif alter C99 cleavage by γ-secretase but not its dimerization.

Science.gov (United States)

Higashide, Hidekazu; Ishihara, Seiko; Nobuhara, Mika; Ihara, Yasuo; Funamoto, Satoru

2017-03-01

The amyloid β (Aβ) protein is a major component of senile plaques, one of the neuropathological hallmarks of Alzheimer's disease. Amyloidogenic processing of amyloid precursor protein (APP) by β- and γ-secretases leads to production of Aβ. APP contains tandem triple repeats of the GXXXG motif in its extracellular juxtamembrane and transmembrane regions. It is reported that the GXXXG motif is related to protein-protein interactions, but it remains controversial whether the GXXXG motif in APP is involved in substrate dimerization and whether dimerization affects γ-secretase-dependent cleavage. Therefore, the relationship between the GXXXG motifs, substrate dimerization, and γ-secretase-dependent cleavage sites remains unclear. Here, we applied blue native poly acrylamide gel electrophoresis to examine the effect of alanine substitutions within the GXXXG motifs of APP carboxyl terminal fragment (C99) on its dimerization and Aβ production. Surprisingly, alanine substitutions in the motif failed to alter C99 dimerization in detergent soluble state. Cell-based and solubilized γ-secretase assays demonstrated that increasing alanine substitutions in the motif tended to decrease long Aβ species such as Aβ42 and Aβ43 and to increase in short Aβ species concomitantly. Our data suggest that the GXXXG motif is crucial for Aβ production, but not for C99 dimerization. © 2016 International Society for Neurochemistry.
Short hairpin RNA-mediated knockdown of protein expression in Entamoeba histolytica

Directory of Open Access Journals (Sweden)

Singh Upinder

2009-02-01

Full Text Available Abstract Background Entamoeba histolytica is an intestinal protozoan parasite of humans. The genome has been sequenced, but the study of individual gene products has been hampered by the lack of the ability to generate gene knockouts. We chose to test the use of RNA interference to knock down gene expression in Entamoeba histolytica. Results An episomal vector-based system, using the E. histolytica U6 promoter to drive expression of 29-basepair short hairpin RNAs, was developed to target protein-encoding genes in E. histolytica. The short hairpin RNAs successfully knocked down protein levels of all three unrelated genes tested with this system: Igl, the intermediate subunit of the galactose- and N-acetyl-D-galactosamine-inhibitable lectin; the transcription factor URE3-BP; and the membrane binding protein EhC2A. Igl levels were reduced by 72%, URE3-BP by 89%, and EhC2A by 97%. Conclusion Use of the U6 promoter to drive expression of 29-basepair short hairpin RNAs is effective at knocking down protein expression for unrelated genes in Entamoeba histolytica, providing a useful tool for the study of this parasite.
Short hairpin RNA-mediated knockdown of protein expression in Entamoeba histolytica.

Science.gov (United States)

Linford, Alicia S; Moreno, Heriberto; Good, Katelyn R; Zhang, Hanbang; Singh, Upinder; Petri, William A

2009-02-17

Entamoeba histolytica is an intestinal protozoan parasite of humans. The genome has been sequenced, but the study of individual gene products has been hampered by the lack of the ability to generate gene knockouts. We chose to test the use of RNA interference to knock down gene expression in Entamoeba histolytica. An episomal vector-based system, using the E. histolytica U6 promoter to drive expression of 29-basepair short hairpin RNAs, was developed to target protein-encoding genes in E. histolytica. The short hairpin RNAs successfully knocked down protein levels of all three unrelated genes tested with this system: Igl, the intermediate subunit of the galactose- and N-acetyl-D-galactosamine-inhibitable lectin; the transcription factor URE3-BP; and the membrane binding protein EhC2A. Igl levels were reduced by 72%, URE3-BP by 89%, and EhC2A by 97%. Use of the U6 promoter to drive expression of 29-basepair short hairpin RNAs is effective at knocking down protein expression for unrelated genes in Entamoeba histolytica, providing a useful tool for the study of this parasite.
Engineering a Functional Small RNA Negative Autoregulation Network with Model-Guided Design.

Science.gov (United States)

Hu, Chelsea Y; Takahashi, Melissa K; Zhang, Yan; Lucks, Julius B

2018-05-22

RNA regulators are powerful components of the synthetic biology toolbox. Here, we expand the repertoire of synthetic gene networks built from these regulators by constructing a transcriptional negative autoregulation (NAR) network out of small RNAs (sRNAs). NAR network motifs are core motifs of natural genetic networks, and are known for reducing network response time and steady state signal. Here we use cell-free transcription-translation (TX-TL) reactions and a computational model to design and prototype sRNA NAR constructs. Using parameter sensitivity analysis, we design a simple set of experiments that allow us to accurately predict NAR function in TX-TL. We transfer successful network designs into Escherichia coli and show that our sRNA transcriptional network reduces both network response time and steady-state gene expression. This work broadens our ability to construct increasingly sophisticated RNA genetic networks with predictable function.
RNAcontext: a new method for learning the sequence and structure binding preferences of RNA-binding proteins.

Directory of Open Access Journals (Sweden)

Hilal Kazan

2010-07-01

Full Text Available Metazoan genomes encode hundreds of RNA-binding proteins (RBPs. These proteins regulate post-transcriptional gene expression and have critical roles in numerous cellular processes including mRNA splicing, export, stability and translation. Despite their ubiquity and importance, the binding preferences for most RBPs are not well characterized. In vitro and in vivo studies, using affinity selection-based approaches, have successfully identified RNA sequence associated with specific RBPs; however, it is difficult to infer RBP sequence and structural preferences without specifically designed motif finding methods. In this study, we introduce a new motif-finding method, RNAcontext, designed to elucidate RBP-specific sequence and structural preferences with greater accuracy than existing approaches. We evaluated RNAcontext on recently published in vitro and in vivo RNA affinity selected data and demonstrate that RNAcontext identifies known binding preferences for several control proteins including HuR, PTB, and Vts1p and predicts new RNA structure preferences for SF2/ASF, RBM4, FUSIP1 and SLM2. The predicted preferences for SF2/ASF are consistent with its recently reported in vivo binding sites. RNAcontext is an accurate and efficient motif finding method ideally suited for using large-scale RNA-binding affinity datasets to determine the relative binding preferences of RBPs for a wide range of RNA sequences and structures.
Kopi dan Kakao dalam Kreasi Motif Batik Khas Jember

Directory of Open Access Journals (Sweden)

Irfa'ina Rohana Salma

2015-06-01

Full Text Available ABSTRAK Batik Jember selama ini identik dengan motif daun tembakau. Visualisasi daun tembakau dalam motif Batik Jember cukup lemah, yaitu kurang berkarakter karena motif yang muncul adalah seperti gambar daun pada umumnya. Oleh karena itu perlu diciptakan desain motif batik khas Jember yang sumber inspirasinya digali dari kekayaan alam lainnya dari Jember yang mempunyai bentuk spesifik dan karakteristik sehingga identitas motif bisa didapatkan dengan lebih kuat. Hasil alam khas Jember tersebut adalah kopi dan kakao. Tujuan penciptaan seni ini adalah untuk menghasilkan motif batik baru yang mempunyai ciri khas Jember. Metode yang digunakan yaitu pengumpulan data, pengamatan mendalam terhadap objek penciptaan, pengkajian sumber inspirasi, pembuatan desain motif, dan perwujudan menjadi batik. Dari penciptaan seni ini berhasil dikreasikan 6 (enam motif batik yaitu: (1 Motif Uwoh Kopi; (2 Motif Godong Kopi; (3 Motif Ceplok Kakao; (4 Motif Kakao Raja; (5 Motif Kakao Biru; dan (6 Motif Wiji Mukti. Berdasarkan hasil penilaian “Selera Estetika” diketahui bahwa motif yang paling banyak disukai adalah Motif Uwoh Kopi dan Motif Kakao Raja. Kata kunci: Motif Woh Kopi, Motif Godong Kopi, Motif Ceplok Kakao, Motif Kakao Raja, Motif Kakao Biru, Motif Wiji Mukti ABSTRACTBatik Jember is synonymous with tobacco leaf motif. Tobacco leaf shape is quite weak in the visual appearance characterized as that motif emerges like a picture of leaves in general. Therefore, it is necessary to create a distinctive design motif extracted from other natural resources of Jember that have specific shapes and characteristics that can be obtained as the stronger motif identity. The typical natural resources from Jember are coffee and cocoa. The purpose of the creation of this art is to produce the unique, creative and innovative batik and have specific characteristics of Jember. The method used are data collection, observation of the object, reviewing inspiration sources
Transmissible Gastroenteritis Coronavirus Genome Packaging Signal Is Located at the 5′ End of the Genome and Promotes Viral RNA Incorporation into Virions in a Replication-Independent Process

Science.gov (United States)

Morales, Lucia; Mateos-Gomez, Pedro A.; Capiscol, Carmen; del Palacio, Lorena; Sola, Isabel

2013-01-01

Preferential RNA packaging in coronaviruses involves the recognition of viral genomic RNA, a crucial process for viral particle morphogenesis mediated by RNA-specific sequences, known as packaging signals. An essential packaging signal component of transmissible gastroenteritis coronavirus (TGEV) has been further delimited to the first 598 nucleotides (nt) from the 5′ end of its RNA genome, by using recombinant viruses transcribing subgenomic mRNA that included potential packaging signals. The integrity of the entire sequence domain was necessary because deletion of any of the five structural motifs defined within this region abrogated specific packaging of this viral RNA. One of these RNA motifs was the stem-loop SL5, a highly conserved motif in coronaviruses located at nucleotide positions 106 to 136. Partial deletion or point mutations within this motif also abrogated packaging. Using TGEV-derived defective minigenomes replicated in trans by a helper virus, we have shown that TGEV RNA packaging is a replication-independent process. Furthermore, the last 494 nt of the genomic 3′ end were not essential for packaging, although this region increased packaging efficiency. TGEV RNA sequences identified as necessary for viral genome packaging were not sufficient to direct packaging of a heterologous sequence derived from the green fluorescent protein gene. These results indicated that TGEV genome packaging is a complex process involving many factors in addition to the identified RNA packaging signal. The identification of well-defined RNA motifs within the TGEV RNA genome that are essential for packaging will be useful for designing packaging-deficient biosafe coronavirus-derived vectors and providing new targets for antiviral therapies. PMID:23966403
The NTP-binding motif in cowpea mosaic virus B polyprotein is essential for viral replication

NARCIS (Netherlands)

Peters, S A; Verver, J; Nollen, E A; van Lent, J W; Wellink, J; van Kammen, A

1994-01-01

We have assessed the functional importance of the NTP-binding motif (NTBM) in the cowpea mosaic virus (CPMV) B-RNA-encoded 58K domain by changing two conserved amino acids within the consensus A and B sites (GKSRTGK500S and MDD545, respectively). Both Lys-500 to Thr and Asp-545 to Pro substitutions
Selection of functional 2A sequences within foot-and-mouth disease virus; requirements for the NPGP motif with a distinct codon bias.

Science.gov (United States)

Kjær, Jonas; Belsham, Graham J

2018-01-01

Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long), which induces a nonproteolytic, cotranslational "cleavage" at its own C terminus. A conserved feature among variants of 2A is the C-terminal motif N 16 P 17 G 18 /P 19 , where P 19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E 14 , S 15 , and N 16 within the 2A sequence of infectious FMDVs, but no variants at residues P 17 , G 18 , or P 19 have been identified. In this study, using highly degenerate primers, we analyzed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after two, three, or four passages. However, surprisingly, a clear codon preference for the wt nucleotide sequence encoding the NPGP motif within these viruses was observed. Indeed, the codons selected to code for P 17 and P 19 within this motif were distinct; thus the synonymous codons are not equivalent. © 2018 Kjær and Belsham; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

A folding algorithm for extended RNA secondary structures.

Science.gov (United States)

Höner zu Siederdissen, Christian; Bernhart, Stephan H; Stadler, Peter F; Hofacker, Ivo L

2011-07-01

RNA secondary structure contains many non-canonical base pairs of different pair families. Successful prediction of these structural features leads to improved secondary structures with applications in tertiary structure prediction and simultaneous folding and alignment. We present a theoretical model capturing both RNA pair families and extended secondary structure motifs with shared nucleotides using 2-diagrams. We accompany this model with a number of programs for parameter optimization and structure prediction. All sources (optimization routines, RNA folding, RNA evaluation, extended secondary structure visualization) are published under the GPLv3 and available at www.tbi.univie.ac.at/software/rnawolf/.
CombiMotif: A new algorithm for network motifs discovery in protein-protein interaction networks

Science.gov (United States)

Luo, Jiawei; Li, Guanghui; Song, Dan; Liang, Cheng

2014-12-01

Discovering motifs in protein-protein interaction networks is becoming a current major challenge in computational biology, since the distribution of the number of network motifs can reveal significant systemic differences among species. However, this task can be computationally expensive because of the involvement of graph isomorphic detection. In this paper, we present a new algorithm (CombiMotif) that incorporates combinatorial techniques to count non-induced occurrences of subgraph topologies in the form of trees. The efficiency of our algorithm is demonstrated by comparing the obtained results with the current state-of-the art subgraph counting algorithms. We also show major differences between unicellular and multicellular organisms. The datasets and source code of CombiMotif are freely available upon request.
MSDmotif: exploring protein sites and motifs

Directory of Open Access Journals (Sweden)

Henrick Kim

2008-07-01

Full Text Available Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.
LDsplit: screening for cis-regulatory motifs stimulating meiotic recombination hotspots by analysis of DNA sequence polymorphisms.

Science.gov (United States)

Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie

2014-02-17

As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of
Defining RNA-Small Molecule Affinity Landscapes Enables Design of a Small Molecule Inhibitor of an Oncogenic Noncoding RNA.

Science.gov (United States)

Velagapudi, Sai Pradeep; Luo, Yiling; Tran, Tuan; Haniff, Hafeez S; Nakai, Yoshio; Fallahi, Mohammad; Martinez, Gustavo J; Childs-Disney, Jessica L; Disney, Matthew D

2017-03-22

RNA drug targets are pervasive in cells, but methods to design small molecules that target them are sparse. Herein, we report a general approach to score the affinity and selectivity of RNA motif-small molecule interactions identified via selection. Named High Throughput Structure-Activity Relationships Through Sequencing (HiT-StARTS), HiT-StARTS is statistical in nature and compares input nucleic acid sequences to selected library members that bind a ligand via high throughput sequencing. The approach allowed facile definition of the fitness landscape of hundreds of thousands of RNA motif-small molecule binding partners. These results were mined against folded RNAs in the human transcriptome and identified an avid interaction between a small molecule and the Dicer nuclease-processing site in the oncogenic microRNA (miR)-18a hairpin precursor, which is a member of the miR-17-92 cluster. Application of the small molecule, Targapremir-18a, to prostate cancer cells inhibited production of miR-18a from the cluster, de-repressed serine/threonine protein kinase 4 protein (STK4), and triggered apoptosis. Profiling the cellular targets of Targapremir-18a via Chemical Cross-Linking and Isolation by Pull Down (Chem-CLIP), a covalent small molecule-RNA cellular profiling approach, and other studies showed specific binding of the compound to the miR-18a precursor, revealing broadly applicable factors that govern small molecule drugging of noncoding RNAs.
RNA STRAND: The RNA Secondary Structure and Statistical Analysis Database

Directory of Open Access Journals (Sweden)

Andronescu Mirela

2008-08-01

Full Text Available Abstract Background The ability to access, search and analyse secondary structures of a large set of known RNA molecules is very important for deriving improved RNA energy models, for evaluating computational predictions of RNA secondary structures and for a better understanding of RNA folding. Currently there is no database that can easily provide these capabilities for almost all RNA molecules with known secondary structures. Results In this paper we describe RNA STRAND – the RNA secondary STRucture and statistical ANalysis Database, a curated database containing known secondary structures of any type and organism. Our new database provides a wide collection of known RNA secondary structures drawn from public databases, searchable and downloadable in a common format. Comprehensive statistical information on the secondary structures in our database is provided using the RNA Secondary Structure Analyser, a new tool we have developed to analyse RNA secondary structures. The information thus obtained is valuable for understanding to which extent and with which probability certain structural motifs can appear. We outline several ways in which the data provided in RNA STRAND can facilitate research on RNA structure, including the improvement of RNA energy models and evaluation of secondary structure prediction programs. In order to keep up-to-date with new RNA secondary structure experiments, we offer the necessary tools to add solved RNA secondary structures to our database and invite researchers to contribute to RNA STRAND. Conclusion RNA STRAND is a carefully assembled database of trusted RNA secondary structures, with easy on-line tools for searching, analyzing and downloading user selected entries, and is publicly available at http://www.rnasoft.ca/strand.
Lentiviral Delivery of a Vesicular Glutamate Transporter 1 (VGLUT1)-Targeting Short Hairpin RNA Vector Into the Mouse Hippocampus Impairs Cognition

NARCIS (Netherlands)

King, Madeleine V.; Kurian, Nisha; Qin, Si; Papadopoulou, Nektaria; Westerink, Ben H. C.; Cremers, Thomas I.; Epping-Jordan, Mark P.; Le Poul, Emmanuel; Ray, David E.; Fone, Kevin C. F.; Kendall, David A.; Marsden, Charles A.; Sharp, Tyson V.

Glutamate is the principle excitatory neurotransmitter in the mammalian brain, and dysregulation of glutamatergic neurotransmission is implicated in the pathophysiology of several psychiatric and neurological diseases. This study utilized novel lentiviral short hairpin RNA (shRNA) vectors to target
An optimized lentiviral vector system for conditional RNAi and efficient cloning of microRNA embedded short hairpin RNA libraries.

Science.gov (United States)

Adams, Felix F; Heckl, Dirk; Hoffmann, Thomas; Talbot, Steven R; Kloos, Arnold; Thol, Felicitas; Heuser, Michael; Zuber, Johannes; Schambach, Axel; Schwarzer, Adrian

2017-09-01

RNA interference (RNAi) and CRISPR-Cas9-based screening systems have emerged as powerful and complementary tools to unravel genetic dependencies through systematic gain- and loss-of-function studies. In recent years, a series of technical advances helped to enhance the performance of virally delivered RNAi. For instance, the incorporation of short hairpin RNAs (shRNAs) into endogenous microRNA contexts (shRNAmiRs) allows the use of Tet-regulated promoters for synchronous onset of gene knockdown and precise interrogation of gene dosage effects. However, remaining challenges include lack of efficient cloning strategies, inconsistent knockdown potencies and leaky expression. Here, we present a simple, one-step cloning approach for rapid and efficient cloning of miR-30 shRNAmiR libraries. We combined a human miR-30 backbone retaining native flanking sequences with an optimized all-in-one lentiviral vector system for conditional RNAi to generate a versatile toolbox characterized by higher doxycycline sensitivity, reduced leakiness and enhanced titer. Furthermore, refinement of existing shRNA design rules resulted in substantially improved prediction of powerful shRNAs. Our approach was validated by accurate quantification of the knockdown potency of over 250 single shRNAmiRs. To facilitate access and use by the scientific community, an online tool was developed for the automated design of refined shRNA-coding oligonucleotides ready for cloning into our system. Copyright © 2017 Elsevier Ltd. All rights reserved.
Reverse Transcription Errors and RNA-DNA Differences at Short Tandem Repeats.

Science.gov (United States)

Fungtammasan, Arkarachai; Tomaszkiewicz, Marta; Campos-Sánchez, Rebeca; Eckert, Kristin A; DeGiorgio, Michael; Makova, Kateryna D

2016-10-01

Transcript variation has important implications for organismal function in health and disease. Most transcriptome studies focus on assessing variation in gene expression levels and isoform representation. Variation at the level of transcript sequence is caused by RNA editing and transcription errors, and leads to nongenetically encoded transcript variants, or RNA-DNA differences (RDDs). Such variation has been understudied, in part because its detection is obscured by reverse transcription (RT) and sequencing errors. It has only been evaluated for intertranscript base substitution differences. Here, we investigated transcript sequence variation for short tandem repeats (STRs). We developed the first maximum-likelihood estimator (MLE) to infer RT error and RDD rates, taking next generation sequencing error rates into account. Using the MLE, we empirically evaluated RT error and RDD rates for STRs in a large-scale DNA and RNA replicated sequencing experiment conducted in a primate species. The RT error rates increased exponentially with STR length and were biased toward expansions. The RDD rates were approximately 1 order of magnitude lower than the RT error rates. The RT error rates estimated with the MLE from a primate data set were concordant with those estimated with an independent method, barcoded RNA sequencing, from a Caenorhabditis elegans data set. Our results have important implications for medical genomics, as STR allelic variation is associated with >40 diseases. STR nonallelic transcript variation can also contribute to disease phenotype. The MLE and empirical rates presented here can be used to evaluate the probability of disease-associated transcripts arising due to RDD. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures

Science.gov (United States)

Sloma, Michael F.; Mathews, David H.

2016-01-01

RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures. PMID:27852924
Statistical tests to compare motif count exceptionalities

Directory of Open Access Journals (Sweden)

Vandewalle Vincent

2007-03-01

Full Text Available Abstract Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use.
A G-C-rich palindromic structural motif and a stretch of single-stranded purines are required for optimal packaging of Mason-Pfizer monkey virus (MPMV) genomic RNA.

Science.gov (United States)

Jaballah, Soumeya Ali; Aktar, Suriya J; Ali, Jahabar; Phillip, Pretty Susan; Al Dhaheri, Noura Salem; Jabeen, Aayesha; Rizvi, Tahir A

2010-09-03

During retroviral RNA packaging, two copies of genomic RNA are preferentially packaged into the budding virus particles whereas the spliced viral RNAs and the cellular RNAs are excluded during this process. Specificity towards retroviral RNA packaging is dependent upon sequences at the 5' end of the viral genome, which at times extend into Gag sequences. It has earlier been suggested that the Mason-Pfizer monkey virus (MPMV) contains packaging sequences within the 5' untranslated region (UTR) and Gag. These studies have also suggested that the packaging determinants of MPMV that lie in the UTR are bipartite and are divided into two regions both upstream and downstream of the major splice donor. However, the precise boundaries of these discontinuous regions within the UTR and the role of the intervening sequences between these dipartite sequences towards MPMV packaging have not been investigated. Employing a combination of genetic and structural prediction analyses, we have shown that region "A", immediately downstream of the primer binding site, is composed of 50 nt, whereas region "B" is composed of the last 23 nt of UTR, and the intervening 55 nt between these two discontinuous regions do not contribute towards MPMV RNA packaging. In addition, we have identified a 14-nt G-C-rich palindromic sequence (with 100% autocomplementarity) within region A that has been predicted to fold into a structural motif and is essential for optimal MPMV RNA packaging. Furthermore, we have also identified a stretch of single-stranded purines (ssPurines) within the UTR and 8 nt of these ssPurines are duplicated in region B. The native ssPurines or its repeat in region B when predicted to refold as ssPurines has been shown to be essential for RNA packaging, possibly functioning as a potential nucleocapsid binding site. Findings from this study should enhance our understanding of the steps involved in MPMV replication including RNA encapsidation process. Copyright (c) 2010 Elsevier Ltd
An Active Immune Defense with a Minimal CRISPR (Clustered Regularly Interspaced Short Palindromic Repeats) RNA and without the Cas6 Protein*

Science.gov (United States)

Maier, Lisa-Katharina; Stachler, Aris-Edda; Saunders, Sita J.; Backofen, Rolf; Marchfelder, Anita

2015-01-01

The prokaryotic immune system CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR-associated) is a defense system that protects prokaryotes against foreign DNA. The short CRISPR RNAs (crRNAs) are central components of this immune system. In CRISPR-Cas systems type I and III, crRNAs are generated by the endonuclease Cas6. We developed a Cas6b-independent crRNA maturation pathway for the Haloferax type I-B system in vivo that expresses a functional crRNA, which we termed independently generated crRNA (icrRNA). The icrRNA is effective in triggering degradation of an invader plasmid carrying the matching protospacer sequence. The Cas6b-independent maturation of the icrRNA allowed mutation of the repeat sequence without interfering with signals important for Cas6b processing. We generated 23 variants of the icrRNA and analyzed them for activity in the interference reaction. icrRNAs with deletions or mutations of the 3′ handle are still active in triggering an interference reaction. The complete 3′ handle could be removed without loss of activity. However, manipulations of the 5′ handle mostly led to loss of interference activity. Furthermore, we could show that in the presence of an icrRNA a strain without Cas6b (Δcas6b) is still active in interference. PMID:25512373
An active immune defense with a minimal CRISPR (clustered regularly interspaced short palindromic repeats) RNA and without the Cas6 protein.

Science.gov (United States)

Maier, Lisa-Katharina; Stachler, Aris-Edda; Saunders, Sita J; Backofen, Rolf; Marchfelder, Anita

2015-02-13

The prokaryotic immune system CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR-associated) is a defense system that protects prokaryotes against foreign DNA. The short CRISPR RNAs (crRNAs) are central components of this immune system. In CRISPR-Cas systems type I and III, crRNAs are generated by the endonuclease Cas6. We developed a Cas6b-independent crRNA maturation pathway for the Haloferax type I-B system in vivo that expresses a functional crRNA, which we termed independently generated crRNA (icrRNA). The icrRNA is effective in triggering degradation of an invader plasmid carrying the matching protospacer sequence. The Cas6b-independent maturation of the icrRNA allowed mutation of the repeat sequence without interfering with signals important for Cas6b processing. We generated 23 variants of the icrRNA and analyzed them for activity in the interference reaction. icrRNAs with deletions or mutations of the 3' handle are still active in triggering an interference reaction. The complete 3' handle could be removed without loss of activity. However, manipulations of the 5' handle mostly led to loss of interference activity. Furthermore, we could show that in the presence of an icrRNA a strain without Cas6b (Δcas6b) is still active in interference. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
WildSpan: mining structured motifs from protein sequences

Directory of Open Access Journals (Sweden)

Chen Chien-Yu

2011-03-01

Full Text Available Abstract Background Automatic extraction of motifs from biological sequences is an important research problem in study of molecular biology. For proteins, it is desired to discover sequence motifs containing a large number of wildcard symbols, as the residues associated with functional sites are usually largely separated in sequences. Discovering such patterns is time-consuming because abundant combinations exist when long gaps (a gap consists of one or more successive wildcards are considered. Mining algorithms often employ constraints to narrow down the search space in order to increase efficiency. However, improper constraint models might degrade the sensitivity and specificity of the motifs discovered by computational methods. We previously proposed a new constraint model to handle large wildcard regions for discovering functional motifs of proteins. The patterns that satisfy the proposed constraint model are called W-patterns. A W-pattern is a structured motif that groups motif symbols into pattern blocks interleaved with large irregular gaps. Considering large gaps reflects the fact that functional residues are not always from a single region of protein sequences, and restricting motif symbols into clusters corresponds to the observation that short motifs are frequently present within protein families. To efficiently discover W-patterns for large-scale sequence annotation and function prediction, this paper first formally introduces the problem to solve and proposes an algorithm named WildSpan (sequential pattern mining across large wildcard regions that incorporates several pruning strategies to largely reduce the mining cost. Results WildSpan is shown to efficiently find W-patterns containing conserved residues that are far separated in sequences. We conducted experiments with two mining strategies, protein-based and family-based mining, to evaluate the usefulness of W-patterns and performance of WildSpan. The protein-based mining mode
Identity and functions of CxxC-derived motifs.

Science.gov (United States)

Fomenko, Dmitri E; Gladyshev, Vadim N

2003-09-30

Two cysteines separated by two other residues (the CxxC motif) are employed by many redox proteins for formation, isomerization, and reduction of disulfide bonds and for other redox functions. The place of the C-terminal cysteine in this motif may be occupied by serine (the CxxS motif), modifying the functional repertoire of redox proteins. Here we found that the CxxC motif may also give rise to a motif, in which the C-terminal cysteine is replaced with threonine (the CxxT motif). Moreover, in contrast to a view that the N-terminal cysteine in the CxxC motif always serves as a nucleophilic attacking group, this residue could also be replaced with threonine (the TxxC motif), serine (the SxxC motif), or other residues. In each of these CxxC-derived motifs, the presence of a downstream alpha-helix was strongly favored. A search for conserved CxxC-derived motif/helix patterns in four complete genomes representing bacteria, archaea, and eukaryotes identified known redox proteins and suggested possible redox functions for several additional proteins. Catalytic sites in peroxiredoxins were major representatives of the TxxC motif, whereas those in glutathione peroxidases represented the CxxT motif. Structural assessments indicated that threonines in these enzymes could stabilize catalytic thiolates, suggesting revisions to previously proposed catalytic triads. Each of the CxxC-derived motifs was also observed in natural selenium-containing proteins, in which selenocysteine was present in place of a catalytic cysteine.
Phytophthora have distinct endogenous small RNA populations that include short interfering and microRNAs.

Directory of Open Access Journals (Sweden)

Noah Fahlgren

Full Text Available In eukaryotes, RNA silencing pathways utilize 20-30-nucleotide small RNAs to regulate gene expression, specify and maintain chromatin structure, and repress viruses and mobile genetic elements. RNA silencing was likely present in the common ancestor of modern eukaryotes, but most research has focused on plant and animal RNA silencing systems. Phytophthora species belong to a phylogenetically distinct group of economically important plant pathogens that cause billions of dollars in yield losses annually as well as ecologically devastating outbreaks. We analyzed the small RNA-generating components of the genomes of P. infestans, P. sojae and P. ramorum using bioinformatics, genetic, phylogenetic and high-throughput sequencing-based methods. Each species produces two distinct populations of small RNAs that are predominantly 21- or 25-nucleotides long. The 25-nucleotide small RNAs were primarily derived from loci encoding transposable elements and we propose that these small RNAs define a pathway of short-interfering RNAs that silence repetitive genetic elements. The 21-nucleotide small RNAs were primarily derived from inverted repeats, including a novel microRNA family that is conserved among the three species, and several gene families, including Crinkler effectors and type III fibronectins. The Phytophthora microRNA is predicted to target a family of amino acid/auxin permeases, and we propose that 21-nucleotide small RNAs function at the post-transcriptional level. The functional significance of microRNA-guided regulation of amino acid/auxin permeases and the association of 21-nucleotide small RNAs with Crinkler effectors remains unclear, but this work provides a framework for testing the role of small RNAs in Phytophthora biology and pathogenesis in future work.
Phytophthora have distinct endogenous small RNA populations that include short interfering and microRNAs.

Science.gov (United States)

Fahlgren, Noah; Bollmann, Stephanie R; Kasschau, Kristin D; Cuperus, Josh T; Press, Caroline M; Sullivan, Christopher M; Chapman, Elisabeth J; Hoyer, J Steen; Gilbert, Kerrigan B; Grünwald, Niklaus J; Carrington, James C

2013-01-01

In eukaryotes, RNA silencing pathways utilize 20-30-nucleotide small RNAs to regulate gene expression, specify and maintain chromatin structure, and repress viruses and mobile genetic elements. RNA silencing was likely present in the common ancestor of modern eukaryotes, but most research has focused on plant and animal RNA silencing systems. Phytophthora species belong to a phylogenetically distinct group of economically important plant pathogens that cause billions of dollars in yield losses annually as well as ecologically devastating outbreaks. We analyzed the small RNA-generating components of the genomes of P. infestans, P. sojae and P. ramorum using bioinformatics, genetic, phylogenetic and high-throughput sequencing-based methods. Each species produces two distinct populations of small RNAs that are predominantly 21- or 25-nucleotides long. The 25-nucleotide small RNAs were primarily derived from loci encoding transposable elements and we propose that these small RNAs define a pathway of short-interfering RNAs that silence repetitive genetic elements. The 21-nucleotide small RNAs were primarily derived from inverted repeats, including a novel microRNA family that is conserved among the three species, and several gene families, including Crinkler effectors and type III fibronectins. The Phytophthora microRNA is predicted to target a family of amino acid/auxin permeases, and we propose that 21-nucleotide small RNAs function at the post-transcriptional level. The functional significance of microRNA-guided regulation of amino acid/auxin permeases and the association of 21-nucleotide small RNAs with Crinkler effectors remains unclear, but this work provides a framework for testing the role of small RNAs in Phytophthora biology and pathogenesis in future work.
Phytophthora Have Distinct Endogenous Small RNA Populations That Include Short Interfering and microRNAs

Science.gov (United States)

Fahlgren, Noah; Bollmann, Stephanie R.; Kasschau, Kristin D.; Cuperus, Josh T.; Press, Caroline M.; Sullivan, Christopher M.; Chapman, Elisabeth J.; Hoyer, J. Steen; Gilbert, Kerrigan B.; Grünwald, Niklaus J.; Carrington, James C.

2013-01-01

In eukaryotes, RNA silencing pathways utilize 20-30-nucleotide small RNAs to regulate gene expression, specify and maintain chromatin structure, and repress viruses and mobile genetic elements. RNA silencing was likely present in the common ancestor of modern eukaryotes, but most research has focused on plant and animal RNA silencing systems. Phytophthora species belong to a phylogenetically distinct group of economically important plant pathogens that cause billions of dollars in yield losses annually as well as ecologically devastating outbreaks. We analyzed the small RNA-generating components of the genomes of P. infestans, P. sojae and P. ramorum using bioinformatics, genetic, phylogenetic and high-throughput sequencing-based methods. Each species produces two distinct populations of small RNAs that are predominantly 21- or 25-nucleotides long. The 25-nucleotide small RNAs were primarily derived from loci encoding transposable elements and we propose that these small RNAs define a pathway of short-interfering RNAs that silence repetitive genetic elements. The 21-nucleotide small RNAs were primarily derived from inverted repeats, including a novel microRNA family that is conserved among the three species, and several gene families, including Crinkler effectors and type III fibronectins. The Phytophthora microRNA is predicted to target a family of amino acid/auxin permeases, and we propose that 21-nucleotide small RNAs function at the post-transcriptional level. The functional significance of microRNA-guided regulation of amino acid/auxin permeases and the association of 21-nucleotide small RNAs with Crinkler effectors remains unclear, but this work provides a framework for testing the role of small RNAs in Phytophthora biology and pathogenesis in future work. PMID:24204767
OSR1 regulates a subset of inward rectifier potassium channels via a binding motif variant.

Science.gov (United States)

Taylor, Clinton A; An, Sung-Wan; Kankanamalage, Sachith Gallolu; Stippec, Steve; Earnest, Svetlana; Trivedi, Ashesh T; Yang, Jonathan Zijiang; Mirzaei, Hamid; Huang, Chou-Long; Cobb, Melanie H

2018-04-10

The with-no-lysine (K) (WNK) signaling pathway to STE20/SPS1-related proline- and alanine-rich kinase (SPAK) and oxidative stress-responsive 1 (OSR1) kinase is an important mediator of cell volume and ion transport. SPAK and OSR1 associate with upstream kinases WNK 1-4, substrates, and other proteins through their C-terminal domains which interact with linear R-F-x-V/I sequence motifs. In this study we find that SPAK and OSR1 also interact with similar affinity with a motif variant, R-x-F-x-V/I. Eight of 16 human inward rectifier K + channels have an R-x-F-x-V motif. We demonstrate that two of these channels, Kir2.1 and Kir2.3, are activated by OSR1, while Kir4.1, which does not contain the motif, is not sensitive to changes in OSR1 or WNK activity. Mutation of the motif prevents activation of Kir2.3 by OSR1. Both siRNA knockdown of OSR1 and chemical inhibition of WNK activity disrupt NaCl-induced plasma membrane localization of Kir2.3. Our results suggest a mechanism by which WNK-OSR1 enhance Kir2.1 and Kir2.3 channel activity by increasing their plasma membrane localization. Regulation of members of the inward rectifier K + channel family adds functional and mechanistic insight into the physiological impact of the WNK pathway.

Motif decomposition of the phosphotyrosine proteome reveals a new N-terminal binding motif for SHIP2

DEFF Research Database (Denmark)

Miller, Martin Lee; Hanke, S.; Hinsby, A. M.

2008-01-01

set of 481 unique phosphotyrosine (Tyr(P)) peptides by sequence similarity to known ligands of the Src homology 2 (SH2) and the phosphotyrosine binding (PTB) domains. From 20 clusters we extracted 16 known and four new interaction motifs. Using quantitative mass spectrometry we pulled down Tyr......(P)-specific binding partners for peptides corresponding to the extracted motifs. We confirmed numerous previously known interaction motifs and found 15 new interactions mediated by phosphosites not previously known to bind SH2 or PTB. Remarkably, a novel hydrophobic N-terminal motif ((L/V/I)(L/V/I)pY) was identified...
RNA-Catalyzed Polymerization and Replication of RNA

Science.gov (United States)

Horning, D. P.; Samantha, B.; Tjhung, K. F.; Joyce, G. F.

2017-07-01

In an effort to reconstruct RNA-based life, in vitro evolution was used to obtain an RNA polymerase ribozyme that can synthesize a variety of complex functional RNAs and can catalyze the exponential amplification of short RNAs.
On topological RNA interaction structures.

Science.gov (United States)

Qin, Jing; Reidys, Christian M

2013-07-01

Recently a folding algorithm of topological RNA pseudoknot structures was presented in Reidys et al. (2011). This algorithm folds single-stranded γ-structures, that is, RNA structures composed by distinct motifs of bounded topological genus. In this article, we set the theoretical foundations for the folding of the two backbone analogues of γ structures: the RNA γ-interaction structures. These are RNA-RNA interaction structures that are constructed by a finite number of building blocks over two backbones having genus at most γ. Combinatorial properties of γ-interaction structures are of practical interest since they have direct implications for the folding of topological interaction structures. We compute the generating function of γ-interaction structures and show that it is algebraic, which implies that the numbers of interaction structures can be computed recursively. We obtain simple asymptotic formulas for 0- and 1-interaction structures. The simplest class of interaction structures are the 0-interaction structures, which represent the two backbone analogues of secondary structures.
[Bioinformatics Analysis of Clustered Regularly Interspaced Short Palindromic Repeats in the Genomes of Shigella].

Science.gov (United States)

Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin

2015-04-01

This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.
The calmodulin-binding, short linear motif, NSCaTE is conserved in L-type channel ancestors of vertebrate Cav1.2 and Cav1.3 channels.

Directory of Open Access Journals (Sweden)

Valentina Taiakina

Full Text Available NSCaTE is a short linear motif of (xWxxx(I or Lxxxx, composed of residues with a high helix-forming propensity within a mostly disordered N-terminus that is conserved in L-type calcium channels from protostome invertebrates to humans. NSCaTE is an optional, lower affinity and calcium-sensitive binding site for calmodulin (CaM which competes for CaM binding with a more ancient, C-terminal IQ domain on L-type channels. CaM bound to N- and C- terminal tails serve as dual detectors to changing intracellular Ca(2+ concentrations, promoting calcium-dependent inactivation of L-type calcium channels. NSCaTE is absent in some arthropod species, and is also lacking in vertebrate L-type isoforms, Cav1.1 and Cav1.4 channels. The pervasiveness of a methionine just downstream from NSCaTE suggests that L-type channels could generate alternative N-termini lacking NSCaTE through the choice of translational start sites. Long N-terminus with an NSCaTE motif in L-type calcium channel homolog LCav1 from pond snail Lymnaea stagnalis has a faster calcium-dependent inactivation than a shortened N-termini lacking NSCaTE. NSCaTE effects are present in low concentrations of internal buffer (0.5 mM EGTA, but disappears in high buffer conditions (10 mM EGTA. Snail and mammalian NSCaTE have an alpha-helical propensity upon binding Ca(2+-CaM and can saturate both CaM N-terminal and C-terminal domains in the absence of a competing IQ motif. NSCaTE evolved in ancestors of the first animals with internal organs for promoting a more rapid, calcium-sensitive inactivation of L-type channels.
A Study on the Motif Pattern of Dark-Cloud Cover in the Securities

Directory of Open Access Journals (Sweden)

Long Jing

2017-01-01

Full Text Available Morphological analysis is the analysis and mining of the graphics formed of the securities price changes. Investors need to forecast the trend of future before buying and selling points, which can avoid great loss. Therefore, the analysis of motif pattern of K-line in the form of futures investment technology analysis is very significant. Based on the thoughts of short-term trend clustering, this paper proposes a method of detecting the motif pattern of Dark-Cloud Cover in stock time series by analysing stock historic data and K-line shape, in order to predict the stock market trends. And we prove the effectiveness and practicality of the method by a series of experimental analysis.
[Personal motif in art].

Science.gov (United States)

Gerevich, József

2015-01-01

One of the basic questions of the art psychology is whether a personal motif is to be found behind works of art and if so, how openly or indirectly it appears in the work itself. Analysis of examples and documents from the fine arts and literature allow us to conclude that the personal motif that can be identified by the viewer through symbols, at times easily at others with more difficulty, gives an emotional plus to the artistic product. The personal motif may be found in traumatic experiences, in communication to the model or with other emotionally important persons (mourning, disappointment, revenge, hatred, rivalry, revolt etc.), in self-searching, or self-analysis. The emotions are expressed in artistic activity either directly or indirectly. The intention nourished by the artist's identity (Kunstwollen) may stand in the way of spontaneous self-expression, channelling it into hidden paths. Under the influence of certain circumstances, the artist may arouse in the viewer, consciously or unconsciously, an illusionary, misleading image of himself. An examination of the personal motif is one of the important research areas of art therapy.
The Q Motif Is Involved in DNA Binding but Not ATP Binding in ChlR1 Helicase.

Directory of Open Access Journals (Sweden)

Hao Ding

Full Text Available Helicases are molecular motors that couple the energy of ATP hydrolysis to the unwinding of structured DNA or RNA and chromatin remodeling. The conversion of energy derived from ATP hydrolysis into unwinding and remodeling is coordinated by seven sequence motifs (I, Ia, II, III, IV, V, and VI. The Q motif, consisting of nine amino acids (GFXXPXPIQ with an invariant glutamine (Q residue, has been identified in some, but not all helicases. Compared to the seven well-recognized conserved helicase motifs, the role of the Q motif is less acknowledged. Mutations in the human ChlR1 (DDX11 gene are associated with a unique genetic disorder known as Warsaw Breakage Syndrome, which is characterized by cellular defects in genome maintenance. To examine the roles of the Q motif in ChlR1 helicase, we performed site directed mutagenesis of glutamine to alanine at residue 23 in the Q motif of ChlR1. ChlR1 recombinant protein was overexpressed and purified from HEK293T cells. ChlR1-Q23A mutant abolished the helicase activity of ChlR1 and displayed reduced DNA binding ability. The mutant showed impaired ATPase activity but normal ATP binding. A thermal shift assay revealed that ChlR1-Q23A has a melting point value similar to ChlR1-WT. Partial proteolysis mapping demonstrated that ChlR1-WT and Q23A have a similar globular structure, although some subtle conformational differences in these two proteins are evident. Finally, we found ChlR1 exists and functions as a monomer in solution, which is different from FANCJ, in which the Q motif is involved in protein dimerization. Taken together, our results suggest that the Q motif is involved in DNA binding but not ATP binding in ChlR1 helicase.
Detecting remote sequence homology in disordered proteins: discovery of conserved motifs in the N-termini of Mononegavirales phosphoproteins.

Directory of Open Access Journals (Sweden)

David Karlin

Full Text Available Paramyxovirinae are a large group of viruses that includes measles virus and parainfluenza viruses. The viral Phosphoprotein (P plays a central role in viral replication. It is composed of a highly variable, disordered N-terminus and a conserved C-terminus. A second viral protein alternatively expressed, the V protein, also contains the N-terminus of P, fused to a zinc finger. We suspected that, despite their high variability, the N-termini of P/V might all be homologous; however, using standard approaches, we could previously identify sequence conservation only in some Paramyxovirinae. We now compared the N-termini using sensitive sequence similarity search programs, able to detect residual similarities unnoticeable by conventional approaches. We discovered that all Paramyxovirinae share a short sequence motif in their first 40 amino acids, which we called soyuz1. Despite its short length (11-16aa, several arguments allow us to conclude that soyuz1 probably evolved by homologous descent, unlike linear motifs. Conservation across such evolutionary distances suggests that soyuz1 plays a crucial role and experimental data suggest that it binds the viral nucleoprotein to prevent its illegitimate self-assembly. In some Paramyxovirinae, the N-terminus of P/V contains a second motif, soyuz2, which might play a role in blocking interferon signaling. Finally, we discovered that the P of related Mononegavirales contain similarly overlooked motifs in their N-termini, and that their C-termini share a previously unnoticed structural similarity suggesting a common origin. Our results suggest several testable hypotheses regarding the replication of Mononegavirales and suggest that disordered regions with little overall sequence similarity, common in viral and eukaryotic proteins, might contain currently overlooked motifs (intermediate in length between linear motifs and disordered domains that could be detected simply by comparing orthologous proteins.
Temporal motifs in time-dependent networks

International Nuclear Information System (INIS)

Kovanen, Lauri; Karsai, Márton; Kaski, Kimmo; Kertész, János; Saramäki, Jari

2011-01-01

Temporal networks are commonly used to represent systems where connections between elements are active only for restricted periods of time, such as telecommunication, neural signal processing, biochemical reaction and human social interaction networks. We introduce the framework of temporal motifs to study the mesoscale topological–temporal structure of temporal networks in which the events of nodes do not overlap in time. Temporal motifs are classes of similar event sequences, where the similarity refers not only to topology but also to the temporal order of the events. We provide a mapping from event sequences to coloured directed graphs that enables an efficient algorithm for identifying temporal motifs. We discuss some aspects of temporal motifs, including causality and null models, and present basic statistics of temporal motifs in a large mobile call network
Noroviruses Co-opt the Function of Host Proteins VAPA and VAPB for Replication via a Phenylalanine-Phenylalanine-Acidic-Tract-Motif Mimic in Nonstructural Viral Protein NS1/2.

Science.gov (United States)

McCune, Broc T; Tang, Wei; Lu, Jia; Eaglesham, James B; Thorne, Lucy; Mayer, Anne E; Condiff, Emily; Nice, Timothy J; Goodfellow, Ian; Krezel, Andrzej M; Virgin, Herbert W

2017-07-11

The Norovirus genus contains important human pathogens, but the role of host pathways in norovirus replication is largely unknown. Murine noroviruses provide the opportunity to study norovirus replication in cell culture and in small animals. The human norovirus nonstructural protein NS1/2 interacts with the host protein VAMP-associated protein A (VAPA), but the significance of the NS1/2-VAPA interaction is unexplored. Here we report decreased murine norovirus replication in VAPA- and VAPB-deficient cells. We characterized the role of VAPA in detail. VAPA was required for the efficiency of a step(s) in the viral replication cycle after entry of viral RNA into the cytoplasm but before the synthesis of viral minus-sense RNA. The interaction of VAPA with viral NS1/2 proteins is conserved between murine and human noroviruses. Murine norovirus NS1/2 directly bound the major sperm protein (MSP) domain of VAPA through its NS1 domain. Mutations within NS1 that disrupted interaction with VAPA inhibited viral replication. Structural analysis revealed that the viral NS1 domain contains a mimic of the phenylalanine-phenylalanine-acidic-tract (FFAT) motif that enables host proteins to bind to the VAPA MSP domain. The NS1/2-FFAT mimic region interacted with the VAPA-MSP domain in a manner similar to that seen with bona fide host FFAT motifs. Amino acids in the FFAT mimic region of the NS1 domain that are important for viral replication are highly conserved across murine norovirus strains. Thus, VAPA interaction with a norovirus protein that functionally mimics host FFAT motifs is important for murine norovirus replication. IMPORTANCE Human noroviruses are a leading cause of gastroenteritis worldwide, but host factors involved in norovirus replication are incompletely understood. Murine noroviruses have been studied to define mechanisms of norovirus replication. Here we defined the importance of the interaction between the hitherto poorly studied NS1/2 norovirus protein and the
Motif enrichment tool.

Science.gov (United States)

Blatti, Charles; Sinha, Saurabh

2014-07-01

The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural Basis for Recognition and Sequestration of UUUOH 3 ' Temini of Nascent RNA Polymerase III Transcripts by La, a Rheumatic Disease Autoantigen

Energy Technology Data Exchange (ETDEWEB)

Teplova,M.; Yuan, Y.; Phan, A.; Malinina, L.; Ilin, S.; Teplov, A.; Patel, D.

2006-01-01

The nuclear phosphoprotein La was identified as an autoantigen in patients with systemic lupus erythematosus and Sjogren's syndrome. La binds to and protects the UUUOH 3' terminii of nascent RNA polymerase III transcripts from exonuclease digestion. We report the 1.85 Angstroms crystal structure of the N-terminal domain of human La, consisting of La and RRM1 motifs, bound to r(U1-G2-C3-U4-G5-U6-U7-U8-U9OH). The U7-U8-U9OH 3' end, in a splayed-apart orientation, is sequestered within a basic and aromatic amino acid-lined cleft between the La and RRM1 motifs. The specificity-determining U8 residue bridges both motifs, in part through unprecedented targeting of the {beta} sheet edge, rather than the anticipated face, of the RRM1 motif. Our structural observations, supported by mutation studies of both La and RNA components, illustrate the principles behind RNA sequestration by a rheumatic disease autoantigen, whereby the UUUOH 3' ends of nascent RNA transcripts are protected during downstream processing and maturation events.
Microprocessor Recruitment to Elongating RNA Polymerase II Is Required for Differential Expression of MicroRNAs

Directory of Open Access Journals (Sweden)

Victoria A. Church

2017-09-01

Full Text Available The cellular abundance of mature microRNAs (miRNAs is dictated by the efficiency of nuclear processing of primary miRNA transcripts (pri-miRNAs into pre-miRNA intermediates. The Microprocessor complex of Drosha and DGCR8 carries this out, but it has been unclear what controls Microprocessor’s differential processing of various pri-miRNAs. Here, we show that Drosophila DGCR8 (Pasha directly associates with the C-terminal domain of the RNA polymerase II elongation complex when it is phosphorylated by the Cdk9 kinase (pTEFb. When association is blocked by loss of Cdk9 activity, a global change in pri-miRNA processing is detected. Processing of pri-miRNAs with a UGU sequence motif in their apical junction domain increases, while processing of pri-miRNAs lacking this motif decreases. Therefore, phosphorylation of RNA polymerase II recruits Microprocessor for co-transcriptional processing of non-UGU pri-miRNAs that would otherwise be poorly processed. In contrast, UGU-positive pri-miRNAs are robustly processed by Microprocessor independent of RNA polymerase association.
Mouse nucleolin binds to 4.5S RNAH, a small noncoding RNA

International Nuclear Information System (INIS)

Hirose, Yutaka; Harada, Fumio

2008-01-01

4.5S RNAH is a rodent-specific small noncoding RNA that exhibits extensive homology to the B1 short interspersed element. Although 4.5S RNAH is known to associate with cellular poly(A)-terminated RNAs and retroviral genomic RNAs, its function remains unclear. In this study, we analyzed 4.5S RNAH-binding proteins in mouse nuclear extracts using gel mobility shift and RNA-protein UV cross-linking assays. We found that at least nine distinct polypeptides (p170, p110, p93, p70, p48, p40, p34, p20, and p16.5) specifically interacted with 4.5S RNAHin vitro. Using anti-La antibody, p48 was identified as mouse La protein. To identify the other 4.5S RNAH-binding proteins, we performed expression cloning from a mouse cDNA library and obtained cDNA clones derived from nucleolin mRNA. We identified p110 as nucleolin using nucleolin-specific antibodies. UV cross-linking analysis using various deletion mutants of nucleolin indicated that the third of four tandem RNA recognition motifs is a major determinant for 4.5S RNAH recognition. Immunoprecipitation of nucleolin from the subcellular fractions of mouse cell extracts revealed that a portion of the endogenous 4.5S RNAH was associated with nucleolin and that this complex was located in both the nucleoplasm and nucleolus
Affinity maturation of a portable Fab–RNA module for chaperone-assisted RNA crystallography

Science.gov (United States)

Koirala, Deepak; Shelke, Sandip A; Dupont, Marcel; Ruiz, Stormy; DasGupta, Saurja; Bailey, Lucas J; Benner, Steven A; Piccirilli, Joseph A

2018-01-01

Abstract Antibody fragments such as Fabs possess properties that can enhance protein and RNA crystallization and therefore can facilitate macromolecular structure determination. In particular, Fab BL3–6 binds to an AAACA RNA pentaloop closed by a GC pair with ∼100 nM affinity. The Fab and hairpin have served as a portable module for RNA crystallization. The potential for general application make it desirable to adjust the properties of this crystallization module in a manner that facilitates its use for RNA structure determination, such as ease of purification, surface entropy or binding affinity. In this work, we used both in vitro RNA selection and phage display selection to alter the epitope and paratope sides of the binding interface, respectively, for improved binding affinity. We identified a 5′-GNGACCC-3′ consensus motif in the RNA and S97N mutation in complimentarity determining region L3 of the Fab that independently impart about an order of magnitude improvement in affinity, resulting from new hydrogen bonding interactions. Using a model RNA, these modifications facilitated crystallization under a wider range of conditions and improved diffraction. The improved features of the Fab–RNA module may facilitate its use as an affinity tag for RNA purification and imaging and as a chaperone for RNA crystallography. PMID:29309709
RNA-PAIRS: RNA probabilistic assignment of imino resonance shifts

International Nuclear Information System (INIS)

Bahrami, Arash; Clos, Lawrence J.; Markley, John L.; Butcher, Samuel E.; Eghbalnia, Hamid R.

2012-01-01

The significant biological role of RNA has further highlighted the need for improving the accuracy, efficiency and the reach of methods for investigating RNA structure and function. Nuclear magnetic resonance (NMR) spectroscopy is vital to furthering the goals of RNA structural biology because of its distinctive capabilities. However, the dispersion pattern in the NMR spectra of RNA makes automated resonance assignment, a key step in NMR investigation of biomolecules, remarkably challenging. Herein we present RNA Probabilistic Assignment of Imino Resonance Shifts (RNA-PAIRS), a method for the automated assignment of RNA imino resonances with synchronized verification and correction of predicted secondary structure. RNA-PAIRS represents an advance in modeling the assignment paradigm because it seeds the probabilistic network for assignment with experimental NMR data, and predicted RNA secondary structure, simultaneously and from the start. Subsequently, RNA-PAIRS sets in motion a dynamic network that reverberates between predictions and experimental evidence in order to reconcile and rectify resonance assignments and secondary structure information. The procedure is halted when assignments and base-parings are deemed to be most consistent with observed crosspeaks. The current implementation of RNA-PAIRS uses an initial peak list derived from proton-nitrogen heteronuclear multiple quantum correlation ( 1 H– 15 N 2D HMQC) and proton–proton nuclear Overhauser enhancement spectroscopy ( 1 H– 1 H 2D NOESY) experiments. We have evaluated the performance of RNA-PAIRS by using it to analyze NMR datasets from 26 previously studied RNAs, including a 111-nucleotide complex. For moderately sized RNA molecules, and over a range of comparatively complex structural motifs, the average assignment accuracy exceeds 90%, while the average base pair prediction accuracy exceeded 93%. RNA-PAIRS yielded accurate assignments and base pairings consistent with imino resonances for a
RNA-PAIRS: RNA probabilistic assignment of imino resonance shifts

Energy Technology Data Exchange (ETDEWEB)

Bahrami, Arash; Clos, Lawrence J.; Markley, John L.; Butcher, Samuel E. [National Magnetic Resonance Facility at Madison (United States); Eghbalnia, Hamid R., E-mail: eghbalhd@uc.edu [University of Cincinnati, Department of Molecular and Cellular Physiology (United States)

2012-04-15

The significant biological role of RNA has further highlighted the need for improving the accuracy, efficiency and the reach of methods for investigating RNA structure and function. Nuclear magnetic resonance (NMR) spectroscopy is vital to furthering the goals of RNA structural biology because of its distinctive capabilities. However, the dispersion pattern in the NMR spectra of RNA makes automated resonance assignment, a key step in NMR investigation of biomolecules, remarkably challenging. Herein we present RNA Probabilistic Assignment of Imino Resonance Shifts (RNA-PAIRS), a method for the automated assignment of RNA imino resonances with synchronized verification and correction of predicted secondary structure. RNA-PAIRS represents an advance in modeling the assignment paradigm because it seeds the probabilistic network for assignment with experimental NMR data, and predicted RNA secondary structure, simultaneously and from the start. Subsequently, RNA-PAIRS sets in motion a dynamic network that reverberates between predictions and experimental evidence in order to reconcile and rectify resonance assignments and secondary structure information. The procedure is halted when assignments and base-parings are deemed to be most consistent with observed crosspeaks. The current implementation of RNA-PAIRS uses an initial peak list derived from proton-nitrogen heteronuclear multiple quantum correlation ({sup 1}H-{sup 15}N 2D HMQC) and proton-proton nuclear Overhauser enhancement spectroscopy ({sup 1}H-{sup 1}H 2D NOESY) experiments. We have evaluated the performance of RNA-PAIRS by using it to analyze NMR datasets from 26 previously studied RNAs, including a 111-nucleotide complex. For moderately sized RNA molecules, and over a range of comparatively complex structural motifs, the average assignment accuracy exceeds 90%, while the average base pair prediction accuracy exceeded 93%. RNA-PAIRS yielded accurate assignments and base pairings consistent with imino
Ratiometric fluorescent sensing of pH values in living cells by dual-fluorophore-labeled i-motif nanoprobes.

Science.gov (United States)

Huang, Jin; Ying, Le; Yang, Xiaohai; Yang, Yanjing; Quan, Ke; Wang, He; Xie, Nuli; Ou, Min; Zhou, Qifeng; Wang, Kemin

2015-09-01

We designed a new ratiometric fluorescent nanoprobe for sensing pH values in living cells. Briefly, the nanoprobe consists of a gold nanoparticle (AuNP), short single-stranded oligonucleotides, and dual-fluorophore-labeled i-motif sequences. The short oligonucleotides are designed to bind with the i-motif sequences and immobilized on the AuNP surface via Au-S bond. At neutral pH, the dual fluorophores are separated, resulting in very low fluorescence resonance energy transfer (FRET) efficiency. At acidic pH, the i-motif strands fold into a quadruplex structure and leave the AuNP, bringing the dual fluorophores into close proximity, resulting in high FRET efficiency, which could be used as a signal for pH sensing. The nanoprobe possesses abilities of cellular transfection, enzymatic protection, fast response and quantitative pH detection. The in vitro and intracellular applications of the nanoprobe were demonstrated, which showed excellent response in the physiological pH range. Furthermore, our experimental results suggested that the nanoprobe showed excellent spatial and temporal resolution in living cells. We think that the ratiometric sensing strategy could potentially be applied to create a variety of new multicolor sensors for intracellular detection.
Structural Dynamics of the GW182 Silencing Domain Including its RNA Recognition motif (RRM) Revealed by Hydrogen-Deuterium Exchange Mass Spectrometry

Science.gov (United States)

Cieplak-Rotowska, Maja K.; Tarnowski, Krzysztof; Rubin, Marcin; Fabian, Marc R.; Sonenberg, Nahum; Dadlez, Michal; Niedzwiecka, Anna

2018-01-01

The human GW182 protein plays an essential role in micro(mi)RNA-dependent gene silencing. miRNA silencing is mediated, in part, by a GW182 C-terminal region called the silencing domain, which interacts with the poly(A) binding protein and the CCR4-NOT deadenylase complex to repress protein synthesis. Structural studies of this GW182 fragment are challenging due to its predicted intrinsically disordered character, except for its RRM domain. However, detailed insights into the properties of proteins containing disordered regions can be provided by hydrogen-deuterium exchange mass spectrometry (HDX/MS). In this work, we applied HDX/MS to define the structural state of the GW182 silencing domain. HDX/MS analysis revealed that this domain is clearly divided into a natively unstructured part, including the CCR4-NOT interacting motif 1, and a distinct RRM domain. The GW182 RRM has a very dynamic structure, since water molecules can penetrate the whole domain in 2 h. The finding of this high structural dynamics sheds new light on the RRM structure. Though this domain is one of the most frequently occurring canonical protein domains in eukaryotes, these results are - to our knowledge - the first HDX/MS characteristics of an RRM. The HDX/MS studies show also that the α2 helix of the RRM can display EX1 behavior after a freezing-thawing cycle. This means that the RRM structure is sensitive to environmental conditions and can change its conformation, which suggests that the state of the RRM containing proteins should be checked by HDX/MS in regard of the conformational uniformity. [Figure not available: see fulltext.

Analysis of hepatitis C virus RNA dimerization and core–RNA interactions

Science.gov (United States)

Ivanyi-Nagy, Roland; Kanevsky, Igor; Gabus, Caroline; Lavergne, Jean-Pierre; Ficheux, Damien; Penin, François; Fossé, Philippe; Darlix, Jean-Luc

2006-01-01

The core protein of hepatitis C virus (HCV) has been shown previously to act as a potent nucleic acid chaperone in vitro, promoting the dimerization of the 3′-untranslated region (3′-UTR) of the HCV genomic RNA, a process probably mediated by a small, highly conserved palindromic RNA motif, named DLS (dimer linkage sequence) [G. Cristofari, R. Ivanyi-Nagy, C. Gabus, S. Boulant, J. P. Lavergne, F. Penin and J. L. Darlix (2004) Nucleic Acids Res., 32, 2623–2631]. To investigate in depth HCV RNA dimerization, we generated a series of point mutations in the DLS region. We find that both the plus-strand 3′-UTR and the complementary minus-strand RNA can dimerize in the presence of core protein, while mutations in the DLS (among them a single point mutation that abolished RNA replication in a HCV subgenomic replicon system) completely abrogate dimerization. Structural probing of plus- and minus-strand RNAs, in their monomeric and dimeric forms, indicate that the DLS is the major if not the sole determinant of UTR RNA dimerization. Furthermore, the N-terminal basic amino acid clusters of core protein were found to be sufficient to induce dimerization, suggesting that they retain full RNA chaperone activity. These findings may have important consequences for understanding the HCV replicative cycle and the genetic variability of the virus. PMID:16707664
Analysis of hepatitis C virus RNA dimerization and core-RNA interactions.

Science.gov (United States)

Ivanyi-Nagy, Roland; Kanevsky, Igor; Gabus, Caroline; Lavergne, Jean-Pierre; Ficheux, Damien; Penin, François; Fossé, Philippe; Darlix, Jean-Luc

2006-01-01

The core protein of hepatitis C virus (HCV) has been shown previously to act as a potent nucleic acid chaperone in vitro, promoting the dimerization of the 3'-untranslated region (3'-UTR) of the HCV genomic RNA, a process probably mediated by a small, highly conserved palindromic RNA motif, named DLS (dimer linkage sequence) [G. Cristofari, R. Ivanyi-Nagy, C. Gabus, S. Boulant, J. P. Lavergne, F. Penin and J. L. Darlix (2004) Nucleic Acids Res., 32, 2623-2631]. To investigate in depth HCV RNA dimerization, we generated a series of point mutations in the DLS region. We find that both the plus-strand 3'-UTR and the complementary minus-strand RNA can dimerize in the presence of core protein, while mutations in the DLS (among them a single point mutation that abolished RNA replication in a HCV subgenomic replicon system) completely abrogate dimerization. Structural probing of plus- and minus-strand RNAs, in their monomeric and dimeric forms, indicate that the DLS is the major if not the sole determinant of UTR RNA dimerization. Furthermore, the N-terminal basic amino acid clusters of core protein were found to be sufficient to induce dimerization, suggesting that they retain full RNA chaperone activity. These findings may have important consequences for understanding the HCV replicative cycle and the genetic variability of the virus.
Short-lived non-coding transcripts (SLiTs): Clues to regulatory long non-coding RNA.

Science.gov (United States)

Tani, Hidenori

2017-03-22

Whole transcriptome analyses have revealed a large number of novel long non-coding RNAs (lncRNAs). Although the importance of lncRNAs has been documented in previous reports, the biological and physiological functions of lncRNAs remain largely unknown. The role of lncRNAs seems an elusive problem. Here, I propose a clue to the identification of regulatory lncRNAs. The key point is RNA half-life. RNAs with a long half-life (t 1/2 > 4 h) contain a significant proportion of ncRNAs, as well as mRNAs involved in housekeeping functions, whereas RNAs with a short half-life (t 1/2 regulatory ncRNAs and regulatory mRNAs. This novel class of ncRNAs with a short half-life can be categorized as Short-Lived non-coding Transcripts (SLiTs). I consider that SLiTs are likely to be rich in functionally uncharacterized regulatory RNAs. This review describes recent progress in research into SLiTs.
CRISPRTarget: bioinformatic prediction and analysis of crRNA targets

NARCIS (Netherlands)

Biswas, A.; Gagnon, J.N.; Brouns, S.J.J.; Fineran, P.C.; Brown, C.M.

2013-01-01

The bacterial and archaeal CRISPR/Cas adaptive immune system targets specific protospacer nucleotide sequences in invading organisms. This requires base pairing between processed CRISPR RNA and the target protospacer. For type I and II CRISPR/Cas systems, protospacer adjacent motifs (PAM) are
UKIRAN KERAWANG ACEH GAYO SEBAGAI INSPIRASI PENCIPTAAN MOTIF BATIK KHAS GAYO

Directory of Open Access Journals (Sweden)

Irfa ina Rohana Salma

2016-12-01

Full Text Available ABSTRAK Industri batik mulai berkembang di Gayo, tetapi belum memiliki motif batik khas daerah. Oleh karena itu perlu diciptakan motif batik khas Gayo, dengan mengambil inspirasi dari ukiran yang terdapat pada rumah tradisional yang biasa disebut ukiran kerawang Gayo. Tujuan penciptaan seni ini adalah untuk menciptakan motif batik yang memiliki ciri khas Gayo. Metode yang digunakan yaitu eksplorasi ide, perancangan, dan perwujudan menjadi motif batik. Dalam kegiatan ini telah diciptakan enam motif batik khas Gayo yaitu: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif Gayo Lurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. Hasil uji kesukaan terhadap motif kepada lima puluh responden menunjukkan bahwa Motif Ceplok Gayo paling banyak dipilih oleh responden yaitu sebesar 19%, sedangkan Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo Lurus 15% dan Motif Gayo Tegak 14%. Rata-rata motif yang dihasilkan mendapatkan apresiasi yang baik dari responden, sehingga semua motif layak diproduksi sebagai batik khas Gayo.Kata kunci: batik Gayo, Motif Ceplok Gayo, Motif Parang Gayo.ABSTRACTBatik industry began to develop in Gayo, but have not had a typical batik motif itself. Therefore, it is necessary to create batik motifs of Gayo, by taking inspiration from the carvings found in traditional houses commonly called kerawang Gayo. The purpose of this art is to create motifs those have a Gayo characteristic. The method used are the idea exploration, design, and motifs embodiment. In this activity has created six Gayo batik motifs, namely: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif GayoLurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. The test results fondness of the motives to fifty respondents indicated that the Motif Ceplok Gayo most preferred by respondents ie 19%, while Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo
Modelling the structure of a ceRNA-theoretical, bipartite microRNA-mRNA interaction network regulating intestinal epithelial cellular pathways using R programming.

Science.gov (United States)

Robinson, J M; Henderson, W A

2018-01-12

We report a method using functional-molecular databases and network modelling to identify hypothetical mRNA-miRNA interaction networks regulating intestinal epithelial barrier function. The model forms a data-analysis component of our cell culture experiments, which produce RNA expression data from Nanostring Technologies nCounter ® system. The epithelial tight-junction (TJ) and actin cytoskeleton interact as molecular components of the intestinal epithelial barrier. Upstream regulation of TJ-cytoskeleton interaction is effected by the Rac/Rock/Rho signaling pathway and other associated pathways which may be activated or suppressed by extracellular signaling from growth factors, hormones, and immune receptors. Pathway activations affect epithelial homeostasis, contributing to degradation of the epithelial barrier associated with osmotic dysregulation, inflammation, and tumor development. The complexity underlying miRNA-mRNA interaction networks represents a roadblock for prediction and validation of competing-endogenous RNA network function. We developed a network model to identify hypothetical co-regulatory motifs in a miRNA-mRNA interaction network related to epithelial function. A mRNA-miRNA interaction list was generated using KEGG and miRWalk2.0 databases. R-code was developed to quantify and visualize inherent network structures. We identified a sub-network with a high number of shared, targeting miRNAs, of genes associated with cellular proliferation and cancer, including c-MYC and Cyclin D.
Design and evaluation of antimalarial peptides derived from prediction of short linear motifs in proteins related to erythrocyte invasion.

Directory of Open Access Journals (Sweden)

Alessandra Bianchin

Full Text Available The purpose of this study was to investigate the blood stage of the malaria causing parasite, Plasmodium falciparum, to predict potential protein interactions between the parasite merozoite and the host erythrocyte and design peptides that could interrupt these predicted interactions. We screened the P. falciparum and human proteomes for computationally predicted short linear motifs (SLiMs in cytoplasmic portions of transmembrane proteins that could play roles in the invasion of the erythrocyte by the merozoite, an essential step in malarial pathogenesis. We tested thirteen peptides predicted to contain SLiMs, twelve of them palmitoylated to enhance membrane targeting, and found three that blocked parasite growth in culture by inhibiting the initiation of new infections in erythrocytes. Scrambled peptides for two of the most promising peptides suggested that their activity may be reflective of amino acid properties, in particular, positive charge. However, one peptide showed effects which were stronger than those of scrambled peptides. This was derived from human red blood cell glycophorin-B. We concluded that proteome-wide computational screening of the intracellular regions of both host and pathogen adhesion proteins provides potential lead peptides for the development of anti-malarial compounds.
A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

KAUST Repository

Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San

2015-01-01

Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.
A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

KAUST Repository

Wong, Ka-Chun

2015-06-11

Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.
Large-scale discovery of promoter motifs in Drosophila melanogaster.

Directory of Open Access Journals (Sweden)

Thomas A Down

2007-01-01

Full Text Available A key step in understanding gene regulation is to identify the repertoire of transcription factor binding motifs (TFBMs that form the building blocks of promoters and other regulatory elements. Identifying these experimentally is very laborious, and the number of TFBMs discovered remains relatively small, especially when compared with the hundreds of transcription factor genes predicted in metazoan genomes. We have used a recently developed statistical motif discovery approach, NestedMICA, to detect candidate TFBMs from a large set of Drosophila melanogaster promoter regions. Of the 120 motifs inferred in our initial analysis, 25 were statistically significant matches to previously reported motifs, while 87 appeared to be novel. Analysis of sequence conservation and motif positioning suggested that the great majority of these discovered motifs are predictive of functional elements in the genome. Many motifs showed associations with specific patterns of gene expression in the D. melanogaster embryo, and we were able to obtain confident annotation of expression patterns for 25 of our motifs, including eight of the novel motifs. The motifs are available through Tiffin, a new database of DNA sequence motifs. We have discovered many new motifs that are overrepresented in D. melanogaster promoter regions, and offer several independent lines of evidence that these are novel TFBMs. Our motif dictionary provides a solid foundation for further investigation of regulatory elements in Drosophila, and demonstrates techniques that should be applicable in other species. We suggest that further improvements in computational motif discovery should narrow the gap between the set of known motifs and the total number of transcription factors in metazoan genomes.
The N-terminal leucine-zipper motif in PTRF/cavin-1 is essential and sufficient for its caveolae-association

Energy Technology Data Exchange (ETDEWEB)

Wei, Zhuang [State Key Laboratory of Cell Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China); Laboratory of System Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China); Zou, Xinle [State Key Laboratory of Cell Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China); Wang, Hongzhong; Lei, Jigang; Wu, Yuan [State Key Laboratory of Cell Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China); Laboratory of System Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China); Liao, Kan, E-mail: kliao@sibs.ac.cn [State Key Laboratory of Cell Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China); Laboratory of System Biology, Institute of Biochemistry and Cell Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031 (China)

2015-01-16

Highlight: • The N-terminal leucine-zipper motif in PTRF/cavin-1 determines caveolar association. • Different cellular localization of PTRF/cavin-1 influences its serine 389 and 391 phosphorylation state. • PTRF/cavin-1 regulates cell motility via its caveolar association. - Abstract: PTRF/cavin-1 is a protein of two lives. Its reported functions in ribosomal RNA synthesis and in caveolae formation happen in two different cellular locations: nucleus vs. plasma membrane. Here, we identified that the N-terminal leucine-zipper motif in PTRF/cavin-1 was essential for the protein to be associated with caveolae in plasma membrane. It could counteract the effect of nuclear localization sequence in the molecule (AA 235–251). Deletion of this leucine-zipper motif from PTRF/cavin-1 caused the mutant to be exclusively localized in nuclei. The fusion of this leucine-zipper motif with histone 2A, which is a nuclear protein, could induce the fusion protein to be exported from nucleus. Cell migration was greatly inhibited in PTRF/cavin-1{sup −/−} mouse embryonic fibroblasts (MEFs). The inhibited cell motility could only be rescued by exogenous cavin-1 but not the leucine-zipper motif deleted cavin-1 mutant. Plasma membrane dynamics is an important factor in cell motility control. Our results suggested that the membrane dynamics in cell migration is affected by caveolae associated PTRF/cavin-1.
The N-terminal leucine-zipper motif in PTRF/cavin-1 is essential and sufficient for its caveolae-association

International Nuclear Information System (INIS)

Wei, Zhuang; Zou, Xinle; Wang, Hongzhong; Lei, Jigang; Wu, Yuan; Liao, Kan

2015-01-01

Highlight: • The N-terminal leucine-zipper motif in PTRF/cavin-1 determines caveolar association. • Different cellular localization of PTRF/cavin-1 influences its serine 389 and 391 phosphorylation state. • PTRF/cavin-1 regulates cell motility via its caveolar association. - Abstract: PTRF/cavin-1 is a protein of two lives. Its reported functions in ribosomal RNA synthesis and in caveolae formation happen in two different cellular locations: nucleus vs. plasma membrane. Here, we identified that the N-terminal leucine-zipper motif in PTRF/cavin-1 was essential for the protein to be associated with caveolae in plasma membrane. It could counteract the effect of nuclear localization sequence in the molecule (AA 235–251). Deletion of this leucine-zipper motif from PTRF/cavin-1 caused the mutant to be exclusively localized in nuclei. The fusion of this leucine-zipper motif with histone 2A, which is a nuclear protein, could induce the fusion protein to be exported from nucleus. Cell migration was greatly inhibited in PTRF/cavin-1 −/− mouse embryonic fibroblasts (MEFs). The inhibited cell motility could only be rescued by exogenous cavin-1 but not the leucine-zipper motif deleted cavin-1 mutant. Plasma membrane dynamics is an important factor in cell motility control. Our results suggested that the membrane dynamics in cell migration is affected by caveolae associated PTRF/cavin-1
Ciliate telomerase RNA loop IV nucleotides promote hierarchical RNP assembly and holoenzyme stability.

Science.gov (United States)

Robart, Aaron R; O'Connor, Catherine M; Collins, Kathleen

2010-03-01

Telomerase adds simple-sequence repeats to chromosome 3' ends to compensate for the loss of repeats with each round of genome replication. To accomplish this de novo DNA synthesis, telomerase uses a template within its integral RNA component. In addition to providing the template, the telomerase RNA subunit (TER) also harbors nontemplate motifs that contribute to the specialized telomerase catalytic cycle of reiterative repeat synthesis. Most nontemplate TER motifs function through linkage with the template, but in ciliate and vertebrate telomerases, a stem-loop motif binds telomerase reverse transcriptase (TERT) and reconstitutes full activity of the minimal recombinant TERT+TER RNP, even when physically separated from the template. Here, we resolve the functional requirements for this motif of ciliate TER in physiological RNP context using the Tetrahymena thermophila p65-TER-TERT core RNP reconstituted in vitro and the holoenzyme reconstituted in vivo. Contrary to expectation based on assays of the minimal recombinant RNP, we find that none of a panel of individual loop IV nucleotide substitutions impacts the profile of telomerase product synthesis when reconstituted as physiological core RNP or holoenzyme RNP. However, loop IV nucleotide substitutions do variably reduce assembly of TERT with the p65-TER complex in vitro and reduce the accumulation and stability of telomerase RNP in endogenous holoenzyme context. Our results point to a unifying model of a conformational activation role for this TER motif in the telomerase RNP enzyme.
MHC motif viewer

DEFF Research Database (Denmark)

Rapin, Nicolas Philippe Jean-Pierre; Hoof, Ilka; Lund, Ole

2008-01-01

. Algorithms that predict which peptides MHC molecules bind have recently been developed and cover many different alleles, but the utility of these algorithms is hampered by the lack of tools for browsing and comparing the specificity of these molecules. We have, therefore, developed a web server, MHC motif....... A special viewing feature, MHC fight, allows for display of the specificity of two different MHC molecules side by side. We show how the web server can be used to discover and display surprising similarities as well as differences between MHC molecules within and between different species. The MHC motif...
Non-Watson Crick base pairs might stabilize RNA structural motifs in ...

Indian Academy of Sciences (India)

Watson Crick base pairs, internal loops and pseudoknots have been the highlighting feature of recent structural determination of RNAs. The recent crystal structure of group-I introns has demonstrated that these might constitute RNA structural ...
Novel RNA Duplex Locks HIV-1 in a Latent State via Chromatin-mediated Transcriptional Silencing

Directory of Open Access Journals (Sweden)

Chantelle Ahlenstiel

2015-01-01

Full Text Available Transcriptional gene silencing (TGS of mammalian genes can be induced by short interfering RNA (siRNA targeting promoter regions. We previously reported potent TGS of HIV-1 by siRNA (PromA, which targets tandem NF-κB motifs within the viral 5′LTR. In this study, we screened a siRNA panel with the aim of identifying novel 5′LTR targets, to provide multiplexing potential with enhanced viral silencing and application toward developing alternate therapeutic strategies. Systematic examination identified a novel siRNA target, si143, confirmed to induce TGS as the silencing mechanism. TGS was prolonged with virus suppression >12 days, despite a limited ability to induce post- TGS. Epigenetic changes associated with silencing were suggested by partial reversal by histone deacetylase inhibitors and confirmed by chromatin immunoprecipitation analyses, which showed induction of H3K27me3 and H3K9me3, reduction in H3K9Ac, and recruitment of argonaute-1, all characteristic marks of heterochromatin and TGS. Together, these epigenetic changes mimic those associated with HIV-1 latency. Further, robust resistance to reactivation was observed in the J-Lat 9.2 cell latency model, when transduced with shPromA and/or sh143. These data support si/shRNA-mediated TGS approaches to HIV-1 and provide alternate targets to pursue a functional cure, whereby the viral reservoir is locked in latency following antiretroviral therapy cessation.
In Vivo Short-Term Topical Application of BAY 11-7082 Prevents the Acidic Bile–Induced mRNA and miRNA Oncogenic Phenotypes in Exposed Murine Hypopharyngeal Mucosa

Directory of Open Access Journals (Sweden)

Clarence T. Sasaki

2018-04-01

Full Text Available PURPOSE: Bile-containing gastroesophageal reflux may promote cancer at extraesophageal sites. Acidic bile can accelerate NF-κB activation and molecular events, linked to premalignant changes in murine hypopharyngeal mucosa (HM. We hypothesize that short-term in vivo topical application of NF-κB inhibitor BAY 11-7082 can prevent acidic bile–induced early preneoplastic molecular events, suggesting its potential role in disease prevention. EXPERIMENTAL DESIGN: We topically exposed HM (C57Bl/6j wild-type to a mixture of bile acids at pH 3.0 with and without BAY 11-7082 3 times/day for 7 days. We used immunofluorescence, Western blotting, immunohistochemistry, quantitative polymerase chain reaction, and polymerase chain reaction microarrays to identify NF-κB activation and its associated oncogenic mRNA and miRNA phenotypes, in murine hypopharyngeal cells in vitro and in murine HM in vivo. RESULTS: Short-term exposure of HM to acidic bile is a potent stimulus accelerating the expression of NF-κB signaling (70 out of 84 genes and oncogenic molecules. Topical application of BAY 11-7082 sufficiently blocks the effect of acidic bile. BAY 11-7082 eliminates NF-κB activation in regenerating basal cells of acidic bile–treated HM and prevents overexpression of molecules central to head and neck cancer, including bcl-2, STAT3, EGFR, TNF-α, and WNT5A. NF-κB inhibitor reverses the upregulated “oncomirs” miR-155 and miR-192 and the downregulated “tumor suppressors” miR-451a and miR-375 phenotypes in HM affected by acidic bile. CONCLUSION: There is novel evidence that acidic bile–induced NF-κB–related oncogenic mRNA and miRNA phenotypes are generated after short-term 7-day mucosal exposure and that topical mucosal application of BAY 11-7082 can prevent the acidic bile–induced molecular alterations associated with unregulated cell growth and proliferation of hypopharyngeal cells.
Role of the Box C/D Motif in Localization of Small Nucleolar RNAs to Coiled Bodies and Nucleoli

Science.gov (United States)

Narayanan, Aarthi; Speckmann, Wayne; Terns, Rebecca; Terns, Michael P.

1999-01-01

Small nucleolar RNAs (snoRNAs) are a large family of eukaryotic RNAs that function within the nucleolus in the biogenesis of ribosomes. One major class of snoRNAs is the box C/D snoRNAs named for their conserved box C and box D sequence elements. We have investigated the involvement of cis-acting sequences and intranuclear structures in the localization of box C/D snoRNAs to the nucleolus by assaying the intranuclear distribution of fluorescently labeled U3, U8, and U14 snoRNAs injected into Xenopus oocyte nuclei. Analysis of an extensive panel of U3 RNA variants showed that the box C/D motif, comprised of box C′, box D, and the 3′ terminal stem of U3, is necessary and sufficient for the nucleolar localization of U3 snoRNA. Disruption of the elements of the box C/D motif of U8 and U14 snoRNAs also prevented nucleolar localization, indicating that all box C/D snoRNAs use a common nucleolar-targeting mechanism. Finally, we found that wild-type box C/D snoRNAs transiently associate with coiled bodies before they localize to nucleoli and that variant RNAs that lack an intact box C/D motif are detained within coiled bodies. These results suggest that coiled bodies play a role in the biogenesis and/or intranuclear transport of box C/D snoRNAs. PMID:10397754
The modeled structure of the RNA dependent RNA polymerase of GBV-C Virus suggests a role for motif E in Flaviviridae RNA polymerases

Directory of Open Access Journals (Sweden)

Dutartre Hélène

2005-10-01

Full Text Available Abstract Background The Flaviviridae virus family includes major human and animal pathogens. The RNA dependent RNA polymerase (RdRp plays a central role in the replication process, and thus is a validated target for antiviral drugs. Despite the increasing structural and enzymatic characterization of viral RdRps, detailed molecular replication mechanisms remain unclear. The hepatitis C virus (HCV is a major human pathogen difficult to study in cultured cells. The bovine viral diarrhea virus (BVDV is often used as a surrogate model to screen antiviral drugs against HCV. The structure of BVDV RdRp has been recently published. It presents several differences relative to HCV RdRp. These differences raise questions about the relevance of BVDV as a surrogate model, and cast novel interest on the "GB" virus C (GBV-C. Indeed, GBV-C is genetically closer to HCV than BVDV, and can lead to productive infection of cultured cells. There is no structural data for the GBV-C RdRp yet. Results We show in this study that the GBV-C RdRp is closest to the HCV RdRp. We report a 3D model of the GBV-C RdRp, developed using sequence-to-structure threading and comparative modeling based on the atomic coordinates of the HCV RdRp structure. Analysis of the predicted structural features in the phylogenetic context of the RNA polymerase family allows rationalizing most of the experimental data available. Both available structures and our model are explored to examine the catalytic cleft, allosteric and substrate binding sites. Conclusion Computational methods were used to infer evolutionary relationships and to predict the structure of a viral RNA polymerase. Docking a GTP molecule into the structure allows defining a GTP binding pocket in the GBV-C RdRp, such as that of BVDV. The resulting model suggests a new proposition for the mechanism of RNA synthesis, and may prove useful to design new experiments to implement our knowledge on the initiation mechanism of RNA
Deciphering functional glycosaminoglycan motifs in development.

Science.gov (United States)

Townley, Robert A; Bülow, Hannes E

2018-03-23

Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.

Fitness for synchronization of network motifs

DEFF Research Database (Denmark)

Vega, Y.M.; Vázquez-Prada, M.; Pacheco, A.F.

2004-01-01

We study the synchronization of Kuramoto's oscillators in small parts of networks known as motifs. We first report on the system dynamics for the case of a scale-free network and show the existence of a non-trivial critical point. We compute the probability that network motifs synchronize, and fi...... that the fitness for synchronization correlates well with motifs interconnectedness and structural complexity. Possible implications for present debates about network evolution in biological and other systems are discussed....
Unifying evolutionary and thermodynamic information for RNA folding of multiple alignments

DEFF Research Database (Denmark)

Seemann, Ernst Stefan; Gorodkin, Jan; Backofen, Rolf

2008-01-01

Computational methods for determining the secondary structure of RNA sequences from given alignments are currently either based on thermodynamic folding, compensatory base pair substitutions or both. However, there is currently no approach that combines both sources of information in a single...... the corresponding probability of being single stranded. Furthermore, we found that structurally conserved RNA motifs are mostly supported by folding energies. Other problems (e.g. RNA-folding kinetics) may also benefit from employing the principles of the model we introduce. Our implementation, PETfold, was tested...
Methods to enable the design of bioactive small molecules targeting RNA.

Science.gov (United States)

Disney, Matthew D; Yildirim, Ilyas; Childs-Disney, Jessica L

2014-02-21

RNA is an immensely important target for small molecule therapeutics or chemical probes of function. However, methods that identify, annotate, and optimize RNA-small molecule interactions that could enable the design of compounds that modulate RNA function are in their infancies. This review describes recent approaches that have been developed to understand and optimize RNA motif-small molecule interactions, including structure-activity relationships through sequencing (StARTS), quantitative structure-activity relationships (QSAR), chemical similarity searching, structure-based design and docking, and molecular dynamics (MD) simulations. Case studies described include the design of small molecules targeting RNA expansions, the bacterial A-site, viral RNAs, and telomerase RNA. These approaches can be combined to afford a synergistic method to exploit the myriad of RNA targets in the transcriptome.
Mutations in the RNA-binding domains of tombusvirus replicase proteins affect RNA recombination in vivo

International Nuclear Information System (INIS)

Panaviene, Zivile; Nagy, Peter D.

2003-01-01

RNA recombination, which is thought to occur due to replicase errors during viral replication, is one of the major driving forces of virus evolution. In this article, we show evidence that the replicase proteins of Cucumber necrosis virus, a tombusvirus, are directly involved in RNA recombination in vivo. Mutations within the RNA-binding domains of the replicase proteins affected the frequency of recombination observed with a prototypical defective-interfering (DI) RNA, a model template for recombination studies. Five of the 17 replicase mutants tested showed delay in the formation of recombinants when compared to the wild-type helper virus. Interestingly, two replicase mutants accelerated recombinant formation and, in addition, these mutants also increased the level of subgenomic RNA synthesis (Virology 308 (2003), 191-205). A trans-complementation system was used to demonstrate that mutation in the p33 replicase protein resulted in altered recombination rate. Isolated recombinants were mostly imprecise (nonhomologous), with the recombination sites clustered around a replication enhancer region and a putative cis-acting element, respectively. These RNA elements might facilitate the proposed template switching events by the tombusvirus replicase. Together with data in the article cited above, results presented here firmly establish that the conserved RNA-binding motif of the replicase proteins is involved in RNA replication, subgenomic RNA synthesis, and RNA recombination
C/EBPα Short-Activating RNA Suppresses Metastasis of Hepatocellular Carcinoma through Inhibiting EGFR/β-Catenin Signaling Mediated EMT.

Directory of Open Access Journals (Sweden)

Hongbo Huan

Full Text Available Hepatocellular carcinoma is associated with high mortality, and tumor metastasis is an important reason for poor prognosis. However, metastasis has not been effectively prevented in clinical therapy and the mechanisms underlying metastasis have not been fully characterized. CCAAT/enhancer-binding protein-α (C/EBPα is a transcriptional regulator with an essential role in tumor metastasis. We used short-activating RNAs (saRNA to enhance expression of C/EBPα. Intravenous injection of C/EBPα-saRNA in a nude mouse liver orthotopic xenograft tumor model inhibited intrahepatic and distant metastasis. C/EBPα-saRNA-treated mice showed increased serum levels of albumin and decreased alanine aminotransferase (ALT, glutamic-oxalacetic transaminase (AST, indicating a role of C/EBPα in improving liver function. Migration and invasion were inhibited in hepatoma cell lines transfected with C/EBPα-saRNA. We also observed an inhibition of epithelial-mesenchymal transition (EMT and suppression of epidermal growth factor receptor (EGFR, EGFR phosphorylation, and β-catenin in C/EBPa-saRNA-transfected cells. Our results suggested that C/EBPα-saRNA successfully inhibited HCC metastasis by inhibiting EGFR/β-catenin signaling pathway mediated EMT in vitro and in vivo.
Free-energy landscape of a hyperstable RNA tetraloop.

Science.gov (United States)

Miner, Jacob C; Chen, Alan A; García, Angel E

2016-06-14

We report the characterization of the energy landscape and the folding/unfolding thermodynamics of a hyperstable RNA tetraloop obtained through high-performance molecular dynamics simulations at microsecond timescales. Sampling of the configurational landscape is conducted using temperature replica exchange molecular dynamics over three isochores at high, ambient, and negative pressures to determine the thermodynamic stability and the free-energy landscape of the tetraloop. The simulations reveal reversible folding/unfolding transitions of the tetraloop into the canonical A-RNA conformation and the presence of two alternative configurations, including a left-handed Z-RNA conformation and a compact purine Triplet. Increasing hydrostatic pressure shows a stabilizing effect on the A-RNA conformation and a destabilization of the left-handed Z-RNA. Our results provide a comprehensive description of the folded free-energy landscape of a hyperstable RNA tetraloop and highlight the significant advances of all-atom molecular dynamics in describing the unbiased folding of a simple RNA secondary structure motif.
Different modes of interaction by TIAR and HuR with target RNA and DNA

OpenAIRE

Kim, Henry S.; Wilce, Matthew C. J.; Yoga, Yano M. K.; Pendini, Nicole R.; Gunzburg, Menachem J.; Cowieson, Nathan P.; Wilson, Gerald M.; Williams, Bryan R. G.; Gorospe, Myriam; Wilce, Jacqueline A.

2011-01-01

TIAR and HuR are mRNA-binding proteins that play important roles in the regulation of translation. They both possess three RNA recognition motifs (RRMs) and bind to AU-rich elements (AREs), with seemingly overlapping specificity. Here we show using SPR that TIAR and HuR bind to both U-rich and AU-rich RNA in the nanomolar range, with higher overall affinity for U-rich RNA. However, the higher affinity for U–rich sequences is mainly due to faster association with U-rich RNA, which we propose i...
Aplikasi Ornamen Khas Maluku untuk Pengembangan Desain Motif Batik

Directory of Open Access Journals (Sweden)

Masiswo Masiswo

2016-04-01

Full Text Available ABSTRAKMaluku memiliki banyak ragam hias budaya warisan nilai leluhur berupa ornamen etnis yang merupakan kesenian dan keterampilan kerajinan. Hasil warisan tersebut sampai saat ini masih lestari hidup serta dapat dinikmati sebagai konsumsi rohani yang memuaskan manusia. Berkaitan dengan keberlangsungan nilai-nilai tradisi etnis yang berwujud pada ornamen-ornamen daerah Maluku, maka dikembangkan untuk kebutuhan manusia berupa motif batik pada kain. Pengembangan ornamen ini lebih menekankan pada representasi akan bentuk-bentuk ornamen yang diterapkan pada kerajinan batik berupa motif khas Maluku. Pengembangan alternatif desain motif batik dibuat tiga variasi yang bersumber dari ornamen khas Maluku dibuat prototipe produknya dan diuji ketahanan luntur warnanya. Hasil uji ketahanan luntur warna terhadap gosokan basah dari tiga prototipe produk berpredikat baik sekali terdapat pada “Motif Siwa” dan predikat baik pada motif “Siwa Talang” dan motif “Matahari Siwa Talang”.Kata kunci: desain, Maluku, motif batik, ornamenABSTRACTMaluku has much decorative ancestral cultural heritage value in the form of ornament ethnic arts and crafts skills. The result of the legacy is still sustainable living can be enjoyed as well as satisfying spiritual human consumption.Related to the sustainability of traditional values in the form of ethnic ornaments Maluku, it was developed for human needs in the form of batik cloth . The development of these ornaments will be more emphasis on the representation forms of ornamentation that is applied to a batik motif Maluku. Development of alternative design motif made three variations. The development of three alternative design motifs derived from the Maluku ornaments made and tested a prototype product color fastness. The test results of color fastness to wet rubbing of the three prototypes are excellent products predicated on the "Motif Siwa" and a good rating on the motif "Siwa Talang" and motif "Matahari Siwa
The limits of de novo DNA motif discovery.

Directory of Open Access Journals (Sweden)

David Simcha

Full Text Available A major challenge in molecular biology is reverse-engineering the cis-regulatory logic that plays a major role in the control of gene expression. This program includes searching through DNA sequences to identify "motifs" that serve as the binding sites for transcription factors or, more generally, are predictive of gene expression across cellular conditions. Several approaches have been proposed for de novo motif discovery-searching sequences without prior knowledge of binding sites or nucleotide patterns. However, unbiased validation is not straightforward. We consider two approaches to unbiased validation of discovered motifs: testing the statistical significance of a motif using a DNA "background" sequence model to represent the null hypothesis and measuring performance in predicting membership in gene clusters. We demonstrate that the background models typically used are "too null," resulting in overly optimistic assessments of significance, and argue that performance in predicting TF binding or expression patterns from DNA motifs should be assessed by held-out data, as in predictive learning. Applying this criterion to common motif discovery methods resulted in universally poor performance, although there is a marked improvement when motifs are statistically significant against real background sequences. Moreover, on synthetic data where "ground truth" is known, discriminative performance of all algorithms is far below the theoretical upper bound, with pronounced "over-fitting" in training. A key conclusion from this work is that the failure of de novo discovery approaches to accurately identify motifs is basically due to statistical intractability resulting from the fixed size of co-regulated gene clusters, and thus such failures do not necessarily provide evidence that unfound motifs are not active biologically. Consequently, the use of prior knowledge to enhance motif discovery is not just advantageous but necessary. An implementation of
Genomic mid-range inhomogeneity correlates with an abundance of RNA secondary structures

Directory of Open Access Journals (Sweden)

Song Jun

2008-06-01

Full Text Available Abstract Background Genomes possess different levels of non-randomness, in particular, an inhomogeneity in their nucleotide composition. Inhomogeneity is manifest from the short-range where neighboring nucleotides influence the choice of base at a site, to the long-range, commonly known as isochores, where a particular base composition can span millions of nucleotides. A separate genomic issue that has yet to be thoroughly elucidated is the role that RNA secondary structure (SS plays in gene expression. Results We present novel data and approaches that show that a mid-range inhomogeneity (~30 to 1000 nt not only exists in mammalian genomes but is also significantly associated with strong RNA SS. A whole-genome bioinformatics investigation of local SS in a set of 11,315 non-redundant human pre-mRNA sequences has been carried out. Four distinct components of these molecules (5'-UTRs, exons, introns and 3'-UTRs were considered separately, since they differ in overall nucleotide composition, sequence motifs and periodicities. For each pre-mRNA component, the abundance of strong local SS ( Conclusion We demonstrate that the excess of strong local SS in pre-mRNAs is linked to the little explored phenomenon of genomic mid-range inhomogeneity (MRI. MRI is an interdependence between nucleotide choice and base composition over a distance of 20–1000 nt. Additionally, we have created a public computational resource to support further study of genomic MRI.
Parole, Sintagmatik, dan Paradigmatik Motif Batik Mega Mendung

Directory of Open Access Journals (Sweden)

Rudi - Nababan

2012-04-01

Full Text Available ABSTRACT Discussing traditional batik is related a lot to the organization system of fine arts element ac- companying it, either the pattern of the motif or the technique of the making. In this case, the motif of Mega Mendung Cirebon certainly has patterns and rules which are traditionally different from the other motifs in other areas. Through semiotics analysis especially with Saussure and Pierce concept, it can be traced that batik with Cirebon motif, in this case Mega Mendung motif, has parole and langue system, as unique fine arts language in batik, and structure of visual syntagmatic and paradigmatic. In the context of batik motif as fine arts language, it is surely related to sign system as symbol and icon. Keywords: visual semiotic, Cirebon’s batik.
Structural Analysis of Monomeric RNA-Dependent Polymerases: Evolutionary and Therapeutic Implications.

Directory of Open Access Journals (Sweden)

Rodrigo Jácome

Full Text Available The crystal structures of monomeric RNA-dependent RNA polymerases and reverse transcriptases of more than 20 different viruses are available in the Protein Data Bank. They all share the characteristic right-hand shape of DNA- and RNA polymerases formed by the fingers, palm and thumb subdomains, and, in many cases, "fingertips" that extend from the fingers towards the thumb subdomain, giving the viral enzyme a closed right-hand appearance. Six conserved structural motifs that contain key residues for the proper functioning of the enzyme have been identified in all these RNA-dependent polymerases. These enzymes share a two divalent metal-ion mechanism of polymerization in which two conserved aspartate residues coordinate the interactions with the metal ions to catalyze the nucleotidyl transfer reaction. The recent availability of crystal structures of polymerases of the Orthomyxoviridae and Bunyaviridae families allowed us to make pairwise comparisons of the tertiary structures of polymerases belonging to the four main RNA viral groups, which has led to a phylogenetic tree in which single-stranded negative RNA viral polymerases have been included for the first time. This has also allowed us to use a homology-based structural prediction approach to develop a general three-dimensional model of the Ebola virus RNA-dependent RNA polymerase. Our model includes several of the conserved structural motifs and residues described in other viral RNA-dependent RNA polymerases that define the catalytic and highly conserved palm subdomain, as well as portions of the fingers and thumb subdomains. The results presented here help to understand the current use and apparent success of antivirals, i.e. Brincidofovir, Lamivudine and Favipiravir, originally aimed at other types of polymerases, to counteract the Ebola virus infection.
Two-dimensional combinatorial screening enables the bottom-up design of a microRNA-10b inhibitor.

Science.gov (United States)

Velagapudi, Sai Pradeep; Disney, Matthew D

2014-03-21

The RNA motifs that bind guanidinylated kanamycin A (G Kan A) and guanidinylated neomycin B (G Neo B) were identified via two-dimensional combinatorial screening (2DCS). The results of these studies enabled the "bottom-up" design of a small molecule inhibitor of oncogenic microRNA-10b.
Mapping the active site of vaccinia virus RNA triphosphatase

International Nuclear Information System (INIS)

Gong Chunling; Shuman, Stewart

2003-01-01

The RNA triphosphatase component of vaccinia virus mRNA capping enzyme (the product of the viral D1 gene) belongs to a family of metal-dependent phosphohydrolases that includes the RNA triphosphatases of fungi, protozoa, Chlorella virus, and baculoviruses. The family is defined by two glutamate-containing motifs (A and C) that form the metal-binding site. Most of the family members resemble the fungal and Chlorella virus enzymes, which have a complex active site located within the hydrophilic interior of a topologically closed eight-stranded β barrel (the so-called ''triphosphate tunnel''). Here we queried whether vaccinia virus capping enzyme is a member of the tunnel subfamily, via mutational mapping of amino acids required for vaccinia triphosphatase activity. We identified four new essential side chains in vaccinia D1 via alanine scanning and illuminated structure-activity relationships by conservative substitutions. Our results, together with previous mutational data, highlight a constellation of six acidic and three basic amino acids that likely compose the vaccinia triphosphatase active site (Glu37, Glu39, Arg77, Lys107, Glu126, Asp159, Lys161, Glu192, and Glu194). These nine essential residues are conserved in all vertebrate and invertebrate poxvirus RNA capping enzymes. We discerned no pattern of clustering of the catalytic residues of the poxvirus triphosphatase that would suggest structural similarity to the tunnel proteins (exclusive of motifs A and C). We infer that the poxvirus triphosphatases are a distinct lineage within the metal-dependent RNA triphosphatase family. Their unique active site, which is completely different from that of the host cell's capping enzyme, recommends the poxvirus RNA triphosphatase as a molecular target for antipoxviral drug discovery
Molecular Detection, Phylogenetic Analysis, and Identification of Transcription Motifs in Feline Leukemia Virus from Naturally Infected Cats in Malaysia

Directory of Open Access Journals (Sweden)

Faruku Bande

2014-01-01

Full Text Available A nested PCR assay was used to determine the viral RNA and proviral DNA status of naturally infected cats. Selected samples that were FeLV-positive by PCR were subjected to sequencing, phylogenetic analysis, and motifs search. Of the 39 samples that were positive for FeLV p27 antigen, 87.2% (34/39 were confirmed positive with nested PCR. FeLV proviral DNA was detected in 38 (97.3% of p27-antigen negative samples. Malaysian FeLV isolates are found to be highly similar with a homology of 91% to 100%. Phylogenetic analysis revealed that Malaysian FeLV isolates divided into two clusters, with a majority (86.2% sharing similarity with FeLV-K01803 and fewer isolates (13.8% with FeLV-GM1 strain. Different enhancer motifs including NF-GMa, Krox-20/WT1I-del2, BAF1, AP-2, TBP, TFIIF-beta, TRF, and TFIID are found to occur either in single, duplicate, triplicate, or sets of 5 in different positions within the U3-LTR-gag region. The present result confirms the occurrence of FeLV viral RNA and provirus DNA in naturally infected cats. Malaysian FeLV isolates are highly similar, and a majority of them are closely related to a UK isolate. This study provides the first molecular based information on FeLV in Malaysia. Additionally, different enhancer motifs likely associated with FeLV related pathogenesis have been identified.
Motif statistics and spike correlations in neuronal networks

International Nuclear Information System (INIS)

Hu, Yu; Shea-Brown, Eric; Trousdale, James; Josić, Krešimir

2013-01-01

Motifs are patterns of subgraphs of complex networks. We studied the impact of such patterns of connectivity on the level of correlated, or synchronized, spiking activity among pairs of cells in a recurrent network of integrate and fire neurons. For a range of network architectures, we find that the pairwise correlation coefficients, averaged across the network, can be closely approximated using only three statistics of network connectivity. These are the overall network connection probability and the frequencies of two second order motifs: diverging motifs, in which one cell provides input to two others, and chain motifs, in which two cells are connected via a third intermediary cell. Specifically, the prevalence of diverging and chain motifs tends to increase correlation. Our method is based on linear response theory, which enables us to express spiking statistics using linear algebra, and a resumming technique, which extrapolates from second order motifs to predict the overall effect of coupling on network correlation. Our motif-based results seek to isolate the effect of network architecture perturbatively from a known network state. (paper)
Pumping RNA: nuclear bodybuilding along the RNP pipeline.

Science.gov (United States)

Matera, A Gregory; Shpargel, Karl B

2006-06-01

Cajal bodies (CBs) are nuclear subdomains involved in the biogenesis of several classes of small ribonucleoproteins (RNPs). A number of recent advances highlight progress in the understanding of the organization and dynamics of CB components. For example, a class of small Cajal body-specific (sca) RNPs has been discovered. Localization of scaRNPs to CBs was shown to depend on a conserved RNA motif. Intriguingly, this motif is also present in mammalian telomerase RNA and the evidence suggests that assembly of the active form of telomerase RNP occurs in and around CBs during S phase. Important steps in the assembly and modification of spliceosomal RNPs have also been shown to take place in CBs. Additional experiments have revealed the existence of kinetically distinct subclasses of CB components. Finally, the recent identification of novel markers for CBs in both Drosophila and Arabidopsis not only lays to rest questions about the evolutionary conservation of these nuclear suborganelles, but also should enable forward genetic screens for the identification of new components and pathways involved in their assembly, maintenance and function.
Bayesian centroid estimation for motif discovery.

Science.gov (United States)

Carvalho, Luis

2013-01-01

Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.
Bayesian centroid estimation for motif discovery.

Directory of Open Access Journals (Sweden)

Luis Carvalho

Full Text Available Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.
Thermodynamic matchers for the construction of the cuckoo RNA family.

Science.gov (United States)

Reinkensmeier, Jan; Giegerich, Robert

2015-01-01

RNA family models describe classes of functionally related, non-coding RNAs based on sequence and structure conservation. The most important method for modeling RNA families is the use of covariance models, which are stochastic models that serve in the discovery of yet unknown, homologous RNAs. However, the performance of covariance models in finding remote homologs is poor for RNA families with high sequence conservation, while for families with high structure but low sequence conservation, these models are difficult to built in the first place. A complementary approach to RNA family modeling involves the use of thermodynamic matchers. Thermodynamic matchers are RNA folding programs, based on the established thermodynamic model, but tailored to a specific structural motif. As thermodynamic matchers focus on structure and folding energy, they unfold their potential in discovering homologs, when high structure conservation is paired with low sequence conservation. In contrast to covariance models, construction of thermodynamic matchers does not require an input alignment, but requires human design decisions and experimentation, and hence, model construction is more laborious. Here we report a case study on an RNA family that was constructed by means of thermodynamic matchers. It starts from a set of known but structurally different members of the same RNA family. The consensus secondary structure of this family consists of 2 to 4 adjacent hairpins. Each hairpin loop carries the same motif, CCUCCUCCC, while the stems show high variability in their nucleotide content. The present study describes (1) a novel approach for the integration of the structurally varying family into a single RNA family model by means of the thermodynamic matcher methodology, and (2) provides the results of homology searches that were conducted with this model in a wide spectrum of bacterial species.

Short-hairpin RNA-mediated stable silencing of Grb2 impairs cell growth and DNA synthesis

International Nuclear Information System (INIS)

Di Fulvio, Mauricio; Henkels, Karen M.; Gomez-Cambronero, Julian

2007-01-01

Grb2 is an SH2-SH3 protein adaptor responsible for linking growth factor receptors with intracellular signaling cascades. To study the role of Grb2 in cell growth, we have generated a new COS7 cell line (COS7 shGrb2 ), based on RNAi technology, as null mutations in mammalian Grb2 genes are lethal in early development. This novel cell line continuously expresses a short hairpin RNA that targets endogenous Grb2. Stable COS7 shGrb2 cells had the shGrb2 integrated into the genomic DNA and carried on SiL construct (made refractory to the shRNA-mediated interference), but not with an SH2-deficient mutant (R86K). Thus, a viable knock-down and rescue protocol has demonstrated that Grb2 is crucial for cell proliferation
Network motif frequency vectors reveal evolving metabolic network organisation.

Science.gov (United States)

Pearcy, Nicole; Crofts, Jonathan J; Chuzhanova, Nadia

2015-01-01

At the systems level many organisms of interest may be described by their patterns of interaction, and as such, are perhaps best characterised via network or graph models. Metabolic networks, in particular, are fundamental to the proper functioning of many important biological processes, and thus, have been widely studied over the past decade or so. Such investigations have revealed a number of shared topological features, such as a short characteristic path-length, large clustering coefficient and hierarchical modular structure. However, the extent to which evolutionary and functional properties of metabolism manifest via this underlying network architecture remains unclear. In this paper, we employ a novel graph embedding technique, based upon low-order network motifs, to compare metabolic network structure for 383 bacterial species categorised according to a number of biological features. In particular, we introduce a new global significance score which enables us to quantify important evolutionary relationships that exist between organisms and their physical environments. Using this new approach, we demonstrate a number of significant correlations between environmental factors, such as growth conditions and habitat variability, and network motif structure, providing evidence that organism adaptability leads to increased complexities in the resultant metabolic networks.
High-throughput SHAPE analysis reveals structures in HIV-1 genomic RNA strongly conserved across distinct biological states.

Directory of Open Access Journals (Sweden)

Kevin A Wilkinson

2008-04-01

Full Text Available Replication and pathogenesis of the human immunodeficiency virus (HIV is tightly linked to the structure of its RNA genome, but genome structure in infectious virions is poorly understood. We invent high-throughput SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension technology, which uses many of the same tools as DNA sequencing, to quantify RNA backbone flexibility at single-nucleotide resolution and from which robust structural information can be immediately derived. We analyze the structure of HIV-1 genomic RNA in four biologically instructive states, including the authentic viral genome inside native particles. Remarkably, given the large number of plausible local structures, the first 10% of the HIV-1 genome exists in a single, predominant conformation in all four states. We also discover that noncoding regions functioning in a regulatory role have significantly lower (p-value < 0.0001 SHAPE reactivities, and hence more structure, than do viral coding regions that function as the template for protein synthesis. By directly monitoring protein binding inside virions, we identify the RNA recognition motif for the viral nucleocapsid protein. Seven structurally homologous binding sites occur in a well-defined domain in the genome, consistent with a role in directing specific packaging of genomic RNA into nascent virions. In addition, we identify two distinct motifs that are targets for the duplex destabilizing activity of this same protein. The nucleocapsid protein destabilizes local HIV-1 RNA structure in ways likely to facilitate initial movement both of the retroviral reverse transcriptase from its tRNA primer and of the ribosome in coding regions. Each of the three nucleocapsid interaction motifs falls in a specific genome domain, indicating that local protein interactions can be organized by the long-range architecture of an RNA. High-throughput SHAPE reveals a comprehensive view of HIV-1 RNA genome structure, and further
Computer-Aided Design of RNA Origami Structures.

Science.gov (United States)

Sparvath, Steffen L; Geary, Cody W; Andersen, Ebbe S

2017-01-01

RNA nanostructures can be used as scaffolds to organize, combine, and control molecular functionalities, with great potential for applications in nanomedicine and synthetic biology. The single-stranded RNA origami method allows RNA nanostructures to be folded as they are transcribed by the RNA polymerase. RNA origami structures provide a stable framework that can be decorated with functional RNA elements such as riboswitches, ribozymes, interaction sites, and aptamers for binding small molecules or protein targets. The rich library of RNA structural and functional elements combined with the possibility to attach proteins through aptamer-based binding creates virtually limitless possibilities for constructing advanced RNA-based nanodevices.In this chapter we provide a detailed protocol for the single-stranded RNA origami design method using a simple 2-helix tall structure as an example. The first step involves 3D modeling of a double-crossover between two RNA double helices, followed by decoration with tertiary motifs. The second step deals with the construction of a 2D blueprint describing the secondary structure and sequence constraints that serves as the input for computer programs. In the third step, computer programs are used to design RNA sequences that are compatible with the structure, and the resulting outputs are evaluated and converted into DNA sequences to order.
Analysis of genetic polymorphism of nine short tandem repeat loci in ...

African Journals Online (AJOL)

Yomi

2012-03-15

Mar 15, 2012 ... Key words: short tandem repeat, repeat motif, genetic polymorphism, Han population, forensic genetics. INTRODUCTION. Short tandem repeat (STR) is widely .... Data analysis. The exact test of Hardy-Weinberg equilibrium was conducted with. Arlequin version 3.5 software (Computational and Molecular.
CONTEMPORARY USAGE OF TRADITIONAL TURKISH MOTIFS IN PRODUCT DESIGNS

Directory of Open Access Journals (Sweden)

Tulay Gumuser

2012-12-01

Full Text Available The aim of this study is to identify the traditional Turkish motifs and its relations among present industrial designs. Traditional Turkish motifs played a very important role in 16th century onwards. The arts of the Ottoman Empire were used because of their symbolic meanings and unique styles. When we examine these motifs we encounter; Tiger Stripe, Three Spot (Çintemani, Rumi, Hatayi, Penç, Cloud, Crescent, Star, Crown, Hyacinth, Tulip and Carnation motifs. Nowadays, Turkish designers have begun to use these traditional Turkish motifs in their designs so as to create differences and awareness in the world design. The examples of these industrial designs, using the Turkish motifs, have survived and have Ottoman heritage and historical value. In this study, the Turkish motifs will be examined along with their focus on contemporary Turkish industrial designs used today.
Matrin 3 binds and stabilizes mRNA.

Directory of Open Access Journals (Sweden)

Maayan Salton

Full Text Available Matrin 3 (MATR3 is a highly conserved, inner nuclear matrix protein with two zinc finger domains and two RNA recognition motifs (RRM, whose function is largely unknown. Recently we found MATR3 to be phosphorylated by the protein kinase ATM, which activates the cellular response to double strand breaks in the DNA. Here, we show that MATR3 interacts in an RNA-dependent manner with several proteins with established roles in RNA processing, and maintains its interaction with RNA via its RRM2 domain. Deep sequencing of the bound RNA (RIP-seq identified several small noncoding RNA species. Using microarray analysis to explore MATR3's role in transcription, we identified 77 transcripts whose amounts depended on the presence of MATR3. We validated this finding with nine transcripts which were also bound to the MATR3 complex. Finally, we demonstrated the importance of MATR3 for maintaining the stability of several of these mRNA species and conclude that it has a role in mRNA stabilization. The data suggest that the cellular level of MATR3, known to be highly regulated, modulates the stability of a group of gene transcripts.
Development of a software tool and criteria evaluation for efficient design of small interfering RNA

International Nuclear Information System (INIS)

Chaudhary, Aparna; Srivastava, Sonam; Garg, Sanjeev

2011-01-01

Research highlights: → The developed tool predicted siRNA constructs with better thermodynamic stability and total score based on positional and other criteria. → Off-target silencing below score 30 were observed for the best siRNA constructs for different genes. → Immunostimulation and cytotoxicity motifs considered and penalized in the developed tool. → Both positional and compositional criteria were observed to be important. -- Abstract: RNA interference can be used as a tool for gene silencing mediated by small interfering RNAs (siRNA). The critical step in effective and specific RNAi processing is the selection of suitable constructs. Major design criteria, i.e., Reynolds's design rules, thermodynamic stability, internal repeats, immunostimulatory motifs were emphasized and implemented in the siRNA design tool. The tool provides thermodynamic stability score, GC content and a total score based on other design criteria in the output. The viability of the tool was established with different datasets. In general, the siRNA constructs produced by the tool had better thermodynamic score and positional properties. Comparable thermodynamic scores and better total scores were observed with the existing tools. Moreover, the results generated had comparable off-target silencing effect. Criteria evaluations with additional criteria were achieved in WEKA.
Computational assessment of the cooperativity between RNA binding proteins and MicroRNAs in Transcript Decay.

Science.gov (United States)

Jiang, Peng; Singh, Mona; Coller, Hilary A

2013-01-01

Transcript degradation is a widespread and important mechanism for regulating protein abundance. Two major regulators of transcript degradation are RNA Binding Proteins (RBPs) and microRNAs (miRNAs). We computationally explored whether RBPs and miRNAs cooperate to promote transcript decay. We defined five RBP motifs based on the evolutionary conservation of their recognition sites in 3'UTRs as the binding motifs for Pumilio (PUM), U1A, Fox-1, Nova, and UAUUUAU. Recognition sites for some of these RBPs tended to localize at the end of long 3'UTRs. A specific group of miRNA recognition sites were enriched within 50 nts from the RBP recognition sites for PUM and UAUUUAU. The presence of both a PUM recognition site and a recognition site for preferentially co-occurring miRNAs was associated with faster decay of the associated transcripts. For PUM and its co-occurring miRNAs, binding of the RBP to its recognition sites was predicted to release nearby miRNA recognition sites from RNA secondary structures. The mammalian miRNAs that preferentially co-occur with PUM binding sites have recognition seeds that are reverse complements to the PUM recognition motif. Their binding sites have the potential to form hairpin secondary structures with proximal PUM binding sites that would normally limit RISC accessibility, but would be more accessible to miRNAs in response to the binding of PUM. In sum, our computational analyses suggest that a specific set of RBPs and miRNAs work together to affect transcript decay, with the rescue of miRNA recognition sites via RBP binding as one possible mechanism of cooperativity.
RNA2DMut: a web tool for the design and analysis of RNA structure mutations.

Science.gov (United States)

Moss, Walter N

2018-03-01

With the widespread application of high-throughput sequencing, novel RNA sequences are being discovered at an astonishing rate. The analysis of function, however, lags behind. In both the cis - and trans -regulatory functions of RNA, secondary structure (2D base-pairing) plays essential regulatory roles. In order to test RNA function, it is essential to be able to design and analyze mutations that can affect structure. This was the motivation for the creation of the RNA2DMut web tool. With RNA2DMut, users can enter in RNA sequences to analyze, constrain mutations to specific residues, or limit changes to purines/pyrimidines. The sequence is analyzed at each base to determine the effect of every possible point mutation on 2D structure. The metrics used in RNA2DMut rely on the calculation of the Boltzmann structure ensemble and do not require a robust 2D model of RNA structure for designing mutations. This tool can facilitate a wide array of uses involving RNA: for example, in designing and evaluating mutants for biological assays, interrogating RNA-protein interactions, identifying key regions to alter in SELEX experiments, and improving RNA folding and crystallization properties for structural biology. Additional tools are available to help users introduce other mutations (e.g., indels and substitutions) and evaluate their effects on RNA structure. Example calculations are shown for five RNAs that require 2D structure for their function: the MALAT1 mascRNA, an influenza virus splicing regulatory motif, the EBER2 viral noncoding RNA, the Xist lncRNA repA region, and human Y RNA 5. RNA2DMut can be accessed at https://rna2dmut.bb.iastate.edu/. © 2018 Moss; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Transcription-factor occupancy at HOT regions quantitatively predicts RNA polymerase recruitment in five human cell lines.

KAUST Repository

Foley, Joseph W; Sidow, Arend

2013-01-01

BACKGROUND: High-occupancy target (HOT) regions are compact genome loci occupied by many different transcription factors (TFs). HOT regions were initially defined in invertebrate model organisms, and we here show that they are a ubiquitous feature of the human gene-regulation landscape. RESULTS: We identified HOT regions by a comprehensive analysis of ChIP-seq data from 96 DNA-associated proteins in 5 human cell lines. Most HOT regions co-localize with RNA polymerase II binding sites, but many are not near the promoters of annotated genes. At HOT promoters, TF occupancy is strongly predictive of transcription preinitiation complex recruitment and moderately predictive of initiating Pol II recruitment, but only weakly predictive of elongating Pol II and RNA transcript abundance. TF occupancy varies quantitatively within human HOT regions; we used this variation to discover novel associations between TFs. The sequence motif associated with any given TF's direct DNA binding is somewhat predictive of its empirical occupancy, but a great deal of occupancy occurs at sites without the TF's motif, implying indirect recruitment by another TF whose motif is present. CONCLUSIONS: Mammalian HOT regions are regulatory hubs that integrate the signals from diverse regulatory pathways to quantitatively tune the promoter for RNA polymerase II recruitment.
Transcription-factor occupancy at HOT regions quantitatively predicts RNA polymerase recruitment in five human cell lines.

KAUST Repository

Foley, Joseph W

2013-10-20

BACKGROUND: High-occupancy target (HOT) regions are compact genome loci occupied by many different transcription factors (TFs). HOT regions were initially defined in invertebrate model organisms, and we here show that they are a ubiquitous feature of the human gene-regulation landscape. RESULTS: We identified HOT regions by a comprehensive analysis of ChIP-seq data from 96 DNA-associated proteins in 5 human cell lines. Most HOT regions co-localize with RNA polymerase II binding sites, but many are not near the promoters of annotated genes. At HOT promoters, TF occupancy is strongly predictive of transcription preinitiation complex recruitment and moderately predictive of initiating Pol II recruitment, but only weakly predictive of elongating Pol II and RNA transcript abundance. TF occupancy varies quantitatively within human HOT regions; we used this variation to discover novel associations between TFs. The sequence motif associated with any given TF\\'s direct DNA binding is somewhat predictive of its empirical occupancy, but a great deal of occupancy occurs at sites without the TF\\'s motif, implying indirect recruitment by another TF whose motif is present. CONCLUSIONS: Mammalian HOT regions are regulatory hubs that integrate the signals from diverse regulatory pathways to quantitatively tune the promoter for RNA polymerase II recruitment.
SiteBinder: an improved approach for comparing multiple protein structural motifs.

Science.gov (United States)

Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav

2012-02-27

There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.
Evolutionary relationships in the ilarviruses: nucleotide sequence of prunus necrotic ringspot virus RNA 3.

Science.gov (United States)

Sánchez-Navarro, J A; Pallás, V

1997-01-01

The complete nucleotide sequence of an isolate of prunus necrotic ringspot virus (PNRSV) RNA 3 has been determined. Elucidation of the amino acid sequence of the proteins encoded by the two large open reading frames (ORFs) allowed us to carry out comparative and phylogenetic studies on the movement (MP) and coat (CP) proteins in the ilarvirus group. Amino acid sequence comparison of the MP revealed a highly conserved basic sequence motif with an amphipathic alpha-helical structure preceding the conserved motif of the '30K superfamily' proposed by Mushegian and Koonin [26] for MP's. Within this '30K' motif a strictly conserved transmembrane domain is present in all ilarviruses sequenced so far. At the amino-terminal end, prune dwarf virus (PDV) has an extension not present in other ilarviruses but which is observed in all bromo- and cucumoviruses, suggesting a common ancestor or a recombinational event in the Bromoviridae family. Examination of the N-terminus of the CP's of all ilarviruses revealed a highly basic region, part of which resembles the Arg-rich motif that has been characterized in the RNA-binding protein family. This motif has also been found in the other members of the Bromoviridae family, suggesting its involvement in a structural function. Furthermore this region is required for infectivity in ilarviruses. The similarities found in this Arg-rich motif are discussed in terms of this process known as genome activation. Finally, phylogenetic analysis of both the MP and CP proteins revealed a higher relationship of A1MV to PNRSV, apple mosaic virus (ApMV) and PDV than any other member of the ilarvirus group. In that sense, A1MV should be considered as a true ilarvirus instead of forming a distinct group of viruses.
Analisis Unsur Matematika pada Motif Sulam Usus

Directory of Open Access Journals (Sweden)

Fredi Ganda Putra

2017-12-01

Full Text Available Based on interviews with researchers sources said that the beginning of the intestine embroidery is an art of genuine crafts. Called the intestine embroidery because this technique is a technique of combining a strand of cloth resembling the intestine formed according to the pattern by means of embroidered using a thread. Intestinal embroidery techniques were originally used to create a cover of the women's customary wardrobe of Lampung or often referred to as bebe. But not many people in Lampung, especially people who live in Lampung are still many who do not know and recognize the intestine embroidery because most only know tapis only characteristic of Lampung, besides that there are other cultural results that is embroidered intestine. There are still many who do not know that the intestine motif there is a knowledge of mathematics. The researcher's problem formulation is whether there are mathematical elements contained in the intestine embroidery motif based on the concept of geometry. The purpose of this study is to determine whether there are elements of mathematics contained in the intestine motif based on the concept of geometry. Subjects in this study consisted of 4 people obtained by purposive sampling technique. From the results of data analysis conducted by using descriptive analysis and discussion as follows: (1 Intestinal embroidery motif contains the meaning of mathematics and culture or often called Etnomatematika. On the meaning of culture there is a link between the embroidery intestine with a culture that has been there before as the existence of cultural linkage between Hindu belief Buddhism and there are similarities of motifs and decorative patterns contained in the motif embroidery intestine with ornamental variety in Indonesia. (2 The relationship between the intestine with mathematical motifs there are elements of mathematics such as geometry elements in the form of geometry of dimension one and dimension two, and the
Complete motif analysis of sequence requirements for translation initiation at non-AUG start codons.

Science.gov (United States)

Diaz de Arce, Alexander J; Noderer, William L; Wang, Clifford L

2018-01-25

The initiation of mRNA translation from start codons other than AUG was previously believed to be rare and of relatively low impact. More recently, evidence has suggested that as much as half of all translation initiation utilizes non-AUG start codons, codons that deviate from AUG by a single base. Furthermore, non-AUG start codons have been shown to be involved in regulation of expression and disease etiology. Yet the ability to gauge expression based on the sequence of a translation initiation site (start codon and its flanking bases) has been limited. Here we have performed a comprehensive analysis of translation initiation sites that utilize non-AUG start codons. By combining genetic-reporter, cell-sorting, and high-throughput sequencing technologies, we have analyzed the expression associated with all possible variants of the -4 to +4 positions of non-AUG translation initiation site motifs. This complete motif analysis revealed that 1) with the right sequence context, certain non-AUG start codons can generate expression comparable to that of AUG start codons, 2) sequence context affects each non-AUG start codon differently, and 3) initiation at non-AUG start codons is highly sensitive to changes in the flanking sequences. Complete motif analysis has the potential to be a key tool for experimental and diagnostic genomics. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Motif signatures of transcribed enhancers

KAUST Repository

Kleftogiannis, Dimitrios

2017-09-14

In mammalian cells, transcribed enhancers (TrEn) play important roles in the initiation of gene expression and maintenance of gene expression levels in spatiotemporal manner. One of the most challenging questions in biology today is how the genomic characteristics of enhancers relate to enhancer activities. This is particularly critical, as several recent studies have linked enhancer sequence motifs to specific functional roles. To date, only a limited number of enhancer sequence characteristics have been investigated, leaving space for exploring the enhancers genomic code in a more systematic way. To address this problem, we developed a novel computational method, TELS, aimed at identifying predictive cell type/tissue specific motif signatures. We used TELS to compile a comprehensive catalog of motif signatures for all known TrEn identified by the FANTOM5 consortium across 112 human primary cells and tissues. Our results confirm that distinct cell type/tissue specific motif signatures characterize TrEn. These signatures allow discriminating successfully a) TrEn from random controls, proxy of non-enhancer activity, and b) cell type/tissue specific TrEn from enhancers expressed and transcribed in different cell types/tissues. TELS codes and datasets are publicly available at http://www.cbrc.kaust.edu.sa/TELS.
Short-term calorie restriction feminizes the mRNA profiles of drug metabolizing enzymes and transporters in livers of mice

International Nuclear Information System (INIS)

Fu, Zidong Donna; Klaassen, Curtis D.

2014-01-01

Calorie restriction (CR) is one of the most effective anti-aging interventions in mammals. A modern theory suggests that aging results from a decline in detoxification capabilities and thus accumulation of damaged macromolecules. The present study aimed to determine how short-term CR alters mRNA profiles of genes that encode metabolism and detoxification machinery in the liver. Male C57BL/6 mice were fed CR (0, 15, 30, or 40%) diets for one month, followed by mRNA quantification of 98 xenobiotic processing genes (XPGs) in the liver, including 7 uptake transporters, 39 phase-I enzymes, 37 phase-II enzymes, 10 efflux transporters, and 5 transcription factors. In general, 15% CR did not alter mRNAs of most XPGs, whereas 30 and 40% CR altered over half of the XPGs (32 increased and 29 decreased). CR up-regulated some phase-I enzymes (fold increase), such as Cyp4a14 (12), Por (2.3), Nqo1 (1.4), Fmo2 (5.4), and Fmo3 (346), and numerous number of phase-II enzymes, such as Sult1a1 (1.2), Sult1d1 (2.0), Sult1e1 (33), Sult3a1 (2.2), Gsta4 (1.3), Gstm2 (1.3), Gstm3 (1.7), and Mgst3 (2.2). CR feminized the mRNA profiles of 32 XPGs in livers of male mice. For instance, CR decreased the male-predominantly expressed Oatp1a1 (97%) and increased the female-predominantly expressed Oatp1a4 (11). In conclusion, short-term CR alters the mRNA levels of over half of the 98 XPGs quantified in livers of male mice, and over half of these alterations appear to be due to feminization of the liver. - Highlights: • Utilized a graded CR model in male mice • The mRNA profiles of xenobiotic processing genes (XPGs) in liver were investigated. • CR up-regulates many phase-II enzymes. • CR tends to feminize the mRNA profiles of XPGs
Short-term calorie restriction feminizes the mRNA profiles of drug metabolizing enzymes and transporters in livers of mice

Energy Technology Data Exchange (ETDEWEB)

Fu, Zidong Donna [Department of Pharmacology, Toxicology, and Therapeutics, University of Kansas Medical Center, Kansas City, KS 66160 (United States); Klaassen, Curtis D., E-mail: cklaasse@kumc.edu [Department of Internal Medicine, University of Kansas Medical Center, Kansas City, KS 66160 (United States)

2014-01-01

Calorie restriction (CR) is one of the most effective anti-aging interventions in mammals. A modern theory suggests that aging results from a decline in detoxification capabilities and thus accumulation of damaged macromolecules. The present study aimed to determine how short-term CR alters mRNA profiles of genes that encode metabolism and detoxification machinery in the liver. Male C57BL/6 mice were fed CR (0, 15, 30, or 40%) diets for one month, followed by mRNA quantification of 98 xenobiotic processing genes (XPGs) in the liver, including 7 uptake transporters, 39 phase-I enzymes, 37 phase-II enzymes, 10 efflux transporters, and 5 transcription factors. In general, 15% CR did not alter mRNAs of most XPGs, whereas 30 and 40% CR altered over half of the XPGs (32 increased and 29 decreased). CR up-regulated some phase-I enzymes (fold increase), such as Cyp4a14 (12), Por (2.3), Nqo1 (1.4), Fmo2 (5.4), and Fmo3 (346), and numerous number of phase-II enzymes, such as Sult1a1 (1.2), Sult1d1 (2.0), Sult1e1 (33), Sult3a1 (2.2), Gsta4 (1.3), Gstm2 (1.3), Gstm3 (1.7), and Mgst3 (2.2). CR feminized the mRNA profiles of 32 XPGs in livers of male mice. For instance, CR decreased the male-predominantly expressed Oatp1a1 (97%) and increased the female-predominantly expressed Oatp1a4 (11). In conclusion, short-term CR alters the mRNA levels of over half of the 98 XPGs quantified in livers of male mice, and over half of these alterations appear to be due to feminization of the liver. - Highlights: • Utilized a graded CR model in male mice • The mRNA profiles of xenobiotic processing genes (XPGs) in liver were investigated. • CR up-regulates many phase-II enzymes. • CR tends to feminize the mRNA profiles of XPGs.
Structural basis underlying CAC RNA recognition by the RRM domain of dimeric RNA-binding protein RBPMS

Energy Technology Data Exchange (ETDEWEB)

Teplova, Marianna; Farazi, Thalia A.; Tuschl, Thomas; Patel, Dinshaw J.

2015-09-08

Abstract
RNA-binding protein with multiple splicing (designated RBPMS) is a higher vertebrate mRNA-binding protein containing a single RNA recognition motif (RRM). RBPMS has been shown to be involved in mRNA transport, localization and stability, with key roles in axon guidance, smooth muscle plasticity, as well as regulation of cancer cell proliferation and migration. We report on structure-function studies of the RRM domain of RBPMS bound to a CAC-containing single-stranded RNA. These results provide insights into potential topologies of complexes formed by the RBPMS RRM domain and the tandem CAC repeat binding sites as detected by photoactivatable-ribonucleoside-enhanced crosslinking and immunoprecipitation. These studies establish that the RRM domain of RBPMS forms a symmetrical dimer in the free state, with each monomer binding sequence-specifically to all three nucleotides of a CAC segment in the RNA bound state. Structure-guided mutations within the dimerization and RNA-binding interfaces of RBPMS RRM on RNA complex formation resulted in both disruption of dimerization and a decrease in RNA-binding affinity as observed by size exclusion chromatography and isothermal titration calorimetry. As anticipated from biochemical binding studies, over-expression of dimerization or RNA-binding mutants of Flag-HA-tagged RBPMS were no longer able to track with stress granules in HEK293 cells, thereby documenting the deleterious effects of such mutationsin vivo.

SLiMScape 3.x: a Cytoscape 3 app for discovery of Short Linear Motifs in protein interaction networks [version 1; referees: 2 approved

Directory of Open Access Journals (Sweden)

Emily Olorin

2015-08-01

Full Text Available Short linear motifs (SLiMs are small protein sequence patterns that mediate a large number of critical protein-protein interactions, involved in processes such as complex formation, signal transduction, localisation and stabilisation. SLiMs show rapid evolutionary dynamics and are frequently the targets of molecular mimicry by pathogens. Identifying enriched sequence patterns due to convergent evolution in non-homologous proteins has proven to be a successful strategy for computational SLiM prediction. Tools of the SLiMSuite package use this strategy, using a statistical model to identify SLiM enrichment based on the evolutionary relationships, amino acid composition and predicted disorder of the input proteins. The quality of input data is critical for successful SLiM prediction. Cytoscape provides a user-friendly, interactive environment to explore interaction networks and select proteins based on common features, such as shared interaction partners. SLiMScape embeds tools of the SLiMSuite package for de novo SLiM discovery (SLiMFinder and QSLiMFinder and identifying occurrences/enrichment of known SLiMs (SLiMProb within this interactive framework. SLiMScape makes it easier to (1 generate high quality hypothesis-driven datasets for these tools, and (2 visualise predicted SLiM occurrences within the context of the network. To generate new predictions, users can select nodes from a protein network or provide a set of Uniprot identifiers. SLiMProb also requires additional query motif input. Jobs are then run remotely on the SLiMSuite server (http://rest.slimsuite.unsw.edu.au for subsequent retrieval and visualisation. SLiMScape can also be used to retrieve and visualise results from jobs run directly on the server. SLiMScape and SLiMSuite are open source and freely available via GitHub under GNU licenses.
Cellular La protein shields nonsegmented negative-strand RNA viral leader RNA from RIG-I and enhances virus growth by diverse mechanisms.

Science.gov (United States)

Bitko, Vira; Musiyenko, Alla; Bayfield, Mark A; Maraia, Richard J; Barik, Sailen

2008-08-01

The La antigen (SS-B) associates with a wide variety of cellular and viral RNAs to affect gene expression in multiple systems. We show that La is the major cellular protein found to be associated with the abundant 44-nucleotide viral leader RNA (leRNA) early after infection with respiratory syncytial virus (RSV), a nonsegmented negative-strand RNA virus. Consistent with this, La redistributes from the nucleus to the cytoplasm in RSV-infected cells. Upon RNA interference knockdown of La, leRNA is redirected to associate with the RNA-binding protein RIG-I, a known activator of interferon (IFN) gene expression, and this is accompanied by the early induction of IFN mRNA. These results suggest that La shields leRNA from RIG-I, abrogating the early viral activation of type I IFN. We mapped the leRNA binding function to RNA recognition motif 1 of La and showed that while wild-type La greatly enhanced RSV growth, a La mutant defective in RSV leRNA binding also did not support RSV growth. Comparative studies of RSV and Sendai virus and the use of IFN-negative Vero cells indicated that La supports the growth of nonsegmented negative-strand RNA viruses by both IFN suppression and a potentially novel IFN-independent mechanism.
APOBEC3G inhibits HIV-1 RNA elongation by inactivating the viral trans-activation response element.

Science.gov (United States)

Nowarski, Roni; Prabhu, Ponnandy; Kenig, Edan; Smith, Yoav; Britan-Rosich, Elena; Kotler, Moshe

2014-07-29

Deamination of cytidine residues in viral DNA is a major mechanism by which APOBEC3G (A3G) inhibits vif-deficient human immunodeficiency virus type 1 (HIV-1) replication. dC-to-dU transition following RNase-H activity leads to viral cDNA degradation, production of non-functional proteins, formation of undesired stop codons and decreased viral protein synthesis. Here, we demonstrate that A3G provides an additional layer of defense against HIV-1 infection dependent on inhibition of proviral transcription. HIV-1 transcription elongation is regulated by the trans-activation response (TAR) element, a short stem-loop RNA structure required for elongation factors binding. Vif-deficient HIV-1-infected cells accumulate short viral transcripts and produce lower amounts of full-length HIV-1 transcripts due to A3G deamination of the TAR apical loop cytidine, highlighting the requirement for TAR loop integrity in HIV-1 transcription. We further show that free single-stranded DNA (ssDNA) termini are not essential for A3G activity and a gap of CCC motif blocked with juxtaposed DNA or RNA on either or 3'+5' ends is sufficient for A3G deamination. These results identify A3G as an efficient mutator and that deamination of (-)SSDNA results in an early block of HIV-1 transcription. Copyright © 2014 Elsevier Ltd. All rights reserved.
A ΩXaV motif in the Rift Valley fever virus NSs protein is essential for degrading p62, forming nuclear filaments and virulence.

Science.gov (United States)

Cyr, Normand; de la Fuente, Cynthia; Lecoq, Lauriane; Guendel, Irene; Chabot, Philippe R; Kehn-Hall, Kylene; Omichinski, James G

2015-05-12

Rift Valley fever virus (RVFV) is a single-stranded RNA virus capable of inducing fatal hemorrhagic fever in humans. A key component of RVFV virulence is its ability to form nuclear filaments through interactions between the viral nonstructural protein NSs and the host general transcription factor TFIIH. Here, we identify an interaction between a ΩXaV motif in NSs and the p62 subunit of TFIIH. This motif in NSs is similar to ΩXaV motifs found in nucleotide excision repair (NER) factors and transcription factors known to interact with p62. Structural and biophysical studies demonstrate that NSs binds to p62 in a similar manner as these other factors. Functional studies in RVFV-infected cells show that the ΩXaV motif is required for both nuclear filament formation and degradation of p62. Consistent with the fact that the RVFV can be distinguished from other Bunyaviridae-family viruses due to its ability to form nuclear filaments in infected cells, the motif is absent in the NSs proteins of other Bunyaviridae-family viruses. Taken together, our studies demonstrate that p62 binding to NSs through the ΩXaV motif is essential for degrading p62, forming nuclear filaments and enhancing RVFV virulence. In addition, these results show how the RVFV incorporates a simple motif into the NSs protein that enables it to functionally mimic host cell proteins that bind the p62 subunit of TFIIH.
Prediction and Dissection of Protein-RNA Interactions by Molecular Descriptors.

Science.gov (United States)

Liu, Zhi-Ping; Chen, Luonan

2016-01-01

Protein-RNA interactions play crucial roles in numerous biological processes. However, detecting the interactions and binding sites between protein and RNA by traditional experiments is still time consuming and labor costing. Thus, it is of importance to develop bioinformatics methods for predicting protein-RNA interactions and binding sites. Accurate prediction of protein-RNA interactions and recognitions will highly benefit to decipher the interaction mechanisms between protein and RNA, as well as to improve the RNA-related protein engineering and drug design. In this work, we summarize the current bioinformatics strategies of predicting protein-RNA interactions and dissecting protein-RNA interaction mechanisms from local structure binding motifs. In particular, we focus on the feature-based machine learning methods, in which the molecular descriptors of protein and RNA are extracted and integrated as feature vectors of representing the interaction events and recognition residues. In addition, the available methods are classified and compared comprehensively. The molecular descriptors are expected to elucidate the binding mechanisms of protein-RNA interaction and reveal the functional implications from structural complementary perspective.
Triadic motifs in the dependence networks of virtual societies

Science.gov (United States)

Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

2014-06-01

In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.
Triadic motifs in the dependence networks of virtual societies.

Science.gov (United States)

Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

2014-06-10

In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.
Direct AUC optimization of regulatory motifs.

Science.gov (United States)

Zhu, Lin; Zhang, Hong-Bo; Huang, De-Shuang

2017-07-15

The discovery of transcription factor binding site (TFBS) motifs is essential for untangling the complex mechanism of genetic variation under different developmental and environmental conditions. Among the huge amount of computational approaches for de novo identification of TFBS motifs, discriminative motif learning (DML) methods have been proven to be promising for harnessing the discovery power of accumulated huge amount of high-throughput binding data. However, they have to sacrifice accuracy for speed and could fail to fully utilize the information of the input sequences. We propose a novel algorithm called CDAUC for optimizing DML-learned motifs based on the area under the receiver-operating characteristic curve (AUC) criterion, which has been widely used in the literature to evaluate the significance of extracted motifs. We show that when the considered AUC loss function is optimized in a coordinate-wise manner, the cost function of each resultant sub-problem is a piece-wise constant function, whose optimal value can be found exactly and efficiently. Further, a key step of each iteration of CDAUC can be efficiently solved as a computational geometry problem. Experimental results on real world high-throughput datasets illustrate that CDAUC outperforms competing methods for refining DML motifs, while being one order of magnitude faster. Meanwhile, preliminary results also show that CDAUC may also be useful for improving the interpretability of convolutional kernels generated by the emerging deep learning approaches for predicting TF sequences specificities. CDAUC is available at: https://drive.google.com/drive/folders/0BxOW5MtIZbJjNFpCeHlBVWJHeW8 . dshuang@tongji.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com
Molecular basis for the wide range of affinity found in Csr/Rsm protein-RNA recognition.

Science.gov (United States)

Duss, Olivier; Michel, Erich; Diarra dit Konté, Nana; Schubert, Mario; Allain, Frédéric H-T

2014-04-01

The carbon storage regulator/regulator of secondary metabolism (Csr/Rsm) type of small non-coding RNAs (sRNAs) is widespread throughout bacteria and acts by sequestering the global translation repressor protein CsrA/RsmE from the ribosome binding site of a subset of mRNAs. Although we have previously described the molecular basis of a high affinity RNA target bound to RsmE, it remains unknown how other lower affinity targets are recognized by the same protein. Here, we have determined the nuclear magnetic resonance solution structures of five separate GGA binding motifs of the sRNA RsmZ of Pseudomonas fluorescens in complex with RsmE. The structures explain how the variation of sequence and structural context of the GGA binding motifs modulate the binding affinity for RsmE by five orders of magnitude (∼10 nM to ∼3 mM, Kd). Furthermore, we see that conformational adaptation of protein side-chains and RNA enable recognition of different RNA sequences by the same protein contributing to binding affinity without conferring specificity. Overall, our findings illustrate how the variability in the Csr/Rsm protein-RNA recognition allows a fine-tuning of the competition between mRNAs and sRNAs for the CsrA/RsmE protein.
DMINDA: an integrated web server for DNA motif identification and analyses.

Science.gov (United States)

Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying

2014-07-01

DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.
A sialoreceptor binding motif in the Mycoplasma synoviae adhesin VlhA.

Directory of Open Access Journals (Sweden)

Meghan May

Full Text Available Mycoplasma synoviae depends on its adhesin VlhA to mediate cytadherence to sialylated host cell receptors. Allelic variants of VlhA arise through recombination between an assemblage of promoterless vlhA pseudogenes and a single transcription promoter site, creating lineages of M. synoviae that each express a different vlhA allele. The predicted full-length VlhA sequences adjacent to the promoter of nine lineages of M. synoviae varying in avidity of cytadherence were aligned with that of the reference strain MS53 and with a 60-a.a. hemagglutinating VlhA C-terminal fragment from a Tunisian lineage of strain WVU1853(T. Seven different sequence variants of an imperfectly conserved, single-copy, 12-a.a. candidate cytadherence motif were evident amid the flanking variable residues of the 11 total sequences examined. The motif was predicted to adopt a short hairpin structure in a low-complexity region near the C-terminus of VlhA. Biotinylated synthetic oligopeptides representing four selected variants of the 12-a.a. motif, with the whole synthesized 60-a.a. fragment as a positive control, differed (P<0.01 in the extent they bound to chicken erythrocyte membranes. All bound to a greater extent (P<0.01 than scrambled or irrelevant VlhA domain negative control peptides did. Experimentally introduced branched-chain amino acid (BCAA substitutions Val3Ile and Leu7Ile did not significantly alter binding, whereas fold-destabilizing substitutions Thr4Gly and Ala9Gly tended to reduce it (P<0.05. Binding was also reduced to background levels (P<0.01 when the peptides were exposed to desialylated membranes, or were pre-saturated with free sialic acid before exposure to untreated membranes. From this evidence we conclude that the motif P-X-(BCAA-X-F-X-(BCAA-X-A-K-X-G binds sialic acid and likely mediates VlhA-dependent M. synoviae attachment to host cells. This conserved mechanism retains the potential for fine-scale rheostasis in binding avidity, which could be a
Efficient motif finding algorithms for large-alphabet inputs

Directory of Open Access Journals (Sweden)

Pavlovic Vladimir

2010-10-01

Full Text Available Abstract Background We consider the problem of identifying motifs, recurring or conserved patterns, in the biological sequence data sets. To solve this task, we present a new deterministic algorithm for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. Results The proposed algorithm (1 improves search efficiency compared to existing algorithms, and (2 scales well with the size of alphabet. On a synthetic planted DNA motif finding problem our algorithm is over 10× more efficient than MITRA, PMSPrune, and RISOTTO for long motifs. Improvements are orders of magnitude higher in the same setting with large alphabets. On benchmark TF-binding site problems (FNP, CRP, LexA we observed reduction in running time of over 12×, with high detection accuracy. The algorithm was also successful in rapidly identifying protein motifs in Lipocalin, Zinc metallopeptidase, and supersecondary structure motifs for Cadherin and Immunoglobin families. Conclusions Our algorithm reduces computational complexity of the current motif finding algorithms and demonstrate strong running time improvements over existing exact algorithms, especially in important and difficult cases of large-alphabet sequences.
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.

Directory of Open Access Journals (Sweden)

Michael F Sloma

2017-11-01

Full Text Available Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package.
Base pair probability estimates improve the prediction accuracy of RNA non-canonical base pairs.

Science.gov (United States)

Sloma, Michael F; Mathews, David H

2017-11-01

Prediction of RNA tertiary structure from sequence is an important problem, but generating accurate structure models for even short sequences remains difficult. Predictions of RNA tertiary structure tend to be least accurate in loop regions, where non-canonical pairs are important for determining the details of structure. Non-canonical pairs can be predicted using a knowledge-based model of structure that scores nucleotide cyclic motifs, or NCMs. In this work, a partition function algorithm is introduced that allows the estimation of base pairing probabilities for both canonical and non-canonical interactions. Pairs that are predicted to be probable are more likely to be found in the true structure than pairs of lower probability. Pair probability estimates can be further improved by predicting the structure conserved across multiple homologous sequences using the TurboFold algorithm. These pairing probabilities, used in concert with prior knowledge of the canonical secondary structure, allow accurate inference of non-canonical pairs, an important step towards accurate prediction of the full tertiary structure. Software to predict non-canonical base pairs and pairing probabilities is now provided as part of the RNAstructure software package.
RMOD: a tool for regulatory motif detection in signaling network.

Directory of Open Access Journals (Sweden)

Jinki Kim

Full Text Available Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.
Proteomic profiling of human keratinocytes undergoing UVB-induced alternative differentiation reveals TRIpartite Motif Protein 29 as a survival factor.

Directory of Open Access Journals (Sweden)

Véronique Bertrand-Vallery

Full Text Available BACKGROUND: Repeated exposures to UVB of human keratinocytes lacking functional p16(INK-4a and able to differentiate induce an alternative state of differentiation rather than stress-induced premature senescence. METHODOLOGY/PRINCIPAL FINDINGS: A 2D-DIGE proteomic profiling of this alternative state of differentiation was performed herein at various times after the exposures to UVB. Sixty-nine differentially abundant protein species were identified by mass spectrometry, many of which are involved in keratinocyte differentiation and survival. Among these protein species was TRIpartite Motif Protein 29 (TRIM29. Increased abundance of TRIM29 following UVB exposures was validated by Western blot using specific antibody and was also further analysed by immunochemistry and by RT-PCR. TRIM29 was found very abundant in keratinocytes and reconstructed epidermis. Knocking down the expression of TRIM29 by short-hairpin RNA interference decreased the viability of keratinocytes after UVB exposure. The abundance of involucrin mRNA, a marker of late differentiation, increased concomitantly. In TRIM29-knocked down reconstructed epidermis, the presence of picnotic cells revealed cell injury. Increased abundance of TRIM29 was also observed upon exposure to DNA damaging agents and PKC activation. The UVB-induced increase of TRIM29 abundance was dependent on a PKC signaling pathway, likely PKCdelta. CONCLUSIONS/SIGNIFICANCE: These findings suggest that TRIM29 allows keratinocytes to enter a protective alternative differentiation process rather than die massively after stress.
CasA mediates Cas3-catalyzed target degradation during CRISPR RNA-guided interference.

Science.gov (United States)

Hochstrasser, Megan L; Taylor, David W; Bhat, Prashant; Guegler, Chantal K; Sternberg, Samuel H; Nogales, Eva; Doudna, Jennifer A

2014-05-06

In bacteria, the clustered regularly interspaced short palindromic repeats (CRISPR)-associated (Cas) DNA-targeting complex Cascade (CRISPR-associated complex for antiviral defense) uses CRISPR RNA (crRNA) guides to bind complementary DNA targets at sites adjacent to a trinucleotide signature sequence called the protospacer adjacent motif (PAM). The Cascade complex then recruits Cas3, a nuclease-helicase that catalyzes unwinding and cleavage of foreign double-stranded DNA (dsDNA) bearing a sequence matching that of the crRNA. Cascade comprises the CasA-E proteins and one crRNA, forming a structure that binds and unwinds dsDNA to form an R loop in which the target strand of the DNA base pairs with the 32-nt RNA guide sequence. Single-particle electron microscopy reconstructions of dsDNA-bound Cascade with and without Cas3 reveal that Cascade positions the PAM-proximal end of the DNA duplex at the CasA subunit and near the site of Cas3 association. The finding that the DNA target and Cas3 colocalize with CasA implicates this subunit in a key target-validation step during DNA interference. We show biochemically that base pairing of the PAM region is unnecessary for target binding but critical for Cas3-mediated degradation. In addition, the L1 loop of CasA, previously implicated in PAM recognition, is essential for Cas3 activation following target binding by Cascade. Together, these data show that the CasA subunit of Cascade functions as an essential partner of Cas3 by recognizing DNA target sites and positioning Cas3 adjacent to the PAM to ensure cleavage.
Microbial expression of proteins containing long repetitive Arg-Gly-Asp cell adhesive motifs created by overlap elongation PCR

International Nuclear Information System (INIS)

Kurihara, Hiroyuki; Shinkai, Masashige; Nagamune, Teruyuki

2004-01-01

We developed a novel method for creating repetitive DNA libraries using overlap elongation PCR, and prepared a DNA library encoding repetitive Arg-Gly-Asp (RGD) cell adhesive motifs. We obtained various length DNAs encoding repetitive RGD from a short monomer DNA (18 bp) after a thermal cyclic reaction without a DNA template for amplification, and isolated DNAs encoding 2, 21, and 43 repeats of the RGD motif. We cloned these DNAs into a protein expression vector and overexpressed them as thioredoxin fusion proteins: RGD2, RGD21, and RGD43, respectively. The solubility of RGD43 in water was low and it formed a fibrous precipitate in water. Scanning electron microscopy revealed that RGD43 formed a branched 3D-network structure in the solid state. To evaluate the function of the cell adhesive motifs in RGD43, mouse fibroblast cells were cultivated on the RGD43 scaffold. The fibroblast cells adhered to the RGD43 scaffold and extended long filopodia
[Comparative analysis of clustered regularly interspaced short palindromic repeats (CRISPRs) loci in the genomes of halophilic archaea].

Science.gov (United States)

Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian

2009-11-01

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
DNA motif alignment by evolving a population of Markov chains.

Science.gov (United States)

Bi, Chengpeng

2009-01-30

Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences.

Nucleocapsid-Independent Specific Viral RNA Packaging via Viral Envelope Protein and Viral RNA Signal

OpenAIRE

Narayanan, Krishna; Chen, Chun-Jen; Maeda, Junko; Makino, Shinji

2003-01-01

For any of the enveloped RNA viruses studied to date, recognition of a specific RNA packaging signal by the virus's nucleocapsid (N) protein is the first step described in the process of viral RNA packaging. In the murine coronavirus a selective interaction between the viral transmembrane envelope protein M and the viral ribonucleoprotein complex, composed of N protein and viral RNA containing a short cis-acting RNA element, the packaging signal, determines the selective RNA packaging into vi...
A Multifunctional Envelope-Type Nano Device Containing a pH-Sensitive Cationic Lipid for Efficient Delivery of Short Interfering RNA to Hepatocytes In Vivo.

Science.gov (United States)

Sato, Yusuke; Harashima, Hideyoshi; Kohara, Michinori

2016-01-01

Various types of nanoparticles have been developed with the intent of efficiently delivering short interfering RNA (siRNA) to hepatocytes to date. To achieve efficient SiRNA delivery, various aspects of the delivery processes and physical properties need to be considered. We recently developed an original lipid nanoparticle, a multifunctional envelope-type nano device (MEND) containing YSK05, a pH-sensitive cationic lipid (YSK05-MEND). The YSK05-MEND with SiRNA in its formulation showed hepatocyte-specific uptake and robust gene silencing in hepatocytes after intravenous administration. Here, we describe the procedure used in the preparation and characterization method of the YSK05-MEND.
DNA motif elucidation using belief propagation.

Science.gov (United States)

Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

2013-09-01

Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k=8∼10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/∼wkc/kmerHMM.
DNA motif elucidation using belief propagation

KAUST Repository

Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

2013-01-01

Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).
DNA motif elucidation using belief propagation

KAUST Repository

Wong, Ka-Chun

2013-06-29

Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors\\' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).
The identification of functional motifs in temporal gene expression analysis

Directory of Open Access Journals (Sweden)

Michael G. Surette

2005-01-01

Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.
C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein families

Directory of Open Access Journals (Sweden)

Cutler Sean R

2007-06-01

Full Text Available Abstract Background The carboxy termini of proteins are a frequent site of activity for a variety of biologically important functions, ranging from post-translational modification to protein targeting. Several short peptide motifs involved in protein sorting roles and dependent upon their proximity to the C-terminus for proper function have already been characterized. As a limited number of such motifs have been identified, the potential exists for genome-wide statistical analysis and comparative genomics to reveal novel peptide signatures functioning in a C-terminal dependent manner. We have applied a novel methodology to the prediction of C-terminal-anchored peptide motifs involving a simple z-statistic and several techniques for improving the signal-to-noise ratio. Results We examined the statistical over-representation of position-specific C-terminal tripeptides in 7 eukaryotic proteomes. Sequence randomization models and simple-sequence masking were applied to the successful reduction of background noise. Similarly, as C-terminal homology among members of large protein families may artificially inflate tripeptide counts in an irrelevant and obfuscating manner, gene-family clustering was performed prior to the analysis in order to assess tripeptide over-representation across protein families as opposed to across all proteins. Finally, comparative genomics was used to identify tripeptides significantly occurring in multiple species. This approach has been able to predict, to our knowledge, all C-terminally anchored targeting motifs present in the literature. These include the PTS1 peroxisomal targeting signal (SKL*, the ER-retention signal (K/HDEL*, the ER-retrieval signal for membrane bound proteins (KKxx*, the prenylation signal (CC* and the CaaX box prenylation motif. In addition to a high statistical over-representation of these known motifs, a collection of significant tripeptides with a high propensity for biological function exists
Identification of novel conserved functional motifs across most Influenza A viral strains

Directory of Open Access Journals (Sweden)

El-Azab Iman

2011-01-01

Full Text Available Abstract Background Influenza A virus poses a continuous threat to global public health. Design of novel universal drugs and vaccine requires a careful analysis of different strains of Influenza A viral genome from diverse hosts and subtypes. We performed a systematic in silico analysis of Influenza A viral segments of all available Influenza A viral strains and subtypes and grouped them based on host, subtype, and years isolated, and through multiple sequence alignments we extrapolated conserved regions, motifs, and accessible regions for functional mapping and annotation. Results Across all species and strains 87 highly conserved regions (conservation percentage > = 90% and 19 functional motifs (conservation percentage = 100% were found in PB2, PB1, PA, NP, M, and NS segments. The conservation percentage of these segments ranged between 94 - 98% in human strains (the most conserved, 85 - 93% in swine strains (the most variable, and 91 - 94% in avian strains. The most conserved segment was different in each host (PB1 for human strains, NS for avian strains, and M for swine strains. Target accessibility prediction yielded 324 accessible regions, with a single stranded probability > 0.5, of which 78 coincided with conserved regions. Some of the interesting annotations in these regions included sites for protein-protein interactions, the RNA binding groove, and the proton ion channel. Conclusions The influenza virus has evolved to adapt to its host through variations in the GC content and conservation percentage of the conserved regions. Nineteen universal conserved functional motifs were discovered, of which some were accessible regions with interesting biological functions. These regions will serve as a foundation for universal drug targets as well as universal vaccine design.
De novo design of RNA-binding proteins with a prion-like domain related to ALS/FTD proteinopathies.

Science.gov (United States)

Mitsuhashi, Kana; Ito, Daisuke; Mashima, Kyoko; Oyama, Munenori; Takahashi, Shinichi; Suzuki, Norihiro

2017-12-04

Aberrant RNA-binding proteins form the core of the neurodegeneration cascade in spectrums of disease, such as amyotrophic lateral sclerosis (ALS)/frontotemporal dementia (FTD). Six ALS-related molecules, TDP-43, FUS, TAF15, EWSR1, heterogeneous nuclear (hn)RNPA1 and hnRNPA2 are RNA-binding proteins containing candidate mutations identified in ALS patients and those share several common features, including harboring an aggregation-prone prion-like domain (PrLD) containing a glycine/serine-tyrosine-glycine/serine (G/S-Y-G/S)-motif-enriched low-complexity sequence and rich in glutamine and/or asparagine. Additinally, these six molecules are components of RNA granules involved in RNA quality control and become mislocated from the nucleus to form cytoplasmic inclusion bodies (IBs) in the ALS/FTD-affected brain. To reveal the essential mechanisms involved in ALS/FTD-related cytotoxicity associated with RNA-binding proteins containing PrLDs, we designed artificial RNA-binding proteins harboring G/S-Y-G/S-motif repeats with and without enriched glutamine residues and nuclear-import/export-signal sequences and examined their cytotoxicity in vitro. These proteins recapitulated features of ALS-linked molecules, including insoluble aggregation, formation of cytoplasmic IBs and components of RNA granules, and cytotoxicity instigation. These findings indicated that these artificial RNA-binding proteins mimicked features of ALS-linked molecules and allowed the study of mechanisms associated with gain of toxic functions related to ALS/FTD pathogenesis.
Evasion of short interfering RNA-directed antiviral silencing in Musa acuminata persistently infected with six distinct banana streak pararetroviruses.

Science.gov (United States)

Rajeswaran, Rajendran; Seguin, Jonathan; Chabannes, Matthieu; Duroy, Pierre-Olivier; Laboureau, Nathalie; Farinelli, Laurent; Iskra-Caruana, Marie-Line; Pooggin, Mikhail M

2014-10-01

Vegetatively propagated crop plants often suffer from infections with persistent RNA and DNA viruses. Such viruses appear to evade the plant defenses that normally restrict viral replication and spread. The major antiviral defense mechanism is based on RNA silencing generating viral short interfering RNAs (siRNAs) that can potentially repress viral genes posttranscriptionally through RNA cleavage and transcriptionally through DNA cytosine methylation. Here we examined the RNA silencing machinery of banana plants persistently infected with six pararetroviruses after many years of vegetative propagation. Using deep sequencing, we reconstructed consensus master genomes of the viruses and characterized virus-derived and endogenous small RNAs. Consistent with the presence of endogenous siRNAs that can potentially establish and maintain DNA methylation, the banana genomic DNA was extensively methylated in both healthy and virus-infected plants. A novel class of abundant 20-nucleotide (nt) endogenous small RNAs with 5'-terminal guanosine was identified. In all virus-infected plants, 21- to 24-nt viral siRNAs accumulated at relatively high levels (up to 22% of the total small RNA population) and covered the entire circular viral DNA genomes in both orientations. The hotspots of 21-nt and 22-nt siRNAs occurred within open reading frame (ORF) I and II and the 5' portion of ORF III, while 24-nt siRNAs were more evenly distributed along the viral genome. Despite the presence of abundant viral siRNAs of different size classes, the viral DNA was largely free of cytosine methylation. Thus, the virus is able to evade siRNA-directed DNA methylation and thereby avoid transcriptional silencing. This evasion of silencing likely contributes to the persistence of pararetroviruses in banana plants. We report that DNA pararetroviruses in Musa acuminata banana plants are able to evade DNA cytosine methylation and transcriptional gene silencing, despite being targeted by the host silencing
Tiny giants of gene regulation: experimental strategies for microRNA functional studies

Science.gov (United States)

Steinkraus, Bruno R.; Toegel, Markus

2016-01-01

The discovery over two decades ago of short regulatory microRNAs (miRNAs) has led to the inception of a vast biomedical research field dedicated to understanding these powerful orchestrators of gene expression. Here we aim to provide a comprehensive overview of the methods and techniques underpinning the experimental pipeline employed for exploratory miRNA studies in animals. Some of the greatest challenges in this field have been uncovering the identity of miRNA–target interactions and deciphering their significance with regard to particular physiological or pathological processes. These endeavors relied almost exclusively on the development of powerful research tools encompassing novel bioinformatics pipelines, high‐throughput target identification platforms, and functional target validation methodologies. Thus, in an unparalleled manner, the biomedical technology revolution unceasingly enhanced and refined our ability to dissect miRNA regulatory networks and understand their roles in vivo in the context of cells and organisms. Recurring motifs of target recognition have led to the creation of a large number of multifactorial bioinformatics analysis platforms, which have proved instrumental in guiding experimental miRNA studies. Subsequently, the need for discovery of miRNA–target binding events in vivo drove the emergence of a slew of high‐throughput multiplex strategies, which now provide a viable prospect for elucidating genome‐wide miRNA–target binding maps in a variety of cell types and tissues. Finally, deciphering the functional relevance of miRNA post‐transcriptional gene silencing under physiological conditions, prompted the evolution of a host of technologies enabling systemic manipulation of miRNA homeostasis as well as high‐precision interference with their direct, endogenous targets. WIREs Dev Biol 2016, 5:311–362. doi: 10.1002/wdev.223 For further resources related to this article, please visit the WIREs website. PMID:26950183
An approach to the construction of tailor-made amphiphilic peptides that strongly and selectively bind to hairpin RNA targets.

Science.gov (United States)

Lee, Su Jin; Hyun, Soonsil; Kieft, Jeffrey S; Yu, Jaehoon

2009-02-18

The hairpin RNA motif is one of the most frequently observed secondary structures and is often targeted by therapeutic agents. An amphiphilic peptide with seven lysine and eight leucine residues and its derivatives were designed for use as ligands against RNA hairpin motifs. We hypothesized that variations in both the hydrophobic leucine-rich and hydrophilic lysine-rich spheres of these amphiphilic peptides would create extra attractive interactions with hairpin RNA targets. A series of alanine-scanned peptides were probed to identify the most influential lysine residues in the hydrophilic sphere. The binding affinities of these modified peptides with several hairpins, such as RRE, TAR from HIV, a short hairpin from IRES of HCV, and a hairpin from the 16S A-site stem from rRNA, were determined. Since the hairpin from IRES of HCV was the most susceptible to the initial series of alanine-scanned peptides, studies investigating how further variations in the peptides effect binding employed the IRES hairpin. Next, the important Lys residues were substituted by shorter chain amines, such as ornithine, to place the peptide deeper into the hairpin groove. In a few cases, a 70-fold improved binding was observed for peptides that contained the specifically located shorter amine side chains. To further explore changes in binding affinities brought about by alterations in the hydrophobic sphere, tryptophan residues were introduced in place of leucine. A few peptides with tryptophan in specific positions also displayed 70-fold improved binding affinities. Finally, double mutant peptides incorporating both specifically located shorter amine side chains in the hydrophilic region and tryptophan residues in the hydrophobic region were synthesized. The binding affinities of peptides containing the simple double modification were observed to be 80 times lower, and their binding specificities were increased 40-fold. The results of this effort provide important information about
Biochemical characterization of a recombinant Japanese encephalitis virus RNA-dependent RNA polymerase

Directory of Open Access Journals (Sweden)

Kim Chan-Mi

2007-07-01

Full Text Available Abstract Background Japanese encephalitis virus (JEV NS5 is a viral nonstructural protein that carries both methyltransferase and RNA-dependent RNA polymerase (RdRp domains. It is a key component of the viral RNA replicase complex that presumably includes other viral nonstructural and cellular proteins. The biochemical properties of JEV NS5 have not been characterized due to the lack of a robust in vitro RdRp assay system, and the molecular mechanisms for the initiation of RNA synthesis by JEV NS5 remain to be elucidated. Results To characterize the biochemical properties of JEV RdRp, we expressed in Escherichia coli and purified an enzymatically active full-length recombinant JEV NS5 protein with a hexahistidine tag at the N-terminus. The purified NS5 protein, but not the mutant NS5 protein with an Ala substitution at the first Asp of the RdRp-conserved GDD motif, exhibited template- and primer-dependent RNA synthesis activity using a poly(A RNA template. The NS5 protein was able to use both plus- and minus-strand 3'-untranslated regions of the JEV genome as templates in the absence of a primer, with the latter RNA being a better template. Analysis of the RNA synthesis initiation site using the 3'-end 83 nucleotides of the JEV genome as a minimal RNA template revealed that the NS5 protein specifically initiates RNA synthesis from an internal site, U81, at the two nucleotides upstream of the 3'-end of the template. Conclusion As a first step toward the understanding of the molecular mechanisms for JEV RNA replication and ultimately for the in vitro reconstitution of viral RNA replicase complex, we for the first time established an in vitro JEV RdRp assay system with a functional full-length recombinant JEV NS5 protein and characterized the mechanisms of RNA synthesis from nonviral and viral RNA templates. The full-length recombinant JEV NS5 will be useful for the elucidation of the structure-function relationship of this enzyme and for the
Hybrid DNA i-motif: Aminoethylprolyl-PNA (pC5) enhance the stability of DNA (dC5) i-motif structure.

Science.gov (United States)

Gade, Chandrasekhar Reddy; Sharma, Nagendra K

2017-12-15

This report describes the synthesis of C-rich sequence, cytosine pentamer, of aep-PNA and its biophysical studies for the formation of hybrid DNA:aep-PNAi-motif structure with DNA cytosine pentamer (dC 5 ) under acidic pH conditions. Herein, the CD/UV/NMR/ESI-Mass studies strongly support the formation of stable hybrid DNA i-motif structure with aep-PNA even near acidic conditions. Hence aep-PNA C-rich sequence cytosine could be considered as potential DNA i-motif stabilizing agents in vivo conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.
Verification of the MOTIF code version 3.0

International Nuclear Information System (INIS)

Chan, T.; Guvanasen, V.; Nakka, B.W.; Reid, J.A.K.; Scheier, N.W.; Stanchell, F.W.

1996-12-01

As part of the Canadian Nuclear Fuel Waste Management Program (CNFWMP), AECL has developed a three-dimensional finite-element code, MOTIF (Model Of Transport In Fractured/ porous media), for detailed modelling of groundwater flow, heat transport and solute transport in a fractured rock mass. The code solves the transient and steady-state equations of groundwater flow, solute (including one-species radionuclide) transport, and heat transport in variably saturated fractured/porous media. The initial development was completed in 1985 (Guvanasen 1985) and version 3.0 was completed in 1986. This version is documented in detail in Guvanasen and Chan (in preparation). This report describes a series of fourteen verification cases which has been used to test the numerical solution techniques and coding of MOTIF, as well as demonstrate some of the MOTIF analysis capabilities. For each case the MOTIF solution has been compared with a corresponding analytical or independently developed alternate numerical solution. Several of the verification cases were included in Level 1 of the International Hydrologic Code Intercomparison Project (HYDROCOIN). The MOTIF results for these cases were also described in the HYDROCOIN Secretariat's compilation and comparison of results submitted by the various project teams (Swedish Nuclear Power Inspectorate 1988). It is evident from the graphical comparisons presented that the MOTIF solutions for the fourteen verification cases are generally in excellent agreement with known analytical or numerical solutions obtained from independent sources. This series of verification studies has established the ability of the MOTIF finite-element code to accurately model the groundwater flow and solute and heat transport phenomena for which it is intended. (author). 20 refs., 14 tabs., 32 figs
VDR regulation of microRNA differs across prostate cell models suggesting extremely flexible control of transcription.

Science.gov (United States)

Singh, Prashant K; Long, Mark D; Battaglia, Sebastiano; Hu, Qiang; Liu, Song; Sucheston-Campbell, Lara E; Campbell, Moray J

2015-01-01

The Vitamin D Receptor (VDR) is a member of the nuclear receptor superfamily and is of therapeutic interest in cancer and other settings. Regulation of microRNA (miRNA) by the VDR appears to be important to mediate its actions, for example, to control cell growth. To identify if and to what extent VDR-regulated miRNA patterns change in prostate cancer progression, we undertook miRNA microarray analyses in 7 cell models representing non-malignant and malignant prostate cells (RWPE-1, RWPE-2, HPr1, HPr1AR, LNCaP, LNCaP-C4-2, and PC-3). To focus on primary VDR regulatory events, we undertook expression analyses after 30 minutes treatment with 1α,25(OH)2D3. Across all models, 111 miRNAs were significantly modulated by 1α,25(OH)2D3 treatment. Of these, only 5 miRNAs were modulated in more than one cell model, and of these, only 3 miRNAs were modulated in the same direction. The patterns of miRNA regulation, and the networks they targeted, significantly distinguished the different cell types. Integration of 1α,25(OH)2D3-regulated miRNAs with published VDR ChIP-seq data showed significant enrichment of VDR peaks in flanking regions of miRNAs. Furthermore, mRNA and miRNA expression analyses in non-malignant RWPE-1 cells revealed patterns of miRNA and mRNA co-regulation; specifically, 13 significant reciprocal patterns were identified and these patterns were also observed in TCGA prostate cancer data. Lastly, motif search analysis revealed differential motif enrichment within VDR peaks flanking mRNA compared to miRNA genes. Together, this study revealed that miRNAs are rapidly regulated in a highly cell-type specific manner, and are significantly co-integrated with mRNA regulation.
Short-term calorie restriction feminizes the mRNA profiles of drug metabolizing enzymes and transporters in livers of mice.

Science.gov (United States)

Fu, Zidong Donna; Klaassen, Curtis D

2014-01-01

Calorie restriction (CR) is one of the most effective anti-aging interventions in mammals. A modern theory suggests that aging results from a decline in detoxification capabilities and thus accumulation of damaged macromolecules. The present study aimed to determine how short-term CR alters mRNA profiles of genes that encode metabolism and detoxification machinery in the liver. Male C57BL/6 mice were fed CR (0, 15, 30, or 40%) diets for one month, followed by mRNA quantification of 98 xenobiotic processing genes (XPGs) in the liver, including 7 uptake transporters, 39 phase-I enzymes, 37 phase-II enzymes, 10 efflux transporters, and 5 transcription factors. In general, 15% CR did not alter mRNAs of most XPGs, whereas 30 and 40% CR altered over half of the XPGs (32 increased and 29 decreased). CR up-regulated some phase-I enzymes (fold increase), such as Cyp4a14 (12), Por (2.3), Nqo1 (1.4), Fmo2 (5.4), and Fmo3 (346), and numerous number of phase-II enzymes, such as Sult1a1 (1.2), Sult1d1 (2.0), Sult1e1 (33), Sult3a1 (2.2), Gsta4 (1.3), Gstm2 (1.3), Gstm3 (1.7), and Mgst3 (2.2). CR feminized the mRNA profiles of 32 XPGs in livers of male mice. For instance, CR decreased the male-predominantly expressed Oatp1a1 (97%) and increased the female-predominantly expressed Oatp1a4 (11). In conclusion, short-term CR alters the mRNA levels of over half of the 98 XPGs quantified in livers of male mice, and over half of these alterations appear to be due to feminization of the liver. Copyright © 2013 Elsevier Inc. All rights reserved.
Short hairpin RNA targeting 2B gene of coxsackievirus B3 exhibits potential antiviral effects both in vitro and in vivo

Directory of Open Access Journals (Sweden)

Yao Hailan

2012-08-01

Full Text Available Abstract Background Coxsackievirus B3 is an important infectious agent of viral myocarditis, pancreatitis and aseptic meningitis, but there are no specific antiviral therapeutic reagents in clinical use. RNA interference-based technology has been developed to prevent the viral infection. Methods To evaluate the impact of RNA interference on viral replication, cytopathogenicity and animal survival, short hairpin RNAs targeting the viral 2B region (shRNA-2B expressed by a recombinant vector (pGCL-2B or a recombinant lentivirus (Lenti-2B were tansfected in HeLa cells or transduced in mice infected with CVB3. Results ShRNA-2B exhibited a significant effect on inhibition of viral production in HeLa cells. Furthermore, shRNA-2B improved mouse survival rate, reduced the viral tissues titers and attenuated tissue damage compared with those of the shRNA-NC treated control group. Lenti-2B displayed more effective role in inhibition of viral replication than pGCL-2B in vivo. Conclusions Coxsackievirus B3 2B is an effective target of gene silencing against coxsackievirus B3 infection, suggesting that shRNA-2B is a potential agent for further development into a treatment for enterviral diseases.
Purification and functional motifs of the recombinant ATPase of orf virus.

Science.gov (United States)

Lin, Fong-Yuan; Chan, Kun-Wei; Wang, Chi-Young; Wong, Min-Liang; Hsu, Wei-Li

2011-10-01

Our previous study showed that the recombinant ATPase encoded by the A32L gene of orf virus displayed ATP hydrolysis activity as predicted from its amino acids sequence. This viral ATPase contains four known functional motifs (motifs I-IV) and a novel AYDG motif; they are essential for ATP hydrolysis reaction by binding ATP and magnesium ions. The motifs I and II correspond with the Walker A and B motifs of the typical ATPase, respectively. To examine the biochemical roles of these five conserved motifs, recombinant ATPases of five deletion mutants derived from the Taiping strain were expressed and purified. Their ATPase functions were assayed and compared with those of two wild type strains, Taiping and Nantou isolated in Taiwan. Our results showed that deletions at motifs I-III or IV exhibited lower activity than that of the wild type. Interestingly, deletion of AYDG motif decreased the ATPase activity more significantly than those of motifs I-IV deletions. Divalent ions such as magnesium and calcium were essential for ATPase activity. Moreover, our recombinant proteins of orf virus also demonstrated GTPase activity, though weaker than the original ATPase activity. Copyright © 2011 Elsevier Inc. All rights reserved.
Chemical correction of pre-mRNA splicing defects associated with sequestration of muscleblind-like 1 protein by expanded r(CAG) transcripts

Science.gov (United States)

Kumar, Amit; Parkesh, Raman; Sznajder, Lukasz J.; Childs-Disney, Jessica; Sobczak, Krzysztof; Disney, Matthew D.

2012-01-01

Recently, it was reported that expanded r(CAG) triplet repeats (r(CAG)exp) associated with untreatable neurological diseases cause pre-mRNA mis-splicing likely due to sequestration of muscleblind-like 1 (MBNL1) splicing factor. Bioactive small molecules that bind the 5’CAG/3’GAC motif found in r(CAG)exp hairpin structure were identified by using RNA binding studies and virtual screening/chemical similarity searching. Specifically, a benzylguanidine-containing small molecule was found to improve pre-mRNA alternative splicing of MBNL1-sensitive exons in cells expressing the toxic r(CAG)exp. The compound was identified by first studying the binding of RNA 1×1 nucleotide internal loops to small molecules known to have affinity for nucleic acids. Those studies identified 4',6-diamidino-2-phenylindole (DAPI) as a specific binder to RNAs with the 5’CAG/3’GAC motif. DAPI was then used as a query molecule in a shape- and chemistry alignment-based virtual screen to identify compounds with improved properties, which identified 4-guanidinophenyl 4-guanidinobenzoate as small molecule capable of improving pre-mRNA splicing defects associated with the r(CAG)exp-MBNL1 complex. This compound may facilitate the development of therapeutics to treat diseases caused by r(CAG)exp and could serve as a useful chemical tool to dissect the mechanisms of r(CAG)exp toxicity. The approach used in these studies, defining the small RNA motifs that bind known nucleic acid binders and then using virtual screening to optimize them for bioactivity, may be generally applicable for designing small molecules that target other RNAs in human genomic sequence. PMID:22252896

A boy with 46,X,+mar presenting gynecomastia and short stature

Directory of Open Access Journals (Sweden)

Ki Eun Kim

2017-12-01

Full Text Available A 15-year-old boy was referred due to gynecomastia and short stature. He was overweight and showed the knuckle-dimple sign on the left hand, a short fourth toe on the left foot, and male external genitalia with a small phallus. His levels of estradiol and follicle-stimulating hormone were increased, and his testosterone concentration was normal. Other hormonal tests were within the normal range. Radiographs showed short fourth and fifth metacarpals and fourth metatarsal bones. The karyotype was reported as 46,X,+mar, and the marker chromosome was shown to originate from the Y chromosome, which was identified by fluorescence in situ hybridization. Polymerase chain reaction and direct sequencing were used to clarify the deleted loci of the Y chromosome by making use of Y-specific sequence-tagged sites (STSs. The sex-determining region Y and centromere were verified, and there were microdeletions on the long arm of the Y chromosome. The azoospermia factor (AZF b region was partially deleted, and AZFa and AZFc were completely deleted. Two STS probes of sY143 and the Y chromosome RNA recognition motif in AZFb showed positive signals corresponding to Yq11.223. The karyotype of the patient was interpreted as 46,X,der(Ydel(Y(q11.21q11.222del(Y(q11.23qter. Herein, we report a rare case of a boy presenting with gynecomastia and short stature with 46, X, +mar, which originated from the Y chromosome, which was identified to have Yq microdeletions.
The long non-coding RNA GAS5 cooperates with the eukaryotic translation initiation factor 4E to regulate c-Myc translation.

Directory of Open Access Journals (Sweden)

Guangzhen Hu

Full Text Available Long noncoding RNAs (lncRNAs are important regulators of transcription; however, their involvement in protein translation is not well known. Here we explored whether the lncRNA GAS5 is associated with translation initiation machinery and regulates translation. GAS5 was enriched with eukaryotic translation initiation factor-4E (eIF4E in an RNA-immunoprecipitation assay using lymphoma cell lines. We identified two RNA binding motifs within eIF4E protein and the deletion of each motif inhibited the binding of GAS5 with eIF4E. To confirm the role of GAS5 in translation regulation, GAS5 siRNA and in vitro transcribed GAS5 RNA were used to knock down or overexpress GAS5, respectively. GAS5 siRNA had no effect on global protein translation but did specifically increase c-Myc protein level without an effect on c-Myc mRNA. The mechanism of this increase in c-Myc protein was enhanced association of c-Myc mRNA with the polysome without any effect on protein stability. In contrast, overexpression of in vitro transcribed GAS5 RNA suppressed c-Myc protein without affecting c-Myc mRNA. Interestingly, GAS5 was found to be bound with c-Myc mRNA, suggesting that GAS5 regulates c-Myc translation through lncRNA-mRNA interaction. Our findings have uncovered a role of GAS5 lncRNA in translation regulation through its interactions with eIF4E and c-Myc mRNA.
Extensive Mutagenesis of the Conserved Box E Motif in Duck Hepatitis B Virus P Protein Reveals Multiple Functions in Replication and a Common Structure with the Primer Grip in HIV-1 Reverse Transcriptase

OpenAIRE

Wang, Yong-Xiang; Luo, Cheng; Zhao, Dan; Beck, Jürgen; Nassal, Michael

2012-01-01

Hepadnaviruses, including the pathogenic hepatitis B virus (HBV), replicate their small DNA genomes through protein-primed reverse transcription, mediated by the terminal protein (TP) domain in their P proteins and an RNA stem-loop, ϵ, on the pregenomic RNA (pgRNA). No direct structural data are available for P proteins, but their reverse transcriptase (RT) domains contain motifs that are conserved in all RTs (box A to box G), implying a similar architecture; however, experimental support for...
An experimental test of a fundamental food web motif.

Science.gov (United States)

Rip, Jason M K; McCann, Kevin S; Lynn, Denis H; Fawcett, Sonia

2010-06-07

Large-scale changes to the world's ecosystem are resulting in the deterioration of biostructure-the complex web of species interactions that make up ecological communities. A difficult, yet crucial task is to identify food web structures, or food web motifs, that are the building blocks of this baroque network of interactions. Once identified, these food web motifs can then be examined through experiments and theory to provide mechanistic explanations for how structure governs ecosystem stability. Here, we synthesize recent ecological research to show that generalist consumers coupling resources with different interaction strengths, is one such motif. This motif amazingly occurs across an enormous range of spatial scales, and so acts to distribute coupled weak and strong interactions throughout food webs. We then perform an experiment that illustrates the importance of this motif to ecological stability. We find that weak interactions coupled to strong interactions by generalist consumers dampen strong interaction strengths and increase community stability. This study takes a critical step by isolating a common food web motif and through clear, experimental manipulation, identifies the fundamental stabilizing consequences of this structure for ecological communities.
Identification of constrained peptides that bind to and preferentially inhibit the activity of the hepatitis C viral RNA-dependent RNA polymerase

International Nuclear Information System (INIS)

Amin, Anthony; Zaccardi, Joe; Mullen, Stanley; Olland, Stephane; Orlowski, Mark; Feld, Boris; Labonte, Patrick; Mak, Paul

2003-01-01

A class of disulfide constrained peptides containing a core motif FPWG was identified from a screen of phage displayed library using the HCV RNA-dependent RNA polymerase (NS5B) as a bait. Surface plasmon resonance studies showed that three highly purified synthetic constrained peptides bound to immobilized NS5B with estimated K d values ranging from 30 to 60 μM. In addition, these peptides inhibited the NS5B activity in vitro with IC 50 ranging from 6 to 48 μM, whereas in contrast they had no inhibitory effect on the enzymatic activities of calf thymus polymerase α, human polymerase β, RSV polymerase, and HIV reverse transcriptase in vitro. Two peptides demonstrated conformation-dependent inhibition since their synthetic linear versions were not inhibitory in the NS5B assay. A constrained peptide with the minimum core motif FPWG retained selective inhibition of NS5B activity with an IC 50 of 50 μM. Alanine scan analyses of a representative constrained peptide, FPWGNTW, indicated that residues F1 and W7 were critical for the inhibitory effect of this peptide, although residues P2 and N5 had some measurable inhibitory effect as well. Further analyses of the mechanism of inhibition indicated that these peptides inhibited the formation of preelongation complexes required for the elongation reaction. However, once the preelongation complex was formed, its activity was refractory to peptide inhibition. Furthermore, the constrained peptide FPWGNTW inhibited de novo initiated RNA synthesis by NS5B from a poly(rC) template. These data indicate that the peptides confer selective inhibition of NS5B activity by binding to the enzyme and perturbing an early step preceding the processive elongation step of RNA synthesis
Highly scalable Ab initio genomic motif identification

KAUST Repository

Marchand, Benoit; Bajic, Vladimir B.; Kaushik, Dinesh

2011-01-01

We present results of scaling an ab initio motif family identification system, Dragon Motif Finder (DMF), to 65,536 processor cores of IBM Blue Gene/P. DMF seeks groups of mutually similar polynucleotide patterns within a set of genomic sequences and builds various motif families from them. Such information is of relevance to many problems in life sciences. Prior attempts to scale such ab initio motif-finding algorithms achieved limited success. We solve the scalability issues using a combination of mixed-mode MPI-OpenMP parallel programming, master-slave work assignment, multi-level workload distribution, multi-level MPI collectives, and serial optimizations. While the scalability of our algorithm was excellent (94% parallel efficiency on 65,536 cores relative to 256 cores on a modest-size problem), the final speedup with respect to the original serial code exceeded 250,000 when serial optimizations are included. This enabled us to carry out many large-scale ab initio motiffinding simulations in a few hours while the original serial code would have needed decades of execution time. Copyright 2011 ACM.
Target motifs affecting natural immunity by a constitutive CRISPR-Cas system in Escherichia coli.

Directory of Open Access Journals (Sweden)

Cristóbal Almendros

Full Text Available Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR and CRISPR associated (cas genes conform the CRISPR-Cas systems of various bacteria and archaea and produce degradation of invading nucleic acids containing sequences (protospacers that are complementary to repeat intervening spacers. It has been demonstrated that the base sequence identity of a protospacer with the cognate spacer and the presence of a protospacer adjacent motif (PAM influence CRISPR-mediated interference efficiency. By using an original transformation assay with plasmids targeted by a resident spacer here we show that natural CRISPR-mediated immunity against invading DNA occurs in wild type Escherichia coli. Unexpectedly, the strongest activity is observed with protospacer adjoining nucleotides (interference motifs that differ from the PAM both in sequence and location. Hence, our results document for the first time native CRISPR activity in E. coli and demonstrate that positions next to the PAM in invading DNA influence their recognition and degradation by these prokaryotic immune systems.
Mechanisms of zero-lag synchronization in cortical motifs.

Directory of Open Access Journals (Sweden)

Leonardo L Gollo

2014-04-01

Full Text Available Zero-lag synchronization between distant cortical areas has been observed in a diversity of experimental data sets and between many different regions of the brain. Several computational mechanisms have been proposed to account for such isochronous synchronization in the presence of long conduction delays: Of these, the phenomenon of "dynamical relaying"--a mechanism that relies on a specific network motif--has proven to be the most robust with respect to parameter mismatch and system noise. Surprisingly, despite a contrary belief in the community, the common driving motif is an unreliable means of establishing zero-lag synchrony. Although dynamical relaying has been validated in empirical and computational studies, the deeper dynamical mechanisms and comparison to dynamics on other motifs is lacking. By systematically comparing synchronization on a variety of small motifs, we establish that the presence of a single reciprocally connected pair--a "resonance pair"--plays a crucial role in disambiguating those motifs that foster zero-lag synchrony in the presence of conduction delays (such as dynamical relaying from those that do not (such as the common driving triad. Remarkably, minor structural changes to the common driving motif that incorporate a reciprocal pair recover robust zero-lag synchrony. The findings are observed in computational models of spiking neurons, populations of spiking neurons and neural mass models, and arise whether the oscillatory systems are periodic, chaotic, noise-free or driven by stochastic inputs. The influence of the resonance pair is also robust to parameter mismatch and asymmetrical time delays amongst the elements of the motif. We call this manner of facilitating zero-lag synchrony resonance-induced synchronization, outline the conditions for its occurrence, and propose that it may be a general mechanism to promote zero-lag synchrony in the brain.
Cooperation of an RNA Packaging Signal and a Viral Envelope Protein in Coronavirus RNA Packaging

OpenAIRE

Narayanan, Krishna; Makino, Shinji

2001-01-01

Murine coronavirus mouse hepatitis virus (MHV) produces a genome-length mRNA, mRNA 1, and six or seven species of subgenomic mRNAs in infected cells. Among these mRNAs, only mRNA 1 is efficiently packaged into MHV particles. MHV N protein binds to all MHV mRNAs, whereas envelope M protein interacts only with mRNA 1. This M protein-mRNA 1 interaction most probably determines the selective packaging of mRNA 1 into MHV particles. A short cis-acting MHV RNA packaging signal is necessary and suffi...
Transduction motif analysis of gastric cancer based on a human signaling network

Energy Technology Data Exchange (ETDEWEB)

Liu, G.; Li, D.Z.; Jiang, C.S.; Wang, W. [Fuzhou General Hospital of Nanjing Command, Department of Gastroenterology, Fuzhou, China, Department of Gastroenterology, Fuzhou General Hospital of Nanjing Command, Fuzhou (China)

2014-04-04

To investigate signal regulation models of gastric cancer, databases and literature were used to construct the signaling network in humans. Topological characteristics of the network were analyzed by CytoScape. After marking gastric cancer-related genes extracted from the CancerResource, GeneRIF, and COSMIC databases, the FANMOD software was used for the mining of gastric cancer-related motifs in a network with three vertices. The significant motif difference method was adopted to identify significantly different motifs in the normal and cancer states. Finally, we conducted a series of analyses of the significantly different motifs, including gene ontology, function annotation of genes, and model classification. A human signaling network was constructed, with 1643 nodes and 5089 regulating interactions. The network was configured to have the characteristics of other biological networks. There were 57,942 motifs marked with gastric cancer-related genes out of a total of 69,492 motifs, and 264 motifs were selected as significantly different motifs by calculating the significant motif difference (SMD) scores. Genes in significantly different motifs were mainly enriched in functions associated with cancer genesis, such as regulation of cell death, amino acid phosphorylation of proteins, and intracellular signaling cascades. The top five significantly different motifs were mainly cascade and positive feedback types. Almost all genes in the five motifs were cancer related, including EPOR, MAPK14, BCL2L1, KRT18, PTPN6, CASP3, TGFBR2, AR, and CASP7. The development of cancer might be curbed by inhibiting signal transductions upstream and downstream of the selected motifs.
RNA-binding properties and mapping of the RNA-binding domain from the movement protein of Prunus necrotic ringspot virus.

Science.gov (United States)

Herranz, M Carmen; Pallás, Vicente

2004-03-01

The movement protein (MP) of Prunus necrotic ringspot virus (PNRSV) is involved in intercellular virus transport. In this study, putative RNA-binding properties of the PNRSV MP were studied. The PNRSV MP was produced in Escherichia coli using an expression vector. Electrophoretic mobility shift assays (EMSAs) using DIG-labelled riboprobes demonstrated that PNRSV MP bound ssRNA cooperatively without sequence specificity. Two different ribonucleoprotein complexes were found to be formed depending on the molar MP : PNRSV RNA ratio. The different responses of the complexes to urea treatment strongly suggested that they have different structural properties. Deletion mutagenesis followed by Northwestern analysis allowed location of a nucleic acid binding domain to aa 56-88. This 33 aa RNA-binding motif is the smallest region delineated among members of the family Bromoviridae for which RNA-binding properties have been demonstrated. This domain is highly conserved within all phylogenetic subgroups previously described for PNRSV isolates. Interestingly, the RNA-binding domain described here and the one described for Alfamovirus are located at the N terminus of their corresponding MPs, whereas similar domains previously characterized in members of the genera Bromovirus and Cucumovirus are present at the C terminus, strongly reflecting their corresponding phylogenetic relationships. The evolutionary implications of this observation are discussed.
Armadillo motifs involved in vesicular transport.

Directory of Open Access Journals (Sweden)

Harald Striegl

Full Text Available Armadillo (ARM repeat proteins function in various cellular processes including vesicular transport and membrane tethering. They contain an imperfect repeating sequence motif that forms a conserved three-dimensional structure. Recently, structural and functional insight into tethering mediated by the ARM-repeat protein p115 has been provided. Here we describe the p115 ARM-motifs for reasons of clarity and nomenclature and show that both sequence and structure are highly conserved among ARM-repeat proteins. We argue that there is no need to invoke repeat types other than ARM repeats for a proper description of the structure of the p115 globular head region. Additionally, we propose to define a new subfamily of ARM-like proteins and show lack of evidence that the ARM motifs found in p115 are present in other long coiled-coil tethering factors of the golgin family.
Genetic determinants of PAM-dependent DNA targeting and pre-crRNA processing in Sulfolobus islandicus

DEFF Research Database (Denmark)

Peng, Wenfang; Li, Huan; Hallstrøm, Søren

2013-01-01

-adjacent motif (PAM)-dependent DNA targeting activity and mature CRISPR RNA (crRNA) production in this organism, mutants deleting individual genes of the type IA system or removing each of other Cas modules were constructed. Characterization of these mutants revealed that Cas7, Cas5, Cas6, Cas3' and Cas3......" are essential for PAM-dependent DNA targeting activity, whereas Csa5, along with all other Cas modules, is dispensable for the targeting in the crenarchaeon. Cas6 is implicated as the only enzyme for pre-crRNA processing and the crRNA maturation is independent of the DNA targeting activity. Importantly, we show...
Characterizing Motif Dynamics of Electric Brain Activity Using Symbolic Analysis

Directory of Open Access Journals (Sweden)

Massimiliano Zanin

2014-10-01

Full Text Available Motifs are small recurring circuits of interactions which constitute the backbone of networked systems. Characterizing motif dynamics is therefore key to understanding the functioning of such systems. Here we propose a method to define and quantify the temporal variability and time scales of electroencephalogram (EEG motifs of resting brain activity. Given a triplet of EEG sensors, links between them are calculated by means of linear correlation; each pattern of links (i.e., each motif is then associated to a symbol, and its appearance frequency is analyzed by means of Shannon entropy. Our results show that each motif becomes observable with different coupling thresholds and evolves at its own time scale, with fronto-temporal sensors emerging at high thresholds and changing at fast time scales, and parietal ones at low thresholds and changing at slower rates. Finally, while motif dynamics differed across individuals, for each subject, it showed robustness across experimental conditions, indicating that it could represent an individual dynamical signature.
Discriminative motif discovery via simulated evolution and random under-sampling.

Directory of Open Access Journals (Sweden)

Tao Song

Full Text Available Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.
Discriminative motif discovery via simulated evolution and random under-sampling.

Science.gov (United States)

Song, Tao; Gu, Hong

2014-01-01

Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs) training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.
Improved i-motif thermal stability by insertion of anthraquinone monomers

DEFF Research Database (Denmark)

Gouda, Alaa S; Amine, Mahasen S.; Pedersen, Erik Bjerregaard

2017-01-01

In order to gain insight into how to improve thermal stability of i-motifs when used in the context of biomedical and nanotechnological applications, novel anthraquinone-modified i-motifs were synthesized by insertion of 1,8-, 1,4-, 1,5- and 2,6-disubstituted anthraquinone monomers into the TAA...... loops of a 22mer cytosine-rich human telomeric DNA sequence. The influence of the four anthraquinone linkers on the i-motif thermal stability was investigated at 295 nm and pH 5.5. Anthraquinone monomers modulate the i-motif stability in a position-depending manner and the modulation also depends...... unlocked nucleic acid monomers or twisted intercalating nucleic acid. The 2,6-disubstituted anthraquinone linker replacing T10 enabled a significant increase of i-motif thermal melting by 8.2 °C. A substantial increase of 5.0 °C in i-motif thermal melting was recorded when both A6 and T16 were modified...
Structural and biochemical studies on ATP binding and hydrolysis by the Escherichia coli RNA chaperone Hfq.

Directory of Open Access Journals (Sweden)

Hermann Hämmerle

Full Text Available In Escherichia coli the RNA chaperone Hfq is involved in riboregulation by assisting base-pairing between small regulatory RNAs (sRNAs and mRNA targets. Several structural and biochemical studies revealed RNA binding sites on either surface of the donut shaped Hfq-hexamer. Whereas sRNAs are believed to contact preferentially the YKH motifs present on the proximal site, poly(A(15 and ADP were shown to bind to tripartite binding motifs (ARE circularly positioned on the distal site. Hfq has been reported to bind and to hydrolyze ATP. Here, we present the crystal structure of a C-terminally truncated variant of E. coli Hfq (Hfq(65 in complex with ATP, showing that it binds to the distal R-sites. In addition, we revisited the reported ATPase activity of full length Hfq purified to homogeneity. At variance with previous reports, no ATPase activity was observed for Hfq. In addition, FRET assays neither indicated an impact of ATP on annealing of two model oligoribonucleotides nor did the presence of ATP induce strand displacement. Moreover, ATP did not lead to destabilization of binary and ternary Hfq-RNA complexes, unless a vast stoichiometric excess of ATP was used. Taken together, these studies strongly suggest that ATP is dispensable for and does not interfere with Hfq-mediated RNA transactions.
Selection of functional 2A sequences within foot-and-mouth disease virus; requirements for the NPGP motif with a distinct codon bias

DEFF Research Database (Denmark)

Kjær, Jonas; Belsham, Graham J.

2018-01-01

Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long) which induces a non-proteolytic, co-translational, "cleavage" at its own C......-terminus. A conserved feature among variants of 2A is the C-terminal motif N16P17G18/P19 where P19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E14, S15 and N16 within the 2A sequence of infectious FMDVs but no variants at residues P17, G18...... or P19 have been identified. In this study, using highly degenerate primers, we analysed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after 2, 3 or 4 passages. However...
Positional bias of general and tissue-specific regulatory motifs in mouse gene promoters

Directory of Open Access Journals (Sweden)

Farré Domènec

2007-12-01

Full Text Available Abstract Background The arrangement of regulatory motifs in gene promoters, or promoter architecture, is the result of mutation and selection processes that have operated over many millions of years. In mammals, tissue-specific transcriptional regulation is related to the presence of specific protein-interacting DNA motifs in gene promoters. However, little is known about the relative location and spacing of these motifs. To fill this gap, we have performed a systematic search for motifs that show significant bias at specific promoter locations in a large collection of housekeeping and tissue-specific genes. Results We observe that promoters driving housekeeping gene expression are enriched in particular motifs with strong positional bias, such as YY1, which are of little relevance in promoters driving tissue-specific expression. We also identify a large number of motifs that show positional bias in genes expressed in a highly tissue-specific manner. They include well-known tissue-specific motifs, such as HNF1 and HNF4 motifs in liver, kidney and small intestine, or RFX motifs in testis, as well as many potentially novel regulatory motifs. Based on this analysis, we provide predictions for 559 tissue-specific motifs in mouse gene promoters. Conclusion The study shows that motif positional bias is an important feature of mammalian proximal promoters and that it affects both general and tissue-specific motifs. Motif positional constraints define very distinct promoter architectures depending on breadth of expression and type of tissue.

mESAdb: microRNA expression and sequence analysis database.

Science.gov (United States)

Kaya, Koray D; Karakülah, Gökhan; Yakicier, Cengiz M; Acar, Aybar C; Konu, Ozlen

2011-01-01

microRNA expression and sequence analysis database (http://konulab.fen.bilkent.edu.tr/mirna/) (mESAdb) is a regularly updated database for the multivariate analysis of sequences and expression of microRNAs from multiple taxa. mESAdb is modular and has a user interface implemented in PHP and JavaScript and coupled with statistical analysis and visualization packages written for the R language. The database primarily comprises mature microRNA sequences and their target data, along with selected human, mouse and zebrafish expression data sets. mESAdb analysis modules allow (i) mining of microRNA expression data sets for subsets of microRNAs selected manually or by motif; (ii) pair-wise multivariate analysis of expression data sets within and between taxa; and (iii) association of microRNA subsets with annotation databases, HUGE Navigator, KEGG and GO. The use of existing and customized R packages facilitates future addition of data sets and analysis tools. Furthermore, the ability to upload and analyze user-specified data sets makes mESAdb an interactive and expandable analysis tool for microRNA sequence and expression data.
Computational analyses of synergism in small molecular network motifs.

Directory of Open Access Journals (Sweden)

Yili Zhang

2014-03-01

Full Text Available Cellular functions and responses to stimuli are controlled by complex regulatory networks that comprise a large diversity of molecular components and their interactions. However, achieving an intuitive understanding of the dynamical properties and responses to stimuli of these networks is hampered by their large scale and complexity. To address this issue, analyses of regulatory networks often focus on reduced models that depict distinct, reoccurring connectivity patterns referred to as motifs. Previous modeling studies have begun to characterize the dynamics of small motifs, and to describe ways in which variations in parameters affect their responses to stimuli. The present study investigates how variations in pairs of parameters affect responses in a series of ten common network motifs, identifying concurrent variations that act synergistically (or antagonistically to alter the responses of the motifs to stimuli. Synergism (or antagonism was quantified using degrees of nonlinear blending and additive synergism. Simulations identified concurrent variations that maximized synergism, and examined the ways in which it was affected by stimulus protocols and the architecture of a motif. Only a subset of architectures exhibited synergism following paired changes in parameters. The approach was then applied to a model describing interlocked feedback loops governing the synthesis of the CREB1 and CREB2 transcription factors. The effects of motifs on synergism for this biologically realistic model were consistent with those for the abstract models of single motifs. These results have implications for the rational design of combination drug therapies with the potential for synergistic interactions.
Phyloproteomic Analysis of 11780 Six-Residue-Long Motifs Occurrences

Directory of Open Access Journals (Sweden)

O. V. Galzitskaya

2015-01-01

Full Text Available How is it possible to find good traits for phylogenetic reconstructions? Here, we present a new phyloproteomic criterion that is an occurrence of simple motifs which can be imprints of evolution history. We studied the occurrences of 11780 six-residue-long motifs consisting of two randomly located amino acids in 97 eukaryotic and 25 bacterial proteomes. For all eukaryotic proteomes, with the exception of the Amoebozoa, Stramenopiles, and Diplomonadida kingdoms, the number of proteins containing the motifs from the first group (one of the two amino acids occurs once at the terminal position made about 20%; in the case of motifs from the second (one of two amino acids occurs one time within the pattern and third (the two amino acids occur randomly groups, 30% and 50%, respectively. For bacterial proteomes, this relationship was 10%, 27%, and 63%, respectively. The matrices of correlation coefficients between numbers of proteins where a motif from the set of 11780 motifs appears at least once in 9 kingdoms and 5 phyla of bacteria were calculated. Among the correlation coefficients for eukaryotic proteomes, the correlation between the animal and fungi kingdoms (0.62 is higher than between fungi and plants (0.54. Our study provides support that animals and fungi are sibling kingdoms. Comparison of the frequencies of six-residue-long motifs in different proteomes allows obtaining phylogenetic relationships based on similarities between these frequencies: the Diplomonadida kingdoms are more close to Bacteria than to Eukaryota; Stramenopiles and Amoebozoa are more close to each other than to other kingdoms of Eukaryota.
Deep sequencing of cardiac microRNA-mRNA interactomes in clinical and experimental cardiomyopathy.

Science.gov (United States)

Matkovich, Scot J; Dorn, Gerald W

2015-01-01

MicroRNAs are a family of short (~21 nucleotide) noncoding RNAs that serve key roles in cellular growth and differentiation and the response of the heart to stress stimuli. As the sequence-specific recognition element of RNA-induced silencing complexes (RISCs), microRNAs bind mRNAs and prevent their translation via mechanisms that may include transcript degradation and/or prevention of ribosome binding. Short microRNA sequences and the ability of microRNAs to bind to mRNA sites having only partial/imperfect sequence complementarity complicate purely computational analyses of microRNA-mRNA interactomes. Furthermore, computational microRNA target prediction programs typically ignore biological context, and therefore the principal determinants of microRNA-mRNA binding: the presence and quantity of each. To address these deficiencies we describe an empirical method, developed via studies of stressed and failing hearts, to determine disease-induced changes in microRNAs, mRNAs, and the mRNAs targeted to the RISC, without cross-linking mRNAs to RISC proteins. Deep sequencing methods are used to determine RNA abundances, delivering unbiased, quantitative RNA data limited only by their annotation in the genome of interest. We describe the laboratory bench steps required to perform these experiments, experimental design strategies to achieve an appropriate number of sequencing reads per biological replicate, and computer-based processing tools and procedures to convert large raw sequencing data files into gene expression measures useful for differential expression analyses.
MicroRNA-encoding long non-coding RNAs

Directory of Open Access Journals (Sweden)

Zhu Xiaopeng

2008-05-01

Full Text Available Abstract Background Recent analysis of the mouse transcriptional data has revealed the existence of ~34,000 messenger-like non-coding RNAs (ml-ncRNAs. Whereas the functional properties of these ml-ncRNAs are beginning to be unravelled, no functional information is available for the large majority of these transcripts. Results A few ml-ncRNA have been shown to have genomic loci that overlap with microRNA loci, leading us to suspect that a fraction of ml-ncRNA may encode microRNAs. We therefore developed an algorithm (PriMir for specifically detecting potential microRNA-encoding transcripts in the entire set of 34,030 mouse full-length ml-ncRNAs. In combination with mouse-rat sequence conservation, this algorithm detected 97 (80 of them were novel strong miRNA-encoding candidates, and for 52 of these we obtained experimental evidence for the existence of their corresponding mature microRNA by microarray and stem-loop RT-PCR. Sequence analysis of the microRNA-encoding RNAs revealed an internal motif, whose presence correlates strongly (R2 = 0.9, P-value = 2.2 × 10-16 with the occurrence of stem-loops with characteristics of known pre-miRNAs, indicating the presence of a larger number microRNA-encoding RNAs (from 300 up to 800 in the ml-ncRNAs population. Conclusion Our work highlights a unique group of ml-ncRNAs and offers clues to their functions.
Crystallographic and Modeling Studies of RNase III Suggest a Mechanism for Double-Stranded RNA Cleavage | Center for Cancer Research

Science.gov (United States)

Background: Ribonuclease III belongs to the family of Mg2+-dependent endonucleases that show specificity for double-stranded RNA (dsRNA). RNase III is conserved in all known bacteria and eukaryotes and has 1–2 copies of a 9-residue consensus sequence, known as the RNase III signature motif. The bacterial RNase III proteins are the simplest, consisting of two domains: an
Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.

Science.gov (United States)

Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique

2015-06-01

Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.
Kinetic discrimination of self/non-self RNA by the ATPase activity of RIG-I and MDA5.

Science.gov (United States)

Louber, Jade; Brunel, Joanna; Uchikawa, Emiko; Cusack, Stephen; Gerlier, Denis

2015-07-28

The cytoplasmic RIG-like receptors are responsible for the early detection of viruses and other intracellular microbes by activating the innate immune response mediated by type I interferons (IFNs). RIG-I and MDA5 detect virus-specific RNA motifs with short 5'-tri/diphosphorylated, blunt-end double-stranded RNA (dsRNA) and >0.5-2 kb long dsRNA as canonical agonists, respectively. However, in vitro, they can bind to many RNA species, while in cells there is an activation threshold. As SF2 helicase/ATPase family members, ATP hydrolysis is dependent on co-operative RNA and ATP binding. Whereas simultaneous ATP and cognate RNA binding is sufficient to activate RIG-I by releasing autoinhibition of the signaling domains, the physiological role of the ATPase activity of RIG-I and MDA5 remains controversial. A cross-analysis of a rationally designed panel of RNA binding and ATPase mutants and truncated receptors, using type I IFN promoter activation as readout, allows us to refine our understanding of the structure-function relationships of RIG-I and MDA5. RNA activation of RIG-I depends on multiple critical RNA binding sites in its helicase domain as confirmed by functional evidence using novel mutations. We found that RIG-I or MDA5 mutants with low ATP hydrolysis activity exhibit constitutive activity but this was fully reverted when associated with mutations preventing RNA binding to the helicase domain. We propose that the turnover kinetics of the ATPase domain enables the discrimination of self/non-self RNA by both RIG-I and MDA5. Non-cognate, possibly self, RNA binding would lead to fast ATP turnover and RNA disassociation and thus insufficient time for the caspase activation and recruitment domains (CARDs) to promote downstream signaling, whereas tighter cognate RNA binding provides a longer time window for downstream events to be engaged. The exquisite fine-tuning of RIG-I and MDA5 RNA-dependent ATPase activity coupled to CARD release allows a robust IFN response
dsRNA binding properties of RDE-4 and TRBP reflect their distinct roles in RNAi.

Science.gov (United States)

Parker, Greg S; Maity, Tuhin Subhra; Bass, Brenda L

2008-12-26

Double-stranded RNA (dsRNA)-binding proteins facilitate Dicer functions in RNA interference. Caenorhabditis elegans RDE-4 facilitates cleavage of long dsRNA to small interfering RNA (siRNA), while human trans-activation response RNA-binding protein (TRBP) functions downstream to pass siRNA to the RNA-induced silencing complex. We show that these distinct in vivo roles are reflected in in vitro binding properties. RDE-4 preferentially binds long dsRNA, while TRBP binds siRNA with an affinity that is independent of dsRNA length. These properties are mechanistically based on the fact that RDE-4 binds cooperatively, via contributions from multiple domains, while TRBP binds noncooperatively. Our studies offer a paradigm for how dsRNA-binding proteins, which are not sequence specific, discern dsRNA length. Additionally, analyses of the ability of RDE-4 deletion constructs and RDE-4/TRBP chimeras to reconstitute Dicer activity suggest RDE-4 promotes activity using its dsRNA-binding motif 2 to bind dsRNA, its linker region to interact with Dicer, and its C-terminus for Dicer activation.
Methods and statistics for combining motif match scores.

Science.gov (United States)

Bailey, T L; Gribskov, M

1998-01-01

Position-specific scoring matrices are useful for representing and searching for protein sequence motifs. A sequence family can often be described by a group of one or more motifs, and an effective search must combine the scores for matching a sequence to each of the motifs in the group. We describe three methods for combining match scores and estimating the statistical significance of the combined scores and evaluate the search quality (classification accuracy) and the accuracy of the estimate of statistical significance of each. The three methods are: 1) sum of scores, 2) sum of reduced variates, 3) product of score p-values. We show that method 3) is superior to the other two methods in both regards, and that combining motif scores indeed gives better search accuracy. The MAST sequence homology search algorithm utilizing the product of p-values scoring method is available for interactive use and downloading at URL http:/(/)www.sdsc.edu/MEME.
Evaluation of microRNA alignment techniques

Science.gov (United States)

Kaspi, Antony; El-Osta, Assam

2016-01-01

Genomic alignment of small RNA (smRNA) sequences such as microRNAs poses considerable challenges due to their short length (∼21 nucleotides [nt]) as well as the large size and complexity of plant and animal genomes. While several tools have been developed for high-throughput mapping of longer mRNA-seq reads (>30 nt), there are few that are specifically designed for mapping of smRNA reads including microRNAs. The accuracy of these mappers has not been systematically determined in the case of smRNA-seq. In addition, it is unknown whether these aligners accurately map smRNA reads containing sequence errors and polymorphisms. By using simulated read sets, we determine the alignment sensitivity and accuracy of 16 short-read mappers and quantify their robustness to mismatches, indels, and nontemplated nucleotide additions. These were explored in the context of a plant genome (Oryza sativa, ∼500 Mbp) and a mammalian genome (Homo sapiens, ∼3.1 Gbp). Analysis of simulated and real smRNA-seq data demonstrates that mapper selection impacts differential expression results and interpretation. These results will inform on best practice for smRNA mapping and enable more accurate smRNA detection and quantification of expression and RNA editing. PMID:27284164
Viral Small-RNA Analysis of Bombyx mori Larval Midgut during Persistent and Pathogenic Cytoplasmic Polyhedrosis Virus Infection.

Science.gov (United States)

Zografidis, Aris; Van Nieuwerburgh, Filip; Kolliopoulou, Anna; Apostolou-Karampelis, Konstantinos; Head, Steven R; Deforce, Dieter; Smagghe, Guy; Swevers, Luc

2015-11-01

The lepidopteran innate immune response against RNA viruses remains poorly understood, while in other insects several studies have highlighted an essential role for the exo-RNAi pathway in combating viral infection. Here, by using deep-sequencing technology for viral small-RNA (vsRNA) assessment, we provide evidence that exo-RNAi is operative in the silkworm Bombyx mori against both persistent and pathogenic infection of B. mori cytoplasmic polyhedrosis virus (BmCPV) which is characterized by a segmented double-stranded RNA (dsRNA) genome. Further, we show that Dicer-2 predominantly targets viral dsRNA and produces 20-nucleotide (nt) vsRNAs, whereas an additional pathway is responsive to viral mRNA derived from segment 10. Importantly, vsRNA distributions, which define specific hot and cold spot profiles for each viral segment, to a considerable degree overlap between Dicer-2-related (19 to 21 nt) and Dicer-2-unrelated vsRNAs, suggesting a common origin for these profiles. We found a degenerate motif significantly enriched at the cut sites of vsRNAs of various lengths which link an unknown RNase to the origins of vsRNAs biogenesis and distribution. Accordingly, the indicated RNase activity may be an important early factor for the host's antiviral defense in Lepidoptera. This work contributes to the elucidation of the lepidopteran antiviral response against infection of segmented double-stranded RNA (dsRNA) virus (CPV; Reoviridae) and highlights the importance of viral small-RNA (vsRNA) analysis for getting insights into host-pathogen interactions. Three vsRNA pathways are implicated in antiviral defense. For dsRNA, two pathways are proposed, either based on Dicer-2 cleavage to generate 20-nucleotide vsRNAs or based on the activity of an uncharacterized endo-RNase that cleaves the viral RNA substrate at a degenerate motif. The analysis also indicates the existence of a degradation pathway that targets the positive strand of segment 10. Copyright © 2015, American
An Inhibitory Motif on the 5’UTR of Several Rotavirus Genome Segments Affects Protein Expression and Reverse Genetics Strategies

Science.gov (United States)

Papa, Guido; Eichwald, Catherine; Burrone, Oscar R.

2016-01-01

Rotavirus genome consists of eleven segments of dsRNA, each encoding one single protein. Viral mRNAs contain an open reading frame (ORF) flanked by relatively short untranslated regions (UTRs), whose role in the viral cycle remains elusive. Here we investigated the role of 5’UTRs in T7 polymerase-driven cDNAs expression in uninfected cells. The 5’UTRs of eight genome segments (gs3, gs5-6, gs7-11) of the simian SA11 strain showed a strong inhibitory effect on the expression of viral proteins. Decreased protein expression was due to both compromised transcription and translation and was independent of the ORF and the 3’UTR sequences. Analysis of several mutants of the 21-nucleotide long 5’UTR of gs 11 defined an inhibitory motif (IM) represented by its primary sequence rather than its secondary structure. IM was mapped to the 5’ terminal 6-nucleotide long pyrimidine-rich tract 5’-GGY(U/A)UY-3’. The 5’ terminal position within the mRNA was shown to be essentially required, as inhibitory activity was lost when IM was moved to an internal position. We identified two mutations (insertion of a G upstream the 5’UTR and the U to A mutation of the fifth nucleotide of IM) that render IM non-functional and increase the transcription and translation rate to levels that could considerably improve the efficiency of virus helper-free reverse genetics strategies. PMID:27846320
CRISPR-Cas: evolution of an RNA-based adaptive immunity system in prokaryotes.

Science.gov (United States)

Koonin, Eugene V; Makarova, Kira S

2013-05-01

The CRISPR-Cas (clustered regularly interspaced short palindromic repeats, CRISPR-associated genes) is an adaptive immunity system in bacteria and archaea that functions via a distinct self-non-self recognition mechanism that is partially analogous to the mechanism of eukaryotic RNA interference (RNAi). The CRISPR-Cas system incorporates fragments of virus or plasmid DNA into the CRISPR repeat cassettes and employs the processed transcripts of these spacers as guide RNAs to cleave the cognate foreign DNA or RNA. The Cas proteins, however, are not homologous to the proteins involved in RNAi and comprise numerous, highly diverged families. The majority of the Cas proteins contain diverse variants of the RNA recognition motif (RRM), a widespread RNA-binding domain. Despite the fast evolution that is typical of the cas genes, the presence of diverse versions of the RRM in most Cas proteins provides for a simple scenario for the evolution of the three distinct types of CRISPR-cas systems. In addition to several proteins that are directly implicated in the immune response, the cas genes encode a variety of proteins that are homologous to prokaryotic toxins that typically possess nuclease activity. The predicted toxins associated with CRISPR-Cas systems include the essential Cas2 protein, proteins of COG1517 that, in addition to a ligand-binding domain and a helix-turn-helix domain, typically contain different nuclease domains and several other predicted nucleases. The tight association of the CRISPR-Cas immunity systems with predicted toxins that, upon activation, would induce dormancy or cell death suggests that adaptive immunity and dormancy/suicide response are functionally coupled. Such coupling could manifest in the persistence state being induced and potentially providing conditions for more effective action of the immune system or in cell death being triggered when immunity fails.
Trans-acting translational regulatory RNA binding proteins.

Science.gov (United States)

Harvey, Robert F; Smith, Tom S; Mulroney, Thomas; Queiroz, Rayner M L; Pizzinga, Mariavittoria; Dezi, Veronica; Villenueva, Eneko; Ramakrishna, Manasa; Lilley, Kathryn S; Willis, Anne E

2018-05-01

The canonical molecular machinery required for global mRNA translation and its control has been well defined, with distinct sets of proteins involved in the processes of translation initiation, elongation and termination. Additionally, noncanonical, trans-acting regulatory RNA-binding proteins (RBPs) are necessary to provide mRNA-specific translation, and these interact with 5' and 3' untranslated regions and coding regions of mRNA to regulate ribosome recruitment and transit. Recently it has also been demonstrated that trans-acting ribosomal proteins direct the translation of specific mRNAs. Importantly, it has been shown that subsets of RBPs often work in concert, forming distinct regulatory complexes upon different cellular perturbation, creating an RBP combinatorial code, which through the translation of specific subsets of mRNAs, dictate cell fate. With the development of new methodologies, a plethora of novel RNA binding proteins have recently been identified, although the function of many of these proteins within mRNA translation is unknown. In this review we will discuss these methodologies and their shortcomings when applied to the study of translation, which need to be addressed to enable a better understanding of trans-acting translational regulatory proteins. Moreover, we discuss the protein domains that are responsible for RNA binding as well as the RNA motifs to which they bind, and the role of trans-acting ribosomal proteins in directing the translation of specific mRNAs. This article is categorized under: RNA Interactions with Proteins and Other Molecules > RNA-Protein Complexes Translation > Translation Regulation Translation > Translation Mechanisms. © 2018 Medical Research Council and University of Cambridge. WIREs RNA published by Wiley Periodicals, Inc.
Using the Hepatitis C Virus RNA-Dependent RNA Polymerase as a Model to Understand Viral Polymerase Structure, Function and Dynamics

Directory of Open Access Journals (Sweden)

Ester Sesmero

2015-07-01

Full Text Available Viral polymerases replicate and transcribe the genomes of several viruses of global health concern such as Hepatitis C virus (HCV, human immunodeficiency virus (HIV and Ebola virus. For this reason they are key targets for therapies to treat viral infections. Although there is little sequence similarity across the different types of viral polymerases, all of them present a right-hand shape and certain structural motifs that are highly conserved. These features allow their functional properties to be compared, with the goal of broadly applying the knowledge acquired from studying specific viral polymerases to other viral polymerases about which less is known. Here we review the structural and functional properties of the HCV RNA-dependent RNA polymerase (NS5B in order to understand the fundamental processes underlying the replication of viral genomes. We discuss recent insights into the process by which RNA replication occurs in NS5B as well as the role that conformational changes play in this process.
The primary structure of L37--a rat ribosomal protein with a zinc finger-like motif.

Science.gov (United States)

Chan, Y L; Paz, V; Olvera, J; Wool, I G

1993-04-30

The amino acid sequence of the rat 60S ribosomal subunit protein L37 was deduced from the sequence of nucleotides in a recombinant cDNA. Ribosomal protein L37 has 96 amino acids, the NH2-terminal methionine is removed after translation of the mRNA, and has a molecular weight of 10,939. Ribosomal protein L37 has a single zinc finger-like motif of the C2-C2 type. Hybridization of the cDNA to digests of nuclear DNA suggests that there are 13 or 14 copies of the L37 gene. The mRNA for the protein is about 500 nucleotides in length. Rat L37 is related to Saccharomyces cerevisiae ribosomal protein YL35 and to Caenorhabditis elegans L37. We have identified in the data base a DNA sequence that encodes the chicken homolog of rat L37.
Non-Watson-Crick basepairing and hydration in RNA motifs: molecular dynamics of 5S rRNA loop E

Czech Academy of Sciences Publication Activity Database

Réblová, K.; Špačková, Naďa; Štefl, R.; Csaszar, K.; Koča, J.; Leontis, N. B.; Šponer, Jiří

2003-01-01

Roč. 84, č. 6 (2003), s. 3564-3582 ISSN 0006-3495 R&D Projects: GA MŠk LN00A016 Grant - others:National Institutes of Health(US) 2R15 GM55898; National Science Foundation(US) CHE-9732563 Institutional research plan: CEZ:AV0Z5004920 Keywords : non-Watson-Crick base pairs * ribosomal RNA * Loop E Subject RIV: BO - Biophysics Impact factor: 4.463, year: 2003
Low-dimensional morphospace of topological motifs in human fMRI brain networks

Directory of Open Access Journals (Sweden)

Sarah E. Morgan

2018-06-01

Full Text Available We present a low-dimensional morphospace of fMRI brain networks, where axes are defined in a data-driven manner based on the network motifs. The morphospace allows us to identify the key variations in healthy fMRI networks in terms of their underlying motifs, and we observe that two principal components (PCs can account for 97% of the motif variability. The first PC of the motif distribution is correlated with efficiency and inversely correlated with transitivity. Hence this axis approximately conforms to the well-known economical small-world trade-off between integration and segregation in brain networks. Finally, we show that the economical clustering generative model proposed by Vértes et al. (2012 can approximately reproduce the motif morphospace of the real fMRI brain networks, in contrast to other generative models. Overall, the motif morphospace provides a powerful way to visualize the relationships between network properties and to investigate generative or constraining factors in the formation of complex human brain functional networks. Motifs have been described as the building blocks of complex networks. Meanwhile, a morphospace allows networks to be placed in a common space and can reveal the relationships between different network properties and elucidate the driving forces behind network topology. We combine the concepts of motifs and morphospaces to create the first motif morphospace of fMRI brain networks. Crucially, the morphospace axes are defined by the motifs, in a data-driven manner. We observe strong correlations between the networks’ positions in morphospace and their global topological properties, suggesting that motif morphospaces are a powerful way to capture the topology of networks in a low-dimensional space and to compare generative models of brain networks. Motif morphospaces could also be used to study other complex networks’ topologies.
Discovering Motifs in Biological Sequences Using the Micron Automata Processor.

Science.gov (United States)

Roy, Indranil; Aluru, Srinivas

2016-01-01

Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-complete, and the largest solved instance reported to date is (26,11). We propose a novel algorithm for the (l,d) motif search problem using streaming execution over a large set of non-deterministic finite automata (NFA). This solution is designed to take advantage of the micron automata processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We demonstrate the capability for solving much larger instances of the (l, d) motif search problem using the resources available within a single automata processor board, by estimating run-times for problem instances (39,18) and (40,17). The paper serves as a useful guide to solving problems using this new accelerator technology.

Aggregation of topological motifs in the Escherichia coli transcriptional regulatory network

Directory of Open Access Journals (Sweden)

Barabási Albert-László

2004-01-01

Full Text Available Abstract Background Transcriptional regulation of cellular functions is carried out through a complex network of interactions among transcription factors and the promoter regions of genes and operons regulated by them.To better understand the system-level function of such networks simplification of their architecture was previously achieved by identifying the motifs present in the network, which are small, overrepresented, topologically distinct regulatory interaction patterns (subgraphs. However, the interaction of such motifs with each other, and their form of integration into the full network has not been previously examined. Results By studying the transcriptional regulatory network of the bacterium, Escherichia coli, we demonstrate that the two previously identified motif types in the network (i.e., feed-forward loops and bi-fan motifs do not exist in isolation, but rather aggregate into homologous motif clusters that largely overlap with known biological functions. Moreover, these clusters further coalesce into a supercluster, thus establishing distinct topological hierarchies that show global statistical properties similar to the whole network. Targeted removal of motif links disintegrates the network into small, isolated clusters, while random disruptions of equal number of links do not cause such an effect. Conclusion Individual motifs aggregate into homologous motif clusters and a supercluster forming the backbone of the E. coli transcriptional regulatory network and play a central role in defining its global topological organization.
MODA: an efficient algorithm for network motif discovery in biological networks.

Science.gov (United States)

Omidi, Saeed; Schreiber, Falk; Masoudi-Nejad, Ali

2009-10-01

In recent years, interest has been growing in the study of complex networks. Since Erdös and Rényi (1960) proposed their random graph model about 50 years ago, many researchers have investigated and shaped this field. Many indicators have been proposed to assess the global features of networks. Recently, an active research area has developed in studying local features named motifs as the building blocks of networks. Unfortunately, network motif discovery is a computationally hard problem and finding rather large motifs (larger than 8 nodes) by means of current algorithms is impractical as it demands too much computational effort. In this paper, we present a new algorithm (MODA) that incorporates techniques such as a pattern growth approach for extracting larger motifs efficiently. We have tested our algorithm and found it able to identify larger motifs with more than 8 nodes more efficiently than most of the current state-of-the-art motif discovery algorithms. While most of the algorithms rely on induced subgraphs as motifs of the networks, MODA is able to extract both induced and non-induced subgraphs simultaneously. The MODA source code is freely available at: http://LBB.ut.ac.ir/Download/LBBsoft/MODA/
RNAPattMatch: a web server for RNA sequence/structure motif detection based on pattern matching with flexible gaps

Science.gov (United States)

Drory Retwitzer, Matan; Polishchuk, Maya; Churkin, Elena; Kifer, Ilona; Yakhini, Zohar; Barash, Danny

2015-01-01

Searching for RNA sequence-structure patterns is becoming an essential tool for RNA practitioners. Novel discoveries of regulatory non-coding RNAs in targeted organisms and the motivation to find them across a wide range of organisms have prompted the use of computational RNA pattern matching as an enhancement to sequence similarity. State-of-the-art programs differ by the flexibility of patterns allowed as queries and by their simplicity of use. In particular—no existing method is available as a user-friendly web server. A general program that searches for RNA sequence-structure patterns is RNA Structator. However, it is not available as a web server and does not provide the option to allow flexible gap pattern representation with an upper bound of the gap length being specified at any position in the sequence. Here, we introduce RNAPattMatch, a web-based application that is user friendly and makes sequence/structure RNA queries accessible to practitioners of various background and proficiency. It also extends RNA Structator and allows a more flexible variable gaps representation, in addition to analysis of results using energy minimization methods. RNAPattMatch service is available at http://www.cs.bgu.ac.il/rnapattmatch. A standalone version of the search tool is also available to download at the site. PMID:25940619
The role of incoherent microRNA-mediated feedforward loops in noise buffering.

Directory of Open Access Journals (Sweden)

Matteo Osella

2011-03-01

Full Text Available MicroRNAs are endogenous non-coding RNAs which negatively regulate the expression of protein-coding genes in plants and animals. They are known to play an important role in several biological processes and, together with transcription factors, form a complex and highly interconnected regulatory network. Looking at the structure of this network, it is possible to recognize a few overrepresented motifs which are expected to perform important elementary regulatory functions. Among them, a special role is played by the microRNA-mediated feedforward loop in which a master transcription factor regulates a microRNA and, together with it, a set of target genes. In this paper we show analytically and through simulations that the incoherent version of this motif can couple the fine-tuning of a target protein level with an efficient noise control, thus conferring precision and stability to the overall gene expression program, especially in the presence of fluctuations in upstream regulators. Among the other results, a nontrivial prediction of our model is that the optimal attenuation of fluctuations coincides with a modest repression of the target expression. This feature is coherent with the expected fine-tuning function and in agreement with experimental observations of the actual impact of a wide class of microRNAs on the protein output of their targets. Finally, we describe the impact on noise-buffering efficiency of the cross-talk between microRNA targets that can naturally arise if the microRNA-mediated circuit is not considered as isolated, but embedded in a larger network of regulations.
Dynamic motifs in socio-economic networks

Science.gov (United States)

Zhang, Xin; Shao, Shuai; Stanley, H. Eugene; Havlin, Shlomo

2014-12-01

Socio-economic networks are of central importance in economic life. We develop a method of identifying and studying motifs in socio-economic networks by focusing on “dynamic motifs,” i.e., evolutionary connection patterns that, because of “node acquaintances” in the network, occur much more frequently than random patterns. We examine two evolving bi-partite networks: i) the world-wide commercial ship chartering market and ii) the ship build-to-order market. We find similar dynamic motifs in both bipartite networks, even though they describe different economic activities. We also find that “influence” and “persistence” are strong factors in the interaction behavior of organizations. When two companies are doing business with the same customer, it is highly probable that another customer who currently only has business relationship with one of these two companies, will become customer of the second in the future. This is the effect of influence. Persistence means that companies with close business ties to customers tend to maintain their relationships over a long period of time.
Characterization of a novel single-stranded RNA mycovirus in pleurotus ostreatus

International Nuclear Information System (INIS)

Yu, Hyun Jae; Lim, Dongbin; Lee, Hyun-Sook

2003-01-01

A mycovirus, named oyster mushroom spherical virus (OMSV), was isolated from cultivated oyster mushrooms with a severe epidemic of oyster mushroom Die-back disease. OMSV was a 27-nm spherical virus encapsidating a single-stranded RNA (ssRNA) of 5.784 kb with a coat protein of approximately 28.5 kDa. The nucleotide sequence of the virus revealed that its genomic RNA was positive strand, containing 5784 bases with seven open reading frames (ORF). ORF1 had the motifs of RNA-dependent RNA polymerases (RdRp) and helicase. ORF2 encoded a coat protein. ORF3 to 7 could encode putative polypeptides of approximately 12, 12.5, 21, 14.5, and 23 kDa, respectively, but none of them showed significant similarity to any other known polypeptides. The 5' end of the viral RNA was uncapped and the 3' end was polyadenylated with 74 bases. Genomic structure and organization and the derived amino acid sequence of RdRp and helicase domain were similar to those of tymoviruses, a plant virus group
Shortcomings of short hairpin RNA-based transgenic RNA interference in mouse oocytes

Czech Academy of Sciences Publication Activity Database

Sarnová, Lenka; Malík, Radek; Sedláček, Radislav; Svoboda, Petr

2010-01-01

Roč. 9, č. 8 (2010), s. 1-10 ISSN 1477-5751 R&D Projects: GA MŠk ME09039 Grant - others:EMBO SDIG(DE) project 1483 Institutional research plan: CEZ:AV0Z50520514 Keywords : transgenic RNAi * shRNA * oocyte Subject RIV: EB - Genetics ; Molecular Biology http://www.jnrbm.com/content/9/1/8
Assessing local structure motifs using order parameters for motif recognition, interstitial identification, and diffusion path characterization

Science.gov (United States)

Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; Haranczyk, Maciej

2017-11-01

Structure-property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal closed packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.
Multiple POU-binding motifs, recognized by tissue-specific nuclear factors, are important for Dll1 gene expression in neural stem cells

International Nuclear Information System (INIS)

Nakayama, Kohzo; Nagase, Kazuko; Tokutake, Yuriko; Koh, Chang-Sung; Hiratochi, Masahiro; Ohkawara, Takeshi; Nakayama, Noriko

2004-01-01

We cloned the 5'-flanking region of the mouse homolog of the Delta gene (Dll1) and demonstrated that the sequence between nucleotide position -514 and -484 in the 5'-flanking region of Dll1 played a critical role in the regulation of its tissue-specific expression in neural stem cells (NSCs). Further, we showed that multiple POU-binding motifs, located within this short sequence of 30 bp, were essential for transcriptional activation of Dll1 and also that multiple tissue-specific nuclear factors recognized these POU-binding motifs in various combinations through differentiation of NSCs. Thus, POU-binding factors may play an important role in Dll1 expression in developing NSCs
Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler.

Science.gov (United States)

Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O'Connor, Mary; Shapiro, Bruce A

2008-10-01

One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes.
Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler☆

Science.gov (United States)

Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O’Connor, Mary; Shapiro, Bruce A.

2013-01-01

One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes. PMID:18838281
Shortcomings of short hairpin RNA-based transgenic RNA interference in mouse oocytes

Czech Academy of Sciences Publication Activity Database

Sarnová, Lenka; Malík, Radek; Sedláček, Radislav; Svoboda, Petr

2010-01-01

Roč. 9, č. 8 (2010), s. 1-10 ISSN 1477-5751 R&D Project s: GA MŠk ME09039 Grant - others:EMBO SDIG(DE) project 1483 Institutional research plan: CEZ:AV0Z50520514 Keywords : transgenic RNAi * shRNA * oocyte Subject RIV: EB - Genetics ; Molecular Biology http://www.jnrbm.com/content/9/1/8
Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins

Science.gov (United States)

Kinjo, Akira R.; Nakamura, Haruki

2012-01-01

Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478
5S rRNA-derived and tRNA-derived SINEs in fruit bats.

Science.gov (United States)

Gogolevsky, Konstantin P; Vassetzky, Nikita S; Kramerov, Dmitri A

2009-05-01

Most short retroposons (SINEs) descend from cellular tRNA of 7SL RNA. Here, four new SINEs were found in megabats (Megachiroptera) but neither in microbats nor in other mammals. Two of them, MEG-RS and MEG-RL, descend from another cellular RNA, 5S rRNA; one (MEG-T2) is a tRNA-derived SINE; and MEG-TR is a hybrid tRNA/5S rRNA SINE. Insertion locus analysis suggests that these SINEs were active in the recent fruit bat evolution. Analysis of MEG-RS and MEG-RL in comparison with other few 5S rRNA-derived SINEs demonstrates that the internal RNA polymerase III promoter is their most invariant region, while the secondary structure is more variable. The mechanisms underlying the modular structure of these and other SINEs as well as their variation are discussed. The scenario of evolution of MEG SINEs is proposed.
A 6-Nucleotide Regulatory Motif within the AbcR Small RNAs of Brucella abortus Mediates Host-Pathogen Interactions.

Science.gov (United States)

Sheehan, Lauren M; Caswell, Clayton C

2017-06-06

In Brucella abortus , two small RNAs (sRNAs), AbcR1 and AbcR2, are responsible for regulating transcripts encoding ABC-type transport systems. AbcR1 and AbcR2 are required for Brucella virulence, as a double chromosomal deletion of both sRNAs results in attenuation in mice. Although these sRNAs are responsible for targeting transcripts for degradation, the mechanism utilized by the AbcR sRNAs to regulate mRNA in Brucella has not been described. Here, two motifs (M1 and M2) were identified in AbcR1 and AbcR2, and complementary motif sequences were defined in AbcR-regulated transcripts. Site-directed mutagenesis of M1 or M2 or of both M1 and M2 in the sRNAs revealed transcripts to be targeted by one or both motifs. Electrophoretic mobility shift assays revealed direct, concentration-dependent binding of both AbcR sRNAs to a target mRNA sequence. These experiments genetically and biochemically characterized two indispensable motifs within the AbcR sRNAs that bind to and regulate transcripts. Additionally, cellular and animal models of infection demonstrated that only M2 in the AbcR sRNAs is required for Brucella virulence. Furthermore, one of the M2-regulated targets, BAB2_0612, was found to be critical for the virulence of B. abortus in a mouse model of infection. Although these sRNAs are highly conserved among Alphaproteobacteria , the present report displays how gene regulation mediated by the AbcR sRNAs has diverged to meet the intricate regulatory requirements of each particular organism and its unique biological niche. IMPORTANCE Small RNAs (sRNAs) are important components of bacterial regulation, allowing organisms to quickly adapt to changes in their environments. The AbcR sRNAs are highly conserved throughout the Alphaproteobacteria and negatively regulate myriad transcripts, many encoding ABC-type transport systems. In Brucella abortus , AbcR1 and AbcR2 are functionally redundant, as only a double abcR1 abcR2 ( abcR1 / 2 ) deletion results in attenuation in
A simple and robust vector-based shRNA expression system used for RNA interference.

Science.gov (United States)

Wang, Xue-jun; Li, Ying; Huang, Hai; Zhang, Xiu-juan; Xie, Pei-wen; Hu, Wei; Li, Dan-dan; Wang, Sheng-qi

2013-01-01

RNA interference (RNAi) mediated by small interfering RNAs (siRNAs) or short hairpin RNAs (shRNAs) has become a powerful genetic tool for conducting functional studies. Previously, vector-based shRNA-expression strategies capable of inducing RNAi in viable cells have been developed, however, these vector systems have some disadvantages, either because they were error-prone or cost prohibitive. In this report we described the development of a simple, robust shRNA expression system utilizing 1 long oligonucleotide or 2 short oligonucleotides for half the cost of conventional shRNA construction methods and with a >95% cloning success rate. The shRNA loop sequence and stem structure were also compared and carefully selected for better RNAi efficiency. Furthermore, an easier strategy was developed based on isocaudomers which permit rapid combination of the most efficient promoter-shRNA cassettes. Finally, using this method, the conservative target sites for hepatitis B virus (HBV) knockdown were systemically screened and HBV antigen expression shown to be successfully suppressed in the presence of connected multiple shRNAs both in vitro and in vivo. This novel design describes an inexpensive and effective way to clone and express single or multiple shRNAs from the same vector with the capacity for potent and effective silencing of target genes.
Ups and Downs of Poised RNA Polymerase II in B-Cells.

Directory of Open Access Journals (Sweden)

Phuong Dao

2016-04-01

Full Text Available Recent genome-wide analyses have uncovered a high accumulation of RNA polymerase II (Pol II at the 5' end of genes. This elevated Pol II presence at promoters, referred to here as Poll II poising, is mainly (but not exclusively attributed to temporal pausing of transcription during early elongation which, in turn, has been proposed to be a regulatory step for processes that need to be activated "on demand". Yet, the full genome-wide regulatory role of Pol II poising is yet to be delineated. To elucidate the role of Pol II poising in B cell activation, we compared Pol II profiles in resting and activated B cells. We found that while Pol II poised genes generally overlap functionally among different B cell states and correspond to the functional groups previously identified for other cell types, non-poised genes are B cell state specific. Focusing on the changes in transcription activity upon B cell activation, we found that the majority of such changes were from poised to non-poised state. The genes showing this type of transition were functionally enriched in translation, RNA processing and mRNA metabolic process. Interestingly, we also observed a transition from non-poised to poised state. Within this set of genes we identified several Immediate Early Genes (IEG, which were highly expressed in resting B cell and shifted from non-poised to poised state after B cell activation. Thus Pol II poising does not only mark genes for rapid expression in the future, but it is also associated with genes that are silenced after a burst of their expression. Finally, we performed comparative analysis of the presence of G4 motifs in the context of poised versus non-poised but active genes. Interestingly we observed a differential enrichment of these motifs upstream versus downstream of TSS depending on poising status. The enrichment of G4 sequence motifs upstream of TSS of non-poised active genes suggests a potential role of quadruplexes in expression
Probing structural changes of self assembled i-motif DNA

KAUST Repository

Lee, Iljoon; Patil, Sachin; Fhayli, Karim; Alsaiari, Shahad K.; Khashab, Niveen M.

2015-01-01

We report an i-motif structural probing system based on Thioflavin T (ThT) as a fluorescent sensor. This probe can discriminate the structural changes of RET and Rb i-motif sequences according to pH change. This journal is
Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

Directory of Open Access Journals (Sweden)

Down Thomas A

2010-09-01

Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.
Demonstration of helicase activity in the nonstructural protein, NSs, of the negative-sense RNA virus, groundnut bud necrosis virus.

Science.gov (United States)

Bhushan, Lokesh; Abraham, Ambily; Choudhury, Nirupam Roy; Rana, Vipin Singh; Mukherjee, Sunil Kumar; Savithri, Handanahal Subbarao

2015-04-01

The nonstructural protein NSs, encoded by the S RNA of groundnut bud necrosis virus (GBNV) (genus Tospovirus, family Bunyaviridae) has earlier been shown to possess nucleic-acid-stimulated NTPase and 5' α phosphatase activity. ATP hydrolysis is an essential function of a true helicase. Therefore, NSs was tested for DNA helicase activity. The results demonstrated that GBNV NSs possesses bidirectional DNA helicase activity. An alanine mutation in the Walker A motif (K189A rNSs) decreased DNA helicase activity substantially, whereas a mutation in the Walker B motif resulted in a marginal decrease in this activity. The parallel loss of the helicase and ATPase activity in the K189A mutant confirms that NSs acts as a non-canonical DNA helicase. Furthermore, both the wild-type and K189A NSs could function as RNA silencing suppressors, demonstrating that the suppressor activity of NSs is independent of its helicase or ATPase activity. This is the first report of a true helicase from a negative-sense RNA virus.

Structure of Drosophila Oskar reveals a novel RNA binding protein

Science.gov (United States)

Yang, Na; Yu, Zhenyu; Hu, Menglong; Wang, Mingzhu; Lehmann, Ruth; Xu, Rui-Ming

2015-01-01

Oskar (Osk) protein plays critical roles during Drosophila germ cell development, yet its functions in germ-line formation and body patterning remain poorly understood. This situation contrasts sharply with the vast knowledge about the function and mechanism of osk mRNA localization. Osk is predicted to have an N-terminal LOTUS domain (Osk-N), which has been suggested to bind RNA, and a C-terminal hydrolase-like domain (Osk-C) of unknown function. Here, we report the crystal structures of Osk-N and Osk-C. Osk-N shows a homodimer of winged-helix–fold modules, but without detectable RNA-binding activity. Osk-C has a lipase-fold structure but lacks critical catalytic residues at the putative active site. Surprisingly, we found that Osk-C binds the 3′UTRs of osk and nanos mRNA in vitro. Mutational studies identified a region of Osk-C important for mRNA binding. These results suggest possible functions of Osk in the regulation of stability, regulation of translation, and localization of relevant mRNAs through direct interaction with their 3′UTRs, and provide structural insights into a novel protein–RNA interaction motif involving a hydrolase-related domain. PMID:26324911
BlockLogo: Visualization of peptide and sequence motif conservation

DEFF Research Database (Denmark)

Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian

2013-01-01

BlockLogo is a web-server application for the visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, se...
Packaging of Mason-Pfizer monkey virus (MPMV) genomic RNA depends upon conserved long-range interactions (LRIs) between U5 and gag sequences.

Science.gov (United States)

Kalloush, Rawan M; Vivet-Boudou, Valérie; Ali, Lizna M; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A

2016-06-01

MPMV has great potential for development as a vector for gene therapy. In this respect, precisely defining the sequences and structural motifs that are important for dimerization and packaging of its genomic RNA (gRNA) are of utmost importance. A distinguishing feature of the MPMV gRNA packaging signal is two phylogenetically conserved long-range interactions (LRIs) between U5 and gag complementary sequences, LRI-I and LRI-II. To test their biological significance in the MPMV life cycle, we introduced mutations into these structural motifs and tested their effects on MPMV gRNA packaging and propagation. Furthermore, we probed the structure of key mutants using SHAPE (selective 2'hydroxyl acylation analyzed by primer extension). Disrupting base-pairing of the LRIs affected gRNA packaging and propagation, demonstrating their significance to the MPMV life cycle. A double mutant restoring a heterologous LRI-I was fully functional, whereas a similar LRI-II mutant failed to restore gRNA packaging and propagation. These results demonstrate that while LRI-I acts at the structural level, maintaining base-pairing is not sufficient for LRI-II function. In addition, in vitro RNA dimerization assays indicated that the loss of RNA packaging in LRI mutants could not be attributed to the defects in dimerization. Our findings suggest that U5-gag LRIs play an important architectural role in maintaining the structure of the 5' region of the MPMV gRNA, expanding the crucial role of LRIs to the nonlentiviral group of retroviruses. © 2016 Kalloush et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Genome Analysis of Conserved Dehydrin Motifs in Vascular Plants

Directory of Open Access Journals (Sweden)

Ahmad A. Malik

2017-05-01

Full Text Available Dehydrins, a large family of abiotic stress proteins, are defined by the presence of a mostly conserved motif known as the K-segment, and may also contain two other conserved motifs known as the Y-segment and S-segment. Using the dehydrin literature, we developed a sequence motif definition of the K-segment, which we used to create a large dataset of dehydrin sequences by searching the Pfam00257 dehydrin dataset and the Phytozome 10 sequences of vascular plants. A comprehensive analysis of these sequences reveals that lysine residues are highly conserved in the K-segment, while the amino acid type is often conserved at other positions. Despite the Y-segment name, the central tyrosine is somewhat conserved, but can be substituted with two other small aromatic amino acids (phenylalanine or histidine. The S-segment contains a series of serine residues, but in some proteins is also preceded by a conserved LHR sequence. In many dehydrins containing all three of these motifs the S-segment is linked to the K-segment by a GXGGRRKK motif (where X can be any amino acid, suggesting a functional linkage between these two motifs. An analysis of the sequences shows that the dehydrin architecture and several biochemical properties (isoelectric point, molecular mass, and hydrophobicity score are dependent on each other, and that some dehydrin architectures are overexpressed during certain abiotic stress, suggesting that they may be optimized for a specific abiotic stress while others are involved in all forms of dehydration stress (drought, cold, and salinity.
Discovery of novel interacting partners of PSMD9, a proteasomal chaperone: Role of an Atypical and versatile PDZ-domain motif interaction and identification of putative functional modules

Directory of Open Access Journals (Sweden)

Nikhil Sangith

2014-01-01

Full Text Available PSMD9 (Proteasome Macropain non-ATPase subunit 9, a proteasomal assembly chaperone, harbors an uncharacterized PDZ-like domain. Here we report the identification of five novel interacting partners of PSMD9 and provide the first glimpse at the structure of the PDZ-domain, including the molecular details of the interaction. We based our strategy on two propositions: (a proteins with conserved C-termini may share common functions and (b PDZ domains interact with C-terminal residues of proteins. Screening of C-terminal peptides followed by interactions using full-length recombinant proteins, we discovered hnRNPA1 (an RNA binding protein, S14 (a ribosomal protein, CSH1 (a growth hormone, E12 (a transcription factor and IL6 receptor as novel PSMD9-interacting partners. Through multiple techniques and structural insights, we clearly demonstrate for the first time that human PDZ domain interacts with the predicted Short Linear Sequence Motif (SLIM at the C-termini of the client proteins. These interactions are also recapitulated in mammalian cells. Together, these results are suggestive of the role of PSMD9 in transcriptional regulation, mRNA processing and editing, hormone and receptor activity and protein translation. Our proof-of-principle experiments endorse a novel and quick method for the identification of putative interacting partners of similar PDZ-domain proteins from the proteome and for discovering novel functions.
UPF201 Archaeal Specific Family Members Reveals Structural Similarity to RNA-Binding Proteins but Low Likelihood for RNA-Binding Function

Energy Technology Data Exchange (ETDEWEB)

Rao, K.N.; Swaminathan, S.; Burley, S. K.

2008-12-11

We have determined X-ray crystal structures of four members of an archaeal specific family of proteins of unknown function (UPF0201; Pfam classification: DUF54) to advance our understanding of the genetic repertoire of archaea. Despite low pairwise amino acid sequence identities (10-40%) and the absence of conserved sequence motifs, the three-dimensional structures of these proteins are remarkably similar to one another. Their common polypeptide chain fold, encompassing a five-stranded antiparallel {beta}-sheet and five {alpha}-helices, proved to be quite unexpectedly similar to that of the RRM-type RNA-binding domain of the ribosomal L5 protein, which is responsible for binding the 5S- rRNA. Structure-based sequence alignments enabled construction of a phylogenetic tree relating UPF0201 family members to L5 ribosomal proteins and other structurally similar RNA binding proteins, thereby expanding our understanding of the evolutionary purview of the RRM superfamily. Analyses of the surfaces of these newly determined UPF0201 structures suggest that they probably do not function as RNA binding proteins, and that this domain specific family of proteins has acquired a novel function in archaebacteria, which awaits experimental elucidation.
Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae

Directory of Open Access Journals (Sweden)

Christian J. Michel

2017-12-01

Full Text Available A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C 3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X , using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X , in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level, and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R . We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions. This property is true for all cardinalities of X motifs (from 4 to 20 and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non- X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together
Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

Science.gov (United States)

Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

2017-12-03

A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first
Structural basis of genomic RNA (gRNA) dimerization and packaging determinants of mouse mammary tumor virus (MMTV).

Science.gov (United States)

Aktar, Suriya J; Vivet-Boudou, Valérie; Ali, Lizna M; Jabeen, Ayesha; Kalloush, Rawan M; Richer, Delphine; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A

2014-11-14

two MMTV RNAs, leading to gRNA dimerization and its subsequent encapsidation into the assembling virus particles. The results presented here enhance our understanding of the MMTV gRNA dimerization and packaging processes and the role of structural motifs with respect to RNA-RNA and possibly RNA-protein interactions that might be taking place during MMTV life cycle.
Assessing Local Structure Motifs Using Order Parameters for Motif Recognition, Interstitial Identification, and Diffusion Path Characterization

Directory of Open Access Journals (Sweden)

Nils E. R. Zimmermann

2017-11-01

Full Text Available Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP database (61,422 compounds for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.
Transcription and translation of the rpsJ, rplN and rRNA operons of the tubercle bacillus.

Science.gov (United States)

Cortes, Teresa; Cox, Robert Ashley

2015-04-01

Several species of the genus Mycobacterium are human pathogens, notably the tubercle bacillus (Mycobacterium tuberculosis). The rate of proliferation of a bacterium is reflected in the rate of ribosome synthesis. This report describes a quantitative analysis of the early stages of the synthesis of ribosomes of M. tuberculosis. Specifically, the roles of three large operons, namely: the rrn operon (1.7 microns) encoding rrs (16S rRNA), rrl (23S rRNA) and rrf (5S rRNA); the rpsJ operon (1.93 microns), which encodes 11 ribosomal proteins; and the rplN operon (1.45 microns), which encodes 10 ribosomal proteins. A mathematical framework based on properties of population-average cells was developed to identify the number of transcripts of the rpsJ and rplN operons needed to maintain exponential growth. The values obtained were supported by RNaseq data. The motif 5'-gcagac-3' was found close to 5' end of transcripts of mycobacterial rplN operons, suggesting it may form part of the RpsH feedback binding site because the same motif is present in the ribosome within the region of rrs that forms the binding site for RpsH. Medical Research Council.
FUCHS—towards full circular RNA characterization using RNAseq

Directory of Open Access Journals (Sweden)

Franziska Metge

2017-02-01

Full Text Available Circular RNAs (circRNAs belong to a recently re-discovered species of RNA that emerge during RNA maturation through a process called back-splicing. A downstream 5′ splice site is linked to an upstream 3′ splice site to form a circular transcript instead of a canonical linear transcript. Recent advances in next-generation sequencing (NGS have brought circRNAs back into the focus of many scientists. Since then, several studies reported that circRNAs are differentially expressed across tissue types and developmental stages, implying that they are actively regulated and not merely a by-product of splicing. Though functional studies have shown that some circRNAs could act as miRNA-sponges, the function of most circRNAs remains unknown. To expand our understanding of possible roles of circular RNAs, we propose a new pipeline that could fully characterizes candidate circRNA structure from RNAseq data—FUCHS: FUll CHaracterization of circular RNA using RNA-Sequencing. Currently, most computational prediction pipelines use back-spliced reads to identify circular RNAs. FUCHS extends this concept by considering all RNA-seq information from long reads (typically >150 bp to learn more about the exon coverage, the number of double break point fragments, the different circular isoforms arising from one host-gene, and the alternatively spliced exons within the same circRNA boundaries. This new knowledge will enable the user to carry out differential motif enrichment and miRNA seed analysis to determine potential regulators during circRNA biogenesis. FUCHS is an easy-to-use Python based pipeline that contributes a new aspect to the circRNA research.
[Cloning of cDNA for RNA polymerase subunit from the fission yeast Schizosaccharomyces pombe by heterospecific complementation in Saccharomyces cerevisiae].

Science.gov (United States)

Shpakovskiĭ, G V; Lebedenko, E N; Thuriaux, P

1997-02-01

The rpb10 cDNA of the fission yeast Schizosaccharomyces pombe, encoding one of the five small subunits common to all three nuclear DNA-dependent RNA polymerases, was isolated from an expression cDNA library by two independent approaches: PCR-based screening and direct suppression by means of heterospecific complementation of a temperature-sensitive mutant defective in the corresponding gene of Saccharomyces cerevisiae. The cloned Sz. pombe cDNA encodes a protein Rpb10 of 71 amino acids with an M of 8,275 Da, sharing 51 amino acids (71% identity) with the subunit ABC10 beta of RNA polymerases I-III from S. cerevisiae. All eukaryotic members of this protein family have the same general organization featuring two highly conserved motifs (RCFT/SCGK and RYCCRRM) around an atypical zinc finger and an additional invariant HVDLIEK motif toward the C-terminal end. The last motif is only characteristics for homologs from eukaryotes. In keeping with this remarkable structural conservation, the Sz. pombe cDNA also fully complemented a S. cerevisiae deletion mutant lacking subunit ABC10 beta (null allele rpb10-delta 1::HIS3).
The Mapping of Predicted Triplex DNA:RNA in the Drosophila Genome Reveals a Prominent Location in Development- and Morphogenesis-Related Genes

Directory of Open Access Journals (Sweden)

Claude Pasquier

2017-07-01

Full Text Available Double-stranded DNA is able to form triple-helical structures by accommodating a third nucleotide strand. A nucleic acid triplex occurs according to Hoogsteen rules that predict the stability and affinity of the third strand bound to the Watson–Crick duplex. The “triplex-forming oligonucleotide” (TFO can be a short sequence of RNA that binds to the major groove of the targeted duplex only when this duplex presents a sequence of purine or pyrimidine bases in one of the DNA strands. Many nuclear proteins are known to bind triplex DNA or DNA:RNA, but their biological functions are unexplored. We identified sequences that are capable of engaging as the “triplex-forming oligonucleotide” in both the pre-lncRNA and pre-mRNA collections of Drosophila melanogaster. These motifs were matched against the Drosophila genome in order to identify putative sequences of triplex formation in intergenic regions, promoters, and introns/exons. Most of the identified TFOs appear to be located in the intronic region of the analyzed genes. Computational prediction of the most targeted genes by TFOs originating from pre-lncRNAs and pre-mRNAs revealed that they are restrictively associated with development- and morphogenesis-related gene networks. The refined analysis by Gene Ontology enrichment demonstrates that some individual TFOs present genome-wide scale matches that are located in numerous genes and regulatory sequences. The triplex DNA:RNA computational mapping at the genome-wide scale suggests broad interference in the regulatory process of the gene networks orchestrated by TFO RNAs acting in association simultaneously at multiple sites.
A PDZ-Like Motif in the Biliary Transporter ABCB4 Interacts with the Scaffold Protein EBP50 and Regulates ABCB4 Cell Surface Expression.

Directory of Open Access Journals (Sweden)

Quitterie Venot

Full Text Available ABCB4/MDR3, a member of the ABC superfamily, is an ATP-dependent phosphatidylcholine translocator expressed at the canalicular membrane of hepatocytes. Defects in the ABCB4 gene are associated with rare biliary diseases. It is essential to understand the mechanisms of its canalicular membrane expression in particular for the development of new therapies. The stability of several ABC transporters is regulated through their binding to PDZ (PSD95/DglA/ZO-1 domain-containing proteins. ABCB4 protein ends by the sequence glutamine-asparagine-leucine (QNL, which shows some similarity to PDZ-binding motifs. The aim of our study was to assess the potential role of the QNL motif on the surface expression of ABCB4 and to determine if PDZ domain-containing proteins are involved. We found that truncation of the QNL motif decreased the stability of ABCB4 in HepG2-transfected cells. The deleted mutant ABCB4-ΔQNL also displayed accelerated endocytosis. EBP50, a PDZ protein highly expressed in the liver, strongly colocalized and coimmunoprecipitated with ABCB4, and this interaction required the QNL motif. Down-regulation of EBP50 by siRNA or by expression of an EBP50 dominant-negative mutant caused a significant decrease in the level of ABCB4 protein expression, and in the amount of ABCB4 localized at the canalicular membrane. Interaction of ABCB4 with EBP50 through its PDZ-like motif plays a critical role in the regulation of ABCB4 expression and stability at the canalicular plasma membrane.
One motif to bind them: A small-XXX-small motif affects transmembrane domain 1 oligomerization, function, localization, and cross-talk between two yeast GPCRs.

Science.gov (United States)

Lock, Antonia; Forfar, Rachel; Weston, Cathryn; Bowsher, Leo; Upton, Graham J G; Reynolds, Christopher A; Ladds, Graham; Dixon, Ann M

2014-12-01

G protein-coupled receptors (GPCRs) are the largest family of cell-surface receptors in mammals and facilitate a range of physiological responses triggered by a variety of ligands. GPCRs were thought to function as monomers, however it is now accepted that GPCR homo- and hetero-oligomers also exist and influence receptor properties. The Schizosaccharomyces pombe GPCR Mam2 is a pheromone-sensing receptor involved in mating and has previously been shown to form oligomers in vivo. The first transmembrane domain (TMD) of Mam2 contains a small-XXX-small motif, overrepresented in membrane proteins and well-known for promoting helix-helix interactions. An ortholog of Mam2 in Saccharomyces cerevisiae, Ste2, contains an analogous small-XXX-small motif which has been shown to contribute to receptor homo-oligomerization, localization and function. Here we have used experimental and computational techniques to characterize the role of the small-XXX-small motif in function and assembly of Mam2 for the first time. We find that disruption of the motif via mutagenesis leads to reduction of Mam2 TMD1 homo-oligomerization and pheromone-responsive cellular signaling of the full-length protein. It also impairs correct targeting to the plasma membrane. Mutation of the analogous motif in Ste2 yielded similar results, suggesting a conserved mechanism for assembly. Using co-expression of the two fungal receptors in conjunction with computational models, we demonstrate a functional change in G protein specificity and propose that this is brought about through hetero-dimeric interactions of Mam2 with Ste2 via the complementary small-XXX-small motifs. This highlights the potential of these motifs to affect a range of properties that can be investigated in other GPCRs. Copyright © 2014. Published by Elsevier B.V.
Modeling of the Ebola Virus Delta Peptide Reveals a Potential Lytic Sequence Motif

Directory of Open Access Journals (Sweden)

William R. Gallaher

2015-01-01

Full Text Available Filoviruses, such as Ebola and Marburg viruses, cause severe outbreaks of human infection, including the extensive epidemic of Ebola virus disease (EVD in West Africa in 2014. In the course of examining mutations in the glycoprotein gene associated with 2014 Ebola virus (EBOV sequences, a differential level of conservation was noted between the soluble form of glycoprotein (sGP and the full length glycoprotein (GP, which are both encoded by the GP gene via RNA editing. In the region of the proteins encoded after the RNA editing site sGP was more conserved than the overlapping region of GP when compared to a distant outlier species, Tai Forest ebolavirus. Half of the amino acids comprising the “delta peptide”, a 40 amino acid carboxy-terminal fragment of sGP, were identical between otherwise widely divergent species. A lysine-rich amphipathic peptide motif was noted at the carboxyl terminus of delta peptide with high structural relatedness to the cytolytic peptide of the non-structural protein 4 (NSP4 of rotavirus. EBOV delta peptide is a candidate viroporin, a cationic pore-forming peptide, and may contribute to EBOV pathogenesis.
MicroRNA-15b regulates reversion-inducing cysteine-rich protein with Kazal motifs (RECK) expression in human uterine leiomyoma.

Science.gov (United States)

Guan, Yichun; Guo, Lankai; Zukerberg, Lawrence; Rueda, Bo R; Styer, Aaron K

2016-08-17

Human uterine leiomyoma (fibroids; LYO) are the most common benign neoplasms in reproductive-aged women. Dysregulated extracellular matrix and irregular LYO reversion-inducing cysteine-rich protein with Kazal motifs (RECK) expression are thought to be mediated by aberrant microRNA (miR) expression. The relationship of miR-15b and RECK expression in LYO has not been studied. The expression levels of miR-15b and RECK were determined by quantitative RT-PCR, Western blot, and immunohistochemistry in cultures derived from commercial primary leiomyoma (cpLYO) and myometrial (cpMYO) cell lines and leiomyoma (pLYO) and myometrium (pMYO) tissue from surgical samples respectively. The relationship between miR-15b and RECK expression in cpLYO and pLYO (compared to their respective myometrial controls) was evaluated following transfection of cell cultures with either miR-15b mimic or inhibitor. Elevated levels of miR-15b were observed in cpLYO (2.82-fold; p = 0.04) and pLYO cell (1.30-fold; p = 0.0001) cultures respectively compared to corresponding MYO cell controls. Following transfection with miR-15b mimic, cpLYO cells (0.62-fold; p < 0.0001) and pLYO cells (0.68-fold; p < 0.0001) demonstrated reduced RECK protein expression. Following transfection with miR-15b inhibitor, cpLYO cells (1.20-fold; p < 0.0001) and pLYO cells (1.31-fold; p = 0.0007) demonstrated elevated RECK protein expression. RECK protein expression was reduced in pLYO tissues (0.73-fold; p < 0.0001) and pLYO (0.47-fold; p = 0.047) cells when compared to the corresponding MYO tissue controls. Our findings suggest that miR-15b negatively regulates RECK expression in LYO, and increased miR-15b and decreased RECK expression may contribute to the pathobiology of LYO. The functional significance of miR-15b and RECK expression warrants further investigation as potential therapeutic targets for the treatment of human LYO.
Intracellular production of hydrogels and synthetic RNA granules by multivalent molecular interactions

Science.gov (United States)

Nakamura, Hideki; Lee, Albert A.; Afshar, Ali Sobhi; Watanabe, Shigeki; Rho, Elmer; Razavi, Shiva; Suarez, Allister; Lin, Yu-Chun; Tanigawa, Makoto; Huang, Brian; Derose, Robert; Bobb, Diana; Hong, William; Gabelli, Sandra B.; Goutsias, John; Inoue, Takanari

2018-01-01

Some protein components of intracellular non-membrane-bound entities, such as RNA granules, are known to form hydrogels in vitro. The physico-chemical properties and functional role of these intracellular hydrogels are difficult to study, primarily due to technical challenges in probing these materials in situ. Here, we present iPOLYMER, a strategy for a rapid induction of protein-based hydrogels inside living cells that explores the chemically inducible dimerization paradigm. Biochemical and biophysical characterizations aided by computational modelling show that the polymer network formed in the cytosol resembles a physiological hydrogel-like entity that acts as a size-dependent molecular sieve. We functionalize these polymers with RNA-binding motifs that sequester polyadenine-containing nucleotides to synthetically mimic RNA granules. These results show that iPOLYMER can be used to synthetically reconstitute the nucleation of biologically functional entities, including RNA granules in intact cells.
Evolutionarily conserved bias of amino-acid usage refines the definition of PDZ-binding motif

Directory of Open Access Journals (Sweden)

Launey Thomas

2011-06-01

Full Text Available Abstract Background The interactions between PDZ (PSD-95, Dlg, ZO-1 domains and PDZ-binding motifs play central roles in signal transductions within cells. Proteins with PDZ domains bind to PDZ-binding motifs almost exclusively when the motifs are located at the carboxyl (C- terminal ends of their binding partners. However, it remains little explored whether PDZ-binding motifs show any preferential location at the C-terminal ends of proteins, at genome-level. Results Here, we examined the distribution of the type-I (x-x-S/T-x-I/L/V or type-II (x-x-V-x-I/V PDZ-binding motifs in proteins encoded in the genomes of five different species (human, mouse, zebrafish, fruit fly and nematode. We first established that these PDZ-binding motifs are indeed preferentially present at their C-terminal ends. Moreover, we found specific amino acid (AA bias for the 'x' positions in the motifs at the C-terminal ends. In general, hydrophilic AAs were favored. Our genomics-based findings confirm and largely extend the results of previous interaction-based studies, allowing us to propose refined consensus sequences for all of the examined PDZ-binding motifs. An ontological analysis revealed that the refined motifs are functionally relevant since a large fraction of the proteins bearing the motif appear to be involved in signal transduction. Furthermore, co-precipitation experiments confirmed two new protein interactions predicted by our genomics-based approach. Finally, we show that influenza virus pathogenicity can be correlated with PDZ-binding motif, with high-virulence viral proteins bearing a refined PDZ-binding motif. Conclusions Our refined definition of PDZ-binding motifs should provide important clues for identifying functional PDZ-binding motifs and proteins involved in signal transduction.

Distinct configurations of protein complexes and biochemical pathways revealed by epistatic interaction network motifs

LENUS (Irish Health Repository)

Casey, Fergal

2011-08-22

Abstract Background Gene and protein interactions are commonly represented as networks, with the genes or proteins comprising the nodes and the relationship between them as edges. Motifs, or small local configurations of edges and nodes that arise repeatedly, can be used to simplify the interpretation of networks. Results We examined triplet motifs in a network of quantitative epistatic genetic relationships, and found a non-random distribution of particular motif classes. Individual motif classes were found to be associated with different functional properties, suggestive of an underlying biological significance. These associations were apparent not only for motif classes, but for individual positions within the motifs. As expected, NNN (all negative) motifs were strongly associated with previously reported genetic (i.e. synthetic lethal) interactions, while PPP (all positive) motifs were associated with protein complexes. The two other motif classes (NNP: a positive interaction spanned by two negative interactions, and NPP: a negative spanned by two positives) showed very distinct functional associations, with physical interactions dominating for the former but alternative enrichments, typical of biochemical pathways, dominating for the latter. Conclusion We present a model showing how NNP motifs can be used to recognize supportive relationships between protein complexes, while NPP motifs often identify opposing or regulatory behaviour between a gene and an associated pathway. The ability to use motifs to point toward underlying biological organizational themes is likely to be increasingly important as more extensive epistasis mapping projects in higher organisms begin.
Fingerprint motifs of phytases | Fan | African Journal of Biotechnology

African Journals Online (AJOL)

Among the total of potential 173 phytases gained in 11 plant genomes through MAST, PAPhys are the major phytases, and HAPhys are the minor, and other phytase groups are not found in planta. Keywords: Phytase, fingerprint motif, multiple EM for motif elicitation (MEME), MAST African Journal of Biotechnology Vol.
Patterns of oligonucleotide sequences in viral and host cell RNA identify mediators of the host innate immune system.

Directory of Open Access Journals (Sweden)

Benjamin D Greenbaum

Full Text Available The innate immune response provides a first line of defense against pathogens by targeting generic differential features that are present in foreign organisms but not in the host. These innate responses generate selection forces acting both in pathogens and hosts that further determine their co-evolution. Here we analyze the nucleic acid sequence fingerprints of these selection forces acting in parallel on both host innate immune genes and ssRNA viral genomes. We do this by identifying dinucleotide biases in the coding regions of innate immune response genes in plasmacytoid dendritic cells, and then use this signal to identify other significant host innate immune genes. The persistence of these biases in the orthologous groups of genes in humans and chickens is also examined. We then compare the significant motifs in highly expressed genes of the innate immune system to those in ssRNA viruses and study the evolution of these motifs in the H1N1 influenza genome. We argue that the significant under-represented motif pattern of CpG in an AU context--which is found in both the ssRNA viruses and innate genes, and has decreased throughout the history of H1N1 influenza replication in humans--is immunostimulatory and has been selected against during the co-evolution of viruses and host innate immune genes. This shows how differences in host immune biology can drive the evolution of viruses that jump into species with different immune priorities than the original host.
Passive Repetitive Stretching for a Short Duration within a Week Increases Myogenic Regulatory Factors and Myosin Heavy Chain mRNA in Rats' Skeletal Muscles

Directory of Open Access Journals (Sweden)

Yurie Kamikawa

2013-01-01

Full Text Available Stretching is a stimulation of muscle growth. Stretching for hours or days has an effect on muscle hypertrophy. However, differences of continuous stretching and repetitive stretching to affect muscle growth are not well known. To clarify the difference of continuous and repetitive stretching within a short duration, we investigated the gene expression of muscle-related genes on stretched skeletal muscles. We used 8-week-old male Wistar rats ( for this study. Animals medial gastrocnemius muscle was stretched continuously or repetitively for 15 min daily and 4 times/week under anesthesia. After stretching, muscles were removed and total RNA was extracted. Then, reverse transcriptional quantitative real-time PCR was done to evaluate the mRNA expression of MyoD, myogenin, and embryonic myosin heavy chain (MyHC. Muscles, either stretched continuously or repetitively, increased mRNA expression of MyoD, myogenin, and embryonic MyHC more than unstretched muscles. Notably, repetitive stretching resulted in more substantial effects on embryonic MyHC gene expression than continuous stretching. In conclusion, passive stretching for a short duration within a week is effective in increasing myogenic factor expression, and repetitive stretching had more effects than continuous stretching for skeletal muscle on muscle growth. These findings are applicable in clinical muscle-strengthening therapy.
Synergy between NMR measurements and MD simulations of protein/RNA complexes: application to the RRMs, the most common RNA recognition motifs

Czech Academy of Sciences Publication Activity Database

Krepl, Miroslav; Clery, A.; Blatter, M.; Allain, F.H.T.; Šponer, Jiří

2016-01-01

Roč. 44, č. 13 (2016), s. 6452-6470 ISSN 0305-1048 Institutional support: RVO:68081707 Keywords : molecular- dynamics simulations * particle mesh ewald * pre-ribosomal-rna Subject RIV: BO - Biophysics Impact factor: 10.162, year: 2016
A simple and robust vector-based shRNA expression system used for RNA interference.

Directory of Open Access Journals (Sweden)

Xue-jun Wang

Full Text Available BACKGROUND: RNA interference (RNAi mediated by small interfering RNAs (siRNAs or short hairpin RNAs (shRNAs has become a powerful genetic tool for conducting functional studies. Previously, vector-based shRNA-expression strategies capable of inducing RNAi in viable cells have been developed, however, these vector systems have some disadvantages, either because they were error-prone or cost prohibitive. RESULTS: In this report we described the development of a simple, robust shRNA expression system utilizing 1 long oligonucleotide or 2 short oligonucleotides for half the cost of conventional shRNA construction methods and with a >95% cloning success rate. The shRNA loop sequence and stem structure were also compared and carefully selected for better RNAi efficiency. Furthermore, an easier strategy was developed based on isocaudomers which permit rapid combination of the most efficient promoter-shRNA cassettes. Finally, using this method, the conservative target sites for hepatitis B virus (HBV knockdown were systemically screened and HBV antigen expression shown to be successfully suppressed in the presence of connected multiple shRNAs both in vitro and in vivo. CONCLUSION: This novel design describes an inexpensive and effective way to clone and express single or multiple shRNAs from the same vector with the capacity for potent and effective silencing of target genes.
Analysis of Protein-RNA and Protein-Peptide Interactions in Equine Infectious Anemia

Energy Technology Data Exchange (ETDEWEB)

Lee, Jae-Hyung [Iowa State Univ., Ames, IA (United States)

2007-01-01

Macromolecular interactions are essential for virtually all cellular functions including signal transduction processes, metabolic processes, regulation of gene expression and immune responses. This dissertation focuses on the characterization of two important macromolecular interactions involved in the relationship between Equine Infectious Anemia Virus (EIAV) and its host cell in horse: (1) the interaction between the EIAV Rev protein and its binding site, the Rev-responsive element (RRE) and (2) interactions between equine MHC class I molecules and epitope peptides derived from EIAV proteins. EIAV, one of the most divergent members of the lentivirus family, has a single-stranded RNA genome and carries several regulatory and structural proteins within its viral particle. Rev is an essential EIAV regulatory encoded protein that interacts with the viral RRE, a specific binding site in the viral mRNA. Using a combination of experimental and computational methods, the interactions between EIAV Rev and RRE were characterized in detail. EIAV Rev was shown to have a bipartite RNA binding domain contain two arginine rich motifs (ARMs). The RRE secondary structure was determined and specific structural motifs that act as cis-regulatory elements for EIAV Rev-RRE interaction were identified. Interestingly, a structural motif located in the high affinity Rev binding site is well conserved in several diverse lentiviral genoes, including HIV-1. Macromolecular interactions involved in the immune response of the horse to EIAV infection were investigated by analyzing complexes between MHC class I proteins and epitope peptides derived from EIAV Rev, Env and Gag proteins. Computational modeling results provided a mechanistic explanation for the experimental finding that a single amino acid change in the peptide binding domain of the quine MHC class I molecule differentially affectes the recognitino of specific epitopes by EIAV-specific CTL. Together, the findings in this
Anion induced conformational preference of Cα NN motif residues in functional proteins.

Science.gov (United States)

Patra, Piya; Ghosh, Mahua; Banerjee, Raja; Chakrabarti, Jaydeb

2017-12-01

Among different ligand binding motifs, anion binding C α NN motif consisting of peptide backbone atoms of three consecutive residues are observed to be important for recognition of free anions, like sulphate or biphosphate and participate in different key functions. Here we study the interaction of sulphate and biphosphate with C α NN motif present in different proteins. Instead of total protein, a peptide fragment has been studied keeping C α NN motif flanked in between other residues. We use classical force field based molecular dynamics simulations to understand the stability of this motif. Our data indicate fluctuations in conformational preferences of the motif residues in absence of the anion. The anion gives stability to one of these conformations. However, the anion induced conformational preferences are highly sequence dependent and specific to the type of anion. In particular, the polar residues are more favourable compared to the other residues for recognising the anion. © 2017 Wiley Periodicals, Inc.
Small finger protein of avian and murine retroviruses has nucleic acid annealing activity and positions the replication primer tRNA onto genomic RNA.

Science.gov (United States)

Prats, A C; Sarih, L; Gabus, C; Litvak, S; Keith, G; Darlix, J L

1988-06-01

Retrovirus virions carry a diploid genome associated with a large number of small viral finger protein molecules which are required for encapsidation. Our present results show that finger protein p12 of Rous sarcoma virus (RSV) and p10 of murine leukaemia virus (MuLV) positions replication primer tRNA on the replication initiation site (PBS) at the 5' end of the RNA genome. An RSV mutant with a Val-Pro insertion in the finger motif of p12 is able to partially encapsidate genomic RNA but is not infectious because mutated p12 is incapable of positioning the replication primer, tRNATrp. Since all known replication competent retroviruses, and the plant virus CaMV, code for finger proteins analogous to RSV p12 or MuLV p10, the initial stage of reverse transcription in avian, mammalian and human retroviruses and in CaMV is probably controlled in an analogous way.
Gene regulatory and signaling networks exhibit distinct topological distributions of motifs

Science.gov (United States)

Ferreira, Gustavo Rodrigues; Nakaya, Helder Imoto; Costa, Luciano da Fontoura

2018-04-01

The biological processes of cellular decision making and differentiation involve a plethora of signaling pathways and gene regulatory circuits. These networks in turn exhibit a multitude of motifs playing crucial parts in regulating network activity. Here we compare the topological placement of motifs in gene regulatory and signaling networks and observe that it suggests different evolutionary strategies in motif distribution for distinct cellular subnetworks.
Dicer-independent processing of short hairpin RNAs

NARCIS (Netherlands)

Liu, Ying Poi; Schopman, Nick C. T.; Berkhout, Ben

2013-01-01

Short hairpin RNAs (shRNAs) are widely used to induce RNA interference (RNAi). We tested a variety of shRNAs that differed in stem length and terminal loop size and revealed strikingly different RNAi activities and shRNA-processing patterns. Interestingly, we identified a specific shRNA design that
Binding of transcription termination protein nun to nascent RNA and template DNA.

Science.gov (United States)

Watnick, R S; Gottesman, M E

1999-12-17

The amino-terminal arginine-rich motif of coliphage HK022 Nun binds phage lambda nascent transcript, whereas the carboxyl-terminal domain interacts with RNA polymerase (RNAP) and blocks transcription elongation. RNA binding is inhibited by zinc (Zn2+) and stimulated by Escherichia coli NusA. To study these interactions, the Nun carboxyl terminus was extended by a cysteine residue conjugated to a photochemical cross-linker. The carboxyl terminus contacted NusA and made Zn2+-dependent intramolecular contacts. When Nun was added to a paused transcription elongation complex, it cross-linked to the DNA template. Nun may arrest transcription by anchoring RNAP to DNA.
A proposed vestigial translation initiation motif in VP1 of hepatitis A virus.

Science.gov (United States)

Kang, Jeong-Ah; Funkhouser, Ann W

2002-07-01

The internal ribosome entry site (IRES) of picornaviruses has a 3' polypyrimidine tract (PPT) 16-24 bases upstream of an AUG triplet (PPT/AUG motif). This motif is critical in determining the efficiency of cap-independent translation. HAV has a conserved PPT/AUG motif consisting of a nine base sequence (AGGUUUUUC) 23 bases upstream of the preferred AUG start codon. This HAV-specific PPT/AUG motif is repeated and conserved in VP1 of HAV, but not of other picornaviruses. We proposed that the PPT/AUG motif in the open reading frame initiated translation and/or had an impact on the life cycle of the virus. In vitro translation of mutant bicistronic mRNAs and growth in cell culture of mutant viruses provided no evidence that the VP1 PPT/AUG motif had any impact on either translation or growth. HAV differs from other picornaviruses in its inefficient growth in cell culture. Since the HAV-specific PPT/AUG motif is found in only 1 in 300,000 reported viral sequences outside the hepatovirus genus, this motif may be a vestigial translation initiation element and may have played a role in determining the unusual phenotype of HAV.
Negative in vitro selection identifies the rRNA recognition motif for ErmE methyltransferase

DEFF Research Database (Denmark)

Nielsen, Allan K.; Douthwaite, Stephen; Vester, Birte

1999-01-01

-mer RNA. The RNAs were passed through a series of rounds of methylation with ErmE. After each round, RNAs were selected that had partially or completely lost their ability to be methylated. After several rounds of methylation/selection, 187 subclones were analyzed. Forty-three of the subclones...
mRNA expression of a cadmium-responsive gene is a sensitive biomarker of cadmium exposure in the soil collembolan Folsomia candida

International Nuclear Information System (INIS)

Nakamori, Taizo; Fujimori, Akira; Kinoshita, Keiji; Ban-nai, Tadaaki; Kubota, Yoshihisa; Yoshida, Satoshi

2010-01-01

The gene expression of environmental organisms is useful as a biomarker of environmental pollution. One of its advantages is high sensitivity. We identified the cDNA of a novel cadmium-responsive gene in the soil collembolan Folsomia candida. The deduced protein, designated 'metallothionein-like motif containing protein' (MTC), was cysteine-rich and contained a metallothionein-like motif with similarity to metallothionein, but had a much longer sequence than metallothionein and contained repeated sequences of amino acids. Expression of MTC mRNA was sensitively induced by cadmium exposure at 0.3 mg/kg of dry food, a concentration at which toxic effects are not observed, but expression was not affected by γ-ray exposure (an inducer of oxidative stress). These findings suggest that MTC is involved in cadmium-binding processes rather than in oxidative-stress responses. In conclusion, we suggest that gene expression of MTC may be a candidate biomarker for detecting low levels of cadmium contamination in soil. - The mRNA expression of a gene potentially encoding a metallothionein-like motif containing protein is sensitively induced by cadmium exposure in the soil collembolan Folsomia candida.
mRNA expression of a cadmium-responsive gene is a sensitive biomarker of cadmium exposure in the soil collembolan Folsomia candida

Energy Technology Data Exchange (ETDEWEB)

Nakamori, Taizo, E-mail: taizo@ynu.ac.j [Environmental Radiation Effects Research Group, National Institute of Radiological Sciences, 4-9-1 Anagawa, Inage-ku, Chiba 263-8555 (Japan); Fujimori, Akira [Heavy-Ion Radiobiology Research Group, National Institute of Radiological Sciences, 4-9-1 Anagawa, Inage-ku, Chiba 263-8555 (Japan); Kinoshita, Keiji [Nagoya University Avian Bioscience Research Centre, Graduate School of Bioagricultural Sciences, Furo-cho, Chikusa-ku, Nagoya 464-8601 (Japan); Ban-nai, Tadaaki; Kubota, Yoshihisa; Yoshida, Satoshi [Environmental Radiation Effects Research Group, National Institute of Radiological Sciences, 4-9-1 Anagawa, Inage-ku, Chiba 263-8555 (Japan)

2010-05-15

The gene expression of environmental organisms is useful as a biomarker of environmental pollution. One of its advantages is high sensitivity. We identified the cDNA of a novel cadmium-responsive gene in the soil collembolan Folsomia candida. The deduced protein, designated 'metallothionein-like motif containing protein' (MTC), was cysteine-rich and contained a metallothionein-like motif with similarity to metallothionein, but had a much longer sequence than metallothionein and contained repeated sequences of amino acids. Expression of MTC mRNA was sensitively induced by cadmium exposure at 0.3 mg/kg of dry food, a concentration at which toxic effects are not observed, but expression was not affected by gamma-ray exposure (an inducer of oxidative stress). These findings suggest that MTC is involved in cadmium-binding processes rather than in oxidative-stress responses. In conclusion, we suggest that gene expression of MTC may be a candidate biomarker for detecting low levels of cadmium contamination in soil. - The mRNA expression of a gene potentially encoding a metallothionein-like motif containing protein is sensitively induced by cadmium exposure in the soil collembolan Folsomia candida.
CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures

Directory of Open Access Journals (Sweden)

Hamed Bostan

2012-01-01

Full Text Available Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and applications, there is no complete reference and motif query available at the moment. Cysteine motif database (CMD is the first online resource that stores all cysteine residues, their flanking motifs with their secondary structure, and propensity values assignment derived from the laboratory data. We extracted more than 3 million cysteine motifs from PDB and UniProt data, annotated with secondary structure assignment, propensity value assignment, and frequency of occurrence and coefficiency of their bonding status. Removal of redundancies generated 15875 unique flanking motifs that are always bonded and 41577 unique patterns that are always nonbonded. Queries are based on the protein ID, FASTA sequence, sequence motif, and secondary structure individually or in batch format using the provided APIs that allow remote users to query our database via third party software and/or high throughput screening/querying. The CMD offers extensive information about the bonded, free cysteine residues, and their motifs that allows in-depth characterization of the sequence motif composition.
Adenovirus delivered short hairpin RNA targeting a conserved site in the 5' non-translated region inhibits all four serotypes of dengue viruses.

Directory of Open Access Journals (Sweden)

Anil Babu Korrapati

Full Text Available BACKGROUND: Dengue is a mosquito-borne viral disease caused by four closely related serotypes of Dengue viruses (DENVs. This disease whose symptoms range from mild fever to potentially fatal haemorrhagic fever and hypovolemic shock, threatens nearly half the global population. There is neither a preventive vaccine nor an effective antiviral therapy against dengue disease. The difference between severe and mild disease appears to be dependent on the viral load. Early diagnosis may enable timely therapeutic intervention to blunt disease severity by reducing the viral load. Harnessing the therapeutic potential of RNA interference (RNAi to attenuate DENV replication may offer one approach to dengue therapy. METHODOLOGY/PRINCIPAL FINDINGS: We screened the non-translated regions (NTRs of the RNA genomes of representative members of the four DENV serotypes for putative siRNA targets mapping to known transcription/translation regulatory elements. We identified a target site in the 5' NTR that maps to the 5' upstream AUG region, a highly conserved cis-acting element essential for viral replication. We used a replication-defective human adenovirus type 5 (AdV5 vector to deliver a short-hairpin RNA (shRNA targeting this site into cells. We show that this shRNA matures to the cognate siRNA and is able to inhibit effectively antigen secretion, viral RNA replication and infectious virus production by all four DENV serotypes. CONCLUSION/SIGNIFICANCE: The data demonstrate the feasibility of using AdV5-mediated delivery of shRNAs targeting conserved sites in the viral genome to achieve inhibition of all four DENV serotypes. This paves the way towards exploration of RNAi as a possible therapeutic strategy to curtail DENV infection.
Review article: The mountain motif in the plot of Matthew

Directory of Open Access Journals (Sweden)

Gert J. Volschenk

2010-09-01

Full Text Available This article reviewed T.L. Donaldson’s book, Jesus on the mountain: A study in Matthean theology, published in 1985 by JSOT Press, Sheffield, and focused on the mountain motif in the structure and plot of the Gospel of Matthew, in addition to the work of Donaldson on the mountain motif as a literary motif and as theological symbol. The mountain is a primary theological setting for Jesus’ ministry and thus is an important setting, serving as one of the literary devices by which Matthew structured and progressed his narrative. The Zion theological and eschatological significance and Second Temple Judaism serve as the historical and theological background for the mountain motif. The last mountain setting (Mt 28:16–20 is the culmination of the three theological themes in the plot of Matthew, namely Christology, ecclesiology and salvation history.
From benchmarking HITS-CLIP peak detection programs to a new method for identification of miRNA-binding sites from Ago2-CLIP data.

Science.gov (United States)

Bottini, Silvia; Hamouda-Tekaya, Nedra; Tanasa, Bogdan; Zaragosi, Laure-Emmanuelle; Grandjean, Valerie; Repetto, Emanuela; Trabucchi, Michele

2017-05-19

Experimental evidence indicates that about 60% of miRNA-binding activity does not follow the canonical rule about the seed matching between miRNA and target mRNAs, but rather a non-canonical miRNA targeting activity outside the seed or with a seed-like motifs. Here, we propose a new unbiased method to identify canonical and non-canonical miRNA-binding sites from peaks identified by Ago2 Cross-Linked ImmunoPrecipitation associated to high-throughput sequencing (CLIP-seq). Since the quality of peaks is of pivotal importance for the final output of the proposed method, we provide a comprehensive benchmarking of four peak detection programs, namely CIMS, PIPE-CLIP, Piranha and Pyicoclip, on four publicly available Ago2-HITS-CLIP datasets and one unpublished in-house Ago2-dataset in stem cells. We measured the sensitivity, the specificity and the position accuracy toward miRNA binding sites identification, and the agreement with TargetScan. Secondly, we developed a new pipeline, called miRBShunter, to identify canonical and non-canonical miRNA-binding sites based on de novo motif identification from Ago2 peaks and prediction of miRNA::RNA heteroduplexes. miRBShunter was tested and experimentally validated on the in-house Ago2-dataset and on an Ago2-PAR-CLIP dataset in human stem cells. Overall, we provide guidelines to choose a suitable peak detection program and a new method for miRNA-target identification. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

RDE-4 preferentially binds long dsRNA and its dimerization is necessary for cleavage of dsRNA to siRNA.

Science.gov (United States)

Parker, Greg S; Eckert, Debra M; Bass, Brenda L

2006-05-01

In organisms ranging from Arabidopsis to humans, Dicer requires dsRNA-binding proteins (dsRBPs) to carry out its roles in RNA interference (RNAi) and micro-RNA (miRNA) processing. In Caenorhabditis elegans, the dsRBP RDE-4 acts with Dicer during the initiation of RNAi, when long dsRNA is cleaved to small interfering RNAs (siRNAs). RDE-4 is not required in subsequent steps, and how RDE-4 distinguishes between long dsRNA and short siRNA is unclear. We report the first detailed analysis of RDE-4 binding, using purified recombinant RDE-4 and various truncated proteins. We find that, similar to other dsRBPs, RDE-4 is not sequence-specific. However, consistent with its in vivo roles, RDE-4 binds with higher affinity to long dsRNA. We also observe that RDE-4 is a homodimer in solution, and that the C-terminal domain of the protein is required for dimerization. Using extracts from wild-type and rde-4 mutant C. elegans, we show that the C-terminal dimerization domain is required for the production of siRNA. Our findings suggest a model for RDE-4 function during the initiation of RNAi.
Turning limited experimental information into 3D models of RNA.

Science.gov (United States)

Flores, Samuel Coulbourn; Altman, Russ B

2010-09-01

Our understanding of RNA functions in the cell is evolving rapidly. As for proteins, the detailed three-dimensional (3D) structure of RNA is often key to understanding its function. Although crystallography and nuclear magnetic resonance (NMR) can determine the atomic coordinates of some RNA structures, many 3D structures present technical challenges that make these methods difficult to apply. The great flexibility of RNA, its charged backbone, dearth of specific surface features, and propensity for kinetic traps all conspire with its long folding time, to challenge in silico methods for physics-based folding. On the other hand, base-pairing interactions (either in runs to form helices or isolated tertiary contacts) and motifs are often available from relatively low-cost experiments or informatics analyses. We present RNABuilder, a novel code that uses internal coordinate mechanics to satisfy user-specified base pairing and steric forces under chemical constraints. The code recapitulates the topology and characteristic L-shape of tRNA and obtains an accurate noncrystallographic structure of the Tetrahymena ribozyme P4/P6 domain. The algorithm scales nearly linearly with molecule size, opening the door to the modeling of significantly larger structures.
Adenoviral short hairpin RNA therapy targeting phosphodiesterase 5a relieves cardiac remodeling and dysfunction following myocardial infarction

Science.gov (United States)

Li, Longhu; Haider, Husnain Kh.; Wang, Linlin; Lu, Gang

2012-01-01

We previously showed that treatment with tadalafil, a long-acting phosphodiesterase-5a (PDE5a) inhibitor, effectively prevented adverse left ventricular (LV) remodeling of the infarcted heart. We hypothesized that short-hairpin RNA (shRNA) therapy targeting PDE5a would simulate the effects of pharmacological intervention for treatment of postinfarction LV remodeling and dysfunction. Experimental model of myocardial infarction was developed in female mice by permanent ligation of left coronary artery. Immediately after that, an adenoviral vector encoding for shRNA sequence targeting PDE5a (Ad-shPDE5a) was injected intramyocardially, which specifically inhibited PDE5a in the heart. Four weeks later, Ad-shPDE5a treated mice showed significant mitigation of the left ventricle (LV) dilatation and dysfunction as indicated by smaller LV cavity and more preserved ejection fraction and fractional shortening. Infarction size and fibrosis were significantly reduced in Ad-shPDE5a-treated mice. Additionally, more salvaged cardiomyocytes, significantly reduced collagen contents, and higher blood vessel density were observed in Ad-shPDE5a-treated mice. The cytoprotective effects of Ad-shPDE5a were demonstrated in vitro in Ad-shPDE5a transfected cardiomyocytes cultured under oxygen glucose deprivation. Among downstream mediators of PDE5a signaling, cyclic GMP (cGMP) and cGMP-dependent protein kinase G (PKG) were activated with concomitant reduction in caspase-3 activity. However, no significant change in PKA and cAMP activities were observed in Ad-shPDE5a-treated hearts. Inhibition with shRNA improved cardiac remodeling and dysfunction by reducing infarction size and cardiac fibrosis and increased cGMP and PKG activity. These findings suggest that PDE5 inhibition with Ad-shPDE5a is a novel approach for treatment of myocardial infarction. PMID:22447941
MicroRNA-directed siRNA biogenesis in Caenorhabditis elegans.

Science.gov (United States)

Corrêa, Régis L; Steiner, Florian A; Berezikov, Eugene; Ketting, René F

2010-04-08

RNA interference (RNAi) is a post-transcriptional silencing process, triggered by double-stranded RNA (dsRNA), leading to the destabilization of homologous mRNAs. A distinction has been made between endogenous RNAi-related pathways and the exogenous RNAi pathway, the latter being essential for the experimental use of RNAi. Previous studies have shown that, in Caenorhabditis elegans, a complex containing the enzymes Dicer and the Argonaute RDE-1 process dsRNA. Dicer is responsible for cleaving dsRNA into short interfering RNAs (siRNAs) while RDE-1 acts as the siRNA acceptor. RDE-1 then guides a multi-protein complex to homologous targets to trigger mRNA destabilization. However, endogenous role(s) for RDE-1, if any, have remained unexplored. We here show that RDE-1 functions as a scavenger protein, taking up small RNA molecules from many different sources, including the microRNA (miRNA) pathway. This is in striking contrast to Argonaute proteins functioning directly in the miRNA pathway, ALG-1 and ALG-2: these proteins exclusively bind miRNAs. While playing no significant role in the biogenesis of the main pool of miRNAs, RDE-1 binds endogenous miRNAs and triggers RdRP activity on at least one perfectly matching, endogenous miRNA target. The resulting secondary siRNAs are taken up by a set of Argonaute proteins known to act as siRNA acceptors in exogenous RNAi, resulting in strong mRNA destabilization. Our results show that RDE-1 in an endogenous setting is actively screening the transcriptome using many different small RNAs, including miRNAs, as a guide, with implications for the evolution of transcripts with a potential to be recognized by Dicer.
Polyadenylation of RNA transcribed from mammalian SINEs by RNA polymerase III: Complex requirements for nucleotide sequences.

Science.gov (United States)

Borodulina, Olga R; Golubchikova, Julia S; Ustyantsev, Ilia G; Kramerov, Dmitri A

2016-02-01

It is generally accepted that only transcripts synthesized by RNA polymerase II (e.g., mRNA) were subject to AAUAAA-dependent polyadenylation. However, we previously showed that RNA transcribed by RNA polymerase III (pol III) from mouse B2 SINE could be polyadenylated in an AAUAAA-dependent manner. Many species of mammalian SINEs end with the pol III transcriptional terminator (TTTTT) and contain hexamers AATAAA in their A-rich tail. Such SINEs were united into Class T(+), whereas SINEs lacking the terminator and AATAAA sequences were classified as T(-). Here we studied the structural features of SINE pol III transcripts that are necessary for their polyadenylation. Eight and six SINE families from classes T(+) and T(-), respectively, were analyzed. The replacement of AATAAA with AACAAA in T(+) SINEs abolished the RNA polyadenylation. Interestingly, insertion of the polyadenylation signal (AATAAA) and pol III transcription terminator in T(-) SINEs did not result in polyadenylation. The detailed analysis of three T(+) SINEs (B2, DIP, and VES) revealed areas important for the polyadenylation of their pol III transcripts: the polyadenylation signal and terminator in A-rich tail, β region positioned immediately downstream of the box B of pol III promoter, and τ region located upstream of the tail. In DIP and VES (but not in B2), the τ region is a polypyrimidine motif which is also characteristic of many other T(+) SINEs. Most likely, SINEs of different mammals acquired these structural features independently as a result of parallel evolution. Copyright © 2015 Elsevier B.V. All rights reserved.
On the origin of distribution patterns of motifs in biological networks

Directory of Open Access Journals (Sweden)

Lesk Arthur M

2008-08-01

Full Text Available Abstract Background Inventories of small subgraphs in biological networks have identified commonly-recurring patterns, called motifs. The inference that these motifs have been selected for function rests on the idea that their occurrences are significantly more frequent than random. Results Our analysis of several large biological networks suggests, in contrast, that the frequencies of appearance of common subgraphs are similar in natural and corresponding random networks. Conclusion Indeed, certain topological features of biological networks give rise naturally to the common appearance of the motifs. We therefore question whether frequencies of occurrences are reasonable evidence that the structures of motifs have been selected for their functional contribution to the operation of networks.
Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences

Science.gov (United States)

König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.

2013-01-01

G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141
A common minimal motif for the ligands of HLA-B*27 class I molecules.

Science.gov (United States)

Barriga, Alejandro; Lorente, Elena; Johnstone, Carolina; Mir, Carmen; del Val, Margarita; López, Daniel

2014-01-01

CD8(+) T cells identify and kill infected cells through the specific recognition of short viral antigens bound to human major histocompatibility complex (HLA) class I molecules. The colossal number of polymorphisms in HLA molecules makes it essential to characterize the antigen-presenting properties common to large HLA families or supertypes. In this context, the HLA-B*27 family comprising at least 100 different alleles, some of them widely distributed in the human population, is involved in the cellular immune response against pathogens and also associated to autoimmune spondyloarthritis being thus a relevant target of study. To this end, HLA binding assays performed using nine HLA-B*2705-restricted ligands endogenously processed and presented in virus-infected cells revealed a common minimal peptide motif for efficient binding to the HLA-B*27 family. The motif was independently confirmed using four unrelated peptides. This experimental approach, which could be easily transferred to other HLA class I families and supertypes, has implications for the validation of new bioinformatics tools in the functional clustering of HLA molecules, for the identification of antiviral cytotoxic T lymphocyte responses, and for future vaccine development.
A common minimal motif for the ligands of HLA-B*27 class I molecules.

Directory of Open Access Journals (Sweden)

Alejandro Barriga

Full Text Available CD8(+ T cells identify and kill infected cells through the specific recognition of short viral antigens bound to human major histocompatibility complex (HLA class I molecules. The colossal number of polymorphisms in HLA molecules makes it essential to characterize the antigen-presenting properties common to large HLA families or supertypes. In this context, the HLA-B*27 family comprising at least 100 different alleles, some of them widely distributed in the human population, is involved in the cellular immune response against pathogens and also associated to autoimmune spondyloarthritis being thus a relevant target of study. To this end, HLA binding assays performed using nine HLA-B*2705-restricted ligands endogenously processed and presented in virus-infected cells revealed a common minimal peptide motif for efficient binding to the HLA-B*27 family. The motif was independently confirmed using four unrelated peptides. This experimental approach, which could be easily transferred to other HLA class I families and supertypes, has implications for the validation of new bioinformatics tools in the functional clustering of HLA molecules, for the identification of antiviral cytotoxic T lymphocyte responses, and for future vaccine development.
Innate immune restriction and antagonism of viral RNA lacking 2'-O methylation

Energy Technology Data Exchange (ETDEWEB)

Hyde, Jennifer L. [Departments of Medicine, Washington University School of Medicine, St Louis., MO 63110 (United States); Diamond, Michael S., E-mail: diamond@borcim.wustl.edu [Departments of Medicine, Washington University School of Medicine, St Louis., MO 63110 (United States); Molecular Microbiology, Washington University School of Medicine, St Louis., MO 63110 (United States); Pathology & Immunology, Washington University School of Medicine, St Louis., MO 63110 (United States); The Center for Human Immunology and Immunotherapy Programs, Washington University School of Medicine, St Louis., MO 63110 (United States)

2015-05-15

N-7 and 2′-O methylation of host cell mRNA occurs in the nucleus and results in the generation of cap structures (cap 0, m{sup 7}GpppN; cap 1, m{sup 7}GpppNm) that control gene expression by modulating nuclear export, splicing, turnover, and protein synthesis. Remarkably, RNA cap modification also contributes to mammalian cell host defense as viral RNA lacking 2′-O methylation is sensed and inhibited by IFIT1, an interferon (IFN) stimulated gene (ISG). Accordingly, pathogenic viruses that replicate in the cytoplasm have evolved mechanisms to circumvent IFIT1 restriction and facilitate infection of mammalian cells. These include: (a) generating cap 1 structures on their RNA through cap-snatching or virally-encoded 2′-O methyltransferases, (b) using cap-independent means of translation, or (c) using RNA secondary structural motifs to antagonize IFIT1 binding. This review will discuss new insights as to how specific modifications at the 5′-end of viral RNA modulate host pathogen recognition responses to promote infection and disease.
RAG-3D: a search tool for RNA 3D substructures

Science.gov (United States)

Zahran, Mai; Sevim Bayrak, Cigdem; Elmetwaly, Shereef; Schlick, Tamar

2015-01-01

To address many challenges in RNA structure/function prediction, the characterization of RNA's modular architectural units is required. Using the RNA-As-Graphs (RAG) database, we have previously explored the existence of secondary structure (2D) submotifs within larger RNA structures. Here we present RAG-3D—a dataset of RNA tertiary (3D) structures and substructures plus a web-based search tool—designed to exploit graph representations of RNAs for the goal of searching for similar 3D structural fragments. The objects in RAG-3D consist of 3D structures translated into 3D graphs, cataloged based on the connectivity between their secondary structure elements. Each graph is additionally described in terms of its subgraph building blocks. The RAG-3D search tool then compares a query RNA 3D structure to those in the database to obtain structurally similar structures and substructures. This comparison reveals conserved 3D RNA features and thus may suggest functional connections. Though RNA search programs based on similarity in sequence, 2D, and/or 3D structural elements are available, our graph-based search tool may be advantageous for illuminating similarities that are not obvious; using motifs rather than sequence space also reduces search times considerably. Ultimately, such substructuring could be useful for RNA 3D structure prediction, structure/function inference and inverse folding. PMID:26304547
Fast social-like learning of complex behaviors based on motor motifs

Science.gov (United States)

Calvo Tapia, Carlos; Tyukin, Ivan Y.; Makarov, Valeri A.

2018-05-01

Social learning is widely observed in many species. Less experienced agents copy successful behaviors exhibited by more experienced individuals. Nevertheless, the dynamical mechanisms behind this process remain largely unknown. Here we assume that a complex behavior can be decomposed into a sequence of n motor motifs. Then a neural network capable of activating motor motifs in a given sequence can drive an agent. To account for (n -1 )! possible sequences of motifs in a neural network, we employ the winnerless competition approach. We then consider a teacher-learner situation: one agent exhibits a complex movement, while another one aims at mimicking the teacher's behavior. Despite the huge variety of possible motif sequences we show that the learner, equipped with the provided learning model, can rewire "on the fly" its synaptic couplings in no more than (n -1 ) learning cycles and converge exponentially to the durations of the teacher's motifs. We validate the learning model on mobile robots. Experimental results show that the learner is indeed capable of copying the teacher's behavior composed of six motor motifs in a few learning cycles. The reported mechanism of learning is general and can be used for replicating different functions, including, for example, sound patterns or speech.
Thermal Stability of Modified i-Motif Oligonucleotides with Naphthalimide Intercalating Nucleic Acids

DEFF Research Database (Denmark)

El-Sayed, Ahmed Ali; Pedersen, Erik B.; Khaireldin, Nahid Y.

2016-01-01

In continuation of our investigation of characteristics and thermodynamic properties of the i-motif 5′-d[(CCCTAA)3CCCT)] upon insertion of intercalating nucleotides into the cytosine-rich oligonucleotide, this article evaluates the stabilities of i-motif oligonucleotides upon insertion of naphtha......In continuation of our investigation of characteristics and thermodynamic properties of the i-motif 5′-d[(CCCTAA)3CCCT)] upon insertion of intercalating nucleotides into the cytosine-rich oligonucleotide, this article evaluates the stabilities of i-motif oligonucleotides upon insertion...... of naphthalimide (1H-benzo[de]isoquinoline-1,3(2H)-dione) as the intercalating nucleic acid. The stabilities of i-motif structures with inserted naphthalimide intercalating nucleotides were studied using UV melting temperatures (Tm) and circular dichroism spectra at different pH values and conditions (crowding...
RRM domain of Arabidopsis splicing factor SF1 is important for pre-mRNA splicing of a specific set of genes

KAUST Repository

Lee, Keh Chien

2017-04-11

The RNA recognition motif of Arabidopsis splicing factor SF1 affects the alternative splicing of FLOWERING LOCUS M pre-mRNA and a heat shock transcription factor HsfA2 pre-mRNA. Splicing factor 1 (SF1) plays a crucial role in 3\\' splice site recognition by binding directly to the intron branch point. Although plant SF1 proteins possess an RNA recognition motif (RRM) domain that is absent in its fungal and metazoan counterparts, the role of the RRM domain in SF1 function has not been characterized. Here, we show that the RRM domain differentially affects the full function of the Arabidopsis thaliana AtSF1 protein under different experimental conditions. For example, the deletion of RRM domain influences AtSF1-mediated control of flowering time, but not the abscisic acid sensitivity response during seed germination. The alternative splicing of FLOWERING LOCUS M (FLM) pre-mRNA is involved in flowering time control. We found that the RRM domain of AtSF1 protein alters the production of alternatively spliced FLM-β transcripts. We also found that the RRM domain affects the alternative splicing of a heat shock transcription factor HsfA2 pre-mRNA, thereby mediating the heat stress response. Taken together, our results suggest the importance of RRM domain for AtSF1-mediated alternative splicing of a subset of genes involved in the regulation of flowering and adaptation to heat stress.
Spliceosomal small nuclear RNAs of Tetrahymena thermophila and some possible snRNA-snRNA base-pairing interactions

DEFF Research Database (Denmark)

Orum, H; Nielsen, Henrik; Engberg, J

1991-01-01

We have identified and characterized the full set of spliceosomal small nuclear RNAs (snRNAs; U1, U2, U4, U5 and U6) from the ciliated protozoan Tetrahymena thermophila. With the exception of U4 snRNA, the sizes of the T. thermophila snRNAs are closely similar to their metazoan homologues. The T....... thermophila snRNAs all have unique 5' ends, which start with an adenine residue. In contrast, with the exception of U6, their 3' ends show some size heterogeneity. The primary sequences of the T. thermophila snRNAs contain the sequence motifs shown, or proposed, to be of functional importance in other...
Motif formation and industry specific topologies in the Japanese business firm network

Science.gov (United States)

Maluck, Julian; Donner, Reik V.; Takayasu, Hideki; Takayasu, Misako

2017-05-01

Motifs and roles are basic quantities for the characterization of interactions among 3-node subsets in complex networks. In this work, we investigate how the distribution of 3-node motifs can be influenced by modifying the rules of an evolving network model while keeping the statistics of simpler network characteristics, such as the link density and the degree distribution, invariant. We exemplify this problem for the special case of the Japanese Business Firm Network, where a well-studied and relatively simple yet realistic evolving network model is available, and compare the resulting motif distribution in the real-world and simulated networks. To better approximate the motif distribution of the real-world network in the model, we introduce both subgraph dependent and global additional rules. We find that a specific rule that allows only for the merging process between nodes with similar link directionality patterns reduces the observed excess of densely connected motifs with bidirectional links. Our study improves the mechanistic understanding of motif formation in evolving network models to better describe the characteristic features of real-world networks with a scale-free topology.
Stem/Progenitor Cell Proteoglycans Decorated with 7-D-4, 4-C-3 and 3-B-3(-) Chondroitin Sulphate Motifs Are Morphogenetic Markers Of Tissue Development.

Science.gov (United States)

Hayes, Anthony J; Smith, Susan M; Caterson, Bruce; Melrose, James

2018-06-11

This study reviewed the occurrence of chondroitin sulphate (CS) motifs 4-C-3, 7-D-4 and 3-B-3(-) which are expressed by progenitor cells in tissues undergoing morphogenesis. These motifs have a transient early expression pattern during tissue development and also appear in mature tissues during pathological remodeling and attempted repair processes by activated adult stem cells. The CS motifs are information and recognition modules, which may regulate cellular behavior and delineate stem cell niches in developmental tissues. One of the difficulties in determining the precise role of stem cells in tissue development and repair processes is their short engraftment period and the lack of specific markers, which differentiate the activated stem cell lineages from the resident cells. The CS sulphation motifs 7-D-4, 4-C-3 and 3-B-3 (-) decorate cell surface proteoglycans on activated stem/progenitor cells and appear to identify these cells in transitional areas of tissue development and in tissue repair and may be applicable to determining a more precise role for stem cells in tissue morphogenesis. This article is protected by copyright. All rights reserved. © 2018 AlphaMed Press.
Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

KAUST Repository

Wong, Ka-Chun; Li, Yue; Peng, Chengbin

2015-01-01

Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.
Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

KAUST Repository

Wong, Ka-Chun

2015-09-27

Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.
RNA sequence determinants of a coupled termination-reinitiation strategy for downstream open reading frame translation in Helminthosporium victoriae virus 190S and other victoriviruses (Family Totiviridae).

Science.gov (United States)

Li, Hua; Havens, Wendy M; Nibert, Max L; Ghabrial, Said A

2011-07-01

The genome-length, dicistronic mRNA of the double-stranded RNA fungal virus Helminthosporium victoriae virus 190S (genus Victorivirus, family Totiviridae) contains two long open reading frames (ORFs) that overlap in the tetranucleotide AUGA. Translation of the downstream ORF, which encodes the RNA-dependent RNA polymerase (RdRp), has been proposed to depend on ribosomal reinitiation following termination of the upstream ORF, which encodes the capsid protein. In the current study, we examined the RNA sequence determinants for RdRp translation in this virus and demonstrated that a coupled termination-reinitiation (stop-restart) strategy is indeed used. Signals for termination-reinitiation are found within a 32-nucleotide stretch of RNA immediately upstream of the AUGA motif, including a predicted pseudoknot structure. The close proximity in which this predicted structure is followed by the upstream ORF's stop codon appears to be especially important for promoting translation of the downstream ORF. The normal strong preferences for an AUG start codon and the canonical sequence context to favor translation initiation appear somewhat relaxed for the downstream ORF. Similar sequence motifs and predicted RNA structures in other victoriviruses suggest that they all share a related stop-restart strategy for RdRp translation. Members of the genus Victorivirus thus provide new and unique opportunities for exploring the molecular mechanisms of translational coupling, which remain only partly understood in this and other systems.

Efficient sequential and parallel algorithms for finding edit distance based motifs.

Science.gov (United States)

Pal, Soumitra; Xiao, Peng; Rajasekaran, Sanguthevar

2016-08-18

Motif search is an important step in extracting meaningful patterns from biological data. The general problem of motif search is intractable and there is a pressing need to develop efficient, exact and approximation algorithms to solve this problem. In this paper, we present several novel, exact, sequential and parallel algorithms for solving the (l,d) Edit-distance-based Motif Search (EMS) problem: given two integers l,d and n biological strings, find all strings of length l that appear in each input string with atmost d errors of types substitution, insertion and deletion. One popular technique to solve the problem is to explore for each input string the set of all possible l-mers that belong to the d-neighborhood of any substring of the input string and output those which are common for all input strings. We introduce a novel and provably efficient neighborhood exploration technique. We show that it is enough to consider the candidates in neighborhood which are at a distance exactly d. We compactly represent these candidate motifs using wildcard characters and efficiently explore them with very few repetitions. Our sequential algorithm uses a trie based data structure to efficiently store and sort the candidate motifs. Our parallel algorithm in a multi-core shared memory setting uses arrays for storing and a novel modification of radix-sort for sorting the candidate motifs. The algorithms for EMS are customarily evaluated on several challenging instances such as (8,1), (12,2), (16,3), (20,4), and so on. The best previously known algorithm, EMS1, is sequential and in estimated 3 days solves up to instance (16,3). Our sequential algorithms are more than 20 times faster on (16,3). On other hard instances such as (9,2), (11,3), (13,4), our algorithms are much faster. Our parallel algorithm has more than 600 % scaling performance while using 16 threads. Our algorithms have pushed up the state-of-the-art of EMS solvers and we believe that the techniques introduced in
A mutation in the Arabidopsis HYL1 gene encoding a dsRNA binding protein affects responses to abscisic acid, auxin, and cytokinin

Science.gov (United States)

Lu, C.; Fedoroff, N.

2000-01-01

Both physiological and genetic evidence indicate interconnections among plant responses to different hormones. We describe a pleiotropic recessive Arabidopsis transposon insertion mutation, designated hyponastic leaves (hyl1), that alters the plant's responses to several hormones. The mutant is characterized by shorter stature, delayed flowering, leaf hyponasty, reduced fertility, decreased rate of root growth, and an altered root gravitropic response. It also exhibits less sensitivity to auxin and cytokinin and hypersensitivity to abscisic acid (ABA). The auxin transport inhibitor 2,3,5-triiodobenzoic acid normalizes the mutant phenotype somewhat, whereas another auxin transport inhibitor, N-(1-naph-thyl)phthalamic acid, exacerbates the phenotype. The gene, designated HYL1, encodes a 419-amino acid protein that contains two double-stranded RNA (dsRNA) binding motifs, a nuclear localization motif, and a C-terminal repeat structure suggestive of a protein-protein interaction domain. We present evidence that the HYL1 gene is ABA-regulated and encodes a nuclear dsRNA binding protein. We hypothesize that the HYL1 protein is a regulatory protein functioning at the transcriptional or post-transcriptional level.
Physical-chemical property based sequence motifs and methods regarding same

Science.gov (United States)

Braun, Werner [Friendswood, TX; Mathura, Venkatarajan S [Sarasota, FL; Schein, Catherine H [Friendswood, TX

2008-09-09

A data analysis system, program, and/or method, e.g., a data mining/data exploration method, using physical-chemical property motifs. For example, a sequence database may be searched for identifying segments thereof having physical-chemical properties similar to the physical-chemical property motifs.
Mechanism of Genome Interrogation: How CRISPR RNA-Guided Cas9 Proteins Locate Specific Targets on DNA.

Science.gov (United States)

Shvets, Alexey A; Kolomeisky, Anatoly B

2017-10-03

The ability to precisely edit and modify a genome opens endless opportunities to investigate fundamental properties of living systems as well as to advance various medical techniques and bioengineering applications. This possibility is now close to reality due to a recent discovery of the adaptive bacterial immune system, which is based on clustered regularly interspaced short palindromic repeats (CRISPR)-associated proteins (Cas) that utilize RNA to find and cut the double-stranded DNA molecules at specific locations. Here we develop a quantitative theoretical approach to analyze the mechanism of target search on DNA by CRISPR RNA-guided Cas9 proteins, which is followed by a selective cleavage of nucleic acids. It is based on a discrete-state stochastic model that takes into account the most relevant physical-chemical processes in the system. Using a method of first-passage processes, a full dynamic description of the target search is presented. It is found that the location of specific sites on DNA by CRISPR Cas9 proteins is governed by binding first to protospacer adjacent motif sequences on DNA, which is followed by reversible transitions into DNA interrogation states. In addition, the search dynamics is strongly influenced by the off-target cutting. Our theoretical calculations allow us to explain the experimental observations and to give experimentally testable predictions. Thus, the presented theoretical model clarifies some molecular aspects of the genome interrogation by CRISPR RNA-guided Cas9 proteins. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks.

Science.gov (United States)

Gorochowski, Thomas E; Grierson, Claire S; di Bernardo, Mario

2018-03-01

Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli . Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution.
Condition-specific RNA editing in the coral symbiont Symbiodinium microadriaticum

KAUST Repository

Liew, Yi Jin

2017-03-01

RNA editing is a rare post-transcriptional event that provides cells with an additional level of gene expression regulation. It has been implicated in various processes including adaptation, viral defence and RNA interference; however, its potential role as a mechanism in acclimatization has just recently been recognised. Here, we show that RNA editing occurs in 1.6% of all nuclear-encoded genes of Symbiodinium microadriaticum, a dinoflagellate symbiont of reef-building corals. All base-substitution edit types were present, and statistically significant motifs were associated with three edit types. Strikingly, a subset of genes exhibited condition-specific editing patterns in response to different stressors that resulted in significant increases of non-synonymous changes. We posit that this previously unrecognised mechanism extends this organism’s capability to respond to stress beyond what is encoded by the genome. This in turn may provide further acclimatization capacity to these organisms, and by extension, their coral hosts.
Condition-specific RNA editing in the coral symbiont Symbiodinium microadriaticum

KAUST Repository

Liew, Yi Jin; Li, Yong; Baumgarten, Sebastian; Voolstra, Christian R.; Aranda, Manuel

2017-01-01

RNA editing is a rare post-transcriptional event that provides cells with an additional level of gene expression regulation. It has been implicated in various processes including adaptation, viral defence and RNA interference; however, its potential role as a mechanism in acclimatization has just recently been recognised. Here, we show that RNA editing occurs in 1.6% of all nuclear-encoded genes of Symbiodinium microadriaticum, a dinoflagellate symbiont of reef-building corals. All base-substitution edit types were present, and statistically significant motifs were associated with three edit types. Strikingly, a subset of genes exhibited condition-specific editing patterns in response to different stressors that resulted in significant increases of non-synonymous changes. We posit that this previously unrecognised mechanism extends this organism’s capability to respond to stress beyond what is encoded by the genome. This in turn may provide further acclimatization capacity to these organisms, and by extension, their coral hosts.
RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.

Science.gov (United States)

Castro-Mondragon, Jaime Abraham; Jaeger, Sébastien; Thieffry, Denis; Thomas-Chollier, Morgane; van Helden, Jacques

2017-07-27

Transcription factor (TF) databases contain multitudes of binding motifs (TFBMs) from various sources, from which non-redundant collections are derived by manual curation. The advent of high-throughput methods stimulated the production of novel collections with increasing numbers of motifs. Meta-databases, built by merging these collections, contain redundant versions, because available tools are not suited to automatically identify and explore biologically relevant clusters among thousands of motifs. Motif discovery from genome-scale data sets (e.g. ChIP-seq) also produces redundant motifs, hampering the interpretation of results. We present matrix-clustering, a versatile tool that clusters similar TFBMs into multiple trees, and automatically creates non-redundant TFBM collections. A feature unique to matrix-clustering is its dynamic visualisation of aligned TFBMs, and its capability to simultaneously treat multiple collections from various sources. We demonstrate that matrix-clustering considerably simplifies the interpretation of combined results from multiple motif discovery tools, and highlights biologically relevant variations of similar motifs. We also ran a large-scale application to cluster ∼11 000 motifs from 24 entire databases, showing that matrix-clustering correctly groups motifs belonging to the same TF families, and drastically reduced motif redundancy. matrix-clustering is integrated within the RSAT suite (http://rsat.eu/), accessible through a user-friendly web interface or command-line for its integration in pipelines. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

Directory of Open Access Journals (Sweden)

Vassilev Boris

2010-04-01

Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.
Quantitative Phosphoproteomic Analysis Provides Insight into the Response to Short-Term Drought Stress in Ammopiptanthus mongolicus Roots

Directory of Open Access Journals (Sweden)

Huigai Sun

2017-10-01

Full Text Available Drought is one of the major abiotic stresses that negatively affects plant growth and development. Ammopiptanthus mongolicus is an ecologically important shrub in the mid-Asia desert region and used as a model for abiotic tolerance research in trees. Protein phosphorylation participates in the regulation of various biological processes, however, phosphorylation events associated with drought stress signaling and response in plants is still limited. Here, we conducted a quantitative phosphoproteomic analysis of the response of A. mongolicus roots to short-term drought stress. Data are available via the iProx database with project ID IPX0000971000. In total, 7841 phosphorylation sites were found from the 2019 identified phosphopeptides, corresponding to 1060 phosphoproteins. Drought stress results in significant changes in the abundance of 103 phosphopeptides, corresponding to 90 differentially-phosphorylated phosphoproteins (DPPs. Motif-x analysis identified two motifs, including [pSP] and [RXXpS], from these DPPs. Functional enrichment and protein-protein interaction analysis showed that the DPPs were mainly involved in signal transduction and transcriptional regulation, osmotic adjustment, stress response and defense, RNA splicing and transport, protein synthesis, folding and degradation, and epigenetic regulation. These drought-corresponsive phosphoproteins, and the related signaling and metabolic pathways probably play important roles in drought stress signaling and response in A. mongolicus roots. Our results provide new information for understanding the molecular mechanism of the abiotic stress response in plants at the posttranslational level.
RNA graph partitioning for the discovery of RNA modularity: a novel application of graph partition algorithm to biology.

Directory of Open Access Journals (Sweden)

Namhee Kim

also suggest design strategies for novel RNA motifs.
Vfold: a web server for RNA structure and folding thermodynamics prediction.

Science.gov (United States)

Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

2014-01-01

The ever increasing discovery of non-coding RNAs leads to unprecedented demand for the accurate modeling of RNA folding, including the predictions of two-dimensional (base pair) and three-dimensional all-atom structures and folding stabilities. Accurate modeling of RNA structure and stability has far-reaching impact on our understanding of RNA functions in human health and our ability to design RNA-based therapeutic strategies. The Vfold server offers a web interface to predict (a) RNA two-dimensional structure from the nucleotide sequence, (b) three-dimensional structure from the two-dimensional structure and the sequence, and (c) folding thermodynamics (heat capacity melting curve) from the sequence. To predict the two-dimensional structure (base pairs), the server generates an ensemble of structures, including loop structures with the different intra-loop mismatches, and evaluates the free energies using the experimental parameters for the base stacks and the loop entropy parameters given by a coarse-grained RNA folding model (the Vfold model) for the loops. To predict the three-dimensional structure, the server assembles the motif scaffolds using structure templates extracted from the known PDB structures and refines the structure using all-atom energy minimization. The Vfold-based web server provides a user friendly tool for the prediction of RNA structure and stability. The web server and the source codes are freely accessible for public use at "http://rna.physics.missouri.edu".
[Cover motifs of the Tidsskrift. A 14-year cavalcade].

Science.gov (United States)

Nylenna, M

1998-12-10

In 1985 the Journal of the Norwegian Medical Association changed its cover policy, moving the table of contents inside the Journal and introducing cover illustrations. This article provides an analysis of all cover illustrations published over this 14-year period, 420 covers in all. There is a great variation in cover motifs and designs and a development towards more general motifs. The initial emphasis on historical and medical aspects is now less pronounced, while the use of works of art and nature motifs has increased, and the cover now more often has a direct bearing on the specific contents of the issue. Professor of medical history Oivind Larsen has photographed two thirds of the covers and contributed 95% of the inside essay-style reflections on the cover motif. Over the years, he has expanded the role of the historian of medicine disseminating knowledge to include that of the raconteur with a personal tone of voice. The Journal's covers are now one of its most characteristic features, emblematic of the Journal's ambition of standing for quality and timelessness vis-à-vis the news media, and of its aim of bridging the gap between medicine and the humanities.
Insights into the motif preference of APOBEC3 enzymes.

Directory of Open Access Journals (Sweden)

Diako Ebrahimi

Full Text Available We used a multivariate data analysis approach to identify motifs associated with HIV hypermutation by different APOBEC3 enzymes. The analysis showed that APOBEC3G targets G mainly within GG, TG, TGG, GGG, TGGG and also GGGT. The G nucleotides flanked by a C at the 3' end (in +1 and +2 positions were indicated as disfavoured targets by APOBEC3G. The G nucleotides within GGGG were found to be targeted at a frequency much less than what is expected. We found that the infrequent G-to-A mutation within GGGG is not limited to the inaccessibility, to APOBEC3, of poly Gs in the central and 3'polypurine tracts (PPTs which remain double stranded during the HIV reverse transcription. GGGG motifs outside the PPTs were also disfavoured. The motifs GGAG and GAGG were also found to be disfavoured targets for APOBEC3. The motif-dependent mutation of G within the HIV genome by members of the APOBEC3 family other than APOBEC3G was limited to GA→AA changes. The results did not show evidence of other types of context dependent G-to-A changes in the HIV genome.
Structural Insights into RNA Recognition by the Alternate-Splicing Regulator CUG-Binding Protein 1

Energy Technology Data Exchange (ETDEWEB)

M Teplova; J Song; H Gaw; A Teplov; D Patel

2011-12-31

CUG-binding protein 1 (CUGBP1) regulates multiple aspects of nuclear and cytoplasmic mRNA processing, with implications for onset of myotonic dystrophy. CUGBP1 harbors three RRM domains and preferentially targets UGU-rich mRNA elements. We describe crystal structures of CUGBP1 RRM1 and tandem RRM1/2 domains bound to RNAs containing tandem UGU(U/G) elements. Both RRM1 in RRM1-RNA and RRM2 in RRM1/2-RNA complexes use similar principles to target UGU(U/G) elements, with recognition mediated by face-to-edge stacking and water-mediated hydrogen-bonding networks. The UG step adopts a left-handed Z-RNA conformation, with the syn guanine recognized through Hoogsteen edge-protein backbone hydrogen-bonding interactions. NMR studies on the RRM1/2-RNA complex establish that both RRM domains target tandem UGUU motifs in solution, whereas filter-binding assays identify a preference for recognition of GU over AU or GC steps. We discuss the implications of CUGBP1-mediated targeting and sequestration of UGU(U/G) elements on pre-mRNA alternative-splicing regulation, translational regulation, and mRNA decay.
Deep Sequencing Insights in Therapeutic shRNA Processing and siRNA Target Cleavage Precision.

Science.gov (United States)

Denise, Hubert; Moschos, Sterghios A; Sidders, Benjamin; Burden, Frances; Perkins, Hannah; Carter, Nikki; Stroud, Tim; Kennedy, Michael; Fancy, Sally-Ann; Lapthorn, Cris; Lavender, Helen; Kinloch, Ross; Suhy, David; Corbau, Romu

2014-02-04

TT-034 (PF-05095808) is a recombinant adeno-associated virus serotype 8 (AAV8) agent expressing three short hairpin RNA (shRNA) pro-drugs that target the hepatitis C virus (HCV) RNA genome. The cytosolic enzyme Dicer cleaves each shRNA into multiple, potentially active small interfering RNA (siRNA) drugs. Using next-generation sequencing (NGS) to identify and characterize active shRNAs maturation products, we observed that each TT-034-encoded shRNA could be processed into as many as 95 separate siRNA strands. Few of these appeared active as determined by Sanger 5' RNA Ligase-Mediated Rapid Amplification of cDNA Ends (5-RACE) and through synthetic shRNA and siRNA analogue studies. Moreover, NGS scrutiny applied on 5-RACE products (RACE-seq) suggested that synthetic siRNAs could direct cleavage in not one, but up to five separate positions on targeted RNA, in a sequence-dependent manner. These data support an on-target mechanism of action for TT-034 without cytotoxicity and question the accepted precision of substrate processing by the key RNA interference (RNAi) enzymes Dicer and siRNA-induced silencing complex (siRISC).Molecular Therapy-Nucleic Acids (2014) 3, e145; doi:10.1038/mtna.2013.73; published online 4 February 2014.
I-motif DNA structures are formed in the nuclei of human cells

Science.gov (United States)

Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel

2018-06-01

Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.
Structural insights into RNA processing by the human RISC-loading complex.

Science.gov (United States)

Wang, Hong-Wei; Noland, Cameron; Siridechadilok, Bunpote; Taylor, David W; Ma, Enbo; Felderer, Karin; Doudna, Jennifer A; Nogales, Eva

2009-11-01

Targeted gene silencing by RNA interference (RNAi) requires loading of a short guide RNA (small interfering RNA (siRNA) or microRNA (miRNA)) onto an Argonaute protein to form the functional center of an RNA-induced silencing complex (RISC). In humans, Argonaute2 (AGO2) assembles with the guide RNA-generating enzyme Dicer and the RNA-binding protein TRBP to form a RISC-loading complex (RLC), which is necessary for efficient transfer of nascent siRNAs and miRNAs from Dicer to AGO2. Here, using single-particle EM analysis, we show that human Dicer has an L-shaped structure. The RLC Dicer's N-terminal DExH/D domain, located in a short 'base branch', interacts with TRBP, whereas its C-terminal catalytic domains in the main body are proximal to AGO2. A model generated by docking the available atomic structures of Dicer and Argonaute homologs into the RLC reconstruction suggests a mechanism for siRNA transfer from Dicer to AGO2.
Design of character-based DNA barcode motif for species identification: A computational approach and its validation in fishes.

Science.gov (United States)

Chakraborty, Mohua; Dhar, Bishal; Ghosh, Sankar Kumar

2017-11-01

The DNA barcodes are generally interpreted using distance-based and character-based methods. The former uses clustering of comparable groups, based on the relative genetic distance, while the latter is based on the presence or absence of discrete nucleotide substitutions. The distance-based approach has a limitation in defining a universal species boundary across the taxa as the rate of mtDNA evolution is not constant throughout the taxa. However, character-based approach more accurately defines this using a unique set of nucleotide characters. The character-based analysis of full-length barcode has some inherent limitations, like sequencing of the full-length barcode, use of a sparse-data matrix and lack of a uniform diagnostic position for each group. A short continuous stretch of a fragment can be used to resolve the limitations. Here, we observe that a 154-bp fragment, from the transversion-rich domain of 1367 COI barcode sequences can successfully delimit species in the three most diverse orders of freshwater fishes. This fragment is used to design species-specific barcode motifs for 109 species by the character-based method, which successfully identifies the correct species using a pattern-matching program. The motifs also correctly identify geographically isolated population of the Cypriniformes species. Further, this region is validated as a species-specific mini-barcode for freshwater fishes by successful PCR amplification and sequencing of the motif (154 bp) using the designed primers. We anticipate that use of such motifs will enhance the diagnostic power of DNA barcode, and the mini-barcode approach will greatly benefit the field-based system of rapid species identification. © 2017 John Wiley & Sons Ltd.
Transcriptome landscape of Lactococcus lactis reveals many novel RNAs including a small regulatory RNA involved in carbon uptake and metabolism.

Science.gov (United States)

van der Meulen, Sjoerd B; de Jong, Anne; Kok, Jan

2016-01-01

RNA sequencing has revolutionized genome-wide transcriptome analyses, and the identification of non-coding regulatory RNAs in bacteria has thus increased concurrently. Here we reveal the transcriptome map of the lactic acid bacterial paradigm Lactococcus lactis MG1363 by employing differential RNA sequencing (dRNA-seq) and a combination of manual and automated transcriptome mining. This resulted in a high-resolution genome annotation of L. lactis and the identification of 60 cis-encoded antisense RNAs (asRNAs), 186 trans-encoded putative regulatory RNAs (sRNAs) and 134 novel small ORFs. Based on the putative targets of asRNAs, a novel classification is proposed. Several transcription factor DNA binding motifs were identified in the promoter sequences of (a)sRNAs, providing insight in the interplay between lactococcal regulatory RNAs and transcription factors. The presence and lengths of 14 putative sRNAs were experimentally confirmed by differential Northern hybridization, including the abundant RNA 6S that is differentially expressed depending on the available carbon source. For another sRNA, LLMGnc_147, functional analysis revealed that it is involved in carbon uptake and metabolism. L. lactis contains 13% leaderless mRNAs (lmRNAs) that, from an analysis of overrepresentation in GO classes, seem predominantly involved in nucleotide metabolism and DNA/RNA binding. Moreover, an A-rich sequence motif immediately following the start codon was uncovered, which could provide novel insight in the translation of lmRNAs. Altogether, this first experimental genome-wide assessment of the transcriptome landscape of L. lactis and subsequent sRNA studies provide an extensive basis for the investigation of regulatory RNAs in L. lactis and related lactococcal species.

A novel glutamine–RNA interaction identified by screening libraries in mammalian cells

OpenAIRE

Tan, Ruoying; Frankel, Alan D.

1998-01-01

The arginine-rich motif provides a versatile framework for RNA recognition in which few amino acids other than arginine are needed to mediate specific binding. Using a mammalian screening system based on transcriptional activation by HIV Tat, we identified novel arginine-rich peptides from combinatorial libraries that bind tightly to the Rev response element of HIV. Remarkably, a single glutamine, but not asparagine, within a stretch of polyarginine can mediate high-affinity binding. These re...
Identification of a putative nuclear export signal motif in human NANOG homeobox domain

International Nuclear Information System (INIS)

Park, Sung-Won; Do, Hyun-Jin; Huh, Sun-Hyung; Sung, Boreum; Uhm, Sang-Jun; Song, Hyuk; Kim, Nam-Hyung; Kim, Jae-Hwan

2012-01-01

Highlights: ► We found the putative nuclear export signal motif within human NANOG homeodomain. ► Leucine-rich residues are important for human NANOG homeodomain nuclear export. ► CRM1-specific inhibitor LMB blocked the potent human NANOG NES-mediated nuclear export. -- Abstract: NANOG is a homeobox-containing transcription factor that plays an important role in pluripotent stem cells and tumorigenic cells. To understand how nuclear localization of human NANOG is regulated, the NANOG sequence was examined and a leucine-rich nuclear export signal (NES) motif ( 125 MQELSNILNL 134 ) was found in the homeodomain (HD). To functionally validate the putative NES motif, deletion and site-directed mutants were fused to an EGFP expression vector and transfected into COS-7 cells, and the localization of the proteins was examined. While hNANOG HD exclusively localized to the nucleus, a mutant with both NLSs deleted and only the putative NES motif contained (hNANOG HD-ΔNLSs) was predominantly cytoplasmic, as observed by nucleo/cytoplasmic fractionation and Western blot analysis as well as confocal microscopy. Furthermore, site-directed mutagenesis of the putative NES motif in a partial hNANOG HD only containing either one of the two NLS motifs led to localization in the nucleus, suggesting that the NES motif may play a functional role in nuclear export. Furthermore, CRM1-specific nuclear export inhibitor LMB blocked the hNANOG potent NES-mediated export, suggesting that the leucine-rich motif may function in CRM1-mediated nuclear export of hNANOG. Collectively, a NES motif is present in the hNANOG HD and may be functionally involved in CRM1-mediated nuclear export pathway.
Leucine-based receptor sorting motifs are dependent on the spacing relative to the plasma membrane

DEFF Research Database (Denmark)

Geisler, C; Dietrich, J; Nielsen, B L

1998-01-01

Many integral membrane proteins contain leucine-based motifs within their cytoplasmic domains that mediate internalization and intracellular sorting. Two types of leucine-based motifs have been identified. One type is dependent on phosphorylation, whereas the other type, which includes an acidic...... amino acid, is constitutively active. In this study, we have investigated how the spacing relative to the plasma membrane affects the function of both types of leucine-based motifs. For phosphorylation-dependent leucine-based motifs, a minimal spacing of 7 residues between the plasma membrane...... and the phospho-acceptor was required for phosphorylation and thereby activation of the motifs. For constitutively active leucine-based motifs, a minimal spacing of 6 residues between the plasma membrane and the acidic residue was required for optimal activity of the motifs. In addition, we found that the acidic...
Structural modelling and phylogenetic analyses of PgeIF4A2 (Eukaryotic translation initiation factor) from Pennisetum glaucum reveal signature motifs with a role in stress tolerance and development.

Science.gov (United States)

Agarwal, Aakrati; Mudgil, Yashwanti; Pandey, Saurabh; Fartyal, Dhirendra; Reddy, Malireddy K

2016-01-01

Eukaryotic translation initiation factor 4A (eIF4A) is an indispensable component of the translation machinery and also play a role in developmental processes and stress alleviation in plants and animals. Different eIF4A isoforms are present in the cytosol of the cell, namely, eIF4A1, eIF4A2, and eIF4A3 and their expression is tightly regulated in cap-dependent translation. We revealed the structural model of PgeIF4A2 protein using the crystal structure of Homo sapiens eIF4A3 (PDB ID: 2J0S) as template by Modeller 9.12. The resultant PgeIF4A2 model structure was refined by PROCHECK, ProSA, Verify3D and RMSD that showed the model structure is reliable with 77 % amino acid sequence identity with template. Investigation revealed two conserved signatures for ATP-dependent RNA Helicase DEAD-box conserved site (VLDEADEML) and RNA helicase DEAD-box type, Q-motif in sheet-turn-helix and α-helical region respectively. All these conserved motifs are responsible for response during developmental stages and stress tolerance in plants.
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.

Science.gov (United States)

Liu, Bingqiang; Zhang, Hanyuan; Zhou, Chuan; Li, Guojun; Fennell, Anne; Wang, Guanghui; Kang, Yu; Liu, Qi; Ma, Qin

2016-08-09

Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance
Identification and functional analysis of novel phosphorylation sites in the RNA surveillance protein Upf1.

Science.gov (United States)

Lasalde, Clarivel; Rivera, Andrea V; León, Alfredo J; González-Feliciano, José A; Estrella, Luis A; Rodríguez-Cruz, Eva N; Correa, María E; Cajigas, Iván J; Bracho, Dina P; Vega, Irving E; Wilkinson, Miles F; González, Carlos I

2014-02-01

One third of inherited genetic diseases are caused by mRNAs harboring premature termination codons as a result of nonsense mutations. These aberrant mRNAs are degraded by the Nonsense-Mediated mRNA Decay (NMD) pathway. A central component of the NMD pathway is Upf1, an RNA-dependent ATPase and helicase. Upf1 is a known phosphorylated protein, but only portions of this large protein have been examined for phosphorylation sites and the functional relevance of its phosphorylation has not been elucidated in Saccharomyces cerevisiae. Using tandem mass spectrometry analyses, we report the identification of 11 putative phosphorylated sites in S. cerevisiae Upf1. Five of these phosphorylated residues are located within the ATPase and helicase domains and are conserved in higher eukaryotes, suggesting a biological significance for their phosphorylation. Indeed, functional analysis demonstrated that a small carboxy-terminal motif harboring at least three phosphorylated amino acids is important for three Upf1 functions: ATPase activity, NMD activity and the ability to promote translation termination efficiency. We provide evidence that two tyrosines within this phospho-motif (Y-738 and Y-742) act redundantly to promote ATP hydrolysis, NMD efficiency and translation termination fidelity.
Targeting functional motifs of a protein family

Science.gov (United States)

Bhadola, Pradeep; Deo, Nivedita

2016-10-01

The structural organization of a protein family is investigated by devising a method based on the random matrix theory (RMT), which uses the physiochemical properties of the amino acid with multiple sequence alignment. A graphical method to represent protein sequences using physiochemical properties is devised that gives a fast, easy, and informative way of comparing the evolutionary distances between protein sequences. A correlation matrix associated with each property is calculated, where the noise reduction and information filtering is done using RMT involving an ensemble of Wishart matrices. The analysis of the eigenvalue statistics of the correlation matrix for the β -lactamase family shows the universal features as observed in the Gaussian orthogonal ensemble (GOE). The property-based approach captures the short- as well as the long-range correlation (approximately following GOE) between the eigenvalues, whereas the previous approach (treating amino acids as characters) gives the usual short-range correlations, while the long-range correlations are the same as that of an uncorrelated series. The distribution of the eigenvector components for the eigenvalues outside the bulk (RMT bound) deviates significantly from RMT observations and contains important information about the system. The information content of each eigenvector of the correlation matrix is quantified by introducing an entropic estimate, which shows that for the β -lactamase family the smallest eigenvectors (low eigenmodes) are highly localized as well as informative. These small eigenvectors when processed gives clusters involving positions that have well-defined biological and structural importance matching with experiments. The approach is crucial for the recognition of structural motifs as shown in β -lactamase (and other families) and selectively identifies the important positions for targets to deactivate (activate) the enzymatic actions.
New archetypes in self-assembled Phe-Phe motif induced nanostructures from nucleoside conjugated-diphenylalanines.

Science.gov (United States)

Datta, Dhrubajyoti; Tiwari, Omshanker; Ganesh, Krishna N

2018-02-15

During the last two decades, the molecular self-assembly of the short peptide diphenylalanine (Phe-Phe) motif has attracted increasing focus due to its unique morphological structure and utility for potential applications in biomaterial chemistry, sensors and bioelectronics. Due to the ease of their synthetic modifications and a plethora of available experimental tools, the self-assembly of free and protected diphenylalanine scaffolds (H-Phe-Phe-OH, Boc-Phe-Phe-OH and Boc-Phe-Phe-OMe) has unfurled interesting tubular, vesicular or fibrillar morphologies. Developing on this theme, here we attempt to examine the effect of structure and properties (hydrophobic and H-bonding) modifying the functional C-terminus conjugated substituents on Boc-Phe-Phe on its self-assembly process. The consequent self-sorting due to H-bonding, van der Waals force and π-π interactions, generates monodisperse nano-vesicles from these peptides characterized via their SEM, HRTEM, AFM pictures and DLS experiments. The stability of these vesicles to different external stimuli such as pH and temperature, encapsulation of fluorescent probes inside the vesicles and their release by external trigger are reported. The results point to a new direction in the study and applications of the Phe-Phe motif to rationally engineer new functional nano-architectures.
Structure, dynamics and RNA binding of the multi-domain splicing factor TIA-1

Science.gov (United States)

Wang, Iren; Hennig, Janosch; Jagtap, Pravin Kumar Ankush; Sonntag, Miriam; Valcárcel, Juan; Sattler, Michael

2014-01-01

Alternative pre-messenger ribonucleic acid (pre-mRNA) splicing is an essential process in eukaryotic gene regulation. The T-cell intracellular antigen-1 (TIA-1) is an apoptosis-promoting factor that modulates alternative splicing of transcripts, including the pre-mRNA encoding the membrane receptor Fas. TIA-1 is a multi-domain ribonucleic acid (RNA) binding protein that recognizes poly-uridine tract RNA sequences to facilitate 5′ splice site recognition by the U1 small nuclear ribonucleoprotein (snRNP). Here, we characterize the RNA interaction and conformational dynamics of TIA-1 by nuclear magnetic resonance (NMR), isothermal titration calorimetry (ITC) and small angle X-ray scattering (SAXS). Our NMR-derived solution structure of TIA-1 RRM2–RRM3 (RRM2,3) reveals that RRM2 adopts a canonical RNA recognition motif (RRM) fold, while RRM3 is preceded by an non-canonical helix α0. NMR and SAXS data show that all three RRMs are largely independent structural modules in the absence of RNA, while RNA binding induces a compact arrangement. RRM2,3 binds to pyrimidine-rich FAS pre-mRNA or poly-uridine (U9) RNA with nanomolar affinities. RRM1 has little intrinsic RNA binding affinity and does not strongly contribute to RNA binding in the context of RRM1,2,3. Our data unravel the role of binding avidity and the contributions of the TIA-1 RRMs for recognition of pyrimidine-rich RNAs. PMID:24682828
Structure of the exon junction core complex with a trapped DEAD-box ATPase bound to RNA

DEFF Research Database (Denmark)

Andersen, Christian Brix Folsted; Ballut, Lionel; Johansen, Jesper Sanderhoff

2006-01-01

exon junction core complex containing the DEAD-box adenosine triphosphatase (ATPase) eukaryotic initiation factor 4AIII (eIF4AIII) bound to an ATP analog, MAGOH, Y14, a fragment of MLN51, and a polyuracil mRNA mimic. eIF4AIII interacts with the phosphate-ribose backbone of six consecutive nucleotides...... and prevents part of the bound RNA from being double stranded. The MAGOH and Y14 subunits lock eIF4AIII in a prehydrolysis state, and activation of the ATPase probably requires only modest conformational changes in eIF4AIII motif I....
Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets.

Science.gov (United States)

Vishnevsky, Oleg V; Bocharnikov, Andrey V; Kolchanov, Nikolay A

2018-02-01

The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.
iFORM: Incorporating Find Occurrence of Regulatory Motifs.

Science.gov (United States)

Ren, Chao; Chen, Hebing; Yang, Bite; Liu, Feng; Ouyang, Zhangyi; Bo, Xiaochen; Shu, Wenjie

2016-01-01

Accurately identifying the binding sites of transcription factors (TFs) is crucial to understanding the mechanisms of transcriptional regulation and human disease. We present incorporating Find Occurrence of Regulatory Motifs (iFORM), an easy-to-use and efficient tool for scanning DNA sequences with TF motifs described as position weight matrices (PWMs). Both performance assessment with a receiver operating characteristic (ROC) curve and a correlation-based approach demonstrated that iFORM achieves higher accuracy and sensitivity by integrating five classical motif discovery programs using Fisher's combined probability test. We have used iFORM to provide accurate results on a variety of data in the ENCODE Project and the NIH Roadmap Epigenomics Project, and the tool has demonstrated its utility in further elucidating individual roles of functional elements. Both the source and binary codes for iFORM can be freely accessed at https://github.com/wenjiegroup/iFORM. The identified TF binding sites across human cell and tissue types using iFORM have been deposited in the Gene Expression Omnibus under the accession ID GSE53962.
Lucky Motifs in Chinese Folk Art: Interpreting Paper-cut from Chinese Shaanxi

OpenAIRE

Xuxiao WANG

2013-01-01

Paper-cut is not simply a form of traditional Chinese folk art. Lucky motifs developed in paper-cut certainly acquired profound cultural connotations. As paper-cut is a time-honoured skill across the nation, interpreting those motifs requires cultural receptiveness and anthropological sensitivity. The author of this article analyzes examples of paper-cut from Northern Shaanxi, China, to identify the cohesive motifs and explore the auspiciousness of the specific concepts of Fu, Lu, Shou, Xi. T...
A validated pipeline for detection of SNVs and short InDels from RNA Sequencing

Directory of Open Access Journals (Sweden)

Nitin Mandloi

2017-12-01

In this study, we have developed a pipeline to detect germline variants from RNA-seq data. The pipeline steps include: pre-processing, alignment, GATK best practices for RNA-seq and variant filtering. The pre-processing step includes base and adapter trimming and removal of contamination reads from rRNA, tRNA, mitochondrial DNA and repeat regions. The read alignment of the pre-processed reads is performed using STAR/HiSAT. After this we used GATK best practices for the RNA-seq dataset to call germline variants. We benchmarked our pipeline on NA12878 RNA-seq data downloaded from SRA (SRR1258218. After variant calling, the quality passed variants were compared against the gold standard variants provided by GIAB consortium. Of the total ~3.6 million high quality variants reported as gold standard variants for this sample (considering whole genome, our pipeline identified ~58,104 variants to be expressed in RNA-seq. Our pipeline achieved more than 99% of sensitivity in detection of germline variants.
Genome-wide identification of VQ motif-containing proteins and their expression profiles under abiotic stresses in maize

Directory of Open Access Journals (Sweden)

Weibin eSong

2016-01-01

Full Text Available VQ motif-containing proteins play crucial roles in abiotic stress responses in plants. Recent studies have shown that some VQ proteins physically interact with WRKY transcription factors to activate downstream genes. In the present study, we identified and characterized genes encoding VQ motif-containing proteins using the most recent version of the maize genome sequence. In total, 61VQ genes were identified. In a cluster analysis, these genes clustered into nine groups together with their homologous genes in rice and Arabidopsis. Most of the VQ genes (57 out of 61 numbers identified in maize were found to be single-copy genes. Analyses of RNA-seq data obtained using seedlings under long-term drought treatment showed that the expression levels of most ZmVQ genes (41 out of 61 members changed during the drought stress response. Quantitative real-time PCR analyses showed that most of the ZmVQ genes were responsive to NaCl treatment. Also, approximately half of the ZmVQ genes were co-expressed with ZmWRKY genes. The identification of these VQ genes in the maize genome and knowledge of their expression profiles under drought and osmotic stresses will provide a solid foundation for exploring their specific functions in the abiotic stress responses of maize.
dPORE-miRNA: Polymorphic regulation of microRNA genes

KAUST Repository

Schmeier, Sebastian; Schaefer, Ulf; MacPherson, Cameron R.; Bajic, Vladimir B.

2011-01-01

Background: MicroRNAs (miRNAs) are short non-coding RNA molecules that act as post-transcriptional regulators and affect the regulation of protein-coding genes. Mostly transcribed by PolII, miRNA genes are regulated at the transcriptional level similarly to protein-coding genes. In this study we focus on human miRNAs. These miRNAs are involved in a variety of pathways and can affect many diseases. Our interest is on possible deregulation of the transcription initiation of the miRNA encoding genes, which is facilitated by variations in the genomic sequence of transcriptional control regions (promoters). Methodology: Our aim is to provide an online resource to facilitate the investigation of the potential effects of single nucleotide polymorphisms (SNPs) on miRNA gene regulation. We analyzed SNPs overlapped with predicted transcription factor binding sites (TFBSs) in promoters of miRNA genes. We also accounted for the creation of novel TFBSs due to polymorphisms not present in the reference genome. The resulting changes in the original TFBSs and potential creation of new TFBSs were incorporated into the Dragon Database of Polymorphic Regulation of miRNA genes (dPORE-miRNA). Conclusions: The dPORE-miRNA database enables researchers to explore potential effects of SNPs on the regulation of miRNAs. dPORE-miRNA can be interrogated with regards to: a/miRNAs (their targets, or involvement in diseases, or biological pathways), b/SNPs, or c/transcription factors. dPORE-miRNA can be accessed at http://cbrc.kaust.edu.sa/dpore and http://apps.sanbi.ac.za/dpore/. Its use is free for academic and non-profit users. © 2011 Schmeier et al.
dPORE-miRNA: Polymorphic regulation of microRNA genes

KAUST Repository

Schmeier, Sebastian

2011-02-04

Background: MicroRNAs (miRNAs) are short non-coding RNA molecules that act as post-transcriptional regulators and affect the regulation of protein-coding genes. Mostly transcribed by PolII, miRNA genes are regulated at the transcriptional level similarly to protein-coding genes. In this study we focus on human miRNAs. These miRNAs are involved in a variety of pathways and can affect many diseases. Our interest is on possible deregulation of the transcription initiation of the miRNA encoding genes, which is facilitated by variations in the genomic sequence of transcriptional control regions (promoters). Methodology: Our aim is to provide an online resource to facilitate the investigation of the potential effects of single nucleotide polymorphisms (SNPs) on miRNA gene regulation. We analyzed SNPs overlapped with predicted transcription factor binding sites (TFBSs) in promoters of miRNA genes. We also accounted for the creation of novel TFBSs due to polymorphisms not present in the reference genome. The resulting changes in the original TFBSs and potential creation of new TFBSs were incorporated into the Dragon Database of Polymorphic Regulation of miRNA genes (dPORE-miRNA). Conclusions: The dPORE-miRNA database enables researchers to explore potential effects of SNPs on the regulation of miRNAs. dPORE-miRNA can be interrogated with regards to: a/miRNAs (their targets, or involvement in diseases, or biological pathways), b/SNPs, or c/transcription factors. dPORE-miRNA can be accessed at http://cbrc.kaust.edu.sa/dpore and http://apps.sanbi.ac.za/dpore/. Its use is free for academic and non-profit users. © 2011 Schmeier et al.
The fission yeast RNA binding protein Mmi1 regulates meiotic genes by controlling intron specific splicing and polyadenylation coupled RNA turnover.

Directory of Open Access Journals (Sweden)

Huei-Mei Chen

Full Text Available The polyA tails of mRNAs are monitored by the exosome as a quality control mechanism. We find that fission yeast, Schizosaccharomyces pombe, adopts this RNA quality control mechanism to regulate a group of 30 or more meiotic genes at the level of both splicing and RNA turnover. In vegetative cells the RNA binding protein Mmi1 binds to the primary transcripts of these genes. We find the novel motif U(U/C/GAAAC highly over-represented in targets of Mmi1. Mmi1 can specifically regulate the splicing of particular introns in a transcript: it inhibits the splicing of introns that are in the vicinity of putative Mmi1 binding sites, while allowing the splicing of other introns that are far from such sites. In addition, binding of Mmi1, particularly near the 3' end, alters 3' processing to promote extremely long polyA tails of up to a kilobase. The hyperadenylated transcripts are then targeted for degradation by the nuclear exonuclease Rrp6. The nuclear polyA binding protein Pab2 assists this hyperadenylation-mediated RNA decay. Rrp6 also targets other hyperadenylated transcripts, which become hyperadenylated in an unknown, but Mmi1-independent way. Thus, hyperadenylation may be a general signal for RNA degradation. In addition, binding of Mmi1 can affect the efficiency of 3' cleavage. Inactivation of Mmi1 in meiosis allows meiotic expression, through splicing and RNA stabilization, of at least 29 target genes, which are apparently constitutively transcribed.
MOMFER: A Search Engine of Thompson's Motif-Index of Folk Literature

NARCIS (Netherlands)

Karsdorp, F.B.; van der Meulen, Marten; Meder, Theo; van den Bosch, Antal

2015-01-01

More than fifty years after the first edition of Thompson's seminal Motif-Indexof Folk Literature, we present an online search engine tailored to fully disclose the index digitally. This search engine, called MOMFER, greatly enhances the searchability of the Motif-Index and provides exciting new
Formation of RNA phosphodiester bond by histidine-containing dipeptides

DEFF Research Database (Denmark)

Wieczorek, Rafal; Dörr, Mark; Chotera, Agata

2013-01-01

A new scenario for prebiotic formation of nucleic acid oligomers is presented. Peptide catalysis is applied to achieve condensation of activated RNA monomers into short RNA chains. As catalysts, L-dipeptides containing a histidine residue, primarily Ser-His, were used. Reactions were carried out...... in self-organised environment, a water-ice eutectic phase, with low concentrations of reactants. Incubation periods up to 30 days resulted in the formation of short oligomers of RNA. During the oligomerisation, an active intermediate (dipeptide-mononucleotide) is produced, which is the reactive species...... by a transamination mechanism. Because peptides are much more likely products of spontaneous condensation than nucleotide chains, their potential as catalysts for the formation of RNA is interesting from the origin-of-life perspective. Finally, the formation of the dipeptide-mononucleotide intermediate and its...

Some links on this page may take you to non-federal websites. Their policies may differ from this site.