WorldWideScience

Sample records for rna structure analysis

  1. RNA STRAND: The RNA Secondary Structure and Statistical Analysis Database

    Directory of Open Access Journals (Sweden)

    Andronescu Mirela

    2008-08-01

    Full Text Available Abstract Background The ability to access, search and analyse secondary structures of a large set of known RNA molecules is very important for deriving improved RNA energy models, for evaluating computational predictions of RNA secondary structures and for a better understanding of RNA folding. Currently there is no database that can easily provide these capabilities for almost all RNA molecules with known secondary structures. Results In this paper we describe RNA STRAND – the RNA secondary STRucture and statistical ANalysis Database, a curated database containing known secondary structures of any type and organism. Our new database provides a wide collection of known RNA secondary structures drawn from public databases, searchable and downloadable in a common format. Comprehensive statistical information on the secondary structures in our database is provided using the RNA Secondary Structure Analyser, a new tool we have developed to analyse RNA secondary structures. The information thus obtained is valuable for understanding to which extent and with which probability certain structural motifs can appear. We outline several ways in which the data provided in RNA STRAND can facilitate research on RNA structure, including the improvement of RNA energy models and evaluation of secondary structure prediction programs. In order to keep up-to-date with new RNA secondary structure experiments, we offer the necessary tools to add solved RNA secondary structures to our database and invite researchers to contribute to RNA STRAND. Conclusion RNA STRAND is a carefully assembled database of trusted RNA secondary structures, with easy on-line tools for searching, analyzing and downloading user selected entries, and is publicly available at http://www.rnasoft.ca/strand.

  2. RNA2DMut: a web tool for the design and analysis of RNA structure mutations.

    Science.gov (United States)

    Moss, Walter N

    2018-03-01

    With the widespread application of high-throughput sequencing, novel RNA sequences are being discovered at an astonishing rate. The analysis of function, however, lags behind. In both the cis - and trans -regulatory functions of RNA, secondary structure (2D base-pairing) plays essential regulatory roles. In order to test RNA function, it is essential to be able to design and analyze mutations that can affect structure. This was the motivation for the creation of the RNA2DMut web tool. With RNA2DMut, users can enter in RNA sequences to analyze, constrain mutations to specific residues, or limit changes to purines/pyrimidines. The sequence is analyzed at each base to determine the effect of every possible point mutation on 2D structure. The metrics used in RNA2DMut rely on the calculation of the Boltzmann structure ensemble and do not require a robust 2D model of RNA structure for designing mutations. This tool can facilitate a wide array of uses involving RNA: for example, in designing and evaluating mutants for biological assays, interrogating RNA-protein interactions, identifying key regions to alter in SELEX experiments, and improving RNA folding and crystallization properties for structural biology. Additional tools are available to help users introduce other mutations (e.g., indels and substitutions) and evaluate their effects on RNA structure. Example calculations are shown for five RNAs that require 2D structure for their function: the MALAT1 mascRNA, an influenza virus splicing regulatory motif, the EBER2 viral noncoding RNA, the Xist lncRNA repA region, and human Y RNA 5. RNA2DMut can be accessed at https://rna2dmut.bb.iastate.edu/. © 2018 Moss; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  3. Global Analysis of RNA Secondary Structure in Two Metazoans

    Directory of Open Access Journals (Sweden)

    Fan Li

    2012-01-01

    Full Text Available The secondary structure of RNA is necessary for its maturation, regulation, processing, and function. However, the global influence of RNA folding in eukaryotes is still unclear. Here, we use a high-throughput, sequencing-based, structure-mapping approach to identify the paired (double-stranded RNA [dsRNA] and unpaired (single-stranded RNA [ssRNA] components of the Drosophila melanogaster and Caenorhabditis elegans transcriptomes, which allows us to identify conserved features of RNA secondary structure in metazoans. From this analysis, we find that ssRNAs and dsRNAs are significantly correlated with specific epigenetic modifications. Additionally, we find key structural patterns across protein-coding transcripts that indicate that RNA folding demarcates regions of protein translation and likely affects microRNA-mediated regulation of mRNAs in animals. Finally, we identify and characterize 546 mRNAs whose folding pattern is significantly correlated between these metazoans, suggesting that their structure has some function. Overall, our findings provide a global assessment of RNA folding in animals.

  4. RNAstructure: software for RNA secondary structure prediction and analysis.

    Science.gov (United States)

    Reuter, Jessica S; Mathews, David H

    2010-03-15

    To understand an RNA sequence's mechanism of action, the structure must be known. Furthermore, target RNA structure is an important consideration in the design of small interfering RNAs and antisense DNA oligonucleotides. RNA secondary structure prediction, using thermodynamics, can be used to develop hypotheses about the structure of an RNA sequence. RNAstructure is a software package for RNA secondary structure prediction and analysis. It uses thermodynamics and utilizes the most recent set of nearest neighbor parameters from the Turner group. It includes methods for secondary structure prediction (using several algorithms), prediction of base pair probabilities, bimolecular structure prediction, and prediction of a structure common to two sequences. This contribution describes new extensions to the package, including a library of C++ classes for incorporation into other programs, a user-friendly graphical user interface written in JAVA, and new Unix-style text interfaces. The original graphical user interface for Microsoft Windows is still maintained. The extensions to RNAstructure serve to make RNA secondary structure prediction user-friendly. The package is available for download from the Mathews lab homepage at http://rna.urmc.rochester.edu/RNAstructure.html.

  5. A method for rapid similarity analysis of RNA secondary structures

    Directory of Open Access Journals (Sweden)

    Liu Na

    2006-11-01

    Full Text Available Abstract Background Owing to the rapid expansion of RNA structure databases in recent years, efficient methods for structure comparison are in demand for function prediction and evolutionary analysis. Usually, the similarity of RNA secondary structures is evaluated based on tree models and dynamic programming algorithms. We present here a new method for the similarity analysis of RNA secondary structures. Results Three sets of real data have been used as input for the example applications. Set I includes the structures from 5S rRNAs. Set II includes the secondary structures from RNase P and RNase MRP. Set III includes the structures from 16S rRNAs. Reasonable phylogenetic trees are derived for these three sets of data by using our method. Moreover, our program runs faster as compared to some existing ones. Conclusion The famous Lempel-Ziv algorithm can efficiently extract the information on repeated patterns encoded in RNA secondary structures and makes our method an alternative to analyze the similarity of RNA secondary structures. This method will also be useful to researchers who are interested in evolutionary analysis.

  6. Comparative structural analysis of human DEAD-box RNA helicases.

    Directory of Open Access Journals (Sweden)

    Patrick Schütz

    2010-09-01

    Full Text Available DEAD-box RNA helicases play various, often critical, roles in all processes where RNAs are involved. Members of this family of proteins are linked to human disease, including cancer and viral infections. DEAD-box proteins contain two conserved domains that both contribute to RNA and ATP binding. Despite recent advances the molecular details of how these enzymes convert chemical energy into RNA remodeling is unknown. We present crystal structures of the isolated DEAD-domains of human DDX2A/eIF4A1, DDX2B/eIF4A2, DDX5, DDX10/DBP4, DDX18/myc-regulated DEAD-box protein, DDX20, DDX47, DDX52/ROK1, and DDX53/CAGE, and of the helicase domains of DDX25 and DDX41. Together with prior knowledge this enables a family-wide comparative structural analysis. We propose a general mechanism for opening of the RNA binding site. This analysis also provides insights into the diversity of DExD/H- proteins, with implications for understanding the functions of individual family members.

  7. Comparative structural analysis of human DEAD-box RNA helicases.

    Science.gov (United States)

    Schütz, Patrick; Karlberg, Tobias; van den Berg, Susanne; Collins, Ruairi; Lehtiö, Lari; Högbom, Martin; Holmberg-Schiavone, Lovisa; Tempel, Wolfram; Park, Hee-Won; Hammarström, Martin; Moche, Martin; Thorsell, Ann-Gerd; Schüler, Herwig

    2010-09-30

    DEAD-box RNA helicases play various, often critical, roles in all processes where RNAs are involved. Members of this family of proteins are linked to human disease, including cancer and viral infections. DEAD-box proteins contain two conserved domains that both contribute to RNA and ATP binding. Despite recent advances the molecular details of how these enzymes convert chemical energy into RNA remodeling is unknown. We present crystal structures of the isolated DEAD-domains of human DDX2A/eIF4A1, DDX2B/eIF4A2, DDX5, DDX10/DBP4, DDX18/myc-regulated DEAD-box protein, DDX20, DDX47, DDX52/ROK1, and DDX53/CAGE, and of the helicase domains of DDX25 and DDX41. Together with prior knowledge this enables a family-wide comparative structural analysis. We propose a general mechanism for opening of the RNA binding site. This analysis also provides insights into the diversity of DExD/H- proteins, with implications for understanding the functions of individual family members.

  8. On infrared spectroscopic analysis of transfer RNA secondary structure

    Energy Technology Data Exchange (ETDEWEB)

    Semenov, M A; Starikov, E B

    1987-07-14

    Various techniques of IR spectroscopy in the 1550-1750 cm/sup -1/ region employed to analyse the tRNA secondary structure are discussed and a novel improved method is proposed. The main novel features of this method are the approximation of tRNA helical region spectra by catalogue carbonyl absorption bands and approximation of tRNA nonhelical region spectra by those of homopolyribonucleotides. The IR spectra of tRNA/sub yeast//sup phe/ and tRNA/sub E.coli//sup fmet/ in the carbonyl vibration region are explained on the basis of calculated transition moment coupling.

  9. RNA-Pareto: interactive analysis of Pareto-optimal RNA sequence-structure alignments.

    Science.gov (United States)

    Schnattinger, Thomas; Schöning, Uwe; Marchfelder, Anita; Kestler, Hans A

    2013-12-01

    Incorporating secondary structure information into the alignment process improves the quality of RNA sequence alignments. Instead of using fixed weighting parameters, sequence and structure components can be treated as different objectives and optimized simultaneously. The result is not a single, but a Pareto-set of equally optimal solutions, which all represent different possible weighting parameters. We now provide the interactive graphical software tool RNA-Pareto, which allows a direct inspection of all feasible results to the pairwise RNA sequence-structure alignment problem and greatly facilitates the exploration of the optimal solution set.

  10. Kinetic analysis of the effects of target structure on siRNA efficiency

    Science.gov (United States)

    Chen, Jiawen; Zhang, Wenbing

    2012-12-01

    RNAi efficiency for target cleavage and protein expression is related to the target structure. Considering the RNA-induced silencing complex (RISC) as a multiple turnover enzyme, we investigated the effect of target mRNA structure on siRNA efficiency with kinetic analysis. The 4-step model was used to study the target cleavage kinetic process: hybridization nucleation at an accessible target site, RISC-mRNA hybrid elongation along with mRNA target structure melting, target cleavage, and enzyme reactivation. At this model, the terms accounting for the target accessibility, stability, and the seed and the nucleation site effects are all included. The results are in good agreement with that of experiments which show different arguments about the structure effects on siRNA efficiency. It shows that the siRNA efficiency is influenced by the integrated factors of target's accessibility, stability, and the seed effects. To study the off-target effects, a simple model of one siRNA binding to two mRNA targets was designed. By using this model, the possibility for diminishing the off-target effects by the concentration of siRNA was discussed.

  11. RNA-Seq-Based Transcript Structure Analysis with TrBorderExt.

    Science.gov (United States)

    Wang, Yejun; Sun, Ming-An; White, Aaron P

    2018-01-01

    RNA-Seq has become a routine strategy for genome-wide gene expression comparisons in bacteria. Despite lower resolution in transcript border parsing compared with dRNA-Seq, TSS-EMOTE, Cappable-seq, Term-seq, and others, directional RNA-Seq still illustrates its advantages: low cost, quantification and transcript border analysis with a medium resolution (±10-20 nt). To facilitate mining of directional RNA-Seq datasets especially with respect to transcript structure analysis, we developed a tool, TrBorderExt, which can parse transcript start sites and termination sites accurately in bacteria. A detailed protocol is described in this chapter for how to use the software package step by step to identify bacterial transcript borders from raw RNA-Seq data. The package was developed with Perl and R programming languages, and is accessible freely through the website: http://www.szu-bioinf.org/TrBorderExt .

  12. RNA three-way junctions can act as flexible RNA structural elements in large RNA molecules: a molecular simulation analysis

    Czech Academy of Sciences Publication Activity Database

    Beššeová, Ivana; Réblová, Kamila; Leontis, N.B.; Šponer, Jiří

    2009-01-01

    Roč. 26, č. 6 (2009), s. 830-831 ISSN 0739-1102. [The 17th Conversation . 16.06.2009-20.06.2009, Albany] Institutional research plan: CEZ:AV0Z50040507; CEZ:AV0Z50040702 Keywords : RNA three-way junctions * RNA Subject RIV: BO - Biophysics

  13. New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes.

    Science.gov (United States)

    Parker, Brian J; Moltke, Ida; Roth, Adam; Washietl, Stefan; Wen, Jiayu; Kellis, Manolis; Breaker, Ronald; Pedersen, Jakob Skou

    2011-11-01

    Regulatory RNA structures are often members of families with multiple paralogous instances across the genome. Family members share functional and structural properties, which allow them to be studied as a whole, facilitating both bioinformatic and experimental characterization. We have developed a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein-coding regions comprising 725 individual structures, including 48 families with known structural RNA elements. Known families identified include both noncoding RNAs, e.g., miRNAs and the recently identified MALAT1/MEN β lincRNA family; and cis-regulatory structures, e.g., iron-responsive elements. We also identify tens of new families supported by strong evolutionary evidence and other statistical evidence, such as GO term enrichments. For some of these, detailed analysis has led to the formulation of specific functional hypotheses. Examples include two hypothesized auto-regulatory feedback mechanisms: one involving six long hairpins in the 3'-UTR of MAT2A, a key metabolic gene that produces the primary human methyl donor S-adenosylmethionine; the other involving a tRNA-like structure in the intron of the tRNA maturation gene POP1. We experimentally validate the predicted MAT2A structures. Finally, we identify potential new regulatory networks, including large families of short hairpins enriched in immunity-related genes, e.g., TNF, FOS, and CTLA4, which include known transcript destabilizing elements. Our findings exemplify the diversity of post-transcriptional regulation and provide a resource for further characterization of new regulatory mechanisms and families of noncoding RNAs.

  14. Hydration sites of unpaired RNA bases: a statistical analysis of the PDB structures

    Directory of Open Access Journals (Sweden)

    Carugo Oliviero

    2011-10-01

    Full Text Available Abstract Background Hydration is crucial for RNA structure and function. X-ray crystallography is the most commonly used method to determine RNA structures and hydration and, therefore, statistical surveys are based on crystallographic results, the number of which is quickly increasing. Results A statistical analysis of the water molecule distribution in high-resolution X-ray structures of unpaired RNA nucleotides showed that: different bases have the same penchant to be surrounded by water molecules; clusters of water molecules indicate possible hydration sites, which, in some cases, match those of the major and minor grooves of RNA and DNA double helices; complex hydrogen bond networks characterize the solvation of the nucleotides, resulting in a significant rigidity of the base and its surrounding water molecules. Interestingly, the hydration sites around unpaired RNA bases do not match, in general, the positions that are occupied by the second nucleotide when the base-pair is formed. Conclusions The hydration sites around unpaired RNA bases were found. They do not replicate the atom positions of complementary bases in the Watson-Crick pairs.

  15. Rich RNA Structure Landscapes Revealed by Mutate-and-Map Analysis.

    Directory of Open Access Journals (Sweden)

    Pablo Cordero

    2015-11-01

    Full Text Available Landscapes exhibiting multiple secondary structures arise in natural RNA molecules that modulate gene expression, protein synthesis, and viral infection [corrected]. We report herein that high-throughput chemical experiments can isolate an RNA's multiple alternative secondary structures as they are stabilized by systematic mutagenesis (mutate-and-map, M2 and that a computational algorithm, REEFFIT, enables unbiased reconstruction of these states' structures and populations. In an in silico benchmark on non-coding RNAs with complex landscapes, M2-REEFFIT recovers 95% of RNA helices present with at least 25% population while maintaining a low false discovery rate (10% and conservative error estimates. In experimental benchmarks, M2-REEFFIT recovers the structure landscapes of a 35-nt MedLoop hairpin, a 110-nt 16S rRNA four-way junction with an excited state, a 25-nt bistable hairpin, and a 112-nt three-state adenine riboswitch with its expression platform, molecules whose characterization previously required expert mutational analysis and specialized NMR or chemical mapping experiments. With this validation, M2-REEFFIT enabled tests of whether artificial RNA sequences might exhibit complex landscapes in the absence of explicit design. An artificial flavin mononucleotide riboswitch and a randomly generated RNA sequence are found to interconvert between three or more states, including structures for which there was no design, but that could be stabilized through mutations. These results highlight the likely pervasiveness of rich landscapes with multiple secondary structures in both natural and artificial RNAs and demonstrate an automated chemical/computational route for their empirical characterization.

  16. A quantitative analysis of secondary RNA structure using domination based parameters on trees

    Directory of Open Access Journals (Sweden)

    Zou Yue

    2006-03-01

    Full Text Available Abstract Background It has become increasingly apparent that a comprehensive database of RNA motifs is essential in order to achieve new goals in genomic and proteomic research. Secondary RNA structures have frequently been represented by various modeling methods as graph-theoretic trees. Using graph theory as a modeling tool allows the vast resources of graphical invariants to be utilized to numerically identify secondary RNA motifs. The domination number of a graph is a graphical invariant that is sensitive to even a slight change in the structure of a tree. The invariants selected in this study are variations of the domination number of a graph. These graphical invariants are partitioned into two classes, and we define two parameters based on each of these classes. These parameters are calculated for all small order trees and a statistical analysis of the resulting data is conducted to determine if the values of these parameters can be utilized to identify which trees of orders seven and eight are RNA-like in structure. Results The statistical analysis shows that the domination based parameters correctly distinguish between the trees that represent native structures and those that are not likely candidates to represent RNA. Some of the trees previously identified as candidate structures are found to be "very" RNA like, while others are not, thereby refining the space of structures likely to be found as representing secondary RNA structure. Conclusion Search algorithms are available that mine nucleotide sequence databases. However, the number of motifs identified can be quite large, making a further search for similar motif computationally difficult. Much of the work in the bioinformatics arena is toward the development of better algorithms to address the computational problem. This work, on the other hand, uses mathematical descriptors to more clearly characterize the RNA motifs and thereby reduce the corresponding search space. These

  17. Homology Modeling and Analysis of Structure Predictions of the Bovine Rhinitis B Virus RNA Dependent RNA Polymerase (RdRp

    Directory of Open Access Journals (Sweden)

    Devendra K. Rai

    2012-07-01

    Full Text Available Bovine Rhinitis B Virus (BRBV is a picornavirus responsible for mild respiratory infection of cattle. It is probably the least characterized among the aphthoviruses. BRBV is the closest relative known to Foot and Mouth Disease virus (FMDV with a ~43% identical polyprotein sequence and as much as 67% identical sequence for the RNA dependent RNA polymerase (RdRp, which is also known as 3D polymerase (3Dpol. In the present study we carried out phylogenetic analysis, structure based sequence alignment and prediction of three-dimensional structure of BRBV 3Dpol using a combination of different computational tools. Model structures of BRBV 3Dpol were verified for their stereochemical quality and accuracy. The BRBV 3Dpol structure predicted by SWISS-MODEL exhibited highest scores in terms of stereochemical quality and accuracy, which were in the range of 2Å resolution crystal structures. The active site, nucleic acid binding site and overall structure were observed to be in agreement with the crystal structure of unliganded as well as template/primer (T/P, nucleotide tri-phosphate (NTP and pyrophosphate (PPi bound FMDV 3Dpol (PDB, 1U09 and 2E9Z. The closest proximity of BRBV and FMDV 3Dpol as compared to human rhinovirus type 16 (HRV-16 and rabbit hemorrhagic disease virus (RHDV 3Dpols is also substantiated by phylogeny analysis and root-mean square deviation (RMSD between C-α traces of the polymerase structures. The absence of positively charged α-helix at C terminal, significant differences in non-covalent interactions especially salt bridges and CH-pi interactions around T/P channel of BRBV 3Dpol compared to FMDV 3Dpol, indicate that despite a very high homology to FMDV 3Dpol, BRBV 3Dpol may adopt a different mechanism for handling its substrates and adapting to physiological requirements. Our findings will be valuable in the

  18. RNA-SSPT: RNA Secondary Structure Prediction Tools.

    Science.gov (United States)

    Ahmad, Freed; Mahboob, Shahid; Gulzar, Tahsin; Din, Salah U; Hanif, Tanzeela; Ahmad, Hifza; Afzal, Muhammad

    2013-01-01

    The prediction of RNA structure is useful for understanding evolution for both in silico and in vitro studies. Physical methods like NMR studies to predict RNA secondary structure are expensive and difficult. Computational RNA secondary structure prediction is easier. Comparative sequence analysis provides the best solution. But secondary structure prediction of a single RNA sequence is challenging. RNA-SSPT is a tool that computationally predicts secondary structure of a single RNA sequence. Most of the RNA secondary structure prediction tools do not allow pseudoknots in the structure or are unable to locate them. Nussinov dynamic programming algorithm has been implemented in RNA-SSPT. The current studies shows only energetically most favorable secondary structure is required and the algorithm modification is also available that produces base pairs to lower the total free energy of the secondary structure. For visualization of RNA secondary structure, NAVIEW in C language is used and modified in C# for tool requirement. RNA-SSPT is built in C# using Dot Net 2.0 in Microsoft Visual Studio 2005 Professional edition. The accuracy of RNA-SSPT is tested in terms of Sensitivity and Positive Predicted Value. It is a tool which serves both secondary structure prediction and secondary structure visualization purposes.

  19. Crystal structure analysis reveals functional flexibility in the selenocysteine-specific tRNA from mouse.

    Directory of Open Access Journals (Sweden)

    Oleg M Ganichkin

    Full Text Available Selenocysteine tRNAs (tRNA(Sec exhibit a number of unique identity elements that are recognized specifically by proteins of the selenocysteine biosynthetic pathways and decoding machineries. Presently, these identity elements and the mechanisms by which they are interpreted by tRNA(Sec-interacting factors are incompletely understood.We applied rational mutagenesis to obtain well diffracting crystals of murine tRNA(Sec. tRNA(Sec lacking the single-stranded 3'-acceptor end ((ΔGCCARNA(Sec yielded a crystal structure at 2.0 Å resolution. The global structure of (ΔGCCARNA(Sec resembles the structure of human tRNA(Sec determined at 3.1 Å resolution. Structural comparisons revealed flexible regions in tRNA(Sec used for induced fit binding to selenophosphate synthetase. Water molecules located in the present structure were involved in the stabilization of two alternative conformations of the anticodon stem-loop. Modeling of a 2'-O-methylated ribose at position U34 of the anticodon loop as found in a sub-population of tRNA(Secin vivo showed how this modification favors an anticodon loop conformation that is functional during decoding on the ribosome. Soaking of crystals in Mn(2+-containing buffer revealed eight potential divalent metal ion binding sites but the located metal ions did not significantly stabilize specific structural features of tRNA(Sec.We provide the most highly resolved structure of a tRNA(Sec molecule to date and assessed the influence of water molecules and metal ions on the molecule's conformation and dynamics. Our results suggest how conformational changes of tRNA(Sec support its interaction with proteins.

  20. RNA Thermodynamic Structural Entropy.

    Science.gov (United States)

    Garcia-Martin, Juan Antonio; Clote, Peter

    2015-01-01

    Conformational entropy for atomic-level, three dimensional biomolecules is known experimentally to play an important role in protein-ligand discrimination, yet reliable computation of entropy remains a difficult problem. Here we describe the first two accurate and efficient algorithms to compute the conformational entropy for RNA secondary structures, with respect to the Turner energy model, where free energy parameters are determined from UV absorption experiments. An algorithm to compute the derivational entropy for RNA secondary structures had previously been introduced, using stochastic context free grammars (SCFGs). However, the numerical value of derivational entropy depends heavily on the chosen context free grammar and on the training set used to estimate rule probabilities. Using data from the Rfam database, we determine that both of our thermodynamic methods, which agree in numerical value, are substantially faster than the SCFG method. Thermodynamic structural entropy is much smaller than derivational entropy, and the correlation between length-normalized thermodynamic entropy and derivational entropy is moderately weak to poor. In applications, we plot the structural entropy as a function of temperature for known thermoswitches, such as the repression of heat shock gene expression (ROSE) element, we determine that the correlation between hammerhead ribozyme cleavage activity and total free energy is improved by including an additional free energy term arising from conformational entropy, and we plot the structural entropy of windows of the HIV-1 genome. Our software RNAentropy can compute structural entropy for any user-specified temperature, and supports both the Turner'99 and Turner'04 energy parameters. It follows that RNAentropy is state-of-the-art software to compute RNA secondary structure conformational entropy. Source code is available at https://github.com/clotelab/RNAentropy/; a full web server is available at http

  1. RNA Thermodynamic Structural Entropy.

    Directory of Open Access Journals (Sweden)

    Juan Antonio Garcia-Martin

    Full Text Available Conformational entropy for atomic-level, three dimensional biomolecules is known experimentally to play an important role in protein-ligand discrimination, yet reliable computation of entropy remains a difficult problem. Here we describe the first two accurate and efficient algorithms to compute the conformational entropy for RNA secondary structures, with respect to the Turner energy model, where free energy parameters are determined from UV absorption experiments. An algorithm to compute the derivational entropy for RNA secondary structures had previously been introduced, using stochastic context free grammars (SCFGs. However, the numerical value of derivational entropy depends heavily on the chosen context free grammar and on the training set used to estimate rule probabilities. Using data from the Rfam database, we determine that both of our thermodynamic methods, which agree in numerical value, are substantially faster than the SCFG method. Thermodynamic structural entropy is much smaller than derivational entropy, and the correlation between length-normalized thermodynamic entropy and derivational entropy is moderately weak to poor. In applications, we plot the structural entropy as a function of temperature for known thermoswitches, such as the repression of heat shock gene expression (ROSE element, we determine that the correlation between hammerhead ribozyme cleavage activity and total free energy is improved by including an additional free energy term arising from conformational entropy, and we plot the structural entropy of windows of the HIV-1 genome. Our software RNAentropy can compute structural entropy for any user-specified temperature, and supports both the Turner'99 and Turner'04 energy parameters. It follows that RNAentropy is state-of-the-art software to compute RNA secondary structure conformational entropy. Source code is available at https://github.com/clotelab/RNAentropy/; a full web server is available at http

  2. High-throughput SHAPE analysis reveals structures in HIV-1 genomic RNA strongly conserved across distinct biological states.

    Directory of Open Access Journals (Sweden)

    Kevin A Wilkinson

    2008-04-01

    Full Text Available Replication and pathogenesis of the human immunodeficiency virus (HIV is tightly linked to the structure of its RNA genome, but genome structure in infectious virions is poorly understood. We invent high-throughput SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension technology, which uses many of the same tools as DNA sequencing, to quantify RNA backbone flexibility at single-nucleotide resolution and from which robust structural information can be immediately derived. We analyze the structure of HIV-1 genomic RNA in four biologically instructive states, including the authentic viral genome inside native particles. Remarkably, given the large number of plausible local structures, the first 10% of the HIV-1 genome exists in a single, predominant conformation in all four states. We also discover that noncoding regions functioning in a regulatory role have significantly lower (p-value < 0.0001 SHAPE reactivities, and hence more structure, than do viral coding regions that function as the template for protein synthesis. By directly monitoring protein binding inside virions, we identify the RNA recognition motif for the viral nucleocapsid protein. Seven structurally homologous binding sites occur in a well-defined domain in the genome, consistent with a role in directing specific packaging of genomic RNA into nascent virions. In addition, we identify two distinct motifs that are targets for the duplex destabilizing activity of this same protein. The nucleocapsid protein destabilizes local HIV-1 RNA structure in ways likely to facilitate initial movement both of the retroviral reverse transcriptase from its tRNA primer and of the ribosome in coding regions. Each of the three nucleocapsid interaction motifs falls in a specific genome domain, indicating that local protein interactions can be organized by the long-range architecture of an RNA. High-throughput SHAPE reveals a comprehensive view of HIV-1 RNA genome structure, and further

  3. Structural Analysis of Monomeric RNA-Dependent Polymerases: Evolutionary and Therapeutic Implications.

    Directory of Open Access Journals (Sweden)

    Rodrigo Jácome

    Full Text Available The crystal structures of monomeric RNA-dependent RNA polymerases and reverse transcriptases of more than 20 different viruses are available in the Protein Data Bank. They all share the characteristic right-hand shape of DNA- and RNA polymerases formed by the fingers, palm and thumb subdomains, and, in many cases, "fingertips" that extend from the fingers towards the thumb subdomain, giving the viral enzyme a closed right-hand appearance. Six conserved structural motifs that contain key residues for the proper functioning of the enzyme have been identified in all these RNA-dependent polymerases. These enzymes share a two divalent metal-ion mechanism of polymerization in which two conserved aspartate residues coordinate the interactions with the metal ions to catalyze the nucleotidyl transfer reaction. The recent availability of crystal structures of polymerases of the Orthomyxoviridae and Bunyaviridae families allowed us to make pairwise comparisons of the tertiary structures of polymerases belonging to the four main RNA viral groups, which has led to a phylogenetic tree in which single-stranded negative RNA viral polymerases have been included for the first time. This has also allowed us to use a homology-based structural prediction approach to develop a general three-dimensional model of the Ebola virus RNA-dependent RNA polymerase. Our model includes several of the conserved structural motifs and residues described in other viral RNA-dependent RNA polymerases that define the catalytic and highly conserved palm subdomain, as well as portions of the fingers and thumb subdomains. The results presented here help to understand the current use and apparent success of antivirals, i.e. Brincidofovir, Lamivudine and Favipiravir, originally aimed at other types of polymerases, to counteract the Ebola virus infection.

  4. Structural Analysis of ‘key’ Interactions in Functional RNA Molecules

    KAUST Repository

    Chawla, Mohit

    2018-04-01

    The main objective of the thesis is to carry out structural bioinformatics study along with usage of advanced quantum chemical methods to look at the structural stability and energetics of RNA building blocks. The main focus of the work described here lies on understanding the reasons behind the intrinsic stability of key interactions in nucleic acids. Crystal structures of RNA molecules exhibit fascinating variety of non-covalent interactions, which play an important role in maintaining the three dimensional structures. An accurate atomic level description of these interactions in the structural building blocks of RNA is a key to understand the structure-function relationship in these molecules. An effort has been made to link the conclusions drawn from quantum chemical computations on RNA base pairs in wide biochemical context of their occurrence in RNA structures. The initial attention was on the impact of natural and non-natural modifications of the nucleic acid bases on the structure and stability of base pairs that they are involved in. In the remaining sections we cover other molecular interactions shaping nucleic acids, as the interaction between ribose and the bases, and the fluoride sensing riboswitch system in order to investigate structure and dynamics of nucleic acids at the atomic level and to gain insight into the physical chemistry behind.

  5. Structural Analysis of ‘key’ Interactions in Functional RNA Molecules

    KAUST Repository

    Chawla, Mohit

    2018-01-01

    The main objective of the thesis is to carry out structural bioinformatics study along with usage of advanced quantum chemical methods to look at the structural stability and energetics of RNA building blocks. The main focus of the work described here lies on understanding the reasons behind the intrinsic stability of key interactions in nucleic acids. Crystal structures of RNA molecules exhibit fascinating variety of non-covalent interactions, which play an important role in maintaining the three dimensional structures. An accurate atomic level description of these interactions in the structural building blocks of RNA is a key to understand the structure-function relationship in these molecules. An effort has been made to link the conclusions drawn from quantum chemical computations on RNA base pairs in wide biochemical context of their occurrence in RNA structures. The initial attention was on the impact of natural and non-natural modifications of the nucleic acid bases on the structure and stability of base pairs that they are involved in. In the remaining sections we cover other molecular interactions shaping nucleic acids, as the interaction between ribose and the bases, and the fluoride sensing riboswitch system in order to investigate structure and dynamics of nucleic acids at the atomic level and to gain insight into the physical chemistry behind.

  6. Topology of RNA-RNA interaction structures

    DEFF Research Database (Denmark)

    Andersen, Jørgen Ellegaard; Huang, Fenix Wenda; Penner, Robert

    2012-01-01

    Abstract The topological filtration of interacting RNA complexes is studied, and the role is analyzed of certain diagrams called irreducible shadows, which form suitable building blocks for more general structures. We prove that, for two interacting RNAs, called interaction structures, there exist...

  7. PRince: a web server for structural and physicochemical analysis of protein-RNA interface.

    Science.gov (United States)

    Barik, Amita; Mishra, Abhishek; Bahadur, Ranjit Prasad

    2012-07-01

    We have developed a web server, PRince, which analyzes the structural features and physicochemical properties of the protein-RNA interface. Users need to submit a PDB file containing the atomic coordinates of both the protein and the RNA molecules in complex form (in '.pdb' format). They should also mention the chain identifiers of interacting protein and RNA molecules. The size of the protein-RNA interface is estimated by measuring the solvent accessible surface area buried in contact. For a given protein-RNA complex, PRince calculates structural, physicochemical and hydration properties of the interacting surfaces. All these parameters generated by the server are presented in a tabular format. The interacting surfaces can also be visualized with software plug-in like Jmol. In addition, the output files containing the list of the atomic coordinates of the interacting protein, RNA and interface water molecules can be downloaded. The parameters generated by PRince are novel, and users can correlate them with the experimentally determined biophysical and biochemical parameters for better understanding the specificity of the protein-RNA recognition process. This server will be continuously upgraded to include more parameters. PRince is publicly accessible and free for use. Available at http://www.facweb.iitkgp.ernet.in/~rbahadur/prince/home.html.

  8. New families of human regulatory RNA structures identified by comparative analysis of vertebrate genomes

    DEFF Research Database (Denmark)

    Parker, Brian John; Moltke, Ida; Roth, Adam

    2011-01-01

    a comparative method, EvoFam, for genome-wide identification of families of regulatory RNA structures, based on primary sequence and secondary structure similarity. We apply EvoFam to a 41-way genomic vertebrate alignment. Genome-wide, we identify 220 human, high-confidence families outside protein...

  9. Methods for RNA Analysis

    DEFF Research Database (Denmark)

    Olivarius, Signe

    of the transcriptome, 5’ end capture of RNA is combined with next-generation sequencing for high-throughput quantitative assessment of transcription start sites by two different methods. The methods presented here allow for functional investigation of coding as well as noncoding RNA and contribute to future...... RNAs rely on interactions with proteins, the establishment of protein-binding profiles is essential for the characterization of RNAs. Aiming to facilitate RNA analysis, this thesis introduces proteomics- as well as transcriptomics-based methods for the functional characterization of RNA. First, RNA...

  10. Characteristics and Prediction of RNA Structure

    Directory of Open Access Journals (Sweden)

    Hengwu Li

    2014-01-01

    Full Text Available RNA secondary structures with pseudoknots are often predicted by minimizing free energy, which is NP-hard. Most RNAs fold during transcription from DNA into RNA through a hierarchical pathway wherein secondary structures form prior to tertiary structures. Real RNA secondary structures often have local instead of global optimization because of kinetic reasons. The performance of RNA structure prediction may be improved by considering dynamic and hierarchical folding mechanisms. This study is a novel report on RNA folding that accords with the golden mean characteristic based on the statistical analysis of the real RNA secondary structures of all 480 sequences from RNA STRAND, which are validated by NMR or X-ray. The length ratios of domains in these sequences are approximately 0.382L, 0.5L, 0.618L, and L, where L is the sequence length. These points are just the important golden sections of sequence. With this characteristic, an algorithm is designed to predict RNA hierarchical structures and simulate RNA folding by dynamically folding RNA structures according to the above golden section points. The sensitivity and number of predicted pseudoknots of our algorithm are better than those of the Mfold, HotKnots, McQfold, ProbKnot, and Lhw-Zhu algorithms. Experimental results reflect the folding rules of RNA from a new angle that is close to natural folding.

  11. Exploring RNA structure by integrative molecular modelling

    DEFF Research Database (Denmark)

    Masquida, Benoît; Beckert, Bertrand; Jossinet, Fabrice

    2010-01-01

    RNA molecular modelling is adequate to rapidly tackle the structure of RNA molecules. With new structured RNAs constituting a central class of cellular regulators discovered every year, the need for swift and reliable modelling methods is more crucial than ever. The pragmatic method based...... on interactive all-atom molecular modelling relies on the observation that specific structural motifs are recurrently found in RNA sequences. Once identified by a combination of comparative sequence analysis and biochemical data, the motifs composing the secondary structure of a given RNA can be extruded...

  12. Inverse folding of RNA pseudoknot structures

    Directory of Open Access Journals (Sweden)

    Li Linda YM

    2010-06-01

    Full Text Available Abstract Background RNA exhibits a variety of structural configurations. Here we consider a structure to be tantamount to the noncrossing Watson-Crick and G-U-base pairings (secondary structure and additional cross-serial base pairs. These interactions are called pseudoknots and are observed across the whole spectrum of RNA functionalities. In the context of studying natural RNA structures, searching for new ribozymes and designing artificial RNA, it is of interest to find RNA sequences folding into a specific structure and to analyze their induced neutral networks. Since the established inverse folding algorithms, RNAinverse, RNA-SSD as well as INFO-RNA are limited to RNA secondary structures, we present in this paper the inverse folding algorithm Inv which can deal with 3-noncrossing, canonical pseudoknot structures. Results In this paper we present the inverse folding algorithm Inv. We give a detailed analysis of Inv, including pseudocodes. We show that Inv allows to design in particular 3-noncrossing nonplanar RNA pseudoknot 3-noncrossing RNA structures-a class which is difficult to construct via dynamic programming routines. Inv is freely available at http://www.combinatorics.cn/cbpc/inv.html. Conclusions The algorithm Inv extends inverse folding capabilities to RNA pseudoknot structures. In comparison with RNAinverse it uses new ideas, for instance by considering sets of competing structures. As a result, Inv is not only able to find novel sequences even for RNA secondary structures, it does so in the context of competing structures that potentially exhibit cross-serial interactions.

  13. On topological RNA interaction structures.

    Science.gov (United States)

    Qin, Jing; Reidys, Christian M

    2013-07-01

    Recently a folding algorithm of topological RNA pseudoknot structures was presented in Reidys et al. (2011). This algorithm folds single-stranded γ-structures, that is, RNA structures composed by distinct motifs of bounded topological genus. In this article, we set the theoretical foundations for the folding of the two backbone analogues of γ structures: the RNA γ-interaction structures. These are RNA-RNA interaction structures that are constructed by a finite number of building blocks over two backbones having genus at most γ. Combinatorial properties of γ-interaction structures are of practical interest since they have direct implications for the folding of topological interaction structures. We compute the generating function of γ-interaction structures and show that it is algebraic, which implies that the numbers of interaction structures can be computed recursively. We obtain simple asymptotic formulas for 0- and 1-interaction structures. The simplest class of interaction structures are the 0-interaction structures, which represent the two backbone analogues of secondary structures.

  14. Efficient RNA structure comparison algorithms.

    Science.gov (United States)

    Arslan, Abdullah N; Anandan, Jithendar; Fry, Eric; Monschke, Keith; Ganneboina, Nitin; Bowerman, Jason

    2017-12-01

    Recently proposed relative addressing-based ([Formula: see text]) RNA secondary structure representation has important features by which an RNA structure database can be stored into a suffix array. A fast substructure search algorithm has been proposed based on binary search on this suffix array. Using this substructure search algorithm, we present a fast algorithm that finds the largest common substructure of given multiple RNA structures in [Formula: see text] format. The multiple RNA structure comparison problem is NP-hard in its general formulation. We introduced a new problem for comparing multiple RNA structures. This problem has more strict similarity definition and objective, and we propose an algorithm that solves this problem efficiently. We also develop another comparison algorithm that iteratively calls this algorithm to locate nonoverlapping large common substructures in compared RNAs. With the new resulting tools, we improved the RNASSAC website (linked from http://faculty.tamuc.edu/aarslan ). This website now also includes two drawing tools: one specialized for preparing RNA substructures that can be used as input by the search tool, and another one for automatically drawing the entire RNA structure from a given structure sequence.

  15. Functional and Structural Analysis of a Highly-Expressed Yersinia pestis Small RNA following Infection of Cultured Macrophages.

    Directory of Open Access Journals (Sweden)

    Nan Li

    Full Text Available Non-coding small RNAs (sRNAs are found in practically all bacterial genomes and play important roles in regulating gene expression to impact bacterial metabolism, growth, and virulence. We performed transcriptomics analysis to identify sRNAs that are differentially expressed in Yersinia pestis that invaded the human macrophage cell line THP-1, compared to pathogens that remained extracellular in the presence of host. Using ultra high-throughput sequencing, we identified 37 novel and 143 previously known sRNAs in Y. pestis. In particular, the sRNA Ysr170 was highly expressed in intracellular Yersinia and exhibited a log2 fold change ~3.6 higher levels compared to extracellular bacteria. We found that knock-down of Ysr170 expression attenuated infection efficiency in cell culture and growth rate in response to different stressors. In addition, we applied selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE analysis to determine the secondary structure of Ysr170 and observed structural changes resulting from interactions with the aminoglycoside antibiotic gentamycin and the RNA chaperone Hfq. Interestingly, gentamicin stabilized helix 4 of Ysr170, which structurally resembles the native gentamicin 16S ribosomal binding site. Finally, we modeled the tertiary structure of Ysr170 binding to gentamycin using RNA motif modeling. Integration of these experimental and structural methods can provide further insight into the design of small molecules that can inhibit function of sRNAs required for pathogen virulence.

  16. New windows into retroviral RNA structures.

    Science.gov (United States)

    Jayaraman, Dhivya; Kenyon, Julia Claire

    2018-01-25

    The multiple roles of both viral and cellular RNAs have become increasingly apparent in recent years, and techniques to model them have become significantly more powerful, enabling faster and more accurate visualization of RNA structures. Techniques such as SHAPE (selective 2'OH acylation analysed by primer extension) have revolutionized the field, and have been used to examine RNAs belonging to many and diverse retroviruses. Secondary structure probing reagents such as these have been aided by the development of faster methods of analysis either via capillary or next-generation sequencing, allowing the analysis of entire genomes, and of retroviral RNA structures within virions. Techniques to model the three-dimensional structures of these large RNAs have also recently developed. The flexibility of retroviral RNAs, both structural and functional, is clear from the results of these new experimental techniques. Retroviral RNA structures and structural changes control many stages of the lifecycle, and both the RNA structures themselves and their interactions with ligands are potential new drug targets. In addition, our growing understanding of retroviral RNA structures is aiding our knowledge of cellular RNA form and function.

  17. Analysis of energy-based algorithms for RNA secondary structure prediction

    Directory of Open Access Journals (Sweden)

    Hajiaghayi Monir

    2012-02-01

    Full Text Available Abstract Background RNA molecules play critical roles in the cells of organisms, including roles in gene regulation, catalysis, and synthesis of proteins. Since RNA function depends in large part on its folded structures, much effort has been invested in developing accurate methods for prediction of RNA secondary structure from the base sequence. Minimum free energy (MFE predictions are widely used, based on nearest neighbor thermodynamic parameters of Mathews, Turner et al. or those of Andronescu et al. Some recently proposed alternatives that leverage partition function calculations find the structure with maximum expected accuracy (MEA or pseudo-expected accuracy (pseudo-MEA methods. Advances in prediction methods are typically benchmarked using sensitivity, positive predictive value and their harmonic mean, namely F-measure, on datasets of known reference structures. Since such benchmarks document progress in improving accuracy of computational prediction methods, it is important to understand how measures of accuracy vary as a function of the reference datasets and whether advances in algorithms or thermodynamic parameters yield statistically significant improvements. Our work advances such understanding for the MFE and (pseudo-MEA-based methods, with respect to the latest datasets and energy parameters. Results We present three main findings. First, using the bootstrap percentile method, we show that the average F-measure accuracy of the MFE and (pseudo-MEA-based algorithms, as measured on our largest datasets with over 2000 RNAs from diverse families, is a reliable estimate (within a 2% range with high confidence of the accuracy of a population of RNA molecules represented by this set. However, average accuracy on smaller classes of RNAs such as a class of 89 Group I introns used previously in benchmarking algorithm accuracy is not reliable enough to draw meaningful conclusions about the relative merits of the MFE and MEA-based algorithms

  18. Optimization of oligonucleotide arrays and RNA amplification protocols for analysis of transcript structure and alternative splicing.

    Science.gov (United States)

    Castle, John; Garrett-Engele, Phil; Armour, Christopher D; Duenwald, Sven J; Loerch, Patrick M; Meyer, Michael R; Schadt, Eric E; Stoughton, Roland; Parrish, Mark L; Shoemaker, Daniel D; Johnson, Jason M

    2003-01-01

    Microarrays offer a high-resolution means for monitoring pre-mRNA splicing on a genomic scale. We have developed a novel, unbiased amplification protocol that permits labeling of entire transcripts. Also, hybridization conditions, probe characteristics, and analysis algorithms were optimized for detection of exons, exon-intron edges, and exon junctions. These optimized protocols can be used to detect small variations and isoform mixtures, map the tissue specificity of known human alternative isoforms, and provide a robust, scalable platform for high-throughput discovery of alternative splicing.

  19. Reproducible analysis of sequencing-based RNA structure probing data with user-friendly tools

    DEFF Research Database (Denmark)

    Kielpinski, Lukasz Jan; Sidiropoulos, Nikos; Vinther, Jeppe

    2015-01-01

    time also made analysis of the data challenging for scientists without formal training in computational biology. Here, we discuss different strategies for data analysis of massive parallel sequencing-based structure-probing data. To facilitate reproducible and standardized analysis of this type of data...

  20. SL1 revisited: functional analysis of the structure and conformation of HIV-1 genome RNA.

    Science.gov (United States)

    Sakuragi, Sayuri; Yokoyama, Masaru; Shioda, Tatsuo; Sato, Hironori; Sakuragi, Jun-Ichi

    2016-11-11

    The dimer initiation site/dimer linkage sequence (DIS/DLS) region of HIV is located on the 5' end of the viral genome and suggested to form complex secondary/tertiary structures. Within this structure, stem-loop 1 (SL1) is believed to be most important and an essential key to dimerization, since the sequence and predicted secondary structure of SL1 are highly stable and conserved among various virus subtypes. In particular, a six-base palindromic sequence is always present at the hairpin loop of SL1 and the formation of kissing-loop structure at this position between the two strands of genomic RNA is suggested to trigger dimerization. Although the higher-order structure model of SL1 is well accepted and perhaps even undoubted lately, there could be stillroom for consideration to depict the functional SL1 structure while in vivo (in virion or cell). In this study, we performed several analyses to identify the nucleotides and/or basepairing within SL1 which are necessary for HIV-1 genome dimerization, encapsidation, recombination and infectivity. We unexpectedly found that some nucleotides that are believed to contribute the formation of the stem do not impact dimerization or infectivity. On the other hand, we found that one G-C basepair involved in stem formation may serve as an alternative dimer interactive site. We also report on our further investigation of the roles of the palindromic sequences on viral replication. Collectively, we aim to assemble a more-comprehensive functional map of SL1 on the HIV-1 viral life cycle. We discovered several possibilities for a novel structure of SL1 in HIV-1 DLS. The newly proposed structure model suggested that the hairpin loop of SL1 appeared larger, and genome dimerization process might consist of more complicated mechanism than previously understood. Further investigations would be still required to fully understand the genome packaging and dimerization of HIV.

  1. Fatgraph models of RNA structure

    Directory of Open Access Journals (Sweden)

    Huang Fenix

    2017-01-01

    Full Text Available In this review paper we discuss fatgraphs as a conceptual framework for RNA structures. We discuss various notions of coarse-grained RNA structures and relate them to fatgraphs.We motivate and discuss the main intuition behind the fatgraph model and showcase its applicability to canonical as well as noncanonical base pairs. Recent discoveries regarding novel recursions of pseudoknotted (pk configurations as well as their translation into context-free grammars for pk-structures are discussed. This is shown to allow for extending the concept of partition functions of sequences w.r.t. a fixed structure having non-crossing arcs to pk-structures. We discuss minimum free energy folding of pk-structures and combine these above results outlining how to obtain an inverse folding algorithm for PK structures.

  2. Small RNA sequence analysis of adenovirus VA RNA-derived miRNAs reveals an unexpected serotype-specific difference in structure and abundance.

    Directory of Open Access Journals (Sweden)

    Wael Kamel

    Full Text Available Human adenoviruses (HAds encode for one or two highly abundant virus-associated RNAs, designated VA RNAI and VA RNAII, which fold into stable hairpin structures resembling miRNA precursors. Here we show that the terminal stem of the VA RNAs originating from Ad4, Ad5, Ad11 and Ad37, all undergo Dicer dependent processing into virus-specific miRNAs (so-called mivaRNAs. We further show that the mivaRNA duplex is subjected to a highly asymmetric RISC loading with the 3'-strand from all VA RNAs being the favored strand, except for the Ad37 VA RNAII, where the 5'-mivaRNAII strand was preferentially assembled into RISC. Although the mivaRNA seed sequences are not fully conserved between the HAds a bioinformatics prediction approach suggests that a large fraction of the VA RNAII-, but not the VA RNAI-derived mivaRNAs still are able to target the same cellular genes. Using small RNA deep sequencing we demonstrate that the Dicer processing event in the terminal stem of the VA RNAs is not unique and generates 3'-mivaRNAs with a slight variation of the position of the 5' terminal nucleotide in the RISC loaded guide strand. Also, we show that all analyzed VA RNAs, except Ad37 VA RNAI and Ad5 VA RNAII, utilize an alternative upstream A start site in addition to the classical +1 G start site. Further, the 5'-mivaRNAs with an A start appears to be preferentially incorporated into RISC. Although the majority of mivaRNA research has been done using Ad5 as the model system our analysis demonstrates that the mivaRNAs expressed in Ad11- and Ad37-infected cells are the most abundant mivaRNAs associated with Ago2-containing RISC. Collectively, our results show an unexpected variability in Dicer processing of the VA RNAs and a serotype-specific loading of mivaRNAs into Ago2-based RISC.

  3. Accelerated probabilistic inference of RNA structure evolution

    Directory of Open Access Journals (Sweden)

    Holmes Ian

    2005-03-01

    Full Text Available Abstract Background Pairwise stochastic context-free grammars (Pair SCFGs are powerful tools for evolutionary analysis of RNA, including simultaneous RNA sequence alignment and secondary structure prediction, but the associated algorithms are intensive in both CPU and memory usage. The same problem is faced by other RNA alignment-and-folding algorithms based on Sankoff's 1985 algorithm. It is therefore desirable to constrain such algorithms, by pre-processing the sequences and using this first pass to limit the range of structures and/or alignments that can be considered. Results We demonstrate how flexible classes of constraint can be imposed, greatly reducing the computational costs while maintaining a high quality of structural homology prediction. Any score-attributed context-free grammar (e.g. energy-based scoring schemes, or conditionally normalized Pair SCFGs is amenable to this treatment. It is now possible to combine independent structural and alignment constraints of unprecedented general flexibility in Pair SCFG alignment algorithms. We outline several applications to the bioinformatics of RNA sequence and structure, including Waterman-Eggert N-best alignments and progressive multiple alignment. We evaluate the performance of the algorithm on test examples from the RFAM database. Conclusion A program, Stemloc, that implements these algorithms for efficient RNA sequence alignment and structure prediction is available under the GNU General Public License.

  4. RNA Structural Alignments, Part I

    DEFF Research Database (Denmark)

    Havgaard, Jakob Hull; Gorodkin, Jan

    2014-01-01

    Simultaneous alignment and secondary structure prediction of RNA sequences is often referred to as "RNA structural alignment." A class of the methods for structural alignment is based on the principles proposed by Sankoff more than 25 years ago. The Sankoff algorithm simultaneously folds and aligns...... is so high that it took more than a decade before the first implementation of a Sankoff style algorithm was published. However, with the faster computers available today and the improved heuristics used in the implementations the Sankoff-based methods have become practical. This chapter describes...... the methods based on the Sankoff algorithm. All the practical implementations of the algorithm use heuristics to make them run in reasonable time and memory. These heuristics are also described in this chapter....

  5. On RNA-RNA interaction structures of fixed topological genus.

    Science.gov (United States)

    Fu, Benjamin M M; Han, Hillary S W; Reidys, Christian M

    2015-04-01

    Interacting RNA complexes are studied via bicellular maps using a filtration via their topological genus. Our main result is a new bijection for RNA-RNA interaction structures and a linear time uniform sampling algorithm for RNA complexes of fixed topological genus. The bijection allows to either reduce the topological genus of a bicellular map directly, or to lose connectivity by decomposing the complex into a pair of single stranded RNA structures. Our main result is proved bijectively. It provides an explicit algorithm of how to rewire the corresponding complexes and an unambiguous decomposition grammar. Using the concept of genus induction, we construct bicellular maps of fixed topological genus g uniformly in linear time. We present various statistics on these topological RNA complexes and compare our findings with biological complexes. Furthermore we show how to construct loop-energy based complexes using our decomposition grammar. Copyright © 2015 Elsevier Inc. All rights reserved.

  6. Protein Structure and the Sequential Structure of mRNA

    DEFF Research Database (Denmark)

    Brunak, Søren; Engelbrecht, Jacob

    1996-01-01

    entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment, By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets, These signals do not originate from......A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed, We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting...... protein, The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain, A complete search for GenBank nucleotide sequences coding for structural...

  7. Analysis of bacterial core communities in the central Baltic by comparative RNA-DNA-based fingerprinting provides links to structure-function relationships.

    Science.gov (United States)

    Brettar, Ingrid; Christen, Richard; Höfle, Manfred G

    2012-01-01

    Understanding structure-function links of microbial communities is a central theme of microbial ecology since its beginning. To this end, we studied the spatial variability of the bacterioplankton community structure and composition across the central Baltic Sea at four stations, which were up to 450 km apart and at a depth profile representative for the central part (Gotland Deep, 235 m). Bacterial community structure was followed by 16S ribosomal RNA (rRNA)- and 16S rRNA gene-based fingerprints using single-strand conformation polymorphism (SSCP) electrophoresis. Species composition was determined by sequence analysis of SSCP bands. High similarities of the bacterioplankton communities across several hundred kilometers were observed in the surface water using RNA- and DNA-based fingerprints. In these surface communities, the RNA- and DNA-based fingerprints resulted in very different pattern, presumably indicating large difference between the active members of the community as represented by RNA-based fingerprints and the present members represented by the DNA-based fingerprints. This large discrepancy changed gradually over depth, resulting in highly similar RNA- and DNA-based fingerprints in the anoxic part of the water column below 130 m depth. A conceivable mechanism explaining this high similarity could be the reduced oxidative stress in the anoxic zone. The stable communities on the surface and in the anoxic zone indicate the strong influence of the hydrography on the bacterioplankton community structure. Comparative analysis of RNA- and DNA-based community structure provided criteria for the identification of the core community, its key members and their links to biogeochemical functions.

  8. Visualization of RNA structure models within the Integrative Genomics Viewer.

    Science.gov (United States)

    Busan, Steven; Weeks, Kevin M

    2017-07-01

    Analyses of the interrelationships between RNA structure and function are increasingly important components of genomic studies. The SHAPE-MaP strategy enables accurate RNA structure probing and realistic structure modeling of kilobase-length noncoding RNAs and mRNAs. Existing tools for visualizing RNA structure models are not suitable for efficient analysis of long, structurally heterogeneous RNAs. In addition, structure models are often advantageously interpreted in the context of other experimental data and gene annotation information, for which few tools currently exist. We have developed a module within the widely used and well supported open-source Integrative Genomics Viewer (IGV) that allows visualization of SHAPE and other chemical probing data, including raw reactivities, data-driven structural entropies, and data-constrained base-pair secondary structure models, in context with linear genomic data tracks. We illustrate the usefulness of visualizing RNA structure in the IGV by exploring structure models for a large viral RNA genome, comparing bacterial mRNA structure in cells with its structure under cell- and protein-free conditions, and comparing a noncoding RNA structure modeled using SHAPE data with a base-pairing model inferred through sequence covariation analysis. © 2017 Busan and Weeks; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  9. Variation in secondary structure of the 16S rRNA molecule in cyanobacteria with implications for phylogenetic analysis

    Czech Academy of Sciences Publication Activity Database

    Řeháková, Klára; Johansen, J. R.; Bowen, M.B.; Martin, M.P.; Sheil, C.A.

    2014-01-01

    Roč. 14, č. 2 (2014), s. 161-178 ISSN 1802-5439 Institutional support: RVO:60077344 Keywords : 16S rRNA secondary structure * cyanobacteria * phylogeny Subject RIV: EE - Microbiology, Virology Impact factor: 1.930, year: 2014

  10. Rapid NMR screening of RNA secondary structure and binding

    International Nuclear Information System (INIS)

    Helmling, Christina; Keyhani, Sara; Sochor, Florian; Fürtig, Boris; Hengesbach, Martin; Schwalbe, Harald

    2015-01-01

    Determination of RNA secondary structures by NMR spectroscopy is a useful tool e.g. to elucidate RNA folding space or functional aspects of regulatory RNA elements. However, current approaches of RNA synthesis and preparation are usually time-consuming and do not provide analysis with single nucleotide precision when applied for a large number of different RNA sequences. Here, we significantly improve the yield and 3′ end homogeneity of RNA preparation by in vitro transcription. Further, by establishing a native purification procedure with increased throughput, we provide a shortcut to study several RNA constructs simultaneously. We show that this approach yields μmol quantities of RNA with purities comparable to PAGE purification, while avoiding denaturation of the RNA

  11. Rapid NMR screening of RNA secondary structure and binding

    Energy Technology Data Exchange (ETDEWEB)

    Helmling, Christina; Keyhani, Sara; Sochor, Florian; Fürtig, Boris; Hengesbach, Martin; Schwalbe, Harald, E-mail: schwalbe@nmr.uni-frankfurt.de [Johann Wolfgang Goethe-Universität, Institut für Organische Chemie und Chemische Biologie, Center for Biomolecular Magnetic Resonance (BMRZ) (Germany)

    2015-09-15

    Determination of RNA secondary structures by NMR spectroscopy is a useful tool e.g. to elucidate RNA folding space or functional aspects of regulatory RNA elements. However, current approaches of RNA synthesis and preparation are usually time-consuming and do not provide analysis with single nucleotide precision when applied for a large number of different RNA sequences. Here, we significantly improve the yield and 3′ end homogeneity of RNA preparation by in vitro transcription. Further, by establishing a native purification procedure with increased throughput, we provide a shortcut to study several RNA constructs simultaneously. We show that this approach yields μmol quantities of RNA with purities comparable to PAGE purification, while avoiding denaturation of the RNA.

  12. Predicting RNA Structure Using Mutual Information

    DEFF Research Database (Denmark)

    Freyhult, E.; Moulton, V.; Gardner, P. P.

    2005-01-01

    , to display and predict conserved RNA secondary structure (including pseudoknots) from an alignment. Results: We show that MIfold can be used to predict simple pseudoknots, and that the performance can be adjusted to make it either more sensitive or more selective. We also demonstrate that the overall...... package. Conclusion: MIfold provides a useful supplementary tool to programs such as RNA Structure Logo, RNAalifold and COVE, and should be useful for automatically generating structural predictions for databases such as Rfam. Availability: MIfold is freely available from http......Background: With the ever-increasing number of sequenced RNAs and the establishment of new RNA databases, such as the Comparative RNA Web Site and Rfam, there is a growing need for accurately and automatically predicting RNA structures from multiple alignments. Since RNA secondary structure...

  13. ncRNA consensus secondary structure derivation using grammar strings.

    Science.gov (United States)

    Achawanantakun, Rujira; Sun, Yanni; Takyar, Seyedeh Shohreh

    2011-04-01

    Many noncoding RNAs (ncRNAs) function through both their sequences and secondary structures. Thus, secondary structure derivation is an important issue in today's RNA research. The state-of-the-art structure annotation tools are based on comparative analysis, which derives consensus structure of homologous ncRNAs. Despite promising results from existing ncRNA aligning and consensus structure derivation tools, there is a need for more efficient and accurate ncRNA secondary structure modeling and alignment methods. In this work, we introduce a consensus structure derivation approach based on grammar string, a novel ncRNA secondary structure representation that encodes an ncRNA's sequence and secondary structure in the parameter space of a context-free grammar (CFG) and a full RNA grammar including pseudoknots. Being a string defined on a special alphabet constructed from a grammar, grammar string converts ncRNA alignment into sequence alignment. We derive consensus secondary structures from hundreds of ncRNA families from BraliBase 2.1 and 25 families containing pseudoknots using grammar string alignment. Our experiments have shown that grammar string-based structure derivation competes favorably in consensus structure quality with Murlet and RNASampler. Source code and experimental data are available at http://www.cse.msu.edu/~yannisun/grammar-string.

  14. Functional RNA structures throughout the Hepatitis C Virus genome.

    Science.gov (United States)

    Adams, Rebecca L; Pirakitikulr, Nathan; Pyle, Anna Marie

    2017-06-01

    The single-stranded Hepatitis C Virus (HCV) genome adopts a set of elaborate RNA structures that are involved in every stage of the viral lifecycle. Recent advances in chemical probing, sequencing, and structural biology have facilitated analysis of RNA folding on a genome-wide scale, revealing novel structures and networks of interactions. These studies have underscored the active role played by RNA in every function of HCV and they open the door to new types of RNA-targeted therapeutics. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Probing of RNA structures in a positive sense RNA virus reveals selection pressures for structural elements

    Science.gov (United States)

    Watters, Kyle E; Choudhary, Krishna; Aviran, Sharon; Perry, Keith L

    2018-01-01

    Abstract In single stranded (+)-sense RNA viruses, RNA structural elements (SEs) play essential roles in the infection process from replication to encapsidation. Using selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq) and covariation analysis, we explore the structural features of the third genome segment of cucumber mosaic virus (CMV), RNA3 (2216 nt), both in vitro and in plant cell lysates. Comparing SHAPE-Seq and covariation analysis results revealed multiple SEs in the coat protein open reading frame and 3′ untranslated region. Four of these SEs were mutated and serially passaged in Nicotiana tabacum plants to identify biologically selected changes to the original mutated sequences. After passaging, loop mutants showed partial reversion to their wild-type sequence and SEs that were structurally disrupted by mutations were restored to wild-type-like structures via synonymous mutations in planta. These results support the existence and selection of virus open reading frame SEs in the host organism and provide a framework for further studies on the role of RNA structure in viral infection. Additionally, this work demonstrates the applicability of high-throughput chemical probing in plant cell lysates and presents a new method for calculating SHAPE reactivities from overlapping reverse transcriptase priming sites. PMID:29294088

  16. Four RNA families with functional transient structures.

    Science.gov (United States)

    Zhu, Jing Yun A; Meyer, Irmtraud M

    2015-01-01

    Protein-coding and non-coding RNA transcripts perform a wide variety of cellular functions in diverse organisms. Several of their functional roles are expressed and modulated via RNA structure. A given transcript, however, can have more than a single functional RNA structure throughout its life, a fact which has been previously overlooked. Transient RNA structures, for example, are only present during specific time intervals and cellular conditions. We here introduce four RNA families with transient RNA structures that play distinct and diverse functional roles. Moreover, we show that these transient RNA structures are structurally well-defined and evolutionarily conserved. Since Rfam annotates one structure for each family, there is either no annotation for these transient structures or no such family. Thus, our alignments either significantly update and extend the existing Rfam families or introduce a new RNA family to Rfam. For each of the four RNA families, we compile a multiple-sequence alignment based on experimentally verified transient and dominant (dominant in terms of either the thermodynamic stability and/or attention received so far) RNA secondary structures using a combination of automated search via covariance model and manual curation. The first alignment is the Trp operon leader which regulates the operon transcription in response to tryptophan abundance through alternative structures. The second alignment is the HDV ribozyme which we extend to the 5' flanking sequence. This flanking sequence is involved in the regulation of the transcript's self-cleavage activity. The third alignment is the 5' UTR of the maturation protein from Levivirus which contains a transient structure that temporarily postpones the formation of the final inhibitory structure to allow translation of maturation protein. The fourth and last alignment is the SAM riboswitch which regulates the downstream gene expression by assuming alternative structures upon binding of SAM. All

  17. SHAPE analysis of the FIV Leader RNA reveals a structural switch potentially controlling viral packaging and genome dimerization.

    Science.gov (United States)

    Kenyon, Julia C; Tanner, Sian J; Legiewicz, Michal; Phillip, Pretty S; Rizvi, Tahir A; Le Grice, Stuart F J; Lever, Andrew M L

    2011-08-01

    Feline immunodeficiency virus (FIV) infects many species of cat, and is related to HIV, causing a similar pathology. High-throughput selective 2' hydroxyl acylation analysed by primer extension (SHAPE), a technique that allows structural interrogation at each nucleotide, was used to map the secondary structure of the FIV packaging signal RNA. Previous studies of this RNA showed four conserved stem-loops, extensive long-range interactions (LRIs) and a small, palindromic stem-loop (SL5) within the gag open reading frame (ORF) that may act as a dimerization initiation site (DIS), enabling the virus to package two copies of its genome. Our analyses of wild-type (wt) and mutant RNAs suggest that although the four conserved stem-loops are static structures, the 5' and 3' regions previously shown to form LRI also adopt an alternative, yet similarly conserved conformation, in which the putative DIS is occluded, and which may thus favour translational and splicing functions over encapsidation. SHAPE and in vitro dimerization assays were used to examine SL5 mutants. Dimerization contacts appear to be made between palindromic loop sequences in SL5. As this stem-loop is located within the gag ORF, recognition of a dimeric RNA provides a possible mechanism for the specific packaging of genomic over spliced viral RNAs.

  18. Computational RNA secondary structure design: empirical complexity and improved methods

    Directory of Open Access Journals (Sweden)

    Condon Anne

    2007-01-01

    Full Text Available Abstract Background We investigate the empirical complexity of the RNA secondary structure design problem, that is, the scaling of the typical difficulty of the design task for various classes of RNA structures as the size of the target structure is increased. The purpose of this work is to understand better the factors that make RNA structures hard to design for existing, high-performance algorithms. Such understanding provides the basis for improving the performance of one of the best algorithms for this problem, RNA-SSD, and for characterising its limitations. Results To gain insights into the practical complexity of the problem, we present a scaling analysis on random and biologically motivated structures using an improved version of the RNA-SSD algorithm, and also the RNAinverse algorithm from the Vienna package. Since primary structure constraints are relevant for designing RNA structures, we also investigate the correlation between the number and the location of the primary structure constraints when designing structures and the performance of the RNA-SSD algorithm. The scaling analysis on random and biologically motivated structures supports the hypothesis that the running time of both algorithms scales polynomially with the size of the structure. We also found that the algorithms are in general faster when constraints are placed only on paired bases in the structure. Furthermore, we prove that, according to the standard thermodynamic model, for some structures that the RNA-SSD algorithm was unable to design, there exists no sequence whose minimum free energy structure is the target structure. Conclusion Our analysis helps to better understand the strengths and limitations of both the RNA-SSD and RNAinverse algorithms, and suggests ways in which the performance of these algorithms can be further improved.

  19. Analysis of intermolecular RNA-RNA recombination by rubella virus

    International Nuclear Information System (INIS)

    Adams, Sandra D.; Tzeng, W.-P.; Chen, M.-H.; Frey, Teryl K.

    2003-01-01

    To investigate whether rubella virus (RUB) undergoes intermolecular RNA-RNA recombination, cells were cotransfected with pairs of in vitro transcripts from genomic cDNA plasmid vectors engineered to contain nonoverlapping deletions: the replicative transcript maintained the 5'-proximal nonstructural (NS) ORF (which contained the replicase, making it RNA replication competent), had a deletion in the 3'-proximal structural protein (SP) ORF, and maintained the 3' end of the genome, including the putative 3' cis-acting elements (CSE), while the nonreplicative transcript consisted of the 3' half of the genome including the SP-ORF and 3' CSE. Cotransfection yielded plaque-forming virus that synthesized the standard genomic and subgenomic RNAs and thus was generated by RNA-RNA recombination. Using transcripts tagged with a 3'-terminal deletion, it was found that recombinants contained the 3' end derived from the replicative strand, indicating a cis-preference for initiation of negative-strand synthesis. In cotransfections in which the replicative transcript lacked the 3' CSE, recombination occurred, albeit at lower efficiency, indicating that initiation in trans from the NS-ORF can occur. The 3' CSE was sufficient as a nonreplicative transcript, showing that it can serve as a promoter for negative-strand RNA synthesis. While deletion mutagenesis showed that the presence of the junction untranslated region (J-UTR) between the ORFs appeared to be necessary on both transcripts for recombination in this region of the genome, analysis with transcripts tagged with restriction sites showed that the J-UTR was not a hot spot for recombination compared to neighboring regions in both ORFs. Sequence analysis of recombinants revealed that both precise (homologous) and imprecise recombination (aberrant, homologous resulting in duplications) occurred; however, imprecise recombination only involved the J-UTR or the 3' end of the NS-ORF and the J-UTR (maintaining the NS-ORF), indicating

  20. Small catalytic RNA: Structure, function and application

    Energy Technology Data Exchange (ETDEWEB)

    Monforte, Joseph Albert [Univ. of California, Berkeley, CA (United States)

    1991-04-01

    We have utilized a combination of photochemical cross-linking techniques and site-directed mutagenesis to obtain secondary and tertiary structure information for the self-cleaving, self-ligating subsequence of RNA from the negative strand of Satellite Tobacco Ringspot Virus. We have found that the helical regions fold about a hinge to promoting four different possible tertiary interactions, creating a molecular of similar shape to a paperclip. A model suggesting that the ``paperclip`` and ``hammerhead`` RNAs share a similar three dimensional structure is proposed. We have used a self-cleaving RNA molecule related to a subsequence of plant viroids, a ``hammerhead,`` to study the length-dependent folding of RNA produced during transcription by RNA polymerase. We have used this method to determine the length of RNA sequestered within elongating E. coli and T7 RNA polymerase complexes. The data show that for E. coli RNA polymerase 121±s are sequestered within the ternary complex, which is consistent with the presence of an RNA-DNA hybrid within the transcription bubble, as proposed by others. The result for T7 RNA polymerase differs from E. coli RNA polymerase, with only 10{plus_minus}1 nucleotides sequestered within the ternary complex, setting a new upper limit for the minimum RNA-DNA required for a stable elongating complex. Comparisons between E. coli and T7 RNA polymerase are made. The relevance of the results to models or transcription termination, abortive initiation, and initiation to elongation mode transitions are discussed.

  1. RNA 3D modules in genome-wide predictions of RNA 2D structure

    DEFF Research Database (Denmark)

    Theis, Corinna; Zirbel, Craig L; Zu Siederdissen, Christian Höner

    2015-01-01

    . These modules can, for example, occur inside structural elements which in RNA 2D predictions appear as internal loops. Hence one question is if the use of such RNA 3D information can improve the prediction accuracy of RNA secondary structure at a genome-wide level. Here, we use RNAz in combination with 3D......Recent experimental and computational progress has revealed a large potential for RNA structure in the genome. This has been driven by computational strategies that exploit multiple genomes of related organisms to identify common sequences and secondary structures. However, these computational...... approaches have two main challenges: they are computationally expensive and they have a relatively high false discovery rate (FDR). Simultaneously, RNA 3D structure analysis has revealed modules composed of non-canonical base pairs which occur in non-homologous positions, apparently by independent evolution...

  2. Direct, rapid RNA sequence analysis

    International Nuclear Information System (INIS)

    Peattie, D.A.

    1987-01-01

    The original methods of RNA sequence analysis were based on enzymatic production and chromatographic separation of overlapping oligonucleotide fragments from within an RNA molecule followed by identification of the mononucleotides comprising the oligomer. Over the past decade the field of nucleic acid sequencing has changed dramatically, however, and RNA molecules now can be sequenced in a variety of more streamlined fashions. Most of the more recent advances in RNA sequencing have involved one-dimensional electrophoretic separation of 32 P-end-labeled oligoribonucleotides on polyacrylamide gels. In this chapter the author discusses two of these methods for determining the nucleotide sequences of RNA molecules rapidly: the chemical method and the enzymatic method. Both methods are direct and degradative, i.e., they rely on fragmatic and chemical approaches should be utilized. The single-strand-specific ribonucleases (A, T 1 , T 2 , and S 1 ) provide an efficient means to locate double-helical regions rapidly, and the chemical reactions provide a means to determine the RNA sequence within these regions. In addition, the chemical reactions allow one to assign interactions to specific atoms and to distinguish secondary interactions from tertiary ones. If the RNA molecule is small enough to be sequenced directly by the enzymatic or chemical method, the probing reactions can be done easily at the same time as sequencing reactions

  3. A comprehensive comparison of comparative RNA structure prediction approaches

    DEFF Research Database (Denmark)

    Gardner, P. P.; Giegerich, R.

    2004-01-01

    -finding and multiple-sequence-alignment algorithms. Results Here we evaluate a number of RNA folding algorithms using reliable RNA data-sets and compare their relative performance. Conclusions We conclude that comparative data can enhance structure prediction but structure-prediction-algorithms vary widely in terms......Background An increasing number of researchers have released novel RNA structure analysis and prediction algorithms for comparative approaches to structure prediction. Yet, independent benchmarking of these algorithms is rarely performed as is now common practice for protein-folding, gene...

  4. Structural analysis of malaria-parasite lysyl-tRNA synthetase provides a platform for drug development.

    Science.gov (United States)

    Khan, Sameena; Garg, Ankur; Camacho, Noelia; Van Rooyen, Jason; Kumar Pole, Anil; Belrhali, Hassan; Ribas de Pouplana, Lluis; Sharma, Vinay; Sharma, Amit

    2013-05-01

    Aminoacyl-tRNA synthetases are essential enzymes that transmit information from the genetic code to proteins in cells and are targets for antipathogen drug development. Elucidation of the crystal structure of cytoplasmic lysyl-tRNA synthetase from the malaria parasite Plasmodium falciparum (PfLysRS) has allowed direct comparison with human LysRS. The authors' data suggest that PfLysRS is dimeric in solution, whereas the human counterpart can also adopt tetrameric forms. It is shown for the first time that PfLysRS is capable of synthesizing the signalling molecule Ap4a (diadenosine tetraphosphate) using ATP as a substrate. The PfLysRS crystal structure is in the apo form, such that binding to ATP will require rotameric changes in four conserved residues. Differences in the active-site regions of parasite and human LysRSs suggest the possibility of exploiting PfLysRS for selective inhibition. These investigations on PfLysRS further validate malarial LysRSs as attractive antimalarial targets and provide new structural space for the development of inhibitors that target pathogen LysRSs selectively.

  5. Facilitating RNA structure prediction with microarrays.

    Science.gov (United States)

    Kierzek, Elzbieta; Kierzek, Ryszard; Turner, Douglas H; Catrina, Irina E

    2006-01-17

    Determining RNA secondary structure is important for understanding structure-function relationships and identifying potential drug targets. This paper reports the use of microarrays with heptamer 2'-O-methyl oligoribonucleotides to probe the secondary structure of an RNA and thereby improve the prediction of that secondary structure. When experimental constraints from hybridization results are added to a free-energy minimization algorithm, the prediction of the secondary structure of Escherichia coli 5S rRNA improves from 27 to 92% of the known canonical base pairs. Optimization of buffer conditions for hybridization and application of 2'-O-methyl-2-thiouridine to enhance binding and improve discrimination between AU and GU pairs are also described. The results suggest that probing RNA with oligonucleotide microarrays can facilitate determination of secondary structure.

  6. Secondary Structural Models (16S rRNA of Polyhydroxyalkanoates Producing Bacillus Species Isolated from Different Rhizospheric Soil: Phylogenetics and Chemical Analysis

    Directory of Open Access Journals (Sweden)

    Swati Mohapatra

    2016-09-01

    Full Text Available Polyhydroxyalkanoates (PHAs producing bacterial isolates are gaining more importance over the world due to the synthesis of a biodegradable polymer which is extremely desirable to substitute synthetic plastics. PHAs are produced by various microorganisms under certain stress conditions. In this study, sixteen bacterial isolates characterized previously by partial 16S rRNA gene sequencing (NCBI Accession No. KF626466 to KF626481 were again stained by Nile red after three years of preservation in order to confirm their ability to accumulate PHAs. Also, phylogenetic analysis carried out in the present investigation evidenced that the bacterial species belonging to genus Bacillus are the dominant flora of the rhizospheric region, with a potentiality of biodegradable polymer (PHAs production. Again, RNA secondary structure prediction hypothesized that there is no direct correlation between RNA folding pattern stability with a rate of PHAs production among the selected isolates of genus Bacillus.

  7. Combining NMR and small angle X-ray and neutron scattering in the structural analysis of a ternary protein-RNA complex

    International Nuclear Information System (INIS)

    Hennig, Janosch; Wang, Iren; Sonntag, Miriam; Gabel, Frank; Sattler, Michael

    2013-01-01

    Many processes in the regulation of gene expression and signaling involve the formation of protein complexes involving multi-domain proteins. Individual domains that mediate protein-protein and protein-nucleic acid interactions are typically connected by flexible linkers, which contribute to conformational dynamics and enable the formation of complexes with distinct binding partners. Solution techniques are therefore required for structural analysis and to characterize potential conformational dynamics. Nuclear magnetic resonance spectroscopy (NMR) provides such information but often only sparse data are obtained with increasing molecular weight of the complexes. It is therefore beneficial to combine NMR data with additional structural restraints from complementary solution techniques. Small angle X-ray/neutron scattering (SAXS/SANS) data can be efficiently combined with NMR-derived information, either for validation or by providing additional restraints for structural analysis. Here, we show that the combination of SAXS and SANS data can help to refine structural models obtained from data-driven docking using HADDOCK based on sparse NMR data. The approach is demonstrated with the ternary protein-protein-RNA complex involving two RNA recognition motif (RRM) domains of Sex-lethal, the N-terminal cold shock domain of Upstream-to-N-Ras, and msl-2 mRNA. Based on chemical shift perturbations we have mapped protein-protein and protein-RNA interfaces and complemented this NMR-derived information with SAXS data, as well as SANS measurements on subunit-selectively deuterated samples of the ternary complex. Our results show that, while the use of SAXS data is beneficial, the additional combination with contrast variation in SANS data resolves remaining ambiguities and improves the docking based on chemical shift perturbations of the ternary protein-RNA complex.

  8. Combining NMR and small angle X-ray and neutron scattering in the structural analysis of a ternary protein-RNA complex

    Energy Technology Data Exchange (ETDEWEB)

    Hennig, Janosch; Wang, Iren; Sonntag, Miriam [Institute of Structural Biology, Helmholtz Zentrum Muenchen (Germany); Gabel, Frank [Extremophiles and Large Molecular Assemblies Group (ELMA), Institut de Biologie Structurale (IBS) CEA-CNRS-UJF (France); Sattler, Michael, E-mail: sattler@helmholtz-muenchen.de [Institute of Structural Biology, Helmholtz Zentrum Muenchen (Germany)

    2013-05-15

    Many processes in the regulation of gene expression and signaling involve the formation of protein complexes involving multi-domain proteins. Individual domains that mediate protein-protein and protein-nucleic acid interactions are typically connected by flexible linkers, which contribute to conformational dynamics and enable the formation of complexes with distinct binding partners. Solution techniques are therefore required for structural analysis and to characterize potential conformational dynamics. Nuclear magnetic resonance spectroscopy (NMR) provides such information but often only sparse data are obtained with increasing molecular weight of the complexes. It is therefore beneficial to combine NMR data with additional structural restraints from complementary solution techniques. Small angle X-ray/neutron scattering (SAXS/SANS) data can be efficiently combined with NMR-derived information, either for validation or by providing additional restraints for structural analysis. Here, we show that the combination of SAXS and SANS data can help to refine structural models obtained from data-driven docking using HADDOCK based on sparse NMR data. The approach is demonstrated with the ternary protein-protein-RNA complex involving two RNA recognition motif (RRM) domains of Sex-lethal, the N-terminal cold shock domain of Upstream-to-N-Ras, and msl-2 mRNA. Based on chemical shift perturbations we have mapped protein-protein and protein-RNA interfaces and complemented this NMR-derived information with SAXS data, as well as SANS measurements on subunit-selectively deuterated samples of the ternary complex. Our results show that, while the use of SAXS data is beneficial, the additional combination with contrast variation in SANS data resolves remaining ambiguities and improves the docking based on chemical shift perturbations of the ternary protein-RNA complex.

  9. Large scale comparative codon-pair context analysis unveils general rules that fine-tune evolution of mRNA primary structure.

    Directory of Open Access Journals (Sweden)

    Gabriela Moura

    Full Text Available BACKGROUND: Codon usage and codon-pair context are important gene primary structure features that influence mRNA decoding fidelity. In order to identify general rules that shape codon-pair context and minimize mRNA decoding error, we have carried out a large scale comparative codon-pair context analysis of 119 fully sequenced genomes. METHODOLOGIES/PRINCIPAL FINDINGS: We have developed mathematical and software tools for large scale comparative codon-pair context analysis. These methodologies unveiled general and species specific codon-pair context rules that govern evolution of mRNAs in the 3 domains of life. We show that evolution of bacterial and archeal mRNA primary structure is mainly dependent on constraints imposed by the translational machinery, while in eukaryotes DNA methylation and tri-nucleotide repeats impose strong biases on codon-pair context. CONCLUSIONS: The data highlight fundamental differences between prokaryotic and eukaryotic mRNA decoding rules, which are partially independent of codon usage.

  10. MicroRNA-target binding structures mimic microRNA duplex structures in humans.

    Directory of Open Access Journals (Sweden)

    Xi Chen

    Full Text Available Traditionally, researchers match a microRNA guide strand to mRNA sequences using sequence comparisons to predict its potential target genes. However, many of the predictions can be false positives due to limitations in sequence comparison alone. In this work, we consider the association of two related RNA structures that share a common guide strand: the microRNA duplex and the microRNA-target binding structure. We have analyzed thousands of such structure pairs and found many of them share high structural similarity. Therefore, we conclude that when predicting microRNA target genes, considering just the microRNA guide strand matches to gene sequences may not be sufficient--the microRNA duplex structure formed by the guide strand and its companion passenger strand must also be considered. We have developed software to translate RNA binding structure into encoded representations, and we have also created novel automatic comparison methods utilizing such encoded representations to determine RNA structure similarity. Our software and methods can be utilized in the other RNA secondary structure comparisons as well.

  11. Small catalytic RNA: Structure, function and application

    Energy Technology Data Exchange (ETDEWEB)

    Monforte, J.A.

    1991-04-01

    We have utilized a combination of photochemical cross-linking techniques and site-directed mutagenesis to obtain secondary and tertiary structure information for the self-cleaving, self-ligating subsequence of RNA from the negative strand of Satellite Tobacco Ringspot Virus. We have found that the helical regions fold about a hinge to promoting four different possible tertiary interactions, creating a molecular of similar shape to a paperclip. A model suggesting that the paperclip'' and hammerhead'' RNAs share a similar three dimensional structure is proposed. We have used a self-cleaving RNA molecule related to a subsequence of plant viroids, a hammerhead,'' to study the length-dependent folding of RNA produced during transcription by RNA polymerase. We have used this method to determine the length of RNA sequestered within elongating E. coli and T7 RNA polymerase complexes. The data show that for E. coli RNA polymerase 12{plus minus}1 nucleotides are sequestered within the ternary complex, which is consistent with the presence of an RNA-DNA hybrid within the transcription bubble, as proposed by others. The result for T7 RNA polymerase differs from E. coli RNA polymerase, with only 10{plus minus}1 nucleotides sequestered within the ternary complex, setting a new upper limit for the minimum RNA-DNA required for a stable elongating complex. Comparisons between E. coli and T7 RNA polymerase are made. The relevance of the results to models or transcription termination, abortive initiation, and initiation to elongation mode transitions are discussed.

  12. New Comparative Analysis Based on the Secondary Structure of SSU-rRNA Gene Reveals the Evolutionary Trend and the Family-Genus Characters of Mobilida (Ciliophora, Peritrichia).

    Science.gov (United States)

    Zhang, Yong; Zhao, Yuan-Jun; Wang, Qin; Tang, Fa-Hui

    2015-08-01

    In order to reveal the structural evolutionary trend of Mobilida ciliates, twenty-six SSU-rRNA sequences of mobilid species, including seven ones newly sequenced in the present work, were used for comparative phylogenic analysis based on the RNA secondary structure. The research results indicate that all the secondary structures except domains Helix 10, Helix 12, and Helix 37 could be regarded as the criterions in classification between the family Trichodinidae and Urceolariida, and four regions including Helix E10-1, Helix 29, Helix 43, and Helix 45-Helix 46 could be as criterions in classification between the genus Trichodinella and Trichodina in family Trichodinidae. After the analysis of common structural feature within the Mobilida, it was found that the secondary structure of V6 could prove the family Urceolariidae primitive status. This research has further suggested that the genus Trichodina could be divergent earlier than Trichodinella in the family Trichodinidae. In addition, the relationship between the secondary structure and topology of phylogenic tree that the branching order of most clades corresponds with the secondary structure of species within each clade of phylogenetic tree was first uncovered and discussed in the present study.

  13. Free energy minimization to predict RNA secondary structures and computational RNA design.

    Science.gov (United States)

    Churkin, Alexander; Weinbrand, Lina; Barash, Danny

    2015-01-01

    Determining the RNA secondary structure from sequence data by computational predictions is a long-standing problem. Its solution has been approached in two distinctive ways. If a multiple sequence alignment of a collection of homologous sequences is available, the comparative method uses phylogeny to determine conserved base pairs that are more likely to form as a result of billions of years of evolution than by chance. In the case of single sequences, recursive algorithms that compute free energy structures by using empirically derived energy parameters have been developed. This latter approach of RNA folding prediction by energy minimization is widely used to predict RNA secondary structure from sequence. For a significant number of RNA molecules, the secondary structure of the RNA molecule is indicative of its function and its computational prediction by minimizing its free energy is important for its functional analysis. A general method for free energy minimization to predict RNA secondary structures is dynamic programming, although other optimization methods have been developed as well along with empirically derived energy parameters. In this chapter, we introduce and illustrate by examples the approach of free energy minimization to predict RNA secondary structures.

  14. Approaches to link RNA secondary structures with splicing regulation

    DEFF Research Database (Denmark)

    Plass, Mireya; Eyras, Eduardo

    2014-01-01

    In higher eukaryotes, alternative splicing is usually regulated by protein factors, which bind to the pre-mRNA and affect the recognition of splicing signals. There is recent evidence that the secondary structure of the pre-mRNA may also play an important role in this process, either by facilitat...... describes the steps in the analysis of the secondary structure of the pre-mRNA and its possible relation to splicing. As a working example, we use the case of yeast and the problem of the recognition of the 3' splice site (3'ss).......In higher eukaryotes, alternative splicing is usually regulated by protein factors, which bind to the pre-mRNA and affect the recognition of splicing signals. There is recent evidence that the secondary structure of the pre-mRNA may also play an important role in this process, either...

  15. RNA secondary structure prediction using soft computing.

    Science.gov (United States)

    Ray, Shubhra Sankar; Pal, Sankar K

    2013-01-01

    Prediction of RNA structure is invaluable in creating new drugs and understanding genetic diseases. Several deterministic algorithms and soft computing-based techniques have been developed for more than a decade to determine the structure from a known RNA sequence. Soft computing gained importance with the need to get approximate solutions for RNA sequences by considering the issues related with kinetic effects, cotranscriptional folding, and estimation of certain energy parameters. A brief description of some of the soft computing-based techniques, developed for RNA secondary structure prediction, is presented along with their relevance. The basic concepts of RNA and its different structural elements like helix, bulge, hairpin loop, internal loop, and multiloop are described. These are followed by different methodologies, employing genetic algorithms, artificial neural networks, and fuzzy logic. The role of various metaheuristics, like simulated annealing, particle swarm optimization, ant colony optimization, and tabu search is also discussed. A relative comparison among different techniques, in predicting 12 known RNA secondary structures, is presented, as an example. Future challenging issues are then mentioned.

  16. Retroviral RNA Dimerization: From Structure to Functions

    Directory of Open Access Journals (Sweden)

    Noé Dubois

    2018-03-01

    Full Text Available The genome of the retroviruses is a dimer composed by two homologous copies of genomic RNA (gRNA molecules of positive polarity. The dimerization process allows two gRNA molecules to be non-covalently linked together through intermolecular base-pairing. This step is critical for the viral life cycle and is highly conserved among retroviruses with the exception of spumaretroviruses. Furthermore, packaging of two gRNA copies into viral particles presents an important evolutionary advantage for immune system evasion and drug resistance. Recent studies reported RNA switches models regulating not only gRNA dimerization, but also translation and packaging, and a spatio-temporal characterization of viral gRNA dimerization within cells are now at hand. This review summarizes our current understanding on the structural features of the dimerization signals for a variety of retroviruses (HIVs, MLV, RSV, BLV, MMTV, MPMV…, the mechanisms of RNA dimer formation and functional implications in the retroviral cycle.

  17. Structure-function analysis of human TYW2 enzyme required for the biosynthesis of a highly modified Wybutosine (yW base in phenylalanine-tRNA.

    Directory of Open Access Journals (Sweden)

    Virginia Rodriguez

    Full Text Available Posttranscriptional modifications are critical for structure and function of tRNAs. Wybutosine (yW and its derivatives are hyper-modified guanosines found at the position 37 of eukaryotic and archaeal tRNA(Phe. TYW2 is an enzyme that catalyzes α-amino-α-carboxypropyl transfer activity at the third step of yW biogenesis. Using complementation of a ΔTYW2 strain, we demonstrate here that human TYW2 (hTYW2 is active in yeast and can synthesize the yW of yeast tRNA(Phe. Structure-guided analysis identified several conserved residues in hTYW2 that interact with S-adenosyl-methionine (AdoMet, and mutation studies revealed that K225 and E265 are critical residues for the enzymatic activity. We previously reported that the human TYW2 is overexpressed in breast cancer. However, no difference in the tRNA(Phe modification status was observed in either normal mouse tissue or a mouse tumor model that overexpresses Tyw2, indicating that hTYW2 may have a role in tumorigenesis unrelated to yW biogenesis.

  18. The art of editing RNA structural alignments

    DEFF Research Database (Denmark)

    Andersen, Ebbe Sloth

    2014-01-01

    Manual editing of RNA structural alignments may be considered more art than science, since it still requires an expert biologist to take multiple levels of information into account and be slightly creative when constructing high-quality alignments. Even though the task is rather tedious, it is re......Manual editing of RNA structural alignments may be considered more art than science, since it still requires an expert biologist to take multiple levels of information into account and be slightly creative when constructing high-quality alignments. Even though the task is rather tedious...

  19. Analysis of RNA binding by the dengue virus NS5 RNA capping enzyme.

    Directory of Open Access Journals (Sweden)

    Brittney R Henderson

    Full Text Available Flaviviruses are small, capped positive sense RNA viruses that replicate in the cytoplasm of infected cells. Dengue virus and other related flaviviruses have evolved RNA capping enzymes to form the viral RNA cap structure that protects the viral genome and directs efficient viral polyprotein translation. The N-terminal domain of NS5 possesses the methyltransferase and guanylyltransferase activities necessary for forming mature RNA cap structures. The mechanism for flavivirus guanylyltransferase activity is currently unknown, and how the capping enzyme binds its diphosphorylated RNA substrate is important for deciphering how the flavivirus guanylyltransferase functions. In this report we examine how flavivirus NS5 N-terminal capping enzymes bind to the 5' end of the viral RNA using a fluorescence polarization-based RNA binding assay. We observed that the K(D for RNA binding is approximately 200 nM Dengue, Yellow Fever, and West Nile virus capping enzymes. Removal of one or both of the 5' phosphates reduces binding affinity, indicating that the terminal phosphates contribute significantly to binding. RNA binding affinity is negatively affected by the presence of GTP or ATP and positively affected by S-adensyl methoninine (SAM. Structural superpositioning of the dengue virus capping enzyme with the Vaccinia virus VP39 protein bound to RNA suggests how the flavivirus capping enzyme may bind RNA, and mutagenesis analysis of residues in the putative RNA binding site demonstrate that several basic residues are critical for RNA binding. Several mutants show differential binding to 5' di-, mono-, and un-phosphorylated RNAs. The mode of RNA binding appears similar to that found with other methyltransferase enzymes, and a discussion of diphosphorylated RNA binding is presented.

  20. Solving the RNA polymerase I structural puzzle

    Energy Technology Data Exchange (ETDEWEB)

    Moreno-Morcillo, María [European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg (Germany); Taylor, Nicholas M. I. [Consejo Superior de Investigaciones Científicas, Ramiro de Maeztu 9, 28040 Madrid (Spain); Gruene, Tim [Georg-August-University, Tammannstrasse 4, 37077 Göttingen (Germany); Legrand, Pierre [SOLEIL Synchrotron, L’Orme de Merisiers, Saint Aubin, Gif-sur-Yvette (France); Rashid, Umar J. [European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg (Germany); Ruiz, Federico M. [Consejo Superior de Investigaciones Científicas, Ramiro de Maeztu 9, 28040 Madrid (Spain); Steuerwald, Ulrich; Müller, Christoph W. [European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg (Germany); Fernández-Tornero, Carlos, E-mail: cftornero@cib.csic.es [Consejo Superior de Investigaciones Científicas, Ramiro de Maeztu 9, 28040 Madrid (Spain); European Molecular Biology Laboratory, Meyerhofstrasse 1, 69117 Heidelberg (Germany)

    2014-10-01

    Details of the RNA polymerase I crystal structure determination provide a framework for solution of the structures of other multi-subunit complexes. Simple crystallographic experiments are described to extract relevant biological information such as the location of the enzyme active site. Knowing the structure of multi-subunit complexes is critical to understand basic cellular functions. However, when crystals of these complexes can be obtained they rarely diffract beyond 3 Å resolution, which complicates X-ray structure determination and refinement. The crystal structure of RNA polymerase I, an essential cellular machine that synthesizes the precursor of ribosomal RNA in the nucleolus of eukaryotic cells, has recently been solved. Here, the crucial steps that were undertaken to build the atomic model of this multi-subunit enzyme are reported, emphasizing how simple crystallographic experiments can be used to extract relevant biological information. In particular, this report discusses the combination of poor molecular replacement and experimental phases, the application of multi-crystal averaging and the use of anomalous scatterers as sequence markers to guide tracing and to locate the active site. The methods outlined here will likely serve as a reference for future structural determination of large complexes at low resolution.

  1. Heart structure-specific transcriptomic atlas reveals conserved microRNA-mRNA interactions.

    Science.gov (United States)

    Vacchi-Suzzi, Caterina; Hahne, Florian; Scheubel, Philippe; Marcellin, Magali; Dubost, Valerie; Westphal, Magdalena; Boeglen, Catherine; Büchmann-Møller, Stine; Cheung, Ming Sin; Cordier, André; De Benedetto, Christopher; Deurinck, Mark; Frei, Moritz; Moulin, Pierre; Oakeley, Edward; Grenet, Olivier; Grevot, Armelle; Stull, Robert; Theil, Diethilde; Moggs, Jonathan G; Marrer, Estelle; Couttet, Philippe

    2013-01-01

    MicroRNAs are short non-coding RNAs that regulate gene expression at the post-transcriptional level and play key roles in heart development and cardiovascular diseases. Here, we have characterized the expression and distribution of microRNAs across eight cardiac structures (left and right ventricles, apex, papillary muscle, septum, left and right atrium and valves) in rat, Beagle dog and cynomolgus monkey using microRNA sequencing. Conserved microRNA signatures enriched in specific heart structures across these species were identified for cardiac valve (miR-let-7c, miR-125b, miR-127, miR-199a-3p, miR-204, miR-320, miR-99b, miR-328 and miR-744) and myocardium (miR-1, miR-133b, miR-133a, miR-208b, miR-30e, miR-499-5p, miR-30e*). The relative abundance of myocardium-enriched (miR-1) and valve-enriched (miR-125b-5p and miR-204) microRNAs was confirmed using in situ hybridization. MicroRNA-mRNA interactions potentially relevant for cardiac functions were explored using anti-correlation expression analysis and microRNA target prediction algorithms. Interactions between miR-1/Timp3, miR-125b/Rbm24, miR-204/Tgfbr2 and miR-208b/Csnk2a2 were identified and experimentally investigated in human pulmonary smooth muscle cells and luciferase reporter assays. In conclusion, we have generated a high-resolution heart structure-specific mRNA/microRNA expression atlas for three mammalian species that provides a novel resource for investigating novel microRNA regulatory circuits involved in cardiac molecular physiopathology.

  2. Heart structure-specific transcriptomic atlas reveals conserved microRNA-mRNA interactions.

    Directory of Open Access Journals (Sweden)

    Caterina Vacchi-Suzzi

    Full Text Available MicroRNAs are short non-coding RNAs that regulate gene expression at the post-transcriptional level and play key roles in heart development and cardiovascular diseases. Here, we have characterized the expression and distribution of microRNAs across eight cardiac structures (left and right ventricles, apex, papillary muscle, septum, left and right atrium and valves in rat, Beagle dog and cynomolgus monkey using microRNA sequencing. Conserved microRNA signatures enriched in specific heart structures across these species were identified for cardiac valve (miR-let-7c, miR-125b, miR-127, miR-199a-3p, miR-204, miR-320, miR-99b, miR-328 and miR-744 and myocardium (miR-1, miR-133b, miR-133a, miR-208b, miR-30e, miR-499-5p, miR-30e*. The relative abundance of myocardium-enriched (miR-1 and valve-enriched (miR-125b-5p and miR-204 microRNAs was confirmed using in situ hybridization. MicroRNA-mRNA interactions potentially relevant for cardiac functions were explored using anti-correlation expression analysis and microRNA target prediction algorithms. Interactions between miR-1/Timp3, miR-125b/Rbm24, miR-204/Tgfbr2 and miR-208b/Csnk2a2 were identified and experimentally investigated in human pulmonary smooth muscle cells and luciferase reporter assays. In conclusion, we have generated a high-resolution heart structure-specific mRNA/microRNA expression atlas for three mammalian species that provides a novel resource for investigating novel microRNA regulatory circuits involved in cardiac molecular physiopathology.

  3. Structural insights into RISC assembly facilitated by dsRNA-binding domains of human RNA helicase A (DHX9).

    Science.gov (United States)

    Fu, Qinqin; Yuan, Y Adam

    2013-03-01

    Intensive research interest has focused on small RNA-processing machinery and the RNA-induced silencing complex (RISC), key cellular machines in RNAi pathways. However, the structural mechanism regarding RISC assembly, the primary step linking small RNA processing and RNA-mediated gene silencing, is largely unknown. Human RNA helicase A (DHX9) was reported to function as an RISC-loading factor, and such function is mediated mainly by its dsRNA-binding domains (dsRBDs). Here, we report the crystal structures of human RNA helicase A (RHA) dsRBD1 and dsRBD2 domains in complex with dsRNAs, respectively. Structural analysis not only reveals higher siRNA duplex-binding affinity displayed by dsRBD1, but also identifies a crystallographic dsRBD1 pair of physiological significance in cooperatively recognizing dsRNAs. Structural observations are further validated by isothermal titration calorimetric (ITC) assay. Moreover, co-immunoprecipitation (co-IP) assay coupled with mutagenesis demonstrated that both dsRBDs are required for RISC association, and such association is mediated by dsRNA. Hence, our structural and functional efforts have revealed a potential working model for siRNA recognition by RHA tandem dsRBDs, and together they provide direct structural insights into RISC assembly facilitated by RHA.

  4. Disease-associated mutations that alter the RNA structural ensemble.

    Directory of Open Access Journals (Sweden)

    Matthew Halvorsen

    2010-08-01

    Full Text Available Genome-wide association studies (GWAS often identify disease-associated mutations in intergenic and non-coding regions of the genome. Given the high percentage of the human genome that is transcribed, we postulate that for some observed associations the disease phenotype is caused by a structural rearrangement in a regulatory region of the RNA transcript. To identify such mutations, we have performed a genome-wide analysis of all known disease-associated Single Nucleotide Polymorphisms (SNPs from the Human Gene Mutation Database (HGMD that map to the untranslated regions (UTRs of a gene. Rather than using minimum free energy approaches (e.g. mFold, we use a partition function calculation that takes into consideration the ensemble of possible RNA conformations for a given sequence. We identified in the human genome disease-associated SNPs that significantly alter the global conformation of the UTR to which they map. For six disease-states (Hyperferritinemia Cataract Syndrome, beta-Thalassemia, Cartilage-Hair Hypoplasia, Retinoblastoma, Chronic Obstructive Pulmonary Disease (COPD, and Hypertension, we identified multiple SNPs in UTRs that alter the mRNA structural ensemble of the associated genes. Using a Boltzmann sampling procedure for sub-optimal RNA structures, we are able to characterize and visualize the nature of the conformational changes induced by the disease-associated mutations in the structural ensemble. We observe in several cases (specifically the 5' UTRs of FTL and RB1 SNP-induced conformational changes analogous to those observed in bacterial regulatory Riboswitches when specific ligands bind. We propose that the UTR and SNP combinations we identify constitute a "RiboSNitch," that is a regulatory RNA in which a specific SNP has a structural consequence that results in a disease phenotype. Our SNPfold algorithm can help identify RiboSNitches by leveraging GWAS data and an analysis of the mRNA structural ensemble.

  5. Selective 2′-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile, and accurate RNA structure analysis

    Science.gov (United States)

    Smola, Matthew J.; Rice, Greggory M.; Busan, Steven; Siegfried, Nathan A.; Weeks, Kevin M.

    2016-01-01

    SHAPE chemistries exploit small electrophilic reagents that react with the 2′-hydroxyl group to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues based on the ability of reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as for simple model RNAs. This protocol describes the experimental steps, implemented over three days, required to perform SHAPE probing and construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. These steps include RNA folding and SHAPE structure probing, mutational profiling by reverse transcription, library construction, and sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots, and provides useful troubleshooting information, often within an hour. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures, and visualize probable and alternative helices, often in under a day. We illustrate these algorithms with the E. coli thiamine pyrophosphate riboswitch, E. coli 16S rRNA, and HIV-1 genomic RNAs. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles, and entire transcriptomes. The straightforward MaP strategy greatly expands the number, length, and complexity of analyzable RNA structures. PMID:26426499

  6. Structural imprints in vivo decode RNA regulatory mechanisms.

    Science.gov (United States)

    Spitale, Robert C; Flynn, Ryan A; Zhang, Qiangfeng Cliff; Crisalli, Pete; Lee, Byron; Jung, Jong-Wha; Kuchelmeister, Hannes Y; Batista, Pedro J; Torre, Eduardo A; Kool, Eric T; Chang, Howard Y

    2015-03-26

    Visualizing the physical basis for molecular behaviour inside living cells is a great challenge for biology. RNAs are central to biological regulation, and the ability of RNA to adopt specific structures intimately controls every step of the gene expression program. However, our understanding of physiological RNA structures is limited; current in vivo RNA structure profiles include only two of the four nucleotides that make up RNA. Here we present a novel biochemical approach, in vivo click selective 2'-hydroxyl acylation and profiling experiment (icSHAPE), which enables the first global view, to our knowledge, of RNA secondary structures in living cells for all four bases. icSHAPE of the mouse embryonic stem cell transcriptome versus purified RNA folded in vitro shows that the structural dynamics of RNA in the cellular environment distinguish different classes of RNAs and regulatory elements. Structural signatures at translational start sites and ribosome pause sites are conserved from in vitro conditions, suggesting that these RNA elements are programmed by sequence. In contrast, focal structural rearrangements in vivo reveal precise interfaces of RNA with RNA-binding proteins or RNA-modification sites that are consistent with atomic-resolution structural data. Such dynamic structural footprints enable accurate prediction of RNA-protein interactions and N(6)-methyladenosine (m(6)A) modification genome wide. These results open the door for structural genomics of RNA in living cells and reveal key physiological structures controlling gene expression.

  7. RNA structural constraints in the evolution of the influenza A virus genome NP segment

    NARCIS (Netherlands)

    A.P. Gultyaev (Alexander); A. Tsyganov-Bodounov (Anton); M.I. Spronken (Monique); S. Van Der Kooij (Sander); R.A.M. Fouchier (Ron); R.C.L. Olsthoorn (René)

    2014-01-01

    textabstractConserved RNA secondary structures were predicted in the nucleoprotein (NP) segment of the influenza A virus genome using comparative sequence and structure analysis. A number of structural elements exhibiting nucleotide covariations were identified over the whole segment length,

  8. Structure and functional analysis of the RNA- and viral phosphoprotein-binding domain of respiratory syncytial virus M2-1 protein.

    Directory of Open Access Journals (Sweden)

    Marie-Lise Blondot

    Full Text Available Respiratory syncytial virus (RSV protein M2-1 functions as an essential transcriptional cofactor of the viral RNA-dependent RNA polymerase (RdRp complex by increasing polymerase processivity. M2-1 is a modular RNA binding protein that also interacts with the viral phosphoprotein P, another component of the RdRp complex. These binding properties are related to the core region of M2-1 encompassing residues S58 to K177. Here we report the NMR structure of the RSV M2-1(58-177 core domain, which is structurally homologous to the C-terminal domain of Ebola virus VP30, a transcription co-factor sharing functional similarity with M2-1. The partial overlap of RNA and P interaction surfaces on M2-1(58-177, as determined by NMR, rationalizes the previously observed competitive behavior of RNA versus P. Using site-directed mutagenesis, we identified eight residues located on these surfaces that are critical for an efficient transcription activity of the RdRp complex. Single mutations of these residues disrupted specifically either P or RNA binding to M2-1 in vitro. M2-1 recruitment to cytoplasmic inclusion bodies, which are regarded as sites of viral RNA synthesis, was impaired by mutations affecting only binding to P, but not to RNA, suggesting that M2-1 is associated to the holonucleocapsid by interacting with P. These results reveal that RNA and P binding to M2-1 can be uncoupled and that both are critical for the transcriptional antitermination function of M2-1.

  9. Structural Chemistry of Human RNA Methyltransferases.

    Science.gov (United States)

    Schapira, Matthieu

    2016-03-18

    RNA methyltransferases (RNMTs) play important roles in RNA stability, splicing, and epigenetic mechanisms. They constitute a promising target class that is underexplored by the medicinal chemistry community. Information of relevance to drug design can be extracted from the rich structural coverage of human RNMTs. In this work, the structural chemistry of this protein family is analyzed in depth. Unlike most methyltransferases, RNMTs generally feature a substrate-binding site that is largely open on the cofactor-binding pocket, favoring the design of bisubstrate inhibitors. Substrate purine or pyrimidines are often sandwiched between hydrophobic walls that can accommodate planar ring systems. When the substrate base is laying on a shallow surface, a 5' flanking base is sometimes anchored in a druggable cavity. The cofactor-binding site is structurally more diverse than in protein methyltransferases and more druggable in SPOUT than in Rossman-fold enzymes. Finally, conformational plasticity observed both at the substrate and cofactor binding sites may be a challenge for structure-based drug design. The landscape drawn here may inform ongoing efforts toward the discovery of the first human RNMT inhibitors.

  10. Stochastic sampling of the RNA structural alignment space.

    Science.gov (United States)

    Harmanci, Arif Ozgun; Sharma, Gaurav; Mathews, David H

    2009-07-01

    A novel method is presented for predicting the common secondary structures and alignment of two homologous RNA sequences by sampling the 'structural alignment' space, i.e. the joint space of their alignments and common secondary structures. The structural alignment space is sampled according to a pseudo-Boltzmann distribution based on a pseudo-free energy change that combines base pairing probabilities from a thermodynamic model and alignment probabilities from a hidden Markov model. By virtue of the implicit comparative analysis between the two sequences, the method offers an improvement over single sequence sampling of the Boltzmann ensemble. A cluster analysis shows that the samples obtained from joint sampling of the structural alignment space cluster more closely than samples generated by the single sequence method. On average, the representative (centroid) structure and alignment of the most populated cluster in the sample of structures and alignments generated by joint sampling are more accurate than single sequence sampling and alignment based on sequence alone, respectively. The 'best' centroid structure that is closest to the known structure among all the centroids is, on average, more accurate than structure predictions of other methods. Additionally, cluster analysis identifies, on average, a few clusters, whose centroids can be presented as alternative candidates. The source code for the proposed method can be downloaded at http://rna.urmc.rochester.edu.

  11. Structural basis of genomic RNA (gRNA) dimerization and packaging determinants of mouse mammary tumor virus (MMTV).

    Science.gov (United States)

    Aktar, Suriya J; Vivet-Boudou, Valérie; Ali, Lizna M; Jabeen, Ayesha; Kalloush, Rawan M; Richer, Delphine; Mustafa, Farah; Marquet, Roland; Rizvi, Tahir A

    2014-11-14

    One of the hallmarks of retroviral life cycle is the efficient and specific packaging of two copies of retroviral gRNA in the form of a non-covalent RNA dimer by the assembling virions. It is becoming increasingly clear that the process of dimerization is closely linked with gRNA packaging, and in some retroviruses, the latter depends on the former. Earlier mutational analysis of the 5' end of the MMTV genome indicated that MMTV gRNA packaging determinants comprise sequences both within the 5' untranslated region (5' UTR) and the beginning of gag. The RNA secondary structure of MMTV gRNA packaging sequences was elucidated employing selective 2'hydroxyl acylation analyzed by primer extension (SHAPE). SHAPE analyses revealed the presence of a U5/Gag long-range interaction (U5/Gag LRI), not predicted by minimum free-energy structure predictions that potentially stabilizes the global structure of this region. Structure conservation along with base-pair covariations between different strains of MMTV further supported the SHAPE-validated model. The 5' region of the MMTV gRNA contains multiple palindromic (pal) sequences that could initiate intermolecular interaction during RNA dimerization. In vitro RNA dimerization, SHAPE analysis, and structure prediction approaches on a series of pal mutants revealed that MMTV RNA utilizes a palindromic point of contact to initiate intermolecular interactions between two gRNAs, leading to dimerization. This contact point resides within pal II (5' CGGCCG 3') at the 5' UTR and contains a canonical "GC" dyad and therefore likely constitutes the MMTV RNA dimerization initiation site (DIS). Further analyses of these pal mutants employing in vivo genetic approaches indicate that pal II, as well as pal sequences located in the primer binding site (PBS) are both required for efficient MMTV gRNA packaging. Employing structural prediction, biochemical, and genetic approaches, we show that pal II functions as a primary point of contact between

  12. RNA.

    Science.gov (United States)

    Darnell, James E., Jr.

    1985-01-01

    Ribonucleic acid (RNA) converts genetic information into protein and usually must be processed to serve its function. RNA types, chemical structure, protein synthesis, translation, manufacture, and processing are discussed. Concludes that the first genes might have been spliced RNA and that humans might be closer than bacteria to primitive…

  13. Regulatory effects of cotranscriptional RNA structure formation and transitions.

    Science.gov (United States)

    Liu, Sheng-Rui; Hu, Chun-Gen; Zhang, Jin-Zhi

    2016-09-01

    RNAs, which play significant roles in many fundamental biological processes of life, fold into sophisticated and precise structures. RNA folding is a dynamic and intricate process, which conformation transition of coding and noncoding RNAs form the primary elements of genetic regulation. The cellular environment contains various intrinsic and extrinsic factors that potentially affect RNA folding in vivo, and experimental and theoretical evidence increasingly indicates that the highly flexible features of the RNA structure are affected by these factors, which include the flanking sequence context, physiochemical conditions, cis RNA-RNA interactions, and RNA interactions with other molecules. Furthermore, distinct RNA structures have been identified that govern almost all steps of biological processes in cells, including transcriptional activation and termination, transcriptional mutagenesis, 5'-capping, splicing, 3'-polyadenylation, mRNA export and localization, and translation. Here, we briefly summarize the dynamic and complex features of RNA folding along with a wide variety of intrinsic and extrinsic factors that affect RNA folding. We then provide several examples to elaborate RNA structure-mediated regulation at the transcriptional and posttranscriptional levels. Finally, we illustrate the regulatory roles of RNA structure and discuss advances pertaining to RNA structure in plants. WIREs RNA 2016, 7:562-574. doi: 10.1002/wrna.1350 For further resources related to this article, please visit the WIREs website. © 2016 Wiley Periodicals, Inc.

  14. Complete probabilistic analysis of RNA shapes

    Directory of Open Access Journals (Sweden)

    Voß Björn

    2006-02-01

    Full Text Available Abstract Background Soon after the first algorithms for RNA folding became available, it was recognised that the prediction of only one energetically optimal structure is insufficient to achieve reliable results. An in-depth analysis of the folding space as a whole appeared necessary to deduce the structural properties of a given RNA molecule reliably. Folding space analysis comprises various methods such as suboptimal folding, computation of base pair probabilities, sampling procedures and abstract shape analysis. Common to many approaches is the idea of partitioning the folding space into classes of structures, for which certain properties can be derived. Results In this paper we extend the approach of abstract shape analysis. We show how to compute the accumulated probabilities of all structures that share the same shape. While this implies a complete (non-heuristic analysis of the folding space, the computational effort depends only on the size of the shape space, which is much smaller. This approach has been integrated into the tool RNAshapes, and we apply it to various RNAs. Conclusion Analyses of conformational switches show the existence of two shapes with probabilities approximately 23 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWcaaqaaiabikdaYaqaaiabiodaZaaaaaa@2EA2@ vs. 13 MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaadaWcaaqaaiabigdaXaqaaiabiodaZaaaaaa@2EA0@, whereas the analysis of a microRNA precursor reveals one shape with a probability near to 1.0. Furthermore, it is shown that a shape can outperform an energetically more favourable one by

  15. Automated and fast building of three-dimensional RNA structures.

    Science.gov (United States)

    Zhao, Yunjie; Huang, Yangyu; Gong, Zhou; Wang, Yanjie; Man, Jianfen; Xiao, Yi

    2012-01-01

    Building tertiary structures of non-coding RNA is required to understand their functions and design new molecules. Current algorithms of RNA tertiary structure prediction give satisfactory accuracy only for small size and simple topology and many of them need manual manipulation. Here, we present an automated and fast program, 3dRNA, for RNA tertiary structure prediction with reasonable accuracy for RNAs of larger size and complex topology.

  16. RNA structure alignment by a unit-vector approach.

    Science.gov (United States)

    Capriotti, Emidio; Marti-Renom, Marc A

    2008-08-15

    The recent discovery of tiny RNA molecules such as microRNAs and small interfering RNA are transforming the view of RNA as a simple information transfer molecule. Similar to proteins, the native three-dimensional structure of RNA determines its biological activity. Therefore, classifying the current structural space is paramount for functionally annotating RNA molecules. The increasing numbers of RNA structures deposited in the PDB requires more accurate, automatic and benchmarked methods for RNA structure comparison. In this article, we introduce a new algorithm for RNA structure alignment based on a unit-vector approach. The algorithm has been implemented in the SARA program, which results in RNA structure pairwise alignments and their statistical significance. The SARA program has been implemented to be of general applicability even when no secondary structure can be calculated from the RNA structures. A benchmark against the ARTS program using a set of 1275 non-redundant pairwise structure alignments results in inverted approximately 6% extra alignments with at least 50% structurally superposed nucleotides and base pairs. A first attempt to perform RNA automatic functional annotation based on structure alignments indicates that SARA can correctly assign the deepest SCOR classification to >60% of the query structures. The SARA program is freely available through a World Wide Web server http://sgu.bioinfo.cipf.es/services/SARA/. Supplementary data are available at Bioinformatics online.

  17. Investigation of RNA structure in satellite panicum mosaic virus

    International Nuclear Information System (INIS)

    Makino, D.L.; Day, J.; Larson, S.B.; McPherson, A.

    2006-01-01

    Three new crystal forms of satellite panicum mosaic virus (SPMV) were grown and their structures solved from X-ray diffraction data using molecular replacement techniques. The crystals were grown under conditions of pH and ionic strength that were appreciably different then those used for the original structure determination. In rhombohedral crystals grown at pH 8.5 and low ionic strength PEG 3350 solutions, Fourier syntheses revealed segments, ten amino acid residues long, of amino-terminal polypeptides not previously seen, as well as masses of electron density within concavities on the interior of the capsid, which appeared in the neighborhoods of icosahedral five- and threefold axes. The densities were compatible with secondary structural domains of RNA, and they included a segment of double helical RNA of about four to five base pairs oriented, at least approximately, along the fivefold axes. The distribution of RNA observed for SPMV appears to be distinctly different than the encapsidated nucleic acid conformation previously suggested for another satellite virus, satellite tobacco mosaic virus. This study further shows that analysis of viruses in crystals grown under different chemical conditions may reveal additional information regarding the structure of encapsidated RNA

  18. The crystal structure of tRNA

    Indian Academy of Sciences (India)

    Madhu

    of yeast alanine tRNA by Robert Holley's group at Cornell. University ... decode nonsense codons) with John Smith and Brenner. However, my ... tRNA from 10 g of unfractionated tRNA. ... tRNA crystals were, in fact, protein (Hendrikson et al.

  19. A folding algorithm for extended RNA secondary structures.

    Science.gov (United States)

    Höner zu Siederdissen, Christian; Bernhart, Stephan H; Stadler, Peter F; Hofacker, Ivo L

    2011-07-01

    RNA secondary structure contains many non-canonical base pairs of different pair families. Successful prediction of these structural features leads to improved secondary structures with applications in tertiary structure prediction and simultaneous folding and alignment. We present a theoretical model capturing both RNA pair families and extended secondary structure motifs with shared nucleotides using 2-diagrams. We accompany this model with a number of programs for parameter optimization and structure prediction. All sources (optimization routines, RNA folding, RNA evaluation, extended secondary structure visualization) are published under the GPLv3 and available at www.tbi.univie.ac.at/software/rnawolf/.

  20. Selective 2'-hydroxyl acylation analyzed by primer extension and mutational profiling (SHAPE-MaP) for direct, versatile and accurate RNA structure analysis.

    Science.gov (United States)

    Smola, Matthew J; Rice, Greggory M; Busan, Steven; Siegfried, Nathan A; Weeks, Kevin M

    2015-11-01

    Selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistries exploit small electrophilic reagents that react with 2'-hydroxyl groups to interrogate RNA structure at single-nucleotide resolution. Mutational profiling (MaP) identifies modified residues by using reverse transcriptase to misread a SHAPE-modified nucleotide and then counting the resulting mutations by massively parallel sequencing. The SHAPE-MaP approach measures the structure of large and transcriptome-wide systems as accurately as can be done for simple model RNAs. This protocol describes the experimental steps, implemented over 3 d, that are required to perform SHAPE probing and to construct multiplexed SHAPE-MaP libraries suitable for deep sequencing. Automated processing of MaP sequencing data is accomplished using two software packages. ShapeMapper converts raw sequencing files into mutational profiles, creates SHAPE reactivity plots and provides useful troubleshooting information. SuperFold uses these data to model RNA secondary structures, identify regions with well-defined structures and visualize probable and alternative helices, often in under 1 d. SHAPE-MaP can be used to make nucleotide-resolution biophysical measurements of individual RNA motifs, rare components of complex RNA ensembles and entire transcriptomes.

  1. Primary structure of human pancreatic elastase 2 determined by sequence analysis of the cloned mRNA

    International Nuclear Information System (INIS)

    Fletcher, T.S.; Shen, W.F.; Largman, C.

    1987-01-01

    A cDNA encoding elastase 2 has been cloned from a human pancreatic cDNA library. The cDNA contains a translation initiation site and a poly(A) recognition site and encodes a protein of 269 amino acids, including a proposed 16-residue signal peptide. The amino acid sequence of the deduced mature protein contains a 12-residue activation peptide containing a cysteine at residue 1 similar to that of chymotryspin. The proposed active enzyme contains all of the characteristic active-site amino acids, including His-57, Asp-102, and Ser-195. The S1 binding pocket is bounded by Gly-216 and Ser-226, making this pocket intermediate in size between chymotrypsins and elastase 1 or protease E, consistent with the substrate specificity of elastase 2 for long-chain aliphatic or aromatic amino acids. Computer modeling studies using the amino acid sequence of elastase 2 superimposed on the X-ray structure of porcine elastase 1 suggest that a change of Gln-192 in elastase 1 to Asn-192 in elastase 2 may account for the lower catalytic efficiency of the latter enzyme. Several basic residues appear to be near the ends of the extended binding pocket of elastases which might serve to anchor the enzyme to the elastin substrate. These studies indicate that elastases 2 and elastase 1 both contain an Arg-65A as well as a basic dipeptide at 223/224 which is not present in chymotrypsins. In addition, Arg-217A is present in humaan elastase 2 but absent in rat pancreatic protein which has been proposed to be an elastase 2 on the basis of sequence homology, but which was not isolated during screening of rat pancreatic tissue extracts for elastolytic activity

  2. Sequence-structure relationships in RNA loops: establishing the basis for loop homology modeling.

    Science.gov (United States)

    Schudoma, Christian; May, Patrick; Nikiforova, Viktoria; Walther, Dirk

    2010-01-01

    The specific function of RNA molecules frequently resides in their seemingly unstructured loop regions. We performed a systematic analysis of RNA loops extracted from experimentally determined three-dimensional structures of RNA molecules. A comprehensive loop-structure data set was created and organized into distinct clusters based on structural and sequence similarity. We detected clear evidence of the hallmark of homology present in the sequence-structure relationships in loops. Loops differing by structures. Thus, our results support the application of homology modeling for RNA loop model building. We established a threshold that may guide the sequence divergence-based selection of template structures for RNA loop homology modeling. Of all possible sequences that are, under the assumption of isosteric relationships, theoretically compatible with actual sequences observed in RNA structures, only a small fraction is contained in the Rfam database of RNA sequences and classes implying that the actual RNA loop space may consist of a limited number of unique loop structures and conserved sequences. The loop-structure data sets are made available via an online database, RLooM. RLooM also offers functionalities for the modeling of RNA loop structures in support of RNA engineering and design efforts.

  3. Reconstruction of ancestral RNA sequences under multiple structural constraints

    OpenAIRE

    Tremblay-Savard, Olivier; Reinharz, Vladimir; Waldisp?hl, J?r?me

    2016-01-01

    Background Secondary structures form the scaffold of multiple sequence alignment of non-coding RNA (ncRNA) families. An accurate reconstruction of ancestral ncRNAs must use this structural signal. However, the inference of ancestors of a single ncRNA family with a single consensus structure may bias the results towards sequences with high affinity to this structure, which are far from the true ancestors. Methods In this paper, we introduce achARNement, a maximum parsimony approach that, given...

  4. High-throughput determination of RNA structure by proximity ligation.

    Science.gov (United States)

    Ramani, Vijay; Qiu, Ruolan; Shendure, Jay

    2015-09-01

    We present an unbiased method to globally resolve RNA structures through pairwise contact measurements between interacting regions. RNA proximity ligation (RPL) uses proximity ligation of native RNA followed by deep sequencing to yield chimeric reads with ligation junctions in the vicinity of structurally proximate bases. We apply RPL in both baker's yeast (Saccharomyces cerevisiae) and human cells and generate contact probability maps for ribosomal and other abundant RNAs, including yeast snoRNAs, the RNA subunit of the signal recognition particle and the yeast U2 spliceosomal RNA homolog. RPL measurements correlate with established secondary structures for these RNA molecules, including stem-loop structures and long-range pseudoknots. We anticipate that RPL will complement the current repertoire of computational and experimental approaches in enabling the high-throughput determination of secondary and tertiary RNA structures.

  5. Random generation of RNA secondary structures according to native distributions

    Directory of Open Access Journals (Sweden)

    Nebel Markus E

    2011-10-01

    Full Text Available Abstract Background Random biological sequences are a topic of great interest in genome analysis since, according to a powerful paradigm, they represent the background noise from which the actual biological information must differentiate. Accordingly, the generation of random sequences has been investigated for a long time. Similarly, random object of a more complicated structure like RNA molecules or proteins are of interest. Results In this article, we present a new general framework for deriving algorithms for the non-uniform random generation of combinatorial objects according to the encoding and probability distribution implied by a stochastic context-free grammar. Briefly, the framework extends on the well-known recursive method for (uniform random generation and uses the popular framework of admissible specifications of combinatorial classes, introducing weighted combinatorial classes to allow for the non-uniform generation by means of unranking. This framework is used to derive an algorithm for the generation of RNA secondary structures of a given fixed size. We address the random generation of these structures according to a realistic distribution obtained from real-life data by using a very detailed context-free grammar (that models the class of RNA secondary structures by distinguishing between all known motifs in RNA structure. Compared to well-known sampling approaches used in several structure prediction tools (such as SFold ours has two major advantages: Firstly, after a preprocessing step in time O(n2 for the computation of all weighted class sizes needed, with our approach a set of m random secondary structures of a given structure size n can be computed in worst-case time complexity Om⋅n⋅ log(n while other algorithms typically have a runtime in O(m⋅n2. Secondly, our approach works with integer arithmetic only which is faster and saves us from all the discomforting details of using floating point arithmetic with

  6. rCAD: A Novel Database Schema for the Comparative Analysis of RNA.

    Science.gov (United States)

    Ozer, Stuart; Doshi, Kishore J; Xu, Weijia; Gutell, Robin R

    2011-12-31

    Beyond its direct involvement in protein synthesis with mRNA, tRNA, and rRNA, RNA is now being appreciated for its significance in the overall metabolism and regulation of the cell. Comparative analysis has been very effective in the identification and characterization of RNA molecules, including the accurate prediction of their secondary structure. We are developing an integrative scalable data management and analysis system, the RNA Comparative Analysis Database (rCAD), implemented with SQL Server to support RNA comparative analysis. The platformagnostic database schema of rCAD captures the essential relationships between the different dimensions of information for RNA comparative analysis datasets. The rCAD implementation enables a variety of comparative analysis manipulations with multiple integrated data dimensions for advanced RNA comparative analysis workflows. In this paper, we describe details of the rCAD schema design and illustrate its usefulness with two usage scenarios.

  7. Modelling the structure of a ceRNA-theoretical, bipartite microRNA-mRNA interaction network regulating intestinal epithelial cellular pathways using R programming.

    Science.gov (United States)

    Robinson, J M; Henderson, W A

    2018-01-12

    We report a method using functional-molecular databases and network modelling to identify hypothetical mRNA-miRNA interaction networks regulating intestinal epithelial barrier function. The model forms a data-analysis component of our cell culture experiments, which produce RNA expression data from Nanostring Technologies nCounter ® system. The epithelial tight-junction (TJ) and actin cytoskeleton interact as molecular components of the intestinal epithelial barrier. Upstream regulation of TJ-cytoskeleton interaction is effected by the Rac/Rock/Rho signaling pathway and other associated pathways which may be activated or suppressed by extracellular signaling from growth factors, hormones, and immune receptors. Pathway activations affect epithelial homeostasis, contributing to degradation of the epithelial barrier associated with osmotic dysregulation, inflammation, and tumor development. The complexity underlying miRNA-mRNA interaction networks represents a roadblock for prediction and validation of competing-endogenous RNA network function. We developed a network model to identify hypothetical co-regulatory motifs in a miRNA-mRNA interaction network related to epithelial function. A mRNA-miRNA interaction list was generated using KEGG and miRWalk2.0 databases. R-code was developed to quantify and visualize inherent network structures. We identified a sub-network with a high number of shared, targeting miRNAs, of genes associated with cellular proliferation and cancer, including c-MYC and Cyclin D.

  8. Differential accumulation of nif structural gene mRNA in Azotobacter vinelandii.

    Science.gov (United States)

    Hamilton, Trinity L; Jacobson, Marty; Ludwig, Marcus; Boyd, Eric S; Bryant, Donald A; Dean, Dennis R; Peters, John W

    2011-09-01

    Northern analysis was employed to investigate mRNA produced by mutant strains of Azotobacter vinelandii with defined deletions in the nif structural genes and in the intergenic noncoding regions. The results indicate that intergenic RNA secondary structures effect the differential accumulation of transcripts, supporting the high Fe protein-to-MoFe protein ratio required for optimal diazotrophic growth.

  9. On the importance of cotranscriptional RNA structure formation

    Science.gov (United States)

    Lai, Daniel; Proctor, Jeff R.; Meyer, Irmtraud M.

    2013-01-01

    The expression of genes, both coding and noncoding, can be significantly influenced by RNA structural features of their corresponding transcripts. There is by now mounting experimental and some theoretical evidence that structure formation in vivo starts during transcription and that this cotranscriptional folding determines the functional RNA structural features that are being formed. Several decades of research in bioinformatics have resulted in a wide range of computational methods for predicting RNA secondary structures. Almost all state-of-the-art methods in terms of prediction accuracy, however, completely ignore the process of structure formation and focus exclusively on the final RNA structure. This review hopes to bridge this gap. We summarize the existing evidence for cotranscriptional folding and then review the different, currently used strategies for RNA secondary-structure prediction. Finally, we propose a range of ideas on how state-of-the-art methods could be potentially improved by explicitly capturing the process of cotranscriptional structure formation. PMID:24131802

  10. Computer-Aided Design of RNA Origami Structures.

    Science.gov (United States)

    Sparvath, Steffen L; Geary, Cody W; Andersen, Ebbe S

    2017-01-01

    RNA nanostructures can be used as scaffolds to organize, combine, and control molecular functionalities, with great potential for applications in nanomedicine and synthetic biology. The single-stranded RNA origami method allows RNA nanostructures to be folded as they are transcribed by the RNA polymerase. RNA origami structures provide a stable framework that can be decorated with functional RNA elements such as riboswitches, ribozymes, interaction sites, and aptamers for binding small molecules or protein targets. The rich library of RNA structural and functional elements combined with the possibility to attach proteins through aptamer-based binding creates virtually limitless possibilities for constructing advanced RNA-based nanodevices.In this chapter we provide a detailed protocol for the single-stranded RNA origami design method using a simple 2-helix tall structure as an example. The first step involves 3D modeling of a double-crossover between two RNA double helices, followed by decoration with tertiary motifs. The second step deals with the construction of a 2D blueprint describing the secondary structure and sequence constraints that serves as the input for computer programs. In the third step, computer programs are used to design RNA sequences that are compatible with the structure, and the resulting outputs are evaluated and converted into DNA sequences to order.

  11. Advances in RNA Structure Determination | Center for Cancer Research

    Science.gov (United States)

    The recent years have witnessed a revolution in the field of RNA structure and function. Until recently the main contribution of RNA in cellular and disease functions was considered to be a role defined by the central dogma, namely DNA codes for mRNAs, which in turn encode for proteins, a notion facilitated by non-coding ribosomal RNA and tRNA. It was also assumed at the time

  12. NoFold: RNA structure clustering without folding or alignment.

    Science.gov (United States)

    Middleton, Sarah A; Kim, Junhyong

    2014-11-01

    Structures that recur across multiple different transcripts, called structure motifs, often perform a similar function-for example, recruiting a specific RNA-binding protein that then regulates translation, splicing, or subcellular localization. Identifying common motifs between coregulated transcripts may therefore yield significant insight into their binding partners and mechanism of regulation. However, as most methods for clustering structures are based on folding individual sequences or doing many pairwise alignments, this results in a tradeoff between speed and accuracy that can be problematic for large-scale data sets. Here we describe a novel method for comparing and characterizing RNA secondary structures that does not require folding or pairwise alignment of the input sequences. Our method uses the idea of constructing a distance function between two objects by their respective distances to a collection of empirical examples or models, which in our case consists of 1973 Rfam family covariance models. Using this as a basis for measuring structural similarity, we developed a clustering pipeline called NoFold to automatically identify and annotate structure motifs within large sequence data sets. We demonstrate that NoFold can simultaneously identify multiple structure motifs with an average sensitivity of 0.80 and precision of 0.98 and generally exceeds the performance of existing methods. We also perform a cross-validation analysis of the entire set of Rfam families, achieving an average sensitivity of 0.57. We apply NoFold to identify motifs enriched in dendritically localized transcripts and report 213 enriched motifs, including both known and novel structures. © 2014 Middleton and Kim; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  13. Solving RNA's structural secrets: interaction with antibodies and crystal structure of a nuclease resistant RNA

    International Nuclear Information System (INIS)

    Wallace, S.T.

    1998-10-01

    This Ph.D. thesis concerns the structural characterization of RNA. The work is split into two sections: 1) in vitro selection and characterization of RNAs which bind antibiotics and 2) crystal structure of a nuclease resistant RNA molecule used in antisense applications. Understanding antibiotic-RNA interactions is crucial in aiding rational drug design. We were interested in studying antibiotic interactions with RNAs small enough to characterize at the molecular and possibly at the atomic level. In order to do so, we previously performed in vitro selection to find small RNAs which bind to the peptide antibiotic viomycin and the aminoglycoside antibiotic streptomycin. The characterization of the viomycin-binding RNAs revealed the necessity of a pseudoknot-structure in order to interact with the antibiotic. The RNAs which were selected to interact with streptomycin require the presence of magnesium to bind the antibiotic. One of the RNAs, upon interacting with streptomycin undergoes a significant conformational change spanning the entire RNA sequence needed to bind the antibiotic. In a quest to design oligodeoxynucleotides (ODNs) which are able to specifically bid and inactivate the mRNA of a gene, it is necessary to fulfill two criteria: 1) increase binding affinity between the ODN and the target RNA and 2) increase the ODN's resistance to nuclease degradation. An ODN with an aminopropyl modification at the 2' position of its ribose has emerged as the most successful candidate at fulfilling both criteria. It is the most nuclease resistant modification known to date. We were interested in explaining how this modification is able to circumvent degradation by nucleases. A dodecamer containing a single 2'-O-aminopropyl modified nucleotide was crystallized and the structure was solved to a resolution of 1.6 A. In an attempt to explain the nuclease resistance, the crystal coordinates were modeled into the active exonuclease site of DNA polymerase I. We propose the

  14. RNA folding: structure prediction, folding kinetics and ion electrostatics.

    Science.gov (United States)

    Tan, Zhijie; Zhang, Wenbing; Shi, Yazhou; Wang, Fenghua

    2015-01-01

    Beyond the "traditional" functions such as gene storage, transport and protein synthesis, recent discoveries reveal that RNAs have important "new" biological functions including the RNA silence and gene regulation of riboswitch. Such functions of noncoding RNAs are strongly coupled to the RNA structures and proper structure change, which naturally leads to the RNA folding problem including structure prediction and folding kinetics. Due to the polyanionic nature of RNAs, RNA folding structure, stability and kinetics are strongly coupled to the ion condition of solution. The main focus of this chapter is to review the recent progress in the three major aspects in RNA folding problem: structure prediction, folding kinetics and ion electrostatics. This chapter will introduce both the recent experimental and theoretical progress, while emphasize the theoretical modelling on the three aspects in RNA folding.

  15. RNA FRABASE 2.0: an advanced web-accessible database with the capacity to search the three-dimensional fragments within RNA structures

    Directory of Open Access Journals (Sweden)

    Wasik Szymon

    2010-05-01

    Full Text Available Abstract Background Recent discoveries concerning novel functions of RNA, such as RNA interference, have contributed towards the growing importance of the field. In this respect, a deeper knowledge of complex three-dimensional RNA structures is essential to understand their new biological functions. A number of bioinformatic tools have been proposed to explore two major structural databases (PDB, NDB in order to analyze various aspects of RNA tertiary structures. One of these tools is RNA FRABASE 1.0, the first web-accessible database with an engine for automatic search of 3D fragments within PDB-derived RNA structures. This search is based upon the user-defined RNA secondary structure pattern. In this paper, we present and discuss RNA FRABASE 2.0. This second version of the system represents a major extension of this tool in terms of providing new data and a wide spectrum of novel functionalities. An intuitionally operated web server platform enables very fast user-tailored search of three-dimensional RNA fragments, their multi-parameter conformational analysis and visualization. Description RNA FRABASE 2.0 has stored information on 1565 PDB-deposited RNA structures, including all NMR models. The RNA FRABASE 2.0 search engine algorithms operate on the database of the RNA sequences and the new library of RNA secondary structures, coded in the dot-bracket format extended to hold multi-stranded structures and to cover residues whose coordinates are missing in the PDB files. The library of RNA secondary structures (and their graphics is made available. A high level of efficiency of the 3D search has been achieved by introducing novel tools to formulate advanced searching patterns and to screen highly populated tertiary structure elements. RNA FRABASE 2.0 also stores data and conformational parameters in order to provide "on the spot" structural filters to explore the three-dimensional RNA structures. An instant visualization of the 3D RNA

  16. Impact of target mRNA structure on siRNA silencing efficiency: A large-scale study.

    Science.gov (United States)

    Gredell, Joseph A; Berger, Angela K; Walton, S Patrick

    2008-07-01

    The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5'- and 3'-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5'-end or 3'-end were silenced, on average, approximately 10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate si

  17. Mapping RNA Structure In Vitro with SHAPE Chemistry and Next-Generation Sequencing (SHAPE-Seq).

    Science.gov (United States)

    Watters, Kyle E; Lucks, Julius B

    2016-01-01

    Mapping RNA structure with selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) chemistry has proven to be a versatile method for characterizing RNA structure in a variety of contexts. SHAPE reagents covalently modify RNAs in a structure-dependent manner to create adducts at the 2'-OH group of the ribose backbone at nucleotides that are structurally flexible. The positions of these adducts are detected using reverse transcriptase (RT) primer extension, which stops one nucleotide before the modification, to create a pool of cDNAs whose lengths reflect the location of SHAPE modification. Quantification of the cDNA pools is used to estimate the "reactivity" of each nucleotide in an RNA molecule to the SHAPE reagent. High reactivities indicate nucleotides that are structurally flexible, while low reactivities indicate nucleotides that are inflexible. These SHAPE reactivities can then be used to infer RNA structures by restraining RNA structure prediction algorithms. Here, we provide a state-of-the-art protocol describing how to perform in vitro RNA structure probing with SHAPE chemistry using next-generation sequencing to quantify cDNA pools and estimate reactivities (SHAPE-Seq). The use of next-generation sequencing allows for higher throughput, more consistent data analysis, and multiplexing capabilities. The technique described herein, SHAPE-Seq v2.0, uses a universal reverse transcription priming site that is ligated to the RNA after SHAPE modification. The introduced priming site allows for the structural analysis of an RNA independent of its sequence.

  18. A Method to Predict the Structure and Stability of RNA/RNA Complexes.

    Science.gov (United States)

    Xu, Xiaojun; Chen, Shi-Jie

    2016-01-01

    RNA/RNA interactions are essential for genomic RNA dimerization and regulation of gene expression. Intermolecular loop-loop base pairing is a widespread and functionally important tertiary structure motif in RNA machinery. However, computational prediction of intermolecular loop-loop base pairing is challenged by the entropy and free energy calculation due to the conformational constraint and the intermolecular interactions. In this chapter, we describe a recently developed statistical mechanics-based method for the prediction of RNA/RNA complex structures and stabilities. The method is based on the virtual bond RNA folding model (Vfold). The main emphasis in the method is placed on the evaluation of the entropy and free energy for the loops, especially tertiary kissing loops. The method also uses recursive partition function calculations and two-step screening algorithm for large, complicated structures of RNA/RNA complexes. As case studies, we use the HIV-1 Mal dimer and the siRNA/HIV-1 mutant (T4) to illustrate the method.

  19. A Two-Way Street: Regulatory Interplay between RNA Polymerase and Nascent RNA Structure.

    Science.gov (United States)

    Zhang, Jinwei; Landick, Robert

    2016-04-01

    The vectorial (5'-to-3' at varying velocity) synthesis of RNA by cellular RNA polymerases (RNAPs) creates a rugged kinetic landscape, demarcated by frequent, sometimes long-lived, pauses. In addition to myriad gene-regulatory roles, these pauses temporally and spatially program the co-transcriptional, hierarchical folding of biologically active RNAs. Conversely, these RNA structures, which form inside or near the RNA exit channel, interact with the polymerase and adjacent protein factors to influence RNA synthesis by modulating pausing, termination, antitermination, and slippage. Here, we review the evolutionary origin, mechanistic underpinnings, and regulatory consequences of this interplay between RNAP and nascent RNA structure. We categorize and rationalize the extensive linkage between the transcriptional machinery and its product, and provide a framework for future studies. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Functional analysis of the cloverleaf-like structure in the 3' untranslated region of bamboo mosaic potexvirus RNA revealed dual roles in viral RNA replication and long distance movement

    International Nuclear Information System (INIS)

    Chen, I-H.; Meng Hsiao; Hsu, Y.-H.; Tsai, C.-H.

    2003-01-01

    The 3' untranslated region (UTR) of bamboo mosaic potexvirus (BaMV) RNA was identified to fold into a tertiary structure comprising a cloverleaf-like structure designated ABC domain followed by a major stem-loop D, which in turn is followed by a pseudoknot E and a poly(A) tail. The coat protein accumulation level of the mutant, BaMV40A/ΔABC, lacking ABC domain was just 15% that of wild-type when inoculated into protoplasts of Nicotiana benthamiana. This suggested that ABC domain might play an important role in BaMV RNA replication. To define the precise role of each of the three stem-loops of ABC domain in RNA replication, three mutants BaMV40B and C each lacking stem-loop A, B, and C, respectively, were created. Our results showed that accumulation of viral products of mutants BaMV40B and C were not as efficient as wild-type. On the contrary, level of accumulation of viral products of BaMVA was similar to that of wild-type in protoplasts and inoculated leaves. Interestingly, the accumulation of viral products was not as efficient as that of wild-type in systemic leaves, implying that stem-loop A is dispensable for replication, but signifies a role in systemic accumulation. Using UV cross-linking and competition experiments, it was demonstrated that the E. coli expressed helicase domain of BaMV ORF1 can preferentially interact with the ABC domain

  1. Prediction of RNA secondary structure using generalized centroid estimators.

    Science.gov (United States)

    Hamada, Michiaki; Kiryu, Hisanori; Sato, Kengo; Mituyama, Toutai; Asai, Kiyoshi

    2009-02-15

    Recent studies have shown that the methods for predicting secondary structures of RNAs on the basis of posterior decoding of the base-pairing probabilities has an advantage with respect to prediction accuracy over the conventionally utilized minimum free energy methods. However, there is room for improvement in the objective functions presented in previous studies, which are maximized in the posterior decoding with respect to the accuracy measures for secondary structures. We propose novel estimators which improve the accuracy of secondary structure prediction of RNAs. The proposed estimators maximize an objective function which is the weighted sum of the expected number of the true positives and that of the true negatives of the base pairs. The proposed estimators are also improved versions of the ones used in previous works, namely CONTRAfold for secondary structure prediction from a single RNA sequence and McCaskill-MEA for common secondary structure prediction from multiple alignments of RNA sequences. We clarify the relations between the proposed estimators and the estimators presented in previous works, and theoretically show that the previous estimators include additional unnecessary terms in the evaluation measures with respect to the accuracy. Furthermore, computational experiments confirm the theoretical analysis by indicating improvement in the empirical accuracy. The proposed estimators represent extensions of the centroid estimators proposed in Ding et al. and Carvalho and Lawrence, and are applicable to a wide variety of problems in bioinformatics. Supporting information and the CentroidFold software are available online at: http://www.ncrna.org/software/centroidfold/.

  2. Structure of Escherichia coli Hfq bound to polyriboadenylate RNA

    DEFF Research Database (Denmark)

    Link, Todd M; Valentin-Hansen, Poul; Brennan, Richard G

    2009-01-01

    (A) RNA, A(15). The structure reveals a unique RNA binding mechanism. Unlike uridine-containing sequences, which bind to the "proximal" face, the poly(A) tract binds to the "distal" face of Hfq using 6 tripartite binding motifs. Each motif consists of an adenosine specificity site (A site), which......Hfq is a small, highly abundant hexameric protein that is found in many bacteria and plays a critical role in mRNA expression and RNA stability. As an "RNA chaperone," Hfq binds AU-rich sequences and facilitates the trans annealing of small RNAs (sRNAs) to their target mRNAs, typically resulting...... in the down-regulation of gene expression. Hfq also plays a key role in bacterial RNA decay by binding tightly to polyadenylate [poly(A)] tracts. The structural mechanism by which Hfq recognizes and binds poly(A) is unknown. Here, we report the crystal structure of Escherichia coli Hfq bound to the poly...

  3. An automated procedure for covariation-based detection of RNA structure

    International Nuclear Information System (INIS)

    Winker, S.; Overbeek, R.; Woese, C.R.; Olsen, G.J.; Pfluger, N.

    1989-12-01

    This paper summarizes our investigations into the computational detection of secondary and tertiary structure of ribosomal RNA. We have developed a new automated procedure that not only identifies potential bondings of secondary and tertiary structure, but also provides the covariation evidence that supports the proposed bondings, and any counter-evidence that can be detected in the known sequences. A small number of previously unknown bondings have been detected in individual RNA molecules (16S rRNA and 7S RNA) through the use of our automated procedure. Currently, we are systematically studying mitochondrial rRNA. Our goal is to detect tertiary structure within 16S rRNA and quaternary structure between 16S and 23S rRNA. Our ultimate hope is that automated covariation analysis will contribute significantly to a refined picture of ribosome structure. Our colleagues in biology have begun experiments to test certain hypotheses suggested by an examination of our program's output. These experiments involve sequencing key portions of the 23S ribosomal RNA for species in which the known 16S ribosomal RNA exhibits variation (from the dominant pattern) at the site of a proposed bonding. The hope is that the 23S ribosomal RNA of these species will exhibit corresponding complementary variation or generalized covariation. 24 refs

  4. An automated procedure for covariation-based detection of RNA structure

    Energy Technology Data Exchange (ETDEWEB)

    Winker, S.; Overbeek, R.; Woese, C.R.; Olsen, G.J.; Pfluger, N.

    1989-12-01

    This paper summarizes our investigations into the computational detection of secondary and tertiary structure of ribosomal RNA. We have developed a new automated procedure that not only identifies potential bondings of secondary and tertiary structure, but also provides the covariation evidence that supports the proposed bondings, and any counter-evidence that can be detected in the known sequences. A small number of previously unknown bondings have been detected in individual RNA molecules (16S rRNA and 7S RNA) through the use of our automated procedure. Currently, we are systematically studying mitochondrial rRNA. Our goal is to detect tertiary structure within 16S rRNA and quaternary structure between 16S and 23S rRNA. Our ultimate hope is that automated covariation analysis will contribute significantly to a refined picture of ribosome structure. Our colleagues in biology have begun experiments to test certain hypotheses suggested by an examination of our program's output. These experiments involve sequencing key portions of the 23S ribosomal RNA for species in which the known 16S ribosomal RNA exhibits variation (from the dominant pattern) at the site of a proposed bonding. The hope is that the 23S ribosomal RNA of these species will exhibit corresponding complementary variation or generalized covariation. 24 refs.

  5. Structural and functional basis for RNA cleavage by Ire1

    Directory of Open Access Journals (Sweden)

    Stroud Robert M

    2011-07-01

    Full Text Available Abstract Background The unfolded protein response (UPR controls the protein folding capacity of the endoplasmic reticulum (ER. Central to this signaling pathway is the ER-resident bifunctional transmembrane kinase/endoribonuclease Ire1. The endoribonuclease (RNase domain of Ire1 initiates a non-conventional mRNA splicing reaction, leading to the production of a transcription factor that controls UPR target genes. The mRNA splicing reaction is an obligatory step of Ire1 signaling, yet its mechanism has remained poorly understood due to the absence of substrate-bound crystal structures of Ire1, the lack of structural similarity between Ire1 and other RNases, and a scarcity of quantitative enzymological data. Here, we experimentally define the active site of Ire1 RNase and quantitatively evaluate the contribution of the key active site residues to catalysis. Results This analysis and two new crystal structures suggest that Ire1 RNase uses histidine H1061 and tyrosine Y1043 as the general acid-general base pair contributing ≥ 7.6 kcal/mol and 1.4 kcal/mol to transition state stabilization, respectively, and asparagine N1057 and arginine R1056 for coordination of the scissile phosphate. Investigation of the stem-loop recognition revealed that additionally to the stem-loops derived from the classic Ire1 substrates HAC1 and Xbp1 mRNA, Ire1 can site-specifically and rapidly cleave anticodon stem-loop (ASL of unmodified tRNAPhe, extending known substrate specificity of Ire1 RNase. Conclusions Our data define the catalytic center of Ire1 RNase and suggest a mechanism of RNA cleavage: each RNase monomer apparently contains a separate catalytic apparatus for RNA cleavage, whereas two RNase subunits contribute to RNA stem-loop docking. Conservation of the key residues among Ire1 homologues suggests that the mechanism elucidated here for yeast Ire1 applies to Ire1 in metazoan cells, and to the only known Ire1 homologue RNase L.

  6. Primary structure, gene organization and polypeptide expression of poliovirus RNA

    Energy Technology Data Exchange (ETDEWEB)

    Kitamura, N. (State Univ. of New York, Stony Brook); Semler, B.L.; Rothberg, P.G.

    1981-06-18

    The primary structure of the poliovirus genome has been determined. The RNA molecule is 7433 nucleotides long, polyadenylated at the 3' terminus, and covalently linked to a small protein (VPg) at the 5' terminus. An open reading frame of 2207 consecutive triplets spans over 89% of the nucleotide sequence and codes for the viral polyprotein NCVPOO. Twelve viral polypeptides have been mapped by amino acid sequence analysis and were found to be proteolytic cleavage products of the polyprotein, cleavages occurring predominantly at Gln-Gly pairs.

  7. A Viral RNA Structural Element Alters Host Recognition of Nonself RNA

    Energy Technology Data Exchange (ETDEWEB)

    Hyde, J. L.; Gardner, C. L.; Kimura, T.; White, J. P.; Liu, G.; Trobaugh, D. W.; Huang, C.; Tonelli, M.; Paessler, S.; Takeda, K.; Klimstra, W. B.; Amarasinghe, G. K.; Diamond, M. S.

    2014-01-30

    Although interferon (IFN) signaling induces genes that limit viral infection, many pathogenic viruses overcome this host response. As an example, 2'-O methylation of the 5' cap of viral RNA subverts mammalian antiviral responses by evading restriction of Ifit1, an IFN-stimulated gene that regulates protein synthesis. However, alphaviruses replicate efficiently in cells expressing Ifit1 even though their genomic RNA has a 5' cap lacking 2'-O methylation. We show that pathogenic alphaviruses use secondary structural motifs within the 5' untranslated region (UTR) of their RNA to alter Ifit1 binding and function. Mutations within the 5'-UTR affecting RNA structural elements enabled restriction by or antagonism of Ifit1 in vitro and in vivo. These results identify an evasion mechanism by which viruses use RNA structural motifs to avoid immune restriction.

  8. Phylogeography, genetic variability and structure of Acanthamoeba metapopulations in Iran inferred by 18S ribosomal RNA sequences: A systematic review and meta-analysis.

    Science.gov (United States)

    Spotin, Adel; Moslemzadeh, Hamid Reza; Mahami-Oskouei, Mahmoud; Ahmadpour, Ehsan; Niyyati, Maryam; Hejazi, Seyed Hossein; Memari, Fatemeh; Noori, Jafar

    2017-09-01

    To verify phylogeography and genetic structure of Acanthamoeba populations among the Iranian clinical isolates and natural/artificial environments distributed in various regions of the country. We searched electronic databases including Medline, PubMed, Science Direct, Scopus and Google Scholar from 2005 to 2016. To explore the genetic variability of Acanthamoeba sp, 205 sequences were retrieved from keratitis patients, immunosuppressed cases and environmental sources as of various geographies of Iran. T4 genotype was the predominant strain in Iran, and the rare genotypes belonged to T2, T3, T5 (Acanthamoeba lenticulata), T6, T9, T11, T13 and T15 (Acanthamoeba jacobsi). A total of 47 unique haplotypes of T4 were identified. A parsimonious network of the sequence haplotypes demonstrated star-like feature containing haplogroups IR6 (34.1%) and IR7 (31.2%) as the most common haplotypes. In accordance with the analysis of molecular variance, the high value of haplotype diversity (0.612-0.848) of Acanthamoeba T4 represented genetic variability within populations. Neutrality indices of the 18S ribosomal RNA demonstrated negative values in all populations which represented a considerable divergence from neutrality. The majority of genetic diversity belonged to the infected contact lens and dust samples in immunodeficiency and ophthalmology wards, which indicated potential routes for exposure to a pathogenic Acanthamoeba sp. in at-risk individuals. A pairwise fixation index (F ST ) was from low to high values (0.02433-0.41892). The statistically F ST points out that T4 is genetically differentiated between north-west, north-south and central-south metapopulations, but not differentiated between west-central, west-south, central-south, and north-central isolates. An occurrence of IR6 and IR7 displays that possibly a gene flow of Acanthamoeba T4 occurred after the founder effect or bottleneck experience through ecological changes or host mobility. This is the first

  9. Evolutionary rate variation and RNA secondary structure prediction

    DEFF Research Database (Denmark)

    Knudsen, B.; Andersen, E.S.; Damgaard, C.

    2004-01-01

    Predicting RNA secondary structure using evolutionary history can be carried out by using an alignment of related RNA sequences with conserved structure. Accurately determining evolutionary substitution rates for base pairs and single stranded nucleotides is a concern for methods based on this type...... by applying rates derived from tRNA and rRNA to the prediction of the much more rapidly evolving 5'-region of HIV-1. We find that the HIV-1 prediction is in agreement with experimental data, even though the relative evolutionary rate between A and G is significantly increased, both in stem and loop regions...

  10. BEAM web server: a tool for structural RNA motif discovery.

    Science.gov (United States)

    Pietrosanto, Marco; Adinolfi, Marta; Casula, Riccardo; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2018-03-15

    RNA structural motif finding is a relevant problem that becomes computationally hard when working on high-throughput data (e.g. eCLIP, PAR-CLIP), often represented by thousands of RNA molecules. Currently, the BEAM server is the only web tool capable to handle tens of thousands of RNA in input with a motif discovery procedure that is only limited by the current secondary structure prediction accuracies. The recently developed method BEAM (BEAr Motifs finder) can analyze tens of thousands of RNA molecules and identify RNA secondary structure motifs associated to a measure of their statistical significance. BEAM is extremely fast thanks to the BEAR encoding that transforms each RNA secondary structure in a string of characters. BEAM also exploits the evolutionary knowledge contained in a substitution matrix of secondary structure elements, extracted from the RFAM database of families of homologous RNAs. The BEAM web server has been designed to streamline data pre-processing by automatically handling folding and encoding of RNA sequences, giving users a choice for the preferred folding program. The server provides an intuitive and informative results page with the list of secondary structure motifs identified, the logo of each motif, its significance, graphic representation and information about its position in the RNA molecules sharing it. The web server is freely available at http://beam.uniroma2.it/ and it is implemented in NodeJS and Python with all major browsers supported. marco.pietrosanto@uniroma2.it. Supplementary data are available at Bioinformatics online.

  11. The Pekin duck programmed death-ligand 1: cDNA cloning, genomic structure, molecular characterization and mRNA expression analysis.

    Science.gov (United States)

    Yao, Q; Fischer, K P; Tyrrell, D L; Gutfreund, K S

    2015-04-01

    Programmed death ligand-1 (PD-L1) plays an important role in the attenuation of adaptive immune responses in higher vertebrates. Here, we describe the identification of the Pekin duck PD-L1 orthologue (duPD-L1) and its gene structure. The duPD-L1 cDNA encodes a 311-amino acid protein that has an amino acid identity of 78% and 42% with chicken and human PD-L1, respectively. Mapping of the duPD-L1 cDNA with duck genomic sequences revealed an exonic structure of its coding sequence similar to those of other vertebrates but lacked a noncoding exon 1. Homology modelling of the duPD-L1 extracellular domain was compatible with the tandem IgV-like and IgC-like IgSF domain structure of human PD-L1 (PDB ID: 3BIS). Residues known to be important for receptor binding of human PD-L1 were mostly conserved in duPD-L1 within the N-terminus and the G sheet, and partially conserved within the F sheet but not within sheets C and C'. DuPD-L1 mRNA was constitutively expressed in all tissues examined with highest expression levels in lung and spleen and very low levels of expression in muscle, kidney and brain. Mitogen stimulation of duck peripheral blood mononuclear cells transiently increased duPD-L1 mRNA expression. Our observations demonstrate evolutionary conservation of the exonic structure of its coding sequence, the extracellular domain structure and residues implicated in receptor binding, but the role of the longer cytoplasmic tail in avian PD-L1 proteins remains to be determined. © 2014 John Wiley & Sons Ltd.

  12. RNA-TVcurve: a Web server for RNA secondary structure comparison based on a multi-scale similarity of its triple vector curve representation.

    Science.gov (United States)

    Li, Ying; Shi, Xiaohu; Liang, Yanchun; Xie, Juan; Zhang, Yu; Ma, Qin

    2017-01-21

    RNAs have been found to carry diverse functionalities in nature. Inferring the similarity between two given RNAs is a fundamental step to understand and interpret their functional relationship. The majority of functional RNAs show conserved secondary structures, rather than sequence conservation. Those algorithms relying on sequence-based features usually have limitations in their prediction performance. Hence, integrating RNA structure features is very critical for RNA analysis. Existing algorithms mainly fall into two categories: alignment-based and alignment-free. The alignment-free algorithms of RNA comparison usually have lower time complexity than alignment-based algorithms. An alignment-free RNA comparison algorithm was proposed, in which novel numerical representations RNA-TVcurve (triple vector curve representation) of RNA sequence and corresponding secondary structure features are provided. Then a multi-scale similarity score of two given RNAs was designed based on wavelet decomposition of their numerical representation. In support of RNA mutation and phylogenetic analysis, a web server (RNA-TVcurve) was designed based on this alignment-free RNA comparison algorithm. It provides three functional modules: 1) visualization of numerical representation of RNA secondary structure; 2) detection of single-point mutation based on secondary structure; and 3) comparison of pairwise and multiple RNA secondary structures. The inputs of the web server require RNA primary sequences, while corresponding secondary structures are optional. For the primary sequences alone, the web server can compute the secondary structures using free energy minimization algorithm in terms of RNAfold tool from Vienna RNA package. RNA-TVcurve is the first integrated web server, based on an alignment-free method, to deliver a suite of RNA analysis functions, including visualization, mutation analysis and multiple RNAs structure comparison. The comparison results with two popular RNA

  13. DNA structure in human RNA polymerase II promoters

    DEFF Research Database (Denmark)

    Pedersen, Anders Gorm; Baldi, Pierre; Chauvin, Yves

    1998-01-01

    with a very low level of sequence similarity. The sequences, which include both TATA-containing and TATA-less promoters, are aligned by hidden Markov models. Using three different models of sequence-derived DNA bendability, the aligned promoters display a common structural profile with bendability being low...... protein in a manner reminiscent of DNA in a nucleosome. This notion is further supported by the finding that the periodic bendability is caused mainly by the complementary triplet pairs CAG/CTG and GGC/GCC, which previously have been found to correlate with nucleosome positioning. We present models where......The fact that DNA three-dimensional structure is important for transcriptional regulation begs the question of whether eukaryotic promoters contain general structural features independently of what genes they control. We present an analysis of a large set of human RNA polymerase II promoters...

  14. CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction

    Science.gov (United States)

    Puton, Tomasz; Kozlowski, Lukasz P.; Rother, Kristian M.; Bujnicki, Janusz M.

    2013-01-01

    We present a continuous benchmarking approach for the assessment of RNA secondary structure prediction methods implemented in the CompaRNA web server. As of 3 October 2012, the performance of 28 single-sequence and 13 comparative methods has been evaluated on RNA sequences/structures released weekly by the Protein Data Bank. We also provide a static benchmark generated on RNA 2D structures derived from the RNAstrand database. Benchmarks on both data sets offer insight into the relative performance of RNA secondary structure prediction methods on RNAs of different size and with respect to different types of structure. According to our tests, on the average, the most accurate predictions obtained by a comparative approach are generated by CentroidAlifold, MXScarna, RNAalifold and TurboFold. On the average, the most accurate predictions obtained by single-sequence analyses are generated by CentroidFold, ContextFold and IPknot. The best comparative methods typically outperform the best single-sequence methods if an alignment of homologous RNA sequences is available. This article presents the results of our benchmarks as of 3 October 2012, whereas the rankings presented online are continuously updated. We will gladly include new prediction methods and new measures of accuracy in the new editions of CompaRNA benchmarks. PMID:23435231

  15. CompaRNA: a server for continuous benchmarking of automated methods for RNA secondary structure prediction.

    Science.gov (United States)

    Puton, Tomasz; Kozlowski, Lukasz P; Rother, Kristian M; Bujnicki, Janusz M

    2013-04-01

    We present a continuous benchmarking approach for the assessment of RNA secondary structure prediction methods implemented in the CompaRNA web server. As of 3 October 2012, the performance of 28 single-sequence and 13 comparative methods has been evaluated on RNA sequences/structures released weekly by the Protein Data Bank. We also provide a static benchmark generated on RNA 2D structures derived from the RNAstrand database. Benchmarks on both data sets offer insight into the relative performance of RNA secondary structure prediction methods on RNAs of different size and with respect to different types of structure. According to our tests, on the average, the most accurate predictions obtained by a comparative approach are generated by CentroidAlifold, MXScarna, RNAalifold and TurboFold. On the average, the most accurate predictions obtained by single-sequence analyses are generated by CentroidFold, ContextFold and IPknot. The best comparative methods typically outperform the best single-sequence methods if an alignment of homologous RNA sequences is available. This article presents the results of our benchmarks as of 3 October 2012, whereas the rankings presented online are continuously updated. We will gladly include new prediction methods and new measures of accuracy in the new editions of CompaRNA benchmarks.

  16. RNA secondary structures of the bacteriophage phi6 packaging regions.

    OpenAIRE

    Pirttimaa, M J; Bamford, D H

    2000-01-01

    Bacteriophage phi6 genome consists of three segments of double-stranded RNA. During maturation, single-stranded copies of these segments are packaged into preformed polymerase complex particles. Only phi6 RNA is packaged, and each particle contains only one copy of each segment. An in vitro packaging and replication assay has been developed for phi6, and the packaging signals (pac sites) have been mapped to the 5' ends of the RNA segments. In this study, we propose secondary structure models ...

  17. Structure of an RNA dimer of a regulatory element from human thymidylate synthase mRNA

    OpenAIRE

    Dibrov, Sergey; McLean, Jaime; Hermann, Thomas

    2011-01-01

    An oligonucleotide representing a regulatory element of human thymidylate synthase mRNA has been crystallized as a dimer. The structure of the asymmetric dimer has been determined at 1.97 Å resolution.

  18. Improved measurements of RNA structure conservation with generalized centroid estimators

    Directory of Open Access Journals (Sweden)

    Yohei eOkada

    2011-08-01

    Full Text Available Identification of non-protein-coding RNAs (ncRNAs in genomes is acrucial task for not only molecular cell biology but alsobioinformatics. Secondary structures of ncRNAs are employed as a keyfeature of ncRNA analysis since biological functions of ncRNAs aredeeply related to their secondary structures. Although the minimumfree energy (MFE structure of an RNA sequence is regarded as the moststable structure, MFE alone could not be an appropriate measure foridentifying ncRNAs since the free energy is heavily biased by thenucleotide composition. Therefore, instead of MFE itself, severalalternative measures for identifying ncRNAs have been proposed such asthe structure conservation index (SCI and the base pair distance(BPD, both of which employ MFE structures. However, thesemeasurements are unfortunately not suitable for identifying ncRNAs insome cases including the genome-wide search and incur high falsediscovery rate. In this study, we propose improved measurements basedon SCI and BPD, applying generalized centroid estimators toincorporate the robustness against low quality multiple alignments.Our experiments show that our proposed methods achieve higher accuracythan the original SCI and BPD for not only human-curated structuralalignments but also low quality alignments produced by CLUSTALW. Furthermore, the centroid-based SCI on CLUSTAL W alignments is moreaccurate than or comparable with that of the original SCI onstructural alignments generated with RAF, a high quality structuralaligner, for which two-fold expensive computational time is requiredon average. We conclude that our methods are more suitable forgenome-wide alignments which are of low quality from the point of viewon secondary structures than the original SCI and BPD.

  19. Automated identification of RNA 3D modules with discriminative power in RNA structural alignments

    DEFF Research Database (Denmark)

    Theis, Corinna; Höner zu Siederdissen, Christian; Hofacker, Ivo L.

    2013-01-01

    Recent progress in predicting RNA structure is moving towards filling the 'gap' in 2D RNA structure prediction where, for example, predicted internal loops often form non-canonical base pairs. This is increasingly recognized with the steady increase of known RNA 3D modules. There is a general...... comparative evidence. Subsequently, the modules, initially represented by a graph, are turned into models for the RMDetect program, which allows to test their discriminative power using real and randomized Rfam alignments. An initial extraction of 22495 3D modules in all PDB files results in 977 internal loop...

  20. High-Throughput Sequencing Based Methods of RNA Structure Investigation

    DEFF Research Database (Denmark)

    Kielpinski, Lukasz Jan

    In this thesis we describe the development of four related methods for RNA structure probing that utilize massive parallel sequencing. Using them, we were able to gather structural data for multiple, long molecules simultaneously. First, we have established an easy to follow experimental...... and computational protocol for detecting the reverse transcription termination sites (RTTS-Seq). This protocol was subsequently applied to hydroxyl radical footprinting of three dimensional RNA structures to give a probing signal that correlates well with the RNA backbone solvent accessibility. Moreover, we applied...

  1. MultiSETTER: web server for multiple RNA structure comparison.

    Science.gov (United States)

    Čech, Petr; Hoksza, David; Svozil, Daniel

    2015-08-12

    Understanding the architecture and function of RNA molecules requires methods for comparing and analyzing their tertiary and quaternary structures. While structural superposition of short RNAs is achievable in a reasonable time, large structures represent much bigger challenge. Therefore, we have developed a fast and accurate algorithm for RNA pairwise structure superposition called SETTER and implemented it in the SETTER web server. However, though biological relationships can be inferred by a pairwise structure alignment, key features preserved by evolution can be identified only from a multiple structure alignment. Thus, we extended the SETTER algorithm to the alignment of multiple RNA structures and developed the MultiSETTER algorithm. In this paper, we present the updated version of the SETTER web server that implements a user friendly interface to the MultiSETTER algorithm. The server accepts RNA structures either as the list of PDB IDs or as user-defined PDB files. After the superposition is computed, structures are visualized in 3D and several reports and statistics are generated. To the best of our knowledge, the MultiSETTER web server is the first publicly available tool for a multiple RNA structure alignment. The MultiSETTER server offers the visual inspection of an alignment in 3D space which may reveal structural and functional relationships not captured by other multiple alignment methods based either on a sequence or on secondary structure motifs.

  2. A combinatorial enumeration problem of RNA secondary structures

    African Journals Online (AJOL)

    use

    2011-12-21

    Dec 21, 2011 ... interesting combinatorial questions (Chen et al., 2005;. Liu, 2006; Schmitt and Waterman 1994; Stein and. Waterman 1978). The research on the enumeration of. RNA secondary structures becomes one of the hot topics in Computational Molecular Biology. An RNA molecule is described by its sequences of.

  3. Computing the Partition Function for Kinetically Trapped RNA Secondary Structures

    Science.gov (United States)

    Lorenz, William A.; Clote, Peter

    2011-01-01

    An RNA secondary structure is locally optimal if there is no lower energy structure that can be obtained by the addition or removal of a single base pair, where energy is defined according to the widely accepted Turner nearest neighbor model. Locally optimal structures form kinetic traps, since any evolution away from a locally optimal structure must involve energetically unfavorable folding steps. Here, we present a novel, efficient algorithm to compute the partition function over all locally optimal secondary structures of a given RNA sequence. Our software, RNAlocopt runs in time and space. Additionally, RNAlocopt samples a user-specified number of structures from the Boltzmann subensemble of all locally optimal structures. We apply RNAlocopt to show that (1) the number of locally optimal structures is far fewer than the total number of structures – indeed, the number of locally optimal structures approximately equal to the square root of the number of all structures, (2) the structural diversity of this subensemble may be either similar to or quite different from the structural diversity of the entire Boltzmann ensemble, a situation that depends on the type of input RNA, (3) the (modified) maximum expected accuracy structure, computed by taking into account base pairing frequencies of locally optimal structures, is a more accurate prediction of the native structure than other current thermodynamics-based methods. The software RNAlocopt constitutes a technical breakthrough in our study of the folding landscape for RNA secondary structures. For the first time, locally optimal structures (kinetic traps in the Turner energy model) can be rapidly generated for long RNA sequences, previously impossible with methods that involved exhaustive enumeration. Use of locally optimal structure leads to state-of-the-art secondary structure prediction, as benchmarked against methods involving the computation of minimum free energy and of maximum expected accuracy. Web server

  4. Computing the partition function for kinetically trapped RNA secondary structures.

    Directory of Open Access Journals (Sweden)

    William A Lorenz

    Full Text Available An RNA secondary structure is locally optimal if there is no lower energy structure that can be obtained by the addition or removal of a single base pair, where energy is defined according to the widely accepted Turner nearest neighbor model. Locally optimal structures form kinetic traps, since any evolution away from a locally optimal structure must involve energetically unfavorable folding steps. Here, we present a novel, efficient algorithm to compute the partition function over all locally optimal secondary structures of a given RNA sequence. Our software, RNAlocopt runs in O(n3 time and O(n2 space. Additionally, RNAlocopt samples a user-specified number of structures from the Boltzmann subensemble of all locally optimal structures. We apply RNAlocopt to show that (1 the number of locally optimal structures is far fewer than the total number of structures--indeed, the number of locally optimal structures approximately equal to the square root of the number of all structures, (2 the structural diversity of this subensemble may be either similar to or quite different from the structural diversity of the entire Boltzmann ensemble, a situation that depends on the type of input RNA, (3 the (modified maximum expected accuracy structure, computed by taking into account base pairing frequencies of locally optimal structures, is a more accurate prediction of the native structure than other current thermodynamics-based methods. The software RNAlocopt constitutes a technical breakthrough in our study of the folding landscape for RNA secondary structures. For the first time, locally optimal structures (kinetic traps in the Turner energy model can be rapidly generated for long RNA sequences, previously impossible with methods that involved exhaustive enumeration. Use of locally optimal structure leads to state-of-the-art secondary structure prediction, as benchmarked against methods involving the computation of minimum free energy and of maximum expected

  5. RNACompress: Grammar-based compression and informational complexity measurement of RNA secondary structure

    Directory of Open Access Journals (Sweden)

    Chen Chun

    2008-03-01

    Full Text Available Abstract Background With the rapid emergence of RNA databases and newly identified non-coding RNAs, an efficient compression algorithm for RNA sequence and structural information is needed for the storage and analysis of such data. Although several algorithms for compressing DNA sequences have been proposed, none of them are suitable for the compression of RNA sequences with their secondary structures simultaneously. This kind of compression not only facilitates the maintenance of RNA data, but also supplies a novel way to measure the informational complexity of RNA structural data, raising the possibility of studying the relationship between the functional activities of RNA structures and their complexities, as well as various structural properties of RNA based on compression. Results RNACompress employs an efficient grammar-based model to compress RNA sequences and their secondary structures. The main goals of this algorithm are two fold: (1 present a robust and effective way for RNA structural data compression; (2 design a suitable model to represent RNA secondary structure as well as derive the informational complexity of the structural data based on compression. Our extensive tests have shown that RNACompress achieves a universally better compression ratio compared with other sequence-specific or common text-specific compression algorithms, such as Gencompress, winrar and gzip. Moreover, a test of the activities of distinct GTP-binding RNAs (aptamers compared with their structural complexity shows that our defined informational complexity can be used to describe how complexity varies with activity. These results lead to an objective means of comparing the functional properties of heteropolymers from the information perspective. Conclusion A universal algorithm for the compression of RNA secondary structure as well as the evaluation of its informational complexity is discussed in this paper. We have developed RNACompress, as a useful tool

  6. Ensemble-based prediction of RNA secondary structures.

    Science.gov (United States)

    Aghaeepour, Nima; Hoos, Holger H

    2013-04-24

    Accurate structure prediction methods play an important role for the understanding of RNA function. Energy-based, pseudoknot-free secondary structure prediction is one of the most widely used and versatile approaches, and improved methods for this task have received much attention over the past five years. Despite the impressive progress that as been achieved in this area, existing evaluations of the prediction accuracy achieved by various algorithms do not provide a comprehensive, statistically sound assessment. Furthermore, while there is increasing evidence that no prediction algorithm consistently outperforms all others, no work has been done to exploit the complementary strengths of multiple approaches. In this work, we present two contributions to the area of RNA secondary structure prediction. Firstly, we use state-of-the-art, resampling-based statistical methods together with a previously published and increasingly widely used dataset of high-quality RNA structures to conduct a comprehensive evaluation of existing RNA secondary structure prediction procedures. The results from this evaluation clarify the performance relationship between ten well-known existing energy-based pseudoknot-free RNA secondary structure prediction methods and clearly demonstrate the progress that has been achieved in recent years. Secondly, we introduce AveRNA, a generic and powerful method for combining a set of existing secondary structure prediction procedures into an ensemble-based method that achieves significantly higher prediction accuracies than obtained from any of its component procedures. Our new, ensemble-based method, AveRNA, improves the state of the art for energy-based, pseudoknot-free RNA secondary structure prediction by exploiting the complementary strengths of multiple existing prediction procedures, as demonstrated using a state-of-the-art statistical resampling approach. In addition, AveRNA allows an intuitive and effective control of the trade-off between

  7. DCJ-RNA - double cut and join for RNA secondary structures.

    Science.gov (United States)

    Badr, Ghada H; Al-Aqel, Haifa A

    2017-10-16

    Genome rearrangements are essential processes for evolution and are responsible for existing varieties of genome architectures. Many studies have been conducted to obtain an algorithm that identifies the minimum number of inversions that are necessary to transform one genome into another; this allows for genome sequence representation in polynomial time. Studies have not been conducted on the topic of rearranging a genome when it is represented as a secondary structure. Unlike sequences, the secondary structure preserves the functionality of the genome. Sequences can be different, but they all share the same structure and, therefore, the same functionality. This paper proposes a double cut and join for RNA secondary structures (DCJ-RNA) algorithm. This algorithm allows for the description of evolutionary scenarios that are based on secondary structures rather than sequences. The main aim of this paper is to suggest an efficient algorithm that can help researchers compare two ribonucleic acid (RNA) secondary structures based on rearrangement operations. The results, which are based on real datasets, show that the algorithm is able to count the minimum number of rearrangement operations, as well as to report an optimum scenario that can increase the similarity between the two structures. The algorithm calculates the distance between structures and reports a scenario based on the minimum rearrangement operations required to make the given structure similar to the other. DCJ-RNA can also be used to measure the distance between the two structures. This can help identify the common functionalities between different species.

  8. Structural basis of RNA folding and recognition in an AMP-RNA aptamer complex.

    Science.gov (United States)

    Jiang, F; Kumar, R A; Jones, R A; Patel, D J

    1996-07-11

    The catalytic properties of RNA and its well known role in gene expression and regulation are the consequence of its unique solution structures. Identification of the structural determinants of ligand recognition by RNA molecules is of fundamental importance for understanding the biological functions of RNA, as well as for the rational design of RNA Sequences with specific catalytic activities. Towards this latter end, Szostak et al. used in vitro selection techniques to isolate RNA sequences ('aptamers') containing a high-affinity binding site for ATP, the universal currency of cellular energy, and then used this motif to engineer ribozymes with polynucleotide kinase activity. Here we present the solution structure, as determined by multidimensional NMR spectroscopy and molecular dynamics calculations, of both uniformly and specifically 13C-, 15N-labelled 40-mer RNA containing the ATP-binding motif complexed with AMP. The aptamer adopts an L-shaped structure with two nearly orthogonal stems, each capped proximally by a G x G mismatch pair, binding the AMP ligand at their junction in a GNRA-like motif.

  9. Topological classification and enumeration of RNA structures by genus

    DEFF Research Database (Denmark)

    Andersen, Joergen Ellegard; Penner, Robert C.; Reidys, Christian

    2013-01-01

    To an RNA pseudoknot structure is naturally associated a topological surface, which has its associated genus, and structures can thus be classified by the genus. Based on earlier work of Harer-Zagier, we compute the generating function for the number of those structures of fixed genus and minimum...

  10. Structural insights into RNA processing by the human RISC-loading complex.

    Science.gov (United States)

    Wang, Hong-Wei; Noland, Cameron; Siridechadilok, Bunpote; Taylor, David W; Ma, Enbo; Felderer, Karin; Doudna, Jennifer A; Nogales, Eva

    2009-11-01

    Targeted gene silencing by RNA interference (RNAi) requires loading of a short guide RNA (small interfering RNA (siRNA) or microRNA (miRNA)) onto an Argonaute protein to form the functional center of an RNA-induced silencing complex (RISC). In humans, Argonaute2 (AGO2) assembles with the guide RNA-generating enzyme Dicer and the RNA-binding protein TRBP to form a RISC-loading complex (RLC), which is necessary for efficient transfer of nascent siRNAs and miRNAs from Dicer to AGO2. Here, using single-particle EM analysis, we show that human Dicer has an L-shaped structure. The RLC Dicer's N-terminal DExH/D domain, located in a short 'base branch', interacts with TRBP, whereas its C-terminal catalytic domains in the main body are proximal to AGO2. A model generated by docking the available atomic structures of Dicer and Argonaute homologs into the RLC reconstruction suggests a mechanism for siRNA transfer from Dicer to AGO2.

  11. HD-RNAS: An automated hierarchical database of RNA structures

    Directory of Open Access Journals (Sweden)

    Shubhra Sankar eRay

    2012-04-01

    Full Text Available One of the important goals of most biological investigations is to classify and organize the experimental findings so that they are readily useful for deriving generalized rules. Although there is a huge amount of information on RNA structures in PDB, there are redundant files, ambiguous synthetic sequences etc. Moreover, a systematic hierarchical organization, reflecting RNA classification, is missing in PDB. In this investigation, we have classified all the available RNA crystal structures from PDB through a programmatic approach. Hence, it would be now a simple assignment to regularly update the classification as and when new structures are released. The classification can further determine (i a non-redundant set of RNA structures and (ii if available, a set of structures of identical sequence and function, which can highlight structural polymorphism, ligand-induced conformational alterations etc. Presently, we have classified the available structures (2095 PDB entries having RNA chain longer than 9 nucleotides solved by X-ray crystallography or NMR spectroscopy into nine functional classes. The structures of same function and same source are mostly seen to be similar with subtle differences depending on their functional complexation. The web-server is available online at http://www.saha.ac.in/biop/www/HD-RNAS.html and is updated regularly.

  12. RNA secondary structures of the bacteriophage phi6 packaging regions.

    Science.gov (United States)

    Pirttimaa, M J; Bamford, D H

    2000-06-01

    Bacteriophage phi6 genome consists of three segments of double-stranded RNA. During maturation, single-stranded copies of these segments are packaged into preformed polymerase complex particles. Only phi6 RNA is packaged, and each particle contains only one copy of each segment. An in vitro packaging and replication assay has been developed for phi6, and the packaging signals (pac sites) have been mapped to the 5' ends of the RNA segments. In this study, we propose secondary structure models for the pac sites of phi6 single-stranded RNA segments. Our models accommodate data from structure-specific chemical modifications, free energy minimizations, and phylogenetic comparisons. Previously reported pac site deletion studies are also discussed. Each pac site possesses a unique architecture, that, however, contains common structural elements.

  13. Integrated structural biology to unravel molecular mechanisms of protein-RNA recognition.

    Science.gov (United States)

    Schlundt, Andreas; Tants, Jan-Niklas; Sattler, Michael

    2017-04-15

    Recent advances in RNA sequencing technologies have greatly expanded our knowledge of the RNA landscape in cells, often with spatiotemporal resolution. These techniques identified many new (often non-coding) RNA molecules. Large-scale studies have also discovered novel RNA binding proteins (RBPs), which exhibit single or multiple RNA binding domains (RBDs) for recognition of specific sequence or structured motifs in RNA. Starting from these large-scale approaches it is crucial to unravel the molecular principles of protein-RNA recognition in ribonucleoprotein complexes (RNPs) to understand the underlying mechanisms of gene regulation. Structural biology and biophysical studies at highest possible resolution are key to elucidate molecular mechanisms of RNA recognition by RBPs and how conformational dynamics, weak interactions and cooperative binding contribute to the formation of specific, context-dependent RNPs. While large compact RNPs can be well studied by X-ray crystallography and cryo-EM, analysis of dynamics and weak interaction necessitates the use of solution methods to capture these properties. Here, we illustrate methods to study the structure and conformational dynamics of protein-RNA complexes in solution starting from the identification of interaction partners in a given RNP. Biophysical and biochemical techniques support the characterization of a protein-RNA complex and identify regions relevant in structural analysis. Nuclear magnetic resonance (NMR) is a powerful tool to gain information on folding, stability and dynamics of RNAs and characterize RNPs in solution. It provides crucial information that is complementary to the static pictures derived from other techniques. NMR can be readily combined with other solution techniques, such as small angle X-ray and/or neutron scattering (SAXS/SANS), electron paramagnetic resonance (EPR), and Förster resonance energy transfer (FRET), which provide information about overall shapes, internal domain

  14. RNA secondary structure image - fRNAdb | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us fRNAdb RNA secondary structure image Data detail Data name RNA secondary structure image DOI... 10.18908/lsdba.nbdc00452-005 Description of data contents RNA secondary structure images - png.zip: RNA secondary structure image...s (PNG) - pdf.zip: RNA secondary structure images (PDF) - thumbnail.zip: Thumbnails of... RNA secondary structure images Data file File name: RNA_secondary_structure_image... File URL: ftp://ftp.biosciencedbc.jp/archive/frnadb/LATEST/RNA_secondary_structure_image File size: 9.6 GB

  15. RNA amplification for successful gene profiling analysis

    Directory of Open Access Journals (Sweden)

    Wang Ena

    2005-07-01

    Full Text Available Abstract The study of clinical samples is often limited by the amount of material available to study. While proteins cannot be multiplied in their natural form, DNA and RNA can be amplified from small specimens and used for high-throughput analyses. Therefore, genetic studies offer the best opportunity to screen for novel insights of human pathology when little material is available. Precise estimates of DNA copy numbers in a given specimen are necessary. However, most studies investigate static variables such as the genetic background of patients or mutations within pathological specimens without a need to assess proportionality of expression among different genes throughout the genome. Comparative genomic hybridization of DNA samples represents a crude exception to this rule since genomic amplification or deletion is compared among different specimens directly. For gene expression analysis, however, it is critical to accurately estimate the proportional expression of distinct RNA transcripts since such proportions directly govern cell function by modulating protein expression. Furthermore, comparative estimates of relative RNA expression at different time points portray the response of cells to environmental stimuli, indirectly informing about broader biological events affecting a particular tissue in physiological or pathological conditions. This cognitive reaction of cells is similar to the detection of electroencephalographic patterns which inform about the status of the brain in response to external stimuli. As our need to understand human pathophysiology at the global level increases, the development and refinement of technologies for high fidelity messenger RNA amplification have become the focus of increasing interest during the past decade. The need to increase the abundance of RNA has been met not only for gene specific amplification, but, most importantly for global transcriptome wide, unbiased amplification. Now gene

  16. In vivo genome-wide profiling of RNA secondary structure reveals novel regulatory features.

    Science.gov (United States)

    Ding, Yiliang; Tang, Yin; Kwok, Chun Kit; Zhang, Yu; Bevilacqua, Philip C; Assmann, Sarah M

    2014-01-30

    RNA structure has critical roles in processes ranging from ligand sensing to the regulation of translation, polyadenylation and splicing. However, a lack of genome-wide in vivo RNA structural data has limited our understanding of how RNA structure regulates gene expression in living cells. Here we present a high-throughput, genome-wide in vivo RNA structure probing method, structure-seq, in which dimethyl sulphate methylation of unprotected adenines and cytosines is identified by next-generation sequencing. Application of this method to Arabidopsis thaliana seedlings yielded the first in vivo genome-wide RNA structure map at nucleotide resolution for any organism, with quantitative structural information across more than 10,000 transcripts. Our analysis reveals a three-nucleotide periodic repeat pattern in the structure of coding regions, as well as a less-structured region immediately upstream of the start codon, and shows that these features are strongly correlated with translation efficiency. We also find patterns of strong and weak secondary structure at sites of alternative polyadenylation, as well as strong secondary structure at 5' splice sites that correlates with unspliced events. Notably, in vivo structures of messenger RNAs annotated for stress responses are poorly predicted in silico, whereas mRNA structures of genes related to cell function maintenance are well predicted. Global comparison of several structural features between these two categories shows that the mRNAs associated with stress responses tend to have more single-strandedness, longer maximal loop length and higher free energy per nucleotide, features that may allow these RNAs to undergo conformational changes in response to environmental conditions. Structure-seq allows the RNA structurome and its biological roles to be interrogated on a genome-wide scale and should be applicable to any organism.

  17. Structure of noncoding RNA is a determinant of function of RNA binding proteins in transcriptional regulation

    Directory of Open Access Journals (Sweden)

    Oyoshi Takanori

    2012-01-01

    Full Text Available Abstract The majority of the noncoding regions of mammalian genomes have been found to be transcribed to generate noncoding RNAs (ncRNAs, resulting in intense interest in their biological roles. During the past decade, numerous ncRNAs and aptamers have been identified as regulators of transcription. 6S RNA, first described as a ncRNA in E. coli, mimics an open promoter structure, which has a large bulge with two hairpin/stalk structures that regulate transcription through interactions with RNA polymerase. B2 RNA, which has stem-loops and unstructured single-stranded regions, represses transcription of mRNA in response to various stresses, including heat shock in mouse cells. The interaction of TLS (translocated in liposarcoma with CBP/p300 was induced by ncRNAs that bind to TLS, and this in turn results in inhibition of CBP/p300 histone acetyltransferase (HAT activity in human cells. Transcription regulator EWS (Ewing's sarcoma, which is highly related to TLS, and TLS specifically bind to G-quadruplex structures in vitro. The carboxy terminus containing the Arg-Gly-Gly (RGG repeat domains in these proteins are necessary for cis-repression of transcription activation and HAT activity by the N-terminal glutamine-rich domain. Especially, the RGG domain in the carboxy terminus of EWS is important for the G-quadruplex specific binding. Together, these data suggest that functions of EWS and TLS are modulated by specific structures of ncRNAs.

  18. Trans-acting RNAs as molecular probes for monitoring time-dependent structural change of an RNA complex adapting two structures.

    Science.gov (United States)

    Maeda, Yuri; Furuta, Hiroyuki; Ikawa, Yoshiya

    2011-03-01

    As dynamic structural changes are pivotal for the functions of some classes of RNA molecule, it is important to develop methods to monitor structural changes in RNA in a time-dependent manner without chemical modification. Based on previous reports that trans-acting RNAs can be used as probes for analysis and control of 3D structures of target RNAs, we applied this method to monitor time-dependent structural changes in RNA. We designed and performed a proof-of-principle study using a simple model RNA complex that adopts two different structures as a target. The time-dependent structural changes in the target RNA were successfully monitored using two trans-acting RNAs, which stably form a ternary complex with the bimolecular target RNA and act as a catalyst to join two RNA fragments of the target complex, respectively. Copyright © 2010 The Society for Biotechnology, Japan. Published by Elsevier B.V. All rights reserved.

  19. Study of RNA structures with a connection to random matrix theory

    International Nuclear Information System (INIS)

    Bhadola, Pradeep; Deo, Nivedita

    2015-01-01

    This manuscript investigates the level of complexity and thermodynamic properties of the real RNA structures and compares the properties with the random RNA sequences. A discussion on the similarities of thermodynamical properties of the real structures with the non linear random matrix model of RNA folding is presented. The structural information contained in the PDB file is exploited to get the base pairing information. The complexity of an RNA structure is defined by a topological quantity called genus which is calculated from the base pairing information. Thermodynamic analysis of the real structures is done numerically. The real structures have a minimum free energy which is very small compared to the randomly generated sequences of the same length. This analysis suggests that there are specific patterns in the structures which are preserved during the evolution of the sequences and certain sequences are discarded by the evolutionary process. Further analyzing the sequences of a fixed length reveal that the RNA structures exist in ensembles i.e. although all the sequences in the ensemble have different series of nucleotides (sequence) they fold into structures that have the same pairs of hydrogen bonding as well as the same minimum free energy. The specific heat of the RNA molecule is numerically estimated at different lengths. The specific heat curve with temperature shows a bump and for some RNA, a double peak behavior is observed. The same behavior is seen in the study of the random matrix model with non linear interaction of RNA folding. The bump in the non linear matrix model can be controlled by the change in the interaction strength.

  20. Correlation of RNA secondary structure statistics with thermodynamic stability and applications to folding.

    Science.gov (United States)

    Wu, Johnny C; Gardner, David P; Ozer, Stuart; Gutell, Robin R; Ren, Pengyu

    2009-08-28

    The accurate prediction of the secondary and tertiary structure of an RNA with different folding algorithms is dependent on several factors, including the energy functions. However, an RNA higher-order structure cannot be predicted accurately from its sequence based on a limited set of energy parameters. The inter- and intramolecular forces between this RNA and other small molecules and macromolecules, in addition to other factors in the cell such as pH, ionic strength, and temperature, influence the complex dynamics associated with transition of a single stranded RNA to its secondary and tertiary structure. Since all of the factors that affect the formation of an RNAs 3D structure cannot be determined experimentally, statistically derived potential energy has been used in the prediction of protein structure. In the current work, we evaluate the statistical free energy of various secondary structure motifs, including base-pair stacks, hairpin loops, and internal loops, using their statistical frequency obtained from the comparative analysis of more than 50,000 RNA sequences stored in the RNA Comparative Analysis Database (rCAD) at the Comparative RNA Web (CRW) Site. Statistical energy was computed from the structural statistics for several datasets. While the statistical energy for a base-pair stack correlates with experimentally derived free energy values, suggesting a Boltzmann-like distribution, variation is observed between different molecules and their location on the phylogenetic tree of life. Our statistical energy values calculated for several structural elements were utilized in the Mfold RNA-folding algorithm. The combined statistical energy values for base-pair stacks, hairpins and internal loop flanks result in a significant improvement in the accuracy of secondary structure prediction; the hairpin flanks contribute the most.

  1. Reconstruction of ancestral RNA sequences under multiple structural constraints

    Directory of Open Access Journals (Sweden)

    Olivier Tremblay-Savard

    2016-11-01

    Full Text Available Abstract Background Secondary structures form the scaffold of multiple sequence alignment of non-coding RNA (ncRNA families. An accurate reconstruction of ancestral ncRNAs must use this structural signal. However, the inference of ancestors of a single ncRNA family with a single consensus structure may bias the results towards sequences with high affinity to this structure, which are far from the true ancestors. Methods In this paper, we introduce achARNement, a maximum parsimony approach that, given two alignments of homologous ncRNA families with consensus secondary structures and a phylogenetic tree, simultaneously calculates ancestral RNA sequences for these two families. Results We test our methodology on simulated data sets, and show that achARNement outperforms classical maximum parsimony approaches in terms of accuracy, but also reduces by several orders of magnitude the number of candidate sequences. To conclude this study, we apply our algorithms on the Glm clan and the FinP-traJ clan from the Rfam database. Conclusions Our results show that our methods reconstruct small sets of high-quality candidate ancestors with better agreement to the two target structures than with classical approaches. Our program is freely available at: http://csb.cs.mcgill.ca/acharnement .

  2. Reconstruction of ancestral RNA sequences under multiple structural constraints.

    Science.gov (United States)

    Tremblay-Savard, Olivier; Reinharz, Vladimir; Waldispühl, Jérôme

    2016-11-11

    Secondary structures form the scaffold of multiple sequence alignment of non-coding RNA (ncRNA) families. An accurate reconstruction of ancestral ncRNAs must use this structural signal. However, the inference of ancestors of a single ncRNA family with a single consensus structure may bias the results towards sequences with high affinity to this structure, which are far from the true ancestors. In this paper, we introduce achARNement, a maximum parsimony approach that, given two alignments of homologous ncRNA families with consensus secondary structures and a phylogenetic tree, simultaneously calculates ancestral RNA sequences for these two families. We test our methodology on simulated data sets, and show that achARNement outperforms classical maximum parsimony approaches in terms of accuracy, but also reduces by several orders of magnitude the number of candidate sequences. To conclude this study, we apply our algorithms on the Glm clan and the FinP-traJ clan from the Rfam database. Our results show that our methods reconstruct small sets of high-quality candidate ancestors with better agreement to the two target structures than with classical approaches. Our program is freely available at: http://csb.cs.mcgill.ca/acharnement .

  3. Strategies for measuring evolutionary conservation of RNA secondary structures

    Directory of Open Access Journals (Sweden)

    Hofacker Ivo L

    2008-02-01

    Full Text Available Abstract Background Evolutionary conservation of RNA secondary structure is a typical feature of many functional non-coding RNAs. Since almost all of the available methods used for prediction and annotation of non-coding RNA genes rely on this evolutionary signature, accurate measures for structural conservation are essential. Results We systematically assessed the ability of various measures to detect conserved RNA structures in multiple sequence alignments. We tested three existing and eight novel strategies that are based on metrics of folding energies, metrics of single optimal structure predictions, and metrics of structure ensembles. We find that the folding energy based SCI score used in the RNAz program and a simple base-pair distance metric are by far the most accurate. The use of more complex metrics like for example tree editing does not improve performance. A variant of the SCI performed particularly well on highly conserved alignments and is thus a viable alternative when only little evolutionary information is available. Surprisingly, ensemble based methods that, in principle, could benefit from the additional information contained in sub-optimal structures, perform particularly poorly. As a general trend, we observed that methods that include a consensus structure prediction outperformed equivalent methods that only consider pairwise comparisons. Conclusion Structural conservation can be measured accurately with relatively simple and intuitive metrics. They have the potential to form the basis of future RNA gene finders, that face new challenges like finding lineage specific structures or detecting mis-aligned sequences.

  4. tRNA-like structure regulates translation of Brome mosaic virus RNA.

    Science.gov (United States)

    Barends, Sharief; Rudinger-Thirion, Joëlle; Florentz, Catherine; Giegé, Richard; Pleij, Cornelis W A; Kraal, Barend

    2004-04-01

    For various groups of plant viruses, the genomic RNAs end with a tRNA-like structure (TLS) instead of the 3' poly(A) tail of common mRNAs. The actual function of these TLSs has long been enigmatic. Recently, however, it became clear that for turnip yellow mosaic virus, a tymovirus, the valylated TLS(TYMV) of the single genomic RNA functions as a bait for host ribosomes and directs them to the internal initiation site of translation (with N-terminal valine) of the second open reading frame for the polyprotein. This discovery prompted us to investigate whether the much larger TLSs of a different genus of viruses have a comparable function in translation. Brome mosaic virus (BMV), a bromovirus, has a tripartite RNA genome with a subgenomic RNA4 for coat protein expression. All four RNAs carry a highly conserved and bulky 3' TLS(BMV) (about 200 nucleotides) with determinants for tyrosylation. We discovered TLS(BMV)-catalyzed self-tyrosylation of the tyrosyl-tRNA synthetase but could not clearly detect tyrosine incorporation into any virus-encoded protein. We established that BMV proteins do not need TLS(BMV) tyrosylation for their initiation. However, disruption of the TLSs strongly reduced the translation of genomic RNA1, RNA2, and less strongly, RNA3, whereas coat protein expression from RNA4 remained unaffected. This aberrant translation could be partially restored by providing the TLS(BMV) in trans. Intriguingly, a subdomain of the TLS(BMV) could even almost fully restore translation to the original pattern. We discuss here a model with a central and dominant role for the TLS(BMV) during the BMV infection cycle.

  5. RNA structure and scalar coupling constants

    Energy Technology Data Exchange (ETDEWEB)

    Tinoco, I. Jr.; Cai, Z.; Hines, J.V.; Landry, S.M.; SantaLucia, J. Jr.; Shen, L.X.; Varani, G. [Univ. of California, Berkeley, CA (United States)

    1994-12-01

    Signs and magnitudes of scalar coupling constants-spin-spin splittings-comprise a very large amount of data that can be used to establish the conformations of RNA molecules. Proton-proton and proton-phosphorus splittings have been used the most, but the availability of {sup 13}C-and {sup 15}N-labeled molecules allow many more coupling constants to be used for determining conformation. We will systematically consider the torsion angles that characterize a nucleotide unit and the coupling constants that depend on the values of these torsion angles. Karplus-type equations have been established relating many three-bond coupling constants to torsion angles. However, one- and two-bond coupling constants can also depend on conformation. Serianni and coworkers measured carbon-proton coupling constants in ribonucleosides and have calculated their values as a function of conformation. The signs of two-bond coupling can be very useful because it is easier to measure a sign than an accurate magnitude.

  6. Structure of Drosophila Oskar reveals a novel RNA binding protein

    Science.gov (United States)

    Yang, Na; Yu, Zhenyu; Hu, Menglong; Wang, Mingzhu; Lehmann, Ruth; Xu, Rui-Ming

    2015-01-01

    Oskar (Osk) protein plays critical roles during Drosophila germ cell development, yet its functions in germ-line formation and body patterning remain poorly understood. This situation contrasts sharply with the vast knowledge about the function and mechanism of osk mRNA localization. Osk is predicted to have an N-terminal LOTUS domain (Osk-N), which has been suggested to bind RNA, and a C-terminal hydrolase-like domain (Osk-C) of unknown function. Here, we report the crystal structures of Osk-N and Osk-C. Osk-N shows a homodimer of winged-helix–fold modules, but without detectable RNA-binding activity. Osk-C has a lipase-fold structure but lacks critical catalytic residues at the putative active site. Surprisingly, we found that Osk-C binds the 3′UTRs of osk and nanos mRNA in vitro. Mutational studies identified a region of Osk-C important for mRNA binding. These results suggest possible functions of Osk in the regulation of stability, regulation of translation, and localization of relevant mRNAs through direct interaction with their 3′UTRs, and provide structural insights into a novel protein–RNA interaction motif involving a hydrolase-related domain. PMID:26324911

  7. RNA secondary structure diagrams for very large molecules: RNAfdl

    DEFF Research Database (Denmark)

    Hecker, Nikolai; Wiegels, Tim; Torda, Andrew E.

    2013-01-01

    There are many programs that can read the secondary structure of an RNA molecule and draw a diagram, but hardly any that can cope with 10 3 bases. RNAfdl is slow but capable of producing intersection-free diagrams for ribosome-sized structures, has a graphical user interface for adjustments...

  8. Nucleosome Positioning and NDR Structure at RNA Polymerase III Promoters

    DEFF Research Database (Denmark)

    Helbo, Alexandra Søgaard; Lay, Fides D; Jones, Peter A

    2017-01-01

    Chromatin is structurally involved in the transcriptional regulation of all genes. While the nucleosome positioning at RNA polymerase II (pol II) promoters has been extensively studied, less is known about the chromatin structure at pol III promoters in human cells. We use a high...

  9. The crystal structure and RNA-binding of an orthomyxovirus nucleoprotein.

    Directory of Open Access Journals (Sweden)

    Wenjie Zheng

    2013-09-01

    Full Text Available Genome packaging for viruses with segmented genomes is often a complex problem. This is particularly true for influenza viruses and other orthomyxoviruses, whose genome consists of multiple negative-sense RNAs encapsidated as ribonucleoprotein (RNP complexes. To better understand the structural features of orthomyxovirus RNPs that allow them to be packaged, we determined the crystal structure of the nucleoprotein (NP of a fish orthomyxovirus, the infectious salmon anemia virus (ISAV (genus Isavirus. As the major protein component of the RNPs, ISAV-NP possesses a bi-lobular structure similar to the influenza virus NP. Because both RNA-free and RNA-bound ISAV NP forms stable dimers in solution, we were able to measure the NP RNA binding affinity as well as the stoichiometry using recombinant proteins and synthetic oligos. Our RNA binding analysis revealed that each ISAV-NP binds ~12 nts of RNA, shorter than the 24-28 nts originally estimated for the influenza A virus NP based on population average. The 12-nt stoichiometry was further confirmed by results from electron microscopy and dynamic light scattering. Considering that RNPs of ISAV and the influenza viruses have similar morphologies and dimensions, our findings suggest that NP-free RNA may exist on orthomyxovirus RNPs, and selective RNP packaging may be accomplished through direct RNA-RNA interactions.

  10. Primary and secondary structure of U8 small nuclear RNA

    International Nuclear Information System (INIS)

    Reddy, R.; Henning, D.; Busch, H.

    1985-01-01

    U8 small nuclear RNA is a new, capped, 140 nucleotides long RNA species found in Novikoff hepatoma cells. Its sequence is: m3GpppAmUmCGUCAGGA GGUUAAUCCU UACCUGUCCC UCCUUUCGGA GGGCAGAUAG AAAAUGAUGA UUGGAGCUUG CAUGAUCUGC UGAUUAUAGC AUUUCCGUGU AAUCAGGACC UGACAACAUC CUGAUUGCUU CUAUCUGAUUOH. This RNA is present in approximately 25,000 copies/cell, and it is enriched in nucleolar preparations. Like U1, U2, U4/U6, and U5 RNAs, U8 RNA was also present as a ribonucleoprotein associated with the Sm antigen. The rat U8 RNA was highly homologous (greater than 90%) to a recently characterized 5.4 S RNA from mouse cells infected with spleen focus-forming virus. In addition to the U8 RNA, three other U small nuclear RNAs were found in anti-Sm antibody immunoprecipitates from labeled rat and HeLa cells. Each of these contained a m3GpppAm cap structure; their apparent chain lengths were 60, 130, and 65 nucleotides. These U small nuclear RNAs are designated U7, U9, and U10 RNAs, respectively

  11. Expression, crystallization and preliminary crystallographic analysis of RNA-binding protein Hfq (YmaH) from Bacillus subtilis in complex with an RNA aptamer

    International Nuclear Information System (INIS)

    Baba, Seiki; Someya, Tatsuhiko; Kawai, Gota; Nakamura, Kouji; Kumasaka, Takashi

    2010-01-01

    The RNA-binding protein Hfq from B. subtilis was crystallized using the hanging-drop vapour-diffusion method in two crystal forms that belonged to space groups I422 and F222; diffraction data were collected to 2.2 Å resolution from both forms. The Hfq protein is a hexameric RNA-binding protein which regulates gene expression by binding to RNA under the influence of diverse environmental stresses. Its ring structure binds various types of RNA, including mRNA and sRNA. RNA-bound structures of Hfq from Escherichia coli and Staphylococcus aureus have been revealed to have poly(A) RNA at the distal site and U-rich RNA at the proximal site, respectively. Here, crystals of a complex of the Bacillus subtilis Hfq protein with an A/G-repeat 7-mer RNA (Hfq–RNA) that were prepared using the hanging-drop vapour-diffusion technique are reported. The type 1 Hfq–RNA crystals belonged to space group I422, with unit-cell parameters a = b = 123.70, c = 119.13 Å, while the type 2 Hfq–RNA crystals belonged to space group F222, with unit-cell parameters a = 91.92, b = 92.50, c = 114.92 Å. Diffraction data were collected to a resolution of 2.20 Å from both crystal forms. The hexameric structure of the Hfq protein was clearly shown by self-rotation analysis

  12. Use of tiling array data and RNA secondary structure predictions to identify noncoding RNA genes

    DEFF Research Database (Denmark)

    Weile, Christian; Gardner, Paul P; Hedegaard, Mads M

    2007-01-01

    neuroblastoma cell line SK-N-AS. Using this strategy, we identify thousands of human candidate RNA genes. To further verify the expression of these genes, we focused on candidate genes that had a stable hairpin structures or a high level of covariance. Using northern blotting, we verify the expression of 2 out...

  13. The ins and outs of lncRNA structure: How, why and what comes next?

    Science.gov (United States)

    Blythe, Amanda J; Fox, Archa H; Bond, Charles S

    2016-01-01

    The field of structural biology has the unique advantage of being able to provide a comprehensive picture of biological mechanisms at the molecular and atomic level. Long noncoding RNAs (lncRNAs) represent the new frontier in the molecular biology of complex organisms yet remain the least characterised of all the classes of RNA. Thousands of new lncRNAs are being reported each year yet very little structural data exists for this rapidly expanding field. The length of lncRNAs ranges from 200 nt to over 100 kb in length and they generally exhibit low cellular abundance. Therefore, obtaining sufficient quantities of lncRNA to use for structural analysis is challenging. However, as technologies develop structures of lncRNAs are starting to emerge providing important information regarding their mechanism of action. Here we review the current methods used to determine the structure of lncRNA and lncRNA:protein complexes and describe the significant contribution structural biology has and will make to the field of lncRNA research. This article is part of a Special Issue entitled: Clues to long noncoding RNA taxonomy1, edited by Dr. Tetsuro Hirose and Dr. Shinichi Nakagawa. Copyright © 2015 Elsevier B.V. All rights reserved.

  14. The modeled structure of the RNA dependent RNA polymerase of GBV-C Virus suggests a role for motif E in Flaviviridae RNA polymerases

    Directory of Open Access Journals (Sweden)

    Dutartre Hélène

    2005-10-01

    Full Text Available Abstract Background The Flaviviridae virus family includes major human and animal pathogens. The RNA dependent RNA polymerase (RdRp plays a central role in the replication process, and thus is a validated target for antiviral drugs. Despite the increasing structural and enzymatic characterization of viral RdRps, detailed molecular replication mechanisms remain unclear. The hepatitis C virus (HCV is a major human pathogen difficult to study in cultured cells. The bovine viral diarrhea virus (BVDV is often used as a surrogate model to screen antiviral drugs against HCV. The structure of BVDV RdRp has been recently published. It presents several differences relative to HCV RdRp. These differences raise questions about the relevance of BVDV as a surrogate model, and cast novel interest on the "GB" virus C (GBV-C. Indeed, GBV-C is genetically closer to HCV than BVDV, and can lead to productive infection of cultured cells. There is no structural data for the GBV-C RdRp yet. Results We show in this study that the GBV-C RdRp is closest to the HCV RdRp. We report a 3D model of the GBV-C RdRp, developed using sequence-to-structure threading and comparative modeling based on the atomic coordinates of the HCV RdRp structure. Analysis of the predicted structural features in the phylogenetic context of the RNA polymerase family allows rationalizing most of the experimental data available. Both available structures and our model are explored to examine the catalytic cleft, allosteric and substrate binding sites. Conclusion Computational methods were used to infer evolutionary relationships and to predict the structure of a viral RNA polymerase. Docking a GTP molecule into the structure allows defining a GTP binding pocket in the GBV-C RdRp, such as that of BVDV. The resulting model suggests a new proposition for the mechanism of RNA synthesis, and may prove useful to design new experiments to implement our knowledge on the initiation mechanism of RNA

  15. Analysis of RNA metabolism in fission yeast

    DEFF Research Database (Denmark)

    Wise, Jo Ann; Nielsen, Olaf

    2017-01-01

    Here we focus on the biogenesis and function of messenger RNA (mRNA) in fission yeast cells. Following a general introduction that also briefly touches on other classes of RNA, we provide an overview of methods used to analyze mRNAs throughout their life cycles....

  16. Network Properties of the Ensemble of RNA Structures

    Science.gov (United States)

    Clote, Peter; Bayegan, Amir

    2015-01-01

    We describe the first dynamic programming algorithm that computes the expected degree for the network, or graph G = (V, E) of all secondary structures of a given RNA sequence a = a 1, …, a n. Here, the nodes V correspond to all secondary structures of a, while an edge exists between nodes s, t if the secondary structure t can be obtained from s by adding, removing or shifting a base pair. Since secondary structure kinetics programs implement the Gillespie algorithm, which simulates a random walk on the network of secondary structures, the expected network degree may provide a better understanding of kinetics of RNA folding when allowing defect diffusion, helix zippering, and related conformation transformations. We determine the correlation between expected network degree, contact order, conformational entropy, and expected number of native contacts for a benchmarking dataset of RNAs. Source code is available at http://bioinformatics.bc.edu/clotelab/RNAexpNumNbors. PMID:26488894

  17. Analysis of extracellular RNA by digital PCR

    Directory of Open Access Journals (Sweden)

    Kenji eTakahashi

    2014-06-01

    Full Text Available The transfer of extracellular RNA is emerging as an important mechanism for intracellular communication. The ability for the transfer of functionally active RNA molecules from one cell to another within vesicles such as exosomes enables a cell to modulate cellular signaling and biological processes within recipient cells. The study of extracellular RNA requires sensitive methods for the detection of these molecules. In this methods article, we will describe protocols for the detection of such extracellular RNA using sensitive detection technologies such as digital PCR. These protocols should be valuable to researchers interested in the role and contribution of extracellular RNA to tumor cell biology.

  18. Landscape and variation of RNA secondary structure across the human transcriptome.

    OpenAIRE

    Wan, Y; Qu, K; Zhang, QC; Flynn, RA; Manor, O; Ouyang, Z; Zhang, J; Spitale, RC; Snyder, MP; Segal, E; Chang, HY

    2014-01-01

    In parallel to the genetic code for protein synthesis, a second layer of information is embedded in all RNA transcripts in the form of RNA structure. RNA structure influences practically every step in the gene expression program. However, the nature of most RNA structures or effects of sequence variation on structure are not known. Here we report the initial landscape and variation of RNA secondary structures (RSSs) in a human family trio (mother, father and their child). This provides a comp...

  19. A combinatorial enumeration problem of RNA secondary structures

    African Journals Online (AJOL)

    use

    2011-12-21

    Dec 21, 2011 ... connection between Discrete Mathematics and Compu- tational Molecular Biology (Chen et al, 2005; Hofacker et ... in Computational Molecular Biology. An RNA molecule is described by its sequences of bases ... Here, a mathematical definition of secondary structure is given (Stein and Waterman 1978).

  20. lncRNA Structure: Message to the Heart.

    Science.gov (United States)

    Fazal, Furqan M; Chang, Howard Y

    2016-10-06

    In this issue, Xue et al. (2016) describe the secondary structure of the heart-specific long non-coding RNA Braveheart, leading to the discovery of a short, asymmetric G-rich loop that controls cardiac lineage commitment by interacting with the transcription factor CNBP. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Expression, crystallization and preliminary crystallographic analysis of RNA-binding protein Hfq (YmaH) from Bacillus subtilis in complex with an RNA aptamer.

    Science.gov (United States)

    Baba, Seiki; Someya, Tatsuhiko; Kawai, Gota; Nakamura, Kouji; Kumasaka, Takashi

    2010-05-01

    The Hfq protein is a hexameric RNA-binding protein which regulates gene expression by binding to RNA under the influence of diverse environmental stresses. Its ring structure binds various types of RNA, including mRNA and sRNA. RNA-bound structures of Hfq from Escherichia coli and Staphylococcus aureus have been revealed to have poly(A) RNA at the distal site and U-rich RNA at the proximal site, respectively. Here, crystals of a complex of the Bacillus subtilis Hfq protein with an A/G-repeat 7-mer RNA (Hfq-RNA) that were prepared using the hanging-drop vapour-diffusion technique are reported. The type 1 Hfq-RNA crystals belonged to space group I422, with unit-cell parameters a = b = 123.70, c = 119.13 A, while the type 2 Hfq-RNA crystals belonged to space group F222, with unit-cell parameters a = 91.92, b = 92.50, c = 114.92 A. Diffraction data were collected to a resolution of 2.20 A from both crystal forms. The hexameric structure of the Hfq protein was clearly shown by self-rotation analysis.

  2. Accurate SHAPE-directed RNA secondary structure modeling, including pseudoknots.

    Science.gov (United States)

    Hajdin, Christine E; Bellaousov, Stanislav; Huggins, Wayne; Leonard, Christopher W; Mathews, David H; Weeks, Kevin M

    2013-04-02

    A pseudoknot forms in an RNA when nucleotides in a loop pair with a region outside the helices that close the loop. Pseudoknots occur relatively rarely in RNA but are highly overrepresented in functionally critical motifs in large catalytic RNAs, in riboswitches, and in regulatory elements of viruses. Pseudoknots are usually excluded from RNA structure prediction algorithms. When included, these pairings are difficult to model accurately, especially in large RNAs, because allowing this structure dramatically increases the number of possible incorrect folds and because it is difficult to search the fold space for an optimal structure. We have developed a concise secondary structure modeling approach that combines SHAPE (selective 2'-hydroxyl acylation analyzed by primer extension) experimental chemical probing information and a simple, but robust, energy model for the entropic cost of single pseudoknot formation. Structures are predicted with iterative refinement, using a dynamic programming algorithm. This melded experimental and thermodynamic energy function predicted the secondary structures and the pseudoknots for a set of 21 challenging RNAs of known structure ranging in size from 34 to 530 nt. On average, 93% of known base pairs were predicted, and all pseudoknots in well-folded RNAs were identified.

  3. Selective small-molecule inhibition of an RNA structural element

    Energy Technology Data Exchange (ETDEWEB)

    Howe, John A.; Wang, Hao; Fischmann, Thierry O.; Balibar, Carl J.; Xiao, Li; Galgoci, Andrew M.; Malinverni, Juliana C.; Mayhood, Todd; Villafania, Artjohn; Nahvi, Ali; Murgolo, Nicholas; Barbieri, Christopher M.; Mann, Paul A.; Carr, Donna; Xia, Ellen; Zuck, Paul; Riley, Dan; Painter, Ronald E.; Walker, Scott S.; Sherborne, Brad; de Jesus, Reynalda; Pan, Weidong; Plotkin, Michael A.; Wu, Jin; Rindgen, Diane; Cummings, John; Garlisi, Charles G.; Zhang, Rumin; Sheth, Payal R.; Gill, Charles J.; Tang, Haifeng; Roemer , Terry (Merck)

    2015-09-30

    Riboswitches are non-coding RNA structures located in messenger RNAs that bind endogenous ligands, such as a specific metabolite or ion, to regulate gene expression. As such, riboswitches serve as a novel, yet largely unexploited, class of emerging drug targets. Demonstrating this potential, however, has proven difficult and is restricted to structurally similar antimetabolites and semi-synthetic analogues of their cognate ligand, thus greatly restricting the chemical space and selectivity sought for such inhibitors. Here we report the discovery and characterization of ribocil, a highly selective chemical modulator of bacterial riboflavin riboswitches, which was identified in a phenotypic screen and acts as a structurally distinct synthetic mimic of the natural ligand, flavin mononucleotide, to repress riboswitch-mediated ribB gene expression and inhibit bacterial cell growth. Our findings indicate that non-coding RNA structural elements may be more broadly targeted by synthetic small molecules than previously expected.

  4. Secondary structure of 5S RNA: NMR experiments on RNA molecules partially labeled with Nitrogen-15

    International Nuclear Information System (INIS)

    Gewirth, D.T.; Abo, S.R.; Leontis, N.B.; Moore, P.B.

    1987-01-01

    A method has been found for reassembling fragment 1 of Escherichia coli 5S RNA from mixtures containing strand III (bases 69-87) and the complex consisting of strand II (bases 89-120) and strand IV (bases 1-11). The reassembled molecule is identical with unreconstituted fragment 1. With this technique, fragment 1 molecules have been constructed 15 N-labeled either in strand III or in the strand II-strand IV complex. Spectroscopic data obtained with these partially labeled molecules show that the terminal helix of 5S RNA includes the GU and GC base pairs at positions 9 and 10 which the standard model for 5S secondary structure predicts but that these base pairs are unstable both in the fragment and in native 5S RNA. The data also assign three resonances to the helix V region of the molecule (bases 70-77 and 99-106). None of these resonances has a normal chemical shift even though two of them correspond to AU or GU base pairs in the standard model. The implications of these findings for the authors understanding of the structure of 5S RNA and its complex with ribosomal protein L25 are discussed

  5. Secondary structural entropy in RNA switch (Riboswitch) identification.

    Science.gov (United States)

    Manzourolajdad, Amirhossein; Arnold, Jonathan

    2015-04-28

    RNA regulatory elements play a significant role in gene regulation. Riboswitches, a widespread group of regulatory RNAs, are vital components of many bacterial genomes. These regulatory elements generally function by forming a ligand-induced alternative fold that controls access to ribosome binding sites or other regulatory sites in RNA. Riboswitch-mediated mechanisms are ubiquitous across bacterial genomes. A typical class of riboswitch has its own unique structural and biological complexity, making de novo riboswitch identification a formidable task. Traditionally, riboswitches have been identified through comparative genomics based on sequence and structural homology. The limitations of structural-homology-based approaches, coupled with the assumption that there is a great diversity of undiscovered riboswitches, suggests the need for alternative methods for riboswitch identification, possibly based on features intrinsic to their structure. As of yet, no such reliable method has been proposed. We used structural entropy of riboswitch sequences as a measure of their secondary structural dynamics. Entropy values of a diverse set of riboswitches were compared to that of their mutants, their dinucleotide shuffles, and their reverse complement sequences under different stochastic context-free grammar folding models. Significance of our results was evaluated by comparison to other approaches, such as the base-pairing entropy and energy landscapes dynamics. Classifiers based on structural entropy optimized via sequence and structural features were devised as riboswitch identifiers and tested on Bacillus subtilis, Escherichia coli, and Synechococcus elongatus as an exploration of structural entropy based approaches. The unusually long untranslated region of the cotH in Bacillus subtilis, as well as upstream regions of certain genes, such as the sucC genes were associated with significant structural entropy values in genome-wide examinations. Various tests show that there

  6. Quantitative DMS mapping for automated RNA secondary structure inference

    OpenAIRE

    Cordero, Pablo; Kladwang, Wipapat; VanLang, Christopher C.; Das, Rhiju

    2012-01-01

    For decades, dimethyl sulfate (DMS) mapping has informed manual modeling of RNA structure in vitro and in vivo. Here, we incorporate DMS data into automated secondary structure inference using a pseudo-energy framework developed for 2'-OH acylation (SHAPE) mapping. On six non-coding RNAs with crystallographic models, DMS- guided modeling achieves overall false negative and false discovery rates of 9.5% and 11.6%, comparable or better than SHAPE-guided modeling; and non-parametric bootstrappin...

  7. Integrating chemical footprinting data into RNA secondary structure prediction.

    Directory of Open Access Journals (Sweden)

    Kourosh Zarringhalam

    Full Text Available Chemical and enzymatic footprinting experiments, such as shape (selective 2'-hydroxyl acylation analyzed by primer extension, yield important information about RNA secondary structure. Indeed, since the [Formula: see text]-hydroxyl is reactive at flexible (loop regions, but unreactive at base-paired regions, shape yields quantitative data about which RNA nucleotides are base-paired. Recently, low error rates in secondary structure prediction have been reported for three RNAs of moderate size, by including base stacking pseudo-energy terms derived from shape data into the computation of minimum free energy secondary structure. Here, we describe a novel method, RNAsc (RNA soft constraints, which includes pseudo-energy terms for each nucleotide position, rather than only for base stacking positions. We prove that RNAsc is self-consistent, in the sense that the nucleotide-specific probabilities of being unpaired in the low energy Boltzmann ensemble always become more closely correlated with the input shape data after application of RNAsc. From this mathematical perspective, the secondary structure predicted by RNAsc should be 'correct', in as much as the shape data is 'correct'. We benchmark RNAsc against the previously mentioned method for eight RNAs, for which both shape data and native structures are known, to find the same accuracy in 7 out of 8 cases, and an improvement of 25% in one case. Furthermore, we present what appears to be the first direct comparison of shape data and in-line probing data, by comparing yeast asp-tRNA shape data from the literature with data from in-line probing experiments we have recently performed. With respect to several criteria, we find that shape data appear to be more robust than in-line probing data, at least in the case of asp-tRNA.

  8. DMS-MaPseq for genome-wide or targeted RNA structure probing in vivo.

    Science.gov (United States)

    Zubradt, Meghan; Gupta, Paromita; Persad, Sitara; Lambowitz, Alan M; Weissman, Jonathan S; Rouskin, Silvi

    2017-01-01

    Coupling of structure-specific in vivo chemical modification to next-generation sequencing is transforming RNA secondary structure studies in living cells. The dominant strategy for detecting in vivo chemical modifications uses reverse transcriptase truncation products, which introduce biases and necessitate population-average assessments of RNA structure. Here we present dimethyl sulfate (DMS) mutational profiling with sequencing (DMS-MaPseq), which encodes DMS modifications as mismatches using a thermostable group II intron reverse transcriptase. DMS-MaPseq yields a high signal-to-noise ratio, can report multiple structural features per molecule, and allows both genome-wide studies and focused in vivo investigations of even low-abundance RNAs. We apply DMS-MaPseq for the first analysis of RNA structure within an animal tissue and to identify a functional structure involved in noncanonical translation initiation. Additionally, we use DMS-MaPseq to compare the in vivo structure of pre-mRNAs with their mature isoforms. These applications illustrate DMS-MaPseq's capacity to dramatically expand in vivo analysis of RNA structure.

  9. Comprehensive analysis of RNA-Seq data reveals extensive RNA editing in a human transcriptome

    DEFF Research Database (Denmark)

    Peng, Zhiyu; Cheng, Yanbing; Tan, Bertrand Chin-Ming

    2012-01-01

    a computational pipeline that carefully controls for false positives while calling RNA editing events from genome and whole-transcriptome data of the same individual. We identified 22,688 RNA editing events in noncoding genes and introns, untranslated regions and coding sequences of protein-coding genes. Most......RNA editing is a post-transcriptional event that recodes hereditary information. Here we describe a comprehensive profile of the RNA editome of a male Han Chinese individual based on analysis of ∼767 million sequencing reads from poly(A)(+), poly(A)(-) and small RNA samples. We developed...... changes (∼93%) converted A to I(G), consistent with known editing mechanisms based on adenosine deaminase acting on RNA (ADAR). We also found evidence of other types of nucleotide changes; however, these were validated at lower rates. We found 44 editing sites in microRNAs (miRNAs), suggesting a potential...

  10. Translation of the flavivirus kunjin NS3 gene in cis but not its RNA sequence or secondary structure is essential for efficient RNA packaging.

    Science.gov (United States)

    Pijlman, Gorben P; Kondratieva, Natasha; Khromykh, Alexander A

    2006-11-01

    Our previous studies using trans-complementation analysis of Kunjin virus (KUN) full-length cDNA clones harboring in-frame deletions in the NS3 gene demonstrated the inability of these defective complemented RNAs to be packaged into virus particles (W. J. Liu, P. L. Sedlak, N. Kondratieva, and A. A. Khromykh, J. Virol. 76:10766-10775). In this study we aimed to establish whether this requirement for NS3 in RNA packaging is determined by the secondary RNA structure of the NS3 gene or by the essential role of the translated NS3 gene product. Multiple silent mutations of three computer-predicted stable RNA structures in the NS3 coding region of KUN replicon RNA aimed at disrupting RNA secondary structure without affecting amino acid sequence did not affect RNA replication and packaging into virus-like particles in the packaging cell line, thus demonstrating that the predicted conserved RNA structures in the NS3 gene do not play a role in RNA replication and/or packaging. In contrast, double frameshift mutations in the NS3 coding region of full-length KUN RNA, producing scrambled NS3 protein but retaining secondary RNA structure, resulted in the loss of ability of these defective RNAs to be packaged into virus particles in complementation experiments in KUN replicon-expressing cells. Furthermore, the more robust complementation-packaging system based on established stable cell lines producing large amounts of complemented replicating NS3-deficient replicon RNAs and infection with KUN virus to provide structural proteins also failed to detect any secreted virus-like particles containing packaged NS3-deficient replicon RNAs. These results have now firmly established the requirement of KUN NS3 protein translated in cis for genome packaging into virus particles.

  11. Crowdsourcing RNA structural alignments with an online computer game.

    Science.gov (United States)

    Waldispühl, Jérôme; Kam, Arthur; Gardner, Paul P

    2015-01-01

    The annotation and classification of ncRNAs is essential to decipher molecular mechanisms of gene regulation in normal and disease states. A database such as Rfam maintains alignments, consensus secondary structures, and corresponding annotations for RNA families. Its primary purpose is the automated, accurate annotation of non-coding RNAs in genomic sequences. However, the alignment of RNAs is computationally challenging, and the data stored in this database are often subject to improvements. Here, we design and evaluate Ribo, a human-computing game that aims to improve the accuracy of RNA alignments already stored in Rfam. We demonstrate the potential of our techniques and discuss the feasibility of large scale collaborative annotation and classification of RNA families.

  12. Sparse RNA folding revisited: space-efficient minimum free energy structure prediction.

    Science.gov (United States)

    Will, Sebastian; Jabbari, Hosna

    2016-01-01

    RNA secondary structure prediction by energy minimization is the central computational tool for the analysis of structural non-coding RNAs and their interactions. Sparsification has been successfully applied to improve the time efficiency of various structure prediction algorithms while guaranteeing the same result; however, for many such folding problems, space efficiency is of even greater concern, particularly for long RNA sequences. So far, space-efficient sparsified RNA folding with fold reconstruction was solved only for simple base-pair-based pseudo-energy models. Here, we revisit the problem of space-efficient free energy minimization. Whereas the space-efficient minimization of the free energy has been sketched before, the reconstruction of the optimum structure has not even been discussed. We show that this reconstruction is not possible in trivial extension of the method for simple energy models. Then, we present the time- and space-efficient sparsified free energy minimization algorithm SparseMFEFold that guarantees MFE structure prediction. In particular, this novel algorithm provides efficient fold reconstruction based on dynamically garbage-collected trace arrows. The complexity of our algorithm depends on two parameters, the number of candidates Z and the number of trace arrows T; both are bounded by [Formula: see text], but are typically much smaller. The time complexity of RNA folding is reduced from [Formula: see text] to [Formula: see text]; the space complexity, from [Formula: see text] to [Formula: see text]. Our empirical results show more than 80 % space savings over RNAfold [Vienna RNA package] on the long RNAs from the RNA STRAND database (≥2500 bases). The presented technique is intentionally generalizable to complex prediction algorithms; due to their high space demands, algorithms like pseudoknot prediction and RNA-RNA-interaction prediction are expected to profit even stronger than "standard" MFE folding. SparseMFEFold is free

  13. The 5S rRNA loop E: chemical probing and phylogenetic data versus crystal structure.

    Science.gov (United States)

    Leontis, N B; Westhof, E

    1998-09-01

    A significant fraction of the bases in a folded, structured RNA molecule participate in noncanonical base pairing interactions, often in the context of internal loops or multi-helix junction loops. The appearance of each new high-resolution RNA structure provides welcome data to guide efforts to understand and predict RNA 3D structure, especially when the RNA in question is a functionally conserved molecule. The recent publication of the crystal structure of the "Loop E" region of bacterial 5S ribosomal RNA is such an event [Correll CC, Freeborn B, Moore PB, Steitz TA, 1997, Cell 91:705-712]. In addition to providing more examples of already established noncanonical base pairs, such as purine-purine sheared pairings, trans-Hoogsteen UA, and GU wobble pairs, the structure provides the first high-resolution views of two new purine-purine pairings and a new GU pairing. The goal of the present analysis is to expand the capabilities of both chemical probing and phylogenetic analysis to predict with greater accuracy the structures of RNA molecules. First, in light of existing chemical probing data, we investigate what lessons could be learned regarding the interpretation of this widely used method of RNA structure probing. Then we analyze the 3D structure with reference to molecular phylogeny data (assuming conservation of function) to discover what alternative base pairings are geometrically compatible with the structure. The comparisons between previous modeling efforts and crystal structures show that the intricate involvements of ions and water molecules in the maintenance of non-Watson-Crick pairs render the process of correctly identifying the interacting sites in such pairs treacherous, except in cases of trans-Hoogsteen A/U or sheared A/G pairs for the adenine N1 site. The phylogenetic analysis identifies A/A, A/C, A/U and C/A, C/C, and C/U pairings isosteric with sheared A/G, as well as A/A and A/C pairings isosteric with both G/U and G/G bifurcated pairings

  14. Crystal structures of two eukaryotic nucleases involved in RNA metabolism

    DEFF Research Database (Denmark)

    Jonstrup, Anette Thyssen; Midtgaard, Søren Fuglsang; Van, Lan Bich

    RNA serves a number of functions in the cell: mRNAs are the carriers of information between gene and protein, tRNAs and rRNAs are involved in the synthesis of proteins, whereas a number of additional RNA species are responsible for other functions in the cell. The quality of the different RNAs...... RNAs. We have solved the structures of two nucleases involved in 3'-5' degradation of RNA; the S. pombe Pop2p and the S. cerevisiae Rrp6p. Pop2p is part of the main cytoplasmatic deadenylation complex in yeast, which also contains the nuclease Ccr4p. Deadenylation, where the poly(A)-tail is removed...... specific transcripts. Here, we present the crystal structure of the S. pombe Pop2p protein to 1.4 Å resolution. The high resolution structure provides a clear picture of the active site architecture. Structural alignment of single nucleotides and poly(A)-oligonucleotides from earlier co-crystal structures...

  15. A Structural Overview of RNA-Dependent RNA Polymerases from the Flaviviridae Family

    Directory of Open Access Journals (Sweden)

    Jiqin Wu

    2015-06-01

    Full Text Available RNA-dependent RNA polymerases (RdRPs from the Flaviviridae family are representatives of viral polymerases that carry out RNA synthesis through a de novo initiation mechanism. They share a ≈ 600-residue polymerase core that displays a canonical viral RdRP architecture resembling an encircled right hand with palm, fingers, and thumb domains surrounding the active site. Polymerase catalytic motifs A–E in the palm and motifs F/G in the fingers are shared by all viral RdRPs with sequence and/or structural conservations regardless of the mechanism of initiation. Different from RdRPs carrying out primer-dependent initiation, Flaviviridae and other de novo RdRPs utilize a priming element often integrated in the thumb domain to facilitate primer-independent initiation. Upon the transition to the elongation phase, this priming element needs to undergo currently unresolved conformational rearrangements to accommodate the growth of the template-product RNA duplex. In the genera of Flavivirus and Pestivirus, the polymerase module in the C-terminal part of the RdRP protein may be regulated in cis by the N-terminal region of the same polypeptide. Either being a methyltransferase in Flavivirus or a functionally unclarified module in Pestivirus, this region could play auxiliary roles for the canonical folding and/or the catalysis of the polymerase, through defined intra-molecular interactions.

  16. Sequence analysis of L RNA of Lassa virus

    International Nuclear Information System (INIS)

    Vieth, Simon; Torda, Andrew E.; Asper, Marcel; Schmitz, Herbert; Guenther, Stephan

    2004-01-01

    The L RNA of three Lassa virus strains originating from Nigeria, Ghana/Ivory Coast, and Sierra Leone was sequenced and the data subjected to structure predictions and phylogenetic analyses. The L gene products had 2218-2221 residues, diverged by 18% at the amino acid level, and contained several conserved regions. Only one region of 504 residues (positions 1043-1546) could be assigned a function, namely that of an RNA polymerase. Secondary structure predictions suggest that this domain is very similar to RNA-dependent RNA polymerases of known structure encoded by plus-strand RNA viruses, permitting a model to be built. Outside the polymerase region, there is little structural data, except for regions of strong alpha-helical content and probably a coiled-coil domain at the N terminus. No evidence for reassortment or recombination during Lassa virus evolution was found. The secondary structure-assisted alignment of the RNA polymerase region permitted a reliable reconstruction of the phylogeny of all negative-strand RNA viruses, indicating that Arenaviridae are most closely related to Nairoviruses. In conclusion, the data provide a basis for structural and functional characterization of the Lassa virus L protein and reveal new insights into the phylogeny of negative-strand RNA viruses

  17. The ribosome structure controls and directs mRNA entry, translocation and exit dynamics

    International Nuclear Information System (INIS)

    Kurkcuoglu, Ozge; Doruker, Pemra; Jernigan, Robert L; Sen, Taner Z; Kloczkowski, Andrzej

    2008-01-01

    The protein-synthesizing ribosome undergoes large motions to effect the translocation of tRNAs and mRNA; here, the domain motions of this system are explored with a coarse-grained elastic network model using normal mode analysis. Crystal structures are used to construct various model systems of the 70S complex with/without tRNA, elongation factor Tu and the ribosomal proteins. Computed motions reveal the well-known ratchet-like rotational motion of the large subunits, as well as the head rotation of the small subunit and the high flexibility of the L1 and L7/L12 stalks, even in the absence of ribosomal proteins. This result indicates that these experimentally observed motions during translocation are inherently controlled by the ribosomal shape and only partially dependent upon GTP hydrolysis. Normal mode analysis further reveals the mobility of A- and P-tRNAs to increase in the absence of the E-tRNA. In addition, the dynamics of the E-tRNA is affected by the absence of the ribosomal protein L1. The mRNA in the entrance tunnel interacts directly with helicase proteins S3 and S4, which constrain the mRNA in a clamp-like fashion, as well as with protein S5, which likely orients the mRNA to ensure correct translation. The ribosomal proteins S7, S11 and S18 may also be involved in assuring translation fidelity by constraining the mRNA at the exit site of the channel. The mRNA also interacts with the 16S 3' end forming the Shine–Dalgarno complex at the initiation step; the 3' end may act as a 'hook' to reel in the mRNA to facilitate its exit

  18. A phase transition in energy-filtered RNA secondary structures

    DEFF Research Database (Denmark)

    Han, Hillary Siwei; reidys, Christian

    2012-01-01

    In this paper we study the effect of energy parameters on minimum free energy (mfe) RNA secondary structures. Employing a simplified combinatorial energy model, that is only dependent on the diagram representation and that is not sequence specific, we prove the following dichotomy result. Mfe...... this phase transition from a discrete limit to a central limit distribution and subsequently put our result into the context of quantifying the effect of sparsification of the folding of these respective mfe-structures. We show that the sparsification of realistic mfe-structures leads to a constant time...

  19. Bioinformatical approaches to RNA structure prediction & Sequencing of an ancient human genome

    DEFF Research Database (Denmark)

    Lindgreen, Stinus

    Stinus Lindgreen has been working in two different fields during his Ph.D. The first part has been focused on computational approaches to predict the structure of non-coding RNA molecules at the base pairing level. This has resulted in the analysis of various measures of the base pairing potentia...

  20. Mutational analysis of Sep-tRNA:Cys-tRNA synthase reveals critical residues for tRNA-dependent cysteine formation.

    Science.gov (United States)

    Helgadóttir, Sunna; Sinapah, Sylvie; Söll, Dieter; Ling, Jiqiang

    2012-01-02

    In methanogenic archaea, Sep-tRNA:Cys-tRNA synthase (SepCysS) converts Sep-tRNA(Cys) to Cys-tRNA(Cys). The mechanism of tRNA-dependent cysteine formation remains unclear due to the lack of functional studies. In this work, we mutated 19 conserved residues in Methanocaldococcus jannaschii SepCysS, and employed an in vivo system to determine the activity of the resulting variants. Our results show that three active-site cysteines (Cys39, Cys42 and Cys247) are essential for SepCysS activity. In addition, combined with structural modeling, our mutational and functional analyses also reveal multiple residues that are important for the binding of PLP, Sep and tRNA. Our work thus represents the first systematic functional analysis of conserved residues in archaeal SepCysSs, providing insights into the catalytic and substrate binding mechanisms of this poorly characterized enzyme. Copyright © 2011 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  1. Quantitation of base substitutions in eukaryotic 5S rRNA: selection for the maintenance of RNA secondary structure.

    Science.gov (United States)

    Curtiss, W C; Vournakis, J N

    1984-01-01

    Eukaryotic 5S rRNA sequences from 34 diverse species were compared by the following method: (1) The sequences were aligned; (2) the positions of substitutions were located by comparison of all possible pairs of sequences; (3) the substitution sites were mapped to an assumed general base pairing model; and (4) the R-Y model of base stacking was used to study stacking pattern relationships in the structure. An analysis of the sequence and structure variability in each region of the molecule is presented. It was found that the degree of base substitution varies over a wide range, from absolute conservation to occurrence of over 90% of the possible observable substitutions. The substitutions are located primarily in stem regions of the 5S rRNA secondary structure. More than 88% of the substitutions in helical regions maintain base pairing. The disruptive substitutions are primarily located at the edges of helical regions, resulting in shortening of the helical regions and lengthening of the adjacent nonpaired regions. Base stacking patterns determined by the R-Y model are mapped onto the general secondary structure. Intrastrand and interstrand stacking could stabilize alternative coaxial structures and limit the conformational flexibility of nonpaired regions. Two short contiguous regions are 100% conserved in all species. This may reflect evolutionary constraints imposed at the DNA level by the requirement for binding of a 5S gene transcription initiation factor during gene expression.

  2. Random versus Deterministic Descent in RNA Energy Landscape Analysis

    Directory of Open Access Journals (Sweden)

    Luke Day

    2016-01-01

    Full Text Available Identifying sets of metastable conformations is a major research topic in RNA energy landscape analysis, and recently several methods have been proposed for finding local minima in landscapes spawned by RNA secondary structures. An important and time-critical component of such methods is steepest, or gradient, descent in attraction basins of local minima. We analyse the speed-up achievable by randomised descent in attraction basins in the context of large sample sets where the size has an order of magnitude in the region of ~106. While the gain for each individual sample might be marginal, the overall run-time improvement can be significant. Moreover, for the two nongradient methods we analysed for partial energy landscapes induced by ten different RNA sequences, we obtained that the number of observed local minima is on average larger by 7.3% and 3.5%, respectively. The run-time improvement is approximately 16.6% and 6.8% on average over the ten partial energy landscapes. For the large sample size we selected for descent procedures, the coverage of local minima is very high up to energy values of the region where the samples were randomly selected from the partial energy landscapes; that is, the difference to the total set of local minima is mainly due to the upper area of the energy landscapes.

  3. Evolution of Tertiary Structure of Viral RNA Dependent Polymerases

    Czech Academy of Sciences Publication Activity Database

    Černý, Jiří; Černá, B.; Valdés, James J.; Grubhoffer, Libor; Růžek, Daniel

    2014-01-01

    Roč. 9, č. 5 (2014), e96070 E-ISSN 1932-6203 R&D Projects: GA ČR GAP502/11/2116; GA ČR GAP302/12/2490; GA MŠk(CZ) EE2.3.30.0032 Institutional support: RVO:60077344 Keywords : Q-BETA replicase * C virus RNA * crystal structure Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.234, year: 2014

  4. Assembly of proteins and 5 S rRNA to transcripts of the major structural domains of 23 S rRNA

    DEFF Research Database (Denmark)

    Ostergaard, P; Phan, H; Johansen, L B

    1998-01-01

    The six major structural domains of 23 S rRNA from Escherichia coli, and all combinations thereof, were synthesized as separate T7 transcripts and reconstituted with total 50 S subunit proteins. Analysis by one and two-dimensional gel electrophoresis demonstrated the presence of at least one prim...... approach was used to map the putative binding regions on domain V of protein L9 and the 5 S RNA-L5-L18 complex....

  5. A probabilistic model for the evolution of RNA structure

    Directory of Open Access Journals (Sweden)

    Holmes Ian

    2004-10-01

    Full Text Available Abstract Background For the purposes of finding and aligning noncoding RNA gene- and cis-regulatory elements in multiple-genome datasets, it is useful to be able to derive multi-sequence stochastic grammars (and hence multiple alignment algorithms systematically, starting from hypotheses about the various kinds of random mutation event and their rates. Results Here, we consider a highly simplified evolutionary model for RNA, called "The TKF91 Structure Tree" (following Thorne, Kishino and Felsenstein's 1991 model of sequence evolution with indels, which we have implemented for pairwise alignment as proof of principle for such an approach. The model, its strengths and its weaknesses are discussed with reference to four examples of functional ncRNA sequences: a riboswitch (guanine, a zipcode (nanos, a splicing factor (U4 and a ribozyme (RNase P. As shown by our visualisations of posterior probability matrices, the selected examples illustrate three different signatures of natural selection that are highly characteristic of ncRNA: (i co-ordinated basepair substitutions, (ii co-ordinated basepair indels and (iii whole-stem indels. Conclusions Although all three types of mutation "event" are built into our model, events of type (i and (ii are found to be better modeled than events of type (iii. Nevertheless, we hypothesise from the model's performance on pairwise alignments that it would form an adequate basis for a prototype multiple alignment and genefinding tool.

  6. Self containment, a property of modular RNA structures, distinguishes microRNAs.

    Directory of Open Access Journals (Sweden)

    Miler T Lee

    2008-08-01

    Full Text Available RNA molecules will tend to adopt a folded conformation through the pairing of bases on a single strand; the resulting so-called secondary structure is critical to the function of many types of RNA. The secondary structure of a particular substring of functional RNA may depend on its surrounding sequence. Yet, some RNAs such as microRNAs retain their specific structures during biogenesis, which involves extraction of the substructure from a larger structural context, while other functional RNAs may be composed of a fusion of independent substructures. Such observations raise the question of whether particular functional RNA substructures may be selected for invariance of secondary structure to their surrounding nucleotide context. We define the property of self containment to be the tendency for an RNA sequence to robustly adopt the same optimal secondary structure regardless of whether it exists in isolation or is a substring of a longer sequence of arbitrary nucleotide content. We measured degree of self containment using a scoring method we call the self-containment index and found that miRNA stem loops exhibit high self containment, consistent with the requirement for structural invariance imposed by the miRNA biogenesis pathway, while most other structured RNAs do not. Further analysis revealed a trend toward higher self containment among clustered and conserved miRNAs, suggesting that high self containment may be a characteristic of novel miRNAs acquiring new genomic contexts. We found that miRNAs display significantly enhanced self containment compared to other functional RNAs, but we also found a trend toward natural selection for self containment in most functional RNA classes. We suggest that self containment arises out of selection for robustness against perturbations, invariance during biogenesis, and modular composition of structural function. Analysis of self containment will be important for both annotation and design of functional

  7. An empirical strategy to detect bacterial transcript structure from directional RNA-seq transcriptome data.

    Science.gov (United States)

    Wang, Yejun; MacKenzie, Keith D; White, Aaron P

    2015-05-07

    As sequencing costs are being lowered continuously, RNA-seq has gradually been adopted as the first choice for comparative transcriptome studies with bacteria. Unlike microarrays, RNA-seq can directly detect cDNA derived from mRNA transcripts at a single nucleotide resolution. Not only does this allow researchers to determine the absolute expression level of genes, but it also conveys information about transcript structure. Few automatic software tools have yet been established to investigate large-scale RNA-seq data for bacterial transcript structure analysis. In this study, 54 directional RNA-seq libraries from Salmonella serovar Typhimurium (S. Typhimurium) 14028s were examined for potential relationships between read mapping patterns and transcript structure. We developed an empirical method, combined with statistical tests, to automatically detect key transcript features, including transcriptional start sites (TSSs), transcriptional termination sites (TTSs) and operon organization. Using our method, we obtained 2,764 TSSs and 1,467 TTSs for 1331 and 844 different genes, respectively. Identification of TSSs facilitated further discrimination of 215 putative sigma 38 regulons and 863 potential sigma 70 regulons. Combining the TSSs and TTSs with intergenic distance and co-expression information, we comprehensively annotated the operon organization in S. Typhimurium 14028s. Our results show that directional RNA-seq can be used to detect transcriptional borders at an acceptable resolution of ±10-20 nucleotides. Technical limitations of the RNA-seq procedure may prevent single nucleotide resolution. The automatic transcript border detection methods, statistical models and operon organization pipeline that we have described could be widely applied to RNA-seq studies in other bacteria. Furthermore, the TSSs, TTSs, operons, promoters and unstranslated regions that we have defined for S. Typhimurium 14028s may constitute valuable resources that can be used for

  8. Multiple Regression Analysis of mRNA-miRNA Associations in Colorectal Cancer Pathway

    Science.gov (United States)

    Wang, Fengfeng; Wong, S. C. Cesar; Chan, Lawrence W. C.; Cho, William C. S.; Yip, S. P.; Yung, Benjamin Y. M.

    2014-01-01

    Background. MicroRNA (miRNA) is a short and endogenous RNA molecule that regulates posttranscriptional gene expression. It is an important factor for tumorigenesis of colorectal cancer (CRC), and a potential biomarker for diagnosis, prognosis, and therapy of CRC. Our objective is to identify the related miRNAs and their associations with genes frequently involved in CRC microsatellite instability (MSI) and chromosomal instability (CIN) signaling pathways. Results. A regression model was adopted to identify the significantly associated miRNAs targeting a set of candidate genes frequently involved in colorectal cancer MSI and CIN pathways. Multiple linear regression analysis was used to construct the model and find the significant mRNA-miRNA associations. We identified three significantly associated mRNA-miRNA pairs: BCL2 was positively associated with miR-16 and SMAD4 was positively associated with miR-567 in the CRC tissue, while MSH6 was positively associated with miR-142-5p in the normal tissue. As for the whole model, BCL2 and SMAD4 models were not significant, and MSH6 model was significant. The significant associations were different in the normal and the CRC tissues. Conclusion. Our results have laid down a solid foundation in exploration of novel CRC mechanisms, and identification of miRNA roles as oncomirs or tumor suppressor mirs in CRC. PMID:24895601

  9. SimRNA: a coarse-grained method for RNA folding simulations and 3D structure prediction.

    Science.gov (United States)

    Boniecki, Michal J; Lach, Grzegorz; Dawson, Wayne K; Tomala, Konrad; Lukasz, Pawel; Soltysinski, Tomasz; Rother, Kristian M; Bujnicki, Janusz M

    2016-04-20

    RNA molecules play fundamental roles in cellular processes. Their function and interactions with other biomolecules are dependent on the ability to form complex three-dimensional (3D) structures. However, experimental determination of RNA 3D structures is laborious and challenging, and therefore, the majority of known RNAs remain structurally uncharacterized. Here, we present SimRNA: a new method for computational RNA 3D structure prediction, which uses a coarse-grained representation, relies on the Monte Carlo method for sampling the conformational space, and employs a statistical potential to approximate the energy and identify conformations that correspond to biologically relevant structures. SimRNA can fold RNA molecules using only sequence information, and, on established test sequences, it recapitulates secondary structure with high accuracy, including correct prediction of pseudoknots. For modeling of complex 3D structures, it can use additional restraints, derived from experimental or computational analyses, including information about secondary structure and/or long-range contacts. SimRNA also can be used to analyze conformational landscapes and identify potential alternative structures. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Evaluation of the suitability of free-energy minimization using nearest-neighbor energy parameters for RNA secondary structure prediction

    Directory of Open Access Journals (Sweden)

    Cobaugh Christian W

    2004-08-01

    Full Text Available Abstract Background A detailed understanding of an RNA's correct secondary and tertiary structure is crucial to understanding its function and mechanism in the cell. Free energy minimization with energy parameters based on the nearest-neighbor model and comparative analysis are the primary methods for predicting an RNA's secondary structure from its sequence. Version 3.1 of Mfold has been available since 1999. This version contains an expanded sequence dependence of energy parameters and the ability to incorporate coaxial stacking into free energy calculations. We test Mfold 3.1 by performing the largest and most phylogenetically diverse comparison of rRNA and tRNA structures predicted by comparative analysis and Mfold, and we use the results of our tests on 16S and 23S rRNA sequences to assess the improvement between Mfold 2.3 and Mfold 3.1. Results The average prediction accuracy for a 16S or 23S rRNA sequence with Mfold 3.1 is 41%, while the prediction accuracies for the majority of 16S and 23S rRNA structures tested are between 20% and 60%, with some having less than 20% prediction accuracy. The average prediction accuracy was 71% for 5S rRNA and 69% for tRNA. The majority of the 5S rRNA and tRNA sequences have prediction accuracies greater than 60%. The prediction accuracy of 16S rRNA base-pairs decreases exponentially as the number of nucleotides intervening between the 5' and 3' halves of the base-pair increases. Conclusion Our analysis indicates that the current set of nearest-neighbor energy parameters in conjunction with the Mfold folding algorithm are unable to consistently and reliably predict an RNA's correct secondary structure. For 16S or 23S rRNA structure prediction, Mfold 3.1 offers little improvement over Mfold 2.3. However, the nearest-neighbor energy parameters do work well for shorter RNA sequences such as tRNA or 5S rRNA, or for larger rRNAs when the contact distance between the base-pairs is less than 100 nucleotides.

  11. Fine-tuning structural RNA alignments in the twilight zone

    Directory of Open Access Journals (Sweden)

    Schirmer Stefanie

    2010-04-01

    Full Text Available Abstract Background A widely used method to find conserved secondary structure in RNA is to first construct a multiple sequence alignment, and then fold the alignment, optimizing a score based on thermodynamics and covariance. This method works best around 75% sequence similarity. However, in a "twilight zone" below 55% similarity, the sequence alignment tends to obscure the covariance signal used in the second phase. Therefore, while the overall shape of the consensus structure may still be found, the degree of conservation cannot be estimated reliably. Results Based on a combination of available methods, we present a method named planACstar for improving structure conservation in structural alignments in the twilight zone. After constructing a consensus structure by alignment folding, planACstar abandons the original sequence alignment, refolds the sequences individually, but consistent with the consensus, aligns the structures, irrespective of sequence, by a pure structure alignment method, and derives an improved sequence alignment from the alignment of structures, to be re-submitted to alignment folding, etc.. This circle may be iterated as long as structural conservation improves, but normally, one step suffices. Conclusions Employing the tools ClustalW, RNAalifold, and RNAforester, we find that for sequences with 30-55% sequence identity, structural conservation can be improved by 10% on average, with a large variation, measured in terms of RNAalifold's own criterion, the structure conservation index.

  12. Labeling of eukaryotic messenger RNA 5' terminus with phosphorus -32: use of tobacco acid pyrophosphatase for removal of cap structures

    International Nuclear Information System (INIS)

    Lockard, R.E.; Rieser, L.; Vournakis, J.N.

    1981-01-01

    In recent years, there has been a growing appreciation of the potential applications of 5'- 32 P-end-labeled mRNA, not only for screening recombinant clones and mapping gene structure, but also for revealing possible nucleotide sequence and structural signals within mRNA molecules themselves, which may be important for eukaryotic mRNA processing and turnover and for controlling differential rates of translational initiation. Three major problems, however, have retarded progress in this area, lack of methods for efficient and reproducible removal of m7G5ppp5'-cap structures, which maintain the integrity of an RNA molecule; inability to generate a sufficient amount of labeled mRNA, owing to the limited availability of most pure mRNA species; and the frequent problem of RNA degradation during in vitro end-labeling owing to RNAse contamination. The procedures presented here permit one to decap and label minute quantities of mRNA, effectively. Tobacco acid pyrophosphatase is relatively efficient in removing cap structures from even nanogram quantities of available mRNA, and enough radioactivity can be easily generated from minute amounts ofintact mRNA with very high-specific-activity [gamma- 32 P]ATP and the inhibition of ribonuclease contamination with diethylpyrocarbonate. These procedures can be modified and applied to almost any other type of RNA molecule as well. In Section III of this volume, we explore in detail how effectively 5'-end-labeled mRNA can be used not only for nucleotide sequence analysis, but also for mapping mRNA secondary structure

  13. Chimira: analysis of small RNA sequencing data and microRNA modifications.

    Science.gov (United States)

    Vitsios, Dimitrios M; Enright, Anton J

    2015-10-15

    Chimira is a web-based system for microRNA (miRNA) analysis from small RNA-Seq data. Sequences are automatically cleaned, trimmed, size selected and mapped directly to miRNA hairpin sequences. This generates count-based miRNA expression data for subsequent statistical analysis. Moreover, it is capable of identifying epi-transcriptomic modifications in the input sequences. Supported modification types include multiple types of 3'-modifications (e.g. uridylation, adenylation), 5'-modifications and also internal modifications or variation (ADAR editing or single nucleotide polymorphisms). Besides cleaning and mapping of input sequences to miRNAs, Chimira provides a simple and intuitive set of tools for the analysis and interpretation of the results (see also Supplementary Material). These allow the visual study of the differential expression between two specific samples or sets of samples, the identification of the most highly expressed miRNAs within sample pairs (or sets of samples) and also the projection of the modification profile for specific miRNAs across all samples. Other tools have already been published in the past for various types of small RNA-Seq analysis, such as UEA workbench, seqBuster, MAGI, OASIS and CAP-miRSeq, CPSS for modifications identification. A comprehensive comparison of Chimira with each of these tools is provided in the Supplementary Material. Chimira outperforms all of these tools in total execution speed and aims to facilitate simple, fast and reliable analysis of small RNA-Seq data allowing also, for the first time, identification of global microRNA modification profiles in a simple intuitive interface. Chimira has been developed as a web application and it is accessible here: http://www.ebi.ac.uk/research/enright/software/chimira. aje@ebi.ac.uk Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  14. Analysis of hepatitis C virus RNA dimerization and core–RNA interactions

    Science.gov (United States)

    Ivanyi-Nagy, Roland; Kanevsky, Igor; Gabus, Caroline; Lavergne, Jean-Pierre; Ficheux, Damien; Penin, François; Fossé, Philippe; Darlix, Jean-Luc

    2006-01-01

    The core protein of hepatitis C virus (HCV) has been shown previously to act as a potent nucleic acid chaperone in vitro, promoting the dimerization of the 3′-untranslated region (3′-UTR) of the HCV genomic RNA, a process probably mediated by a small, highly conserved palindromic RNA motif, named DLS (dimer linkage sequence) [G. Cristofari, R. Ivanyi-Nagy, C. Gabus, S. Boulant, J. P. Lavergne, F. Penin and J. L. Darlix (2004) Nucleic Acids Res., 32, 2623–2631]. To investigate in depth HCV RNA dimerization, we generated a series of point mutations in the DLS region. We find that both the plus-strand 3′-UTR and the complementary minus-strand RNA can dimerize in the presence of core protein, while mutations in the DLS (among them a single point mutation that abolished RNA replication in a HCV subgenomic replicon system) completely abrogate dimerization. Structural probing of plus- and minus-strand RNAs, in their monomeric and dimeric forms, indicate that the DLS is the major if not the sole determinant of UTR RNA dimerization. Furthermore, the N-terminal basic amino acid clusters of core protein were found to be sufficient to induce dimerization, suggesting that they retain full RNA chaperone activity. These findings may have important consequences for understanding the HCV replicative cycle and the genetic variability of the virus. PMID:16707664

  15. Analysis of hepatitis C virus RNA dimerization and core-RNA interactions.

    Science.gov (United States)

    Ivanyi-Nagy, Roland; Kanevsky, Igor; Gabus, Caroline; Lavergne, Jean-Pierre; Ficheux, Damien; Penin, François; Fossé, Philippe; Darlix, Jean-Luc

    2006-01-01

    The core protein of hepatitis C virus (HCV) has been shown previously to act as a potent nucleic acid chaperone in vitro, promoting the dimerization of the 3'-untranslated region (3'-UTR) of the HCV genomic RNA, a process probably mediated by a small, highly conserved palindromic RNA motif, named DLS (dimer linkage sequence) [G. Cristofari, R. Ivanyi-Nagy, C. Gabus, S. Boulant, J. P. Lavergne, F. Penin and J. L. Darlix (2004) Nucleic Acids Res., 32, 2623-2631]. To investigate in depth HCV RNA dimerization, we generated a series of point mutations in the DLS region. We find that both the plus-strand 3'-UTR and the complementary minus-strand RNA can dimerize in the presence of core protein, while mutations in the DLS (among them a single point mutation that abolished RNA replication in a HCV subgenomic replicon system) completely abrogate dimerization. Structural probing of plus- and minus-strand RNAs, in their monomeric and dimeric forms, indicate that the DLS is the major if not the sole determinant of UTR RNA dimerization. Furthermore, the N-terminal basic amino acid clusters of core protein were found to be sufficient to induce dimerization, suggesting that they retain full RNA chaperone activity. These findings may have important consequences for understanding the HCV replicative cycle and the genetic variability of the virus.

  16. ISEV position paper: extracellular vesicle RNA analysis and bioinformatics

    Directory of Open Access Journals (Sweden)

    Andrew F. Hill

    2013-12-01

    Full Text Available Extracellular vesicles (EVs are the collective term for the various vesicles that are released by cells into the extracellular space. Such vesicles include exosomes and microvesicles, which vary by their size and/or protein and genetic cargo. With the discovery that EVs contain genetic material in the form of RNA (evRNA has come the increased interest in these vesicles for their potential use as sources of disease biomarkers and potential therapeutic agents. Rapid developments in the availability of deep sequencing technologies have enabled the study of EV-related RNA in detail. In October 2012, the International Society for Extracellular Vesicles (ISEV held a workshop on “evRNA analysis and bioinformatics.” Here, we report the conclusions of one of the roundtable discussions where we discussed evRNA analysis technologies and provide some guidelines to researchers in the field to consider when performing such analysis.

  17. Structural basis underlying CAC RNA recognition by the RRM domain of dimeric RNA-binding protein RBPMS

    Energy Technology Data Exchange (ETDEWEB)

    Teplova, Marianna; Farazi, Thalia A.; Tuschl, Thomas; Patel, Dinshaw J.

    2015-09-08

    Abstract

    RNA-binding protein with multiple splicing (designated RBPMS) is a higher vertebrate mRNA-binding protein containing a single RNA recognition motif (RRM). RBPMS has been shown to be involved in mRNA transport, localization and stability, with key roles in axon guidance, smooth muscle plasticity, as well as regulation of cancer cell proliferation and migration. We report on structure-function studies of the RRM domain of RBPMS bound to a CAC-containing single-stranded RNA. These results provide insights into potential topologies of complexes formed by the RBPMS RRM domain and the tandem CAC repeat binding sites as detected by photoactivatable-ribonucleoside-enhanced crosslinking and immunoprecipitation. These studies establish that the RRM domain of RBPMS forms a symmetrical dimer in the free state, with each monomer binding sequence-specifically to all three nucleotides of a CAC segment in the RNA bound state. Structure-guided mutations within the dimerization and RNA-binding interfaces of RBPMS RRM on RNA complex formation resulted in both disruption of dimerization and a decrease in RNA-binding affinity as observed by size exclusion chromatography and isothermal titration calorimetry. As anticipated from biochemical binding studies, over-expression of dimerization or RNA-binding mutants of Flag-HA-tagged RBPMS were no longer able to track with stress granules in HEK293 cells, thereby documenting the deleterious effects of such mutationsin vivo.

  18. A toolbox for miRNA analysis

    Czech Academy of Sciences Publication Activity Database

    Svoboda, Petr

    2015-01-01

    Roč. 589, č. 14 (2015), s. 1694-1701 ISSN 0014-5793 R&D Projects: GA MŠk LO1419 Institutional support: RVO:68378050 Keywords : MicroRNA * Antagomir * Inhibition * Reporter * Locked nucleic acids Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.519, year: 2015

  19. The identification and functional annotation of RNA structures conserved in vertebrates

    DEFF Research Database (Denmark)

    Seemann, Ernst Stefan; Mirza, Aashiq Hussain; Hansen, Claus

    2017-01-01

    Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for Conserved RNA Structures (CRSs), leveraging structure-b......-structured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality.......Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for Conserved RNA Structures (CRSs), leveraging structure......-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ~516k human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (i) co-localize consistently with binding sites of the same RNA binding proteins...

  20. RNA Secondary Structure Prediction by Using Discrete Mathematics: An Interdisciplinary Research Experience for Undergraduate Students

    Science.gov (United States)

    Ellington, Roni; Wachira, James; Nkwanta, Asamoah

    2010-01-01

    The focus of this Research Experience for Undergraduates (REU) project was on RNA secondary structure prediction by using a lattice walk approach. The lattice walk approach is a combinatorial and computational biology method used to enumerate possible secondary structures and predict RNA secondary structure from RNA sequences. The method uses…

  1. ConStruct: Improved construction of RNA consensus structures

    Directory of Open Access Journals (Sweden)

    Steger Gerhard

    2008-04-01

    Full Text Available Abstract Background Aligning homologous non-coding RNAs (ncRNAs correctly in terms of sequence and structure is an unresolved problem, due to both mathematical complexity and imperfect scoring functions. High quality alignments, however, are a prerequisite for most consensus structure prediction approaches, homology searches, and tools for phylogeny inference. Automatically created ncRNA alignments often need manual corrections, yet this manual refinement is tedious and error-prone. Results We present an extended version of CONSTRUCT, a semi-automatic, graphical tool suitable for creating RNA alignments correct in terms of both consensus sequence and consensus structure. To this purpose CONSTRUCT combines sequence alignment, thermodynamic data and various measures of covariation. One important feature is that the user is guided during the alignment correction step by a consensus dotplot, which displays all thermodynamically optimal base pairs and the corresponding covariation. Once the initial alignment is corrected, optimal and suboptimal secondary structures as well as tertiary interaction can be predicted. We demonstrate CONSTRUCT's ability to guide the user in correcting an initial alignment, and show an example for optimal secondary consensus structure prediction on very hard to align SECIS elements. Moreover we use CONSTRUCT to predict tertiary interactions from sequences of the internal ribosome entry site of CrP-like viruses. In addition we show that alignments specifically designed for benchmarking can be easily be optimized using CONSTRUCT, although they share very little sequence identity. Conclusion CONSTRUCT's graphical interface allows for an easy alignment correction based on and guided by predicted and known structural constraints. It combines several algorithms for prediction of secondary consensus structure and even tertiary interactions. The CONSTRUCT package can be downloaded from the URL listed in the Availability and

  2. The integrated analysis of RNA-seq and microRNA-seq depicts miRNA-mRNA networks involved in Japanese flounder (Paralichthys olivaceus) albinism.

    Science.gov (United States)

    Wang, Na; Wang, Ruoqing; Wang, Renkai; Tian, Yongsheng; Shao, Changwei; Jia, Xiaodong; Chen, Songlin

    2017-01-01

    Albinism, a phenomenon characterized by pigmentation deficiency on the ocular side of Japanese flounder (Paralichthys olivaceus), has caused significant damage. Limited mRNA and microRNA (miRNA) information is available on fish pigmentation deficiency. In this study, a high-throughput sequencing strategy was employed to identify the mRNA and miRNAs involved in P. olivaceus albinism. Based on P. olivaceus genome, RNA-seq identified 21,787 know genes and 711 new genes by transcripts assembly. Of those, 235 genes exhibited significantly different expression pattern (fold change ≥2 or ≤0.5 and q-value≤0.05), including 194 down-regulated genes and 41 up-regulated genes in albino versus normally pigmented individuals. These genes were enriched to 81 GO terms and 9 KEGG pathways (p≤0.05). Among those, the pigmentation related pathways-Melanogenesis and tyrosine metabolism were contained. High-throughput miRNA sequencing identified a total of 475 miRNAs, including 64 novel miRNAs. Furthermore, 33 differentially expressed miRNAs containing 13 up-regulated and 20 down-regulated miRNAs were identified in albino versus normally pigmented individuals (fold change ≥1.5 or ≤0.67 and p≤0.05). The next target prediction discovered a variety of putative target genes, of which, 134 genes including Tyrosinase (TYR), Tyrosinase-related protein 1 (TYRP1), Microphthalmia-associated transcription factor (MITF) were overlapped with differentially expressed genes derived from RNA-seq. These target genes were significantly enriched to 254 GO terms and 103 KEGG pathways (p<0.001). Of those, tyrosine metabolism, lysosomes, phototransduction pathways, etc., attracted considerable attention due to their involvement in regulating skin pigmentation. Expression patterns of differentially expressed mRNA and miRNAs were validated in 10 mRNA and 10 miRNAs by qRT-PCR. With high-throughput mRNA and miRNA sequencing and analysis, a series of interested mRNA and miRNAs involved in fish

  3. Thermodynamics and kinetics of RNA tertiary structure formation in the junctionless hairpin ribozyme.

    Science.gov (United States)

    White, Neil A; Hoogstraten, Charles G

    2017-09-01

    The hairpin ribozyme consists of two RNA internal loops that interact to form the catalytically active structure. This docking transition is a rare example of intermolecular formation of RNA tertiary structure without coupling to helix annealing. We have used temperature-dependent surface plasmon resonance (SPR) to characterize the thermodynamics and kinetics of RNA tertiary structure formation for the junctionless form of the ribozyme, in which loops A and B reside on separate molecules. We find docking to be strongly enthalpy-driven and to be accompanied by substantial activation barriers for association and dissociation, consistent with the structural reorganization of both internal loops upon complex formation. Comparisons with the parallel analysis of a ribozyme variant carrying a 2'-O-methyl modification at the self-cleavage site and with published data in other systems reveal a surprising diversity of thermodynamic signatures, emphasizing the delicate balance of contributions to the free energy of formation of RNA tertiary structure. Copyright © 2017 Elsevier B.V. All rights reserved.

  4. Viral replication. Structural basis for RNA replication by the hepatitis C virus polymerase.

    Science.gov (United States)

    Appleby, Todd C; Perry, Jason K; Murakami, Eisuke; Barauskas, Ona; Feng, Joy; Cho, Aesop; Fox, David; Wetmore, Diana R; McGrath, Mary E; Ray, Adrian S; Sofia, Michael J; Swaminathan, S; Edwards, Thomas E

    2015-02-13

    Nucleotide analog inhibitors have shown clinical success in the treatment of hepatitis C virus (HCV) infection, despite an incomplete mechanistic understanding of NS5B, the viral RNA-dependent RNA polymerase. Here we study the details of HCV RNA replication by determining crystal structures of stalled polymerase ternary complexes with enzymes, RNA templates, RNA primers, incoming nucleotides, and catalytic metal ions during both primed initiation and elongation of RNA synthesis. Our analysis revealed that highly conserved active-site residues in NS5B position the primer for in-line attack on the incoming nucleotide. A β loop and a C-terminal membrane-anchoring linker occlude the active-site cavity in the apo state, retract in the primed initiation assembly to enforce replication of the HCV genome from the 3' terminus, and vacate the active-site cavity during elongation. We investigated the incorporation of nucleotide analog inhibitors, including the clinically active metabolite formed by sofosbuvir, to elucidate key molecular interactions in the active site. Copyright © 2015, American Association for the Advancement of Science.

  5. Landscape and variation of RNA secondary structure across the human transcriptome.

    Science.gov (United States)

    Wan, Yue; Qu, Kun; Zhang, Qiangfeng Cliff; Flynn, Ryan A; Manor, Ohad; Ouyang, Zhengqing; Zhang, Jiajing; Spitale, Robert C; Snyder, Michael P; Segal, Eran; Chang, Howard Y

    2014-01-30

    In parallel to the genetic code for protein synthesis, a second layer of information is embedded in all RNA transcripts in the form of RNA structure. RNA structure influences practically every step in the gene expression program. However, the nature of most RNA structures or effects of sequence variation on structure are not known. Here we report the initial landscape and variation of RNA secondary structures (RSSs) in a human family trio (mother, father and their child). This provides a comprehensive RSS map of human coding and non-coding RNAs. We identify unique RSS signatures that demarcate open reading frames and splicing junctions, and define authentic microRNA-binding sites. Comparison of native deproteinized RNA isolated from cells versus refolded purified RNA suggests that the majority of the RSS information is encoded within RNA sequence. Over 1,900 transcribed single nucleotide variants (approximately 15% of all transcribed single nucleotide variants) alter local RNA structure. We discover simple sequence and spacing rules that determine the ability of point mutations to impact RSSs. Selective depletion of 'riboSNitches' versus structurally synonymous variants at precise locations suggests selection for specific RNA shapes at thousands of sites, including 3' untranslated regions, binding sites of microRNAs and RNA-binding proteins genome-wide. These results highlight the potentially broad contribution of RNA structure and its variation to gene regulation.

  6. Statistical properties of thermodynamically predicted RNA secondary structures in viral genomes

    Science.gov (United States)

    Spanò, M.; Lillo, F.; Miccichè, S.; Mantegna, R. N.

    2008-10-01

    By performing a comprehensive study on 1832 segments of 1212 complete genomes of viruses, we show that in viral genomes the hairpin structures of thermodynamically predicted RNA secondary structures are more abundant than expected under a simple random null hypothesis. The detected hairpin structures of RNA secondary structures are present both in coding and in noncoding regions for the four groups of viruses categorized as dsDNA, dsRNA, ssDNA and ssRNA. For all groups, hairpin structures of RNA secondary structures are detected more frequently than expected for a random null hypothesis in noncoding rather than in coding regions. However, potential RNA secondary structures are also present in coding regions of dsDNA group. In fact, we detect evolutionary conserved RNA secondary structures in conserved coding and noncoding regions of a large set of complete genomes of dsDNA herpesviruses.

  7. Entropy-based model for miRNA isoform analysis.

    Directory of Open Access Journals (Sweden)

    Shengqin Wang

    Full Text Available MiRNAs have been widely studied due to their important post-transcriptional regulatory roles in gene expression. Many reports have demonstrated the evidence of miRNA isoform products (isomiRs in high-throughput small RNA sequencing data. However, the biological function involved in these molecules is still not well investigated. Here, we developed a Shannon entropy-based model to estimate isomiR expression profiles of high-throughput small RNA sequencing data extracted from miRBase webserver. By using the Kolmogorov-Smirnov statistical test (KS test, we demonstrated that the 5p and 3p miRNAs present more variants than the single arm miRNAs. We also found that the isomiR variant, except the 3' isomiR variant, is strongly correlated with Minimum Free Energy (MFE of pre-miRNA, suggesting the intrinsic feature of pre-miRNA should be one of the important factors for the miRNA regulation. The functional enrichment analysis showed that the miRNAs with high variation, particularly the 5' end variation, are enriched in a set of critical functions, supporting these molecules should not be randomly produced. Our results provide a probabilistic framework for miRNA isoforms analysis, and give functional insights into pre-miRNA processing.

  8. Structure-function relationship of substituted bromomethylcoumarins in nucleoside specificity of RNA alkylation.

    Science.gov (United States)

    Kellner, Stefanie; Kollar, Laura Bettina; Ochel, Antonia; Ghate, Manjunath; Helm, Mark

    2013-01-01

    Selective alkylation of RNA nucleotides is an important field of RNA biochemistry, e.g. in applications of fluorescent labeling or in structural probing experiments, yet detailed structure-function studies of labeling agents are rare. Here, bromomethylcoumarins as reactive compounds for fluorescent labeling of RNA are developed as an attractive scaffold on which electronic properties can be modulated by varying the substituents. Six different 4-bromomethyl-coumarins of various substitution patterns were tested for nucleotide specificity of RNA alkylation using tRNA from Escherichia coli as substrate. Using semi-quantitative LC-MS/MS analysis, reactions at mildly acidic and slightly alkaline pH were compared. For all tested compounds, coumarin conjugates with 4-thiouridine, pseudouridine, guanosine, and uridine were identified, with the latter largely dominating. This data set shows that selectivity of ribonucleotide alkylation depends on the substitution pattern of the reactive dye, and even more strongly on the modulation of the reaction conditions. The latter should be therefore carefully optimized when striving to achieve selectivity. Interestingly, the highest selectivity for labeling of a modified nucleoside, namely of 4-thiouridine, was achieved with a compound whose selectivity was somewhat less dependent on reaction conditions than the other compounds. In summary, bromomethylcoumarin derivatives are a highly interesting class of compounds, since their selectivity for 4-thiouridine can be efficiently tuned by variation of substitution pattern and reaction conditions.

  9. A Comparative Taxonomy of Parallel Algorithms for RNA Secondary Structure Prediction

    Science.gov (United States)

    Al-Khatib, Ra’ed M.; Abdullah, Rosni; Rashid, Nur’Aini Abdul

    2010-01-01

    RNA molecules have been discovered playing crucial roles in numerous biological and medical procedures and processes. RNA structures determination have become a major problem in the biology context. Recently, computer scientists have empowered the biologists with RNA secondary structures that ease an understanding of the RNA functions and roles. Detecting RNA secondary structure is an NP-hard problem, especially in pseudoknotted RNA structures. The detection process is also time-consuming; as a result, an alternative approach such as using parallel architectures is a desirable option. The main goal in this paper is to do an intensive investigation of parallel methods used in the literature to solve the demanding issues, related to the RNA secondary structure prediction methods. Then, we introduce a new taxonomy for the parallel RNA folding methods. Based on this proposed taxonomy, a systematic and scientific comparison is performed among these existing methods. PMID:20458364

  10. Geographical variation in soil bacterial community structure in tropical forests in Southeast Asia and temperate forests in Japan based on pyrosequencing analysis of 16S rRNA.

    Science.gov (United States)

    Ito, Natsumi; Iwanaga, Hiroko; Charles, Suliana; Diway, Bibian; Sabang, John; Chong, Lucy; Nanami, Satoshi; Kamiya, Koichi; Lum, Shawn; Siregar, Ulfah J; Harada, Ko; Miyashita, Naohiko T

    2017-09-12

    Geographical variation in soil bacterial community structure in 26 tropical forests in Southeast Asia (Malaysia, Indonesia and Singapore) and two temperate forests in Japan was investigated to elucidate the environmental factors and mechanisms that influence biogeography of soil bacterial diversity and composition. Despite substantial environmental differences, bacterial phyla were represented in similar proportions, with Acidobacteria and Proteobacteria the dominant phyla in all forests except one mangrove forest in Sarawak, although highly significant heterogeneity in frequency of individual phyla was detected among forests. In contrast, species diversity (α-diversity) differed to a much greater extent, being nearly six-fold higher in the mangrove forest (Chao1 index = 6,862) than in forests in Singapore and Sarawak (~1,250). In addition, natural mixed dipterocarp forests had lower species diversity than acacia and oil palm plantations, indicating that aboveground tree composition does not influence soil bacterial diversity. Shannon and Chao1 indices were correlated positively, implying that skewed operational taxonomic unit (OTU) distribution was associated with the abundance of overall and rare (singleton) OTUs. No OTUs were represented in all 28 forests, and forest-specific OTUs accounted for over 70% of all detected OTUs. Forests that were geographically adjacent and/or of the same forest type had similar bacterial species composition, and a positive correlation was detected between species divergence (β-diversity) and direct distance between forests. Both α- and β-diversities were correlated with soil pH. These results suggest that soil bacterial communities in different forests evolve largely independently of each other and that soil bacterial communities adapt to their local environment, modulated by bacterial dispersal (distance effect) and forest type. Therefore, we conclude that the biogeography of soil bacteria communities described here is non

  11. A semi-supervised learning approach for RNA secondary structure prediction.

    Science.gov (United States)

    Yonemoto, Haruka; Asai, Kiyoshi; Hamada, Michiaki

    2015-08-01

    RNA secondary structure prediction is a key technology in RNA bioinformatics. Most algorithms for RNA secondary structure prediction use probabilistic models, in which the model parameters are trained with reliable RNA secondary structures. Because of the difficulty of determining RNA secondary structures by experimental procedures, such as NMR or X-ray crystal structural analyses, there are still many RNA sequences that could be useful for training whose secondary structures have not been experimentally determined. In this paper, we introduce a novel semi-supervised learning approach for training parameters in a probabilistic model of RNA secondary structures in which we employ not only RNA sequences with annotated secondary structures but also ones with unknown secondary structures. Our model is based on a hybrid of generative (stochastic context-free grammars) and discriminative models (conditional random fields) that has been successfully applied to natural language processing. Computational experiments indicate that the accuracy of secondary structure prediction is improved by incorporating RNA sequences with unknown secondary structures into training. To our knowledge, this is the first study of a semi-supervised learning approach for RNA secondary structure prediction. This technique will be useful when the number of reliable structures is limited. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. The UEA Small RNA Workbench: A Suite of Computational Tools for Small RNA Analysis.

    Science.gov (United States)

    Mohorianu, Irina; Stocks, Matthew Benedict; Applegate, Christopher Steven; Folkes, Leighton; Moulton, Vincent

    2017-01-01

    RNA silencing (RNA interference, RNAi) is a complex, highly conserved mechanism mediated by short, typically 20-24 nt in length, noncoding RNAs known as small RNAs (sRNAs). They act as guides for the sequence-specific transcriptional and posttranscriptional regulation of target mRNAs and play a key role in the fine-tuning of biological processes such as growth, response to stresses, or defense mechanism.High-throughput sequencing (HTS) technologies are employed to capture the expression levels of sRNA populations. The processing of the resulting big data sets facilitated the computational analysis of the sRNA patterns of variation within biological samples such as time point experiments, tissue series or various treatments. Rapid technological advances enable larger experiments, often with biological replicates leading to a vast amount of raw data. As a result, in this fast-evolving field, the existing methods for sequence characterization and prediction of interaction (regulatory) networks periodically require adapting or in extreme cases, a complete redesign to cope with the data deluge. In addition, the presence of numerous tools focused only on particular steps of HTS analysis hinders the systematic parsing of the results and their interpretation.The UEA small RNA Workbench (v1-4), described in this chapter, provides a user-friendly, modular, interactive analysis in the form of a suite of computational tools designed to process and mine sRNA datasets for interesting characteristics that can be linked back to the observed phenotypes. First, we show how to preprocess the raw sequencing output and prepare it for downstream analysis. Then we review some quality checks that can be used as a first indication of sources of variability between samples. Next we show how the Workbench can provide a comparison of the effects of different normalization approaches on the distributions of expression, enhanced methods for the identification of differentially expressed

  13. Community Structure of Denitrifiers, Bacteria, and Archaea along Redox Gradients in Pacific Northwest Marine Sediments by Terminal Restriction Fragment Length Polymorphism Analysis of Amplified Nitrite Reductase (nirS) and 16S rRNA Genes

    Science.gov (United States)

    Braker, Gesche; Ayala-del-Río, Héctor L.; Devol, Allan H.; Fesefeldt, Andreas; Tiedje, James M.

    2001-01-01

    Steep vertical gradients of oxidants (O2 and NO3−) in Puget Sound and Washington continental margin sediments indicate that aerobic respiration and denitrification occur within the top few millimeters to centimeters. To systematically explore the underlying communities of denitrifiers, Bacteria, and Archaea along redox gradients at distant geographic locations, nitrite reductase (nirS) genes and bacterial and archaeal 16S rRNA genes (rDNAs) were PCR amplified and analyzed by terminal restriction fragment length polymorphism (T-RFLP) analysis. The suitablility of T-RFLP analysis for investigating communities of nirS-containing denitrifiers was established by the correspondence of dominant terminal restriction fragments (T-RFs) of nirS to computer-simulated T-RFs of nirS clones. These clones belonged to clusters II, III, and IV from the same cores and were analyzed in a previous study (G. Braker, J. Zhou, L. Wu, A. H. Devol, and J. M. Tiedje, Appl. Environ. Microbiol. 66:2096–2104, 2000). T-RFLP analysis of nirS and bacterial rDNA revealed a high level of functional and phylogenetic diversity, whereas the level of diversity of Archaea was lower. A comparison of T-RFLPs based on the presence or absence of T-RFs and correspondence analysis based on the frequencies and heights of T-RFs allowed us to group sediment samples according to the sampling location and thus clearly distinguish Puget Sound and the Washington margin populations. However, changes in community structure within sediment core sections during the transition from aerobic to anaerobic conditions were minor. Thus, within the top layers of marine sediments, redox gradients seem to result from the differential metabolic activities of populations of similar communities, probably through mixing by marine invertebrates rather than from the development of distinct communities. PMID:11282647

  14. Structure of Hepatitis E Virion-Sized Particle Reveals an RNA-Dependent Viral Assembly Pathway

    Energy Technology Data Exchange (ETDEWEB)

    Xing, L.; Wall, J.; Li, T.-C.; Mayazaki, N.; Simon, M. N.; Moore, M.; Wang, C.-Y.; Takeda, N.; Wakita, T.; Miyamura, T.; Cheng, R. H.

    2010-10-22

    Hepatitis E virus (HEV) induces acute hepatitis in humans with a high fatality rate in pregnant women. There is a need for anti-HEV research to understand the assembly process of HEV native capsid. Here, we produced a large virion-sized and a small T=1 capsid by expressing the HEV capsid protein in insect cells with and without the N-terminal 111 residues, respectively, for comparative structural analysis. The virion-sized capsid demonstrates a T=3 icosahedral lattice and contains RNA fragment in contrast to the RNA-free T=1 capsid. However, both capsids shared common decameric organization. The in vitro assembly further demonstrated that HEV capsid protein had the intrinsic ability to form decameric intermediate. Our data suggest that RNA binding is the extrinsic factor essential for the assembly of HEV native capsids.

  15. RADIA: RNA and DNA integrated analysis for somatic mutation detection.

    Directory of Open Access Journals (Sweden)

    Amie J Radenbaugh

    Full Text Available The detection of somatic single nucleotide variants is a crucial component to the characterization of the cancer genome. Mutation calling algorithms thus far have focused on comparing the normal and tumor genomes from the same individual. In recent years, it has become routine for projects like The Cancer Genome Atlas (TCGA to also sequence the tumor RNA. Here we present RADIA (RNA and DNA Integrated Analysis, a novel computational method combining the patient-matched normal and tumor DNA with the tumor RNA to detect somatic mutations. The inclusion of the RNA increases the power to detect somatic mutations, especially at low DNA allelic frequencies. By integrating an individual's DNA and RNA, we are able to detect mutations that would otherwise be missed by traditional algorithms that examine only the DNA. We demonstrate high sensitivity (84% and very high precision (98% and 99% for RADIA in patient data from endometrial carcinoma and lung adenocarcinoma from TCGA. Mutations with both high DNA and RNA read support have the highest validation rate of over 99%. We also introduce a simulation package that spikes in artificial mutations to patient data, rather than simulating sequencing data from a reference genome. We evaluate sensitivity on the simulation data and demonstrate our ability to rescue back mutations at low DNA allelic frequencies by including the RNA. Finally, we highlight mutations in important cancer genes that were rescued due to the incorporation of the RNA.

  16. Fast flexible modeling of RNA structure using internal coordinates.

    Science.gov (United States)

    Flores, Samuel Coulbourn; Sherman, Michael A; Bruns, Christopher M; Eastman, Peter; Altman, Russ Biagio

    2011-01-01

    Modeling the structure and dynamics of large macromolecules remains a critical challenge. Molecular dynamics (MD) simulations are expensive because they model every atom independently, and are difficult to combine with experimentally derived knowledge. Assembly of molecules using fragments from libraries relies on the database of known structures and thus may not work for novel motifs. Coarse-grained modeling methods have yielded good results on large molecules but can suffer from difficulties in creating more detailed full atomic realizations. There is therefore a need for molecular modeling algorithms that remain chemically accurate and economical for large molecules, do not rely on fragment libraries, and can incorporate experimental information. RNABuilder works in the internal coordinate space of dihedral angles and thus has time requirements proportional to the number of moving parts rather than the number of atoms. It provides accurate physics-based response to applied forces, but also allows user-specified forces for incorporating experimental information. A particular strength of RNABuilder is that all Leontis-Westhof basepairs can be specified as primitives by the user to be satisfied during model construction. We apply RNABuilder to predict the structure of an RNA molecule with 160 bases from its secondary structure, as well as experimental information. Our model matches the known structure to 10.2 Angstroms RMSD and has low computational expense.

  17. The secondary structure of large-subunit rRNA divergent domains, a marker for protist evolution

    DEFF Research Database (Denmark)

    Lenaers, G; Nielsen, Henrik; Engberg, J

    1988-01-01

    The secondary structure of the large-subunit ribosomal RNA (24-26S rRNA) has been studied with emphasis on comparative analysis of the folding patterns of the divergent domains in the available protist sequences, that is Prorocentrum micans (dinoflagellate), Saccharomyces carlsbergensis (yeast......), Tetrahymena thermophila (ciliate), Physarum polycephalum and Dictyostelium discoideum (slime moulds), Crithidia fasciculata and Giardia lamblia (parasitic flagellates). The folding for the D3, D7a and D10 divergent domains has been refined and a consensus model for the protist 24-26S rRNA structure...

  18. Reconstruction and analysis of the lncRNA-miRNA-mRNA network based on competitive endogenous RNA reveal functional lncRNAs in rheumatoid arthritis.

    Science.gov (United States)

    Jiang, Hui; Ma, Rong; Zou, Shubiao; Wang, Yongzhong; Li, Zhuqing; Li, Weiping

    2017-06-01

    Rheumatoid arthritis (RA) is an autoimmune disease with an unknown etiology, occurring in approximately 1.0% of general population. More and more studies have suggested that long non-coding RNAs (lncRNAs) could play important roles in various biological processes and be associated with the pathogenesis of different kinds of diseases including RA. Although a large number of lncRNAs have been found, our knowledge of their function and physiological/pathological significance is still in its infancy. In order to reveal functional lncRNAs and identify the key lncRNAs in RA, we reconstructed a global triple network based on the competitive endogenous RNA (ceRNA) theory using the data from National Center for Biotechnology Information Gene Expression Omnibus and our previous paper. Meanwhile, Gene Ontology (GO) and pathway analysis were performed using Cytoscape plug-in BinGO and Database for Annotation, Visualization, and Integration Discovery (DAVID), respectively. We found that the lncRNA-miRNA-mRNA network was composed of 7 lncRNA nodes, 90 mRNA nodes, 24 miRNA nodes, and 301 edges. The functional assay showed that 147 GO terms and 23 pathways were enriched. In addition, three lncRNAs (S5645.1, XR_006437.1, J01878) were highly related to RA, and therefore, were selected as key lncRNAs. This study suggests that specific lncRNAs are associated with the development of RA, and three lncRNAs (S5645.1, XR_006437.1, J01878) could be used as potential diagnostic biomarkers and therapeutic targets.

  19. Biomarker MicroRNAs for Diagnosis of Oral Squamous Cell Carcinoma Identified Based on Gene Expression Data and MicroRNA-mRNA Network Analysis

    Science.gov (United States)

    Zhang, Hui; Li, Tangxin; Zheng, Linqing

    2017-01-01

    Oral squamous cell carcinoma is one of the most malignant tumors with high mortality rate worldwide. Biomarker discovery is critical for early diagnosis and precision treatment of this disease. MicroRNAs are small noncoding RNA molecules which often regulate essential biological processes and are good candidates for biomarkers. By integrative analysis of both the cancer-associated gene expression data and microRNA-mRNA network, miR-148b-3p, miR-629-3p, miR-27a-3p, and miR-142-3p were screened as novel diagnostic biomarkers for oral squamous cell carcinoma based on their unique regulatory abilities in the network structure of the conditional microRNA-mRNA network and their important functions. These findings were confirmed by literature verification and functional enrichment analysis. Future experimental validation is expected for the further investigation of their molecular mechanisms. PMID:29098014

  20. Molecular structure and thermodynamic predictions to create highly sensitive microRNA biosensors

    International Nuclear Information System (INIS)

    Larkey, Nicholas E.; Brucks, Corinne N.; Lansing, Shan S.; Le, Sophia D.; Smith, Natasha M.; Tran, Victoria; Zhang, Lulu; Burrows, Sean M.

    2016-01-01

    Many studies have established microRNAs (miRNAs) as post-transcriptional regulators in a variety of intracellular molecular processes. Abnormal changes in miRNA have been associated with several diseases. However, these changes are sometimes subtle and occur at nanomolar levels or lower. Several biosensing hurdles for in situ cellular/tissue analysis of miRNA limit detection of small amounts of miRNA. Of these limitations the most challenging are selectivity and sensor degradation creating high background signals and false signals. Recently we developed a reporter+probe biosensor for let-7a that showed potential to mitigate false signal from sensor degradation. Here we designed reporter+probe biosensors for miR-26a-2-3p and miR-27a-5p to better understand the effect of thermodynamics and molecular structures of the biosensor constituents on the analytical performance. Signal changes from interactions between Cy3 and Cy5 on the reporters were used to understand structural aspects of the reporter designs. Theoretical thermodynamic values, single stranded conformations, hetero- and homodimerization structures, and equilibrium concentrations of the reporters and probes were used to interpret the experimental observations. Studies of the sensitivity and selectivity revealed 5–9 nM detection limits in the presence and absence of interfering off-analyte miRNAs. These studies will aid in determining how to rationally design reporter+probe biosensors to overcome hurdles associated with highly sensitive miRNA biosensing. - Highlights: • Challenges facing highly sensitive miRNA biosensor designs are addressed. • Thermodynamic and molecular structure design metrics for reporter+probe biosensors are proposed. • The influence of ideal and non-ideal reporter hairpin structures on reporter+probe formation and signal change are discussed. • 5–9 nM limits of detection were observed with no interference from off-analytes.

  1. Molecular structure and thermodynamic predictions to create highly sensitive microRNA biosensors

    Energy Technology Data Exchange (ETDEWEB)

    Larkey, Nicholas E.; Brucks, Corinne N.; Lansing, Shan S.; Le, Sophia D.; Smith, Natasha M.; Tran, Victoria; Zhang, Lulu; Burrows, Sean M., E-mail: sean.burrows@oregonstate.edu

    2016-02-25

    Many studies have established microRNAs (miRNAs) as post-transcriptional regulators in a variety of intracellular molecular processes. Abnormal changes in miRNA have been associated with several diseases. However, these changes are sometimes subtle and occur at nanomolar levels or lower. Several biosensing hurdles for in situ cellular/tissue analysis of miRNA limit detection of small amounts of miRNA. Of these limitations the most challenging are selectivity and sensor degradation creating high background signals and false signals. Recently we developed a reporter+probe biosensor for let-7a that showed potential to mitigate false signal from sensor degradation. Here we designed reporter+probe biosensors for miR-26a-2-3p and miR-27a-5p to better understand the effect of thermodynamics and molecular structures of the biosensor constituents on the analytical performance. Signal changes from interactions between Cy3 and Cy5 on the reporters were used to understand structural aspects of the reporter designs. Theoretical thermodynamic values, single stranded conformations, hetero- and homodimerization structures, and equilibrium concentrations of the reporters and probes were used to interpret the experimental observations. Studies of the sensitivity and selectivity revealed 5–9 nM detection limits in the presence and absence of interfering off-analyte miRNAs. These studies will aid in determining how to rationally design reporter+probe biosensors to overcome hurdles associated with highly sensitive miRNA biosensing. - Highlights: • Challenges facing highly sensitive miRNA biosensor designs are addressed. • Thermodynamic and molecular structure design metrics for reporter+probe biosensors are proposed. • The influence of ideal and non-ideal reporter hairpin structures on reporter+probe formation and signal change are discussed. • 5–9 nM limits of detection were observed with no interference from off-analytes.

  2. Simultaneous characterization of cellular RNA structure and function with in-cell SHAPE-Seq.

    Science.gov (United States)

    Watters, Kyle E; Abbott, Timothy R; Lucks, Julius B

    2016-01-29

    Many non-coding RNAs form structures that interact with cellular machinery to control gene expression. A central goal of molecular and synthetic biology is to uncover design principles linking RNA structure to function to understand and engineer this relationship. Here we report a simple, high-throughput method called in-cell SHAPE-Seq that combines in-cell probing of RNA structure with a measurement of gene expression to simultaneously characterize RNA structure and function in bacterial cells. We use in-cell SHAPE-Seq to study the structure-function relationship of two RNA mechanisms that regulate translation in Escherichia coli. We find that nucleotides that participate in RNA-RNA interactions are highly accessible when their binding partner is absent and that changes in RNA structure due to RNA-RNA interactions can be quantitatively correlated to changes in gene expression. We also characterize the cellular structures of three endogenously expressed non-coding RNAs: 5S rRNA, RNase P and the btuB riboswitch. Finally, a comparison between in-cell and in vitro folded RNA structures revealed remarkable similarities for synthetic RNAs, but significant differences for RNAs that participate in complex cellular interactions. Thus, in-cell SHAPE-Seq represents an easily approachable tool for biologists and engineers to uncover relationships between sequence, structure and function of RNAs in the cell. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. RNA-dependent RNA polymerase of hepatitis C virus binds to its coding region RNA stem-loop structure, 5BSL3.2, and its negative strand.

    Science.gov (United States)

    Kanamori, Hiroshi; Yuhashi, Kazuhito; Ohnishi, Shin; Koike, Kazuhiko; Kodama, Tatsuhiko

    2010-05-01

    The hepatitis C virus NS5B RNA-dependent RNA polymerase (RdRp) is a key enzyme involved in viral replication. Interaction between NS5B RdRp and the viral RNA sequence is likely to be an important step in viral RNA replication. The C-terminal half of the NS5B-coding sequence, which contains the important cis-acting replication element, has been identified as an NS5B-binding sequence. In the present study, we confirm the specific binding of NS5B to one of the RNA stem-loop structures in the region, 5BSL3.2. In addition, we show that NS5B binds to the complementary strand of 5BSL3.2 (5BSL3.2N). The bulge structure of 5BSL3.2N was shown to be indispensable for tight binding to NS5B. In vitro RdRp activity was inhibited by 5BSL3.2N, indicating the importance of the RNA element in the polymerization by RdRp. These results suggest the involvement of the RNA stem-loop structure of the negative strand in the replication process.

  4. An efficient algorithm for planar drawing of RNA structures with pseudoknots of any type.

    Science.gov (United States)

    Byun, Yanga; Han, Kyungsook

    2016-06-01

    An RNA pseudoknot is a tertiary structural element in which bases of a loop pair with complementary bases are outside the loop. A drawing of RNA secondary structures is a tree, but a drawing of RNA pseudoknots is a graph that has an inner cycle within a pseudoknot and possibly outer cycles formed between the pseudoknot and other structural elements. Visualizing a large-scale RNA structure with pseudoknots as a planar drawing is challenging because a planar drawing of an RNA structure requires both pseudoknots and an entire structure enclosing the pseudoknots to be embedded into a plane without overlapping or crossing. This paper presents an efficient heuristic algorithm for visualizing a pseudoknotted RNA structure as a planar drawing. The algorithm consists of several parts for finding crossing stems and page mapping the stems, for the layout of stem-loops and pseudoknots, and for overlap detection between structural elements and resolving it. Unlike previous algorithms, our algorithm generates a planar drawing for a large RNA structure with pseudoknots of any type and provides a bracket view of the structure. It generates a compact and aesthetic structure graph for a large pseudoknotted RNA structure in O([Formula: see text]) time, where n is the number of stems of the RNA structure.

  5. Considerations in the identification of functional RNA structural elements in genomic alignments

    Directory of Open Access Journals (Sweden)

    Blencowe Benjamin J

    2007-01-01

    Full Text Available Abstract Background Accurate identification of novel, functional noncoding (nc RNA features in genome sequence has proven more difficult than for exons. Current algorithms identify and score potential RNA secondary structures on the basis of thermodynamic stability, conservation, and/or covariance in sequence alignments. Neither the algorithms nor the information gained from the individual inputs have been independently assessed. Furthermore, due to issues in modelling background signal, it has been difficult to gauge the precision of these algorithms on a genomic scale, in which even a seemingly small false-positive rate can result in a vast excess of false discoveries. Results We developed a shuffling algorithm, shuffle-pair.pl, that simultaneously preserves dinucleotide frequency, gaps, and local conservation in pairwise sequence alignments. We used shuffle-pair.pl to assess precision and recall of six ncRNA search tools (MSARI, QRNA, ddbRNA, RNAz, Evofold, and several variants of simple thermodynamic stability on a test set of 3046 alignments of known ncRNAs. Relative to mononucleotide shuffling, preservation of dinucleotide content in shuffling the alignments resulted in a drastic increase in estimated false-positive detection rates for ncRNA elements, precluding evaluation of higher order alignments, which cannot not be adequately shuffled maintaining both dinucleotides and alignment structure. On pairwise alignments, none of the covariance-based tools performed markedly better than thermodynamic scoring alone. Although the high false-positive rates call into question the veracity of any individual predicted secondary structural element in our analysis, we nevertheless identified intriguing global trends in human genome alignments. The distribution of ncRNA prediction scores in 75-base windows overlapping UTRs, introns, and intergenic regions analyzed using both thermodynamic stability and EvoFold (which has no thermodynamic component was

  6. Structural explanation for the role of Mn2+ in the activity of phi6 RNA-dependent RNA polymerase.

    Science.gov (United States)

    Poranen, Minna M; Salgado, Paula S; Koivunen, Minni R L; Wright, Sam; Bamford, Dennis H; Stuart, David I; Grimes, Jonathan M

    2008-11-01

    The biological role of manganese (Mn(2+)) has been a long-standing puzzle, since at low concentrations it activates several polymerases whilst at higher concentrations it inhibits. Viral RNA polymerases possess a common architecture, reminiscent of a closed right hand. The RNA-dependent RNA polymerase (RdRp) of bacteriophage 6 is one of the best understood examples of this important class of polymerases. We have probed the role of Mn(2+) by biochemical, biophysical and structural analyses of the wild-type enzyme and of a mutant form with an altered Mn(2+)-binding site (E491 to Q). The E491Q mutant has much reduced affinity for Mn(2+), reduced RNA binding and a compromised elongation rate. Loss of Mn(2+) binding structurally stabilizes the enzyme. These data and a re-examination of the structures of other viral RNA polymerases clarify the role of manganese in the activation of polymerization: Mn(2+) coordination of a catalytic aspartate is necessary to allow the active site to properly engage with the triphosphates of the incoming NTPs. The structural flexibility caused by Mn(2+) is also important for the enzyme dynamics, explaining the requirement for manganese throughout RNA polymerization.

  7. Structural similarities and functional differences clarify evolutionary relationships between tRNA healing enzymes and the myelin enzyme CNPase.

    Science.gov (United States)

    Muruganandam, Gopinath; Raasakka, Arne; Myllykoski, Matti; Kursula, Inari; Kursula, Petri

    2017-05-16

    Eukaryotic tRNA splicing is an essential process in the transformation of a primary tRNA transcript into a mature functional tRNA molecule. 5'-phosphate ligation involves two steps: a healing reaction catalyzed by polynucleotide kinase (PNK) in association with cyclic phosphodiesterase (CPDase), and a sealing reaction catalyzed by an RNA ligase. The enzymes that catalyze tRNA healing in yeast and higher eukaryotes are homologous to the members of the 2H phosphoesterase superfamily, in particular to the vertebrate myelin enzyme 2',3'-cyclic nucleotide 3'-phosphodiesterase (CNPase). We employed different biophysical and biochemical methods to elucidate the overall structural and functional features of the tRNA healing enzymes yeast Trl1 PNK/CPDase and lancelet PNK/CPDase and compared them with vertebrate CNPase. The yeast and the lancelet enzymes have cyclic phosphodiesterase and polynucleotide kinase activity, while vertebrate CNPase lacks PNK activity. In addition, we also show that the healing enzymes are structurally similar to the vertebrate CNPase by applying synchrotron radiation circular dichroism spectroscopy and small-angle X-ray scattering. We provide a structural analysis of the tRNA healing enzyme PNK and CPDase domains together. Our results support evolution of vertebrate CNPase from tRNA healing enzymes with a loss of function at its N-terminal PNK-like domain.

  8. Production of HIV-1 vif mRNA Is Modulated by Natural Nucleotide Variations and SLSA1 RNA Structure in SA1D2prox Genomic Region

    Directory of Open Access Journals (Sweden)

    Masako Nomaguchi

    2017-12-01

    Full Text Available Genomic RNA of HIV-1 contains localized structures critical for viral replication. Its structural analysis has demonstrated a stem-loop structure, SLSA1, in a nearby region of HIV-1 genomic splicing acceptor 1 (SA1. We have previously shown that the expression level of vif mRNA is considerably altered by some natural single-nucleotide variations (nSNVs clustering in SLSA1 structure. In this study, besides eleven nSNVs previously identified by us, we totally found nine new nSNVs in the SLSA1-containing sequence from SA1, splicing donor 2, and through to the start codon of Vif that significantly affect the vif mRNA level, and designated the sequence SA1D2prox (142 nucleotides for HIV-1 NL4-3. We then examined by extensive variant and mutagenesis analyses how SA1D2prox sequence and SLSA1 secondary structure are related to vif mRNA level. While the secondary structure and stability of SLSA1 was largely changed by nSNVs and artificial mutations introduced to restore the original NL4-3 form from altered ones by nSNVs, no clear association of the two SLSA1 properties with vif mRNA level was observed. In contrast, when naturally occurring SA1D2prox sequences that contain multiple nSNVs were examined, we attained significant inverse correlation between the vif level and SLSA1 stability. These results may suggest that SA1D2prox sequence adapts over time, and also that the altered SA1D2prox sequence, SLSA1 stability, and vif level are mutually related. In total, we show here that the entire SA1D2prox sequence and SLSA1 stability critically contribute to the modulation of vif mRNA level.

  9. Topological structure of the space of phenotypes: the case of RNA neutral networks.

    Directory of Open Access Journals (Sweden)

    Jacobo Aguirre

    Full Text Available The evolution and adaptation of molecular populations is constrained by the diversity accessible through mutational processes. RNA is a paradigmatic example of biopolymer where genotype (sequence and phenotype (approximated by the secondary structure fold are identified in a single molecule. The extreme redundancy of the genotype-phenotype map leads to large ensembles of RNA sequences that fold into the same secondary structure and can be connected through single-point mutations. These ensembles define neutral networks of phenotypes in sequence space. Here we analyze the topological properties of neutral networks formed by 12-nucleotides RNA sequences, obtained through the exhaustive folding of sequence space. A total of 4(12 sequences fragments into 645 subnetworks that correspond to 57 different secondary structures. The topological analysis reveals that each subnetwork is far from being random: it has a degree distribution with a well-defined average and a small dispersion, a high clustering coefficient, and an average shortest path between nodes close to its minimum possible value, i.e. the Hamming distance between sequences. RNA neutral networks are assortative due to the correlation in the composition of neighboring sequences, a feature that together with the symmetries inherent to the folding process explains the existence of communities. Several topological relationships can be analytically derived attending to structural restrictions and generic properties of the folding process. The average degree of these phenotypic networks grows logarithmically with their size, such that abundant phenotypes have the additional advantage of being more robust to mutations. This property prevents fragmentation of neutral networks and thus enhances the navigability of sequence space. In summary, RNA neutral networks show unique topological properties, unknown to other networks previously described.

  10. Temporal Translational Control by a Metastable RNA Structure

    DEFF Research Database (Denmark)

    Møller-Jensen, Jakob; Franch, Thomas; Gerdes, Kenn

    2001-01-01

    Programmed cell death by the hok/sok locus of plasmid R1 relies on a complex translational control mechanism. The highly stable hok mRNA is activated by 3'-end exonucleolytical processing. Removal of the mRNA 3' end releases a 5'-end sequence that triggers refolding of the mRNA. The refolded hok m......-transcriptional control mechanism....

  11. Genome-scale characterization of RNA tertiary structures and their functional impact by RNA solvent accessibility prediction.

    Science.gov (United States)

    Yang, Yuedong; Li, Xiaomei; Zhao, Huiying; Zhan, Jian; Wang, Jihua; Zhou, Yaoqi

    2017-01-01

    As most RNA structures are elusive to structure determination, obtaining solvent accessible surface areas (ASAs) of nucleotides in an RNA structure is an important first step to characterize potential functional sites and core structural regions. Here, we developed RNAsnap, the first machine-learning method trained on protein-bound RNA structures for solvent accessibility prediction. Built on sequence profiles from multiple sequence alignment (RNAsnap-prof), the method provided robust prediction in fivefold cross-validation and an independent test (Pearson correlation coefficients, r, between predicted and actual ASA values are 0.66 and 0.63, respectively). Application of the method to 6178 mRNAs revealed its positive correlation to mRNA accessibility by dimethyl sulphate (DMS) experimentally measured in vivo (r = 0.37) but not in vitro (r = 0.07), despite the lack of training on mRNAs and the fact that DMS accessibility is only an approximation to solvent accessibility. We further found strong association across coding and noncoding regions between predicted solvent accessibility of the mutation site of a single nucleotide variant (SNV) and the frequency of that variant in the population for 2.2 million SNVs obtained in the 1000 Genomes Project. Moreover, mapping solvent accessibility of RNAs to the human genome indicated that introns, 5' cap of 5' and 3' cap of 3' untranslated regions, are more solvent accessible, consistent with their respective functional roles. These results support conformational selections as the mechanism for the formation of RNA-protein complexes and highlight the utility of genome-scale characterization of RNA tertiary structures by RNAsnap. The server and its stand-alone downloadable version are available at http://sparks-lab.org. © 2016 Yang et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  12. Schizosaccharomyces pombe Polysome Profile Analysis and RNA Purification.

    Science.gov (United States)

    Wolf, Dieter A; Bähler, Jürg; Wise, Jo Ann

    2017-04-03

    Polysome profile analysis is widely used by investigators studying the mechanism and regulation of translation. The method described here uses high-velocity centrifugation of whole cell extracts on linear sucrose gradients to separate 40S and 60S ribosomal subunits from 80S monosomes and polysomes. Cycloheximide is included in the lysis buffer to "freeze" polysomes by blocking translation. After centrifugation, the gradient is fractionated and RNA (and/or protein) is prepared from each fraction for subsequent analysis of individual species using northern or western blots. The entire RNA population in each fraction can be analyzed by hybridization to microarrays or by high-throughput RNA sequencing, and the proteins present can be identified by mass spectrometry analysis. © 2017 Cold Spring Harbor Laboratory Press.

  13. Functional 5' UTR mRNA structures in eukaryotic translation regulation and how to find them.

    Science.gov (United States)

    Leppek, Kathrin; Das, Rhiju; Barna, Maria

    2018-03-01

    RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5' untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5' UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms.

  14. Vfold: a web server for RNA structure and folding thermodynamics prediction.

    Science.gov (United States)

    Xu, Xiaojun; Zhao, Peinan; Chen, Shi-Jie

    2014-01-01

    The ever increasing discovery of non-coding RNAs leads to unprecedented demand for the accurate modeling of RNA folding, including the predictions of two-dimensional (base pair) and three-dimensional all-atom structures and folding stabilities. Accurate modeling of RNA structure and stability has far-reaching impact on our understanding of RNA functions in human health and our ability to design RNA-based therapeutic strategies. The Vfold server offers a web interface to predict (a) RNA two-dimensional structure from the nucleotide sequence, (b) three-dimensional structure from the two-dimensional structure and the sequence, and (c) folding thermodynamics (heat capacity melting curve) from the sequence. To predict the two-dimensional structure (base pairs), the server generates an ensemble of structures, including loop structures with the different intra-loop mismatches, and evaluates the free energies using the experimental parameters for the base stacks and the loop entropy parameters given by a coarse-grained RNA folding model (the Vfold model) for the loops. To predict the three-dimensional structure, the server assembles the motif scaffolds using structure templates extracted from the known PDB structures and refines the structure using all-atom energy minimization. The Vfold-based web server provides a user friendly tool for the prediction of RNA structure and stability. The web server and the source codes are freely accessible for public use at "http://rna.physics.missouri.edu".

  15. Evolution of the RNase P RNA structural domain in Leptospira spp

    NARCIS (Netherlands)

    Ravishankar, Vigneshwaran; Ahmed, Ahmed; Sivagnanam, Ulaganathan; Muthuraman, Krishnaraja; Karthikaichamy, Anbarasu; Wilson, Herald A.; Devendran, Ajay; Hartskeerl, Rudy A.; Raj, Stephen M. L.

    2014-01-01

    We have employed the RNase P RNA (RPR) gene, which is present as single copy in chromosome I of Leptospira spp. to investigate the phylogeny of structural domains present in the RNA subunit of the tRNA processing enzyme, RNase P. RPR gene sequences of 150 strains derived from NCBI database along

  16. Non-Watson Crick base pairs might stabilize RNA structural motifs in ...

    Indian Academy of Sciences (India)

    Watson Crick base pairs, internal loops and pseudoknots have been the highlighting feature of recent structural determination of RNAs. The recent crystal structure of group-I introns has demonstrated that these might constitute RNA structural ...

  17. DSAP: deep-sequencing small RNA analysis pipeline.

    Science.gov (United States)

    Huang, Po-Jung; Liu, Yi-Chung; Lee, Chi-Ching; Lin, Wei-Chen; Gan, Richie Ruei-Chi; Lyu, Ping-Chiang; Tang, Petrus

    2010-07-01

    DSAP is an automated multiple-task web service designed to provide a total solution to analyzing deep-sequencing small RNA datasets generated by next-generation sequencing technology. DSAP uses a tab-delimited file as an input format, which holds the unique sequence reads (tags) and their corresponding number of copies generated by the Solexa sequencing platform. The input data will go through four analysis steps in DSAP: (i) cleanup: removal of adaptors and poly-A/T/C/G/N nucleotides; (ii) clustering: grouping of cleaned sequence tags into unique sequence clusters; (iii) non-coding RNA (ncRNA) matching: sequence homology mapping against a transcribed sequence library from the ncRNA database Rfam (http://rfam.sanger.ac.uk/); and (iv) known miRNA matching: detection of known miRNAs in miRBase (http://www.mirbase.org/) based on sequence homology. The expression levels corresponding to matched ncRNAs and miRNAs are summarized in multi-color clickable bar charts linked to external databases. DSAP is also capable of displaying miRNA expression levels from different jobs using a log(2)-scaled color matrix. Furthermore, a cross-species comparative function is also provided to show the distribution of identified miRNAs in different species as deposited in miRBase. DSAP is available at http://dsap.cgu.edu.tw.

  18. Poliovirus RNA synthesis in vitro: structural elements and antibody inhibition

    International Nuclear Information System (INIS)

    Semler, B.L.; Hanecak, R.; Dorner, L.F.; Anderson, C.W.; Wimmer, E.

    1983-01-01

    The poliovirus RNA polymerase complex has been analyzed by immunoautoradiography using antibody probes derived from purified replicase (P3) region viral polypeptides. Antibody preparations made against the polio RNA polymerase, P3-4b, detected a previously unreported cellular protein that copurifies with the RNA polymerase. An IgG fraction purified from rabbit antiserum to polypeptide P3-2, a precursor fo the RNA polymerase, specifically inhibits poliovirus RNA synthesis in vitro. The authors have also immunoprecipitated a 60,000-dalton protein (P3-4a) with antiserum to protein P3-4b and have determined the precise genomic map position of this protein by automated Edman degradation. Protein P3-4a originates by cleavage of the RNA polymerase precursor at a glutamine-glucine amino acid pair not previously reported to be a viral cleavage site

  19. An RNA-binding compound that stabilizes the HIV-1 gRNA packaging signal structure and specifically blocks HIV-1 RNA encapsidation.

    Science.gov (United States)

    Ingemarsdotter, Carin K; Zeng, Jingwei; Long, Ziqi; Lever, Andrew M L; Kenyon, Julia C

    2018-03-14

    NSC260594, a quinolinium derivative from the NCI diversity set II compound library, was previously identified in a target-based assay as an inhibitor of the interaction between the HIV-1 (ψ) stem-loop 3 (SL3) RNA and Gag. This compound was shown to exhibit potent antiviral activity. Here, the effects of this compound on individual stages of the viral lifecycle were examined by qRT-PCR, ELISA and Western blot, to see if its actions were specific to the viral packaging stage. The structural effects of NSC260594 binding to the HIV-1 gRNA were also examined by SHAPE and dimerization assays. Treatment of cells with NSC260594 did not reduce the number of integration events of incoming virus, and treatment of virus producing cells did not affect the level of intracellular Gag protein or viral particle release as determined by immunoblot. However, NSC260594 reduced the incorporation of gRNA into virions by up to 82%, without affecting levels of gRNA inside the cell. This reduction in packaging correlated closely with the reduction in infectivity of the released viral particles. To establish the structural effects of NSC260594 on the HIV-1 gRNA, we performed SHAPE analyses to pinpoint RNA structural changes. NSC260594 had a stabilizing effect on the wild type RNA that was not confined to SL3, but that was propagated across the structure. A packaging mutant lacking SL3 did not show this effect. NSC260594 acts as a specific inhibitor of HIV-1 RNA packaging. No other viral functions are affected. Its action involves preventing the interaction of Gag with SL3 by stabilizing this small RNA stem-loop which then leads to stabilization of the global packaging signal region (psi or ψ). This confirms data, previously only shown in analyses of isolated SL3 oligonucleotides, that SL3 is structurally labile in the presence of Gag and that this is critical for the complete psi region to be able to adopt different conformations. Since replication is otherwise unaffected by NSC260594

  20. RStrucFam: a web server to associate structure and cognate RNA for RNA-binding proteins from sequence information.

    Science.gov (United States)

    Ghosh, Pritha; Mathew, Oommen K; Sowdhamini, Ramanathan

    2016-10-07

    RNA-binding proteins (RBPs) interact with their cognate RNA(s) to form large biomolecular assemblies. They are versatile in their functionality and are involved in a myriad of processes inside the cell. RBPs with similar structural features and common biological functions are grouped together into families and superfamilies. It will be useful to obtain an early understanding and association of RNA-binding property of sequences of gene products. Here, we report a web server, RStrucFam, to predict the structure, type of cognate RNA(s) and function(s) of proteins, where possible, from mere sequence information. The web server employs Hidden Markov Model scan (hmmscan) to enable association to a back-end database of structural and sequence families. The database (HMMRBP) comprises of 437 HMMs of RBP families of known structure that have been generated using structure-based sequence alignments and 746 sequence-centric RBP family HMMs. The input protein sequence is associated with structural or sequence domain families, if structure or sequence signatures exist. In case of association of the protein with a family of known structures, output features like, multiple structure-based sequence alignment (MSSA) of the query with all others members of that family is provided. Further, cognate RNA partner(s) for that protein, Gene Ontology (GO) annotations, if any and a homology model of the protein can be obtained. The users can also browse through the database for details pertaining to each family, protein or RNA and their related information based on keyword search or RNA motif search. RStrucFam is a web server that exploits structurally conserved features of RBPs, derived from known family members and imprinted in mathematical profiles, to predict putative RBPs from sequence information. Proteins that fail to associate with such structure-centric families are further queried against the sequence-centric RBP family HMMs in the HMMRBP database. Further, all other essential

  1. Oasis: online analysis of small RNA deep sequencing data.

    Science.gov (United States)

    Capece, Vincenzo; Garcia Vizcaino, Julio C; Vidal, Ramon; Rahman, Raza-Ur; Pena Centeno, Tonatiuh; Shomroni, Orr; Suberviola, Irantzu; Fischer, Andre; Bonn, Stefan

    2015-07-01

    Oasis is a web application that allows for the fast and flexible online analysis of small-RNA-seq (sRNA-seq) data. It was designed for the end user in the lab, providing an easy-to-use web frontend including video tutorials, demo data and best practice step-by-step guidelines on how to analyze sRNA-seq data. Oasis' exclusive selling points are a differential expression module that allows for the multivariate analysis of samples, a classification module for robust biomarker detection and an advanced programming interface that supports the batch submission of jobs. Both modules include the analysis of novel miRNAs, miRNA targets and functional analyses including GO and pathway enrichment. Oasis generates downloadable interactive web reports for easy visualization, exploration and analysis of data on a local system. Finally, Oasis' modular workflow enables for the rapid (re-) analysis of data. Oasis is implemented in Python, R, Java, PHP, C++ and JavaScript. It is freely available at http://oasis.dzne.de. stefan.bonn@dzne.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  2. Structure and assembly of the essential RNA ring component of a viral DNA packaging motor.

    Science.gov (United States)

    Ding, Fang; Lu, Changrui; Zhao, Wei; Rajashankar, Kanagalaghatta R; Anderson, Dwight L; Jardine, Paul J; Grimes, Shelley; Ke, Ailong

    2011-05-03

    Prohead RNA (pRNA) is an essential component in the assembly and operation of the powerful bacteriophage 29 DNA packaging motor. The pRNA forms a multimeric ring via intermolecular base-pairing interactions between protomers that serves to guide the assembly of the ring ATPase that drives DNA packaging. Here we report the quaternary structure of this rare multimeric RNA at 3.5 Å resolution, crystallized as tetrameric rings. Strong quaternary interactions and the inherent flexibility helped rationalize how free pRNA is able to adopt multiple oligomerization states in solution. These characteristics also allowed excellent fitting of the crystallographic pRNA protomers into previous prohead/pRNA cryo-EM reconstructions, supporting the presence of a pentameric, but not hexameric, pRNA ring in the context of the DNA packaging motor. The pentameric pRNA ring anchors itself directly to the phage prohead by interacting specifically with the fivefold symmetric capsid structures that surround the head-tail connector portal. From these contacts, five RNA superhelices project from the pRNA ring, where they serve as scaffolds for binding and assembly of the ring ATPase, and possibly mediate communication between motor components. Construction of structure-based designer pRNAs with little sequence similarity to the wild-type pRNA were shown to fully support the packaging of 29 DNA.

  3. Methyltransferase That Modifies Guanine 966 of the 16 S rRNA: FUNCTIONAL IDENTIFICATION AND TERTIARY STRUCTURE*

    Science.gov (United States)

    Lesnyak, Dmitry V.; Osipiuk, Jerzy; Skarina, Tatiana; Sergiev, Petr V.; Bogdanov, Alexey A.; Edwards, Aled; Savchenko, Alexei; Joachimiak, Andrzej; Dontsova, Olga A.

    2010-01-01

    N2-Methylguanine 966 is located in the loop of Escherichia coli 16 S rRNA helix 31, forming a part of the P-site tRNA-binding pocket. We found yhhF to be a gene encoding for m2G966 specific 16 S rRNA methyltransferase. Disruption of the yhhF gene by kanamycin resistance marker leads to a loss of modification at G966. The modification could be rescued by expression of recombinant protein from the plasmid carrying the yhhF gene. Moreover, purified m2G966 methyltransferase, in the presence of S-adenosylomethionine (AdoMet), is able to methylate 30 S ribosomal subunits that were purified from yhhF knock-out strain in vitro. The methylation is specific for G966 base of the 16 S rRNA. The m2G966 methyltransferase was crystallized, and its structure has been determined and refined to 2.05 Å. The structure closely resembles RsmC rRNA methyltransferase, specific for m2G1207 of the 16 S rRNA. Structural comparisons and analysis of the enzyme active site suggest modes for binding AdoMet and rRNA to m2G966 methyltransferase. Based on the experimental data and current nomenclature the protein expressed from the yhhF gene was renamed to RsmD. A model for interaction of RsmD with ribosome has been proposed. PMID:17189261

  4. Methyltransferase that modifies guanine 966 of the 16 S rRNA: functional identification and tertiary structure.

    Science.gov (United States)

    Lesnyak, Dmitry V; Osipiuk, Jerzy; Skarina, Tatiana; Sergiev, Petr V; Bogdanov, Alexey A; Edwards, Aled; Savchenko, Alexei; Joachimiak, Andrzej; Dontsova, Olga A

    2007-02-23

    N(2)-Methylguanine 966 is located in the loop of Escherichia coli 16 S rRNA helix 31, forming a part of the P-site tRNA-binding pocket. We found yhhF to be a gene encoding for m(2)G966 specific 16 S rRNA methyltransferase. Disruption of the yhhF gene by kanamycin resistance marker leads to a loss of modification at G966. The modification could be rescued by expression of recombinant protein from the plasmid carrying the yhhF gene. Moreover, purified m(2)G966 methyltransferase, in the presence of S-adenosylomethionine (AdoMet), is able to methylate 30 S ribosomal subunits that were purified from yhhF knock-out strain in vitro. The methylation is specific for G966 base of the 16 S rRNA. The m(2)G966 methyltransferase was crystallized, and its structure has been determined and refined to 2.05A(.) The structure closely resembles RsmC rRNA methyltransferase, specific for m(2)G1207 of the 16 S rRNA. Structural comparisons and analysis of the enzyme active site suggest modes for binding AdoMet and rRNA to m(2)G966 methyltransferase. Based on the experimental data and current nomenclature the protein expressed from the yhhF gene was renamed to RsmD. A model for interaction of RsmD with ribosome has been proposed.

  5. RNAPattMatch: a web server for RNA sequence/structure motif detection based on pattern matching with flexible gaps

    Science.gov (United States)

    Drory Retwitzer, Matan; Polishchuk, Maya; Churkin, Elena; Kifer, Ilona; Yakhini, Zohar; Barash, Danny

    2015-01-01

    Searching for RNA sequence-structure patterns is becoming an essential tool for RNA practitioners. Novel discoveries of regulatory non-coding RNAs in targeted organisms and the motivation to find them across a wide range of organisms have prompted the use of computational RNA pattern matching as an enhancement to sequence similarity. State-of-the-art programs differ by the flexibility of patterns allowed as queries and by their simplicity of use. In particular—no existing method is available as a user-friendly web server. A general program that searches for RNA sequence-structure patterns is RNA Structator. However, it is not available as a web server and does not provide the option to allow flexible gap pattern representation with an upper bound of the gap length being specified at any position in the sequence. Here, we introduce RNAPattMatch, a web-based application that is user friendly and makes sequence/structure RNA queries accessible to practitioners of various background and proficiency. It also extends RNA Structator and allows a more flexible variable gaps representation, in addition to analysis of results using energy minimization methods. RNAPattMatch service is available at http://www.cs.bgu.ac.il/rnapattmatch. A standalone version of the search tool is also available to download at the site. PMID:25940619

  6. Genome-wide analysis of miRNA and mRNA transcriptomes during amelogenesis.

    Science.gov (United States)

    Yin, Kaifeng; Hacia, Joseph G; Zhong, Zhe; Paine, Michael L

    2014-11-19

    In the rodent incisor during amelogenesis, as ameloblast cells transition from secretory stage to maturation stage, their morphology and transcriptome profiles change dramatically. Prior whole genome transcriptome analysis has given a broad picture of the molecular activities dominating both stages of amelogenesis, but this type of analysis has not included miRNA transcript profiling. In this study, we set out to document which miRNAs and corresponding target genes change significantly as ameloblasts transition from secretory- to maturation-stage amelogenesis. Total RNA samples from both secretory- and maturation-stage rat enamel organs were subjected to genome-wide miRNA and mRNA transcript profiling. We identified 59 miRNAs that were differentially expressed at the maturation stage relative to the secretory stage of enamel development (False Discovery Rate (FDR)<0.05, fold change (FC)≥1.8). In parallel, transcriptome profiling experiments identified 1,729 mRNA transcripts that were differentially expressed in the maturation stage compared to the secretory stage (FDR<0.05, FC≥1.8). Based on bioinformatics analyses, 5.8% (629 total) of these differentially expressed genes (DEGS) were highlighted as being the potential targets of 59 miRNAs that were differentially expressed in the opposite direction, in the same tissue samples. Although the number of predicted target DEGs was not higher than baseline expectations generated by examination of stably expressed miRNAs, Gene Ontology (GO) analysis showed that these 629 DEGS were enriched for ion transport, pH regulation, calcium handling, endocytotic, and apoptotic activities. Seven differentially expressed miRNAs (miR-21, miR-31, miR-488, miR-153, miR-135b, miR-135a and miR298) in secretory- and/or maturation-stage enamel organs were confirmed by in situ hybridization. Further, we used luciferase reporter assays to provide evidence that two of these differentially expressed miRNAs, miR-153 and miR-31, are potential

  7. RNA secondary structure prediction with pseudoknots: Contribution of algorithm versus energy model.

    Science.gov (United States)

    Jabbari, Hosna; Wark, Ian; Montemagno, Carlo

    2018-01-01

    RNA is a biopolymer with various applications inside the cell and in biotechnology. Structure of an RNA molecule mainly determines its function and is essential to guide nanostructure design. Since experimental structure determination is time-consuming and expensive, accurate computational prediction of RNA structure is of great importance. Prediction of RNA secondary structure is relatively simpler than its tertiary structure and provides information about its tertiary structure, therefore, RNA secondary structure prediction has received attention in the past decades. Numerous methods with different folding approaches have been developed for RNA secondary structure prediction. While methods for prediction of RNA pseudoknot-free structure (structures with no crossing base pairs) have greatly improved in terms of their accuracy, methods for prediction of RNA pseudoknotted secondary structure (structures with crossing base pairs) still have room for improvement. A long-standing question for improving the prediction accuracy of RNA pseudoknotted secondary structure is whether to focus on the prediction algorithm or the underlying energy model, as there is a trade-off on computational cost of the prediction algorithm versus the generality of the method. The aim of this work is to argue when comparing different methods for RNA pseudoknotted structure prediction, the combination of algorithm and energy model should be considered and a method should not be considered superior or inferior to others if they do not use the same scoring model. We demonstrate that while the folding approach is important in structure prediction, it is not the only important factor in prediction accuracy of a given method as the underlying energy model is also as of great value. Therefore we encourage researchers to pay particular attention in comparing methods with different energy models.

  8. The identification and functional annotation of RNA structures conserved in vertebrates

    DEFF Research Database (Denmark)

    Seemann, Ernst Stefan; Mirza, Aashiq Hussain; Hansen, Claus

    2017-01-01

    Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for Conserved RNA Structures (CRSs), leveraging structure...... (RBPs) or (ii) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared...... expression and shared structure despite low abundance and low sequence identity. About 30k CRS regions are located near coding or long non-coding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their non-structured...

  9. Genome-Wide Analysis of the RNA Helicase Gene Family in Gossypium raimondii

    Directory of Open Access Journals (Sweden)

    Jie Chen

    2014-03-01

    Full Text Available The RNA helicases, which help to unwind stable RNA duplexes, and have important roles in RNA metabolism, belong to a class of motor proteins that play important roles in plant development and responses to stress. Although this family of genes has been the subject of systematic investigation in Arabidopsis, rice, and tomato, it has not yet been characterized in cotton. In this study, we identified 161 putative RNA helicase genes in the genome of the diploid cotton species Gossypium raimondii. We classified these genes into three subfamilies, based on the presence of either a DEAD-box (51 genes, DEAH-box (52 genes, or DExD/H-box (58 genes in their coding regions. Chromosome location analysis showed that the genes that encode RNA helicases are distributed across all 13 chromosomes of G. raimondii. Syntenic analysis revealed that 62 of the 161 G. raimondii helicase genes (38.5% are within the identified syntenic blocks. Sixty-six (40.99% helicase genes from G. raimondii have one or several putative orthologs in tomato. Additionally, GrDEADs have more conserved gene structures and more simple domains than GrDEAHs and GrDExD/Hs. Transcriptome sequencing data demonstrated that many of these helicases, especially GrDEADs, are highly expressed at the fiber initiation stage and in mature leaves. To our knowledge, this is the first report of a genome-wide analysis of the RNA helicase gene family in cotton.

  10. An efficient genetic algorithm for structural RNA pairwise alignment and its application to non-coding RNA discovery in yeast

    Directory of Open Access Journals (Sweden)

    Taneda Akito

    2008-12-01

    Full Text Available Abstract Background Aligning RNA sequences with low sequence identity has been a challenging problem since such a computation essentially needs an algorithm with high complexities for taking structural conservation into account. Although many sophisticated algorithms for the purpose have been proposed to date, further improvement in efficiency is necessary to accelerate its large-scale applications including non-coding RNA (ncRNA discovery. Results We developed a new genetic algorithm, Cofolga2, for simultaneously computing pairwise RNA sequence alignment and consensus folding, and benchmarked it using BRAliBase 2.1. The benchmark results showed that our new algorithm is accurate and efficient in both time and memory usage. Then, combining with the originally trained SVM, we applied the new algorithm to novel ncRNA discovery where we compared S. cerevisiae genome with six related genomes in a pairwise manner. By focusing our search to the relatively short regions (50 bp to 2,000 bp sandwiched by conserved sequences, we successfully predict 714 intergenic and 1,311 sense or antisense ncRNA candidates, which were found in the pairwise alignments with stable consensus secondary structure and low sequence identity (≤ 50%. By comparing with the previous predictions, we found that > 92% of the candidates is novel candidates. The estimated rate of false positives in the predicted candidates is 51%. Twenty-five percent of the intergenic candidates has supports for expression in cell, i.e. their genomic positions overlap those of the experimentally determined transcripts in literature. By manual inspection of the results, moreover, we obtained four multiple alignments with low sequence identity which reveal consensus structures shared by three species/sequences. Conclusion The present method gives an efficient tool complementary to sequence-alignment-based ncRNA finders.

  11. Evolution of primary and secondary structures in 5S and 5.8S rRNA

    International Nuclear Information System (INIS)

    Curtiss, W.C.

    1986-01-01

    The secondary structure of Bombyx mori 5S rRNA was studied using the sing-strand specific S1 nuclease and the base pair specific cobra venom ribonuclease. The RNA was end-labeled with [ 32 P] at either the 5' or 3' end and sequenced using enzymatic digestion techniques. These enzymatic data coupled with thermodynamic structure prediction were used to generate a secondary structure for 5S rRNA. A computer algorithm has been implemented to aid in the comparison of a large set of homologous RNAs. Eukaryotic 5S rRNA sequences from thirty four diverse species were compared by (1) alignment or the sequences, (2) the positions of substitutions were located with respect to the aligned sequence and secondary structure, and (3) the R-Y model of base stacking was used to study stacking pattern relationships in the structure. Eukaryotic 5S rRNA was found to have significant sequence variation throughout much of the molecule while maintaining a relatively constant secondary structure. A detailed analysis of the sequence and structure variability in each region of the molecule is presented

  12. Correlation between sequence conservation and structural thermodynamics of microRNA precursors from human, mouse, and chicken genomes

    Directory of Open Access Journals (Sweden)

    Wang Shengqi

    2010-10-01

    Full Text Available Abstract Background Previous studies have shown that microRNA precursors (pre-miRNAs have considerably more stable secondary structures than other native RNAs (tRNA, rRNA, and mRNA and artificial RNA sequences. However, pre-miRNAs with ultra stable secondary structures have not been investigated. It is not known if there is a tendency in pre-miRNA sequences towards or against ultra stable structures? Furthermore, the relationship between the structural thermodynamic stability of pre-miRNA and their evolution remains unclear. Results We investigated the correlation between pre-miRNA sequence conservation and structural stability as measured by adjusted minimum folding free energies in pre-miRNAs isolated from human, mouse, and chicken. The analysis revealed that conserved and non-conserved pre-miRNA sequences had structures with similar average stabilities. However, the relatively ultra stable and unstable pre-miRNAs were more likely to be non-conserved than pre-miRNAs with moderate stability. Non-conserved pre-miRNAs had more G+C than A+U nucleotides, while conserved pre-miRNAs contained more A+U nucleotides. Notably, the U content of conserved pre-miRNAs was especially higher than that of non-conserved pre-miRNAs. Further investigations showed that conserved and non-conserved pre-miRNAs exhibited different structural element features, even though they had comparable levels of stability. Conclusions We proposed that there is a correlation between structural thermodynamic stability and sequence conservation for pre-miRNAs from human, mouse, and chicken genomes. Our analyses suggested that pre-miRNAs with relatively ultra stable or unstable structures were less favoured by natural selection than those with moderately stable structures. Comparison of nucleotide compositions between non-conserved and conserved pre-miRNAs indicated the importance of U nucleotides in the pre-miRNA evolutionary process. Several characteristic structural elements were

  13. General enumeration of RNA secondary structures based on new ...

    African Journals Online (AJOL)

    akpobome

    coding, transferring and retrieving genetic information, and in directing cell metabolism. The nucleic acid includes DNA and RNA molecule. RNA molecule is a single-stranded nucleic acid of four different kinds of nucleotides. The four nucleotides only differ by one part, called bases. Hence, one usually identifies nucleotides.

  14. GARN: Sampling RNA 3D Structure Space with Game Theory and Knowledge-Based Scoring Strategies.

    Science.gov (United States)

    Boudard, Mélanie; Bernauer, Julie; Barth, Dominique; Cohen, Johanne; Denise, Alain

    2015-01-01

    Cellular processes involve large numbers of RNA molecules. The functions of these RNA molecules and their binding to molecular machines are highly dependent on their 3D structures. One of the key challenges in RNA structure prediction and modeling is predicting the spatial arrangement of the various structural elements of RNA. As RNA folding is generally hierarchical, methods involving coarse-grained models hold great promise for this purpose. We present here a novel coarse-grained method for sampling, based on game theory and knowledge-based potentials. This strategy, GARN (Game Algorithm for RNa sampling), is often much faster than previously described techniques and generates large sets of solutions closely resembling the native structure. GARN is thus a suitable starting point for the molecular modeling of large RNAs, particularly those with experimental constraints. GARN is available from: http://garn.lri.fr/.

  15. TurboFold: Iterative probabilistic estimation of secondary structures for multiple RNA sequences

    Directory of Open Access Journals (Sweden)

    Sharma Gaurav

    2011-04-01

    significance threshold are shown to be more accurate for TurboFold than for alternative methods that estimate base pairing probabilities. TurboFold-MEA, which uses base pairing probabilities from TurboFold in a maximum expected accuracy algorithm for secondary structure prediction, has accuracy comparable to the best performing secondary structure prediction methods. The computational and memory requirements for TurboFold are modest and, in terms of sequence length and number of sequences, scale much more favorably than joint alignment and folding algorithms. Conclusions TurboFold is an iterative probabilistic method for predicting secondary structures for multiple RNA sequences that efficiently and accurately combines the information from the comparative analysis between sequences with the thermodynamic folding model. Unlike most other multi-sequence structure prediction methods, TurboFold does not enforce strict commonality of structures and is therefore useful for predicting structures for homologous sequences that have diverged significantly. TurboFold can be downloaded as part of the RNAstructure package at http://rna.urmc.rochester.edu.

  16. Investigation of RNA Structure by High-Throughput SHAPE-Based Probing Methods

    DEFF Research Database (Denmark)

    Poulsen, Line Dahl

    of highthroughput SHAPE-based approaches to investigate RNA structure based on novel SHAPE reagents that permit selection of full-length cDNAs. The SHAPE Selection (SHAPES) method is applied to the foot-and-mouth disease virus (FMDV) plus strand RNA genome, and the data is used to construct a genome-wide structural...... that they are functional. The SHAPES method is further applied to the hepatitis C virus (HCV), where the data is used to refine known and predicted structures. Over the past years, the interest of studying RNA structure in their native environment has been increased, and to allow studying RNA structure inside living cells...... using the SHAPE Selection approach, I introduce a biotinylated probing reagent. This chemical can cross cell membranes and reacts with RNA inside the cells, allowing the structural conformations to be studied in the context of physiological relevant conditions in living cells. The methods and results...

  17. Structural Studies of RNA Helicases Involved in Eukaryotic Pre-mRNA Splicing, Ribosome Biogenesis, and Translation Initiation

    DEFF Research Database (Denmark)

    He, Yangzi

    and ligates the neighbouring exons to generate mature mRNAs. Prp43 is an RNA helicase of the DEAH/RHA family. In yeast, once mRNAs are released, Prp43 catalyzes the disassembly of spliceosomes. The 18S, 5.8S and 25S rRNAs are transcribed as a single polycistronic transcript—the 35S pre......-rRNA. It is nucleolytically cleaved and chemically modified to generate mature rRNAs, which assemble with ribosomal proteins to form the ribosome. Prp43 is required for the processing of the 18S rRNA. Using X-ray crystallography, I determined a high resolution structure of Prp43 bound to ADP, the first structure of a DEAH....../RHA helicase. It defined the conserved structural features of all DEAH/RHA helicases, and unveiled a novel nucleotide binding site. Additionally a preliminary low resolution structure of a ternary complex comprising Prp43, a non-hydrolyzable ATP analogue, and a single-stranded RNA, was obtained. The ribosome...

  18. R2R - software to speed the depiction of aesthetic consensus RNA secondary structures

    Science.gov (United States)

    2011-01-01

    Background With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. Results We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. Conclusions R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file. PMID:21205310

  19. R2R--software to speed the depiction of aesthetic consensus RNA secondary structures.

    Science.gov (United States)

    Weinberg, Zasha; Breaker, Ronald R

    2011-01-04

    With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file.

  20. R2R - software to speed the depiction of aesthetic consensus RNA secondary structures

    Directory of Open Access Journals (Sweden)

    Weinberg Zasha

    2011-01-01

    Full Text Available Abstract Background With continuing identification of novel structured noncoding RNAs, there is an increasing need to create schematic diagrams showing the consensus features of these molecules. RNA structural diagrams are typically made either with general-purpose drawing programs like Adobe Illustrator, or with automated or interactive programs specific to RNA. Unfortunately, the use of applications like Illustrator is extremely time consuming, while existing RNA-specific programs produce figures that are useful, but usually not of the same aesthetic quality as those produced at great cost in Illustrator. Additionally, most existing RNA-specific applications are designed for drawing single RNA molecules, not consensus diagrams. Results We created R2R, a computer program that facilitates the generation of aesthetic and readable drawings of RNA consensus diagrams in a fraction of the time required with general-purpose drawing programs. Since the inference of a consensus RNA structure typically requires a multiple-sequence alignment, the R2R user annotates the alignment with commands directing the layout and annotation of the RNA. R2R creates SVG or PDF output that can be imported into Adobe Illustrator, Inkscape or CorelDRAW. R2R can be used to create consensus sequence and secondary structure models for novel RNA structures or to revise models when new representatives for known RNA classes become available. Although R2R does not currently have a graphical user interface, it has proven useful in our efforts to create 100 schematic models of distinct noncoding RNA classes. Conclusions R2R makes it possible to obtain high-quality drawings of the consensus sequence and structural models of many diverse RNA structures with a more practical amount of effort. R2R software is available at http://breaker.research.yale.edu/R2R and as an Additional file.

  1. The chemical structure of DNA sequence signals for RNA transcription

    Science.gov (United States)

    George, D. G.; Dayhoff, M. O.

    1982-01-01

    The proposed recognition sites for RNA transcription for E. coli NRA polymerase, bacteriophage T7 RNA polymerase, and eukaryotic RNA polymerase Pol II are evaluated in the light of the requirements for efficient recognition. It is shown that although there is good experimental evidence that specific nucleic acid sequence patterns are involved in transcriptional regulation in bacteria and bacterial viruses, among the sequences now available, only in the case of the promoters recognized by bacteriophage T7 polymerase does it seem likely that the pattern is sufficient. It is concluded that the eukaryotic pattern that is investigated is not restrictive enough to serve as a recognition site.

  2. Crystal-Structure-Guided Design of Self-Assembling RNA Nanotriangles.

    Science.gov (United States)

    Boerneke, Mark A; Dibrov, Sergey M; Hermann, Thomas

    2016-03-14

    RNA nanotechnology uses RNA structural motifs to build nanosized architectures that assemble through selective base-pair interactions. Herein, we report the crystal-structure-guided design of highly stable RNA nanotriangles that self-assemble cooperatively from short oligonucleotides. The crystal structure of an 81 nucleotide nanotriangle determined at 2.6 Å resolution reveals the so-far smallest circularly closed nanoobject made entirely of double-stranded RNA. The assembly of the nanotriangle architecture involved RNA corner motifs that were derived from ligand-responsive RNA switches, which offer the opportunity to control self-assembly and dissociation. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  3. RNA CoMPASS: a dual approach for pathogen and host transcriptome analysis of RNA-seq datasets.

    Directory of Open Access Journals (Sweden)

    Guorong Xu

    Full Text Available High-throughput RNA sequencing (RNA-seq has become an instrumental assay for the analysis of multiple aspects of an organism's transcriptome. Further, the analysis of a biological specimen's associated microbiome can also be performed using RNA-seq data and this application is gaining interest in the scientific community. There are many existing bioinformatics tools designed for analysis and visualization of transcriptome data. Despite the availability of an array of next generation sequencing (NGS analysis tools, the analysis of RNA-seq data sets poses a challenge for many biomedical researchers who are not familiar with command-line tools. Here we present RNA CoMPASS, a comprehensive RNA-seq analysis pipeline for the simultaneous analysis of transcriptomes and metatranscriptomes from diverse biological specimens. RNA CoMPASS leverages existing tools and parallel computing technology to facilitate the analysis of even very large datasets. RNA CoMPASS has a web-based graphical user interface with intrinsic queuing to control a distributed computational pipeline. RNA CoMPASS was evaluated by analyzing RNA-seq data sets from 45 B-cell samples. Twenty-two of these samples were derived from lymphoblastoid cell lines (LCLs generated by the infection of naïve B-cells with the Epstein Barr virus (EBV, while another 23 samples were derived from Burkitt's lymphomas (BL, some of which arose in part through infection with EBV. Appropriately, RNA CoMPASS identified EBV in all LCLs and in a fraction of the BLs. Cluster analysis of the human transcriptome component of the RNA CoMPASS output clearly separated the BLs (which have a germinal center-like phenotype from the LCLs (which have a blast-like phenotype with evidence of activated MYC signaling and lower interferon and NF-kB signaling in the BLs. Together, this analysis illustrates the utility of RNA CoMPASS in the simultaneous analysis of transcriptome and metatranscriptome data. RNA CoMPASS is freely

  4. Probabilistic Structural Analysis Program

    Science.gov (United States)

    Pai, Shantaram S.; Chamis, Christos C.; Murthy, Pappu L. N.; Stefko, George L.; Riha, David S.; Thacker, Ben H.; Nagpal, Vinod K.; Mital, Subodh K.

    2010-01-01

    NASA/NESSUS 6.2c is a general-purpose, probabilistic analysis program that computes probability of failure and probabilistic sensitivity measures of engineered systems. Because NASA/NESSUS uses highly computationally efficient and accurate analysis techniques, probabilistic solutions can be obtained even for extremely large and complex models. Once the probabilistic response is quantified, the results can be used to support risk-informed decisions regarding reliability for safety-critical and one-of-a-kind systems, as well as for maintaining a level of quality while reducing manufacturing costs for larger-quantity products. NASA/NESSUS has been successfully applied to a diverse range of problems in aerospace, gas turbine engines, biomechanics, pipelines, defense, weaponry, and infrastructure. This program combines state-of-the-art probabilistic algorithms with general-purpose structural analysis and lifting methods to compute the probabilistic response and reliability of engineered structures. Uncertainties in load, material properties, geometry, boundary conditions, and initial conditions can be simulated. The structural analysis methods include non-linear finite-element methods, heat-transfer analysis, polymer/ceramic matrix composite analysis, monolithic (conventional metallic) materials life-prediction methodologies, boundary element methods, and user-written subroutines. Several probabilistic algorithms are available such as the advanced mean value method and the adaptive importance sampling method. NASA/NESSUS 6.2c is structured in a modular format with 15 elements.

  5. Probing RNA native conformational ensembles with structural constraints

    DEFF Research Database (Denmark)

    Fonseca, Rasmus; van den Bedem, Henry; Bernauer, Julie

    2016-01-01

    substates, which are difficult to characterize experimentally and computationally. Here, we present an innovative, entirely kinematic computational procedure to efficiently explore the native ensemble of RNA molecules. Our procedure projects degrees of freedom onto a subspace of conformation space defined...

  6. Structure and Dynamics of the tRNA-like Structure Domain of Brome Mosaic Virus

    Science.gov (United States)

    Vieweger, Mario; Nesbitt, David

    2014-03-01

    Conformational switching is widely accepted as regulatory mechanism in gene expression in bacterial systems. More recently, similar regulation mechanisms are emerging for viral systems. One of the most abundant and best studied systems is the tRNA-like structure domain that is found in a number of plant viruses across eight genera. In this work, the folding dynamics of the tRNA-like structure domain of Brome Mosaic Virus are investigated using single-molecule Fluorescence Resonance Energy Transfer techniques. In particular, Burst fluorescence is applied to observe metal-ion induced folding in freely diffusing RNA constructs resembling the 3'-terminal 169nt of BMV RNA3. Histograms of EFRET probabilities reveal a complex equilibrium of three distinct populations. A step-wise kinetic model for TLS folding is developed in accord with the evolution of conformational populations and structural information in the literature. In this mechanism, formation of functional TLS domains from unfolded RNAs requires two consecutive steps; 1) hybridization of a long-range stem interaction followed by 2) formation of a 3' pseudoknot. This three-state equilibrium is well described by step-wise dissociation constants K1(328(30) μM) and K2(1092(183) μM) for [Mg2+] and K1(74(6) mM) and K2(243(52) mM) for [Na+]-induced folding. The kinetic model is validated by oligo competition with the STEM interaction. Implications of this conformational folding mechanism are discussed in regards to regulation of virus replication.

  7. Rclick: a web server for comparison of RNA 3D structures.

    Science.gov (United States)

    Nguyen, Minh N; Verma, Chandra

    2015-03-15

    RNA molecules play important roles in key biological processes in the cell and are becoming attractive for developing therapeutic applications. Since the function of RNA depends on its structure and dynamics, comparing and classifying the RNA 3D structures is of crucial importance to molecular biology. In this study, we have developed Rclick, a web server that is capable of superimposing RNA 3D structures by using clique matching and 3D least-squares fitting. Our server Rclick has been benchmarked and compared with other popular servers and methods for RNA structural alignments. In most cases, Rclick alignments were better in terms of structure overlap. Our server also recognizes conformational changes between structures. For this purpose, the server produces complementary alignments to maximize the extent of detectable similarity. Various examples showcase the utility of our web server for comparison of RNA, RNA-protein complexes and RNA-ligand structures. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  8. Sequence analysis of RNase MRP RNA reveals its origination from eukaryotic RNase P RNA

    Science.gov (United States)

    Zhu, Yanglong; Stribinskis, Vilius; Ramos, Kenneth S.; Li, Yong

    2006-01-01

    RNase MRP is a eukaryote-specific endoribonuclease that generates RNA primers for mitochondrial DNA replication and processes precursor rRNA. RNase P is a ubiquitous endoribonuclease that cleaves precursor tRNA transcripts to produce their mature 5′ termini. We found extensive sequence homology of catalytic domains and specificity domains between their RNA subunits in many organisms. In Candida glabrata, the internal loop of helix P3 is 100% conserved between MRP and P RNAs. The helix P8 of MRP RNA from microsporidia Encephalitozoon cuniculi is identical to that of P RNA. Sequence homology can be widely spread over the whole molecule of MRP RNA and P RNA, such as those from Dictyostelium discoideum. These conserved nucleotides between the MRP and P RNAs strongly support the hypothesis that the MRP RNA is derived from the P RNA molecule in early eukaryote evolution. PMID:16540690

  9. Structural and functional characterisation of Aichi virus RNA dependent RNA polymerase

    Czech Academy of Sciences Publication Activity Database

    Dubánková, Anna; Humpolíčková, Jana; Šilhán, Jan; Bäumlová, Adriana; Chalupská, Dominika; Klíma, Martin; Bouřa, Evžen

    2017-01-01

    Roč. 15, č. 1 (2017), s. 7-8 ISSN 2336-7202. [Mezioborové setkání mladých biologů, biochemiků a chemiků /17./. 30.05.2017-01.06.2017, Milovy] Institutional support: RVO:61388963 Keywords : Aichi virus * RNA replication Subject RIV: CE - Biochemistry

  10. Structure-Function Model for Kissing Loop Interactions That Initiate Dimerization of Ty1 RNA

    Directory of Open Access Journals (Sweden)

    Eric R. Gamache

    2017-04-01

    Full Text Available The genomic RNA of the retrotransposon Ty1 is packaged as a dimer into virus-like particles. The 5′ terminus of Ty1 RNA harbors cis-acting sequences required for translation initiation, packaging and initiation of reverse transcription (TIPIRT. To identify RNA motifs involved in dimerization and packaging, a structural model of the TIPIRT domain in vitro was developed from single-nucleotide resolution RNA structural data. In general agreement with previous models, the first 326 nucleotides of Ty1 RNA form a pseudoknot with a 7-bp stem (S1, a 1-nucleotide interhelical loop and an 8-bp stem (S2 that delineate two long, structured loops. Nucleotide substitutions that disrupt either pseudoknot stem greatly reduced helper-Ty1-mediated retrotransposition of a mini-Ty1, but only mutations in S2 destabilized mini-Ty1 RNA in cis and helper-Ty1 RNA in trans. Nested in different loops of the pseudoknot are two hairpins with complementary 7-nucleotide motifs at their apices. Nucleotide substitutions in either motif also reduced retrotransposition and destabilized mini- and helper-Ty1 RNA. Compensatory mutations that restore base-pairing in the S2 stem or between the hairpins rescued retrotransposition and RNA stability in cis and trans. These data inform a model whereby a Ty1 RNA kissing complex with two intermolecular kissing-loop interactions initiates dimerization and packaging.

  11. JNSViewer-A JavaScript-based Nucleotide Sequence Viewer for DNA/RNA secondary structures.

    Science.gov (United States)

    Shi, Jieming; Li, Xi; Dong, Min; Graham, Mitchell; Yadav, Nehul; Liang, Chun

    2017-01-01

    Many tools are available for visualizing RNA or DNA secondary structures, but there is scarce implementation in JavaScript that provides seamless integration with the increasingly popular web computational platforms. We have developed JNSViewer, a highly interactive web service, which is bundled with several popular tools for DNA/RNA secondary structure prediction and can provide precise and interactive correspondence among nucleotides, dot-bracket data, secondary structure graphs, and genic annotations. In JNSViewer, users can perform RNA secondary structure predictions with different programs and settings, add customized genic annotations in GFF format to structure graphs, search for specific linear motifs, and extract relevant structure graphs of sub-sequences. JNSViewer also allows users to choose a transcript or specific segment of Arabidopsis thaliana genome sequences and predict the corresponding secondary structure. Popular genome browsers (i.e., JBrowse and BrowserGenome) were integrated into JNSViewer to provide powerful visualizations of chromosomal locations, genic annotations, and secondary structures. In addition, we used StructureFold with default settings to predict some RNA structures for Arabidopsis by incorporating in vivo high-throughput RNA structure profiling data and stored the results in our web server, which might be a useful resource for RNA secondary structure studies in plants. JNSViewer is available at http://bioinfolab.miamioh.edu/jnsviewer/index.html.

  12. JNSViewer—A JavaScript-based Nucleotide Sequence Viewer for DNA/RNA secondary structures

    Science.gov (United States)

    Dong, Min; Graham, Mitchell; Yadav, Nehul

    2017-01-01

    Many tools are available for visualizing RNA or DNA secondary structures, but there is scarce implementation in JavaScript that provides seamless integration with the increasingly popular web computational platforms. We have developed JNSViewer, a highly interactive web service, which is bundled with several popular tools for DNA/RNA secondary structure prediction and can provide precise and interactive correspondence among nucleotides, dot-bracket data, secondary structure graphs, and genic annotations. In JNSViewer, users can perform RNA secondary structure predictions with different programs and settings, add customized genic annotations in GFF format to structure graphs, search for specific linear motifs, and extract relevant structure graphs of sub-sequences. JNSViewer also allows users to choose a transcript or specific segment of Arabidopsis thaliana genome sequences and predict the corresponding secondary structure. Popular genome browsers (i.e., JBrowse and BrowserGenome) were integrated into JNSViewer to provide powerful visualizations of chromosomal locations, genic annotations, and secondary structures. In addition, we used StructureFold with default settings to predict some RNA structures for Arabidopsis by incorporating in vivo high-throughput RNA structure profiling data and stored the results in our web server, which might be a useful resource for RNA secondary structure studies in plants. JNSViewer is available at http://bioinfolab.miamioh.edu/jnsviewer/index.html. PMID:28582416

  13. JNSViewer-A JavaScript-based Nucleotide Sequence Viewer for DNA/RNA secondary structures.

    Directory of Open Access Journals (Sweden)

    Jieming Shi

    Full Text Available Many tools are available for visualizing RNA or DNA secondary structures, but there is scarce implementation in JavaScript that provides seamless integration with the increasingly popular web computational platforms. We have developed JNSViewer, a highly interactive web service, which is bundled with several popular tools for DNA/RNA secondary structure prediction and can provide precise and interactive correspondence among nucleotides, dot-bracket data, secondary structure graphs, and genic annotations. In JNSViewer, users can perform RNA secondary structure predictions with different programs and settings, add customized genic annotations in GFF format to structure graphs, search for specific linear motifs, and extract relevant structure graphs of sub-sequences. JNSViewer also allows users to choose a transcript or specific segment of Arabidopsis thaliana genome sequences and predict the corresponding secondary structure. Popular genome browsers (i.e., JBrowse and BrowserGenome were integrated into JNSViewer to provide powerful visualizations of chromosomal locations, genic annotations, and secondary structures. In addition, we used StructureFold with default settings to predict some RNA structures for Arabidopsis by incorporating in vivo high-throughput RNA structure profiling data and stored the results in our web server, which might be a useful resource for RNA secondary structure studies in plants. JNSViewer is available at http://bioinfolab.miamioh.edu/jnsviewer/index.html.

  14. Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler.

    Science.gov (United States)

    Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O'Connor, Mary; Shapiro, Bruce A

    2008-10-01

    One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes.

  15. Computational strategies for the automated design of RNA nanoscale structures from building blocks using NanoTiler☆

    Science.gov (United States)

    Bindewald, Eckart; Grunewald, Calvin; Boyle, Brett; O’Connor, Mary; Shapiro, Bruce A.

    2013-01-01

    One approach to designing RNA nanoscale structures is to use known RNA structural motifs such as junctions, kissing loops or bulges and to construct a molecular model by connecting these building blocks with helical struts. We previously developed an algorithm for detecting internal loops, junctions and kissing loops in RNA structures. Here we present algorithms for automating or assisting many of the steps that are involved in creating RNA structures from building blocks: (1) assembling building blocks into nanostructures using either a combinatorial search or constraint satisfaction; (2) optimizing RNA 3D ring structures to improve ring closure; (3) sequence optimisation; (4) creating a unique non-degenerate RNA topology descriptor. This effectively creates a computational pipeline for generating molecular models of RNA nanostructures and more specifically RNA ring structures with optimized sequences from RNA building blocks. We show several examples of how the algorithms can be utilized to generate RNA tecto-shapes. PMID:18838281

  16. RBiomirGS: an all-in-one miRNA gene set analysis solution featuring target mRNA mapping and expression profile integration

    Directory of Open Access Journals (Sweden)

    Jing Zhang

    2018-01-01

    Full Text Available Background With the continuous discovery of microRNA’s (miRNA association with a wide range of biological and cellular processes, expression profile-based functional characterization of such post-transcriptional regulation is crucial for revealing its significance behind particular phenotypes. Profound advancement in bioinformatics has been made to enable in depth investigation of miRNA’s role in regulating cellular and molecular events, resulting in a huge quantity of software packages covering different aspects of miRNA functional analysis. Therefore, an all-in-one software solution is in demand for a comprehensive yet highly efficient workflow. Here we present RBiomirGS, an R package for a miRNA gene set (GS analysis. Methods The package utilizes multiple databases for target mRNA mapping, estimates miRNA effect on the target mRNAs through miRNA expression profile and conducts a logistic regression-based GS enrichment. Additionally, human ortholog Entrez ID conversion functionality is included for target mRNAs. Results By incorporating all the core steps into one package, RBiomirGS eliminates the need for switching between different software packages. The modular structure of RBiomirGS enables various access points to the analysis, with which users can choose the most relevant functionalities for their workflow. Conclusions With RBiomirGS, users are able to assess the functional significance of the miRNA expression profile under the corresponding experimental condition by minimal input and intervention. Accordingly, RBiomirGS encompasses an all-in-one solution for miRNA GS analysis. RBiomirGS is available on GitHub (http://github.com/jzhangc/RBiomirGS. More information including instruction and examples can be found on website (http://kenstoreylab.com/?page_id=2865.

  17. The tRNA-like structure of Turnip yellow mosaic virus RNA is a 3'-translational enhancer

    International Nuclear Information System (INIS)

    Matsuda, Daiki; Dreher, Theo W.

    2004-01-01

    Many positive stand RNA viral genomes lack the poly(A) tail that is characteristic of cellular mRNAs and that promotes translation in cis. The 3' untranslated regions (UTRs) of such genomes are expected to provide similar translation-enhancing properties as a poly(A) tail, yet the great variety of 3' sequences suggests that this is accomplished in a range of ways. We have identified a translational enhancer present in the 3' UTR of Turnip yellow mosaic virus (TYMV) RNA using luciferase reporter RNAs with generic 5' sequences transfected into plant cells. The 3' terminal 109 nucleotides comprising the tRNA-like structure (TLS) and an upstream pseudoknot (UPSK) act in synergy with a 5'-cap to enhance translation, with a minor contribution in stabilizing the RNA. Maximum enhancement requires that the RNA be capable of aminoacylation, but either the native valine or engineered methionine is acceptable. Mutations that decrease the affinity for translation elongation factor eEF1A (but also diminish aminoacylation efficiency) strongly decrease translational enhancement, suggesting that eEF1A is mechanistically involved. The UPSK seems to act as an important, though nonspecific, spacer element ensuring proper presentation of a functional TLS. Our studies have uncovered a novel type of translational enhancer and a new role for a plant viral TLS

  18. MCTBI: a web server for predicting metal ion effects in RNA structures.

    Science.gov (United States)

    Sun, Li-Zhen; Zhang, Jing-Xiang; Chen, Shi-Jie

    2017-08-01

    Metal ions play critical roles in RNA structure and function. However, web servers and software packages for predicting ion effects in RNA structures are notably scarce. Furthermore, the existing web servers and software packages mainly neglect ion correlation and fluctuation effects, which are potentially important for RNAs. We here report a new web server, the MCTBI server (http://rna.physics.missouri.edu/MCTBI), for the prediction of ion effects for RNA structures. This server is based on the recently developed MCTBI, a model that can account for ion correlation and fluctuation effects for nucleic acid structures and can provide improved predictions for the effects of metal ions, especially for multivalent ions such as Mg 2+ effects, as shown by extensive theory-experiment test results. The MCTBI web server predicts metal ion binding fractions, the most probable bound ion distribution, the electrostatic free energy of the system, and the free energy components. The results provide mechanistic insights into the role of metal ions in RNA structure formation and folding stability, which is important for understanding RNA functions and the rational design of RNA structures. © 2017 Sun et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  19. Structure of Hepatitis C Virus Polymerase in Complex with Primer-Template RNA

    Energy Technology Data Exchange (ETDEWEB)

    Mosley, Ralph T.; Edwards, Thomas E.; Murakami, Eisuke; Lam, Angela M.; Grice, Rena L.; Du, Jinfa; Sofia, Michael J.; Furman, Philip A.; Otto, Michael J. (Pharmasset); (Emerald)

    2012-08-01

    The replication of the hepatitis C viral (HCV) genome is accomplished by the NS5B RNA-dependent RNA polymerase (RdRp), for which mechanistic understanding and structure-guided drug design efforts have been hampered by its propensity to crystallize in a closed, polymerization-incompetent state. The removal of an autoinhibitory {beta}-hairpin loop from genotype 2a HCV NS5B increases de novo RNA synthesis by >100-fold, promotes RNA binding, and facilitated the determination of the first crystallographic structures of HCV polymerase in complex with RNA primer-template pairs. These crystal structures demonstrate the structural realignment required for primer-template recognition and elongation, provide new insights into HCV RNA synthesis at the molecular level, and may prove useful in the structure-based design of novel antiviral compounds. Additionally, our approach for obtaining the RNA primer-template-bound structure of HCV polymerase may be generally applicable to solving RNA-bound complexes for other viral RdRps that contain similar regulatory {beta}-hairpin loops, including bovine viral diarrhea virus, dengue virus, and West Nile virus.

  20. A comparative method for finding and folding RNA secondary structures within protein-coding regions

    DEFF Research Database (Denmark)

    Pedersen, Jakob Skou; Meyer, Irmtraud Margret; Forsberg, Roald

    2004-01-01

    that RNA-DECODER's parameters can be automatically trained to successfully fold known secondary structures within the HCV genome. We scan the genomes of HCV and polio virus for conserved secondary-structure elements, and analyze performance as a function of available evolutionary information. On known...... secondary structures, RNA-DECODER shows a sensitivity similar to the programs MFOLD, PFOLD and RNAALIFOLD. When scanning the entire genomes of HCV and polio virus for structure elements, RNA-DECODER's results indicate a markedly higher specificity than MFOLD, PFOLD and RNAALIFOLD....

  1. De Novo Discovery of Structured ncRNA Motifs in Genomic Sequences

    DEFF Research Database (Denmark)

    Ruzzo, Walter L; Gorodkin, Jan

    2014-01-01

    De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphas...... on an approach based on the CMfinder CMfinder program as a case study. Applications to genomic screens for novel de novo structured ncRNA ncRNA s, including structured RNA elements in untranslated portions of protein-coding genes, are presented.......De novo discovery of "motifs" capturing the commonalities among related noncoding ncRNA structured RNAs is among the most difficult problems in computational biology. This chapter outlines the challenges presented by this problem, together with some approaches towards solving them, with an emphasis...

  2. Covariant Evolutionary Event Analysis for Base Interaction Prediction Using a Relational Database Management System for RNA.

    Science.gov (United States)

    Xu, Weijia; Ozer, Stuart; Gutell, Robin R

    2009-01-01

    With an increasingly large amount of sequences properly aligned, comparative sequence analysis can accurately identify not only common structures formed by standard base pairing but also new types of structural elements and constraints. However, traditional methods are too computationally expensive to perform well on large scale alignment and less effective with the sequences from diversified phylogenetic classifications. We propose a new approach that utilizes coevolutional rates among pairs of nucleotide positions using phylogenetic and evolutionary relationships of the organisms of aligned sequences. With a novel data schema to manage relevant information within a relational database, our method, implemented with a Microsoft SQL Server 2005, showed 90% sensitivity in identifying base pair interactions among 16S ribosomal RNA sequences from Bacteria, at a scale 40 times bigger and 50% better sensitivity than a previous study. The results also indicated covariation signals for a few sets of cross-strand base stacking pairs in secondary structure helices, and other subtle constraints in the RNA structure.

  3. Structure of Escherichia coli Arginyl-tRNA Synthetase in Complex with tRNAArg: Pivotal Role of the D-loop.

    Science.gov (United States)

    Stephen, Preyesh; Ye, Sheng; Zhou, Ming; Song, Jian; Zhang, Rongguang; Wang, En-Duo; Giegé, Richard; Lin, Sheng-Xiang

    2018-05-25

    Aminoacyl-tRNA synthetases are essential components in protein biosynthesis. Arginyl-tRNA synthetase (ArgRS) belongs to the small group of aminoacyl-tRNA synthetases requiring cognate tRNA for amino acid activation. The crystal structure of Escherichia coli (Eco) ArgRS has been solved in complex with tRNA Arg at 3.0-Å resolution. With this first bacterial tRNA complex, we are attempting to bridge the gap existing in structure-function understanding in prokaryotic tRNA Arg recognition. The structure shows a tight binding of tRNA on the synthetase through the identity determinant A20 from the D-loop, a tRNA recognition snapshot never elucidated structurally. This interaction of A20 involves 5 amino acids from the synthetase. Additional contacts via U20a and U16 from the D-loop reinforce the interaction. The importance of D-loop recognition in EcoArgRS functioning is supported by a mutagenesis analysis of critical amino acids that anchor tRNA Arg on the synthetase; in particular, mutations at amino acids interacting with A20 affect binding affinity to the tRNA and specificity of arginylation. Altogether the structural and functional data indicate that the unprecedented ArgRS crystal structure represents a snapshot during functioning and suggest that the recognition of the D-loop by ArgRS is an important trigger that anchors tRNA Arg on the synthetase. In this process, A20 plays a major role, together with prominent conformational changes in several ArgRS domains that may eventually lead to the mature ArgRS:tRNA complex and the arginine activation. Functional implications that could be idiosyncratic to the arginine identity of bacterial ArgRSs are discussed. Copyright © 2018 Elsevier Ltd. All rights reserved.

  4. piRNA analysis framework from small RNA-Seq data by a novel cluster prediction tool - PILFER.

    Science.gov (United States)

    Ray, Rishav; Pandey, Priyanka

    2017-12-19

    With the increasing number of studies focusing on PIWI-interacting RNA (piRNAs), it is now pertinent to develop efficient tools dedicated towards piRNA analysis. We have developed a novel cluster prediction tool called PILFER (PIrna cLuster FindER), which can accurately predict piRNA clusters from small RNA sequencing data. PILFER is an open source, easy to use tool, and can be executed even on a personal computer with minimum resources. It uses a sliding-window mechanism by integrating the expression of the reads along with the spatial information to predict the piRNA clusters. We have additionally defined a piRNA analysis pipeline incorporating PILFER to detect and annotate piRNAs and their clusters from raw small RNA sequencing data and implemented it on publicly available data from healthy germline and somatic tissues. We compared PILFER with other existing piRNA cluster prediction tools and found it to be statistically more accurate and superior in many aspects such as the robustness of PILFER clusters is higher and memory efficiency is more. Overall, PILFER provides a fast and accurate solution to piRNA cluster prediction. Copyright © 2017 Elsevier Inc. All rights reserved.

  5. TRANSAT-- method for detecting the conserved helices of functional RNA structures, including transient, pseudo-knotted and alternative structures.

    Science.gov (United States)

    Wiebe, Nicholas J P; Meyer, Irmtraud M

    2010-06-24

    The prediction of functional RNA structures has attracted increased interest, as it allows us to study the potential functional roles of many genes. RNA structure prediction methods, however, assume that there is a unique functional RNA structure and also do not predict functional features required for in vivo folding. In order to understand how functional RNA structures form in vivo, we require sophisticated experiments or reliable prediction methods. So far, there exist only a few, experimentally validated transient RNA structures. On the computational side, there exist several computer programs which aim to predict the co-transcriptional folding pathway in vivo, but these make a range of simplifying assumptions and do not capture all features known to influence RNA folding in vivo. We want to investigate if evolutionarily related RNA genes fold in a similar way in vivo. To this end, we have developed a new computational method, Transat, which detects conserved helices of high statistical significance. We introduce the method, present a comprehensive performance evaluation and show that Transat is able to predict the structural features of known reference structures including pseudo-knotted ones as well as those of known alternative structural configurations. Transat can also identify unstructured sub-sequences bound by other molecules and provides evidence for new helices which may define folding pathways, supporting the notion that homologous RNA sequence not only assume a similar reference RNA structure, but also fold similarly. Finally, we show that the structural features predicted by Transat differ from those assuming thermodynamic equilibrium. Unlike the existing methods for predicting folding pathways, our method works in a comparative way. This has the disadvantage of not being able to predict features as function of time, but has the considerable advantage of highlighting conserved features and of not requiring a detailed knowledge of the cellular

  6. Structure of Lipid Nanoparticles Containing siRNA or mRNA by Dynamic Nuclear Polarization-Enhanced NMR Spectroscopy.

    Science.gov (United States)

    Viger-Gravel, Jasmine; Schantz, Anna; Pinon, Arthur C; Rossini, Aaron J; Schantz, Staffan; Emsley, Lyndon

    2018-02-22

    Here, we show how dynamic nuclear polarization (DNP) NMR spectroscopy experiments permit the atomic level structural characterization of loaded and empty lipid nanoparticles (LNPs). The LNPs used here were synthesized by the microfluidic mixing technique and are composed of ionizable cationic lipid (DLin-MC3-DMA), a phospholipid (distearoylphosphatidylcholine, DSPC), cholesterol, and poly(ethylene glycol) (PEG) (dimyristoyl phosphatidyl ethanolamine (DMPE)-PEG 2000), as well as encapsulated cargoes that are either phosphorothioated siRNA (50 or 100%) or mRNA. We show that LNPs form physically stable complexes with bioactive drug siRNA for a period of 94 days. Relayed DNP experiments are performed to study 1 H- 1 H spin diffusion and to determine the spatial location of the various components of the LNP by studying the average enhancement factors as a function of polarization time. We observe a striking feature of LNPs in the presence and in the absence of encapsulating siRNA or mRNA by comparing our experimental results to numerical spin-diffusion modeling. We observe that LNPs form a layered structure, and we detect that DSPC and DMPE-PEG 2000 lipids form a surface rich layer in the presence (or absence) of the cargoes and that the cholesterol and ionizable cationic lipid are embedded in the core. Furthermore, relayed DNP 31 P solid-state NMR experiments allow the location of the cargo encapsulated in the LNPs to be determined. On the basis of the results, we propose a new structural model for the LNPs that features a homogeneous core with a tendency for layering of DSPC and DMPE-PEG at the surface.

  7. Evolving stochastic context-free grammars for RNA secondary structure prediction

    DEFF Research Database (Denmark)

    Anderson, James WJ; Tataru, Paula Cristina; Stains, Joe

    2012-01-01

    Background Stochastic Context-Free Grammars (SCFGs) were applied successfully to RNA secondary structure prediction in the early 90s, and used in combination with comparative methods in the late 90s. The set of SCFGs potentially useful for RNA secondary structure prediction is very large, but a few...... to structure prediction as has been previously suggested. Results These search techniques were applied to predict RNA secondary structure on a maximal data set and revealed new and interesting grammars, though none are dramatically better than classic grammars. In general, results showed that many grammars...... with quite different structure could have very similar predictive ability. Many ambiguous grammars were found which were at least as effective as the best current unambiguous grammars. Conclusions Overall the method of evolving SCFGs for RNA secondary structure prediction proved effective in finding many...

  8. CentroidFold: a web server for RNA secondary structure prediction

    OpenAIRE

    Sato, Kengo; Hamada, Michiaki; Asai, Kiyoshi; Mituyama, Toutai

    2009-01-01

    The CentroidFold web server (http://www.ncrna.org/centroidfold/) is a web application for RNA secondary structure prediction powered by one of the most accurate prediction engine. The server accepts two kinds of sequence data: a single RNA sequence and a multiple alignment of RNA sequences. It responses with a prediction result shown as a popular base-pair notation and a graph representation. PDF version of the graph representation is also available. For a multiple alignment sequence, the ser...

  9. RNA secondary structure prediction by using discrete mathematics: an interdisciplinary research experience for undergraduate students.

    Science.gov (United States)

    Ellington, Roni; Wachira, James; Nkwanta, Asamoah

    2010-01-01

    The focus of this Research Experience for Undergraduates (REU) project was on RNA secondary structure prediction by using a lattice walk approach. The lattice walk approach is a combinatorial and computational biology method used to enumerate possible secondary structures and predict RNA secondary structure from RNA sequences. The method uses discrete mathematical techniques and identifies specified base pairs as parameters. The goal of the REU was to introduce upper-level undergraduate students to the principles and challenges of interdisciplinary research in molecular biology and discrete mathematics. At the beginning of the project, students from the biology and mathematics departments of a mid-sized university received instruction on the role of secondary structure in the function of eukaryotic RNAs and RNA viruses, RNA related to combinatorics, and the National Center for Biotechnology Information resources. The student research projects focused on RNA secondary structure prediction on a regulatory region of the yellow fever virus RNA genome and on an untranslated region of an mRNA of a gene associated with the neurological disorder epilepsy. At the end of the project, the REU students gave poster and oral presentations, and they submitted written final project reports to the program director. The outcome of the REU was that the students gained transferable knowledge and skills in bioinformatics and an awareness of the applications of discrete mathematics to biological research problems.

  10. RNA Secondary Structure Prediction by Using Discrete Mathematics: An Interdisciplinary Research Experience for Undergraduate Students

    Science.gov (United States)

    Ellington, Roni; Wachira, James

    2010-01-01

    The focus of this Research Experience for Undergraduates (REU) project was on RNA secondary structure prediction by using a lattice walk approach. The lattice walk approach is a combinatorial and computational biology method used to enumerate possible secondary structures and predict RNA secondary structure from RNA sequences. The method uses discrete mathematical techniques and identifies specified base pairs as parameters. The goal of the REU was to introduce upper-level undergraduate students to the principles and challenges of interdisciplinary research in molecular biology and discrete mathematics. At the beginning of the project, students from the biology and mathematics departments of a mid-sized university received instruction on the role of secondary structure in the function of eukaryotic RNAs and RNA viruses, RNA related to combinatorics, and the National Center for Biotechnology Information resources. The student research projects focused on RNA secondary structure prediction on a regulatory region of the yellow fever virus RNA genome and on an untranslated region of an mRNA of a gene associated with the neurological disorder epilepsy. At the end of the project, the REU students gave poster and oral presentations, and they submitted written final project reports to the program director. The outcome of the REU was that the students gained transferable knowledge and skills in bioinformatics and an awareness of the applications of discrete mathematics to biological research problems. PMID:20810968

  11. Visualizing RNA Secondary Structure Base Pair Binding Probabilities using Nested Concave Hulls

    OpenAIRE

    Sansen , Joris; Bourqui , Romain; Thebault , Patricia; Allali , Julien; Auber , David

    2015-01-01

    International audience; The challenge 1 of the BIOVIS 2015 design contest consists in designing an intuitive visual depiction of base pairs binding probabilities for secondary structure of ncRNA. Our representation depicts the potential nucleotide pairs binding using nested concave hulls over the computed MFE ncRNA secondary structure. Thus, it allows to identify regions with a high level of uncertainty in the MFE computation and the structures which seem to match to reality.

  12. The Globular State of the Single-Stranded RNA: Effect of the Secondary Structure Rearrangements

    Science.gov (United States)

    Grigoryan, Zareh A.; Karapetian, Armen T.

    2015-01-01

    The mutual influence of the slow rearrangements of secondary structure and fast collapse of the long single-stranded RNA (ssRNA) in approximation of coarse-grained model is studied with analytic calculations. It is assumed that the characteristic time of the secondary structure rearrangement is much longer than that for the formation of the tertiary structure. A nonequilibrium phase transition of the 2nd order has been observed. PMID:26345143

  13. The Globular State of the Single-Stranded RNA: Effect of the Secondary Structure Rearrangements

    Directory of Open Access Journals (Sweden)

    Zareh A. Grigoryan

    2015-01-01

    Full Text Available The mutual influence of the slow rearrangements of secondary structure and fast collapse of the long single-stranded RNA (ssRNA in approximation of coarse-grained model is studied with analytic calculations. It is assumed that the characteristic time of the secondary structure rearrangement is much longer than that for the formation of the tertiary structure. A nonequilibrium phase transition of the 2nd order has been observed.

  14. An image processing approach to computing distances between RNA secondary structures dot plots

    Directory of Open Access Journals (Sweden)

    Sapiro Guillermo

    2009-02-01

    Full Text Available Abstract Background Computing the distance between two RNA secondary structures can contribute in understanding the functional relationship between them. When used repeatedly, such a procedure may lead to finding a query RNA structure of interest in a database of structures. Several methods are available for computing distances between RNAs represented as strings or graphs, but none utilize the RNA representation with dot plots. Since dot plots are essentially digital images, there is a clear motivation to devise an algorithm for computing the distance between dot plots based on image processing methods. Results We have developed a new metric dubbed 'DoPloCompare', which compares two RNA structures. The method is based on comparing dot plot diagrams that represent the secondary structures. When analyzing two diagrams and motivated by image processing, the distance is based on a combination of histogram correlations and a geometrical distance measure. We introduce, describe, and illustrate the procedure by two applications that utilize this metric on RNA sequences. The first application is the RNA design problem, where the goal is to find the nucleotide sequence for a given secondary structure. Examples where our proposed distance measure outperforms others are given. The second application locates peculiar point mutations that induce significant structural alternations relative to the wild type predicted secondary structure. The approach reported in the past to solve this problem was tested on several RNA sequences with known secondary structures to affirm their prediction, as well as on a data set of ribosomal pieces. These pieces were computationally cut from a ribosome for which an experimentally derived secondary structure is available, and on each piece the prediction conveys similarity to the experimental result. Our newly proposed distance measure shows benefit in this problem as well when compared to standard methods used for assessing

  15. Web-Beagle: a web server for the alignment of RNA secondary structures.

    Science.gov (United States)

    Mattei, Eugenio; Pietrosanto, Marco; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2015-07-01

    Web-Beagle (http://beagle.bio.uniroma2.it) is a web server for the pairwise global or local alignment of RNA secondary structures. The server exploits a new encoding for RNA secondary structure and a substitution matrix of RNA structural elements to perform RNA structural alignments. The web server allows the user to compute up to 10 000 alignments in a single run, taking as input sets of RNA sequences and structures or primary sequences alone. In the latter case, the server computes the secondary structure prediction for the RNAs on-the-fly using RNAfold (free energy minimization). The user can also compare a set of input RNAs to one of five pre-compiled RNA datasets including lncRNAs and 3' UTRs. All types of comparison produce in output the pairwise alignments along with structural similarity and statistical significance measures for each resulting alignment. A graphical color-coded representation of the alignments allows the user to easily identify structural similarities between RNAs. Web-Beagle can be used for finding structurally related regions in two or more RNAs, for the identification of homologous regions or for functional annotation. Benchmark tests show that Web-Beagle has lower computational complexity, running time and better performances than other available methods. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  16. Purification, crystallization and preliminary X-ray diffraction analysis of the RNA-dependent RNA polymerase from Thosea asigna virus

    International Nuclear Information System (INIS)

    Ferrero, Diego; Buxaderas, Mònica; Rodriguez, José F.; Verdaguer, Núria

    2012-01-01

    The RNA-dependent RNA polymerase of Thosea asigna virus has been purified and crystallized in two different crystal forms. Preliminary characterization of P2 1 2 1 2 and C222 1 crystals is reported. Co-crystallization experiments in the presence of lutetium produced a heavy-atom derivative suitable for structure determination. Thosea asigna virus (TaV) is a positive-sense, single-stranded RNA (ssRNA) virus that belongs to the Permutotetravirus genera within the recently created Permutotetraviridae family. The genome of TaV consists of an RNA segment of about 5.700 nucleotides with two open reading frames, encoding for the replicase and capsid protein. The particular TaV replicase does not contain N7-methyl transferase and helicase domains but includes a structurally unique RNA-dependent RNA polymerase (RdRp) with a sequence permutation in the domain where the active site is anchored. This architecture is also found in double-stranded RNA viruses of the Birnaviridae family. Here we report the purification and preliminary crystallographic studies TaV RdRp. The enzyme was crystallized by the sitting-drop vapour diffusion method using PEG 8K and lithium sulfate as precipitants. Two different crystal forms were obtained: native RdRp crystallized in space group P2 1 2 1 2 and diffracts up to 2.1 Å and the RdRp-Lu 3+ derivative co-crystals belong to the C222 1 space group, diffracting to 3.0 Å resolution. The structure of TaV RdRp represents the first structure of a non-canonical RdRp from ssRNA viruses

  17. The identification and functional annotation of RNA structures conserved in vertebrates.

    Science.gov (United States)

    Seemann, Stefan E; Mirza, Aashiq H; Hansen, Claus; Bang-Berthelsen, Claus H; Garde, Christian; Christensen-Dalsgaard, Mikkel; Torarinsson, Elfar; Yao, Zizhen; Workman, Christopher T; Pociot, Flemming; Nielsen, Henrik; Tommerup, Niels; Ruzzo, Walter L; Gorodkin, Jan

    2017-08-01

    Structured elements of RNA molecules are essential in, e.g., RNA stabilization, localization, and protein interaction, and their conservation across species suggests a common functional role. We computationally screened vertebrate genomes for conserved RNA structures (CRSs), leveraging structure-based, rather than sequence-based, alignments. After careful correction for sequence identity and GC content, we predict ∼516,000 human genomic regions containing CRSs. We find that a substantial fraction of human-mouse CRS regions (1) colocalize consistently with binding sites of the same RNA binding proteins (RBPs) or (2) are transcribed in corresponding tissues. Additionally, a CaptureSeq experiment revealed expression of many of our CRS regions in human fetal brain, including 662 novel ones. For selected human and mouse candidate pairs, qRT-PCR and in vitro RNA structure probing supported both shared expression and shared structure despite low abundance and low sequence identity. About 30,000 CRS regions are located near coding or long noncoding RNA genes or within enhancers. Structured (CRS overlapping) enhancer RNAs and extended 3' ends have significantly increased expression levels over their nonstructured counterparts. Our findings of transcribed uncharacterized regulatory regions that contain CRSs support their RNA-mediated functionality. © 2017 Seemann et al.; Published by Cold Spring Harbor Laboratory Press.

  18. Solution structure of a DNA mimicking motif of an RNA aptamer against transcription factor AML1 Runt domain.

    Science.gov (United States)

    Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi

    2013-12-01

    AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.

  19. Structural analysis for Diagnosis

    DEFF Research Database (Denmark)

    Izadi-Zamanabadi, Roozbeh; Blanke, M.

    2001-01-01

    Aiming at design of algorithms for fault diagnosis, structural analysis of systems offers concise yet easy overall analysis. Graph-based matching, which is the essential technique to obtain redundant information for diagnosis, is re-considered in this paper. Matching is re-formulated as a problem...... of relating faults to known parameters and measurements of a system. Using explicit fault modelling, minimal over-determined subsystems are shown to provide necessary redundancy relations from the matching. Details of the method are presented and a realistic example used to clearly describe individual steps...

  20. Structural analysis for diagnosis

    DEFF Research Database (Denmark)

    Izadi-Zamanabadi, Roozbeh; Blanke, M.

    2002-01-01

    Aiming at design of algorithms for fault diagnosis, structural analysis of systems offers concise yet easy overall analysis. Graph-based matching, which is the essential tech-nique to obtain redundant information for diagnosis, is reconsidered in this paper. Matching is reformulated as a problem...... of relating faults to known parameters and measurements of a system. Using explicit fault modelling, minimal overdetermined subsystems are shown to provide necessary redundancy relations from the matching. Details of the method are presented and a realistic example used to clearly describe individual steps....

  1. Structural analysis of DAEs

    DEFF Research Database (Denmark)

    Poulsen, Mikael Zebbelin

    2002-01-01

    , by the implementation of the Simpy tool box. This is an object oriented system implemented in the Python language. It can be used for analysis of DAEs, ODEs and non-linear equation and uses e.g. symbolic representations of expressions and equations. The presentations of theory and algorithms for structural index......Differential algebraic equations (DAEs) constitute a fundamental model class for many modelling purposes in engineering and other sciences, especially for dynamical simulation of component based systems. This thesis describes a practical methodology and approach for analysing general DAE...... analysis of DAE is original in the sense that it is based on a new matrix representation of the structural information of a general DAE system instead of a graph oriented representation. Also the presentation of the theory is found to be more complete compared to other presentations, since it e.g. proves...

  2. Application of ion mobility-mass spectrometry to microRNA analysis.

    Science.gov (United States)

    Takebayashi, Kosuke; Hirose, Kenji; Izumi, Yoshihiro; Bamba, Takeshi; Fukusaki, Eiichiro

    2013-03-01

    Liquid chromatography/mass spectrometry is widely used for studying sequence determination and modification analysis of small RNAs. However, the efficiency of liquid chromatography-based separation of intact small RNA species is insufficient, since the physiochemical properties among small RNAs are very similar. In this study, we focused on ion mobility-mass spectrometry (IM-MS), which is a gas-phase separation technique coupled with mass spectrometry; we have evaluated the utility of IM-MS for microRNA (miRNA) analysis. A multiply charged deprotonated ion derived from an 18-24-nt-long miRNA was formed by electrospray ionization, and then the time, called the "drift time", taken by each ion to migrate through a buffer gas was measured. Each multivalent ion was temporally separated on the basis of the charge state and structural formation; 3 types of unique mass-mobility correlation patterns (i.e., chainlike-form, hairpin-form, and dimer-form) were present on the two-dimensional mobility-mass spectrum. Moreover, we found that the ion size (sequence length) and the secondary structures of the small RNAs strongly contributed to the IM-MS-based separation, although solvent conditions such as pH had no effect. Therefore, sequence isomers could also be discerned by the selection of each specific charged ion, i.e., the 6(-) charged ion reflected a majority among chainlike-, hairpin-, and other structures. We concluded that the IM-MS provides additional capability for separation; thus, this analytical method will be a powerful tool for comprehensive small RNA analysis. Copyright © 2012. Published by Elsevier B.V.

  3. Getting the most out of RNA-seq data analysis

    Directory of Open Access Journals (Sweden)

    Tsung Fei Khang

    2015-10-01

    Full Text Available Background. A common research goal in transcriptome projects is to find genes that are differentially expressed in different phenotype classes. Biologists might wish to validate such gene candidates experimentally, or use them for downstream systems biology analysis. Producing a coherent differential gene expression analysis from RNA-seq count data requires an understanding of how numerous sources of variation such as the replicate size, the hypothesized biological effect size, and the specific method for making differential expression calls interact. We believe an explicit demonstration of such interactions in real RNA-seq data sets is of practical interest to biologists.Results. Using two large public RNA-seq data sets—one representing strong, and another mild, biological effect size—we simulated different replicate size scenarios, and tested the performance of several commonly-used methods for calling differentially expressed genes in each of them. We found that, when biological effect size was mild, RNA-seq experiments should focus on experimental validation of differentially expressed gene candidates. Importantly, at least triplicates must be used, and the differentially expressed genes should be called using methods with high positive predictive value (PPV, such as NOISeq or GFOLD. In contrast, when biological effect size was strong, differentially expressed genes mined from unreplicated experiments using NOISeq, ASC and GFOLD had between 30 to 50% mean PPV, an increase of more than 30-fold compared to the cases of mild biological effect size. Among methods with good PPV performance, having triplicates or more substantially improved mean PPV to over 90% for GFOLD, 60% for DESeq2, 50% for NOISeq, and 30% for edgeR. At a replicate size of six, we found DESeq2 and edgeR to be reasonable methods for calling differentially expressed genes at systems level analysis, as their PPV and sensitivity trade-off were superior to the other methods

  4. The PETfold and PETcofold web servers for intra- and intermolecular structures of multiple RNA sequences

    DEFF Research Database (Denmark)

    Seemann, Ernst Stefan; Menzel, Karl Peter; Backofen, Rolf

    2011-01-01

    gene. We present web servers to analyze multiple RNA sequences for common RNA structure and for RNA interaction sites. The web servers are based on the recent PET (Probabilistic Evolutionary and Thermodynamic) models PETfold and PETcofold, but add user friendly features ranging from a graphical layer...... to interactive usage of the predictors. Additionally, the web servers provide direct access to annotated RNA alignments, such as the Rfam 10.0 database and multiple alignments of 16 vertebrate genomes with human. The web servers are freely available at: http://rth.dk/resources/petfold/...

  5. A novel knowledge-based potential for RNA 3D structure evaluation

    Science.gov (United States)

    Yang, Yi; Gu, Qi; Zhang, Ben-Gong; Shi, Ya-Zhou; Shao, Zhi-Gang

    2018-03-01

    Ribonucleic acids (RNAs) play a vital role in biology, and knowledge of their three-dimensional (3D) structure is required to understand their biological functions. Recently structural prediction methods have been developed to address this issue, but a series of RNA 3D structures are generally predicted by most existing methods. Therefore, the evaluation of the predicted structures is generally indispensable. Although several methods have been proposed to assess RNA 3D structures, the existing methods are not precise enough. In this work, a new all-atom knowledge-based potential is developed for more accurately evaluating RNA 3D structures. The potential not only includes local and nonlocal interactions but also fully considers the specificity of each RNA by introducing a retraining mechanism. Based on extensive test sets generated from independent methods, the proposed potential correctly distinguished the native state and ranked near-native conformations to effectively select the best. Furthermore, the proposed potential precisely captured RNA structural features such as base-stacking and base-pairing. Comparisons with existing potential methods show that the proposed potential is very reliable and accurate in RNA 3D structure evaluation. Project supported by the National Science Foundation of China (Grants Nos. 11605125, 11105054, 11274124, and 11401448).

  6. StralSV: assessment of sequence variability within similar 3D structures and application to polio RNA-dependent RNA polymerase

    Energy Technology Data Exchange (ETDEWEB)

    Zemla, A; Lang, D; Kostova, T; Andino, R; Zhou, C

    2010-11-29

    Most of the currently used methods for protein function prediction rely on sequence-based comparisons between a query protein and those for which a functional annotation is provided. A serious limitation of sequence similarity-based approaches for identifying residue conservation among proteins is the low confidence in assigning residue-residue correspondences among proteins when the level of sequence identity between the compared proteins is poor. Multiple sequence alignment methods are more satisfactory - still, they cannot provide reliable results at low levels of sequence identity. Our goal in the current work was to develop an algorithm that could overcome these difficulties and facilitate the identification of structurally (and possibly functionally) relevant residue-residue correspondences between compared protein structures. Here we present StralSV, a new algorithm for detecting closely related structure fragments and quantifying residue frequency from tight local structure alignments. We apply StralSV in a study of the RNA-dependent RNA polymerase of poliovirus and demonstrate that the algorithm can be used to determine regions of the protein that are relatively unique or that shared structural similarity with structures that are distantly related. By quantifying residue frequencies among many residue-residue pairs extracted from local alignments, one can infer potential structural or functional importance of specific residues that are determined to be highly conserved or that deviate from a consensus. We further demonstrate that considerable detailed structural and phylogenetic information can be derived from StralSV analyses. StralSV is a new structure-based algorithm for identifying and aligning structure fragments that have similarity to a reference protein. StralSV analysis can be used to quantify residue-residue correspondences and identify residues that may be of particular structural or functional importance, as well as unusual or unexpected

  7. Structure, dynamics and RNA binding of the multi-domain splicing factor TIA-1

    Science.gov (United States)

    Wang, Iren; Hennig, Janosch; Jagtap, Pravin Kumar Ankush; Sonntag, Miriam; Valcárcel, Juan; Sattler, Michael

    2014-01-01

    Alternative pre-messenger ribonucleic acid (pre-mRNA) splicing is an essential process in eukaryotic gene regulation. The T-cell intracellular antigen-1 (TIA-1) is an apoptosis-promoting factor that modulates alternative splicing of transcripts, including the pre-mRNA encoding the membrane receptor Fas. TIA-1 is a multi-domain ribonucleic acid (RNA) binding protein that recognizes poly-uridine tract RNA sequences to facilitate 5′ splice site recognition by the U1 small nuclear ribonucleoprotein (snRNP). Here, we characterize the RNA interaction and conformational dynamics of TIA-1 by nuclear magnetic resonance (NMR), isothermal titration calorimetry (ITC) and small angle X-ray scattering (SAXS). Our NMR-derived solution structure of TIA-1 RRM2–RRM3 (RRM2,3) reveals that RRM2 adopts a canonical RNA recognition motif (RRM) fold, while RRM3 is preceded by an non-canonical helix α0. NMR and SAXS data show that all three RRMs are largely independent structural modules in the absence of RNA, while RNA binding induces a compact arrangement. RRM2,3 binds to pyrimidine-rich FAS pre-mRNA or poly-uridine (U9) RNA with nanomolar affinities. RRM1 has little intrinsic RNA binding affinity and does not strongly contribute to RNA binding in the context of RRM1,2,3. Our data unravel the role of binding avidity and the contributions of the TIA-1 RRMs for recognition of pyrimidine-rich RNAs. PMID:24682828

  8. Advancing viral RNA structure prediction: measuring the thermodynamics of pyrimidine-rich internal loops.

    Science.gov (United States)

    Phan, Andy; Mailey, Katherine; Saeki, Jessica; Gu, Xiaobo; Schroeder, Susan J

    2017-05-01

    Accurate thermodynamic parameters improve RNA structure predictions and thus accelerate understanding of RNA function and the identification of RNA drug binding sites. Many viral RNA structures, such as internal ribosome entry sites, have internal loops and bulges that are potential drug target sites. Current models used to predict internal loops are biased toward small, symmetric purine loops, and thus poorly predict asymmetric, pyrimidine-rich loops with >6 nucleotides (nt) that occur frequently in viral RNA. This article presents new thermodynamic data for 40 pyrimidine loops, many of which can form UU or protonated CC base pairs. Uracil and protonated cytosine base pairs stabilize asymmetric internal loops. Accurate prediction rules are presented that account for all thermodynamic measurements of RNA asymmetric internal loops. New loop initiation terms for loops with >6 nt are presented that do not follow previous assumptions that increasing asymmetry destabilizes loops. Since the last 2004 update, 126 new loops with asymmetry or sizes greater than 2 × 2 have been measured. These new measurements significantly deepen and diversify the thermodynamic database for RNA. These results will help better predict internal loops that are larger, pyrimidine-rich, and occur within viral structures such as internal ribosome entry sites. © 2017 Phan et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  9. Exact calculation of loop formation probability identifies folding motifs in RNA secondary structures

    Science.gov (United States)

    Sloma, Michael F.; Mathews, David H.

    2016-01-01

    RNA secondary structure prediction is widely used to analyze RNA sequences. In an RNA partition function calculation, free energy nearest neighbor parameters are used in a dynamic programming algorithm to estimate statistical properties of the secondary structure ensemble. Previously, partition functions have largely been used to estimate the probability that a given pair of nucleotides form a base pair, the conditional stacking probability, the accessibility to binding of a continuous stretch of nucleotides, or a representative sample of RNA structures. Here it is demonstrated that an RNA partition function can also be used to calculate the exact probability of formation of hairpin loops, internal loops, bulge loops, or multibranch loops at a given position. This calculation can also be used to estimate the probability of formation of specific helices. Benchmarking on a set of RNA sequences with known secondary structures indicated that loops that were calculated to be more probable were more likely to be present in the known structure than less probable loops. Furthermore, highly probable loops are more likely to be in the known structure than the set of loops predicted in the lowest free energy structures. PMID:27852924

  10. Capturing alternative secondary structures of RNA by decomposition of base-pairing probabilities.

    Science.gov (United States)

    Hagio, Taichi; Sakuraba, Shun; Iwakiri, Junichi; Mori, Ryota; Asai, Kiyoshi

    2018-02-19

    It is known that functional RNAs often switch their functions by forming different secondary structures. Popular tools for RNA secondary structures prediction, however, predict the single 'best' structures, and do not produce alternative structures. There are bioinformatics tools to predict suboptimal structures, but it is difficult to detect which alternative secondary structures are essential. We proposed a new computational method to detect essential alternative secondary structures from RNA sequences by decomposing the base-pairing probability matrix. The decomposition is calculated by a newly implemented software tool, RintW, which efficiently computes the base-pairing probability distributions over the Hamming distance from arbitrary reference secondary structures. The proposed approach has been demonstrated on ROSE element RNA thermometer sequence and Lysine RNA ribo-switch, showing that the proposed approach captures conformational changes in secondary structures. We have shown that alternative secondary structures are captured by decomposing base-paring probabilities over Hamming distance. Source code is available from http://www.ncRNA.org/RintW .

  11. URS DataBase: universe of RNA structures and their motifs.

    Science.gov (United States)

    Baulin, Eugene; Yacovlev, Victor; Khachko, Denis; Spirin, Sergei; Roytberg, Mikhail

    2016-01-01

    The Universe of RNA Structures DataBase (URSDB) stores information obtained from all RNA-containing PDB entries (2935 entries in October 2015). The content of the database is updated regularly. The database consists of 51 tables containing indexed data on various elements of the RNA structures. The database provides a web interface allowing user to select a subset of structures with desired features and to obtain various statistical data for a selected subset of structures or for all structures. In particular, one can easily obtain statistics on geometric parameters of base pairs, on structural motifs (stems, loops, etc.) or on different types of pseudoknots. The user can also view and get information on an individual structure or its selected parts, e.g. RNA-protein hydrogen bonds. URSDB employs a new original definition of loops in RNA structures. That definition fits both pseudoknot-free and pseudoknotted secondary structures and coincides with the classical definition in case of pseudoknot-free structures. To our knowledge, URSDB is the first database supporting searches based on topological classification of pseudoknots and on extended loop classification.Database URL: http://server3.lpm.org.ru/urs/. © The Author(s) 2016. Published by Oxford University Press.

  12. Structural organization of the transfer RNA operon I of Vibrio cholerae

    Indian Academy of Sciences (India)

    Unknown

    [Ghatak A, Majumdar A and Ghosh R K 2005 Structural organization of the transfer RNA operon I of Vibrio cholerae: Differences ..... clonal relationship are of utmost importance. ... rately derived from environmental, nontoxigenic, non-O1.

  13. Determination of low-energy structures of a small RNA hairpin using ...

    Indian Academy of Sciences (India)

    2012-06-25

    Jun 25, 2012 ... such as water clusters, Argon clusters and different biomo- lecules. ... Keywords. Energy landscape; Monte Carlo; optimization; RNA; tertiary structure .... tui large ribosomal subunit (Residues 310–321, PDB ID: 1JJ2).

  14. The crystal structure and small-angle X-ray analysis of CsdL/TcdA reveal a new tRNA binding motif in the MoeB/E1 superfamily.

    Directory of Open Access Journals (Sweden)

    Miguel López-Estepa

    Full Text Available Cyclic N6-threonylcarbamoyladenosine ('cyclic t6A', ct(6A is a non-thiolated hypermodification found in transfer RNAs (tRNAs in bacteria, protists, fungi and plants. In bacteria and yeast cells ct(6A has been shown to enhance translation fidelity and efficiency of ANN codons by improving the faithful discrimination of aminoacylated tRNAs by the ribosome. To further the understanding of ct(6A biology we have determined the high-resolution crystal structures of CsdL/TcdA in complex with AMP and ATP, an E1-like activating enzyme from Escherichia coli, which catalyzes the ATP-dependent dehydration of t6A to form ct(6A. CsdL/TcdA is a dimer whose structural integrity and dimer interface depend critically on strongly bound K+ and Na+ cations. By using biochemical assays and small-angle X-ray scattering we show that CsdL/TcdA can associate with tRNA with a 1:1 stoichiometry and with the proper position and orientation for the cyclization of t6A. Furthermore, we show by nuclear magnetic resonance that CsdL/TcdA engages in transient interactions with CsdA and CsdE, which, in the latter case, involve catalytically important residues. These short-lived interactions may underpin the precise channeling of sulfur atoms from cysteine to CsdL/TcdA as previously characterized. In summary, the combination of structural, biophysical and biochemical methods applied to CsdL/TcdA has afforded a more thorough understanding of how the structure of this E1-like enzyme has been fine tuned to accomplish ct(6A synthesis on tRNAs while providing support for the notion that CsdA and CsdE are able to functionally interact with CsdL/TcdA.

  15. BioVLAB-MMIA-NGS: microRNA-mRNA integrated analysis using high-throughput sequencing data.

    Science.gov (United States)

    Chae, Heejoon; Rhee, Sungmin; Nephew, Kenneth P; Kim, Sun

    2015-01-15

    It is now well established that microRNAs (miRNAs) play a critical role in regulating gene expression in a sequence-specific manner, and genome-wide efforts are underway to predict known and novel miRNA targets. However, the integrated miRNA-mRNA analysis remains a major computational challenge, requiring powerful informatics systems and bioinformatics expertise. The objective of this study was to modify our widely recognized Web server for the integrated mRNA-miRNA analysis (MMIA) and its subsequent deployment on the Amazon cloud (BioVLAB-MMIA) to be compatible with high-throughput platforms, including next-generation sequencing (NGS) data (e.g. RNA-seq). We developed a new version called the BioVLAB-MMIA-NGS, deployed on both Amazon cloud and on a high-performance publicly available server called MAHA. By using NGS data and integrating various bioinformatics tools and databases, BioVLAB-MMIA-NGS offers several advantages. First, sequencing data is more accurate than array-based methods for determining miRNA expression levels. Second, potential novel miRNAs can be detected by using various computational methods for characterizing miRNAs. Third, because miRNA-mediated gene regulation is due to hybridization of an miRNA to its target mRNA, sequencing data can be used to identify many-to-many relationship between miRNAs and target genes with high accuracy. http://epigenomics.snu.ac.kr/biovlab_mmia_ngs/. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  16. Integrated Analysis of Long Noncoding RNA and mRNA Expression Profile in Advanced Laryngeal Squamous Cell Carcinoma.

    Directory of Open Access Journals (Sweden)

    Ling Feng

    Full Text Available Long non-coding RNA (lncRNA plays an important role in tumorigenesis. However, the expression pattern and function of lncRNAs in laryngeal squamous cell carcinoma (LSCC are still unclear. To investigate the aberrantly expressed lncRNAs and mRNAs in advanced LSCC, we screened lncRNA and mRNA expression profiles in 9 pairs of primary Stage IVA LSCC tissues and adjacent non-neoplastic tissues by lncRNA and mRNA integrated microarrays. Gene Ontology and pathway analysis were performed to find out the significant function and pathway of the differentially expressed mRNAs, gene-gene functional interaction network and ceRNA network were constructed to select core mRNAs, and lncRNA-mRNA expression correlation network was built to identify the interactions between lncRNA and mRNA. qRT-PCR was performed to further validate the expressions of selected lncRNAs and mRNAs in advanced LSCC. We found 1459 differentially expressed lncRNAs and 2381 differentially expressed mRNAs, including 846 up-regulated lncRNAs and 613 down-regulated lncRNAs, 1542 up-regulated mRNAs and 839 down-regulated mRNAs. The mRNAs ITGB1, HIF1A, and DDIT4 were selected as core mRNAs, which are mainly involved in biological processes, such as matrix organization, cell cycle, adhesion, and metabolic pathway. LncRNA-mRNA expression correlation network showed LncRNA NR_027340, MIR31HG were positively correlated with ITGB1, HIF1A respectively. LncRNA SOX2-OT was negatively correlated with DDIT4. qRT-PCR further validated the expression of these lncRNAs and mRNAs. The work provides convincing evidence that the identified lncRNAs and mRNAs are potential biomarkers in advanced LSCC for further future studies.

  17. RNA-Seq as an Emerging Tool for Marine Dinoflagellate Transcriptome Analysis: Process and Challenges

    Directory of Open Access Journals (Sweden)

    Muhamad Afiq Akbar

    2018-01-01

    Full Text Available Dinoflagellates are the large group of marine phytoplankton with primary studies interest regarding their symbiosis with coral reef and the abilities to form harmful algae blooms (HABs. Toxin produced by dinoflagellates during events of HABs cause severe negative impact both in the economy and health sector. However, attempts to understand the dinoflagellates genomic features are hindered by their complex genome organization. Transcriptomics have been employed to understand dinoflagellates genome structure, profile genes and gene expression. RNA-seq is one of the latest methods for transcriptomics study. This method is capable of profiling the dinoflagellates transcriptomes and has several advantages, including highly sensitive, cost effective and deeper sequence coverage. Thus, in this review paper, the current workflow of dinoflagellates RNA-seq starts with the extraction of high quality RNA and is followed by cDNA sequencing using the next-generation sequencing platform, dinoflagellates transcriptome assembly and computational analysis will be discussed. Certain consideration needs will be highlighted such as difficulty in dinoflagellates sequence annotation, post-transcriptional activity and the effect of RNA pooling when using RNA-seq.

  18. In vitro synthesis of minus-strand RNA by an isolated cereal yellow dwarf virus RNA-dependent RNA polymerase requires VPg and a stem-loop structure at the 3' end of the virus RNA.

    Science.gov (United States)

    Osman, Toba A M; Coutts, Robert H A; Buck, Kenneth W

    2006-11-01

    Cereal yellow dwarf virus (CYDV) RNA has a 5'-terminal genome-linked protein (VPg). We have expressed the VPg region of the CYDV genome in bacteria and used the purified protein (bVPg) to raise an antiserum which was able to detect free VPg in extracts of CYDV-infected oat plants. A template-dependent RNA-dependent RNA polymerase (RdRp) has been produced from a CYDV membrane-bound RNA polymerase by treatment with BAL 31 nuclease. The RdRp was template specific, being able to utilize templates from CYDV plus- and minus-strand RNAs but not those of three unrelated viruses, Red clover necrotic mosaic virus, Cucumber mosaic virus, and Tobacco mosaic virus. RNA synthesis catalyzed by the RdRp required a 3'-terminal GU sequence and the presence of bVPg. Additionally, synthesis of minus-strand RNA on a plus-strand RNA template required the presence of a putative stem-loop structure near the 3' terminus of CYDV RNA. The base-paired stem, a single-nucleotide (A) bulge in the stem, and the sequence of a tetraloop were all required for the template activity. Evidence was produced showing that minus-strand synthesis in vitro was initiated by priming by bVPg at the 3' end of the template. The data are consistent with a model in which the RdRp binds to the stem-loop structure which positions the active site to recognize the 3'-terminal GU sequence for initiation of RNA synthesis by the addition of an A residue to VPg.

  19. The FOLDALIGN web server for pairwise structural RNA alignment and mutual motif search

    DEFF Research Database (Denmark)

    Havgaard, Jakob Hull; Lyngsø, Rune B.; Gorodkin, Jan

    2005-01-01

    FOLDALIGN is a Sankoff-based algorithm for making structural alignments of RNA sequences. Here, we present a web server for making pairwise alignments between two RNA sequences, using the recently updated version of FOLDALIGN. The server can be used to scan two sequences for a common structural RNA...... motif of limited size, or the entire sequences can be aligned locally or globally. The web server offers a graphical interface, which makes it simple to make alignments and manually browse the results. the web server can be accessed at http://foldalign.kvl.dk...

  20. Unzippers, Resolvers and Sensors: A Structural and Functional Biochemistry Tale of RNA Helicases

    Directory of Open Access Journals (Sweden)

    Ana Lúcia Leitão

    2015-01-01

    Full Text Available The centrality of RNA within the biological world is an irrefutable fact that currently attracts increasing attention from the scientific community. The panoply of functional RNAs requires the existence of specific biological caretakers, RNA helicases, devoted to maintain the proper folding of those molecules, resolving unstable structures. However, evolution has taken advantage of the specific position and characteristics of RNA helicases to develop new functions for these proteins, which are at the interface of the basic processes for transference of information from DNA to proteins. RNA helicases are involved in many biologically relevant processes, not only as RNA chaperones, but also as signal transducers, scaffolds of molecular complexes, and regulatory elements. Structural biology studies during the last decade, founded in X-ray crystallography, have characterized in detail several RNA-helicases. This comprehensive review summarizes the structural knowledge accumulated in the last two decades within this family of proteins, with special emphasis on the structure-function relationships of the most widely-studied families of RNA helicases: the DEAD-box, RIG-I-like and viral NS3 classes.

  1. RNA Sequencing and Bioinformatics Analysis Implicate the Regulatory Role of a Long Noncoding RNA-mRNA Network in Hepatic Stellate Cell Activation.

    Science.gov (United States)

    Guo, Can-Jie; Xiao, Xiao; Sheng, Li; Chen, Lili; Zhong, Wei; Li, Hai; Hua, Jing; Ma, Xiong

    2017-01-01

    To analyze the long noncoding (lncRNA)-mRNA expression network and potential roles in rat hepatic stellate cells (HSCs) during activation. LncRNA expression was analyzed in quiescent and culture-activated HSCs by RNA sequencing, and differentially expressed lncRNAs verified by quantitative reverse transcription polymerase chain reaction (qRT-PCR) were subjected to bioinformatics analysis. In vivo analyses of differential lncRNA-mRNA expression were performed on a rat model of liver fibrosis. We identified upregulation of 12 lncRNAs and 155 mRNAs and downregulation of 12 lncRNAs and 374 mRNAs in activated HSCs. Additionally, we identified the differential expression of upregulated lncRNAs (NONRATT012636.2, NONRATT016788.2, and NONRATT021402.2) and downregulated lncRNAs (NONRATT007863.2, NONRATT019720.2, and NONRATT024061.2) in activated HSCs relative to levels observed in quiescent HSCs, and Gene Ontology and Kyoto Encyclopedia of Genes and Genomes pathway analyses showed that changes in lncRNAs associated with HSC activation revealed 11 significantly enriched pathways according to their predicted targets. Moreover, based on the predicted co-expression network, the relative dynamic levels of NONRATT013819.2 and lysyl oxidase (Lox) were compared during HSC activation both in vitro and in vivo. Our results confirmed the upregulation of lncRNA NONRATT013819.2 and Lox mRNA associated with the extracellular matrix (ECM)-related signaling pathway in HSCs and fibrotic livers. Our results detailing a dysregulated lncRNA-mRNA network might provide new treatment strategies for hepatic fibrosis based on findings indicating potentially critical roles for NONRATT013819.2 and Lox in ECM remodeling during HSC activation. © 2017 The Author(s). Published by S. Karger AG, Basel.

  2. Thermodynamic heuristics with case-based reasoning: combined insights for RNA pseudoknot secondary structure.

    Science.gov (United States)

    Al-Khatib, Ra'ed M; Rashid, Nur'Aini Abdul; Abdullah, Rosni

    2011-08-01

    The secondary structure of RNA pseudoknots has been extensively inferred and scrutinized by computational approaches. Experimental methods for determining RNA structure are time consuming and tedious; therefore, predictive computational approaches are required. Predicting the most accurate and energy-stable pseudoknot RNA secondary structure has been proven to be an NP-hard problem. In this paper, a new RNA folding approach, termed MSeeker, is presented; it includes KnotSeeker (a heuristic method) and Mfold (a thermodynamic algorithm). The global optimization of this thermodynamic heuristic approach was further enhanced by using a case-based reasoning technique as a local optimization method. MSeeker is a proposed algorithm for predicting RNA pseudoknot structure from individual sequences, especially long ones. This research demonstrates that MSeeker improves the sensitivity and specificity of existing RNA pseudoknot structure predictions. The performance and structural results from this proposed method were evaluated against seven other state-of-the-art pseudoknot prediction methods. The MSeeker method had better sensitivity than the DotKnot, FlexStem, HotKnots, pknotsRG, ILM, NUPACK and pknotsRE methods, with 79% of the predicted pseudoknot base-pairs being correct.

  3. A comparative analysis of measles virus RNA by oligonucleotide fingerprinting

    International Nuclear Information System (INIS)

    Stephenson, J.R.; Meulen, V. ter

    1982-01-01

    Isolates from two cases of acute measles, one case of acute measles encephalitis and three patients with subacute sclerosing panencephalitis were compared. This comparison was based upon the electrophoretic analysis of T 1 oligonucleotides from single-stranded, full-length RNA isolated from cytoplasmic nucleocapsids. Although all viruses have oligonucleotides in common, each isolate generated a unique pattern of oligonucleotides. However, no group of oligonucleotides was observed which would allow differentiation between viruses isolated from acute infections and those isolated from CNS diseases; indicating that probably all measles viruses differ in their nucleotide sequence, regardless of origin. (Author)

  4. Full-length RNA structure prediction of the HIV-1 genome reveals a conserved core domain

    DEFF Research Database (Denmark)

    Sükösd, Zsuzsanna; Andersen, Ebbe Sloth; Seemann, Ernst Stefan

    2015-01-01

    of the HIV-1 genome is highly variable in most regions, with a limited number of stable and conserved RNA secondary structures. Most interesting, a set of long distance interactions form a core organizing structure (COS) that organize the genome into three major structural domains. Despite overlapping...

  5. Structural Basis for dsRNA Recognition by NS1 Protein of Influenza A Virus

    Energy Technology Data Exchange (ETDEWEB)

    Cheng, A.; Wong, S; Yuan, Y

    2009-01-01

    Influenza A viruses are important human pathogens causing periodic pandemic threats. Nonstructural protein 1 (NS1) protein of influenza A virus (NS1A) shields the virus against host defense. Here, we report the crystal structure of NS1A RNA-binding domain (RBD) bound to a double-stranded RNA (dsRNA) at 1.7A. NS1A RBD forms a homodimer to recognize the major groove of A-form dsRNA in a length-independent mode by its conserved concave surface formed by dimeric anti-parallel alpha-helices. dsRNA is anchored by a pair of invariable arginines (Arg38) from both monomers by extensive hydrogen bonds. In accordance with the structural observation, isothermal titration calorimetry assay shows that the unique Arg38-Arg38 pair and two Arg35-Arg46 pairs are crucial for dsRNA binding, and that Ser42 and Thr49 are also important for dsRNA binding. Agrobacterium co-infiltration assay further supports that the unique Arg38 pair plays important roles in dsRNA binding in vivo.

  6. Structured Analysis - IDEF0

    DEFF Research Database (Denmark)

    Larsen, Michael Holm

    1999-01-01

    This note introduces the IDEF0 modelling language (semantics and syntax), and associated rules and techniques, for developing structured graphical representations of a system or enterprise. Use of this standard for IDEF0 permits the construction of models comprising system functions (activities...... that require a modelling technique for the analysis, development, re-engineering, integration, or acquisition of information systems; and incorporate a systems or enterprise modelling technique into a business process analysis or software engineering methodology.This note is a summary of the Standard...... for Integration Definition for Function Modelling (IDEF0). I.e. the Draft Federal Information Processing Standards Publication 183, 1993, December 21, Announcing the Standard for Integration Definition for Function Modelling (IDEF0)....

  7. Parallel mRNA, proteomics and miRNA expression analysis in cell line models of the intestine.

    Science.gov (United States)

    O'Sullivan, Finbarr; Keenan, Joanne; Aherne, Sinead; O'Neill, Fiona; Clarke, Colin; Henry, Michael; Meleady, Paula; Breen, Laura; Barron, Niall; Clynes, Martin; Horgan, Karina; Doolan, Padraig; Murphy, Richard

    2017-11-07

    To identify miRNA-regulated proteins differentially expressed between Caco2 and HT-29: two principal cell line models of the intestine. Exponentially growing Caco-2 and HT-29 cells were harvested and prepared for mRNA, miRNA and proteomic profiling. mRNA microarray profiling analysis was carried out using the Affymetrix GeneChip Human Gene 1.0 ST array. miRNA microarray profiling analysis was carried out using the Affymetrix Genechip miRNA 3.0 array. Quantitative Label-free LC-MS/MS proteomic analysis was performed using a Dionex Ultimate 3000 RSLCnano system coupled to a hybrid linear ion trap/Orbitrap mass spectrometer. Peptide identities were validated in Proteome Discoverer 2.1 and were subsequently imported into Progenesis QI software for further analysis. Hierarchical cluster analysis for all three parallel datasets (miRNA, proteomics, mRNA) was conducted in the R software environment using the Euclidean distance measure and Ward's clustering algorithm. The prediction of miRNA and oppositely correlated protein/mRNA interactions was performed using TargetScan 6.1. GO biological process, molecular function and cellular component enrichment analysis was carried out for the DE miRNA, protein and mRNA lists via the Pathway Studio 11.3 Web interface using their Mammalian database. Differential expression (DE) profiling comparing the intestinal cell lines HT-29 and Caco-2 identified 1795 Genes, 168 Proteins and 160 miRNAs as DE between the two cell lines. At the gene level, 1084 genes were upregulated and 711 were downregulated in the Caco-2 cell line relative to the HT-29 cell line. At the protein level, 57 proteins were found to be upregulated and 111 downregulated in the Caco-2 cell line relative to the HT-29 cell line. Finally, at the miRNAs level, 104 were upregulated and 56 downregulated in the Caco-2 cell line relative to the HT-29 cell line. Gene ontology (GO) analysis of the DE mRNA identified cell adhesion, migration and ECM organization, cellular lipid

  8. Structure of the second RRM domain of Nrd1, a fission yeast MAPK target RNA binding protein, and implication for its RNA recognition and regulation

    Energy Technology Data Exchange (ETDEWEB)

    Kobayashi, Ayaho; Kanaba, Teppei [Graduate School of Science and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji 192-0397 (Japan); Satoh, Ryosuke [Institute of Microbial Chemistry, 3-14-23 Kamiosaki, Shinagawa-ku 141-0021, Tokyo (Japan); Fujiwara, Toshinobu [Institute of Microbial Chemistry, 3-14-23 Kamiosaki, Shinagawa-ku 141-0021, Tokyo (Japan); Graduate School of Pharmaceutical Sciences, Nagoya City University, 3-1 Tanabe-dori, Mizuho-ku,Nagoya 467-8603 (Japan); Ito, Yutaka [Graduate School of Science and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji 192-0397 (Japan); Sugiura, Reiko [Laboratory of Molecular Pharmacogenomics, School of Pharmaceutical Sciences, Kinki University, 3-4-1 Kowakae, Higashi-Osaka 577-8502 (Japan); Mishima, Masaki, E-mail: mishima-masaki@tmu.ac.jp [Graduate School of Science and Engineering, Tokyo Metropolitan University, Minamiosawa 1-1, Hachioji 192-0397 (Japan)

    2013-07-19

    Highlights: •Solution structure of the second RRM of Nrd1 was determined. •RNA binding site of the second RRM was estimated. •Regulatory mechanism of RNA binding by phosphorylation is discussed. -- Abstract: Negative regulator of differentiation 1 (Nrd1) is known as a negative regulator of sexual differentiation in fission yeast. Recently, it has been revealed that Nrd1 also regulates cytokinesis, in which physical separation of the cell is achieved by a contractile ring comprising many proteins including actin and myosin. Cdc4, a myosin II light chain, is known to be required for cytokinesis. Nrd1 binds and stabilizes Cdc4 mRNA, and thereby suppressing the cytokinesis defects of the cdc4 mutants. Interestingly, Pmk1 MAPK phosphorylates Nrd1, resulting in markedly reduced RNA binding activity. Furthermore, Nrd1 localizes to stress granules in response to various stresses, and Pmk1 phosphorylation enhances the localization. Nrd1 consists of four RRM domains, although the mechanism by which Pmk1 regulates the RNA binding activity of Nrd1 is unknown. In an effort to delineate the relationship between Nrd1 structure and function, we prepared each RNA binding domain of Nrd1 and examined RNA binding to chemically synthesized oligo RNA using NMR. The structure of the second RRM domain of Nrd1 was determined and the RNA binding site on the second RRM domain was mapped by NMR. A plausible mechanism pertaining to the regulation of RNA binding activity by phosphorylation is also discussed.

  9. Structure of the second RRM domain of Nrd1, a fission yeast MAPK target RNA binding protein, and implication for its RNA recognition and regulation

    International Nuclear Information System (INIS)

    Kobayashi, Ayaho; Kanaba, Teppei; Satoh, Ryosuke; Fujiwara, Toshinobu; Ito, Yutaka; Sugiura, Reiko; Mishima, Masaki

    2013-01-01

    Highlights: •Solution structure of the second RRM of Nrd1 was determined. •RNA binding site of the second RRM was estimated. •Regulatory mechanism of RNA binding by phosphorylation is discussed. -- Abstract: Negative regulator of differentiation 1 (Nrd1) is known as a negative regulator of sexual differentiation in fission yeast. Recently, it has been revealed that Nrd1 also regulates cytokinesis, in which physical separation of the cell is achieved by a contractile ring comprising many proteins including actin and myosin. Cdc4, a myosin II light chain, is known to be required for cytokinesis. Nrd1 binds and stabilizes Cdc4 mRNA, and thereby suppressing the cytokinesis defects of the cdc4 mutants. Interestingly, Pmk1 MAPK phosphorylates Nrd1, resulting in markedly reduced RNA binding activity. Furthermore, Nrd1 localizes to stress granules in response to various stresses, and Pmk1 phosphorylation enhances the localization. Nrd1 consists of four RRM domains, although the mechanism by which Pmk1 regulates the RNA binding activity of Nrd1 is unknown. In an effort to delineate the relationship between Nrd1 structure and function, we prepared each RNA binding domain of Nrd1 and examined RNA binding to chemically synthesized oligo RNA using NMR. The structure of the second RRM domain of Nrd1 was determined and the RNA binding site on the second RRM domain was mapped by NMR. A plausible mechanism pertaining to the regulation of RNA binding activity by phosphorylation is also discussed

  10. Prediction of proton chemical shifts in RNA - Their use in structure refinement and validation

    International Nuclear Information System (INIS)

    Cromsigt, Jenny A.M.T.C.; Hilbers, Cees W.; Wijmenga, Sybren S.

    2001-01-01

    An analysis is presented of experimental versus calculated chemical shifts of the non-exchangeable protons for 28 RNA structures deposited in the Protein Data Bank, covering a wide range of structural building blocks. We have used existing models for ring-current and magnetic-anisotropy contributions to calculate the proton chemical shifts from the structures. Two different parameter sets were tried: (i) parameters derived by Ribas-Prado and Giessner-Prettre (GP set) [(1981) J. Mol. Struct.,76, 81-92.]; (ii) parameters derived by Case [(1995) J. Biomol. NMR, 6, 341-346]. Both sets lead to similar results. The detailed analysis was carried using the GP set. The root-mean-square-deviation between the predicted and observed chemical shifts of the complete database is 0.16 ppm with a Pearson correlation coefficient of 0.79. For protons in the usually well-defined A-helix environment these numbers are, 0.08 ppm and 0.96, respectively. As a result of this good correspondence, a reliable analysis could be made of the structural dependencies of the 1 H chemical shifts revealing their physical origin. For example, a down-field shift of either H2' or H3' or both indicates a high-syn/syn χ-angle. In an A-helix it is essentially the 5'-neighbor that affects the chemical shifts of H5, H6 and H8 protons. The H5, H6 and H8 resonances can therefore be assigned in an A-helix on the basis of their observed chemical shifts. In general, the chemical shifts were found to be quite sensitive to structural changes. We therefore propose that a comparison between calculated and observed 1 H chemical shifts is a good tool for validation and refinement of structures derived from NOEs and J-couplings

  11. Terminal structures of West Nile virus genomic RNA and their interactions with viral NS5 protein

    International Nuclear Information System (INIS)

    Dong Hongping; Zhang Bo; Shi Peiyong

    2008-01-01

    Genome cyclization is essential for flavivirus replication. We used RNases to probe the structures formed by the 5'-terminal 190 nucleotides and the 3'-terminal 111 nucleotides of the West Nile virus (WNV) genomic RNA. When analyzed individually, the two RNAs adopt stem-loop structures as predicted by the thermodynamic-folding program. However, when mixed together, the two RNAs form a duplex that is mediated through base-pairings of two sets of RNA elements (5'CS/3'CSI and 5'UAR/3'UAR). Formation of the RNA duplex facilitates a conformational change that leaves the 3'-terminal nucleotides of the genome (position - 8 to - 16) to be single-stranded. Viral NS5 binds specifically to the 5'-terminal stem-loop (SL1) of the genomic RNA. The 5'SL1 RNA structure is essential for WNV replication. The study has provided further evidence to suggest that flavivirus genome cyclization and NS5/5'SL1 RNA interaction facilitate NS5 binding to the 3' end of the genome for the initiation of viral minus-strand RNA synthesis

  12. An analytical platform for mass spectrometry-based identification and chemical analysis of RNA in ribonucleoprotein complexes.

    Science.gov (United States)

    Taoka, Masato; Yamauchi, Yoshio; Nobe, Yuko; Masaki, Shunpei; Nakayama, Hiroshi; Ishikawa, Hideaki; Takahashi, Nobuhiro; Isobe, Toshiaki

    2009-11-01

    We describe here a mass spectrometry (MS)-based analytical platform of RNA, which combines direct nano-flow reversed-phase liquid chromatography (RPLC) on a spray tip column and a high-resolution LTQ-Orbitrap mass spectrometer. Operating RPLC under a very low flow rate with volatile solvents and MS in the negative mode, we could estimate highly accurate mass values sufficient to predict the nucleotide composition of a approximately 21-nucleotide small interfering RNA, detect post-transcriptional modifications in yeast tRNA, and perform collision-induced dissociation/tandem MS-based structural analysis of nucleolytic fragments of RNA at a sub-femtomole level. Importantly, the method allowed the identification and chemical analysis of small RNAs in ribonucleoprotein (RNP) complex, such as the pre-spliceosomal RNP complex, which was pulled down from cultured cells with a tagged protein cofactor as bait. We have recently developed a unique genome-oriented database search engine, Ariadne, which allows tandem MS-based identification of RNAs in biological samples. Thus, the method presented here has broad potential for automated analysis of RNA; it complements conventional molecular biology-based techniques and is particularly suited for simultaneous analysis of the composition, structure, interaction, and dynamics of RNA and protein components in various cellular RNP complexes.

  13. Tree decomposition based fast search of RNA structures including pseudoknots in genomes.

    Science.gov (United States)

    Song, Yinglei; Liu, Chunmei; Malmberg, Russell; Pan, Fangfang; Cai, Liming

    2005-01-01

    Searching genomes for RNA secondary structure with computational methods has become an important approach to the annotation of non-coding RNAs. However, due to the lack of efficient algorithms for accurate RNA structure-sequence alignment, computer programs capable of fast and effectively searching genomes for RNA secondary structures have not been available. In this paper, a novel RNA structure profiling model is introduced based on the notion of a conformational graph to specify the consensus structure of an RNA family. Tree decomposition yields a small tree width t for such conformation graphs (e.g., t = 2 for stem loops and only a slight increase for pseudo-knots). Within this modelling framework, the optimal alignment of a sequence to the structure model corresponds to finding a maximum valued isomorphic subgraph and consequently can be accomplished through dynamic programming on the tree decomposition of the conformational graph in time O(k(t)N(2)), where k is a small parameter; and N is the size of the projiled RNA structure. Experiments show that the application of the alignment algorithm to search in genomes yields the same search accuracy as methods based on a Covariance model with a significant reduction in computation time. In particular; very accurate searches of tmRNAs in bacteria genomes and of telomerase RNAs in yeast genomes can be accomplished in days, as opposed to months required by other methods. The tree decomposition based searching tool is free upon request and can be downloaded at our site h t t p ://w.uga.edu/RNA-informatics/software/index.php.

  14. Identification of RNA species in the RNA-toxin complex and structure of the complex in Clostridium botulinum type E.

    Science.gov (United States)

    Kitamura, Masaru

    2002-02-15

    Clostridium botulinum type E toxin was isolated in the form of a complex with RNA(s) from bacterial cells. Characterization of the complexed RNA remains to be elucidated. The RNA is identified here as ribosomal RNA (rRNA) having 23S and 16S components. The RNA-toxin complexes were found to be made up of three types with different molecular sizes. The three types of RNA-toxin complex are toxin bound to both the 23S and 16S rRNA, toxin bound to the 16S rRNA and a small amount of 23S rRNA, and toxin bound only to the 16S rRNA. ©2002 Elsevier Science (USA).

  15. The structure of the hypothetical protein smu.1377c from Streptococcus mutans suggests a role in tRNA modification

    International Nuclear Information System (INIS)

    Fu, Tian-Min; Liu, Xiang; Li, Lanfen; Su, Xiao-Dong

    2010-01-01

    The crystal structure of smu.1377c, a hypothetical protein from S. mutans, shows a similar fold to Sua5-YciO-YrdC-family proteins and indicates its functional role in tRNA modification. Members of the Sua5-YciO-YrdC protein family are found in both eukaryotes and prokaryotes and possess a conserved α/β twisted open-sheet fold. The Escherichia coli protein YrdC has been shown to be involved in modification of tRNA. The crystal structure of smu.1377c, a hypothetical protein from Streptococcus mutans, has been determined to 2.25 Å resolution. From structure analysis and comparison, it is shown that smu.1377c is a member of the Sua5-YciO-YrdC family and that it may play the same role as E. coli YrdC

  16. RNAmutants: a web server to explore the mutational landscape of RNA secondary structures

    Science.gov (United States)

    Waldispühl, Jerome; Devadas, Srinivas; Berger, Bonnie; Clote, Peter

    2009-01-01

    The history and mechanism of molecular evolution in DNA have been greatly elucidated by contributions from genetics, probability theory and bioinformatics—indeed, mathematical developments such as Kimura's neutral theory, Kingman's coalescent theory and efficient software such as BLAST, ClustalW, Phylip, etc., provide the foundation for modern population genetics. In contrast to DNA, the function of most noncoding RNA depends on tertiary structure, experimentally known to be largely determined by secondary structure, for which dynamic programming can efficiently compute the minimum free energy secondary structure. For this reason, understanding the effect of pointwise mutations in RNA secondary structure could reveal fundamental properties of structural RNA molecules and improve our understanding of molecular evolution of RNA. The web server RNAmutants provides several efficient tools to compute the ensemble of low-energy secondary structures for all k-mutants of a given RNA sequence, where k is bounded by a user-specified upper bound. As we have previously shown, these tools can be used to predict putative deleterious mutations and to analyze regulatory sequences from the hepatitis C and human immunodeficiency genomes. Web server is available at http://bioinformatics.bc.edu/clotelab/RNAmutants/, and downloadable binaries at http://rnamutants.csail.mit.edu/. PMID:19531740

  17. Structural Insights into RNA Recognition by the Alternate-Splicing Regulator CUG-Binding Protein 1

    Energy Technology Data Exchange (ETDEWEB)

    M Teplova; J Song; H Gaw; A Teplov; D Patel

    2011-12-31

    CUG-binding protein 1 (CUGBP1) regulates multiple aspects of nuclear and cytoplasmic mRNA processing, with implications for onset of myotonic dystrophy. CUGBP1 harbors three RRM domains and preferentially targets UGU-rich mRNA elements. We describe crystal structures of CUGBP1 RRM1 and tandem RRM1/2 domains bound to RNAs containing tandem UGU(U/G) elements. Both RRM1 in RRM1-RNA and RRM2 in RRM1/2-RNA complexes use similar principles to target UGU(U/G) elements, with recognition mediated by face-to-edge stacking and water-mediated hydrogen-bonding networks. The UG step adopts a left-handed Z-RNA conformation, with the syn guanine recognized through Hoogsteen edge-protein backbone hydrogen-bonding interactions. NMR studies on the RRM1/2-RNA complex establish that both RRM domains target tandem UGUU motifs in solution, whereas filter-binding assays identify a preference for recognition of GU over AU or GC steps. We discuss the implications of CUGBP1-mediated targeting and sequestration of UGU(U/G) elements on pre-mRNA alternative-splicing regulation, translational regulation, and mRNA decay.

  18. Integrated Analysis of Long Noncoding RNA and Coding RNA Expression in Esophageal Squamous Cell Carcinoma

    Directory of Open Access Journals (Sweden)

    Wei Cao

    2013-01-01

    Full Text Available Tumorigenesis is a complex dynamic biological process that includes multiple steps of genetic and epigenetic alterations, aberrant expression of noncoding RNA, and changes in the expression profiles of coding genes. We call the collection of those perturbations in genome space the “cancer initiatome.” Long noncoding RNAs (lncRNAs are pervasively transcribed in the genome and they have key regulatory functions in chromatin remodeling and gene expression. Spatiotemporal variation in the expression of lncRNAs has been observed in development and disease states, including cancer. A few dysregulated lncRNAs have been studied in cancers, but the role of lncRNAs in the cancer initiatome remains largely unknown, especially in esophageal squamous cell carcinoma (ESCC. We conducted a genome-wide screen of the expression of lncRNAs and coding RNAs from ESCC and matched adjacent nonneoplastic normal tissues. We identified differentially expressed lncRNAs and coding RNAs in ESCC relative to their matched normal tissue counterparts and validated the result using polymerase chain reaction analysis. Furthermore, we identified differentially expressed lncRNAs that are co-located and co-expressed with differentially expressed coding RNAs in ESCC and the results point to a potential interaction between lncRNAs and neighboring coding genes that affect ether lipid metabolism, and the interaction may contribute to the development of ESCC. These data provide compelling evidence for a potential novel genomic biomarker of esophageal squamous cell cancer.

  19. nRC: non-coding RNA Classifier based on structural features.

    Science.gov (United States)

    Fiannaca, Antonino; La Rosa, Massimo; La Paglia, Laura; Rizzo, Riccardo; Urso, Alfonso

    2017-01-01

    Non-coding RNA (ncRNA) are small non-coding sequences involved in gene expression regulation of many biological processes and diseases. The recent discovery of a large set of different ncRNAs with biologically relevant roles has opened the way to develop methods able to discriminate between the different ncRNA classes. Moreover, the lack of knowledge about the complete mechanisms in regulative processes, together with the development of high-throughput technologies, has required the help of bioinformatics tools in addressing biologists and clinicians with a deeper comprehension of the functional roles of ncRNAs. In this work, we introduce a new ncRNA classification tool, nRC (non-coding RNA Classifier). Our approach is based on features extraction from the ncRNA secondary structure together with a supervised classification algorithm implementing a deep learning architecture based on convolutional neural networks. We tested our approach for the classification of 13 different ncRNA classes. We obtained classification scores, using the most common statistical measures. In particular, we reach an accuracy and sensitivity score of about 74%. The proposed method outperforms other similar classification methods based on secondary structure features and machine learning algorithms, including the RNAcon tool that, to date, is the reference classifier. nRC tool is freely available as a docker image at https://hub.docker.com/r/tblab/nrc/. The source code of nRC tool is also available at https://github.com/IcarPA-TBlab/nrc.

  20. Structure and Dynamics of RNA Repeat Expansions That Cause Huntington's Disease and Myotonic Dystrophy Type 1.

    Science.gov (United States)

    Chen, Jonathan L; VanEtten, Damian M; Fountain, Matthew A; Yildirim, Ilyas; Disney, Matthew D

    2017-07-11

    RNA repeat expansions cause a host of incurable, genetically defined diseases. The most common class of RNA repeats consists of trinucleotide repeats. These long, repeating transcripts fold into hairpins containing 1 × 1 internal loops that can mediate disease via a variety of mechanism(s) in which RNA is the central player. Two of these disorders are Huntington's disease and myotonic dystrophy type 1, which are caused by r(CAG) and r(CUG) repeats, respectively. We report the structures of two RNA constructs containing three copies of a r(CAG) [r(3×CAG)] or r(CUG) [r(3×CUG)] motif that were modeled with nuclear magnetic resonance spectroscopy and simulated annealing with restrained molecular dynamics. The 1 × 1 internal loops of r(3×CAG) are stabilized by one-hydrogen bond (cis Watson-Crick/Watson-Crick) AA pairs, while those of r(3×CUG) prefer one- or two-hydrogen bond (cis Watson-Crick/Watson-Crick) UU pairs. Assigned chemical shifts for the residues depended on the identity of neighbors or next nearest neighbors. Additional insights into the dynamics of these RNA constructs were gained by molecular dynamics simulations and a discrete path sampling method. Results indicate that the global structures of the RNA are A-form and that the loop regions are dynamic. The results will be useful for understanding the dynamic trajectory of these RNA repeats but also may aid in the development of therapeutics.

  1. GC content around splice sites affects splicing through pre-mRNA secondary structures

    Directory of Open Access Journals (Sweden)

    Chen Liang

    2011-01-01

    Full Text Available Abstract Background Alternative splicing increases protein diversity by generating multiple transcript isoforms from a single gene through different combinations of exons or through different selections of splice sites. It has been reported that RNA secondary structures are involved in alternative splicing. Here we perform a genomic study of RNA secondary structures around splice sites in humans (Homo sapiens, mice (Mus musculus, fruit flies (Drosophila melanogaster, and nematodes (Caenorhabditis elegans to further investigate this phenomenon. Results We observe that GC content around splice sites is closely associated with the splice site usage in multiple species. RNA secondary structure is the possible explanation, because the structural stability difference among alternative splice sites, constitutive splice sites, and skipped splice sites can be explained by the GC content difference. Alternative splice sites tend to be GC-enriched and exhibit more stable RNA secondary structures in all of the considered species. In humans and mice, splice sites of first exons and long exons tend to be GC-enriched and hence form more stable structures, indicating the special role of RNA secondary structures in promoter proximal splicing events and the splicing of long exons. In addition, GC-enriched exon-intron junctions tend to be overrepresented in tissue-specific alternative splice sites, indicating the functional consequence of the GC effect. Compared with regions far from splice sites and decoy splice sites, real splice sites are GC-enriched. We also found that the GC-content effect is much stronger than the nucleotide-order effect to form stable secondary structures. Conclusion All of these results indicate that GC content is related to splice site usage and it may mediate the splicing process through RNA secondary structures.

  2. Initiation of translation in bacteria by a structured eukaryotic IRES RNA.

    Science.gov (United States)

    Colussi, Timothy M; Costantino, David A; Zhu, Jianyu; Donohue, John Paul; Korostelev, Andrei A; Jaafar, Zane A; Plank, Terra-Dawn M; Noller, Harry F; Kieft, Jeffrey S

    2015-03-05

    The central dogma of gene expression (DNA to RNA to protein) is universal, but in different domains of life there are fundamental mechanistic differences within this pathway. For example, the canonical molecular signals used to initiate protein synthesis in bacteria and eukaryotes are mutually exclusive. However, the core structures and conformational dynamics of ribosomes that are responsible for the translation steps that take place after initiation are ancient and conserved across the domains of life. We wanted to explore whether an undiscovered RNA-based signal might be able to use these conserved features, bypassing mechanisms specific to each domain of life, and initiate protein synthesis in both bacteria and eukaryotes. Although structured internal ribosome entry site (IRES) RNAs can manipulate ribosomes to initiate translation in eukaryotic cells, an analogous RNA structure-based mechanism has not been observed in bacteria. Here we report our discovery that a eukaryotic viral IRES can initiate translation in live bacteria. We solved the crystal structure of this IRES bound to a bacterial ribosome to 3.8 Å resolution, revealing that despite differences between bacterial and eukaryotic ribosomes this IRES binds directly to both and occupies the space normally used by transfer RNAs. Initiation in both bacteria and eukaryotes depends on the structure of the IRES RNA, but in bacteria this RNA uses a different mechanism that includes a form of ribosome repositioning after initial recruitment. This IRES RNA bridges billions of years of evolutionary divergence and provides an example of an RNA structure-based translation initiation signal capable of operating in two domains of life.

  3. NMR-study of dynamic structural transtions in RNA-molecules

    OpenAIRE

    Fürtig, Boris

    2007-01-01

    The following thesis is concerned with the elucidation of structural changes of RNA molecules during the time course of dynamic processes that are commonly denoted as folding reactions. In contrast to the field of protein folding, the concept of RNA folding comprises not only folding reactions itself but also refolding- or conformational switching- and assembly processes (see chapter III). The method in this thesis to monitor these diverse processes is high resolution liquid-state NMR spectro...

  4. An evolutionary model for protein-coding regions with conserved RNA structure

    DEFF Research Database (Denmark)

    Pedersen, Jakob Skou; Forsberg, Roald; Meyer, Irmtraud Margret

    2004-01-01

    in the RNA structure. The overlap of these fundamental dependencies is sufficient to cause "contagious" context dependencies which cascade across many nucleotide sites. Such large-scale dependencies challenge the use of traditional phylogenetic models in evolutionary inference because they explicitly assume...... components of traditional phylogenetic models. We applied this to a data set of full-genome sequences from the hepatitis C virus where five RNA structures are mapped within the coding region. This allowed us to partition the effects of selection on different structural elements and to test various hypotheses......Here we present a model of nucleotide substitution in protein-coding regions that also encode the formation of conserved RNA structures. In such regions, apparent evolutionary context dependencies exist, both between nucleotides occupying the same codon and between nucleotides forming a base pair...

  5. Structures of two exonucleases involved in controlled RNA turnover in yeast

    DEFF Research Database (Denmark)

    Jonstrup, Anette Thyssen; Midtgaard, Søren Fuglsang; Van, Lan Bich

    divalent cations. The Pop2p structure reveals that the ability of this enzyme to degrade poly(A)/(U)/(C), but not poly(G) may be determined by structural hindrance of interaction with this specific nucleotide. In Rrp6p, mutations known to confer specific RNA degradation phenotypes in yeast nuclei can now...... rid of aberrant RNAs. Here we describe the structures of two 3'-5' exonucleases involved in controlled RNA decay in yeast, Pop2p and Rrp6p. Rrp6p is associated with the nuclear exosome where it participates in the degradation of improperly processed precursor mRNAs and trimming of stable RNAs [1]. Pop...

  6. Structure and function of initiator methionine tRNA from the mitochondria of Neurospora crassa

    International Nuclear Information System (INIS)

    Heckman, J.E.; Hecker, L.I.; Schwartzbach, S.D.; Barnett, W.E.; Baumstark, B.; RajBhandary, U.L.

    1978-01-01

    Initiator methionine tRNA from the mitochondria of Neurospora crassa has been purified and sequenced. This mitochondrial tRNA can be aminoacylated and formylated by E. coli enzymes, and is capable of initiating protein synthesis in E. coli extracts. The nucleotide composition of the mitochondrial initiator tRNA (the first mitochondrial tRNA subjected to sequence analysis) is very rich in A + U, like that reported for total mitochondrial tRNA. In two of the unique features which differentiate procaryotic from eucaryotic cytoplasmic initiator tRNAs, the mitochondrial tRNA appears to resemble the eucaryotic initiator tRNAs. Thus unlike procaryotic initiator tRNAs in which the 5' terminal nucleotide cannot form a Watson-Crick base pair to the fifth nucleotide from 3' end, the mitochondrial tRNA can form such a base pair; and like the eucaryotic cytoplasmic initiator tRNAs, the mitochondrial initiator tRNA lacks the sequence - T psiCG(or A) in loop IV. The corresponding sequence in the mitochondrial tRNA, however, is -UGCA- and not -AU(or psi)CG- as found in all eucaryotic cytoplasmic initiator tRNAs. In spite of some similarity of the mitochondrial initiator tRNA to both eucaryotic and procaryotic initiator tRNAs, the mitochondrial initiator tRNA is basically different from both these tRNAs. Between these two classes of initiator tRNAs, however, it is more homologous in sequence to procaryotic (56 to 60%) than to eucaryotic cytoplasmic initiator tRNAs

  7. Integration analysis of microRNA and mRNA paired expression profiling identifies deregulated microRNA-transcription factor-gene regulatory networks in ovarian endometriosis.

    Science.gov (United States)

    Zhao, Luyang; Gu, Chenglei; Ye, Mingxia; Zhang, Zhe; Li, Li'an; Fan, Wensheng; Meng, Yuanguang

    2018-01-22

    The etiology and pathophysiology of endometriosis remain unclear. Accumulating evidence suggests that aberrant microRNA (miRNA) and transcription factor (TF) expression may be involved in the pathogenesis and development of endometriosis. This study therefore aims to survey the key miRNAs, TFs and genes and further understand the mechanism of endometriosis. Paired expression profiling of miRNA and mRNA in ectopic endometria compared with eutopic endometria were determined by high-throughput sequencing techniques in eight patients with ovarian endometriosis. Binary interactions and circuits among the miRNAs, TFs, and corresponding genes were identified by the Pearson correlation coefficients. miRNA-TF-gene regulatory networks were constructed using bioinformatic methods. Eleven selected miRNAs and TFs were validated by quantitative reverse transcription-polymerase chain reaction in 22 patients. Overall, 107 differentially expressed miRNAs and 6112 differentially expressed mRNAs were identified by comparing the sequencing of the ectopic endometrium group and the eutopic endometrium group. The miRNA-TF-gene regulatory network consists of 22 miRNAs, 12 TFs and 430 corresponding genes. Specifically, some key regulators from the miR-449 and miR-34b/c cluster, miR-200 family, miR-106a-363 cluster, miR-182/183, FOX family, GATA family, and E2F family as well as CEBPA, SOX9 and HNF4A were suggested to play vital regulatory roles in the pathogenesis of endometriosis. Integration analysis of the miRNA and mRNA expression profiles presents a unique insight into the regulatory network of this enigmatic disorder and possibly provides clues regarding replacement therapy for endometriosis.

  8. Prediction of RNA secondary structures: from theory to models and real molecules

    International Nuclear Information System (INIS)

    Schuster, Peter

    2006-01-01

    RNA secondary structures are derived from RNA sequences, which are strings built form the natural four letter nucleotide alphabet, {AUGC}. These coarse-grained structures, in turn, are tantamount to constrained strings over a three letter alphabet. Hence, the secondary structures are discrete objects and the number of sequences always exceeds the number of structures. The sequences built from two letter alphabets form perfect structures when the nucleotides can form a base pair, as is the case with {GC} or {AU}, but the relation between the sequences and structures differs strongly from the four letter alphabet. A comprehensive theory of RNA structure is presented, which is based on the concepts of sequence space and shape space, being a space of structures. It sets the stage for modelling processes in ensembles of RNA molecules like evolutionary optimization or kinetic folding as dynamical phenomena guided by mappings between the two spaces. The number of minimum free energy (mfe) structures is always smaller than the number of sequences, even for two letter alphabets. Folding of RNA molecules into mfe energy structures constitutes a non-invertible mapping from sequence space onto shape space. The preimage of a structure in sequence space is defined as its neutral network. Similarly the set of suboptimal structures is the preimage of a sequence in shape space. This set represents the conformation space of a given sequence. The evolutionary optimization of structures in populations is a process taking place in sequence space, whereas kinetic folding occurs in molecular ensembles that optimize free energy in conformation space. Efficient folding algorithms based on dynamic programming are available for the prediction of secondary structures for given sequences. The inverse problem, the computation of sequences for predefined structures, is an important tool for the design of RNA molecules with tailored properties. Simultaneous folding or cofolding of two or more RNA

  9. 3dRPC: a web server for 3D RNA-protein structure prediction.

    Science.gov (United States)

    Huang, Yangyu; Li, Haotian; Xiao, Yi

    2018-04-01

    RNA-protein interactions occur in many biological processes. To understand the mechanism of these interactions one needs to know three-dimensional (3D) structures of RNA-protein complexes. 3dRPC is an algorithm for prediction of 3D RNA-protein complex structures and consists of a docking algorithm RPDOCK and a scoring function 3dRPC-Score. RPDOCK is used to sample possible complex conformations of an RNA and a protein by calculating the geometric and electrostatic complementarities and stacking interactions at the RNA-protein interface according to the features of atom packing of the interface. 3dRPC-Score is a knowledge-based potential that uses the conformations of nucleotide-amino-acid pairs as statistical variables and that is used to choose the near-native complex-conformations obtained from the docking method above. Recently, we built a web server for 3dRPC. The users can easily use 3dRPC without installing it locally. RNA and protein structures in PDB (Protein Data Bank) format are the only needed input files. It can also incorporate the information of interface residues or residue-pairs obtained from experiments or theoretical predictions to improve the prediction. The address of 3dRPC web server is http://biophy.hust.edu.cn/3dRPC. yxiao@hust.edu.cn.

  10. Bi-objective integer programming for RNA secondary structure prediction with pseudoknots.

    Science.gov (United States)

    Legendre, Audrey; Angel, Eric; Tahi, Fariza

    2018-01-15

    RNA structure prediction is an important field in bioinformatics, and numerous methods and tools have been proposed. Pseudoknots are specific motifs of RNA secondary structures that are difficult to predict. Almost all existing methods are based on a single model and return one solution, often missing the real structure. An alternative approach would be to combine different models and return a (small) set of solutions, maximizing its quality and diversity in order to increase the probability that it contains the real structure. We propose here an original method for predicting RNA secondary structures with pseudoknots, based on integer programming. We developed a generic bi-objective integer programming algorithm allowing to return optimal and sub-optimal solutions optimizing simultaneously two models. This algorithm was then applied to the combination of two known models of RNA secondary structure prediction, namely MEA and MFE. The resulting tool, called BiokoP, is compared with the other methods in the literature. The results show that the best solution (structure with the highest F 1 -score) is, in most cases, given by BiokoP. Moreover, the results of BiokoP are homogeneous, regardless of the pseudoknot type or the presence or not of pseudoknots. Indeed, the F 1 -scores are always higher than 70% for any number of solutions returned. The results obtained by BiokoP show that combining the MEA and the MFE models, as well as returning several optimal and several sub-optimal solutions, allow to improve the prediction of secondary structures. One perspective of our work is to combine better mono-criterion models, in particular to combine a model based on the comparative approach with the MEA and the MFE models. This leads to develop in the future a new multi-objective algorithm to combine more than two models. BiokoP is available on the EvryRNA platform: https://EvryRNA.ibisc.univ-evry.fr .

  11. Ebola virus RNA editing depends on the primary editing site sequence and an upstream secondary structure.

    Directory of Open Access Journals (Sweden)

    Masfique Mehedi

    Full Text Available Ebolavirus (EBOV, the causative agent of a severe hemorrhagic fever and a biosafety level 4 pathogen, increases its genome coding capacity by producing multiple transcripts encoding for structural and nonstructural glycoproteins from a single gene. This is achieved through RNA editing, during which non-template adenosine residues are incorporated into the EBOV mRNAs at an editing site encoding for 7 adenosine residues. However, the mechanism of EBOV RNA editing is currently not understood. In this study, we report for the first time that minigenomes containing the glycoprotein gene editing site can undergo RNA editing, thereby eliminating the requirement for a biosafety level 4 laboratory to study EBOV RNA editing. Using a newly developed dual-reporter minigenome, we have characterized the mechanism of EBOV RNA editing, and have identified cis-acting sequences that are required for editing, located between 9 nt upstream and 9 nt downstream of the editing site. Moreover, we show that a secondary structure in the upstream cis-acting sequence plays an important role in RNA editing. EBOV RNA editing is glycoprotein gene-specific, as a stretch encoding for 7 adenosine residues located in the viral polymerase gene did not serve as an editing site, most likely due to an absence of the necessary cis-acting sequences. Finally, the EBOV protein VP30 was identified as a trans-acting factor for RNA editing, constituting a novel function for this protein. Overall, our results provide novel insights into the RNA editing mechanism of EBOV, further understanding of which might result in novel intervention strategies against this viral pathogen.

  12. Ebola virus RNA editing depends on the primary editing site sequence and an upstream secondary structure.

    Science.gov (United States)

    Mehedi, Masfique; Hoenen, Thomas; Robertson, Shelly; Ricklefs, Stacy; Dolan, Michael A; Taylor, Travis; Falzarano, Darryl; Ebihara, Hideki; Porcella, Stephen F; Feldmann, Heinz

    2013-01-01

    Ebolavirus (EBOV), the causative agent of a severe hemorrhagic fever and a biosafety level 4 pathogen, increases its genome coding capacity by producing multiple transcripts encoding for structural and nonstructural glycoproteins from a single gene. This is achieved through RNA editing, during which non-template adenosine residues are incorporated into the EBOV mRNAs at an editing site encoding for 7 adenosine residues. However, the mechanism of EBOV RNA editing is currently not understood. In this study, we report for the first time that minigenomes containing the glycoprotein gene editing site can undergo RNA editing, thereby eliminating the requirement for a biosafety level 4 laboratory to study EBOV RNA editing. Using a newly developed dual-reporter minigenome, we have characterized the mechanism of EBOV RNA editing, and have identified cis-acting sequences that are required for editing, located between 9 nt upstream and 9 nt downstream of the editing site. Moreover, we show that a secondary structure in the upstream cis-acting sequence plays an important role in RNA editing. EBOV RNA editing is glycoprotein gene-specific, as a stretch encoding for 7 adenosine residues located in the viral polymerase gene did not serve as an editing site, most likely due to an absence of the necessary cis-acting sequences. Finally, the EBOV protein VP30 was identified as a trans-acting factor for RNA editing, constituting a novel function for this protein. Overall, our results provide novel insights into the RNA editing mechanism of EBOV, further understanding of which might result in novel intervention strategies against this viral pathogen.

  13. RNA Structural Dynamics As Captured by Molecular Simulations: A Comprehensive Overview

    Science.gov (United States)

    2018-01-01

    With both catalytic and genetic functions, ribonucleic acid (RNA) is perhaps the most pluripotent chemical species in molecular biology, and its functions are intimately linked to its structure and dynamics. Computer simulations, and in particular atomistic molecular dynamics (MD), allow structural dynamics of biomolecular systems to be investigated with unprecedented temporal and spatial resolution. We here provide a comprehensive overview of the fast-developing field of MD simulations of RNA molecules. We begin with an in-depth, evaluatory coverage of the most fundamental methodological challenges that set the basis for the future development of the field, in particular, the current developments and inherent physical limitations of the atomistic force fields and the recent advances in a broad spectrum of enhanced sampling methods. We also survey the closely related field of coarse-grained modeling of RNA systems. After dealing with the methodological aspects, we provide an exhaustive overview of the available RNA simulation literature, ranging from studies of the smallest RNA oligonucleotides to investigations of the entire ribosome. Our review encompasses tetranucleotides, tetraloops, a number of small RNA motifs, A-helix RNA, kissing-loop complexes, the TAR RNA element, the decoding center and other important regions of the ribosome, as well as assorted others systems. Extended sections are devoted to RNA–ion interactions, ribozymes, riboswitches, and protein/RNA complexes. Our overview is written for as broad of an audience as possible, aiming to provide a much-needed interdisciplinary bridge between computation and experiment, together with a perspective on the future of the field. PMID:29297679

  14. Mathematical and Biological Modelling of RNA Secondary Structure and Its Effects on Gene Expression

    Directory of Open Access Journals (Sweden)

    T. A. Hughes

    2006-01-01

    Full Text Available Secondary structures within the 5′ untranslated regions of messenger RNAs can have profound effects on the efficiency of translation of their messages and thereby on gene expression. Consequently they can act as important regulatory motifs in both physiological and pathological settings. Current approaches to predicting the secondary structure of these RNA sequences find the structure with the global-minimum free energy. However, since RNA folds progressively from the 5′ end when synthesised or released from the translational machinery, this may not be the most probable structure. We discuss secondary structure prediction based on local-minimisation of free energy with thermodynamic fluctuations as nucleotides are added to the 3′ end and show that these can result in different secondary structures. We also discuss approaches for studying the extent of the translational inhibition specified by structures within the 5′ untranslated region.

  15. MicroRNA Polymorphisms in Cancer: A Literature Analysis

    International Nuclear Information System (INIS)

    Pipan, Veronika; Zorc, Minja; Kunej, Tanja

    2015-01-01

    Single nucleotide polymorphisms (SNPs) located in microRNA (miRNA) genes (miR-SNPs) have attracted increasing attention in recent years due to their involvement in the development of various types of cancer. Therefore, a systematic review on this topic was needed. From 55 scientific publications we collected 20 SNPs, which are located within 18 miRNA encoding genes and have been associated with 16 types of cancer. Among 20 miRNA gene polymorphisms 13 are located within the premature miRNA region, five within mature, and two within mature seed miRNA region. We graphically visualized a network of miRNA-cancer associations which revealed miRNA genes and cancer types with the highest number of connections. Our study showed that, despite a large number of variations currently known to be located within miRNA genes in humans, most of them have not yet been tested for association with cancer. MicroRNA SNPs collected in this study represent only 0.43% of known miRNA gene variations (20/4687). Results of the present study will be useful to researchers investigating the clinical use of miRNAs, such as the roles of miRNAs as diagnostic markers and therapeutic targets

  16. MicroRNA Polymorphisms in Cancer: A Literature Analysis

    Energy Technology Data Exchange (ETDEWEB)

    Pipan, Veronika; Zorc, Minja; Kunej, Tanja, E-mail: tanja.kunej@bf.uni-lj.si [Department of Animal Science, Biotechnical Faculty, University of Ljubljana, Groblje 3, SI-1230 Domzale (Slovenia)

    2015-09-09

    Single nucleotide polymorphisms (SNPs) located in microRNA (miRNA) genes (miR-SNPs) have attracted increasing attention in recent years due to their involvement in the development of various types of cancer. Therefore, a systematic review on this topic was needed. From 55 scientific publications we collected 20 SNPs, which are located within 18 miRNA encoding genes and have been associated with 16 types of cancer. Among 20 miRNA gene polymorphisms 13 are located within the premature miRNA region, five within mature, and two within mature seed miRNA region. We graphically visualized a network of miRNA-cancer associations which revealed miRNA genes and cancer types with the highest number of connections. Our study showed that, despite a large number of variations currently known to be located within miRNA genes in humans, most of them have not yet been tested for association with cancer. MicroRNA SNPs collected in this study represent only 0.43% of known miRNA gene variations (20/4687). Results of the present study will be useful to researchers investigating the clinical use of miRNAs, such as the roles of miRNAs as diagnostic markers and therapeutic targets.

  17. Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components.

    Science.gov (United States)

    Fagerlund, Robert D; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S

    2015-09-01

    Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA-RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. © 2015 Fagerlund et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  18. Construction and analysis of lncRNA-lncRNA synergistic networks to reveal clinically relevant lncRNAs in cancer.

    Science.gov (United States)

    Li, Yongsheng; Chen, Juan; Zhang, Jinwen; Wang, Zishan; Shao, Tingting; Jiang, Chunjie; Xu, Juan; Li, Xia

    2015-09-22

    Long non-coding RNAs (lncRNAs) play key roles in diverse biological processes. Moreover, the development and progression of cancer often involves the combined actions of several lncRNAs. Here we propose a multi-step method for constructing lncRNA-lncRNA functional synergistic networks (LFSNs) through co-regulation of functional modules having three features: common coexpressed genes of lncRNA pairs, enrichment in the same functional category and close proximity within protein interaction networks. Applied to three cancers, we constructed cancer-specific LFSNs and found that they exhibit a scale free and modular architecture. In addition, cancer-associated lncRNAs tend to be hubs and are enriched within modules. Although there is little synergistic pairing of lncRNAs across cancers, lncRNA pairs involved in the same cancer hallmarks by regulating same or different biological processes. Finally, we identify prognostic biomarkers within cancer lncRNA expression datasets using modules derived from LFSNs. In summary, this proof-of-principle study indicates synergistic lncRNA pairs can be identified through integrative analysis of genome-wide expression data sets and functional information.

  19. General enumeration of RNA secondary structures based on new ...

    African Journals Online (AJOL)

    Crick base pairs between AU and GC. Based on the new representation, this paper also computes the number of various types of constrained secondary structures taking the minimum stack length 1 and minimum size m for each bonding loop as ...

  20. sRNAnalyzer-a flexible and customizable small RNA sequencing data analysis pipeline.

    Science.gov (United States)

    Wu, Xiaogang; Kim, Taek-Kyun; Baxter, David; Scherler, Kelsey; Gordon, Aaron; Fong, Olivia; Etheridge, Alton; Galas, David J; Wang, Kai

    2017-12-01

    Although many tools have been developed to analyze small RNA sequencing (sRNA-Seq) data, it remains challenging to accurately analyze the small RNA population, mainly due to multiple sequence ID assignment caused by short read length. Additional issues in small RNA analysis include low consistency of microRNA (miRNA) measurement results across different platforms, miRNA mapping associated with miRNA sequence variation (isomiR) and RNA editing, and the origin of those unmapped reads after screening against all endogenous reference sequence databases. To address these issues, we built a comprehensive and customizable sRNA-Seq data analysis pipeline-sRNAnalyzer, which enables: (i) comprehensive miRNA profiling strategies to better handle isomiRs and summarization based on each nucleotide position to detect potential SNPs in miRNAs, (ii) different sequence mapping result assignment approaches to simulate results from microarray/qRT-PCR platforms and a local probabilistic model to assign mapping results to the most-likely IDs, (iii) comprehensive ribosomal RNA filtering for accurate mapping of exogenous RNAs and summarization based on taxonomy annotation. We evaluated our pipeline on both artificial samples (including synthetic miRNA and Escherichia coli cultures) and biological samples (human tissue and plasma). sRNAnalyzer is implemented in Perl and available at: http://srnanalyzer.systemsbiology.net/. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  1. sRNAnalyzer—a flexible and customizable small RNA sequencing data analysis pipeline

    Science.gov (United States)

    Kim, Taek-Kyun; Baxter, David; Scherler, Kelsey; Gordon, Aaron; Fong, Olivia; Etheridge, Alton; Galas, David J.

    2017-01-01

    Abstract Although many tools have been developed to analyze small RNA sequencing (sRNA-Seq) data, it remains challenging to accurately analyze the small RNA population, mainly due to multiple sequence ID assignment caused by short read length. Additional issues in small RNA analysis include low consistency of microRNA (miRNA) measurement results across different platforms, miRNA mapping associated with miRNA sequence variation (isomiR) and RNA editing, and the origin of those unmapped reads after screening against all endogenous reference sequence databases. To address these issues, we built a comprehensive and customizable sRNA-Seq data analysis pipeline—sRNAnalyzer, which enables: (i) comprehensive miRNA profiling strategies to better handle isomiRs and summarization based on each nucleotide position to detect potential SNPs in miRNAs, (ii) different sequence mapping result assignment approaches to simulate results from microarray/qRT-PCR platforms and a local probabilistic model to assign mapping results to the most-likely IDs, (iii) comprehensive ribosomal RNA filtering for accurate mapping of exogenous RNAs and summarization based on taxonomy annotation. We evaluated our pipeline on both artificial samples (including synthetic miRNA and Escherichia coli cultures) and biological samples (human tissue and plasma). sRNAnalyzer is implemented in Perl and available at: http://srnanalyzer.systemsbiology.net/. PMID:29069500

  2. Informatics for RNA Sequencing: A Web Resource for Analysis on the Cloud.

    Directory of Open Access Journals (Sweden)

    Malachi Griffith

    2015-08-01

    Full Text Available Massively parallel RNA sequencing (RNA-seq has rapidly become the assay of choice for interrogating RNA transcript abundance and diversity. This article provides a detailed introduction to fundamental RNA-seq molecular biology and informatics concepts. We make available open-access RNA-seq tutorials that cover cloud computing, tool installation, relevant file formats, reference genomes, transcriptome annotations, quality-control strategies, expression, differential expression, and alternative splicing analysis methods. These tutorials and additional training resources are accompanied by complete analysis pipelines and test datasets made available without encumbrance at www.rnaseq.wiki.

  3. A 3'-end structure in RNA2 of a crinivirus is essential for viral RNA synthesis and contributes to replication-associated translation activity.

    Science.gov (United States)

    Mongkolsiriwattana, Chawin; Zhou, Jaclyn S; Ng, James C K

    2016-10-03

    The terminal ends in the genome of RNA viruses contain features that regulate viral replication and/or translation. We have identified a Y-shaped structure (YSS) in the 3' terminal regions of the bipartite genome of Lettuce chlorosis virus (LCV), a member in the genus Crinivirus (family Closteroviridae). The YSS is the first in this family of viruses to be determined using Selective 2'-Hydroxyl Acylation Analyzed by Primer Extension (SHAPE). Using luciferase constructs/replicons, in vivo and in vitro assays showed that the 5' and YSS-containing 3' terminal regions of LCV RNA1 supported translation activity. In contrast, similar regions from LCV RNA2, including those upstream of the YSS, did not. LCV RNA2 mutants with nucleotide deletions or replacements that affected the YSS were replication deficient. In addition, the YSS of LCV RNA1 and RNA2 were interchangeable without affecting viral RNA synthesis. Translation and significant replication were observed for specific LCV RNA2 replicons only in the presence of LCV RNA1, but both processes were impaired when the YSS and/or its upstream region were incomplete or altered. These results are evidence that the YSS is essential to the viral replication machinery, and contributes to replication enhancement and replication-associated translation activity in the RNA2 replicons.

  4. Nuclei of aged myofibres undergo structural and functional changes suggesting impairment in RNA processing

    Directory of Open Access Journals (Sweden)

    C Pellicciari

    2009-06-01

    Full Text Available Advancing adult age is associated with a progressive decrease in skeletal muscle mass, strength and quality known as sarcopenia. The mechanisms underlying age-related skeletal muscle wasting and weakness are manifold and still remain to be fully elucidated. Despite the increasing evidence that the progress of muscle diseases leading to muscle atrophy/dystrophy may be related to defective RNA processing, no data on the morpho-functional features of skeletal muscle nuclei in sarcopenia are available at present. In this view, we have investigated, by combining morphometry and immunocytochemistry at light and electron microscopy, the fine structure of myonuclei as well as the distribution and amount of RNA processing factors in skeletal myofibres of biceps brachii and quadriceps femoris from adult and old rats. Results demonstrate that the myonuclei of aged type II fibres show an increased amount of condensed chromatin and lower amounts of phosphorylated polymerase II and DNA/RNA hybrid molecules, clearly indicating a decrease in pre-mRNA transcription rate compared to adult animals. In addition, myonuclei of aged fibres show decreased amounts of nucleoplasmic splicing factors and an accumulation of cleavage factors, polyadenilated RNA and perichromatin granules, suggesting a reduction in the processing and transport rate of premRNA. During ageing, it seems therefore that in rat myonuclei the entire production chain of mRNA, from synthesis to cytoplasmic export, is less efficient. This failure likely contributes to the reduced responsiveness of muscle cells to anabolic stimuli in the elderly.

  5. Structural insights into mechanisms of the small RNA methyltransferase HEN1

    Energy Technology Data Exchange (ETDEWEB)

    Huang, Ying; Ji, Lijuan; Huang, Qichen; Vassylyev, Dmitry G.; Chen, Xuemei; Ma, Jin-Biao; (UAB); (UCR)

    2010-02-22

    RNA silencing is a conserved regulatory mechanism in fungi, plants and animals that regulates gene expression and defence against viruses and transgenes. Small silencing RNAs of {approx}20-30 nucleotides and their associated effector proteins, the Argonaute family proteins, are the central components in RNA silencing. A subset of small RNAs, such as microRNAs and small interfering RNAs (siRNAs) in plants, Piwi-interacting RNAs in animals and siRNAs in Drosophila, requires an additional crucial step for their maturation; that is, 2'-O-methylation on the 3' terminal nucleotide. A conserved S-adenosyl-L-methionine-dependent RNA methyltransferase, HUA ENHANCER 1 (HEN1), and its homologues are responsible for this specific modification. Here we report the 3.1 {angstrom} crystal structure of full-length HEN1 from Arabidopsis in complex with a 22-nucleotide small RNA duplex and cofactor product S-adenosyl-L-homocysteine. Highly cooperative recognition of the small RNA substrate by multiple RNA binding domains and the methyltransferase domain in HEN1 measures the length of the RNA duplex and determines the substrate specificity. Metal ion coordination by both 2' and 3' hydroxyls on the 3'-terminal nucleotide and four invariant residues in the active site of the methyltransferase domain suggests a novel Mg{sup 2+}-dependent 2'-O-methylation mechanism.

  6. Structure determination of an 11-subunit exosome in complex with RNA by molecular replacement

    International Nuclear Information System (INIS)

    Makino, Debora Lika; Conti, Elena

    2013-01-01

    The crystallographic steps towards the structure determination of a complete eukaryotic exosome complex bound to RNA are presented. Phasing of this 11-protein subunit complex was carried out via molecular replacement. The RNA exosome is an evolutionarily conserved multi-protein complex involved in the 3′ degradation of a variety of RNA transcripts. In the nucleus, the exosome participates in the maturation of structured RNAs, in the surveillance of pre-mRNAs and in the decay of a variety of noncoding transcripts. In the cytoplasm, the exosome degrades mRNAs in constitutive and regulated turnover pathways. Several structures of subcomplexes of eukaryotic exosomes or related prokaryotic exosome-like complexes are known, but how the complete assembly is organized to fulfil processive RNA degradation has been unclear. An atomic snapshot of a Saccharomyces cerevisiae 420 kDa exosome complex bound to an RNA substrate in the pre-cleavage state of a hydrolytic reaction has been determined. Here, the crystallographic steps towards the structural elucidation, which was carried out by molecular replacement, are presented

  7. Structure of the prolyl-tRNA synthetase from the eukaryotic pathogen Giardia lamblia

    Energy Technology Data Exchange (ETDEWEB)

    Larson, Eric T.; Kim, Jessica E.; Napuli, Alberto J.; Verlinde, Christophe L. M. J.; Fan, Erkang; Zucker, Frank H.; Van Voorhis, Wesley C.; Buckner, Frederick S.; Hol, Wim G. J.; Merritt, Ethan A., E-mail: merritt@u.washington.edu [Medical Structural Genomics of Pathogenic Protozoa, (United States); University of Washington, Seattle, WA 98195 (United States)

    2012-09-01

    The structure of Giardia prolyl-tRNA synthetase cocrystallized with proline and ATP shows evidence for half-of-the-sites activity, leading to a corresponding mixture of reaction substrates and product (prolyl-AMP) in the two active sites of the dimer. The genome of the human intestinal parasite Giardia lamblia contains only a single aminoacyl-tRNA synthetase gene for each amino acid. The Giardia prolyl-tRNA synthetase gene product was originally misidentified as a dual-specificity Pro/Cys enzyme, in part owing to its unexpectedly high off-target activation of cysteine, but is now believed to be a normal representative of the class of archaeal/eukaryotic prolyl-tRNA synthetases. The 2.2 Å resolution crystal structure of the G. lamblia enzyme presented here is thus the first structure determination of a prolyl-tRNA synthetase from a eukaryote. The relative occupancies of substrate (proline) and product (prolyl-AMP) in the active site are consistent with half-of-the-sites reactivity, as is the observed biphasic thermal denaturation curve for the protein in the presence of proline and MgATP. However, no corresponding induced asymmetry is evident in the structure of the protein. No thermal stabilization is observed in the presence of cysteine and ATP. The implied low affinity for the off-target activation product cysteinyl-AMP suggests that translational fidelity in Giardia is aided by the rapid release of misactivated cysteine.

  8. Rational Modular RNA Engineering Based on In Vivo Profiling of Structural Accessibility.

    Science.gov (United States)

    Leistra, Abigail N; Amador, Paul; Buvanendiran, Aishwarya; Moon-Walker, Alex; Contreras, Lydia M

    2017-12-15

    Bacterial small RNAs (sRNAs) have been established as powerful parts for controlling gene expression. However, development and application of engineered sRNAs has primarily focused on regulating novel synthetic targets. In this work, we demonstrate a rational modular RNA engineering approach that uses in vivo structural accessibility measurements to tune the regulatory activity of a multisubstrate sRNA for differential control of its native target network. Employing the CsrB global sRNA regulator as a model system, we use published in vivo structural accessibility data to infer the contribution of its local structures (substructures) to function and select a subset for engineering. We then modularly recombine the selected substructures, differentially representing those of presumed high or low functional contribution, to build a library of 21 CsrB variants. Using fluorescent translational reporter assays, we demonstrate that the CsrB variants achieve a 5-fold gradient of control of well-characterized Csr network targets. Interestingly, results suggest that less conserved local structures within long, multisubstrate sRNAs may represent better targets for rational engineering than their well-conserved counterparts. Lastly, mapping the impact of sRNA variants on a signature Csr network phenotype indicates the potential of this approach for tuning the activity of global sRNA regulators in the context of metabolic engineering applications.

  9. Nonlinear Structural Analysis

    Indian Academy of Sciences (India)

    The Structures Panel of the Aeronautics Research and Development Board of India ... A great variety of topics was covered, including themes such as nonlinear finite ... or shell structures, and three are on the composite form of construction, ...

  10. Base pairing and structural insights into the 5-formylcytosine in RNA duplex

    Science.gov (United States)

    Wang, Rui; Luo, Zhipu; He, Kaizhang; Delaney, Michael O.; Chen, Doris; Sheng, Jia

    2016-01-01

    Abstract 5-Formylcytidine (f5C), a previously discovered natural nucleotide in the mitochondrial tRNA of many species including human, has been recently detected as the oxidative product of 5-methylcytidine (m5C) through 5-hydroxymethylcytidine (hm5C) in total RNA of mammalian cells. The discovery indicated that these cytosine derivatives in RNA might also play important epigenetic roles similar as in DNA, which has been intensively investigated in the past few years. In this paper, we studied the base pairing specificity of f5C in different RNA duplex contexts. We found that the 5-formyl group could increase duplex thermal stability and enhance base pairing specificity. We present three high-resolution crystal structures of an octamer RNA duplex [5′-GUA(f5C)GUAC-3′]2 that have been solved under three crystallization conditions with different buffers and pH values. Our results showed that the 5-formyl group is located in the same plane as the cytosine base and forms an intra-residue hydrogen bond with the amino group in the N4 position. In addition, this modification increases the base stacking between the f5C and the neighboring bases while not causing significant global and local structure perturbations. This work provides insights into the effects of 5-formylcytosine on RNA duplex. PMID:27079978

  11. Annotating the protein-RNA interaction sites in proteins using evolutionary information and protein backbone structure.

    Science.gov (United States)

    Li, Tao; Li, Qian-Zhong

    2012-11-07

    RNA-protein interactions play important roles in various biological processes. The precise detection of RNA-protein interaction sites is very important for understanding essential biological processes and annotating the function of the proteins. In this study, based on various features from amino acid sequence and structure, including evolutionary information, solvent accessible surface area and torsion angles (φ, ψ) in the backbone structure of the polypeptide chain, a computational method for predicting RNA-binding sites in proteins is proposed. When the method is applied to predict RNA-binding sites in three datasets: RBP86 containing 86 protein chains, RBP107 containing 107 proteins chains and RBP109 containing 109 proteins chains, better sensitivities and specificities are obtained compared to previously published methods in five-fold cross-validation tests. In order to make further examination for the efficiency of our method, the RBP107 dataset is used as training set, RBP86 and RBP109 datasets are used as the independent test sets. In addition, as examples of our prediction, RNA-binding sites in a few proteins are presented. The annotated results are consistent with the PDB annotation. These results show that our method is useful for annotating RNA binding sites of novel proteins.

  12. Structure of E. coli 16S RNA elucidated by psoralen crosslinking

    International Nuclear Information System (INIS)

    Thompson, J.F.; Hearst, J.E.

    1983-01-01

    E. coli 16S RNA in solution was photoreacted with hydroxymethyltrimethylpsoralen and long wave ultraviolet light. Positions of crosslinks were determined to high resolution by partially digesting the RNA with T 1 RNase, separating the crosslinked fragments by two-dimensional gel electrophoresis, reversing the crosslink, and sequencing the separated fragments. This method yielded the locations of crosslinks to +/-15 nucleotides. Even finer placement has been made on the basis of our knowledge of psoralen reactivity. Thirteen unique crosslinks were mapped. Seven crosslinks confirmed regions of secondary structure which had been predicted in published phylogenetic models, three crosslinks discriminated between phylogenetic models, and three proved the existence of new structures. The new structures were all long-range interactions which appear to be in dynamic equilibrium with local secondary structure. Because this technique yields direct information about the secondary structure of large RNAs, it should prove invaluable in studying the structure of other RNAs of all sizes

  13. Utility of RNA Sequencing for Analysis of Maize Reproductive Transcriptomes

    Directory of Open Access Journals (Sweden)

    Rebecca M. Davidson

    2011-11-01

    Full Text Available Transcriptome sequencing is a powerful method for studying global expression patterns in large, complex genomes. Evaluation of sequence-based expression profiles during reproductive development would provide functional annotation to genes underlying agronomic traits. We generated transcriptome profiles for 12 diverse maize ( L. reproductive tissues representing male, female, developing seed, and leaf tissues using high throughput transcriptome sequencing. Overall, ∼80% of annotated genes were expressed. Comparative analysis between sequence and hybridization-based methods demonstrated the utility of ribonucleic acid sequencing (RNA-seq for expression determination and differentiation of paralagous genes (∼85% of maize genes. Analysis of 4975 gene families across reproductive tissues revealed expression divergence is proportional to family size. In all pairwise comparisons between tissues, 7 (pre- vs. postemergence cobs to 48% (pollen vs. ovule of genes were differentially expressed. Genes with expression restricted to a single tissue within this study were identified with the highest numbers observed in leaves, endosperm, and pollen. Coexpression network analysis identified 17 gene modules with complex and shared expression patterns containing many previously described maize genes. The data and analyses in this study provide valuable tools through improved gene annotation, gene family characterization, and a core set of candidate genes to further characterize maize reproductive development and improve grain yield potential.

  14. Defining the structural requirements for a helix in 23 S ribosomal RNA that confers erythromycin resistance

    DEFF Research Database (Denmark)

    Douthwaite, S; Powers, T; Lee, J Y

    1989-01-01

    The helix spanning nucleotides 1198 to 1247 (helix 1200-1250) in Escherichia coli 23 S ribosomal RNA (rRNA) is functionally important in protein synthesis, and deletions in this region confer erythromycin resistance. In order to define the structural requirements for resistance, we have dissected...... deletion mutants show a sensitive phenotype. Deletions that extend into the base-pairing between GCC1208 and GGU1240 result in non-functional 23 S RNAs, which consequently do not confer resistance. A number of phylogenetically conserved nucleotides have been shown to be non-essential for 23 S RNA function....... However, removal of either these or non-conserved nucleotides from helix 1200-1250 measurably reduces the efficiency of 23 S RNA in forming functional ribosomes. We have used chemical probing and a modified primer extension method to investigate erythromycin binding to wild-type and resistant ribosomes...

  15. A structured sparse regression method for estimating isoform expression level from multi-sample RNA-seq data.

    Science.gov (United States)

    Zhang, L; Liu, X J

    2016-06-03

    With the rapid development of next-generation high-throughput sequencing technology, RNA-seq has become a standard and important technique for transcriptome analysis. For multi-sample RNA-seq data, the existing expression estimation methods usually deal with each single-RNA-seq sample, and ignore that the read distributions are consistent across multiple samples. In the current study, we propose a structured sparse regression method, SSRSeq, to estimate isoform expression using multi-sample RNA-seq data. SSRSeq uses a non-parameter model to capture the general tendency of non-uniformity read distribution for all genes across multiple samples. Additionally, our method adds a structured sparse regularization, which not only incorporates the sparse specificity between a gene and its corresponding isoform expression levels, but also reduces the effects of noisy reads, especially for lowly expressed genes and isoforms. Four real datasets were used to evaluate our method on isoform expression estimation. Compared with other popular methods, SSRSeq reduced the variance between multiple samples, and produced more accurate isoform expression estimations, and thus more meaningful biological interpretations.

  16. The Structure of the RNA m5C Methyltransferase YebU from Escherichia coli Reveals a C-terminal RNA-recruiting PUA Domain

    DEFF Research Database (Denmark)

    Hallberg, B. Martin; Ericsson, Ulrika B.; Johnson, Kenneth A

    2006-01-01

    potential that differ from other RNA-MTase structures, suggesting that YebU interacts with its RNA target in a different manner. Docking of YebU onto the 30 S subunit indicates that the PUA and MTase domains make several contacts with 16 S rRNA as well as with the ribosomal protein S12. The ribosomal...... protein interactions would explain why the assembled 30 S subunit, and not naked 16 S rRNA, is the preferred substrate for YebU....... by X-ray crystallography, and we present a molecular model for how YebU specifically recognizes, binds and methylates its ribosomal substrate. The YebU protein has an N-terminal SAM-binding catalytic domain with structural similarity to the equivalent domains in several other m(5)C RNA MTases including...

  17. Structural profiles of human miRNA families from pairwise clustering

    DEFF Research Database (Denmark)

    Kaczkowski, Bogumil; Þórarinsson, Elfar; Reiche, Kristin

    2009-01-01

    secondary structure already predicted, little is known about the patterns of structural conservation among pre-miRNAs. We address this issue by clustering the human pre-miRNA sequences based on pairwise, sequence and secondary structure alignment using FOLDALIGN, followed by global multiple alignment...... of obtained clusters by WAR. As a result, the common secondary structure was successfully determined for four FOLDALIGN clusters: the RF00027 structural family of the Rfam database and three clusters with previously undescribed consensus structures. Availability: http://genome.ku.dk/resources/mirclust...

  18. Modeling the structure of RNA molecules with small-angle X-ray scattering data.

    Directory of Open Access Journals (Sweden)

    Michal Jan Gajda

    Full Text Available We propose a novel fragment assembly method for low-resolution modeling of RNA and show how it may be used along with small-angle X-ray solution scattering (SAXS data to model low-resolution structures of particles having as many as 12 independent secondary structure elements. We assessed this model-building procedure by using both artificial data on a previously proposed benchmark and publicly available data. With the artificial data, SAXS-guided models show better similarity to native structures than ROSETTA decoys. The publicly available data showed that SAXS-guided models can be used to reinterpret RNA structures previously deposited in the Protein Data Bank. Our approach allows for fast and efficient building of de novo models of RNA using approximate secondary structures that can be readily obtained from existing bioinformatic approaches. We also offer a rigorous assessment of the resolving power of SAXS in the case of small RNA structures, along with a small multimetric benchmark of the proposed method.

  19. NOBAI: a web server for character coding of geometrical and statistical features in RNA structure

    Science.gov (United States)

    Knudsen, Vegeir; Caetano-Anollés, Gustavo

    2008-01-01

    The Numeration of Objects in Biology: Alignment Inferences (NOBAI) web server provides a web interface to the applications in the NOBAI software package. This software codes topological and thermodynamic information related to the secondary structure of RNA molecules as multi-state phylogenetic characters, builds character matrices directly in NEXUS format and provides sequence randomization options. The web server is an effective tool that facilitates the search for evolutionary history embedded in the structure of functional RNA molecules. The NOBAI web server is accessible at ‘http://www.manet.uiuc.edu/nobai/nobai.php’. This web site is free and open to all users and there is no login requirement. PMID:18448469

  20. Functional 5′ UTR mRNA structures in eukaryotic translation regulation and how to find them

    Science.gov (United States)

    Leppek, Kathrin; Das, Rhiju; Barna, Maria

    2017-01-01

    RNA molecules can fold into intricate shapes that can provide an additional layer of control of gene expression beyond that of their sequence. In this Review, we discuss the current mechanistic understanding of structures in 5′ untranslated regions (UTRs) of eukaryotic mRNAs and the emerging methodologies used to explore them. These structures may regulate cap-dependent translation initiation through helicase-mediated remodelling of RNA structures and higher-order RNA interactions, as well as cap-independent translation initiation through internal ribosome entry sites (IRESs), mRNA modifications and other specialized translation pathways. We discuss known 5′ UTR RNA structures and how new structure probing technologies coupled with prospective validation, particularly compensatory mutagenesis, are likely to identify classes of structured RNA elements that shape post-transcriptional control of gene expression and the development of multicellular organisms. PMID:29165424

  1. Rtools: a web server for various secondary structural analyses on single RNA sequences.

    Science.gov (United States)

    Hamada, Michiaki; Ono, Yukiteru; Kiryu, Hisanori; Sato, Kengo; Kato, Yuki; Fukunaga, Tsukasa; Mori, Ryota; Asai, Kiyoshi

    2016-07-08

    The secondary structures, as well as the nucleotide sequences, are the important features of RNA molecules to characterize their functions. According to the thermodynamic model, however, the probability of any secondary structure is very small. As a consequence, any tool to predict the secondary structures of RNAs has limited accuracy. On the other hand, there are a few tools to compensate the imperfect predictions by calculating and visualizing the secondary structural information from RNA sequences. It is desirable to obtain the rich information from those tools through a friendly interface. We implemented a web server of the tools to predict secondary structures and to calculate various structural features based on the energy models of secondary structures. By just giving an RNA sequence to the web server, the user can get the different types of solutions of the secondary structures, the marginal probabilities such as base-paring probabilities, loop probabilities and accessibilities of the local bases, the energy changes by arbitrary base mutations as well as the measures for validations of the predicted secondary structures. The web server is available at http://rtools.cbrc.jp, which integrates software tools, CentroidFold, CentroidHomfold, IPKnot, CapR, Raccess, Rchange and RintD. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  2. Evolution of the RNase P RNA structural domain in Leptospira spp.

    Science.gov (United States)

    Ravishankar, Vigneshwaran; Ahmed, Ahmed; Sivagnanam, Ulaganathan; Muthuraman, Krishnaraja; Karthikaichamy, Anbarasu; Wilson, Herald A; Devendran, Ajay; Hartskeerl, Rudy A; Raj, Stephen M L

    2014-12-01

    We have employed the RNase P RNA (RPR) gene, which is present as single copy in chromosome I of Leptospira spp. to investigate the phylogeny of structural domains present in the RNA subunit of the tRNA processing enzyme, RNase P. RPR gene sequences of 150 strains derived from NCBI database along with sequences determined from 8 reference strains were examined to fathom strain specific structural differences present in leptospiral RPR. Sequence variations in the RPR gene impacted on the configuration of loops, stems and bulges found in the RPR highlighting species and strain specific structural motifs. In vitro transcribed leptospiral RPR ribozymes are demonstrated to process pre-tRNA into mature tRNA in consonance with the positioning of Leptospira in the taxonomic domain of bacteria. RPR sequence datasets used to construct a phylogenetic tree exemplified the segregation of strains into their respective lineages with a (re)speciation of strain SH 9 to Leptospira borgpetersenii, strains Fiocruz LV 3954 and Fiocruz LV 4135 to Leptospira santarosai, strain CBC 613 to Leptospira kirschneri and strain HAI 1536 to Leptospira noguchii. Furthermore, it allowed characterization of an isolate P2653, presumptively characterized as either serovar Hebdomadis, Kremastos or Longnan to Leptospira weilii, serovar Longnan. Copyright © 2014 Institut Pasteur. Published by Elsevier Masson SAS. All rights reserved.

  3. Integrated Analysis of Dysregulated ncRNA and mRNA Expression Profiles in Humans Exposed to Carbon Nanotubes.

    Directory of Open Access Journals (Sweden)

    Anna A Shvedova

    Full Text Available As the application of carbon nanotubes (CNT in consumer products continues to rise, studies have expanded to determine the associated risks of exposure on human and environmental health. In particular, several lines of evidence indicate that exposure to multi-walled carbon nanotubes (MWCNT could pose a carcinogenic risk similar to asbestos fibers. However, to date the potential markers of MWCNT exposure are not yet explored in humans.In the present study, global mRNA and ncRNA expression profiles in the blood of exposed workers, having direct contact with MWCNT aerosol for at least 6 months (n = 8, were compared with expression profiles of non-exposed (n = 7 workers (e.g., professional and/or technical staff from the same manufacturing facility.Significant changes in the ncRNA and mRNA expression profiles were observed between exposed and non-exposed worker groups. An integrative analysis of ncRNA-mRNA correlations was performed to identify target genes, functional relationships, and regulatory networks in MWCNT-exposed workers. The coordinated changes in ncRNA and mRNA expression profiles revealed a set of miRNAs and their target genes with roles in cell cycle regulation/progression/control, apoptosis and proliferation. Further, the identified pathways and signaling networks also revealed MWCNT potential to trigger pulmonary and cardiovascular effects as well as carcinogenic outcomes in humans, similar to those previously described in rodents exposed to MWCNTs.This study is the first to investigate aberrant changes in mRNA and ncRNA expression profiles in the blood of humans exposed to MWCNT. The significant changes in several miRNAs and mRNAs expression as well as their regulatory networks are important for getting molecular insights into the MWCNT-induced toxicity and pathogenesis in humans. Further large-scale prospective studies are necessary to validate the potential applicability of such changes in mRNAs and miRNAs as prognostic markers

  4. Integrated analysis of miRNA and mRNA expression in childhood medulloblastoma compared with neural stem cells.

    Directory of Open Access Journals (Sweden)

    Laura A Genovesi

    Full Text Available Medulloblastoma (MB is the most common malignant brain tumor in children and a leading cause of cancer-related mortality and morbidity. Several molecular sub-types of MB have been identified, suggesting they may arise from distinct cells of origin. Data from animal models indicate that some MB sub-types arise from multipotent cerebellar neural stem cells (NSCs. Hence, microRNA (miRNA expression profiles of primary MB samples were compared to CD133+ NSCs, aiming to identify deregulated miRNAs involved in MB pathogenesis. Expression profiling of 662 miRNAs in primary MB specimens, MB cell lines, and human CD133+ NSCs and CD133- neural progenitor cells was performed by qRT-PCR. Clustering analysis identified two distinct sub-types of MB primary specimens, reminiscent of sub-types obtained from their mRNA profiles. 21 significantly up-regulated and 12 significantly down-regulated miRNAs were identified in MB primary specimens relative to CD133+ NSCs (p<0.01. The majority of up-regulated miRNAs mapped to chromosomal regions 14q32 and 17q. Integration of the predicted targets of deregulated miRNAs with mRNA expression data from the same specimens revealed enrichment of pathways regulating neuronal migration, nervous system development and cell proliferation. Transient over-expression of a down-regulated miRNA, miR-935, resulted in significant down-regulation of three of the seven predicted miR-935 target genes at the mRNA level in a MB cell line, confirming the validity of this approach. This study represents the first integrated analysis of MB miRNA and mRNA expression profiles and is the first to compare MB miRNA expression profiles to those of CD133+ NSCs. We identified several differentially expressed miRNAs that potentially target networks of genes and signaling pathways that may be involved in the transformation of normal NSCs to brain tumor stem cells. Based on this integrative approach, our data provide an important platform for future

  5. 16S ribosomal RNA sequence analysis for determination of phylogenetic relationship among methylotrophs.

    Science.gov (United States)

    Tsuji, K; Tsien, H C; Hanson, R S; DePalma, S R; Scholtz, R; LaRoche, S

    1990-01-01

    16S ribosomal RNAs (rRNA) of 12 methylotrophic bacteria have been almost completely sequenced to establish their phylogenetic relationships. Methylotrophs that are physiologically related are phylogenetically diverse and are scattered among the purple eubacteria (class Proteobacteria). Group I methylotrophs can be classified in the beta- and the gamma-subdivisions and group II methylotrophs in the alpha-subdivision of the purple eubacteria, respectively. Pink-pigmented facultative and non-pigmented obligate group II methylotrophs form two distinctly separate branches within the alpha-subdivision. The secondary structures of the 16S rRNA sequences of 'Methylocystis parvus' strain OBBP, 'Methylosinus trichosporium' strain OB3b, 'Methylosporovibrio methanica' strain 81Z and Hyphomicrobium sp. strain DM2 are similar, and these non-pigmented obligate group II methylotrophs form one tight cluster in the alpha-subdivision. The pink-pigmented facultative methylotrophs, Methylobacterium extorquens strain AM1, Methylobacterium sp. strain DM4 and Methylobacterium organophilum strain XX form another cluster within the alpha-subdivision. Although similar in phenotypic characteristics, Methylobacterium organophilum strain XX and Methylobacterium extorquens strain AM1 are clearly distinguishable by their 16S rRNA sequences. The group I methylotrophs, Methylophilus methylotrophus strain AS1 and methylotrophic species DM11, which do not utilize methane, are similar in 16S rRNA sequence to bacteria in the beta-subdivision. The methane-utilizing, obligate group I methanotrophs, Methylococcus capsulatus strain BATH and Methylomonas methanica, are placed in the gamma-subdivision. The results demonstrate that it is possible to distinguish and classify the methylotrophic bacteria using 16S rRNA sequence analysis.

  6. RNA is an integral component of chromatin that contributes to its structural organization.

    Directory of Open Access Journals (Sweden)

    Antonio Rodríguez-Campos

    Full Text Available Chromatin structure is influenced by multiples factors, such as pH, temperature, nature and concentration of counterions, post-translational modifications of histones and binding of structural non-histone proteins. RNA is also known to contribute to the regulation of chromatin structure as chromatin-induced gene silencing was shown to depend on the RNAi machinery in S. pombe, plants and Drosophila. Moreover, both in Drosophila and mammals, dosage compensation requires the contribution of specific non-coding RNAs. However, whether RNA itself plays a direct structural role in chromatin is not known. Here, we report results that indicate a general structural role for RNA in eukaryotic chromatin. RNA is found associated to purified chromatin prepared from chicken liver, or cultured Drosophila S2 cells, and treatment with RNase A alters the structural properties of chromatin. Our results indicate that chromatin-associated RNAs, which account for 2%-5% of total chromatin-associated nucleic acids, are polyA(- and show a size similar to that of the DNA contained in the corresponding chromatin fragments. Chromatin-associated RNA(s are not likely to correspond to nascent transcripts as they are also found bound to chromatin when cells are treated with alpha-amanitin. After treatment with RNase A, chromatin fragments of molecular weight >3.000 bp of DNA showed reduced sedimentation through sucrose gradients and increased sensitivity to micrococcal nuclease digestion. This structural transition, which is observed both at euchromatic and heterochromatic regions, proceeds without loss of histone H1 or any significant change in core-histone composition and integrity.

  7. Crystal Structure of the N-Terminal RNA Recognition Motif of mRNA Decay Regulator AUF1

    Directory of Open Access Journals (Sweden)

    Young Jun Choi

    2016-01-01

    Full Text Available AU-rich element binding/degradation factor 1 (AUF1 plays a role in destabilizing mRNAs by forming complexes with AU-rich elements (ARE in the 3′-untranslated regions. Multiple AUF1-ARE complexes regulate the translation of encoded products related to the cell cycle, apoptosis, and inflammation. AUF1 contains two tandem RNA recognition motifs (RRM and a Gln- (Q- rich domain in their C-terminal region. To observe how the two RRMs are involved in recognizing ARE, we obtained the AUF1-p37 protein covering the two RRMs. However, only N-terminal RRM (RRM1 was crystallized and its structure was determined at 1.7 Å resolution. It appears that the RRM1 and RRM2 separated before crystallization. To demonstrate which factors affect the separate RRM1-2, we performed limited proteolysis using trypsin. The results indicated that the intact proteins were cleaved by unknown proteases that were associated with them prior to crystallization. In comparison with each of the monomers, the conformations of the β2-β3 loops were highly variable. Furthermore, a comparison with the RRM1-2 structures of HuR and hnRNP A1 revealed that a dimer of RRM1 could be one of the possible conformations of RRM1-2. Our data may provide a guidance for further structural investigations of AUF1 tandem RRM repeat and its mode of ARE binding.

  8. Genome-wide comparative in silico analysis of the RNA helicase gene family in Zea mays and Glycine max: a comparison with Arabidopsis and Oryza sativa.

    Science.gov (United States)

    Xu, Ruirui; Zhang, Shizhong; Huang, Jinguang; Zheng, Chengchao

    2013-01-01

    RNA helicases are enzymes that are thought to unwind double-stranded RNA molecules in an energy-dependent fashion through the hydrolysis of NTP. RNA helicases are associated with all processes involving RNA molecules, including nuclear transcription, editing, splicing, ribosome biogenesis, RNA export, and organelle gene expression. The involvement of RNA helicase in response to stress and in plant growth and development has been reported previously. While their importance in Arabidopsis and Oryza sativa has been partially studied, the function of RNA helicase proteins is poorly understood in Zea mays and Glycine max. In this study, we identified a total of RNA helicase genes in Arabidopsis and other crop species genome by genome-wide comparative in silico analysis. We classified the RNA helicase genes into three subfamilies according to the structural features of the motif II region, such as DEAD-box, DEAH-box and DExD/H-box, and different species showed different patterns of alternative splicing. Secondly, chromosome location analysis showed that the RNA helicase protein genes were distributed across all chromosomes with different densities in the four species. Thirdly, phylogenetic tree analyses identified the relevant homologs of DEAD-box, DEAH-box and DExD/H-box RNA helicase proteins in each of the four species. Fourthly, microarray expression data showed that many of these predicted RNA helicase genes were expressed in different developmental stages and different tissues under normal growth conditions. Finally, real-time quantitative PCR analysis showed that the expression levels of 10 genes in Arabidopsis and 13 genes in Zea mays were in close agreement with the microarray expression data. To our knowledge, this is the first report of a comparative genome-wide analysis of the RNA helicase gene family in Arabidopsis, Oryza sativa, Zea mays and Glycine max. This study provides valuable information for understanding the classification and putative functions of

  9. A genome-wide analysis of the RNA-guided silencing pathway in coffee reveals insights into its regulatory mechanisms.

    Directory of Open Access Journals (Sweden)

    Christiane Noronha Fernandes-Brum

    Full Text Available microRNAs (miRNAs are derived from self-complementary hairpin structures, while small-interfering RNAs (siRNAs are derived from double-stranded RNA (dsRNA or hairpin precursors. The core mechanism of sRNA production involves DICER-like (DCL in processing the smallRNAs (sRNAs and ARGONAUTE (AGO as effectors of silencing, and siRNA biogenesis also involves action of RNA-Dependent RNA Polymerase (RDR, Pol IV and Pol V in biogenesis. Several other proteins interact with the core proteins to guide sRNA biogenesis, action, and turnover. We aimed to unravel the components and functions of the RNA-guided silencing pathway in a non-model plant species of worldwide economic relevance. The sRNA-guided silencing complex members have been identified in the Coffea canephora genome, and they have been characterized at the structural, functional, and evolutionary levels by computational analyses. Eleven AGO proteins, nine DCL proteins (which include a DCL1-like protein that was not previously annotated, and eight RDR proteins were identified. Another 48 proteins implicated in smallRNA (sRNA pathways were also identified. Furthermore, we identified 235 miRNA precursors and 317 mature miRNAs from 113 MIR families, and we characterized ccp-MIR156, ccp-MIR172, and ccp-MIR390. Target prediction and gene ontology analyses of 2239 putative targets showed that significant pathways in coffee are targeted by miRNAs. We provide evidence of the expansion of the loci related to sRNA pathways, insights into the activities of these proteins by domain and catalytic site analyses, and gene expression analysis. The number of MIR loci and their targeted pathways highlight the importance of miRNAs in coffee. We identified several roles of sRNAs in C. canephora, which offers substantial insight into better understanding the transcriptional and post-transcriptional regulation of this major crop.

  10. Extracellular plasma RNA from colon cancer patients is confined in a vesicle-like structure and is mRNA-enriched

    Science.gov (United States)

    García, José Miguel; García, Vanesa; Peña, Cristina; Domínguez, Gemma; Silva, Javier; Diaz, Raquel; Espinosa, Pablo; Citores, Maria Jesús; Collado, Manuel; Bonilla, Félix

    2008-01-01

    Little is yet known about the origin and protective mechanism of free nucleic acids in plasma. We investigated the possibility of these free nucleic acids being particle associated. Plasma samples from colon cancer patients and cell culture media were subjected to various antibody incubations, ultracentrifugation, and RNA extraction protocols for total RNA, epithelial RNA, and mRNA. Flow cytometry using a Ber-EP4 antibody and confocal laser microscopy after staining with propidium iodide were also performed. mRNA levels of the LISCH7 and SDHA genes were determined in cells and in culture media. Ber-EP4 antibody and polystyrene beads coated with oligo dT sequences were employed. We observed that, after incubation, total RNA and mRNA were always detected after membrane digestion, and that epithelial RNA was detected before this procedure. In ultracentrifugation, mRNA was caught in the supernatant only if a former lysis mediated or in the pellet if there was no previous digestion. Flow cytometry determinations showed that antibody-coated microbeads keep acellular structures bearing epithelial antigens apart. Confocal laser microscopy made 1- to 2-μm-diameter particles perceptible in the vicinity of magnetic polystyrene beads. Relevant differences were observed between mRNA of cells and culture media, as there was a considerable difference in LISCH7 mRNA levels between HT29 and IMR90 cell co-cultures and their culture media. Our results support the view that extracellular RNA found in plasma from cancer patients circulates in association with or is protected in a multiparticle complex, and that an active release mechanism by tumor cells may be a possible origin. PMID:18456845

  11. Analysis of the RNA Content of the Yeast "Saccharomyces Cerevisiae"

    Science.gov (United States)

    Deutch, Charles E.; Marshall, Pamela A.

    2008-01-01

    In this article, the authors describe an interconnected set of relatively simple laboratory experiments in which students determine the RNA content of yeast cells and use agarose gel electrophoresis to separate and analyze the major species of cellular RNA. This set of experiments focuses on RNAs from the yeast "Saccharomyces cerevisiae", a…

  12. GDCRNATools: an R/Bioconductor package for integrative analysis of lncRNA, miRNA, and mRNA data in GDC.

    Science.gov (United States)

    Li, Ruidong; Qu, Han; Wang, Shibo; Wei, Julong; Zhang, Le; Ma, Renyuan; Lu, Jianming; Zhu, Jianguo; Zhong, Wei-De; Jia, Zhenyu

    2018-03-02

    The large-scale multidimensional omics data in the Genomic Data Commons (GDC) provides opportunities to investigate the crosstalk among different RNA species and their regulatory mechanisms in cancers. Easy-to-use bioinformatics pipelines are needed to facilitate such studies. We have developed a user-friendly R/Bioconductor package, named GDCRNATools, for downloading, organizing, and analyzing RNA data in GDC with an emphasis on deciphering the lncRNA-mRNA related competing endogenous RNAs (ceRNAs) regulatory network in cancers. Many widely used bioinformatics tools and databases are utilized in our package. Users can easily pack preferred downstream analysis pipelines or integrate their own pipelines into the workflow. Interactive shiny web apps built in GDCRNATools greatly improve visualization of results from the analysis. GDCRNATools is an R/Bioconductor package that is freely available at Bioconductor (http://bioconductor.org/packages/devel/bioc/html/GDCRNATools.html). Detailed instructions, manual and example code are also available in Github (https://github.com/Jialab-UCR/GDCRNATools). arthur.jia@ucr.edu or zhongwd2009@live.cn or doctorzhujianguo@163.com.

  13. Evaluating the effect of disturbed ensemble distributions on SCFG based statistical sampling of RNA secondary structures

    Directory of Open Access Journals (Sweden)

    Scheid Anika

    2012-07-01

    Full Text Available Abstract Background Over the past years, statistical and Bayesian approaches have become increasingly appreciated to address the long-standing problem of computational RNA structure prediction. Recently, a novel probabilistic method for the prediction of RNA secondary structures from a single sequence has been studied which is based on generating statistically representative and reproducible samples of the entire ensemble of feasible structures for a particular input sequence. This method samples the possible foldings from a distribution implied by a sophisticated (traditional or length-dependent stochastic context-free grammar (SCFG that mirrors the standard thermodynamic model applied in modern physics-based prediction algorithms. Specifically, that grammar represents an exact probabilistic counterpart to the energy model underlying the Sfold software, which employs a sampling extension of the partition function (PF approach to produce statistically representative subsets of the Boltzmann-weighted ensemble. Although both sampling approaches have the same worst-case time and space complexities, it has been indicated that they differ in performance (both with respect to prediction accuracy and quality of generated samples, where neither of these two competing approaches generally outperforms the other. Results In this work, we will consider the SCFG based approach in order to perform an analysis on how the quality of generated sample sets and the corresponding prediction accuracy changes when different degrees of disturbances are incorporated into the needed sampling probabilities. This is motivated by the fact that if the results prove to be resistant to large errors on the distinct sampling probabilities (compared to the exact ones, then it will be an indication that these probabilities do not need to be computed exactly, but it may be sufficient and more efficient to approximate them. Thus, it might then be possible to decrease the worst

  14. Flow cytometric analysis of RNA synthesis by detection of bromouridine incorporation

    DEFF Research Database (Denmark)

    Larsen, J K; Jensen, Peter Østrup; Larsen, J

    2001-01-01

    RNA synthesis has traditionally been investigated by a laborious and time-consuming radiographic method involving incorporation of tritiated uridine. Now a faster non-radioactive alternative has emerged, based on immunocytochemical detection. This method utilizes the brominated RNA precursor...... bromouridine, which is taken into a cell, phosphorylated, and incorporated into nascent RNA. The BrU-substituted RNA is detected by permeabilizing the cells and staining with certain anti-BrdU antibodies. This dynamic approach yields information complementing that provided by cellular RNA content analysis...

  15. Structures and their analysis

    CERN Document Server

    Fuchs, Maurice Bernard

    2016-01-01

    Addressing structures, this book presents a classic discipline in a modern setting by combining illustrated examples with insights into the solutions. It is the fruit of the author’s many years of teaching the subject and of just as many years of research into the design of optimal structures. Although intended for an advanced level of instruction it has an undergraduate course at its core. Further, the book was written with the advantage of having massive computer power in the background, an aspect which changes the entire approach to many engineering disciplines and in particular to structures. This paradigm shift has dislodged the force (flexibility) method from its former prominence and paved the way for the displacement (stiffness) method, despite the multitude of linear equations it spawns. In this book, however, both methods are taught: the force method offers a perfect vehicle for understanding structural behavior, bearing in mind that it is the displacement method which does the heavy number crunch...

  16. The suboptimal structures find the optimal RNAs: homology search for bacterial non-coding RNAsusing suboptimal RNA structures

    Czech Academy of Sciences Publication Activity Database

    Pánek, Josef; Krásný, Libor; Bobek, Jan; Ježková, E.; Korelusová, Jana; Vohradský, Jiří

    -, - (2010), s. 1-9 ISSN 1362-4962 R&D Projects: GA MŠk 2B06065; GA ČR GA303/09/0475; GA ČR GA310/07/1009 Institutional research plan: CEZ:AV0Z50200510 Keywords : ncRNAs * RNA structures Subject RIV: EE - Microbiology, Virology

  17. Finding the most significant common sequence and structure motifs in a set of RNA sequences

    DEFF Research Database (Denmark)

    Gorodkin, Jan; Heyer, L.J.; Stormo, G.D.

    1997-01-01

    We present a computational scheme to locally align a collection of RNA sequences using sequence and structure constraints, In addition, the method searches for the resulting alignments with the most significant common motifs, among all possible collections, The first part utilizes a simplified...

  18. SHAPE selection (SHAPES) enrich for RNA structure signal in SHAPE sequencing-based probing data

    DEFF Research Database (Denmark)

    Poulsen, Line Dahl; Kielpinski, Lukasz Jan; Salama, Sofie R

    2015-01-01

    transcriptase. Here, we introduce a SHAPE Selection (SHAPES) reagent, N-propanone isatoic anhydride (NPIA), which retains the ability of SHAPE reagents to accurately probe RNA structure, but also allows covalent coupling between the SHAPES reagent and a biotin molecule. We demonstrate that SHAPES...

  19. A possible contribution of mRNA secondary structure to translation initiation efficiency in Lactococcus lactis

    NARCIS (Netherlands)

    Guchte, Maarten van de; Lende, Ted van der; Kok, Jan; Venema, Gerard

    1991-01-01

    Gene expression signals derived from Lactococcus lactis were linked to lacZ-fused genes with different 5'-nucleotide sequences. Computer predictions of mRNA secondary structure were combined with lacZ expression studies to direct base-substitutions that could possibly influence gene expression.

  20. Structure of Pseudoknot PK26 Shows 3D Domain Swapping in an RNA

    Science.gov (United States)

    Lietzke, Susan E; Barnes, Cindy L.

    1998-01-01

    3D domain swapping provides a facile pathway for the evolution of oligomeric proteins and allosteric mechanisms and a means for using monomer-oligomer equilibria to regulate biological activity. The term "3D domain swapping" describes the exchange of identical domains between two protein monomers to create an oligomer. 3D domain swapping has, so far, only been recognized in proteins. In this study, the structure of the pseudoknot PK26 is reported and it is a clear example of 3D domain swapping in RNA. PK26 was chosen for study because RNA pseudoknots are required structures in several biological processes and they arise frequently in in vitro selection experiments directed against protein targets. PK26 specifically inhibits HIV-1 reverse transcriptase with nanomolar affinity. We have now determined the 3.1 A resolution crystal structure of PK26 and find that it forms a 3D domain swapped dimer. PK26 shows extensive base pairing between and within strands. Formation of the dimer requires the linker region between the pseudoknot folds to adopt a unique conformation that allows a base within a helical stem to skip one base in the stacking register. Rearrangement of the linker would permit a monomeric pseudoknot to form. This structure shows how RNA can use 3D domain swapping to build large scale oligomers like the putative hexamer in the packaging RNA of bacteriophage Phi29.

  1. Glassy transition in a disordered model for the RNA secondary structure

    International Nuclear Information System (INIS)

    Pagnani, A.; Parisi, G.; Ricci-Tersenghi, F.

    2000-04-01

    We numerically study a disordered model for the RNA secondary structure and we find that it undergoes a phase transition, with a breaking of the replica symmetry in the low temperature region (like in spin glasses). Our results are based on the exact evaluation of the partition function. (author)

  2. Structural and biochemical studies on ATP binding and hydrolysis by the Escherichia coli RNA chaperone Hfq.

    Directory of Open Access Journals (Sweden)

    Hermann Hämmerle

    Full Text Available In Escherichia coli the RNA chaperone Hfq is involved in riboregulation by assisting base-pairing between small regulatory RNAs (sRNAs and mRNA targets. Several structural and biochemical studies revealed RNA binding sites on either surface of the donut shaped Hfq-hexamer. Whereas sRNAs are believed to contact preferentially the YKH motifs present on the proximal site, poly(A(15 and ADP were shown to bind to tripartite binding motifs (ARE circularly positioned on the distal site. Hfq has been reported to bind and to hydrolyze ATP. Here, we present the crystal structure of a C-terminally truncated variant of E. coli Hfq (Hfq(65 in complex with ATP, showing that it binds to the distal R-sites. In addition, we revisited the reported ATPase activity of full length Hfq purified to homogeneity. At variance with previous reports, no ATPase activity was observed for Hfq. In addition, FRET assays neither indicated an impact of ATP on annealing of two model oligoribonucleotides nor did the presence of ATP induce strand displacement. Moreover, ATP did not lead to destabilization of binary and ternary Hfq-RNA complexes, unless a vast stoichiometric excess of ATP was used. Taken together, these studies strongly suggest that ATP is dispensable for and does not interfere with Hfq-mediated RNA transactions.

  3. Analysis of the complement and molecular evolution of tRNA genes in cow

    Directory of Open Access Journals (Sweden)

    Barris Wesley C

    2009-04-01

    Full Text Available Abstract Background Detailed information regarding the number and organization of transfer RNA (tRNA genes at the genome level is becoming readily available with the increase of DNA sequencing of whole genomes. However the identification of functional tRNA genes is challenging for species that have large numbers of repetitive elements containing tRNA derived sequences, such as Bos taurus. Reliable identification and annotation of entire sets of tRNA genes allows the evolution of tRNA genes to be understood on a genomic scale. Results In this study, we explored the B. taurus genome using bioinformatics and comparative genomics approaches to catalogue and analyze cow tRNA genes. The initial analysis of the cow genome using tRNAscan-SE identified 31,868 putative tRNA genes and 189,183 pseudogenes, where 28,830 of the 31,868 predicted tRNA genes were classified as repetitive elements by the RepeatMasker program. We then used comparative genomics to further discriminate between functional tRNA genes and tRNA-derived sequences for the remaining set of 3,038 putative tRNA genes. For our analysis, we used the human, chimpanzee, mouse, rat, horse, dog, chicken and fugu genomes to predict that the number of active tRNA genes in cow lies in the vicinity of 439. Of this set, 150 tRNA genes were 100% identical in their sequences across all nine vertebrate genomes studied. Using clustering analyses, we identified a new tRNA-GlyCCC subfamily present in all analyzed mammalian genomes. We suggest that this subfamily originated from an ancestral tRNA-GlyGCC gene via a point mutation prior to the radiation of the mammalian lineages. Lastly, in a separate analysis we created phylogenetic profiles for each putative cow tRNA gene using a representative set of genomes to gain an overview of common evolutionary histories of tRNA genes. Conclusion The use of a combination of bioinformatics and comparative genomics approaches has allowed the confident identification of a

  4. Structure of the Hantavirus Nucleoprotein Provides Insights into the Mechanism of RNA Encapsidation

    Directory of Open Access Journals (Sweden)

    Daniel Olal

    2016-03-01

    Full Text Available Hantaviruses are etiological agents of life-threatening hemorrhagic fever with renal syndrome and hantavirus cardiopulmonary syndrome. The nucleoprotein (N of hantavirus is essential for viral transcription and replication, thus representing an attractive target for therapeutic intervention. We have determined the crystal structure of hantavirus N to 3.2 Å resolution. The structure reveals a two-lobed, mostly α-helical structure that is distantly related to that of orthobunyavirus Ns. A basic RNA binding pocket is located at the intersection between the two lobes. We provide evidence that oligomerization is mediated by amino- and C-terminal arms that bind to the adjacent monomers. Based on these findings, we suggest a model for the oligomeric ribonucleoprotein (RNP complex. Our structure provides mechanistic insights into RNA encapsidation in the genus Hantavirus and constitutes a template for drug discovery efforts aimed at combating hantavirus infections.

  5. Detailed analysis of RNA-protein interactions within the bacterial ribosomal protein L5/5S rRNA complex.

    Science.gov (United States)

    Perederina, Anna; Nevskaya, Natalia; Nikonov, Oleg; Nikulin, Alexei; Dumas, Philippe; Yao, Min; Tanaka, Isao; Garber, Maria; Gongadze, George; Nikonov, Stanislav

    2002-12-01

    The crystal structure of ribosomal protein L5 from Thermus thermophilus complexed with a 34-nt fragment comprising helix III and loop C of Escherichia coli 5S rRNA has been determined at 2.5 A resolution. The protein specifically interacts with the bulged nucleotides at the top of loop C of 5S rRNA. The rRNA and protein contact surfaces are strongly stabilized by intramolecular interactions. Charged and polar atoms forming the network of conserved intermolecular hydrogen bonds are located in two narrow planar parallel layers belonging to the protein and rRNA, respectively. The regions, including these atoms conserved in Bacteria and Archaea, can be considered an RNA-protein recognition module. Comparison of the T. thermophilus L5 structure in the RNA-bound form with the isolated Bacillus stearothermophilus L5 structure shows that the RNA-recognition module on the protein surface does not undergo significant changes upon RNA binding. In the crystal of the complex, the protein interacts with another RNA molecule in the asymmetric unit through the beta-sheet concave surface. This protein/RNA interface simulates the interaction of L5 with 23S rRNA observed in the Haloarcula marismortui 50S ribosomal subunit.

  6. Analysis of Pteridium ribosomal RNA sequences by rapid direct sequencing.

    Science.gov (United States)

    Tan, M K

    1991-08-01

    A total of 864 bases from 5 regions interspersed in the 18S and 26S rRNA molecules from various clones of Pteridium covering the general geographical distribution of the genus was analysed using a rapid rRNA sequencing technique. No base difference has been detected amongst the three major lineages, two of which apparently separated before the breakup of the ancient supercontinent, Pangaea. These regions of the rRNA sequences have thus been conserved for at least 160 million years and are here compared with other eukaryotic, especially plant rRNAs.

  7. Collapse Analysis of Timber Structures

    DEFF Research Database (Denmark)

    Kirkegaard, Poul Henning; Sørensen, John Dalsgaard

    2008-01-01

    of Structures and a probabilistic modelling of the timber material proposed in the Probabilistic Model Code (PMC) of the Joint Committee on Structural Safety (JCSS). Due to the framework in the Danish Code the timber structure has to be evaluated with respect to the following criteria where at least one shall...... to criteria a) and b) the timber frame structure has one column with a reliability index a bit lower than an assumed target level. By removal three columns one by one no significant extensive failure of the entire structure or significant parts of it are obtained. Therefore the structure can be considered......A probabilistic based collapse analysis has been performed for a glulam frame structure supporting the roof over the main court in a Norwegian sports centre. The robustness analysis is based on the framework for robustness analysis introduced in the Danish Code of Practice for the Safety...

  8. Potential microRNA-mediated oncogenic intercellular communication revealed by pan-cancer analysis

    Science.gov (United States)

    Li, Yue; Zhang, Zhaolei

    2014-11-01

    Carcinogenesis consists of oncogenesis and metastasis, and intriguingly microRNAs (miRNAs) are involved in both processes. Although aberrant miRNA activities are prevalent in diverse tumor types, the exact mechanisms for how they regulate cancerous processes are not always clear. To this end, we performed a large-scale pan-cancer analysis via a novel probabilistic approach to infer recurrent miRNA-target interactions implicated in 12 cancer types using data from The Cancer Genome Atlas. We discovered ~20,000 recurrent miRNA regulations, which are enriched for cancer-related miRNAs/genes. Notably, miRNA 200 family (miR-200/141/429) is among the most prominent miRNA regulators, which is known to be involved in metastasis. Importantly, the recurrent miRNA regulatory network is not only enriched for cancer pathways but also for extracellular matrix (ECM) organization and ECM-receptor interactions. The results suggest an intriguing cancer mechanism involving miRNA-mediated cell-to-cell communication, which possibly involves delivery of tumorigenic miRNA messengers to adjacent cells via exosomes. Finally, survival analysis revealed 414 recurrent-prognostic associations, where both gene and miRNA involved in each interaction conferred significant prognostic power in one or more cancer types. Together, our comprehensive pan-cancer analysis provided not only biological insights into metastasis but also brought to bear the clinical relevance of the proposed recurrent miRNA-gene associations.

  9. Cyanobacteria contain a structural homologue of the Hfq protein with altered RNA binding properties

    DEFF Research Database (Denmark)

    Bøggild, Andreas; Overgaard, Martin; Valentin-Hansen, Poul

    2009-01-01

    Hfq proteins are common in many species of enterobacteria, where they participate in RNA folding and translational regulation through pairing of small RNAs and messenger RNAs. Hfq proteins share the distinctive Sm fold, and form ring-shaped structures similar to those of the Sm/Lsm proteins...... proteins from the cyanobacteria Synechocystis sp. PCC 6803 and Anabaena PCC 7120 at 1.3 and 2.3 A resolution, respectively, and show that they retain the classic Sm fold despite low sequence conservation. In addition, the intersubunit contacts and RNA-binding site are divergent, and we show biochemically...

  10. Cyanobacteria contain a structural homologue of the Hfq protein with altered RNA-binding properties

    DEFF Research Database (Denmark)

    Bøggild, Andreas; Overgaard, Martin; Valentin-Hansen, Poul

    2009-01-01

    Hfq proteins are common in many species of enterobacteria, where they participate in RNA folding and translational regulation through pairing of small RNAs and messenger RNAs. Hfq proteins share the distinctive Sm fold, and form ring-shaped structures similar to those of the Sm/Lsm proteins...... proteins from the cyanobacteria Synechocystis sp. PCC 6803 and Anabaena PCC 7120 at 1.3 and 2.3 A resolution, respectively, and show that they retain the classic Sm fold despite low sequence conservation. In addition, the intersubunit contacts and RNA-binding site are divergent, and we show biochemically...

  11. Structural relationships among the multiple forms of DNA-dependent RNA polymerase II from cultured parsley cells

    International Nuclear Information System (INIS)

    Link, G.; Bogorad, L.; Kidd, G.H.; Richter, G.

    1978-01-01

    DNA-dependent RNA polymerase II (or B) was purified from cultured parsley cells, and its molecular structure was examined in detail. Upon centrifugation through glycerol gradients, RNA polymerase II sediments as a single band with an apparent sedimentation constant of 15S. No contamination with RNA polymerases I or III could be detected when the activity of purified RNA polymerase II was assayed in the presence of high concentrations of α-amanitin. Analysis of purified RNA polymerase II be nondenaturing and denaturing polyacrylamide gel electrophoresis revealed that this enzyme exists in multiple forms. They were designated II(O), II(A), and II(B). It is suggested that each form has a subunit of Mr = 140000 as well as smaller polypeptides in common. They differ, however, in the molecular weights of their largest subunits which is 220000 in form II(O), 200000 in form II(A), and 180000 in form II(B). These large subunits were labelled with 125 I, digested with trypsin, and tryptic digests were compared by two-dimensional analysis on thin-layer plates (Elder et al. (1977) J. Biol. Chem. 252, 6510-6515). Fingerprints of tryptic digests from the polypeptides with Mr = 220000, Mr = 200000, and Mr = 180000 were similar. It is, therefore, suggested that these subunits are stucturally related. A tryptic digest was also produced from the subunit with Mr = 140000. Its fingerprint was found to yield a considerably different distribution of peptides as compared to those from the three large subunits. (orig.) [de

  12. Locked nucleoside analogues expand the potential of DNAzymes to cleave structured RNA targets

    Directory of Open Access Journals (Sweden)

    Wengel Jesper

    2006-06-01

    Full Text Available Abstract Background DNAzymes cleave at predetermined sequences within RNA. A prerequisite for cleavage is that the DNAzyme can gain access to its target, and thus the DNAzyme must be capable of unfolding higher-order structures that are present in the RNA substrate. However, in many cases the RNA target sequence is hidden in a region that is too tightly structured to be accessed under physiological conditions by DNAzymes. Results We investigated how incorporation of LNA (locked nucleic acid monomers into DNAzymes improves their ability to gain access and cleave at highly-structured RNA targets. The binding arms of DNAzymes were varied in length and were substituted with up to three LNA and α-L-LNA monomers (forming LNAzymes. For one DNAzyme, the overall cleavage reaction proceeded fifty times faster after incorporation of two α-L-LNA monomers per binding arm (kobs increased from 0.014 min-1 to 0.78 min-1. Conclusion The data demonstrate how hydrolytic performance can be enhanced by design of LNAzymes, and indicate that there are optimal lengths for the binding arms and for the number of modified LNA monomers.

  13. Structural systems reliability analysis

    International Nuclear Information System (INIS)

    Frangopol, D.

    1975-01-01

    For an exact evaluation of the reliability of a structure it appears necessary to determine the distribution densities of the loads and resistances and to calculate the correlation coefficients between loads and between resistances. These statistical characteristics can be obtained only on the basis of a long activity period. In case that such studies are missing the statistical properties formulated here give upper and lower bounds of the reliability. (orig./HP) [de

  14. Natural and artificial binders of polyriboadenylic acid and their effect on RNA structure

    Directory of Open Access Journals (Sweden)

    Giovanni N. Roviello

    2015-06-01

    Full Text Available The employment of molecular tools with nucleic acid binding ability to specifically control crucial cellular functions represents an important scientific area at the border between biochemistry and pharmaceutical chemistry. In this review we describe several molecular systems of natural or artificial origin, which are able to bind polyriboadenylic acid (poly(rA both in its single-stranded or structured forms. Due to the fundamental role played by the poly(rA tail in the maturation and stability of mRNA, as well as in the initiation of the translation process, compounds able to bind this RNA tract, influencing the mRNA fate, are of special interest for developing innovative biomedical strategies mainly in the field of anticancer therapy.

  15. A technical assessment of the porcine ejaculated spermatozoa for a sperm-specific RNA-seq analysis.

    Science.gov (United States)

    Gòdia, Marta; Mayer, Fabiana Quoos; Nafissi, Julieta; Castelló, Anna; Rodríguez-Gil, Joan Enric; Sánchez, Armand; Clop, Alex

    2018-04-26

    The study of the boar sperm transcriptome by RNA-seq can provide relevant information on sperm quality and fertility and might contribute to animal breeding strategies. However, the analysis of the spermatozoa RNA is challenging as these cells harbor very low amounts of highly fragmented RNA, and the ejaculates also contain other cell types with larger amounts of non-fragmented RNA. Here, we describe a strategy for a successful boar sperm purification, RNA extraction and RNA-seq library preparation. Using these approaches our objectives were: (i) to evaluate the sperm recovery rate (SRR) after boar spermatozoa purification by density centrifugation using the non-porcine-specific commercial reagent BoviPure TM ; (ii) to assess the correlation between SRR and sperm quality characteristics; (iii) to evaluate the relationship between sperm cell RNA load and sperm quality traits and (iv) to compare different library preparation kits for both total RNA-seq (SMARTer Universal Low Input RNA and TruSeq RNA Library Prep kit) and small RNA-seq (NEBNext Small RNA and TailorMix miRNA Sample Prep v2) for high-throughput sequencing. Our results show that pig SRR (~22%) is lower than in other mammalian species and that it is not significantly dependent of the sperm quality parameters analyzed in our study. Moreover, no relationship between the RNA yield per sperm cell and sperm phenotypes was found. We compared a RNA-seq library preparation kit optimized for low amounts of fragmented RNA with a standard kit designed for high amount and quality of input RNA and found that for sperm, a protocol designed to work on low-quality RNA is essential. We also compared two small RNA-seq kits and did not find substantial differences in their performance. We propose the methodological workflow described for the RNA-seq screening of the boar spermatozoa transcriptome. FPKM: fragments per kilobase of transcript per million mapped reads; KRT1: keratin 1; miRNA: micro-RNA; miscRNA: miscellaneous

  16. RNA Crystallization

    Science.gov (United States)

    Golden, Barbara L.; Kundrot, Craig E.

    2003-01-01

    RNA molecules may be crystallized using variations of the methods developed for protein crystallography. As the technology has become available to syntheisize and purify RNA molecules in the quantities and with the quality that is required for crystallography, the field of RNA structure has exploded. The first consideration when crystallizing an RNA is the sequence, which may be varied in a rational way to enhance crystallizability or prevent formation of alternate structures. Once a sequence has been designed, the RNA may be synthesized chemically by solid-state synthesis, or it may be produced enzymatically using RNA polymerase and an appropriate DNA template. Purification of milligram quantities of RNA can be accomplished by HPLC or gel electrophoresis. As with proteins, crystallization of RNA is usually accomplished by vapor diffusion techniques. There are several considerations that are either unique to RNA crystallization or more important for RNA crystallization. Techniques for design, synthesis, purification, and crystallization of RNAs will be reviewed here.

  17. Systematic Analysis of RNA Regulatory Network in Rat Brain after Ischemic Stroke

    Directory of Open Access Journals (Sweden)

    Juan Liu

    2018-01-01

    Full Text Available Although extensive studies have identified large number of microRNAs (miRNAs and long noncoding RNAs (lncRNAs in ischemic stroke, the RNA regulation network response to focal ischemia remains poorly understood. In this study, we simultaneously interrogate the expression profiles of lncRNAs, miRNAs, and mRNAs changes during focal ischemia induced by transient middle cerebral artery occlusion. A set of 1924 novel lncRNAs were identified and may involve brain injury and DNA repair as revealed by coexpression network analysis. Furthermore, many short interspersed elements (SINE mediated lncRNA:mRNA duplexes were identified, implying that lncRNAs mediate Staufen1-mediated mRNA decay (SMD which may play a role during focal ischemia. Moreover, based on the competitive endogenous RNA (ceRNA hypothesis, a stroke regulatory ceRNA network which reveals functional lncRNA:miRNA:mRNA interactions was revealed in ischemic stroke. In brief, this work reports a large number of novel lncRNAs responding to focal ischemia and constructs a systematic RNA regulation network which highlighted the role of ncRNAs in ischemic stroke.

  18. Fast pairwise structural RNA alignments by pruning of the dynamical programming matrix

    DEFF Research Database (Denmark)

    Havgaard, Jakob Hull; Torarinsson, Elfar; Gorodkin, Jan

    2007-01-01

    and backtracked in a normal fashion. Finally, the FOLDALIGN algorithm has also been updated with a better memory implementation and an improved energy model. With these improvements in the algorithm, the FOLDALIGN software package provides the molecular biologist with an efficient and user-friendly tool...... the advantage of providing the constraints dynamically. This has been included in a new implementation of the FOLDALIGN algorithm for pairwise local or global structural alignment of RNA sequences. It is shown that time and memory requirements are dramatically lowered while overall performance is maintained....... Furthermore, a new divide and conquer method is introduced to limit the memory requirement during global alignment and backtrack of local alignment. All branch points in the computed RNA structure are found and used to divide the structure into smaller unbranched segments. Each segment is then realigned...

  19. Structural Basis for Specific Inhibition of tRNA Synthetase by an ATP Competitive Inhibitor.

    Science.gov (United States)

    Fang, Pengfei; Han, Hongyan; Wang, Jing; Chen, Kaige; Chen, Xin; Guo, Min

    2015-06-18

    Pharmaceutical inhibitors of aminoacyl-tRNA synthetases demand high species and family specificity. The antimalarial ATP-mimetic cladosporin selectively inhibits Plasmodium falciparum LysRS (PfLysRS). How the binding to a universal ATP site achieves the specificity is unknown. Here we report three crystal structures of cladosporin with human LysRS, PfLysRS, and a Pf-like human LysRS mutant. In all three structures, cladosporin occupies the class defining ATP-binding pocket, replacing the adenosine portion of ATP. Three residues holding the methyltetrahydropyran moiety of cladosporin are critical for the specificity of cladosporin against LysRS over other class II tRNA synthetase families. The species-exclusive inhibition of PfLysRS is linked to a structural divergence beyond the active site that mounts a lysine-specific stabilizing response to binding cladosporin. These analyses reveal that inherent divergence of tRNA synthetase structural assembly may allow for highly specific inhibition even through the otherwise universal substrate binding pocket and highlight the potential for structure-driven drug development. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Know Your Enemy: Successful Bioinformatic Approaches to Predict Functional RNA Structures in Viral RNAs

    Science.gov (United States)

    Lim, Chun Shen; Brown, Chris M.

    2018-01-01

    Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community. PMID:29354101

  1. Three RNA recognition motifs participate in RNA recognition and structural organization by the pro-apoptotic factor TIA-1

    Science.gov (United States)

    Bauer, William J.; Heath, Jason; Jenkins, Jermaine L.; Kielkopf, Clara L.

    2012-01-01

    T-cell intracellular antigen-1 (TIA-1) regulates developmental and stress-responsive pathways through distinct activities at the levels of alternative pre-mRNA splicing and mRNA translation. The TIA-1 polypeptide contains three RNA recognition motifs (RRMs). The central RRM2 and C-terminal RRM3 associate with cellular mRNAs. The N-terminal RRM1 enhances interactions of a C-terminal Q-rich domain of TIA-1 with the U1-C splicing factor, despite linear separation of the domains in the TIA-1 sequence. Given the expanded functional repertoire of the RRM family, it was unknown whether TIA-1 RRM1 contributes to RNA binding as well as documented protein interactions. To address this question, we used isothermal titration calorimetry and small-angle X-ray scattering (SAXS) to dissect the roles of the TIA-1 RRMs in RNA recognition. Notably, the fas RNA exhibited two binding sites with indistinguishable affinities for TIA-1. Analyses of TIA-1 variants established that RRM1 was dispensable for binding AU-rich fas sites, yet all three RRMs were required to bind a polyU RNA with high affinity. SAXS analyses demonstrated a `V' shape for a TIA-1 construct comprising the three RRMs, and revealed that its dimensions became more compact in the RNA-bound state. The sequence-selective involvement of TIA-1 RRM1 in RNA recognition suggests a possible role for RNA sequences in regulating the distinct functions of TIA-1. Further implications for U1-C recruitment by the adjacent TIA-1 binding sites of the fas pre-mRNA and the bent TIA-1 shape, which organizes the N- and C-termini on the same side of the protein, are discussed. PMID:22154808

  2. Mechanism of microRNA-target interaction: molecular dynamics simulations and thermodynamics analysis.

    Directory of Open Access Journals (Sweden)

    Yonghua Wang

    Full Text Available MicroRNAs (miRNAs are endogenously produced approximately 21-nt riboregulators that associate with Argonaute (Ago proteins to direct mRNA cleavage or repress the translation of complementary RNAs. Capturing the molecular mechanisms of miRNA interacting with its target will not only reinforce the understanding of underlying RNA interference but also fuel the design of more effective small-interfering RNA strands. To address this, in the present work the RNA-bound (Ago-miRNA, Ago-miRNA-target and RNA-free Ago forms were analyzed by performing both molecular dynamics simulations and thermodynamic analysis. Based on the principal component analysis results of the simulation trajectories as well as the correlation analysis in fluctuations of residues, we discover that: 1 three important (PAZ, Mid and PIWI domains exist in Argonaute which define the global dynamics of the protein; 2 the interdomain correlated movements are so crucial for the interaction of Ago-RNAs that they not only facilitate the relaxation of the interactions between residues surrounding the RNA binding channel but also induce certain conformational changes; and 3 it is just these conformational changes that expand the cavity of the active site and open putative pathways for both the substrate uptake and product release. In addition, by thermodynamic analysis we also discover that for both the guide RNA 5'-end recognition and the facilitated site-specific cleavage of the target, the presence of two metal ions (of Mg(2+ plays a predominant role, and this conclusion is consistent with the observed enzyme catalytic cleavage activity in the ternary complex (Ago-miRNA-mRNA. Our results find that it is the set of arginine amino acids concentrated in the nucleotide-binding channel in Ago, instead of the conventionally-deemed seed base-paring, that makes greater contributions in stabilizing the binding of the nucleic acids to Ago.

  3. Mechanism of microRNA-target interaction: molecular dynamics simulations and thermodynamics analysis.

    Science.gov (United States)

    Wang, Yonghua; Li, Yan; Ma, Zhi; Yang, Wei; Ai, Chunzhi

    2010-07-29

    MicroRNAs (miRNAs) are endogenously produced approximately 21-nt riboregulators that associate with Argonaute (Ago) proteins to direct mRNA cleavage or repress the translation of complementary RNAs. Capturing the molecular mechanisms of miRNA interacting with its target will not only reinforce the understanding of underlying RNA interference but also fuel the design of more effective small-interfering RNA strands. To address this, in the present work the RNA-bound (Ago-miRNA, Ago-miRNA-target) and RNA-free Ago forms were analyzed by performing both molecular dynamics simulations and thermodynamic analysis. Based on the principal component analysis results of the simulation trajectories as well as the correlation analysis in fluctuations of residues, we discover that: 1) three important (PAZ, Mid and PIWI) domains exist in Argonaute which define the global dynamics of the protein; 2) the interdomain correlated movements are so crucial for the interaction of Ago-RNAs that they not only facilitate the relaxation of the interactions between residues surrounding the RNA binding channel but also induce certain conformational changes; and 3) it is just these conformational changes that expand the cavity of the active site and open putative pathways for both the substrate uptake and product release. In addition, by thermodynamic analysis we also discover that for both the guide RNA 5'-end recognition and the facilitated site-specific cleavage of the target, the presence of two metal ions (of Mg(2+)) plays a predominant role, and this conclusion is consistent with the observed enzyme catalytic cleavage activity in the ternary complex (Ago-miRNA-mRNA). Our results find that it is the set of arginine amino acids concentrated in the nucleotide-binding channel in Ago, instead of the conventionally-deemed seed base-paring, that makes greater contributions in stabilizing the binding of the nucleic acids to Ago.

  4. MicroRNA expression analysis of feline and canine parvovirus infection in vivo (felis.

    Directory of Open Access Journals (Sweden)

    Pei Zhou

    Full Text Available Feline panleukopenia is a common contagious disease with high morbidity and mortality. At present, feline parvovirus (FPV and canine parvovirus (CPV variants are the pathogens of feline panleukopenia. Many studies have shown that miRNAs are involved in virus-host interactions. Nevertheless, miRNA expression profiling of FPV (original virus or CPV-2b (new virus in cats has not been reported. To investigate these profiles, three 10-week-old cats were orally inoculated with 106 TCID50 of the viruses (FPV and CPV-2b, and the jejunums of one cat in each group were sectioned for miRNA sequencing at 5 days post-inoculation (dpi. This study is the first attempt to use miRNA analysis to understand the molecular basis of FPV and CPV infection in cats. The miRNA expression profiles of the jejunums of cats infected with FPV and CPV were obtained, and a subset of miRNAs was validated by real-time qPCR. The results show that a variety of metabolism-related pathways, cytokine- and pathogen-host interaction-related pathways, and pathology- and cellar structure-related pathways, as well as others, were affected. Specifically, the JAK-STAT signaling pathway, which is critical for cytokines and growth factors, was enriched. This description of the miRNAs involved in regulating FPV and CPV infection in vivo provides further insight into the mechanisms of viral infection and adaptation and might provide an alternative antiviral strategy for disease control and prevention.

  5. Structural Basis for Polypyrimidine Tract Recognition by the Essential Pre-mRNA Splicing Factor U2AF65

    International Nuclear Information System (INIS)

    Sickmier, E.; Frato, K.; Shen, H.; Paranawithana, S.; Green, M.; Kielkopf, C.

    2006-01-01

    The essential pre-mRNA splicing factor, U2AF 65 , guides the early stages of splice site choice by recognizing a polypyrimidine (Py)-tract consensus sequence near the 3'-splice site. Since Py-tracts are relatively poorly conserved in higher eukaryotes, U2AF 65 is faced with the problem of specifying uridine-rich sequences, yet tolerating a variety of nucleotide substitutions found in natural Py-tracts. To better understand these apparently contradictory RNA binding characteristics, the X-ray structure of the U2AF 65 RNA binding domain bound to a Py-tract composed of seven uridines has been determined at 2.5Angstroms resolution. Specific hydrogen bonds between U2AF 65 and the uracil bases provide an explanation for polyuridine recognition. Flexible sidechains and bound water molecules form the majority of the base contacts, and potentially could rearrange when the U2AF 65 structure adapts to different Py-tract sequences. The energetic importance of conserved residues for Py-tract binding is established by analysis of site-directed mutant U2AF 65 proteins using surface plasmon resonance

  6. Structures of riboswitch RNA reaction states by mix-and-inject XFEL serial crystallography

    Science.gov (United States)

    Stagno, J. R.; Liu, Y.; Bhandari, Y. R.; Conrad, C. E.; Panja, S.; Swain, M.; Fan, L.; Nelson, G.; Li, C.; Wendel, D. R.; White, T. A.; Coe, J. D.; Wiedorn, M. O.; Knoska, J.; Oberthuer, D.; Tuckey, R. A.; Yu, P.; Dyba, M.; Tarasov, S. G.; Weierstall, U.; Grant, T. D.; Schwieters, C. D.; Zhang, J.; Ferré-D'Amaré, A. R.; Fromme, P.; Draper, D. E.; Liang, M.; Hunter, M. S.; Boutet, S.; Tan, K.; Zuo, X.; Ji, X.; Barty, A.; Zatsepin, N. A.; Chapman, H. N.; Spence, J. C. H.; Woodson, S. A.; Wang, Y.-X.

    2017-01-01

    Riboswitches are structural RNA elements that are generally located in the 5‧ untranslated region of messenger RNA. During regulation of gene expression, ligand binding to the aptamer domain of a riboswitch triggers a signal to the downstream expression platform. A complete understanding of the structural basis of this mechanism requires the ability to study structural changes over time. Here we use femtosecond X-ray free electron laser (XFEL) pulses to obtain structural measurements from crystals so small that diffusion of a ligand can be timed to initiate a reaction before diffraction. We demonstrate this approach by determining four structures of the adenine riboswitch aptamer domain during the course of a reaction, involving two unbound apo structures, one ligand-bound intermediate, and the final ligand-bound conformation. These structures support a reaction mechanism model with at least four states and illustrate the structural basis of signal transmission. The three-way junction and the P1 switch helix of the two apo conformers are notably different from those in the ligand-bound conformation. Our time-resolved crystallographic measurements with a 10-second delay captured the structure of an intermediate with changes in the binding pocket that accommodate the ligand. With at least a 10-minute delay, the RNA molecules were fully converted to the ligand-bound state, in which the substantial conformational changes resulted in conversion of the space group. Such notable changes in crystallo highlight the important opportunities that micro- and nanocrystals may offer in these and similar time-resolved diffraction studies. Together, these results demonstrate the potential of ‘mix-and-inject’ time-resolved serial crystallography to study biochemically important interactions between biomacromolecules and ligands, including those that involve large conformational changes.

  7. Structural basis of the non-coding RNA RsmZ acting as a protein sponge.

    Science.gov (United States)

    Duss, Olivier; Michel, Erich; Yulikov, Maxim; Schubert, Mario; Jeschke, Gunnar; Allain, Frédéric H-T

    2014-05-29

    MicroRNA and protein sequestration by non-coding RNAs (ncRNAs) has recently generated much interest. In the bacterial Csr/Rsm system, which is considered to be the most general global post-transcriptional regulatory system responsible for bacterial virulence, ncRNAs such as CsrB or RsmZ activate translation initiation by sequestering homodimeric CsrA-type proteins from the ribosome-binding site of a subset of messenger RNAs. However, the mechanism of ncRNA-mediated protein sequestration is not understood at the molecular level. Here we show for Pseudomonas fluorescens that RsmE protein dimers assemble sequentially, specifically and cooperatively onto the ncRNA RsmZ within a narrow affinity range. This assembly yields two different native ribonucleoprotein structures. Using a powerful combination of nuclear magnetic resonance and electron paramagnetic resonance spectroscopy we elucidate these 70-kilodalton solution structures, thereby revealing the molecular mechanism of the sequestration process and how RsmE binding protects the ncRNA from RNase E degradation. Overall, our findings suggest that RsmZ is well-tuned to sequester, store and release RsmE and therefore can be viewed as an ideal protein 'sponge'.

  8. Structure of a class II TrmH tRNA-modifying enzyme from Aquifex aeolicus

    International Nuclear Information System (INIS)

    Pleshe, Elizabeth; Truesdell, John; Batey, Robert T.

    2005-01-01

    The crystal structure of Aquifex aeolicus TrmH, a member of the a/b-knot superfamily responsible for O methylation of G18 of tRNAs, was determined to 1.85 Å resolution using the molecular-replacement method. Biological RNAs contain a variety of post-transcriptional modifications that facilitate their efficient function in the cellular environment. One of the two most common forms of modification is methylation of the 2′-hydroxyl group of the ribose sugar, which is performed by a number of S-adenosylmethionine (SAM) dependent methyltransferases. In bacteria, many of these modifications in tRNA and rRNA are carried out by the α/β-knot superfamily of enzymes, whose SAM-binding pocket is created by a characteristic deep trefoil knot. TrmH, an enzyme found throughout all three kingdoms of life, modifies the universally conserved guanosine 18 position of tRNA. The crystal structure of TrmH from the thermophilic bacterium Aquifex aeolicus has been determined at 1.85 Å resolution using data collected from a synchrotron-radiation source. The protein reveals a fold typical of members of the SpoU clan of proteins, a subfamily of the α/β-knot superfamily, with α-helical extensions at the N- and C-termini that are likely to be involved in tRNA binding

  9. Viroids: from genotype to phenotype just relying on RNA sequence and structural motifs

    Directory of Open Access Journals (Sweden)

    Ricardo eFlores

    2012-06-01

    Full Text Available As a consequence of two unique physical properties, small size and circularity, viroid RNAs do not code for proteins and thus depend on RNA sequence/structural motifs for interacting with host proteins that mediate their invasion, replication, spread, and circumvention of defensive barriers. Viroid genomes fold up on themselves adopting collapsed secondary structures wherein stretches of nucleotides stabilized by Watson-Crick pairs are flanked by apparently unstructured loops. However, compelling data show that they are instead stabilized by alternative non-canonical pairs and that specific loops in the rod-like secondary structure, characteristic of Potato spindle tuber viroid and most other members of the family Pospiviroidae, are critical for replication and systemic trafficking. In contrast, rather than folding into a rod-like secondary structure, most members of the family Avsunvioidae adopt multibranched conformations occasionally stabilized by kissing loop interactions critical for viroid viability in vivo. Besides these most stable secondary structures, viroid RNAs alternatively adopt during replication transient metastable conformations containing elements of local higher-order structure, prominent among which are the hammerhead ribozymes catalyzing a key replicative step in the family Avsunvioidae, and certain conserved hairpins that also mediate replication steps in the family Pospiviroidae. Therefore, different RNA structures ⎯either global or local ⎯ determine different functions, thus highlighting the need for in-depth structural studies on viroid RNAs.

  10. Systematic identification of non-coding RNA 2,2,7-trimethylguanosine cap structures in Caenorhabditis elegans

    Directory of Open Access Journals (Sweden)

    Skogerbø Geir

    2007-09-01

    Full Text Available Abstract Background The 2,2,7-trimethylguanosine (TMG cap structure is an important functional characteristic of ncRNAs with critical cellular roles, such as some snRNAs. Here we used immunoprecipitation with both K121 and R1131 anti-TMG antibodies to systematically identify the TMG cap structures for all presently characterized ncRNAs in C. elegans. Results The two anti-TMG antibodies precipitated a similar group of the C. elegans ncRNAs. All snRNAs known to have a TMG cap structure were found in the precipitate, indicating that our identification system was efficient. Other ncRNA families related to splicing, such as SL RNAs and Sm Y RNAs, were also found in the precipitate, as were 7 C/D box snoRNAs. Further analysis showed that the SL RNAs and the Sm Y RNAs shared a very similar Sm binding site element (AAU4–5GGA, which sequence composition differed somewhat from those of other U snRNAs. There were also 16 ncRNAs without an Sm binding site element in the precipitate, suggesting that for these ncRNAs, TMG formation may occur independently of Sm proteins. Conclusion Our results showed that most ncRNAs predicted to be transcribed by RNA polymerase II had a TMG cap, while those predicted to be transcribed by RNA plymerase III or located in introns did not have a TMG cap structure. Compared to ncRNAs without a TMG cap, TMG-capped ncRNAs tended to have higher expression levels. Five functionally non-annotated ncRNAs also have a TMG cap structure, which might be helpful for identifying the cellular roles of these ncRNAs.

  11. Structural biology. Structures of the CRISPR-Cmr complex reveal mode of RNA target positioning

    NARCIS (Netherlands)

    Taylor, D.W.; Zhu, Y.; Staals, R.H.J.; Kornfeld, J.E.; Shinkai, A.; Oost, van der J.; Nogales, E.; Doudna, J.A.

    2015-01-01

    Adaptive immunity in bacteria involves RNA-guided surveillance complexes that use CRISPR (clustered regularly interspaced short palindromic repeats)-associated (Cas) proteins together with CRISPR RNAs (crRNAs) to target invasive nucleic acids for degradation. Whereas type I and type II CRISPR-Cas

  12. Predicting Consensus Structures for RNA Alignments Via Pseudo-Energy Minimization