viral non-coding sequence: Topics by WorldWideScience.org

Sample records for viral non-coding sequence

Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana.

Science.gov (United States)

Hoffmann, Robert D; Palmgren, Michael

2016-06-13

Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3' untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication.
Highly conserved non-coding sequences are associated with vertebrate development.

Directory of Open Access Journals (Sweden)

Adam Woolfe

2005-01-01

Full Text Available In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH, in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development
gEVE: a genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes.

Science.gov (United States)

Nakagawa, So; Takahashi, Mahoko Ueda

2016-01-01

In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species.Database URL: http://geve.med.u-tokai.ac.jp. © The Author(s) 2016. Published by Oxford University Press.
Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

Directory of Open Access Journals (Sweden)

Maggi Giorgio P

2008-06-01

Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.
DNA watermarks in non-coding regulatory sequences

Directory of Open Access Journals (Sweden)

Pyka Martin

2009-07-01

Full Text Available Abstract Background DNA watermarks can be applied to identify the unauthorized use of genetically modified organisms. It has been shown that coding regions can be used to encrypt information into living organisms by using the DNA-Crypt algorithm. Yet, if the sequence of interest presents a non-coding DNA sequence, either the function of a resulting functional RNA molecule or a regulatory sequence, such as a promoter, could be affected. For our studies we used the small cytoplasmic RNA 1 in yeast and the lac promoter region of Escherichia coli. Findings The lac promoter was deactivated by the integrated watermark. In addition, the RNA molecules displayed altered configurations after introducing a watermark, but surprisingly were functionally intact, which has been verified by analyzing the growth characteristics of both wild type and watermarked scR1 transformed yeast cells. In a third approach we introduced a second overlapping watermark into the lac promoter, which did not affect the promoter activity. Conclusion Even though the watermarked RNA and one of the watermarked promoters did not show any significant differences compared to the wild type RNA and wild type promoter region, respectively, it cannot be generalized that other RNA molecules or regulatory sequences behave accordingly. Therefore, we do not recommend integrating watermark sequences into regulatory regions.
Dengue virus genomic variation associated with mosquito adaptation defines the pattern of viral non-coding RNAs and fitness in human cells.

Directory of Open Access Journals (Sweden)

Claudia V Filomatori

2017-03-01

Full Text Available The Flavivirus genus includes a large number of medically relevant pathogens that cycle between humans and arthropods. This host alternation imposes a selective pressure on the viral population. Here, we found that dengue virus, the most important viral human pathogen transmitted by insects, evolved a mechanism to differentially regulate the production of viral non-coding RNAs in mosquitos and humans, with a significant impact on viral fitness in each host. Flavivirus infections accumulate non-coding RNAs derived from the viral 3'UTRs (known as sfRNAs, relevant in viral pathogenesis and immune evasion. We found that dengue virus host adaptation leads to the accumulation of different species of sfRNAs in vertebrate and invertebrate cells. This process does not depend on differences in the host machinery; but it was found to be dependent on the selection of specific mutations in the viral 3'UTR. Dissecting the viral population and studying phenotypes of cloned variants, the molecular determinants for the switch in the sfRNA pattern during host change were mapped to a single RNA structure. Point mutations selected in mosquito cells were sufficient to change the pattern of sfRNAs, induce higher type I interferon responses and reduce viral fitness in human cells, explaining the rapid clearance of certain viral variants after host change. In addition, using epidemic and pre-epidemic Zika viruses, similar patterns of sfRNAs were observed in mosquito and human infected cells, but they were different from those observed during dengue virus infections, indicating that distinct selective pressures act on the 3'UTR of these closely related viruses. In summary, we present a novel mechanism by which dengue virus evolved an RNA structure that is under strong selective pressure in the two hosts, as regulator of non-coding RNA accumulation and viral fitness. This work provides new ideas about the impact of host adaptation on the variability and evolution of
Comparative Annotation of Viral Genomes with Non-Conserved Gene Structure

DEFF Research Database (Denmark)

de Groot, Saskia; Mailund, Thomas; Hein, Jotun

2007-01-01

Motivation: Detecting genes in viral genomes is a complex task. Due to the biological necessity of them being constrained in length, RNA viruses in particular tend to code in overlapping reading frames. Since one amino acid is encoded by a triplet of nucleic acids, up to three genes may be coded...... allows for coding in unidirectional nested and overlapping reading frames, to annotate two homologous aligned viral genomes. Our method does not insist on conserved gene structure between the two sequences, thus making it applicable for the pairwise comparison of more distantly related sequences. Results...... and HIV2, as well as of two different Hepatitis Viruses, attaining results of ~87% sensitivity and ~98.5% specificity. We subsequently incorporate prior knowledge by "knowing" the gene structure of one sequence and annotating the other conditional on it. Boosting accuracy close to perfect we demonstrate...
Identification of evolutionarily conserved non-AUG-initiated N-terminal extensions in human coding sequences.

LENUS (Irish Health Repository)

Ivanov, Ivaylo P

2011-05-01

In eukaryotes, it is generally assumed that translation initiation occurs at the AUG codon closest to the messenger RNA 5\\' cap. However, in certain cases, initiation can occur at codons differing from AUG by a single nucleotide, especially the codons CUG, UUG, GUG, ACG, AUA and AUU. While non-AUG initiation has been experimentally verified for a handful of human genes, the full extent to which this phenomenon is utilized--both for increased coding capacity and potentially also for novel regulatory mechanisms--remains unclear. To address this issue, and hence to improve the quality of existing coding sequence annotations, we developed a methodology based on phylogenetic analysis of predicted 5\\' untranslated regions from orthologous genes. We use evolutionary signatures of protein-coding sequences as an indicator of translation initiation upstream of annotated coding sequences. Our search identified novel conserved potential non-AUG-initiated N-terminal extensions in 42 human genes including VANGL2, FGFR1, KCNN4, TRPV6, HDGF, CITED2, EIF4G3 and NTF3, and also affirmed the conservation of known non-AUG-initiated extensions in 17 other genes. In several instances, we have been able to obtain independent experimental evidence of the expression of non-AUG-initiated products from the previously published literature and ribosome profiling data.
Sub-grouping of Plasmodium falciparum 3D7 var genes based on sequence analysis of coding and non-coding regions

DEFF Research Database (Denmark)

Lavstsen, Thomas; Salanti, Ali; Jensen, Anja T R

2003-01-01

and organization of the 3D7 PfEMP1 repertoire was investigated on the basis of the complete genome sequence. METHODS: Using two tree-building methods we analysed the coding and non-coding sequences of 3D7 var and rif genes as well as var genes of other parasite strains. RESULTS: var genes can be sub...
Automated degenerate PCR primer design for high-throughput sequencing improves efficiency of viral sequencing

Directory of Open Access Journals (Sweden)

Li Kelvin

2012-11-01

Full Text Available Abstract Background In a high-throughput environment, to PCR amplify and sequence a large set of viral isolates from populations that are potentially heterogeneous and continuously evolving, the use of degenerate PCR primers is an important strategy. Degenerate primers allow for the PCR amplification of a wider range of viral isolates with only one set of pre-mixed primers, thus increasing amplification success rates and minimizing the necessity for genome finishing activities. To successfully select a large set of degenerate PCR primers necessary to tile across an entire viral genome and maximize their success, this process is best performed computationally. Results We have developed a fully automated degenerate PCR primer design system that plays a key role in the J. Craig Venter Institute’s (JCVI high-throughput viral sequencing pipeline. A consensus viral genome, or a set of consensus segment sequences in the case of a segmented virus, is specified using IUPAC ambiguity codes in the consensus template sequence to represent the allelic diversity of the target population. PCR primer pairs are then selected computationally to produce a minimal amplicon set capable of tiling across the full length of the specified target region. As part of the tiling process, primer pairs are computationally screened to meet the criteria for successful PCR with one of two described amplification protocols. The actual sequencing success rates for designed primers for measles virus, mumps virus, human parainfluenza virus 1 and 3, human respiratory syncytial virus A and B and human metapneumovirus are described, where >90% of designed primer pairs were able to consistently successfully amplify >75% of the isolates. Conclusions Augmenting our previously developed and published JCVI Primer Design Pipeline, we achieved similarly high sequencing success rates with only minor software modifications. The recommended methodology for the construction of the consensus
Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

Energy Technology Data Exchange (ETDEWEB)

Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

2007-02-21

Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by
Non-codingRNA sequence variations in human chronic lymphocytic leukemia and colorectal cancer.

Science.gov (United States)

Wojcik, Sylwia E; Rossi, Simona; Shimizu, Masayoshi; Nicoloso, Milena S; Cimmino, Amelia; Alder, Hansjuerg; Herlea, Vlad; Rassenti, Laura Z; Rai, Kanti R; Kipps, Thomas J; Keating, Michael J; Croce, Carlo M; Calin, George A

2010-02-01

Cancer is a genetic disease in which the interplay between alterations in protein-coding genes and non-coding RNAs (ncRNAs) plays a fundamental role. In recent years, the full coding component of the human genome was sequenced in various cancers, whereas such attempts related to ncRNAs are still fragmentary. We screened genomic DNAs for sequence variations in 148 microRNAs (miRNAs) and ultraconserved regions (UCRs) loci in patients with chronic lymphocytic leukemia (CLL) or colorectal cancer (CRC) by Sanger technique and further tried to elucidate the functional consequences of some of these variations. We found sequence variations in miRNAs in both sporadic and familial CLL cases, mutations of UCRs in CLLs and CRCs and, in certain instances, detected functional effects of these variations. Furthermore, by integrating our data with previously published data on miRNA sequence variations, we have created a catalog of DNA sequence variations in miRNAs/ultraconserved genes in human cancers. These findings argue that ncRNAs are targeted by both germ line and somatic mutations as well as by single-nucleotide polymorphisms with functional significance for human tumorigenesis. Sequence variations in ncRNA loci are frequent and some have functional and biological significance. Such information can be exploited to further investigate on a genome-wide scale the frequency of genetic variations in ncRNAs and their functional meaning, as well as for the development of new diagnostic and prognostic markers for leukemias and carcinomas.
Detecting non-coding selective pressure in coding regions

Directory of Open Access Journals (Sweden)

Blanchette Mathieu

2007-02-01

Full Text Available Abstract Background Comparative genomics approaches, where orthologous DNA regions are compared and inter-species conserved regions are identified, have proven extremely powerful for identifying non-coding regulatory regions located in intergenic or intronic regions. However, non-coding functional elements can also be located within coding region, as is common for exonic splicing enhancers, some transcription factor binding sites, and RNA secondary structure elements affecting mRNA stability, localization, or translation. Since these functional elements are located in regions that are themselves highly conserved because they are coding for a protein, they generally escaped detection by comparative genomics approaches. Results We introduce a comparative genomics approach for detecting non-coding functional elements located within coding regions. Codon evolution is modeled as a mixture of codon substitution models, where each component of the mixture describes the evolution of codons under a specific type of coding selective pressure. We show how to compute the posterior distribution of the entropy and parsimony scores under this null model of codon evolution. The method is applied to a set of growth hormone 1 orthologous mRNA sequences and a known exonic splicing elements is detected. The analysis of a set of CORTBP2 orthologous genes reveals a region of several hundred base pairs under strong non-coding selective pressure whose function remains unknown. Conclusion Non-coding functional elements, in particular those involved in post-transcriptional regulation, are likely to be much more prevalent than is currently known. With the numerous genome sequencing projects underway, comparative genomics approaches like that proposed here are likely to become increasingly powerful at detecting such elements.
Viral Metagenomics: MetaView Software

Energy Technology Data Exchange (ETDEWEB)

Zhou, C; Smith, J

2007-10-22

The purpose of this report is to design and develop a tool for analysis of raw sequence read data from viral metagenomics experiments. The tool should compare read sequences of known viral nucleic acid sequence data and enable a user to attempt to determine, with some degree of confidence, what virus groups may be present in the sample. This project was conducted in two phases. In phase 1 we surveyed the literature and examined existing metagenomics tools to educate ourselves and to more precisely define the problem of analyzing raw read data from viral metagenomic experiments. In phase 2 we devised an approach and built a prototype code and database. This code takes viral metagenomic read data in fasta format as input and accesses all complete viral genomes from Kpath for sequence comparison. The system executes at the UNIX command line, producing output that is stored in an Oracle relational database. We provide here a description of the approach we came up with for handling un-assembled, short read data sets from viral metagenomics experiments. We include a discussion of the current MetaView code capabilities and additional functionality that we believe should be added, should additional funding be acquired to continue the work.
Combining genomic sequencing methods to explore viral diversity and reveal potential virus-host interactions

Directory of Open Access Journals (Sweden)

Cheryl-Emiliane Tien Chow

2015-04-01

Full Text Available Viral diversity and virus-host interactions in oxygen-starved regions of the ocean, also known as oxygen minimum zones (OMZs, remain relatively unexplored. Microbial community metabolism in OMZs alters nutrient and energy flow through marine food webs, resulting in biological nitrogen loss and greenhouse gas production. Thus, viruses infecting OMZ microbes have the potential to modulate community metabolism with resulting feedback on ecosystem function. Here, we describe viral communities inhabiting oxic surface (10m and oxygen-starved basin (200m waters of Saanich Inlet, a seasonally anoxic fjord on the coast of Vancouver Island, British Columbia using viral metagenomics and complete viral fosmid sequencing on samples collected between April 2007 and April 2010. Of 6459 open reading frames (ORFs predicted across all 34 viral fosmids, 77.6% (n=5010 had no homology to reference viral genomes. These fosmids recruited a higher proportion of viral metagenomic sequences from Saanich Inlet than from nearby northeastern subarctic Pacific Ocean (Line P waters, indicating differences in the viral communities between coastal and open ocean locations. While functional annotations of fosmid ORFs were limited, recruitment to NCBI’s non-redundant ‘nr’ database and publicly available single-cell genomes identified putative viruses infecting marine thaumarchaeal and SUP05 proteobacteria to provide potential host linkages with relevance to coupled biogeochemical cycling processes in OMZ waters. Taken together, these results highlight the power of coupled analyses of multiple sequence data types, such as viral metagenomic and fosmid sequence data with prokaryotic single cell genomes, to chart viral diversity, elucidate genomic and ecological contexts for previously unclassifiable viral sequences, and identify novel host interactions in natural and engineered ecosystems.
Systematic analysis of coding and noncoding DNA sequences using methods of statistical linguistics

Science.gov (United States)

Mantegna, R. N.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.; Stanley, H. E.

1995-01-01

We compare the statistical properties of coding and noncoding regions in eukaryotic and viral DNA sequences by adapting two tests developed for the analysis of natural languages and symbolic sequences. The data set comprises all 30 sequences of length above 50 000 base pairs in GenBank Release No. 81.0, as well as the recently published sequences of C. elegans chromosome III (2.2 Mbp) and yeast chromosome XI (661 Kbp). We find that for the three chromosomes we studied the statistical properties of noncoding regions appear to be closer to those observed in natural languages than those of coding regions. In particular, (i) a n-tuple Zipf analysis of noncoding regions reveals a regime close to power-law behavior while the coding regions show logarithmic behavior over a wide interval, while (ii) an n-gram entropy measurement shows that the noncoding regions have a lower n-gram entropy (and hence a larger "n-gram redundancy") than the coding regions. In contrast to the three chromosomes, we find that for vertebrates such as primates and rodents and for viral DNA, the difference between the statistical properties of coding and noncoding regions is not pronounced and therefore the results of the analyses of the investigated sequences are less conclusive. After noting the intrinsic limitations of the n-gram redundancy analysis, we also briefly discuss the failure of the zeroth- and first-order Markovian models or simple nucleotide repeats to account fully for these "linguistic" features of DNA. Finally, we emphasize that our results by no means prove the existence of a "language" in noncoding DNA.
SHARAKU: an algorithm for aligning and clustering read mapping profiles of deep sequencing in non-coding RNA processing.

Science.gov (United States)

Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi

2016-06-15

Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5'-end processing and 3'-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA004502. yasu@bio.keio.ac.jp Supplementary data are available
Identification of coding and non-coding mutational hotspots in cancer genomes.

Science.gov (United States)

Piraino, Scott W; Furney, Simon J

2017-01-05

The identification of mutations that play a causal role in tumour development, so called "driver" mutations, is of critical importance for understanding how cancers form and how they might be treated. Several large cancer sequencing projects have identified genes that are recurrently mutated in cancer patients, suggesting a role in tumourigenesis. While the landscape of coding drivers has been extensively studied and many of the most prominent driver genes are well characterised, comparatively less is known about the role of mutations in the non-coding regions of the genome in cancer development. The continuing fall in genome sequencing costs has resulted in a concomitant increase in the number of cancer whole genome sequences being produced, facilitating systematic interrogation of both the coding and non-coding regions of cancer genomes. To examine the mutational landscapes of tumour genomes we have developed a novel method to identify mutational hotspots in tumour genomes using both mutational data and information on evolutionary conservation. We have applied our methodology to over 1300 whole cancer genomes and show that it identifies prominent coding and non-coding regions that are known or highly suspected to play a role in cancer. Importantly, we applied our method to the entire genome, rather than relying on predefined annotations (e.g. promoter regions) and we highlight recurrently mutated regions that may have resulted from increased exposure to mutational processes rather than selection, some of which have been identified previously as targets of selection. Finally, we implicate several pan-cancer and cancer-specific candidate non-coding regions, which could be involved in tumourigenesis. We have developed a framework to identify mutational hotspots in cancer genomes, which is applicable to the entire genome. This framework identifies known and novel coding and non-coding mutional hotspots and can be used to differentiate candidate driver regions from
A viral metagenomic approach on a non-metagenomic experiment: Mining next generation sequencing datasets from pig DNA identified several porcine parvoviruses for a retrospective evaluation of viral infections.

Directory of Open Access Journals (Sweden)

Samuele Bovo

Full Text Available Shot-gun next generation sequencing (NGS on whole DNA extracted from specimens collected from mammals often produces reads that are not mapped (i.e. unmapped reads on the host reference genome and that are usually discarded as by-products of the experiments. In this study, we mined Ion Torrent reads obtained by sequencing DNA isolated from archived blood samples collected from 100 performance tested Italian Large White pigs. Two reduced representation libraries were prepared from two DNA pools constructed each from 50 equimolar DNA samples. Bioinformatic analyses were carried out to mine unmapped reads on the reference pig genome that were obtained from the two NGS datasets. In silico analyses included read mapping and sequence assembly approaches for a viral metagenomic analysis using the NCBI Viral Genome Resource. Our approach identified sequences matching several viruses of the Parvoviridae family: porcine parvovirus 2 (PPV2, PPV4, PPV5 and PPV6 and porcine bocavirus 1-H18 isolate (PBoV1-H18. The presence of these viruses was confirmed by PCR and Sanger sequencing of individual DNA samples. PPV2, PPV4, PPV5, PPV6 and PBoV1-H18 were all identified in samples collected in 1998-2007, 1998-2000, 1997-2000, 1998-2004 and 2003, respectively. For most of these viruses (PPV4, PPV5, PPV6 and PBoV1-H18 previous studies reported their first occurrence much later (from 5 to more than 10 years than our identification period and in different geographic areas. Our study provided a retrospective evaluation of apparently asymptomatic parvovirus infected pigs providing information that could be important to define occurrence and prevalence of different parvoviruses in South Europe. This study demonstrated the potential of mining NGS datasets non-originally derived by metagenomics experiments for viral metagenomics analyses in a livestock species.
Sequence-based heuristics for faster annotation of non-coding RNA families.

Science.gov (United States)

Weinberg, Zasha; Ruzzo, Walter L

2006-01-01

Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are extremely slow. Previously, we created rigorous filters, which provably sacrifice none of a CM's accuracy, while making searches significantly faster for virtually all ncRNA families. However, these rigorous filters make searches slower than heuristics could be. In this paper we introduce profile HMM-based heuristic filters. We show that their accuracy is usually superior to heuristics based on BLAST. Moreover, we compared our heuristics with those used in tRNAscan-SE, whose heuristics incorporate a significant amount of work specific to tRNAs, where our heuristics are generic to any ncRNA. Performance was roughly comparable, so we expect that our heuristics provide a high-quality solution that--unlike family-specific solutions--can scale to hundreds of ncRNA families. The source code is available under GNU Public License at the supplementary web site.

Sequencing illustrates the transcriptional response of Legionella pneumophila during infection and identifies seventy novel small non-coding RNAs.

LENUS (Irish Health Repository)

Weissenmayer, Barbara A

2011-01-01

Second generation sequencing has prompted a number of groups to re-interrogate the transcriptomes of several bacterial and archaeal species. One of the central findings has been the identification of complex networks of small non-coding RNAs that play central roles in transcriptional regulation in all growth conditions and for the pathogen\\'s interaction with and survival within host cells. Legionella pneumophila is a gram-negative facultative intracellular human pathogen with a distinct biphasic lifestyle. One of its primary environmental hosts in the free-living amoeba Acanthamoeba castellanii and its infection by L. pneumophila mimics that seen in human macrophages. Here we present analysis of strand specific sequencing of the transcriptional response of L. pneumophila during exponential and post-exponential broth growth and during the replicative and transmissive phase of infection inside A. castellanii. We extend previous microarray based studies as well as uncovering evidence of a complex regulatory architecture underpinned by numerous non-coding RNAs. Over seventy new non-coding RNAs could be identified; many of them appear to be strain specific and in configurations not previously reported. We discover a family of non-coding RNAs preferentially expressed during infection conditions and identify a second copy of 6S RNA in L. pneumophila. We show that the newly discovered putative 6S RNA as well as a number of other non-coding RNAs show evidence for antisense transcription. The nature and extent of the non-coding RNAs and their expression patterns suggests that these may well play central roles in the regulation of Legionella spp. specific traits and offer clues as to how L. pneumophila adapts to its intracellular niche. The expression profiles outlined in the study have been deposited into Genbank\\'s Gene Expression Omnibus (GEO) database under the series accession GSE27232.
Viral metagenomics: Analysis of begomoviruses by illumina high-throughput sequencing

KAUST Repository

Idris, Ali; Al-Saleh, Mohammed; Piatek, Marek J.; Al-Shahwan, Ibrahim; Ali, Shahjahan; Brown, Judith K.

2014-01-01

Traditional DNA sequencing methods are inefficient, lack the ability to discern the least abundant viral sequences, and ineffective for determining the extent of variability in viral populations. Here, populations of single-stranded DNA plant
Faktor Risiko Non Viral Pada Karsinoma Nasofaring

Directory of Open Access Journals (Sweden)

Sukri Rahman

2015-09-01

Full Text Available Abstrak Latar belakang: Karsinoma nasofaring adalah tumor ganas epitel nasofaring yang sampai saat ini penyebabnya belum diketahui, infeksi virus Epstein Barr dilaporkan sebagai faktor dominan terjadinya karsinoma nasofaring tetapi faktor non viral juga berperan untuk timbulnya keganasan nasofaring. Tujuan: Untuk mengetahui faktor non viral yang dapat meningkatkan kejadian karsinoma nasofaring sehingga dapat mencegah dan menghindari faktor-faktor non viral tersebut. Tinjauan Pustaka: Karsinoma nasofaring merupakan tumor ganas epitel nasofaring yang penyebabnya berhubungan dengan faktor viral dan non viral diantaranya asap rokok, ikan asin, formaldehid, genetik, asap kayu bakar , debu kayu, infeksi kronik telinga hidung tenggorok, alkohol dan obat tradisional. Kesimpulan: Pembuktian secara klinis dan ilmiah terhadap faktor non viral sebagai penyebab timbulnya karsinoma nasofaring masih belum dapat dijelaskan secara pasti. Faktor non viral merupakan salah satu faktor risiko yang dapat meningkatkan angka kejadian timbulnya keganasan nasofaring Kata kunci: karsinoma nasofaring, faktor risiko, non viral AbstractBackground: Nasopharyngeal carcinoma is a malignant epithelial nasopharyngeal tumor that until now the cause still unknown, Epstein barr virus infection had reported as predominant occurance of nasopharyngeal carcinoma but non viral factors may also contribute to the onset of the incidence of nasopharyngeal malignancy. Purpose: To find non viral factors that may increase the incidence of nasopharyngel carcinoma in order to prevent and avoid non-viral factors Literature: Nasopharyngeal carcinoma is a malignant tumor that causes nasopharyngeal epithelium associated with viral and non-viral factors such as cigarette smoke, salt fish, formaldehyde, genetic, wood smoke ,wood dust, ear nose throat chronic infections, alcohol, and traditional medicine. Conclusion: Clinically and scientifically proving the non-viral factors as
Annotating pathogenic non-coding variants in genic regions.

Science.gov (United States)

Gelfman, Sahar; Wang, Quanli; McSweeney, K Melodi; Ren, Zhong; La Carpia, Francesca; Halvorsen, Matt; Schoch, Kelly; Ratzon, Fanni; Heinzen, Erin L; Boland, Michael J; Petrovski, Slavé; Goldstein, David B

2017-08-09

Identifying the underlying causes of disease requires accurate interpretation of genetic variants. Current methods ineffectively capture pathogenic non-coding variants in genic regions, resulting in overlooking synonymous and intronic variants when searching for disease risk. Here we present the Transcript-inferred Pathogenicity (TraP) score, which uses sequence context alterations to reliably identify non-coding variation that causes disease. High TraP scores single out extremely rare variants with lower minor allele frequencies than missense variants. TraP accurately distinguishes known pathogenic and benign variants in synonymous (AUC = 0.88) and intronic (AUC = 0.83) public datasets, dismissing benign variants with exceptionally high specificity. TraP analysis of 843 exomes from epilepsy family trios identifies synonymous variants in known epilepsy genes, thus pinpointing risk factors of disease from non-coding sequence data. TraP outperforms leading methods in identifying non-coding variants that are pathogenic and is therefore a valuable tool for use in gene discovery and the interpretation of personal genomes.While non-coding synonymous and intronic variants are often not under strong selective constraint, they can be pathogenic through affecting splicing or transcription. Here, the authors develop a score that uses sequence context alterations to predict pathogenicity of synonymous and non-coding genetic variants, and provide a web server of pre-computed scores.
Viral metagenomics: Analysis of begomoviruses by illumina high-throughput sequencing

KAUST Repository

Idris, Ali

2014-03-12

Traditional DNA sequencing methods are inefficient, lack the ability to discern the least abundant viral sequences, and ineffective for determining the extent of variability in viral populations. Here, populations of single-stranded DNA plant begomoviral genomes and their associated beta- and alpha-satellite molecules (virus-satellite complexes) (genus, Begomovirus; family, Geminiviridae) were enriched from total nucleic acids isolated from symptomatic, field-infected plants, using rolling circle amplification (RCA). Enriched virus-satellite complexes were subjected to Illumina-Next Generation Sequencing (NGS). CASAVA and SeqMan NGen programs were implemented, respectively, for quality control and for de novo and reference-guided contig assembly of viral-satellite sequences. The authenticity of the begomoviral sequences, and the reproducibility of the Illumina-NGS approach for begomoviral deep sequencing projects, were validated by comparing NGS results with those obtained using traditional molecular cloning and Sanger sequencing of viral components and satellite DNAs, also enriched by RCA or amplified by polymerase chain reaction. As the use of NGS approaches, together with advances in software development, make possible deep sequence coverage at a lower cost; the approach described herein will streamline the exploration of begomovirus diversity and population structure from naturally infected plants, irrespective of viral abundance. This is the first report of the implementation of Illumina-NGS to explore the diversity and identify begomoviral-satellite SNPs directly from plants naturally-infected with begomoviruses under field conditions. 2014 by the authors; licensee MDPI, Basel, Switzerland.
Viral Metagenomics: Analysis of Begomoviruses by Illumina High-Throughput Sequencing

Directory of Open Access Journals (Sweden)

Ali Idris

2014-03-01

Full Text Available Traditional DNA sequencing methods are inefficient, lack the ability to discern the least abundant viral sequences, and ineffective for determining the extent of variability in viral populations. Here, populations of single-stranded DNA plant begomoviral genomes and their associated beta- and alpha-satellite molecules (virus-satellite complexes (genus, Begomovirus; family, Geminiviridae were enriched from total nucleic acids isolated from symptomatic, field-infected plants, using rolling circle amplification (RCA. Enriched virus-satellite complexes were subjected to Illumina-Next Generation Sequencing (NGS. CASAVA and SeqMan NGen programs were implemented, respectively, for quality control and for de novo and reference-guided contig assembly of viral-satellite sequences. The authenticity of the begomoviral sequences, and the reproducibility of the Illumina-NGS approach for begomoviral deep sequencing projects, were validated by comparing NGS results with those obtained using traditional molecular cloning and Sanger sequencing of viral components and satellite DNAs, also enriched by RCA or amplified by polymerase chain reaction. As the use of NGS approaches, together with advances in software development, make possible deep sequence coverage at a lower cost; the approach described herein will streamline the exploration of begomovirus diversity and population structure from naturally infected plants, irrespective of viral abundance. This is the first report of the implementation of Illumina-NGS to explore the diversity and identify begomoviral-satellite SNPs directly from plants naturally-infected with begomoviruses under field conditions.
Functional interrogation of non-coding DNA through CRISPR genome editing.

Science.gov (United States)

Canver, Matthew C; Bauer, Daniel E; Orkin, Stuart H

2017-05-15

Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. Copyright © 2017 Elsevier Inc. All rights reserved.
Annotating non-coding regions of the genome.

Science.gov (United States)

Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

2010-08-01

Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.
CRISPR/Cas9-mediated viral interference in plants

KAUST Repository

Ali, Zahir

2015-11-11

Background The CRISPR/Cas9 system provides bacteria and archaea with molecular immunity against invading phages and conjugative plasmids. Recently, CRISPR/Cas9 has been used for targeted genome editing in diverse eukaryotic species. Results In this study, we investigate whether the CRISPR/Cas9 system could be used in plants to confer molecular immunity against DNA viruses. We deliver sgRNAs specific for coding and non-coding sequences of tomato yellow leaf curl virus (TYLCV) into Nicotiana benthamiana plants stably overexpressing the Cas9 endonuclease, and subsequently challenge these plants with TYLCV. Our data demonstrate that the CRISPR/Cas9 system targeted TYLCV for degradation and introduced mutations at the target sequences. All tested sgRNAs exhibit interference activity, but those targeting the stem-loop sequence within the TYLCV origin of replication in the intergenic region (IR) are the most effective. N. benthamiana plants expressing CRISPR/Cas9 exhibit delayed or reduced accumulation of viral DNA, abolishing or significantly attenuating symptoms of infection. Moreover, this system could simultaneously target multiple DNA viruses. Conclusions These data establish the efficacy of the CRISPR/Cas9 system for viral interference in plants, thereby extending the utility of this technology and opening the possibility of producing plants resistant to multiple viral infections.
Improved osteogenic vector for non-viral gene therapy

Directory of Open Access Journals (Sweden)

ARA Hacobian

2016-03-01

Full Text Available Therapeutic compensation of deficient bone regeneration is a challenging task and a topic of on-going search for novel treatment strategies. One promising approach for improvement involves non-viral gene delivery using the bone morphogenetic protein-2 (BMP-2 gene to provide transient, local and sustained expression of the growth factor. However, since efficiency of non-viral gene delivery is low, this study focused on the improvement of a BMP-2 gene expression system, aiming for compensation of poor transfection efficiency. First, the native BMP-2 gene sequence was modified by codon optimisation and altered by inserting a highly truncated artificial intron (96 bp. Transfection of multiple cell lines and rat adipose-derived mesenchymal stem cells with plasmids harbouring the improved BMP-2 sequence led to a several fold increased expression rate and subsequent osteogenic differentiation. Additionally, comparing expression kinetics of elongation factor 1 alpha (EF1α promoter with a state of the art CMV promoter revealed significantly higher BMP-2 expression when under the influence of the EF1α promoter. Results obtained by quantification of bone markers as well as osteogenic assays showed reduced sensitivity to promoter silencing effects of the EF1α promoter in rat adipose-derived mesenchymal stem cells. Finally, screening of several protein secretion signals using either luciferase or BMP-2 as reporter protein revealed no superior candidates for potential replacement of the native BMP-2 secretion signal. Taken together, by enhancing the exogenous BMP-2 expression system, low transfection efficiencies in therapeutic applications can be compensated, making safe non-viral systems even more suitable for tissue regeneration approaches.
Hybridization-based reconstruction of small non-coding RNA transcripts from deep sequencing data.

Science.gov (United States)

Ragan, Chikako; Mowry, Bryan J; Bauer, Denis C

2012-09-01

Recent advances in RNA sequencing technology (RNA-Seq) enables comprehensive profiling of RNAs by producing millions of short sequence reads from size-fractionated RNA libraries. Although conventional tools for detecting and distinguishing non-coding RNAs (ncRNAs) from reference-genome data can be applied to sequence data, ncRNA detection can be improved by harnessing the full information content provided by this new technology. Here we present NorahDesk, the first unbiased and universally applicable method for small ncRNAs detection from RNA-Seq data. NorahDesk utilizes the coverage-distribution of small RNA sequence data as well as thermodynamic assessments of secondary structure to reliably predict and annotate ncRNA classes. Using publicly available mouse sequence data from brain, skeletal muscle, testis and ovary, we evaluated our method with an emphasis on the performance for microRNAs (miRNAs) and piwi-interacting small RNA (piRNA). We compared our method with Dario and mirDeep2 and found that NorahDesk produces longer transcripts with higher read coverage. This feature makes it the first method particularly suitable for the prediction of both known and novel piRNAs.
Determining mutant spectra of three RNA viral samples using ultra-deep sequencing

Energy Technology Data Exchange (ETDEWEB)

Chen, H

2012-06-06

RNA viruses have extremely high mutation rates that enable the virus to adapt to new host environments and even jump from one species to another. As part of a viral transmission study, three viral samples collected from naturally infected animals were sequenced using Illumina paired-end technology at ultra-deep coverage. In order to determine the mutant spectra within the viral quasispecies, it is critical to understand the sequencing error rates and control for false positive calls of viral variants (point mutantations). I will estimate the sequencing error rate from two control sequences and characterize the mutant spectra in the natural samples with this error rate.
Possible Relevance of Receptor-Receptor Interactions between Viral- and Host-Coded Receptors for Viral-Induced Disease

Directory of Open Access Journals (Sweden)

Luigi F. Agnati

2007-01-01

Full Text Available It has been demonstrated that some viruses, such as the cytomegalovirus, code for G-protein coupled receptors not only to elude the immune system, but also to redirect cellular signaling in the receptor networks of the host cells. In view of the existence of receptor-receptor interactions, the hypothesis is introduced that these viral-coded receptors not only operate as constitutively active monomers, but also can affect other receptor function by interacting with receptors of the host cell. Furthermore, it is suggested that viruses could also insert not single receptors (monomers, but clusters of receptors (receptor mosaics, altering the cell metabolism in a profound way. The prevention of viral receptor-induced changes in host receptor networks may give rise to novel antiviral drugs that counteract viral-induced disease.
Know Your Enemy: Successful Bioinformatic Approaches to Predict Functional RNA Structures in Viral RNAs

Science.gov (United States)

Lim, Chun Shen; Brown, Chris M.

2018-01-01

Structured RNA elements may control virus replication, transcription and translation, and their distinct features are being exploited by novel antiviral strategies. Viral RNA elements continue to be discovered using combinations of experimental and computational analyses. However, the wealth of sequence data, notably from deep viral RNA sequencing, viromes, and metagenomes, necessitates computational approaches being used as an essential discovery tool. In this review, we describe practical approaches being used to discover functional RNA elements in viral genomes. In addition to success stories in new and emerging viruses, these approaches have revealed some surprising new features of well-studied viruses e.g., human immunodeficiency virus, hepatitis C virus, influenza, and dengue viruses. Some notable discoveries were facilitated by new comparative analyses of diverse viral genome alignments. Importantly, comparative approaches for finding RNA elements embedded in coding and non-coding regions differ. With the exponential growth of computer power we have progressed from stem-loop prediction on single sequences to cutting edge 3D prediction, and from command line to user friendly web interfaces. Despite these advances, many powerful, user friendly prediction tools and resources are underutilized by the virology community. PMID:29354101
Downregulation of viral RNA translation by hepatitis C virus non-structural protein NS5A requires the poly(U/UC) sequence in the 3' UTR.

Science.gov (United States)

Hoffman, Brett; Li, Zhubing; Liu, Qiang

2015-08-01

Hepatitis C virus (HCV) non-structural protein 5A (NS5A) is essential for viral replication; however, its effect on HCV RNA translation remains controversial partially due to the use of reporters lacking the 3' UTR, where NS5A binds to the poly(U/UC) sequence. We investigated the role of NS5A in HCV translation using a monocistronic RNA containing a Renilla luciferase gene flanked by the HCV UTRs. We found that NS5A downregulated viral RNA translation in a dose-dependent manner. This downregulation required both the 5' and 3' UTRs of HCV because substitution of either sequence with the 5' and 3' UTRs of enterovirus 71 or a cap structure at the 5' end eliminated the effects of NS5A on translation. Translation of the HCV genomic RNA was also downregulated by NS5A. The inhibition of HCV translation by NS5A required the poly(U/UC) sequence in the 3' UTR as NS5A did not affect translation when it was deleted. In addition, we showed that, whilst the amphipathic α-helix of NS5A has no effect on viral translation, the three domains of NS5A can inhibit translation independently, also dependent on the presence of the poly(U/UC) sequence in the 3' UTR. These results suggested that NS5A downregulated HCV RNA translation through a mechanism involving the poly(U/UC) sequence in the 3' UTR.
The Regulatory and Kinase Domains but Not the Interdomain Linker Determine Human Double-stranded RNA-activated Kinase (PKR) Sensitivity to Inhibition by Viral Non-coding RNAs.

Science.gov (United States)

Sunita, S; Schwartz, Samantha L; Conn, Graeme L

2015-11-20

Double-stranded RNA (dsRNA)-activated protein kinase (PKR) is an important component of the innate immune system that presents a crucial first line of defense against viral infection. PKR has a modular architecture comprising a regulatory N-terminal dsRNA binding domain and a C-terminal kinase domain interposed by an unstructured ∼80-residue interdomain linker (IDL). Guided by sequence alignment, we created IDL deletions in human PKR (hPKR) and regulatory/kinase domain swap human-rat chimeric PKRs to assess the contributions of each domain and the IDL to regulation of the kinase activity by RNA. Using circular dichroism spectroscopy, limited proteolysis, kinase assays, and isothermal titration calorimetry, we show that each PKR protein is properly folded with similar domain boundaries and that each exhibits comparable polyinosinic-cytidylic (poly(rI:rC)) dsRNA activation profiles and binding affinities for adenoviral virus-associated RNA I (VA RNAI) and HIV-1 trans-activation response (TAR) RNA. From these results we conclude that the IDL of PKR is not required for RNA binding or mediating changes in protein conformation or domain interactions necessary for PKR regulation by RNA. In contrast, inhibition of rat PKR by VA RNAI and TAR RNA was found to be weaker than for hPKR by 7- and >300-fold, respectively, and each human-rat chimeric domain-swapped protein showed intermediate levels of inhibition. These findings indicate that PKR sequence or structural elements in the kinase domain, present in hPKR but absent in rat PKR, are exploited by viral non-coding RNAs to accomplish efficient inhibition of PKR. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.
On Coding Non-Contiguous Letter Combinations

Directory of Open Access Journals (Sweden)

Frédéric eDandurand

2011-06-01

Full Text Available Starting from the hypothesis that printed word identification initially involves the parallel mapping of visual features onto location-specific letter identities, we analyze the type of information that would be involved in optimally mapping this location-specific orthographic code onto a location-invariant lexical code. We assume that some intermediate level of coding exists between individual letters and whole words, and that this involves the representation of letter combinations. We then investigate the nature of this intermediate level of coding given the constraints of optimality. This intermediate level of coding is expected to compress data while retaining as much information as possible about word identity. Information conveyed by letters is a function of how much they constrain word identity and how visible they are. Optimization of this coding is a combination of minimizing resources (using the most compact representations and maximizing information. We show that in a large proportion of cases, non-contiguous letter sequences contain more information than contiguous sequences, while at the same time requiring less precise coding. Moreover, we found that the best predictor of human performance in orthographic priming experiments was within-word ranking of conditional probabilities, rather than average conditional probabilities. We conclude that from an optimality perspective, readers learn to select certain contiguous and non-contiguous letter combinations as information that provides the best cue to word identity.
Long non-coding RNAs: Mechanism of action and functional utility

OpenAIRE

Bhat, Shakil Ahmad; Ahmad, Syed Mudasir; Mumtaz, Peerzada Tajamul; Malik, Abrar Ahad; Dar, Mashooq Ahmad; Urwat, Uneeb; Shah, Riaz Ahmad; Ganai, Nazir Ahmad

2016-01-01

Recent RNA sequencing studies have revealed that most of the human genome is transcribed, but very little of the total transcriptomes has the ability to encode proteins. Long non-coding RNAs (lncRNAs) are non-coding transcripts longer than 200 nucleotides. Members of the non-coding genome include microRNA (miRNA), small regulatory RNAs and other short RNAs. Most of long non-coding RNA (lncRNAs) are poorly annotated. Recent recognition about lncRNAs highlights their effects in many biological ...
Peripheral non-viral MIDGE vector-driven delivery of β-endorphin in inflammatory pain

Directory of Open Access Journals (Sweden)

Busch Melanie

2009-12-01

Full Text Available Abstract Background Leukocytes infiltrating inflamed tissue produce and release opioid peptides such as β-endorphin, which activate opioid receptors on peripheral terminals of sensory nerves resulting in analgesia. Gene therapy is an attractive strategy to enhance continuous production of endogenous opioids. However, classical viral and plasmid vectors for gene delivery are hampered by immunogenicity, recombination, oncogene activation, anti-bacterial antibody production or changes in physiological gene expression. Non-viral, non-plasmid minimalistic, immunologically defined gene expression (MIDGE vectors may overcome these problems as they carry only elements needed for gene transfer. Here, we investigated the effects of a nuclear localization sequence (NLS-coupled MIDGE encoding the β-endorphin precursor proopiomelanocortin (POMC on complete Freund's adjuvant-induced inflammatory pain in rats. Results POMC-MIDGE-NLS injected into inflamed paws appeared to be taken up by leukocytes resulting in higher concentrations of β-endorphin in these cells. POMC-MIDGE-NLS treatment reversed enhanced mechanical sensitivity compared with control MIDGE-NLS. However, both effects were moderate, not always statistically significant or directly correlated with each other. Also, the anti-hyperalgesic actions could not be increased by enhancing β-endorphin secretion or by modifying POMC-MIDGE-NLS to code for multiple copies of β-endorphin. Conclusion Although MIDGE vectors circumvent side-effects associated with classical viral and plasmid vectors, the current POMC-MIDGE-NLS did not result in reliable analgesic effectiveness in our pain model. This was possibly associated with insufficient and variable efficacy in transfection and/or β-endorphin production. Our data point at the importance of the reproducibility of gene therapy strategies for the control of chronic pain.
Supplementary Material for: CRISPR/Cas9-mediated viral interference in plants

KAUST Repository

Ali, Zahir; Abulfaraj, Aala A.; Idris, Ali; Ali, Shakila; Tashkandi, Manal; Mahfouz, Magdy

2015-01-01

Abstract Background The CRISPR/Cas9 system provides bacteria and archaea with molecular immunity against invading phages and conjugative plasmids. Recently, CRISPR/Cas9 has been used for targeted genome editing in diverse eukaryotic species. Results In this study, we investigate whether the CRISPR/Cas9 system could be used in plants to confer molecular immunity against DNA viruses. We deliver sgRNAs specific for coding and non-coding sequences of tomato yellow leaf curl virus (TYLCV) into Nicotiana benthamiana plants stably overexpressing the Cas9 endonuclease, and subsequently challenge these plants with TYLCV. Our data demonstrate that the CRISPR/Cas9 system targeted TYLCV for degradation and introduced mutations at the target sequences. All tested sgRNAs exhibit interference activity, but those targeting the stem-loop sequence within the TYLCV origin of replication in the intergenic region (IR) are the most effective. N. benthamiana plants expressing CRISPR/Cas9 exhibit delayed or reduced accumulation of viral DNA, abolishing or significantly attenuating symptoms of infection. Moreover, this system could simultaneously target multiple DNA viruses. Conclusions These data establish the efficacy of the CRISPR/Cas9 system for viral interference in plants, thereby extending the utility of this technology and opening the possibility of producing plants resistant to multiple viral infections.

A Reference Viral Database (RVDB) To Enhance Bioinformatics Analysis of High-Throughput Sequencing for Novel Virus Detection.

Science.gov (United States)

Goodacre, Norman; Aljanahi, Aisha; Nandakumar, Subhiksha; Mikailov, Mike; Khan, Arifa S

2018-01-01

Detection of distantly related viruses by high-throughput sequencing (HTS) is bioinformatically challenging because of the lack of a public database containing all viral sequences, without abundant nonviral sequences, which can extend runtime and obscure viral hits. Our reference viral database (RVDB) includes all viral, virus-related, and virus-like nucleotide sequences (excluding bacterial viruses), regardless of length, and with overall reduced cellular sequences. Semantic selection criteria (SEM-I) were used to select viral sequences from GenBank, resulting in a first-generation viral database (VDB). This database was manually and computationally reviewed, resulting in refined, semantic selection criteria (SEM-R), which were applied to a new download of updated GenBank sequences to create a second-generation VDB. Viral entries in the latter were clustered at 98% by CD-HIT-EST to reduce redundancy while retaining high viral sequence diversity. The viral identity of the clustered representative sequences (creps) was confirmed by BLAST searches in NCBI databases and HMMER searches in PFAM and DFAM databases. The resulting RVDB contained a broad representation of viral families, sequence diversity, and a reduced cellular content; it includes full-length and partial sequences and endogenous nonretroviral elements, endogenous retroviruses, and retrotransposons. Testing of RVDBv10.2, with an in-house HTS transcriptomic data set indicated a significantly faster run for virus detection than interrogating the entirety of the NCBI nonredundant nucleotide database, which contains all viral sequences but also nonviral sequences. RVDB is publically available for facilitating HTS analysis, particularly for novel virus detection. It is meant to be updated on a regular basis to include new viral sequences added to GenBank. IMPORTANCE To facilitate bioinformatics analysis of high-throughput sequencing (HTS) data for the detection of both known and novel viruses, we have
Phylogenetic analyses of the polyprotein coding sequences of serotype O foot-and-mouth disease viruses in East Africa: evidence for interserotypic recombination

Directory of Open Access Journals (Sweden)

Balinda Sheila N

2010-08-01

Full Text Available Abstract Background Foot-and-mouth disease (FMD is endemic in East Africa with the majority of the reported outbreaks attributed to serotype O virus. In this study, phylogenetic analyses of the polyprotein coding region of serotype O FMD viruses from Kenya and Uganda has been undertaken to infer evolutionary relationships and processes responsible for the generation and maintenance of diversity within this serotype. FMD virus RNA was obtained from six samples following virus isolation in cell culture and in one case by direct extraction from an oropharyngeal sample. Following RT-PCR, the single long open reading frame, encoding the polyprotein, was sequenced. Results Phylogenetic comparisons of the VP1 coding region showed that the recent East African viruses belong to one lineage within the EA-2 topotype while an older Kenyan strain, K/52/1992 is a representative of the topotype EA-1. Evolutionary relationships between the coding regions for the leader protease (L, the capsid region and almost the entire coding region are monophyletic except for the K/52/1992 which is distinct. Furthermore, phylogenetic relationships for the P2 and P3 regions suggest that the K/52/1992 is a probable recombinant between serotypes A and O. A bootscan analysis of K/52/1992 with East African FMD serotype A viruses (A21/KEN/1964 and A23/KEN/1965 and serotype O viral isolate (K/117/1999 revealed that the P2 region is probably derived from a serotype A strain while the P3 region appears to be a mosaic derived from both serotypes A and O. Conclusions Sequences of the VP1 coding region from recent serotype O FMDVs from Kenya and Uganda are all representatives of a specific East African lineage (topotype EA-2, a probable indication that hardly any FMD introductions of this serotype have occurred from outside the region in the recent past. Furthermore, evidence for interserotypic recombination, within the non-structural protein coding regions, between FMDVs of serotypes A
Annotation of selection strengths in viral genomes

DEFF Research Database (Denmark)

McCauley, Stephen; de Groot, Saskia; Mailund, Thomas

2007-01-01

Motivation: Viral genomes tend to code in overlapping reading frames to maximize information content. This may result in atypical codon bias and particular evolutionary constraints. Due to the fast mutation rate of viruses, there is additional strong evidence for varying selection between intra......- and intergenomic regions. The presence of multiple coding regions complicates the concept of Ka/Ks ratio, and thus begs for an alternative approach when investigating selection strengths. Building on the paper by McCauley & Hein (2006), we develop a method for annotating a viral genome coding in overlapping...... may thus achieve an annotation both of coding regions as well as selection strengths, allowing us to investigate different selection patterns and hypotheses. Results: We illustrate our method by applying it to a multiple alignment of four HIV2 sequences, as well as four Hepatitis B sequences. We...
Retrotransposons and non-protein coding RNAs

DEFF Research Database (Denmark)

Mourier, Tobias; Willerslev, Eske

2009-01-01

does not merely represent spurious transcription. We review examples of functional RNAs transcribed from retrotransposons, and address the collection of non-protein coding RNAs derived from transposable element sequences, including numerous human microRNAs and the neuronal BC RNAs. Finally, we review...
Golay sequences coded coherent optical OFDM for long-haul transmission

Science.gov (United States)

Qin, Cui; Ma, Xiangrong; Hua, Tao; Zhao, Jing; Yu, Huilong; Zhang, Jian

2017-09-01

We propose to use binary Golay sequences in coherent optical orthogonal frequency division multiplexing (CO-OFDM) to improve the long-haul transmission performance. The Golay sequences are generated by binary Reed-Muller codes, which have low peak-to-average power ratio and certain error correction capability. A low-complexity decoding algorithm for the Golay sequences is then proposed to recover the signal. Under same spectral efficiency, the QPSK modulated OFDM with binary Golay sequences coding with and without discrete Fourier transform (DFT) spreading (DFTS-QPSK-GOFDM and QPSK-GOFDM) are compared with the normal BPSK modulated OFDM with and without DFT spreading (DFTS-BPSK-OFDM and BPSK-OFDM) after long-haul transmission. At a 7% forward error correction code threshold (Q2 factor of 8.5 dB), it is shown that DFTS-QPSK-GOFDM outperforms DFTS-BPSK-OFDM by extending the transmission distance by 29% and 18%, in non-dispersion managed and dispersion managed links, respectively.
Prevalence of transcription promoters within archaeal operons and coding sequences.

Science.gov (United States)

Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S

2009-01-01

Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of approximately 64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein-DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3' ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes-events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements.
Structure, sequence and expression of the hepatitis delta (δ) viral genome

Science.gov (United States)

Wang, Kang-Sheng; Choo, Qui-Lim; Weiner, Amy J.; Ou, Jing-Hsiung; Najarian, Richard C.; Thayer, Richard M.; Mullenbach, Guy T.; Denniston, Katherine J.; Gerin, John L.; Houghton, Michael

1986-10-01

Biochemical and electron microscopic data indicate that the human hepatitis δ viral agent contains a covalently closed circular and single-stranded RNA genome that has certain similarities with viroid-like agents from plants. The sequence of the viral genome (1,678 nucleotides) has been determined and an open reading frame within the complementary strand has been shown to encode an antigen that binds specifically to antisera from patients with chronic hepatitis δ viral infections.
Sequence elements correlating with circulating viral load in genotype 1b hepatitis C virus infection

International Nuclear Information System (INIS)

Watanabe, Hideki; Nagayama, Kazuyoshi; Enomoto, Nobuyuki; Itakura, Jun; Tanabe, Yoko; Hamano, Kosei; Izumi, Namiki; Sato, Chifumi; Watanabe, Mamoru

2003-01-01

The correlation between hepatitis C virus (HCV) genomic sequences and circulating HCV RNA levels was assessed to investigate the genetic elements affecting viral load. The interferon sensitivity-determining region (ISDR) sequence and the serum viral load were strongly correlated in 226 patients examined. Analysis of the entire HCV genome from six patients (three with a high and the others with a low viral load) with similar ISDR sequences identified several candidate residues associated with viral load. The amino acid (aa) sequences of these candidate residues and flanking regions in 67 additional patients revealed that only the residue at aa 962 varied significantly between the HCV patients with low and high serum loads (P 0.042). At this position, alanine was observed more frequently in the patients with a high viral load. In conclusion, our results strongly suggest that serum HCV RNA loads are inversely correlated with amino acid substitutions in the ISDR, and aa 962 was identified as a possible second determinant of serum HCV RNA load
Physical non-viral gene delivery methods for tissue engineering.

Science.gov (United States)

Mellott, Adam J; Forrest, M Laird; Detamore, Michael S

2013-03-01

The integration of gene therapy into tissue engineering to control differentiation and direct tissue formation is not a new concept; however, successful delivery of nucleic acids into primary cells, progenitor cells, and stem cells has proven exceptionally challenging. Viral vectors are generally highly effective at delivering nucleic acids to a variety of cell populations, both dividing and non-dividing, yet these viral vectors are marred by significant safety concerns. Non-viral vectors are preferred for gene therapy, despite lower transfection efficiencies, and possess many customizable attributes that are desirable for tissue engineering applications. However, there is no single non-viral gene delivery strategy that "fits-all" cell types and tissues. Thus, there is a compelling opportunity to examine different non-viral vectors, especially physical vectors, and compare their relative degrees of success. This review examines the advantages and disadvantages of physical non-viral methods (i.e., microinjection, ballistic gene delivery, electroporation, sonoporation, laser irradiation, magnetofection, and electric field-induced molecular vibration), with particular attention given to electroporation because of its versatility, with further special emphasis on Nucleofection™. In addition, attributes of cellular character that can be used to improve differentiation strategies are examined for tissue engineering applications. Ultimately, electroporation exhibits a high transfection efficiency in many cell types, which is highly desirable for tissue engineering applications, but electroporation and other physical non-viral gene delivery methods are still limited by poor cell viability. Overcoming the challenge of poor cell viability in highly efficient physical non-viral techniques is the key to using gene delivery to enhance tissue engineering applications.
Physical non-viral gene delivery methods for tissue engineering

Science.gov (United States)

Mellott, Adam J.; Forrest, M. Laird; Detamore, Michael S.

2016-01-01

The integration of gene therapy into tissue engineering to control differentiation and direct tissue formation is not a new concept; however, successful delivery of nucleic acids into primary cells, progenitor cells, and stem cells has proven exceptionally challenging. Viral vectors are generally highly effective at delivering nucleic acids to a variety of cell populations, both dividing and non-dividing, yet these viral vectors are marred by significant safety concerns. Non-viral vectors are preferred for gene therapy, despite lower transfection efficiencies, and possess many customizable attributes that are desirable for tissue engineering applications. However, there is no single non-viral gene delivery strategy that “fits-all” cell types and tissues. Thus, there is a compelling opportunity to examine different non-viral vectors, especially physical vectors, and compare their relative degrees of success. This review examines the advantages and disadvantages of physical non-viral methods (i.e., microinjection, ballistic gene delivery, electroporation, sonoporation, laser irradiation, magnetofection, and electric field-induced molecular vibration), with particular attention given to electroporation because of its versatility, with further special emphasis on Nucleofection™. In addition, attributes of cellular character that can be used to improve differentiation strategies are examined for tissue engineering applications. Ultimately, electroporation exhibits a high transfection efficiency in many cell types, which is highly desirable for tissue engineering applications, but electroporation and other physical non-viral gene delivery methods are still limited by poor cell viability. Overcoming the challenge of poor cell viability in highly efficient physical non-viral techniques is the key to using gene delivery to enhance tissue engineering applications. PMID:23099792
Cytomegalovirus sequence variability, amplicon length, and DNase-sensitive non-encapsidated genomes are obstacles to standardization and commutability of plasma viral load results.

Science.gov (United States)

Naegele, Klaudia; Lautenschlager, Irmeli; Gosert, Rainer; Loginov, Raisa; Bir, Katia; Helanterä, Ilkka; Schaub, Stefan; Khanna, Nina; Hirsch, Hans H

2018-04-22

Cytomegalovirus (CMV) management post-transplantation relies on quantification in blood, but inter-laboratory and inter-assay variability impairs commutability. An international multicenter study demonstrated that variability is mitigated by standardizing plasma volumes, automating DNA extraction and amplification, and calibration to the 1st-CMV-WHO-International-Standard as in the FDA-approved Roche-CAP/CTM-CMV. However, Roche-CAP/CTM-CMV showed under-quantification and false-negative results in a quality assurance program (UK-NEQAS-2014). To evaluate factors contributing to quantification variability of CMV viral load and to develop optimized CMV-UL54-QNAT. The UL54 target of the UK-NEQAS-2014 variant was sequenced and compared to 329 available CMV GenBank sequences. Four Basel-CMV-UL54-QNAT assays of 361 bp, 254 bp, 151 bp, and 95 bp amplicons were developed that only differed in reverse primer positions. The assays were validated using plasmid dilutions, UK-NEQAS-2014 sample, as well as 107 frozen and 69 prospectively collected plasma samples from transplant patients submitted for CMV QNAT, with and without DNase-digestion prior to nucleic acid extraction. Eight of 43 mutations were identified as relevant in the UK-NEQAS-2014 target. All Basel-CMV-UL54 QNATs quantified the UK-NEQAS-2014 but revealed 10-fold increasing CMV loads as amplicon size decreased. The inverse correlation of amplicon size and viral loads was confirmed using 1st-WHO-International-Standard and patient samples. DNase pre-treatment reduced plasma CMV loads by >90% indicating the presence of unprotected CMV genomic DNA. Sequence variability, amplicon length, and non-encapsidated genomes obstruct standardization and commutability of CMV loads needed to develop thresholds for clinical research and management. Besides regular sequence surveys, matrix and extraction standardization, we propose developing reference calibrators using 100 bp amplicons. Copyright © 2018 Elsevier B.V. All
On the classification of long non-coding RNAs

KAUST Repository

Ma, Lina; Bajic, Vladimir B.; Zhang, Zhang

2013-01-01

Long non-coding RNAs (lncRNAs) have been found to perform various functions in a wide variety of important biological processes. To make easier interpretation of lncRNA functionality and conduct deep mining on these transcribed sequences
Diverse Array of New Viral Sequences Identified in Worldwide Populations of the Asian Citrus Psyllid (Diaphorina citri) Using Viral Metagenomics.

Science.gov (United States)

Nouri, Shahideh; Salem, Nidá; Nigg, Jared C; Falk, Bryce W

2015-12-16

The Asian citrus psyllid, Diaphorina citri, is the natural vector of the causal agent of Huanglongbing (HLB), or citrus greening disease. Together; HLB and D. citri represent a major threat to world citrus production. As there is no cure for HLB, insect vector management is considered one strategy to help control the disease, and D. citri viruses might be useful. In this study, we used a metagenomic approach to analyze viral sequences associated with the global population of D. citri. By sequencing small RNAs and the transcriptome coupled with bioinformatics analysis, we showed that the virus-like sequences of D. citri are diverse. We identified novel viral sequences belonging to the picornavirus superfamily, the Reoviridae, Parvoviridae, and Bunyaviridae families, and an unclassified positive-sense single-stranded RNA virus. Moreover, a Wolbachia prophage-related sequence was identified. This is the first comprehensive survey to assess the viral community from worldwide populations of an agricultural insect pest. Our results provide valuable information on new putative viruses, some of which may have the potential to be used as biocontrol agents. Insects have the most species of all animals, and are hosts to, and vectors of, a great variety of known and unknown viruses. Some of these most likely have the potential to be important fundamental and/or practical resources. In this study, we used high-throughput next-generation sequencing (NGS) technology and bioinformatics analysis to identify putative viruses associated with Diaphorina citri, the Asian citrus psyllid. D. citri is the vector of the bacterium causing Huanglongbing (HLB), currently the most serious threat to citrus worldwide. Here, we report several novel viral sequences associated with D. citri. Copyright © 2016, American Society for Microbiology. All Rights Reserved.
Efficient error correction for next-generation sequencing of viral amplicons.

Science.gov (United States)

Skums, Pavel; Dimitrova, Zoya; Campo, David S; Vaughan, Gilberto; Rossi, Livia; Forbi, Joseph C; Yokosawa, Jonny; Zelikovsky, Alex; Khudyakov, Yury

2012-06-25

Next-generation sequencing allows the analysis of an unprecedented number of viral sequence variants from infected patients, presenting a novel opportunity for understanding virus evolution, drug resistance and immune escape. However, sequencing in bulk is error prone. Thus, the generated data require error identification and correction. Most error-correction methods to date are not optimized for amplicon analysis and assume that the error rate is randomly distributed. Recent quality assessment of amplicon sequences obtained using 454-sequencing showed that the error rate is strongly linked to the presence and size of homopolymers, position in the sequence and length of the amplicon. All these parameters are strongly sequence specific and should be incorporated into the calibration of error-correction algorithms designed for amplicon sequencing. In this paper, we present two new efficient error correction algorithms optimized for viral amplicons: (i) k-mer-based error correction (KEC) and (ii) empirical frequency threshold (ET). Both were compared to a previously published clustering algorithm (SHORAH), in order to evaluate their relative performance on 24 experimental datasets obtained by 454-sequencing of amplicons with known sequences. All three algorithms show similar accuracy in finding true haplotypes. However, KEC and ET were significantly more efficient than SHORAH in removing false haplotypes and estimating the frequency of true ones. Both algorithms, KEC and ET, are highly suitable for rapid recovery of error-free haplotypes obtained by 454-sequencing of amplicons from heterogeneous viruses.The implementations of the algorithms and data sets used for their testing are available at: http://alan.cs.gsu.edu/NGS/?q=content/pyrosequencing-error-correction-algorithm.
A standardized framework for accurate, high-throughput genotyping of recombinant and non-recombinant viral sequences.

Science.gov (United States)

Alcantara, Luiz Carlos Junior; Cassol, Sharon; Libin, Pieter; Deforche, Koen; Pybus, Oliver G; Van Ranst, Marc; Galvão-Castro, Bernardo; Vandamme, Anne-Mieke; de Oliveira, Tulio

2009-07-01

Human immunodeficiency virus type-1 (HIV-1), hepatitis B and C and other rapidly evolving viruses are characterized by extremely high levels of genetic diversity. To facilitate diagnosis and the development of prevention and treatment strategies that efficiently target the diversity of these viruses, and other pathogens such as human T-lymphotropic virus type-1 (HTLV-1), human herpes virus type-8 (HHV8) and human papillomavirus (HPV), we developed a rapid high-throughput-genotyping system. The method involves the alignment of a query sequence with a carefully selected set of pre-defined reference strains, followed by phylogenetic analysis of multiple overlapping segments of the alignment using a sliding window. Each segment of the query sequence is assigned the genotype and sub-genotype of the reference strain with the highest bootstrap (>70%) and bootscanning (>90%) scores. Results from all windows are combined and displayed graphically using color-coded genotypes. The new Virus-Genotyping Tools provide accurate classification of recombinant and non-recombinant viruses and are currently being assessed for their diagnostic utility. They have incorporated into several HIV drug resistance algorithms including the Stanford (http://hivdb.stanford.edu) and two European databases (http://www.umcutrecht.nl/subsite/spread-programme/ and http://www.hivrdb.org.uk/) and have been successfully used to genotype a large number of sequences in these and other databases. The tools are a PHP/JAVA web application and are freely accessible on a number of servers including: http://bioafrica.mrc.ac.za/rega-genotype/html/, http://lasp.cpqgm.fiocruz.br/virus-genotype/html/, http://jose.med.kuleuven.be/genotypetool/html/.
Some Algebraic Aspects of MorseCode Sequences

OpenAIRE

Johann Cigler

2003-01-01

Morse code sequences are very useful to give combinatorial interpretations of various properties of Fibonacci numbers. In this note we study some algebraic and combinatorial aspects of Morse code sequences and obtain several q-analogues of Fibonacci numbers and Fibonacci polynomials and their generalizations.
Some Algebraic Aspects of MorseCode Sequences

Directory of Open Access Journals (Sweden)

Johann Cigler

2003-06-01

Full Text Available Morse code sequences are very useful to give combinatorial interpretations of various properties of Fibonacci numbers. In this note we study some algebraic and combinatorial aspects of Morse code sequences and obtain several q-analogues of Fibonacci numbers and Fibonacci polynomials and their generalizations.
TFIIS-Dependent Non-coding Transcription Regulates Developmental Genome Rearrangements.

Directory of Open Access Journals (Sweden)

Kamila Maliszewska-Olejniczak

2015-07-01

Full Text Available Because of their nuclear dimorphism, ciliates provide a unique opportunity to study the role of non-coding RNAs (ncRNAs in the communication between germline and somatic lineages. In these unicellular eukaryotes, a new somatic nucleus develops at each sexual cycle from a copy of the zygotic (germline nucleus, while the old somatic nucleus degenerates. In the ciliate Paramecium tetraurelia, the genome is massively rearranged during this process through the reproducible elimination of repeated sequences and the precise excision of over 45,000 short, single-copy Internal Eliminated Sequences (IESs. Different types of ncRNAs resulting from genome-wide transcription were shown to be involved in the epigenetic regulation of genome rearrangements. To understand how ncRNAs are produced from the entire genome, we have focused on a homolog of the TFIIS elongation factor, which regulates RNA polymerase II transcriptional pausing. Six TFIIS-paralogs, representing four distinct families, can be found in P. tetraurelia genome. Using RNA interference, we showed that TFIIS4, which encodes a development-specific TFIIS protein, is essential for the formation of a functional somatic genome. Molecular analyses and high-throughput DNA sequencing upon TFIIS4 RNAi demonstrated that TFIIS4 is involved in all kinds of genome rearrangements, including excision of ~48% of IESs. Localization of a GFP-TFIIS4 fusion revealed that TFIIS4 appears specifically in the new somatic nucleus at an early developmental stage, before IES excision. RT-PCR experiments showed that TFIIS4 is necessary for the synthesis of IES-containing non-coding transcripts. We propose that these IES+ transcripts originate from the developing somatic nucleus and serve as pairing substrates for germline-specific short RNAs that target elimination of their homologous sequences. Our study, therefore, connects the onset of zygotic non coding transcription to the control of genome plasticity in Paramecium
Non-infectious plasmid engineered to simulate multiple viral threat agents.

Science.gov (United States)

Carrera, Monica; Sagripanti, Jose-Luis

2009-07-01

The aim of this study was to design and construct a non-virulent simulant to replace several pathogenic viruses in the development of detection and identification methods in biodefense. A non-infectious simulant was designed and engineered to include the nucleic acid signature of VEEV (Venezuelan Equine Encephalitis virus), Influenza virus, Rift Valley Fever virus, Machupo virus, Lassa virus, Yellow Fever virus, Ebola virus, Eastern Equine Encephalitis virus, Junin virus, Marburg virus, Dengue virus, and Crimean-Congo virus, all in a single construct. The nucleic acid sequences of all isolates available for each virus species were aligned using ClustalW software in order to obtain conserved regions of the viral genomes. Specific primers were designed to permit the identification and differentiation between viral threat agents. A chimera of 3143 base pairs was engineered to produce 13 PCR amplicons of different sizes. PCR amplification of the simulant with virus-specific primers revealed products of the predicted length, in bands of similar intensity, and without detectable unspecific products by electrophoresis analysis. The simulant described could reduce the need to use infectious viruses in the development of detection and diagnostic methods, and could also be useful as a non-virulent positive control in nucleic acid-based tests against biological threat agents.
De novo ORFs in Drosophila are important to organismal fitness and evolved rapidly from previously non-coding sequences.

Directory of Open Access Journals (Sweden)

Josephine A Reinhardt

Full Text Available How non-coding DNA gives rise to new protein-coding genes (de novo genes is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs, while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important.

Microbial and viral-like rhodopsins present in coastal marine sediments from four polar and subpolar regions

Energy Technology Data Exchange (ETDEWEB)

López, José L.; Golemba, Marcelo; Hernández, Edgardo; Lozada, Mariana; Dionisi, Hebe; Jansson, Janet K.; Carroll, Jolynn; Lundgren, Leif; Sjöling, Sara; Mac Cormack, Walter P.; Sobecky, Patricia

2016-11-03

Rhodopsins are broadly distributed. In this work, we analyzed 23 metagenomes corresponding to marine sediment samples from four regions that share cold climate conditions (Norway; Sweden; Argentina and Antarctica). In order to investigate the genes evolution of viral rhodopsins, an initial set of 6224 bacterial rhodopsin sequences according to COG5524 were retrieved from the 23 metagenomes. After selection by the presence of transmembrane domains and alignment, 123 viral (51) and non-viral (72) sequences (>50 amino acids) were finally included in further analysis. Viral rhodopsin genes were homologs of Phaeocystis globosa virus and Organic lake Phycodnavirus. Non-viral microbial rhodopsin genes were ascribed to Bacteroidetes, Planctomycetes, Firmicutes, Actinobacteria, Cyanobacteria, Proteobacteria, Deinococcus-Thermus and Cryptophyta and Fungi. A rescreening using Blastp, using as queries the viral sequences previously described, retrieved 30 sequences (>100 amino acids). Phylogeographic analysis revealed a geographical clustering of the sequences affiliated to the viral group. This clustering was not observed for the microbial non-viral sequences. The phylogenetic reconstruction allowed us to propose the existence of a putative ancestor of viral rhodopsin genes related to Actinobacteria and Chloroflexi. This is the first report about the existence of a phylogeographic association of the viral rhodopsin sequences from marine sediments.
Junk DNA enhances pEI-based non-viral gene delivery

NARCIS (Netherlands)

Gaal, E.V.B. van; Oosting, R.S.; Hennink, W.E.; Crommelin, D.J.A.; Mastrobattista, E.

Gene therapy aims at delivering exogenous DNA into the nuclei of target cells to establish expression of a therapeutic protein. Non-viral gene delivery is examined as a safer alternative to viral approaches, but is presently characterized by a low efficiency. In the past years several non-viral
Recent Advances in Non-viral Vectors for Gene Delivery

Science.gov (United States)

Guo, Xia; Huang, Leaf

2011-01-01

CONSPECTUS Non-viral vectors, typically based on cationic lipids or polymers, are preferred due to safety concerns with viral vectors. So far, non-viral vectors can proficiently transfect cells in culture, but obtaining efficient nanomedicines is far from evident. To overcome the hurdles associated with non-viral vectors is significant for improving delivery efficiency and therapeutic effect of nucleic acid. The drawbacks include the strong interaction of cationic delivery vehicles with blood components, uptake by the reticuloendothelial system (RES), toxicity, targeting ability of the carriers to the cells of interest, and so on. PEGylation is the predominant method used to reduce the binding of plasma proteins with non-viral vectors and minimize the clearance by RES after intravenous administration. The nanoparticles that are not rapidly cleared from the circulation accumulate in the tumors due to the enhanced permeability and retention effect, and the targeting ligands attached to the distal end of the PEGylated components allow binding to the receptors on the target cell surface. Neutral or anionic liposomes have been also developed for systemic delivery of nucleic acids in experimental animal model. Designing and synthesizing novel cationic lipids and polymers, and binding nucleic acid with peptides, targeting ligands, polymers, or environmentally sensitive moieties also attract many attentions for resolving the problems encountered by non-viral vectors. The application of inorganic nanoparticles in nucleic acid delivery is an emerging field, too. Recently, different classes of non-viral vectors appear to be converging and the features of different classes of non-viral vectors could be combined in one strategy. More hurdles associated with efficient nucleic acid delivery therefore might be expected to be overcome. In this account, we will focus on these novel non-viral vectors, which are classified into multifunctional hybrid nucleic acid vectors, novel
Long non-coding RNAs: Mechanism of action and functional utility

Directory of Open Access Journals (Sweden)

Shakil Ahmad Bhat

2016-10-01

Full Text Available Recent RNA sequencing studies have revealed that most of the human genome is transcribed, but very little of the total transcriptomes has the ability to encode proteins. Long non-coding RNAs (lncRNAs are non-coding transcripts longer than 200 nucleotides. Members of the non-coding genome include microRNA (miRNA, small regulatory RNAs and other short RNAs. Most of long non-coding RNA (lncRNAs are poorly annotated. Recent recognition about lncRNAs highlights their effects in many biological and pathological processes. LncRNAs are dysfunctional in a variety of human diseases varying from cancerous to non-cancerous diseases. Characterization of these lncRNA genes and their modes of action may allow their use for diagnosis, monitoring of progression and targeted therapies in various diseases. In this review, we summarize the functional perspectives as well as the mechanism of action of lncRNAs. Keywords: LncRNA, X-chromosome inactivation, Genome imprinting, Transcription regulation, Cancer, Immunity
A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

Science.gov (United States)

Kress, W John; Erickson, David L

2007-06-06

A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.
Full Genome Sequence and sfRNA Interferon Antagonist Activity of Zika Virus from Recife, Brazil.

Directory of Open Access Journals (Sweden)

Claire L Donald

2016-10-01

Full Text Available The outbreak of Zika virus (ZIKV in the Americas has transformed a previously obscure mosquito-transmitted arbovirus of the Flaviviridae family into a major public health concern. Little is currently known about the evolution and biology of ZIKV and the factors that contribute to the associated pathogenesis. Determining genomic sequences of clinical viral isolates and characterization of elements within these are an important prerequisite to advance our understanding of viral replicative processes and virus-host interactions.We obtained a ZIKV isolate from a patient who presented with classical ZIKV-associated symptoms, and used high throughput sequencing and other molecular biology approaches to determine its full genome sequence, including non-coding regions. Genome regions were characterized and compared to the sequences of other isolates where available. Furthermore, we identified a subgenomic flavivirus RNA (sfRNA in ZIKV-infected cells that has antagonist activity against RIG-I induced type I interferon induction, with a lesser effect on MDA-5 mediated action.The full-length genome sequence including non-coding regions of a South American ZIKV isolate from a patient with classical symptoms will support efforts to develop genetic tools for this virus. Detection of sfRNA that counteracts interferon responses is likely to be important for further understanding of pathogenesis and virus-host interactions.
The PRC2-binding long non-coding RNAs in human and mouse genomes are associated with predictive sequence features

Science.gov (United States)

Tu, Shiqi; Yuan, Guo-Cheng; Shao, Zhen

2017-01-01

Recently, long non-coding RNAs (lncRNAs) have emerged as an important class of molecules involved in many cellular processes. One of their primary functions is to shape epigenetic landscape through interactions with chromatin modifying proteins. However, mechanisms contributing to the specificity of such interactions remain poorly understood. Here we took the human and mouse lncRNAs that were experimentally determined to have physical interactions with Polycomb repressive complex 2 (PRC2), and systematically investigated the sequence features of these lncRNAs by developing a new computational pipeline for sequences composition analysis, in which each sequence is considered as a series of transitions between adjacent nucleotides. Through that, PRC2-binding lncRNAs were found to be associated with a set of distinctive and evolutionarily conserved sequence features, which can be utilized to distinguish them from the others with considerable accuracy. We further identified fragments of PRC2-binding lncRNAs that are enriched with these sequence features, and found they show strong PRC2-binding signals and are more highly conserved across species than the other parts, implying their functional importance.
A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

Science.gov (United States)

Zhang, Ai-bing; Feng, Jie; Ward, Robert D; Wan, Ping; Gao, Qiang; Wu, Jun; Zhao, Wei-zhong

2012-01-01

Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37%) for 1094 brown algae queries, both using ITS barcodes.
A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

Directory of Open Access Journals (Sweden)

Ai-bing Zhang

Full Text Available Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish and two representing non-coding ITS barcodes (rust fungi and brown algae. Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ and Maximum likelihood (ML methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40% for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37% for 1094 brown algae queries, both using ITS barcodes.
An RNA-Seq strategy to detect the complete coding and non-coding transcriptome including full-length imprinted macro ncRNAs.

Directory of Open Access Journals (Sweden)

Ru Huang

Full Text Available Imprinted macro non-protein-coding (nc RNAs are cis-repressor transcripts that silence multiple genes in at least three imprinted gene clusters in the mouse genome. Similar macro or long ncRNAs are abundant in the mammalian genome. Here we present the full coding and non-coding transcriptome of two mouse tissues: differentiated ES cells and fetal head using an optimized RNA-Seq strategy. The data produced is highly reproducible in different sequencing locations and is able to detect the full length of imprinted macro ncRNAs such as Airn and Kcnq1ot1, whose length ranges between 80-118 kb. Transcripts show a more uniform read coverage when RNA is fragmented with RNA hydrolysis compared with cDNA fragmentation by shearing. Irrespective of the fragmentation method, all coding and non-coding transcripts longer than 8 kb show a gradual loss of sequencing tags towards the 3' end. Comparisons to published RNA-Seq datasets show that the strategy presented here is more efficient in detecting known functional imprinted macro ncRNAs and also indicate that standardization of RNA preparation protocols would increase the comparability of the transcriptome between different RNA-Seq datasets.
Challenges and opportunities in estimating viral genetic diversity from next-generation sequencing data

Directory of Open Access Journals (Sweden)

Niko eBeerenwinkel

2012-09-01

Full Text Available Many viruses, including the clinically relevant RNA viruses HIV and HCV, exist in large populations and display high genetic heterogeneity within and between infected hosts. Assessing intra-patient viral genetic diversity is essential for understanding the evolutionary dynamics of viruses, for designing effective vaccines, and for the success of antiviral therapy. Next-generation sequencing technologies allow the rapid and cost-effective acquisition of thousands to millions of short DNA sequences from a single sample. However, this approach entails several challenges in experimental design and computational data analysis. Here, we review the entire process of inferring viral diversity from sample collection to computing measures of genetic diversity. We discuss sample preparation, including reverse transcription and amplification, and the effect of experimental conditions on diversity estimates due to in vitro base substitutions, insertions, deletions, and recombination. The use of different next-generation sequencing platforms and their sequencing error profiles are compared in the context of various applications of diversity estimation, ranging from the detection of single nucleotide variants to the reconstruction of whole-genome haplotypes. We describe the statistical and computational challenges arising from these technical artifacts, and we review existing approaches, including available software, for their solution. Finally, we discuss open problems, and highlight successful biomedical applications and potential future clinical use of next-generation sequencing to estimate viral diversity.
Genetic Code Analysis Toolkit: A novel tool to explore the coding properties of the genetic code and DNA sequences

Science.gov (United States)

Kraljić, K.; Strüngmann, L.; Fimmel, E.; Gumbel, M.

2018-01-01

The genetic code is degenerated and it is assumed that redundancy provides error detection and correction mechanisms in the translation process. However, the biological meaning of the code's structure is still under current research. This paper presents a Genetic Code Analysis Toolkit (GCAT) which provides workflows and algorithms for the analysis of the structure of nucleotide sequences. In particular, sets or sequences of codons can be transformed and tested for circularity, comma-freeness, dichotomic partitions and others. GCAT comes with a fertile editor custom-built to work with the genetic code and a batch mode for multi-sequence processing. With the ability to read FASTA files or load sequences from GenBank, the tool can be used for the mathematical and statistical analysis of existing sequence data. GCAT is Java-based and provides a plug-in concept for extensibility. Availability: Open source Homepage:http://www.gcat.bio/
Inferring the Clonal Structure of Viral Populations from Time Series Sequencing.

Directory of Open Access Journals (Sweden)

Donatien F Chedom

2015-11-01

Full Text Available RNA virus populations will undergo processes of mutation and selection resulting in a mixed population of viral particles. High throughput sequencing of a viral population subsequently contains a mixed signal of the underlying clones. We would like to identify the underlying evolutionary structures. We utilize two sources of information to attempt this; within segment linkage information, and mutation prevalence. We demonstrate that clone haplotypes, their prevalence, and maximum parsimony reticulate evolutionary structures can be identified, although the solutions may not be unique, even for complete sets of information. This is applied to a chain of influenza infection, where we infer evolutionary structures, including reassortment, and demonstrate some of the difficulties of interpretation that arise from deep sequencing due to artifacts such as template switching during PCR amplification.
Clinical polyomavirus BK variants with agnogene deletion are non-functional but rescued by trans-complementation

International Nuclear Information System (INIS)

Myhre, Marit Renee; Olsen, Gunn-Hege; Gosert, Rainer; Hirsch, Hans H.; Rinaldo, Christine Hanssen

2010-01-01

High-level replication of polyomavirus BK (BKV) in kidney transplant recipients is associated with the emergence of BKV variants with rearranged (rr) non-coding control region (NCCR) increasing viral early gene expression and cytopathology. Cloning and sequencing revealed the presence of a BKV quasispecies which included non-functional variants when assayed in a recombinant virus assay. Here we report that the rr-NCCR of BKV variants RH-3 and RH-12, both bearing a NCCR deletion including the 5' end of the agnoprotein coding sequence, mediated early and late viral reporter gene expression in kidney cells. However, in a recombinant virus they failed to produce infectious progeny despite large T-antigen and VP1 expression and the formation of nuclear virus-like particles. Infectious progeny was generated when the agnogene was reconstructed in cis or agnoprotein provided in trans from a co-existing BKV rr-NCCR variant. We conclude that complementation can rescue non-functional BKV variants in vitro and possibly in vivo.
Tandem Oligonucleotide Probe Annealing and Elongation To Discriminate Viral Sequence

DEFF Research Database (Denmark)

Taskova, Maria; Uhd, Jesper; Miotke, Laura

2017-01-01

opportunities in transcriptome analysis, virology, and other fields. Herein, we report for the first time a "click" chemistry approach to oligonucleotide probe elongation as a novel approach to specifically detect a viral sequence. We hybridized a library of short, terminally labeled probes to Ebola virus RNA...
Deep sequencing of the viral phoH gene reveals temporal variation, depth-specific composition, and persistent dominance of the same viral phoH genes in the Sargasso Sea

Directory of Open Access Journals (Sweden)

Dawn B. Goldsmith

2015-06-01

Full Text Available Deep sequencing of the viral phoH gene, a host-derived auxiliary metabolic gene, was used to track viral diversity throughout the water column at the Bermuda Atlantic Time-series Study (BATS site in the summer (September and winter (March of three years. Viral phoH sequences reveal differences in the viral communities throughout a depth profile and between seasons in the same year. Variation was also detected between the same seasons in subsequent years, though these differences were not as great as the summer/winter distinctions. Over 3,600 phoH operational taxonomic units (OTUs; 97% sequence identity were identified. Despite high richness, most phoH sequences belong to a few large, common OTUs whereas the majority of the OTUs are small and rare. While many OTUs make sporadic appearances at just a few times or depths, a small number of OTUs dominate the community throughout the seasons, depths, and years.
SRComp: short read sequence compression using burstsort and Elias omega coding.

Directory of Open Access Journals (Sweden)

Jeremy John Selva

Full Text Available Next-generation sequencing (NGS technologies permit the rapid production of vast amounts of data at low cost. Economical data storage and transmission hence becomes an increasingly important challenge for NGS experiments. In this paper, we introduce a new non-reference based read sequence compression tool called SRComp. It works by first employing a fast string-sorting algorithm called burstsort to sort read sequences in lexicographical order and then Elias omega-based integer coding to encode the sorted read sequences. SRComp has been benchmarked on four large NGS datasets, where experimental results show that it can run 5-35 times faster than current state-of-the-art read sequence compression tools such as BEETL and SCALCE, while retaining comparable compression efficiency for large collections of short read sequences. SRComp is a read sequence compression tool that is particularly valuable in certain applications where compression time is of major concern.
An Auto sequence Code to Integrate a Neutron Unfolding Code with thePC-MCA Accuspec

International Nuclear Information System (INIS)

Darsono

2000-01-01

In a neutron spectrometry using proton recoil method, the neutronunfolding code is needed to unfold the measured proton spectrum to become theneutron spectrum. The process of the unfolding neutron in the existingneutron spectrometry which was successfully installed last year was doneseparately. This manuscript reports that the auto sequence code to integratethe neutron unfolding code UNFSPEC.EXE with the software facility of thePC-MCA Accuspec has been made and run successfully so that the new neutronspectrometry become compact. The auto sequence code was written based on therules in application program facility of PC-MCA Accuspec and then it wascompiled using AC-EXE. Result of the test of the auto sequence code showedthat for binning width 20, 30, and 40 giving a little different spectrumshape. The binning width around 30 gives a better spectrum in mean of givingsmall error compared to the others. (author)
Analysis of high-depth sequence data for studying viral diversity: a comparison of next generation sequencing platforms using Segminator II

Directory of Open Access Journals (Sweden)

Archer John

2012-03-01

Full Text Available Abstract Background Next generation sequencing provides detailed insight into the variation present within viral populations, introducing the possibility of treatment strategies that are both reactive and predictive. Current software tools, however, need to be scaled up to accommodate for high-depth viral data sets, which are often temporally or spatially linked. In addition, due to the development of novel sequencing platforms and chemistries, each with implicit strengths and weaknesses, it will be helpful for researchers to be able to routinely compare and combine data sets from different platforms/chemistries. In particular, error associated with a specific sequencing process must be quantified so that true biological variation may be identified. Results Segminator II was developed to allow for the efficient comparison of data sets derived from different sources. We demonstrate its usage by comparing large data sets from 12 influenza H1N1 samples sequenced on both the 454 Life Sciences and Illumina platforms, permitting quantification of platform error. For mismatches median error rates at 0.10 and 0.12%, respectively, suggested that both platforms performed similarly. For insertions and deletions median error rates within the 454 data (at 0.3 and 0.2%, respectively were significantly higher than those within the Illumina data (0.004 and 0.006%, respectively. In agreement with previous observations these higher rates were strongly associated with homopolymeric stretches on the 454 platform. Outside of such regions both platforms had similar indel error profiles. Additionally, we apply our software to the identification of low frequency variants. Conclusion We have demonstrated, using Segminator II, that it is possible to distinguish platform specific error from biological variation using data derived from two different platforms. We have used this approach to quantify the amount of error present within the 454 and Illumina platforms in
Oxygen minimum zones harbour novel viral communities with low diversity.

Science.gov (United States)

Cassman, Noriko; Prieto-Davó, Alejandra; Walsh, Kevin; Silva, Genivaldo G Z; Angly, Florent; Akhter, Sajia; Barott, Katie; Busch, Julia; McDole, Tracey; Haggerty, J Matthew; Willner, Dana; Alarcón, Gadiel; Ulloa, Osvaldo; DeLong, Edward F; Dutilh, Bas E; Rohwer, Forest; Dinsdale, Elizabeth A

2012-11-01

Oxygen minimum zones (OMZs) are oceanographic features that affect ocean productivity and biodiversity, and contribute to ocean nitrogen loss and greenhouse gas emissions. Here we describe the viral communities associated with the Eastern Tropical South Pacific (ETSP) OMZ off Iquique, Chile for the first time through abundance estimates and viral metagenomic analysis. The viral-to-microbial ratio (VMR) in the ETSP OMZ fluctuated in the oxycline and declined in the anoxic core to below one on several occasions. The number of viral genotypes (unique genomes as defined by sequence assembly) ranged from 2040 at the surface to 98 in the oxycline, which is the lowest viral diversity recorded to date in the ocean. Within the ETSP OMZ viromes, only 4.95% of genotypes were shared between surface and anoxic core viromes using reciprocal BLASTn sequence comparison. ETSP virome comparison with surface marine viromes (Sargasso Sea, Gulf of Mexico, Kingman Reef, Chesapeake Bay) revealed a dissimilarity of ETSP OMZ viruses to those from other oceanic regions. From the 1.4 million non-redundant DNA sequences sampled within the altered oxygen conditions of the ETSP OMZ, more than 97.8% were novel. Of the average 3.2% of sequences that showed similarity to the SEED non-redundant database, phage sequences dominated the surface viromes, eukaryotic virus sequences dominated the oxycline viromes, and phage sequences dominated the anoxic core viromes. The viral community of the ETSP OMZ was characterized by fluctuations in abundance, taxa and diversity across the oxygen gradient. The ecological significance of these changes was difficult to predict; however, it appears that the reduction in oxygen coincides with an increased shedding of eukaryotic viruses in the oxycline, and a shift to unique viral genotypes in the anoxic core. © 2012 Society for Applied Microbiology and Blackwell Publishing Ltd.

Non-Protein Coding RNAs

CERN Document Server

Walter, Nils G; Batey, Robert T

2009-01-01

This book assembles chapters from experts in the Biophysics of RNA to provide a broadly accessible snapshot of the current status of this rapidly expanding field. The 2006 Nobel Prize in Physiology or Medicine was awarded to the discoverers of RNA interference, highlighting just one example of a large number of non-protein coding RNAs. Because non-protein coding RNAs outnumber protein coding genes in mammals and other higher eukaryotes, it is now thought that the complexity of organisms is correlated with the fraction of their genome that encodes non-protein coding RNAs. Essential biological processes as diverse as cell differentiation, suppression of infecting viruses and parasitic transposons, higher-level organization of eukaryotic chromosomes, and gene expression itself are found to largely be directed by non-protein coding RNAs. The biophysical study of these RNAs employs X-ray crystallography, NMR, ensemble and single molecule fluorescence spectroscopy, optical tweezers, cryo-electron microscopy, and ot...
Flavivirus RNAi suppression: decoding non-coding RNA.

Science.gov (United States)

Pijlman, Gorben P

2014-08-01

Flaviviruses are important human pathogens that are transmitted by invertebrate vectors, mostly mosquitoes and ticks. During replication in their vector, flaviviruses are subject to a potent innate immune response known as antiviral RNA interference (RNAi). This defense mechanism is associated with the production of small interfering (si)RNA that lead to degradation of viral RNA. To what extent flaviviruses would benefit from counteracting antiviral RNAi is subject of debate. Here, the experimental evidence to suggest the existence of flavivirus RNAi suppressors is discussed. I will highlight the putative role of non-coding, subgenomic flavivirus RNA in suppression of RNAi in insect and mammalian cells. Novel insights from ongoing research will reveal how arthropod-borne viruses modulate innate immunity including antiviral RNAi. Copyright © 2014 Elsevier B.V. All rights reserved.
Long Non-Coding RNAs: Emerging and Versatile Regulators in Host–Virus Interactions

Directory of Open Access Journals (Sweden)

Xing-Yu Meng

2017-11-01

Full Text Available Long non-coding RNAs (lncRNAs are a class of non-protein-coding RNA molecules, which are involved in various biological processes, including chromatin modification, cell differentiation, pre-mRNA transcription and splicing, protein translation, etc. During the last decade, increasing evidence has suggested the involvement of lncRNAs in both immune and antiviral responses as positive or negative regulators. The immunity-associated lncRNAs modulate diverse and multilayered immune checkpoints, including activation or repression of innate immune signaling components, such as interleukin (IL-8, IL-10, retinoic acid inducible gene I, toll-like receptors 1, 3, and 8, and interferon (IFN regulatory factor 7, transcriptional regulation of various IFN-stimulated genes, and initiation of the cell apoptosis pathways. Additionally, some virus-encoded lncRNAs facilitate viral replication through individually or synergistically inhibiting the host antiviral responses or regulating multiple steps of the virus life cycle. Moreover, some viruses are reported to hijack host-encoded lncRNAs to establish persistent infections. Based on these amazing discoveries, lncRNAs are an emerging hotspot in host–virus interactions. In this review, we summarized the current findings of the host- or virus-encoded lncRNAs and the underlying mechanisms, discussed their impacts on immune responses and viral replication, and highlighted their critical roles in host–virus interactions.
Quantitative Profiling of Peptides from RNAs classified as non-coding

Science.gov (United States)

Prabakaran, Sudhakaran; Hemberg, Martin; Chauhan, Ruchi; Winter, Dominic; Tweedie-Cullen, Ry Y.; Dittrich, Christian; Hong, Elizabeth; Gunawardena, Jeremy; Steen, Hanno; Kreiman, Gabriel; Steen, Judith A.

2014-01-01

Only a small fraction of the mammalian genome codes for messenger RNAs destined to be translated into proteins, and it is generally assumed that a large portion of transcribed sequences - including introns and several classes of non-coding RNAs (ncRNAs) do not give rise to peptide products. A systematic examination of translation and physiological regulation of ncRNAs has not been conducted. Here, we use computational methods to identify the products of non-canonical translation in mouse neurons by analyzing unannotated transcripts in combination with proteomic data. This study supports the existence of non-canonical translation products from both intragenic and extragenic genomic regions, including peptides derived from anti-sense transcripts and introns. Moreover, the studied novel translation products exhibit temporal regulation similar to that of proteins known to be involved in neuronal activity processes. These observations highlight a potentially large and complex set of biologically regulated translational events from transcripts formerly thought to lack coding potential. PMID:25403355
Non-viral Nucleic Acid Delivery Strategies to the Central Nervous System

Directory of Open Access Journals (Sweden)

James-Kevin Tan

2016-11-01

Full Text Available With an increased prevalence and understanding of central nervous system injuries and neurological disorders, nucleic acid therapies are gaining promise as a way to regenerate lost neurons or halt disease progression. While more viral vectors have been used clinically as tools for gene delivery, non-viral vectors are gaining interest due to lower safety concerns and the ability to deliver all types of nucleic acids. Nevertheless, there are still a number of barriers to nucleic acid delivery. In this focused review, we explore the in vivo challenges hindering non-viral nucleic acid delivery to the central nervous system and the strategies and vehicles used to overcome them. Advantages and disadvantages of different routes of administration including: systemic injection, cerebrospinal fluid injection, intraparenchymal injection, and peripheral administration are discussed. Non-viral vehicles and treatment strategies that have overcome delivery barriers and demonstrated in vivo gene transfer to the central nervous system are presented. These approaches can be used as guidelines in developing synthetic gene delivery vectors for central nervous system applications and will ultimately bring non-viral vectors closer to clinical application.
Performance Analysis for Cooperative Communication System with QC-LDPC Codes Constructed with Integer Sequences

Directory of Open Access Journals (Sweden)

Yan Zhang

2015-01-01

Full Text Available This paper presents four different integer sequences to construct quasi-cyclic low-density parity-check (QC-LDPC codes with mathematical theory. The paper introduces the procedure of the coding principle and coding. Four different integer sequences constructing QC-LDPC code are compared with LDPC codes by using PEG algorithm, array codes, and the Mackey codes, respectively. Then, the integer sequence QC-LDPC codes are used in coded cooperative communication. Simulation results show that the integer sequence constructed QC-LDPC codes are effective, and overall performance is better than that of other types of LDPC codes in the coded cooperative communication. The performance of Dayan integer sequence constructed QC-LDPC is the most excellent performance.
Rfam: annotating families of non-coding RNA sequences.

Science.gov (United States)

Daub, Jennifer; Eberhardt, Ruth Y; Tate, John G; Burge, Sarah W

2015-01-01

The primary task of the Rfam database is to collate experimentally validated noncoding RNA (ncRNA) sequences from the published literature and facilitate the prediction and annotation of new homologues in novel nucleotide sequences. We group homologous ncRNA sequences into "families" and related families are further grouped into "clans." We collate and manually curate data cross-references for these families from other databases and external resources. Our Web site offers researchers a simple interface to Rfam and provides tools with which to annotate their own sequences using our covariance models (CMs), through our tools for searching, browsing, and downloading information on Rfam families. In this chapter, we will work through examples of annotating a query sequence, collating family information, and searching for data.
Coding visual features extracted from video sequences.

Science.gov (United States)

Baroffio, Luca; Cesana, Matteo; Redondi, Alessandro; Tagliasacchi, Marco; Tubaro, Stefano

2014-05-01

Visual features are successfully exploited in several applications (e.g., visual search, object recognition and tracking, etc.) due to their ability to efficiently represent image content. Several visual analysis tasks require features to be transmitted over a bandwidth-limited network, thus calling for coding techniques to reduce the required bit budget, while attaining a target level of efficiency. In this paper, we propose, for the first time, a coding architecture designed for local features (e.g., SIFT, SURF) extracted from video sequences. To achieve high coding efficiency, we exploit both spatial and temporal redundancy by means of intraframe and interframe coding modes. In addition, we propose a coding mode decision based on rate-distortion optimization. The proposed coding scheme can be conveniently adopted to implement the analyze-then-compress (ATC) paradigm in the context of visual sensor networks. That is, sets of visual features are extracted from video frames, encoded at remote nodes, and finally transmitted to a central controller that performs visual analysis. This is in contrast to the traditional compress-then-analyze (CTA) paradigm, in which video sequences acquired at a node are compressed and then sent to a central unit for further processing. In this paper, we compare these coding paradigms using metrics that are routinely adopted to evaluate the suitability of visual features in the context of content-based retrieval, object recognition, and tracking. Experimental results demonstrate that, thanks to the significant coding gains achieved by the proposed coding scheme, ATC outperforms CTA with respect to all evaluation metrics.
Orion: Detecting regions of the human non-coding genome that are intolerant to variation using population genetics.

Science.gov (United States)

Gussow, Ayal B; Copeland, Brett R; Dhindsa, Ryan S; Wang, Quanli; Petrovski, Slavé; Majoros, William H; Allen, Andrew S; Goldstein, David B

2017-01-01

There is broad agreement that genetic mutations occurring outside of the protein-coding regions play a key role in human disease. Despite this consensus, we are not yet capable of discerning which portions of non-coding sequence are important in the context of human disease. Here, we present Orion, an approach that detects regions of the non-coding genome that are depleted of variation, suggesting that the regions are intolerant of mutations and subject to purifying selection in the human lineage. We show that Orion is highly correlated with known intolerant regions as well as regions that harbor putatively pathogenic variation. This approach provides a mechanism to identify pathogenic variation in the human non-coding genome and will have immediate utility in the diagnostic interpretation of patient genomes and in large case control studies using whole-genome sequences.
Screening and Identification of putative long non coding RNAs from transcriptome data of a high yielding blackgram (Vigna mungo, Cv. T9

Directory of Open Access Journals (Sweden)

Pankaj Kumar Singh

2018-04-01

Full Text Available Blackgram (Vigna mungo is one of primary legumes cultivated throughout India, Cv.T9 being one of its common high yielding cultivar. This article reports RNA sequencing data and a pipeline for prediction of novel long non-coding RNAs from the sequenced data. The raw data generated during sequencing are available at Sequence Read Archive (SRA of NCBI with accession number- SRX1558530 Keywords: Blackgram, Long non-coding RNA, Legumes, RNA sequencing data
Optimizing viral and non-viral gene transfer methods for genetic modification of porcine mesenchymal stem cells

DEFF Research Database (Denmark)

Stiehler, Maik; Duch, Mogens; Mygind, Tina

2006-01-01

INTRODUCTION: Mesenchymal stem cells (MSCs) provide an excellent source of pluripotent progenitor cells for tissue-engineering applications due to their proliferation capacity and differentiation potential. Genetic modification of MSCs with genes encoding tissue-specific growth factors...... viral and non-viral ex vivo gene delivery systems with respect to gene transfer efficiency, maintenance of transgene expression, and safety issues using primary porcine MSCs as target cells. MATERIALS AND METHODS: MSCs were purified from bone marrow aspirates from the proximal tibiae of four 3-month......-old Danish landrace pigs by Ficoll step gradient separation and polystyrene adherence technique. Vectors expressing enhanced green fluorescent protein (eGFP) and human bone morphogenetic protein-2 (BMP-2) were transferred to the cells by different non-viral methods and by use of recombinant adeno...
Genome-wide characterization of long intergenic non-coding RNAs (lincRNAs) provides new insight into viral diseases in honey bees Apis cerana and Apis mellifera.

Science.gov (United States)

Jayakodi, Murukarthick; Jung, Je Won; Park, Doori; Ahn, Young-Joon; Lee, Sang-Choon; Shin, Sang-Yoon; Shin, Chanseok; Yang, Tae-Jin; Kwon, Hyung Wook

2015-09-04

Long non-coding RNAs (lncRNAs) are a class of RNAs that do not encode proteins. Recently, lncRNAs have gained special attention for their roles in various biological process and diseases. In an attempt to identify long intergenic non-coding RNAs (lincRNAs) and their possible involvement in honey bee development and diseases, we analyzed RNA-seq datasets generated from Asian honey bee (Apis cerana) and western honey bee (Apis mellifera). We identified 2470 lincRNAs with an average length of 1011 bp from A. cerana and 1514 lincRNAs with an average length of 790 bp in A. mellifera. Comparative analysis revealed that 5 % of the total lincRNAs derived from both species are unique in each species. Our comparative digital gene expression analysis revealed a high degree of tissue-specific expression among the seven major tissues of honey bee, different from mRNA expression patterns. A total of 863 (57 %) and 464 (18 %) lincRNAs showed tissue-dependent expression in A. mellifera and A. cerana, respectively, most preferentially in ovary and fat body tissues. Importantly, we identified 11 lincRNAs that are specifically regulated upon viral infection in honey bees, and 10 of them appear to play roles during infection with various viruses. This study provides the first comprehensive set of lincRNAs for honey bees and opens the door to discover lincRNAs associated with biological and hormone signaling pathways as well as various diseases of honey bee.
Whole transcriptome sequencing enables discovery and analysis of viruses in archived primary central nervous system lymphomas.

Directory of Open Access Journals (Sweden)

Christopher DeBoever

Full Text Available Primary central nervous system lymphomas (PCNSL have a dramatically increased prevalence among persons living with AIDS and are known to be associated with human Epstein Barr virus (EBV infection. Previous work suggests that in some cases, co-infection with other viruses may be important for PCNSL pathogenesis. Viral transcription in tumor samples can be measured using next generation transcriptome sequencing. We demonstrate the ability of transcriptome sequencing to identify viruses, characterize viral expression, and identify viral variants by sequencing four archived AIDS-related PCNSL tissue samples and analyzing raw sequencing reads. EBV was detected in all four PCNSL samples and cytomegalovirus (CMV, JC polyomavirus (JCV, and HIV were also discovered, consistent with clinical diagnoses. CMV was found to express three long non-coding RNAs recently reported as expressed during active infection. Single nucleotide variants were observed in each of the viruses observed and three indels were found in CMV. No viruses were found in several control tumor types including 32 diffuse large B-cell lymphoma samples. This study demonstrates the ability of next generation transcriptome sequencing to accurately identify viruses, including DNA viruses, in solid human cancer tissue samples.
Machine-Checked Sequencer for Critical Embedded Code Generator

Science.gov (United States)

Izerrouken, Nassima; Pantel, Marc; Thirioux, Xavier

This paper presents the development of a correct-by-construction block sequencer for GeneAuto a qualifiable (according to DO178B/ED12B recommendation) automatic code generator. It transforms Simulink models to MISRA C code for safety critical systems. Our approach which combines classical development process and formal specification and verification using proof-assistants, led to preliminary fruitful exchanges with certification authorities. We present parts of the classical user and tools requirements and derived formal specifications, implementation and verification for the correctness and termination of the block sequencer. This sequencer has been successfully applied to real-size industrial use cases from various transportation domain partners and led to requirement errors detection and a correct-by-construction implementation.
Bluetongue virus non-structural protein 1 is a positive regulator of viral protein synthesis

Directory of Open Access Journals (Sweden)

Boyce Mark

2012-08-01

Full Text Available Abstract Background Bluetongue virus (BTV is a double-stranded RNA (dsRNA virus of the Reoviridae family, which encodes its genes in ten linear dsRNA segments. BTV mRNAs are synthesised by the viral RNA-dependent RNA polymerase (RdRp as exact plus sense copies of the genome segments. Infection of mammalian cells with BTV rapidly replaces cellular protein synthesis with viral protein synthesis, but the regulation of viral gene expression in the Orbivirus genus has not been investigated. Results Using an mRNA reporter system based on genome segment 10 of BTV fused with GFP we identify the protein characteristic of this genus, non-structural protein 1 (NS1 as sufficient to upregulate translation. The wider applicability of this phenomenon among the viral genes is demonstrated using the untranslated regions (UTRs of BTV genome segments flanking the quantifiable Renilla luciferase ORF in chimeric mRNAs. The UTRs of viral mRNAs are shown to be determinants of the amount of protein synthesised, with the pre-expression of NS1 increasing the quantity in each case. The increased expression induced by pre-expression of NS1 is confirmed in virus infected cells by generating a replicating virus which expresses the reporter fused with genome segment 10, using reverse genetics. Moreover, NS1-mediated upregulation of expression is restricted to mRNAs which lack the cellular 3′ poly(A sequence identifying the 3′ end as a necessary determinant in specifically increasing the translation of viral mRNA in the presence of cellular mRNA. Conclusions NS1 is identified as a positive regulator of viral protein synthesis. We propose a model of translational regulation where NS1 upregulates the synthesis of viral proteins, including itself, and creates a positive feedback loop of NS1 expression, which rapidly increases the expression of all the viral proteins. The efficient translation of viral reporter mRNAs among cellular mRNAs can account for the observed
Bluetongue virus non-structural protein 1 is a positive regulator of viral protein synthesis.

Science.gov (United States)

Boyce, Mark; Celma, Cristina C P; Roy, Polly

2012-08-29

Bluetongue virus (BTV) is a double-stranded RNA (dsRNA) virus of the Reoviridae family, which encodes its genes in ten linear dsRNA segments. BTV mRNAs are synthesised by the viral RNA-dependent RNA polymerase (RdRp) as exact plus sense copies of the genome segments. Infection of mammalian cells with BTV rapidly replaces cellular protein synthesis with viral protein synthesis, but the regulation of viral gene expression in the Orbivirus genus has not been investigated. Using an mRNA reporter system based on genome segment 10 of BTV fused with GFP we identify the protein characteristic of this genus, non-structural protein 1 (NS1) as sufficient to upregulate translation. The wider applicability of this phenomenon among the viral genes is demonstrated using the untranslated regions (UTRs) of BTV genome segments flanking the quantifiable Renilla luciferase ORF in chimeric mRNAs. The UTRs of viral mRNAs are shown to be determinants of the amount of protein synthesised, with the pre-expression of NS1 increasing the quantity in each case. The increased expression induced by pre-expression of NS1 is confirmed in virus infected cells by generating a replicating virus which expresses the reporter fused with genome segment 10, using reverse genetics. Moreover, NS1-mediated upregulation of expression is restricted to mRNAs which lack the cellular 3' poly(A) sequence identifying the 3' end as a necessary determinant in specifically increasing the translation of viral mRNA in the presence of cellular mRNA. NS1 is identified as a positive regulator of viral protein synthesis. We propose a model of translational regulation where NS1 upregulates the synthesis of viral proteins, including itself, and creates a positive feedback loop of NS1 expression, which rapidly increases the expression of all the viral proteins. The efficient translation of viral reporter mRNAs among cellular mRNAs can account for the observed replacement of cellular protein synthesis with viral protein
Advanced Design of Dumbbell-shaped Genetic Minimal Vectors Improves Non-coding and Coding RNA Expression.

Science.gov (United States)

Jiang, Xiaoou; Yu, Han; Teo, Cui Rong; Tan, Genim Siu Xian; Goh, Sok Chin; Patel, Parasvi; Chua, Yiqiang Kevin; Hameed, Nasirah Banu Sahul; Bertoletti, Antonio; Patzel, Volker

2016-09-01

Dumbbell-shaped DNA minimal vectors lacking nontherapeutic genes and bacterial sequences are considered a stable, safe alternative to viral, nonviral, and naked plasmid-based gene-transfer systems. We investigated novel molecular features of dumbbell vectors aiming to reduce vector size and to improve the expression of noncoding or coding RNA. We minimized small hairpin RNA (shRNA) or microRNA (miRNA) expressing dumbbell vectors in size down to 130 bp generating the smallest genetic expression vectors reported. This was achieved by using a minimal H1 promoter with integrated transcriptional terminator transcribing the RNA hairpin structure around the dumbbell loop. Such vectors were generated with high conversion yields using a novel protocol. Minimized shRNA-expressing dumbbells showed accelerated kinetics of delivery and transcription leading to enhanced gene silencing in human tissue culture cells. In primary human T cells, minimized miRNA-expressing dumbbells revealed higher stability and triggered stronger target gene suppression as compared with plasmids and miRNA mimics. Dumbbell-driven gene expression was enhanced up to 56- or 160-fold by implementation of an intron and the SV40 enhancer compared with control dumbbells or plasmids. Advanced dumbbell vectors may represent one option to close the gap between durable expression that is achievable with integrating viral vectors and short-term effects triggered by naked RNA.
Overexpression of long non-coding RNAs following exposure to xenobiotics in the aquatic midge Chironomus riparius

International Nuclear Information System (INIS)

Martínez-Guitarte, José-Luis; Planelló, Rosario; Morcillo, Gloria

2012-01-01

Non-coding RNAs (ncRNAs) represent an important transcriptional output of eukaryotic genomes. In addition to their functional relevance as housekeeping and regulatory elements, recent studies have suggested their involvement in rather unexpected cellular functions. The aim of this work was to analyse the transcriptional behaviour of non-coding RNAs in the toxic response to pollutants in Chironomus riparius, a reference organism in aquatic toxicology. Three well-characterized long non-coding sequences were studied: telomeric repeats, Cla repetitive elements and the SINE CTRT1. Transcription levels were evaluated by RT-PCR after 24-h exposures to three current aquatic contaminants: bisphenol A (BPA), benzyl butyl phthalate (BBP) and the heavy metal cadmium (Cd). Upregulation of telomeric transcripts was found after BPA treatments. Moreover, BPA significantly activated Cla transcription, which also appeared to be increased by cadmium, whereas BBP did not affect the transcription levels of these sequences. Transcription of SINE CTRT1 was not altered by any of the chemicals tested. These data are discussed in the light of previous studies that have shown a response by long ncRNAS (lncRNAs) to cellular stressors, indicating a relationship with environmental stimuli. Our results demonstrated for the first time the ability of bisphenol A to activate non-coding sequences mainly located at telomeres and centromeres. Overall, this study provides evidence that xenobiotics can induce specific responses in ncRNAs derived from repetitive sequences that could be relevant in the toxic response, and also suggests that ncRNAs could represent a novel class of potential biomarkers in toxicological assessment.
Overexpression of long non-coding RNAs following exposure to xenobiotics in the aquatic midge Chironomus riparius

Energy Technology Data Exchange (ETDEWEB)

Martinez-Guitarte, Jose-Luis, E-mail: jlmartinez@ccia.uned.es [Grupo de Biologia y Toxicologia Ambiental, Facultad de Ciencias, Universidad Nacional de Educacion a Distancia, UNED, Senda del Rey 9, 28040 Madrid (Spain); Planello, Rosario; Morcillo, Gloria [Grupo de Biologia y Toxicologia Ambiental, Facultad de Ciencias, Universidad Nacional de Educacion a Distancia, UNED, Senda del Rey 9, 28040 Madrid (Spain)

2012-04-15

Non-coding RNAs (ncRNAs) represent an important transcriptional output of eukaryotic genomes. In addition to their functional relevance as housekeeping and regulatory elements, recent studies have suggested their involvement in rather unexpected cellular functions. The aim of this work was to analyse the transcriptional behaviour of non-coding RNAs in the toxic response to pollutants in Chironomus riparius, a reference organism in aquatic toxicology. Three well-characterized long non-coding sequences were studied: telomeric repeats, Cla repetitive elements and the SINE CTRT1. Transcription levels were evaluated by RT-PCR after 24-h exposures to three current aquatic contaminants: bisphenol A (BPA), benzyl butyl phthalate (BBP) and the heavy metal cadmium (Cd). Upregulation of telomeric transcripts was found after BPA treatments. Moreover, BPA significantly activated Cla transcription, which also appeared to be increased by cadmium, whereas BBP did not affect the transcription levels of these sequences. Transcription of SINE CTRT1 was not altered by any of the chemicals tested. These data are discussed in the light of previous studies that have shown a response by long ncRNAS (lncRNAs) to cellular stressors, indicating a relationship with environmental stimuli. Our results demonstrated for the first time the ability of bisphenol A to activate non-coding sequences mainly located at telomeres and centromeres. Overall, this study provides evidence that xenobiotics can induce specific responses in ncRNAs derived from repetitive sequences that could be relevant in the toxic response, and also suggests that ncRNAs could represent a novel class of potential biomarkers in toxicological assessment.
Sequence- and interactome-based prediction of viral protein hotspots targeting host proteins: a case study for HIV Nef.

Directory of Open Access Journals (Sweden)

Mahdi Sarmady

Full Text Available Virus proteins alter protein pathways of the host toward the synthesis of viral particles by breaking and making edges via binding to host proteins. In this study, we developed a computational approach to predict viral sequence hotspots for binding to host proteins based on sequences of viral and host proteins and literature-curated virus-host protein interactome data. We use a motif discovery algorithm repeatedly on collections of sequences of viral proteins and immediate binding partners of their host targets and choose only those motifs that are conserved on viral sequences and highly statistically enriched among binding partners of virus protein targeted host proteins. Our results match experimental data on binding sites of Nef to host proteins such as MAPK1, VAV1, LCK, HCK, HLA-A, CD4, FYN, and GNB2L1 with high statistical significance but is a poor predictor of Nef binding sites on highly flexible, hoop-like regions. Predicted hotspots recapture CD8 cell epitopes of HIV Nef highlighting their importance in modulating virus-host interactions. Host proteins potentially targeted or outcompeted by Nef appear crowding the T cell receptor, natural killer cell mediated cytotoxicity, and neurotrophin signaling pathways. Scanning of HIV Nef motifs on multiple alignments of hepatitis C protein NS5A produces results consistent with literature, indicating the potential value of the hotspot discovery in advancing our understanding of virus-host crosstalk.

CRISPR/Cas9-Mediated Immunity to Geminiviruses: Differential Interference and Evasion

KAUST Repository

Ali, Zahir; Ali, Shawkat; Tashkandi, Manal; Zaidi, Syed Shan-e-Ali; Mahfouz, Magdy M.

2016-01-01

The CRISPR/Cas9 system has recently been used to confer molecular immunity against several eukaryotic viruses, including plant DNA geminiviruses. Here, we provide a detailed analysis of the efficiencies of targeting different coding and non-coding sequences in the genomes of multiple geminiviruses. Moreover, we analyze the ability of geminiviruses to evade the CRISPR/Cas9 machinery. Our results demonstrate that the CRISPR/Cas9 machinery can efficiently target coding and non-coding sequences and interfere with various geminiviruses. Furthermore, targeting the coding sequences of different geminiviruses resulted in the generation of viral variants capable of replication and systemic movement. By contrast, targeting the noncoding intergenic region sequences of geminiviruses resulted in interference, but with inefficient recovery of mutated viral variants, which thus limited the generation of variants capable of replication and movement. Taken together, our results indicate that targeting noncoding, intergenic sequences provides viral interference activity and significantly limits the generation of viral variants capable of replication and systemic infection, which is essential for developing durable resistance strategies for long-term virus control.
CRISPR/Cas9-Mediated Immunity to Geminiviruses: Differential Interference and Evasion

KAUST Repository

Ali, Zahir

2016-05-26

The CRISPR/Cas9 system has recently been used to confer molecular immunity against several eukaryotic viruses, including plant DNA geminiviruses. Here, we provide a detailed analysis of the efficiencies of targeting different coding and non-coding sequences in the genomes of multiple geminiviruses. Moreover, we analyze the ability of geminiviruses to evade the CRISPR/Cas9 machinery. Our results demonstrate that the CRISPR/Cas9 machinery can efficiently target coding and non-coding sequences and interfere with various geminiviruses. Furthermore, targeting the coding sequences of different geminiviruses resulted in the generation of viral variants capable of replication and systemic movement. By contrast, targeting the noncoding intergenic region sequences of geminiviruses resulted in interference, but with inefficient recovery of mutated viral variants, which thus limited the generation of variants capable of replication and movement. Taken together, our results indicate that targeting noncoding, intergenic sequences provides viral interference activity and significantly limits the generation of viral variants capable of replication and systemic infection, which is essential for developing durable resistance strategies for long-term virus control.
Anti-metastatic effects of viral and non-viral mediated Nk4 delivery to tumours.

Science.gov (United States)

Buhles, Alexandra; Collins, Sara A; van Pijkeren, Jan P; Rajendran, Simon; Miles, Michelle; O'Sullivan, Gerald C; O'Hanlon, Deirdre M; Tangney, Mark

2009-03-09

The most common cause of death of cancer sufferers is through the occurrence of metastases. The metastatic behaviour of tumour cells is regulated by extracellular growth factors such as hepatocyte growth factor (HGF), a ligand for the c-Met receptor tyrosine kinase, and aberrant expression/activation of the c-Met receptor is closely associated with metastatic progression. Nk4 (also known as Interleukin (IL)32b) is a competitive antagonist of the HGF c-Met system and inhibits c-Met signalling and tumour metastasis. Nk4 has an additional anti-angiogenic activity independent of its HGF-antagonist function. Angiogenesis-inhibitory as well as cancer-specific apoptosis inducing effects make the Nk4 sequence an attractive candidate for gene therapy of cancer. This study investigates the inhibition of tumour metastasis by gene therapy mediated production of Nk4 by the primary tumour. Optimal delivery of anti-cancer genes is vital in order to achieve the highest therapeutic responses. Non-viral plasmid delivery methods have the advantage of safety and ease of production, providing immediate transgene expression, albeit short-lived in most tumours. Sustained presence of anti-angiogenic molecules is preferable with anti-angiogenic therapies, and the long-term expression mediated by Adeno-associated Virus (AAV) might represent a more appropriate delivery in this respect. However, the incubation time required by AAV vectors to reach appropriate gene expression levels hampers efficacy in many fast-growing murine tumour models. Here, we describe murine trials assessing the effects of Nk4 on the spontaneously metastatic Lewis Lung Carcinoma (LLC) model when delivered to primary tumour via plasmid lipofection or AAV2 vector. Intratumoural AAV-Nk4 administration produced the highest therapeutic response with significant reduction in both primary tumour growth and incidence of lung metastases. Plasmid-mediated therapy also significantly reduced metastatic growth, but with moderate
Phage and Nucleocytoplasmic Large Viral Sequences Dominate Coral Viromes from the Arabian Gulf.

Science.gov (United States)

Mahmoud, Huda; Jose, Liny

2017-01-01

Corals that naturally thrive under extreme conditions are gaining increasing attention due to their importance as living models to understand the impact of global warming on world corals. Here, we present the first metagenomic study of viral communities in corals thriving in a thermally variable water body in which the temperature fluctuates between 11 and 39°C in different seasons. The viral assemblages of two of the most abundant massive ( Porites harrisoni ) and branching ( Acropora downingi ) corals in offshore and inshore reef systems in the northern Arabian Gulf were investigated. Samples were collected from five reef systems during summer, autumn and winter of 2011/2012. The two coral viromes contain 12 viral families, including 10 dsDNA viral families [Siphoviridae, Podoviridae, Myoviridae, Phycodnaviridae, Baculoviridae, Herpesviridae, Adenoviridae, Alloherpesviridae, Mimiviridae and one unclassified family], one-ssDNA viral family (Microviridae) and one RNA viral family (Retroviridae). Overall, sequences significantly similar to Podoviridae were the most abundant in the P. harrisoni and A. downingi viromes. Various morphological types of virus-like particles (VLPs) were confirmed in the healthy coral tissue by transmission electron microscopy, including large tailless VLPs and electron-dense core VLPs. Tailed bacteriophages were isolated from coral tissue using a plaque assay. Higher functional gene diversity was recorded in A. downingi than in P. harrisoni , and comparative metagenomics revealed that the Gulf viral assemblages are functionally distinct from Pacific Ocean coral viral communities.
Phage and Nucleocytoplasmic Large Viral Sequences Dominate Coral Viromes from the Arabian Gulf

Directory of Open Access Journals (Sweden)

Huda Mahmoud

2017-10-01

Full Text Available Corals that naturally thrive under extreme conditions are gaining increasing attention due to their importance as living models to understand the impact of global warming on world corals. Here, we present the first metagenomic study of viral communities in corals thriving in a thermally variable water body in which the temperature fluctuates between 11 and 39°C in different seasons. The viral assemblages of two of the most abundant massive (Porites harrisoni and branching (Acropora downingi corals in offshore and inshore reef systems in the northern Arabian Gulf were investigated. Samples were collected from five reef systems during summer, autumn and winter of 2011/2012. The two coral viromes contain 12 viral families, including 10 dsDNA viral families [Siphoviridae, Podoviridae, Myoviridae, Phycodnaviridae, Baculoviridae, Herpesviridae, Adenoviridae, Alloherpesviridae, Mimiviridae and one unclassified family], one-ssDNA viral family (Microviridae and one RNA viral family (Retroviridae. Overall, sequences significantly similar to Podoviridae were the most abundant in the P. harrisoni and A. downingi viromes. Various morphological types of virus-like particles (VLPs were confirmed in the healthy coral tissue by transmission electron microscopy, including large tailless VLPs and electron-dense core VLPs. Tailed bacteriophages were isolated from coral tissue using a plaque assay. Higher functional gene diversity was recorded in A. downingi than in P. harrisoni, and comparative metagenomics revealed that the Gulf viral assemblages are functionally distinct from Pacific Ocean coral viral communities.
Algebraic solution of the synthesis problem for coded sequences

International Nuclear Information System (INIS)

Leukhin, Anatolii N

2005-01-01

The algebraic solution of a 'complex' problem of synthesis of phase-coded (PC) sequences with the zero level of side lobes of the cyclic autocorrelation function (ACF) is proposed. It is shown that the solution of the synthesis problem is connected with the existence of difference sets for a given code dimension. The problem of estimating the number of possible code combinations for a given code dimension is solved. It is pointed out that the problem of synthesis of PC sequences is related to the fundamental problems of discrete mathematics and, first of all, to a number of combinatorial problems, which can be solved, as the number factorisation problem, by algebraic methods by using the theory of Galois fields and groups. (fourth seminar to the memory of d.n. klyshko)
A Next-Generation Sequencing Approach Uncovers Viral Transcripts Incorporated in Poxvirus Virions

Directory of Open Access Journals (Sweden)

Marica Grossegesse

2017-10-01

Full Text Available Transcripts are known to be incorporated in particles of DNA viruses belonging to the families of Herpesviridae and Mimiviridae, but the presence of transcripts in other DNA viruses, such as poxviruses, has not been analyzed yet. Therefore, we first established a next-generation-sequencing (NGS-based protocol, enabling the unbiased identification of transcripts in virus particles. Subsequently, we applied our protocol to analyze RNA in an emerging zoonotic member of the Poxviridae family, namely Cowpox virus. Our results revealed the incorporation of 19 viral transcripts, while host identifications were restricted to ribosomal and mitochondrial RNA. Most viral transcripts had an unknown and immunomodulatory function, suggesting that transcript incorporation may be beneficial for poxvirus immune evasion. Notably, the most abundant transcript originated from the D5L/I1R gene that encodes a viral inhibitor of the host cytoplasmic DNA sensing machinery.
Continuous Non-malleable Codes

DEFF Research Database (Denmark)

Faust, Sebastian; Mukherjee, Pratyay; Nielsen, Jesper Buus

2014-01-01

or modify it to the encoding of a completely unrelated value. This paper introduces an extension of the standard non-malleability security notion - so-called continuous non-malleability - where we allow the adversary to tamper continuously with an encoding. This is in contrast to the standard notion of non...... is necessary to achieve continuous non-malleability in the split-state model. Moreover, we illustrate that none of the existing constructions satisfies our uniqueness property and hence is not secure in the continuous setting. We construct a split-state code satisfying continuous non-malleability. Our scheme...... is based on the inner product function, collision-resistant hashing and non-interactive zero-knowledge proofs of knowledge and requires an untamperable common reference string. We apply continuous non-malleable codes to protect arbitrary cryptographic primitives against tampering attacks. Previous...
Non-binary Entanglement-assisted Stabilizer Quantum Codes

OpenAIRE

Riguang, Leng; Zhi, Ma

2011-01-01

In this paper, we show how to construct non-binary entanglement-assisted stabilizer quantum codes by using pre-shared entanglement between the sender and receiver. We also give an algorithm to determine the circuit for non-binary entanglement-assisted stabilizer quantum codes and some illustrated examples. The codes we constructed do not require the dual-containing constraint, and many non-binary classical codes, like non-binary LDPC codes, which do not satisfy the condition, can be used to c...
MASTR: multiple alignment and structure prediction of non-coding RNAs using simulated annealing

DEFF Research Database (Denmark)

Lindgreen, Stinus; Gardner, Paul P; Krogh, Anders

2007-01-01

function that considers sequence conservation, covariation and basepairing probabilities. The results show that the method is very competitive to similar programs available today, both in terms of accuracy and computational efficiency. AVAILABILITY: Source code available from http://mastr.binf.ku.dk/......MOTIVATION: As more non-coding RNAs are discovered, the importance of methods for RNA analysis increases. Since the structure of ncRNA is intimately tied to the function of the molecule, programs for RNA structure prediction are necessary tools in this growing field of research. Furthermore......, it is known that RNA structure is often evolutionarily more conserved than sequence. However, few existing methods are capable of simultaneously considering multiple sequence alignment and structure prediction. RESULT: We present a novel solution to the problem of simultaneous structure prediction...
Fast comparison of IS radar code sequences for lag profile inversion

Directory of Open Access Journals (Sweden)

M. S. Lehtinen

2008-08-01

Full Text Available A fast method for theoretically comparing the posteriori variances produced by different phase code sequences in incoherent scatter radar (ISR experiments is introduced. Alternating codes of types 1 and 2 are known to be optimal for selected range resolutions, but the code sets are inconveniently long for many purposes like ground clutter estimation and in cases where coherent echoes from lower ionospheric layers are to be analyzed in addition to standard F-layer spectra.

The method is used in practice for searching binary code quads that have estimation accuracy almost equal to that of much longer alternating code sets. Though the code sequences can consist of as few as four different transmission envelopes, the lag profile estimation variances are near to the theoretical minimum. Thus the short code sequence is equally good as a full cycle of alternating codes with the same pulse length and bit length. The short code groups cannot be directly decoded, but the decoding is done in connection with more computationally expensive lag profile inversion in data analysis.

The actual code searches as well as the analysis and real data results from the found short code searches are explained in other papers sent to the same issue of this journal. We also discuss interesting subtle differences found between the different alternating codes by this method. We assume that thermal noise dominates the incoherent scatter signal.
Sequence Coding and Search System for licensee event reports: code listings. Volume 2

International Nuclear Information System (INIS)

Gallaher, R.B.; Guymon, R.H.; Mays, G.T.; Poore, W.P.; Cagle, R.J.; Harrington, K.H.; Johnson, M.P.

1985-04-01

Operating experience data from nuclear power plants are essential for safety and reliability analyses, especially analyses of trends and patterns. The licensee event reports (LERs) that are submitted to the Nuclear Regulatory Commission (NRC) by the nuclear power plant utilities contain much of this data. The NRC's Office for Analysis and Evaluation of Operational Data (AEOD) has developed, under contract with NSIC, a system for codifying the events reported in the LERs. The primary objective of the Sequence Coding and Search System (SCSS) is to reduce the descriptive text of the LERs to coded sequences that are both computer-readable and computer-searchable. This system provides a structured format for detailed coding of component, system, and unit effects as well as personnel errors. The database contains all current LERs submitted by nuclear power plant utilities for events occurring since 1981 and is updated on a continual basis. Volume 2 contains all valid and acceptable codes used for searching and encoding the LER data. This volume contains updated material through amendment 1 to revision 1 of the working version of ORNL/NSIC-223, Vol. 2
Non-coding RNA networks in cancer.

Science.gov (United States)

Anastasiadou, Eleni; Jacob, Leni S; Slack, Frank J

2018-01-01

Thousands of unique non-coding RNA (ncRNA) sequences exist within cells. Work from the past decade has altered our perception of ncRNAs from 'junk' transcriptional products to functional regulatory molecules that mediate cellular processes including chromatin remodelling, transcription, post-transcriptional modifications and signal transduction. The networks in which ncRNAs engage can influence numerous molecular targets to drive specific cell biological responses and fates. Consequently, ncRNAs act as key regulators of physiological programmes in developmental and disease contexts. Particularly relevant in cancer, ncRNAs have been identified as oncogenic drivers and tumour suppressors in every major cancer type. Thus, a deeper understanding of the complex networks of interactions that ncRNAs coordinate would provide a unique opportunity to design better therapeutic interventions.
Association of coral algal symbionts with a diverse viral community responsive to heat shock

KAUST Repository

Brüwer, Jan D.

2017-08-17

Stony corals provide the structural foundation of coral reef ecosystems and are termed holobionts given they engage in symbioses, in particular with photosynthetic dinoflagellates of the genus Symbiodinium. Besides Symbiodinium, corals also engage with bacteria affecting metabolism, immunity, and resilience of the coral holobiont, but the role of associated viruses is largely unknown. In this regard, the increase of studies using RNA sequencing (RNA-Seq) to assess gene expression provides an opportunity to elucidate viral signatures encompassed within the data via careful delineation of sequence reads and their source of origin.Here, we re-analyzed an RNA-Seq dataset from a cultured coral symbiont (Symbiodinium microadriaticum, Clade A1) across four experimental treatments (control, cold shock, heat shock, dark shock) to characterize associated viral diversity, abundance, and gene expression. Our approach comprised the filtering and removal of host sequence reads, subsequent phylogenetic assignment of sequence reads of putative viral origin, and the assembly and analysis of differentially expressed viral genes. About 15.46% (123 million) of all sequence reads were non-host-related, of which <1% could be classified as archaea, bacteria, or virus. Of these, 18.78% were annotated as virus and comprised a diverse community consistent across experimental treatments. Further, non-host related sequence reads assembled into 56,064 contigs, including 4856 contigs of putative viral origin that featured 43 differentially expressed genes during heat shock. The differentially expressed genes included viral kinases, ubiquitin, and ankyrin repeat proteins (amongst others), which are suggested to help the virus proliferate and inhibit the algal host\\'s antiviral response.Our results suggest that a diverse viral community is associated with coral algal endosymbionts of the genus Symbiodinium, which prompts further research on their ecological role in coral health and resilience.
The oncogenic potential of BK-polyomavirus is linked to viral integration into the human genome.

Science.gov (United States)

Kenan, Daniel J; Mieczkowski, Piotr A; Burger-Calderon, Raquel; Singh, Harsharan K; Nickeleit, Volker

2015-11-01

It has been suggested that BK-polyomavirus is linked to oncogenesis via high expression levels of large T-antigen in some urothelial neoplasms arising following kidney transplantation. However, a causal association between BK-polyomavirus, large T-antigen expression and oncogenesis has never been demonstrated in humans. Here we describe an investigation using high-throughput sequencing of tumour DNA obtained from an urothelial carcinoma arising in a renal allograft. We show that a novel BK-polyomavirus strain, named CH-1, is integrated into exon 26 of the myosin-binding protein C1 gene (MYBPC1) on chromosome 12 in tumour cells but not in normal renal cells. Integration of the BK-polyomavirus results in a number of discrete alterations in viral gene expression, including: (a) disruption of VP1 protein expression and robust expression of large T-antigen; (b) preclusion of viral replication; and (c) deletions in the non-coding control region (NCCR), with presumed alterations in promoter feedback loops. Viral integration disrupts one MYBPC1 gene copy and likely alters its expression. Circular episomal BK-polyomavirus gene sequences are not found, and the renal allograft shows no productive polyomavirus infection or polyomavirus nephropathy. These findings support the hypothesis that integration of polyomaviruses is essential to tumourigenesis. It is likely that dysregulation of large T-antigen, with persistent over-expression in non-lytic cells, promotes cell growth, genetic instability and neoplastic transformation. © 2015 The Authors. The Journal of Pathology published by John Wiley & Sons Ltd on behalf of Pathological Society of Great Britain and Ireland.
Phylogenetic analyses of the polyprotein coding sequences of serotype O foot-and-mouth disease viruses in East Africa: evidence for interserotypic recombination

DEFF Research Database (Denmark)

Balinda, Sheila; Siegismund, Hans; Muwanika, Vincent

2010-01-01

from both serotypes A and O. Conclusions Sequences of the VP1 coding region from recent serotype O FMDVs from Kenya and Uganda are all representatives of a specific East African lineage (topotype EA-2), a probable indication that hardly any FMD introductions of this serotype have occurred from outside...... the region in the recent past. Furthermore, evidence for interserotypic recombination, within the non-structural protein coding regions, between FMDVs of serotypes A and O has been obtained. In addition to characterization using the VP1 coding region, analyses involving the non-structural protein coding...
Laboratory procedures to generate viral metagenomes.

Science.gov (United States)

Thurber, Rebecca V; Haynes, Matthew; Breitbart, Mya; Wegley, Linda; Rohwer, Forest

2009-01-01

This collection of laboratory protocols describes the steps to collect viruses from various samples with the specific aim of generating viral metagenome sequence libraries (viromes). Viral metagenomics, the study of uncultured viral nucleic acid sequences from different biomes, relies on several concentration, purification, extraction, sequencing and heuristic bioinformatic methods. No single technique can provide an all-inclusive approach, and therefore the protocols presented here will be discussed in terms of hypothetical projects. However, care must be taken to individualize each step depending on the source and type of viral-particles. This protocol is a description of the processes we have successfully used to: (i) concentrate viral particles from various types of samples, (ii) eliminate contaminating cells and free nucleic acids and (iii) extract, amplify and purify viral nucleic acids. Overall, a sample can be processed to isolate viral nucleic acids suitable for high-throughput sequencing in approximately 1 week.
nRC: non-coding RNA Classifier based on structural features.

Science.gov (United States)

Fiannaca, Antonino; La Rosa, Massimo; La Paglia, Laura; Rizzo, Riccardo; Urso, Alfonso

2017-01-01

Non-coding RNA (ncRNA) are small non-coding sequences involved in gene expression regulation of many biological processes and diseases. The recent discovery of a large set of different ncRNAs with biologically relevant roles has opened the way to develop methods able to discriminate between the different ncRNA classes. Moreover, the lack of knowledge about the complete mechanisms in regulative processes, together with the development of high-throughput technologies, has required the help of bioinformatics tools in addressing biologists and clinicians with a deeper comprehension of the functional roles of ncRNAs. In this work, we introduce a new ncRNA classification tool, nRC (non-coding RNA Classifier). Our approach is based on features extraction from the ncRNA secondary structure together with a supervised classification algorithm implementing a deep learning architecture based on convolutional neural networks. We tested our approach for the classification of 13 different ncRNA classes. We obtained classification scores, using the most common statistical measures. In particular, we reach an accuracy and sensitivity score of about 74%. The proposed method outperforms other similar classification methods based on secondary structure features and machine learning algorithms, including the RNAcon tool that, to date, is the reference classifier. nRC tool is freely available as a docker image at https://hub.docker.com/r/tblab/nrc/. The source code of nRC tool is also available at https://github.com/IcarPA-TBlab/nrc.
Regulated expression of the human cytomegalovirus pp65 gene: Octamer sequence in the promoter is required for activation by viral gene products

International Nuclear Information System (INIS)

Depto, A.S.; Stenberg, R.M.

1989-01-01

To better understand the regulation of late gene expression in human cytomegalovirus (CMV)-infected cells, the authors examined expression of the gene that codes for the 65-kilodalton lower-matrix phosphoprotein (pp65). Analysis of RNA isolated at 72 h from cells infected with CMV Towne or ts66, a DNA-negative temperature-sensitive mutant, supported the fact that pp65 is expressed at low levels prior to viral DNA replication but maximally expressed after the initiation of viral DNA replication. To investigate promoter activation in a transient expression assay, the pp65 promoter was cloned into the indicator plasmid containing the gene for chloramphenicol acetyltransferase (CAT). Transfection of the promoter-CAT construct and subsequent superinfection with CMV resulted in activation of the promoter at early times after infection. Cotransfection with plasmids capable of expressing immediate-early (IE) proteins demonstrated that the promoter was activated by IE proteins and that both IE regions 1 and 2 were necessary. These studies suggest that interactions between IE proteins and this octamer sequence may be important for the regulation and expression of this CMV gene
Analysis of Non-binary Hybrid LDPC Codes

OpenAIRE

Sassatelli, Lucile; Declercq, David

2008-01-01

In this paper, we analyse asymptotically a new class of LDPC codes called Non-binary Hybrid LDPC codes, which has been recently introduced. We use density evolution techniques to derive a stability condition for hybrid LDPC codes, and prove their threshold behavior. We study this stability condition to conclude on asymptotic advantages of hybrid LDPC codes compared to their non-hybrid counterparts.

Novel classes of non-coding RNAs and cancer

Directory of Open Access Journals (Sweden)

Sana Jiri

2012-05-01

Full Text Available Abstract For the many years, the central dogma of molecular biology has been that RNA functions mainly as an informational intermediate between a DNA sequence and its encoded protein. But one of the great surprises of modern biology was the discovery that protein-coding genes represent less than 2% of the total genome sequence, and subsequently the fact that at least 90% of the human genome is actively transcribed. Thus, the human transcriptome was found to be more complex than a collection of protein-coding genes and their splice variants. Although initially argued to be spurious transcriptional noise or accumulated evolutionary debris arising from the early assembly of genes and/or the insertion of mobile genetic elements, recent evidence suggests that the non-coding RNAs (ncRNAs may play major biological roles in cellular development, physiology and pathologies. NcRNAs could be grouped into two major classes based on the transcript size; small ncRNAs and long ncRNAs. Each of these classes can be further divided, whereas novel subclasses are still being discovered and characterized. Although, in the last years, small ncRNAs called microRNAs were studied most frequently with more than ten thousand hits at PubMed database, recently, evidence has begun to accumulate describing the molecular mechanisms by which a wide range of novel RNA species function, providing insight into their functional roles in cellular biology and in human disease. In this review, we summarize newly discovered classes of ncRNAs, and highlight their functioning in cancer biology and potential usage as biomarkers or therapeutic targets.
Cooperative heteroassembly of the adenoviral L4-22K and IVa2 proteins onto the viral packaging sequence DNA.

Science.gov (United States)

Yang, Teng-Chieh; Maluf, Nasib Karl

2012-02-21

Human adenovirus (Ad) is an icosahedral, double-stranded DNA virus. Viral DNA packaging refers to the process whereby the viral genome becomes encapsulated by the viral particle. In Ad, activation of the DNA packaging reaction requires at least three viral components: the IVa2 and L4-22K proteins and a section of DNA within the viral genome, called the packaging sequence. Previous studies have shown that the IVa2 and L4-22K proteins specifically bind to conserved elements within the packaging sequence and that these interactions are absolutely required for the observation of DNA packaging. However, the equilibrium mechanism for assembly of IVa2 and L4-22K onto the packaging sequence has not been determined. Here we characterize the assembly of the IVa2 and L4-22K proteins onto truncated packaging sequence DNA by analytical sedimentation velocity and equilibrium methods. At limiting concentrations of L4-22K, we observe a species with two IVa2 monomers and one L4-22K monomer bound to the DNA. In this species, the L4-22K monomer is promoting positive cooperative interactions between the two bound IVa2 monomers. As L4-22K levels are increased, we observe a species with one IVa2 monomer and three L4-22K monomers bound to the DNA. To explain this result, we propose a model in which L4-22K self-assembly on the DNA competes with IVa2 for positive heterocooperative interactions, destabilizing binding of the second IVa2 monomer. Thus, we propose that L4-22K levels control the extent of cooperativity observed between adjacently bound IVa2 monomers. We have also determined the hydrodynamic properties of all observed stoichiometric species; we observe that species with three L4-22K monomers bound have more extended conformations than species with a single L4-22K bound. We suggest this might reflect a molecular switch that controls insertion of the viral DNA into the capsid.
Origins and challenges of viral dark matter.

Science.gov (United States)

Krishnamurthy, Siddharth R; Wang, David

2017-07-15

The accurate classification of viral dark matter - metagenomic sequences that originate from viruses but do not align to any reference virus sequences - is one of the major obstacles in comprehensively defining the virome. Depending on the sample, viral dark matter can make up from anywhere between 40 and 90% of sequences. This review focuses on the specific nature of dark matter as it relates to viral sequences. We identify three factors that contribute to the existence of viral dark matter: the divergence and length of virus sequences, the limitations of alignment based classification, and limited representation of viruses in reference sequence databases. We then discuss current methods that have been developed to at least partially circumvent these limitations and thereby reduce the extent of viral dark matter. Copyright © 2017 Elsevier B.V. All rights reserved.
Non-Coding RNAs in Hodgkin Lymphoma

Directory of Open Access Journals (Sweden)

Anna Cordeiro

2017-05-01

Full Text Available MicroRNAs (miRNAs, small non-coding RNAs that regulate gene expression by binding to the 3’-UTR of their target genes, can act as oncogenes or tumor suppressors. Recently, other types of non-coding RNAs—piwiRNAs and long non-coding RNAs—have also been identified. Hodgkin lymphoma (HL is a B cell origin disease characterized by the presence of only 1% of tumor cells, known as Hodgkin and Reed-Stenberg (HRS cells, which interact with the microenvironment to evade apoptosis. Several studies have reported specific miRNA signatures that can differentiate HL lymph nodes from reactive lymph nodes, identify histologic groups within classical HL, and distinguish HRS cells from germinal center B cells. Moreover, some signatures are associated with survival or response to chemotherapy. Most of the miRNAs in the signatures regulate genes related to apoptosis, cell cycle arrest, or signaling pathways. Here we review findings on miRNAs in HL, as well as on other non-coding RNAs.
Generation of transgene-free induced pluripotent stem cells with non-viral methods.

Science.gov (United States)

Wang, Tao; Zhao, Hua-shan; Zhang, Qiu-ling; Xu, Chang-lin; Liu, Chang-bai

2013-03-01

Induced pluripotent stem (iPS) cells were originally generated from mouse fibroblasts by enforced expression of Yamanaka factors (Oct3/4, Sox2, Klf4, and c-Myc). The technique was quickly reproduced with human fibroblasts or mesenchymal stem cells. Although having been showed therapeutic potential in animal models of sickle cell anemia and Parkinson's disease, iPS cells generated by viral methods do not suit all the clinical applications. Various non-viral methods have appeared in recent years for application of iPS cells in cell transplantation therapy. These methods mainly include DNA vector-based approaches, transfection of mRNA, and transduction of reprogramming proteins. This review summarized these non-viral methods and compare the advantages, disadvantages, efficiency, and safety of these methods.
Profile hidden Markov models for the detection of viruses within metagenomic sequence data.

Directory of Open Access Journals (Sweden)

Peter Skewes-Cox

Full Text Available Rapid, sensitive, and specific virus detection is an important component of clinical diagnostics. Massively parallel sequencing enables new diagnostic opportunities that complement traditional serological and PCR based techniques. While massively parallel sequencing promises the benefits of being more comprehensive and less biased than traditional approaches, it presents new analytical challenges, especially with respect to detection of pathogen sequences in metagenomic contexts. To a first approximation, the initial detection of viruses can be achieved simply through alignment of sequence reads or assembled contigs to a reference database of pathogen genomes with tools such as BLAST. However, recognition of highly divergent viral sequences is problematic, and may be further complicated by the inherently high mutation rates of some viral types, especially RNA viruses. In these cases, increased sensitivity may be achieved by leveraging position-specific information during the alignment process. Here, we constructed HMMER3-compatible profile hidden Markov models (profile HMMs from all the virally annotated proteins in RefSeq in an automated fashion using a custom-built bioinformatic pipeline. We then tested the ability of these viral profile HMMs ("vFams" to accurately classify sequences as viral or non-viral. Cross-validation experiments with full-length gene sequences showed that the vFams were able to recall 91% of left-out viral test sequences without erroneously classifying any non-viral sequences into viral protein clusters. Thorough reanalysis of previously published metagenomic datasets with a set of the best-performing vFams showed that they were more sensitive than BLAST for detecting sequences originating from more distant relatives of known viruses. To facilitate the use of the vFams for rapid detection of remote viral homologs in metagenomic data, we provide two sets of vFams, comprising more than 4,000 vFams each, in the HMMER3
Decoding the non-coding RNAs in Alzheimer's disease.

Science.gov (United States)

Schonrock, Nicole; Götz, Jürgen

2012-11-01

Non-coding RNAs (ncRNAs) are integral components of biological networks with fundamental roles in regulating gene expression. They can integrate sequence information from the DNA code, epigenetic regulation and functions of multimeric protein complexes to potentially determine the epigenetic status and transcriptional network in any given cell. Humans potentially contain more ncRNAs than any other species, especially in the brain, where they may well play a significant role in human development and cognitive ability. This review discusses their emerging role in Alzheimer's disease (AD), a human pathological condition characterized by the progressive impairment of cognitive functions. We discuss the complexity of the ncRNA world and how this is reflected in the regulation of the amyloid precursor protein and Tau, two proteins with central functions in AD. By understanding this intricate regulatory network, there is hope for a better understanding of disease mechanisms and ultimately developing diagnostic and therapeutic tools.
The origins and evolutionary history of human non-coding RNA regulatory networks.

Science.gov (United States)

Sherafatian, Masih; Mowla, Seyed Javad

2017-04-01

The evolutionary history and origin of the regulatory function of animal non-coding RNAs are not well understood. Lack of conservation of long non-coding RNAs and small sizes of microRNAs has been major obstacles in their phylogenetic analysis. In this study, we tried to shed more light on the evolution of ncRNA regulatory networks by changing our phylogenetic strategy to focus on the evolutionary pattern of their protein coding targets. We used available target databases of miRNAs and lncRNAs to find their protein coding targets in human. We were able to recognize evolutionary hallmarks of ncRNA targets by phylostratigraphic analysis. We found the conventional 3'-UTR and lesser known 5'-UTR targets of miRNAs to be enriched at three consecutive phylostrata. Firstly, in eukaryata phylostratum corresponding to the emergence of miRNAs, our study revealed that miRNA targets function primarily in cell cycle processes. Moreover, the same overrepresentation of the targets observed in the next two consecutive phylostrata, opisthokonta and eumetazoa, corresponded to the expansion periods of miRNAs in animals evolution. Coding sequence targets of miRNAs showed a delayed rise at opisthokonta phylostratum, compared to the 3' and 5' UTR targets of miRNAs. LncRNA regulatory network was the latest to evolve at eumetazoa.
Optimizing viral and non-viral gene transfer methods for genetic modification of porcine mesenchymal stem cells

DEFF Research Database (Denmark)

Stiehler, Maik; Duch, Mogens R.; Mygind, Tina

2006-01-01

-old Danish landrace pigs by Ficoll step gradient separation and polystyrene adherence technique. Vectors expressing enhanced green fluorescent protein (eGFP) and human bone morphogenetic protein-2 (BMP-2) were transferred to the cells by different non-viral methods and by use of recombinant adeno...
The Non-Coding RNA Ontology (NCRO): a comprehensive resource for the unification of non-coding RNA biology.

Science.gov (United States)

Huang, Jingshan; Eilbeck, Karen; Smith, Barry; Blake, Judith A; Dou, Dejing; Huang, Weili; Natale, Darren A; Ruttenberg, Alan; Huan, Jun; Zimmermann, Michael T; Jiang, Guoqian; Lin, Yu; Wu, Bin; Strachan, Harrison J; He, Yongqun; Zhang, Shaojie; Wang, Xiaowei; Liu, Zixing; Borchert, Glen M; Tan, Ming

2016-01-01

In recent years, sequencing technologies have enabled the identification of a wide range of non-coding RNAs (ncRNAs). Unfortunately, annotation and integration of ncRNA data has lagged behind their identification. Given the large quantity of information being obtained in this area, there emerges an urgent need to integrate what is being discovered by a broad range of relevant communities. To this end, the Non-Coding RNA Ontology (NCRO) is being developed to provide a systematically structured and precisely defined controlled vocabulary for the domain of ncRNAs, thereby facilitating the discovery, curation, analysis, exchange, and reasoning of data about structures of ncRNAs, their molecular and cellular functions, and their impacts upon phenotypes. The goal of NCRO is to serve as a common resource for annotations of diverse research in a way that will significantly enhance integrative and comparative analysis of the myriad resources currently housed in disparate sources. It is our belief that the NCRO ontology can perform an important role in the comprehensive unification of ncRNA biology and, indeed, fill a critical gap in both the Open Biological and Biomedical Ontologies (OBO) Library and the National Center for Biomedical Ontology (NCBO) BioPortal. Our initial focus is on the ontological representation of small regulatory ncRNAs, which we see as the first step in providing a resource for the annotation of data about all forms of ncRNAs. The NCRO ontology is free and open to all users, accessible at: http://purl.obolibrary.org/obo/ncro.owl.
Non-viral delivery systems for CRISPR/Cas9-based genome editing: Challenges and opportunities.

Science.gov (United States)

Li, Ling; Hu, Shuo; Chen, Xiaoyuan

2018-07-01

In recent years, CRISPR (clustered regularly interspaced short palindromic repeat)/Cas (CRISPR-associated) genome editing systems have become one of the most robust platforms in basic biomedical research and therapeutic applications. To date, efficient in vivo delivery of the CRISPR/Cas9 system to the targeted cells remains a challenge. Although viral vectors have been widely used in the delivery of the CRISPR/Cas9 system in vitro and in vivo, their fundamental shortcomings, such as the risk of carcinogenesis, limited insertion size, immune responses and difficulty in large-scale production, severely limit their further applications. Alternative non-viral delivery systems for CRISPR/Cas9 are urgently needed. With the rapid development of non-viral vectors, lipid- or polymer-based nanocarriers have shown great potential for CRISPR/Cas9 delivery. In this review, we analyze the pros and cons of delivering CRISPR/Cas9 systems in the form of plasmid, mRNA, or protein and then discuss the limitations and challenges of CRISPR/Cas9-based genome editing. Furthermore, current non-viral vectors that have been applied for CRISPR/Cas9 delivery in vitro and in vivo are outlined in details. Finally, critical obstacles for non-viral delivery of CRISPR/Cas9 system are highlighted and promising strategies to overcome these barriers are proposed. Published by Elsevier Ltd.
Long Non-Coding RNAs in Metabolic Organs and Energy Homeostasis

Directory of Open Access Journals (Sweden)

Maude Giroud

2017-11-01

Full Text Available Single cell organisms can surprisingly exceed the number of human protein-coding genes, which are thus not at the origin of the complexity of an organism. In contrast, the relative amount of non-protein-coding sequences increases consistently with organismal complexity. Moreover, the mammalian transcriptome predominantly comprises non-(protein-coding RNAs (ncRNA, of which the long ncRNAs (lncRNAs constitute the most abundant part. lncRNAs are highly species- and tissue-specific with very versatile modes of action in accordance with their binding to a large spectrum of molecules and their diverse localization. lncRNAs are transcriptional regulators adding an additional regulatory layer in biological processes and pathophysiological conditions. Here, we review lncRNAs affecting metabolic organs with a focus on the liver, pancreas, skeletal muscle, cardiac muscle, brain, and adipose organ. In addition, we will discuss the impact of lncRNAs on metabolic diseases such as obesity and diabetes. In contrast to the substantial number of lncRNA loci in the human genome, the functionally characterized lncRNAs are just the tip of the iceberg. So far, our knowledge concerning lncRNAs in energy homeostasis is still in its infancy, meaning that the rest of the iceberg is a treasure chest yet to be discovered.
First insight into the viral community of the cnidarian model metaorganism Aiptasia using RNA-Seq data

KAUST Repository

Brüwer, Jan D.

2018-03-01

Current research posits that all multicellular organisms live in symbioses with associated microorganisms and form so-called metaorganisms or holobionts. Cnidarian metaorganisms are of specific interest given that stony corals provide the foundation of the globally threatened coral reef ecosystems. To gain first insight into viruses associated with the coral model system Aiptasia (sensu Exaiptasia pallida), we analyzed an existing RNA-Seq dataset of aposymbiotic, partially populated, and fully symbiotic Aiptasia CC7 anemones with Symbiodinium. Our approach included the selective removal of anemone host and algal endosymbiont sequences and subsequent microbial sequence annotation. Of a total of 297 million raw sequence reads, 8.6 million (∼3%) remained after host and endosymbiont sequence removal. Of these, 3,293 sequences could be assigned as of viral origin. Taxonomic annotation of these sequences suggests that Aiptasia is associated with a diverse viral community, comprising 116 viral taxa covering 40 families. The viral assemblage was dominated by viruses from the families Herpesviridae (12.00%), Partitiviridae (9.93%), and Picornaviridae (9.87%). Despite an overall stable viral assemblage, we found that some viral taxa exhibited significant changes in their relative abundance when Aiptasia engaged in a symbiotic relationship with Symbiodinium. Elucidation of viral taxa consistently present across all conditions revealed a core virome of 15 viral taxa from 11 viral families, encompassing many viruses previously reported as members of coral viromes. Despite the non-random selection of viral genetic material due to the nature of the sequencing data analyzed, our study provides a first insight into the viral community associated with Aiptasia. Similarities of the Aiptasia viral community with those of corals corroborate the application of Aiptasia as a model system to study coral holobionts. Further, the change in abundance of certain viral taxa across different
Vaccines against viral hemorrhagic fevers: non-human primate models.

Science.gov (United States)

Carrion, Ricardo; Patterson, Jean L

2011-06-01

Viral hemorrhagic fevers are a group of disease syndromes caused by infection with certain RNA viruses. The disease is marked by a febrile response, malaise, coagulopathy and vascular permeability culminating in death. Case fatality rates can reach 90% depending on the etiologic agent. Currently, there is no approved antiviral treatment. Because of the high case fatality, risk of importation and the potential to use these agents as biological weapons, development of countermeasures to these agents is a high priority. The sporadic nature of disease outbreaks and the ethical issues associated with conducting a human trial for such diseases make human studies impractical; therefore, development of countermeasures must occur in relevant animal models. Non-human primates are superior models to study infectious disease because their immune system is similar to humans and they are good predictors of efficacy in vaccine development and other intervention strategies. This review article summarizes viral hemorrhagic fever non-human primate models.
The Non-Coding Regulatory RNA Revolution in Archaea

Directory of Open Access Journals (Sweden)

Diego Rivera Gelsinger

2018-03-01

Full Text Available Small non-coding RNAs (sRNAs are ubiquitously found in the three domains of life playing large-scale roles in gene regulation, transposable element silencing and defense against foreign elements. While a substantial body of experimental work has been done to uncover function of sRNAs in Bacteria and Eukarya, the functional roles of sRNAs in Archaea are still poorly understood. Recently, high throughput studies using RNA-sequencing revealed that sRNAs are broadly expressed in the Archaea, comprising thousands of transcripts within the transcriptome during non-challenged and stressed conditions. Antisense sRNAs, which overlap a portion of a gene on the opposite strand (cis-acting, are the most abundantly expressed non-coding RNAs and they can be classified based on their binding patterns to mRNAs (3′ untranslated region (UTR, 5′ UTR, CDS-binding. These antisense sRNAs target many genes and pathways, suggesting extensive roles in gene regulation. Intergenic sRNAs are less abundantly expressed and their targets are difficult to find because of a lack of complete overlap between sRNAs and target mRNAs (trans-acting. While many sRNAs have been validated experimentally, a regulatory role has only been reported for very few of them. Further work is needed to elucidate sRNA-RNA binding mechanisms, the molecular determinants of sRNA-mediated regulation, whether protein components are involved and how sRNAs integrate with complex regulatory networks.
Identification of Novel Long Non-coding and Circular RNAs in Human Papillomavirus-Mediated Cervical Cancer

Directory of Open Access Journals (Sweden)

Hongbo Wang

2017-09-01

Full Text Available Cervical cancer is the third most common cancer worldwide and the fourth leading cause of cancer-associated mortality in women. Accumulating evidence indicates that long non-coding RNAs (lncRNAs and circular RNAs (circRNAs may play key roles in the carcinogenesis of different cancers; however, little is known about the mechanisms of lncRNAs and circRNAs in the progression and metastasis of cervical cancer. In this study, we explored the expression profiles of lncRNAs, circRNAs, miRNAs, and mRNAs in HPV16 (human papillomavirus genotype 16 mediated cervical squamous cell carcinoma and matched adjacent non-tumor (ATN tissues from three patients with high-throughput RNA sequencing (RNA-seq. In total, we identified 19 lncRNAs, 99 circRNAs, 28 miRNAs, and 304 mRNAs that were commonly differentially expressed (DE in different patients. Among the non-coding RNAs, 3 lncRNAs and 44 circRNAs are novel to our knowledge. Functional enrichment analysis showed that DE lncRNAs, miRNAs, and mRNAs were enriched in pathways crucial to cancer as well as other gene ontology (GO terms. Furthermore, the co-expression network and function prediction suggested that all 19 DE lncRNAs could play different roles in the carcinogenesis and development of cervical cancer. The competing endogenous RNA (ceRNA network based on DE coding and non-coding RNAs showed that each miRNA targeted a number of lncRNAs and circRNAs. The link between part of the miRNAs in the network and cervical cancer has been validated in previous studies, and these miRNAs targeted the majority of the novel non-coding RNAs, thus suggesting that these novel non-coding RNAs may be involved in cervical cancer. Taken together, our study shows that DE non-coding RNAs could be further developed as diagnostic and therapeutic biomarkers of cervical cancer. The complex ceRNA network also lays the foundation for future research of the roles of coding and non-coding RNAs in cervical cancer.
Identification of Novel Long Non-coding and Circular RNAs in Human Papillomavirus-Mediated Cervical Cancer

Science.gov (United States)

Wang, Hongbo; Zhao, Yingchao; Chen, Mingyue; Cui, Jie

2017-01-01

Cervical cancer is the third most common cancer worldwide and the fourth leading cause of cancer-associated mortality in women. Accumulating evidence indicates that long non-coding RNAs (lncRNAs) and circular RNAs (circRNAs) may play key roles in the carcinogenesis of different cancers; however, little is known about the mechanisms of lncRNAs and circRNAs in the progression and metastasis of cervical cancer. In this study, we explored the expression profiles of lncRNAs, circRNAs, miRNAs, and mRNAs in HPV16 (human papillomavirus genotype 16) mediated cervical squamous cell carcinoma and matched adjacent non-tumor (ATN) tissues from three patients with high-throughput RNA sequencing (RNA-seq). In total, we identified 19 lncRNAs, 99 circRNAs, 28 miRNAs, and 304 mRNAs that were commonly differentially expressed (DE) in different patients. Among the non-coding RNAs, 3 lncRNAs and 44 circRNAs are novel to our knowledge. Functional enrichment analysis showed that DE lncRNAs, miRNAs, and mRNAs were enriched in pathways crucial to cancer as well as other gene ontology (GO) terms. Furthermore, the co-expression network and function prediction suggested that all 19 DE lncRNAs could play different roles in the carcinogenesis and development of cervical cancer. The competing endogenous RNA (ceRNA) network based on DE coding and non-coding RNAs showed that each miRNA targeted a number of lncRNAs and circRNAs. The link between part of the miRNAs in the network and cervical cancer has been validated in previous studies, and these miRNAs targeted the majority of the novel non-coding RNAs, thus suggesting that these novel non-coding RNAs may be involved in cervical cancer. Taken together, our study shows that DE non-coding RNAs could be further developed as diagnostic and therapeutic biomarkers of cervical cancer. The complex ceRNA network also lays the foundation for future research of the roles of coding and non-coding RNAs in cervical cancer. PMID:28970820
Blackout sequence modeling for Atucha-I with MARCH3 code

International Nuclear Information System (INIS)

Baron, J.; Bastianelli, B.

1997-01-01

The modeling of a blackout sequence in Atucha I nuclear power plant is presented in this paper, as a preliminary phase for a level II probabilistic safety assessment. Such sequence is analyzed with the code MARCH3 from STCP (Source Term Code Package), based on a specific model developed for Atucha, that takes into accounts it peculiarities. The analysis includes all the severe accident phases, from the initial transient (loss of heat sink), loss of coolant through the safety valves, core uncovered, heatup, metal-water reaction, melting and relocation, heatup and failure of the pressure vessel, core-concrete interaction in the reactor cavity, heatup and failure of the containment building (multi-compartmented) due to quasi-static overpressurization. The results obtained permit to visualize the time sequence of these events, as well as provide the basis for source term studies. (author) [es
A Bioinformatic Pipeline for Monitoring of the Mutational Stability of Viral Drug Targets with Deep-Sequencing Technology.

Science.gov (United States)

Kravatsky, Yuri; Chechetkin, Vladimir; Fedoseeva, Daria; Gorbacheva, Maria; Kravatskaya, Galina; Kretova, Olga; Tchurikov, Nickolai

2017-11-23

The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs), requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s). Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s). The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi) targets in human immunodeficiency virus 1 (HIV-1) subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.
Progress in developing cationic vectors for non-viral systemic gene therapy against cancer.

Science.gov (United States)

Morille, Marie; Passirani, Catherine; Vonarbourg, Arnaud; Clavreul, Anne; Benoit, Jean-Pierre

2008-01-01

Initially, gene therapy was viewed as an approach for treating hereditary diseases, but its potential role in the treatment of acquired diseases such as cancer is now widely recognized. The understanding of the molecular mechanisms involved in cancer and the development of nucleic acid delivery systems are two concepts that have led to this development. Systemic gene delivery systems are needed for therapeutic application to cells inaccessible by percutaneous injection and for multi-located tumor sites, i.e. metastases. Non-viral vectors based on the use of cationic lipids or polymers appear to have promising potential, given the problems of safety encountered with viral vectors. Using these non-viral vectors, the current challenge is to obtain a similarly effective transfection to viral ones. Based on the advantages and disadvantages of existing vectors and on the hurdles encountered with these carriers, the aim of this review is to describe the "perfect vector" for systemic gene therapy against cancer.

Emerging Roles of Small Epstein-Barr Virus Derived Non-Coding RNAs in Epithelial Malignancy

Science.gov (United States)

Lung, Raymond Wai-Ming; Tong, Joanna Hung-Man; To, Ka-Fai

2013-01-01

Latent Epstein-Barr virus (EBV) infection is an etiological factor in the progression of several human epithelial malignancies such as nasopharyngeal carcinoma (NPC) and a subset of gastric carcinoma. Reports have shown that EBV produces several viral oncoproteins, yet their pathological roles in carcinogenesis are not fully elucidated. Studies on the recently discovered of EBV-encoded microRNAs (ebv-miRNAs) showed that these small molecules function as post-transcriptional gene regulators and may play a role in the carcinogenesis process. In NPC and EBV positive gastric carcinoma (EBVaGC), 22 viral miRNAs which are located in the long alternative splicing EBV transcripts, named BamH1 A rightward transcripts (BARTs), are abundantly expressed. The importance of several miR-BARTs in carcinogenesis has recently been demonstrated. These novel findings enhance our understanding of the oncogenic properties of EBV and may lead to a more effective design of therapeutic regimens to combat EBV-associated malignancies. This article will review the pathological roles of miR-BARTs in modulating the expression of cancer-related genes in both host and viral genomes. The expression of other small non-coding RNAs in NPC and the expression pattern of miR-BARTs in rare EBV-associated epithelial cancers will also be discussed. PMID:23979421
Assembly of viral genomes from metagenomes

Directory of Open Access Journals (Sweden)

Saskia L Smits

2014-12-01

Full Text Available Viral infections remain a serious global health issue. Metagenomic approaches are increasingly used in the detection of novel viral pathogens but also to generate complete genomes of uncultivated viruses. In silico identification of complete viral genomes from sequence data would allow rapid phylogenetic characterization of these new viruses. Often, however, complete viral genomes are not recovered, but rather several distinct contigs derived from a single entity, some of which have no sequence homology to any known proteins. De novo assembly of single viruses from a metagenome is challenging, not only because of the lack of a reference genome, but also because of intrapopulation variation and uneven or insufficient coverage. Here we explored different assembly algorithms, remote homology searches, genome-specific sequence motifs, k-mer frequency ranking, and coverage profile binning to detect and obtain viral target genomes from metagenomes. All methods were tested on 454-generated sequencing datasets containing three recently described RNA viruses with a relatively large genome which were divergent to previously known viruses from the viral families Rhabdoviridae and Coronaviridae. Depending on specific characteristics of the target virus and the metagenomic community, different assembly and in silico gap closure strategies were successful in obtaining near complete viral genomes.
Generic detection of poleroviruses using an RT-PCR assay targeting the RdRp coding sequence.

Science.gov (United States)

Lotos, Leonidas; Efthimiou, Konstantinos; Maliogka, Varvara I; Katis, Nikolaos I

2014-03-01

In this study a two-step RT-PCR assay was developed for the generic detection of poleroviruses. The RdRp coding region was selected as the primers' target, since it differs significantly from that of other members in the family Luteoviridae and its sequence can be more informative than other regions in the viral genome. Species specific RT-PCR assays targeting the same region were also developed for the detection of the six most widespread poleroviral species (Beet mild yellowing virus, Beet western yellows virus, Cucurbit aphid-borne virus, Carrot red leaf virus, Potato leafroll virus and Turnip yellows virus) in Greece and the collection of isolates. These isolates along with other characterized ones were used for the evaluation of the generic PCR's detection range. The developed assay efficiently amplified a 593bp RdRp fragment from 46 isolates of 10 different Polerovirus species. Phylogenetic analysis using the generic PCR's amplicon sequence showed that although it cannot accurately infer evolutionary relationships within the genus it can differentiate poleroviruses at the species level. Overall, the described generic assay could be applied for the reliable detection of Polerovirus infections and, in combination with the specific PCRs, for the identification of new and uncharacterized species in the genus. Copyright © 2013 Elsevier B.V. All rights reserved.
MicroRNA-Related Polymorphisms in Infectious Diseases—Tiny Changes With a Huge Impact on Viral Infections and Potential Clinical Applications

Directory of Open Access Journals (Sweden)

Joel Henrique Ellwanger

2018-06-01

Full Text Available MicroRNAs (miRNAs are single-stranded sequences of non-coding RNA with approximately 22 nucleotides that act posttranscriptionally on gene expression. miRNAs are important gene regulators in physiological contexts, but they also impact the pathogenesis of various diseases. The role of miRNAs in viral infections has been explored by different authors in both population-based as well as in functional studies. However, the effect of miRNA polymorphisms on the susceptibility to viral infections and on the clinical course of these diseases is still an emerging topic. Thus, this review will compile and organize the findings described in studies that evaluated the effects of genetic variations on miRNA genes and on their binding sites, in the context of human viral diseases. In addition to discussing the basic aspects of miRNAs biology, we will cover the studies that investigated miRNA polymorphisms in infections caused by hepatitis B virus, hepatitis C virus, human immunodeficiency virus, Epstein–Barr virus, and human papillomavirus. Finally, emerging topics concerning the importance of miRNA genetic variants will be presented, focusing on the context of viral infectious diseases.
Hominoid-specific de novo protein-coding genes originating from long non-coding RNAs.

Directory of Open Access Journals (Sweden)

Chen Xie

2012-09-01

Full Text Available Tinkering with pre-existing genes has long been known as a major way to create new genes. Recently, however, motherless protein-coding genes have been found to have emerged de novo from ancestral non-coding DNAs. How these genes originated is not well addressed to date. Here we identified 24 hominoid-specific de novo protein-coding genes with precise origination timing in vertebrate phylogeny. Strand-specific RNA-Seq analyses were performed in five rhesus macaque tissues (liver, prefrontal cortex, skeletal muscle, adipose, and testis, which were then integrated with public transcriptome data from human, chimpanzee, and rhesus macaque. On the basis of comparing the RNA expression profiles in the three species, we found that most of the hominoid-specific de novo protein-coding genes encoded polyadenylated non-coding RNAs in rhesus macaque or chimpanzee with a similar transcript structure and correlated tissue expression profile. According to the rule of parsimony, the majority of these hominoid-specific de novo protein-coding genes appear to have acquired a regulated transcript structure and expression profile before acquiring coding potential. Interestingly, although the expression profile was largely correlated, the coding genes in human often showed higher transcriptional abundance than their non-coding counterparts in rhesus macaque. The major findings we report in this manuscript are robust and insensitive to the parameters used in the identification and analysis of de novo genes. Our results suggest that at least a portion of long non-coding RNAs, especially those with active and regulated transcription, may serve as a birth pool for protein-coding genes, which are then further optimized at the transcriptional level.
Function and Application Areas in Medicine of Non-Coding RNA

Directory of Open Access Journals (Sweden)

Figen Guzelgul

2009-06-01

Full Text Available RNA is the genetic material converting the genetic code that it gets from DNA into protein. While less than 2 % of RNA is converted into protein , more than 98 % of it can not be converted into protein and named as non-coding RNAs. 70 % of noncoding RNAs consists of introns , however, the rest part of them consists of exons. Non-coding RNAs are examined in two classes according to their size and functions. Whereas they are classified as long non-coding and small non-coding RNAs according to their size , they are grouped as housekeeping non-coding RNAs and regulating non-coding RNAs according to their function. For long years ,these non-coding RNAs have been considered as non-functional. However, today, it has been proved that these non-coding RNAs play role in regulating genes and in structural, functional and catalitic roles of RNAs converted into protein. Due to its taking a role in gene silencing mechanism, particularly in medical world , non-coding RNAs have led to significant developments. RNAi technolgy , which is used in designing drugs to be used in treatment of various diseases , is a ray of hope for medical world. [Archives Medical Review Journal 2009; 18(3.000: 141-155
Importance of Viral Sequence Length and Number of Variable and Informative Sites in Analysis of HIV Clustering.

Science.gov (United States)

Novitsky, Vlad; Moyo, Sikhulile; Lei, Quanhong; DeGruttola, Victor; Essex, M

2015-05-01

To improve the methodology of HIV cluster analysis, we addressed how analysis of HIV clustering is associated with parameters that can affect the outcome of viral clustering. The extent of HIV clustering and tree certainty was compared between 401 HIV-1C near full-length genome sequences and subgenomic regions retrieved from the LANL HIV Database. Sliding window analysis was based on 99 windows of 1,000 bp and 45 windows of 2,000 bp. Potential associations between the extent of HIV clustering and sequence length and the number of variable and informative sites were evaluated. The near full-length genome HIV sequences showed the highest extent of HIV clustering and the highest tree certainty. At the bootstrap threshold of 0.80 in maximum likelihood (ML) analysis, 58.9% of near full-length HIV-1C sequences but only 15.5% of partial pol sequences (ViroSeq) were found in clusters. Among HIV-1 structural genes, pol showed the highest extent of clustering (38.9% at a bootstrap threshold of 0.80), although it was significantly lower than in the near full-length genome sequences. The extent of HIV clustering was significantly higher for sliding windows of 2,000 bp than 1,000 bp. We found a strong association between the sequence length and proportion of HIV sequences in clusters, and a moderate association between the number of variable and informative sites and the proportion of HIV sequences in clusters. In HIV cluster analysis, the extent of detectable HIV clustering is directly associated with the length of viral sequences used, as well as the number of variable and informative sites. Near full-length genome sequences could provide the most informative HIV cluster analysis. Selected subgenomic regions with a high extent of HIV clustering and high tree certainty could also be considered as a second choice.
Positive Selection or Free to Vary? Assessing the Functional Significance of Sequence Change Using Molecular Dynamics.

Directory of Open Access Journals (Sweden)

Jane R Allison

Full Text Available Evolutionary arms races between pathogens and their hosts may be manifested as selection for rapid evolutionary change of key genes, and are sometimes detectable through sequence-level analyses. In the case of protein-coding genes, such analyses frequently predict that specific codons are under positive selection. However, detecting positive selection can be non-trivial, and false positive predictions are a common concern in such analyses. It is therefore helpful to place such predictions within a structural and functional context. Here, we focus on the p19 protein from tombusviruses. P19 is a homodimer that sequesters siRNAs, thereby preventing the host RNAi machinery from shutting down viral infection. Sequence analysis of the p19 gene is complicated by the fact that it is constrained at the sequence level by overprinting of a viral movement protein gene. Using homology modeling, in silico mutation and molecular dynamics simulations, we assess how non-synonymous changes to two residues involved in forming the dimer interface-one invariant, and one predicted to be under positive selection-impact molecular function. Interestingly, we find that both observed variation and potential variation (where a non-synonymous change to p19 would be synonymous for the overprinted movement protein does not significantly impact protein structure or RNA binding. Consequently, while several methods identify residues at the dimer interface as being under positive selection, MD results suggest they are functionally indistinguishable from a site that is free to vary. Our analyses serve as a caveat to using sequence-level analyses in isolation to detect and assess positive selection, and emphasize the importance of also accounting for how non-synonymous changes impact structure and function.
A Bioinformatic Pipeline for Monitoring of the Mutational Stability of Viral Drug Targets with Deep-Sequencing Technology

Directory of Open Access Journals (Sweden)

Yuri Kravatsky

2017-11-01

Full Text Available The efficient development of antiviral drugs, including efficient antiviral small interfering RNAs (siRNAs, requires continuous monitoring of the strict correspondence between a drug and the related highly variable viral DNA/RNA target(s. Deep sequencing is able to provide an assessment of both the general target conservation and the frequency of particular mutations in the different target sites. The aim of this study was to develop a reliable bioinformatic pipeline for the analysis of millions of short, deep sequencing reads corresponding to selected highly variable viral sequences that are drug target(s. The suggested bioinformatic pipeline combines the available programs and the ad hoc scripts based on an original algorithm of the search for the conserved targets in the deep sequencing data. We also present the statistical criteria for the threshold of reliable mutation detection and for the assessment of variations between corresponding data sets. These criteria are robust against the possible sequencing errors in the reads. As an example, the bioinformatic pipeline is applied to the study of the conservation of RNA interference (RNAi targets in human immunodeficiency virus 1 (HIV-1 subtype A. The developed pipeline is freely available to download at the website http://virmut.eimb.ru/. Brief comments and comparisons between VirMut and other pipelines are also presented.
Chromosome preference of disease genes and vectorization for the prediction of non-coding disease genes.

Science.gov (United States)

Peng, Hui; Lan, Chaowang; Liu, Yuansheng; Liu, Tao; Blumenstein, Michael; Li, Jinyan

2017-10-03

Disease-related protein-coding genes have been widely studied, but disease-related non-coding genes remain largely unknown. This work introduces a new vector to represent diseases, and applies the newly vectorized data for a positive-unlabeled learning algorithm to predict and rank disease-related long non-coding RNA (lncRNA) genes. This novel vector representation for diseases consists of two sub-vectors, one is composed of 45 elements, characterizing the information entropies of the disease genes distribution over 45 chromosome substructures. This idea is supported by our observation that some substructures (e.g., the chromosome 6 p-arm) are highly preferred by disease-related protein coding genes, while some (e.g., the 21 p-arm) are not favored at all. The second sub-vector is 30-dimensional, characterizing the distribution of disease gene enriched KEGG pathways in comparison with our manually created pathway groups. The second sub-vector complements with the first one to differentiate between various diseases. Our prediction method outperforms the state-of-the-art methods on benchmark datasets for prioritizing disease related lncRNA genes. The method also works well when only the sequence information of an lncRNA gene is known, or even when a given disease has no currently recognized long non-coding genes.
HBVRegDB: Annotation, comparison, detection and visualization of regulatory elements in hepatitis B virus sequences

Directory of Open Access Journals (Sweden)

Firth Andrew E

2007-12-01

Full Text Available Abstract Background The many Hepadnaviridae sequences available have widely varied functional annotation. The genomes are very compact (~3.2 kb but contain multiple layers of functional regulatory elements in addition to coding regions. Key regions are subject to purifying selection, as mutations in these regions will produce non-functional viruses. Results These genomic sequences have been organized into a structured database to facilitate research at the molecular level. HBVRegDB is a comparative genomic analysis tool with an integrated underlying sequence database. The database contains genomic sequence data from representative viruses. In addition to INSDC and RefSeq annotation, HBVRegDB also contains expert and systematically calculated annotations (e.g. promoters and comparative genome analysis results (e.g. blastn, tblastx. It also contains analyses based on curated HBV alignments. Information about conserved regions – including primary conservation (e.g. CDS-Plotcon and RNA secondary structure predictions (e.g. Alidot – is integrated into the database. A large amount of data is graphically presented using the GBrowse (Generic Genome Browser adapted for analysis of viral genomes. Flexible query access is provided based on any annotated genomic feature. Novel regulatory motifs can be found by analysing the annotated sequences. Conclusion HBVRegDB serves as a knowledge database and as a comparative genomic analysis tool for molecular biologists investigating HBV. It is publicly available and complementary to other viral and HBV focused datasets and tools http://hbvregdb.otago.ac.nz. The availability of multiple and highly annotated sequences of viral genomes in one database combined with comparative analysis tools facilitates detection of novel genomic elements.
Developmental programming of long non-coding RNAs during postnatal liver maturation in mice.

Directory of Open Access Journals (Sweden)

Lai Peng

Full Text Available The liver is a vital organ with critical functions in metabolism, protein synthesis, and immune defense. Most of the liver functions are not mature at birth and many changes happen during postnatal liver development. However, it is unclear what changes occur in liver after birth, at what developmental stages they occur, and how the developmental processes are regulated. Long non-coding RNAs (lncRNAs are involved in organ development and cell differentiation. Here, we analyzed the transcriptome of lncRNAs in mouse liver from perinatal (day -2 to adult (day 60 by RNA-Sequencing, with an attempt to understand the role of lncRNAs in liver maturation. We found around 15,000 genes expressed, including about 2,000 lncRNAs. Most lncRNAs were expressed at a lower level than coding RNAs. Both coding RNAs and lncRNAs displayed three major ontogenic patterns: enriched at neonatal, adolescent, or adult stages. Neighboring coding and non-coding RNAs showed the trend to exhibit highly correlated ontogenic expression patterns. Gene ontology (GO analysis revealed that some lncRNAs enriched at neonatal ages have their neighbor protein coding genes also enriched at neonatal ages and associated with cell proliferation, immune activation related processes, tissue organization pathways, and hematopoiesis; other lncRNAs enriched at adolescent ages have their neighbor protein coding genes associated with different metabolic processes. These data reveal significant functional transition during postnatal liver development and imply the potential importance of lncRNAs in liver maturation.
Non-Molecular-Clock-Like Evolution following Viral Origins in Homo sapiens

Directory of Open Access Journals (Sweden)

Wendy Mok

2007-01-01

Full Text Available Researchers routinely adopt molecular clock assumptions in conducting sequence analyses to estimate dates for viral origins in humans. We used computational methods to examine the extent to which this practice can result in inaccurate ‘retrodiction.’ Failing to account for dynamic molecular evolution can affect greatly estimating index case dates, resulting in an overestimated age for the SARS-CoV-human infection, for instance.
A Retrospective Examination of Feline Leukemia Subgroup Characterization: Viral Interference Assays to Deep Sequencing.

Science.gov (United States)

Chiu, Elliott S; Hoover, Edward A; VandeWoude, Sue

2018-01-10

Feline leukemia virus (FeLV) was the first feline retrovirus discovered, and is associated with multiple fatal disease syndromes in cats, including lymphoma. The original research conducted on FeLV employed classical virological techniques. As methods have evolved to allow FeLV genetic characterization, investigators have continued to unravel the molecular pathology associated with this fascinating agent. In this review, we discuss how FeLV classification, transmission, and disease-inducing potential have been defined sequentially by viral interference assays, Sanger sequencing, PCR, and next-generation sequencing. In particular, we highlight the influences of endogenous FeLV and host genetics that represent FeLV research opportunities on the near horizon.
On the classification of long non-coding RNAs

KAUST Repository

Ma, Lina

2013-06-01

Long non-coding RNAs (lncRNAs) have been found to perform various functions in a wide variety of important biological processes. To make easier interpretation of lncRNA functionality and conduct deep mining on these transcribed sequences, it is convenient to classify lncRNAs into different groups. Here, we summarize classification methods of lncRNAs according to their four major features, namely, genomic location and context, effect exerted on DNA sequences, mechanism of functioning and their targeting mechanism. In combination with the presently available function annotations, we explore potential relationships between different classification categories, and generalize and compare biological features of different lncRNAs within each category. Finally, we present our view on potential further studies. We believe that the classifications of lncRNAs as indicated above are of fundamental importance for lncRNA studies, helpful for further investigation of specific lncRNAs, for formulation of new hypothesis based on different features of lncRNA and for exploration of the underlying lncRNA functional mechanisms. © 2013 Landes Bioscience.
Bioreducible poly(amido amine)s for non-viral gene delivery

NARCIS (Netherlands)

Lin, C.

2008-01-01

This thesis describes the design and development of bioreducible poly(amido amine)s as non-viral vectors for gene delivery in vitro and in vivo. The structural influences of these polymers on their physico-chemical properties and gene delivery properties, transfection capability and cytotoxicity in
Frameshift mutations in infectious cDNA clones of Citrus tristeza virus: a strategy to minimize the toxicity of viral sequences to Escherichia coli

International Nuclear Information System (INIS)

Satyanarayana, Tatineni; Gowda, Siddarame; Ayllon, Maria A.; Dawson, William O.

2003-01-01

The advent of reverse genetics revolutionized the study of positive-stranded RNA viruses that were amenable for cloning as cDNAs into high-copy-number plasmids of Escherichia coli. However, some viruses are inherently refractory to cloning in high-copy-number plasmids due to toxicity of viral sequences to E. coli. We report a strategy that is a compromise between infectivity of the RNA transcripts and toxicity to E. coli effected by introducing frameshift mutations into 'slippery sequences' near the viral 'toxicity sequences' in the viral cDNA. Citrus tristeza virus (CTV) has cDNA sequences that are toxic to E. coli. The original full-length infectious cDNA of CTV and a derivative replicon, CTV-ΔCla, cloned into pUC119, resulted in unusually limited E. coli growth. However, upon sequencing of these cDNAs, an additional uridinylate (U) was found in a stretch of U's between nts 3726 and 3731 that resulted in a change to a reading frame with a stop codon at nt 3734. Yet, in vitro produced RNA transcripts from these clones infected protoplasts, and the resulting progeny virus was repaired. Correction of the frameshift mutation in the CTV cDNA constructs resulted in increased infectivity of in vitro produced RNA transcripts, but also caused a substantial increase of toxicity to E. coli, now requiring 3 days to develop visible colonies. Frameshift mutations created in sequences not suspected to facilitate reading frame shifting and silent mutations introduced into oligo(U) regions resulted in complete loss of infectivity, suggesting that the oligo(U) region facilitated the repair of the frameshift mutation. Additional frameshift mutations introduced into other oligo(U) regions also resulted in transcripts with reduced infectivity similarly to the original clones with the +1 insertion. However, only the frameshift mutations introduced into oligo(U) regions that were near and before the toxicity region improved growth and stability in E. coli. These data demonstrate that
Molecular Evolution of the non-coding Eosinophil Granule Ontogeny Transcript EGOT

Directory of Open Access Journals (Sweden)

Dominic eRose

2011-10-01

Full Text Available Eukaryotic genomes are pervasively transcribed. A large fraction of the transcriptional output consists of long, mRNA-like, non-protein-coding transcripts (mlncRNAs. The evolutionary history of mlncRNAs is still largely uncharted territory.In this contribution, we explore in detail the evolutionary traces of the eosinophil granule ontogeny transcript (EGOT, an experimentally confirmed representative of an abundant class of totally intronic non-coding transcripts (TINs. EGOT is located antisense to an intron of the ITPR1 gene. We computationally identify putative EGOT orthologs in the genomes of 32 different amniotes, including orthologs from primates, rodents, ungulates, carnivores, afrotherians, and xenarthrans, as well as putative candidates from basal amniotes, such as opossum or platypus. We investigate the EGOT gene phylogeny, analyse patterns of sequence conservation, and the evolutionary conservation of the EGOT gene structure. We show that EGO-B, the spliced isoform, may be present throughout the placental mammals, but most likely dates back even further. We demonstrat here for the first time that the whole EGOT locus is highly structured, containing several evolutionary conserved and thermodynamic stable secondary structures.Our analyses allow us to postulate novel functional roles of a hitherto poorly understood region at the intron of EGO-B which is highly conserved at the sequence level. The region contains a novel ITPR1 exon and also conserved RNA secondary structures together with a conserved TATA-like element, which putatively acts as a promoter of an independent regulatory element.
Non-coding RNAs and epigenome: de novo DNA methylation, allelic exclusion and X-inactivation

Directory of Open Access Journals (Sweden)

V. A. Halytskiy

2013-12-01

Full Text Available Non-coding RNAs are widespread class of cell RNAs. They participate in many important processes in cells – signaling, posttranscriptional silencing, protein biosynthesis, splicing, maintenance of genome stability, telomere lengthening, X-inactivation. Nevertheless, activity of these RNAs is not restricted to posttranscriptional sphere, but cover also processes that change or maintain the epigenetic information. Non-coding RNAs can directly bind to the DNA targets and cause their repression through recruitment of DNA methyltransferases as well as chromatin modifying enzymes. Such events constitute molecular mechanism of the RNA-dependent DNA methylation. It is possible, that the RNA-DNA interaction is universal mechanism triggering DNA methylation de novo. Allelic exclusion can be also based on described mechanism. This phenomenon takes place, when non-coding RNA, which precursor is transcribed from one allele, triggers DNA methylation in all other alleles present in the cell. Note, that miRNA-mediated transcriptional silencing resembles allelic exclusion, because both miRNA gene and genes, which can be targeted by this miRNA, contain elements with the same sequences. It can be assumed that RNA-dependent DNA methylation and allelic exclusion originated with the purpose of counteracting the activity of mobile genetic elements. Probably, thinning and deregulation of the cellular non-coding RNA pattern allows reactivation of silent mobile genetic elements resulting in genome instability that leads to ageing and carcinogenesis. In the course of X-inactivation, DNA methylation and subsequent heterochromatinization of X chromosome can be triggered by direct hybridization of 5′-end of large non-coding RNA Xist with DNA targets in remote regions of the X chromosome.
An integrative approach to predicting the functional effects of small indels in non-coding regions of the human genome.

Science.gov (United States)

Ferlaino, Michael; Rogers, Mark F; Shihab, Hashem A; Mort, Matthew; Cooper, David N; Gaunt, Tom R; Campbell, Colin

2017-10-06

Small insertions and deletions (indels) have a significant influence in human disease and, in terms of frequency, they are second only to single nucleotide variants as pathogenic mutations. As the majority of mutations associated with complex traits are located outside the exome, it is crucial to investigate the potential pathogenic impact of indels in non-coding regions of the human genome. We present FATHMM-indel, an integrative approach to predict the functional effect, pathogenic or neutral, of indels in non-coding regions of the human genome. Our method exploits various genomic annotations in addition to sequence data. When validated on benchmark data, FATHMM-indel significantly outperforms CADD and GAVIN, state of the art models in assessing the pathogenic impact of non-coding variants. FATHMM-indel is available via a web server at indels.biocompute.org.uk. FATHMM-indel can accurately predict the functional impact and prioritise small indels throughout the whole non-coding genome.

HIVBrainSeqDB: a database of annotated HIV envelope sequences from brain and other anatomical sites

Directory of Open Access Journals (Sweden)

O'Connor Niall

2010-12-01

Full Text Available Abstract Background The population of HIV replicating within a host consists of independently evolving and interacting sub-populations that can be genetically distinct within anatomical compartments. HIV replicating within the brain causes neurocognitive disorders in up to 20-30% of infected individuals and is a viral sanctuary site for the development of drug resistance. The primary determinant of HIV neurotropism is macrophage tropism, which is primarily determined by the viral envelope (env gene. However, studies of genetic aspects of HIV replicating in the brain are hindered because existing repositories of HIV sequences are not focused on neurotropic virus nor annotated with neurocognitive and neuropathological status. To address this need, we constructed the HIV Brain Sequence Database. Results The HIV Brain Sequence Database is a public database of HIV envelope sequences, directly sequenced from brain and other tissues from the same patients. Sequences are annotated with clinical data including viral load, CD4 count, antiretroviral status, neurocognitive impairment, and neuropathological diagnosis, all curated from the original publication. Tissue source is coded using an anatomical ontology, the Foundational Model of Anatomy, to capture the maximum level of detail available, while maintaining ontological relationships between tissues and their subparts. 44 tissue types are represented within the database, grouped into 4 categories: (i brain, brainstem, and spinal cord; (ii meninges, choroid plexus, and CSF; (iii blood and lymphoid; and (iv other (bone marrow, colon, lung, liver, etc. Patient coding is correlated across studies, allowing sequences from the same patient to be grouped to increase statistical power. Using Cytoscape, we visualized relationships between studies, patients and sequences, illustrating interconnections between studies and the varying depth of sequencing, patient number, and tissue representation across studies
Coding patient emotional cues and concerns in medical consultations: the Verona coding definitions of emotional sequences (VR-CoDES).

NARCIS (Netherlands)

Zimmermann, C.; Piccolo, L. del; Bensing, J.; Bergvik, S.; Haes, H. de; Eide, H.; Fletcher, I.; Goss, C.; Heaven, C.; Humphris, G.; Young-Mi, K.; Langewitz, W.; Meeuwesen, L.; Nuebling, M.; Rimondini, M.; Salmon, P.; Dulmen, S. van; Wissow, L.; Zandbelt, L.; Finset, A.

2011-01-01

Objective: To present the Verona Coding Definitions of Emotional Sequences (VR-CoDES CC), a consensus based system for coding patient expressions of emotional distress in medical consultations, defined as Cues or Concerns. Methods: The system was developed by an international group of communication
Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

Directory of Open Access Journals (Sweden)

Guo Xiang

2008-12-01

Full Text Available Abstract Background Parasites in the genus Theileria cause lymphoproliferative diseases in cattle, resulting in enormous socio-economic losses. The availability of the genome sequences and annotation for T. parva and T. annulata has facilitated the study of parasite biology and their relationship with host cell transformation and tropism. However, the mechanism of transcriptional regulation in this genus, which may be key to understanding fundamental aspects of its parasitology, remains poorly understood. In this study, we analyze the evolution of non-coding sequences in the Theileria genome and identify conserved sequence elements that may be involved in gene regulation of these parasitic species. Results Intergenic regions and introns in Theileria are short, and their length distributions are considerably right-skewed. Intergenic regions flanked by genes in 5'-5' orientation tend to be longer and slightly more AT-rich than those flanked by two stop codons; intergenic regions flanked by genes in 3'-5' orientation have intermediate values of length and AT composition. Intron position is negatively correlated with intron length, and positively correlated with GC content. Using stringent criteria, we identified a set of high-quality orthologous non-coding sequences between T. parva and T. annulata, and determined the distribution of selective constraints across regions, which are shown to be higher close to translation start sites. A positive correlation between constraint and length in both intergenic regions and introns suggests a tight control over length expansion of non-coding regions. Genome-wide searches for functional elements revealed several conserved motifs in intergenic regions of Theileria genomes. Two such motifs are preferentially located within the first 60 base pairs upstream of transcription start sites in T. parva, are preferentially associated with specific protein functional categories, and have significant similarity to know
A Retrospective Examination of Feline Leukemia Subgroup Characterization: Viral Interference Assays to Deep Sequencing

Directory of Open Access Journals (Sweden)

Elliott S. Chiu

2018-01-01

Full Text Available Feline leukemia virus (FeLV was the first feline retrovirus discovered, and is associated with multiple fatal disease syndromes in cats, including lymphoma. The original research conducted on FeLV employed classical virological techniques. As methods have evolved to allow FeLV genetic characterization, investigators have continued to unravel the molecular pathology associated with this fascinating agent. In this review, we discuss how FeLV classification, transmission, and disease-inducing potential have been defined sequentially by viral interference assays, Sanger sequencing, PCR, and next-generation sequencing. In particular, we highlight the influences of endogenous FeLV and host genetics that represent FeLV research opportunities on the near horizon.
Non-binary Hybrid LDPC Codes: Structure, Decoding and Optimization

OpenAIRE

Sassatelli, Lucile; Declercq, David

2007-01-01

In this paper, we propose to study and optimize a very general class of LDPC codes whose variable nodes belong to finite sets with different orders. We named this class of codes Hybrid LDPC codes. Although efficient optimization techniques exist for binary LDPC codes and more recently for non-binary LDPC codes, they both exhibit drawbacks due to different reasons. Our goal is to capitalize on the advantages of both families by building codes with binary (or small finite set order) and non-bin...
Metformin-Induced Changes of the Coding Transcriptome and Non-Coding RNAs in the Livers of Non-Alcoholic Fatty Liver Disease Mice.

Science.gov (United States)

Guo, Jun; Zhou, Yuan; Cheng, Yafen; Fang, Weiwei; Hu, Gang; Wei, Jie; Lin, Yajun; Man, Yong; Guo, Lixin; Sun, Mingxiao; Cui, Qinghua; Li, Jian

2018-01-01

Recent studies have suggested that changes in non-coding mRNA play a key role in the progression of non-alcoholic fatty liver disease (NAFLD). Metformin is now recommended and effective for the treatment of NAFLD. We hope the current analyses of the non-coding mRNA transcriptome will provide a better presentation of the potential roles of mRNAs and long non-coding RNAs (lncRNAs) that underlie NAFLD and metformin intervention. The present study mainly analysed changes in the coding transcriptome and non-coding RNAs after the application of a five-week metformin intervention. Liver samples from three groups of mice were harvested for transcriptome profiling, which covered mRNA, lncRNA, microRNA (miRNA) and circular RNA (circRNA), using a microarray technique. A systematic alleviation of high-fat diet (HFD)-induced transcriptome alterations by metformin was observed. The metformin treatment largely reversed the correlations with diabetes-related pathways. Our analysis also suggested interaction networks between differentially expressed lncRNAs and known hepatic disease genes and interactions between circRNA and their disease-related miRNA partners. Eight HFD-responsive lncRNAs and three metformin-responsive lncRNAs were noted due to their widespread associations with disease genes. Moreover, seven miRNAs that interacted with multiple differentially expressed circRNAs were highlighted because they were likely to be associated with metabolic or liver diseases. The present study identified novel changes in the coding transcriptome and non-coding RNAs in the livers of NAFLD mice after metformin treatment that might shed light on the underlying mechanism by which metformin impedes the progression of NAFLD. © 2018 The Author(s). Published by S. Karger AG, Basel.
Non-Binary Protograph-Based LDPC Codes: Analysis,Enumerators and Designs

OpenAIRE

Sun, Yizeng

2013-01-01

Non-binary LDPC codes can outperform binary LDPC codes using sum-product algorithm with higher computation complexity. Non-binary LDPC codes based on protographs have the advantage of simple hardware architecture. In the first part of this thesis, we will use EXIT chart analysis to compute the thresholds of different protographs over GF(q). Based on threshold computation, some non-binary protograph-based LDPC codes are designed and their frame error rates are compared with binary LDPC codes. ...
ViralORFeome: an integrated database to generate a versatile collection of viral ORFs.

Science.gov (United States)

Pellet, J; Tafforeau, L; Lucas-Hourani, M; Navratil, V; Meyniel, L; Achaz, G; Guironnet-Paquet, A; Aublin-Gex, A; Caignard, G; Cassonnet, P; Chaboud, A; Chantier, T; Deloire, A; Demeret, C; Le Breton, M; Neveu, G; Jacotot, L; Vaglio, P; Delmotte, S; Gautier, C; Combet, C; Deleage, G; Favre, M; Tangy, F; Jacob, Y; Andre, P; Lotteau, V; Rabourdin-Combe, C; Vidalain, P O

2010-01-01

Large collections of protein-encoding open reading frames (ORFs) established in a versatile recombination-based cloning system have been instrumental to study protein functions in high-throughput assays. Such 'ORFeome' resources have been developed for several organisms but in virology, plasmid collections covering a significant fraction of the virosphere are still needed. In this perspective, we present ViralORFeome 1.0 (http://www.viralorfeome.com), an open-access database and management system that provides an integrated set of bioinformatic tools to clone viral ORFs in the Gateway(R) system. ViralORFeome provides a convenient interface to navigate through virus genome sequences, to design ORF-specific cloning primers, to validate the sequence of generated constructs and to browse established collections of virus ORFs. Most importantly, ViralORFeome has been designed to manage all possible variants or mutants of a given ORF so that the cloning procedure can be applied to any emerging virus strain. A subset of plasmid constructs generated with ViralORFeome platform has been tested with success for heterologous protein expression in different expression systems at proteome scale. ViralORFeome should provide our community with a framework to establish a large collection of virus ORF clones, an instrumental resource to determine functions, activities and binding partners of viral proteins.
A Tiny RNA that Packs a Big Punch: The Critical Role of a Viral miR-155 Ortholog in Lymphomagenesis in Marek’s Disease

Directory of Open Access Journals (Sweden)

Guoqing Zhuang

2017-06-01

Full Text Available MicroRNAs (miRNAs are small non-coding RNAs that have been identified in animals, plants, and viruses. These small RNAs play important roles in post-transcriptional regulation of various cellular processes, including development, differentiation, and all aspects of cancer biology. Rapid-onset T-cell lymphoma of chickens, namely Marek’s disease (MD, induced by Gallid alphaherpesvirus 2 (GaHV2, could provide an ideal natural animal model for herpesvirus-related cancer research. GaHV2 encodes 26 mature miRNAs derived from 14 precursors assembled in three distinct gene clusters in the viral genome. One of the most highly expressed GaHV2 miRNAs, miR-M4-5p, shows high sequence similarity to the cellular miR-155 and the miR-K12-11 encoded by Kaposi’s sarcoma-associated herpesvirus, particularly in the miRNA “seed region.” As with miR-K12-11, miR-M4-5p shares a common set of host and viral target genes with miR-155, suggesting that they may target the same regulatory cellular networks; however, differences in regulatory function between miR-155 and miR-M4-5p may distinguish non-viral and viral mediated tumorigenesis. In this review, we focus on the functions of miR-M4-5p as the viral ortholog of miR-155 to explore how the virus mimics a host pathway to benefit the viral life cycle and trigger virus-induced tumorigenesis.
Non coding RNA: sequence-specific guide for chromatin modification and DNA damage signaling

Directory of Open Access Journals (Sweden)

Sofia eFrancia

2015-11-01

Full Text Available Chromatin conformation shapes the environment in which our genome is transcribed into RNA. Transcription is a source of DNA damage, thus it often occurs concomitantly to DNA damage signaling. Growing amounts of evidence suggest that different types of RNAs can, independently from their protein-coding properties, directly affect chromatin conformation, transcription and splicing, as well as promote the activation of the DNA damage response (DDR and DNA repair. Therefore, transcription paradoxically functions to both threaten and safeguard genome integrity. On the other hand, DNA damage signaling is known to modulate chromatin to suppress transcription of the surrounding genetic unit. It is thus intriguing to understand how transcription can modulate DDR signaling while, in turn, DDR signaling represses transcription of chromatin around the DNA lesion. An unexpected player in this field is the RNA interference (RNAi machinery, which play roles in transcription, splicing and chromatin modulation in several organisms. Non-coding RNAs (ncRNAs and several protein factors involved in the RNAi pathway are well known master regulators of chromatin while only recent reports suggest that ncRNAs are involved in DDR signaling and homology-mediated DNA repair. Here, we discuss the experimental evidence supporting the idea that ncRNAs act at the genomic loci from which they are transcribed to modulate chromatin, DDR signaling and DNA repair.
Viral Bacterial Artificial Chromosomes: Generation, Mutagenesis, and Removal of Mini-F Sequences

Directory of Open Access Journals (Sweden)

B. Karsten Tischer

2012-01-01

Full Text Available Maintenance and manipulation of large DNA and RNA virus genomes had presented an obstacle for virological research. BAC vectors provided a solution to both problems as they can harbor large DNA sequences and can efficiently be modified using well-established mutagenesis techniques in Escherichia coli. Numerous DNA virus genomes of herpesvirus and pox virus were cloned into mini-F vectors. In addition, several reverse genetic systems for RNA viruses such as members of Coronaviridae and Flaviviridae could be established based on BAC constructs. Transfection into susceptible eukaryotic cells of virus DNA cloned as a BAC allows reconstitution of recombinant viruses. In this paper, we provide an overview on the strategies that can be used for the generation of virus BAC vectors and also on systems that are currently available for various virus species. Furthermore, we address common mutagenesis techniques that allow modification of BACs from single-nucleotide substitutions to deletion of viral genes or insertion of foreign sequences. Finally, we review the reconstitution of viruses from BAC vectors and the removal of the bacterial sequences from the virus genome during this process.
PATACSDB—the database of polyA translational attenuators in coding sequences

Directory of Open Access Journals (Sweden)

Malgorzata Habich

2016-02-01

Full Text Available Recent additions to the repertoire of gene expression regulatory mechanisms are polyadenylate (polyA tracks encoding for poly-lysine runs in protein sequences. Such tracks stall the translation apparatus and induce frameshifting independently of the effects of charged nascent poly-lysine sequence on the ribosome exit channel. As such, they substantially influence the stability of mRNA and the amount of protein produced from a given transcript. Single base changes in these regions are enough to exert a measurable response on both protein and mRNA abundance; this makes each of these sequences a potentially interesting case study for the effects of synonymous mutation, gene dosage balance and natural frameshifting. Here we present PATACSDB, a resource that contain a comprehensive list of polyA tracks from over 250 eukaryotic genomes. Our data is based on the Ensembl genomic database of coding sequences and filtered with algorithm of 12A-1 which selects sequences of polyA tracks with a minimal length of 12 A’s allowing for one mismatched base. The PATACSDB database is accessible at: http://sysbio.ibb.waw.pl/patacsdb. The source code is available at http://github.com/habich/PATACSDB, and it includes the scripts with which the database can be recreated.
Hypothesis for heritable, anti-viral immunity in crustaceans and insects

Directory of Open Access Journals (Sweden)

Flegel Timothy W

2009-09-01

Full Text Available Abstract Background It is known that crustaceans and insects can persistently carry one or more viral pathogens at low levels, without signs of disease. They may transmit them to their offspring or to naïve individuals, often with lethal consequences. The underlying molecular mechanisms have not been elucidated, but the process has been called viral accommodation. Since tolerance to one virus does not confer tolerance to another, tolerance is pathogen-specific, so the requirement for a specific pathogen response mechanism (memory was included in the original viral accommodation concept. Later, it was hypothesized that specific responses were based on the presence of viruses in persistent infections. However, recent developments suggest that specific responses may be based on viral sequences inserted into the host genome. Presentation of the hypothesis Non-retroviral fragments of both RNA and DNA viruses have been found in insect and crustacean genomes. In addition, reverse-transcriptase (RT and integrase (IN sequences are also common in their genomes. It is hypothesized that shrimp and other arthropods use these RT to recognize "foreign" mRNA of both RNA and DNA viruses and use the integrases (IN to randomly insert short cDNA sequences into their genomes. By chance, some of these sequences result in production of immunospecific RNA (imRNA capable of stimulating RNAi that suppresses viral propagation. Individuals with protective inserts would pass these on to the next generation, together with similar protective inserts for other viruses that could be amalgamated rapidly in individual offspring by random assortment of chromosomes. The most successful individuals would be environmentally selected from billions of offspring. Conclusion This hypothesis for immunity based on an imRNA generation mechanism fits with the general principle of invertebrate immunity based on a non-host, "pattern recognition" process. If proven correct, understanding the
New Approaches to Attenuated Hepatitis a Vaccine Development: Cloning and Sequencing of Cell-Culture Adapted Viral cDNA.

Science.gov (United States)

1987-10-13

after multiple passages in vivo and in vitro. J. Gen. Virol. 67, 1741- 1744. Sabin , A.B. (1985). Oral poliovirus vaccine : history of its development...IN (N NEW APPROACHES TO ATTENUATED HEPATITIS A VACCINE DEVELOPMENT: Q) CLONING AND SEQUENCING OF CELL-CULTURE ADAPTED VIRAL cDNA I ANNUAL REPORT...6ll02Bsl0 A 055 11. TITLE (Include Security Classification) New Approaches to Attenuated Hepatitis A Vaccine Development: Cloning and Sequencing of Cell
A random forest classifier for detecting rare variants in NGS data from viral populations

Directory of Open Access Journals (Sweden)

Raunaq Malhotra

Full Text Available We propose a random forest classifier for detecting rare variants from sequencing errors in Next Generation Sequencing (NGS data from viral populations. The method utilizes counts of varying length of k-mers from the reads of a viral population to train a Random forest classifier, called MultiRes, that classifies k-mers as erroneous or rare variants. Our algorithm is rooted in concepts from signal processing and uses a frame-based representation of k-mers. Frames are sets of non-orthogonal basis functions that were traditionally used in signal processing for noise removal. We define discrete spatial signals for genomes and sequenced reads, and show that k-mers of a given size constitute a frame.We evaluate MultiRes on simulated and real viral population datasets, which consist of many low frequency variants, and compare it to the error detection methods used in correction tools known in the literature. MultiRes has 4 to 500 times less false positives k-mer predictions compared to other methods, essential for accurate estimation of viral population diversity and their de-novo assembly. It has high recall of the true k-mers, comparable to other error correction methods. MultiRes also has greater than 95% recall for detecting single nucleotide polymorphisms (SNPs and fewer false positive SNPs, while detecting higher number of rare variants compared to other variant calling methods for viral populations. The software is available freely from the GitHub link https://github.com/raunaq-m/MultiRes. Keywords: Sequencing error detection, Reference free methods, Next-generation sequencing, Viral populations, Multi-resolution frames, Random forest classifier
Assembly of viral genomes from metagenomes

NARCIS (Netherlands)

S.L. Smits (Saskia); R. Bodewes (Rogier); A. Ruiz-Gonzalez (Aritz); V. Baumgärtner (Volkmar); M.P.G. Koopmans D.V.M. (Marion); A.D.M.E. Osterhaus (Albert); A. Schürch (Anita)

2014-01-01

textabstractViral infections remain a serious global health issue. Metagenomic approaches are increasingly used in the detection of novel viral pathogens but also to generate complete genomes of uncultivated viruses. In silico identification of complete viral genomes from sequence data would allow
De novo assembly of highly diverse viral populations

Directory of Open Access Journals (Sweden)

Yang Xiao

2012-09-01

Full Text Available Abstract Background Extensive genetic diversity in viral populations within infected hosts and the divergence of variants from existing reference genomes impede the analysis of deep viral sequencing data. A de novo population consensus assembly is valuable both as a single linear representation of the population and as a backbone on which intra-host variants can be accurately mapped. The availability of consensus assemblies and robustly mapped variants are crucial to the genetic study of viral disease progression, transmission dynamics, and viral evolution. Existing de novo assembly techniques fail to robustly assemble ultra-deep sequence data from genetically heterogeneous populations such as viruses into full-length genomes due to the presence of extensive genetic variability, contaminants, and variable sequence coverage. Results We present VICUNA, a de novo assembly algorithm suitable for generating consensus assemblies from genetically heterogeneous populations. We demonstrate its effectiveness on Dengue, Human Immunodeficiency and West Nile viral populations, representing a range of intra-host diversity. Compared to state-of-the-art assemblers designed for haploid or diploid systems, VICUNA recovers full-length consensus and captures insertion/deletion polymorphisms in diverse samples. Final assemblies maintain a high base calling accuracy. VICUNA program is publicly available at: http://www.broadinstitute.org/scientific-community/science/projects/viral-genomics/ viral-genomics-analysis-software. Conclusions We developed VICUNA, a publicly available software tool, that enables consensus assembly of ultra-deep sequence derived from diverse viral populations. While VICUNA was developed for the analysis of viral populations, its application to other heterogeneous sequence data sets such as metagenomic or tumor cell population samples may prove beneficial in these fields of research.
Application of the verona coding definitions of emotional sequences (VR-CoDES) on a pediatric data set.

Science.gov (United States)

Vatne, Torun M; Finset, Arnstein; Ørnes, Knut; Ruland, Cornelia M

2010-09-01

Adult patients present concerns as defined in the Verona Coding Definitions of Emotional Sequences (VR-CoDES), but we do not know how children express their concerns during medical consultations. This study aimed to evaluate the applicability of VR-CoDES to pediatric oncology consultations. Twenty-eight pediatric consultations were coded with the Verona Coding Definitions of Emotional Sequences (VR-CoDES), and the material was also qualitatively analyzed for descriptive purposes. Five consultations were randomly selected for reliability testing and descriptive statistics were computed. Perfect inter-rater reliability for concerns and moderate reliability for cues were obtained. Cues and/or concerns were present in over half of the consultations. Cues were more frequent than concerns, with the majority of cues being verbal hints to hidden concerns or non-verbal cues. Intensity of expressions, limitations in vocabulary, commonality of statements, and complexity of the setting complicated the use of VR-CoDES. Child-specific cues; use of the imperative, cues about past experiences, and use of onomatopoeia were observed. Children with cancer express concerns during medical consultations. VR-CoDES is a reliable tool for coding concerns in pediatric data sets. For future applications in pediatric settings an appendix should be developed to incorporate the child-specific traits. Copyright (c) 2010 Elsevier Ireland Ltd. All rights reserved.
Forced selection of a human immunodeficiency virus type 1 variant that uses a non-self tRNA primer for reverse transcription: Involvement of viral RNA sequences and the reverse transcriptase enzyme

NARCIS (Netherlands)

Abbink, Truus E. M.; Beerens, Nancy; Berkhout, Ben

2004-01-01

Human immunodeficiency virus type 1 uses the tRNA(3)(Lys) molecule as a selective primer for reverse transcription. This primer specificity is imposed by sequence complementarity between the tRNA primer and two motifs in the viral RNA genome: the primer-binding site (PBS) and the primer activation
Nano-sized calcium phosphate (CaP) carriers for non-viral gene deilvery

Energy Technology Data Exchange (ETDEWEB)

Lee, Donghyun, E-mail: dhlee@cau.ac.kr [Department of Biomedical Engineering, Division of Integrative Engineering, Chung-Ang University, 221 Heukseok-Dong, Dongjak-Gu, Seoul 156-756 (Korea, Republic of); Upadhye, Kalpesh [Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA 15260 (United States); Kumta, Prashant N., E-mail: pkumta@pitt.edu [Department of Bioengineering, University of Pittsburgh, Pittsburgh, PA 15260 (United States); Department of Mechanical Engineering and Materials Sceince, University of Pittsburgh, Pittsburgh, PA 15260 (United States); Department of Chemical and Petroleum Engineering, University of Pittsburgh, Pittsburgh, PA 15260 (United States); Center for Complex Engineered Multifunctional Materials, University of Pittsburgh, Pittsburgh, PA 15261 (United States)

2012-02-25

Highlights: Black-Right-Pointing-Pointer Nanostructured calcium phosphates (NanoCaPs): comprehensive review. Black-Right-Pointing-Pointer Non viral gene delivery mechanisms: detailed mechanisms are outlined. Black-Right-Pointing-Pointer Barriers to non-viral gene delivery: detailed barriers are discussed. - Abstract: Gene therapy has garnered much interest due to the potential for curing multiple inherited and/or increases in the acquired diseases. As a result, there has been intense activity from multiple research groups for developing effective delivery methods and carriers, which is a critical step in advancing gene delivery technologies. In order for the carriers to effectively deliver the genetic payloads, multiple extracellular and intracellular barriers need to be overcome. Although overcoming these challenges to improve the effectiveness is critical, the development of safe gene delivery agents is even more vital to assure its use in clinical applications. The development of safe and effective strategies has therefore been a major challenge impeding gene therapy progress. In this regard, calcium phosphate (CaP) based nano-particles has been considered as one of the candidate non-viral gene delivery vehicles, but has been plagued by inconsistent and low transfection efficiencies limiting its progress. There has been major research effort to improve the consistency and effectiveness of CaP based vectors. Currently, it is therefore thought that by controlling the various synthesis factors such as Ca/P ratio, mode of mixing, and type of calcium phosphate phase, such variability and inefficiency could be modulated. This review attempts to provide a comprehensive analysis of the current research activity in the development of CaP based ceramic and polymer-ceramic hybrid systems for non-viral gene delivery. Preliminary transfection results of hydroxyapatite (HA or NanoCaPs), amorphous calcium phosphate (ACP) and brushite phases are also compared to assess the

Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

KAUST Repository

Zhang, Zhang

2010-11-08

Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.
Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

KAUST Repository

Zhang, Zhang; Yu, Jun

2010-01-01

Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.
Discovery of Cationic Polymers for Non-viral Gene Delivery using Combinatorial Approaches

Science.gov (United States)

Barua, Sutapa; Ramos, James; Potta, Thrimoorthy; Taylor, David; Huang, Huang-Chiao; Montanez, Gabriela; Rege, Kaushal

2015-01-01

Gene therapy is an attractive treatment option for diseases of genetic origin, including several cancers and cardiovascular diseases. While viruses are effective vectors for delivering exogenous genes to cells, concerns related to insertional mutagenesis, immunogenicity, lack of tropism, decay and high production costs necessitate the discovery of non-viral methods. Significant efforts have been focused on cationic polymers as non-viral alternatives for gene delivery. Recent studies have employed combinatorial syntheses and parallel screening methods for enhancing the efficacy of gene delivery, biocompatibility of the delivery vehicle, and overcoming cellular level barriers as they relate to polymer-mediated transgene uptake, transport, transcription, and expression. This review summarizes and discusses recent advances in combinatorial syntheses and parallel screening of cationic polymer libraries for the discovery of efficient and safe gene delivery systems. PMID:21843141
[Immunotherapy for refractory viral infections].

Science.gov (United States)

Morio, Tomohiro; Fujita, Yuriko; Takahashi, Satoshi

Various antiviral agents have been developed, which are sometimes associated with toxicity, development of virus-resistant strain, and high cost. Virus-specific T-cell (VST) therapy provides an alternative curative therapy that can be effective for a prolonged time without eliciting drug resistance. VSTs can be directly separated using several types of capture devices and can be obtained by stimulating peripheral blood mononuclear cells with viral antigens (virus, protein, or peptide) loaded on antigen-presenting cells (APC). APC can be transduced with virus-antigen coding plasmid or pulsed with overlapping peptides. VST therapy has been studied in drug non-responsive viral infections after hematopoietic cell transplantation (HCT). Several previous studies have demonstrated the efficacy of VST therapy without significant severe GVHD. In addition, VSTs from a third-party donor have been prepared and administered for post-HCT viral infection. Although target viruses of VSTs include herpes virus species and polyomavirus species, a wide variety of pathogens, such as papillomavirus, intracellular bacteria, and fungi, can be treated by pathogen-specific T-cells. Perhaps, these specific T-cells could be used for opportunistic infections in other immunocompromised hosts in the near future.
Optical orthogonal code-division multiple-access system - Part 2: Multibits/sequence-period OOCDMA

Science.gov (United States)

Kwon, Hyuck M.

1994-08-01

In a recently proposed optical orthogonal code division multiple-access (OOCDMA) system, one bit of user's data is transmitted per sequence-period, and a threshold is employed for the final bit decision. In this paper, a system that can transmit multibits per sequence-period is introduced, and avalanche photodiode (APD) noise, thermal noise, and interference, are included. This system, derived by exploiting orthogonal properties of the OOCDMA code sequence and using a maximum search (instead of a threshold) in the final decision, is log(sub 2) F times higher in throughput, where F is sequence-period. For example, four orders of magnitude are better in bit error probability at - 56 dBW received laser power, with F = 1000 chips, 10 'marks' in a sequence, and 10 users of 30 Mb/s data rate for one-bit/sequence-period and 270 Mb/s data rate for multibits/sequence-period system. Furthermore, an exact analysis is performed for the log(sub 2)F bits/sequence-period system with a hard-limiter placed before the receiver, and its performance is compared to the performance without hard-limiter, for the chip-synchronous case. The improvement from using a hard-limiter is significant in the log(sub 2)F bits/sequence-period OCCDMA system.
Metagenomic analysis of viral diversity in respiratory samples from patients with respiratory tract infections in Kuwait.

Science.gov (United States)

Madi, Nada; Al-Nakib, Widad; Mustafa, Abu Salim; Habibi, Nazima

2018-03-01

A metagenomic approach based on target independent next-generation sequencing has become a known method for the detection of both known and novel viruses in clinical samples. This study aimed to use the metagenomic sequencing approach to characterize the viral diversity in respiratory samples from patients with respiratory tract infections. We have investigated 86 respiratory samples received from various hospitals in Kuwait between 2015 and 2016 for the diagnosis of respiratory tract infections. A metagenomic approach using the next-generation sequencer to characterize viruses was used. According to the metagenomic analysis, an average of 145, 019 reads were identified, and 2% of these reads were of viral origin. Also, metagenomic analysis of the viral sequences revealed many known respiratory viruses, which were detected in 30.2% of the clinical samples. Also, sequences of non-respiratory viruses were detected in 14% of the clinical samples, while sequences of non-human viruses were detected in 55.8% of the clinical samples. The average genome coverage of the viruses was 12% with the highest genome coverage of 99.2% for respiratory syncytial virus, and the lowest was 1% for torque teno midi virus 2. Our results showed 47.7% agreement between multiplex Real-Time PCR and metagenomics sequencing in the detection of respiratory viruses in the clinical samples. Though there are some difficulties in using this method to clinical samples such as specimen quality, these observations are indicative of the promising utility of the metagenomic sequencing approach for the identification of respiratory viruses in patients with respiratory tract infections. © 2017 Wiley Periodicals, Inc.
Revised Mimivirus major capsid protein sequence reveals intron-containing gene structure and extra domain

Directory of Open Access Journals (Sweden)

Suzan-Monti Marie

2009-05-01

Full Text Available Abstract Background Acanthamoebae polyphaga Mimivirus (APM is the largest known dsDNA virus. The viral particle has a nearly icosahedral structure with an internal capsid shell surrounded with a dense layer of fibrils. A Capsid protein sequence, D13L, was deduced from the APM L425 coding gene and was shown to be the most abundant protein found within the viral particle. However this protein remained poorly characterised until now. A revised protein sequence deposited in a database suggested an additional N-terminal stretch of 142 amino acids missing from the original deduced sequence. This result led us to investigate the L425 gene structure and the biochemical properties of the complete APM major Capsid protein. Results This study describes the full length 3430 bp Capsid coding gene and characterises the 593 amino acids long corresponding Capsid protein 1. The recombinant full length protein allowed the production of a specific monoclonal antibody able to detect the Capsid protein 1 within the viral particle. This protein appeared to be post-translationnally modified by glycosylation and phosphorylation. We proposed a secondary structure prediction of APM Capsid protein 1 compared to the Capsid protein structure of Paramecium Bursaria Chlorella Virus 1, another member of the Nucleo-Cytoplasmic Large DNA virus family. Conclusion The characterisation of the full length L425 Capsid coding gene of Acanthamoebae polyphaga Mimivirus provides new insights into the structure of the main Capsid protein. The production of a full length recombinant protein will be useful for further structural studies.
Non-Viral Deoxyribonucleoside Kinases

DEFF Research Database (Denmark)

Christiansen, Louise Slot; Munch-Petersen, Birgitte; Knecht, Wolfgang

2015-01-01

Deoxyribonucleoside kinases (dNKs) phosphorylate deoxyribonucleosides to their corresponding monophosphate compounds. dNks also phosphorylate deoxyribonucleoside analogues that are used in the treatment of cancer or viral infections. The study of the mammalian dNKs has therefore always been of gr...
Immunity: Insect Immune Memory Goes Viral.

Science.gov (United States)

Ligoxygakis, Petros

2017-11-20

Adaptive memory in insect immunity has been controversial. In this issue, Andino and co-workers propose that acquisition of viral sequences in the host genome gives rise to anti-sense, anti-viral piRNAs. Such sequences can be regarded as both a genomic archive of past infections and as an armour of potential heritable memory. Copyright © 2017 Elsevier Ltd. All rights reserved.
Bistability in self-activating genes regulated by non-coding RNAs

International Nuclear Information System (INIS)

Miro-Bueno, Jesus

2015-01-01

Non-coding RNA molecules are able to regulate gene expression and play an essential role in cells. On the other hand, bistability is an important behaviour of genetic networks. Here, we propose and study an ODE model in order to show how non-coding RNA can produce bistability in a simple way. The model comprises a single gene with positive feedback that is repressed by non-coding RNA molecules. We show how the values of all the reaction rates involved in the model are able to control the transitions between the high and low states. This new model can be interesting to clarify the role of non-coding RNA molecules in genetic networks. As well, these results can be interesting in synthetic biology for developing new genetic memories and biomolecular devices based on non-coding RNAs
RNA-Seq analysis of D. radiodurans find non coding RNAs expressed in response to radiation stress

International Nuclear Information System (INIS)

Gadewal, Nikhil; Mukhopadhyaya, Rita

2015-01-01

In bacteria discovery of functional RNA molecules that are not translated into protein, noncoding RNAs, became possible with advent of Next Generation Sequencing technology. Bacterial non coding RNAs are typically 50-300 nucleotides long and work as internal signals controlling various levels of gene expression. Deep sequencing of total cellular RNA captures all coding and noncoding transcripts with their differential levels of expression in the transcriptome. It provides a powerful approach to study bacterial gene expression and mechanisms of gene regulation. We subjected the 3 h transcriptome of Deinococcus radiodurans R1 cells post exposure to 6 KGy gamma radiation to 100 x 2 cycles of deep sequencing on the Illumina HiSeq 2000 to look for ncRNA transcripts. Bioinformatics pipeline for analysis and interpretation of RNA Seq data was done in house using Softwares available in public domains. Our sequence data aligned with 21 putative ncRNAs expressed in the intergenic regions of annotated genome of D radiodurans. Verification of 2 ncRNA candidates and 3 transcription factor genes by Real Time PCR confirmed presence of these transcripts in the 3 h transcriptome sequenced by us. Any relationship between ncRNAs and control of radiation induced gene expression in D radiodurans can be proved only after specific gene knock outs in future. (author)
SEAPATH: A microcomputer code for evaluating physical security effectiveness using adversary sequence diagrams

International Nuclear Information System (INIS)

Darby, J.L.

1986-01-01

The Adversary Sequence Diagram (ASD) concept was developed by Sandia National Laboratories (SNL) to examine physical security system effectiveness. Sandia also developed a mainframe computer code, PANL, to analyze the ASD. The authors have developed a microcomputer code, SEAPATH, which also analyzes ASD's. The Authors are supporting SNL in software development of the SAVI code; SAVI utilizes the SEAPATH algorithm to identify and quantify paths
Kinetic models of gene expression including non-coding RNAs

Energy Technology Data Exchange (ETDEWEB)

Zhdanov, Vladimir P., E-mail: zhdanov@catalysis.r

2011-03-15

In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.
Comparative analysis of seven viral nuclear export signals (NESs reveals the crucial role of nuclear export mediated by the third NES consensus sequence of nucleoprotein (NP in influenza A virus replication.

Directory of Open Access Journals (Sweden)

Nopporn Chutiwitoonchai

Full Text Available The assembly of influenza virus progeny virions requires machinery that exports viral genomic ribonucleoproteins from the cell nucleus. Currently, seven nuclear export signal (NES consensus sequences have been identified in different viral proteins, including NS1, NS2, M1, and NP. The present study examined the roles of viral NES consensus sequences and their significance in terms of viral replication and nuclear export. Mutation of the NP-NES3 consensus sequence resulted in a failure to rescue viruses using a reverse genetics approach, whereas mutation of the NS2-NES1 and NS2-NES2 sequences led to a strong reduction in viral replication kinetics compared with the wild-type sequence. While the viral replication kinetics for other NES mutant viruses were also lower than those of the wild-type, the difference was not so marked. Immunofluorescence analysis after transient expression of NP-NES3, NS2-NES1, or NS2-NES2 proteins in host cells showed that they accumulated in the cell nucleus. These results suggest that the NP-NES3 consensus sequence is mostly required for viral replication. Therefore, each of the hydrophobic (Φ residues within this NES consensus sequence (Φ1, Φ2, Φ3, or Φ4 was mutated, and its viral replication and nuclear export function were analyzed. No viruses harboring NP-NES3 Φ2 or Φ3 mutants could be rescued. Consistent with this, the NP-NES3 Φ2 and Φ3 mutants showed reduced binding affinity with CRM1 in a pull-down assay, and both accumulated in the cell nucleus. Indeed, a nuclear export assay revealed that these mutant proteins showed lower nuclear export activity than the wild-type protein. Moreover, the Φ2 and Φ3 residues (along with other Φ residues within the NP-NES3 consensus were highly conserved among different influenza A viruses, including human, avian, and swine. Taken together, these results suggest that the Φ2 and Φ3 residues within the NP-NES3 protein are important for its nuclear export function
The Evolution of Bony Vertebrate Enhancers at Odds with Their Coding Sequence Landscape.

Science.gov (United States)

Yousaf, Aisha; Sohail Raza, Muhammad; Ali Abbasi, Amir

2015-08-06

Enhancers lie at the heart of transcriptional and developmental gene regulation. Therefore, changes in enhancer sequences usually disrupt the target gene expression and result in disease phenotypes. Despite the well-established role of enhancers in development and disease, evolutionary sequence studies are lacking. The current study attempts to unravel the puzzle of bony vertebrates' conserved noncoding elements (CNE) enhancer evolution. Bayesian phylogenetics of enhancer sequences spotlights promising interordinal relationships among placental mammals, proposing a closer relationship between humans and laurasiatherians while placing rodents at the basal position. Clock-based estimates of enhancer evolution provided a dynamic picture of interspecific rate changes across the bony vertebrate lineage. Moreover, coelacanth in the study augmented our appreciation of the vertebrate cis-regulatory evolution during water-land transition. Intriguingly, we observed a pronounced upsurge in enhancer evolution in land-dwelling vertebrates. These novel findings triggered us to further investigate the evolutionary trend of coding as well as CNE nonenhancer repertoires, to highlight the relative evolutionary dynamics of diverse genomic landscapes. Surprisingly, the evolutionary rates of enhancer sequences were clearly at odds with those of the coding and the CNE nonenhancer sequences during vertebrate adaptation to land, with land vertebrates exhibiting significantly reduced rates of coding sequence evolution in comparison to their fast evolving regulatory landscape. The observed variation in tetrapod cis-regulatory elements caused the fine-tuning of associated gene regulatory networks. Therefore, the increased evolutionary rate of tetrapods' enhancer sequences might be responsible for the variation in developmental regulatory circuits during the process of vertebrate adaptation to land. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for
Comparison of two Next Generation sequencing platforms for full genome sequencing of Classical Swine Fever Virus

DEFF Research Database (Denmark)

Fahnøe, Ulrik; Pedersen, Anders Gorm; Höper, Dirk

2013-01-01

to the consensus sequence. Additionally, we got an average sequence depth for the genome of 4000 for the Iontorrent PGM and 400 for the FLX platform making the mapping suitable for single nucleotide variant (SNV) detection. The analysis revealed a single non-silent SNV A10665G leading to the amino acid change D......Next Generation Sequencing (NGS) is becoming more adopted into viral research and will be the preferred technology in the years to come. We have recently sequenced several strains of Classical Swine Fever Virus (CSFV) by NGS on both Genome Sequencer FLX (GS FLX) and Iontorrent PGM platforms...
Weight Distribution for Non-binary Cluster LDPC Code Ensemble

Science.gov (United States)

Nozaki, Takayuki; Maehara, Masaki; Kasai, Kenta; Sakaniwa, Kohichi

In this paper, we derive the average weight distributions for the irregular non-binary cluster low-density parity-check (LDPC) code ensembles. Moreover, we give the exponential growth rate of the average weight distribution in the limit of large code length. We show that there exist $(2,d_c)$-regular non-binary cluster LDPC code ensembles whose normalized typical minimum distances are strictly positive.
From structure prediction to genomic screens for novel non-coding RNAs

DEFF Research Database (Denmark)

Gorodkin, Jan; Hofacker, Ivo L.

2011-01-01

Abstract: Non-coding RNAs (ncRNAs) are receiving more and more attention not only as an abundant class of genes, but also as regulatory structural elements (some located in mRNAs). A key feature of RNA function is its structure. Computational methods were developed early for folding and prediction....... This and the increased amount of available genomes have made it possible to employ structure-based methods for genomic screens. The field has moved from folding prediction of single sequences to computational screens for ncRNAs in genomic sequence using the RNA structure as the main characteristic feature. Whereas early...... upon some of the concepts in current methods that have been applied in genomic screens for de novo RNA structures in searches for novel ncRNA genes and regulatory RNA structure on mRNAs. We discuss the strengths and weaknesses of the different strategies and how they can complement each other....
Short-lived non-coding transcripts (SLiTs): Clues to regulatory long non-coding RNA.

Science.gov (United States)

Tani, Hidenori

2017-03-22

Whole transcriptome analyses have revealed a large number of novel long non-coding RNAs (lncRNAs). Although the importance of lncRNAs has been documented in previous reports, the biological and physiological functions of lncRNAs remain largely unknown. The role of lncRNAs seems an elusive problem. Here, I propose a clue to the identification of regulatory lncRNAs. The key point is RNA half-life. RNAs with a long half-life (t 1/2 > 4 h) contain a significant proportion of ncRNAs, as well as mRNAs involved in housekeeping functions, whereas RNAs with a short half-life (t 1/2 regulatory ncRNAs and regulatory mRNAs. This novel class of ncRNAs with a short half-life can be categorized as Short-Lived non-coding Transcripts (SLiTs). I consider that SLiTs are likely to be rich in functionally uncharacterized regulatory RNAs. This review describes recent progress in research into SLiTs.
Statistical properties and fractals of nucleotide clusters in DNA sequences

International Nuclear Information System (INIS)

Sun Tingting; Zhang Linxi; Chen Jin; Jiang Zhouting

2004-01-01

Statistical properties of nucleotide clusters in DNA sequences and their fractals are investigated in this paper. The average size of nucleotide clusters in non-coding sequence is larger than that in coding sequence. We investigate the cluster-size distribution P(S) for human chromosomes 21 and 22, and the results are different from previous works. The cluster-size distribution P(S 1 +S 2 ) with the total size of sequential Pu-cluster and Py-cluster S 1 +S 2 is studied. We observe that P(S 1 +S 2 ) follows an exponential decay both in coding and non-coding sequences. However, we get different results for human chromosomes 21 and 22. The probability distribution P(S 1 ,S 2 ) of nucleotide clusters with the size of sequential Pu-cluster and Py-cluster S 1 and S 2 respectively, is also examined. In the meantime, some of the linear correlations are obtained in the double logarithmic plots of the fluctuation F(l) versus nucleotide cluster distance l along the DNA chain. The power spectrums of nucleotide clusters are also discussed, and it is concluded that the curves are flat and hardly changed and the 1/3 frequency is neither observed in coding sequence nor in non-coding sequence. These investigations can provide some insights into the nucleotide clusters of DNA sequences

Target-dependent enrichment of virions determines the reduction of high-throughput sequencing in virus discovery.

Directory of Open Access Journals (Sweden)

Randi Holm Jensen

Full Text Available Viral infections cause many different diseases stemming both from well-characterized viral pathogens but also from emerging viruses, and the search for novel viruses continues to be of great importance. High-throughput sequencing is an important technology for this purpose. However, viral nucleic acids often constitute a minute proportion of the total genetic material in a sample from infected tissue. Techniques to enrich viral targets in high-throughput sequencing have been reported, but the sensitivity of such methods is not well established. This study compares different library preparation techniques targeting both DNA and RNA with and without virion enrichment. By optimizing the selection of intact virus particles, both by physical and enzymatic approaches, we assessed the effectiveness of the specific enrichment of viral sequences as compared to non-enriched sample preparations by selectively looking for and counting read sequences obtained from shotgun sequencing. Using shotgun sequencing of total DNA or RNA, viral targets were detected at concentrations corresponding to the predicted level, providing a foundation for estimating the effectiveness of virion enrichment. Virion enrichment typically produced a 1000-fold increase in the proportion of DNA virus sequences. For RNA virions the gain was less pronounced with a maximum 13-fold increase. This enrichment varied between the different sample concentrations, with no clear trend. Despite that less sequencing was required to identify target sequences, it was not evident from our data that a lower detection level was achieved by virion enrichment compared to shotgun sequencing.
Covalent Strategies for Targeting Messenger and Non-Coding RNAs: An Updated Review on siRNA, miRNA and antimiR Conjugates

Directory of Open Access Journals (Sweden)

Santiago Grijalvo

2018-02-01

Full Text Available Oligonucleotide-based therapy has become an alternative to classical approaches in the search of novel therapeutics involving gene-related diseases. Several mechanisms have been described in which demonstrate the pivotal role of oligonucleotide for modulating gene expression. Antisense oligonucleotides (ASOs and more recently siRNAs and miRNAs have made important contributions either in reducing aberrant protein levels by sequence-specific targeting messenger RNAs (mRNAs or restoring the anomalous levels of non-coding RNAs (ncRNAs that are involved in a good number of diseases including cancer. In addition to formulation approaches which have contributed to accelerate the presence of ASOs, siRNAs and miRNAs in clinical trials; the covalent linkage between non-viral vectors and nucleic acids has also added value and opened new perspectives to the development of promising nucleic acid-based therapeutics. This review article is mainly focused on the strategies carried out for covalently modifying siRNA and miRNA molecules. Examples involving cell-penetrating peptides (CPPs, carbohydrates, polymers, lipids and aptamers are discussed for the synthesis of siRNA conjugates whereas in the case of miRNA-based drugs, this review article makes special emphasis in using antagomiRs, locked nucleic acids (LNAs, peptide nucleic acids (PNAs as well as nanoparticles. The biomedical applications of siRNA and miRNA conjugates are also discussed.
Binary Linear-Time Erasure Decoding for Non-Binary LDPC codes

OpenAIRE

Savin, Valentin

2009-01-01

In this paper, we first introduce the extended binary representation of non-binary codes, which corresponds to a covering graph of the bipartite graph associated with the non-binary code. Then we show that non-binary codewords correspond to binary codewords of the extended representation that further satisfy some simplex-constraint: that is, bits lying over the same symbol-node of the non-binary graph must form a codeword of a simplex code. Applied to the binary erasure channel, this descript...
Roles of Non-Coding RNA in Sugarcane-Microbe Interaction.

Science.gov (United States)

Thiebaut, Flávia; Rojas, Cristian A; Grativol, Clícia; Calixto, Edmundo P da R; Motta, Mariana R; Ballesteros, Helkin G F; Peixoto, Barbara; de Lima, Berenice N S; Vieira, Lucas M; Walter, Maria Emilia; de Armas, Elvismary M; Entenza, Júlio O P; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S; Ferreira, Paulo C G

2017-12-20

Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae . Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae , while the siRNAs were repressed in the presence of A. avenae . Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408-a copper-microRNA-was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5'RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly.
Roles of Non-Coding RNA in Sugarcane-Microbe Interaction

Science.gov (United States)

Grativol, Clícia; Motta, Mariana R.; Ballesteros, Helkin G. F.; Peixoto, Barbara; Vieira, Lucas M.; Walter, Maria Emilia; de Armas, Elvismary M.; Entenza, Júlio O. P.; Lifschitz, Sergio; Farinelli, Laurent; Hemerly, Adriana S.

2017-01-01

Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs) in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae. Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs) were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae, while the siRNAs were repressed in the presence of A. avenae. Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR). Among these miRNAs, miR408—a copper-microRNA—was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5′RACE (rapid amplification of cDNA ends) assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly. PMID:29657296
Roles of Non-Coding RNA in Sugarcane-Microbe Interaction

Directory of Open Access Journals (Sweden)

Flávia Thiebaut

2017-12-01

Full Text Available Studies have highlighted the importance of non-coding RNA regulation in plant-microbe interaction. However, the roles of sugarcane microRNAs (miRNAs in the regulation of disease responses have not been investigated. Firstly, we screened the sRNA transcriptome of sugarcane infected with Acidovorax avenae. Conserved and novel miRNAs were identified. Additionally, small interfering RNAs (siRNAs were aligned to differentially expressed sequences from the sugarcane transcriptome. Interestingly, many siRNAs aligned to a transcript encoding a copper-transporter gene whose expression was induced in the presence of A. avenae, while the siRNAs were repressed in the presence of A. avenae. Moreover, a long intergenic non-coding RNA was identified as a potential target or decoy of miR408. To extend the bioinformatics analysis, we carried out independent inoculations and the expression patterns of six miRNAs were validated by quantitative reverse transcription-PCR (qRT-PCR. Among these miRNAs, miR408—a copper-microRNA—was downregulated. The cleavage of a putative miR408 target, a laccase, was confirmed by a modified 5′RACE (rapid amplification of cDNA ends assay. MiR408 was also downregulated in samples infected with other pathogens, but it was upregulated in the presence of a beneficial diazotrophic bacteria. Our results suggest that regulation by miR408 is important in sugarcane sensing whether microorganisms are either pathogenic or beneficial, triggering specific miRNA-mediated regulatory mechanisms accordingly.
Linear-Time Non-Malleable Codes in the Bit-Wise Independent Tampering Model

DEFF Research Database (Denmark)

Cramer, Ronald; Damgård, Ivan Bjerre; Döttling, Nico

Non-malleable codes were introduced by Dziembowski et al. (ICS 2010) as coding schemes that protect a message against tampering attacks. Roughly speaking, a code is non-malleable if decoding an adversarially tampered encoding of a message m produces the original message m or a value m' (eventuall...... non-malleable codes of Agrawal et al. (TCC 2015) and of Cher- aghchi and Guruswami (TCC 2014) and improves the previous result in the bit-wise tampering model: it builds the first non-malleable codes with linear-time complexity and optimal-rate (i.e. rate 1 - o(1)).......Non-malleable codes were introduced by Dziembowski et al. (ICS 2010) as coding schemes that protect a message against tampering attacks. Roughly speaking, a code is non-malleable if decoding an adversarially tampered encoding of a message m produces the original message m or a value m' (eventually...... abort) completely unrelated with m. It is known that non-malleability is possible only for restricted classes of tampering functions. Since their introduction, a long line of works has established feasibility results of non-malleable codes against different families of tampering functions. However...
A metagenomic viral discovery approach identifies potential zoonotic and novel mammalian viruses in Neoromicia bats within South Africa.

Science.gov (United States)

Geldenhuys, Marike; Mortlock, Marinda; Weyer, Jacqueline; Bezuidt, Oliver; Seamark, Ernest C J; Kearney, Teresa; Gleasner, Cheryl; Erkkila, Tracy H; Cui, Helen; Markotter, Wanda

2018-01-01

Species within the Neoromicia bat genus are abundant and widely distributed in Africa. It is common for these insectivorous bats to roost in anthropogenic structures in urban regions. Additionally, Neoromicia capensis have previously been identified as potential hosts for Middle East respiratory syndrome (MERS)-related coronaviruses. This study aimed to ascertain the gastrointestinal virome of these bats, as viruses excreted in fecal material or which may be replicating in rectal or intestinal tissues have the greatest opportunities of coming into contact with other hosts. Samples were collected in five regions of South Africa over eight years. Initial virome composition was determined by viral metagenomic sequencing by pooling samples and enriching for viral particles. Libraries were sequenced on the Illumina MiSeq and NextSeq500 platforms, producing a combined 37 million reads. Bioinformatics analysis of the high throughput sequencing data detected the full genome of a novel species of the Circoviridae family, and also identified sequence data from the Adenoviridae, Coronaviridae, Herpesviridae, Parvoviridae, Papillomaviridae, Phenuiviridae, and Picornaviridae families. Metagenomic sequencing data was insufficient to determine the viral diversity of certain families due to the fragmented coverage of genomes and lack of suitable sequencing depth, as some viruses were detected from the analysis of reads-data only. Follow up conventional PCR assays targeting conserved gene regions for the Adenoviridae, Coronaviridae, and Herpesviridae families were used to confirm metagenomic data and generate additional sequences to determine genetic diversity. The complete coding genome of a MERS-related coronavirus was recovered with additional amplicon sequencing on the MiSeq platform. The new genome shared 97.2% overall nucleotide identity to a previous Neoromicia-associated MERS-related virus, also from South Africa. Conventional PCR analysis detected diverse adenovirus and
Non-Coding RNAs in Arabidopsis

DEFF Research Database (Denmark)

van Wonterghem, Miranda

This work evolves around elucidating the mechanisms of micro RNAs (miRNAs) in Arabidopsis thaliana. I identified a new class of nuclear non-coding RNAs derived from protein coding genes. The genes are miRNA targets with extensive gene body methylation. The RNA species are nuclear localized and de...
Non-contiguous finished genome sequence and description of Streptococcus varani sp. nov.

Directory of Open Access Journals (Sweden)

S. Bakour

2016-05-01

Full Text Available Strain FF10T (= CSUR P1489 = DSM 100884 was isolated from the oral cavity of a lizard (Varanus niloticus in Dakar, Senegal. Here we used a polyphasic study including phenotypic and genomic analyses to describe the strain FF10T. Results support strain FF10T being a Gram-positive coccus, facultative anaerobic bacterium, catalase-negative, non-motile and non-spore forming. The sequenced genome counts 2.46 Mb with one chromosome but no plasmid. It exhibits a G+C content of 40.4% and contains 2471 protein-coding and 45 RNA genes. On the basis of these data, we propose the creation of Streptococcus varani sp. nov.
New technologies accelerate the exploration of non-coding RNAs in horticultural plants

Energy Technology Data Exchange (ETDEWEB)

Liu, Degao; Mewalal, Ritesh; Hu, Rongbin; Tuskan, Gerald A.; Yang, Xiaohan

2017-07-05

Non-coding RNAs (ncRNAs), that is, RNAs not translated into proteins, are crucial regulators of a variety of biological processes in plants. While protein-encoding genes have been relatively well-annotated in sequenced genomes, accounting for a small portion of the genome space in plants, the universe of plant ncRNAs is rapidly expanding. Recent advances in experimental and computational technologies have generated a great momentum for discovery and functional characterization of ncRNAs. Here we summarize the classification and known biological functions of plant ncRNAs, review the application of next-generation sequencing (NGS) technology and ribosome profiling technology to ncRNA discovery in horticultural plants and discuss the application of new technologies, especially the new genome-editing tool clustered regularly interspaced short palindromic repeat (CRISPR)/CRISPR-associated protein 9 (Cas9) systems, to functional characterization of plant ncRNAs.
Exercise Improves Host Response to Influenza Viral Infection in Obese and Non-Obese Mice through Different Mechanisms

Science.gov (United States)

Warren, Kristi J.; Olson, Molly M.; Thompson, Nicholas J.; Cahill, Mackenzie L.; Wyatt, Todd A.; Yoon, Kyoungjin J.; Loiacono, Christina M.; Kohut, Marian L.

2015-01-01

Obesity has been associated with greater severity of influenza virus infection and impaired host defense. Exercise may confer health benefits even when weight loss is not achieved, but it has not been determined if regular exercise improves immune defense against influenza A virus (IAV) in the obese condition. In this study, diet-induced obese mice and lean control mice exercised for eight weeks followed by influenza viral infection. Exercise reduced disease severity in both obese and non-obese mice, but the mechanisms differed. Exercise reversed the obesity-associated delay in bronchoalveolar-lavage (BAL) cell infiltration, restored BAL cytokine and chemokine production, and increased ciliary beat frequency and IFNα-related gene expression. In non-obese mice, exercise treatment reduced lung viral load, increased Type-I-IFN-related gene expression early during infection, but reduced BAL inflammatory cytokines and chemokines. In both obese and non-obese mice, exercise increased serum anti-influenza virus specific IgG2c antibody, increased CD8+ T cell percentage in BAL, and reduced TNFα by influenza viral NP-peptide-responding CD8+ T cells. Overall, the results suggest that exercise “restores” the immune response of obese mice to a phenotype similar to non-obese mice by improving the delay in immune activation. In contrast, in non-obese mice exercise treatment results in an early reduction in lung viral load and limited inflammatory response. PMID:26110868
Formation of a unique cluster of G-quadruplex structures in the HIV-1 Nef coding region: implications for antiviral activity.

Directory of Open Access Journals (Sweden)

Rosalba Perrone

Full Text Available G-quadruplexes are tetraplex structures of nucleic acids that can form in G-rich sequences. Their presence and functional role have been established in telomeres, oncogene promoters and coding regions of the human chromosome. In particular, they have been proposed to be directly involved in gene regulation at the level of transcription. Because the HIV-1 Nef protein is a fundamental factor for efficient viral replication, infectivity and pathogenesis in vitro and in vivo, we investigated G-quadruplex formation in the HIV-1 nef gene to assess the potential for viral inhibition through G-quadruplex stabilization. A comprehensive computational analysis of the nef coding region of available strains showed the presence of three conserved sequences that were uniquely clustered. Biophysical testing proved that G-quadruplex conformations were efficiently stabilized or induced by G-quadruplex ligands in all three sequences. Upon incubation with a G-quadruplex ligand, Nef expression was reduced in a reporter gene assay and Nef-dependent enhancement of HIV-1 infectivity was significantly repressed in an antiviral assay. These data constitute the first evidence of the possibility to regulate HIV-1 gene expression and infectivity through G-quadruplex targeting and therefore open a new avenue for viral treatment.
V-GAP: Viral genome assembly pipeline

KAUST Repository

Nakamura, Yoji

2015-10-22

Next-generation sequencing technologies have allowed the rapid determination of the complete genomes of many organisms. Although shotgun sequences from large genome organisms are still difficult to reconstruct perfect contigs each of which represents a full chromosome, those from small genomes have been assembled successfully into a very small number of contigs. In this study, we show that shotgun reads from phage genomes can be reconstructed into a single contig by controlling the number of read sequences used in de novo assembly. We have developed a pipeline to assemble small viral genomes with good reliability using a resampling method from shotgun data. This pipeline, named V-GAP (Viral Genome Assembly Pipeline), will contribute to the rapid genome typing of viruses, which are highly divergent, and thus will meet the increasing need for viral genome comparisons in metagenomic studies.
V-GAP: Viral genome assembly pipeline

KAUST Repository

Nakamura, Yoji; Yasuike, Motoshige; Nishiki, Issei; Iwasaki, Yuki; Fujiwara, Atushi; Kawato, Yasuhiko; Nakai, Toshihiro; Nagai, Satoshi; Kobayashi, Takanori; Gojobori, Takashi; Ototake, Mitsuru

2015-01-01

Next-generation sequencing technologies have allowed the rapid determination of the complete genomes of many organisms. Although shotgun sequences from large genome organisms are still difficult to reconstruct perfect contigs each of which represents a full chromosome, those from small genomes have been assembled successfully into a very small number of contigs. In this study, we show that shotgun reads from phage genomes can be reconstructed into a single contig by controlling the number of read sequences used in de novo assembly. We have developed a pipeline to assemble small viral genomes with good reliability using a resampling method from shotgun data. This pipeline, named V-GAP (Viral Genome Assembly Pipeline), will contribute to the rapid genome typing of viruses, which are highly divergent, and thus will meet the increasing need for viral genome comparisons in metagenomic studies.
Multiple Access Interference Reduction Using Received Response Code Sequence for DS-CDMA UWB System

Science.gov (United States)

Toh, Keat Beng; Tachikawa, Shin'ichi

This paper proposes a combination of novel Received Response (RR) sequence at the transmitter and a Matched Filter-RAKE (MF-RAKE) combining scheme receiver system for the Direct Sequence-Code Division Multiple Access Ultra Wideband (DS-CDMA UWB) multipath channel model. This paper also demonstrates the effectiveness of the RR sequence in Multiple Access Interference (MAI) reduction for the DS-CDMA UWB system. It suggests that by using conventional binary code sequence such as the M sequence or the Gold sequence, there is a possibility of generating extra MAI in the UWB system. Therefore, it is quite difficult to collect the energy efficiently although the RAKE reception method is applied at the receiver. The main purpose of the proposed system is to overcome the performance degradation for UWB transmission due to the occurrence of MAI during multiple accessing in the DS-CDMA UWB system. The proposed system improves the system performance by improving the RAKE reception performance using the RR sequence which can reduce the MAI effect significantly. Simulation results verify that significant improvement can be obtained by the proposed system in the UWB multipath channel models.
MicroRNA-encoding long non-coding RNAs

Directory of Open Access Journals (Sweden)

Zhu Xiaopeng

2008-05-01

Full Text Available Abstract Background Recent analysis of the mouse transcriptional data has revealed the existence of ~34,000 messenger-like non-coding RNAs (ml-ncRNAs. Whereas the functional properties of these ml-ncRNAs are beginning to be unravelled, no functional information is available for the large majority of these transcripts. Results A few ml-ncRNA have been shown to have genomic loci that overlap with microRNA loci, leading us to suspect that a fraction of ml-ncRNA may encode microRNAs. We therefore developed an algorithm (PriMir for specifically detecting potential microRNA-encoding transcripts in the entire set of 34,030 mouse full-length ml-ncRNAs. In combination with mouse-rat sequence conservation, this algorithm detected 97 (80 of them were novel strong miRNA-encoding candidates, and for 52 of these we obtained experimental evidence for the existence of their corresponding mature microRNA by microarray and stem-loop RT-PCR. Sequence analysis of the microRNA-encoding RNAs revealed an internal motif, whose presence correlates strongly (R2 = 0.9, P-value = 2.2 × 10-16 with the occurrence of stem-loops with characteristics of known pre-miRNAs, indicating the presence of a larger number microRNA-encoding RNAs (from 300 up to 800 in the ml-ncRNAs population. Conclusion Our work highlights a unique group of ml-ncRNAs and offers clues to their functions.
Use of profile hidden Markov models in viral discovery: current insights

Directory of Open Access Journals (Sweden)

Reyes A

2017-07-01

Full Text Available Alejandro Reyes,1–3 João Marcelo P Alves,4 Alan Mitchell Durham,5 Arthur Gruber4 1Department of Biological Sciences, Universidad de los Andes, Bogotá, Colombia; 2Department of Pathology and Immunology, Center for Genome Sciences and Systems Biology, Washington University in Saint Louis, St Louis, MO, USA; 3Max Planck Tandem Group in Computational Biology, Universidad de los Andes, Bogotá, Colombia; 4Department of Parasitology, Institute of Biomedical Sciences, 5Department of Computer Science, Institute of Mathematics and Statistics, Universidade de São Paulo, São Paulo, Brazil Abstract: Sequence similarity searches are the bioinformatic cornerstone of molecular sequence analysis for all domains of life. However, large amounts of divergence between organisms, such as those seen among viruses, can significantly hamper analyses. Profile hidden Markov models (profile HMMs are among the most successful approaches for dealing with this problem, which represent an invaluable tool for viral identification efforts. Profile HMMs are statistical models that convert information from a multiple sequence alignment into a set of probability values that reflect position-specific variation levels in all members of evolutionarily related sequences. Since profile HMMs represent a wide spectrum of variation, these models show higher sensitivity than conventional similarity methods such as BLAST for the detection of remote homologs. In recent years, there has been an effort to compile viral sequences from different viral taxonomic groups into integrated databases, such as Prokaryotic Virus Orthlogous Groups (pVOGs and database of profile HMMs (vFam database, which provide functional annotation, multiple sequence alignments, and profile HMMs. Since these databases rely on viral sequences collected from GenBank and RefSeq, they suffer in variable extent from uneven taxonomic sampling, with low sequence representation of many viral groups, which affects the
LLNL Genomic Assessment: Viral and Bacterial Sequencing Needs for TMTI, Task 1.4.2 Report

Energy Technology Data Exchange (ETDEWEB)

Slezak, T; Borucki, M; Lam, M; Lenhoff, R; Vitalis, E

2010-01-26

Good progress has been made on both bacterial and viral sequencing by the TMTI centers. While access to appropriate samples is a limiting factor to throughput, excellent progress has been made with respect to getting agreements in place with key sources of relevant materials. Sharing of sequenced genomes funded by TMTI has been extremely limited to date. The April 2010 exercise should force a resolution to this, but additional managerial pressures may be needed to ensure that rapid sharing of TMTI-funded sequencing occurs, regardless of collaborator constraints concerning ultimate publication(s). Policies to permit TMTI-internal rapid sharing of sequenced genomes should be written into all TMTI agreements with collaborators now being negotiated. TMTI needs to establish a Web-based system for tracking samples destined for sequencing. This includes metadata on sample origins and contributor, information on sample shipment/receipt, prioritization by TMTI, assignment to one or more sequencing centers (including possible TMTI-sponsored sequencing at a contributor site), and status history of the sample sequencing effort. While this system could be a component of the AFRL system, it is not part of any current development effort. Policy and standardized procedures are needed to ensure appropriate verification of all TMTI samples prior to the investment in sequencing. PCR, arrays, and classical biochemical tests are examples of potential verification methods. Verification is needed to detect miss-labeled, degraded, mixed or contaminated samples. Regular QC exercises are needed to ensure that the TMTI-funded centers are meeting all standards for producing quality genomic sequence data.
Comparison of Nucleotide Sequence of P2C Region in Diabetogenic and Non-Diabetogenic Coxsackie Virus B5 Isolates

Directory of Open Access Journals (Sweden)

Cheng-Chong Chou

2004-11-01

Full Text Available Enteroviruses are environmental triggers in the pathogenesis of type 1 diabetes mellitus (DM. A sequence of six identical amino acids (PEVKEK is shared by the 2C protein of Coxsackie virus B and the glutamic acid decarboxylase (GAD molecules. Between 1995 and 2002, we investigated 22 Coxsackie virus B5 (CVB5 isolates from southern Taiwan. Four of these isolates were obtained from four new-onset type 1 DM patients with diabetic ketoacidosis. We compared a 300 nucleotide sequence in the 2C protein gene (p2C in 24 CVB5 isolates (4 diabetogenic, 18 non-diabetogenic and 2 prototype. We found 0.3-10% nucleotide differences. In the four isolates from type 1 DM patients, there was only 2.4-3.4% nucleotide difference, and there was only 1.7-7.1% nucleotide difference between type 1 DM isolates and non-diabetogenic isolates. Comparison of the nucleotide sequence between prototype virus and 22 CVB5 isolates revealed 18.4-24.1% difference. Twenty-one CVB5 isolates from type 1 DM and non-type 1 DM patients contained the PEVKEK sequence, as shown by the p2C nucleotide sequence. Our data showed that the viral p2C sequence with homology with GAD is highly conserved in CVB5 isolates. There was no difference between diabetogenic and non-diabetogenic CVB5 isolates. All four type 1 DM patients had at least one of the genetic susceptibility alleles HLA-DR, DQA1, DQB1. Other genetic and autoimmune factors such as HLA genetic susceptibility and GAD may also play important roles in the pathogenesis in type 1 DM.

VirSorter: mining viral signal from microbial genomic data

Directory of Open Access Journals (Sweden)

Simon Roux

2015-05-01

Full Text Available Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome, new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter’s prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages. Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in “reverse” to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made
VirSorter: mining viral signal from microbial genomic data

Science.gov (United States)

Roux, Simon; Enault, Francois; Hurwitz, Bonnie L.

2015-01-01

Viruses of microbes impact all ecosystems where microbes drive key energy and substrate transformations including the oceans, humans and industrial fermenters. However, despite this recognized importance, our understanding of viral diversity and impacts remains limited by too few model systems and reference genomes. One way to fill these gaps in our knowledge of viral diversity is through the detection of viral signal in microbial genomic data. While multiple approaches have been developed and applied for the detection of prophages (viral genomes integrated in a microbial genome), new types of microbial genomic data are emerging that are more fragmented and larger scale, such as Single-cell Amplified Genomes (SAGs) of uncultivated organisms or genomic fragments assembled from metagenomic sequencing. Here, we present VirSorter, a tool designed to detect viral signal in these different types of microbial sequence data in both a reference-dependent and reference-independent manner, leveraging probabilistic models and extensive virome data to maximize detection of novel viruses. Performance testing shows that VirSorter’s prophage prediction capability compares to that of available prophage predictors for complete genomes, but is superior in predicting viral sequences outside of a host genome (i.e., from extrachromosomal prophages, lytic infections, or partially assembled prophages). Furthermore, VirSorter outperforms existing tools for fragmented genomic and metagenomic datasets, and can identify viral signal in assembled sequence (contigs) as short as 3kb, while providing near-perfect identification (>95% Recall and 100% Precision) on contigs of at least 10kb. Because VirSorter scales to large datasets, it can also be used in “reverse” to more confidently identify viral sequence in viral metagenomes by sorting away cellular DNA whether derived from gene transfer agents, generalized transduction or contamination. Finally, VirSorter is made available through the i
Comparative genomics beyond sequence-based alignments

DEFF Research Database (Denmark)

Þórarinsson, Elfar; Yao, Zizhen; Wiklund, Eric D.

2008-01-01

Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment me...
Code of practice for safety in laboratory - non ionising radiation

International Nuclear Information System (INIS)

Ramli Jaya; Mohd Yusof Mohd Ali; Khoo Boo Huat; Khatijah Hashim

1995-01-01

The code identifies the non-ionizing radiation encountered in laboratories and the associated hazards. The code is intended as a laboratory standard reference document for general information on safety requirements relating to the usage of non-ionizing radiations in laboratories. The nonionizing radiations cover in this code, namely, are ultraviolet radiation, visible light, radio-frequency radiation, lasers, sound waves and ultrasonic radiation. (author)
Non-contiguous finished genome sequence and description of Collinsella massiliensis sp. nov.

Science.gov (United States)

Padmanabhan, Roshan; Dubourg, Gregory; Nguyen, Thi-Thien; Couderc, Carine; Rossi-Tamisier, Morgane; Caputo, Aurelia; Raoult, Didier; Fournier, Pierre-Edouard

2014-06-15

Collinsella massiliensis strain GD3(T) is the type strain of Collinsella massiliensis sp. nov., a new species within the genus Collinsella. This strain, whose genome is described here, was isolated from the fecal flora of a 53-year-old French Caucasoid woman who had been admitted to intensive care unit for Guillain-Barré syndrome. Collinsella massiliensis is a Gram-positive, obligate anaerobic, non motile and non sporulating bacillus. Here, we describe the features of this organism, together with the complete genome sequence and annotation. The genome is 2,319,586 bp long (1 chromosome, no plasmid), exhibits a G+C content of 65.8% and contains 2,003 protein-coding and 54 RNA genes, including 1 rRNA operon.
Inter- and intra-host viral diversity in a large seasonal DENV2 outbreak.

Directory of Open Access Journals (Sweden)

Camila Malta Romano

Full Text Available BACKGROUND: High genetic diversity at both inter- and intra-host level are hallmarks of RNA viruses due to the error-prone nature of their genome replication. Several groups have evaluated the extent of viral variability using different RNA virus deep sequencing methods. Although much of this effort has been dedicated to pathogens that cause chronic infections in humans, few studies investigated arthropod-borne, acute viral infections. METHODS AND PRINCIPAL FINDINGS: We deep sequenced the complete genome of ten DENV2 isolates from representative classical and severe cases sampled in a large outbreak in Brazil using two different approaches. Analysis of the consensus genomes confirmed the larger extent of the 2010 epidemic in comparison to a previous epidemic caused by the same viruses in another city two years before (genetic distance = 0.002 and 0.0008 respectively. Analysis of viral populations within the host revealed a high level of conservation. After excluding homopolymer regions of 454/Roche generated sequences, we found 10 to 44 variable sites per genome population at a frequency of >1%, resulting in very low intra-host genetic diversity. While up to 60% of all variable sites at intra-host level were non-synonymous changes, only 10% of inter-host variability resulted from non-synonymous mutations, indicative of purifying selection at the population level. CONCLUSIONS AND SIGNIFICANCE: Despite the error-prone nature of RNA-dependent RNA-polymerase, dengue viruses maintain low levels of intra-host variability.
Novel microRNA-like viral small regulatory RNAs arising during human hepatitis A virus infection.

Science.gov (United States)

Shi, Jiandong; Sun, Jing; Wang, Bin; Wu, Meini; Zhang, Jing; Duan, Zhiqing; Wang, Haixuan; Hu, Ningzhu; Hu, Yunzhang

2014-10-01

MicroRNAs (miRNAs), including host miRNAs and viral miRNAs, play vital roles in regulating host-virus interactions. DNA viruses encode miRNAs that regulate the viral life cycle. However, it is generally believed that cytoplasmic RNA viruses do not encode miRNAs, owing to inaccessible cellular miRNA processing machinery. Here, we provide a comprehensive genome-wide analysis and identification of miRNAs that were derived from hepatitis A virus (HAV; Hu/China/H2/1982), which is a typical cytoplasmic RNA virus. Using deep-sequencing and in silico approaches, we identified 2 novel virally encoded miRNAs, named hav-miR-1-5p and hav-miR-2-5p. Both of the novel virally encoded miRNAs were clearly detected in infected cells. Analysis of Dicer enzyme silencing demonstrated that HAV-derived miRNA biogenesis is Dicer dependent. Furthermore, we confirmed that HAV mature miRNAs were generated from viral miRNA precursors (pre-miRNAs) in host cells. Notably, naturally derived HAV miRNAs were biologically and functionally active and induced post-transcriptional gene silencing (PTGS). Genomic location analysis revealed novel miRNAs located in the coding region of the viral genome. Overall, our results show that HAV naturally generates functional miRNA-like small regulatory RNAs during infection. This is the first report of miRNAs derived from the coding region of genomic RNA of a cytoplasmic RNA virus. These observations demonstrate that a cytoplasmic RNA virus can naturally generate functional miRNAs, as DNA viruses do. These findings also contribute to improved understanding of host-RNA virus interactions mediated by RNA virus-derived miRNAs. © FASEB.
Development of non-linear vibration analysis code for CANDU fuelling machine

International Nuclear Information System (INIS)

Murakami, Hajime; Hirai, Takeshi; Horikoshi, Kiyomi; Mizukoshi, Kaoru; Takenaka, Yasuo; Suzuki, Norio.

1988-01-01

This paper describes the development of a non-linear, dynamic analysis code for the CANDU 600 fuelling machine (F-M), which includes a number of non-linearities such as gap with or without Coulomb friction, special multi-linear spring connections, etc. The capabilities and features of the code and the mathematical treatment for the non-linearities are explained. The modeling and numerical methodology for the non-linearities employed in the code are verified experimentally. Finally, the simulation analyses for the full-scale F-M vibration testing are carried out, and the applicability of the code to such multi-degree of freedom systems as F-M is demonstrated. (author)
Non-Coding Transcript Heterogeneity in Mesothelioma: Insights from Asbestos-Exposed Mice.

Science.gov (United States)

Felley-Bosco, Emanuela; Rehrauer, Hubert

2018-04-11

Mesothelioma is an aggressive, rapidly fatal cancer and a better understanding of its molecular heterogeneity may help with making more efficient therapeutic strategies. Non-coding RNAs represent a larger part of the transcriptome but their contribution to diseases is not fully understood yet. We used recently obtained RNA-seq data from asbestos-exposed mice and performed data mining of publicly available datasets in order to evaluate how non-coding RNA contribute to mesothelioma heterogeneity. Nine non-coding RNAs are specifically elevated in mesothelioma tumors and contribute to human mesothelioma heterogeneity. Because some of them have known oncogenic properties, this study supports the concept of non-coding RNAs as cancer progenitor genes.
Recombination and population mosaic of a multifunctional viral gene, adeno-associated virus cap.

Directory of Open Access Journals (Sweden)

Yasuhiro Takeuchi

Full Text Available Homologous recombination is a dominant force in evolution and results in genetic mosaics. To detect evidence of recombination events and assess the biological significance of genetic mosaics, genome sequences for various viral populations of reasonably large size are now available in the GenBank. We studied a multi-functional viral gene, the adeno-associated virus (AAV cap gene, which codes for three capsid proteins, VP1, VP2 and VP3. VP1-3 share a common C-terminal domain corresponding to VP3, which forms the viral core structure, while the VP1 unique N-terminal part contains an enzymatic domain with phospholipase A2 activity. Our recombinant detection program (RecI revealed five novel recombination events, four of which have their cross-over points in the N-terminal, VP1 and VP2 unique region. Comparison of phylogenetic trees for different cap gene regions confirmed discordant phylogenies for the recombinant sequences. Furthermore, differences in the phylogenetic tree structures for the VP1 unique (VP1u region and the rest of cap highlighted the mosaic nature of cap gene in the AAV population: two dominant forms of VP1u sequences were identified and these forms are linked to diverse sequences in the rest of cap gene. This observation together with the finding of frequent recombination in the VP1 and 2 unique regions suggests that this region is a recombination hot spot. Recombination events in this region preserve protein blocks of distinctive functions and contribute to convergence in VP1u and divergence of the rest of cap. Additionally the possible biological significance of two dominant VP1u forms is inferred.
[Transposition errors during learning to reproduce a sequence by the right- and the left-hand movements: simulation of positional and movement coding].

Science.gov (United States)

Liakhovetskiĭ, V A; Bobrova, E V; Skopin, G N

2012-01-01

Transposition errors during the reproduction of a hand movement sequence make it possible to receive important information on the internal representation of this sequence in the motor working memory. Analysis of such errors showed that learning to reproduce sequences of the left-hand movements improves the system of positional coding (coding ofpositions), while learning of the right-hand movements improves the system of vector coding (coding of movements). Learning of the right-hand movements after the left-hand performance involved the system of positional coding "imposed" by the left hand. Learning of the left-hand movements after the right-hand performance activated the system of vector coding. Transposition errors during learning to reproduce movement sequences can be explained by neural network using either vector coding or both vector and positional coding.
LLNL Genomic Assessment: Viral and Bacterial Sequencing Needs for TMTI, Tier 1 Report

Energy Technology Data Exchange (ETDEWEB)

Slezak, T; Borucki, M; Lenhoff, R; Vitalis, E

2009-09-29

The Lawrence Livermore National Lab Bioinformatics group has recently taken on a role in DTRA's Transformation Medical Technologies Initiative (TMTI). The high-level goal of TMTI is to accelerate the development of broad-spectrum countermeasures. To achieve those goals, TMTI has a near term need to obtain more sequence information across a large range of pathogens, near neighbors, and across a broad geographical and host range. Our role in this project is to research available sequence data for the organisms of interest and identify critical microbial sequence and knowledge gaps that need to be filled to meet TMTI objectives. This effort includes: (1) assessing current genomic sequence for each agent including phylogenetic and geographical diversity, host range, date of isolation range, virulence, sequence availability of key near neighbors, and other characteristics; (2) identifying Subject Matter Experts (SME's) and potential holders of isolate collections, contacting appropriate SME's with known expertise and isolate collections to obtain information on isolate availability and specific recommendations; (3) identifying sequence as well as knowledge gaps (eg virulence, host range, and antibiotic resistance determinants); (4) providing specific recommendations as to the most valuable strains to be placed on the DTRA sequencing queue. We acknowledge that criteria for prioritization of isolates for sequencing falls into two categories aligning with priority queues 1 and 2 as described in the summary. (Priority queue 0 relates to DTRA operational isolates whose availability is not predictable in advance.) 1. Selection of isolates that appear to have likelihood to provide information on virulence and antibiotic resistance. This will include sequence of known virulent strains. Particularly valuable would be virulent strains that have genetically similar yet avirulent, or non human transmissible, counterparts that can be used for comparison to help
Direct metagenomic detection of viral pathogens in nasal and fecal specimens using an unbiased high-throughput sequencing approach.

Directory of Open Access Journals (Sweden)

Shota Nakamura

Full Text Available With the severe acute respiratory syndrome epidemic of 2003 and renewed attention on avian influenza viral pandemics, new surveillance systems are needed for the earlier detection of emerging infectious diseases. We applied a "next-generation" parallel sequencing platform for viral detection in nasopharyngeal and fecal samples collected during seasonal influenza virus (Flu infections and norovirus outbreaks from 2005 to 2007 in Osaka, Japan. Random RT-PCR was performed to amplify RNA extracted from 0.1-0.25 ml of nasopharyngeal aspirates (N = 3 and fecal specimens (N = 5, and more than 10 microg of cDNA was synthesized. Unbiased high-throughput sequencing of these 8 samples yielded 15,298-32,335 (average 24,738 reads in a single 7.5 h run. In nasopharyngeal samples, although whole genome analysis was not available because the majority (>90% of reads were host genome-derived, 20-460 Flu-reads were detected, which was sufficient for subtype identification. In fecal samples, bacteria and host cells were removed by centrifugation, resulting in gain of 484-15,260 reads of norovirus sequence (78-98% of the whole genome was covered, except for one specimen that was under-detectable by RT-PCR. These results suggest that our unbiased high-throughput sequencing approach is useful for directly detecting pathogenic viruses without advance genetic information. Although its cost and technological availability make it unlikely that this system will very soon be the diagnostic standard worldwide, this system could be useful for the earlier discovery of novel emerging viruses and bioterrorism, which are difficult to detect with conventional procedures.
Verona Coding Definitions of Emotional Sequences (VR-CoDES): Conceptual framework and future directions.

Science.gov (United States)

Piccolo, Lidia Del; Finset, Arnstein; Mellblom, Anneli V; Figueiredo-Braga, Margarida; Korsvold, Live; Zhou, Yuefang; Zimmermann, Christa; Humphris, Gerald

2017-12-01

To discuss the theoretical and empirical framework of VR-CoDES and potential future direction in research based on the coding system. The paper is based on selective review of papers relevant to the construction and application of VR-CoDES. VR-CoDES system is rooted in patient-centered and biopsychosocial model of healthcare consultations and on a functional approach to emotion theory. According to the VR-CoDES, emotional interaction is studied in terms of sequences consisting of an eliciting event, an emotional expression by the patient and the immediate response by the clinician. The rationale for the emphasis on sequences, on detailed classification of cues and concerns, and on the choices of explicit vs. non-explicit responses and providing vs. reducing room for further disclosure, as basic categories of the clinician responses, is described. Results from research on VR-CoDES may help raise awareness of emotional sequences. Future directions in applying VR-CoDES in research may include studies on predicting patient and clinician behavior within the consultation, qualitative analyses of longer sequences including several VR-CoDES triads, and studies of effects of emotional communication on health outcomes. VR-CoDES may be applied to develop interventions to promote good handling of patients' emotions in healthcare encounters. Copyright © 2017 Elsevier B.V. All rights reserved.
lncRScan-SVM: A Tool for Predicting Long Non-Coding RNAs Using Support Vector Machine.

Science.gov (United States)

Sun, Lei; Liu, Hui; Zhang, Lin; Meng, Jia

2015-01-01

Functional long non-coding RNAs (lncRNAs) have been bringing novel insight into biological study, however it is still not trivial to accurately distinguish the lncRNA transcripts (LNCTs) from the protein coding ones (PCTs). As various information and data about lncRNAs are preserved by previous studies, it is appealing to develop novel methods to identify the lncRNAs more accurately. Our method lncRScan-SVM aims at classifying PCTs and LNCTs using support vector machine (SVM). The gold-standard datasets for lncRScan-SVM model training, lncRNA prediction and method comparison were constructed according to the GENCODE gene annotations of human and mouse respectively. By integrating features derived from gene structure, transcript sequence, potential codon sequence and conservation, lncRScan-SVM outperforms other approaches, which is evaluated by several criteria such as sensitivity, specificity, accuracy, Matthews correlation coefficient (MCC) and area under curve (AUC). In addition, several known human lncRNA datasets were assessed using lncRScan-SVM. LncRScan-SVM is an efficient tool for predicting the lncRNAs, and it is quite useful for current lncRNA study.
Coding chaotic billiards. Pt. 3

International Nuclear Information System (INIS)

Ullmo, D.; Giannoni, M.J.

1993-01-01

Non-tiling compact billiard defined on the pseudosphere is studied 'a la Morse coding'. As for most bounded systems, the coding is non exact. However, two sets of approximate grammar rules can be obtained, one specifying forbidden codes, and the other allowed ones. In-between some sequences remain in the 'unknown' zone, but their relative amount can be reduced to zero if one lets the length of the approximate grammar rules goes to infinity. The relationship between these approximate grammar rules and the 'pruning front' introduced by Cvitanovic et al. is discussed. (authors). 13 refs., 10 figs., 1 tab
Analysis of the AD sequence in Zion plant using the March 1.1 code

International Nuclear Information System (INIS)

Oriolo, F.; Paci, S.

1985-01-01

The analyses of the AD sequences for the Zion power plant, made at the Pisa University, in the framework of the participation in the Source Tern Working Group. After a short description of the plant and the sequence under analysis, the model used for the reference computation and the results obtained using the March 1.1 code are shown. Together with the reference computation a series of parametric tests have been also made, concerning some input code variables, in order to ascertain their influence on the transient trend. The results of these analyses are shown in Appendix
Min-Max decoding for non binary LDPC codes

OpenAIRE

Savin, Valentin

2008-01-01

Iterative decoding of non-binary LDPC codes is currently performed using either the Sum-Product or the Min-Sum algorithms or slightly different versions of them. In this paper, several low-complexity quasi-optimal iterative algorithms are proposed for decoding non-binary codes. The Min-Max algorithm is one of them and it has the benefit of two possible LLR domain implementations: a standard implementation, whose complexity scales as the square of the Galois field's cardinality and a reduced c...
Sequence Coding and Search System for licensee event reports: coder's manual. Volume 4

International Nuclear Information System (INIS)

Gallaher, R.B.; Guymon, R.H.; Mays, G.T.; Poore, W.P.; Cagle, R.J.; Harrington, K.H.; Johnson, M.P.

1985-04-01

Operating experience data from nuclear power plants are essential for safety and reliability analyses, especially analyses of trends and patterns. The licensee event reports (LERs) that are submitted to the Nuclear Regulatory Commission (NRC) by the nuclear power plant utilities contain much of this data. The NRC's Office for Analysis and Evaluation of Operational Data (AEOD) has developed, under contract with NSIC, a system for codifying the events reported in the LERs. The primary objective of the Sequence Coding and Search System (SCSS) is to reduce the descriptive text of the LERs to coded sequences that are both computer-readable and computer-searchable. This four volume report documents and describes SCSS in detail. Volume 3 and 4 provide a technical processor, new to SCSS, the information and methodology necessary to capture descriptive data from the LER and to codify that data into a structured format and serve as reference material for the more experienced technical processor, and contains information that is essential for the more advanced user who needs to be familiar with the intricate coding techniques in order to retrieve specific details in a sequence. This volume contains updated material through amendment 1 to revision 1 of the working version of ORNL/NSIC-223, Vol. 4
Sequence Coding and Search System for licensee event reports: coder's manual. Volume 3

International Nuclear Information System (INIS)

Gallaher, R.B.; Guymon, R.H.; Mays, G.T.; Poore, W.P.; Cagle, R.J.; Harrington, K.H.; Johnson, M.P.

1985-04-01

Operating experience data from nuclear power plants are essential for safety and reliability analyses, especially analyses of trends and patterns. The licensee event reports (LERs) that are submitted to the Nuclear Regulatory Commission (NRC) by the nuclear power plant utilities contain much of this data. The NRC's Office for Analysis and Evaluation of Operational Data (AEOD) has developed, under contract with NSIC, a system for codifying the events reported in the LERs. The primary objective of the Sequence Coding and Search System (SCSS) is to reduce the descriptive text of the LERs to coded sequences that are both computer-readable and computer-searchable. This four volume report documents and describes SCSS in detail. Volumes 3 and 4 provide a technical processor, new to SCSS, the information and methodology necessary to capture descriptive data from the LER and to codify that data into a structured format and serve as reference material for the more experienced technical processor, and contains information is essential for the more advanced user who needs to be familiar with the intricate coding techniques in order to retrieve specific details in a sequence. This volume contains updated material through amendment 1 to revision 1 of the working version of ORNL/NSIC-223, Vol. 3

Sequence Coding and Search System Backfit Quality Assurance Program Plan

International Nuclear Information System (INIS)

Lovell, C.J.; Stepina, P.L.

1985-03-01

The Sequence Coding and Search System is a computer-based encoding system for events described in Licensee Event Reports. This data system contains LERs from 1981 to present. Backfit of the data system to include LERs prior to 1981 is required. This report documents the Quality Assurance Program Plan that EG and G Idaho, Inc. will follow while encoding 1980 LERs
microRNA dependent and independent deregulation of long non-coding RNAs by an oncogenic herpesvirus.

Directory of Open Access Journals (Sweden)

Sunantha Sethuraman

2017-07-01

Full Text Available Kaposi's sarcoma (KS is a highly prevalent cancer in AIDS patients, especially in sub-Saharan Africa. Kaposi's sarcoma-associated herpesvirus (KSHV is the etiological agent of KS and other cancers like Primary Effusion Lymphoma (PEL. In KS and PEL, all tumors harbor latent KSHV episomes and express latency-associated viral proteins and microRNAs (miRNAs. The exact molecular mechanisms by which latent KSHV drives tumorigenesis are not completely understood. Recent developments have highlighted the importance of aberrant long non-coding RNA (lncRNA expression in cancer. Deregulation of lncRNAs by miRNAs is a newly described phenomenon. We hypothesized that KSHV-encoded miRNAs deregulate human lncRNAs to drive tumorigenesis. We performed lncRNA expression profiling of endothelial cells infected with wt and miRNA-deleted KSHV and identified 126 lncRNAs as putative viral miRNA targets. Here we show that KSHV deregulates host lncRNAs in both a miRNA-dependent fashion by direct interaction and in a miRNA-independent fashion through latency-associated proteins. Several lncRNAs that were previously implicated in cancer, including MEG3, ANRIL and UCA1, are deregulated by KSHV. Our results also demonstrate that KSHV-mediated UCA1 deregulation contributes to increased proliferation and migration of endothelial cells.
Effects of metal-rich particulate matter exposure on exogenous and endogenous viral sequence methylation in healthy steel-workers.

Science.gov (United States)

Mercorio, Roberta; Bonzini, Matteo; Angelici, Laura; Iodice, Simona; Delbue, Serena; Mariani, Jacopo; Apostoli, Pietro; Pesatori, Angela Cecilia; Bollati, Valentina

2017-11-01

Inhaled particles have been shown to produce systemic changes in DNA methylation. Global hypomethylation has been associated to viral sequence reactivation, possibly linked to the activation of pro-inflammatory pathways occurring after exposure. This observation provides a rationale to investigate viral sequence (both exogenous and endogenous) methylation in association to metal-rich particulate matter exposure. To verify this hypothesis, we chose the Wp promoter of the Epstein-Barr Virus (EBV-Wp) and the promoter of the human-endogenous-retrovirus w (HERV-w), respectively as a paradigm of an exogenous and an endogenous retroviral sequence, to be investigated by bisulfite PCR Pyrosequencing. We enrolled 63 male workers in an electric furnace steel plant, exposed to high level of metal-rich particulate matter. Comparing samples obtained in the first day of a work week (time 0-baseline, after 2 days off work) and the samples obtained after 3 days of work (time 1-post exposure), the mean methylation of EBV-Wp was significantly higher at baseline compared to post-exposure (mean baseline = 56.7%5mC; mean post-exposure = 47.9%5mC; p-value = 0.009), whereas the mean methylation of HERV-w did not significantly differ. Individual exposure to inhalable particles and metals was estimated based on measures in all working areas and time spent by the study subjects in each area. In a regression model adjusted for age, body mass index and smoking, PM and metal components had a positive association with EBV-Wp methylation (i.e. PM10: β = 5.99, p-value < 0.038; nickel: β = 17.82, p-value = 0.02; arsenic: β = 13.59, p-value < 0.015). The difference observed comparing baseline and post-exposure samples may be suggestive of a rapid change in EBV methylation induced by air particles, while correlation between EBV methylation and PM/metal exposure may represent a more stable adaptive mechanism. Future studies investigating a larger panel of viral sequences could better elucidate
Development and preliminary evaluation of a multiplexed amplification and next generation sequencing method for viral hemorrhagic fever diagnostics.

Directory of Open Access Journals (Sweden)

Annika Brinkmann

2017-11-01

Full Text Available We describe the development and evaluation of a novel method for targeted amplification and Next Generation Sequencing (NGS-based identification of viral hemorrhagic fever (VHF agents and assess the feasibility of this approach in diagnostics.An ultrahigh-multiplex panel was designed with primers to amplify all known variants of VHF-associated viruses and relevant controls. The performance of the panel was evaluated via serially quantified nucleic acids from Yellow fever virus, Rift Valley fever virus, Crimean-Congo hemorrhagic fever (CCHF virus, Ebola virus, Junin virus and Chikungunya virus in a semiconductor-based sequencing platform. A comparison of direct NGS and targeted amplification-NGS was performed. The panel was further tested via a real-time nanopore sequencing-based platform, using clinical specimens from CCHF patients.The multiplex primer panel comprises two pools of 285 and 256 primer pairs for the identification of 46 virus species causing hemorrhagic fevers, encompassing 6,130 genetic variants of the strains involved. In silico validation revealed that the panel detected over 97% of all known genetic variants of the targeted virus species. High levels of specificity and sensitivity were observed for the tested virus strains. Targeted amplification ensured viral read detection in specimens with the lowest virus concentration (1-10 genome equivalents and enabled significant increases in specific reads over background for all viruses investigated. In clinical specimens, the panel enabled detection of the causative agent and its characterization within 10 minutes of sequencing, with sample-to-result time of less than 3.5 hours.Virus enrichment via targeted amplification followed by NGS is an applicable strategy for the diagnosis of VHFs which can be adapted for high-throughput or nanopore sequencing platforms and employed for surveillance or outbreak monitoring.
Development and preliminary evaluation of a multiplexed amplification and next generation sequencing method for viral hemorrhagic fever diagnostics.

Science.gov (United States)

Brinkmann, Annika; Ergünay, Koray; Radonić, Aleksandar; Kocak Tufan, Zeliha; Domingo, Cristina; Nitsche, Andreas

2017-11-01

We describe the development and evaluation of a novel method for targeted amplification and Next Generation Sequencing (NGS)-based identification of viral hemorrhagic fever (VHF) agents and assess the feasibility of this approach in diagnostics. An ultrahigh-multiplex panel was designed with primers to amplify all known variants of VHF-associated viruses and relevant controls. The performance of the panel was evaluated via serially quantified nucleic acids from Yellow fever virus, Rift Valley fever virus, Crimean-Congo hemorrhagic fever (CCHF) virus, Ebola virus, Junin virus and Chikungunya virus in a semiconductor-based sequencing platform. A comparison of direct NGS and targeted amplification-NGS was performed. The panel was further tested via a real-time nanopore sequencing-based platform, using clinical specimens from CCHF patients. The multiplex primer panel comprises two pools of 285 and 256 primer pairs for the identification of 46 virus species causing hemorrhagic fevers, encompassing 6,130 genetic variants of the strains involved. In silico validation revealed that the panel detected over 97% of all known genetic variants of the targeted virus species. High levels of specificity and sensitivity were observed for the tested virus strains. Targeted amplification ensured viral read detection in specimens with the lowest virus concentration (1-10 genome equivalents) and enabled significant increases in specific reads over background for all viruses investigated. In clinical specimens, the panel enabled detection of the causative agent and its characterization within 10 minutes of sequencing, with sample-to-result time of less than 3.5 hours. Virus enrichment via targeted amplification followed by NGS is an applicable strategy for the diagnosis of VHFs which can be adapted for high-throughput or nanopore sequencing platforms and employed for surveillance or outbreak monitoring.
FishPathogens.eu/vhsv: a user-friendly viral haemorrhagic septicaemia virus isolate and sequence database

DEFF Research Database (Denmark)

Jonstrup, Søren Peter; Gray, Tanya; Kahns, Søren

2009-01-01

A database has been created, http://www.Fish Pathogens.eu, with the aim of providing a single repository for collating important information on significant pathogens of aquaculture, relevant to their control and management. This database will be developed, maintained and managed as part of the Eu......A database has been created, http://www.Fish Pathogens.eu, with the aim of providing a single repository for collating important information on significant pathogens of aquaculture, relevant to their control and management. This database will be developed, maintained and managed as part...... of the European Community Reference Laboratory for Fish Diseases function. This concept has been initially developed for viral haemorrhagic septicaemia virus and will be extended in future to include information on other significant aquaculture pathogens. Information included for each isolate comprises sequence...... to obtain data from any selected part of the genome of interest. The output of the sequence search can be readily retrieved as a FASTA file ready to be imported into a sequence alignment tool of choice, facilitating further molecular epidemiological study....
The Canonical Immediate Early 3 Gene Product pIE611 of Mouse Cytomegalovirus Is Dispensable for Viral Replication but Mediates Transcriptional and Posttranscriptional Regulation of Viral Gene Products.

Science.gov (United States)

Rattay, Stephanie; Trilling, Mirko; Megger, Dominik A; Sitek, Barbara; Meyer, Helmut E; Hengel, Hartmut; Le-Trilling, Vu Thuy Khanh

2015-08-01

Transcription of mouse cytomegalovirus (MCMV) immediate early ie1 and ie3 is controlled by the major immediate early promoter/enhancer (MIEP) and requires differential splicing. Based on complete loss of genome replication of an MCMV mutant carrying a deletion of the ie3-specific exon 5, the multifunctional IE3 protein (611 amino acids; pIE611) is considered essential for viral replication. Our analysis of ie3 transcription resulted in the identification of novel ie3 isoforms derived from alternatively spliced ie3 transcripts. Construction of an IE3-hemagglutinin (IE3-HA) virus by insertion of an in-frame HA epitope sequence allowed detection of the IE3 isoforms in infected cells, verifying that the newly identified transcripts code for proteins. This prompted the construction of an MCMV mutant lacking ie611 but retaining the coding capacity for the newly identified isoforms ie453 and ie310. Using Δie611 MCMV, we demonstrated the dispensability of the canonical ie3 gene product pIE611 for viral replication. To determine the role of pIE611 for viral gene expression during MCMV infection in an unbiased global approach, we used label-free quantitative mass spectrometry to delineate pIE611-dependent changes of the MCMV proteome. Interestingly, further analysis revealed transcriptional as well as posttranscriptional regulation of MCMV gene products by pIE611. Cytomegaloviruses are pathogenic betaherpesviruses persisting in a lifelong latency from which reactivation can occur under conditions of immunosuppression, immunoimmaturity, or inflammation. The switch from latency to reactivation requires expression of immediate early genes. Therefore, understanding of immediate early gene regulation might add insights into viral pathogenesis. The mouse cytomegalovirus (MCMV) immediate early 3 protein (611 amino acids; pIE611) is considered essential for viral replication. The identification of novel protein isoforms derived from alternatively spliced ie3 transcripts prompted
Modelling of blackout sequence at Atucha-1 using the MARCH3 code

International Nuclear Information System (INIS)

Baron, J.; Bastianelli, B.

1997-01-01

This paper presents the modelling of a complete blackout at the Atucha-1 NPP as preliminary phase for a Level II safety probabilistic analysis. The MARCH3 code of the STCP (Source Term Code Package) is used, based on a plant model made in accordance with particularities of the plant design. The analysis covers all the severe accident phases. The results allow to view the time sequence of the events, and provide the basis for source term studies. (author). 6 refs., 2 figs
The Pacific Ocean virome (POV: a marine viral metagenomic dataset and associated protein clusters for quantitative viral ecology.

Directory of Open Access Journals (Sweden)

Bonnie L Hurwitz

Full Text Available Bacteria and their viruses (phage are fundamental drivers of many ecosystem processes including global biogeochemistry and horizontal gene transfer. While databases and resources for studying function in uncultured bacterial communities are relatively advanced, many fewer exist for their viral counterparts. The issue is largely technical in that the majority (often 90% of viral sequences are functionally 'unknown' making viruses a virtually untapped resource of functional and physiological information. Here, we provide a community resource that organizes this unknown sequence space into 27 K high confidence protein clusters using 32 viral metagenomes from four biogeographic regions in the Pacific Ocean that vary by season, depth, and proximity to land, and include some of the first deep pelagic ocean viral metagenomes. These protein clusters more than double currently available viral protein clusters, including those from environmental datasets. Further, a protein cluster guided analysis of functional diversity revealed that richness decreased (i from deep to surface waters, (ii from winter to summer, (iii and with distance from shore in surface waters only. These data provide a framework from which to draw on for future metadata-enabled functional inquiries of the vast viral unknown.
The Pacific Ocean virome (POV): a marine viral metagenomic dataset and associated protein clusters for quantitative viral ecology.

Science.gov (United States)

Hurwitz, Bonnie L; Sullivan, Matthew B

2013-01-01

Bacteria and their viruses (phage) are fundamental drivers of many ecosystem processes including global biogeochemistry and horizontal gene transfer. While databases and resources for studying function in uncultured bacterial communities are relatively advanced, many fewer exist for their viral counterparts. The issue is largely technical in that the majority (often 90%) of viral sequences are functionally 'unknown' making viruses a virtually untapped resource of functional and physiological information. Here, we provide a community resource that organizes this unknown sequence space into 27 K high confidence protein clusters using 32 viral metagenomes from four biogeographic regions in the Pacific Ocean that vary by season, depth, and proximity to land, and include some of the first deep pelagic ocean viral metagenomes. These protein clusters more than double currently available viral protein clusters, including those from environmental datasets. Further, a protein cluster guided analysis of functional diversity revealed that richness decreased (i) from deep to surface waters, (ii) from winter to summer, (iii) and with distance from shore in surface waters only. These data provide a framework from which to draw on for future metadata-enabled functional inquiries of the vast viral unknown.
Non-contiguous finished genome sequence of the opportunistic oral pathogen Prevotella multisaccharivorax type strain (PPPA20T)

Energy Technology Data Exchange (ETDEWEB)

Pati, Amrita [U.S. Department of Energy, Joint Genome Institute; Gronow, Sabine [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Lu, Megan [Los Alamos National Laboratory (LANL); Lapidus, Alla L. [U.S. Department of Energy, Joint Genome Institute; Nolan, Matt [U.S. Department of Energy, Joint Genome Institute; Lucas, Susan [U.S. Department of Energy, Joint Genome Institute; Hammon, Nancy [U.S. Department of Energy, Joint Genome Institute; Deshpande, Shweta [U.S. Department of Energy, Joint Genome Institute; Cheng, Jan-Fang [U.S. Department of Energy, Joint Genome Institute; Tapia, Roxanne [Los Alamos National Laboratory (LANL); Han, Cliff [Los Alamos National Laboratory (LANL); Goodwin, Lynne A. [Los Alamos National Laboratory (LANL); Pitluck, Sam [U.S. Department of Energy, Joint Genome Institute; Liolios, Konstantinos [U.S. Department of Energy, Joint Genome Institute; Pagani, Ioanna [U.S. Department of Energy, Joint Genome Institute; Mavromatis, K [U.S. Department of Energy, Joint Genome Institute; Mikhailova, Natalia [U.S. Department of Energy, Joint Genome Institute; Huntemann, Marcel [U.S. Department of Energy, Joint Genome Institute; Chen, Amy [U.S. Department of Energy, Joint Genome Institute; Palaniappan, Krishna [U.S. Department of Energy, Joint Genome Institute; Land, Miriam L [ORNL; Hauser, Loren John [ORNL; Detter, J. Chris [U.S. Department of Energy, Joint Genome Institute; Brambilla, Evelyne-Marie [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Rohde, Manfred [HZI - Helmholtz Centre for Infection Research, Braunschweig, Germany; Goker, Markus [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Woyke, Tanja [U.S. Department of Energy, Joint Genome Institute; Bristow, James [U.S. Department of Energy, Joint Genome Institute; Eisen, Jonathan [U.S. Department of Energy, Joint Genome Institute; Markowitz, Victor [U.S. Department of Energy, Joint Genome Institute; Hugenholtz, Philip [U.S. Department of Energy, Joint Genome Institute; Kyrpides, Nikos C [U.S. Department of Energy, Joint Genome Institute; Klenk, Hans-Peter [DSMZ - German Collection of Microorganisms and Cell Cultures GmbH, Braunschweig, Germany; Ivanova, N [U.S. Department of Energy, Joint Genome Institute

2011-01-01

Prevotella multisaccharivorax Sakamoto et al. 2005 is a species of the large genus Prevotella, which belongs to the family Prevotellaceae. The species is of medical interest because its members are able to cause diseases in the human oral cavity such as periodontitis, root caries and others. Although 77 Prevotella genomes have already been sequenced or are targeted for sequencing, this is only the second completed genome sequence of a type strain of a species within the genus Prevotella to be published. The 3,388,644 bp long genome is assembled in three non-contiguous contigs, harbors 2,876 protein-coding and 75 RNA genes and is a part of the Genomic Encyclopedia of Bacteria and Archaea project.
Prospecting for viral natural enemies of the fire ant Solenopsis invicta in Argentina.

Science.gov (United States)

Valles, Steven M; Porter, Sanford D; Calcaterra, Luis A

2018-01-01

Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina) with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29) exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5). SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations.
Complete coding sequence of Zika virus from Martinique outbreak in 2015

Directory of Open Access Journals (Sweden)

G. Piorkowski

2016-05-01

Full Text Available Zika virus is an Aedes-borne Flavivirus causing fever, arthralgia, myalgia rash, associated with Guillain–Barré syndrome and suspected to induce microcephaly in the fetus. We report here the complete coding sequence of the first characterized Caribbean Zika virus strain, isolated from a patient from Martinique in December, 2015.
Sensitive detection of viral transcripts in human tumor transcriptomes.

Directory of Open Access Journals (Sweden)

Sven-Eric Schelhorn

Full Text Available In excess of 12% of human cancer incidents have a viral cofactor. Epidemiological studies of idiopathic human cancers indicate that additional tumor viruses remain to be discovered. Recent advances in sequencing technology have enabled systematic screenings of human tumor transcriptomes for viral transcripts. However, technical problems such as low abundances of viral transcripts in large volumes of sequencing data, viral sequence divergence, and homology between viral and human factors significantly confound identification of tumor viruses. We have developed a novel computational approach for detecting viral transcripts in human cancers that takes the aforementioned confounding factors into account and is applicable to a wide variety of viruses and tumors. We apply the approach to conducting the first systematic search for viruses in neuroblastoma, the most common cancer in infancy. The diverse clinical progression of this disease as well as related epidemiological and virological findings are highly suggestive of a pathogenic cofactor. However, a viral etiology of neuroblastoma is currently contested. We mapped 14 transcriptomes of neuroblastoma as well as positive and negative controls to the human and all known viral genomes in order to detect both known and unknown viruses. Analysis of controls, comparisons with related methods, and statistical estimates demonstrate the high sensitivity of our approach. Detailed investigation of putative viral transcripts within neuroblastoma samples did not provide evidence for the existence of any known human viruses. Likewise, de-novo assembly and analysis of chimeric transcripts did not result in expression signatures associated with novel human pathogens. While confounding factors such as sample dilution or viral clearance in progressed tumors may mask viral cofactors in the data, in principle, this is rendered less likely by the high sensitivity of our approach and the number of biological replicates
Solid lipid nanoparticles mediate non-viral delivery of plasmid DNA to dendritic cells

Science.gov (United States)

Penumarthi, Alekhya; Parashar, Deepti; Abraham, Amanda N.; Dekiwadia, Chaitali; Macreadie, Ian; Shukla, Ravi; Smooker, Peter M.

2017-06-01

There is an increasing demand for novel DNA vaccine delivery systems, mainly for the non-viral type as they are considered relatively safe. Therefore, solid lipid nanoparticles (SLNs) were investigated for their suitability as a non-viral DNA vaccine delivery system. SLNs were synthesised by a modified solvent-emulsification method in order to study their potential to conjugate with plasmid DNA and deliver them in vitro to dendritic cells using eGFP as the reporter plasmid. The DNA-SLN complexes were characterised by electron microscopy, gel retardation assays and dynamic light scattering. The cytotoxicity assay data supported their biocompatibility and was used to estimate safe threshold concentration resulting in high transfection rate. The transfection efficiency of these complexes in a dendritic cell line was shown to increase significantly compared to plasmid alone, and was comparable to that mediated by lipofectamine. Transmission electron microscopy studies delineated the pathway of cellular uptake. Endosomal escape was observed supporting the mechanism of transfection.
Classifying Coding DNA with Nucleotide Statistics

Directory of Open Access Journals (Sweden)

Nicolas Carels

2009-10-01

Full Text Available In this report, we compared the success rate of classification of coding sequences (CDS vs. introns by Codon Structure Factor (CSF and by a method that we called Universal Feature Method (UFM. UFM is based on the scoring of purine bias (Rrr and stop codon frequency. We show that the success rate of CDS/intron classification by UFM is higher than by CSF. UFM classifies ORFs as coding or non-coding through a score based on (i the stop codon distribution, (ii the product of purine probabilities in the three positions of nucleotide triplets, (iii the product of Cytosine (C, Guanine (G, and Adenine (A probabilities in the 1st, 2nd, and 3rd positions of triplets, respectively, (iv the probabilities of G in 1st and 2nd position of triplets and (v the distance of their GC3 vs. GC2 levels to the regression line of the universal correlation. More than 80% of CDSs (true positives of Homo sapiens (>250 bp, Drosophila melanogaster (>250 bp and Arabidopsis thaliana (>200 bp are successfully classified with a false positive rate lower or equal to 5%. The method releases coding sequences in their coding strand and coding frame, which allows their automatic translation into protein sequences with 95% confidence. The method is a natural consequence of the compositional bias of nucleotides in coding sequences.
Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution.

Science.gov (United States)

2004-12-09

We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.
Source coherence impairments in a direct detection direct sequence optical code-division multiple-access system.

Science.gov (United States)

Fsaifes, Ihsan; Lepers, Catherine; Lourdiane, Mounia; Gallion, Philippe; Beugin, Vincent; Guignard, Philippe

2007-02-01

We demonstrate that direct sequence optical code- division multiple-access (DS-OCDMA) encoders and decoders using sampled fiber Bragg gratings (S-FBGs) behave as multipath interferometers. In that case, chip pulses of the prime sequence codes generated by spreading in time-coherent data pulses can result from multiple reflections in the interferometers that can superimpose within a chip time duration. We show that the autocorrelation function has to be considered as the sum of complex amplitudes of the combined chip as the laser source coherence time is much greater than the integration time of the photodetector. To reduce the sensitivity of the DS-OCDMA system to the coherence time of the laser source, we analyze the use of sparse and nonperiodic quadratic congruence and extended quadratic congruence codes.
Source coherence impairments in a direct detection direct sequence optical code-division multiple-access system

Science.gov (United States)

Fsaifes, Ihsan; Lepers, Catherine; Lourdiane, Mounia; Gallion, Philippe; Beugin, Vincent; Guignard, Philippe

2007-02-01

We demonstrate that direct sequence optical code- division multiple-access (DS-OCDMA) encoders and decoders using sampled fiber Bragg gratings (S-FBGs) behave as multipath interferometers. In that case, chip pulses of the prime sequence codes generated by spreading in time-coherent data pulses can result from multiple reflections in the interferometers that can superimpose within a chip time duration. We show that the autocorrelation function has to be considered as the sum of complex amplitudes of the combined chip as the laser source coherence time is much greater than the integration time of the photodetector. To reduce the sensitivity of the DS-OCDMA system to the coherence time of the laser source, we analyze the use of sparse and nonperiodic quadratic congruence and extended quadratic congruence codes.
IdentiCS – Identification of coding sequence and in silico reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence

Directory of Open Access Journals (Sweden)

Zeng An-Ping

2004-08-01

Full Text Available Abstract Background A necessary step for a genome level analysis of the cellular metabolism is the in silico reconstruction of the metabolic network from genome sequences. The available methods are mainly based on the annotation of genome sequences including two successive steps, the prediction of coding sequences (CDS and their function assignment. The annotation process takes time. The available methods often encounter difficulties when dealing with unfinished error-containing genomic sequence. Results In this work a fast method is proposed to use unannotated genome sequence for predicting CDSs and for an in silico reconstruction of metabolic networks. Instead of using predicted genes or CDSs to query public databases, entries from public DNA or protein databases are used as queries to search a local database of the unannotated genome sequence to predict CDSs. Functions are assigned to the predicted CDSs simultaneously. The well-annotated genome of Salmonella typhimurium LT2 is used as an example to demonstrate the applicability of the method. 97.7% of the CDSs in the original annotation are correctly identified. The use of SWISS-PROT-TrEMBL databases resulted in an identification of 98.9% of CDSs that have EC-numbers in the published annotation. Furthermore, two versions of sequences of the bacterium Klebsiella pneumoniae with different genome coverage (3.9 and 7.9 fold, respectively are examined. The results suggest that a 3.9-fold coverage of the bacterial genome could be sufficiently used for the in silico reconstruction of the metabolic network. Compared to other gene finding methods such as CRITICA our method is more suitable for exploiting sequences of low genome coverage. Based on the new method, a program called IdentiCS (Identification of Coding Sequences from Unfinished Genome Sequences is delivered that combines the identification of CDSs with the reconstruction, comparison and visualization of metabolic networks (free to download

Non-coding RNA in Deinococcus radiodurans

International Nuclear Information System (INIS)

Chen Zhongzhong; Wang Liangyan; Lin Jun; Tian Bing; Hua Yuejin

2006-01-01

Researches on DNA damage and repair pathways of Deinococcus radiodurans show its extreme resistance to ionizing radiation, ultraviolet radiation and reactive oxygen species. Non-coding (ncRNA) RNAs are involved in a variety of processes such as transcriptional regulations, RNA processing and modification, mRNA translation, protein transportation and stability. The conserved secondary structures of intergenic regions of Deinococcus radiodurans R1 were predicted using Stochastic Context Free Grammar (SCFG) scan strategy. Results showed that 28 ncRNA families were present in the non-coding regions of the genome of Deinococcus radiodurans R1. Among these families, IRE is the largest family, followed by Histone3, tRNA, SECIS. DicF, ctRNA-pGA1 and tmRNA are one discovered in bacteria. Results from the comparison with other organisms showed that these ncRNA can be applied to the study of biological function of Deinococcus radiodurans and supply reference for the further study of DNA damage and repair mechanisms of this bacterium. (authors)
Small non coding RNAs in adipocyte biology and obesity.

Science.gov (United States)

Amri, Ez-Zoubir; Scheideler, Marcel

2017-11-15

Obesity has reached epidemic proportions world-wide and constitutes a substantial risk factor for hypertension, type 2 diabetes, cardiovascular diseases and certain cancers. So far, regulation of energy intake by dietary and pharmacological treatments has met limited success. The main interest of current research is focused on understanding the role of different pathways involved in adipose tissue function and modulation of its mass. Whole-genome sequencing studies revealed that the majority of the human genome is transcribed, with thousands of non-protein-coding RNAs (ncRNA), which comprise small and long ncRNAs. ncRNAs regulate gene expression at the transcriptional and post-transcriptional level. Numerous studies described the involvement of ncRNAs in the pathogenesis of many diseases including obesity and associated metabolic disorders. ncRNAs represent potential diagnostic biomarkers and promising therapeutic targets. In this review, we focused on small ncRNAs involved in the formation and function of adipocytes and obesity. Copyright © 2017 Elsevier B.V. All rights reserved.
Toward understanding non-coding RNA roles in intracranial aneurysms and subarachnoid hemorrhage

Directory of Open Access Journals (Sweden)

Huang Fengzhen

2017-05-01

Full Text Available Subarachnoid hemorrhage (SAH is a common and frequently life-threatening cerebrovascular disease, which is mostly related with a ruptured intracranial aneurysm. Its complications include rebleeding, early brain injury, cerebral vasospasm, delayed cerebral ischemia, chronic hydrocephalus, and also non neurological problems. Non-coding RNAs (ncRNAs, comprising of microRNAs (miRNAs, small interfering RNAs (siRNAs and long non-coding RNAs (lncRNAs, play an important role in intracranial aneurysms and SAH. Here, we review the non-coding RNAs expression profile and their related mechanisms in intracranial aneurysms and SAH. Moreover, we suggest that these non-coding RNAs function as novel molecular biomarkers to predict intracranial aneurysms and SAH, and may yield new therapies after SAH in the future.
Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop.

Science.gov (United States)

Brister, James Rodney; Bao, Yiming; Kuiken, Carla; Lefkowitz, Elliot J; Le Mercier, Philippe; Leplae, Raphael; Madupu, Ramana; Scheuermann, Richard H; Schobel, Seth; Seto, Donald; Shrivastava, Susmita; Sterk, Peter; Zeng, Qiandong; Klimke, William; Tatusova, Tatiana

2010-10-01

Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world's biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.
Towards Viral Genome Annotation Standards, Report from the 2010 NCBI Annotation Workshop

Directory of Open Access Journals (Sweden)

Qiandong Zeng

2010-10-01

Full Text Available Improvements in DNA sequencing technologies portend a new era in virology and could possibly lead to a giant leap in our understanding of viral evolution and ecology. Yet, as viral genome sequences begin to fill the world’s biological databases, it is critically important to recognize that the scientific promise of this era is dependent on consistent and comprehensive genome annotation. With this in mind, the NCBI Genome Annotation Workshop recently hosted a study group tasked with developing sequence, function, and metadata annotation standards for viral genomes. This report describes the issues involved in viral genome annotation and reviews policy recommendations presented at the NCBI Annotation Workshop.
Multiplexing Short Primers for Viral Family PCR

Energy Technology Data Exchange (ETDEWEB)

Gardner, S N; Hiddessen, A L; Hara, C A; Williams, P L; Wagner, M; Colston, B W

2008-06-26

We describe a Multiplex Primer Prediction (MPP) algorithm to build multiplex compatible primer sets for large, diverse, and unalignable sets of target sequences. The MPP algorithm is scalable to larger target sets than other available software, and it does not require a multiple sequence alignment. We applied it to questions in viral detection, and demonstrated that there are no universally conserved priming sequences among viruses and that it could require an unfeasibly large number of primers ({approx}3700 18-mers or {approx}2000 10-mers) to generate amplicons from all sequenced viruses. We then designed primer sets separately for each viral family, and for several diverse species such as foot-and-mouth disease virus, hemagglutinin and neuraminidase segments of influenza A virus, Norwalk virus, and HIV-1.
Non-coding RNAs in the Ovarian Follicle

Directory of Open Access Journals (Sweden)

Rosalia Battaglia

2017-05-01

Full Text Available The mammalian ovarian follicle is the complex reproductive unit comprising germ cell, somatic cells (Cumulus and Granulosa cells, and follicular fluid (FF: paracrine communication among the different cell types through FF ensures the development of a mature oocyte ready for fertilization. This paper is focused on non-coding RNAs in ovarian follicles and their predicted role in the pathways involved in oocyte growth and maturation. We determined the expression profiles of microRNAs in human oocytes and FF by high-throughput analysis and identified 267 microRNAs in FF and 176 in oocytes. Most of these were FF microRNAs, while 9 were oocyte specific. By bioinformatic analysis, independently performed on FF and oocyte microRNAs, we identified the most significant Biological Processes and the pathways regulated by their validated targets. We found many pathways shared between the two compartments and some specific for oocyte microRNAs. Moreover, we found 41 long non-coding RNAs able to interact with oocyte microRNAs and potentially involved in the regulation of folliculogenesis. These data are important in basic reproductive research and could also be useful for clinical applications. In fact, the characterization of non-coding RNAs in ovarian follicles could improve reproductive disease diagnosis, provide biomarkers of oocyte quality in Assisted Reproductive Treatment, and allow the development of therapies for infertility disorders.
Hidden Structural Codes in Protein Intrinsic Disorder.

Science.gov (United States)

Borkosky, Silvia S; Camporeale, Gabriela; Chemes, Lucía B; Risso, Marikena; Noval, María Gabriela; Sánchez, Ignacio E; Alonso, Leonardo G; de Prat Gay, Gonzalo

2017-10-17

Intrinsic disorder is a major structural category in biology, accounting for more than 30% of coding regions across the domains of life, yet consists of conformational ensembles in equilibrium, a major challenge in protein chemistry. Anciently evolved papillomavirus genomes constitute an unparalleled case for sequence to structure-function correlation in cases in which there are no folded structures. E7, the major transforming oncoprotein of human papillomaviruses, is a paradigmatic example among the intrinsically disordered proteins. Analysis of a large number of sequences of the same viral protein allowed for the identification of a handful of residues with absolute conservation, scattered along the sequence of its N-terminal intrinsically disordered domain, which intriguingly are mostly leucine residues. Mutation of these led to a pronounced increase in both α-helix and β-sheet structural content, reflected by drastic effects on equilibrium propensities and oligomerization kinetics, and uncovers the existence of local structural elements that oppose canonical folding. These folding relays suggest the existence of yet undefined hidden structural codes behind intrinsic disorder in this model protein. Thus, evolution pinpoints conformational hot spots that could have not been identified by direct experimental methods for analyzing or perturbing the equilibrium of an intrinsically disordered protein ensemble.
Decoding the function of nuclear long non-coding RNAs.

Science.gov (United States)

Chen, Ling-Ling; Carmichael, Gordon G

2010-06-01

Long non-coding RNAs (lncRNAs) are mRNA-like, non-protein-coding RNAs that are pervasively transcribed throughout eukaryotic genomes. Rather than silently accumulating in the nucleus, many of these are now known or suspected to play important roles in nuclear architecture or in the regulation of gene expression. In this review, we highlight some recent progress in how lncRNAs regulate these important nuclear processes at the molecular level. Copyright 2010 Elsevier Ltd. All rights reserved.
A long and abundant non-coding RNA in Lactobacillus salivarius.

Science.gov (United States)

Cousin, Fabien J; Lynch, Denise B; Chuat, Victoria; Bourin, Maxence J B; Casey, Pat G; Dalmasso, Marion; Harris, Hugh M B; McCann, Angela; O'Toole, Paul W

2017-09-01

Lactobacillus salivarius , found in the intestinal microbiota of humans and animals, is studied as an example of the sub-dominant intestinal commensals that may impart benefits upon their host. Strains typically harbour at least one megaplasmid that encodes functions contributing to contingency metabolism and environmental adaptation. RNA sequencing (RNA-seq)transcriptomic analysis of L. salivarius strain UCC118 identified the presence of a novel unusually abundant long non-coding RNA (lncRNA) encoded by the megaplasmid, and which represented more than 75 % of the total RNA-seq reads after depletion of rRNA species. The expression level of this 520 nt lncRNA in L. salivarius UCC118 exceeded that of the 16S rRNA, it accumulated during growth, was very stable over time and was also expressed during intestinal transit in a mouse. This lncRNA sequence is specific to the L. salivarius species; however, among 45 L . salivarius genomes analysed, not all (only 34) harboured the sequence for the lncRNA. This lncRNA was produced in 27 tested L. salivarius strains, but at strain-specific expression levels. High-level lncRNA expression correlated with high megaplasmid copy number. Transcriptome analysis of a deletion mutant lacking this lncRNA identified altered expression levels of genes in a number of pathways, but a definitive function of this new lncRNA was not identified. This lncRNA presents distinctive and unique properties, and suggests potential basic and applied scientific developments of this phenomenon.
Linear-time non-malleable codes in the bit-wise independent tampering model

NARCIS (Netherlands)

R.J.F. Cramer (Ronald); I.B. Damgård (Ivan); N.M. Döttling (Nico); I. Giacomelli (Irene); C. Xing (Chaoping)

2017-01-01

textabstractNon-malleable codes were introduced by Dziembowski et al. (ICS 2010) as coding schemes that protect a message against tampering attacks. Roughly speaking, a code is non-malleable if decoding an adversarially tampered encoding of a message m produces the original message m or a value m′
CONDOR: a database resource of developmentally associated conserved non-coding elements

Directory of Open Access Journals (Sweden)

Smith Sarah

2007-08-01

Full Text Available Abstract Background Comparative genomics is currently one of the most popular approaches to study the regulatory architecture of vertebrate genomes. Fish-mammal genomic comparisons have proved powerful in identifying conserved non-coding elements likely to be distal cis-regulatory modules such as enhancers, silencers or insulators that control the expression of genes involved in the regulation of early development. The scientific community is showing increasing interest in characterizing the function, evolution and language of these sequences. Despite this, there remains little in the way of user-friendly access to a large dataset of such elements in conjunction with the analysis and the visualization tools needed to study them. Description Here we present CONDOR (COnserved Non-coDing Orthologous Regions available at: http://condor.fugu.biology.qmul.ac.uk. In an interactive and intuitive way the website displays data on > 6800 non-coding elements associated with over 120 early developmental genes and conserved across vertebrates. The database regularly incorporates results of ongoing in vivo zebrafish enhancer assays of the CNEs carried out in-house, which currently number ~100. Included and highlighted within this set are elements derived from duplication events both at the origin of vertebrates and more recently in the teleost lineage, thus providing valuable data for studying the divergence of regulatory roles between paralogs. CONDOR therefore provides a number of tools and facilities to allow scientists to progress in their own studies on the function and evolution of developmental cis-regulation. Conclusion By providing access to data with an approachable graphics interface, the CONDOR database presents a rich resource for further studies into the regulation and evolution of genes involved in early development.
Properties of Sequence Conservation in Upstream Regulatory and Protein Coding Sequences among Paralogs in Arabidopsis thaliana

Science.gov (United States)

Richardson, Dale N.; Wiehe, Thomas

Whole genome duplication (WGD) has catalyzed the formation of new species, genes with novel functions, altered expression patterns, complexified signaling pathways and has provided organisms a level of genetic robustness. We studied the long-term evolution and interrelationships of 5’ upstream regulatory sequences (URSs), protein coding sequences (CDSs) and expression correlations (EC) of duplicated gene pairs in Arabidopsis. Three distinct methods revealed significant evolutionary conservation between paralogous URSs and were highly correlated with microarray-based expression correlation of the respective gene pairs. Positional information on exact matches between sequences unveiled the contribution of micro-chromosomal rearrangements on expression divergence. A three-way rank analysis of URS similarity, CDS divergence and EC uncovered specific gene functional biases. Transcription factor activity was associated with gene pairs exhibiting conserved URSs and divergent CDSs, whereas a broad array of metabolic enzymes was found to be associated with gene pairs showing diverged URSs but conserved CDSs.
LeARN: a platform for detecting, clustering and annotating non-coding RNAs

Directory of Open Access Journals (Sweden)

Schiex Thomas

2008-01-01

Full Text Available Abstract Background In the last decade, sequencing projects have led to the development of a number of annotation systems dedicated to the structural and functional annotation of protein-coding genes. These annotation systems manage the annotation of the non-protein coding genes (ncRNAs in a very crude way, allowing neither the edition of the secondary structures nor the clustering of ncRNA genes into families which are crucial for appropriate annotation of these molecules. Results LeARN is a flexible software package which handles the complete process of ncRNA annotation by integrating the layers of automatic detection and human curation. Conclusion This software provides the infrastructure to deal properly with ncRNAs in the framework of any annotation project. It fills the gap between existing prediction software, that detect independent ncRNA occurrences, and public ncRNA repositories, that do not offer the flexibility and interactivity required for annotation projects. The software is freely available from the download section of the website http://bioinfo.genopole-toulouse.prd.fr/LeARN
Understanding Image Virality

Science.gov (United States)

2015-06-07

Example non-viral images. Figure 1: Top: Images with high viral scores in our dataset depict internet “celebrity” memes ex. “Grumpy Cat”; Bottom: Images...of images that is most similar to ours is the concurrently introduced viral meme generator of Wang et al., that combines NLP and Computer Vision (low...doing any of our tasks. The test included questions about widely spread Reddit memes and jargon so that anyone familiar with Reddit can easily get a high
Cyclic and constacyclic codes over a non-chain ring

Directory of Open Access Journals (Sweden)

Ayşegül Bayram

2014-09-01

Full Text Available In this study, we consider linear and especially cyclic codes over the non-chain ring $Z_{p}[v]/\\langle v^{p}-v\\rangle$ where $p$ is a prime. This is a generalization of the case $p=3.$ Further, in this work the structure of constacyclic codes are studied as well. This study takes advantage mainly from a Gray map which preserves the distance between codes over this ring and $p$-ary codes and moreover this map enlightens the structure of these codes. Furthermore, a MacWilliams type identity is presented together with some illustrative examples.
Viral Reservoirs in Lymph Nodes of FIV-Infected Progressor and Long-Term Non-Progressor Cats during the Asymptomatic Phase.

Directory of Open Access Journals (Sweden)

C D Eckstrand

Full Text Available Examination of a cohort of cats experimentally infected with feline immunodeficiency virus (FIV for 5.75 years revealed detectable proviral DNA in peripheral blood mononuclear cells (PBMCs harvested during the asymptomatic phase, undetectable plasma viral RNA (FIV gag, and rarely detectable cell-associated viral RNA. Despite apparent viral latency in peripheral CD4+ T cells, circulating CD4+ T cell numbers progressively declined in progressor animals. The aim of this study was to explore this dichotomy of peripheral blood viral latency in the face of progressive immunopathology. The viral replication status, cellular immunophenotypes, and histopathologic features were compared between popliteal lymph nodes (PLNs and peripheral blood. Also, we identified and further characterized one of the FIV-infected cats identified as a long-term non-progressor (LTNP.PLN-derived leukocytes from FIV-infected cats during the chronic asymptomatic phase demonstrated active viral gag transcription and FIV protein translation as determined by real-time RT-PCR, Western blot and in situ immunohistochemistry, whereas viral RNA in blood leukocytes was either undetectable or intermittently detectable and viral protein was not detected. Active transcription of viral RNA was detectable in PLN-derived CD4+ and CD21+ leukocytes. Replication competent provirus was reactivated ex vivo from PLN-derived leukocytes from three of four FIV-infected cats. Progressor cats showed a persistent and dramatically decreased proportion and absolute count of CD4+ T cells in blood, and a decreased proportion of CD4+ T cells in PLNs. A single long-term non-progressor (LTNP cat persistently demonstrated an absolute peripheral blood CD4+ T cell count indistinguishable from uninfected animals, a lower proviral load in unfractionated blood and PLN leukocytes, and very low amounts of viral RNA in the PLN.Collectively our data indicates that PLNs harbor important reservoirs of ongoing viral
High efficiency non-viral transfection of retinal and iris pigment epithelial cells with pigment epithelium-derived factor.

Science.gov (United States)

Thumann, G; Stöcker, M; Maltusch, C; Salz, A K; Barth, S; Walter, P; Johnen, S

2010-02-01

Transplantation of pigment epithelial cells in patients with age-related macular degeneration and Parkinson's disease has the potential to improve functional rehabilitation. Genetic modification of cells before transplantation may allow the delivery of neuroprotective factors to achieve functional improvement. As transplantation of cells modified using viral vectors is complicated by the possible dissemination of viral particles and severe immune reactions, we have explored non-viral methods to insert genetic material in pigment epithelial cells. Using lipofection or nucleofection ARPE-19 cells, freshly isolated and primary retinal and iris pigment epithelial (IPE) cells were transfected with plasmids encoding green fluorescent protein (GFP) and with three plasmids encoding recombinant pigment epithelium-derived factor (PEDF) and GFP. Transfection efficiency was evaluated by fluorescence microscopy and stability of protein expression by immunoblotting. Pigment epithelial cells were successfully transfected with plasmid encoding GFP. Expression of GFP in ARPE-19 was transient, but was observed for up to 1 year in IPE cells. Analysis of pigment epithelial cells transfected with PEDF plasmids revealed that PEDF fusion proteins were successfully expressed and functionally active. In conclusion, efficient transfer of genetic information in pigment epithelial cells can be achieved using non-viral transfection protocols.
Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

Directory of Open Access Journals (Sweden)

Graner Andreas

2008-10-01

Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular
Non-coding, mRNA-like RNAs database Y2K.

Science.gov (United States)

Erdmann, V A; Szymanski, M; Hochberg, A; Groot, N; Barciszewski, J

2000-01-01

In last few years much data has accumulated on various non-translatable RNA transcripts that are synthesised in different cells. They are lacking in protein coding capacity and it seems that they work mainly or exclusively at the RNA level. All known non-coding RNA transcripts are collected in the database: http://www. man.poznan.pl/5SData/ncRNA/index.html

Licensee Event Report sequence coding and search procedure workshop

International Nuclear Information System (INIS)

Cottrell, W.B.; Gallaher, R.B.

1981-01-01

Since mid-1980, the Office for Analysis and Evaluation of Operational Data (AEOD) of the Nuclear Regulatory Commission (NRC) has been developing procedures for the systematic review and analysis of Licensee Event Reports (LERs). These procedures generally address several areas of concern, including identification of significant trends and patterns, event sequence of occurrences, component failures, and system and plant effects. The AEOD and NSIC conducted a workshop on the new coding procedure at the American Museum of Science and Energy in Oak Ridge, TN, on November 24, 1980
Appearances can be deceptive: revealing a hidden viral infection with deep sequencing in a plant quarantine context.

Science.gov (United States)

Candresse, Thierry; Filloux, Denis; Muhire, Brejnev; Julian, Charlotte; Galzi, Serge; Fort, Guillaume; Bernardo, Pauline; Daugrois, Jean-Heindrich; Fernandez, Emmanuel; Martin, Darren P; Varsani, Arvind; Roumagnac, Philippe

2014-01-01

Comprehensive inventories of plant viral diversity are essential for effective quarantine and sanitation efforts. The safety of regulated plant material exchanges presently relies heavily on techniques such as PCR or nucleic acid hybridisation, which are only suited to the detection and characterisation of specific, well characterised pathogens. Here, we demonstrate the utility of sequence-independent next generation sequencing (NGS) of both virus-derived small interfering RNAs (siRNAs) and virion-associated nucleic acids (VANA) for the detailed identification and characterisation of viruses infecting two quarantined sugarcane plants. Both plants originated from Egypt and were known to be infected with Sugarcane streak Egypt Virus (SSEV; Genus Mastrevirus, Family Geminiviridae), but were revealed by the NGS approaches to also be infected by a second highly divergent mastrevirus, here named Sugarcane white streak Virus (SWSV). This novel virus had escaped detection by all routine quarantine detection assays and was found to also be present in sugarcane plants originating from Sudan. Complete SWSV genomes were cloned and sequenced from six plants and all were found to share >91% genome-wide identity. With the exception of two SWSV variants, which potentially express unusually large RepA proteins, the SWSV isolates display genome characteristics very typical to those of all other previously described mastreviruses. An analysis of virus-derived siRNAs for SWSV and SSEV showed them to be strongly influenced by secondary structures within both genomic single stranded DNA and mRNA transcripts. In addition, the distribution of siRNA size frequencies indicates that these mastreviruses are likely subject to both transcriptional and post-transcriptional gene silencing. Our study stresses the potential advantages of NGS-based virus metagenomic screening in a plant quarantine setting and indicates that such techniques could dramatically reduce the numbers of non
Appearances can be deceptive: revealing a hidden viral infection with deep sequencing in a plant quarantine context.

Directory of Open Access Journals (Sweden)

Thierry Candresse

Full Text Available Comprehensive inventories of plant viral diversity are essential for effective quarantine and sanitation efforts. The safety of regulated plant material exchanges presently relies heavily on techniques such as PCR or nucleic acid hybridisation, which are only suited to the detection and characterisation of specific, well characterised pathogens. Here, we demonstrate the utility of sequence-independent next generation sequencing (NGS of both virus-derived small interfering RNAs (siRNAs and virion-associated nucleic acids (VANA for the detailed identification and characterisation of viruses infecting two quarantined sugarcane plants. Both plants originated from Egypt and were known to be infected with Sugarcane streak Egypt Virus (SSEV; Genus Mastrevirus, Family Geminiviridae, but were revealed by the NGS approaches to also be infected by a second highly divergent mastrevirus, here named Sugarcane white streak Virus (SWSV. This novel virus had escaped detection by all routine quarantine detection assays and was found to also be present in sugarcane plants originating from Sudan. Complete SWSV genomes were cloned and sequenced from six plants and all were found to share >91% genome-wide identity. With the exception of two SWSV variants, which potentially express unusually large RepA proteins, the SWSV isolates display genome characteristics very typical to those of all other previously described mastreviruses. An analysis of virus-derived siRNAs for SWSV and SSEV showed them to be strongly influenced by secondary structures within both genomic single stranded DNA and mRNA transcripts. In addition, the distribution of siRNA size frequencies indicates that these mastreviruses are likely subject to both transcriptional and post-transcriptional gene silencing. Our study stresses the potential advantages of NGS-based virus metagenomic screening in a plant quarantine setting and indicates that such techniques could dramatically reduce the numbers of non
[Influence of "prehistory" of sequential movements of the right and the left hand on reproduction: coding of positions, movements and sequence structure].

Science.gov (United States)

Bobrova, E V; Liakhovetskiĭ, V A; Borshchevskaia, E R

2011-01-01

The dependence of errors during reproduction of a sequence of hand movements without visual feedback on the previous right- and left-hand performance ("prehistory") and on positions in space of sequence elements (random or ordered by the explicit rule) was analyzed. It was shown that the preceding information about the ordered positions of the sequence elements was used during right-hand movements, whereas left-hand movements were performed with involvement of the information about the random sequence. The data testify to a central mechanism of the analysis of spatial structure of sequence elements. This mechanism activates movement coding specific for the left hemisphere (vector coding) in case of an ordered sequence structure and positional coding specific for the right hemisphere in case of a random sequence structure.
Two-terminal video coding.

Science.gov (United States)

Yang, Yang; Stanković, Vladimir; Xiong, Zixiang; Zhao, Wei

2009-03-01

Following recent works on the rate region of the quadratic Gaussian two-terminal source coding problem and limit-approaching code designs, this paper examines multiterminal source coding of two correlated, i.e., stereo, video sequences to save the sum rate over independent coding of both sequences. Two multiterminal video coding schemes are proposed. In the first scheme, the left sequence of the stereo pair is coded by H.264/AVC and used at the joint decoder to facilitate Wyner-Ziv coding of the right video sequence. The first I-frame of the right sequence is successively coded by H.264/AVC Intracoding and Wyner-Ziv coding. An efficient stereo matching algorithm based on loopy belief propagation is then adopted at the decoder to produce pixel-level disparity maps between the corresponding frames of the two decoded video sequences on the fly. Based on the disparity maps, side information for both motion vectors and motion-compensated residual frames of the right sequence are generated at the decoder before Wyner-Ziv encoding. In the second scheme, source splitting is employed on top of classic and Wyner-Ziv coding for compression of both I-frames to allow flexible rate allocation between the two sequences. Experiments with both schemes on stereo video sequences using H.264/AVC, LDPC codes for Slepian-Wolf coding of the motion vectors, and scalar quantization in conjunction with LDPC codes for Wyner-Ziv coding of the residual coefficients give a slightly lower sum rate than separate H.264/AVC coding of both sequences at the same video quality.
Raw Sewage Harbors Diverse Viral Populations

Science.gov (United States)

Cantalupo, Paul G.; Calgua, Byron; Zhao, Guoyan; Hundesa, Ayalkibet; Wier, Adam D.; Katz, Josh P.; Grabe, Michael; Hendrix, Roger W.; Girones, Rosina; Wang, David; Pipas, James M.

2011-01-01

ABSTRACT At this time, about 3,000 different viruses are recognized, but metagenomic studies suggest that these viruses are a small fraction of the viruses that exist in nature. We have explored viral diversity by deep sequencing nucleic acids obtained from virion populations enriched from raw sewage. We identified 234 known viruses, including 17 that infect humans. Plant, insect, and algal viruses as well as bacteriophages were also present. These viruses represented 26 taxonomic families and included viruses with single-stranded DNA (ssDNA), double-stranded DNA (dsDNA), positive-sense ssRNA [ssRNA(+)], and dsRNA genomes. Novel viruses that could be placed in specific taxa represented 51 different families, making untreated wastewater the most diverse viral metagenome (genetic material recovered directly from environmental samples) examined thus far. However, the vast majority of sequence reads bore little or no sequence relation to known viruses and thus could not be placed into specific taxa. These results show that the vast majority of the viruses on Earth have not yet been characterized. Untreated wastewater provides a rich matrix for identifying novel viruses and for studying virus diversity. Importance At this time, virology is focused on the study of a relatively small number of viral species. Specific viruses are studied either because they are easily propagated in the laboratory or because they are associated with disease. The lack of knowledge of the size and characteristics of the viral universe and the diversity of viral genomes is a roadblock to understanding important issues, such as the origin of emerging pathogens and the extent of gene exchange among viruses. Untreated wastewater is an ideal system for assessing viral diversity because virion populations from large numbers of individuals are deposited and because raw sewage itself provides a rich environment for the growth of diverse host species and thus their viruses. These studies suggest that
MicroRNAs in the host response to viral infections of veterinary importance

Directory of Open Access Journals (Sweden)

Mohamed Samir Ahmed

2016-10-01

Full Text Available The discovery of small regulatory non-coding RNAs has been an exciting advance in the field of genomics. MicroRNAs (miRNAs are endogenous RNA molecules, approximately 22 nucleotides in length that regulate gene expression, mostly at the post-transcriptional level. MiRNA profiling technologies have made it possible to identify and quantify novel miRNAs and to study their regulation and potential roles in disease pathogenesis. Although miRNAs have been extensively investigated in viral infections of humans, their implications in viral diseases affecting animals of veterinary importance are much less understood. The number of annotated miRNAs in different animal species is growing continuously, and novel roles in regulating host-pathogen interactions are being discovered, for instance miRNA-mediated augmentation of viral transcription and replication. In this review, we present an overview of synthesis and function of miRNAs and an update on the current state of research on host-encoded miRNAs in the genesis of viral infectious diseases in their natural animal host as well as in selected in vivo and in vitro laboratory models.
A comprehensive and quantitative exploration of thousands of viral genomes

Science.gov (United States)

Mahmoudabadi, Gita

2018-01-01

The complete assembly of viral genomes from metagenomic datasets (short genomic sequences gathered from environmental samples) has proven to be challenging, so there are significant blind spots when we view viral genomes through the lens of metagenomics. One approach to overcoming this problem is to leverage the thousands of complete viral genomes that are publicly available. Here we describe our efforts to assemble a comprehensive resource that provides a quantitative snapshot of viral genomic trends – such as gene density, noncoding percentage, and abundances of functional gene categories – across thousands of viral genomes. We have also developed a coarse-grained method for visualizing viral genome organization for hundreds of genomes at once, and have explored the extent of the overlap between bacterial and bacteriophage gene pools. Existing viral classification systems were developed prior to the sequencing era, so we present our analysis in a way that allows us to assess the utility of the different classification systems for capturing genomic trends. PMID:29624169
Capsid coding sequences of foot-and-mouth disease viruses are determinants of pathogenicity in pigs.

Science.gov (United States)

Lohse, Louise; Jackson, Terry; Bøtner, Anette; Belsham, Graham J

2012-05-24

The surface exposed capsid proteins, VP1, VP2 and VP3, of foot-and-mouth disease virus (FMDV) determine its antigenicity and the ability of the virus to interact with host-cell receptors. Hence, modification of these structural proteins may alter the properties of the virus.In the present study we compared the pathogenicity of different FMDVs in young pigs. In total 32 pigs, 7-weeks-old, were exposed to virus, either by direct inoculation or through contact with inoculated pigs, using cell culture adapted (O1K B64), chimeric (O1K/A-TUR and O1K/O-UKG) or field strain (O-UKG/34/2001) viruses. The O1K B64 virus and the two chimeric viruses are identical to each other except for the capsid coding region.Animals exposed to O1K B64 did not exhibit signs of disease, while pigs exposed to each of the other viruses showed typical clinical signs of foot-and-mouth disease (FMD). All pigs infected with the O1K/O-UKG chimera or the field strain (O-UKG/34/2001) developed fulminant disease. Furthermore, 3 of 4 in-contact pigs exposed to the O1K/O-UKG virus died in the acute phase of infection, likely from myocardial infection. However, in the group exposed to the O1K/A-TUR chimeric virus, only 1 pig showed symptoms of disease within the time frame of the experiment (10 days). All pigs that developed clinical disease showed a high level of viral RNA in serum and infected pigs that survived the acute phase of infection developed a serotype specific antibody response. It is concluded that the capsid coding sequences are determinants of FMDV pathogenicity in pigs.
Graphene materials as 2D non-viral gene transfer vector platforms.

Science.gov (United States)

Vincent, M; de Lázaro, I; Kostarelos, K

2017-03-01

Advances in genomics and gene therapy could offer solutions to many diseases that remain incurable today, however, one of the critical reasons halting clinical progress is due to the difficulty in designing efficient and safe delivery vectors for the appropriate genetic cargo. Safety and large-scale production concerns counter-balance the high gene transfer efficiency achieved with viral vectors, while non-viral strategies have yet to become sufficiently efficient. The extraordinary physicochemical, optical and photothermal properties of graphene-based materials (GBMs) could offer two-dimensional components for the design of nucleic acid carrier systems. We discuss here such properties and their implications for the optimization of gene delivery. While the design of such vectors is still in its infancy, we provide here an exhaustive and up-to-date analysis of the studies that have explored GBMs as gene transfer vectors, focusing on the functionalization strategies followed to improve vector performance and on the biological effects attained.
Prospecting for viral natural enemies of the fire ant Solenopsis invicta in Argentina.

Directory of Open Access Journals (Sweden)

Steven M Valles

Full Text Available Metagenomics and next generation sequencing were employed to discover new virus natural enemies of the fire ant, Solenopsis invicta Buren in its native range (i.e., Formosa, Argentina with the ultimate goal of testing and releasing new viral pathogens into U.S. S. invicta populations to provide natural, sustainable control of this ant. RNA was purified from worker ants from 182 S. invicta colonies, which was pooled into 4 groups according to location. A library was created from each group and sequenced using Illumina Miseq technology. After a series of winnowing methods to remove S. invicta genes, known S. invicta virus genes, and all other non-virus gene sequences, 61,944 unique singletons were identified with virus identity. These were assembled de novo yielding 171 contiguous sequences with significant identity to non-plant virus genes. Fifteen contiguous sequences exhibited very high expression rates and were detected in all four gene libraries. One contig (Contig_29 exhibited the highest expression level overall and across all four gene libraries. Random amplification of cDNA ends analyses expanded this contiguous sequence yielding a complete virus genome, which we have provisionally named Solenopsis invicta virus 5 (SINV-5. SINV-5 is a positive-sense, single-stranded RNA virus with genome characteristics consistent with insect-infecting viruses from the family Dicistroviridae. Moreover, the replicative genome strand of SINV-5 was detected in worker ants indicating that S. invicta serves as host for the virus. Many additional sequences were identified that are likely of viral origin. These sequences await further investigation to determine their origins and relationship with S. invicta. This study expands knowledge of the RNA virome diversity found within S. invicta populations.
Instruction sequence based non-uniform complexity classes

NARCIS (Netherlands)

Bergstra, J.A.; Middelburg, C.A.

2013-01-01

We present an approach to non-uniform complexity in which single-pass instruction sequences play a key part, and answer various questions that arise from this approach. We introduce several kinds of non-uniform complexity classes. One kind includes a counterpart of the well-known non-uniform
Comparison of tissue sample processing methods for harvesting the viral metagenome and a snapshot of the RNA viral community in a turkey gut.

Science.gov (United States)

Shah, Jigna D; Baller, Joshua; Zhang, Ying; Silverstein, Kevin; Xing, Zheng; Cardona, Carol J

2014-12-01

RNA viruses have been associated with enteritis in poultry and have been isolated from diseased birds. The same viral agents have also been detected in healthy flocks bringing into question their role in health and disease. In order to understand better eukaryotic viruses in the gut, this project focused on evaluating alternative methods to purify and concentrate viral particles, which do not involve the use of density gradients, for generating viral metagenome data. In this study, the sequence outcomes of three tissue processing methods have been evaluated and a data analysis pipeline has been established for RNA viruses from the gastrointestinal tract. In addition, with the use of the best method and increased sequencing depth, a glimpse of the RNA viral community in the gastrointestinal tract of a clinically normal 5-week old turkey is presented. The viruses from the Reoviridae and Astroviridae families together accounted for 76.3% of total viruses identified. The rarefaction curve at the species level further indicated that majority of the species diversity was included with the increased sequencing depth, implying that viruses from other viral families were present in very low abundance. Copyright © 2014 Elsevier B.V. All rights reserved.
Cas9 specifies functional viral targets during CRISPR-Cas adaptation.

Science.gov (United States)

Heler, Robert; Samai, Poulami; Modell, Joshua W; Weiner, Catherine; Goldberg, Gregory W; Bikard, David; Marraffini, Luciano A

2015-03-12

Clustered regularly interspaced short palindromic repeat (CRISPR) loci and their associated (Cas) proteins provide adaptive immunity against viral infection in prokaryotes. Upon infection, short phage sequences known as spacers integrate between CRISPR repeats and are transcribed into small RNA molecules that guide the Cas9 nuclease to the viral targets (protospacers). Streptococcus pyogenes Cas9 cleavage of the viral genome requires the presence of a 5'-NGG-3' protospacer adjacent motif (PAM) sequence immediately downstream of the viral target. It is not known whether and how viral sequences flanked by the correct PAM are chosen as new spacers. Here we show that Cas9 selects functional spacers by recognizing their PAM during spacer acquisition. The replacement of cas9 with alleles that lack the PAM recognition motif or recognize an NGGNG PAM eliminated or changed PAM specificity during spacer acquisition, respectively. Cas9 associates with other proteins of the acquisition machinery (Cas1, Cas2 and Csn2), presumably to provide PAM-specificity to this process. These results establish a new function for Cas9 in the genesis of prokaryotic immunological memory.
Enhanced Protein Production in Escherichia coli by Optimization of Cloning Scars at the Vector-Coding Sequence Junction

DEFF Research Database (Denmark)

Mirzadeh, Kiavash; Martinez, Virginia; Toddo, Stephen

2015-01-01

are poorly expressed even when they are codon-optimized and expressed from vectors with powerful genetic elements. In this study, we show that poor expression can be caused by certain nucleotide sequences (e.g., cloning scars) at the junction between the vector and the coding sequence. Since these sequences...
HPV integration hijacks and multimerizes a cellular enhancer to generate a viral-cellular super-enhancer that drives high viral oncogene expression

Science.gov (United States)

Redmond, Catherine J.; Dooley, Katharine E.; Fu, Haiqing; Gillison, Maura L.; Akagi, Keiko; Symer, David E.; Aladjem, Mirit I.

2018-01-01

Integration of human papillomavirus (HPV) genomes into cellular chromatin is common in HPV-associated cancers. Integration is random, and each site is unique depending on how and where the virus integrates. We recently showed that tandemly integrated HPV16 could result in the formation of a super-enhancer-like element that drives transcription of the viral oncogenes. Here, we characterize the chromatin landscape and genomic architecture of this integration locus to elucidate the mechanisms that promoted de novo super-enhancer formation. Using next-generation sequencing and molecular combing/fiber-FISH, we show that ~26 copies of HPV16 are integrated into an intergenic region of chromosome 2p23.2, interspersed with 25 kb of amplified, flanking cellular DNA. This interspersed, co-amplified viral-host pattern is frequent in HPV-associated cancers and here we designate it as Type III integration. An abundant viral-cellular fusion transcript encoding the viral E6/E7 oncogenes is expressed from the integration locus and the chromatin encompassing both the viral enhancer and a region in the adjacent amplified cellular sequences is strongly enriched in the super-enhancer markers H3K27ac and Brd4. Notably, the peak in the amplified cellular sequence corresponds to an epithelial-cell-type specific enhancer. Thus, HPV16 integration generated a super-enhancer-like element composed of tandem interspersed copies of the viral upstream regulatory region and a cellular enhancer, to drive high levels of oncogene expression. PMID:29364907
Construction and sequencing of an infectious clone of the human parvovirus B19

International Nuclear Information System (INIS)

Zhi Ning; Zadori, Zoltan; Brown, Kevin E.; Tijssen, Peter

2004-01-01

Human parvovirus B19 has a nonenveloped, icosahedral capsid packaging a linear single-stranded DNA genome of 5.6 kb with long inverted terminal repeats (ITR) at both the 5' and 3' end. Previous attempts to construct a full-length B19 clone were unsuccessful due to deletions in the ITR sequences. We cloned the complete parvovirus B19 genome with intact ITRs from an aplastic crisis patient. Sequence analysis of the complete viral genome indicated that both 5' and 3' ITRs have two sequence configurations and several base changes within the ITRs compared to previous published sequences. After transfection of the plasmid into permissive cells, spliced and non-spliced viral transcripts and viral capsid proteins could be detected. Southern blot analysis of the DNA purified from the plasmid-transfected cells confirmed parvovirus B19 DNA replication. Production of infectious virus by the B19 plasmid was shown by inoculation of cell lysate derived from transfected cells into fresh cells. Together, these results indicate the first successful production of an infectious clone for parvovirus B19 virus
EG-10LONG NON-CODING RNAs IN GLIOBLASTOMA

Science.gov (United States)

Pastori, Chiara; Kapranov, Philipp; Penas, Clara; Laurent, Georges St.; Ayad, Nagi; Wahlestedt, Claes

2014-01-01

Glioblastoma (GBM) is the most common, aggressive and incurable primary brain tumor in adults. Genome studies have confirmed that GBM is extremely heterogeneous with many genetically different subgroups. Consequently, there is much current interest in epigenetic drugs that may be active across genetically distinct tumors. In support of this, some epigenetic drugs has recently shown efficacy against several cancers including glioblastoma. Much recent interest has also been devoted to long non-coding RNAs (lncRNAs), which can modulate gene expression regulating chromatin architecture, in part through the interaction with epigenetic protein machineries. To date, however, only a few lncRNAs have been studied in human cancer. We therefore embarked on a comprehensive genomic and functional analysis of lncRNAs in GBM. Using the Helicos Single Molecule Sequencing platform glioblastoma samples were sequenced resulting in the identification of hundreds of dysregulated lncRNAs. Among these the lncRNA HOTAIR was found massively increased in GBM. This observation parallels findings in other cancers where HOTAIR's increased expression has been linked to poor prognosis due to metastatic events. Interestingly, here we show that in glioblastoma HOTAIR does not promote metastasis, but instead sustains the ability of these cells to proliferate. In fact, we demonstrate that HOTAIR knockdown in GBM strongly impairs cell proliferation and induces apoptosis in vitro and in vivo. Further, we implicate HOTAIR in the mechanism of action of certain epigenetic drugs. In summary, long noncoding RNAs (newly discovered epigenomic factors) play a vital role in GBM and deserve attention as entirely novel drug targets as well as biomarkers.
Analysis of the complete genome of the first Irkut virus isolate from China: comparison across the Lyssavirus genus.

Science.gov (United States)

Liu, Ye; Li, Nan; Zhang, Shoufeng; Zhang, Fei; Lian, Hai; Wang, Ying; Zhang, Jinxia; Hu, Rongliang

2013-12-01

The genome of Irkut virus, isolate IRKV-THChina12, the first non-rabies lyssavirus from China (of bat origin), has been completely sequenced. In general, coding and non-coding regions of this viral genome are similar to those of other lyssaviruses. However, alignment of the deduced amino acid sequences of the structural proteins of IRKV-THChina12 with those of other lyssavirus representatives revealed significant variability between viral species. The nucleoprotein and matrix protein were found to be the most conserved, followed by the large protein, glycoprotein and phosphoprotein. Differences in the antigenic sites in glycoprotein may result in only partial protection of the available rabies biologics against Irkut virus, which is of particular concern for pre- and post-exposure rabies prophylaxis. Copyright © 2013 Elsevier Inc. All rights reserved.
FOURTH SEMINAR TO THE MEMORY OF D.N. KLYSHKO: Algebraic solution of the synthesis problem for coded sequences

Science.gov (United States)

Leukhin, Anatolii N.

2005-08-01

The algebraic solution of a 'complex' problem of synthesis of phase-coded (PC) sequences with the zero level of side lobes of the cyclic autocorrelation function (ACF) is proposed. It is shown that the solution of the synthesis problem is connected with the existence of difference sets for a given code dimension. The problem of estimating the number of possible code combinations for a given code dimension is solved. It is pointed out that the problem of synthesis of PC sequences is related to the fundamental problems of discrete mathematics and, first of all, to a number of combinatorial problems, which can be solved, as the number factorisation problem, by algebraic methods by using the theory of Galois fields and groups.

RAMBO-K: Rapid and Sensitive Removal of Background Sequences from Next Generation Sequencing Data.

Directory of Open Access Journals (Sweden)

Simon H Tausch

Full Text Available The assembly of viral or endosymbiont genomes from Next Generation Sequencing (NGS data is often hampered by the predominant abundance of reads originating from the host organism. These reads increase the memory and CPU time usage of the assembler and can lead to misassemblies.We developed RAMBO-K (Read Assignment Method Based On K-mers, a tool which allows rapid and sensitive removal of unwanted host sequences from NGS datasets. Reaching a speed of 10 Megabases/s on 4 CPU cores and a standard hard drive, RAMBO-K is faster than any tool we tested, while showing a consistently high sensitivity and specificity across different datasets.RAMBO-K rapidly and reliably separates reads from different species without data preprocessing. It is suitable as a straightforward standard solution for workflows dealing with mixed datasets. Binaries and source code (java and python are available from http://sourceforge.net/projects/rambok/.
SinEx DB: a database for single exon coding sequences in mammalian genomes.

Science.gov (United States)

Jorquera, Roddy; Ortiz, Rodrigo; Ossandon, F; Cárdenas, Juan Pablo; Sepúlveda, Rene; González, Carolina; Holmes, David S

2016-01-01

Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as 'single exon genes' (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs.Database URL: www.sinex.cl. © The Author(s) 2016. Published by Oxford University Press.
A new method for detecting signal regions in ordered sequences of real numbers, and application to viral genomic data.

Science.gov (United States)

Gog, Julia R; Lever, Andrew M L; Skittrall, Jordan P

2018-01-01

We present a fast, robust and parsimonious approach to detecting signals in an ordered sequence of numbers. Our motivation is in seeking a suitable method to take a sequence of scores corresponding to properties of positions in virus genomes, and find outlying regions of low scores. Suitable statistical methods without using complex models or making many assumptions are surprisingly lacking. We resolve this by developing a method that detects regions of low score within sequences of real numbers. The method makes no assumptions a priori about the length of such a region; it gives the explicit location of the region and scores it statistically. It does not use detailed mechanistic models so the method is fast and will be useful in a wide range of applications. We present our approach in detail, and test it on simulated sequences. We show that it is robust to a wide range of signal morphologies, and that it is able to capture multiple signals in the same sequence. Finally we apply it to viral genomic data to identify regions of evolutionary conservation within influenza and rotavirus.
RevTrans: multiple alignment of coding DNA from aligned amino acid sequences

DEFF Research Database (Denmark)

Wernersson, Rasmus; Pedersen, Anders Gorm

2003-01-01

The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit...... proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA...
Genic non-coding microsatellites in the rice genome: characterization, marker design and use in assessing genetic and evolutionary relationships among domesticated groups

Directory of Open Access Journals (Sweden)

Singh Nagendra

2009-03-01

Full Text Available Abstract Background Completely sequenced plant genomes provide scope for designing a large number of microsatellite markers, which are useful in various aspects of crop breeding and genetic analysis. With the objective of developing genic but non-coding microsatellite (GNMS markers for the rice (Oryza sativa L. genome, we characterized the frequency and relative distribution of microsatellite repeat-motifs in 18,935 predicted protein coding genes including 14,308 putative promoter sequences. Results We identified 19,555 perfect GNMS repeats with densities ranging from 306.7/Mb in chromosome 1 to 450/Mb in chromosome 12 with an average of 357.5 GNMS per Mb. The average microsatellite density was maximum in the 5' untranslated regions (UTRs followed by those in introns, promoters, 3'UTRs and minimum in the coding sequences (CDS. Primers were designed for 17,966 (92% GNMS repeats, including 4,288 (94% hypervariable class I types, which were bin-mapped on the rice genome. The GNMS markers were most polymorphic in the intronic region (73.3% followed by markers in the promoter region (53.3% and least in the CDS (26.6%. The robust polymerase chain reaction (PCR amplification efficiency and high polymorphic potential of GNMS markers over genic coding and random genomic microsatellite markers suggest their immediate use in efficient genotyping applications in rice. A set of these markers could assess genetic diversity and establish phylogenetic relationships among domesticated rice cultivar groups. We also demonstrated the usefulness of orthologous and paralogous conserved non-coding microsatellite (CNMS markers, identified in the putative rice promoter sequences, for comparative physical mapping and understanding of evolutionary and gene regulatory complexities among rice and other members of the grass family. The divergence between long-grained aromatics and subspecies japonica was estimated to be more recent (0.004 Mya compared to short
Functional Role of Infective Viral Particles on Metal Reduction

Energy Technology Data Exchange (ETDEWEB)

Coates, John D.

2014-04-01

A proposed strategy for the remediation of uranium (U) contaminated sites was based on the immobilization of U by reducing the oxidized soluble U, U(VI), to form a reduced insoluble end product, U(IV). Previous studies identified Geobacter sp., including G. sulfurreducens and G. metallireducens, as predominant U(VI)-reducing bacteria under acetate-oxidizing and U(VI)-reducing conditions. Examination of the finished genome sequence annotation of the canonical metal reducing species Geobacter sulfurreducens strain PCA and G. metallireduceans strain GS-15 as well as the draft genome sequence of G. uraniumreducens strain Rf4 identified phage related proteins. In addition, the completed genome for Anaeromyxobacter dehalogenans and the draft genome sequence of Desulfovibrio desulfuricans strain G20, two more model metal-reducing bacteria, also revealed phage related sequences. The presence of these gene sequences indicated that Geobacter spp., Anaeromyxobacter spp., and Desulfovibrio spp. are susceptible to viral infection. Furthermore, viral populations in soils and sedimentary environments in the order of 6.4×10{sup 6}–2.7×10{sup 10} VLP’s cm{sup -3} have been observed. In some cases, viral populations exceed bacterial populations in these environments suggesting that a relationship may exist between viruses and bacteria. Our preliminary screens of samples collected from the ESR FRC indicated that viral like particles were observed in significant numbers. The objective of this study was to investigate the potential functional role viruses play in metal reduction specifically Fe(III) and U(VI) reduction, the environmental parameters affecting viral infection of metal reducing bacteria, and the subsequent effects on U transport.
Construction of a mutagenesis cartridge for poliovirus genome-linked viral protein: isolation and characterization of viable and nonviable mutants

International Nuclear Information System (INIS)

Kuhn, R.J.; Tada, H.; Ypma-Wong, M.F.; Dunn, J.J.; Semler, B.L.; Wimmer, E.

1988-01-01

By following a strategy of genetic analysis of poliovirus, the authors have constructed a synthetic mutagenesis cartridge spanning the genome-linked viral protein coding region and flanking cleavage sites in an infectious cDNA clone of the type I (Mahoney) genome. The insertion of new restriction sites within the infectious clone has allowed them to replace the wild-type sequences with short complementary pairs of synthetic oligonucleotides containing various mutations. A set of mutations have been made that create methionine codons within the genome-linked viral protein region. The resulting viruses have growth characteristics similar to wild type. Experiments that led to an alteration of the tyrosine residue responsible for the linkage to RNA have resulted in nonviable virus. In one mutant, proteolytic processing assayed in vitro appeared unimpaired by the mutation. They suggest that the position of the tyrosine residue is important for genome-linked viral protein function(s)
Extracellular vesicle associated long non-coding RNAs functionally enhance cell viability

Directory of Open Access Journals (Sweden)

Chris Hewson

2016-10-01

Full Text Available Cells communicate with one another to create microenvironments and share resources. One avenue by which cells communicate is through the action of exosomes. Exosomes are extracellular vesicles that are released by one cell and taken up by neighbouring cells. But how exosomes instigate communication between cells has remained largely unknown. We present evidence here that particular long non-coding RNA molecules are preferentially packaged into exosomes. We also find that a specific class of these exosome associated non-coding RNAs functionally modulate cell viability by direct interactions with l-lactate dehydrogenase B (LDHB, high-mobility group protein 17 (HMG-17, and CSF2RB, proteins involved in metabolism, nucleosomal architecture and cell signalling respectively. Knowledge of this endogenous cell to cell pathway, those proteins interacting with exosome associated non-coding transcripts and their interacting domains, could lead to a better understanding of not only cell to cell interactions but also the development of exosome targeted approaches in patient specific cell-based therapies. Keywords: Non-coding RNA, Extracellular RNA, Exosomes, Retroelement, Pseudogene
Progressive changes in non-coding RNA profile in leucocytes with age

Science.gov (United States)

Muñoz-Culla, Maider; Irizar, Haritz; Gorostidi, Ana; Alberro, Ainhoa; Osorio-Querejeta, Iñaki; Ruiz-Martínez, Javier; Olascoaga, Javier; de Munain, Adolfo López; Otaegui, David

2017-01-01

It has been observed that immune cell deterioration occurs in the elderly, as well as a chronic low-grade inflammation called inflammaging. These cellular changes must be driven by numerous changes in gene expression and in fact, both protein-coding and non-coding RNA expression alterations have been observed in peripheral blood mononuclear cells from elder people. In the present work we have studied the expression of small non-coding RNA (microRNA and small nucleolar RNA -snoRNA-) from healthy individuals from 24 to 79 years old. We have observed that the expression of 69 non-coding RNAs (56 microRNAs and 13 snoRNAs) changes progressively with chronological age. According to our results, the age range from 47 to 54 is critical given that it is the period when the expression trend (increasing or decreasing) of age-related small non-coding RNAs is more pronounced. Furthermore, age-related miRNAs regulate genes that are involved in immune, cell cycle and cancer-related processes, which had already been associated to human aging. Therefore, human aging could be studied as a result of progressive molecular changes, and different age ranges should be analysed to cover the whole aging process. PMID:28448962
ChimericSeq: An open-source, user-friendly interface for analyzing NGS data to identify and characterize viral-host chimeric sequences

Science.gov (United States)

Shieh, Fwu-Shan; Jongeneel, Patrick; Steffen, Jamin D.; Lin, Selena; Jain, Surbhi; Song, Wei

2017-01-01

Identification of viral integration sites has been important in understanding the pathogenesis and progression of diseases associated with particular viral infections. The advent of next-generation sequencing (NGS) has enabled researchers to understand the impact that viral integration has on the host, such as tumorigenesis. Current computational methods to analyze NGS data of virus-host junction sites have been limited in terms of their accessibility to a broad user base. In this study, we developed a software application (named ChimericSeq), that is the first program of its kind to offer a graphical user interface, compatibility with both Windows and Mac operating systems, and optimized for effectively identifying and annotating virus-host chimeric reads within NGS data. In addition, ChimericSeq’s pipeline implements custom filtering to remove artifacts and detect reads with quantitative analytical reporting to provide functional significance to discovered integration sites. The improved accessibility of ChimericSeq through a GUI interface in both Windows and Mac has potential to expand NGS analytical support to a broader spectrum of the scientific community. PMID:28829778
ChimericSeq: An open-source, user-friendly interface for analyzing NGS data to identify and characterize viral-host chimeric sequences.

Directory of Open Access Journals (Sweden)

Fwu-Shan Shieh

Full Text Available Identification of viral integration sites has been important in understanding the pathogenesis and progression of diseases associated with particular viral infections. The advent of next-generation sequencing (NGS has enabled researchers to understand the impact that viral integration has on the host, such as tumorigenesis. Current computational methods to analyze NGS data of virus-host junction sites have been limited in terms of their accessibility to a broad user base. In this study, we developed a software application (named ChimericSeq, that is the first program of its kind to offer a graphical user interface, compatibility with both Windows and Mac operating systems, and optimized for effectively identifying and annotating virus-host chimeric reads within NGS data. In addition, ChimericSeq's pipeline implements custom filtering to remove artifacts and detect reads with quantitative analytical reporting to provide functional significance to discovered integration sites. The improved accessibility of ChimericSeq through a GUI interface in both Windows and Mac has potential to expand NGS analytical support to a broader spectrum of the scientific community.
Viral Diversity Threshold for Adaptive Immunity in Prokaryotes

Science.gov (United States)

Weinberger, Ariel D.; Wolf, Yuri I.; Lobkovsky, Alexander E.; Gilmore, Michael S.; Koonin, Eugene V.

2012-01-01

ABSTRACT Bacteria and archaea face continual onslaughts of rapidly diversifying viruses and plasmids. Many prokaryotes maintain adaptive immune systems known as clustered regularly interspaced short palindromic repeats (CRISPR) and CRISPR-associated genes (Cas). CRISPR-Cas systems are genomic sensors that serially acquire viral and plasmid DNA fragments (spacers) that are utilized to target and cleave matching viral and plasmid DNA in subsequent genomic invasions, offering critical immunological memory. Only 50% of sequenced bacteria possess CRISPR-Cas immunity, in contrast to over 90% of sequenced archaea. To probe why half of bacteria lack CRISPR-Cas immunity, we combined comparative genomics and mathematical modeling. Analysis of hundreds of diverse prokaryotic genomes shows that CRISPR-Cas systems are substantially more prevalent in thermophiles than in mesophiles. With sequenced bacteria disproportionately mesophilic and sequenced archaea mostly thermophilic, the presence of CRISPR-Cas appears to depend more on environmental temperature than on bacterial-archaeal taxonomy. Mutation rates are typically severalfold higher in mesophilic prokaryotes than in thermophilic prokaryotes. To quantitatively test whether accelerated viral mutation leads microbes to lose CRISPR-Cas systems, we developed a stochastic model of virus-CRISPR coevolution. The model competes CRISPR-Cas-positive (CRISPR-Cas+) prokaryotes against CRISPR-Cas-negative (CRISPR-Cas−) prokaryotes, continually weighing the antiviral benefits conferred by CRISPR-Cas immunity against its fitness costs. Tracking this cost-benefit analysis across parameter space reveals viral mutation rate thresholds beyond which CRISPR-Cas cannot provide sufficient immunity and is purged from host populations. These results offer a simple, testable viral diversity hypothesis to explain why mesophilic bacteria disproportionately lack CRISPR-Cas immunity. More generally, fundamental limits on the adaptability of biological
Non-coding RNAs and plant male sterility: current knowledge and future prospects.

Science.gov (United States)

Mishra, Ankita; Bohra, Abhishek

2018-02-01

Latest outcomes assign functional role to non-coding (nc) RNA molecules in regulatory networks that confer male sterility to plants. Male sterility in plants offers great opportunity for improving crop performance through application of hybrid technology. In this respect, cytoplasmic male sterility (CMS) and sterility induced by photoperiod (PGMS)/temperature (TGMS) have greatly facilitated development of high-yielding hybrids in crops. Participation of non-coding (nc) RNA molecules in plant reproductive development is increasingly becoming evident. Recent breakthroughs in rice definitively associate ncRNAs with PGMS and TGMS. In case of CMS, the exact mechanism through which the mitochondrial ORFs exert influence on the development of male gametophyte remains obscure in several crops. High-throughput sequencing has enabled genome-wide discovery and validation of these regulatory molecules and their target genes, describing their potential roles performed in relation to CMS. Discovery of ncRNA localized in plant mtDNA with its possible implication in CMS induction is intriguing in this respect. Still, conclusive evidences linking ncRNA with CMS phenotypes are currently unavailable, demanding complementing genetic approaches like transgenics to substantiate the preliminary findings. Here, we review the recent literature on the contribution of ncRNAs in conferring male sterility to plants, with an emphasis on microRNAs. Also, we present a perspective on improved understanding about ncRNA-mediated regulatory pathways that control male sterility in plants. A refined understanding of plant male sterility would strengthen crop hybrid industry to deliver hybrids with improved performance.
From structure prediction to genomic screens for novel non-coding RNAs.

Science.gov (United States)

Gorodkin, Jan; Hofacker, Ivo L

2011-08-01

Non-coding RNAs (ncRNAs) are receiving more and more attention not only as an abundant class of genes, but also as regulatory structural elements (some located in mRNAs). A key feature of RNA function is its structure. Computational methods were developed early for folding and prediction of RNA structure with the aim of assisting in functional analysis. With the discovery of more and more ncRNAs, it has become clear that a large fraction of these are highly structured. Interestingly, a large part of the structure is comprised of regular Watson-Crick and GU wobble base pairs. This and the increased amount of available genomes have made it possible to employ structure-based methods for genomic screens. The field has moved from folding prediction of single sequences to computational screens for ncRNAs in genomic sequence using the RNA structure as the main characteristic feature. Whereas early methods focused on energy-directed folding of single sequences, comparative analysis based on structure preserving changes of base pairs has been efficient in improving accuracy, and today this constitutes a key component in genomic screens. Here, we cover the basic principles of RNA folding and touch upon some of the concepts in current methods that have been applied in genomic screens for de novo RNA structures in searches for novel ncRNA genes and regulatory RNA structure on mRNAs. We discuss the strengths and weaknesses of the different strategies and how they can complement each other.
Coding and decoding libraries of sequence-defined functional copolymers synthesized via photoligation.

Science.gov (United States)

Zydziak, Nicolas; Konrad, Waldemar; Feist, Florian; Afonin, Sergii; Weidner, Steffen; Barner-Kowollik, Christopher

2016-11-30

Designing artificial macromolecules with absolute sequence order represents a considerable challenge. Here we report an advanced light-induced avenue to monodisperse sequence-defined functional linear macromolecules up to decamers via a unique photochemical approach. The versatility of the synthetic strategy-combining sequential and modular concepts-enables the synthesis of perfect macromolecules varying in chemical constitution and topology. Specific functions are placed at arbitrary positions along the chain via the successive addition of monomer units and blocks, leading to a library of functional homopolymers, alternating copolymers and block copolymers. The in-depth characterization of each sequence-defined chain confirms the precision nature of the macromolecules. Decoding of the functional information contained in the molecular structure is achieved via tandem mass spectrometry without recourse to their synthetic history, showing that the sequence information can be read. We submit that the presented photochemical strategy is a viable and advanced concept for coding individual monomer units along a macromolecular chain.
BIRTH: a beam deposition code for non-circular tokamak plasmas

International Nuclear Information System (INIS)

Otsuka, Michio; Nagami, Masayuki; Matsuda, Toshiaki

1982-09-01

A new beam deposition code has been developed which is capable of calculating fast ion deposition profiles including the orbit correction. The code incorporates any injection geometry and a non-circular cross section plasma with a variable elongation and an outward shift of the magnetic flux surface. Typical cpu time on a DEC-10 computer is 10 - 20 seconds and 5 - 10 seconds with and without the orbit correction, respectively. This is shorter by an order of magnitude than that of other codes, e.g., Monte Carlo codes. The power deposition profile calculated by this code is in good agreement with that calculated by a Monte Carlo code. (author)
Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

Directory of Open Access Journals (Sweden)

Maley Carlo C

2008-10-01

Full Text Available Abstract Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12 genomes. Virtually all possible (> 98% 12 bp oligomers appear in vertebrate genomes while 98% to D. melanogaster (12–17 bp, C. elegans (11–17 bp, A. thaliana (11–17 bp, S. cerevisiae (10–16 bp and E. coli (9–15 bp. Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously recognized. This information may be used to detect
Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

Science.gov (United States)

Liu, Zhandong; Venkatesh, Santosh S; Maley, Carlo C

2008-01-01

Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12) genomes. Virtually all possible (> 98%) 12 bp oligomers appear in vertebrate genomes while 98% to < 2% of possible oligomers in D. melanogaster (12–17 bp), C. elegans (11–17 bp), A. thaliana (11–17 bp), S. cerevisiae (10–16 bp) and E. coli (9–15 bp). Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously recognized. This information may be used to
Genomic sequence around butterfly wing development genes: annotation and comparative analysis.

Directory of Open Access Journals (Sweden)

Inês C Conceição

Full Text Available BACKGROUND: Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. METHODOLOGY/PRINCIPAL FINDINGS: We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes. CONCLUSIONS: The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1 the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2 the high
Characterization of non-coding DNA satellites associated with sweepoviruses (genus Begomovirus, Geminiviridae - definition of a distinct class of begomovirus-associated satellites

Directory of Open Access Journals (Sweden)

Gloria eLozano

2016-02-01

Full Text Available Begomoviruses (family Geminiviridae are whitefly-transmitted, plant-infecting single-stranded DNA viruses that cause crop losses throughout the warmer parts of the World. Sweepoviruses are a phylogenetically distinct group of begomoviruses that infect plants of the family Convolvulaceae, including sweet potato (Ipomoea batatas. Two classes of subviral molecules are often associated with begomoviruses, particularly in the Old World; the betasatellites and the alphasatellites. An analysis of sweet potato and Ipomoea indica samples from Spain and Merremia dissecta samples from Venezuela identified small non-coding subviral molecules in association with several distinct sweepoviruses. The sequences of 18 clones were obtained and found to be structurally similar to tomato leaf curl virus–satellite (ToLCV-sat, the first DNA satellite identified in association with a begomovirus, with a region with significant sequence identity to the conserved region of betasatellites, an A-rich sequence, a predicted stem-loop structure containing the nonanucleotide TAATATTAC, and a second predicted stem-loop. These sweepovirus-associated satellites join an increasing number of ToLCV-sat-like non-coding satellites identified recently. Although sharing some features with betasatellites, evidence is provided to suggest that the ToLCV-sat-like satellites are distinct from betasatellites and should be considered a separate class of satellites, for which the collective name deltasatellites is proposed.

IPNV with high and low virulence: host immune responses and viral mutations during infection

Directory of Open Access Journals (Sweden)

Skjesol Astrid

2011-08-01

Full Text Available Abstract Background Infectious pancreatic necrosis virus (IPNV is an aquatic member of the Birnaviridae family that causes widespread disease in salmonids. IPNV is represented by multiple strains with markedly different virulence. Comparison of isolates reveals hyper variable regions (HVR, which are presumably associated with pathogenicity. However little is known about the rates and modes of sequence divergence and molecular mechanisms that determine virulence. Also how the host response may influence IPNV virulence is poorly described. Methods In this study we compared two field isolates of IPNV (NFH-Ar and NFH-El. The sequence changes, replication and mortality were assessed following experimental challenge of Atlantic salmon. Gene expression analyses with qPCR and microarray were applied to examine the immune responses in head kidney. Results Significant differences in mortality were observed between the two isolates, and viral load in the pancreas at 13 days post infection (d p.i. was more than 4 orders of magnitude greater for NFH-Ar in comparison with NFH-El. Sequence comparison of five viral genes from the IPNV isolates revealed different mutation rates and Ka/Ks ratios. A strong tendency towards non-synonymous mutations was found in the HRV of VP2 and in VP3. All mutations in VP5 produced precocious stop codons. Prior to the challenge, NFH-Ar and NFH-El possessed high and low virulence motifs in VP2, respectively. Nucleotide substitutions were noticed already during passage of viruses in CHSE-214 cells and their accumulation continued in the challenged fish. The sequence changes were notably directed towards low virulence. Co-ordinated activation of anti-viral genes with diverse functions (IFN-a1 and c, sensors - Rig-I, MDA-5, TLR8 and 9, signal transducers - Srk2, MyD88, effectors - Mx, galectin 9, galectin binding protein, antigen presentation - b2-microglobulin was observed at 13 d p.i. (NFH-Ar and 29 d p.i. (both isolates
The role of viral population diversity in adaptation of bovine coronavirus to new host environments.

Directory of Open Access Journals (Sweden)

Monica K Borucki

Full Text Available The high mutation rate of RNA viruses enables a diverse genetic population of viral genotypes to exist within a single infected host. In-host genetic diversity could better position the virus population to respond and adapt to a diverse array of selective pressures such as host-switching events. Multiple new coronaviruses, including SARS, have been identified in human samples just within the last ten years, demonstrating the potential of coronaviruses as emergent human pathogens. Deep sequencing was used to characterize genomic changes in coronavirus quasispecies during simulated host-switching. Three bovine nasal samples infected with bovine coronavirus were used to infect human and bovine macrophage and lung cell lines. The virus reproduced relatively well in macrophages, but the lung cell lines were not infected efficiently enough to allow passage of non lab-adapted samples. Approximately 12 kb of the genome was amplified before and after passage and sequenced at average coverages of nearly 950×(454 sequencing and 38,000×(Illumina. The consensus sequence of many of the passaged samples had a 12 nucleotide insert in the consensus sequence of the spike gene, and multiple point mutations were associated with the presence of the insert. Deep sequencing revealed that the insert was present but very rare in the unpassaged samples and could quickly shift to dominate the population when placed in a different environment. The insert coded for three arginine residues, occurred in a region associated with fusion entry into host cells, and may allow infection of new cell types via heparin sulfate binding. Analysis of the deep sequencing data indicated that two distinct genotypes circulated at different frequency levels in each sample, and support the hypothesis that the mutations present in passaged strains were "selected" from a pre-existing pool rather than through de novo mutation and subsequent population fixation.
DNA methylation of miRNA coding sequences putatively associated with childhood obesity.

Science.gov (United States)

Mansego, M L; Garcia-Lacarte, M; Milagro, F I; Marti, A; Martinez, J A

2017-02-01

Epigenetic mechanisms may be involved in obesity onset and its consequences. The aim of the present study was to evaluate whether DNA methylation status in microRNA (miRNA) coding regions is associated with childhood obesity. DNA isolated from white blood cells of 24 children (identification sample: 12 obese and 12 non-obese) from the Grupo Navarro de Obesidad Infantil study was hybridized in a 450 K methylation microarray. Several CpGs whose DNA methylation levels were statistically different between obese and non-obese were validated by MassArray® in 95 children (validation sample) from the same study. Microarray analysis identified 16 differentially methylated CpGs between both groups (6 hypermethylated and 10 hypomethylated). DNA methylation levels in miR-1203, miR-412 and miR-216A coding regions significantly correlated with body mass index standard deviation score (BMI-SDS) and explained up to 40% of the variation of BMI-SDS. The network analysis identified 19 well-defined obesity-relevant biological pathways from the KEGG database. MassArray® validation identified three regions located in or near miR-1203, miR-412 and miR-216A coding regions differentially methylated between obese and non-obese children. The current work identified three CpG sites located in coding regions of three miRNAs (miR-1203, miR-412 and miR-216A) that were differentially methylated between obese and non-obese children, suggesting a role of miRNA epigenetic regulation in childhood obesity. © 2016 World Obesity Federation.
A viral metagenomic approach on a nonmetagenomic experiment

DEFF Research Database (Denmark)

Bovo, Samuele; Mazzoni, Gianluca; Ribani, Anisa

2017-01-01

Shot-gun next generation sequencing (NGS) on whole DNA extracted from specimens collected from mammals often produces reads that are not mapped (i.e. unmapped reads) on the host reference genome and that are usually discarded as by-products of the experiments. In this study, we mined Ion Torrent...... reads obtained by sequencing DNA isolated from archived blood samples collected from 100 performance tested Italian Large White pigs. Two reduced representation libraries were prepared from two DNA pools constructed each from 50 equimolar DNA samples. Bioinformatic analyses were carried out to mine...... unmapped reads on the reference pig genome that were obtained from the two NGS datasets. In silico analyses included read mapping and sequence assembly approaches for a viral metagenomic analysis using the NCBI Viral Genome Resource. Our approach identified sequences matching several viruses...
Comparisons between Arabidopsis thaliana and Drosophila melanogaster in relation to Coding and Noncoding Sequence Length and Gene Expression

Directory of Open Access Journals (Sweden)

Rachel Caldwell

2015-01-01

Full Text Available There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length.
An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

Science.gov (United States)

Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

2011-01-01

cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.
Genome-wide identification and characterization of long intergenic non-coding RNAs in Ganoderma lucidum.

Directory of Open Access Journals (Sweden)

Jianqin Li

Full Text Available Ganoderma lucidum is a white-rot fungus best-known for its medicinal activities. We have previously sequenced its genome and annotated the protein coding genes. However, long non-coding RNAs in G. lucidum genome have not been analyzed. In this study, we have identified and characterized long intergenic non-coding RNAs (lincRNA in G. lucidum systematically. We developed a computational pipeline, which was used to analyze RNA-Seq data derived from G. lucidum samples collected from three developmental stages. A total of 402 lincRNA candidates were identified, with an average length of 609 bp. Analysis of their adjacent protein-coding genes (apcGenes revealed that 46 apcGenes belong to the pathways of triterpenoid biosynthesis and lignin degradation, or families of cytochrome P450, mating type B genes, and carbohydrate-active enzymes. To determine if lincRNAs and these apcGenes have any interactions, the corresponding pairs of lincRNAs and apcGenes were analyzed in detail. We developed a modified 3' RACE method to analyze the transcriptional direction of a transcript. Among the 46 lincRNAs, 37 were found unidirectionally transcribed, and 9 were found bidirectionally transcribed. The expression profiles of 16 of these 37 lincRNAs were found to be highly correlated with those of the apcGenes across the three developmental stages. Among them, 11 are positively correlated (r>0.8 and 5 are negatively correlated (r<-0.8. The co-localization and co-expression of lincRNAs and those apcGenes playing important functions is consistent with the notion that lincRNAs might be important regulators for cellular processes. In summary, this represents the very first study to identify and characterize lincRNAs in the genomes of basidiomycetes. The results obtained here have laid the foundation for study of potential lincRNA-mediated expression regulation of genes in G. lucidum.
An Explicit Construction of a sequence of codes attaining the Tsfasman-Vladut-Zink Bound:The first steps

DEFF Research Database (Denmark)

Høholdt, Tom; Voss, Cornelia

1997-01-01

We present a sequence of codes attaining the Tsfasman-Vladut-Zink bound. The construction is based on the tower of Artin-Schreier extensions described by Garcia and Stichtenoth (1995). We also determine the dual codes. The first steps of the constructions are explicitly given as generator matrices...
Histomorphological changes in hepatitis C non-responders with respect to viral genotypes

International Nuclear Information System (INIS)

Adnan, U.; Mirza, T.; Naz, E.; Aziz, S.

2013-01-01

Objective: To evaluate the distinct histopathological changes of chronic hepatitis C (CHC) non-responders in association with viral genotypes. Methods: This cross-sectional study was conducted at the histopathology section of the Dow Diagnostic Research and Reference Laboratory, Dow University of Health Sciences in collaboration with Sarwar Zuberi Liver Centre, Civil Hospital, Karachi from September 2009 to August 2011. Seventy-five non-responders (end-treatment-response [ETR] positive patients) from a consecutive series of viral-RNA positive CHC patients with known genotypes were selected. Their genotypes and pertinent clinical history was recorded. They were subjected to liver biopsies which were assessed for grade, stage, steatosis, stainable iron and characteristic histological lesions. Results: Majority of the patients (63, 84%) had genotype 3 while 12(16%) cases had genotype 1. The genotype 1 patients had significantly higher scores of inflammation (p<0.03) and fibrosis (p<0.04) as compared to genotype 3. Steatosis was significantly present in all genotype 3 patients in higher scores (p<0.001) compared to genotype 1. Stainable iron scores were generally low in the patients in this study, however, it was more commonly seen in genotype 3. The distribution of characteristic histological lesions was noteworthy in both the groups, irrespective of genotype. Conclusion: In this series, the predominant genotype was 3. However, genotype 1 patients were more prone to the aggressive nature of the disease with significantly higher scores of inflammation and fibrosis. Steatosis was characteristically observed in genotype 3 group. Stainable iron could not be attributed as a cause of non-response. (author)
Complex organisation and structure of the ghrelin antisense strand gene GHRLOS, a candidate non-coding RNA gene

Directory of Open Access Journals (Sweden)

Herington Adrian C

2008-10-01

Full Text Available Abstract Background The peptide hormone ghrelin has many important physiological and pathophysiological roles, including the stimulation of growth hormone (GH release, appetite regulation, gut motility and proliferation of cancer cells. We previously identified a gene on the opposite strand of the ghrelin gene, ghrelinOS (GHRLOS, which spans the promoter and untranslated regions of the ghrelin gene (GHRL. Here we further characterise GHRLOS. Results We have described GHRLOS mRNA isoforms that extend over 1.4 kb of the promoter region and 106 nucleotides of exon 4 of the ghrelin gene, GHRL. These GHRLOS transcripts initiate 4.8 kb downstream of the terminal exon 4 of GHRL and are present in the 3' untranslated exon of the adjacent gene TATDN2 (TatD DNase domain containing 2. Interestingly, we have also identified a putative non-coding TATDN2-GHRLOS chimaeric transcript, indicating that GHRLOS RNA biogenesis is extremely complex. Moreover, we have discovered that the 3' region of GHRLOS is also antisense, in a tail-to-tail fashion to a novel terminal exon of the neighbouring SEC13 gene, which is important in protein transport. Sequence analyses revealed that GHRLOS is riddled with stop codons, and that there is little nucleotide and amino-acid sequence conservation of the GHRLOS gene between vertebrates. The gene spans 44 kb on 3p25.3, is extensively spliced and harbours multiple variable exons. We have also investigated the expression of GHRLOS and found evidence of differential tissue expression. It is highly expressed in tissues which are emerging as major sites of non-coding RNA expression (the thymus, brain, and testis, as well as in the ovary and uterus. In contrast, very low levels were found in the stomach where sense, GHRL derived RNAs are highly expressed. Conclusion GHRLOS RNA transcripts display several distinctive features of non-coding (ncRNA genes, including 5' capping, polyadenylation, extensive splicing and short open reading
ChIPBase: a database for decoding the transcriptional regulation of long non-coding RNA and microRNA genes from ChIP-Seq data.

Science.gov (United States)

Yang, Jian-Hua; Li, Jun-Hao; Jiang, Shan; Zhou, Hui; Qu, Liang-Hu

2013-01-01

Long non-coding RNAs (lncRNAs) and microRNAs (miRNAs) represent two classes of important non-coding RNAs in eukaryotes. Although these non-coding RNAs have been implicated in organismal development and in various human diseases, surprisingly little is known about their transcriptional regulation. Recent advances in chromatin immunoprecipitation with next-generation DNA sequencing (ChIP-Seq) have provided methods of detecting transcription factor binding sites (TFBSs) with unprecedented sensitivity. In this study, we describe ChIPBase (http://deepbase.sysu.edu.cn/chipbase/), a novel database that we have developed to facilitate the comprehensive annotation and discovery of transcription factor binding maps and transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. The current release of ChIPBase includes high-throughput sequencing data that were generated by 543 ChIP-Seq experiments in diverse tissues and cell lines from six organisms. By analysing millions of TFBSs, we identified tens of thousands of TF-lncRNA and TF-miRNA regulatory relationships. Furthermore, two web-based servers were developed to annotate and discover transcriptional regulatory relationships of lncRNAs and miRNAs from ChIP-Seq data. In addition, we developed two genome browsers, deepView and genomeView, to provide integrated views of multidimensional data. Moreover, our web implementation supports diverse query types and the exploration of TFs, lncRNAs, miRNAs, gene ontologies and pathways.
Coding sequence of human rho cDNAs clone 6 and clone 9

Energy Technology Data Exchange (ETDEWEB)

Chardin, P; Madaule, P; Tavitian, A

1988-03-25

The authors have isolated human cDNAs including the complete coding sequence for two rho proteins corresponding to the incomplete isolates previously described as clone 6 and clone 9. The deduced a.a. sequences, when compared to the a.a. sequence deduced from clone 12 cDNA, show that there are in human at least three highly homologous rho genes. They suggest that clone 12 be named rhoA, clone 6 : rhoB and clone 9 : rhoC. RhoA, B and C proteins display approx. 30% a.a. identity with ras proteins,. mainly clustered in four highly homologous internal regions corresponding to the GTP binding site; however at least one significant difference is found; the 3 rho proteins have an Alanine in position corresponding to ras Glycine 13, suggesting that rho and ras proteins might have slightly different biochemical properties.
Application of Melcor code for the calculo of TMLB sequence in PWR with natural circulating into the vessel

International Nuclear Information System (INIS)

Marten-Fuertes, F.

1995-01-01

The use of computer codes to analyze the phenomena of severe accidents is very important to take decisions in Nuclear Safety. This paper presents the MELCOR code used to calculate the TMLB sequence of PWR with natural circulation into the vessels. The main goal of this code is its application for the PSA (probabilistic safety analysis)
Ebola virus RNA editing depends on the primary editing site sequence and an upstream secondary structure.

Directory of Open Access Journals (Sweden)

Masfique Mehedi

Full Text Available Ebolavirus (EBOV, the causative agent of a severe hemorrhagic fever and a biosafety level 4 pathogen, increases its genome coding capacity by producing multiple transcripts encoding for structural and nonstructural glycoproteins from a single gene. This is achieved through RNA editing, during which non-template adenosine residues are incorporated into the EBOV mRNAs at an editing site encoding for 7 adenosine residues. However, the mechanism of EBOV RNA editing is currently not understood. In this study, we report for the first time that minigenomes containing the glycoprotein gene editing site can undergo RNA editing, thereby eliminating the requirement for a biosafety level 4 laboratory to study EBOV RNA editing. Using a newly developed dual-reporter minigenome, we have characterized the mechanism of EBOV RNA editing, and have identified cis-acting sequences that are required for editing, located between 9 nt upstream and 9 nt downstream of the editing site. Moreover, we show that a secondary structure in the upstream cis-acting sequence plays an important role in RNA editing. EBOV RNA editing is glycoprotein gene-specific, as a stretch encoding for 7 adenosine residues located in the viral polymerase gene did not serve as an editing site, most likely due to an absence of the necessary cis-acting sequences. Finally, the EBOV protein VP30 was identified as a trans-acting factor for RNA editing, constituting a novel function for this protein. Overall, our results provide novel insights into the RNA editing mechanism of EBOV, further understanding of which might result in novel intervention strategies against this viral pathogen.
Ebola virus RNA editing depends on the primary editing site sequence and an upstream secondary structure.

Science.gov (United States)

Mehedi, Masfique; Hoenen, Thomas; Robertson, Shelly; Ricklefs, Stacy; Dolan, Michael A; Taylor, Travis; Falzarano, Darryl; Ebihara, Hideki; Porcella, Stephen F; Feldmann, Heinz

2013-01-01

Ebolavirus (EBOV), the causative agent of a severe hemorrhagic fever and a biosafety level 4 pathogen, increases its genome coding capacity by producing multiple transcripts encoding for structural and nonstructural glycoproteins from a single gene. This is achieved through RNA editing, during which non-template adenosine residues are incorporated into the EBOV mRNAs at an editing site encoding for 7 adenosine residues. However, the mechanism of EBOV RNA editing is currently not understood. In this study, we report for the first time that minigenomes containing the glycoprotein gene editing site can undergo RNA editing, thereby eliminating the requirement for a biosafety level 4 laboratory to study EBOV RNA editing. Using a newly developed dual-reporter minigenome, we have characterized the mechanism of EBOV RNA editing, and have identified cis-acting sequences that are required for editing, located between 9 nt upstream and 9 nt downstream of the editing site. Moreover, we show that a secondary structure in the upstream cis-acting sequence plays an important role in RNA editing. EBOV RNA editing is glycoprotein gene-specific, as a stretch encoding for 7 adenosine residues located in the viral polymerase gene did not serve as an editing site, most likely due to an absence of the necessary cis-acting sequences. Finally, the EBOV protein VP30 was identified as a trans-acting factor for RNA editing, constituting a novel function for this protein. Overall, our results provide novel insights into the RNA editing mechanism of EBOV, further understanding of which might result in novel intervention strategies against this viral pathogen.
Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

DEFF Research Database (Denmark)

Jason, Flannick; Fuchsberger, Christian; Mahajan, Anubha

2017-01-01

variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced...... individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics...... from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D....
PARN and TOE1 Constitute a 3′ End Maturation Module for Nuclear Non-coding RNAs

Directory of Open Access Journals (Sweden)

Ahyeon Son

2018-04-01

Full Text Available Summary: Poly(A-specific ribonuclease (PARN and target of EGR1 protein 1 (TOE1 are nuclear granule-associated deadenylases, whose mutations are linked to multiple human diseases. Here, we applied mTAIL-seq and RNA sequencing (RNA-seq to systematically identify the substrates of PARN and TOE1 and elucidate their molecular functions. We found that PARN and TOE1 do not modulate the length of mRNA poly(A tails. Rather, they promote the maturation of nuclear small non-coding RNAs (ncRNAs. PARN and TOE1 act redundantly on some ncRNAs, most prominently small Cajal body-specific RNAs (scaRNAs. scaRNAs are strongly downregulated when PARN and TOE1 are compromised together, leading to defects in small nuclear RNA (snRNA pseudouridylation. They also function redundantly in the biogenesis of telomerase RNA component (TERC, which shares sequence motifs found in H/ACA box scaRNAs. Our findings extend the knowledge of nuclear ncRNA biogenesis, and they provide insights into the pathology of PARN/TOE1-associated genetic disorders whose therapeutic treatments are currently unavailable. : By analyzing the 3′ termini of transcriptome, Son et al. reveal the targets of PARN and TOE1, two nuclear deadenylases with disease associations. Both deadenylases are involved in nuclear small non-coding RNA maturation, but not in mRNA deadenylation. Their combined activity is particularly important for biogenesis of scaRNAs and TERC. Keywords: PARN, TOE1, CAF1Z, deadenylase, 3′ end maturation, adenylation, deadenylation, scaRNA, TERC
Viral dark matter and virus–host interactions resolved from publicly available microbial genomes

Science.gov (United States)

Roux, Simon; Hallam, Steven J; Woyke, Tanja; Sullivan, Matthew B

2015-01-01

The ecological importance of viruses is now widely recognized, yet our limited knowledge of viral sequence space and virus–host interactions precludes accurate prediction of their roles and impacts. In this study, we mined publicly available bacterial and archaeal genomic data sets to identify 12,498 high-confidence viral genomes linked to their microbial hosts. These data augment public data sets 10-fold, provide first viral sequences for 13 new bacterial phyla including ecologically abundant phyla, and help taxonomically identify 7–38% of ‘unknown’ sequence space in viromes. Genome- and network-based classification was largely consistent with accepted viral taxonomy and suggested that (i) 264 new viral genera were identified (doubling known genera) and (ii) cross-taxon genomic recombination is limited. Further analyses provided empirical data on extrachromosomal prophages and coinfection prevalences, as well as evaluation of in silico virus–host linkage predictions. Together these findings illustrate the value of mining viral signal from microbial genomes. DOI: http://dx.doi.org/10.7554/eLife.08490.001 PMID:26200428
Prediction Error During Functional and Non-Functional Action Sequences

DEFF Research Database (Denmark)

Nielbo, Kristoffer Laigaard; Sørensen, Jesper

2013-01-01

recurrent networks were made and the results are presented in this article. The simulations show that non-functional action sequences do indeed increase prediction error, but that context representations, such as abstract goal information, can modulate the error signal considerably. It is also shown...... that the networks are sensitive to boundaries between sequences in both functional and non-functional actions....
Inferences about the global scenario of human T-cell lymphotropic virus type 1 infection using data mining of viral sequences

Directory of Open Access Journals (Sweden)

Thessika Hialla Almeida Araujo

2014-07-01

Full Text Available Human T-cell lymphotropic virus type 1 (HTLV-1 is mainly associated with two diseases: tropical spastic paraparesis/HTLV-1-associated myelopathy (TSP/HAM and adult T-cell leukaemia/lymphoma. This retrovirus infects five-10 million individuals throughout the world. Previously, we developed a database that annotates sequence data from GenBank and the present study aimed to describe the clinical, molecular and epidemiological scenarios of HTLV-1 infection through the stored sequences in this database. A total of 2,545 registered complete and partial sequences of HTLV-1 were collected and 1,967 (77.3% of those sequences represented unique isolates. Among these isolates, 93% contained geographic origin information and only 39% were related to any clinical status. A total of 1,091 sequences contained information about the geographic origin and viral subtype and 93% of these sequences were identified as subtype “a”. Ethnicity data are very scarce. Regarding clinical status data, 29% of the sequences were generated from TSP/HAM and 67.8% from healthy carrier individuals. Although the data mining enabled some inferences about specific aspects of HTLV-1 infection to be made, due to the relative scarcity of data of available sequences, it was not possible to delineate a global scenario of HTLV-1 infection.

At the intersection of non-coding transcription, DNA repair, chromatin structure, and cellular senescence

Directory of Open Access Journals (Sweden)

Ryosuke eOhsawa

2013-07-01

Full Text Available It is well accepted that non-coding RNAs play a critical role in regulating gene expression. Recent paradigm-setting studies are now revealing that non-coding RNAs, other than microRNAs, also play intriguing roles in the maintenance of chromatin structure, in the DNA damage response, and in adult human stem cell aging. In this review, we will discuss the complex inter-dependent relationships among non-coding RNA transcription, maintenance of genomic stability, chromatin structure and adult stem cell senescence. DNA damage-induced non-coding RNAs transcribed in the vicinity of the DNA break regulate recruitment of the DNA damage machinery and DNA repair efficiency. We will discuss the correlation between non-coding RNAs and DNA damage repair efficiency and the potential role of changing chromatin structures around double-strand break sites. On the other hand, induction of non-coding RNA transcription from the repetitive Alu elements occurs during human stem cell aging and hinders efficient DNA repair causing entry into senescence. We will discuss how this fine balance between transcription and genomic instability may be regulated by the dramatic changes to chromatin structure that accompany cellular senescence.
Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

Science.gov (United States)

Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

2015-01-01

Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930
Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence.

Directory of Open Access Journals (Sweden)

Kacy L Gordon

2015-05-01

Full Text Available Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2 from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements.
Analysis of metagenomic data reveals common features of halophilic viral communities across continents.

Science.gov (United States)

Roux, Simon; Enault, Francois; Ravet, Viviane; Colombet, Jonathan; Bettarel, Yvan; Auguet, Jean-Christophe; Bouvier, Thierry; Lucas-Staat, Soizick; Vellet, Agnès; Prangishvili, David; Forterre, Patrick; Debroas, Didier; Sime-Ngando, Telesphore

2016-03-01

Microbial communities from hypersaline ponds, dominated by halophilic archaea, are considered specific of such extreme conditions. The associated viral communities have accordingly been shown to display specific features, such as similar morphologies among different sites. However, little is known about the genetic diversity of these halophilic viral communities across the Earth. Here, we studied viral communities in hypersaline ponds sampled on the coast of Senegal (8-36% of salinity) using metagenomics approach, and compared them with hypersaline viromes from Australia and Spain. The specificity of hyperhalophilic viruses could first be demonstrated at a community scale, salinity being a strong discriminating factor between communities. For the major viral group detected in all samples (Caudovirales), only a limited number of halophilic Caudovirales clades were highlighted. These clades gather viruses from different continents and display consistent genetic composition, indicating that they represent related lineages with a worldwide distribution. Non-tailed hyperhalophilic viruses display a greater rate of gene transfer and recombination, with uncharacterized genes conserved across different kind of viruses and plasmids. Thus, hypersaline viral communities around the world appear to form a genetically consistent community that are likely to harbour new genes coding for enzymes specifically adapted to these environments. © 2015 Society for Applied Microbiology and John Wiley & Sons Ltd.
Detection of viral sequence fragments of HIV-1 subfamilies yet unknown

Directory of Open Access Journals (Sweden)

Stanke Mario

2011-04-01

Full Text Available Abstract Background Methods of determining whether or not any particular HIV-1 sequence stems - completely or in part - from some unknown HIV-1 subtype are important for the design of vaccines and molecular detection systems, as well as for epidemiological monitoring. Nevertheless, a single algorithm only, the Branching Index (BI, has been developed for this task so far. Moving along the genome of a query sequence in a sliding window, the BI computes a ratio quantifying how closely the query sequence clusters with a subtype clade. In its current version, however, the BI does not provide predicted boundaries of unknown fragments. Results We have developed Unknown Subtype Finder (USF, an algorithm based on a probabilistic model, which automatically determines which parts of an input sequence originate from a subtype yet unknown. The underlying model is based on a simple profile hidden Markov model (pHMM for each known subtype and an additional pHMM for an unknown subtype. The emission probabilities of the latter are estimated using the emission frequencies of the known subtypes by means of a (position-wise probabilistic model for the emergence of new subtypes. We have applied USF to SIV and HIV-1 sequences formerly classified as having emerged from an unknown subtype. Moreover, we have evaluated its performance on artificial HIV-1 recombinants and non-recombinant HIV-1 sequences. The results have been compared with the corresponding results of the BI. Conclusions Our results demonstrate that USF is suitable for detecting segments in HIV-1 sequences stemming from yet unknown subtypes. Comparing USF with the BI shows that our algorithm performs as good as the BI or better.
Bombyx mori nucleopolyhedrovirus ORF54, a viral desmoplakin gene, is associated with the infectivity of budded virions.

Science.gov (United States)

Zhang, Min-Juan; Tian, Cai-Hong; Fan, Xiao-Ying; Lou, Yi-Han; Cheng, Ruo-Lin; Zhang, Chuan-Xi

2012-07-01

Bombyx mori nucleopolyhedrovirus (BmNPV) ORF54 (Bm54), a member of the viral desmoplakin N-terminus superfamily, is homologous to Autographa californica nucleopolyhedrovirus (AcMNPV) ORF66, which is required for the efficient egress of nucleocapsids from the nucleus and occlusion body formation. In this paper, we generated a bacmid with the Bm54 gene deleted via homologous recombination in Escherichia coli and characterized the mutant virus using a transfection-infection assay and transmission electron microscopy analysis. Our results demonstrated that the cells transfected with viral DNA lacking Bm54 produced non-infectious budded viruses (BVs). Electron microscopy showed that although the deletion of Bm54 did not affect assembly and release of nucleocapsids, it severely affected polyhedron formation. In conclusion, deletion of Bm54 resulted in non-infectious BV and defective polyhedra. Although the sequences of Bm54 and Ac66 are very similar, the two genes function quite differently in the regulation of viral life cycle.
Non-Coding RNAs and Endometrial Cancer

Directory of Open Access Journals (Sweden)

Cristina Vallone

2018-03-01

Full Text Available Non-coding RNAs (ncRNAs are involved in the regulation of cell metabolism and neoplastic transformation. Recent studies have tried to clarify the significance of these information carriers in the genesis and progression of various cancers and their use as biomarkers for the disease; possible targets for the inhibition of growth and invasion by the neoplastic cells have been suggested. The significance of ncRNAs in lung cancer, bladder cancer, kidney cancer, and melanoma has been amply investigated with important results. Recently, the role of long non-coding RNAs (lncRNAs has also been included in cancer studies. Studies on the relation between endometrial cancer (EC and ncRNAs, such as small ncRNAs or micro RNAs (miRNAs, transfer RNAs (tRNAs, ribosomal RNAs (rRNAs, antisense RNAs (asRNAs, small nuclear RNAs (snRNAs, Piwi-interacting RNAs (piRNAs, small nucleolar RNAs (snoRNAs, competing endogenous RNAs (ceRNAs, lncRNAs, and long intergenic ncRNAs (lincRNAs have been published. The recent literature produced in the last three years was extracted from PubMed by two independent readers, which was then selected for the possible relation between ncRNAs, oncogenesis in general, and EC in particular.
Junk DNA and the long non-coding RNA twist in cancer genetics

NARCIS (Netherlands)

H. Ling (Hui); K. Vincent; M. Pichler; R. Fodde (Riccardo); I. Berindan-Neagoe (Ioana); F.J. Slack (Frank); G.A. Calin (George)

2015-01-01

textabstractThe central dogma of molecular biology states that the flow of genetic information moves from DNA to RNA to protein. However, in the last decade this dogma has been challenged by new findings on non-coding RNAs (ncRNAs) such as microRNAs (miRNAs). More recently, long non-coding RNAs
Viral Genome DataBase: storing and analyzing genes and proteins from complete viral genomes.

Science.gov (United States)

Hiscock, D; Upton, C

2000-05-01

The Viral Genome DataBase (VGDB) contains detailed information of the genes and predicted protein sequences from 15 completely sequenced genomes of large (&100 kb) viruses (2847 genes). The data that is stored includes DNA sequence, protein sequence, GenBank and user-entered notes, molecular weight (MW), isoelectric point (pI), amino acid content, A + T%, nucleotide frequency, dinucleotide frequency and codon use. The VGDB is a mySQL database with a user-friendly JAVA GUI. Results of queries can be easily sorted by any of the individual parameters. The software and additional figures and information are available at http://athena.bioc.uvic.ca/genomes/index.html .
An expanding universe of the non-coding genome in cancer biology.

Science.gov (United States)

Xue, Bin; He, Lin

2014-06-01

Neoplastic transformation is caused by accumulation of genetic and epigenetic alterations that ultimately convert normal cells into tumor cells with uncontrolled proliferation and survival, unlimited replicative potential and invasive growth [Hanahan,D. et al. (2011) Hallmarks of cancer: the next generation. Cell, 144, 646-674]. Although the majority of the cancer studies have focused on the functions of protein-coding genes, emerging evidence has started to reveal the importance of the vast non-coding genome, which constitutes more than 98% of the human genome. A number of non-coding RNAs (ncRNAs) derived from the 'dark matter' of the human genome exhibit cancer-specific differential expression and/or genomic alterations, and it is increasingly clear that ncRNAs, including small ncRNAs and long ncRNAs (lncRNAs), play an important role in cancer development by regulating protein-coding gene expression through diverse mechanisms. In addition to ncRNAs, nearly half of the mammalian genomes consist of transposable elements, particularly retrotransposons. Once depicted as selfish genomic parasites that propagate at the expense of host fitness, retrotransposon elements could also confer regulatory complexity to the host genomes during development and disease. Reactivation of retrotransposons in cancer, while capable of causing insertional mutagenesis and genome rearrangements to promote oncogenesis, could also alter host gene expression networks to favor tumor development. Taken together, the functional significance of non-coding genome in tumorigenesis has been previously underestimated, and diverse transcripts derived from the non-coding genome could act as integral functional components of the oncogene and tumor suppressor network. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
Non-viral gene therapy that targets motor neurons in vivo

Directory of Open Access Journals (Sweden)

Mary-Louise eRogers

2014-10-01

Full Text Available A major challenge in neurological gene therapy is safe delivery of transgenes to sufficient cell numbers from the circulation or periphery. This is particularly difficult for diseases involving spinal cord motor neurons such as amyotrophic lateral sclerosis (ALS. We have examined the feasibility of non-viral gene delivery to spinal motor neurons from intraperitoneal injections of plasmids carried by ‘immunogene’ nanoparticles targeted for axonal retrograde transport using antibodies. PEGylated polyethylenimine (PEI-PEG12 as DNA carrier was conjugated to an antibody (MLR2 to the neurotrophin receptor p75 (p75NTR. We used a plasmid (pVIVO2 designed for in vivo gene delivery that produces minimal immune responses, has improved nuclear entry into post mitotic cells and also expresses green fluorescent protein (GFP. MLR2-PEI-PEG12 carried pVIVO2 and was specific for mouse motor neurons in mixed cultures containing astrocytes. While only 8% of motor neurons expressed GFP 72 h post transfection in vitro, when the immunogene was given intraperitonealy to neonatal C57BL/6J mice GFP specific motor neuron expression was observed in 25.4% of lumbar, 18.3% of thoracic and 17.0 % of cervical motor neurons, 72 h post transfection. PEI-PEG12 carrying pVIVO2 by itself did not transfect motor neurons in vivo, demonstrating the need for specificity via the p75NTR antibody MLR2. This is the first time that specific transfection of spinal motor neurons has been achieved from peripheral delivery of plasmid DNA as part of a non-viral gene delivery agent. These results stress the specificity and feasibility of immunogene delivery targeted for p75NTR expressing motor neurons, but suggests that further improvements are required to increase the transfection efficiency of motor neurons in vivo.
Design of ACM system based on non-greedy punctured LDPC codes

Science.gov (United States)

Lu, Zijun; Jiang, Zihong; Zhou, Lin; He, Yucheng

2017-08-01

In this paper, an adaptive coded modulation (ACM) scheme based on rate-compatible LDPC (RC-LDPC) codes was designed. The RC-LDPC codes were constructed by a non-greedy puncturing method which showed good performance in high code rate region. Moreover, the incremental redundancy scheme of LDPC-based ACM system over AWGN channel was proposed. By this scheme, code rates vary from 2/3 to 5/6 and the complication of the ACM system is lowered. Simulations show that more and more obvious coding gain can be obtained by the proposed ACM system with higher throughput.
Non-viral gene delivery strategies for cancer therapy, tissue engineering and regenerative medicine

Science.gov (United States)

Bhise, Nupura S.

Gene therapy involves the delivery of deoxyribonucleic acid (DNA) into cells to override or replace a malfunctioning gene for treating debilitating genetic diseases, including cancer and neurodegenerative diseases. In addition to its use as a therapeutic, it can also serve as a technology to enable regenerative medicine strategies. The central challenge of the gene therapy research arena is developing a safe and effective delivery agent. Since viral vectors have critical immunogenic and tumorogenic safety issues that limit their clinical use, recent efforts have focused on developing non-viral biomaterial based delivery vectors. Cationic polymers are an attractive class of gene delivery vectors due to their structural versatility, ease of synthesis, biodegradability, ability to self-complex into nanoparticles with negatively charged DNA, capacity to carry large cargo, cellular uptake and endosomal escape capacity. In this thesis, we hypothesized that developing a biomaterial library of poly(betaamino esters) (PBAE), a newer class of cationic polymers consisting of biodegradable ester groups, would allow investigating vector design parameters and formulating effective non-viral gene delivery strategies for cancer drug delivery, tissue engineering and stem cell engineering. Consequently, a high-throughput transfection assay was developed to screen the PBAE-based nanoparticles in hard to transfect fibroblast cell lines. To gain mechanistic insights into the nanoparticle formulation process, biophysical properties of the vectors were characterized in terms of molecular weight (MW), nanoparticle size, zeta potential and plasmid per particle count. We report a novel assay developed for quantifying the plasmid per nanoparticle count and studying its implications for co-delivery of multiple genes. The MW of the polymers ranged from 10 kDa to 100 kDa, nanoparticle size was about 150 run, zeta potential was about 30 mV in sodium acetate buffer (25 mM, pH 5) and 30 to 100
From structure prediction to genomic screens for novel non-coding RNAs.

Directory of Open Access Journals (Sweden)

Jan Gorodkin

2011-08-01

Full Text Available Non-coding RNAs (ncRNAs are receiving more and more attention not only as an abundant class of genes, but also as regulatory structural elements (some located in mRNAs. A key feature of RNA function is its structure. Computational methods were developed early for folding and prediction of RNA structure with the aim of assisting in functional analysis. With the discovery of more and more ncRNAs, it has become clear that a large fraction of these are highly structured. Interestingly, a large part of the structure is comprised of regular Watson-Crick and GU wobble base pairs. This and the increased amount of available genomes have made it possible to employ structure-based methods for genomic screens. The field has moved from folding prediction of single sequences to computational screens for ncRNAs in genomic sequence using the RNA structure as the main characteristic feature. Whereas early methods focused on energy-directed folding of single sequences, comparative analysis based on structure preserving changes of base pairs has been efficient in improving accuracy, and today this constitutes a key component in genomic screens. Here, we cover the basic principles of RNA folding and touch upon some of the concepts in current methods that have been applied in genomic screens for de novo RNA structures in searches for novel ncRNA genes and regulatory RNA structure on mRNAs. We discuss the strengths and weaknesses of the different strategies and how they can complement each other.
Low-Bandwidth and Non-Compute Intensive Remote Identification of Microbes from Raw Sequencing Reads

DEFF Research Database (Denmark)

Gautier, Laurent; Lund, Ole

2013-01-01

, allowing a fully automated processing of sequencing data and routine instant quality check of sequencing runs from desktop sequencers. A web access is available at http://tapir.cbs.dtu.dk. The source code for a python command-line client, a server, and supplementary data are available at http://bit.ly/1aURxkc....
Camps 2.0: exploring the sequence and structure space of prokaryotic, eukaryotic, and viral membrane proteins.

Science.gov (United States)

Neumann, Sindy; Hartmann, Holger; Martin-Galiano, Antonio J; Fuchs, Angelika; Frishman, Dmitrij

2012-03-01

Structural bioinformatics of membrane proteins is still in its infancy, and the picture of their fold space is only beginning to emerge. Because only a handful of three-dimensional structures are available, sequence comparison and structure prediction remain the main tools for investigating sequence-structure relationships in membrane protein families. Here we present a comprehensive analysis of the structural families corresponding to α-helical membrane proteins with at least three transmembrane helices. The new version of our CAMPS database (CAMPS 2.0) covers nearly 1300 eukaryotic, prokaryotic, and viral genomes. Using an advanced classification procedure, which is based on high-order hidden Markov models and considers both sequence similarity as well as the number of transmembrane helices and loop lengths, we identified 1353 structurally homogeneous clusters roughly corresponding to membrane protein folds. Only 53 clusters are associated with experimentally determined three-dimensional structures, and for these clusters CAMPS is in reasonable agreement with structure-based classification approaches such as SCOP and CATH. We therefore estimate that ∼1300 structures would need to be determined to provide a sufficient structural coverage of polytopic membrane proteins. CAMPS 2.0 is available at http://webclu.bio.wzw.tum.de/CAMPS2.0/. Copyright © 2011 Wiley Periodicals, Inc.
Divergent evolutionary rates in vertebrate and mammalian specific conserved non-coding elements (CNEs) in echolocating mammals.

Science.gov (United States)

Davies, Kalina T J; Tsagkogeorga, Georgia; Rossiter, Stephen J

2014-12-19

The majority of DNA contained within vertebrate genomes is non-coding, with a certain proportion of this thought to play regulatory roles during development. Conserved Non-coding Elements (CNEs) are an abundant group of putative regulatory sequences that are highly conserved across divergent groups and thus assumed to be under strong selective constraint. Many CNEs may contain regulatory factor binding sites, and their frequent spatial association with key developmental genes - such as those regulating sensory system development - suggests crucial roles in regulating gene expression and cellular patterning. Yet surprisingly little is known about the molecular evolution of CNEs across diverse mammalian taxa or their role in specific phenotypic adaptations. We examined 3,110 vertebrate-specific and ~82,000 mammalian-specific CNEs across 19 and 9 mammalian orders respectively, and tested for changes in the rate of evolution of CNEs located in the proximity of genes underlying the development or functioning of auditory systems. As we focused on CNEs putatively associated with genes underlying the development/functioning of auditory systems, we incorporated echolocating taxa in our dataset because of their highly specialised and derived auditory systems. Phylogenetic reconstructions of concatenated CNEs broadly recovered accepted mammal relationships despite high levels of sequence conservation. We found that CNE substitution rates were highest in rodents and lowest in primates, consistent with previous findings. Comparisons of CNE substitution rates from several genomic regions containing genes linked to auditory system development and hearing revealed differences between echolocating and non-echolocating taxa. Wider taxonomic sampling of four CNEs associated with the homeobox genes Hmx2 and Hmx3 - which are required for inner ear development - revealed family-wise variation across diverse bat species. Specifically within one family of echolocating bats that utilise
Recoding method that removes inhibitory sequences and improves HIV gene expression

Energy Technology Data Exchange (ETDEWEB)

Rabadan, Raul; Krasnitz, Michael; Robins, Harlan; Witten, Daniela; Levine, Arnold

2016-08-23

The invention relates to inhibitory nucleotide signal sequences or "INS" sequences in the genomes of lentiviruses. In particular the invention relates to the AGG motif present in all viral genomes. The AGG motif may have an inhibitory effect on a virus, for example by reducing the levels of, or maintaining low steady-state levels of, viral RNAs in host cells, and inducing and/or maintaining in viral latency. In one aspect, the invention provides vaccines that contain, or are produced from, viral nucleic acids in which the AGG sequences have been mutated. In another aspect, the invention provides methods and compositions for affecting the function of the AGG motif, and methods for identifying other INS sequences in viral genomes.
A versatile family of degradable non-viral gene carriers based on hyperbranched poly(ester amine)s

NARCIS (Netherlands)

Zhong, Zhiyuan; Song, Y.; Engbersen, Johannes F.J.; Lok, Martin C.; Hennink, Wim E.; Feijen, Jan

2005-01-01

A variety of degradable hyperbranched poly(ester amine)s containing primary, secondary and tertiary amino groups, were synthesized and evaluated as non-viral gene carriers. The polymers were obtained in high yields through a Michael-type conjugate addition of diacrylate monomers with trifunctional
Episodic sequence memory is supported by a theta-gamma phase code

OpenAIRE

Heusser, Andrew C.; Poeppel, David; Ezzyat, Youssef; Davachi, Lila

2016-01-01

The meaning we derive from our experiences is not a simple static extraction of the elements, but is largely based on the order in which those elements occur. Models propose that sequence encoding is supported by interactions between high and low frequency oscillations, such that elements within an experience are represented by neural cell assemblies firing at higher frequencies (i.e. gamma) and sequential order is coded by the specific timing of firing with respect to a lower frequency oscil...

u-Constacyclic codes over F_p+u{F}_p and their applications of constructing new non-binary quantum codes

Science.gov (United States)

Gao, Jian; Wang, Yongkang

2018-01-01

Structural properties of u-constacyclic codes over the ring F_p+u{F}_p are given, where p is an odd prime and u^2=1. Under a special Gray map from F_p+u{F}_p to F_p^2, some new non-binary quantum codes are obtained by this class of constacyclic codes.
Long non-coding RNA expression profile in cervical cancer tissues

Science.gov (United States)

Zhu, Hua; Chen, Xiangjian; Hu, Yan; Shi, Zhengzheng; Zhou, Qing; Zheng, Jingjie; Wang, Yifeng

2017-01-01

Cervical cancer (CC), one of the most common types of cancer of the female population, presents an enormous challenge in diagnosis and treatment. Long non-coding (lnc)RNAs, non-coding (nc)RNAs with length >200 nucleotides, have been identified to be associated with multiple types of cancer, including CC. This class of nc transcripts serves an important role in tumor suppression and oncogenic signaling pathways. In the present study, the microarray method was used to obtain the expression profile of lncRNAs and protein-coding mRNAs and to compare the expression of lncRNAs between CC tissues and corresponding adjacent non-cancerous tissues in order to screen potential lncRNAs for associations with CC. Overall, 3356 lncRNAs with significantly different expression pattern in CC tissues compared with adjacent non-cancerous tissues were identified, while 1,857 of them were upregulated. These differentially expressed lncRNAs were additionally classified into 5 subgroups. Reverse transcription quantitative polymerase chain reactions were performed to validate the expression pattern of 5 random selected lncRNAs, and 2lncRNAs were identified to have significantly different expression in CC samples compared with adjacent non-cancerous tissues. This finding suggests that those lncRNAs with different expression may serve important roles in the development of CC, and the expression data may provide information for additional study on the involvement of lncRNAs in CC. PMID:28789353
Viral indicators for fecal contamination - a one-year viral metagenomic study of treatment efficiency in danish waste water treatment plants

DEFF Research Database (Denmark)

Hellmér, Maria; Stranddorf, Kasper; Seidel, Michael

2017-01-01

from two urban waste water treatment plants in Copenhagen. All samples are investigated for their viral content and the presence of pathogens by metagenomic sequencing and analyzed specifically for HAdV, JCPyV, norovirus GI and GII (NoV GI and GII) using quantitative (q)PCR. Preliminary qPCR results......, the number of identified pathogenic viral species decreases with treatment of the waste water. Further bioinformatic analyses will investigate the seasonal variations of viral composition within a sample as well as the effect of the treatment system. Updated qPCR and metagenomics data will be presented....... are therefore using metagenomics sequencing with the aim to map the viriome in different water sources. In addition we investigate the possibility to use Human Adenovirus (HAdV) or JC Polyomavirus (JCPyV) as indicator for human fecal contamination. Water has been sampled monthly throughout the treatment process...
Code-Switching to Know a TL Equivalent of an L1 Word: Request-Provision-Acknowledgement (RPA) Sequence

Science.gov (United States)

Lucero, Edgar

2011-01-01

This article focuses on the learner's use of Code-switching to learn the TL (Target Language) equivalent of an L1 word. The interactional pattern that this situation creates defines the Request-Provision-Acknowledgement (RPA) sequence. The article explains each of the turns of the sequence under the combination of the Ethnomethodological…
Effect of oligonucleotide primers in determining viral variability within hosts

Directory of Open Access Journals (Sweden)

Moya Andrés

2004-12-01

Full Text Available Abstract Background Genetic variability in viral populations is usually estimated by means of polymerase chain reaction (PCR based methods in which the relative abundance of each amplicon is assumed to be proportional to the frequency of the corresponding template in the initial sample. Although bias in template-to-product ratios has been described before, its relevance in describing viral genetic variability at the intrapatient level has not been fully assessed yet. Results To investigate the role of oligonucleotide design in estimating viral variability within hosts, genetic diversity in hepatitis C virus (HCV populations from eight infected patients was characterised by two parallel PCR amplifications performed with two slightly different sets of primers, followed by cloning and sequencing (mean = 89 cloned sequences per patient. Population genetics analyses of viral populations recovered by pairs of amplifications revealed that in seven patients statistically significant differences were detected between populations sampled with different set of primers. Conclusions Genetic variability analyses demonstrates that PCR selection due to the choice of primers, differing in their degeneracy degree at some nucleotide positions, can eclipse totally or partially viral variants, hence yielding significant different estimates of viral variability within a single patient and therefore eventually producing quite different qualitative and quantitative descriptions of viral populations within each host.
Effect of oligonucleotide primers in determining viral variability within hosts.

Science.gov (United States)

Bracho, Maria Alma; García-Robles, Inmaculada; Jiménez, Nuria; Torres-Puente, Manuela; Moya, Andrés; González-Candelas, Fernando

2004-12-09

Genetic variability in viral populations is usually estimated by means of polymerase chain reaction (PCR) based methods in which the relative abundance of each amplicon is assumed to be proportional to the frequency of the corresponding template in the initial sample. Although bias in template-to-product ratios has been described before, its relevance in describing viral genetic variability at the intrapatient level has not been fully assessed yet. To investigate the role of oligonucleotide design in estimating viral variability within hosts, genetic diversity in hepatitis C virus (HCV) populations from eight infected patients was characterised by two parallel PCR amplifications performed with two slightly different sets of primers, followed by cloning and sequencing (mean = 89 cloned sequences per patient). Population genetics analyses of viral populations recovered by pairs of amplifications revealed that in seven patients statistically significant differences were detected between populations sampled with different set of primers. Genetic variability analyses demonstrates that PCR selection due to the choice of primers, differing in their degeneracy degree at some nucleotide positions, can eclipse totally or partially viral variants, hence yielding significant different estimates of viral variability within a single patient and therefore eventually producing quite different qualitative and quantitative descriptions of viral populations within each host.
Seroprevalence of some bovine viral respiratory diseases among non vaccinated cattle in Saudi Arabia

Directory of Open Access Journals (Sweden)

Mohamed Abd El Fatah Mahmoud

2013-02-01

Full Text Available Aim: Four viral pathogens, bovine viral diarrhea virus (BVDV, and bovine herpes virus type 1 (BHV-1, bovine parainfluenza type 3 virus (PI-3V, bovine respiratory syncytial virus (BRSV are mainly associated with bovine respiratory diseases that cause major economic losses in the dairy cattle industry. This study aimed to document exposure of cattle in Saudi Arabia to infectious BVDV, BHV-1, PI-3V and BRSV viruses in non vaccinated cattle in order to obtain epidemiological and immunological information. Materials and Methods: In the present study, 460 random serum samples obtained from non vaccinated cattle in five districts (Riyadh, Eastern Province, Jizan, Najran, Asir of Saudi Arabia between January to March 2011. These samples were tested for presence of antibodies against BVDV, BHV-1, BRSV and PIV-3 by commercial indirect ELISA kits. Results: Our findings displayed that Seropositivity rates were 26 % for BVD, 17.4 % for BHV-1, 69.1 % for PI-3V and 75.6 % for BRSV in the sampled population. In addition, coinfections with more than one virus were considerably common among non-vaccinated dairy cattle. Conclusion: These results indicate that exposure to these agents is common within the study areas. Preventive and control measures against these infectious agents should therefore be adopted. [Vet World 2013; 6(1.000: 1-4
Root cause investigation of a viral contamination incident occurred during master cell bank (MCB) testing and characterization--a case study.

Science.gov (United States)

Chen, Dayue; Nims, Raymond; Dusing, Sandra; Miller, Pamela; Luo, Wen; Quertinmont, Michelle; Parekh, Bhavin; Poorbaugh, Josh; Boose, Jeri Ann; Atkinson, E Morrey

2008-11-01

An adventitious agent contamination occurred during a routine 9 CFR bovine viral screening test at BioReliance for an Eli Lilly Chinese Hamster Ovary (CHO) cell-derived Master Cell Bank (MCB) intended for biological production. Scientists from the sponsor (Eli Lilly and Company) and the testing service company (BioReliance) jointly conducted a systematic investigation in an attempt to determine the root cause of the contamination. Our investigation resulted in the identification of the viral nature of the contaminant. Subsequent experiments indicated that the viral contaminant was a non-enveloped and non-hemadsorbing virus. Transmission electron microscopy (TEM) revealed that the viral contaminant was 25-30 nm in size and morphologically resembled viruses of the family Picornaviridae. The contaminant virus was readily inactivated when exposed to acidic pH, suggesting that the viral contaminant was a member of rhinoviruses. Although incapable of infecting CHO cells, the viral contaminant replicated efficiently in Vero cell with a life cycle of approximately 16 h. Our investigation provided compelling data demonstrating that the viral contaminant did not originate from the MCB. Instead, it was introduced into the process during cell passaging and a possible entry point was proposed. We identified the viral contaminant as an equine rhinitis A virus using molecular cloning and DNA sequencing. Finally, our investigation led us to conclude that the source of the viral contaminant was the equine serum added to the cell growth medium in the 9 CFR bovine virus test.
Evolution of endogenous non-retroviral genes integrated into plant genomes

Directory of Open Access Journals (Sweden)

Hyosub Chu

2014-08-01

Full Text Available Numerous comparative genome analyses have revealed the wide extent of horizontal gene transfer (HGT in living organisms, which contributes to their evolution and genetic diversity. Viruses play important roles in HGT. Endogenous viral elements (EVEs are defined as viral DNA sequences present within the genomes of non-viral organisms. In eukaryotic cells, the majority of EVEs are derived from RNA viruses using reverse transcription. In contrast, endogenous non-retroviral elements (ENREs are poorly studied. However, the increasing availability of genomic data and the rapid development of bioinformatics tools have enabled the identification of several ENREs in various eukaryotic organisms. To date, a small number of ENREs integrated into plant genomes have been identified. Of the known non-retroviruses, most identified ENREs are derived from double-strand (ds RNA viruses, followed by single-strand (ss DNA and ssRNA viruses. At least eight virus families have been identified. Of these, viruses in the family Partitiviridae are dominant, followed by viruses of the families Chrysoviridae and Geminiviridae. The identified ENREs have been primarily identified in eudicots, followed by monocots. In this review, we briefly discuss the current view on non-retroviral sequences integrated into plant genomes that are associated with plant-virus evolution and their possible roles in antiviral resistance.
Complete Genome Sequence of Mulberry Vein Banding Associated Virus, a New Tospovirus Infecting Mulberry.

Directory of Open Access Journals (Sweden)

Jiaorong Meng

Full Text Available Mulberry vein banding associated virus (MVBaV that infects mulberry plants with typical vein banding symptoms had been identified as a tentative species of the genus Tospovirus based on the homology of N gene sequence to those of tospoviruses. In this study, the complete sequence of the tripartite RNA genome of MVBaV was determined and analyzed. The L RNA has 8905 nucleotides (nt and encodes the putative RNA-dependent RNA polymerase (RdRp of 2877 aa amino acids (aa in the viral complementary (vc strand. The RdRp of MVBaV shares the highest aa sequence identity (85.9% with that of Watermelon silver mottle virus (WSMoV, and contains conserved motifs shared with those of the species of the genus Tospovirus. The M RNA contains 4731 nt and codes in ambisense arrangement for the NSm protein of 309 aa in the sense strand and the Gn/Gc glycoprotein precursor (GP of 1,124 aa in the vc strand. The NSm and GP of MVBaV share the highest aa sequence identities with those of Capsicum chlorosis virus (CaCV and Groundnut bud necrosis virus (GBNV (83.2% and 84.3%, respectively. The S RNA is 3294 nt in length and contains two open reading frames (ORFs in an ambisense coding strategy, encoding a 439-aa non-structural protein (NSs and the 277-aa nucleocapsid protein (N, respectively. The NSs and N also share the highest aa sequence identity (71.1% and 74.4%, respectively with those of CaCV. Phylogenetic analysis of the RdRp, NSm, GP, NSs, and N proteins showed that MVBaV is most closely related to CaCV and GBNV and that these proteins cluster with those of the WSMoV serogroup, and that MVBaV seems to be a species bridging the two subgroups within the WSMoV serogroup of tospoviruses in evolutionary aspect, suggesting that MVBaV represents a distinct tospovirus. Analysis of S RNA sequence uncovered the highly conserved 5'-/3'-ends and the coding regions, and the variable region of IGR with divergent patterns among MVBaV isolates.
Transduplication resulted in the incorporation of two protein-coding sequences into the Turmoil-1 transposable element of C. elegans

Directory of Open Access Journals (Sweden)

Pupko Tal

2008-10-01

Full Text Available Abstract Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif. Reviewers This article was reviewed by Dan Graur and William Martin. For the full reviews, please go to the Reviewers' Reports section.
Analysis of a new strain of Euphorbia mosaic virus with distinct replication specificity unveils a lineage of begomoviruses with short Rep sequences in the DNA-B intergenic region

Directory of Open Access Journals (Sweden)

Argüello-Astorga Gerardo R

2010-10-01

Full Text Available Abstract Background Euphorbia mosaic virus (EuMV is a member of the SLCV clade, a lineage of New World begomoviruses that display distinctive features in their replication-associated protein (Rep and virion-strand replication origin. The first entirely characterized EuMV isolate is native from Yucatan Peninsula, Mexico; subsequently, EuMV was detected in weeds and pepper plants from another region of Mexico, and partial DNA-A sequences revealed significant differences in their putative replication specificity determinants with respect to EuMV-YP. This study was aimed to investigate the replication compatibility between two EuMV isolates from the same country. Results A new isolate of EuMV was obtained from pepper plants collected at Jalisco, Mexico. Full-length clones of both genomic components of EuMV-Jal were biolistically inoculated into plants of three different species, which developed symptoms indistinguishable from those induced by EuMV-YP. Pseudorecombination experiments with EuMV-Jal and EuMV-YP genomic components demonstrated that these viruses do not form infectious reassortants in Nicotiana benthamiana, presumably because of Rep-iteron incompatibility. Sequence analysis of the EuMV-Jal DNA-B intergenic region (IR led to the unexpected discovery of a 35-nt-long sequence that is identical to a segment of the rep gene in the cognate viral DNA-A. Similar short rep sequences ranging from 35- to 51-nt in length were identified in all EuMV isolates and in three distinct viruses from South America related to EuMV. These short rep sequences in the DNA-B IR are positioned downstream to a ~160-nt non-coding domain highly similar to the CP promoter of begomoviruses belonging to the SLCV clade. Conclusions EuMV strains are not compatible in replication, indicating that this begomovirus species probably is not a replicating lineage in nature. The genomic analysis of EuMV-Jal led to the discovery of a subgroup of SLCV clade viruses that contain in
Cryptographic pseudo-random sequences from the chaotic Hénon ...

Indian Academy of Sciences (India)

dimensional discrete-time Hénon map is proposed. Properties of the proposed sequences pertaining to linear complexity, linear complexity proﬁle, correlation and auto-correlation are investigated. All these properties of the sequences suggest a ...
A Novel Type of Non-coding RNA, nc886, Implicated in Tumor Sensing and Suppression

Directory of Open Access Journals (Sweden)

Yong Sun Lee

2015-06-01

Full Text Available nc886 (=vtRNA2-1, pre-miR-886, or CBL3 is a newly identified non-coding RNA (ncRNA that represses the activity of protein kinase R (PKR. nc886 is transcribed by RNA polymerase III (Pol III and is intriguingly the first case of a Pol III gene whose expression is silenced by CpG DNA hypermethylation in several types of cancer. PKR is a sensor protein that recognizes evading viruses and induces apoptosis to eliminate infected cells. Like viral infection, nc886 silencing activates PKR and induces apoptosis. Thus, the significance of the nc886:PKR pathway in cancer is to sense and eliminate pre-malignant cells, which is analogous to PKR's role in cellular innate immunity. Beyond this tumor sensing role, nc886 plays a putative tumor suppressor role as supported by experimental evidence. Collectively, nc886 provides a novel example how epigenetic silencing of a ncRNA contributes to tumorigenesis by controlling the activity of its protein ligand.
Patterns of oligonucleotide sequences in viral and host cell RNA identify mediators of the host innate immune system.

Directory of Open Access Journals (Sweden)

Benjamin D Greenbaum

Full Text Available The innate immune response provides a first line of defense against pathogens by targeting generic differential features that are present in foreign organisms but not in the host. These innate responses generate selection forces acting both in pathogens and hosts that further determine their co-evolution. Here we analyze the nucleic acid sequence fingerprints of these selection forces acting in parallel on both host innate immune genes and ssRNA viral genomes. We do this by identifying dinucleotide biases in the coding regions of innate immune response genes in plasmacytoid dendritic cells, and then use this signal to identify other significant host innate immune genes. The persistence of these biases in the orthologous groups of genes in humans and chickens is also examined. We then compare the significant motifs in highly expressed genes of the innate immune system to those in ssRNA viruses and study the evolution of these motifs in the H1N1 influenza genome. We argue that the significant under-represented motif pattern of CpG in an AU context--which is found in both the ssRNA viruses and innate genes, and has decreased throughout the history of H1N1 influenza replication in humans--is immunostimulatory and has been selected against during the co-evolution of viruses and host innate immune genes. This shows how differences in host immune biology can drive the evolution of viruses that jump into species with different immune priorities than the original host.
Sequence Coding and Search System for licensee event reports: user's guide. Volume 1, Revision 1

International Nuclear Information System (INIS)

Greene, N.M.; Mays, G.T.; Johnson, M.P.

1985-04-01

Operating experience data from nuclear power plants are essential for safety and reliability analyses, especially analyses of trends and patterns. The licensee event reports (LERs) that are submitted to the Nuclear Regulatory Commission (NRC) by the nuclear power plant utilities contain much of this data. The NRC's Office for Analysis and Evaluation of Operational Data (AEOD) has developed, under contract with NSIC, a system for codifying the events reported in the LERs. The primary objective of the Sequence Coding and Search System (SCSS) is to reduce the descriptive text of the LERs to coded sequences that are both computer-readable and computer-searchable. This system provides a structured format for detailed coding of component, system, and unit effects as well as personnel errors. The database contains all current LERs submitted by nuclear power plant utilities for events occurring since 1981 and is updated on a continual basis. This four volume report documents and describes SCSS in detail. Volume 1 is a User's Guide for searching the SCSS database. This volume contains updated material through February 1985 of the working version of ORNL/NSIC-223, Vol. 1
Engineering cotton (Gossypium hirsutum L.) for resistance to cotton leaf curl disease using viral truncated AC1 DNA sequences.

Science.gov (United States)

Hashmi, Jamil A; Zafar, Yusuf; Arshad, Muhammad; Mansoor, Shahid; Asad, Shaheen

2011-04-01

Several important biological processes are performed by distinct functional domains found on replication-associated protein (Rep) encoded by AC1 of geminiviruses. Two truncated forms of replicase (tAC1) gene, capable of expressing only the N-terminal 669 bp (5'AC1) and C-terminal 783 bp (3'AC1) nucleotides cloned under transcriptional control of the CaMV35S were introduced into cotton (Gossypium hirsutum L.) using LBA4404 strain of Agrobacterium tumefaciens to make use of an interference strategy for impairing cotton leaf curl virus (CLCuV) infection in transgenic cotton. Compared with nontransformed control, we observed that transgenic cotton plants overexpressing either N-terminal (5'AC1) or C-terminal (3'AC1) sequences confer resistance to CLCuV by inhibiting replication of viral genomic and β satellite DNA components. Molecular analysis by Northern blot hybridization revealed high transgene expression in early and late growth stages associated with inhibition of CLCuV replication. Of the eight T(1) transgenic lines tested, six had delayed and minor symptoms as compared to nontransformed control lines which developed disease symptoms after 2-3 weeks of whitefly-mediated viral delivery. Virus biological assay and growth of T(2) plants proved that transgenic cotton plants overexpressing 5'- and 3'AC1 displayed high resistance level up to 72, 81%, respectively, as compared to non-transformed control plants following inoculation with viruliferous whiteflies giving significantly high cotton seed yield. Progeny analysis of these plants by polymerase chain reaction (PCR), Southern blotting and virus biological assay showed stable transgene, integration, inheritance and cotton leaf curl disease (CLCuD) resistance in two of the eight transgenic lines having single or two transgene insertions. Transgenic cotton expressing partial AC1 gene of CLCuV can be used as virus resistance source in cotton breeding programs aiming to improve virus resistance in cotton crop.
Novel viral genomes identified from six metagenomes reveal wide distribution of archaeal viruses and high viral diversity in terrestrial hot springs

DEFF Research Database (Denmark)

Islin, Sóley Ruth; Menzel, Peter; Krogh, Anders

2016-01-01

Limited by culture-dependent methods the number of viruses identified from thermophilic Archaea and Bacteria is still very small. In this study we retrieved viral sequences from six hot spring metagenomes isolated worldwide, revealing a wide distribution of four archaeal viral families....... Among the novel genomes, one belongs to a putative thermophilic virus infecting the bacterium Hydrogenobaculum, for which no virus has been reported in the literature. Moreover, a high viral diversity was observed in the metagenomes, especially among the Lipothrixviridae, as indicated by the large...
Nucleotide sequence of the melA gene, coding for alpha-galactosidase in Escherichia coli K-12.

OpenAIRE

Liljeström, P L; Liljeström, P

1987-01-01

Melibiose uptake and hydrolysis in E.coli is performed by the MelB and MelA proteins, respectively. We report the cloning and sequencing of the melA gene. The nucleotide sequence data showed that melA codes for a 450 amino acid long protein with a molecular weight of 50.6 kd. The sequence data also supported the assumption that the mel locus forms an operon with melA in proximal position. A comparison of MelA with alpha-galactosidase proteins from yeast and human origin showed that these prot...
A search for RNA insertions and NS3 gene duplication in the genome of cytopathic isolates of bovine viral diarrhea virus

Directory of Open Access Journals (Sweden)

V.L. Quadros

2006-07-01

Full Text Available Calves born persistently infected with non-cytopathic bovine viral diarrhea virus (ncpBVDV frequently develop a fatal gastroenteric illness called mucosal disease. Both the original virus (ncpBVDV and an antigenically identical but cytopathic virus (cpBVDV can be isolated from animals affected by mucosal disease. Cytopathic BVDVs originate from their ncp counterparts by diverse genetic mechanisms, all leading to the expression of the non-structural polypeptide NS3 as a discrete protein. In contrast, ncpBVDVs express only the large precursor polypeptide, NS2-3, which contains the NS3 sequence within its carboxy-terminal half. We report here the investigation of the mechanism leading to NS3 expression in 41 cpBVDV isolates. An RT-PCR strategy was employed to detect RNA insertions within the NS2-3 gene and/or duplication of the NS3 gene, two common mechanisms of NS3 expression. RT-PCR amplification revealed insertions in the NS2-3 gene of three cp isolates, with the inserts being similar in size to that present in the cpBVDV NADL strain. Sequencing of one such insert revealed a 296-nucleotide sequence with a central core of 270 nucleotides coding for an amino acid sequence highly homologous (98% to the NADL insert, a sequence corresponding to part of the cellular J-Domain gene. One cpBVDV isolate contained a duplication of the NS3 gene downstream from the original locus. In contrast, no detectable NS2-3 insertions or NS3 gene duplications were observed in the genome of 37 cp isolates. These results demonstrate that processing of NS2-3 without bulk mRNA insertions or NS3 gene duplications seems to be a frequent mechanism leading to NS3 expression and BVDV cytopathology.

Application of discrete Fourier inter-coefficient difference for assessing genetic sequence similarity.

Science.gov (United States)

King, Brian R; Aburdene, Maurice; Thompson, Alex; Warres, Zach

2014-01-01

Digital signal processing (DSP) techniques for biological sequence analysis continue to grow in popularity due to the inherent digital nature of these sequences. DSP methods have demonstrated early success for detection of coding regions in a gene. Recently, these methods are being used to establish DNA gene similarity. We present the inter-coefficient difference (ICD) transformation, a novel extension of the discrete Fourier transformation, which can be applied to any DNA sequence. The ICD method is a mathematical, alignment-free DNA comparison method that generates a genetic signature for any DNA sequence that is used to generate relative measures of similarity among DNA sequences. We demonstrate our method on a set of insulin genes obtained from an evolutionarily wide range of species, and on a set of avian influenza viral sequences, which represents a set of highly similar sequences. We compare phylogenetic trees generated using our technique against trees generated using traditional alignment techniques for similarity and demonstrate that the ICD method produces a highly accurate tree without requiring an alignment prior to establishing sequence similarity.
Classification of viral zoonosis through receptor pattern analysis.

Science.gov (United States)

Bae, Se-Eun; Son, Hyeon Seok

2011-04-13

Viral zoonosis, the transmission of a virus from its primary vertebrate reservoir species to humans, requires ubiquitous cellular proteins known as receptor proteins. Zoonosis can occur not only through direct transmission from vertebrates to humans, but also through intermediate reservoirs or other environmental factors. Viruses can be categorized according to genotype (ssDNA, dsDNA, ssRNA and dsRNA viruses). Among them, the RNA viruses exhibit particularly high mutation rates and are especially problematic for this reason. Most zoonotic viruses are RNA viruses that change their envelope proteins to facilitate binding to various receptors of host species. In this study, we sought to predict zoonotic propensity through the analysis of receptor characteristics. We hypothesized that the major barrier to interspecies virus transmission is that receptor sequences vary among species--in other words, that the specific amino acid sequence of the receptor determines the ability of the viral envelope protein to attach to the cell. We analysed host-cell receptor sequences for their hydrophobicity/hydrophilicity characteristics. We then analysed these properties for similarities among receptors of different species and used a statistical discriminant analysis to predict the likelihood of transmission among species. This study is an attempt to predict zoonosis through simple computational analysis of receptor sequence differences. Our method may be useful in predicting the zoonotic potential of newly discovered viral strains.
Sequence variation of the glycoprotein gene identifies three distinct lineages within field isolates of viral hemorrhagic septicemia virus, a fish rhabdovirus

Science.gov (United States)

Benmansour, A.; Bascuro, B.; Monnier, A.F.; Vende, P.; Winton, J.R.; de Kinkelin, P.

1997-01-01

To evaluate the genetic diversity of viral haemorrhagic septicaemia virus (VHSV), the sequence of the glycoprotein genes (G) of 11 North American and European isolates were determined. Comparison with the G protein of representative members of the family Rhabdoviridae suggested that VHSV was a different virus species from infectious haemorrhagic necrosis virus (IHNV) and Hirame rhabdovirus (HIRRV). At a higher taxonomic level, VHSV, IHNV and HIRRV formed a group which was genetically closest to the genus Lyssavirus. Compared with each other, the G genes of VHSV displayed a dissimilar overall genetic diversity which correlated with differences in geographical origin. The multiple sequence alignment of the complete G protein, showed that the divergent positions were not uniformly distributed along the sequence. A central region (amino acid position 245-300) accumulated substitutions and appeared to be highly variable. The genetic heterogeneity within a single isolate was high, with an apparent internal mutation frequency of 1.2 x 10(-3) per nucleotide site, attesting the quasispecies nature of the viral population. The phylogeny separated VHSV strains according to the major geographical area of isolation: genotype I for continental Europe, genotype II for the British Isles, and genotype III for North America. Isolates from continental Europe exhibited the highest genetic variability, with sub-groups correlated partially with the serological classification. Neither neutralizing polyclonal sera, nor monoclonal antibodies, were able to discriminate between the genotypes. The overall structure of the phylogenetic tree suggests that VHSV genetic diversity and evolution fit within the model of random change and positive selection operating on quasispecies.
Distinct patterns of HIV-1 evolution within metastatic tissues in patients with non-Hodgkins lymphoma.

Directory of Open Access Journals (Sweden)

Marco Salemi

2009-12-01

Full Text Available Despite highly active antiretroviral therapy (HAART, AIDS related lymphoma (ARL occurs at a significantly higher rate in patients infected with the Human Immunodeficiency Virus (HIV than in the general population. HIV-infected macrophages are a known viral reservoir and have been shown to have lymphomagenic potential in SCID mice; therefore, there is an interest in determining if a viral component to lymphomagenesis also exists. We sequenced HIV-1 envelope gp120 clones obtained post mortem from several tumor and non-tumor tissues of two patients who died with AIDS-related Non-Hodgkin's lymphoma (ARL-NH. Similar results were found in both patients: 1 high-resolution phylogenetic analysis showed a significant degree of compartmentalization between lymphoma and non-lymphoma viral sub-populations while viral sub-populations from lymph nodes appeared to be intermixed within sequences from tumor and non-tumor tissues, 2 a 100-fold increase in the effective HIV population size in tumor versus non-tumor tissues was associated with the emergence of lymphadenopathy and aggressive metastatic ARL, and 3 HIV gene flow among lymph nodes, normal and metastatic tissues was non-random. The different population dynamics between the viruses found in tumors versus the non-tumor associated viruses suggest that there is a significant relationship between HIV evolution and lymphoma pathogenesis. Moreover, the study indicates that HIV could be used as an effective marker to study the origin and dissemination of lymphomas in vivo.
Long non-coding RNA discovery across the genus anopheles reveals conserved secondary structures within and beyond the Gambiae complex.

Science.gov (United States)

Jenkins, Adam M; Waterhouse, Robert M; Muskavitch, Marc A T

2015-04-23

Long non-coding RNAs (lncRNAs) have been defined as mRNA-like transcripts longer than 200 nucleotides that lack significant protein-coding potential, and many of them constitute scaffolds for ribonucleoprotein complexes with critical roles in epigenetic regulation. Various lncRNAs have been implicated in the modulation of chromatin structure, transcriptional and post-transcriptional gene regulation, and regulation of genomic stability in mammals, Caenorhabditis elegans, and Drosophila melanogaster. The purpose of this study is to identify the lncRNA landscape in the malaria vector An. gambiae and assess the evolutionary conservation of lncRNAs and their secondary structures across the Anopheles genus. Using deep RNA sequencing of multiple Anopheles gambiae life stages, we have identified 2,949 lncRNAs and more than 300 previously unannotated putative protein-coding genes. The lncRNAs exhibit differential expression profiles across life stages and adult genders. We find that across the genus Anopheles, lncRNAs display much lower sequence conservation than protein-coding genes. Additionally, we find that lncRNA secondary structure is highly conserved within the Gambiae complex, but diverges rapidly across the rest of the genus Anopheles. This study offers one of the first lncRNA secondary structure analyses in vector insects. Our description of lncRNAs in An. gambiae offers the most comprehensive genome-wide insights to date into lncRNAs in this vector mosquito, and defines a set of potential targets for the development of vector-based interventions that may further curb the human malaria burden in disease-endemic countries.
The Genomic Code: Genome Evolution and Potential Applications

KAUST Repository

Bernardi, Giorgio

2016-01-25

The genome of metazoans is organized according to a genomic code which comprises three laws: 1) Compositional correlations hold between contiguous coding and non-coding sequences, as well as among the three codon positions of protein-coding genes; these correlations are the consequence of the fact that the genomes under consideration consist of fairly homogeneous, long (≥200Kb) sequences, the isochores; 2) Although isochores are defined on the basis of purely compositional properties, GC levels of isochores are correlated with all tested structural and functional properties of the genome; 3) GC levels of isochores are correlated with chromosome architecture from interphase to metaphase; in the case of interphase the correlation concerns isochores and the three-dimensional “topological associated domains” (TADs); in the case of mitotic chromosomes, the correlation concerns isochores and chromosomal bands. Finally, the genomic code is the fourth and last pillar of molecular biology, the first three pillars being 1) the double helix structure of DNA; 2) the regulation of gene expression in prokaryotes; and 3) the genetic code.
Long Non-Coding RNA in Cancer

Directory of Open Access Journals (Sweden)

Damjan Glavač

2013-02-01

Full Text Available Long non-coding RNAs (lncRNAs are pervasively transcribed in the genome and are emerging as new players in tumorigenesis due to their various functions in transcriptional, posttranscriptional and epigenetic mechanisms of gene regulation. LncRNAs are deregulated in a number of cancers, demonstrating both oncogenic and tumor suppressive roles, thus suggesting their aberrant expression may be a substantial contributor in cancer development. In this review, we will summarize their emerging role in human cancer and discuss their perspectives in diagnostics as potential biomarkers.
Phylogenetic footprinting of non-coding RNA: hammerhead ribozyme sequences in a satellite DNA family of Dolichopoda cave crickets (Orthoptera, Rhaphidophoridae

Directory of Open Access Journals (Sweden)

Venanzetti Federica

2010-01-01

Full Text Available Abstract Background The great variety in sequence, length, complexity, and abundance of satellite DNA has made it difficult to ascribe any function to this genome component. Recent studies have shown that satellite DNA can be transcribed and be involved in regulation of chromatin structure and gene expression. Some satellite DNAs, such as the pDo500 sequence family in Dolichopoda cave crickets, have a catalytic hammerhead (HH ribozyme structure and activity embedded within each repeat. Results We assessed the phylogenetic footprints of the HH ribozyme within the pDo500 sequences from 38 different populations representing 12 species of Dolichopoda. The HH region was significantly more conserved than the non-hammerhead (NHH region of the pDo500 repeat. In addition, stems were more conserved than loops. In stems, several compensatory mutations were detected that maintain base pairing. The core region of the HH ribozyme was affected by very few nucleotide substitutions and the cleavage position was altered only once among 198 sequences. RNA folding of the HH sequences revealed that a potentially active HH ribozyme can be found in most of the Dolichopoda populations and species. Conclusions The phylogenetic footprints suggest that the HH region of the pDo500 sequence family is selected for function in Dolichopoda cave crickets. However, the functional role of HH ribozymes in eukaryotic organisms is unclear. The possible functions have been related to trans cleavage of an RNA target by a ribonucleoprotein and regulation of gene expression. Whether the HH ribozyme in Dolichopoda is involved in similar functions remains to be investigated. Future studies need to demonstrate how the observed nucleotide changes and evolutionary constraint have affected the catalytic efficiency of the hammerhead.
Flavivirus and Filovirus EvoPrinters: New alignment tools for the comparative analysis of viral evolution.

Directory of Open Access Journals (Sweden)

Thomas Brody

2017-06-01

Full Text Available Flavivirus and Filovirus infections are serious epidemic threats to human populations. Multi-genome comparative analysis of these evolving pathogens affords a view of their essential, conserved sequence elements as well as progressive evolutionary changes. While phylogenetic analysis has yielded important insights, the growing number of available genomic sequences makes comparisons between hundreds of viral strains challenging. We report here a new approach for the comparative analysis of these hemorrhagic fever viruses that can superimpose an unlimited number of one-on-one alignments to identify important features within genomes of interest.We have adapted EvoPrinter alignment algorithms for the rapid comparative analysis of Flavivirus or Filovirus sequences including Zika and Ebola strains. The user can input a full genome or partial viral sequence and then view either individual comparisons or generate color-coded readouts that superimpose hundreds of one-on-one alignments to identify unique or shared identity SNPs that reveal ancestral relationships between strains. The user can also opt to select a database genome in order to access a library of pre-aligned genomes of either 1,094 Flaviviruses or 460 Filoviruses for rapid comparative analysis with all database entries or a select subset. Using EvoPrinter search and alignment programs, we show the following: 1 superimposing alignment data from many related strains identifies lineage identity SNPs, which enable the assessment of sublineage complexity within viral outbreaks; 2 whole-genome SNP profile screens uncover novel Dengue2 and Zika recombinant strains and their parental lineages; 3 differential SNP profiling identifies host cell A-to-I hyper-editing within Ebola and Marburg viruses, and 4 hundreds of superimposed one-on-one Ebola genome alignments highlight ultra-conserved regulatory sequences, invariant amino acid codons and evolutionarily variable protein-encoding domains within a
Identification of an ICP27-responsive element in the coding region of a herpes simplex virus type 1 late gene.

Science.gov (United States)

Sedlackova, Lenka; Perkins, Keith D; Meyer, Julia; Strain, Anna K; Goldman, Oksana; Rice, Stephen A

2010-03-01

During productive herpes simplex virus type 1 (HSV-1) infection, a subset of viral delayed-early (DE) and late (L) genes require the immediate-early (IE) protein ICP27 for their expression. However, the cis-acting regulatory sequences in DE and L genes that mediate their specific induction by ICP27 are unknown. One viral L gene that is highly dependent on ICP27 is that encoding glycoprotein C (gC). We previously demonstrated that this gene is posttranscriptionally transactivated by ICP27 in a plasmid cotransfection assay. Based on our past results, we hypothesized that the gC gene possesses a cis-acting inhibitory sequence and that ICP27 overcomes the effects of this sequence to enable efficient gC expression. To test this model, we systematically deleted sequences from the body of the gC gene and tested the resulting constructs for expression. In so doing, we identified a 258-bp "silencing element" (SE) in the 5' portion of the gC coding region. When present, the SE inhibits gC mRNA accumulation from a transiently transfected gC gene, unless ICP27 is present. Moreover, the SE can be transferred to another HSV-1 gene, where it inhibits mRNA accumulation in the absence of ICP27 and confers high-level expression in the presence of ICP27. Thus, for the first time, an ICP27-responsive sequence has been identified in a physiologically relevant ICP27 target gene. To see if the SE functions during viral infection, we engineered HSV-1 recombinants that lack the SE, either in a wild-type (WT) or ICP27-null genetic background. In an ICP27-null background, deletion of the SE led to ICP27-independent expression of the gC gene, demonstrating that the SE functions during viral infection. Surprisingly, the ICP27-independent gC expression seen with the mutant occurred even in the absence of viral DNA synthesis, indicating that the SE helps to regulate the tight DNA replication-dependent expression of gC.
Short communication: identification of a novel HIV type 1 subtype H/J recombinant in Canada with discordant HIV viral load (RNA) values in three different commercial assays.

Science.gov (United States)

Kim, John E; Beckthold, Brenda; Chen, Zhaoxia; Mihowich, Jennifer; Malloch, Laurie; Gill, Michael John

2007-11-01

The presence of HIV-1 non-B subtypes is increasing worldwide. This poses challenges to commercial diagnostic and viral load (RNA) monitoring tests that are predominantly based on HIV-1 subtype B strains. Based on phylogenetic analysis of the gag, pol, and env gene regions, we describe the first HIV-1 H/J recombinant in Canada that presented divergent viral load values. DNA sequence analysis of the gag gene region further revealed that genetic diversity between this H/J recombinant and the primers and probes used in the bio-Merieux Nuclisens HIV-1 QT (Nuclisens) and Roche Amplicor Monitor HIV-1, v1.5 (Monitor) viral RNA assays can erroneously lead to undetectable viral load values. This observation appears to be more problematic in the Nuclisens assay. In light of increasing genetic diversity in HIV worldwide we recommend that DNA sequencing of HIV, especially in the gag gene region targeted by primers and probes used in molecular diagnostic and viral load tests, be incorporated into clinical monitoring practices.
Identification of sequences in herpes simplex virus type 1 ICP22 that influence RNA polymerase II modification and viral late gene expression.

Science.gov (United States)

Bastian, Thomas W; Rice, Stephen A

2009-01-01

Previous studies have shown that the herpes simplex virus type 1 (HSV-1) immediate-early protein ICP22 alters the phosphorylation of the host cell RNA polymerase II (Pol II) during viral infection. In this study, we have engineered several ICP22 plasmid and virus mutants in order to map the ICP22 sequences that are involved in this function. We identify a region in the C-terminal half of ICP22 (residues 240 to 340) that is critical for Pol II modification and further show that the N-terminal half of the protein (residues 1 to 239) is not required. However, immunofluorescence analysis indicates that the N-terminal half of ICP22 is needed for its localization to nuclear body structures. These results demonstrate that ICP22's effects on Pol II do not require that it accumulate in nuclear bodies. As ICP22 is known to enhance viral late gene expression during infection of certain cultured cells, including human embryonic lung (HEL) cells, we used our engineered viral mutants to map this function of ICP22. It was found that mutations in both the N- and C-terminal halves of ICP22 result in similar defects in viral late gene expression and growth in HEL cells, despite having distinctly different effects on Pol II. Thus, our results genetically uncouple ICP22's effects on Pol II from its effects on viral late gene expression. This suggests that these two functions of ICP22 may be due to distinct activities of the protein.
A Case for Dynamic Reverse-code Generation to Debug Non-deterministic Programs

Directory of Open Access Journals (Sweden)

Jooyong Yi

2013-09-01

Full Text Available Backtracking (i.e., reverse execution helps the user of a debugger to naturally think backwards along the execution path of a program, and thinking backwards makes it easy to locate the origin of a bug. So far backtracking has been implemented mostly by state saving or by checkpointing. These implementations, however, inherently do not scale. Meanwhile, a more recent backtracking method based on reverse-code generation seems promising because executing reverse code can restore the previous states of a program without state saving. In the literature, there can be found two methods that generate reverse code: (a static reverse-code generation that pre-generates reverse code through static analysis before starting a debugging session, and (b dynamic reverse-code generation that generates reverse code by applying dynamic analysis on the fly during a debugging session. In particular, we espoused the latter one in our previous work to accommodate non-determinism of a program caused by e.g., multi-threading. To demonstrate the usefulness of our dynamic reverse-code generation, this article presents a case study of various backtracking methods including ours. We compare the memory usage of various backtracking methods in a simple but nontrivial example, a bounded-buffer program. In the case of non-deterministic programs such as this bounded-buffer program, our dynamic reverse-code generation outperforms the existing backtracking methods in terms of memory efficiency.
Sequence data and association statistics from 12,940 type 2 diabetes cases and controls.

Science.gov (United States)

Flannick, Jason; Fuchsberger, Christian; Mahajan, Anubha; Teslovich, Tanya M; Agarwala, Vineeta; Gaulton, Kyle J; Caulkins, Lizz; Koesterer, Ryan; Ma, Clement; Moutsianas, Loukas; McCarthy, Davis J; Rivas, Manuel A; Perry, John R B; Sim, Xueling; Blackwell, Thomas W; Robertson, Neil R; Rayner, N William; Cingolani, Pablo; Locke, Adam E; Tajes, Juan Fernandez; Highland, Heather M; Dupuis, Josee; Chines, Peter S; Lindgren, Cecilia M; Hartl, Christopher; Jackson, Anne U; Chen, Han; Huyghe, Jeroen R; van de Bunt, Martijn; Pearson, Richard D; Kumar, Ashish; Müller-Nurasyid, Martina; Grarup, Niels; Stringham, Heather M; Gamazon, Eric R; Lee, Jaehoon; Chen, Yuhui; Scott, Robert A; Below, Jennifer E; Chen, Peng; Huang, Jinyan; Go, Min Jin; Stitzel, Michael L; Pasko, Dorota; Parker, Stephen C J; Varga, Tibor V; Green, Todd; Beer, Nicola L; Day-Williams, Aaron G; Ferreira, Teresa; Fingerlin, Tasha; Horikoshi, Momoko; Hu, Cheng; Huh, Iksoo; Ikram, Mohammad Kamran; Kim, Bong-Jo; Kim, Yongkang; Kim, Young Jin; Kwon, Min-Seok; Lee, Juyoung; Lee, Selyeong; Lin, Keng-Han; Maxwell, Taylor J; Nagai, Yoshihiko; Wang, Xu; Welch, Ryan P; Yoon, Joon; Zhang, Weihua; Barzilai, Nir; Voight, Benjamin F; Han, Bok-Ghee; Jenkinson, Christopher P; Kuulasmaa, Teemu; Kuusisto, Johanna; Manning, Alisa; Ng, Maggie C Y; Palmer, Nicholette D; Balkau, Beverley; Stančáková, Alena; Abboud, Hanna E; Boeing, Heiner; Giedraitis, Vilmantas; Prabhakaran, Dorairaj; Gottesman, Omri; Scott, James; Carey, Jason; Kwan, Phoenix; Grant, George; Smith, Joshua D; Neale, Benjamin M; Purcell, Shaun; Butterworth, Adam S; Howson, Joanna M M; Lee, Heung Man; Lu, Yingchang; Kwak, Soo-Heon; Zhao, Wei; Danesh, John; Lam, Vincent K L; Park, Kyong Soo; Saleheen, Danish; So, Wing Yee; Tam, Claudia H T; Afzal, Uzma; Aguilar, David; Arya, Rector; Aung, Tin; Chan, Edmund; Navarro, Carmen; Cheng, Ching-Yu; Palli, Domenico; Correa, Adolfo; Curran, Joanne E; Rybin, Dennis; Farook, Vidya S; Fowler, Sharon P; Freedman, Barry I; Griswold, Michael; Hale, Daniel Esten; Hicks, Pamela J; Khor, Chiea-Chuen; Kumar, Satish; Lehne, Benjamin; Thuillier, Dorothée; Lim, Wei Yen; Liu, Jianjun; Loh, Marie; Musani, Solomon K; Puppala, Sobha; Scott, William R; Yengo, Loïc; Tan, Sian-Tsung; Taylor, Herman A; Thameem, Farook; Wilson, Gregory; Wong, Tien Yin; Njølstad, Pål Rasmus; Levy, Jonathan C; Mangino, Massimo; Bonnycastle, Lori L; Schwarzmayr, Thomas; Fadista, João; Surdulescu, Gabriela L; Herder, Christian; Groves, Christopher J; Wieland, Thomas; Bork-Jensen, Jette; Brandslund, Ivan; Christensen, Cramer; Koistinen, Heikki A; Doney, Alex S F; Kinnunen, Leena; Esko, Tõnu; Farmer, Andrew J; Hakaste, Liisa; Hodgkiss, Dylan; Kravic, Jasmina; Lyssenko, Valeri; Hollensted, Mette; Jørgensen, Marit E; Jørgensen, Torben; Ladenvall, Claes; Justesen, Johanne Marie; Käräjämäki, Annemari; Kriebel, Jennifer; Rathmann, Wolfgang; Lannfelt, Lars; Lauritzen, Torsten; Narisu, Narisu; Linneberg, Allan; Melander, Olle; Milani, Lili; Neville, Matt; Orho-Melander, Marju; Qi, Lu; Qi, Qibin; Roden, Michael; Rolandsson, Olov; Swift, Amy; Rosengren, Anders H; Stirrups, Kathleen; Wood, Andrew R; Mihailov, Evelin; Blancher, Christine; Carneiro, Mauricio O; Maguire, Jared; Poplin, Ryan; Shakir, Khalid; Fennell, Timothy; DePristo, Mark; de Angelis, Martin Hrabé; Deloukas, Panos; Gjesing, Anette P; Jun, Goo; Nilsson, Peter; Murphy, Jacquelyn; Onofrio, Robert; Thorand, Barbara; Hansen, Torben; Meisinger, Christa; Hu, Frank B; Isomaa, Bo; Karpe, Fredrik; Liang, Liming; Peters, Annette; Huth, Cornelia; O'Rahilly, Stephen P; Palmer, Colin N A; Pedersen, Oluf; Rauramaa, Rainer; Tuomilehto, Jaakko; Salomaa, Veikko; Watanabe, Richard M; Syvänen, Ann-Christine; Bergman, Richard N; Bharadwaj, Dwaipayan; Bottinger, Erwin P; Cho, Yoon Shin; Chandak, Giriraj R; Chan, Juliana Cn; Chia, Kee Seng; Daly, Mark J; Ebrahim, Shah B; Langenberg, Claudia; Elliott, Paul; Jablonski, Kathleen A; Lehman, Donna M; Jia, Weiping; Ma, Ronald C W; Pollin, Toni I; Sandhu, Manjinder; Tandon, Nikhil; Froguel, Philippe; Barroso, Inês; Teo, Yik Ying; Zeggini, Eleftheria; Loos, Ruth J F; Small, Kerrin S; Ried, Janina S; DeFronzo, Ralph A; Grallert, Harald; Glaser, Benjamin; Metspalu, Andres; Wareham, Nicholas J; Walker, Mark; Banks, Eric; Gieger, Christian; Ingelsson, Erik; Im, Hae Kyung; Illig, Thomas; Franks, Paul W; Buck, Gemma; Trakalo, Joseph; Buck, David; Prokopenko, Inga; Mägi, Reedik; Lind, Lars; Farjoun, Yossi; Owen, Katharine R; Gloyn, Anna L; Strauch, Konstantin; Tuomi, Tiinamaija; Kooner, Jaspal Singh; Lee, Jong-Young; Park, Taesung; Donnelly, Peter; Morris, Andrew D; Hattersley, Andrew T; Bowden, Donald W; Collins, Francis S; Atzmon, Gil; Chambers, John C; Spector, Timothy D; Laakso, Markku; Strom, Tim M; Bell, Graeme I; Blangero, John; Duggirala, Ravindranath; Tai, E Shyong; McVean, Gilean; Hanis, Craig L; Wilson, James G; Seielstad, Mark; Frayling, Timothy M; Meigs, James B; Cox, Nancy J; Sladek, Rob; Lander, Eric S; Gabriel, Stacey; Mohlke, Karen L; Meitinger, Thomas; Groop, Leif; Abecasis, Goncalo; Scott, Laura J; Morris, Andrew P; Kang, Hyun Min; Altshuler, David; Burtt, Noël P; Florez, Jose C; Boehnke, Michael; McCarthy, Mark I

2017-12-19

To investigate the genetic basis of type 2 diabetes (T2D) to high resolution, the GoT2D and T2D-GENES consortia catalogued variation from whole-genome sequencing of 2,657 European individuals and exome sequencing of 12,940 individuals of multiple ancestries. Over 27M SNPs, indels, and structural variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D.
CVD-associated non-coding RNA, ANRIL, modulates expression of atherogenic pathways in VSMC

International Nuclear Information System (INIS)

Congrains, Ada; Kamide, Kei; Katsuya, Tomohiro; Yasuda, Osamu; Oguro, Ryousuke; Yamamoto, Koichi; Ohishi, Mitsuru; Rakugi, Hiromi

2012-01-01

Highlights: ► ANRIL maps in the strongest susceptibility locus for cardiovascular disease. ► Silencing of ANRIL leads to altered expression of tissue remodeling-related genes. ► The effects of ANRIL on gene expression are splicing variant specific. ► ANRIL affects progression of cardiovascular disease by regulating proliferation and apoptosis pathways. -- Abstract: ANRIL is a newly discovered non-coding RNA lying on the strongest genetic susceptibility locus for cardiovascular disease (CVD) in the chromosome 9p21 region. Genome-wide association studies have been linking polymorphisms in this locus with CVD and several other major diseases such as diabetes and cancer. The role of this non-coding RNA in atherosclerosis progression is still poorly understood. In this study, we investigated the implication of ANRIL in the modulation of gene sets directly involved in atherosclerosis. We designed and tested siRNA sequences to selectively target two exons (exon 1 and exon 19) of the transcript and successfully knocked down expression of ANRIL in human aortic vascular smooth muscle cells (HuAoVSMC). We used a pathway-focused RT-PCR array to profile gene expression changes caused by ANRIL knock down. Notably, the genes affected by each of the siRNAs were different, suggesting that different splicing variants of ANRIL might have distinct roles in cell physiology. Our results suggest that ANRIL splicing variants play a role in coordinating tissue remodeling, by modulating the expression of genes involved in cell proliferation, apoptosis, extra-cellular matrix remodeling and inflammatory response to finally impact in the risk of cardiovascular disease and other pathologies.
Homologous SV40 RNA trans-splicing: Special case or prime example of viral RNA trans-splicing?

Directory of Open Access Journals (Sweden)

Sushmita Poddar

2014-06-01

Full Text Available To date the Simian Virus 40 (SV40 is the only proven example of a virus that recruits the mechanism of RNA trans-splicing to diversify its sequences and gene products. Thereby, two identical viral transcripts are efficiently joined by homologous trans-splicing triggering the formation of a highly transforming 100 kDa super T antigen. Sequences of other viruses including HIV-1 and the human adenovirus type 5 were reported to be involved in heterologous trans-splicing towards cellular or viral sequences but the meaning of these events remains unclear. We computationally and experimentally investigated molecular features associated with viral RNA trans-splicing and identified a common pattern: Viral RNA trans-splicing occurs between strong cryptic or regular viral splice sites and strong regular or cryptic splice sites of the trans-splice partner sequences. The majority of these splice sites are supported by exonic splice enhancers. Splice sites that could compete with the trans-splicing sites for cis-splice reactions are weaker or inexistent. Finally, all but one of the trans-splice reactions seem to be facilitated by one or more complementary binding domains of 11 to 16 nucleotides in length which, however occur with a statistical probability close to one for the given length of the involved sequences. The chimeric RNAs generated via heterologous viral RNA trans-splicing either did not lead to fusion proteins or led to proteins of unknown function. Our data suggest that distinct viral RNAs are highly susceptible to trans-splicing and that heterologous viral trans-splicing, unlike homologous SV40 trans-splicing, represents a chance event.
Estradiol-Induced Transcriptional Regulation of Long Non-Coding RNA, HOTAIR.

Science.gov (United States)

Bhan, Arunoday; Mandal, Subhrangsu S

2016-01-01

HOTAIR (HOX antisense intergenic RNA) is a 2.2 kb long non-coding RNA (lncRNA), transcribed from the antisense strand of homeobox C (HOXC) gene locus in chromosome 12. HOTAIR acts as a scaffolding lncRNA. It interacts and guides various chromatin-modifying complexes such as PRC2 (polycomb-repressive complex 2) and LSD1 (lysine-specific demethylase 1) to the target gene promoters leading to their gene silencing. Various studies have demonstrated that HOTAIR overexpression is associated with breast cancer. Recent studies from our laboratory demonstrate that HOTAIR is required for viability of breast cancer cells and is transcriptionally regulated by estradiol (E2) in vitro and in vivo. This chapter describes protocols for analysis of the HOTAIR promoter, cloning, transfection and dual luciferase assays, knockdown of protein synthesis by antisense oligonucleotides, and chromatin immunoprecipitation (ChIP) assay. These protocols are useful for studying the estrogen-mediated transcriptional regulation of lncRNA HOTAIR, as well as other protein coding genes and non-coding RNAs.
Simultaneous sequencing of coding and noncoding RNA reveals a human transcriptome dominated by a small number of highly expressed noncoding genes.

Science.gov (United States)

Boivin, Vincent; Deschamps-Francoeur, Gabrielle; Couture, Sonia; Nottingham, Ryan M; Bouchard-Bourelle, Philia; Lambowitz, Alan M; Scott, Michelle S; Abou-Elela, Sherif

2018-07-01

Comparing the abundance of one RNA molecule to another is crucial for understanding cellular functions but most sequencing techniques can target only specific subsets of RNA. In this study, we used a new fragmented ribodepleted TGIRT sequencing method that uses a thermostable group II intron reverse transcriptase (TGIRT) to generate a portrait of the human transcriptome depicting the quantitative relationship of all classes of nonribosomal RNA longer than 60 nt. Comparison between different sequencing methods indicated that FRT is more accurate in ranking both mRNA and noncoding RNA than viral reverse transcriptase-based sequencing methods, even those that specifically target these species. Measurements of RNA abundance in different cell lines using this method correlate with biochemical estimates, confirming tRNA as the most abundant nonribosomal RNA biotype. However, the single most abundant transcript is 7SL RNA, a component of the signal recognition particle. S tructured n on c oding RNAs (sncRNAs) associated with the same biological process are expressed at similar levels, with the exception of RNAs with multiple functions like U1 snRNA. In general, sncRNAs forming RNPs are hundreds to thousands of times more abundant than their mRNA counterparts. Surprisingly, only 50 sncRNA genes produce half of the non-rRNA transcripts detected in two different cell lines. Together the results indicate that the human transcriptome is dominated by a small number of highly expressed sncRNAs specializing in functions related to translation and splicing. © 2018 Boivin et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.
Formation of large viroplasms and virulence of Cauliflower mosaic virus in turnip plants depend on the N-terminal EKI sequence of viral protein TAV.

Directory of Open Access Journals (Sweden)

Angèle Geldreich

Full Text Available Cauliflower mosaic virus (CaMV TAV protein (TransActivator/Viroplasmin plays a pivotal role during the infection cycle since it activates translation reinitiation of viral polycistronic RNAs and suppresses RNA silencing. It is also the major component of cytoplasmic electron-dense inclusion bodies (EDIBs called viroplasms that are particularly evident in cells infected by the virulent CaMV Cabb B-JI isolate. These EDIBs are considered as virion factories, vehicles for CaMV intracellular movement and reservoirs for CaMV transmission by aphids. In this study, focused on different TAV mutants in vivo, we demonstrate that three physically separated domains collectively participate to the formation of large EDIBs: the N-terminal EKI motif, a sequence of the MAV domain involved in translation reinitiation and a C-terminal region encompassing the zinc finger. Surprisingly, EKI mutant TAVm3, corresponding to a substitution of the EKI motif at amino acids 11-13 by three alanines (AAA, which completely abolished the formation of large viroplasms, was not lethal for CaMV but highly reduced its virulence without affecting the rate of systemic infection. Expression of TAVm3 in a viral context led to formation of small irregularly shaped inclusion bodies, mild symptoms and low levels of viral DNA and particles accumulation, despite the production of significant amounts of mature capsid proteins. Unexpectedly, for CaMV-TAVm3 the formation of viral P2-containing electron-light inclusion body (ELIB, which is essential for CaMV aphid transmission, was also altered, thus suggesting an indirect role of the EKI tripeptide in CaMV plant-to-plant propagation. This important functional contribution of the EKI motif in CaMV biology can explain the strict conservation of this motif in the TAV sequences of all CaMV isolates.
RNA-dependent RNA polymerase of hepatitis C virus binds to its coding region RNA stem-loop structure, 5BSL3.2, and its negative strand.

Science.gov (United States)

Kanamori, Hiroshi; Yuhashi, Kazuhito; Ohnishi, Shin; Koike, Kazuhiko; Kodama, Tatsuhiko

2010-05-01

The hepatitis C virus NS5B RNA-dependent RNA polymerase (RdRp) is a key enzyme involved in viral replication. Interaction between NS5B RdRp and the viral RNA sequence is likely to be an important step in viral RNA replication. The C-terminal half of the NS5B-coding sequence, which contains the important cis-acting replication element, has been identified as an NS5B-binding sequence. In the present study, we confirm the specific binding of NS5B to one of the RNA stem-loop structures in the region, 5BSL3.2. In addition, we show that NS5B binds to the complementary strand of 5BSL3.2 (5BSL3.2N). The bulge structure of 5BSL3.2N was shown to be indispensable for tight binding to NS5B. In vitro RdRp activity was inhibited by 5BSL3.2N, indicating the importance of the RNA element in the polymerization by RdRp. These results suggest the involvement of the RNA stem-loop structure of the negative strand in the replication process.

Primate-specific spliced PMCHL RNAs are non-protein coding in human and macaque tissues

Directory of Open Access Journals (Sweden)

Delerue-Audegond Audrey

2008-12-01

Full Text Available Abstract Background Brain-expressed genes that were created in primate lineage represent obvious candidates to investigate molecular mechanisms that contributed to neural reorganization and emergence of new behavioural functions in Homo sapiens. PMCHL1 arose from retroposition of a pro-melanin-concentrating hormone (PMCH antisense mRNA on the ancestral human chromosome 5p14 when platyrrhines and catarrhines diverged. Mutations before divergence of hylobatidae led to creation of new exons and finally PMCHL1 duplicated in an ancestor of hominids to generate PMCHL2 at the human chromosome 5q13. A complex pattern of spliced and unspliced PMCHL RNAs were found in human brain and testis. Results Several novel spliced PMCHL transcripts have been characterized in human testis and fetal brain, identifying an additional exon and novel splice sites. Sequencing of PMCHL genes in several non-human primates allowed to carry out phylogenetic analyses revealing that the initial retroposition event took place within an intron of the brain cadherin (CDH12 gene, soon after platyrrhine/catarrhine divergence, i.e. 30–35 Mya, and was concomitant with the insertion of an AluSg element. Sequence analysis of the spliced PMCHL transcripts identified only short ORFs of less than 300 bp, with low (VMCH-p8 and protein variants or no evolutionary conservation. Western blot analyses of human and macaque tissues expressing PMCHL RNA failed to reveal any protein corresponding to VMCH-p8 and protein variants encoded by spliced transcripts. Conclusion Our present results improve our knowledge of the gene structure and the evolutionary history of the primate-specific chimeric PMCHL genes. These genes produce multiple spliced transcripts, bearing short, non-conserved and apparently non-translated ORFs that may function as mRNA-like non-coding RNAs.
Long Non-Coding RNAs: A Novel Paradigm for Toxicology.

Science.gov (United States)

Dempsey, Joseph L; Cui, Julia Yue

2017-01-01

Long non-coding RNAs (lncRNAs) are over 200 nucleotides in length and are transcribed from the mammalian genome in a tissue-specific and developmentally regulated pattern. There is growing recognition that lncRNAs are novel biomarkers and/or key regulators of toxicological responses in humans and animal models. Lacking protein-coding capacity, the numerous types of lncRNAs possess a myriad of transcriptional regulatory functions that include cis and trans gene expression, transcription factor activity, chromatin remodeling, imprinting, and enhancer up-regulation. LncRNAs also influence mRNA processing, post-transcriptional regulation, and protein trafficking. Dysregulation of lncRNAs has been implicated in various human health outcomes such as various cancers, Alzheimer's disease, cardiovascular disease, autoimmune diseases, as well as intermediary metabolism such as glucose, lipid, and bile acid homeostasis. Interestingly, emerging evidence in the literature over the past five years has shown that lncRNA regulation is impacted by exposures to various chemicals such as polycyclic aromatic hydrocarbons, benzene, cadmium, chlorpyrifos-methyl, bisphenol A, phthalates, phenols, and bile acids. Recent technological advancements, including next-generation sequencing technologies and novel computational algorithms, have enabled the profiling and functional characterizations of lncRNAs on a genomic scale. In this review, we summarize the biogenesis and general biological functions of lncRNAs, highlight the important roles of lncRNAs in human diseases and especially during the toxicological responses to various xenobiotics, evaluate current methods for identifying aberrant lncRNA expression and molecular target interactions, and discuss the potential to implement these tools to address fundamental questions in toxicology. © The Author 2016. Published by Oxford University Press on behalf of the Society of Toxicology. All rights reserved. For Permissions, please e
Diagnostic and prognostic signatures from the small non-coding RNA transcriptome in prostate cancer

DEFF Research Database (Denmark)

Martens-Uzunova, E S; Jalava, S E; Dits, N F

2011-01-01

Prostate cancer (PCa) is the most frequent male malignancy and the second most common cause of cancer-related death in Western countries. Current clinical and pathological methods are limited in the prediction of postoperative outcome. It is becoming increasingly evident that small non-coding RNA...... signatures of 102 fresh-frozen patient samples during PCa progression by miRNA microarrays. Both platforms were cross-validated by quantitative reverse transcriptase-PCR. Besides the altered expression of several miRNAs, our deep sequencing analyses revealed strong differential expression of small nucleolar...... RNAs (snoRNAs) and transfer RNAs (tRNAs). From microarray analysis, we derived a miRNA diagnostic classifier that accurately distinguishes normal from cancer samples. Furthermore, we were able to construct a PCa prognostic predictor that independently forecasts postoperative outcome. Importantly...
Advanced GF(32) nonbinary LDPC coded modulation with non-uniform 9-QAM outperforming star 8-QAM.

Science.gov (United States)

Liu, Tao; Lin, Changyu; Djordjevic, Ivan B

2016-06-27

In this paper, we first describe a 9-symbol non-uniform signaling scheme based on Huffman code, in which different symbols are transmitted with different probabilities. By using the Huffman procedure, prefix code is designed to approach the optimal performance. Then, we introduce an algorithm to determine the optimal signal constellation sets for our proposed non-uniform scheme with the criterion of maximizing constellation figure of merit (CFM). The proposed nonuniform polarization multiplexed signaling 9-QAM scheme has the same spectral efficiency as the conventional 8-QAM. Additionally, we propose a specially designed GF(32) nonbinary quasi-cyclic LDPC code for the coded modulation system based on the 9-QAM non-uniform scheme. Further, we study the efficiency of our proposed non-uniform 9-QAM, combined with nonbinary LDPC coding, and demonstrate by Monte Carlo simulation that the proposed GF(23) nonbinary LDPC coded 9-QAM scheme outperforms nonbinary LDPC coded uniform 8-QAM by at least 0.8dB.
Non-Viral Transfection Methods Optimized for Gene Delivery to a Lung Cancer Cell Line

OpenAIRE

Salimzadeh, Loghman; Jaberipour, Mansooreh; Hosseini, Ahmad; Ghaderi, Abbas

2013-01-01

Background Mehr-80 is a newly established adherent human large cell lung cancer cell line that has not been transfected until now. This study aims to define the optimal transfection conditions and effects of some critical elements for enhancing gene delivery to this cell line by utilizing different non-viral transfection Procedures. Methods In the current study, calcium phosphate (CaP), DEAE-dextran, superfect, electroporation and lipofection transfection methods were used to optimize deliver...
ICRPfinder: a fast pattern design algorithm for coding sequences and its application in finding potential restriction enzyme recognition sites

Directory of Open Access Journals (Sweden)

Stafford Phillip

2009-09-01

Full Text Available Abstract Background Restriction enzymes can produce easily definable segments from DNA sequences by using a variety of cut patterns. There are, however, no software tools that can aid in gene building -- that is, modifying wild-type DNA sequences to express the same wild-type amino acid sequences but with enhanced codons, specific cut sites, unique post-translational modifications, and other engineered-in components for recombinant applications. A fast DNA pattern design algorithm, ICRPfinder, is provided in this paper and applied to find or create potential recognition sites in target coding sequences. Results ICRPfinder is applied to find or create restriction enzyme recognition sites by introducing silent mutations. The algorithm is shown capable of mapping existing cut-sites but importantly it also can generate specified new unique cut-sites within a specified region that are guaranteed not to be present elsewhere in the DNA sequence. Conclusion ICRPfinder is a powerful tool for finding or creating specific DNA patterns in a given target coding sequence. ICRPfinder finds or creates patterns, which can include restriction enzyme recognition sites, without changing the translated protein sequence. ICRPfinder is a browser-based JavaScript application and it can run on any platform, in on-line or off-line mode.
SV40 Utilizes ATM Kinase Activity to Prevent Non-homologous End Joining of Broken Viral DNA Replication Products

Science.gov (United States)

Sowd, Gregory A.; Mody, Dviti; Eggold, Joshua; Cortez, David; Friedman, Katherine L.; Fanning, Ellen

2014-01-01

Simian virus 40 (SV40) and cellular DNA replication rely on host ATM and ATR DNA damage signaling kinases to facilitate DNA repair and elicit cell cycle arrest following DNA damage. During SV40 DNA replication, ATM kinase activity prevents concatemerization of the viral genome whereas ATR activity prevents accumulation of aberrant genomes resulting from breakage of a moving replication fork as it converges with a stalled fork. However, the repair pathways that ATM and ATR orchestrate to prevent these aberrant SV40 DNA replication products are unclear. Using two-dimensional gel electrophoresis and Southern blotting, we show that ATR kinase activity, but not DNA-PKcs kinase activity, facilitates some aspects of double strand break (DSB) repair when ATM is inhibited during SV40 infection. To clarify which repair factors associate with viral DNA replication centers, we examined the localization of DSB repair proteins in response to SV40 infection. Under normal conditions, viral replication centers exclusively associate with homology-directed repair (HDR) and do not colocalize with non-homologous end joining (NHEJ) factors. Following ATM inhibition, but not ATR inhibition, activated DNA-PKcs and KU70/80 accumulate at the viral replication centers while CtIP and BLM, proteins that initiate 5′ to 3′ end resection during HDR, become undetectable. Similar to what has been observed during cellular DSB repair in S phase, these data suggest that ATM kinase influences DSB repair pathway choice by preventing the recruitment of NHEJ factors to replicating viral DNA. These data may explain how ATM prevents concatemerization of the viral genome and promotes viral propagation. We suggest that inhibitors of DNA damage signaling and DNA repair could be used during infection to disrupt productive viral DNA replication. PMID:25474690
SV40 utilizes ATM kinase activity to prevent non-homologous end joining of broken viral DNA replication products.

Directory of Open Access Journals (Sweden)

Gregory A Sowd

2014-12-01

Full Text Available Simian virus 40 (SV40 and cellular DNA replication rely on host ATM and ATR DNA damage signaling kinases to facilitate DNA repair and elicit cell cycle arrest following DNA damage. During SV40 DNA replication, ATM kinase activity prevents concatemerization of the viral genome whereas ATR activity prevents accumulation of aberrant genomes resulting from breakage of a moving replication fork as it converges with a stalled fork. However, the repair pathways that ATM and ATR orchestrate to prevent these aberrant SV40 DNA replication products are unclear. Using two-dimensional gel electrophoresis and Southern blotting, we show that ATR kinase activity, but not DNA-PK(cs kinase activity, facilitates some aspects of double strand break (DSB repair when ATM is inhibited during SV40 infection. To clarify which repair factors associate with viral DNA replication centers, we examined the localization of DSB repair proteins in response to SV40 infection. Under normal conditions, viral replication centers exclusively associate with homology-directed repair (HDR and do not colocalize with non-homologous end joining (NHEJ factors. Following ATM inhibition, but not ATR inhibition, activated DNA-PK(cs and KU70/80 accumulate at the viral replication centers while CtIP and BLM, proteins that initiate 5' to 3' end resection during HDR, become undetectable. Similar to what has been observed during cellular DSB repair in S phase, these data suggest that ATM kinase influences DSB repair pathway choice by preventing the recruitment of NHEJ factors to replicating viral DNA. These data may explain how ATM prevents concatemerization of the viral genome and promotes viral propagation. We suggest that inhibitors of DNA damage signaling and DNA repair could be used during infection to disrupt productive viral DNA replication.
CRISPR/Cas9-Mediated Knockin Application in Cell Therapy: A Non-viral Procedure for Bystander Treatment of Glioma in Mice

Directory of Open Access Journals (Sweden)

Oscar Meca-Cortés

2017-09-01

Full Text Available The use of non-viral procedures, together with CRISPR/Cas9 genome-editing technology, allows the insertion of single-copy therapeutic genes at pre-determined genomic sites, overcoming safety limitations resulting from random gene insertions of viral vectors with potential for genome damage. In this study, we demonstrate that combination of non-viral gene delivery and CRISPR/Cas9-mediated knockin via homology-directed repair can replace the use of viral vectors for the generation of genetically modified therapeutic cells. We custom-modified human adipose mesenchymal stem cells (hAMSCs, using electroporation as a transfection method and CRISPR/Cas9-mediated knockin for the introduction and stable expression of a 3 kb DNA fragment including the eGFP (selectable marker and a variant of the herpes simplex virus 1 thymidine kinase genes (therapeutic gene, under the control of the human elongation factor 1 alpha promoter in exon 5 of the endogenous thymidine kinase 2 gene. Using a U87 glioma model in SCID mice, we show that the therapeutic capacity of the new CRISPR/Cas9-engineered hAMSCs is equivalent to that of therapeutic hAMSCs generated by introduction of the same therapeutic gene by transduction with a lentiviral vector previously published by our group. This strategy should be of general use to other applications requiring genetic modification of therapeutic cells. Keywords: CRISPR/Cas9, cell therapy, mesenchymal stem cells, bystander suicide therapy, glioblastoma, non-invasive bioluminescence imaging, CRISPR/Cas9, CRISPR/Cas9 knockin
Peripheral immunophenotype and viral promoter variants during the asymptomatic phase of feline immunodeficiency virus infection.

Science.gov (United States)

Murphy, B; Hillman, C; McDonnel, S

2014-01-22

Feline immunodeficiency virus (FIV)-infected cats enter a clinically asymptomatic phase during chronic infection. Despite the lack of overt clinical disease, the asymptomatic phase is characterized by persistent immunologic impairment. In the peripheral blood obtained from cats experimentally infected with FIV-C for approximately 5 years, we identified a persistent inversion of the CD4/CD8 ratio. We cloned and sequenced the FIV-C long terminal repeat containing the viral promoter from cells infected with the inoculating virus and from in vivo-derived peripheral blood mononuclear cells and CD4 T cells isolated at multiple time points throughout the asymptomatic phase. Relative to the inoculating virus, viral sequences amplified from cells isolated from all of the infected animals demonstrated multiple single nucleotide mutations and a short deletion within the viral U3, R and U5 regions. A transcriptionally inactivating proviral mutation in the U3 promoter AP-1 site was identified at multiple time points from all of the infected animals but not within cell-associated viral RNA. In contrast, no mutations were identified within the sequence of the viral dUTPase gene amplified from PBMC isolated at approximately 5 years post-infection relative to the inoculating sequence. The possible implications of these mutations to viral pathogenesis are discussed. Copyright © 2013 Elsevier B.V. All rights reserved.
Python Radiative Transfer Emission code (PyRaTE): non-LTE spectral lines simulations

Science.gov (United States)

Tritsis, A.; Yorke, H.; Tassis, K.

2018-05-01

We describe PyRaTE, a new, non-local thermodynamic equilibrium (non-LTE) line radiative transfer code developed specifically for post-processing astrochemical simulations. Population densities are estimated using the escape probability method. When computing the escape probability, the optical depth is calculated towards all directions with density, molecular abundance, temperature and velocity variations all taken into account. A very easy-to-use interface, capable of importing data from simulations outputs performed with all major astrophysical codes, is also developed. The code is written in PYTHON using an "embarrassingly parallel" strategy and can handle all geometries and projection angles. We benchmark the code by comparing our results with those from RADEX (van der Tak et al. 2007) and against analytical solutions and present case studies using hydrochemical simulations. The code will be released for public use.
Complete coding sequence of the human raf oncogene and the corresponding structure of the c-raf-1 gene

Energy Technology Data Exchange (ETDEWEB)

Bonner, T I; Oppermann, H; Seeburg, P; Kerby, S B; Gunnell, M A; Young, A C; Rapp, U R

1986-01-24

The complete 648 amino acid sequence of the human raf oncogene was deduced from the 2977 nucleotide sequence of a fetal liver cDNA. The cDNA has been used to obtain clones which extend the human c-raf-1 locus by an additional 18.9 kb at the 5' end and contain all the remaining coding exons.
The N-Terminal of Aquareovirus NS80 Is Required for Interacting with Viral Proteins and Viral Replication.

Directory of Open Access Journals (Sweden)

Jie Zhang

Full Text Available Reovirus replication and assembly occurs within viral inclusion bodies that formed in specific intracellular compartments of cytoplasm in infected cells. Previous study indicated that aquareovirus NS80 is able to form inclusion bodies, and also can retain viral proteins within its inclusions. To better understand how NS80 performed in viral replication and assembly, the functional regions of NS80 associated with other viral proteins in aquareovirus replication were investigated in this study. Deletion mutational analysis and rotavirus NSP5-based protein association platform were used to detect association regions. Immunofluorescence images indicated that different N-terminal regions of NS80 could associate with viral proteins VP1, VP4, VP6 and NS38. Further co-immunoprecipitation analysis confirmed the interaction between VP1, VP4, VP6 or NS38 with different regions covering the N-terminal amino acid (aa, 1-471 of NS80, respectively. Moreover, removal of NS80 N-terminal sequences required for interaction with proteins VP1, VP4, VP6 or NS38 not only prevented the capacity of NS80 to support viral replication in NS80 shRNA-based replication complementation assays, but also inhibited the expression of aquareovirus proteins, suggesting that N-terminal regions of NS80 are necessary for viral replication. These results provided a foundational basis for further understanding the role of NS80 in viral replication and assembly during aquareovirus infection.
Scrutinizing virus genome termini by high-throughput sequencing.

Directory of Open Access Journals (Sweden)

Shasha Li

Full Text Available Analysis of genomic terminal sequences has been a major step in studies on viral DNA replication and packaging mechanisms. However, traditional methods to study genome termini are challenging due to the time-consuming protocols and their inefficiency where critical details are lost easily. Recent advances in next generation sequencing (NGS have enabled it to be a powerful tool to study genome termini. In this study, using NGS we sequenced one iridovirus genome and twenty phage genomes and confirmed for the first time that the high frequency sequences (HFSs found in the NGS reads are indeed the terminal sequences of viral genomes. Further, we established a criterion to distinguish the type of termini and the viral packaging mode. We also obtained additional terminal details such as terminal repeats, multi-termini, asymmetric termini. With this approach, we were able to simultaneously detect details of the genome termini as well as obtain the complete sequence of bacteriophage genomes. Theoretically, this application can be further extended to analyze larger and more complicated genomes of plant and animal viruses. This study proposed a novel and efficient method for research on viral replication, packaging, terminase activity, transcription regulation, and metabolism of the host cell.
Plum Pox Virus 6K1 Protein Is Required for Viral Replication and Targets the Viral Replication Complex at the Early Stage of Infection.

Science.gov (United States)

Cui, Hongguang; Wang, Aiming

2016-05-15

The potyviral RNA genome encodes two polyproteins that are proteolytically processed by three viral protease domains into 11 mature proteins. Extensive molecular studies have identified functions for the majority of the viral proteins. For example, 6K2, one of the two smallest potyviral proteins, is an integral membrane protein and induces the endoplasmic reticulum (ER)-originated replication vesicles that target the chloroplast for robust viral replication. However, the functional role of 6K1, the other smallest protein, remains uncharacterized. In this study, we developed a series of recombinant full-length viral cDNA clones derived from a Canadian Plum pox virus (PPV) isolate. We found that deletion of any of the short motifs of 6K1 (each of which ranged from 5 to 13 amino acids), most of the 6K1 sequence (but with the conserved sequence of the cleavage sites being retained), or all of the 6K1 sequence in the PPV infectious clone abolished viral replication. The trans expression of 6K1 or the cis expression of a dislocated 6K1 failed to rescue the loss-of-replication phenotype, suggesting the temporal and spatial requirement of 6K1 for viral replication. Disruption of the N- or C-terminal cleavage site of 6K1, which prevented the release of 6K1 from the polyprotein, either partially or completely inhibited viral replication, suggesting the functional importance of the mature 6K1. We further found that green fluorescent protein-tagged 6K1 formed punctate inclusions at the viral early infection stage and colocalized with chloroplast-bound viral replicase elements 6K2 and NIb. Taken together, our results suggest that 6K1 is required for viral replication and is an important viral element of the viral replication complex at the early infection stage. Potyviruses account for more than 30% of known plant viruses and consist of many agriculturally important viruses. The genomes of potyviruses encode two polyproteins that are proteolytically processed into 11 mature
Law of Iterated Logarithm for NA Sequences with Non-Identical ...

Indian Academy of Sciences (India)

Based on a law of the iterated logarithm for independent random variables sequences, an iterated logarithm theorem for NA sequences with non-identical distributions is obtained. The proof is based on a Kolmogrov-type exponential inequality.
Epstein-Barr virus viral load and serology in childhood non-Hodgkin's lymphoma and chronic inflammatory conditions in Uganda: implications for disease risk and characteristics.

Science.gov (United States)

Orem, Jackson; Sandin, Sven; Mbidde, Edward; Mangen, Fred Wabwire; Middeldorp, Jaap; Weiderpass, Elisabete

2014-10-01

Epstein-Barr virus (EBV) has been linked to malignancies and chronic inflammatory conditions. In this study, EBV detection was compared in children with non-Hodgkin's lymphoma and children with chronic inflammatory conditions, using samples and data from a case-control study carried out at the Mulago National Referral Hospital between 2004 and 2008. EBV viral load was measured in saliva, whole blood and white blood cells by real-time PCR. Serological values for IgG-VCA, EBNA1, and EAd-IgG were compared in non-Hodgkin's lymphoma and chronic inflammatory conditions; and in Burkitt's lymphoma and other subtypes of non-Hodgkin's lymphoma. Odds ratios (ORs) and corresponding 95% confidence intervals (CIs) were calculated. Of the 127 children included (87 males and 40 females; median age 7 years, range 2-17), 96 had non-Hodgkin's lymphoma (46 Burkitt's lymphoma and 50 other non-Hodgkin's lymphoma), 31 had chronic inflammatory conditions, and only 10% were HIV-positive. The most common clinical presentations for all disease categories considered were fever, night sweats, and weight loss. EBV viral load in whole blood was elevated in Burkitt's lymphoma compared to other non-Hodgkin's lymphoma (OR 6.67, 95% CI 1.32, 33.69; P-value = 0.04), but EBV viral loads in saliva and white blood cells were not different in any of the disease categories considered. A significant difference in EAd-IgG was observed when non-Hodgkin's lymphoma was compared with chronic inflammatory conditions (OR 0.19, 95% CI 0.07, 0.51; P-value = 0.001). When compared to chronic inflammatory conditions, EBV viral load was elevated in Burkitt's lymphoma, and EA IgG was higher in non-Hodgkin's lymphoma. This study supports an association between virological and serological markers of EBV and childhood non-Hodgkin's lymphoma, irrespective of subtype, in Uganda. © 2014 Wiley Periodicals, Inc.
Chimeric classical swine fever (CSF)-Japanese encephalitis (JE) viral particles as a non-transmissible bivalent marker vaccine candidate against CSF and JE infections

Science.gov (United States)

A trans-complemented CSF- JE chimeric viral replicon was constructed using an infectious cDNA clone of the CSF virus (CSFV) Alfort/187 strain. The E2 gene of CSFV Alfort/187 strain was deleted and the resultant plasmid pA187delE2 was inserted by a fragment containing the region coding for a truncate...
Decoding the non-coding genome: elucidating genetic risk outside the coding genome.

Science.gov (United States)

Barr, C L; Misener, V L

2016-01-01

Current evidence emerging from genome-wide association studies indicates that the genetic underpinnings of complex traits are likely attributable to genetic variation that changes gene expression, rather than (or in combination with) variation that changes protein-coding sequences. This is particularly compelling with respect to psychiatric disorders, as genetic changes in regulatory regions may result in differential transcriptional responses to developmental cues and environmental/psychosocial stressors. Until recently, however, the link between transcriptional regulation and psychiatric genetic risk has been understudied. Multiple obstacles have contributed to the paucity of research in this area, including challenges in identifying the positions of remote (distal from the promoter) regulatory elements (e.g. enhancers) and their target genes and the underrepresentation of neural cell types and brain tissues in epigenome projects - the availability of high-quality brain tissues for epigenetic and transcriptome profiling, particularly for the adolescent and developing brain, has been limited. Further challenges have arisen in the prediction and testing of the functional impact of DNA variation with respect to multiple aspects of transcriptional control, including regulatory-element interaction (e.g. between enhancers and promoters), transcription factor binding and DNA methylation. Further, the brain has uncommon DNA-methylation marks with unique genomic distributions not found in other tissues - current evidence suggests the involvement of non-CG methylation and 5-hydroxymethylation in neurodevelopmental processes but much remains unknown. We review here knowledge gaps as well as both technological and resource obstacles that will need to be overcome in order to elucidate the involvement of brain-relevant gene-regulatory variants in genetic risk for psychiatric disorders. © 2015 John Wiley & Sons Ltd and International Behavioural and Neural Genetics Society.
Diversity in non-repetitive human sequences not found in the reference genome.

Science.gov (United States)

Kehr, Birte; Helgadottir, Anna; Melsted, Pall; Jonsson, Hakon; Helgason, Hannes; Jonasdottir, Adalbjörg; Jonasdottir, Aslaug; Sigurdsson, Asgeir; Gylfason, Arnaldur; Halldorsson, Gisli H; Kristmundsdottir, Snaedis; Thorgeirsson, Gudmundur; Olafsson, Isleifur; Holm, Hilma; Thorsteinsdottir, Unnur; Sulem, Patrick; Helgason, Agnar; Gudbjartsson, Daniel F; Halldorsson, Bjarni V; Stefansson, Kari

2017-04-01

Genomes usually contain some non-repetitive sequences that are missing from the reference genome and occur only in a population subset. Such non-repetitive, non-reference (NRNR) sequences have remained largely unexplored in terms of their characterization and downstream analyses. Here we describe 3,791 breakpoint-resolved NRNR sequence variants called using PopIns from whole-genome sequence data of 15,219 Icelanders. We found that over 95% of the 244 NRNR sequences that are 200 bp or longer are present in chimpanzees, indicating that they are ancestral. Furthermore, 149 variant loci are in linkage disequilibrium (r 2 > 0.8) with a genome-wide association study (GWAS) catalog marker, suggesting disease relevance. Additionally, we report an association (P = 3.8 × 10 -8 , odds ratio (OR) = 0.92) with myocardial infarction (23,360 cases, 300,771 controls) for a 766-bp NRNR sequence variant. Our results underline the importance of including variation of all complexity levels when searching for variants that associate with disease.

Codon size reduction as the origin of the triplet genetic code.

Directory of Open Access Journals (Sweden)

Pavel V Baranov

Full Text Available The genetic code appears to be optimized in its robustness to missense errors and frameshift errors. In addition, the genetic code is near-optimal in terms of its ability to carry information in addition to the sequences of encoded proteins. As evolution has no foresight, optimality of the modern genetic code suggests that it evolved from less optimal code variants. The length of codons in the genetic code is also optimal, as three is the minimal nucleotide combination that can encode the twenty standard amino acids. The apparent impossibility of transitions between codon sizes in a discontinuous manner during evolution has resulted in an unbending view that the genetic code was always triplet. Yet, recent experimental evidence on quadruplet decoding, as well as the discovery of organisms with ambiguous and dual decoding, suggest that the possibility of the evolution of triplet decoding from living systems with non-triplet decoding merits reconsideration and further exploration. To explore this possibility we designed a mathematical model of the evolution of primitive digital coding systems which can decode nucleotide sequences into protein sequences. These coding systems can evolve their nucleotide sequences via genetic events of Darwinian evolution, such as point-mutations. The replication rates of such coding systems depend on the accuracy of the generated protein sequences. Computer simulations based on our model show that decoding systems with codons of length greater than three spontaneously evolve into predominantly triplet decoding systems. Our findings suggest a plausible scenario for the evolution of the triplet genetic code in a continuous manner. This scenario suggests an explanation of how protein synthesis could be accomplished by means of long RNA-RNA interactions prior to the emergence of the complex decoding machinery, such as the ribosome, that is required for stabilization and discrimination of otherwise weak triplet codon
The Number, Organization, and Size of Polymorphic Membrane Protein Coding Sequences as well as the Most Conserved Pmp Protein Differ within and across Chlamydia Species.

Science.gov (United States)

Van Lent, Sarah; Creasy, Heather Huot; Myers, Garry S A; Vanrompay, Daisy

2016-01-01

Variation is a central trait of the polymorphic membrane protein (Pmp) family. The number of pmp coding sequences differs between Chlamydia species, but it is unknown whether the number of pmp coding sequences is constant within a Chlamydia species. The level of conservation of the Pmp proteins has previously only been determined for Chlamydia trachomatis. As different Pmp proteins might be indispensible for the pathogenesis of different Chlamydia species, this study investigated the conservation of Pmp proteins both within and across C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci. The pmp coding sequences were annotated in 16 C. trachomatis, 6 C. pneumoniae, 2 C. abortus, and 16 C. psittaci genomes. The number and organization of polymorphic membrane coding sequences differed within and across the analyzed Chlamydia species. The length of coding sequences of pmpA,pmpB, and pmpH was conserved among all analyzed genomes, while the length of pmpE/F and pmpG, and remarkably also of the subtype pmpD, differed among the analyzed genomes. PmpD, PmpA, PmpH, and PmpA were the most conserved Pmp in C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci, respectively. PmpB was the most conserved Pmp across the 4 analyzed Chlamydia species. © 2016 S. Karger AG, Basel.
Yeast genome sequencing:

DEFF Research Database (Denmark)

Piskur, Jure; Langkjær, Rikke Breinhold

2004-01-01

For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...
A metagenomic survey of viral abundance and diversity in mosquitoes from Hubei province.

Directory of Open Access Journals (Sweden)

Chenyan Shi

Full Text Available Mosquitoes as one of the most common but important vectors have the potential to transmit or acquire a lot of viruses through biting, however viral flora in mosquitoes and its impact on mosquito-borne disease transmission has not been well investigated and evaluated. In this study, the metagenomic techniquehas been successfully employed in analyzing the abundance and diversity of viral community in three mosquito samples from Hubei, China. Among 92,304 reads produced through a run with 454 GS FLX system, 39% have high similarities with viral sequences belonging to identified bacterial, fungal, animal, plant and insect viruses, and 0.02% were classed into unidentified viral sequences, demonstrating high abundance and diversity of viruses in mosquitoes. Furthermore, two novel viruses in subfamily Densovirinae and family Dicistroviridae were identified, and six torque tenosus virus1 in family Anelloviridae, three porcine parvoviruses in subfamily Parvovirinae and a Culex tritaeniorhynchus rhabdovirus in Family Rhabdoviridae were preliminarily characterized. The viral metagenomic analysis offered us a deep insight into the viral population of mosquito which played an important role in viral initiative or passive transmission and evolution during the process.
A Metagenomic Survey of Viral Abundance and Diversity in Mosquitoes from Hubei Province

Science.gov (United States)

Shi, Chenyan; Liu, Yi; Hu, Xiaomin; Xiong, Jinfeng; Zhang, Bo; Yuan, Zhiming

2015-01-01

Mosquitoes as one of the most common but important vectors have the potential to transmit or acquire a lot of viruses through biting, however viral flora in mosquitoes and its impact on mosquito-borne disease transmission has not been well investigated and evaluated. In this study, the metagenomic techniquehas been successfully employed in analyzing the abundance and diversity of viral community in three mosquito samples from Hubei, China. Among 92,304 reads produced through a run with 454 GS FLX system, 39% have high similarities with viral sequences belonging to identified bacterial, fungal, animal, plant and insect viruses, and 0.02% were classed into unidentified viral sequences, demonstrating high abundance and diversity of viruses in mosquitoes. Furthermore, two novel viruses in subfamily Densovirinae and family Dicistroviridae were identified, and six torque tenosus virus1 in family Anelloviridae, three porcine parvoviruses in subfamily Parvovirinae and a Culex tritaeniorhynchus rhabdovirus in Family Rhabdoviridae were preliminarily characterized. The viral metagenomic analysis offered us a deep insight into the viral population of mosquito which played an important role in viral initiative or passive transmission and evolution during the process. PMID:26030271
A Sequence-Independent, Unstructured Internal Ribosome Entry Site Is Responsible for Internal Expression of the Coat Protein of Turnip Crinkle Virus.

Science.gov (United States)

May, Jared; Johnson, Philip; Saleem, Huma; Simon, Anne E

2017-04-15

To maximize the coding potential of viral genomes, internal ribosome entry sites (IRES) can be used to bypass the traditional requirement of a 5' cap and some/all of the associated translation initiation factors. Although viral IRES typically contain higher-order RNA structure, an unstructured sequence of about 84 nucleotides (nt) immediately upstream of the Turnip crinkle virus (TCV) coat protein (CP) open reading frame (ORF) has been found to promote internal expression of the CP from the genomic RNA (gRNA) both in vitro and in vivo An absence of extensive RNA structure was predicted using RNA folding algorithms and confirmed by selective 2'-hydroxyl acylation analyzed by primer extension (SHAPE) RNA structure probing. Analysis of the IRES region in vitro by use of both the TCV gRNA and reporter constructs did not reveal any sequence-specific elements but rather suggested that an overall lack of structure was an important feature for IRES activity. The CP IRES is A-rich, independent of orientation, and strongly conserved among viruses in the same genus. The IRES was dependent on eIF4G, but not eIF4E, for activity. Low levels of CP accumulated in vivo in the absence of detectable TCV subgenomic RNAs, strongly suggesting that the IRES was active in the gRNA in vivo Since the TCV CP also serves as the viral silencing suppressor, early translation of the CP from the viral gRNA is likely important for countering host defenses. Cellular mRNA IRES also lack extensive RNA structures or sequence conservation, suggesting that this viral IRES and cellular IRES may have similar strategies for internal translation initiation. IMPORTANCE Cap-independent translation is a common strategy among positive-sense, single-stranded RNA viruses for bypassing the host cell requirement of a 5' cap structure. Viral IRES, in general, contain extensive secondary structure that is critical for activity. In contrast, we demonstrate that a region of viral RNA devoid of extensive secondary
Nucleic Acid-Based Approaches for Detection of Viral Hepatitis

Science.gov (United States)

Behzadi, Payam; Ranjbar, Reza; Alavian, Seyed Moayed

2014-01-01

Context: To determining suitable nucleic acid diagnostics for individual viral hepatitis agent, an extensive search using related keywords was done in major medical library and data were collected, categorized, and summarized in different sections. Results: Various types of molecular biology tools can be used to detect and quantify viral genomic elements and analyze the sequences. These molecular assays are proper technologies for rapidly detecting viral agents with high accuracy, high sensitivity, and high specificity. Nonetheless, the application of each diagnostic method is completely dependent on viral agent. Conclusions: Despite rapidity, automation, accuracy, cost-effectiveness, high sensitivity, and high specificity of molecular techniques, each type of molecular technology has its own advantages and disadvantages. PMID:25789132
pEPito: a significantly improved non-viral episomal expression vector for mammalian cells

Directory of Open Access Journals (Sweden)

Ogris Manfred

2010-03-01

Full Text Available Abstract Background The episomal replication of the prototype vector pEPI-1 depends on a transcription unit starting from the constitutively expressed Cytomegalovirus immediate early promoter (CMV-IEP and directed into a 2000 bp long matrix attachment region sequence (MARS derived from the human β-interferon gene. The original pEPI-1 vector contains two mammalian transcription units and a total of 305 CpG islands, which are located predominantly within the vector elements necessary for bacterial propagation and known to be counterproductive for persistent long-term transgene expression. Results Here, we report the development of a novel vector pEPito, which is derived from the pEPI-1 plasmid replicon but has considerably improved efficacy both in vitro and in vivo. The pEPito vector is significantly reduced in size, contains only one transcription unit and 60% less CpG motives in comparison to pEPI-1. It exhibits major advantages compared to the original pEPI-1 plasmid, including higher transgene expression levels and increased colony-forming efficiencies in vitro, as well as more persistent transgene expression profiles in vivo. The performance of pEPito-based vectors was further improved by replacing the CMV-IEP with the human CMV enhancer/human elongation factor 1 alpha promoter (hCMV/EF1P element that is known to be less affected by epigenetic silencing events. Conclusions The novel vector pEPito can be considered suitable as an improved vector for biotechnological applications in vitro and for non-viral gene delivery in vivo.
A Herpesviral Immediate Early Protein Promotes Transcription Elongation of Viral Transcripts

Directory of Open Access Journals (Sweden)

Hannah L. Fox

2017-06-01

Full Text Available Herpes simplex virus 1 (HSV-1 genes are transcribed by cellular RNA polymerase II (RNA Pol II. While four viral immediate early proteins (ICP4, ICP0, ICP27, and ICP22 function in some capacity in viral transcription, the mechanism by which ICP22 functions remains unclear. We observed that the FACT complex (comprised of SSRP1 and Spt16 was relocalized in infected cells as a function of ICP22. ICP22 was also required for the association of FACT and the transcription elongation factors SPT5 and SPT6 with viral genomes. We further demonstrated that the FACT complex interacts with ICP22 throughout infection. We therefore hypothesized that ICP22 recruits cellular transcription elongation factors to viral genomes for efficient transcription elongation of viral genes. We reevaluated the phenotype of an ICP22 mutant virus by determining the abundance of all viral mRNAs throughout infection by transcriptome sequencing (RNA-seq. The accumulation of almost all viral mRNAs late in infection was reduced compared to the wild type, regardless of kinetic class. Using chromatin immunoprecipitation sequencing (ChIP-seq, we mapped the location of RNA Pol II on viral genes and found that RNA Pol II levels on the bodies of viral genes were reduced in the ICP22 mutant compared to wild-type virus. In contrast, the association of RNA Pol II with transcription start sites in the mutant was not reduced. Taken together, our results indicate that ICP22 plays a role in recruiting elongation factors like the FACT complex to the HSV-1 genome to allow for efficient viral transcription elongation late in viral infection and ultimately infectious virion production.
Compartmentalization of the gut viral reservoir in HIV-1 infected patients

Directory of Open Access Journals (Sweden)

Grant Tannika

2007-12-01

Full Text Available Abstract Background Recently there has been an increasing interest and appreciation for the gut as both a viral reservoir as well as an important host-pathogen interface in human immunodefiency virus type 1 (HIV-1 infection. The gut associated lymphoid tissue (GALT is the largest lymphoid organ infected by HIV-1. In this study we examined if different HIV-1 quasispecies are found in different parts of the gut of HIV-1 infected individuals. Results Gut biopsies (esophagus, stomach, duodenum and colorectum were obtained from eight HIV-1 infected preHAART (highly active antiretroviral therapy patients. HIV-1 Nef and Reverse transcriptase (RT encoding sequences were obtained through nested PCR amplification from DNA isolated from the gut biopsy tissues. The PCR fragments were cloned and sequenced. The resulting sequences were subjected to various phylogenetic analyses. Expression of the nef gene and viral RNA in the different gut tissues was determined using real-time RT-PCR. Phylogenetic analysis of the Nef protein-encoding region revealed compartmentalization of viral replication in the gut within patients. Viral diversity in both the Nef and RT encoding region varied in different parts of the gut. Moreover, increased nef gene expression (p Conclusion Our results indicated that different HIV-1 quasispecies populate different parts of the gut, and that viral replication in the gut is compartmentalized. These observations underscore the importance of the gut as a host-pathogen interface in HIV-1 infection.
Instruction sequences and non-uniform complexity theory

NARCIS (Netherlands)

Bergstra, J.A.; Middelburg, C.A.

2008-01-01

We develop theory concerning non-uniform complexity in a setting in which the notion of single-pass instruction sequence considered in program algebra is the central notion. We define counterparts of the complexity classes P/poly and NP/poly and formulate a counterpart of the complexity theoretic
Reconstitution of wild type viral DNA in simian cells transfected with early and late SV40 defective genomes.

Science.gov (United States)

O'Neill, F J; Gao, Y; Xu, X

1993-11-01

The DNAs of polyomaviruses ordinarily exist as a single circular molecule of approximately 5000 base pairs. Variants of SV40, BKV and JCV have been described which contain two complementing defective DNA molecules. These defectives, which form a bipartite genome structure, contain either the viral early region or the late region. The defectives have the unique property of being able to tolerate variable sized reiterations of regulatory and terminus region sequences, and portions of the coding region. They can also exchange coding region sequences with other polyomaviruses. It has been suggested that the bipartite genome structure might be a stage in the evolution of polyomaviruses which can uniquely sustain genome and sequence diversity. However, it is not known if the regulatory and terminus region sequences are highly mutable. Also, it is not known if the bipartite genome structure is reversible and what the conditions might be which would favor restoration of the monomolecular genome structure. We addressed the first question by sequencing the reiterated regulatory and terminus regions of E- and L-SV40 DNAs. This revealed a large number of mutations in the regulatory regions of the defective genomes, including deletions, insertions, rearrangements and base substitutions. We also detected insertions and base substitutions in the T-antigen gene. We addressed the second question by introducing into permissive simian cells, E- and L-SV40 genomes which had been engineered to contain only a single regulatory region. Analysis of viral DNA from transfected cells demonstrated recombined genomes containing a wild type monomolecular DNA structure. However, the complete defectives, containing reiterated regulatory regions, could often compete away the wild type genomes. The recombinant monomolecular genomes were isolated, cloned and found to be infectious. All of the DNA alterations identified in one of the regulatory regions of E-SV40 DNA were present in the recombinant
CVD-associated non-coding RNA, ANRIL, modulates expression of atherogenic pathways in VSMC

Energy Technology Data Exchange (ETDEWEB)

Congrains, Ada; Kamide, Kei [Department of Geriatric Medicine and Nephrology, Osaka University Graduate School of Medicine (Japan); Katsuya, Tomohiro [Clinical Gene Therapy, Osaka University Graduate School of Medicine (Japan); Yasuda, Osamu [Department of Cardiovascular Clinical and Translational Research, Kumamoto University Hospital (Japan); Oguro, Ryousuke; Yamamoto, Koichi [Department of Geriatric Medicine and Nephrology, Osaka University Graduate School of Medicine (Japan); Ohishi, Mitsuru, E-mail: ohishi@geriat.med.osaka-u.ac.jp [Department of Geriatric Medicine and Nephrology, Osaka University Graduate School of Medicine (Japan); Rakugi, Hiromi [Department of Geriatric Medicine and Nephrology, Osaka University Graduate School of Medicine (Japan)

2012-03-23

Highlights: Black-Right-Pointing-Pointer ANRIL maps in the strongest susceptibility locus for cardiovascular disease. Black-Right-Pointing-Pointer Silencing of ANRIL leads to altered expression of tissue remodeling-related genes. Black-Right-Pointing-Pointer The effects of ANRIL on gene expression are splicing variant specific. Black-Right-Pointing-Pointer ANRIL affects progression of cardiovascular disease by regulating proliferation and apoptosis pathways. -- Abstract: ANRIL is a newly discovered non-coding RNA lying on the strongest genetic susceptibility locus for cardiovascular disease (CVD) in the chromosome 9p21 region. Genome-wide association studies have been linking polymorphisms in this locus with CVD and several other major diseases such as diabetes and cancer. The role of this non-coding RNA in atherosclerosis progression is still poorly understood. In this study, we investigated the implication of ANRIL in the modulation of gene sets directly involved in atherosclerosis. We designed and tested siRNA sequences to selectively target two exons (exon 1 and exon 19) of the transcript and successfully knocked down expression of ANRIL in human aortic vascular smooth muscle cells (HuAoVSMC). We used a pathway-focused RT-PCR array to profile gene expression changes caused by ANRIL knock down. Notably, the genes affected by each of the siRNAs were different, suggesting that different splicing variants of ANRIL might have distinct roles in cell physiology. Our results suggest that ANRIL splicing variants play a role in coordinating tissue remodeling, by modulating the expression of genes involved in cell proliferation, apoptosis, extra-cellular matrix remodeling and inflammatory response to finally impact in the risk of cardiovascular disease and other pathologies.
Comprehensive reconstruction andvisualization of non-coding regulatorynetworks in human

Directory of Open Access Journals (Sweden)

Vincenzo eBonnici

2014-12-01

Full Text Available Research attention has been powered to understand the functional roles of non-coding RNAs (ncRNAs. Many studies have demonstrated their deregulation in cancer and other human disorders. ncRNAs are also present in extracellular human body fluids such as serum and plasma, giving them a great potential as non-invasive biomarkers. However, non-coding RNAs have been relatively recently discovered and a comprehensive database including all of them is still missing. Reconstructing and visualizing the network of ncRNAs interactions are important steps to understand their regulatory mechanism in complex systems. This work presents ncRNA-DB, a NoSQL database that integrates ncRNAs data interactions from a large number of well established online repositories. The interactions involve RNA, DNA, proteins and diseases. ncRNA-DB is available at http://ncrnadb.scienze.univr.it/ncrnadb/. It is equipped with three interfaces: web based, command line and a Cytoscape app called ncINetView. By accessing only one resource, users can search for ncRNAs and their interactions, build a network annotated with all known ncRNAs and associated diseases, and use all visual and mining features available in Cytoscape.
Therapeutic targeting of non-coding RNAs in cancer

Czech Academy of Sciences Publication Activity Database

Slabý, O.; Laga, Richard; Sedláček, Ondřej

2017-01-01

Roč. 474, č. 24 (2017), s. 4219-4251 ISSN 0264-6021 R&D Projects: GA MŠk(CZ) LQ1604; GA MŠk(CZ) ED1.1.00/02.0109 Institutional support: RVO:61389013 Keywords : non-coding RNA * RNA delivery * polymer carriers Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Biochemical research methods Impact factor: 3.797, year: 2016
Identification of Known and Novel Recurrent Viral Sequences in Data from Multiple Patients and Multiple Cancers

DEFF Research Database (Denmark)

Friis-Nielsen, Jens; Kjartansdóttir, Kristín Rós; Mollerup, Sarah

2016-01-01

non-template controls, and 24 test samples. Recurrent sequences were statistically associated to biological, methodological or technical features with the aim to identify novel pathogens or plausible contaminants that may associate to a particular kit or method. We provide examples of identified......Virus discovery from high throughput sequencing data often follows a bottom-up approach where taxonomic annotation takes place prior to association to disease. Albeit effective in some cases, the approach fails to detect novel pathogens and remote variants not present in reference databases. We...... have developed a species independent pipeline that utilises sequence clustering for the identification of nucleotide sequences that co-occur across multiple sequencing data instances. We applied the workflow to 686 sequencing libraries from 252 cancer samples of different cancer and tissue types, 32...
Exploration of small RNA-seq data for small non-coding RNAs in Human Colorectal Cancer.

Science.gov (United States)

Koduru, Srinivas V; Tiwari, Amit K; Hazard, Sprague W; Mahajan, Milind; Ravnic, Dino J

2017-01-01

Background: Improved healthcare and recent breakthroughs in technology have substantially reduced cancer mortality rates worldwide. Recent advancements in next-generation sequencing (NGS) have allowed genomic analysis of the human transcriptome. Now, using NGS we can further look into small non-coding regions of RNAs (sncRNAs) such as microRNAs (miRNAs), Piwi-interacting-RNAs (piRNAs), long non-coding RNAs (lncRNAs), and small nuclear/nucleolar RNAs (sn/snoRNAs) among others. Recent studies looking at sncRNAs indicate their role in important biological processes such as cancer progression and predict their role as biomarkers for disease diagnosis, prognosis, and therapy. Results: In the present study, we data mined publically available small RNA sequencing data from colorectal tissue samples of eight matched patients (benign, tumor, and metastasis) and remapped the data for various small RNA annotations. We identified aberrant expression of 13 miRNAs in tumor and metastasis specimens [tumor vs benign group (19 miRNAs) and metastasis vs benign group (38 miRNAs)] of which five were upregulated, and eight were downregulated, during disease progression. Pathway analysis of aberrantly expressed miRNAs showed that the majority of miRNAs involved in colon cancer were also involved in other cancers. Analysis of piRNAs revealed six to be over-expressed in the tumor vs benign cohort and 24 in the metastasis vs benign group. Only two piRNAs were shared between the two cohorts. Examining other types of small RNAs [sn/snoRNAs, mt_rRNA, miscRNA, nonsense mediated decay (NMD), and rRNAs] identified 15 sncRNAs in the tumor vs benign group and 104 in the metastasis vs benign group, with only four others being commonly expressed. Conclusion: In summary, our comprehensive analysis on publicly available small RNA-seq data identified multiple differentially expressed sncRNAs during colorectal cancer progression at different stages compared to normal colon tissue. We speculate that
Nuclear RNA sequencing of the mouse erythroid cell transcriptome.

Directory of Open Access Journals (Sweden)

Jennifer A Mitchell

Full Text Available In addition to protein coding genes a substantial proportion of mammalian genomes are transcribed. However, most transcriptome studies investigate steady-state mRNA levels, ignoring a considerable fraction of the transcribed genome. In addition, steady-state mRNA levels are influenced by both transcriptional and posttranscriptional mechanisms, and thus do not provide a clear picture of transcriptional output. Here, using deep sequencing of nuclear RNAs (nucRNA-Seq in parallel with chromatin immunoprecipitation sequencing (ChIP-Seq of active RNA polymerase II, we compared the nuclear transcriptome of mouse anemic spleen erythroid cells with polymerase occupancy on a genome-wide scale. We demonstrate that unspliced transcripts quantified by nucRNA-seq correlate with primary transcript frequencies measured by RNA FISH, but differ from steady-state mRNA levels measured by poly(A-enriched RNA-seq. Highly expressed protein coding genes showed good correlation between RNAPII occupancy and transcriptional output; however, genome-wide we observed a poor correlation between transcriptional output and RNAPII association. This poor correlation is due to intergenic regions associated with RNAPII which correspond with transcription factor bound regulatory regions and a group of stable, nuclear-retained long non-coding transcripts. In conclusion, sequencing the nuclear transcriptome provides an opportunity to investigate the transcriptional landscape in a given cell type through quantification of unspliced primary transcripts and the identification of nuclear-retained long non-coding RNAs.
Constacyclic codes over the ring F_q+v{F}_q+v2F_q and their applications of constructing new non-binary quantum codes

Science.gov (United States)

Ma, Fanghui; Gao, Jian; Fu, Fang-Wei

2018-06-01

Let R={F}_q+v{F}_q+v2{F}_q be a finite non-chain ring, where q is an odd prime power and v^3=v. In this paper, we propose two methods of constructing quantum codes from (α +β v+γ v2)-constacyclic codes over R. The first one is obtained via the Gray map and the Calderbank-Shor-Steane construction from Euclidean dual-containing (α +β v+γ v2)-constacyclic codes over R. The second one is obtained via the Gray map and the Hermitian construction from Hermitian dual-containing (α +β v+γ v2)-constacyclic codes over R. As an application, some new non-binary quantum codes are obtained.
Human viral pathogens are pervasive in wastewater treatment center aerosols.

Science.gov (United States)

Brisebois, Evelyne; Veillette, Marc; Dion-Dupont, Vanessa; Lavoie, Jacques; Corbeil, Jacques; Culley, Alexander; Duchaine, Caroline

2018-05-01

Wastewater treatment center (WTC) workers may be vulnerable to diseases caused by viruses, such as the common cold, influenza and gastro-intestinal infections. Although there is a substantial body of literature characterizing the microbial community found in wastewater, only a few studies have characterized the viral component of WTC aerosols, despite the fact that most diseases affecting WTC workers are of viral origin and that some of these viruses are transmitted through the air. In this study, we evaluated in four WTCs the presence of 11 viral pathogens of particular concern in this milieu and used a metagenomic approach to characterize the total viral community in the air of one of those WTCs. The presence of viruses in aerosols in different locations of individual WTCs was evaluated and the results obtained with four commonly used air samplers were compared. We detected four of the eleven viruses tested, including human adenovirus (hAdV), rotavirus, hepatitis A virus (HAV) and Herpes Simplex virus type 1 (HSV1). The results of the metagenomic assay uncovered very few viral RNA sequences in WTC aerosols, however sequences from human DNA viruses were in much greater relative abundance. Copyright © 2017. Published by Elsevier B.V.

Mutations in Cas9 Enhance the Rate of Acquisition of Viral Spacer Sequences during the CRISPR-Cas Immune Response.

Science.gov (United States)

Heler, Robert; Wright, Addison V; Vucelja, Marija; Bikard, David; Doudna, Jennifer A; Marraffini, Luciano A

2017-01-05

CRISPR loci and their associated (Cas) proteins encode a prokaryotic immune system that protects against viruses and plasmids. Upon infection, a low fraction of cells acquire short DNA sequences from the invader. These sequences (spacers) are integrated in between the repeats of the CRISPR locus and immunize the host against the matching invader. Spacers specify the targets of the CRISPR immune response through transcription into short RNA guides that direct Cas nucleases to the invading DNA molecules. Here we performed random mutagenesis of the RNA-guided Cas9 nuclease to look for variants that provide enhanced immunity against viral infection. We identified a mutation, I473F, that increases the rate of spacer acquisition by more than two orders of magnitude. Our results highlight the role of Cas9 during CRISPR immunization and provide a useful tool to study this rare process and develop it as a biotechnological application. Copyright © 2017 Elsevier Inc. All rights reserved.
Diversity of antisense and other non-coding RNAs in Archaea revealed by comparative small RNA sequencing in four Pyrobaculum species

Directory of Open Access Journals (Sweden)

David L Bernick

2012-07-01

Full Text Available A great diversity of small, non-coding RNA molecules with roles in gene regulation and RNA processing have been intensely studied in eukaryotic and bacterial model organisms, yet our knowledge of possible parallel roles for small RNAs in archaea is limited. We employed RNA-seq to identify novel small RNA across multiple species of the hyperthermophilic genus Pyrobaculum, known for unusual RNA gene characteristics. By comparing transcriptional data collected in parallel among four species, we were able to identify conserved RNA genes fitting into known and novel families. Among our findings, we highlight three novel cis-antisense small RNAs encoded opposite to key regulatory (ferric uptake regulator, metabolic (triose-phosphate isomerase, and core transcriptional apparatus genes (transcription factor B. We also found a large increase in the number of conserved C/D box small RNA genes over what had been previously recognized; many of these genes are encoded antisense to protein coding genes. The conserved opposition to orthologous genes across the Pyrobaculum genus suggests similarities to other cis-antisense regulatory systems. Furthermore, the genus-specific nature of these small RNAs indicates they are relatively recent, stable adaptations.
DNA-Binding Properties of African Swine Fever Virus pA104R, a Histone-Like Protein Involved in Viral Replication and Transcription.

Science.gov (United States)

Frouco, Gonçalo; Freitas, Ferdinando B; Coelho, João; Leitão, Alexandre; Martins, Carlos; Ferreira, Fernando

2017-06-15

African swine fever virus (ASFV) codes for a putative histone-like protein (pA104R) with extensive sequence homology to bacterial proteins that are implicated in genome replication and packaging. Functional characterization of purified recombinant pA104R revealed that it binds to single-stranded DNA (ssDNA) and double-stranded DNA (dsDNA) over a wide range of temperatures, pH values, and salt concentrations and in an ATP-independent manner, with an estimated binding site size of about 14 to 16 nucleotides. Using site-directed mutagenesis, the arginine located in pA104R's DNA-binding domain, at position 69, was found to be relevant for efficient DNA-binding activity. Together, pA104R and ASFV topoisomerase II (pP1192R) display DNA-supercoiling activity, although none of the proteins by themselves do, indicating that the two cooperate in this process. In ASFV-infected cells, A104R transcripts were detected from 2 h postinfection (hpi) onward, reaching a maximum concentration around 16 hpi. pA104R was detected from 12 hpi onward, localizing with viral DNA replication sites and being found exclusively in the Triton-insoluble fraction. Small interfering RNA (siRNA) knockdown experiments revealed that pA104R plays a critical role in viral DNA replication and gene expression, with transfected cells showing lower viral progeny numbers (up to a reduction of 82.0%), lower copy numbers of viral genomes (-78.3%), and reduced transcription of a late viral gene (-47.6%). Taken together, our results strongly suggest that pA104R participates in the modulation of viral DNA topology, probably being involved in viral DNA replication, transcription, and packaging, emphasizing that ASFV mutants lacking the A104R gene could be used as a strategy to develop a vaccine against ASFV. IMPORTANCE Recently reintroduced in Europe, African swine fever virus (ASFV) causes a fatal disease in domestic pigs, causing high economic losses in affected countries, as no vaccine or treatment is currently
Exome sequencing and genetic testing for MODY.

Directory of Open Access Journals (Sweden)

Stefan Johansson

Full Text Available Genetic testing for monogenic diabetes is important for patient care. Given the extensive genetic and clinical heterogeneity of diabetes, exome sequencing might provide additional diagnostic potential when standard Sanger sequencing-based diagnostics is inconclusive.The aim of the study was to examine the performance of exome sequencing for a molecular diagnosis of MODY in patients who have undergone conventional diagnostic sequencing of candidate genes with negative results.We performed exome enrichment followed by high-throughput sequencing in nine patients with suspected MODY. They were Sanger sequencing-negative for mutations in the HNF1A, HNF4A, GCK, HNF1B and INS genes. We excluded common, non-coding and synonymous gene variants, and performed in-depth analysis on filtered sequence variants in a pre-defined set of 111 genes implicated in glucose metabolism.On average, we obtained 45 X median coverage of the entire targeted exome and found 199 rare coding variants per individual. We identified 0-4 rare non-synonymous and nonsense variants per individual in our a priori list of 111 candidate genes. Three of the variants were considered pathogenic (in ABCC8, HNF4A and PPARG, respectively, thus exome sequencing led to a genetic diagnosis in at least three of the nine patients. Approximately 91% of known heterozygous SNPs in the target exomes were detected, but we also found low coverage in some key diabetes genes using our current exome sequencing approach. Novel variants in the genes ARAP1, GLIS3, MADD, NOTCH2 and WFS1 need further investigation to reveal their possible role in diabetes.Our results demonstrate that exome sequencing can improve molecular diagnostics of MODY when used as a complement to Sanger sequencing. However, improvements will be needed, especially concerning coverage, before the full potential of exome sequencing can be realized.
Construction of Fixed Rate Non-Binary WOM Codes Based on Integer Programming

Science.gov (United States)

Fujino, Yoju; Wadayama, Tadashi

In this paper, we propose a construction of non-binary WOM (Write-Once-Memory) codes for WOM storages such as flash memories. The WOM codes discussed in this paper are fixed rate WOM codes where messages in a fixed alphabet of size $M$ can be sequentially written in the WOM storage at least $t^*$-times. In this paper, a WOM storage is modeled by a state transition graph. The proposed construction has the following two features. First, it includes a systematic method to determine the encoding regions in the state transition graph. Second, the proposed construction includes a labeling method for states by using integer programming. Several novel WOM codes for $q$ level flash memories with 2 cells are constructed by the proposed construction. They achieve the worst numbers of writes $t^*$ that meet the known upper bound in many cases. In addition, we constructed fixed rate non-binary WOM codes with the capability to reduce ICI (inter cell interference) of flash cells. One of the advantages of the proposed construction is its flexibility. It can be applied to various storage devices, to various dimensions (i.e, number of cells), and various kind of additional constraints.
An intact sequence-specific DNA-binding domain is required for human cytomegalovirus-mediated sequestration of p53 and may promote in vivo binding to the viral genome during infection

International Nuclear Information System (INIS)

Rosenke, Kyle; Samuel, Melanie A.; McDowell, Eric T.; Toerne, Melissa A.; Fortunato, Elizabeth A.

2006-01-01

The p53 protein is stabilized during infection of primary human fibroblasts with human cytomegalovirus (HCMV). However, the p53 in HCMV-infected cells is unable to activate its downstream targets. HCMV accomplishes this inactivation, at least in part, by sequestering p53 into viral replication centers within the cell's nucleus soon after they are established. In order to better understand the interplay between HCMV and p53 and the mechanism of sequestration, we constructed a panel of mutant p53-GFP fusion constructs for use in transfection/infection experiments. These mutants affected several post-translational modification sites and several sites within the central sequence-specific DNA-binding domain of the protein. Two categories of p53 sequestration were observed when the mutant constructs were transfected into primary fibroblasts and then infected at either high or low multiplicity. The first category, including all of the post-translational modification mutants, showed sequestration comparable to a wild-type (wt) control, while the second category, mutants affecting the DNA-binding core, were not specifically sequestered above control GFP levels. This suggested that the DNA-binding ability of the protein was required for sequestration. When the HCMV genome was analyzed for p53 consensus binding sites, 21 matches were found, which localized either to the promoters or the coding regions of viral proteins involved in DNA replication and processing as well as structural proteins. An analysis of in vivo binding to these identified sites via chromatin immunoprecipitation assays revealed differential binding to several of the sites over the course of infection
Complete genome sequencing of the luminescent bacterium, Vibrio qinghaiensis sp. Q67 using PacBio technology

Science.gov (United States)

Gong, Liang; Wu, Yu; Jian, Qijie; Yin, Chunxiao; Li, Taotao; Gupta, Vijai Kumar; Duan, Xuewu; Jiang, Yueming

2018-01-01

Vibrio qinghaiensis sp.-Q67 (Vqin-Q67) is a freshwater luminescent bacterium that continuously emits blue-green light (485 nm). The bacterium has been widely used for detecting toxic contaminants. Here, we report the complete genome sequence of Vqin-Q67, obtained using third-generation PacBio sequencing technology. Continuous long reads were attained from three PacBio sequencing runs and reads >500 bp with a quality value of >0.75 were merged together into a single dataset. This resultant highly-contiguous de novo assembly has no genome gaps, and comprises two chromosomes with substantial genetic information, including protein-coding genes, non-coding RNA, transposon and gene islands. Our dataset can be useful as a comparative genome for evolution and speciation studies, as well as for the analysis of protein-coding gene families, the pathogenicity of different Vibrio species in fish, the evolution of non-coding RNA and transposon, and the regulation of gene expression in relation to the bioluminescence of Vqin-Q67.
Asymptotic behaviour of firmly non expansive sequences

International Nuclear Information System (INIS)

Rouhani, B.D.

1993-04-01

We introduce the notion of firmly non expansive sequences in a Banach space and present several results concerning their asymptotic behaviour extending previous results and giving an affirmative answer to an open question raised by S. Reich and I. Shafir. Applications to averaged mappings are also given. (author). 16 refs
ViCTree: An automated framework for taxonomic classification from protein sequences.

Science.gov (United States)

Modha, Sejal; Thanki, Anil; Cotmore, Susan F; Davison, Andrew J; Hughes, Joseph

2018-02-20

The increasing rate of submission of genetic sequences into public databases is providing a growing resource for classifying the organisms that these sequences represent. To aid viral classification, we have developed ViCTree, which automatically integrates the relevant sets of sequences in NCBI GenBank and transforms them into an interactive maximum likelihood phylogenetic tree that can be updated automatically. ViCTree incorporates ViCTreeView, which is a JavaScript-based visualisation tool that enables the tree to be explored interactively in the context of pairwise distance data. To demonstrate utility, ViCTree was applied to subfamily Densovirinae of family Parvoviridae. This led to the identification of six new species of insect virus. ViCTree is open-source and can be run on any Linux- or Unix-based computer or cluster. A tutorial, the documentation and the source code are available under a GPL3 license, and can be accessed at http://bioinformatics.cvr.ac.uk/victree_web/. sejal.modha@glasgow.ac.uk.
A metagenomic assessment of viral contamination on fresh parsley plants irrigated with fecally tainted river water.

Science.gov (United States)

Fernandez-Cassi, X; Timoneda, N; Gonzales-Gustavson, E; Abril, J F; Bofill-Mas, S; Girones, R

2017-09-18

Microbial food-borne diseases are still frequently reported despite the implementation of microbial quality legislation to improve food safety. Among all the microbial agents, viruses are the most important causative agents of food-borne outbreaks. The development and application of a new generation of sequencing techniques to test for viral contaminants in fresh produce is an unexplored field that allows for the study of the viral populations that might be transmitted by the fecal-oral route through the consumption of contaminated food. To advance this promising field, parsley was planted and grown under controlled conditions and irrigated using contaminated river water. Viruses polluting the irrigation water and the parsley leaves were studied by using metagenomics. To address possible contamination due to sample manipulation, library preparation, and other sources, parsley plants irrigated with nutritive solution were used as a negative control. In parallel, viruses present in the river water used for plant irrigation were analyzed using the same methodology. It was possible to assign viral taxons from 2.4 to 74.88% of the total reads sequenced depending on the sample. Most of the viral reads detected in the river water were related to the plant viral families Tymoviridae (66.13%) and Virgaviridae (14.45%) and the phage viral families Myoviridae (5.70%), Siphoviridae (5.06%), and Microviridae (2.89%). Less than 1% of the viral reads were related to viral families that infect humans, including members of the Adenoviridae, Reoviridae, Picornaviridae and Astroviridae families. On the surface of the parsley plants, most of the viral reads that were detected were assigned to the Dicistroviridae family (41.52%). Sequences related to important viral pathogens, such as the hepatitis E virus, several picornaviruses from species A and B as well as human sapoviruses and GIV noroviruses were detected. The high diversity of viral sequences found in the parsley plants
A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

Directory of Open Access Journals (Sweden)

Glass John I

2010-07-01

repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements, a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches.
Reconstructing viral genomes from the environment using fosmid clones: the case of haloviruses.

Directory of Open Access Journals (Sweden)

Inmaculada Garcia-Heredia

Full Text Available BACKGROUND: Metaviriomes, the viral genomes present in an environment, have been studied by direct sequencing of the viral DNA or by cloning in small insert libraries. The short reads generated by both approaches make it very difficult to assemble and annotate such flexible genomic entities. Many environmental viruses belong to unknown groups or prey on uncultured and little known cellular lineages, and hence might not be present in databases. METHODOLOGY AND PRINCIPAL FINDINGS: Here we have used a different approach, the cloning of viral DNA into fosmids before sequencing, to obtain natural contigs that are close to the size of a viral genome. We have studied a relatively low diversity extreme environment: saturated NaCl brines, which simplifies the analysis and interpretation of the data. Forty-two different viral genomes were retrieved, and some of these were almost complete, and could be tentatively identified as head-tail phages (Caudovirales. CONCLUSIONS AND SIGNIFICANCE: We found a cluster of phage genomes that most likely infect Haloquadratum walsbyi, the square archaeon and major component of the community in these hypersaline habitats. The identity of the prey could be confirmed by the presence of CRISPR spacer sequences shared by the virus and one of the available strain genomes. Other viral clusters detected appeared to prey on the Nanohaloarchaea and on the bacterium Salinibacter ruber, covering most of the diversity of microbes found in this type of environment. This approach appears then as a viable alternative to describe metaviriomes in a much more detailed and reliable way than by the more common approaches based on direct sequencing. An example of transfer of a CRISPR cluster including repeats and spacers was accidentally found supporting the dynamic nature and frequent transfer of this peculiar prokaryotic mechanism of cell protection.
Genome-wide identification and functional prediction of nitrogen-responsive intergenic and intronic long non-coding RNAs in maize (Zea mays L.).

Science.gov (United States)

Lv, Yuanda; Liang, Zhikai; Ge, Min; Qi, Weicong; Zhang, Tifu; Lin, Feng; Peng, Zhaohua; Zhao, Han

2016-05-11

Nitrogen (N) is an essential and often limiting nutrient to plant growth and development. Previous studies have shown that the mRNA expressions of numerous genes are regulated by nitrogen supplies; however, little is known about the expressed non-coding elements, for example long non-coding RNAs (lncRNAs) that control the response of maize (Zea mays L.) to nitrogen. LncRNAs are a class of non-coding RNAs larger than 200 bp, which have emerged as key regulators in gene expression. In this study, we surveyed the intergenic/intronic lncRNAs in maize B73 leaves at the V7 stage under conditions of N-deficiency and N-sufficiency using ribosomal RNA depletion and ultra-deep total RNA sequencing approaches. By integration with mRNA expression profiles and physiological evaluations, 7245 lncRNAs and 637 nitrogen-responsive lncRNAs were identified that exhibited unique expression patterns. Co-expression network analysis showed that the nitrogen-responsive lncRNAs were enriched mainly in one of the three co-expressed modules. The genes in the enriched module are mainly involved in NADH dehydrogenase activity, oxidative phosphorylation and the nitrogen compounds metabolic process. We identified a large number of lncRNAs in maize and illustrated their potential regulatory roles in response to N stress. The results lay the foundation for further in-depth understanding of the molecular mechanisms of lncRNAs' role in response to nitrogen stresses.
Genetic population structure of marine viral haemorrhagic septicaemia virus (VHSV)

DEFF Research Database (Denmark)

Snow, M.; Bain, N.; Black, J.

2004-01-01

The nucleotide sequences of a specific region of the nucleoprotein gene were compared in order to investigate the genetic population structure of marine viral haemorrhagic septicaemia virus (VHSV). Analysis of the sequence from 128 isolates of diverse geographic and host origin renders this the m......The nucleotide sequences of a specific region of the nucleoprotein gene were compared in order to investigate the genetic population structure of marine viral haemorrhagic septicaemia virus (VHSV). Analysis of the sequence from 128 isolates of diverse geographic and host origin renders...... this the most comprehensive molecular epidemiological study of marine VHSV conducted to date. Phylogenetic analysis of nucleoprotein gene sequences confirmed the existence of the 4 major genotypes previously identified based on N- and subsequent G-gene based analyses. The range of Genotype I included subgroups...... of isolates associated with rainbow trout aquaculture (Genotype la) and those from the Baltic marine environment (Genotype Ib) to emphasise the relatively close genetic relationship between these isolates. The existence of an additional genotype circulating within the Baltic Sea (Genotype II) was also...
Female-biased expression of long non-coding RNAs in domains that escape X-inactivation in mouse

Directory of Open Access Journals (Sweden)

Lu Lu

2010-11-01

Full Text Available Abstract Background Sexual dimorphism in brain gene expression has been recognized in several animal species. However, the relevant regulatory mechanisms remain poorly understood. To investigate whether sex-biased gene expression in mammalian brain is globally regulated or locally regulated in diverse brain structures, and to study the genomic organisation of brain-expressed sex-biased genes, we performed a large scale gene expression analysis of distinct brain regions in adult male and female mice. Results This study revealed spatial specificity in sex-biased transcription in the mouse brain, and identified 173 sex-biased genes in the striatum; 19 in the neocortex; 12 in the hippocampus and 31 in the eye. Genes located on sex chromosomes were consistently over-represented in all brain regions. Analysis on a subset of genes with sex-bias in more than one tissue revealed Y-encoded male-biased transcripts and X-encoded female-biased transcripts known to escape X-inactivation. In addition, we identified novel coding and non-coding X-linked genes with female-biased expression in multiple tissues. Interestingly, the chromosomal positions of all of the female-biased non-coding genes are in close proximity to protein-coding genes that escape X-inactivation. This defines X-chromosome domains each of which contains a coding and a non-coding female-biased gene. Lack of repressive chromatin marks in non-coding transcribed loci supports the possibility that they escape X-inactivation. Moreover, RNA-DNA combined FISH experiments confirmed the biallelic expression of one such novel domain. Conclusion This study demonstrated that the amount of genes with sex-biased expression varies between individual brain regions in mouse. The sex-biased genes identified are localized on many chromosomes. At the same time, sexually dimorphic gene expression that is common to several parts of the brain is mostly restricted to the sex chromosomes. Moreover, the study uncovered
Non-viral causes of liver cancer: does obesity led inflammation play a role?

Science.gov (United States)

Alzahrani, Badr; Iseli, Tristan J; Hebbard, Lionel W

2014-04-10

Liver cancer is the fifth most common cancer worldwide and the third most common cause of cancer mortality. Hepatocellular carcinoma (HCC) accounts for around 90% of primary liver cancers. Chronic infection with hepatitis B and hepatitis C viruses are two of most common causes of liver cancer. However, there are non-viral factors that are associated with liver cancer development. Numerous population studies have revealed strong links between obesity and the development of liver cancer. Obesity can alter hepatic pathology, metabolism and promote inflammation, leading to nonalcoholic fatty liver disease (NAFLD) and the progression to the more severe form, non-alcoholic steatohepatitis (NASH). NASH is characterised by prominent steatosis and inflammation, and can lead to HCC. Here, we discuss the role of obesity in inflammation and the principal signalling mechanisms involved in HCC formation. Copyright © 2013 Elsevier Ireland Ltd. All rights reserved.
Targeted sequencing of large genomic regions with CATCH-Seq.

Directory of Open Access Journals (Sweden)

Kenneth Day

Full Text Available Current target enrichment systems for large-scale next-generation sequencing typically require synthetic oligonucleotides used as capture reagents to isolate sequences of interest. The majority of target enrichment reagents are focused on gene coding regions or promoters en masse. Here we introduce development of a customizable targeted capture system using biotinylated RNA probe baits transcribed from sheared bacterial artificial chromosome clone templates that enables capture of large, contiguous blocks of the genome for sequencing applications. This clone adapted template capture hybridization sequencing (CATCH-Seq procedure can be used to capture both coding and non-coding regions of a gene, and resolve the boundaries of copy number variations within a genomic target site. Furthermore, libraries constructed with methylated adapters prior to solution hybridization also enable targeted bisulfite sequencing. We applied CATCH-Seq to diverse targets ranging in size from 125 kb to 3.5 Mb. Our approach provides a simple and cost effective alternative to other capture platforms because of template-based, enzymatic probe synthesis and the lack of oligonucleotide design costs. Given its similarity in procedure, CATCH-Seq can also be performed in parallel with commercial systems.
Contagious Content: Viral Video Ads Identification of Content Characteristics that Help Online Video Advertisements Go Viral

Directory of Open Access Journals (Sweden)

Yentl Knossenburg

2016-12-01

Full Text Available Why do some online video advertisements go viral while others remain unnoticed? What kind of video content keeps the viewer interested and motivated to share? Many companies have realized the need to innovate their marketing strategies and have embraced the newest ways of using technology, as the Internet, to their advantage as in the example of virality. Yet few marketers actually understand how, and academic literature on this topic is still in development. This study investigated which content characteristics distinguish successful from non-successful online viral video advertisements by analyzing 641 cases using Structural Equation Modeling. Results show that Engagement and Surprise are two main content characteristics that significantly increase the chance of online video advertisements to go viral.
Long Non-coding RNAs in Response to Genotoxic Stress

Institute of Scientific and Technical Information of China (English)

Xiaoman Li; Dong Pan; Baoquan Zhao; Burong Hu

2016-01-01

Long non-coding RNAs(lncRNAs) are increasingly involved in diverse biological processes.Upon DNA damage,the DNA damage response(DDR) elicits a complex signaling cascade,which includes the induction of lncRNAs.LncRNA-mediated DDR is involved in non-canonical and canonical manners.DNA-damage induced lncRNAs contribute to the regulation of cell cycle,apoptosis,and DNA repair,thereby playing a key role in maintaining genome stability.This review summarizes the emerging role of lncRNAs in DNA damage and repair.
Novel chaperonins are prevalent in the virioplankton and demonstrate links to viral biology and ecology.

Science.gov (United States)

Marine, Rachel L; Nasko, Daniel J; Wray, Jeffrey; Polson, Shawn W; Wommack, K Eric

2017-11-01

Chaperonins are protein-folding machinery found in all cellular life. Chaperonin genes have been documented within a few viruses, yet, surprisingly, analysis of metagenome sequence data indicated that chaperonin-carrying viruses are common and geographically widespread in marine ecosystems. Also unexpected was the discovery of viral chaperonin sequences related to thermosome proteins of archaea, indicating the presence of virioplankton populations infecting marine archaeal hosts. Virioplankton large subunit chaperonin sequences (GroELs) were divergent from bacterial sequences, indicating that viruses have carried this gene over long evolutionary time. Analysis of viral metagenome contigs indicated that: the order of large and small subunit genes was linked to the phylogeny of GroEL; both lytic and temperate phages may carry group I chaperonin genes; and viruses carrying a GroEL gene likely have large double-stranded DNA (dsDNA) genomes (>70 kb). Given these connections, it is likely that chaperonins are critical to the biology and ecology of virioplankton populations that carry these genes. Moreover, these discoveries raise the intriguing possibility that viral chaperonins may more broadly alter the structure and function of viral and cellular proteins in infected host cells.

Analysis of viral protein-2 encoding gene of avian encephalomyelitis virus from field specimens in Central Java region, Indonesia

Directory of Open Access Journals (Sweden)

Aris Haryanto

2016-01-01

Full Text Available Aim: Avian encephalomyelitis (AE is a viral disease which can infect various types of poultry, especially chicken. In Indonesia, the incidence of AE infection in chicken has been reported since 2009, the AE incidence tends to increase from year to year. The objective of this study was to analyze viral protein 2 (VP-2 encoding gene of AE virus (AEV from various species of birds in field specimen by reverse transcription polymerase chain reaction (RT-PCR amplification using specific nucleotides primer for confirmation of AE diagnosis. Materials and Methods: A total of 13 AEV samples are isolated from various species of poultry which are serologically diagnosed infected by AEV from some areas in central Java, Indonesia. Research stage consists of virus samples collection from field specimens, extraction of AEV RNA, amplification of VP-2 protein encoding gene by RT-PCR, separation of RT-PCR product by agarose gel electrophoresis, DNA sequencing and data analysis. Results: Amplification products of the VP-2 encoding gene of AEV by RT-PCR methods of various types of poultry from field specimens showed a positive results on sample code 499/4/12 which generated DNA fragment in the size of 619 bp. Sensitivity test of RT-PCR amplification showed that the minimum concentration of RNA template is 127.75 ng/μl. The multiple alignments of DNA sequencing product indicated that positive sample with code 499/4/12 has 92% nucleotide homology compared with AEV with accession number AV1775/07 and 85% nucleotide homology with accession number ZCHP2/0912695 from Genbank database. Analysis of VP-2 gene sequence showed that it found 46 nucleotides difference between isolate 499/4/12 compared with accession number AV1775/07 and 93 nucleotides different with accession number ZCHP2/0912695. Conclusions: Analyses of the VP-2 encoding gene of AEV with RT-PCR method from 13 samples from field specimen generated the DNA fragment in the size of 619 bp from one sample with
Development of a 3D non-linear implicit MHD code

International Nuclear Information System (INIS)

Nicolas, T.; Ichiguchi, K.

2016-06-01

This paper details the on-going development of a 3D non-linear implicit MHD code, which aims at making possible large scale simulations of the non-linear phase of the interchange mode. The goal of the paper is to explain the rationale behind the choices made along the development, and the technical difficulties encountered. At the present stage, the development of the code has not been completed yet. Most of the discussion is concerned with the first approach, which utilizes cartesian coordinates in the poloidal plane. This approach shows serious difficulties in writing the preconditioner, closely related to the choice of coordinates. A second approach, based on curvilinear coordinates, also faced significant difficulties, which are detailed. The third and last approach explored involves unstructured tetrahedral grids, and indicates the possibility to solve the problem. The issue to domain meshing is addressed. (author)
Association of Amine-Receptor DNA Sequence Variants with Associative Learning in the Honeybee.

Science.gov (United States)

Lagisz, Malgorzata; Mercer, Alison R; de Mouzon, Charlotte; Santos, Luana L S; Nakagawa, Shinichi

2016-03-01

Octopamine- and dopamine-based neuromodulatory systems play a critical role in learning and learning-related behaviour in insects. To further our understanding of these systems and resulting phenotypes, we quantified DNA sequence variations at six loci coding octopamine-and dopamine-receptors and their association with aversive and appetitive learning traits in a population of honeybees. We identified 79 polymorphic sequence markers (mostly SNPs and a few insertions/deletions) located within or close to six candidate genes. Intriguingly, we found that levels of sequence variation in the protein-coding regions studied were low, indicating that sequence variation in the coding regions of receptor genes critical to learning and memory is strongly selected against. Non-coding and upstream regions of the same genes, however, were less conserved and sequence variations in these regions were weakly associated with between-individual differences in learning-related traits. While these associations do not directly imply a specific molecular mechanism, they suggest that the cross-talk between dopamine and octopamine signalling pathways may influence olfactory learning and memory in the honeybee.
Transcriptional role of androgen receptor in the expression of long non-coding RNA Sox2OT in neurogenesis.

Directory of Open Access Journals (Sweden)

Valentina Tosetti

Full Text Available The complex architecture of adult brain derives from tightly regulated migration and differentiation of precursor cells generated during embryonic neurogenesis. Changes at transcriptional level of genes that regulate migration and differentiation may lead to neurodevelopmental disorders. Androgen receptor (AR is a transcription factor that is already expressed during early embryonic days. However, AR role in the regulation of gene expression at early embryonic stage is yet to be determinate. Long non-coding RNA (lncRNA Sox2 overlapping transcript (Sox2OT plays a crucial role in gene expression control during development but its transcriptional regulation is still to be clearly defined. Here, using Bicalutamide in order to pharmacologically inactivated AR, we investigated whether AR participates in the regulation of the transcription of the lncRNASox2OTat early embryonic stage. We identified a new DNA binding region upstream of Sox2 locus containing three androgen response elements (ARE, and found that AR binds such a sequence in embryonic neural stem cells and in mouse embryonic brain. Our data suggest that through this binding, AR can promote the RNA polymerase II dependent transcription of Sox2OT. Our findings also suggest that AR participates in embryonic neurogenesis through transcriptional control of the long non-coding RNA Sox2OT.
Non-Coding RNA in the Pathogenesis, Progression and Treatment of Hypertension

Directory of Open Access Journals (Sweden)

Christiana Leimena

2018-03-01

Full Text Available Hypertension is a complex, multifactorial disease that involves the coexistence of multiple risk factors, environmental factors and physiological systems. The complexities extend to the treatment and management of hypertension, which are still the pursuit of many researchers. In the last two decades, various genes have emerged as possible biomarkers and have become the target for investigations of specialized drug design based on its risk factors and the primary cause. Owing to the growing technology of microarrays and next-generation sequencing, the non-protein-coding RNAs (ncRNAs have increasingly gained attention, and their status of redundancy has flipped to importance in normal cellular processes, as well as in disease progression. The ncRNA molecules make up a significant portion of the human genome, and their role in diseases continues to be uncovered. Specifically, the cellular role of these ncRNAs has played a part in the pathogenesis of hypertension and its progression to heart failure. This review explores the function of the ncRNAs, their types and biology, the current update of their association with hypertension pathology and the potential new therapeutic regime for hypertension.
CRISPR-Cas systems exploit viral DNA injection to establish and maintain adaptive immunity.

Science.gov (United States)

Modell, Joshua W; Jiang, Wenyan; Marraffini, Luciano A

2017-04-06

Clustered regularly interspaced short palindromic repeats (CRISPR)-Cas systems provide protection against viral and plasmid infection by capturing short DNA sequences from these invaders and integrating them into the CRISPR locus of the prokaryotic host. These sequences, known as spacers, are transcribed into short CRISPR RNA guides that specify the cleavage site of Cas nucleases in the genome of the invader. It is not known when spacer sequences are acquired during viral infection. Here, to investigate this, we tracked spacer acquisition in Staphylococcus aureus cells harbouring a type II CRISPR-Cas9 system after infection with the staphylococcal bacteriophage ϕ12. We found that new spacers were acquired immediately after infection preferentially from the cos site, the viral free DNA end that is first injected into the cell. Analysis of spacer acquisition after infection with mutant phages demonstrated that most spacers are acquired during DNA injection, but not during other stages of the viral cycle that produce free DNA ends, such as DNA replication or packaging. Finally, we showed that spacers acquired from early-injected genomic regions, which direct Cas9 cleavage of the viral DNA immediately after infection, provide better immunity than spacers acquired from late-injected regions. Our results reveal that CRISPR-Cas systems exploit the phage life cycle to generate a pattern of spacer acquisition that ensures a successful CRISPR immune response.
Stability of the resistance to the thiosemicarbazone derived from 5,6-dimethoxy-1-indanone, a non-nucleoside polymerase inhibitor of bovine viral diarrhea virus.

Science.gov (United States)

Castro, Eliana F; Campos, Rodolfo H; Cavallaro, Lucía V

2014-01-01

Bovine viral diarrhea virus (BVDV) is the prototype Pestivirus. BVDV infection is distributed worldwide and causes serious problems for the livestock industry. The thiosemicarbazone of 5,6-dimethoxy-1-indanone (TSC) is a non-nucleoside polymerase inhibitor (NNI) of BVDV. All TSC-resistant BVDV variants (BVDV-TSCr T1-5) present an N264D mutation in the NS5B gene (RdRp) whereas the variant BVDV-TSCr T1 also presents an NS5B A392E mutation. In the present study, we carried out twenty passages of BVDV-TSCr T1-5 in MDBK cells in the absence of TSC to evaluate the stability of the resistance. The viral populations obtained (BVDV R1-5) remained resistant to the antiviral compound and conserved the mutations in NS5B associated with this phenotype. Along the passages, BVDV R2, R3 and R5 presented a delay in the production of cytopathic effect that correlated with a decrease in cell apoptosis and intracellular accumulation of viral RNA. The complete genome sequences that encode for NS2 to NS5B, Npro and Erns were analyzed. Additional mutations were detected in the NS5B of BVDV R1, R3 and R4. In both BVDV R2 and R3, most of the mutations found were localized in NS5A, whereas in BVDV R5, the only mutation fixed was NS5A V177A. These results suggest that mutations in NS5A could alter BVDV cytopathogenicity. In conclusion, the stability of the resistance to TSC may be due to the fixation of different compensatory mutations in each BVDV-TSCr. During their replication in a TSC-free medium, some virus populations presented a kind of interaction with the host cell that resembled a persistent infection: decreased cytopathogenicity and viral genome synthesis. This is the first report on the stability of antiviral resistance and on the evolution of NNI-resistant BVDV variants. The results obtained for BVDV-TSCr could also be applied for other NNIs.
Nucleotide sequence of tomato ringspot virus RNA-2.

Science.gov (United States)

Rott, M E; Tremaine, J H; Rochon, D M

1991-07-01

The sequence of tomato ringspot virus (TomRSV) RNA-2 has been determined. It is 7273 nucleotides in length excluding the 3' poly(A) tail and contains a single long open reading frame (ORF) of 5646 nucleotides in the positive sense beginning at position 78 and terminating at position 5723. A second in-frame AUG at position 441 is in a more favourable context for initiation of translation and may act as a site for initiation of translation. The TomRSV RNA-2 3' noncoding region is 1550 nucleotides in length. The coat protein is located in the C-terminal region of the large polypeptide and shows significant but limited amino acid sequence similarity to the putative coat proteins of the nepoviruses tomato black ring (TBRV), Hungarian grapevine chrome mosaic (GCMV) and grapevine fanleaf (GFLV). Comparisons of the coding and non-coding regions of TomRSV RNA-2 and the RNA components of TBRV, GCMV, GFLV and the comovirus cowpea mosaic virus revealed significant similarity for over 300 amino acids between the coding region immediately to the N-terminal side of the putative coat proteins of TomRSV and GFLV; very little similarity could be detected among the non-coding regions of TomRSV and any of these viruses.
Optimizing multiple sequence alignments using a genetic algorithm based on three objectives: structural information, non-gaps percentage and totally conserved columns.

Science.gov (United States)

Ortuño, Francisco M; Valenzuela, Olga; Rojas, Fernando; Pomares, Hector; Florido, Javier P; Urquiza, Jose M; Rojas, Ignacio

2013-09-01

Multiple sequence alignments (MSAs) are widely used approaches in bioinformatics to carry out other tasks such as structure predictions, biological function analyses or phylogenetic modeling. However, current tools usually provide partially optimal alignments, as each one is focused on specific biological features. Thus, the same set of sequences can produce different alignments, above all when sequences are less similar. Consequently, researchers and biologists do not agree about which is the most suitable way to evaluate MSAs. Recent evaluations tend to use more complex scores including further biological features. Among them, 3D structures are increasingly being used to evaluate alignments. Because structures are more conserved in proteins than sequences, scores with structural information are better suited to evaluate more distant relationships between sequences. The proposed multiobjective algorithm, based on the non-dominated sorting genetic algorithm, aims to jointly optimize three objectives: STRIKE score, non-gaps percentage and totally conserved columns. It was significantly assessed on the BAliBASE benchmark according to the Kruskal-Wallis test (P algorithm also outperforms other aligners, such as ClustalW, Multiple Sequence Alignment Genetic Algorithm (MSA-GA), PRRP, DIALIGN, Hidden Markov Model Training (HMMT), Pattern-Induced Multi-sequence Alignment (PIMA), MULTIALIGN, Sequence Alignment Genetic Algorithm (SAGA), PILEUP, Rubber Band Technique Genetic Algorithm (RBT-GA) and Vertical Decomposition Genetic Algorithm (VDGA), according to the Wilcoxon signed-rank test (P 0.05) with the advantage of being able to use less structures. Structural information is included within the objective function to evaluate more accurately the obtained alignments. The source code is available at http://www.ugr.es/~fortuno/MOSAStrE/MO-SAStrE.zip.
Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

Science.gov (United States)

Pujar, Shashikant; O'Leary, Nuala A; Farrell, Catherine M; Loveland, Jane E; Mudge, Jonathan M; Wallin, Craig; Girón, Carlos G; Diekhans, Mark; Barnes, If; Bennett, Ruth; Berry, Andrew E; Cox, Eric; Davidson, Claire; Goldfarb, Tamara; Gonzalez, Jose M; Hunt, Toby; Jackson, John; Joardar, Vinita; Kay, Mike P; Kodali, Vamsi K; Martin, Fergal J; McAndrews, Monica; McGarvey, Kelly M; Murphy, Michael; Rajput, Bhanu; Rangwala, Sanjida H; Riddick, Lillian D; Seal, Ruth L; Suner, Marie-Marthe; Webb, David; Zhu, Sophia; Aken, Bronwen L; Bruford, Elspeth A; Bult, Carol J; Frankish, Adam; Murphy, Terence; Pruitt, Kim D

2018-01-04

The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.
ncRNA-class Web Tool: Non-coding RNA feature extraction and pre-miRNA classification web tool

KAUST Repository

Kleftogiannis, Dimitrios A.; Theofilatos, Konstantinos A.; Papadimitriou, Stergios; Tsakalidis, Athanasios K.; Likothanassis, Spiridon D.; Mavroudi, Seferina P.

2012-01-01

Until recently, it was commonly accepted that most genetic information is transacted by proteins. Recent evidence suggests that the majority of the genomes of mammals and other complex organisms are in fact transcribed into non-coding RNAs (ncRNAs), many of which are alternatively spliced and/or processed into smaller products. Non coding RNA genes analysis requires the calculation of several sequential, thermodynamical and structural features. Many independent tools have already been developed for the efficient calculation of such features but to the best of our knowledge there does not exist any integrative approach for this task. The most significant amount of existing work is related to the miRNA class of non-coding RNAs. MicroRNAs (miRNAs) are small non-coding RNAs that play a significant role in gene regulation and their prediction is a challenging bioinformatics problem. Non-coding RNA feature extraction and pre-miRNA classification Web Tool (ncRNA-class Web Tool) is a publicly available web tool ( http://150.140.142.24:82/Default.aspx ) which provides a user friendly and efficient environment for the effective calculation of a set of 58 sequential, thermodynamical and structural features of non-coding RNAs, plus a tool for the accurate prediction of miRNAs. © 2012 IFIP International Federation for Information Processing.
Confinement of Reinforced-Concrete Columns with Non-Code Compliant Confining Reinforcement plus Supplemental Pen-Binder

Directory of Open Access Journals (Sweden)

Anang Kristianto

2012-11-01

Full Text Available One of the important requirements for earthquake resistant building related to confinement is the use of seismic hooks in the hoop or confining reinforcement of reinforced-concrete column elements. However, installation of a confining reinforcement with a 135-degree hook is not easy. Therefore, in practice, many construction workers apply a confining reinforcement with a 90-degreehook (non-code compliant. Based on research and records of recent earthquakes in Indonesia, the use of a non-code compliant confining reinforcement for concrete columns produces structures with poor seismic performance. This paper presents a study that introduces an additional element that is expected to improve the effectiveness of concrete columns confined with a non-code compliant confining reinforcement. The additional element, named a pen-binder, is used to keep the non-code compliant confining reinforcement in place. The effectiveness of this element under pure axial concentric loading was investigatedcomprehensively.The specimens tested in this study were 18 concrete columns,with a cross-section of 170 mm x 170 mm and a height of 480 mm. The main test variables were the material type of the pen-binder, the angle of the hook, and the confining reinforcement configuration.The test results indicate that adding pen-binders can effectively improve the strength and ductility of the column specimens confined with a non-code compliant confining reinforcement
Cytoplasmic translocation of polypyrimidine tract-binding protein and its binding to viral RNA during Japanese encephalitis virus infection inhibits virus replication.

Directory of Open Access Journals (Sweden)

Deepika Bhullar

Full Text Available Japanese encephalitis virus (JEV has a single-stranded, positive-sense RNA genome containing a single open reading frame flanked by the 5'- and 3'-non-coding regions (NCRs. The virus genome replicates via a negative-sense RNA intermediate. The NCRs and their complementary sequences in the negative-sense RNA are the sites for assembly of the RNA replicase complex thereby regulating the RNA synthesis and virus replication. In this study, we show that the 55-kDa polypyrimidine tract-binding protein (PTB interacts in vitro with both the 5'-NCR of the positive-sense genomic RNA--5NCR(+, and its complementary sequence in the negative-sense replication intermediate RNA--3NCR(-. The interaction of viral RNA with PTB was validated in infected cells by JEV RNA co-immunoprecipitation and JEV RNA-PTB colocalization experiments. Interestingly, we observed phosphorylation-coupled translocation of nuclear PTB to cytoplasmic foci that co-localized with JEV RNA early during JEV infection. Our studies employing the PTB silencing and over-expression in cultured cells established an inhibitory role of PTB in JEV replication. Using RNA-protein binding assay we show that PTB competitively inhibits association of JEV 3NCR(- RNA with viral RNA-dependent RNA polymerase (NS5 protein, an event required for the synthesis of the plus-sense genomic RNA. cAMP is known to promote the Protein kinase A (PKA-mediated PTB phosphorylation. We show that cells treated with a cAMP analogue had an enhanced level of phosphorylated PTB in the cytoplasm and a significantly suppressed JEV replication. Data presented here show a novel, cAMP-induced, PTB-mediated, innate host response that could effectively suppress JEV replication in mammalian cells.
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

Science.gov (United States)

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
The Complete Sequence of a Human Parainfluenzavirus 4 Genome

Science.gov (United States)

Yea, Carmen; Cheung, Rose; Collins, Carol; Adachi, Dena; Nishikawa, John; Tellier, Raymond

2009-01-01

Although the human parainfluenza virus 4 (HPIV4) has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada). The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97%) with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized. PMID:21994536
The Complete Sequence of a Human Parainfluenzavirus 4 Genome

Directory of Open Access Journals (Sweden)

Carmen Yea

2009-06-01

Full Text Available Although the human parainfluenza virus 4 (HPIV4 has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada. The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97% with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized.
Description and applicability of the BEFEM-CODE

Energy Technology Data Exchange (ETDEWEB)

Groth, T.

1980-05-15

The BEFEM-CODE, developed for rock mechanics problems in hard rock with joints, is a simple FEM code constructed using triangular and quadrilateral elements. As an option, a joint element of the Goodman type may be used. The Cook-Pian type quadrilateral stress hybrid element was introduced into the version of the code used for the Naesliden project, to replace the constant stress quadrilateral elements. This hybrid element, derived with assumed stress distributions, simplifies the excavation process for use in non-linear models. The shear behavior of the Goodman 1976 joint element has been replaced by Goodman's 1968 formulation. This element makes it possible to take dilation into account, but it was not considered necessary to use dilation to simulate proper joint behavior in the Naesliden project. The code uses Barton's shear strength criteria. Excessive nodal forces due to failure and non-linearities in the joint elements are redistributed with stress transfer iterations. Convergence can be speeded up by dividing each excavation sequence into several loadsteps in which the stiffness matrix is recalculated.
Viral Diagnostics in Plants Using Next Generation Sequencing: Computational Analysis in Practice

Directory of Open Access Journals (Sweden)

Susan Jones

2017-10-01

Full Text Available Viruses cause significant yield and quality losses in a wide variety of cultivated crops. Hence, the detection and identification of viruses is a crucial facet of successful crop production and of great significance in terms of world food security. Whilst the adoption of molecular techniques such as RT-PCR has increased the speed and accuracy of viral diagnostics, such techniques only allow the detection of known viruses, i.e., each test is specific to one or a small number of related viruses. Therefore, unknown viruses can be missed and testing can be slow and expensive if molecular tests are unavailable. Methods for simultaneous detection of multiple viruses have been developed, and (NGS is now a principal focus of this area, as it enables unbiased and hypothesis-free testing of plant samples. The development of NGS protocols capable of detecting multiple known and emergent viruses present in infected material is proving to be a major advance for crops, nuclear stocks or imported plants and germplasm, in which disease symptoms are absent, unspecific or only triggered by multiple viruses. Researchers want to answer the question “how many different viruses are present in this crop plant?” without knowing what they are looking for: RNA-sequencing (RNA-seq of plant material allows this question to be addressed. As well as needing efficient nucleic acid extraction and enrichment protocols, virus detection using RNA-seq requires fast and robust bioinformatics methods to enable host sequence removal and virus classification. In this review recent studies that use RNA-seq for virus detection in a variety of crop plants are discussed with specific emphasis on the computational methods implemented. The main features of a number of specific bioinformatics workflows developed for virus detection from NGS data are also outlined and possible reasons why these have not yet been widely adopted are discussed. The review concludes by discussing the future
Viral Diagnostics in Plants Using Next Generation Sequencing: Computational Analysis in Practice.

Science.gov (United States)

Jones, Susan; Baizan-Edge, Amanda; MacFarlane, Stuart; Torrance, Lesley

2017-01-01

Viruses cause significant yield and quality losses in a wide variety of cultivated crops. Hence, the detection and identification of viruses is a crucial facet of successful crop production and of great significance in terms of world food security. Whilst the adoption of molecular techniques such as RT-PCR has increased the speed and accuracy of viral diagnostics, such techniques only allow the detection of known viruses, i.e., each test is specific to one or a small number of related viruses. Therefore, unknown viruses can be missed and testing can be slow and expensive if molecular tests are unavailable. Methods for simultaneous detection of multiple viruses have been developed, and (NGS) is now a principal focus of this area, as it enables unbiased and hypothesis-free testing of plant samples. The development of NGS protocols capable of detecting multiple known and emergent viruses present in infected material is proving to be a major advance for crops, nuclear stocks or imported plants and germplasm, in which disease symptoms are absent, unspecific or only triggered by multiple viruses. Researchers want to answer the question "how many different viruses are present in this crop plant?" without knowing what they are looking for: RNA-sequencing (RNA-seq) of plant material allows this question to be addressed. As well as needing efficient nucleic acid extraction and enrichment protocols, virus detection using RNA-seq requires fast and robust bioinformatics methods to enable host sequence removal and virus classification. In this review recent studies that use RNA-seq for virus detection in a variety of crop plants are discussed with specific emphasis on the computational methods implemented. The main features of a number of specific bioinformatics workflows developed for virus detection from NGS data are also outlined and possible reasons why these have not yet been widely adopted are discussed. The review concludes by discussing the future directions of this
Deep sequencing of foot-and-mouth disease virus reveals RNA sequences involved in genome packaging.

Science.gov (United States)

Logan, Grace; Newman, Joseph; Wright, Caroline F; Lasecka-Dykes, Lidia; Haydon, Daniel T; Cottam, Eleanor M; Tuthill, Tobias J

2017-10-18

Non-enveloped viruses protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. Packaging and capsid assembly in RNA viruses can involve interactions between capsid proteins and secondary structures in the viral genome as exemplified by the RNA bacteriophage MS2 and as proposed for other RNA viruses of plants, animals and human. In the picornavirus family of non-enveloped RNA viruses, the requirements for genome packaging remain poorly understood. Here we show a novel and simple approach to identify predicted RNA secondary structures involved in genome packaging in the picornavirus foot-and-mouth disease virus (FMDV). By interrogating deep sequencing data generated from both packaged and unpackaged populations of RNA we have determined multiple regions of the genome with constrained variation in the packaged population. Predicted secondary structures of these regions revealed stem loops with conservation of structure and a common motif at the loop. Disruption of these features resulted in attenuation of virus growth in cell culture due to a reduction in assembly of mature virions. This study provides evidence for the involvement of predicted RNA structures in picornavirus packaging and offers a readily transferable methodology for identifying packaging requirements in many other viruses. Importance In order to transmit their genetic material to a new host, non-enveloped viruses must protect their genomes by packaging them into an outer shell or capsid of virus-encoded proteins. For many non-enveloped RNA viruses the requirements for this critical part of the viral life cycle remain poorly understood. We have identified RNA sequences involved in genome packaging of the picornavirus foot-and-mouth disease virus. This virus causes an economically devastating disease of livestock affecting both the developed and developing world. The experimental methods developed to carry out this work are novel, simple and transferable to the

Circular viral DNA detection and junction sequence analysis from PBMC of SHIV-infected cynomolgus monkeys with undetectable virus plasma RNA

International Nuclear Information System (INIS)

Cara, Andrea; Maggiorella, Maria Teresa; Bona, Roberta; Sernicola, Leonardo; Baroncelli, Silvia; Negri, Donatella R.M.; Leone, Pasqualina; Fagrouch, Zahra; Heeney, Jonathan; Titti, Fausto; Cafaro, Aurelio; Ensoli, Barbara

2004-01-01

Extrachromosomal forms of human immunodeficiency virus (HIV)-1 can be detected in peripheral blood mononuclear cell (PBMC) from HIV-infected patients in the absence of detectable viral replication and are thought to be a sign of active but cryptic virus replication. No information, however, are available on whether these forms are also present in animal models for acquired immunodeficiency syndrome (AIDS) and on their relation with other methods of detection of virus replication. To this aim, a polymerase chain reaction (PCR) approach was used to detect and analyze unintegrated circular 2-LTR-containing forms in PBMC of simian human immunodeficiency virus (SHIV)89.6P infected cynomolgus monkeys with RNA levels ranging between 1.8x10 6 and less than 50 copies/ml of plasma. 2-LTR forms were detected in 96.5% of monkeys' samples above 50 copies/ml of plasma, whereas they were present in 75.8% of monkeys' samples below 50 copies/ml of plasma. Persistence of unintegrated viral DNA in monkeys with undetectable plasma RNA could indicate either stability in non-dividing cells or ongoing low levels of viral replication in dividing cells
The sequence coding and search system: An approach for constructing and analyzing event sequences at commercial nuclear power plants

International Nuclear Information System (INIS)

Mays, G.T.

1989-04-01

The US Nuclear Regulatory Commission (NRC) has recognized the importance of the collection, assessment, and feedstock of operating experience data from commercial nuclear power plants and has centralized these activities in the Office for Analysis and Evaluation of Operational Data (AEOD). Such data is essential for performing safety and reliability analyses, especially analyses of trends and patterns to identify undesirable changes in plant performance at the earliest opportunity to implement corrective measures to preclude the occurrences of a more serious event. One of NRC's principal tools for collecting and evaluating operating experience data is the Sequence Coding and Search System (SCSS). The SCSS consists of a methodology for structuring event sequences and the requisite computer system to store and search the data. The source information for SCSS is the Licensee Event Report (LER), which is a legally required document. This paper describes the objective SCSS, the information it contains, and the format and approach for constructuring SCSS event sequences. Examples are presented demonstrating the use SCSS to support the analysis of LER data. The SCSS contains over 30,000 LERs describing events from 1980 through the present. Insights gained from working with a complex data system from the initial developmental stage to the point of a mature operating system are highlighted
LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification.

Science.gov (United States)

Filatov, Gleb; Bauwens, Bruno; Kertész-Farkas, Attila

2018-05-07

Bioinformatics studies often rely on similarity measures between sequence pairs, which often pose a bottleneck in large-scale sequence analysis. Here, we present a new convolutional kernel function for protein sequences called the LZW-Kernel. It is based on code words identified with the Lempel-Ziv-Welch (LZW) universal text compressor. The LZW-Kernel is an alignment-free method, it is always symmetric, is positive, always provides 1.0 for self-similarity and it can directly be used with Support Vector Machines (SVMs) in classification problems, contrary to normalized compression distance (NCD), which often violates the distance metric properties in practice and requires further techniques to be used with SVMs. The LZW-Kernel is a one-pass algorithm, which makes it particularly plausible for big data applications. Our experimental studies on remote protein homology detection and protein classification tasks reveal that the LZW-Kernel closely approaches the performance of the Local Alignment Kernel (LAK) and the SVM-pairwise method combined with Smith-Waterman (SW) scoring at a fraction of the time. Moreover, the LZW-Kernel outperforms the SVM-pairwise method when combined with BLAST scores, which indicates that the LZW code words might be a better basis for similarity measures than local alignment approximations found with BLAST. In addition, the LZW-Kernel outperforms n-gram based mismatch kernels, hidden Markov model based SAM and Fisher kernel, and protein family based PSI-BLAST, among others. Further advantages include the LZW-Kernel's reliance on a simple idea, its ease of implementation, and its high speed, three times faster than BLAST and several magnitudes faster than SW or LAK in our tests. LZW-Kernel is implemented as a standalone C code and is a free open-source program distributed under GPLv3 license and can be downloaded from https://github.com/kfattila/LZW-Kernel. akerteszfarkas@hse.ru. Supplementary data are available at Bioinformatics Online.
Transmission of single and multiple viral variants in primary HIV-1 subtype C infection.

Directory of Open Access Journals (Sweden)

Vladimir Novitsky

2011-02-01

Full Text Available To address whether sequences of viral gag and env quasispecies collected during the early post-acute period can be utilized to determine multiplicity of transmitted HIV's, recently developed approaches for analysis of viral evolution in acute HIV-1 infection [1,2] were applied. Specifically, phylogenetic reconstruction, inter- and intra-patient distribution of maximum and mean genetic distances, analysis of Poisson fitness, shape of highlighter plots, recombination analysis, and estimation of time to the most recent common ancestor (tMRCA were utilized for resolving multiplicity of HIV-1 transmission in a set of viral quasispecies collected within 50 days post-seroconversion (p/s in 25 HIV-infected individuals with estimated time of seroconversion. The decision on multiplicity of HIV infection was made based on the model's fit with, or failure to explain, the observed extent of viral sequence heterogeneity. The initial analysis was based on phylogeny, inter-patient distribution of maximum and mean distances, and Poisson fitness, and was able to resolve multiplicity of HIV transmission in 20 of 25 (80% cases. Additional analysis involved distribution of individual viral distances, highlighter plots, recombination analysis, and estimation of tMRCA, and resolved 4 of the 5 remaining cases. Overall, transmission of a single viral variant was identified in 16 of 25 (64% cases, and transmission of multiple variants was evident in 8 of 25 (32% cases. In one case multiplicity of HIV-1 transmission could not be determined. In primary HIV-1 subtype C infection, samples collected within 50 days p/s and analyzed by a single-genome amplification/sequencing technique can provide reliable identification of transmission multiplicity in 24 of 25 (96% cases. Observed transmission frequency of a single viral variant and multiple viral variants were within the ranges of 64% to 68%, and 32% to 36%, respectively.
Complete genome sequence of an isolate of Potato virus X (PVX) infecting Cape gooseberry (Physalis peruviana) in Colombia.

Science.gov (United States)

Gutiérrez, Pablo A; Alzate, Juan F; Montoya, Mauricio Marín

2015-06-01

Transcriptome analysis of a Cape gooseberry (Physalis peruviana) plant with leaf symptoms of a mild yellow mosaic typical of a viral disease revealed an infection with Potato virus X (PVX). The genome sequence of the PVX-Physalis isolate comprises 6435 nt and exhibits higher sequence similarity to members of the Eurasian group of PVX (~95 %) than to the American group (~77 %). Genome organization is similar to other PVX isolates with five open reading frames coding for proteins RdRp, TGBp1, TGBp2, TGBp3, and CP. 5' and 3' untranslated regions revealed all regulatory motifs typically found in PVX isolates. The PVX-Physalis genome is the only complete sequence available for a Potexvirus in Colombia and is a new addition to the restricted number of available sequences of PVX isolates infecting plant species different to potato.
Human Adenovirus Core Protein V Is Targeted by the Host SUMOylation Machinery To Limit Essential Viral Functions.

Science.gov (United States)

Freudenberger, Nora; Meyer, Tina; Groitl, Peter; Dobner, Thomas; Schreiner, Sabrina

2018-02-15

Human adenoviruses (HAdV) are nonenveloped viruses containing a linear, double-stranded DNA genome surrounded by an icosahedral capsid. To allow proper viral replication, the genome is imported through the nuclear pore complex associated with viral core proteins. Until now, the role of these incoming virion proteins during the early phase of infection was poorly understood. The core protein V is speculated to bridge the core and the surrounding capsid. It binds the genome in a sequence-independent manner and localizes in the nucleus of infected cells, accumulating at nucleoli. Here, we show that protein V contains conserved SUMO conjugation motifs (SCMs). Mutation of these consensus motifs resulted in reduced SUMOylation of the protein; thus, protein V represents a novel target of the host SUMOylation machinery. To understand the role of protein V SUMO posttranslational modification during productive HAdV infection, we generated a replication-competent HAdV with SCM mutations within the protein V coding sequence. Phenotypic analyses revealed that these SCM mutations are beneficial for adenoviral replication. Blocking protein V SUMOylation at specific sites shifts the onset of viral DNA replication to earlier time points during infection and promotes viral gene expression. Simultaneously, the altered kinetics within the viral life cycle are accompanied by more efficient proteasomal degradation of host determinants and increased virus progeny production than that observed during wild-type infection. Taken together, our studies show that protein V SUMOylation reduces virus growth; hence, protein V SUMOylation represents an important novel aspect of the host antiviral strategy to limit virus replication and thereby points to potential intervention strategies. IMPORTANCE Many decades of research have revealed that HAdV structural proteins promote viral entry and mainly physical stability of the viral genome in the capsid. Our work over the last years showed that this
Coding chaotic billiards: I-Non-Compact billiards on a negative curvature manifold

International Nuclear Information System (INIS)

Giannoni, M.J.; Ullmo, D.

1989-03-01

This paper presents a method for coding billiards. The main device is to use a proper surface of section, the bounce mapping, and foliate the reduced phase space into regions associated with a given code. The alphabet is merely the ensemble of the labels of the sides of the billiard. The procedure is applied here to non-compact polygonal billiard defined on a manifold of constant negative curvature, with all vertices at infinity. A simple grammar rule is necessary and sufficient to insure existence and uniqueness of the coding
Viral promoters can initiate expression of toxin genes introduced into Escherichia coli

Directory of Open Access Journals (Sweden)

Jacob Daniela

2005-06-01

Full Text Available Abstract Background The expression of recombinant proteins in eukaryotic cells requires the fusion of the coding region to a promoter functional in the eukaryotic cell line. Viral promoters are very often used for this purpose. The preceding cloning procedures are usually performed in Escherichia coli and it is therefore of interest if the foreign promoter results in an expression of the gene in bacteria. In the case molecules toxic for humans are to be expressed, this knowledge is indispensable for the specification of safety measures. Results We selected five frequently used viral promoters and quantified their activity in E. coli with a reporter system. Only the promoter from the thymidine kinase gene from HSV1 showed no activity, while the polyhedrin promoter from baculovirus, the early immediate CMV promoter, the early SV40 promoter and the 5' LTR promoter from HIV-1 directed gene expression in E. coli. The determination of transcription start sites in the immediate early CMV promoter and the polyhedrin promoter confirmed the existence of bacterial -10 and -35 consensus sequences. The importance of this heterologous gene expression for safety considerations was further supported by analysing fusions between the aforementioned promoters and a promoter-less cytotoxin gene. Conclusion According to our results a high percentage of viral promoters have the ability of initiating gene expression in E. coli. The degree of such heterologous gene expression can be sufficient for the expression of toxin genes and must therefore be considered when defining safety measures for the handling of corresponding genetically modified organisms.
Detection of non-coding RNAs on the basis of predicted secondary structure formation free energy change

Directory of Open Access Journals (Sweden)

Uzilov Andrew V

2006-03-01

Full Text Available Abstract Background Non-coding RNAs (ncRNAs have a multitude of roles in the cell, many of which remain to be discovered. However, it is difficult to detect novel ncRNAs in biochemical screens. To advance biological knowledge, computational methods that can accurately detect ncRNAs in sequenced genomes are therefore desirable. The increasing number of genomic sequences provides a rich dataset for computational comparative sequence analysis and detection of novel ncRNAs. Results Here, Dynalign, a program for predicting secondary structures common to two RNA sequences on the basis of minimizing folding free energy change, is utilized as a computational ncRNA detection tool. The Dynalign-computed optimal total free energy change, which scores the structural alignment and the free energy change of folding into a common structure for two RNA sequences, is shown to be an effective measure for distinguishing ncRNA from randomized sequences. To make the classification as a ncRNA, the total free energy change of an input sequence pair can either be compared with the total free energy changes of a set of control sequence pairs, or be used in combination with sequence length and nucleotide frequencies as input to a classification support vector machine. The latter method is much faster, but slightly less sensitive at a given specificity. Additionally, the classification support vector machine method is shown to be sensitive and specific on genomic ncRNA screens of two different Escherichia coli and Salmonella typhi genome alignments, in which many ncRNAs are known. The Dynalign computational experiments are also compared with two other ncRNA detection programs, RNAz and QRNA. Conclusion The Dynalign-based support vector machine method is more sensitive for known ncRNAs in the test genomic screens than RNAz and QRNA. Additionally, both Dynalign-based methods are more sensitive than RNAz and QRNA at low sequence pair identities. Dynalign can be used as a
Identification and Characterization of Bovine Viral Diarrhea Virus from Indonesian Cattle (IDENTIFIKASI DAN KARAKTERISASI VIRUS BOVINE VIRAL DIARRHEA DARI SAPI INDONESIA

Directory of Open Access Journals (Sweden)

Muharam Saepulloh

2015-05-01

Full Text Available Bovine viral diarrhea virus (BVDV is an important viral disease, which a ubiquitous pathogen ofcattle with worldwide economic importance and due to its misdiagnose with other viruses. The goal of thecurrent study was to identify and characterize of BVDV by reverse transcriptase polymerase chainreaction (RT-PCR and followed by sequence genome analyses. Blood, feces, and semen samples werecollected from 588 selected cattle from animals suffering from diarrhea and respiratory manifestation. RTPCRresults showed that the 69 (11.74% samples were positive to BVDV. Further molecularcharacterization was conducted only with 17 PCR positive samples. The results indicated the 17 IndonesianBVD virus isolates were belonging to the genotype-1 of BVDV (BVDV-1 based on sequence analysis anda phylogenetic relationship between Indonesian BVDV isolates and BVDV in the world. This finding is thefirst report of BVD-1 circulated in Indonesian cattle.
Fibroscore for the non-invasive assessment of liver fibrosis in chronic viral hepatitis

International Nuclear Information System (INIS)

Ashraf, S.; Ahmed, S.A.

2012-01-01

Objective: To evaluate the predictive value of a set of laboratory markers for the assessment of liver fibrosis in chronic viral hepatitis patients. Study Design: Cross-sectional study. Place and Duration of Study: Baqai Medical University, Combined Military Hospital, Malir, Karachi, from November 2006 to May 2008. Methodology: Twenty laboratory parameters were measured in 100 treatment-native chronic viral hepatitis patients who also had liver biopsy performed. Descriptive statistics, areas under the ROC's curves, and multivariate logistic regression analysis identified a fibrosis panel, a set of five most useful markers, for the assessment of stages of fibrosis, stage 0 to stage 4. The fibrosis index, FibroScore, consisted of bilirubin, Gamma glutamyl transferase, Hyaluronic acid, alpha 2 macroglobulin, and platelets evaluation. Results: A score of > 0.5 predicted stages 2, 3 and 4, with a sensitivity of 82%, and specificity of 92%. A score > 0.5 for stages 3 and 4 had a sensitivity of 85%, and specificity of 89%. At a score of > 0.80, for stages 3 and 4, the sensitivity was 70%, specificity was 97%, and PPV 87% (there was > 85% possibility of presence of stage 3 or 4). A score of < 0.20 predicted the absence of stages 2, 3, and 4 with a sensitivity of 91%, specificity of 86%, and NPV of 96%. Scores from 0.00 to 0.10 almost certainly ruled out the presence of stages 2-4 (NPV=98%). The areas under the ROC curve were: 0.808 for stage 2; 0.938 for stage 3; and 0.959 for stage 4. Conclusion: A combination of 5 markers is very useful in predicting various stages of liver fibrosis, and is helpful in the non-invasive assessment of liver fibrosis in chronic viral hepatitis patients. (author)
Screening for viral extraneous agents in live-attenuated avian vaccines by using a microbial microarray and sequencing

DEFF Research Database (Denmark)

Olesen, Majken Lindholm; Jørgensen, Lotte Leick; Blixenkrone-Møller, Merete

2018-01-01

The absence of extraneous agents (EA) in the raw material used for production and in finished products is one of the principal safety elements related to all medicinal products of biological origin, such as live-attenuated vaccines. The aim of this study was to investigate the applicability...... of the Lawrence Livermore Microbial detection array version 2 (LLMDAv2) combined with whole genome amplification and sequencing for screening for viral EAs in live-attenuated vaccines and specific pathogen-free (SPF) eggs.We detected positive microarray signals for avian endogenous retrovirus EAV-HP and several...... viruses belonging to the Alpharetrovirus genus in all analyzed vaccines and SPF eggs. We used a microarray probe mapping approach to evaluate the presence of intact retroviral genomes, which in addition to PCR analysis revealed that several of the positive microarray signals were most likely due to cross...
Analysis of host genetic diversity and viral entry as sources of between-host variation in viral load

Science.gov (United States)

Wargo, Andrew R.; Kell, Alison M.; Scott, Robert J.; Thorgaard, Gary H.; Kurath, Gael

2012-01-01

Little is known about the factors that drive the high levels of between-host variation in pathogen burden that are frequently observed in viral infections. Here, two factors thought to impact viral load variability, host genetic diversity and stochastic processes linked with viral entry into the host, were examined. This work was conducted with the aquatic vertebrate virus, Infectious hematopoietic necrosis virus (IHNV), in its natural host, rainbow trout. It was found that in controlled in vivo infections of IHNV, a suggestive trend of reduced between-fish viral load variation was observed in a clonal population of isogenic trout compared to a genetically diverse population of out-bred trout. However, this trend was not statistically significant for any of the four viral genotypes examined, and high levels of fish-to-fish variation persisted even in the isogenic trout population. A decrease in fish-to-fish viral load variation was also observed in virus injection challenges that bypassed the host entry step, compared to fish exposed to the virus through the natural water-borne immersion route of infection. This trend was significant for three of the four virus genotypes examined and suggests host entry may play a role in viral load variability. However, high levels of viral load variation also remained in the injection challenges. Together, these results indicate that although host genetic diversity and viral entry may play some role in between-fish viral load variation, they are not major factors. Other biological and non-biological parameters that may influence viral load variation are discussed.
Multiple viral infections in Agaricus bisporus - Characterisation of 18 unique RNA viruses and 8 ORFans identified by deep sequencing

OpenAIRE

Deakin, Gregory; Dobbs, Edward; Bennett, Julie M.; Jones, Ian M.; Grogan, Helen M.; Burton, Kerry S.

2017-01-01

Thirty unique non-host RNAs were sequenced in the cultivated fungus, Agaricus bisporus, comprising 18 viruses each encoding an RdRp domain with an additional 8 ORFans (non-host RNAs with no similarity to known sequences). Two viruses were multipartite with component RNAs showing correlative abundances and common 3′ motifs. The viruses, all positive sense single-stranded, were classified into diverse orders/families. Multiple infections of Agaricus may represent a diverse, dynamic and interact...
Facial Expression Recognition via Non-Negative Least-Squares Sparse Coding

Directory of Open Access Journals (Sweden)

Ying Chen

2014-05-01

Full Text Available Sparse coding is an active research subject in signal processing, computer vision, and pattern recognition. A novel method of facial expression recognition via non-negative least squares (NNLS sparse coding is presented in this paper. The NNLS sparse coding is used to form a facial expression classifier. To testify the performance of the presented method, local binary patterns (LBP and the raw pixels are extracted for facial feature representation. Facial expression recognition experiments are conducted on the Japanese Female Facial Expression (JAFFE database. Compared with other widely used methods such as linear support vector machines (SVM, sparse representation-based classifier (SRC, nearest subspace classifier (NSC, K-nearest neighbor (KNN and radial basis function neural networks (RBFNN, the experiment results indicate that the presented NNLS method performs better than other used methods on facial expression recognition tasks.
Spectral entropy criteria for structural segmentation in genomic DNA sequences

International Nuclear Information System (INIS)

Chechetkin, V.R.; Lobzin, V.V.

2004-01-01

The spectral entropy is calculated with Fourier structure factors and characterizes the level of structural ordering in a sequence of symbols. It may efficiently be applied to the assessment and reconstruction of the modular structure in genomic DNA sequences. We present the relevant spectral entropy criteria for the local and non-local structural segmentation in DNA sequences. The results are illustrated with the model examples and analysis of intervening exon-intron segments in the protein-coding regions
nocoRNAc: Characterization of non-coding RNAs in prokaryotes

Directory of Open Access Journals (Sweden)

Nieselt Kay

2011-01-01

Full Text Available Abstract Background The interest in non-coding RNAs (ncRNAs constantly rose during the past few years because of the wide spectrum of biological processes in which they are involved. This led to the discovery of numerous ncRNA genes across many species. However, for most organisms the non-coding transcriptome still remains unexplored to a great extent. Various experimental techniques for the identification of ncRNA transcripts are available, but as these methods are costly and time-consuming, there is a need for computational methods that allow the detection of functional RNAs in complete genomes in order to suggest elements for further experiments. Several programs for the genome-wide prediction of functional RNAs have been developed but most of them predict a genomic locus with no indication whether the element is transcribed or not. Results We present NOCORNAc, a program for the genome-wide prediction of ncRNA transcripts in bacteria. NOCORNAc incorporates various procedures for the detection of transcriptional features which are then integrated with functional ncRNA loci to determine the transcript coordinates. We applied RNAz and NOCORNAc to the genome of Streptomyces coelicolor and detected more than 800 putative ncRNA transcripts most of them located antisense to protein-coding regions. Using a custom design microarray we profiled the expression of about 400 of these elements and found more than 300 to be transcribed, 38 of them are predicted novel ncRNA genes in intergenic regions. The expression patterns of many ncRNAs are similarly complex as those of the protein-coding genes, in particular many antisense ncRNAs show a high expression correlation with their protein-coding partner. Conclusions We have developed NOCORNAc, a framework that facilitates the automated characterization of functional ncRNAs. NOCORNAc increases the confidence of predicted ncRNA loci, especially if they contain transcribed ncRNAs. NOCORNAc is not restricted to
mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences.

Science.gov (United States)

Links, Matthew G; Chaban, Bonnie; Hemmingsen, Sean M; Muirhead, Kevin; Hill, Janet E

2013-08-15

Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly.
Chitosan-Graft-Polyethylenimine/DNA Nanoparticles as Novel Non-Viral Gene Delivery Vectors Targeting Osteoarthritis

Science.gov (United States)

Lv, Lulu; Zhao, Huiqing

2014-01-01

The development of safe and efficient gene carriers is the key to the clinical success of gene therapy. The present study was designed to develop and evaluate the chitosan-graft-polyethylenimine (CP)/DNA nanoparticles as novel non-viral gene vectors for gene therapy of osteoarthritis. The CP/DNA nanoparticles were produced through a complex coacervation of the cationic polymers with pEGFP after grafting chitosan (CS) with a low molecular weight (Mw) PEI (Mw = 1.8 kDa). Particle size and zeta potential were related to the weight ratio of CP:DNA, where decreases in nanoparticle size and increases in surface charge were observed as CP content increased. The buffering capacity of CP was significantly greater than that of CS. The transfection efficiency of CP/DNA nanoparticles was similar with that of the Lipofectamine™ 2000, and significantly higher than that of CS/DNA and PEI (25 kDa)/DNA nanoparticles. The transfection efficiency of the CP/DNA nanoparticles was dependent on the weight ratio of CP:DNA (w/w). The average cell viability after the treatment with CP/DNA nanoparticles was over 90% in both chondrocytes and synoviocytes, which was much higher than that of PEI (25 kDa)/DNA nanoparticles. The CP copolymers efficiently carried the pDNA inside chondrocytes and synoviocytes, and the pDNA was detected entering into nucleus. These results suggest that CP/DNA nanoparticles with improved transfection efficiency and low cytotoxicity might be a safe and efficient non-viral vector for gene delivery to both chondrocytes and synoviocytes. PMID:24392152
Non-viral bone morphogenetic protein 2 transfection of rat dental pulp stem cells using calcium phosphate nanoparticles as carriers.

NARCIS (Netherlands)

Yang, X.; Walboomers, X.F.; Dolder, J. van den; Yang, F.; Bian, Z.; Fan, M.; Jansen, J.A.

2008-01-01

Calcium phosphate nanoparticles have shown potential as non-viral vectors for gene delivery. The aim of this study was to induce bone morphogenetic protein (Bmp)2 transfection in rat dental pulp stem cells using calcium phosphate nanoparticles as a gene vector and then to evaluate the efficiency and

Some links on this page may take you to non-federal websites. Their policies may differ from this site.