WorldWideScience

Sample records for mrna coding sequence

  1. PATACSDB—the database of polyA translational attenuators in coding sequences

    Directory of Open Access Journals (Sweden)

    Malgorzata Habich

    2016-02-01

    Full Text Available Recent additions to the repertoire of gene expression regulatory mechanisms are polyadenylate (polyA tracks encoding for poly-lysine runs in protein sequences. Such tracks stall the translation apparatus and induce frameshifting independently of the effects of charged nascent poly-lysine sequence on the ribosome exit channel. As such, they substantially influence the stability of mRNA and the amount of protein produced from a given transcript. Single base changes in these regions are enough to exert a measurable response on both protein and mRNA abundance; this makes each of these sequences a potentially interesting case study for the effects of synonymous mutation, gene dosage balance and natural frameshifting. Here we present PATACSDB, a resource that contain a comprehensive list of polyA tracks from over 250 eukaryotic genomes. Our data is based on the Ensembl genomic database of coding sequences and filtered with algorithm of 12A-1 which selects sequences of polyA tracks with a minimal length of 12 A’s allowing for one mismatched base. The PATACSDB database is accessible at: http://sysbio.ibb.waw.pl/patacsdb. The source code is available at http://github.com/habich/PATACSDB, and it includes the scripts with which the database can be recreated.

  2. Full-length mRNA sequencing uncovers a widespread coupling between transcription initiation and mRNA processing.

    Science.gov (United States)

    Anvar, Seyed Yahya; Allard, Guy; Tseng, Elizabeth; Sheynkman, Gloria M; de Klerk, Eleonora; Vermaat, Martijn; Yin, Raymund H; Johansson, Hans E; Ariyurek, Yavuz; den Dunnen, Johan T; Turner, Stephen W; 't Hoen, Peter A C

    2018-03-29

    The multifaceted control of gene expression requires tight coordination of regulatory mechanisms at transcriptional and post-transcriptional level. Here, we studied the interdependence of transcription initiation, splicing and polyadenylation events on single mRNA molecules by full-length mRNA sequencing. In MCF-7 breast cancer cells, we find 2700 genes with interdependent alternative transcription initiation, splicing and polyadenylation events, both in proximal and distant parts of mRNA molecules, including examples of coupling between transcription start sites and polyadenylation sites. The analysis of three human primary tissues (brain, heart and liver) reveals similar patterns of interdependency between transcription initiation and mRNA processing events. We predict thousands of novel open reading frames from full-length mRNA sequences and obtained evidence for their translation by shotgun proteomics. The mapping database rescues 358 previously unassigned peptides and improves the assignment of others. By recognizing sample-specific amino-acid changes and novel splicing patterns, full-length mRNA sequencing improves proteogenomics analysis of MCF-7 cells. Our findings demonstrate that our understanding of transcriptome complexity is far from complete and provides a basis to reveal largely unresolved mechanisms that coordinate transcription initiation and mRNA processing.

  3. Protein Structure and the Sequential Structure of mRNA

    DEFF Research Database (Denmark)

    Brunak, Søren; Engelbrecht, Jacob

    1996-01-01

    entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment, By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets, These signals do not originate from......A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed, We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting...... protein, The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain, A complete search for GenBank nucleotide sequences coding for structural...

  4. Exploratory Bioinformatics Study of lncRNAs in Alzheimer’s Disease mRNA Sequences with Application to Drug Development

    Directory of Open Access Journals (Sweden)

    T. Holden

    2013-01-01

    Full Text Available Long noncoding RNA (lncRNA within mRNA sequences of Alzheimer’s disease genes, namely, APP, APOE, PSEN1, and PSEN2, has been analyzed using fractal dimension (FD computation and correlation analysis. We examined lncRNA by comparing mRNA FD to corresponding coding DNA sequences (CDSs FD. APP, APOE, and PSEN1 CDSs select slightly higher FDs compared to the mRNA, while PSEN2 CDSs FDs are lower. The correlation coefficient for these sequences is 0.969. A comparative study of differentially expressed MAPK signaling pathway lncRNAs in pancreatic cancer cells shows a correlation of 0.771. Selection of higher FD CDSs could indicate interaction of Alzheimer’s gene products APP, APOE, and PSEN1. Including hypocretin sequences (where all CDSs have higher fractal dimensions than mRNA in the APP, APOE, and PSEN1 sequence analyses improves correlation, but the inclusion of erythropoietin (where all CDSs have higher FD than mRNA would suppress correlation, suggesting that HCRT, a hypothalamus neurotransmitter related to the wake/sleep cycle, might be better when compared to EPO, a glycoprotein hormone, for targeting Alzheimer’s disease drug development. Fractal dimension and entropy correlation have provided supporting evidence, consistent with evolutionary studies, for using a zebrafish model together with a mouse model, in HCRT drug development.

  5. Human apolipoprotein B (apoB) mRNA: Identification of two distinct apoB mRNAs, an mRNA with the apoB-100 sequence and an apoB mRNA containing a premature in-frame translational stop codon, in both liver and intestine

    International Nuclear Information System (INIS)

    Higuchi, K.; Hospattankar, A.V.; Law, S.W.; Meglin, N.; Cortright, J.; Brewer, H.B. Jr.

    1988-01-01

    Human apolipoprotein B (apoB) is present in plasma as two separate isoproteins, designated apoB-100 (512 kDa) and apoB-48 (250 kDa). ApoB is encoded by a single gene on chromosome 2, and a single nuclear mRNA is edited and processed into two separate apoB mRNAs. A 14.1-kilobase apoB mRNA codes for apoB-100, and the second mRNA, which codes for apoB-48, contains a premature stop codon generated by a single base substitution of cytosine to uracil at nucleotide 6,538, which converts the translated CAA codon coding for the amino acid glutamine at residue 2,153 in apoB-100 to a premature in-frame stop codon (UAA). Two 30-base synthetic oligonucleotides, designated apoB-Stop and apoB-Gln, were synthesized containing the complementary sequence to the stop codon (UAA) and glutamine codon (CAA), respectively. The combined results from these studies establish that both human intestine and liver contain the two distinct apoB mRNAs, an mRNA that codes for apoB-100 and an apoB mRNA that contains the premature stop codon, which codes for apoB-48. The premature in-frame stop codon is not tissue specific and is present in both human liver and intestine

  6. Natural selection and algorithmic design of mRNA.

    Science.gov (United States)

    Cohen, Barry; Skiena, Steven

    2003-01-01

    Messenger RNA (mRNA) sequences serve as templates for proteins according to the triplet code, in which each of the 4(3) = 64 different codons (sequences of three consecutive nucleotide bases) in RNA either terminate transcription or map to one of the 20 different amino acids (or residues) which build up proteins. Because there are more codons than residues, there is inherent redundancy in the coding. Certain residues (e.g., tryptophan) have only a single corresponding codon, while other residues (e.g., arginine) have as many as six corresponding codons. This freedom implies that the number of possible RNA sequences coding for a given protein grows exponentially in the length of the protein. Thus nature has wide latitude to select among mRNA sequences which are informationally equivalent, but structurally and energetically divergent. In this paper, we explore how nature takes advantage of this freedom and how to algorithmically design structures more energetically favorable than have been built through natural selection. In particular: (1) Natural Selection--we perform the first large-scale computational experiment comparing the stability of mRNA sequences from a variety of organisms to random synonymous sequences which respect the codon preferences of the organism. This experiment was conducted on over 27,000 sequences from 34 microbial species with 36 genomic structures. We provide evidence that in all genomic structures highly stable sequences are disproportionately abundant, and in 19 of 36 cases highly unstable sequences are disproportionately abundant. This suggests that the stability of mRNA sequences is subject to natural selection. (2) Artificial Selection--motivated by these biological results, we examine the algorithmic problem of designing the most stable and unstable mRNA sequences which code for a target protein. We give a polynomial-time dynamic programming solution to the most stable sequence problem (MSSP), which is asymptotically no more complex

  7. Sequence, 'subtle' alternative splicing and expression of the CYYR1 (cysteine/tyrosine-rich 1) mRNA in human neuroendocrine tumors

    International Nuclear Information System (INIS)

    Vitale, Lorenza; Coppola, Domenico; Strippoli, Pierluigi; Frabetti, Flavia; Huntsman, Shane A; Canaider, Silvia; Casadei, Raffaella; Lenzi, Luca; Facchin, Federica; Carinci, Paolo; Zannotti, Maria

    2007-01-01

    CYYR1 is a recently identified gene located on human chromosome 21 whose product has no similarity to any known protein and is of unknown function. Analysis of expressed sequence tags (ESTs) have revealed high human CYYR1 expression in cells belonging to the diffuse neuroendocrine system (DNES). These cells may be the origin of neuroendocrine (NE) tumors. The aim of this study was to conduct an initial analysis of sequence, splicing and expression of the CYYR1 mRNA in human NE tumors. The CYYR1 mRNA coding sequence (CDS) was studied in 32 NE tumors by RT-PCR and sequence analysis. A subtle alternative splicing was identified generating two isoforms of CYYR1 mRNA differing in terms of the absence (CAG - isoform, the first described mRNA for CYYR1 locus) or the presence (CAG + isoform) of a CAG codon. When present, this specific codon determines the presence of an alanine residue, at the exon 3/exon 4 junction of the CYYR1 mRNA. The two mRNA isoform amounts were determined by quantitative relative RT-PCR in 29 NE tumors, 2 non-neuroendocrine tumors and 10 normal tissues. A bioinformatic analysis was performed to search for the existence of the two CYYR1 isoforms in other species. The CYYR1 CDS did not show differences compared to the reference sequence in any of the samples, with the exception of an NE tumor arising in the neck region. Sequence analysis of this tumor identified a change in the CDS 333 position (T instead of C), leading to the amino acid mutation P111S. NE tumor samples showed no significant difference in either CYYR1 CAG - or CAG + isoform expression compared to control tissues. CYYR1 CAG - isoform was significantly more expressed than CAG + isoform in NE tumors as well as in control samples investigated. Bioinformatic analysis revealed that only the genomic sequence of Pan troglodytes CYYR1 is consistent with the possible existence of the two described mRNA isoforms. A new 'subtle' splicing isoform (CAG + ) of CYYR1 mRNA, the sequence and

  8. Analysis and prediction of translation rate based on sequence and functional features of the mRNA.

    Directory of Open Access Journals (Sweden)

    Tao Huang

    Full Text Available Protein concentrations depend not only on the mRNA level, but also on the translation rate and the degradation rate. Prediction of mRNA's translation rate would provide valuable information for in-depth understanding of the translation mechanism and dynamic proteome. In this study, we developed a new computational model to predict the translation rate, featured by (1 integrating various sequence-derived and functional features, (2 applying the maximum relevance & minimum redundancy method and incremental feature selection to select features to optimize the prediction model, and (3 being able to predict the translation rate of RNA into high or low translation rate category. The prediction accuracies under rich and starvation condition were 68.8% and 70.0%, respectively, evaluated by jackknife cross-validation. It was found that the following features were correlated with translation rate: codon usage frequency, some gene ontology enrichment scores, number of RNA binding proteins known to bind its mRNA product, coding sequence length, protein abundance and 5'UTR free energy. These findings might provide useful information for understanding the mechanisms of translation and dynamic proteome. Our translation rate prediction model might become a high throughput tool for annotating the translation rate of mRNAs in large-scale.

  9. Discovery of Proteomic Code with mRNA Assisted Protein Folding

    Directory of Open Access Journals (Sweden)

    Jan C. Biro

    2008-12-01

    Full Text Available The 3x redundancy of the Genetic Code is usually explained as a necessity to increase the mutation-resistance of the genetic information. However recent bioinformatical observations indicate that the redundant Genetic Code contains more biological information than previously known and which is additional to the 64/20 definition of amino acids. It might define the physico-chemical and structural properties of amino acids, the codon boundaries, the amino acid co-locations (interactions in the coded proteins and the free folding energy of mRNAs. This additional information, which seems to be necessary to determine the 3D structure of coding nucleic acids as well as the coded proteins, is known as the Proteomic Code and mRNA Assisted Protein Folding.

  10. Relationship between mRNA secondary structure and sequence variability in Chloroplast genes: possible life history implications.

    Science.gov (United States)

    Krishnan, Neeraja M; Seligmann, Hervé; Rao, Basuthkar J

    2008-01-28

    Synonymous sites are freer to vary because of redundancy in genetic code. Messenger RNA secondary structure restricts this freedom, as revealed by previous findings in mitochondrial genes that mutations at third codon position nucleotides in helices are more selected against than those in loops. This motivated us to explore the constraints imposed by mRNA secondary structure on evolutionary variability at all codon positions in general, in chloroplast systems. We found that the evolutionary variability and intrinsic secondary structure stability of these sequences share an inverse relationship. Simulations of most likely single nucleotide evolution in Psilotum nudum and Nephroselmis olivacea mRNAs, indicate that helix-forming propensities of mutated mRNAs are greater than those of the natural mRNAs for short sequences and vice-versa for long sequences. Moreover, helix-forming propensity estimated by the percentage of total mRNA in helices increases gradually with mRNA length, saturating beyond 1000 nucleotides. Protection levels of functionally important sites vary across plants and proteins: r-strategists minimize mutation costs in large genes; K-strategists do the opposite. Mrna length presumably predisposes shorter mRNAs to evolve under different constraints than longer mRNAs. The positive correlation between secondary structure protection and functional importance of sites suggests that some sites might be conserved due to packing-protection constraints at the nucleic acid level in addition to protein level constraints. Consequently, nucleic acid secondary structure a priori biases mutations. The converse (exposure of conserved sites) apparently occurs in a smaller number of cases, indicating a different evolutionary adaptive strategy in these plants. The differences between the protection levels of functionally important sites for r- and K-strategists reflect their respective molecular adaptive strategies. These converge with increasing domestication levels of

  11. MicroRNA-200c modulates the expression of MUC4 and MUC16 by directly targeting their coding sequences in human pancreatic cancer.

    Directory of Open Access Journals (Sweden)

    Prakash Radhakrishnan

    Full Text Available Transmembrane mucins, MUC4 and MUC16 are associated with tumor progression and metastatic potential in human pancreatic adenocarcinoma. We discovered that miR-200c interacts with specific sequences within the coding sequence of MUC4 and MUC16 mRNAs, and evaluated the regulatory nature of this association. Pancreatic cancer cell lines S2.028 and T3M-4 transfected with miR-200c showed a 4.18 and 8.50 fold down regulation of MUC4 mRNA, and 4.68 and 4.82 fold down regulation of MUC16 mRNA compared to mock-transfected cells, respectively. A significant reduction of glycoprotein expression was also observed. These results indicate that miR-200c overexpression regulates MUC4 and MUC16 mucins in pancreatic cancer cells by directly targeting the mRNA coding sequence of each, resulting in reduced levels of MUC4 and MUC16 mRNA and protein. These data suggest that, in addition to regulating proteins that modulate EMT, miR-200c influences expression of cell surface mucins in pancreatic cancer.

  12. Tau mRNA 3'UTR-to-CDS ratio is increased in Alzheimer disease.

    Science.gov (United States)

    García-Escudero, Vega; Gargini, Ricardo; Martín-Maestro, Patricia; García, Esther; García-Escudero, Ramón; Avila, Jesús

    2017-08-10

    Neurons frequently show an imbalance in expression of the 3' untranslated region (3'UTR) relative to the coding DNA sequence (CDS) region of mature messenger RNAs (mRNA). The ratio varies among different cells or parts of the brain. The Map2 protein levels per cell depend on the 3'UTR-to-CDS ratio rather than the total mRNA amount, which suggests powerful regulation of protein expression by 3'UTR sequences. Here we found that MAPT (the microtubule-associated protein tau gene) 3'UTR levels are particularly high with respect to other genes; indeed, the 3'UTR-to-CDS ratio of MAPT is balanced in healthy brain in mouse and human. The tau protein accumulates in Alzheimer diseased brain. We nonetheless observed that the levels of RNA encoding MAPT/tau were diminished in these patients' brains. To explain this apparently contradictory result, we studied MAPT mRNA stoichiometry in coding and non-coding regions, and found that the 3'UTR-to-CDS ratio was higher in the hippocampus of Alzheimer disease patients, with higher tau protein but lower total mRNA levels. Our data indicate that changes in the 3'UTR-to-CDS ratio have a regulatory role in the disease. Future research should thus consider not only mRNA levels, but also the ratios between coding and non-coding regions. Copyright © 2017 Elsevier B.V. All rights reserved.

  13. Protein functional features are reflected in the patterns of mRNA translation speed.

    Science.gov (United States)

    López, Daniel; Pazos, Florencio

    2015-07-09

    The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.

  14. Evidence for a Complex Class of Nonadenylated mRNA in Drosophila

    Science.gov (United States)

    Zimmerman, J. Lynn; Fouts, David L.; Manning, Jerry E.

    1980-01-01

    The amount, by mass, of poly(A+) mRNA present in the polyribosomes of third-instar larvae of Drosophila melanogaster, and the relative contribution of the poly(A+) mRNA to the sequence complexity of total polysomal RNA, has been determined. Selective removal of poly(A+) mRNA from total polysomal RNA by use of either oligo-dT-cellulose, or poly(U)-sepharose affinity chromatography, revealed that only 0.15% of the mass of the polysomal RNA was present as poly(A+) mRNA. The present study shows that this RNA hybridized at saturation with 3.3% of the single-copy DNA in the Drosophila genome. After correction for asymmetric transcription and reactability of the DNA, 7.4% of the single-copy DNA in the Drosophila genome is represented in larval poly(A+) mRNA. This corresponds to 6.73 x 106 nucleotides of mRNA coding sequences, or approximately 5,384 diverse RNA sequences of average size 1,250 nucleotides. However, total polysomal RNA hybridizes at saturation to 10.9% of the single-copy DNA sequences. After correcting this value for asymmetric transcription and tracer DNA reactability, 24% of the single-copy DNA in Drosophila is represented in total polysomal RNA. This corresponds to 2.18 x 107 nucleotides of RNA coding sequences or 17,440 diverse RNA molecules of size 1,250 nucleotides. This value is 3.2 times greater than that observed for poly(A+) mRNA, and indicates that ≃69% of the polysomal RNA sequence complexity is contributed by nonadenylated RNA. Furthermore, if the number of different structural genes represented in total polysomal RNA is ≃1.7 x 104, then the number of genes expressed in third-instar larvae exceeds the number of chromomeres in Drosophila by about a factor of three. This numerology indicates that the number of chromomeres observed in polytene chromosomes does not reflect the number of structural gene sequences in the Drosophila genome. PMID:6777246

  15. Single-cell mRNA cytometry via sequence-specific nanoparticle clustering and trapping

    Science.gov (United States)

    Labib, Mahmoud; Mohamadi, Reza M.; Poudineh, Mahla; Ahmed, Sharif U.; Ivanov, Ivaylo; Huang, Ching-Lung; Moosavi, Maral; Sargent, Edward H.; Kelley, Shana O.

    2018-05-01

    Cell-to-cell variation in gene expression creates a need for techniques that can characterize expression at the level of individual cells. This is particularly true for rare circulating tumour cells, in which subtyping and drug resistance are of intense interest. Here we describe a method for cell analysis—single-cell mRNA cytometry—that enables the isolation of rare cells from whole blood as a function of target mRNA sequences. This approach uses two classes of magnetic particles that are labelled to selectively hybridize with different regions of the target mRNA. Hybridization leads to the formation of large magnetic clusters that remain localized within the cells of interest, thereby enabling the cells to be magnetically separated. Targeting specific intracellular mRNAs enablescirculating tumour cells to be distinguished from normal haematopoietic cells. No polymerase chain reaction amplification is required to determine RNA expression levels and genotype at the single-cell level, and minimal cell manipulation is required. To demonstrate this approach we use single-cell mRNA cytometry to detect clinically important sequences in prostate cancer specimens.

  16. Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

    Energy Technology Data Exchange (ETDEWEB)

    Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

    2007-02-21

    Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

  17. Sequences within the 5' untranslated region regulate the levels of a kinetoplast DNA topoisomerase mRNA during the cell cycle.

    Science.gov (United States)

    Pasion, S G; Hines, J C; Ou, X; Mahmood, R; Ray, D S

    1996-12-01

    Gene expression in trypanosomatids appears to be regulated largely at the posttranscriptional level and involves maturation of mRNA precursors by trans splicing of a 39-nucleotide miniexon sequence to the 5' end of the mRNA and cleavage and polyadenylation at the 3' end of the mRNA. To initiate the identification of sequences involved in the periodic expression of DNA replication genes in trypanosomatids, we have mapped splice acceptor sites in the 5' flanking region of the TOP2 gene, which encodes the kinetoplast DNA topoisomerase, and have carried out deletion analysis of this region on a plasmid-encoded TOP2 gene. Block deletions within the 5' untranslated region (UTR) identified two regions (-608 to -388 and -387 to -186) responsible for periodic accumulation of the mRNA. Deletion of one or the other of these sequences had no effect on periodic expression of the mRNA, while deletion of both regions resulted in constitutive expression of the mRNA throughout the cell cycle. Subcloning of these sequences into the 5' UTR of a construct lacking both regions of the TOP2 5' UTR has shown that an octamer consensus sequence present in the 5' UTR of the TOP2, RPA1, and DHFR-TS mRNAs is required for normal cycling of the TOP2 mRNA. Mutation of the consensus octamer sequence in the TOP2 5' UTR in a plasmid construct containing only a single consensus octamer and that shows normal cycling of the plasmid-encoded TOP2 mRNA resulted in substantial reduction of the cycling of the mRNA level. These results imply a negative regulation of TOP2 mRNA during the cell cycle by a mechanism involving redundant elements containing one or more copies of a conserved octamer sequence within the 5' UTR of TOP2 mRNA.

  18. Purifying selection acts on coding and non-coding sequences of paralogous genes in Arabidopsis thaliana.

    Science.gov (United States)

    Hoffmann, Robert D; Palmgren, Michael

    2016-06-13

    Whole-genome duplications in the ancestors of many diverse species provided the genetic material for evolutionary novelty. Several models explain the retention of paralogous genes. However, how these models are reflected in the evolution of coding and non-coding sequences of paralogous genes is unknown. Here, we analyzed the coding and non-coding sequences of paralogous genes in Arabidopsis thaliana and compared these sequences with those of orthologous genes in Arabidopsis lyrata. Paralogs with lower expression than their duplicate had more nonsynonymous substitutions, were more likely to fractionate, and exhibited less similar expression patterns with their orthologs in the other species. Also, lower-expressed genes had greater tissue specificity. Orthologous conserved non-coding sequences in the promoters, introns, and 3' untranslated regions were less abundant at lower-expressed genes compared to their higher-expressed paralogs. A gene ontology (GO) term enrichment analysis showed that paralogs with similar expression levels were enriched in GO terms related to ribosomes, whereas paralogs with different expression levels were enriched in terms associated with stress responses. Loss of conserved non-coding sequences in one gene of a paralogous gene pair correlates with reduced expression levels that are more tissue specific. Together with increased mutation rates in the coding sequences, this suggests that similar forces of purifying selection act on coding and non-coding sequences. We propose that coding and non-coding sequences evolve concurrently following gene duplication.

  19. Sequence-engineered mRNA Without Chemical Nucleoside Modifications Enables an Effective Protein Therapy in Large Animals

    Science.gov (United States)

    Thess, Andreas; Grund, Stefanie; Mui, Barbara L; Hope, Michael J; Baumhof, Patrick; Fotin-Mleczek, Mariola; Schlake, Thomas

    2015-01-01

    Being a transient carrier of genetic information, mRNA could be a versatile, flexible, and safe means for protein therapies. While recent findings highlight the enormous therapeutic potential of mRNA, evidence that mRNA-based protein therapies are feasible beyond small animals such as mice is still lacking. Previous studies imply that mRNA therapeutics require chemical nucleoside modifications to obtain sufficient protein expression and avoid activation of the innate immune system. Here we show that chemically unmodified mRNA can achieve those goals as well by applying sequence-engineered molecules. Using erythropoietin (EPO) driven production of red blood cells as the biological model, engineered Epo mRNA elicited meaningful physiological responses from mice to nonhuman primates. Even in pigs of about 20 kg in weight, a single adequate dose of engineered mRNA encapsulated in lipid nanoparticles (LNPs) induced high systemic Epo levels and strong physiological effects. Our results demonstrate that sequence-engineered mRNA has the potential to revolutionize human protein therapies. PMID:26050989

  20. Involvement of the 5'-leader sequence in coupling the stability of a human H3 histone mRNA with DNA replication

    International Nuclear Information System (INIS)

    Morris, T.; Marashi, F.; Weber, L.; Hickey, E.; Greenspan, D.; Bonner, J.; Stein, J.; Stein, G.

    1986-01-01

    Two lines of evidence derived from fusion gene constructs indicate that sequences residing in the 5'-nontranslated region of a cell cycle-dependent human H3 histone mRNA are involved in the selective destabilization that occurs when DNA synthesis is terminated. The experimental approach was to construct chimeric genes in which fragments of the mRNA coding regions of the H3 histone gene were fused with fragments of genes not expressed in a cell cycle-dependent manner. After transfection in HeLa S3 cells with the recombinant plasmids, levels of fusion mRNAs were determined by S1 nuclease analysis prior to and following DNA synthesis inhibition. When the first 20 nucleotides of an H3 histone mRNA leader were replaced with 89 nucleotides of the leader from a Drosophila heat-shock (hsp70) mRNA, the fusion transcript remained stable during inhibition of DNA synthesis, in contrast to the rapid destabilization of the endogenous histone mRNA in these cells. In a reciprocal experiment, a histone-globin fusion gene was constructed that produced a transcript with the initial 20 nucleotides of the H3 histone mRNA substituted for the human β-globin mRNA leader. In HeLa cells treated with inhibitors of DNA synthesis and/or protein synthesis, cellular levels of this histone-globin fusion mRNA appeared to be regulated in a manner similar to endogenous histone mRNA levels. These results suggest that the first 20 nucleotides of the leader are sufficient to couple histone mRNA stability with DNA replication

  1. Some Algebraic Aspects of MorseCode Sequences

    OpenAIRE

    Johann Cigler

    2003-01-01

    Morse code sequences are very useful to give combinatorial interpretations of various properties of Fibonacci numbers. In this note we study some algebraic and combinatorial aspects of Morse code sequences and obtain several q-analogues of Fibonacci numbers and Fibonacci polynomials and their generalizations.

  2. Some Algebraic Aspects of MorseCode Sequences

    Directory of Open Access Journals (Sweden)

    Johann Cigler

    2003-06-01

    Full Text Available Morse code sequences are very useful to give combinatorial interpretations of various properties of Fibonacci numbers. In this note we study some algebraic and combinatorial aspects of Morse code sequences and obtain several q-analogues of Fibonacci numbers and Fibonacci polynomials and their generalizations.

  3. Nucleotide sequence of the gene coding for human factor VII, a vitamin K-dependent protein participating in blood coagulation

    International Nuclear Information System (INIS)

    O'Hara, P.J.; Grant, F.J.; Haldeman, B.A.; Gray, C.L.; Insley, M.Y.; Hagen, F.S.; Murray, M.J.

    1987-01-01

    Activated factor VII (factor VIIa) is a vitamin K-dependent plasma serine protease that participates in a cascade of reactions leading to the coagulation of blood. Two overlapping genomic clones containing sequences encoding human factor VII were isolated and characterized. The complete sequence of the gene was determined and found to span about 12.8 kilobases. The mRNA for factor VII as demonstrated by cDNA cloning is polyadenylylated at multiple sites but contains only one AAUAAA poly(A) signal sequence. The mRNA can undergo alternative splicing, forming one transcript containing eight segments as exons and another with an additional exon that encodes a larger prepro leader sequence. The latter transcript has no known counterpart in the other vitamin K-dependent proteins. The positions of the introns with respect to the amino acid sequence encoded by the eight essential exons of factor VII are the same as those present in factor IX, factor X, protein C, and the first three exons of prothrombin. These exons code for domains generally conserved among members of this gene family. The comparable introns in these genes, however, are dissimilar with respect to size and sequence, with the exception of intron C in factor VII and protein C. The gene for factor VII also contains five regions made up of tandem repeats of oligonucleotide monomer elements. More than a quarter of the intron sequences and more than a third of the 3' untranslated portion of the mRNA transcript consist of these minisatellite tandem repeats

  4. Translation of vph mRNA in Streptomyces lividans and Escherichia coli after removal of the 5' untranslated leader.

    Science.gov (United States)

    Wu, C J; Janssen, G R

    1996-10-01

    The Streptomyces vinaceus viomycin phosphotransferase (vph) mRNA contains an untranslated leader with a conventional Shine-Dalgarno homology. The vph leader was removed by ligation of the vph coding sequence to the transcriptional start site of a Streptomyces or an Escherichia coli promoter, such that transcription would initiate at the first position of the vph start codon. Analysis of mRNA demonstrated that transcription initiated primarily at the A of the vph AUG translational start codon in both Streptomyces lividans and E. coli; cells expressing the unleadered vph mRNA were resistant to viomycin indicating that the Shine-Dalgarno sequence, or other features contained within the leader, was not necessary for vph translation. Addition of four nucleotides (5'-AUGC-3') onto the 5' end of the unleadered vph mRNA resulted in translation initiation from the vph start codon and the AUG triplet contained within the added sequence. Translational fusions of vph sequence to a Tn5 neo reporter gene indicated that the first 16 codons of vph coding sequence were sufficient to specify the translational start site and reading frame for expression of neomycin resistance in both E. coli and S. lividans.

  5. Detecting non-coding selective pressure in coding regions

    Directory of Open Access Journals (Sweden)

    Blanchette Mathieu

    2007-02-01

    Full Text Available Abstract Background Comparative genomics approaches, where orthologous DNA regions are compared and inter-species conserved regions are identified, have proven extremely powerful for identifying non-coding regulatory regions located in intergenic or intronic regions. However, non-coding functional elements can also be located within coding region, as is common for exonic splicing enhancers, some transcription factor binding sites, and RNA secondary structure elements affecting mRNA stability, localization, or translation. Since these functional elements are located in regions that are themselves highly conserved because they are coding for a protein, they generally escaped detection by comparative genomics approaches. Results We introduce a comparative genomics approach for detecting non-coding functional elements located within coding regions. Codon evolution is modeled as a mixture of codon substitution models, where each component of the mixture describes the evolution of codons under a specific type of coding selective pressure. We show how to compute the posterior distribution of the entropy and parsimony scores under this null model of codon evolution. The method is applied to a set of growth hormone 1 orthologous mRNA sequences and a known exonic splicing elements is detected. The analysis of a set of CORTBP2 orthologous genes reveals a region of several hundred base pairs under strong non-coding selective pressure whose function remains unknown. Conclusion Non-coding functional elements, in particular those involved in post-transcriptional regulation, are likely to be much more prevalent than is currently known. With the numerous genome sequencing projects underway, comparative genomics approaches like that proposed here are likely to become increasingly powerful at detecting such elements.

  6. Genetic Code Analysis Toolkit: A novel tool to explore the coding properties of the genetic code and DNA sequences

    Science.gov (United States)

    Kraljić, K.; Strüngmann, L.; Fimmel, E.; Gumbel, M.

    2018-01-01

    The genetic code is degenerated and it is assumed that redundancy provides error detection and correction mechanisms in the translation process. However, the biological meaning of the code's structure is still under current research. This paper presents a Genetic Code Analysis Toolkit (GCAT) which provides workflows and algorithms for the analysis of the structure of nucleotide sequences. In particular, sets or sequences of codons can be transformed and tested for circularity, comma-freeness, dichotomic partitions and others. GCAT comes with a fertile editor custom-built to work with the genetic code and a batch mode for multi-sequence processing. With the ability to read FASTA files or load sequences from GenBank, the tool can be used for the mathematical and statistical analysis of existing sequence data. GCAT is Java-based and provides a plug-in concept for extensibility. Availability: Open source Homepage:http://www.gcat.bio/

  7. A nonsense mutation causing decreased levels of insulin receptor mRNA: Detection by a simplified technique for direct sequencing of genomic DNA amplified by the polymerase chain reaction

    International Nuclear Information System (INIS)

    Kadowaki, T.; Kadowaki, H.; Taylor, S.I.

    1990-01-01

    Mutations in the insulin receptor gene can render the cell resistant to the biological action of insulin. The authors have studied a patient with leprechaunism (leprechaun/Minn-1), a genetic syndrome associated with intrauterine growth retardation and extreme insulin resistance. Genomic DNA from the patient was amplified by the polymerase chain reaction catalyzed by Thermus aquaticus (Taq) DNA polymerase, and the amplified DNA was directly sequenced. A nonsense mutations was identified at codon 897 in exon 14 in the paternal allele of the patient's insulin receptor gene. Levels of insulin receptor mRNA are decreased to <10% of normal in Epstein-Barr virus-transformed lymphoblasts and cultured skin fibroblasts from this patient. Thus, this nonsense mutation appears to cause a decrease in the levels of insulin receptor mRNA. In addition, they have obtained indirect evidence that the patient's maternal allele of the insulin receptor gene contains a cis-acting dominant mutation that also decreases the level of mRNA, but by a different mechanism. The nucleotide sequence of the entire protein-coding domain and the sequences of the intron-exon boundaries for all 22 exons of the maternal allele were normal. Presumably, the mutation in the maternal allele maps elsewhere in the insulin receptor gene. Thus, they conclude that the patient is a compound heterozygote for two cis-acting dominant mutations in the insulin receptor gene: (i) a nonsense mutation in the paternal allel that reduces the level of insulin receptor mRNA and (ii) an as yet unidentified mutation in the maternal allele that either decreases the rate of transcription or decreases the stability of the mRNA

  8. Highly conserved non-coding sequences are associated with vertebrate development.

    Directory of Open Access Journals (Sweden)

    Adam Woolfe

    2005-01-01

    Full Text Available In addition to protein coding sequence, the human genome contains a significant amount of regulatory DNA, the identification of which is proving somewhat recalcitrant to both in silico and functional methods. An approach that has been used with some success is comparative sequence analysis, whereby equivalent genomic regions from different organisms are compared in order to identify both similarities and differences. In general, similarities in sequence between highly divergent organisms imply functional constraint. We have used a whole-genome comparison between humans and the pufferfish, Fugu rubripes, to identify nearly 1,400 highly conserved non-coding sequences. Given the evolutionary divergence between these species, it is likely that these sequences are found in, and furthermore are essential to, all vertebrates. Most, and possibly all, of these sequences are located in and around genes that act as developmental regulators. Some of these sequences are over 90% identical across more than 500 bases, being more highly conserved than coding sequence between these two species. Despite this, we cannot find any similar sequences in invertebrate genomes. In order to begin to functionally test this set of sequences, we have used a rapid in vivo assay system using zebrafish embryos that allows tissue-specific enhancer activity to be identified. Functional data is presented for highly conserved non-coding sequences associated with four unrelated developmental regulators (SOX21, PAX6, HLXB9, and SHH, in order to demonstrate the suitability of this screen to a wide range of genes and expression patterns. Of 25 sequence elements tested around these four genes, 23 show significant enhancer activity in one or more tissues. We have identified a set of non-coding sequences that are highly conserved throughout vertebrates. They are found in clusters across the human genome, principally around genes that are implicated in the regulation of development

  9. An Auto sequence Code to Integrate a Neutron Unfolding Code with thePC-MCA Accuspec

    International Nuclear Information System (INIS)

    Darsono

    2000-01-01

    In a neutron spectrometry using proton recoil method, the neutronunfolding code is needed to unfold the measured proton spectrum to become theneutron spectrum. The process of the unfolding neutron in the existingneutron spectrometry which was successfully installed last year was doneseparately. This manuscript reports that the auto sequence code to integratethe neutron unfolding code UNFSPEC.EXE with the software facility of thePC-MCA Accuspec has been made and run successfully so that the new neutronspectrometry become compact. The auto sequence code was written based on therules in application program facility of PC-MCA Accuspec and then it wascompiled using AC-EXE. Result of the test of the auto sequence code showedthat for binning width 20, 30, and 40 giving a little different spectrumshape. The binning width around 30 gives a better spectrum in mean of givingsmall error compared to the others. (author)

  10. Heterogeneity of rat tropoelastin mRNA revealed by cDNA cloning

    International Nuclear Information System (INIS)

    Pierce, R.A.; Deak, S.B.; Stolle, C.A.; Boyd, C.D.

    1990-01-01

    A λgt11 library constructed from poly(A+) RNA isolated from aortic tissue of neonatal rats was screened for rat tropoelastin cDNAs. The first, screen, utilizing a human tropoelastin cDNA clone, provided rat tropoelastin cDNAs spanning 2.3 kb of carboxy-terminal coding sequence and extended into the 3'-untranslated region. A subsequent screen using a 5' rat tropoelastin cDNA clone yielded clones extending into the amino-terminal signal sequence coding region. Sequence analysis of these clones has provided the complete derived amino acid sequence of rat tropoelastin and allowed alignment and comparison with published bovine cDNA sequence. While the overall structure of rat tropoelastin is similar to bovine sequence, numerous substitutions, deletions, and insertions demonstrated considerable heterogeneity between species. In particular, the pentapeptide repeat VPGVG, characteristic of all tropoelastins analyzed to date, is replaced in rat tropoelastin by a repeating pentapeptide, IPGVG. The hexapeptide repeat VGVAPG, the bovine elastin receptor binding peptide, is not encoded by rat tropoelastin cDNAs. Variations in coding sequence between rat tropoelastin CDNA clones were also found which may represent mRNA heterogeneity produced by alternative splicing of the rat tropoelastin pre-mRNA

  11. Genome-wide identification of coding and non-coding conserved sequence tags in human and mouse genomes

    Directory of Open Access Journals (Sweden)

    Maggi Giorgio P

    2008-06-01

    Full Text Available Abstract Background The accurate detection of genes and the identification of functional regions is still an open issue in the annotation of genomic sequences. This problem affects new genomes but also those of very well studied organisms such as human and mouse where, despite the great efforts, the inventory of genes and regulatory regions is far from complete. Comparative genomics is an effective approach to address this problem. Unfortunately it is limited by the computational requirements needed to perform genome-wide comparisons and by the problem of discriminating between conserved coding and non-coding sequences. This discrimination is often based (thus dependent on the availability of annotated proteins. Results In this paper we present the results of a comprehensive comparison of human and mouse genomes performed with a new high throughput grid-based system which allows the rapid detection of conserved sequences and accurate assessment of their coding potential. By detecting clusters of coding conserved sequences the system is also suitable to accurately identify potential gene loci. Following this analysis we created a collection of human-mouse conserved sequence tags and carefully compared our results to reliable annotations in order to benchmark the reliability of our classifications. Strikingly we were able to detect several potential gene loci supported by EST sequences but not corresponding to as yet annotated genes. Conclusion Here we present a new system which allows comprehensive comparison of genomes to detect conserved coding and non-coding sequences and the identification of potential gene loci. Our system does not require the availability of any annotated sequence thus is suitable for the analysis of new or poorly annotated genomes.

  12. On the optimal trimming of high-throughput mRNA sequence data

    Directory of Open Access Journals (Sweden)

    Matthew D MacManes

    2014-01-01

    Full Text Available The widespread and rapid adoption of high-throughput sequencing technologies has afforded researchers the opportunity to gain a deep understanding of genome level processes that underlie evolutionary change, and perhaps more importantly, the links between genotype and phenotype. In particular, researchers interested in functional biology and adaptation have used these technologies to sequence mRNA transcriptomes of specific tissues, which in turn are often compared to other tissues, or other individuals with different phenotypes. While these techniques are extremely powerful, careful attention to data quality is required. In particular, because high-throughput sequencing is more error-prone than traditional Sanger sequencing, quality trimming of sequence reads should be an important step in all data processing pipelines. While several software packages for quality trimming exist, no general guidelines for the specifics of trimming have been developed. Here, using empirically derived sequence data, I provide general recommendations regarding the optimal strength of trimming, specifically in mRNA-Seq studies. Although very aggressive quality trimming is common, this study suggests that a more gentle trimming, specifically of those nucleotides whose Phred score < 2 or < 5, is optimal for most studies across a wide variety of metrics.

  13. Performance Analysis for Cooperative Communication System with QC-LDPC Codes Constructed with Integer Sequences

    Directory of Open Access Journals (Sweden)

    Yan Zhang

    2015-01-01

    Full Text Available This paper presents four different integer sequences to construct quasi-cyclic low-density parity-check (QC-LDPC codes with mathematical theory. The paper introduces the procedure of the coding principle and coding. Four different integer sequences constructing QC-LDPC code are compared with LDPC codes by using PEG algorithm, array codes, and the Mackey codes, respectively. Then, the integer sequence QC-LDPC codes are used in coded cooperative communication. Simulation results show that the integer sequence constructed QC-LDPC codes are effective, and overall performance is better than that of other types of LDPC codes in the coded cooperative communication. The performance of Dayan integer sequence constructed QC-LDPC is the most excellent performance.

  14. Coding visual features extracted from video sequences.

    Science.gov (United States)

    Baroffio, Luca; Cesana, Matteo; Redondi, Alessandro; Tagliasacchi, Marco; Tubaro, Stefano

    2014-05-01

    Visual features are successfully exploited in several applications (e.g., visual search, object recognition and tracking, etc.) due to their ability to efficiently represent image content. Several visual analysis tasks require features to be transmitted over a bandwidth-limited network, thus calling for coding techniques to reduce the required bit budget, while attaining a target level of efficiency. In this paper, we propose, for the first time, a coding architecture designed for local features (e.g., SIFT, SURF) extracted from video sequences. To achieve high coding efficiency, we exploit both spatial and temporal redundancy by means of intraframe and interframe coding modes. In addition, we propose a coding mode decision based on rate-distortion optimization. The proposed coding scheme can be conveniently adopted to implement the analyze-then-compress (ATC) paradigm in the context of visual sensor networks. That is, sets of visual features are extracted from video frames, encoded at remote nodes, and finally transmitted to a central controller that performs visual analysis. This is in contrast to the traditional compress-then-analyze (CTA) paradigm, in which video sequences acquired at a node are compressed and then sent to a central unit for further processing. In this paper, we compare these coding paradigms using metrics that are routinely adopted to evaluate the suitability of visual features in the context of content-based retrieval, object recognition, and tracking. Experimental results demonstrate that, thanks to the significant coding gains achieved by the proposed coding scheme, ATC outperforms CTA with respect to all evaluation metrics.

  15. Nuclear RNA sequencing of the mouse erythroid cell transcriptome.

    Directory of Open Access Journals (Sweden)

    Jennifer A Mitchell

    Full Text Available In addition to protein coding genes a substantial proportion of mammalian genomes are transcribed. However, most transcriptome studies investigate steady-state mRNA levels, ignoring a considerable fraction of the transcribed genome. In addition, steady-state mRNA levels are influenced by both transcriptional and posttranscriptional mechanisms, and thus do not provide a clear picture of transcriptional output. Here, using deep sequencing of nuclear RNAs (nucRNA-Seq in parallel with chromatin immunoprecipitation sequencing (ChIP-Seq of active RNA polymerase II, we compared the nuclear transcriptome of mouse anemic spleen erythroid cells with polymerase occupancy on a genome-wide scale. We demonstrate that unspliced transcripts quantified by nucRNA-seq correlate with primary transcript frequencies measured by RNA FISH, but differ from steady-state mRNA levels measured by poly(A-enriched RNA-seq. Highly expressed protein coding genes showed good correlation between RNAPII occupancy and transcriptional output; however, genome-wide we observed a poor correlation between transcriptional output and RNAPII association. This poor correlation is due to intergenic regions associated with RNAPII which correspond with transcription factor bound regulatory regions and a group of stable, nuclear-retained long non-coding transcripts. In conclusion, sequencing the nuclear transcriptome provides an opportunity to investigate the transcriptional landscape in a given cell type through quantification of unspliced primary transcripts and the identification of nuclear-retained long non-coding RNAs.

  16. Machine-Checked Sequencer for Critical Embedded Code Generator

    Science.gov (United States)

    Izerrouken, Nassima; Pantel, Marc; Thirioux, Xavier

    This paper presents the development of a correct-by-construction block sequencer for GeneAuto a qualifiable (according to DO178B/ED12B recommendation) automatic code generator. It transforms Simulink models to MISRA C code for safety critical systems. Our approach which combines classical development process and formal specification and verification using proof-assistants, led to preliminary fruitful exchanges with certification authorities. We present parts of the classical user and tools requirements and derived formal specifications, implementation and verification for the correctness and termination of the block sequencer. This sequencer has been successfully applied to real-size industrial use cases from various transportation domain partners and led to requirement errors detection and a correct-by-construction implementation.

  17. Combined sequencing of mRNA and DNA from human embryonic stem cells

    Directory of Open Access Journals (Sweden)

    Florian Mertes

    2016-06-01

    Full Text Available Combined transcriptome and whole genome sequencing of the same ultra-low input sample down to single cells is a rapidly evolving approach for the analysis of rare cells. Besides stem cells, rare cells originating from tissues like tumor or biopsies, circulating tumor cells and cells from early embryonic development are under investigation. Herein we describe a universal method applicable for the analysis of minute amounts of sample material (150 to 200 cells derived from sub-colony structures from human embryonic stem cells. The protocol comprises the combined isolation and separate amplification of poly(A mRNA and whole genome DNA followed by next generation sequencing. Here we present a detailed description of the method developed and an overview of the results obtained for RNA and whole genome sequencing of human embryonic stem cells, sequencing data is available in the Gene Expression Omnibus (GEO database under accession number GSE69471.

  18. Regulatory Roles for Long ncRNA and mRNA

    International Nuclear Information System (INIS)

    Karapetyan, Armen R.; Buiting, Coen; Kuiper, Renske A.; Coolen, Marcel W.

    2013-01-01

    Recent advances in high-throughput sequencing technology have identified the transcription of a much larger portion of the genome than previously anticipated. Especially in the context of cancer it has become clear that aberrant transcription of both protein-coding and long non-coding RNAs (lncRNAs) are frequent events. The current dogma of RNA function describes mRNA to be responsible for the synthesis of proteins, whereas non-coding RNA can have regulatory or epigenetic functions. However, this distinction between protein coding and regulatory ability of transcripts may not be that strict. Here, we review the increasing body of evidence for the existence of multifunctional RNAs that have both protein-coding and trans-regulatory roles. Moreover, we demonstrate that coding transcripts bind to components of the Polycomb Repressor Complex 2 (PRC2) with similar affinities as non-coding transcripts, revealing potential epigenetic regulation by mRNAs. We hypothesize that studies on the regulatory ability of disease-associated mRNAs will form an important new field of research

  19. Regulatory Roles for Long ncRNA and mRNA

    Energy Technology Data Exchange (ETDEWEB)

    Karapetyan, Armen R.; Buiting, Coen; Kuiper, Renske A.; Coolen, Marcel W., E-mail: M.Coolen@gen.umcn.nl [Department of Human Genetics, Nijmegen Centre for Molecular Life Sciences (NCMLS), Radboud University Nijmegen Medical Centre, P.O. Box 9101, Nijmegen 6500 HB (Netherlands)

    2013-04-26

    Recent advances in high-throughput sequencing technology have identified the transcription of a much larger portion of the genome than previously anticipated. Especially in the context of cancer it has become clear that aberrant transcription of both protein-coding and long non-coding RNAs (lncRNAs) are frequent events. The current dogma of RNA function describes mRNA to be responsible for the synthesis of proteins, whereas non-coding RNA can have regulatory or epigenetic functions. However, this distinction between protein coding and regulatory ability of transcripts may not be that strict. Here, we review the increasing body of evidence for the existence of multifunctional RNAs that have both protein-coding and trans-regulatory roles. Moreover, we demonstrate that coding transcripts bind to components of the Polycomb Repressor Complex 2 (PRC2) with similar affinities as non-coding transcripts, revealing potential epigenetic regulation by mRNAs. We hypothesize that studies on the regulatory ability of disease-associated mRNAs will form an important new field of research.

  20. Algebraic solution of the synthesis problem for coded sequences

    International Nuclear Information System (INIS)

    Leukhin, Anatolii N

    2005-01-01

    The algebraic solution of a 'complex' problem of synthesis of phase-coded (PC) sequences with the zero level of side lobes of the cyclic autocorrelation function (ACF) is proposed. It is shown that the solution of the synthesis problem is connected with the existence of difference sets for a given code dimension. The problem of estimating the number of possible code combinations for a given code dimension is solved. It is pointed out that the problem of synthesis of PC sequences is related to the fundamental problems of discrete mathematics and, first of all, to a number of combinatorial problems, which can be solved, as the number factorisation problem, by algebraic methods by using the theory of Galois fields and groups. (fourth seminar to the memory of d.n. klyshko)

  1. Definition of the complete Schistosoma mansoni hemoglobinase mRNA sequence and gene expression in developing parasites.

    Science.gov (United States)

    el Meanawy, M A; Aji, T; Phillips, N F; Davis, R E; Salata, R A; Malhotra, I; McClain, D; Aikawa, M; Davis, A H

    1990-07-01

    Schistosoma mansoni uses a variety of proteases termed hemoglobinases to obtain nutrition from host globin. Previous reports have characterized cDNAs encoding 1 of these enzymes. However, these sequences did not define the primary structures of the mRNA and protein. The complete sequence of the 1390 base mRNA has now been determined. It encodes a 50 kDa primary translation product. In vitro translations coupled with immunoprecipitations and Western blots of parasite lysates allowed visualization of the 50 kDa form. Production of the 31 kDa mature hemoglobinase from the 50 kDa species involves removal of both NH2 and COOH terminal residues from the primary translation product. Expression of hemoglobinase mRNA and protein was examined during larval parasite development. Low levels were observed in young schistosomula. After 6-9 days in culture, high hemoglobinase levels were seen which correlated with the onset of red blood cell feeding. Immunoelectron microscopy was employed to examine hemoglobinase location and function. In adult worms the enzyme was associated with the gut lumen and gut epithelium. In cercariae, the protease was observed in the head gland, suggesting new roles for the protease.

  2. DNA watermarks in non-coding regulatory sequences

    Directory of Open Access Journals (Sweden)

    Pyka Martin

    2009-07-01

    Full Text Available Abstract Background DNA watermarks can be applied to identify the unauthorized use of genetically modified organisms. It has been shown that coding regions can be used to encrypt information into living organisms by using the DNA-Crypt algorithm. Yet, if the sequence of interest presents a non-coding DNA sequence, either the function of a resulting functional RNA molecule or a regulatory sequence, such as a promoter, could be affected. For our studies we used the small cytoplasmic RNA 1 in yeast and the lac promoter region of Escherichia coli. Findings The lac promoter was deactivated by the integrated watermark. In addition, the RNA molecules displayed altered configurations after introducing a watermark, but surprisingly were functionally intact, which has been verified by analyzing the growth characteristics of both wild type and watermarked scR1 transformed yeast cells. In a third approach we introduced a second overlapping watermark into the lac promoter, which did not affect the promoter activity. Conclusion Even though the watermarked RNA and one of the watermarked promoters did not show any significant differences compared to the wild type RNA and wild type promoter region, respectively, it cannot be generalized that other RNA molecules or regulatory sequences behave accordingly. Therefore, we do not recommend integrating watermark sequences into regulatory regions.

  3. Human α2-HS-glycoprotein: the A and B chains with a connecting sequence are encoded by a single mRNA transcript

    International Nuclear Information System (INIS)

    Lee, C.C.; Bowman, B.H.; Yang, F.

    1987-01-01

    The α 2 -HS-glycoprotein (AHSG) is a plasma protein reported to play roles in bone mineralization and in the immune response. It is composed of two subunits, the A and B chains. Recombinant plasmids containing human cDNA AHSG have been isolated by screening an adult human liver library with a mixed oligonucleotide probe. The cDNA clones containing AHSG inserts span approximately 1.5 kilobase pairs and include the entire AHSG coding sequence, demonstrating that the A and B chains are encoded by a single mRNA transcript. The cDNA sequence predicts an 18-amino-acid signal peptide, followed by the A-chain sequence of AHSG. A heretofore unseen connecting sequence of 40 amino acids was deduced between the A- and B-chain sequences. The connecting sequence demonstrates the unique amino acid doublets and collagen triplets found in the A and B chains; it is not homologous with other reported amino acid sequences. The connecting sequence may be cleaved in a posttranslational step by limited proteolysis before mature AHSG is released into the circulation or may vary in its presence because of alternative processing. The AHSG cDNA was utilized for mapping the AHSG gene to the 3q21→qter region of human chromosome 3. The availability of the AHSG cDNA clone will facilitate the analysis of its genetic control and gene expression during development and bone formation

  4. Golay sequences coded coherent optical OFDM for long-haul transmission

    Science.gov (United States)

    Qin, Cui; Ma, Xiangrong; Hua, Tao; Zhao, Jing; Yu, Huilong; Zhang, Jian

    2017-09-01

    We propose to use binary Golay sequences in coherent optical orthogonal frequency division multiplexing (CO-OFDM) to improve the long-haul transmission performance. The Golay sequences are generated by binary Reed-Muller codes, which have low peak-to-average power ratio and certain error correction capability. A low-complexity decoding algorithm for the Golay sequences is then proposed to recover the signal. Under same spectral efficiency, the QPSK modulated OFDM with binary Golay sequences coding with and without discrete Fourier transform (DFT) spreading (DFTS-QPSK-GOFDM and QPSK-GOFDM) are compared with the normal BPSK modulated OFDM with and without DFT spreading (DFTS-BPSK-OFDM and BPSK-OFDM) after long-haul transmission. At a 7% forward error correction code threshold (Q2 factor of 8.5 dB), it is shown that DFTS-QPSK-GOFDM outperforms DFTS-BPSK-OFDM by extending the transmission distance by 29% and 18%, in non-dispersion managed and dispersion managed links, respectively.

  5. Fast comparison of IS radar code sequences for lag profile inversion

    Directory of Open Access Journals (Sweden)

    M. S. Lehtinen

    2008-08-01

    Full Text Available A fast method for theoretically comparing the posteriori variances produced by different phase code sequences in incoherent scatter radar (ISR experiments is introduced. Alternating codes of types 1 and 2 are known to be optimal for selected range resolutions, but the code sets are inconveniently long for many purposes like ground clutter estimation and in cases where coherent echoes from lower ionospheric layers are to be analyzed in addition to standard F-layer spectra.

    The method is used in practice for searching binary code quads that have estimation accuracy almost equal to that of much longer alternating code sets. Though the code sequences can consist of as few as four different transmission envelopes, the lag profile estimation variances are near to the theoretical minimum. Thus the short code sequence is equally good as a full cycle of alternating codes with the same pulse length and bit length. The short code groups cannot be directly decoded, but the decoding is done in connection with more computationally expensive lag profile inversion in data analysis.

    The actual code searches as well as the analysis and real data results from the found short code searches are explained in other papers sent to the same issue of this journal. We also discuss interesting subtle differences found between the different alternating codes by this method. We assume that thermal noise dominates the incoherent scatter signal.

  6. PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3' UTRs and coding sequences.

    Science.gov (United States)

    Šulc, Miroslav; Marín, Ray M; Robins, Harlan S; Vaníček, Jiří

    2015-07-01

    The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA transcripts targeted in their 3' untranslated regions (3' UTRs), and (ii) PACCMIT-CDS, designed to find mRNAs targeted within their coding sequences (CDSs). While PACCMIT belongs among the accurate algorithms for predicting conserved microRNA targets in the 3' UTRs, the main contribution of the web server is 2-fold: PACCMIT provides an accurate tool for predicting targets also of weakly conserved or non-conserved microRNAs, whereas PACCMIT-CDS addresses the lack of similar portals adapted specifically for targets in CDS. The web server asks the user for microRNAs and mRNAs to be analyzed, accesses the precomputed P-values for all microRNA-mRNA pairs from a database for all mRNAs and microRNAs in a given species, ranks the predicted microRNA-mRNA pairs, evaluates their significance according to the false discovery rate and finally displays the predictions in a tabular form. The results are also available for download in several standard formats. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  7. Single step production of Cas9 mRNA for zygote injection.

    Science.gov (United States)

    Redel, Bethany K; Beaton, Benjamin P; Spate, Lee D; Benne, Joshua A; Murphy, Stephanie L; O'Gorman, Chad W; Spate, Anna M; Prather, Randall S; Wells, Kevin D

    2018-03-01

    Production of Cas9 mRNA in vitro typically requires the addition of a 5´ cap and 3´ polyadenylation. A plasmid was constructed that harbored the T7 promoter followed by the EMCV IRES and a Cas9 coding region. We hypothesized that the use of the metastasis associated lung adenocarcinoma transcript 1 (Malat1) triplex structure downstream of an IRES/Cas9 expression cassette would make polyadenylation of in vitro produced mRNA unnecessary. A sequence from the mMalat1 gene was cloned downstream of the IRES/Cas9 cassette described above. An mRNA concentration curve was constructed with either commercially available Cas9 mRNA or the IRES/ Cas9/triplex, by injection into porcine zygotes. Blastocysts were genotyped to determine if differences existed in the percent of embryos modified. The concentration curve identified differences due to concentration and RNA type injected. Single step production of Cas9 mRNA provides an alternative source of Cas9 for use in zygote injections.

  8. Sequence Coding and Search System for licensee event reports: code listings. Volume 2

    International Nuclear Information System (INIS)

    Gallaher, R.B.; Guymon, R.H.; Mays, G.T.; Poore, W.P.; Cagle, R.J.; Harrington, K.H.; Johnson, M.P.

    1985-04-01

    Operating experience data from nuclear power plants are essential for safety and reliability analyses, especially analyses of trends and patterns. The licensee event reports (LERs) that are submitted to the Nuclear Regulatory Commission (NRC) by the nuclear power plant utilities contain much of this data. The NRC's Office for Analysis and Evaluation of Operational Data (AEOD) has developed, under contract with NSIC, a system for codifying the events reported in the LERs. The primary objective of the Sequence Coding and Search System (SCSS) is to reduce the descriptive text of the LERs to coded sequences that are both computer-readable and computer-searchable. This system provides a structured format for detailed coding of component, system, and unit effects as well as personnel errors. The database contains all current LERs submitted by nuclear power plant utilities for events occurring since 1981 and is updated on a continual basis. Volume 2 contains all valid and acceptable codes used for searching and encoding the LER data. This volume contains updated material through amendment 1 to revision 1 of the working version of ORNL/NSIC-223, Vol. 2

  9. Alternative splicing of human elastin mRNA indicated by sequence analysis of cloned genomic and complementary DNA

    International Nuclear Information System (INIS)

    Indik, Z.; Yeh, H.; Ornstein-goldstein, N.; Sheppard, P.; Anderson, N.; Rosenbloom, J.C.; Peltonen, L.; Rosenbloom, J.

    1987-01-01

    Poly(A) + RNA, isolated from a single 7-mo fetal human aorta, was used to synthesize cDNA by the RNase H method, and the cDNA was inserted into λgt10. Recombinant phage containing elastin sequences were identified by hybridization with cloned, exon-containing fragments of the human elastin gene. Three clones containing inserts of 3.3, 2.7, and 2.3 kilobases were selected for further analysis. Three overlapping clones containing 17.8 kilobases of the human elastin gene were also isolated from genomic libraries. Complete sequence analysis of the six clones demonstrated that: (i) the cDNA encompassed the entire translated portion of the mRNA encoding 786 amino acids, including several unusual hydrophilic amino acid sequences not previously identified in porcine tropoelastin, (ii) exons encoding either hydrophobic or crosslinking domains in the protein alternated in the gene, and (iii) a great abundance of Alu repetitive sequences occurred throughout the introns. The data also indicated substantial alternative splicing of the mRNA. These results suggest the potential for significant variation in the precise molecular structure of the elastic fiber in the human population

  10. Regulatory Roles for Long ncRNA and mRNA

    Directory of Open Access Journals (Sweden)

    Marcel W. Coolen

    2013-04-01

    Full Text Available Recent advances in high-throughput sequencing technology have identified the transcription of a much larger portion of the genome than previously anticipated. Especially in the context of cancer it has become clear that aberrant transcription of both protein-coding and long non-coding RNAs (lncRNAs are frequent events. The current dogma of RNA function describes mRNA to be responsible for the synthesis of proteins, whereas non-coding RNA can have regulatory or epigenetic functions. However, this distinction between protein coding and regulatory ability of transcripts may not be that strict. Here, we review the increasing body of evidence for the existence of multifunctional RNAs that have both protein-coding and trans-regulatory roles. Moreover, we demonstrate that coding transcripts bind to components of the Polycomb Repressor Complex 2 (PRC2 with similar affinities as non-coding transcripts, revealing potential epigenetic regulation by mRNAs. We hypothesize that studies on the regulatory ability of disease-associated mRNAs will form an important new field of research.

  11. Annotation of the protein coding regions of the equine genome

    DEFF Research Database (Denmark)

    Hestand, Matthew S.; Kalbfleisch, Theodore S.; Coleman, Stephen J.

    2015-01-01

    Current gene annotation of the horse genome is largely derived from in silico predictions and cross-species alignments. Only a small number of genes are annotated based on equine EST and mRNA sequences. To expand the number of equine genes annotated from equine experimental evidence, we sequenced m...... and appear to be small errors in the equine reference genome, since they are also identified as homozygous variants by genomic DNA resequencing of the reference horse. Taken together, we provide a resource of equine mRNA structures and protein coding variants that will enhance equine and cross...

  12. Integrated mRNA and microRNA transcriptome sequencing characterizes sequence variants and mRNA–microRNA regulatory network in nasopharyngeal carcinoma model systems

    Directory of Open Access Journals (Sweden)

    Carol Ying-Ying Szeto

    2014-01-01

    Full Text Available Nasopharyngeal carcinoma (NPC is a prevalent malignancy in Southeast Asia among the Chinese population. Aberrant regulation of transcripts has been implicated in many types of cancers including NPC. Herein, we characterized mRNA and miRNA transcriptomes by RNA sequencing (RNASeq of NPC model systems. Matched total mRNA and small RNA of undifferentiated Epstein–Barr virus (EBV-positive NPC xenograft X666 and its derived cell line C666, well-differentiated NPC cell line HK1, and the immortalized nasopharyngeal epithelial cell line NP460 were sequenced by Solexa technology. We found 2812 genes and 149 miRNAs (human and EBV to be differentially expressed in NP460, HK1, C666 and X666 with RNASeq; 533 miRNA–mRNA target pairs were inversely regulated in the three NPC cell lines compared to NP460. Integrated mRNA/miRNA expression profiling and pathway analysis show extracellular matrix organization, Beta-1 integrin cell surface interactions, and the PI3K/AKT, EGFR, ErbB, and Wnt pathways were potentially deregulated in NPC. Real-time quantitative PCR was performed on selected mRNA/miRNAs in order to validate their expression. Transcript sequence variants such as short insertions and deletions (INDEL, single nucleotide variant (SNV, and isomiRs were characterized in the NPC model systems. A novel TP53 transcript variant was identified in NP460, HK1, and C666. Detection of three previously reported novel EBV-encoded BART miRNAs and their isomiRs were also observed. Meta-analysis of a model system to a clinical system aids the choice of different cell lines in NPC studies. This comprehensive characterization of mRNA and miRNA transcriptomes in NPC cell lines and the xenograft provides insights on miRNA regulation of mRNA and valuable resources on transcript variation and regulation in NPC, which are potentially useful for mechanistic and preclinical studies.

  13. Auto-Regulatory RNA Editing Fine-Tunes mRNA Re-Coding and Complex Behaviour in Drosophila

    Science.gov (United States)

    Savva, Yiannis A.; Jepson, James E.C; Sahin, Asli; Sugden, Arthur U.; Dorsky, Jacquelyn S.; Alpert, Lauren; Lawrence, Charles; Reenan, Robert A.

    2014-01-01

    Auto-regulatory feedback loops are a common molecular strategy used to optimize protein function. In Drosophila many mRNAs involved in neuro-transmission are re-coded at the RNA level by the RNA editing enzyme dADAR, leading to the incorporation of amino acids that are not directly encoded by the genome. dADAR also re-codes its own transcript, but the consequences of this auto-regulation in vivo are unclear. Here we show that hard-wiring or abolishing endogenous dADAR auto-regulation dramatically remodels the landscape of re-coding events in a site-specific manner. These molecular phenotypes correlate with altered localization of dADAR within the nuclear compartment. Furthermore, auto-editing exhibits sexually dimorphic patterns of spatial regulation and can be modified by abiotic environmental factors. Finally, we demonstrate that modifying dAdar auto-editing affects adaptive complex behaviors. Our results reveal the in vivo relevance of auto-regulatory control over post-transcriptional mRNA re-coding events in fine-tuning brain function and organismal behavior. PMID:22531175

  14. Selection of mRNA 5'-untranslated region sequence with high translation efficiency through ribosome display

    International Nuclear Information System (INIS)

    Mie, Masayasu; Shimizu, Shun; Takahashi, Fumio; Kobatake, Eiry

    2008-01-01

    The 5'-untranslated region (5'-UTR) of mRNAs functions as a translation enhancer, promoting translation efficiency. Many in vitro translation systems exhibit a reduced efficiency in protein translation due to decreased translation initiation. The use of a 5'-UTR sequence with high translation efficiency greatly enhances protein production in these systems. In this study, we have developed an in vitro selection system that favors 5'-UTRs with high translation efficiency using a ribosome display technique. A 5'-UTR random library, comprised of 5'-UTRs tagged with a His-tag and Renilla luciferase (R-luc) fusion, were in vitro translated in rabbit reticulocytes. By limiting the translation period, only mRNAs with high translation efficiency were translated. During translation, mRNA, ribosome and translated R-luc with His-tag formed ternary complexes. They were collected with translated His-tag using Ni-particles. Extracted mRNA from ternary complex was amplified using RT-PCR and sequenced. Finally, 5'-UTR with high translation efficiency was obtained from random 5'-UTR library

  15. Combinatorial Control of mRNA Fates by RNA-Binding Proteins and Non-Coding RNAs

    Directory of Open Access Journals (Sweden)

    Valentina Iadevaia

    2015-09-01

    Full Text Available Post-transcriptional control of gene expression is mediated by RNA-binding proteins (RBPs and small non-coding RNAs (e.g., microRNAs that bind to distinct elements in their mRNA targets. Here, we review recent examples describing the synergistic and/or antagonistic effects mediated by RBPs and miRNAs to determine the localisation, stability and translation of mRNAs in mammalian cells. From these studies, it is becoming increasingly apparent that dynamic rearrangements of RNA-protein complexes could have profound implications in human cancer, in synaptic plasticity, and in cellular differentiation.

  16. pEVL: A Linear Plasmid for Generating mRNA IVT Templates With Extended Encoded Poly(A Sequences

    Directory of Open Access Journals (Sweden)

    Alexandra E Grier

    2016-01-01

    Full Text Available Increasing demand for large-scale synthesis of in vitro transcribed (IVT mRNA is being driven by the increasing use of mRNA for transient gene expression in cell engineering and therapeutic applications. An important determinant of IVT mRNA potency is the 3′ polyadenosine (poly(A tail, the length of which correlates with translational efficiency. However, present methods for generation of IVT mRNA rely on templates derived from circular plasmids or PCR products, in which homopolymeric tracts are unstable, thus limiting encoded poly(A tail lengths to ≃120 base pairs (bp. Here, we have developed a novel method for generation of extended poly(A tracts using a previously described linear plasmid system, pJazz. We find that linear plasmids can successfully propagate poly(A tracts up to ≃500 bp in length for IVT mRNA production. We then modified pJazz by removing extraneous restriction sites, adding a T7 promoter sequence upstream from an extended multiple cloning site, and adding a unique type-IIS restriction site downstream from the encoded poly(A tract to facilitate generation of IVT mRNA with precisely defined encoded poly(A tracts and 3′ termini. The resulting plasmid, designated pEVL, can be used to generate IVT mRNA with consistent defined lengths and terminal residue(s.

  17. Cloning and sequence analysis of cDNA coding for rat nucleolar protein C23

    International Nuclear Information System (INIS)

    Ghaffari, S.H.; Olson, M.O.J.

    1986-01-01

    Using synthetic oligonucleotides as primers and probes, the authors have isolated and sequenced cDNA clones encoding protein C23, a putative nucleolus organizer protein. Poly(A + ) RNA was isolated from rat Novikoff hepatoma cells and enriched in C23 mRNA by sucrose density gradient ultracentrifugation. Two deoxyoligonuleotides, a 48- and a 27-mer, were synthesized on the basis of amino acid sequence from the C-terminal half of protein C23 and cDNA sequence data from CHO cell protein. The 48-mer was used a primer for synthesis of cDNA which was then inserted into plasmid pUC9. Transformed bacterial colonies were screened by hybridization with 32 P labeled 27-mer. Two clones among 5000 gave a strong positive signal. Plasmid DNAs from these clones were purified and characterized by blotting and nucleotide sequence analysis. The length of C23 mRNA was estimated to be 3200 bases in a northern blot analysis. The sequence of a 267 b.p. insert shows high homology with the CHO cDNA with only 9 nucleotide differences and an identical amino acid sequence. These studies indicate that this region of the protein is highly conserved

  18. PACCMIT/PACCMIT-CDS: identifying microRNA targets in 3′ UTRs and coding sequences

    Science.gov (United States)

    Šulc, Miroslav; Marín, Ray M.; Robins, Harlan S.; Vaníček, Jiří

    2015-01-01

    The purpose of the proposed web server, publicly available at http://paccmit.epfl.ch, is to provide a user-friendly interface to two algorithms for predicting messenger RNA (mRNA) molecules regulated by microRNAs: (i) PACCMIT (Prediction of ACcessible and/or Conserved MIcroRNA Targets), which identifies primarily mRNA transcripts targeted in their 3′ untranslated regions (3′ UTRs), and (ii) PACCMIT-CDS, designed to find mRNAs targeted within their coding sequences (CDSs). While PACCMIT belongs among the accurate algorithms for predicting conserved microRNA targets in the 3′ UTRs, the main contribution of the web server is 2-fold: PACCMIT provides an accurate tool for predicting targets also of weakly conserved or non-conserved microRNAs, whereas PACCMIT-CDS addresses the lack of similar portals adapted specifically for targets in CDS. The web server asks the user for microRNAs and mRNAs to be analyzed, accesses the precomputed P-values for all microRNA–mRNA pairs from a database for all mRNAs and microRNAs in a given species, ranks the predicted microRNA–mRNA pairs, evaluates their significance according to the false discovery rate and finally displays the predictions in a tabular form. The results are also available for download in several standard formats. PMID:25948580

  19. Genomic analysis suggests that mRNA destabilization by the microprocessor is specialized for the auto-regulation of Dgcr8.

    Directory of Open Access Journals (Sweden)

    Archana Shenoy

    2009-09-01

    Full Text Available The Microprocessor, containing the RNA binding protein Dgcr8 and RNase III enzyme Drosha, is responsible for processing primary microRNAs to precursor microRNAs. The Microprocessor regulates its own levels by cleaving hairpins in the 5'UTR and coding region of the Dgcr8 mRNA, thereby destabilizing the mature transcript.To determine whether the Microprocessor has a broader role in directly regulating other coding mRNA levels, we integrated results from expression profiling and ultra high-throughput deep sequencing of small RNAs. Expression analysis of mRNAs in wild-type, Dgcr8 knockout, and Dicer knockout mouse embryonic stem (ES cells uncovered mRNAs that were specifically upregulated in the Dgcr8 null background. A number of these transcripts had evolutionarily conserved predicted hairpin targets for the Microprocessor. However, analysis of deep sequencing data of 18 to 200nt small RNAs in mouse ES, HeLa, and HepG2 indicates that exonic sequence reads that map in a pattern consistent with Microprocessor activity are unique to Dgcr8.We conclude that the Microprocessor's role in directly destabilizing coding mRNAs is likely specifically targeted to Dgcr8 itself, suggesting a specialized cellular mechanism for gene auto-regulation.

  20. Isolation of full-length putative rat lysophospholipase cDNA using improved methods for mRNA isolation and cDNA cloning

    International Nuclear Information System (INIS)

    Han, J.H.; Stratowa, C.; Rutter, W.J.

    1987-01-01

    The authors have cloned a full-length putative rat pancreatic lysophospholipase cDNA by an improved mRNA isolation method and cDNA cloning strategy using [ 32 P]-labelled nucleotides. These new methods allow the construction of a cDNA library from the adult rat pancreas in which the majority of recombinant clones contained complete sequences for the corresponding mRNAs. A previously recognized but unidentified long and relatively rare cDNA clone containing the entire sequence from the cap site at the 5' end to the poly(A) tail at the 3' end of the mRNA was isolated by single-step screening of the library. The size, amino acid composition, and the activity of the protein expressed in heterologous cells strongly suggest this mRNA codes for lysophospholipase

  1. Deep Sequencing Reveals Uncharted Isoform Heterogeneity of the Protein-Coding Transcriptome in Cerebral Ischemia.

    Science.gov (United States)

    Bhattarai, Sunil; Aly, Ahmed; Garcia, Kristy; Ruiz, Diandra; Pontarelli, Fabrizio; Dharap, Ashutosh

    2018-06-03

    Gene expression in cerebral ischemia has been a subject of intense investigations for several years. Studies utilizing probe-based high-throughput methodologies such as microarrays have contributed significantly to our existing knowledge but lacked the capacity to dissect the transcriptome in detail. Genome-wide RNA-sequencing (RNA-seq) enables comprehensive examinations of transcriptomes for attributes such as strandedness, alternative splicing, alternative transcription start/stop sites, and sequence composition, thus providing a very detailed account of gene expression. Leveraging this capability, we conducted an in-depth, genome-wide evaluation of the protein-coding transcriptome of the adult mouse cortex after transient focal ischemia at 6, 12, or 24 h of reperfusion using RNA-seq. We identified a total of 1007 transcripts at 6 h, 1878 transcripts at 12 h, and 1618 transcripts at 24 h of reperfusion that were significantly altered as compared to sham controls. With isoform-level resolution, we identified 23 splice variants arising from 23 genes that were novel mRNA isoforms. For a subset of genes, we detected reperfusion time-point-dependent splice isoform switching, indicating an expression and/or functional switch for these genes. Finally, for 286 genes across all three reperfusion time-points, we discovered multiple, distinct, simultaneously expressed and differentially altered isoforms per gene that were generated via alternative transcription start/stop sites. Of these, 165 isoforms derived from 109 genes were novel mRNAs. Together, our data unravel the protein-coding transcriptome of the cerebral cortex at an unprecedented depth to provide several new insights into the flexibility and complexity of stroke-related gene transcription and transcript organization.

  2. Cloning of cDNA sequences of a progestin-regulated mRNA from MCF7 human breast cancer cells

    Energy Technology Data Exchange (ETDEWEB)

    Chalbos, D; Westley, B; Alibert, C; Rochefort, H

    1986-01-24

    A cDNA clone corresponding to an mRNA regulated by the progestin R5020, has been isolated by differential screening of a cDNA library from the MCF7 breast cancer cell line, which contains estrogen and progesterone receptors. This probe hybridized with a single species of poly A + RNA of 8-kb molecular weight as shown by Northern blot analysis and could also be used to total RNA preparation. This recombinant cone hybridized specifically to an mRNA coding for a 250,000 daltons protein when translated in vitro. This protein was identical to the 250 kDa progestin-regulated protein that the authors previously described as shown by immunoprecipitation with specific rabbit polyclonal antibodies. Dose-response curve and specificity studies show that the accumulation of the Pg8 mRNA and that of the 250-kDa protein was increased by 5 to 30-fold following progestin treatment and that this effect was mediated by the progesterone receptor. Time course of induction indicated that the accumulation of mRNA was rapid and preceded that of the protein. This is the first report on a cloned cDNA probe of progestin-regulated mRNA in human cell lines.

  3. Staphylococcus aureus RNAIII binds to two distant regions of coa mRNA to arrest translation and promote mRNA degradation.

    Directory of Open Access Journals (Sweden)

    Clément Chevalier

    2010-03-01

    Full Text Available Staphylococcus aureus RNAIII is the intracellular effector of the quorum sensing system that temporally controls a large number of virulence factors including exoproteins and cell-wall-associated proteins. Staphylocoagulase is one major virulence factor, which promotes clotting of human plasma. Like the major cell surface protein A, the expression of staphylocoagulase is strongly repressed by the quorum sensing system at the post-exponential growth phase. Here we used a combination of approaches in vivo and in vitro to analyze the mechanism used by RNAIII to regulate the expression of staphylocoagulase. Our data show that RNAIII represses the synthesis of the protein through a direct binding with the mRNA. Structure mapping shows that two distant regions of RNAIII interact with coa mRNA and that the mRNA harbors a conserved signature as found in other RNAIII-target mRNAs. The resulting complex is composed of an imperfect duplex masking the Shine-Dalgarno sequence of coa mRNA and of a loop-loop interaction occurring downstream in the coding region. The imperfect duplex is sufficient to prevent the formation of the ribosomal initiation complex and to repress the expression of a reporter gene in vivo. In addition, the double-strand-specific endoribonuclease III cleaves the two regions of the mRNA bound to RNAIII that may contribute to the degradation of the repressed mRNA. This study validates another direct target of RNAIII that plays a role in virulence. It also illustrates the diversity of RNAIII-mRNA topologies and how these multiple RNAIII-mRNA interactions would mediate virulence regulation.

  4. Sub-grouping of Plasmodium falciparum 3D7 var genes based on sequence analysis of coding and non-coding regions

    DEFF Research Database (Denmark)

    Lavstsen, Thomas; Salanti, Ali; Jensen, Anja T R

    2003-01-01

    and organization of the 3D7 PfEMP1 repertoire was investigated on the basis of the complete genome sequence. METHODS: Using two tree-building methods we analysed the coding and non-coding sequences of 3D7 var and rif genes as well as var genes of other parasite strains. RESULTS: var genes can be sub...

  5. Blackout sequence modeling for Atucha-I with MARCH3 code

    International Nuclear Information System (INIS)

    Baron, J.; Bastianelli, B.

    1997-01-01

    The modeling of a blackout sequence in Atucha I nuclear power plant is presented in this paper, as a preliminary phase for a level II probabilistic safety assessment. Such sequence is analyzed with the code MARCH3 from STCP (Source Term Code Package), based on a specific model developed for Atucha, that takes into accounts it peculiarities. The analysis includes all the severe accident phases, from the initial transient (loss of heat sink), loss of coolant through the safety valves, core uncovered, heatup, metal-water reaction, melting and relocation, heatup and failure of the pressure vessel, core-concrete interaction in the reactor cavity, heatup and failure of the containment building (multi-compartmented) due to quasi-static overpressurization. The results obtained permit to visualize the time sequence of these events, as well as provide the basis for source term studies. (author) [es

  6. Isolation and characterization of human glycophorin A cDNA clones by a synthetic oligonucleotide approach: nucleotide sequence and mRNA structure

    International Nuclear Information System (INIS)

    Siebert, P.D.; Fukuda, M.

    1986-01-01

    In an effort to understand the relationships among and the regulation of human glycophorins, the authors have isolated and characterized several glycophorin A-specific cDNA clones obtained from a human erythroleukemic K562 cell cDNA library. This was accomplished by using mixed synthetic oligonucleotides, corresponding to various regions of the known amino acid sequence, to prime the synthesis of the cDNA as well as to screen the cDNA library. They also used synthetic oligonucleotides to sequence the largest of the glycophorin cDNAs. The nucleotide sequence obtained suggests the presence of a potential leader peptide, consistent with the membrane localization of this glycoprotein. Examination of the structure of glycophorin mRNA by blot hybridization revealed the existence of several electrophoretically distinct mRNAs numbering three or four, depending on the size of the glycophorin cDNA used as a hybridization probe. The smaller cDNA hybridized to three mRNAs of approximately 2.8, 1.7, and 1.0 kilobases. In contrast, the larger cDNA hybridized to an additional mRNA of approximately 0.6 kilobases. Further examination of the relationships between these multiple mRNAs by blot hybridization was conducted with the use of exact-sequence oligonucleotide probes constructed from various regions of the cDNA representing portions of the amino acid sequence of glycophorin A with or without known homology with glycophorin B. In total, the results obtained are consistent with the hypothesis that the three larger mRNAs represent glycophorin A gene transcripts and that the smallest (0.6 kilobase) mRNA may be specific for glycophorin B

  7. Identification of multiple mRNA and DNA sequences from small tissue samples isolated by laser-assisted microdissection.

    Science.gov (United States)

    Bernsen, M R; Dijkman, H B; de Vries, E; Figdor, C G; Ruiter, D J; Adema, G J; van Muijen, G N

    1998-10-01

    Molecular analysis of small tissue samples has become increasingly important in biomedical studies. Using a laser dissection microscope and modified nucleic acid isolation protocols, we demonstrate that multiple mRNA as well as DNA sequences can be identified from a single-cell sample. In addition, we show that the specificity of procurement of tissue samples is not compromised by smear contamination resulting from scraping of the microtome knife during sectioning of lesions. The procedures described herein thus allow for efficient RT-PCR or PCR analysis of multiple nucleic acid sequences from small tissue samples obtained by laser-assisted microdissection.

  8. The "periodic table" of the genetic code: A new way to look at the code and the decoding process.

    Science.gov (United States)

    Komar, Anton A

    2016-01-01

    Henri Grosjean and Eric Westhof recently presented an information-rich, alternative view of the genetic code, which takes into account current knowledge of the decoding process, including the complex nature of interactions between mRNA, tRNA and rRNA that take place during protein synthesis on the ribosome, and it also better reflects the evolution of the code. The new asymmetrical circular genetic code has a number of advantages over the traditional codon table and the previous circular diagrams (with a symmetrical/clockwise arrangement of the U, C, A, G bases). Most importantly, all sequence co-variances can be visualized and explained based on the internal logic of the thermodynamics of codon-anticodon interactions.

  9. Cytoplasmic protein binding to highly conserved sequences in the 3' untranslated region of mouse protamine 2 mRNA, a translationally regulated transcript of male germ cells

    International Nuclear Information System (INIS)

    Kwon, Y.K.; Hecht, N.B.

    1991-01-01

    The expression of the protamines, the predominant nuclear proteins of mammalian spermatozoa, is regulated translationally during male germ-cell development. The 3' untranslated region (UTR) of protamine 1 mRNA has been reported to control its time of translation. To understand the mechanisms controlling translation of the protamine mRNAs, we have sought to identify cis elements of the 3' UTR of protamine 2 mRNA that are recognized by cytoplasmic factors. From gel retardation assays, two sequence elements are shown to form specific RNA-protein complexes. Protein binding sites of the two complexes were determined by RNase T1 mapping, by blocking the putative binding sites with antisense oligonucleotides, and by competition assays. The sequences of these elements, located between nucleotides + 537 and + 572 in protamine 2 mRNA, are highly conserved among postmeiotic translationally regulated nuclear proteins of the mammalian testis. Two closely linked protein binding sites were detected. UV-crosslinking studies revealed that a protein of about 18 kDa binds to one of the conserved sequences. These data demonstrate specific protein binding to a highly conserved 3' UTR of translationally regulated testicular mRNA

  10. Systematic analysis of coding and noncoding DNA sequences using methods of statistical linguistics

    Science.gov (United States)

    Mantegna, R. N.; Buldyrev, S. V.; Goldberger, A. L.; Havlin, S.; Peng, C. K.; Simons, M.; Stanley, H. E.

    1995-01-01

    We compare the statistical properties of coding and noncoding regions in eukaryotic and viral DNA sequences by adapting two tests developed for the analysis of natural languages and symbolic sequences. The data set comprises all 30 sequences of length above 50 000 base pairs in GenBank Release No. 81.0, as well as the recently published sequences of C. elegans chromosome III (2.2 Mbp) and yeast chromosome XI (661 Kbp). We find that for the three chromosomes we studied the statistical properties of noncoding regions appear to be closer to those observed in natural languages than those of coding regions. In particular, (i) a n-tuple Zipf analysis of noncoding regions reveals a regime close to power-law behavior while the coding regions show logarithmic behavior over a wide interval, while (ii) an n-gram entropy measurement shows that the noncoding regions have a lower n-gram entropy (and hence a larger "n-gram redundancy") than the coding regions. In contrast to the three chromosomes, we find that for vertebrates such as primates and rodents and for viral DNA, the difference between the statistical properties of coding and noncoding regions is not pronounced and therefore the results of the analyses of the investigated sequences are less conclusive. After noting the intrinsic limitations of the n-gram redundancy analysis, we also briefly discuss the failure of the zeroth- and first-order Markovian models or simple nucleotide repeats to account fully for these "linguistic" features of DNA. Finally, we emphasize that our results by no means prove the existence of a "language" in noncoding DNA.

  11. Coding patient emotional cues and concerns in medical consultations: the Verona coding definitions of emotional sequences (VR-CoDES).

    NARCIS (Netherlands)

    Zimmermann, C.; Piccolo, L. del; Bensing, J.; Bergvik, S.; Haes, H. de; Eide, H.; Fletcher, I.; Goss, C.; Heaven, C.; Humphris, G.; Young-Mi, K.; Langewitz, W.; Meeuwesen, L.; Nuebling, M.; Rimondini, M.; Salmon, P.; Dulmen, S. van; Wissow, L.; Zandbelt, L.; Finset, A.

    2011-01-01

    Objective: To present the Verona Coding Definitions of Emotional Sequences (VR-CoDES CC), a consensus based system for coding patient expressions of emotional distress in medical consultations, defined as Cues or Concerns. Methods: The system was developed by an international group of communication

  12. Prevalence of transcription promoters within archaeal operons and coding sequences.

    Science.gov (United States)

    Koide, Tie; Reiss, David J; Bare, J Christopher; Pang, Wyming Lee; Facciotti, Marc T; Schmid, Amy K; Pan, Min; Marzolf, Bruz; Van, Phu T; Lo, Fang-Yin; Pratap, Abhishek; Deutsch, Eric W; Peterson, Amelia; Martin, Dan; Baliga, Nitin S

    2009-01-01

    Despite the knowledge of complex prokaryotic-transcription mechanisms, generalized rules, such as the simplified organization of genes into operons with well-defined promoters and terminators, have had a significant role in systems analysis of regulatory logic in both bacteria and archaea. Here, we have investigated the prevalence of alternate regulatory mechanisms through genome-wide characterization of transcript structures of approximately 64% of all genes, including putative non-coding RNAs in Halobacterium salinarum NRC-1. Our integrative analysis of transcriptome dynamics and protein-DNA interaction data sets showed widespread environment-dependent modulation of operon architectures, transcription initiation and termination inside coding sequences, and extensive overlap in 3' ends of transcripts for many convergently transcribed genes. A significant fraction of these alternate transcriptional events correlate to binding locations of 11 transcription factors and regulators (TFs) inside operons and annotated genes-events usually considered spurious or non-functional. Using experimental validation, we illustrate the prevalence of overlapping genomic signals in archaeal transcription, casting doubt on the general perception of rigid boundaries between coding sequences and regulatory elements.

  13. Cloning and tissue distribution of rat hear fatty acid binding protein mRNA: identical forms in heart and skeletal muscle

    International Nuclear Information System (INIS)

    Claffey, K.P.; Herrera, V.L.; Brecher, P.; Ruiz-Opazo, N.

    1987-01-01

    A fatty acid binding protein (FABP) as been identified and characterized in rat heart, but the function and regulation of this protein are unclear. In this study the cDNA for rat heart FABP was cloned from a λ gt11 library. Sequencing of the cDNA showed an open reading frame coding for a protein with 133 amino acids and a calculated size of 14,776 daltons. Several differences were found between the sequence determined from the cDNA and that reported previously by protein sequencing techniques. Northern blot analysis using rat heart FABP cDNA as a probe established the presence of an abundant mRNA in rat heart about 0.85 kilobases in length. This mRNA was detected, but was not abundant, in fetal heart tissue. Tissue distribution studies showed a similar mRNA species in red, but not white, skeletal muscle. In general, the mRNA tissue distribution was similar to that of the protein detected by Western immunoblot analysis, suggesting that heart FABP expression may be regulated at the transcriptional level. S1 nuclease mapping studies confirmed that the mRNA hybridized to rat heart FABP cDNA was identical in heart and red skeletal muscle throughout the entire open reading frame. The structural differences between heart FABP and other members of this multigene family may be related to the functional requirements of oxidative muscle for fatty acids as a fuel source

  14. Cloning and tissue distribution of rat hear fatty acid binding protein mRNA: identical forms in heart and skeletal muscle

    Energy Technology Data Exchange (ETDEWEB)

    Claffey, K.P.; Herrera, V.L.; Brecher, P.; Ruiz-Opazo, N.

    1987-12-01

    A fatty acid binding protein (FABP) as been identified and characterized in rat heart, but the function and regulation of this protein are unclear. In this study the cDNA for rat heart FABP was cloned from a lambda gt11 library. Sequencing of the cDNA showed an open reading frame coding for a protein with 133 amino acids and a calculated size of 14,776 daltons. Several differences were found between the sequence determined from the cDNA and that reported previously by protein sequencing techniques. Northern blot analysis using rat heart FABP cDNA as a probe established the presence of an abundant mRNA in rat heart about 0.85 kilobases in length. This mRNA was detected, but was not abundant, in fetal heart tissue. Tissue distribution studies showed a similar mRNA species in red, but not white, skeletal muscle. In general, the mRNA tissue distribution was similar to that of the protein detected by Western immunoblot analysis, suggesting that heart FABP expression may be regulated at the transcriptional level. S1 nuclease mapping studies confirmed that the mRNA hybridized to rat heart FABP cDNA was identical in heart and red skeletal muscle throughout the entire open reading frame. The structural differences between heart FABP and other members of this multigene family may be related to the functional requirements of oxidative muscle for fatty acids as a fuel source.

  15. Characterization of a major late herpes simplex virus type 1 mRNA.

    Science.gov (United States)

    Costa, R H; Devi, B G; Anderson, K P; Gaylord, B H; Wagner, E K

    1981-05-01

    A major, late 6-kilobase (6-kb) mRNa mapping in the large unique region of herpes simplex virus type 1 (HSV-1) was characterized by using two recombinant DNA clones, one containing EcoRI fragment G (0.190 to 0.30 map units) in lambda. WES.B (L. Enquist, M. Madden, P. Schiop-Stansly, and G. Vandl Woude, Science 203:541-544, 1979) and one containing HindIII fragment J (0.181 to 0.259 map units) in pBR322. This 6-kb mRNA had its 3' end to the left of 0.231 on the prototypical arrangement of the HSV-1 genome and was transcribed from right to left. It was bounded on both sides by regions containing a large number of distinct mRNA species, and its 3' end was partially colinear with a 1.5-kb mRNA which encoded a 35,000-dalton polypeptide. The 6-kb mRNA encoded a 155,000-dalton polypeptide which was shown to be the only one of this size detectable by hybrid-arrested translation encoded by late polyadenylated polyribosomal RNA. The S1 nuclease mapping experiments indicated that there were no introns in the coding sequence for this mRNA and that its 3' end mapped approximately 800 nucleotides to the left of the BglII site at 0.231, whereas its 5' end extended very close to the BamHI site at 0.266.

  16. Viperin mRNA is a novel target for the human RNase MRP/RNase P endoribonuclease.

    Science.gov (United States)

    Mattijssen, Sandy; Hinson, Ella R; Onnekink, Carla; Hermanns, Pia; Zabel, Bernhard; Cresswell, Peter; Pruijn, Ger J M

    2011-07-01

    RNase MRP is a conserved endoribonuclease, in humans consisting of a 267-nucleotide RNA associated with 7-10 proteins. Mutations in its RNA component lead to several autosomal recessive skeletal dysplasias, including cartilage-hair hypoplasia (CHH). Because the known substrates of mammalian RNase MRP, pre-ribosomal RNA, and RNA involved in mitochondrial DNA replication are not likely involved in CHH, we analyzed the effects of RNase MRP (and the structurally related RNase P) depletion on mRNAs using DNA microarrays. We confirmed the upregulation of the interferon-inducible viperin mRNA by RNAi experiments and this appeared to be independent of the interferon response. We detected two cleavage sites for RNase MRP/RNase P in the coding sequence of viperin mRNA. This is the first study providing direct evidence for the cleavage of a mRNA by RNase MRP/RNase P in human cells. Implications for the involvement in the pathophysiology of CHH are discussed.

  17. Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

    KAUST Repository

    Zhang, Zhang

    2010-11-08

    Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.

  18. Modeling compositional dynamics based on GC and purine contents of protein-coding sequences

    KAUST Repository

    Zhang, Zhang; Yu, Jun

    2010-01-01

    Background: Understanding the compositional dynamics of genomes and their coding sequences is of great significance in gaining clues into molecular evolution and a large number of publically-available genome sequences have allowed us to quantitatively predict deviations of empirical data from their theoretical counterparts. However, the quantification of theoretical compositional variations for a wide diversity of genomes remains a major challenge.Results: To model the compositional dynamics of protein-coding sequences, we propose two simple models that take into account both mutation and selection effects, which act differently at the three codon positions, and use both GC and purine contents as compositional parameters. The two models concern the theoretical composition of nucleotides, codons, and amino acids, with no prerequisite of homologous sequences or their alignments. We evaluated the two models by quantifying theoretical compositions of a large collection of protein-coding sequences (including 46 of Archaea, 686 of Bacteria, and 826 of Eukarya), yielding consistent theoretical compositions across all the collected sequences.Conclusions: We show that the compositions of nucleotides, codons, and amino acids are largely determined by both GC and purine contents and suggest that deviations of the observed from the expected compositions may reflect compositional signatures that arise from a complex interplay between mutation and selection via DNA replication and repair mechanisms.Reviewers: This article was reviewed by Zhaolei Zhang (nominated by Mark Gerstein), Guruprasad Ananda (nominated by Kateryna Makova), and Daniel Haft. 2010 Zhang and Yu; licensee BioMed Central Ltd.

  19. Non-codingRNA sequence variations in human chronic lymphocytic leukemia and colorectal cancer.

    Science.gov (United States)

    Wojcik, Sylwia E; Rossi, Simona; Shimizu, Masayoshi; Nicoloso, Milena S; Cimmino, Amelia; Alder, Hansjuerg; Herlea, Vlad; Rassenti, Laura Z; Rai, Kanti R; Kipps, Thomas J; Keating, Michael J; Croce, Carlo M; Calin, George A

    2010-02-01

    Cancer is a genetic disease in which the interplay between alterations in protein-coding genes and non-coding RNAs (ncRNAs) plays a fundamental role. In recent years, the full coding component of the human genome was sequenced in various cancers, whereas such attempts related to ncRNAs are still fragmentary. We screened genomic DNAs for sequence variations in 148 microRNAs (miRNAs) and ultraconserved regions (UCRs) loci in patients with chronic lymphocytic leukemia (CLL) or colorectal cancer (CRC) by Sanger technique and further tried to elucidate the functional consequences of some of these variations. We found sequence variations in miRNAs in both sporadic and familial CLL cases, mutations of UCRs in CLLs and CRCs and, in certain instances, detected functional effects of these variations. Furthermore, by integrating our data with previously published data on miRNA sequence variations, we have created a catalog of DNA sequence variations in miRNAs/ultraconserved genes in human cancers. These findings argue that ncRNAs are targeted by both germ line and somatic mutations as well as by single-nucleotide polymorphisms with functional significance for human tumorigenesis. Sequence variations in ncRNA loci are frequent and some have functional and biological significance. Such information can be exploited to further investigate on a genome-wide scale the frequency of genetic variations in ncRNAs and their functional meaning, as well as for the development of new diagnostic and prognostic markers for leukemias and carcinomas.

  20. Optical orthogonal code-division multiple-access system - Part 2: Multibits/sequence-period OOCDMA

    Science.gov (United States)

    Kwon, Hyuck M.

    1994-08-01

    In a recently proposed optical orthogonal code division multiple-access (OOCDMA) system, one bit of user's data is transmitted per sequence-period, and a threshold is employed for the final bit decision. In this paper, a system that can transmit multibits per sequence-period is introduced, and avalanche photodiode (APD) noise, thermal noise, and interference, are included. This system, derived by exploiting orthogonal properties of the OOCDMA code sequence and using a maximum search (instead of a threshold) in the final decision, is log(sub 2) F times higher in throughput, where F is sequence-period. For example, four orders of magnitude are better in bit error probability at - 56 dBW received laser power, with F = 1000 chips, 10 'marks' in a sequence, and 10 users of 30 Mb/s data rate for one-bit/sequence-period and 270 Mb/s data rate for multibits/sequence-period system. Furthermore, an exact analysis is performed for the log(sub 2)F bits/sequence-period system with a hard-limiter placed before the receiver, and its performance is compared to the performance without hard-limiter, for the chip-synchronous case. The improvement from using a hard-limiter is significant in the log(sub 2)F bits/sequence-period OCCDMA system.

  1. Sequence and expression analyses of porcine ISG15 and ISG43 genes.

    Science.gov (United States)

    Huang, Jiangnan; Zhao, Shuhong; Zhu, Mengjin; Wu, Zhenfang; Yu, Mei

    2009-08-01

    The coding sequences of porcine interferon-stimulated gene 15 (ISG15) and the interferon-stimulated gene (ISG43) were cloned from swine spleen mRNA. The amino acid sequences deduced from porcine ISG15 and ISG43 genes coding sequence shared 24-75% and 29-83% similarity with ISG15s and ISG43s from other vertebrates, respectively. Structural analyses revealed that porcine ISG15 comprises two ubiquitin homologues motifs (UBQ) domain and a conserved C-terminal LRLRGG conjugating motif. Porcine ISG43 contains an ubiquitin-processing proteases-like domain. Phylogenetic analyses showed that porcine ISG15 and ISG43 were mostly related to rat ISG15 and cattle ISG43, respectively. Using quantitative real-time PCR assay, significant increased expression levels of porcine ISG15 and ISG43 genes were detected in porcine kidney endothelial cells (PK15) cells treated with poly I:C. We also observed the enhanced mRNA expression of three members of dsRNA pattern-recognition receptors (PRR), TLR3, DDX58 and IFIH1, which have been reported to act as critical receptors in inducing the mRNA expression of ISG15 and ISG43 genes. However, we did not detect any induced mRNA expression of IFNalpha and IFNbeta, suggesting that transcriptional activations of ISG15 and ISG43 were mediated through IFN-independent signaling pathway in the poly I:C treated PK15 cells. Association analyses in a Landrace pig population revealed that ISG15 c.347T>C (BstUI) polymorphism and the ISG43 c.953T>G (BccI) polymorphism were significantly associated with hematological parameters and immune-related traits.

  2. SEAPATH: A microcomputer code for evaluating physical security effectiveness using adversary sequence diagrams

    International Nuclear Information System (INIS)

    Darby, J.L.

    1986-01-01

    The Adversary Sequence Diagram (ASD) concept was developed by Sandia National Laboratories (SNL) to examine physical security system effectiveness. Sandia also developed a mainframe computer code, PANL, to analyze the ASD. The authors have developed a microcomputer code, SEAPATH, which also analyzes ASD's. The Authors are supporting SNL in software development of the SAVI code; SAVI utilizes the SEAPATH algorithm to identify and quantify paths

  3. The Evolution of Bony Vertebrate Enhancers at Odds with Their Coding Sequence Landscape.

    Science.gov (United States)

    Yousaf, Aisha; Sohail Raza, Muhammad; Ali Abbasi, Amir

    2015-08-06

    Enhancers lie at the heart of transcriptional and developmental gene regulation. Therefore, changes in enhancer sequences usually disrupt the target gene expression and result in disease phenotypes. Despite the well-established role of enhancers in development and disease, evolutionary sequence studies are lacking. The current study attempts to unravel the puzzle of bony vertebrates' conserved noncoding elements (CNE) enhancer evolution. Bayesian phylogenetics of enhancer sequences spotlights promising interordinal relationships among placental mammals, proposing a closer relationship between humans and laurasiatherians while placing rodents at the basal position. Clock-based estimates of enhancer evolution provided a dynamic picture of interspecific rate changes across the bony vertebrate lineage. Moreover, coelacanth in the study augmented our appreciation of the vertebrate cis-regulatory evolution during water-land transition. Intriguingly, we observed a pronounced upsurge in enhancer evolution in land-dwelling vertebrates. These novel findings triggered us to further investigate the evolutionary trend of coding as well as CNE nonenhancer repertoires, to highlight the relative evolutionary dynamics of diverse genomic landscapes. Surprisingly, the evolutionary rates of enhancer sequences were clearly at odds with those of the coding and the CNE nonenhancer sequences during vertebrate adaptation to land, with land vertebrates exhibiting significantly reduced rates of coding sequence evolution in comparison to their fast evolving regulatory landscape. The observed variation in tetrapod cis-regulatory elements caused the fine-tuning of associated gene regulatory networks. Therefore, the increased evolutionary rate of tetrapods' enhancer sequences might be responsible for the variation in developmental regulatory circuits during the process of vertebrate adaptation to land. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for

  4. SRComp: short read sequence compression using burstsort and Elias omega coding.

    Directory of Open Access Journals (Sweden)

    Jeremy John Selva

    Full Text Available Next-generation sequencing (NGS technologies permit the rapid production of vast amounts of data at low cost. Economical data storage and transmission hence becomes an increasingly important challenge for NGS experiments. In this paper, we introduce a new non-reference based read sequence compression tool called SRComp. It works by first employing a fast string-sorting algorithm called burstsort to sort read sequences in lexicographical order and then Elias omega-based integer coding to encode the sorted read sequences. SRComp has been benchmarked on four large NGS datasets, where experimental results show that it can run 5-35 times faster than current state-of-the-art read sequence compression tools such as BEETL and SCALCE, while retaining comparable compression efficiency for large collections of short read sequences. SRComp is a read sequence compression tool that is particularly valuable in certain applications where compression time is of major concern.

  5. Interactions between the HIV-1 Unspliced mRNA and Host mRNA Decay Machineries

    Directory of Open Access Journals (Sweden)

    Daniela Toro-Ascuy

    2016-11-01

    Full Text Available The human immunodeficiency virus type-1 (HIV-1 unspliced transcript is used both as mRNA for the synthesis of structural proteins and as the packaged genome. Given the presence of retained introns and instability AU-rich sequences, this viral transcript is normally retained and degraded in the nucleus of host cells unless the viral protein REV is present. As such, the stability of the HIV-1 unspliced mRNA must be particularly controlled in the nucleus and the cytoplasm in order to ensure proper levels of this viral mRNA for translation and viral particle formation. During its journey, the HIV-1 unspliced mRNA assembles into highly specific messenger ribonucleoproteins (mRNPs containing many different host proteins, amongst which are well-known regulators of cytoplasmic mRNA decay pathways such as up-frameshift suppressor 1 homolog (UPF1, Staufen double-stranded RNA binding protein 1/2 (STAU1/2, or components of miRNA-induced silencing complex (miRISC and processing bodies (PBs. More recently, the HIV-1 unspliced mRNA was shown to contain N6-methyladenosine (m6A, allowing the recruitment of YTH N6-methyladenosine RNA binding protein 2 (YTHDF2, an m6A reader host protein involved in mRNA decay. Interestingly, these host proteins involved in mRNA decay were shown to play positive roles in viral gene expression and viral particle assembly, suggesting that HIV-1 interacts with mRNA decay components to successfully accomplish viral replication. This review summarizes the state of the art in terms of the interactions between HIV-1 unspliced mRNA and components of different host mRNA decay machineries.

  6. Increased mRNA expression of a laminin-binding protein in human colon carcinoma: Complete sequence of a full-length cDNA encoding the protein

    International Nuclear Information System (INIS)

    Yow, Hsiukang; Wong, Jau Min; Chen, Hai Shiene; Lee, C.; Steele, G.D. Jr.; Chen, Lanbo

    1988-01-01

    Reliable markers to distinguish human colon carcinoma from normal colonic epithelium are needed particularly for poorly differentiated tumors where no useful marker is currently available. To search for markers the authors constructed cDNA libraries from human colon carcinoma cell lines and screened for clones that hybridize to a greater degree with mRNAs of colon carcinomas than with their normal counterparts. Here they report one such cDNA clone that hybridizes with a 1.2-kilobase (kb) mRNA, the level of which is ∼9-fold greater in colon carcinoma than in adjacent normal colonic epithelium. Blot hybridization of total RNA from a variety of human colon carcinoma cell lines shows that the level of this 1.2-kb mRNA in poorly differentiated colon carcinomas is as high as or higher than that in well-differentiated carcinomas. Molecular cloning and complete sequencing of cDNA corresponding to the full-length open reading frame of this 1.2-kb mRNA unexpectedly show it to contain all the partial cDNA sequence encoding 135 amino acid residues previously reported for a human laminin receptor. The deduced amino acid sequence suggests that this putative laminin-binding protein from human colon carcinomas consists of 295 amino acid residues with interesting features. There is an unusual C-terminal 70-amino acid segment, which is trypsin-resistant and highly negatively charged

  7. Detection and quantitative analysis of actin mRNA by in situ hybridization with an oligodeoxynucleotide probe

    International Nuclear Information System (INIS)

    Taneja, K.; Singer, R.

    1987-01-01

    In situ hybridization is a useful method for localizing specific nucleic acid sequences intracellularly and for studying regulation of gene expression. Recently synthetic oligonucleotides have been successfully used as probes in this technique. Since they can be made easily to specific nucleic acid regions, they may be the best approach for analysis of a gene family of highly conserved sequences. They have analyzed these probes for the development of an in situ hybridization method. Oligonucleotides were made to different regions of chick beta-actin mRNA and used for detection of these sequences in a culture of chicken fibroblasts and myoblasts. They found that synthetic DNAs have different efficiencies of hybridization, indicating that not all target sequences are equivalent. They have investigated in detail a particular probe to the actin mRNA coding region and have optimized hybridization parameters. When hybridization was quantitated it was found that an oligonucleotide end labelled with 35 S or 32 P was capable of detecting several thousand messages per cell with a signal-to-noise ratio of 10:1. In situ hybridization confirmed the specificity of the hybridization as well as the background level. Increase in the number of oligonucleotides used should increase the signal-to-noise ratio-proportionately. Under particular circumstances the specificity of oligonucleotides make them an important reagent for in situ hybridization

  8. Multiple Access Interference Reduction Using Received Response Code Sequence for DS-CDMA UWB System

    Science.gov (United States)

    Toh, Keat Beng; Tachikawa, Shin'ichi

    This paper proposes a combination of novel Received Response (RR) sequence at the transmitter and a Matched Filter-RAKE (MF-RAKE) combining scheme receiver system for the Direct Sequence-Code Division Multiple Access Ultra Wideband (DS-CDMA UWB) multipath channel model. This paper also demonstrates the effectiveness of the RR sequence in Multiple Access Interference (MAI) reduction for the DS-CDMA UWB system. It suggests that by using conventional binary code sequence such as the M sequence or the Gold sequence, there is a possibility of generating extra MAI in the UWB system. Therefore, it is quite difficult to collect the energy efficiently although the RAKE reception method is applied at the receiver. The main purpose of the proposed system is to overcome the performance degradation for UWB transmission due to the occurrence of MAI during multiple accessing in the DS-CDMA UWB system. The proposed system improves the system performance by improving the RAKE reception performance using the RR sequence which can reduce the MAI effect significantly. Simulation results verify that significant improvement can be obtained by the proposed system in the UWB multipath channel models.

  9. Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing.

    Science.gov (United States)

    Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li

    2010-08-01

    Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resolution and sensitivity afforded by RNA-Seq, we were able to identify a substantial number of novel transcripts, new exons, untranslated regions, alternative upstream initiation codons and upstream open reading frames, which provide remarkable insight into the A. oryzae transcriptome. We were also able to assess the alternative mRNA isoforms in A. oryzae and found a large number of genes undergoing alternative splicing. Many genes and pathways that might be involved in higher levels of protein production in solid-state culture than in liquid culture were identified by comparing gene expression levels between different cultures. Our analysis indicated that the transcriptome of A. oryzae is much more complex than previously anticipated, and these results may provide a blueprint for further study of the A. oryzae transcriptome.

  10. Identification of evolutionarily conserved non-AUG-initiated N-terminal extensions in human coding sequences.

    LENUS (Irish Health Repository)

    Ivanov, Ivaylo P

    2011-05-01

    In eukaryotes, it is generally assumed that translation initiation occurs at the AUG codon closest to the messenger RNA 5\\' cap. However, in certain cases, initiation can occur at codons differing from AUG by a single nucleotide, especially the codons CUG, UUG, GUG, ACG, AUA and AUU. While non-AUG initiation has been experimentally verified for a handful of human genes, the full extent to which this phenomenon is utilized--both for increased coding capacity and potentially also for novel regulatory mechanisms--remains unclear. To address this issue, and hence to improve the quality of existing coding sequence annotations, we developed a methodology based on phylogenetic analysis of predicted 5\\' untranslated regions from orthologous genes. We use evolutionary signatures of protein-coding sequences as an indicator of translation initiation upstream of annotated coding sequences. Our search identified novel conserved potential non-AUG-initiated N-terminal extensions in 42 human genes including VANGL2, FGFR1, KCNN4, TRPV6, HDGF, CITED2, EIF4G3 and NTF3, and also affirmed the conservation of known non-AUG-initiated extensions in 17 other genes. In several instances, we have been able to obtain independent experimental evidence of the expression of non-AUG-initiated products from the previously published literature and ribosome profiling data.

  11. Cloning and characterization of DNA complementary to the canine distemper virus mRNA encoding matrix, phosphoprotein, and nucleocapsid protein

    International Nuclear Information System (INIS)

    Rozenblatt, S.; Eizenberg, O.; Englund, G.; Bellini, W.J.

    1985-01-01

    Double-stranded cDNA synthesized from total polyadenylate-containing mRNA, extracted from monkey kidney cells infected with canine distemper virus (CDV), has been cloned into the PstI site of Escherichia coli plasmid pBR322. Clones containing canine distemper virus DNA were identified by hybridization to a canine distemper virus-specific, 32 P-labeled cDNA. Four specific clones containing different classes of sequences have been identified. The cloned plasmids contain inserts of 800 (clone 44-80), 960 (clone 74-16), 1700 (clone 364), and 950 (clone 40-9) base pairs. The sizes of the mRNA species complementary to these inserts are 1500, 1850, 1850 and 2500 nucleotides, respectively, as determined by the Northern technique. Three of the cloned DNA fragments were further identified as the reverse transcripts of the mRNA coding for the matrix, phosphoprotein, and nucleocapsid protein of CDV

  12. Cloning and characterization of DNA complementary to the canine distemper virus mRNA encoding matrix, phosphoprotein, and nucleocapsid protein

    Energy Technology Data Exchange (ETDEWEB)

    Rozenblatt, S.; Eizenberg, O.; Englund, G.; Bellini, W.J.

    1985-02-01

    Double-stranded cDNA synthesized from total polyadenylate-containing mRNA, extracted from monkey kidney cells infected with canine distemper virus (CDV), has been cloned into the PstI site of Escherichia coli plasmid pBR322. Clones containing canine distemper virus DNA were identified by hybridization to a canine distemper virus-specific, /sup 32/P-labeled cDNA. Four specific clones containing different classes of sequences have been identified. The cloned plasmids contain inserts of 800 (clone 44-80), 960 (clone 74-16), 1700 (clone 364), and 950 (clone 40-9) base pairs. The sizes of the mRNA species complementary to these inserts are 1500, 1850, 1850 and 2500 nucleotides, respectively, as determined by the Northern technique. Three of the cloned DNA fragments were further identified as the reverse transcripts of the mRNA coding for the matrix, phosphoprotein, and nucleocapsid protein of CDV.

  13. [Transposition errors during learning to reproduce a sequence by the right- and the left-hand movements: simulation of positional and movement coding].

    Science.gov (United States)

    Liakhovetskiĭ, V A; Bobrova, E V; Skopin, G N

    2012-01-01

    Transposition errors during the reproduction of a hand movement sequence make it possible to receive important information on the internal representation of this sequence in the motor working memory. Analysis of such errors showed that learning to reproduce sequences of the left-hand movements improves the system of positional coding (coding ofpositions), while learning of the right-hand movements improves the system of vector coding (coding of movements). Learning of the right-hand movements after the left-hand performance involved the system of positional coding "imposed" by the left hand. Learning of the left-hand movements after the right-hand performance activated the system of vector coding. Transposition errors during learning to reproduce movement sequences can be explained by neural network using either vector coding or both vector and positional coding.

  14. Analysis of the AD sequence in Zion plant using the March 1.1 code

    International Nuclear Information System (INIS)

    Oriolo, F.; Paci, S.

    1985-01-01

    The analyses of the AD sequences for the Zion power plant, made at the Pisa University, in the framework of the participation in the Source Tern Working Group. After a short description of the plant and the sequence under analysis, the model used for the reference computation and the results obtained using the March 1.1 code are shown. Together with the reference computation a series of parametric tests have been also made, concerning some input code variables, in order to ascertain their influence on the transient trend. The results of these analyses are shown in Appendix

  15. Sequence Coding and Search System for licensee event reports: coder's manual. Volume 4

    International Nuclear Information System (INIS)

    Gallaher, R.B.; Guymon, R.H.; Mays, G.T.; Poore, W.P.; Cagle, R.J.; Harrington, K.H.; Johnson, M.P.

    1985-04-01

    Operating experience data from nuclear power plants are essential for safety and reliability analyses, especially analyses of trends and patterns. The licensee event reports (LERs) that are submitted to the Nuclear Regulatory Commission (NRC) by the nuclear power plant utilities contain much of this data. The NRC's Office for Analysis and Evaluation of Operational Data (AEOD) has developed, under contract with NSIC, a system for codifying the events reported in the LERs. The primary objective of the Sequence Coding and Search System (SCSS) is to reduce the descriptive text of the LERs to coded sequences that are both computer-readable and computer-searchable. This four volume report documents and describes SCSS in detail. Volume 3 and 4 provide a technical processor, new to SCSS, the information and methodology necessary to capture descriptive data from the LER and to codify that data into a structured format and serve as reference material for the more experienced technical processor, and contains information that is essential for the more advanced user who needs to be familiar with the intricate coding techniques in order to retrieve specific details in a sequence. This volume contains updated material through amendment 1 to revision 1 of the working version of ORNL/NSIC-223, Vol. 4

  16. Sequence Coding and Search System for licensee event reports: coder's manual. Volume 3

    International Nuclear Information System (INIS)

    Gallaher, R.B.; Guymon, R.H.; Mays, G.T.; Poore, W.P.; Cagle, R.J.; Harrington, K.H.; Johnson, M.P.

    1985-04-01

    Operating experience data from nuclear power plants are essential for safety and reliability analyses, especially analyses of trends and patterns. The licensee event reports (LERs) that are submitted to the Nuclear Regulatory Commission (NRC) by the nuclear power plant utilities contain much of this data. The NRC's Office for Analysis and Evaluation of Operational Data (AEOD) has developed, under contract with NSIC, a system for codifying the events reported in the LERs. The primary objective of the Sequence Coding and Search System (SCSS) is to reduce the descriptive text of the LERs to coded sequences that are both computer-readable and computer-searchable. This four volume report documents and describes SCSS in detail. Volumes 3 and 4 provide a technical processor, new to SCSS, the information and methodology necessary to capture descriptive data from the LER and to codify that data into a structured format and serve as reference material for the more experienced technical processor, and contains information is essential for the more advanced user who needs to be familiar with the intricate coding techniques in order to retrieve specific details in a sequence. This volume contains updated material through amendment 1 to revision 1 of the working version of ORNL/NSIC-223, Vol. 3

  17. Sequence Coding and Search System Backfit Quality Assurance Program Plan

    International Nuclear Information System (INIS)

    Lovell, C.J.; Stepina, P.L.

    1985-03-01

    The Sequence Coding and Search System is a computer-based encoding system for events described in Licensee Event Reports. This data system contains LERs from 1981 to present. Backfit of the data system to include LERs prior to 1981 is required. This report documents the Quality Assurance Program Plan that EG and G Idaho, Inc. will follow while encoding 1980 LERs

  18. Modelling of blackout sequence at Atucha-1 using the MARCH3 code

    International Nuclear Information System (INIS)

    Baron, J.; Bastianelli, B.

    1997-01-01

    This paper presents the modelling of a complete blackout at the Atucha-1 NPP as preliminary phase for a Level II safety probabilistic analysis. The MARCH3 code of the STCP (Source Term Code Package) is used, based on a plant model made in accordance with particularities of the plant design. The analysis covers all the severe accident phases. The results allow to view the time sequence of the events, and provide the basis for source term studies. (author). 6 refs., 2 figs

  19. Comparative analysis of transcriptomes in aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing

    Directory of Open Access Journals (Sweden)

    Taketo Okada

    2016-12-01

    Full Text Available Ephedra plants are taxonomically classified as gymnosperms, and are medicinally important as the botanical origin of crude drugs and as bioresources that contain pharmacologically active chemicals. Here we show a comparative analysis of the transcriptomes of aerial stems and roots of Ephedra sinica based on high-throughput mRNA sequencing by RNA-Seq. De novo assembly of short cDNA sequence reads generated 23,358, 13,373, and 28,579 contigs longer than 200 bases from aerial stems, roots, or both aerial stems and roots, respectively. The presumed functions encoded by these contig sequences were annotated by BLAST (blastx. Subsequently, these contigs were classified based on gene ontology slims, Enzyme Commission numbers, and the InterPro database. Furthermore, comparative gene expression analysis was performed between aerial stems and roots. These transcriptome analyses revealed differences and similarities between the transcriptomes of aerial stems and roots in E. sinica. Deep transcriptome sequencing of Ephedra should open the door to molecular biological studies based on the entire transcriptome, tissue- or organ-specific transcriptomes, or targeted genes of interest.

  20. Complete coding sequence of Zika virus from Martinique outbreak in 2015

    Directory of Open Access Journals (Sweden)

    G. Piorkowski

    2016-05-01

    Full Text Available Zika virus is an Aedes-borne Flavivirus causing fever, arthralgia, myalgia rash, associated with Guillain–Barré syndrome and suspected to induce microcephaly in the fetus. We report here the complete coding sequence of the first characterized Caribbean Zika virus strain, isolated from a patient from Martinique in December, 2015.

  1. mRNA secondary structure at start AUG codon is a key limiting factor for human protein expression in Escherichia coli

    International Nuclear Information System (INIS)

    Zhang Weici; Xiao Weihua; Wei Haiming; Zhang Jian; Tian Zhigang

    2006-01-01

    Codon usage and thermodynamic optimization of the 5'-end of mRNA have been applied to improve the efficiency of human protein production in Escherichia coli. However, high level expression of human protein in E. coli is still a challenge that virtually depends upon each individual target genes. Using human interleukin 10 (huIL-10) and interferon α (huIFN-α) coding sequences, we systematically analyzed the influence of several major factors on expression of human protein in E. coli. The results from huIL-10 and reinforced by huIFN-α showed that exposing AUG initiator codon from base-paired structure within mRNA itself significantly improved the translation of target protein, which resulted in a 10-fold higher protein expression than the wild-type genes. It was also noted that translation process was not affected by the retained short-range stem-loop structure at Shine-Dalgarno (SD) sequences. On the other hand, codon-optimized constructs of huIL-10 showed unimproved levels of protein expression, on the contrary, led to a remarkable RNA degradation. Our study demonstrates that exposure of AUG initiator codon from long-range intra-strand secondary structure at 5'-end of mRNA may be used as a general strategy for human protein production in E. coli

  2. Source coherence impairments in a direct detection direct sequence optical code-division multiple-access system.

    Science.gov (United States)

    Fsaifes, Ihsan; Lepers, Catherine; Lourdiane, Mounia; Gallion, Philippe; Beugin, Vincent; Guignard, Philippe

    2007-02-01

    We demonstrate that direct sequence optical code- division multiple-access (DS-OCDMA) encoders and decoders using sampled fiber Bragg gratings (S-FBGs) behave as multipath interferometers. In that case, chip pulses of the prime sequence codes generated by spreading in time-coherent data pulses can result from multiple reflections in the interferometers that can superimpose within a chip time duration. We show that the autocorrelation function has to be considered as the sum of complex amplitudes of the combined chip as the laser source coherence time is much greater than the integration time of the photodetector. To reduce the sensitivity of the DS-OCDMA system to the coherence time of the laser source, we analyze the use of sparse and nonperiodic quadratic congruence and extended quadratic congruence codes.

  3. Source coherence impairments in a direct detection direct sequence optical code-division multiple-access system

    Science.gov (United States)

    Fsaifes, Ihsan; Lepers, Catherine; Lourdiane, Mounia; Gallion, Philippe; Beugin, Vincent; Guignard, Philippe

    2007-02-01

    We demonstrate that direct sequence optical code- division multiple-access (DS-OCDMA) encoders and decoders using sampled fiber Bragg gratings (S-FBGs) behave as multipath interferometers. In that case, chip pulses of the prime sequence codes generated by spreading in time-coherent data pulses can result from multiple reflections in the interferometers that can superimpose within a chip time duration. We show that the autocorrelation function has to be considered as the sum of complex amplitudes of the combined chip as the laser source coherence time is much greater than the integration time of the photodetector. To reduce the sensitivity of the DS-OCDMA system to the coherence time of the laser source, we analyze the use of sparse and nonperiodic quadratic congruence and extended quadratic congruence codes.

  4. IdentiCS – Identification of coding sequence and in silico reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence

    Directory of Open Access Journals (Sweden)

    Zeng An-Ping

    2004-08-01

    Full Text Available Abstract Background A necessary step for a genome level analysis of the cellular metabolism is the in silico reconstruction of the metabolic network from genome sequences. The available methods are mainly based on the annotation of genome sequences including two successive steps, the prediction of coding sequences (CDS and their function assignment. The annotation process takes time. The available methods often encounter difficulties when dealing with unfinished error-containing genomic sequence. Results In this work a fast method is proposed to use unannotated genome sequence for predicting CDSs and for an in silico reconstruction of metabolic networks. Instead of using predicted genes or CDSs to query public databases, entries from public DNA or protein databases are used as queries to search a local database of the unannotated genome sequence to predict CDSs. Functions are assigned to the predicted CDSs simultaneously. The well-annotated genome of Salmonella typhimurium LT2 is used as an example to demonstrate the applicability of the method. 97.7% of the CDSs in the original annotation are correctly identified. The use of SWISS-PROT-TrEMBL databases resulted in an identification of 98.9% of CDSs that have EC-numbers in the published annotation. Furthermore, two versions of sequences of the bacterium Klebsiella pneumoniae with different genome coverage (3.9 and 7.9 fold, respectively are examined. The results suggest that a 3.9-fold coverage of the bacterial genome could be sufficiently used for the in silico reconstruction of the metabolic network. Compared to other gene finding methods such as CRITICA our method is more suitable for exploiting sequences of low genome coverage. Based on the new method, a program called IdentiCS (Identification of Coding Sequences from Unfinished Genome Sequences is delivered that combines the identification of CDSs with the reconstruction, comparison and visualization of metabolic networks (free to download

  5. Death of a dogma: eukaryotic mRNAs can code for more than one protein.

    Science.gov (United States)

    Mouilleron, Hélène; Delcourt, Vivian; Roucou, Xavier

    2016-01-08

    mRNAs carry the genetic information that is translated by ribosomes. The traditional view of a mature eukaryotic mRNA is a molecule with three main regions, the 5' UTR, the protein coding open reading frame (ORF) or coding sequence (CDS), and the 3' UTR. This concept assumes that ribosomes translate one ORF only, generally the longest one, and produce one protein. As a result, in the early days of genomics and bioinformatics, one CDS was associated with each protein-coding gene. This fundamental concept of a single CDS is being challenged by increasing experimental evidence indicating that annotated proteins are not the only proteins translated from mRNAs. In particular, mass spectrometry (MS)-based proteomics and ribosome profiling have detected productive translation of alternative open reading frames. In several cases, the alternative and annotated proteins interact. Thus, the expression of two or more proteins translated from the same mRNA may offer a mechanism to ensure the co-expression of proteins which have functional interactions. Translational mechanisms already described in eukaryotic cells indicate that the cellular machinery is able to translate different CDSs from a single viral or cellular mRNA. In addition to summarizing data showing that the protein coding potential of eukaryotic mRNAs has been underestimated, this review aims to challenge the single translated CDS dogma. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Nascent peptide-mediated translation elongation arrest coupled with mRNA degradation in the CGS1 gene of Arabidopsis

    Science.gov (United States)

    Onouchi, Hitoshi; Nagami, Yoko; Haraguchi, Yuhi; Nakamoto, Mari; Nishimura, Yoshiko; Sakurai, Ryoko; Nagao, Nobuhiro; Kawasaki, Daisuke; Kadokura, Yoshitomo; Naito, Satoshi

    2005-01-01

    Expression of the Arabidopsis CGS1 gene that codes for cystathionine γ-synthase is feedback regulated at the step of mRNA stability in response to S-adenosyl-L-methionine (AdoMet). A short stretch of amino acid sequence, called the MTO1 region, encoded by the first exon of CGS1 itself is involved in this regulation. Here, we demonstrate, using a cell-free system, that AdoMet induces temporal translation elongation arrest at the Ser-94 codon located immediately downstream of the MTO1 region, by analyzing a translation intermediate and performing primer extension inhibition (toeprint) analysis. This translation arrest precedes the formation of a degradation intermediate of CGS1 mRNA, which has its 5′ end points near the 5′ edge of the stalled ribosome. The position of ribosome stalling also suggests that the MTO1 region in nascent peptide resides in the ribosomal exit tunnel when translation elongation is temporarily arrested. In addition to the MTO1 region amino acid sequence, downstream Trp-93 is also important for the AdoMet-induced translation arrest. This is the first example of nascent peptide-mediated translation elongation arrest coupled with mRNA degradation in eukaryotes. Furthermore, our data suggest that the ribosome stalls at the step of translocation rather than at the step of peptidyl transfer. PMID:16027170

  7. Properties of Sequence Conservation in Upstream Regulatory and Protein Coding Sequences among Paralogs in Arabidopsis thaliana

    Science.gov (United States)

    Richardson, Dale N.; Wiehe, Thomas

    Whole genome duplication (WGD) has catalyzed the formation of new species, genes with novel functions, altered expression patterns, complexified signaling pathways and has provided organisms a level of genetic robustness. We studied the long-term evolution and interrelationships of 5’ upstream regulatory sequences (URSs), protein coding sequences (CDSs) and expression correlations (EC) of duplicated gene pairs in Arabidopsis. Three distinct methods revealed significant evolutionary conservation between paralogous URSs and were highly correlated with microarray-based expression correlation of the respective gene pairs. Positional information on exact matches between sequences unveiled the contribution of micro-chromosomal rearrangements on expression divergence. A three-way rank analysis of URS similarity, CDS divergence and EC uncovered specific gene functional biases. Transcription factor activity was associated with gene pairs exhibiting conserved URSs and divergent CDSs, whereas a broad array of metabolic enzymes was found to be associated with gene pairs showing diverged URSs but conserved CDSs.

  8. Licensee Event Report sequence coding and search procedure workshop

    International Nuclear Information System (INIS)

    Cottrell, W.B.; Gallaher, R.B.

    1981-01-01

    Since mid-1980, the Office for Analysis and Evaluation of Operational Data (AEOD) of the Nuclear Regulatory Commission (NRC) has been developing procedures for the systematic review and analysis of Licensee Event Reports (LERs). These procedures generally address several areas of concern, including identification of significant trends and patterns, event sequence of occurrences, component failures, and system and plant effects. The AEOD and NSIC conducted a workshop on the new coding procedure at the American Museum of Science and Energy in Oak Ridge, TN, on November 24, 1980

  9. [Influence of "prehistory" of sequential movements of the right and the left hand on reproduction: coding of positions, movements and sequence structure].

    Science.gov (United States)

    Bobrova, E V; Liakhovetskiĭ, V A; Borshchevskaia, E R

    2011-01-01

    The dependence of errors during reproduction of a sequence of hand movements without visual feedback on the previous right- and left-hand performance ("prehistory") and on positions in space of sequence elements (random or ordered by the explicit rule) was analyzed. It was shown that the preceding information about the ordered positions of the sequence elements was used during right-hand movements, whereas left-hand movements were performed with involvement of the information about the random sequence. The data testify to a central mechanism of the analysis of spatial structure of sequence elements. This mechanism activates movement coding specific for the left hemisphere (vector coding) in case of an ordered sequence structure and positional coding specific for the right hemisphere in case of a random sequence structure.

  10. Two-terminal video coding.

    Science.gov (United States)

    Yang, Yang; Stanković, Vladimir; Xiong, Zixiang; Zhao, Wei

    2009-03-01

    Following recent works on the rate region of the quadratic Gaussian two-terminal source coding problem and limit-approaching code designs, this paper examines multiterminal source coding of two correlated, i.e., stereo, video sequences to save the sum rate over independent coding of both sequences. Two multiterminal video coding schemes are proposed. In the first scheme, the left sequence of the stereo pair is coded by H.264/AVC and used at the joint decoder to facilitate Wyner-Ziv coding of the right video sequence. The first I-frame of the right sequence is successively coded by H.264/AVC Intracoding and Wyner-Ziv coding. An efficient stereo matching algorithm based on loopy belief propagation is then adopted at the decoder to produce pixel-level disparity maps between the corresponding frames of the two decoded video sequences on the fly. Based on the disparity maps, side information for both motion vectors and motion-compensated residual frames of the right sequence are generated at the decoder before Wyner-Ziv encoding. In the second scheme, source splitting is employed on top of classic and Wyner-Ziv coding for compression of both I-frames to allow flexible rate allocation between the two sequences. Experiments with both schemes on stereo video sequences using H.264/AVC, LDPC codes for Slepian-Wolf coding of the motion vectors, and scalar quantization in conjunction with LDPC codes for Wyner-Ziv coding of the residual coefficients give a slightly lower sum rate than separate H.264/AVC coding of both sequences at the same video quality.

  11. Enhanced Protein Production in Escherichia coli by Optimization of Cloning Scars at the Vector-Coding Sequence Junction

    DEFF Research Database (Denmark)

    Mirzadeh, Kiavash; Martinez, Virginia; Toddo, Stephen

    2015-01-01

    are poorly expressed even when they are codon-optimized and expressed from vectors with powerful genetic elements. In this study, we show that poor expression can be caused by certain nucleotide sequences (e.g., cloning scars) at the junction between the vector and the coding sequence. Since these sequences...

  12. mRNA localization mechanisms in Trypanosoma cruzi.

    Directory of Open Access Journals (Sweden)

    Lysangela R Alves

    Full Text Available Asymmetric mRNA localization is a sophisticated tool for regulating and optimizing protein synthesis and maintaining cell polarity. Molecular mechanisms involved in the regulated localization of transcripts are widespread in higher eukaryotes and fungi, but not in protozoa. Trypanosomes are ancient eukaryotes that branched off early in eukaryote evolution. We hypothesized that these organisms would have basic mechanisms of mRNA localization. FISH assays with probes against transcripts coding for proteins with restricted distributions showed a discrete localization of the mRNAs in the cytoplasm. Moreover, cruzipain mRNA was found inside reservosomes suggesting new unexpected functions for this vacuolar organelle. Individual mRNAs were also mobilized to RNA granules in response to nutritional stress. The cytoplasmic distribution of these transcripts changed with cell differentiation, suggesting that localization mechanisms might be involved in the regulation of stage-specific protein expression. Transfection assays with reporter genes showed that, as in higher eukaryotes, 3'UTRs were responsible for guiding mRNAs to their final location. Our results strongly suggest that Trypanosoma cruzi have a core, basic mechanism of mRNA localization. This kind of controlled mRNA transport is ancient, dating back to early eukaryote evolution.

  13. Expression of coding (mRNA) and non-coding (microRNA) RNA in lung tissue and blood isolated from pigs suffering from bacterial pleuropneumonia

    DEFF Research Database (Denmark)

    Skovgaard, Kerstin; Schou, Kirstine Klitgaard; Wendt, Karin Tarp

    2010-01-01

    MicroRNAs are small non-coding RNA molecules (18-23 nt), that regulate the activity of other genes at the post-transcriptional level. Recently it has become evident that microRNA plays an important role in modulating and fine tuning innate and adaptive immune responses. Still, little is known about...... the impact of microRNAs in the development and pathogenesis of lung infections. Expression of microRNA known to be induced by bacterial (i.e., LPS) ligands and thus supposed to play a role in the regulation of antimicrobial defence, were studied in lung tissue and in blood from pigs experimentally infected...... with Actinobacillus pleuropneumoniae (AP). Expression differences of mRNA and microRNA were quantified at different time points (6h, 12h, 24h, 48h PI) using reverse transcription quantitative real-time PCR (Rotor-Gene and Fluidigm). Expression profiles of miRNA in blood of seven animals were further studied using mi...

  14. Laminar and Temporal Expression Dynamics of Coding and Noncoding RNAs in the Mouse Neocortex

    Directory of Open Access Journals (Sweden)

    Sofia Fertuzinhos

    2014-03-01

    Full Text Available The hallmark of the cerebral neocortex is its organization into six layers, each containing a characteristic set of cell types and synaptic connections. The transcriptional events involved in laminar development and function still remain elusive. Here, we employed deep sequencing of mRNA and small RNA species to gain insights into transcriptional differences among layers and their temporal dynamics during postnatal development of the mouse primary somatosensory neocortex. We identify a number of coding and noncoding transcripts with specific spatiotemporal expression and splicing patterns. We also identify signature trajectories and gene coexpression networks associated with distinct biological processes and transcriptional overlap between these processes. Finally, we provide data that allow the study of potential miRNA and mRNA interactions. Overall, this study provides an integrated view of the laminar and temporal expression dynamics of coding and noncoding transcripts in the mouse neocortex and a resource for studies of neurodevelopment and transcriptome.

  15. A glimpse at mRNA dynamics reveals cellular domains and rapid trafficking through granules

    NARCIS (Netherlands)

    Gemert, Alice Myriam Christi van

    2011-01-01

    mRNA transport and targeting are essential to gene expression regulation. Specific mRNA sequences can bind several proteins and together form RiboNucleoProtein particles (RNP). The various proteins within the RNP determine mRNA fate: translation, transport or decay. RNP composition varies with

  16. FOURTH SEMINAR TO THE MEMORY OF D.N. KLYSHKO: Algebraic solution of the synthesis problem for coded sequences

    Science.gov (United States)

    Leukhin, Anatolii N.

    2005-08-01

    The algebraic solution of a 'complex' problem of synthesis of phase-coded (PC) sequences with the zero level of side lobes of the cyclic autocorrelation function (ACF) is proposed. It is shown that the solution of the synthesis problem is connected with the existence of difference sets for a given code dimension. The problem of estimating the number of possible code combinations for a given code dimension is solved. It is pointed out that the problem of synthesis of PC sequences is related to the fundamental problems of discrete mathematics and, first of all, to a number of combinatorial problems, which can be solved, as the number factorisation problem, by algebraic methods by using the theory of Galois fields and groups.

  17. Direct quantification of human cytomegalovirus immediate-early and late mRNA levels in blood of lung transplant recipients by competitive nucleic acid sequence-based amplification

    NARCIS (Netherlands)

    Greijer, AE; Verschuuren, EAM; Harmsen, MC; Dekkers, CAJ; Adriaanse, HMA; The, TH; Middeldorp, JM

    The dynamics of active human cytomegalovirus (HCMV) infection was monitored by competitive nucleic acid sequence-based amplification (NASBA) assays for quantification of IE1 (UL123) and pp67 (UL65) mRNA expression levels In the blood of patients after lung transplantation. RNA was isolated from 339

  18. SinEx DB: a database for single exon coding sequences in mammalian genomes.

    Science.gov (United States)

    Jorquera, Roddy; Ortiz, Rodrigo; Ossandon, F; Cárdenas, Juan Pablo; Sepúlveda, Rene; González, Carolina; Holmes, David S

    2016-01-01

    Eukaryotic genes are typically interrupted by intragenic, noncoding sequences termed introns. However, some genes lack introns in their coding sequence (CDS) and are generally known as 'single exon genes' (SEGs). In this work, a SEG is defined as a nuclear, protein-coding gene that lacks introns in its CDS. Whereas, many public databases of Eukaryotic multi-exon genes are available, there are only two specialized databases for SEGs. The present work addresses the need for a more extensive and diverse database by creating SinEx DB, a publicly available, searchable database of predicted SEGs from 10 completely sequenced mammalian genomes including human. SinEx DB houses the DNA and protein sequence information of these SEGs and includes their functional predictions (KOG) and the relative distribution of these functions within species. The information is stored in a relational database built with My SQL Server 5.1.33 and the complete dataset of SEG sequences and their functional predictions are available for downloading. SinEx DB can be interrogated by: (i) a browsable phylogenetic schema, (ii) carrying out BLAST searches to the in-house SinEx DB of SEGs and (iii) via an advanced search mode in which the database can be searched by key words and any combination of searches by species and predicted functions. SinEx DB provides a rich source of information for advancing our understanding of the evolution and function of SEGs.Database URL: www.sinex.cl. © The Author(s) 2016. Published by Oxford University Press.

  19. Dwell-Time Distribution, Long Pausing and Arrest of Single-Ribosome Translation through the mRNA Duplex.

    Science.gov (United States)

    Xie, Ping

    2015-10-09

    Proteins in the cell are synthesized by a ribosome translating the genetic information encoded on the single-stranded messenger RNA (mRNA). It has been shown that the ribosome can also translate through the duplex region of the mRNA by unwinding the duplex. Here, based on our proposed model of the ribosome translation through the mRNA duplex we study theoretically the distribution of dwell times of the ribosome translation through the mRNA duplex under the effect of a pulling force externally applied to the ends of the mRNA to unzip the duplex. We provide quantitative explanations of the available single molecule experimental data on the distribution of dwell times with both short and long durations, on rescuing of the long paused ribosomes by raising the pulling force to unzip the duplex, on translational arrests induced by the mRNA duplex and Shine-Dalgarno(SD)-like sequence in the mRNA. The functional consequences of the pauses or arrests caused by the mRNA duplex and the SD sequence are discussed and compared with those obtained from other types of pausing, such as those induced by "hungry" codons or interactions of specific sequences in the nascent chain with the ribosomal exit tunnel.

  20. RevTrans: multiple alignment of coding DNA from aligned amino acid sequences

    DEFF Research Database (Denmark)

    Wernersson, Rasmus; Pedersen, Anders Gorm

    2003-01-01

    The simple fact that proteins are built from 20 amino acids while DNA only contains four different bases, means that the 'signal-to-noise ratio' in protein sequence alignments is much better than in alignments of DNA. Besides this information-theoretical advantage, protein alignments also benefit...... proteins. It is therefore preferable to align coding DNA at the amino acid level and it is for this purpose we have constructed the program RevTrans. RevTrans constructs a multiple DNA alignment by: (i) translating the DNA; (ii) aligning the resulting peptide sequences; and (iii) building a multiple DNA...

  1. Individual microRNAs (miRNAs) display distinct mRNA targeting "rules".

    Science.gov (United States)

    Wang, Wang-Xia; Wilfred, Bernard R; Xie, Kevin; Jennings, Mary H; Hu, Yanling Hu; Stromberg, Arnold J; Nelson, Peter T

    2010-01-01

    MicroRNAs (miRNAs) guide Argonaute (AGO)-containing microribonucleoprotein (miRNP) complexes to target mRNAs.It has been assumed that miRNAs behave similarly to each other with regard to mRNA target recognition. The usual assumptions, which are based on prior studies, are that miRNAs target preferentially sequences in the 3'UTR of mRNAs,guided by the 5' "seed" portion of the miRNAs. Here we isolated AGO- and miRNA-containing miRNPs from human H4 tumor cells by co-immunoprecipitation (co-IP) with anti-AGO antibody. Cells were transfected with miR-107, miR-124,miR-128, miR-320, or a negative control miRNA. Co-IPed RNAs were subjected to downstream high-density Affymetrix Human Gene 1.0 ST microarray analyses using an assay we validated previously-a "RIP-Chip" experimental design. RIP-Chip data provided a list of mRNAs recruited into the AGO-miRNP in correlation to each miRNA. These experimentally identified miRNA targets were analyzed for complementary six nucleotide "seed" sequences within the transfected miRNAs. We found that miR-124 targets tended to have sequences in the 3'UTR that would be recognized by the 5' seed of miR-124, as described in previous studies. By contrast, miR-107 targets tended to have 'seed' sequences in the mRNA open reading frame, but not the 3' UTR. Further, mRNA targets of miR-128 and miR-320 are less enriched for 6-mer seed sequences in comparison to miR-107 and miR-124. In sum, our data support the importance of the 5' seed in determining binding characteristics for some miRNAs; however, the "binding rules" are complex, and individual miRNAs can have distinct sequence determinants that lead to mRNA targeting.

  2. SHARAKU: an algorithm for aligning and clustering read mapping profiles of deep sequencing in non-coding RNA processing.

    Science.gov (United States)

    Tsuchiya, Mariko; Amano, Kojiro; Abe, Masaya; Seki, Misato; Hase, Sumitaka; Sato, Kengo; Sakakibara, Yasubumi

    2016-06-15

    Deep sequencing of the transcripts of regulatory non-coding RNA generates footprints of post-transcriptional processes. After obtaining sequence reads, the short reads are mapped to a reference genome, and specific mapping patterns can be detected called read mapping profiles, which are distinct from random non-functional degradation patterns. These patterns reflect the maturation processes that lead to the production of shorter RNA sequences. Recent next-generation sequencing studies have revealed not only the typical maturation process of miRNAs but also the various processing mechanisms of small RNAs derived from tRNAs and snoRNAs. We developed an algorithm termed SHARAKU to align two read mapping profiles of next-generation sequencing outputs for non-coding RNAs. In contrast with previous work, SHARAKU incorporates the primary and secondary sequence structures into an alignment of read mapping profiles to allow for the detection of common processing patterns. Using a benchmark simulated dataset, SHARAKU exhibited superior performance to previous methods for correctly clustering the read mapping profiles with respect to 5'-end processing and 3'-end processing from degradation patterns and in detecting similar processing patterns in deriving the shorter RNAs. Further, using experimental data of small RNA sequencing for the common marmoset brain, SHARAKU succeeded in identifying the significant clusters of read mapping profiles for similar processing patterns of small derived RNA families expressed in the brain. The source code of our program SHARAKU is available at http://www.dna.bio.keio.ac.jp/sharaku/, and the simulated dataset used in this work is available at the same link. Accession code: The sequence data from the whole RNA transcripts in the hippocampus of the left brain used in this work is available from the DNA DataBank of Japan (DDBJ) Sequence Read Archive (DRA) under the accession number DRA004502. yasu@bio.keio.ac.jp Supplementary data are available

  3. Naturally occurring BRCA2 alternative mRNA splicing events in clinically relevant samples

    DEFF Research Database (Denmark)

    Fackenthal, James D; Yoshimatsu, Toshio; Zhang, Bifeng

    2016-01-01

    patterns and thereby disrupt gene function. mRNA analyses are therefore among the tests used to interpret the clinical significance of some genetic variants. However, these could be confounded by the appearance of naturally occurring alternative transcripts unrelated to germline sequence variation...... to characterise the spectrum of naturally occurring BRCA2 mRNA alternate-splicing events. METHODS: mRNA was prepared from several blood and breast tissue-derived cells and cell lines by contributing ENIGMA laboratories. cDNA representing BRCA2 alternate splice sites was amplified and visualised using capillary...... or agarose gel electrophoresis, followed by sequencing. RESULTS: We demonstrate the existence of 24 different BRCA2 mRNA alternate-splicing events in lymphoblastoid cell lines and both breast cancer and non-cancerous breast cell lines. CONCLUSIONS: These naturally occurring alternate-splicing events...

  4. Browning in Annona cherimola fruit: role of polyphenol oxidase and characterization of a coding sequence of the enzyme.

    Science.gov (United States)

    Prieto, Humberto; Utz, Daniella; Castro, Alvaro; Aguirre, Carlos; González-Agüero, Mauricio; Valdés, Héctor; Cifuentes, Nicolas; Defilippi, Bruno G; Zamora, Pablo; Zúñiga, Gustavo; Campos-Vargas, Reinaldo

    2007-10-31

    Cherimoya (Annona cherimola Mill.) fruit is an attractive candidate for food processing applications as fresh cut. However, along with its desirable delicate taste, cherimoya shows a marked susceptibility to browning. This condition is mainly attributed to polyphenol oxidase activity (PPO). A general lack of knowledge regarding PPO and its role in the oxidative loss of quality in processed cherimoya fruit requires a better understanding of the mechanisms involved. The work carried out included the cloning of a full-length cDNA, an analysis of its properties in the deduced amino sequence, and linkage of its mRNA levels with enzyme activity in mature and ripe fruits after wounding. The results showed one gene different at the nucleotide level when compared with previously reported genes, but a well-conserved protein, either in functional and in structural terms. Cherimoya PPO gene (Ac-ppo, GenBank DQ990911) showed to be present apparently in one copy of the genome, and its transcripts could be significantly detected in leaves and less abundantly in flowers and fruits. Analysis of wounded matured and ripened fruits revealed an inductive behavior for mRNA levels in the flesh of mature cherimoya after 16 h. Although the highest enzymatic activity was observed on rind, a consistent PPO activity was detected on flesh samples. A lack of correlation between PPO mRNA level and PPO activity was observed, especially in flesh tissue. This is probably due to the presence of monophenolic substrates inducing a lag period, enzyme inhibitors and/or diphenolic substrates causing suicide inactivation, and proenzyme or latent isoforms of PPO. To our knowledge this is the first report of a complete PPO sequence in cherimoya. Furthermore, the gene is highly divergent from known nucleotide sequences but shows a well conserved protein in terms of its function, deduced structure, and physiological role.

  5. Whole transcriptome analysis of Acinetobacter baumannii assessed by RNA-sequencing reveals different mRNA expression profiles in biofilm compared to planktonic cells.

    Directory of Open Access Journals (Sweden)

    Soraya Rumbo-Feal

    Full Text Available Acinetobacterbaumannii has emerged as a dangerous opportunistic pathogen, with many strains able to form biofilms and thus cause persistent infections. The aim of the present study was to use high-throughput sequencing techniques to establish complete transcriptome profiles of planktonic (free-living and sessile (biofilm forms of A. baumannii ATCC 17978 and thereby identify differences in their gene expression patterns. Collections of mRNA from planktonic (both exponential and stationary phase cultures and sessile (biofilm cells were sequenced. Six mRNA libraries were prepared following the mRNA-Seq protocols from Illumina. Reads were obtained in a HiScanSQ platform and mapped against the complete genome to describe the complete mRNA transcriptomes of planktonic and sessile cells. The results showed that the gene expression pattern of A. baumannii biofilm cells was distinct from that of planktonic cells, including 1621 genes over-expressed in biofilms relative to stationary phase cells and 55 genes expressed only in biofilms. These differences suggested important changes in amino acid and fatty acid metabolism, motility, active transport, DNA-methylation, iron acquisition, transcriptional regulation, and quorum sensing, among other processes. Disruption or deletion of five of these genes caused a significant decrease in biofilm formation ability in the corresponding mutant strains. Among the genes over-expressed in biofilm cells were those in an operon involved in quorum sensing. One of them, encoding an acyl carrier protein, was shown to be involved in biofilm formation as demonstrated by the significant decrease in biofilm formation by the corresponding knockout strain. The present work serves as a basis for future studies examining the complex network systems that regulate bacterial biofilm formation and maintenance.

  6. Kinetic models of gene expression including non-coding RNAs

    Energy Technology Data Exchange (ETDEWEB)

    Zhdanov, Vladimir P., E-mail: zhdanov@catalysis.r

    2011-03-15

    In cells, genes are transcribed into mRNAs, and the latter are translated into proteins. Due to the feedbacks between these processes, the kinetics of gene expression may be complex even in the simplest genetic networks. The corresponding models have already been reviewed in the literature. A new avenue in this field is related to the recognition that the conventional scenario of gene expression is fully applicable only to prokaryotes whose genomes consist of tightly packed protein-coding sequences. In eukaryotic cells, in contrast, such sequences are relatively rare, and the rest of the genome includes numerous transcript units representing non-coding RNAs (ncRNAs). During the past decade, it has become clear that such RNAs play a crucial role in gene expression and accordingly influence a multitude of cellular processes both in the normal state and during diseases. The numerous biological functions of ncRNAs are based primarily on their abilities to silence genes via pairing with a target mRNA and subsequently preventing its translation or facilitating degradation of the mRNA-ncRNA complex. Many other abilities of ncRNAs have been discovered as well. Our review is focused on the available kinetic models describing the mRNA, ncRNA and protein interplay. In particular, we systematically present the simplest models without kinetic feedbacks, models containing feedbacks and predicting bistability and oscillations in simple genetic networks, and models describing the effect of ncRNAs on complex genetic networks. Mathematically, the presentation is based primarily on temporal mean-field kinetic equations. The stochastic and spatio-temporal effects are also briefly discussed.

  7. Coding and decoding libraries of sequence-defined functional copolymers synthesized via photoligation.

    Science.gov (United States)

    Zydziak, Nicolas; Konrad, Waldemar; Feist, Florian; Afonin, Sergii; Weidner, Steffen; Barner-Kowollik, Christopher

    2016-11-30

    Designing artificial macromolecules with absolute sequence order represents a considerable challenge. Here we report an advanced light-induced avenue to monodisperse sequence-defined functional linear macromolecules up to decamers via a unique photochemical approach. The versatility of the synthetic strategy-combining sequential and modular concepts-enables the synthesis of perfect macromolecules varying in chemical constitution and topology. Specific functions are placed at arbitrary positions along the chain via the successive addition of monomer units and blocks, leading to a library of functional homopolymers, alternating copolymers and block copolymers. The in-depth characterization of each sequence-defined chain confirms the precision nature of the macromolecules. Decoding of the functional information contained in the molecular structure is achieved via tandem mass spectrometry without recourse to their synthetic history, showing that the sequence information can be read. We submit that the presented photochemical strategy is a viable and advanced concept for coding individual monomer units along a macromolecular chain.

  8. The Pekin duck programmed death-ligand 1: cDNA cloning, genomic structure, molecular characterization and mRNA expression analysis.

    Science.gov (United States)

    Yao, Q; Fischer, K P; Tyrrell, D L; Gutfreund, K S

    2015-04-01

    Programmed death ligand-1 (PD-L1) plays an important role in the attenuation of adaptive immune responses in higher vertebrates. Here, we describe the identification of the Pekin duck PD-L1 orthologue (duPD-L1) and its gene structure. The duPD-L1 cDNA encodes a 311-amino acid protein that has an amino acid identity of 78% and 42% with chicken and human PD-L1, respectively. Mapping of the duPD-L1 cDNA with duck genomic sequences revealed an exonic structure of its coding sequence similar to those of other vertebrates but lacked a noncoding exon 1. Homology modelling of the duPD-L1 extracellular domain was compatible with the tandem IgV-like and IgC-like IgSF domain structure of human PD-L1 (PDB ID: 3BIS). Residues known to be important for receptor binding of human PD-L1 were mostly conserved in duPD-L1 within the N-terminus and the G sheet, and partially conserved within the F sheet but not within sheets C and C'. DuPD-L1 mRNA was constitutively expressed in all tissues examined with highest expression levels in lung and spleen and very low levels of expression in muscle, kidney and brain. Mitogen stimulation of duck peripheral blood mononuclear cells transiently increased duPD-L1 mRNA expression. Our observations demonstrate evolutionary conservation of the exonic structure of its coding sequence, the extracellular domain structure and residues implicated in receptor binding, but the role of the longer cytoplasmic tail in avian PD-L1 proteins remains to be determined. © 2014 John Wiley & Sons Ltd.

  9. MicroRNA-128 targets myostatin at coding domain sequence to regulate myoblasts in skeletal muscle development.

    Science.gov (United States)

    Shi, Lei; Zhou, Bo; Li, Pinghua; Schinckel, Allan P; Liang, Tingting; Wang, Han; Li, Huizhi; Fu, Lingling; Chu, Qingpo; Huang, Ruihua

    2015-09-01

    MicroRNAs (miRNAs or miRs) play a critical role in skeletal muscle development. In a previous study we observed that miR-128 was highly expressed in skeletal muscle. However, its function in regulating skeletal muscle development is not clear. Our hypothesis was that miR-128 is involved in the regulation of the proliferation and differentiation of skeletal myoblasts. In this study, through bioinformatics analyses, we demonstrate that miR-128 specifically targeted mRNA of myostatin (MSTN), a critical inhibitor of skeletal myogenesis, at coding domain sequence (CDS) region, resulting in down-regulating of myostatin post-transcription. Overexpression of miR-128 inhibited proliferation of mouse C2C12 myoblast cells but promoted myotube formation; whereas knockdown of miR-128 had completely opposite effects. In addition, ectopic miR-128 regulated the expression of myogenic factor 5 (Myf5), myogenin (MyoG), paired box (Pax) 3 and 7. Furthermore, an inverse relationship was found between the expression of miR-128 and MSTN protein expression in vivo and in vitro. Taken together, these results reveal that there is a novel pathway in skeletal muscle development in which miR-128 regulates myostatin at CDS region to inhibit proliferation but promote differentiation of myoblast cells. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Methylmethane-sulphonate and X-ray-induced mutations in the Chinese hamster hprt gene: mRNA phenotyping using polymerase chain reactions

    International Nuclear Information System (INIS)

    Chaudhry, M.A.; Fox, Margaret

    1990-01-01

    Alterations in the hprt gene of Chinese hamster cells were determined in 71 spontaneous, methylmethane sulphonate (MMS)-and X-ray induced mutants, using the Southern blot hybridization technique. To investigate the possibility of small deletions, MMS-induced mutants were studied with proves derived from exons 3 and 9 but no evidence of specific deletion of these two exons was found. The polymerase chain reaction (PCR) was used to phenotype hprt transcripts in 48 MMS, X-ray and spontaneous Chinese hamster mutants by amplifying the coding region of their cDNA. Among 22 MMS-induced mutants the message was present in 16 instances. An analysis of 20 X-ray-induced mutants showed the presence of hprt mRNA in 11 of them with five having low levels of transcription. Among six spontaneous mutants, four were negative for mRNA on standard Northern blots and in one the message was only detected after PCR amplification. Direct DNA sequencing of 10 mutants revealed the presence of base substitutions in five of them while a 7 bp deletion was found in another. No mutations were found in another four mutants, suggesting the presence of mutation outside the coding region. (author)

  11. Comparisons between Arabidopsis thaliana and Drosophila melanogaster in relation to Coding and Noncoding Sequence Length and Gene Expression

    Directory of Open Access Journals (Sweden)

    Rachel Caldwell

    2015-01-01

    Full Text Available There is a continuing interest in the analysis of gene architecture and gene expression to determine the relationship that may exist. Advances in high-quality sequencing technologies and large-scale resource datasets have increased the understanding of relationships and cross-referencing of expression data to the large genome data. Although a negative correlation between expression level and gene (especially transcript length has been generally accepted, there have been some conflicting results arising from the literature concerning the impacts of different regions of genes, and the underlying reason is not well understood. The research aims to apply quantile regression techniques for statistical analysis of coding and noncoding sequence length and gene expression data in the plant, Arabidopsis thaliana, and fruit fly, Drosophila melanogaster, to determine if a relationship exists and if there is any variation or similarities between these species. The quantile regression analysis found that the coding sequence length and gene expression correlations varied, and similarities emerged for the noncoding sequence length (5′ and 3′ UTRs between animal and plant species. In conclusion, the information described in this study provides the basis for further exploration into gene regulation with regard to coding and noncoding sequence length.

  12. An integrated PCR colony hybridization approach to screen cDNA libraries for full-length coding sequences.

    Science.gov (United States)

    Pollier, Jacob; González-Guzmán, Miguel; Ardiles-Diaz, Wilson; Geelen, Danny; Goossens, Alain

    2011-01-01

    cDNA-Amplified Fragment Length Polymorphism (cDNA-AFLP) is a commonly used technique for genome-wide expression analysis that does not require prior sequence knowledge. Typically, quantitative expression data and sequence information are obtained for a large number of differentially expressed gene tags. However, most of the gene tags do not correspond to full-length (FL) coding sequences, which is a prerequisite for subsequent functional analysis. A medium-throughput screening strategy, based on integration of polymerase chain reaction (PCR) and colony hybridization, was developed that allows in parallel screening of a cDNA library for FL clones corresponding to incomplete cDNAs. The method was applied to screen for the FL open reading frames of a selection of 163 cDNA-AFLP tags from three different medicinal plants, leading to the identification of 109 (67%) FL clones. Furthermore, the protocol allows for the use of multiple probes in a single hybridization event, thus significantly increasing the throughput when screening for rare transcripts. The presented strategy offers an efficient method for the conversion of incomplete expressed sequence tags (ESTs), such as cDNA-AFLP tags, to FL-coding sequences.

  13. An Explicit Construction of a sequence of codes attaining the Tsfasman-Vladut-Zink Bound:The first steps

    DEFF Research Database (Denmark)

    Høholdt, Tom; Voss, Cornelia

    1997-01-01

    We present a sequence of codes attaining the Tsfasman-Vladut-Zink bound. The construction is based on the tower of Artin-Schreier extensions described by Garcia and Stichtenoth (1995). We also determine the dual codes. The first steps of the constructions are explicitly given as generator matrices...

  14. Coding sequence of human rho cDNAs clone 6 and clone 9

    Energy Technology Data Exchange (ETDEWEB)

    Chardin, P; Madaule, P; Tavitian, A

    1988-03-25

    The authors have isolated human cDNAs including the complete coding sequence for two rho proteins corresponding to the incomplete isolates previously described as clone 6 and clone 9. The deduced a.a. sequences, when compared to the a.a. sequence deduced from clone 12 cDNA, show that there are in human at least three highly homologous rho genes. They suggest that clone 12 be named rhoA, clone 6 : rhoB and clone 9 : rhoC. RhoA, B and C proteins display approx. 30% a.a. identity with ras proteins,. mainly clustered in four highly homologous internal regions corresponding to the GTP binding site; however at least one significant difference is found; the 3 rho proteins have an Alanine in position corresponding to ras Glycine 13, suggesting that rho and ras proteins might have slightly different biochemical properties.

  15. Application of Melcor code for the calculo of TMLB sequence in PWR with natural circulating into the vessel

    International Nuclear Information System (INIS)

    Marten-Fuertes, F.

    1995-01-01

    The use of computer codes to analyze the phenomena of severe accidents is very important to take decisions in Nuclear Safety. This paper presents the MELCOR code used to calculate the TMLB sequence of PWR with natural circulation into the vessels. The main goal of this code is its application for the PSA (probabilistic safety analysis)

  16. Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor

    International Nuclear Information System (INIS)

    Antalis, T.M.; Clark, M.A.; Barnes, T.; Lehrbach, P.R.; Devine, P.L.; Schevzov, G.; Goss, N.H.; Stephens, R.W.; Tolstoshev, P.

    1988-01-01

    Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A) + RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the λ P/sub L/ promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated M/sub r/ of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators

  17. Full-Length Venom Protein cDNA Sequences from Venom-Derived mRNA: Exploring Compositional Variation and Adaptive Multigene Evolution.

    Science.gov (United States)

    Modahl, Cassandra M; Mackessy, Stephen P

    2016-06-01

    Envenomation of humans by snakes is a complex and continuously evolving medical emergency, and treatment is made that much more difficult by the diverse biochemical composition of many venoms. Venomous snakes and their venoms also provide models for the study of molecular evolutionary processes leading to adaptation and genotype-phenotype relationships. To compare venom complexity and protein sequences, venom gland transcriptomes are assembled, which usually requires the sacrifice of snakes for tissue. However, toxin transcripts are also present in venoms, offering the possibility of obtaining cDNA sequences directly from venom. This study provides evidence that unknown full-length venom protein transcripts can be obtained from the venoms of multiple species from all major venomous snake families. These unknown venom protein cDNAs are obtained by the use of primers designed from conserved signal peptide sequences within each venom protein superfamily. This technique was used to assemble a partial venom gland transcriptome for the Middle American Rattlesnake (Crotalus simus tzabcan) by amplifying sequences for phospholipases A2, serine proteases, C-lectins, and metalloproteinases from within venom. Phospholipase A2 sequences were also recovered from the venoms of several rattlesnakes and an elapid snake (Pseudechis porphyriacus), and three-finger toxin sequences were recovered from multiple rear-fanged snake species, demonstrating that the three major clades of advanced snakes (Elapidae, Viperidae, Colubridae) have stable mRNA present in their venoms. These cDNA sequences from venom were then used to explore potential activities derived from protein sequence similarities and evolutionary histories within these large multigene superfamilies. Venom-derived sequences can also be used to aid in characterizing venoms that lack proteomic profiles and identify sequence characteristics indicating specific envenomation profiles. This approach, requiring only venom, provides

  18. Photodynamic antisense regulation of mRNA having a point mutation with psoralen-conjugated oligonucleotide.

    Science.gov (United States)

    Higuchi, Maiko; Yamayoshi, Asako; Kobori, Akio; Murakami, Akira

    2008-01-01

    Nucleic acid-based drugs, such as antisense oligonucleotide, ribozyme, and small interfering RNA, are specific compounds that inhibit gene expression at the post-transcriptional level. To develop more effective nucleic acid-based drugs, we focused on photo-reactive antisense oligonucleotides. We have optimized the structure of psoralen-conjugated oligonucleotide to improve their sequence selectivity and photo-crosslinking efficiency. Previously, we reported that photo reactive oligonucleotides containing 2'-O-psoralenyl-methoxyethyl adenosine (2'-Ps-eom) showed drastic photo-reactivity with a strictly sequence specific manner in vitro. In this report, we evaluated the binding ability toward intracellular target mRNA. The 2'-Ps-eom selectively photo-cross-linked to the target mRNA extracted from cells. The 2'-Ps-eom also cross-linked to target mRNA in cells. Furthermore, 2'-Ps-eom did not cross-link to mRNA having a mismatch base. These results suggest that 2'-Ps-eom is a powerful antisense molecule to inhibit the expression of mRNA having a point mutation.

  19. Accessibility of the Shine-Dalgarno sequence dictates N-terminal codon bias in E. coli

    OpenAIRE

    Shakhnovich, Eugene; Zhang, Wenli; Yan, Jin; Adkar, Bharat; Jacobs, William; Bhattacharyya, Sanchari; Adkar, Bharat

    2018-01-01

    Despite considerable efforts, no physical mechanism has been shown to explain N-terminal codon bias in prokaryotic genomes. Using a systematic study of synonymous substitutions in two endogenous E. coli genes, we show that interactions between the coding region and the upstream Shine-Dalgarno (SD) sequence modulate the efficiency of translation initiation, affecting both intracellular mRNA and protein levels due to the inherent coupling of transcription and translation in E. coli. We further ...

  20. Diagnosis of chronic myeloid and acute lymphocytic leukemias by detection of leukemia-specific mRNA sequences amplified in vitro

    International Nuclear Information System (INIS)

    Kawasaki, E.S.; Clark, S.S.; Coyne, M.Y.; Smith, S.D.; Champlin, R.; Witte, O.N.; McCormick, F.P.

    1988-01-01

    The Philadelphia chromosome is present in more than 95% of chronic myeloid leukemia patients and 13% of acute lymphocytic leukemia patients. The Philadelphia translocation, t(9;22), fuses the BCR and ABL genes resulting in the expression of leukemia-specific, chimeric BCR-ABL messenger RNAs. To facilitate diagnosis of these leukemias, the authors have developed a method of amplifying and detecting only the unique mRNA sequences, using an extension of the polymerase chain reaction technique. Diagnosis of chronic myeloid and acute lymphocytic leukemias by this procedure is rapid, much more sensitive than existing protocols, and independent of the presence or absence of an identifiable Philadelphia chromosome

  1. cDNA sequences of two inducible T-cell genes

    Energy Technology Data Exchange (ETDEWEB)

    Kwon, B.S. (Indiana Univ. School of Medicine, Indianapolis (USA) Guthrie Research Institute, Sayre, PA (USA)); Weissman, S.M. (Yale Univ., New Haven, CT (USA))

    1989-03-01

    The authors have previously described a set of human T-lymphocyte-specific cDNA clones isolated by a modified differential screening procedure. Apparent full-length cDNAs containing the sequences of 14 of the 16 initial isolates were sequenced and were found to represent five different species of mRNA; three of the five species were identical to previously reported cDNA sequences of preproenkephalin, T-cell-replacing factor, and a serine esterase, respectively. The other two species, 4-1BB and L2G25B, were inducible sequences found in mRNA from both a cytolytic T-lymphocyte and a helper T-lymphocyte clone and were not previously described in T-cell mRNA; these mRNA sequences encode peptides of 256 and 92 amino acids, respectively. Both peptides contain putative leader sequences. The protein encoded by 4-1BB also has a potential membrane anchor segment and other features also seen in known receptor proteins.

  2. Episodic sequence memory is supported by a theta-gamma phase code

    OpenAIRE

    Heusser, Andrew C.; Poeppel, David; Ezzyat, Youssef; Davachi, Lila

    2016-01-01

    The meaning we derive from our experiences is not a simple static extraction of the elements, but is largely based on the order in which those elements occur. Models propose that sequence encoding is supported by interactions between high and low frequency oscillations, such that elements within an experience are represented by neural cell assemblies firing at higher frequencies (i.e. gamma) and sequential order is coded by the specific timing of firing with respect to a lower frequency oscil...

  3. A Mechanistic Beta-Binomial Probability Model for mRNA Sequencing Data.

    Science.gov (United States)

    Smith, Gregory R; Birtwistle, Marc R

    2016-01-01

    A main application for mRNA sequencing (mRNAseq) is determining lists of differentially-expressed genes (DEGs) between two or more conditions. Several software packages exist to produce DEGs from mRNAseq data, but they typically yield different DEGs, sometimes markedly so. The underlying probability model used to describe mRNAseq data is central to deriving DEGs, and not surprisingly most softwares use different models and assumptions to analyze mRNAseq data. Here, we propose a mechanistic justification to model mRNAseq as a binomial process, with data from technical replicates given by a binomial distribution, and data from biological replicates well-described by a beta-binomial distribution. We demonstrate good agreement of this model with two large datasets. We show that an emergent feature of the beta-binomial distribution, given parameter regimes typical for mRNAseq experiments, is the well-known quadratic polynomial scaling of variance with the mean. The so-called dispersion parameter controls this scaling, and our analysis suggests that the dispersion parameter is a continually decreasing function of the mean, as opposed to current approaches that impose an asymptotic value to the dispersion parameter at moderate mean read counts. We show how this leads to current approaches overestimating variance for moderately to highly expressed genes, which inflates false negative rates. Describing mRNAseq data with a beta-binomial distribution thus may be preferred since its parameters are relatable to the mechanistic underpinnings of the technique and may improve the consistency of DEG analysis across softwares, particularly for moderately to highly expressed genes.

  4. Molecular evolution of adiponectin in Carnivora and its mRNA expression in relation to hepatic lipidosis.

    Science.gov (United States)

    Nieminen, Petteri; Rouvinen-Watt, Kirsti; Kapiainen, Suvi; Harris, Lora; Mustonen, Anne-Mari

    2010-09-15

    Adiponectin is a novel adipocyte-derived hormone with low circulating concentrations and/or mRNA expression in obesity and non-alcoholic fatty liver disease (NAFLD). The adiponectin mRNA of several Carnivora species was sequenced to enable further gene expression studies in this clade with potential experimental species to examine the connections of hypoadiponectinemia to hepatic lipidosis. In addition, adiponectin mRNA expression was studied in the retroperitoneal fat of the American mink (Neovison vison), as hepatic lipidosis with close similarities to NAFLD can be rapidly induced to the species by fasting. The mRNA expression was determined after overnight-7d of food deprivation and 28d of re-feeding and correlated to the liver fat %. The homologies between the determined carnivoran mRNA sequences and that of the domestic dog were 92.2-99.1%. As the mRNA expression was not affected by short-term fasting and did not correlate with the liver fat %, there seems to be no clear connection between adiponectin and the development of lipidosis in the American mink. In the future, the obtained sequences can be utilized in further studies of adiponectin expression in comparative endocrinology. Copyright (c) 2010 Elsevier Inc. All rights reserved.

  5. The RDE-10/RDE-11 complex triggers RNAi-induced mRNA degradation by association with target mRNA in C. elegans.

    Science.gov (United States)

    Yang, Huan; Zhang, Ying; Vallandingham, Jim; Li, Hua; Li, Hau; Florens, Laurence; Mak, Ho Yi

    2012-04-15

    The molecular mechanisms for target mRNA degradation in Caenorhabditis elegans undergoing RNAi are not fully understood. Using a combination of genetic, proteomic, and biochemical approaches, we report a divergent RDE-10/RDE-11 complex that is required for RNAi in C. elegans. Genetic analysis indicates that the RDE-10/RDE-11 complex acts in parallel to nuclear RNAi. Association of the complex with target mRNA is dependent on RDE-1 but not RRF-1, suggesting that target mRNA recognition depends on primary but not secondary siRNA. Furthermore, RDE-11 is required for mRNA degradation subsequent to target engagement. Deep sequencing reveals a fivefold decrease in secondary siRNA abundance in rde-10 and rde-11 mutant animals, while primary siRNA and microRNA biogenesis is normal. Therefore, the RDE-10/RDE-11 complex is critical for amplifying the exogenous RNAi response. Our work uncovers an essential output of the RNAi pathway in C. elegans.

  6. Cloning and expression of a cDNA coding for a human monocyte-derived plasminogen activator inhibitor.

    Science.gov (United States)

    Antalis, T M; Clark, M A; Barnes, T; Lehrbach, P R; Devine, P L; Schevzov, G; Goss, N H; Stephens, R W; Tolstoshev, P

    1988-02-01

    Human monocyte-derived plasminogen activator inhibitor (mPAI-2) was purified to homogeneity from the U937 cell line and partially sequenced. Oligonucleotide probes derived from this sequence were used to screen a cDNA library prepared from U937 cells. One positive clone was sequenced and contained most of the coding sequence as well as a long incomplete 3' untranslated region (1112 base pairs). This cDNA sequence was shown to encode mPAI-2 by hybrid-select translation. A cDNA clone encoding the remainder of the mPAI-2 mRNA was obtained by primer extension of U937 poly(A)+ RNA using a probe complementary to the mPAI-2 coding region. The coding sequence for mPAI-2 was placed under the control of the lambda PL promoter, and the protein expressed in Escherichia coli formed a complex with urokinase that could be detected immunologically. By nucleotide sequence analysis, mPAI-2 cDNA encodes a protein containing 415 amino acids with a predicted unglycosylated Mr of 46,543. The predicted amino acid sequence of mPAI-2 is very similar to placental PAI-2 (3 amino acid differences) and shows extensive homology with members of the serine protease inhibitor (serpin) superfamily. mPAI-2 was found to be more homologous to ovalbumin (37%) than the endothelial plasminogen activator inhibitor, PAI-1 (26%). Like ovalbumin, mPAI-2 appears to have no typical amino-terminal signal sequence. The 3' untranslated region of the mPAI-2 cDNA contains a putative regulatory sequence that has been associated with the inflammatory mediators.

  7. Code-Switching to Know a TL Equivalent of an L1 Word: Request-Provision-Acknowledgement (RPA) Sequence

    Science.gov (United States)

    Lucero, Edgar

    2011-01-01

    This article focuses on the learner's use of Code-switching to learn the TL (Target Language) equivalent of an L1 word. The interactional pattern that this situation creates defines the Request-Provision-Acknowledgement (RPA) sequence. The article explains each of the turns of the sequence under the combination of the Ethnomethodological…

  8. Sequence-based heuristics for faster annotation of non-coding RNA families.

    Science.gov (United States)

    Weinberg, Zasha; Ruzzo, Walter L

    2006-01-01

    Non-coding RNAs (ncRNAs) are functional RNA molecules that do not code for proteins. Covariance Models (CMs) are a useful statistical tool to find new members of an ncRNA gene family in a large genome database, using both sequence and, importantly, RNA secondary structure information. Unfortunately, CM searches are extremely slow. Previously, we created rigorous filters, which provably sacrifice none of a CM's accuracy, while making searches significantly faster for virtually all ncRNA families. However, these rigorous filters make searches slower than heuristics could be. In this paper we introduce profile HMM-based heuristic filters. We show that their accuracy is usually superior to heuristics based on BLAST. Moreover, we compared our heuristics with those used in tRNAscan-SE, whose heuristics incorporate a significant amount of work specific to tRNAs, where our heuristics are generic to any ncRNA. Performance was roughly comparable, so we expect that our heuristics provide a high-quality solution that--unlike family-specific solutions--can scale to hundreds of ncRNA families. The source code is available under GNU Public License at the supplementary web site.

  9. Detection of melatonin receptor mRNA in human muscle

    International Nuclear Information System (INIS)

    Li Lei

    2004-01-01

    To verify the expression of melatonin receptor mRNA in human, muscle, muscle beside vertebrae was collected to obtain total RNA and the mRNA of melatonin receptor was detected by RT-PCR method. The electrophoretic results of RT-PCR products by mt 1 and MT 2 primer were all positive and the sequence is corresponding with human melatonin receptor cDNA. It suggests that melatonin may act on the muscle beside vertebrae directly and regulate its growth and development. (authors)

  10. Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting

    Science.gov (United States)

    Piazza, Carol Lyn; Smith, Dorie

    2018-01-01

    Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis, inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. PMID:29905149

  11. Transduplication resulted in the incorporation of two protein-coding sequences into the Turmoil-1 transposable element of C. elegans

    Directory of Open Access Journals (Sweden)

    Pupko Tal

    2008-10-01

    Full Text Available Abstract Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif. Reviewers This article was reviewed by Dan Graur and William Martin. For the full reviews, please go to the Reviewers' Reports section.

  12. Accumulation of defence-related transcripts and cloning of a chitinase mRNA from pea leaves (Pisum sativum L.) inoculated with Ascochyta pisi Lib

    DEFF Research Database (Denmark)

    Vad, Knud; de Neergaard, Eigil; Madriz-Ordeñana, Kenneth

    1993-01-01

    The race specific resistance of pea to Ascochyta pisi Lib. was shown to be exhibited as a hypersensitive response associated with the production of polyphenolic substances in epidermal and mesophyll cells. The levels of transcripts representing a pathogenesis-related (PR) protein (chitinase......) and an enzyme of phytoalexin biosynthesis (chalcone synthase) were shown to accumulate more rapidly during the hypersensitive response than during lesion development in the compatible interaction. A full-length (1143 bp) cDNA sequence of a pea chitinase (EC 3.2.1.14) (coding for an approx. 34 500 Da protein......) was deduced by combining the overlapping sequences of three clones obtained following PCR amplification of cDNA prepared from mRNA isolated 24 h after inoculation of pea leaves with Ascochyta pisi. The combined sequences were identified as a class I chitinase corresponding to the basic A1-chitinase enzyme...

  13. Sequence Coding and Search System for licensee event reports: user's guide. Volume 1, Revision 1

    International Nuclear Information System (INIS)

    Greene, N.M.; Mays, G.T.; Johnson, M.P.

    1985-04-01

    Operating experience data from nuclear power plants are essential for safety and reliability analyses, especially analyses of trends and patterns. The licensee event reports (LERs) that are submitted to the Nuclear Regulatory Commission (NRC) by the nuclear power plant utilities contain much of this data. The NRC's Office for Analysis and Evaluation of Operational Data (AEOD) has developed, under contract with NSIC, a system for codifying the events reported in the LERs. The primary objective of the Sequence Coding and Search System (SCSS) is to reduce the descriptive text of the LERs to coded sequences that are both computer-readable and computer-searchable. This system provides a structured format for detailed coding of component, system, and unit effects as well as personnel errors. The database contains all current LERs submitted by nuclear power plant utilities for events occurring since 1981 and is updated on a continual basis. This four volume report documents and describes SCSS in detail. Volume 1 is a User's Guide for searching the SCSS database. This volume contains updated material through February 1985 of the working version of ORNL/NSIC-223, Vol. 1

  14. Changes in growth hormone (GH) messenger RNA (GH mRNA) expression in the rat anterior pituitary after single interferon (IFN) alpha administration

    International Nuclear Information System (INIS)

    Romanowski, W.; Braczkowski, R.; Nowakowska-Zajdel, E.; Muc-Wierzgon, M.; Zubelewicz-Szkodzinska, B.; Kosiewicz, J.; Korzonek, I.

    2006-01-01

    Introduction: Interferon a (IFN-a) is a cytokine with pleiotropic effects which, via different pathways, influences the secretion of certain cytokines and hormones. Growth hormone (GH) secreted from the pituitary has physiological effects on various target tissues. The question is how IFN-a administered in various types of disease influences GH secretion. This study investigated the acute effect of IFN-a on GH mRNA expression in the rat anterior pituitary. Objective: The aim of the study was to measure the cellular expression of GH mRNA by in situ hybridisation in the anterior pituitary after a single administration of IFN-a. Material and methods: Rats were administered an intraperitoneal injection of IFN-a or saline. The rat pituitaries were taken 2 and 4 hours after IFN/saline administration and kept frozen until in situ hybridisation histochemistry. A 31 - base 35S -labelled oligonucleotide probe complementary to part of the exonic mRNA sequence coding for GH mRNA was used. All control and experimental sections were hybridised in the same hybridisation reaction. Results: Acute administration of interferon a increased GH mRNA expression in the anterior pituitary in the 4-hour group in comparison with the control group, and there was no difference between the control group and the 2-hour rats. Conclusion: A single IFN-a administration was found to exert an influence on anterior pituitary GH mRNA expression. These observations may pave the way for presenting a possible new action of IFN-a. (author) GH mRNA, anterior pituitary, interferon

  15. cDNA cloning and nucleotide sequence comparison of Chinese hamster metallothionein I and II mRNAs

    Energy Technology Data Exchange (ETDEWEB)

    Griffith, B B; Walters, R A; Enger, M D; Hildebrand, C E; Griffith, J K

    1983-01-01

    Polyadenylated RNA was extracted from a cadmium resistant Chinese hamster (CHO) cell line, enriched for metal-induced, abundant RNA sequences and cloned as double-stranded cDNA in the plasmid pBR322. Two cDNA clones, pCHMT1 and pCHMT2, encoding two Chinese hamster isometallothioneins were identified, and the nucleotide sequence of each insert was determined. The two Chinese hamster metallothioneins show nucleotide sequence homologies of 80% in the protein coding region and approximately 35% in both the 5' and 3' untranslated regions. Interestingly, an 8 nucleotide sequence (TGTAAATA) has been conserved in sequence and position in the 3' untranslated regions of each metallothionein mRNA sequenced thus far. Estimated nucleotide substitution rates derived from interspecies comparisons were used to calculate a metallothionein gene duplication time of 45 to 120 million years ago. 39 references, 1 figure, 1 table.

  16. Nucleotide sequence of the melA gene, coding for alpha-galactosidase in Escherichia coli K-12.

    OpenAIRE

    Liljeström, P L; Liljeström, P

    1987-01-01

    Melibiose uptake and hydrolysis in E.coli is performed by the MelB and MelA proteins, respectively. We report the cloning and sequencing of the melA gene. The nucleotide sequence data showed that melA codes for a 450 amino acid long protein with a molecular weight of 50.6 kd. The sequence data also supported the assumption that the mel locus forms an operon with melA in proximal position. A comparison of MelA with alpha-galactosidase proteins from yeast and human origin showed that these prot...

  17. Complexity on Acute Myeloid Leukemia mRNA Transcript Variant

    Directory of Open Access Journals (Sweden)

    Carlo Cattani

    2011-01-01

    Full Text Available This paper deals with the sequence analysis of acute myeloid leukemia mRNA. Six transcript variants of mlf1 mRNA, with more than 2000 bps, are analyzed by focusing on the autocorrelation of each distribution. Through the correlation matrix, some patches and similarities are singled out and commented, with respect to similar distributions. The comparison of Kolmogorov fractal dimension will be also given in order to classify the six variants. The existence of a fractal shape, patterns, and symmetries are discussed as well.

  18. The structural analysis of protein sequences based on the quasi-amino acids code

    International Nuclear Information System (INIS)

    Ping, Zhu; Xu-Qing, Tang; Zhen-Yuan, Xu

    2009-01-01

    Proteomics is the study of proteins and their interactions in a cell. With the successful completion of the Human Genome Project, it comes the postgenome era when the proteomics technology is emerging. This paper studies protein molecule from the algebraic point of view. The algebraic system (Σ, +, *) is introduced, where Σ is the set of 64 codons. According to the characteristics of (Σ, +, *), a novel quasi-amino acids code classification method is introduced and the corresponding algebraic operation table over the set ZU of the 16 kinds of quasi-amino acids is established. The internal relation is revealed about quasi-amino acids. The results show that there exist some very close correlations between the properties of the quasi-amino acids and the codon. All these correlation relationships may play an important part in establishing the logic relationship between codons and the quasi-amino acids during the course of life origination. According to Ma F et al (2003 J. Anhui Agricultural University 30 439), the corresponding relation and the excellent properties about amino acids code are very difficult to observe. The present paper shows that (ZU, ⊕, ) is a field. Furthermore, the operational results display that the codon tga has different property from other stop codons. In fact, in the mitochondrion from human and ox genomic codon, tga is just tryptophane, is not the stop codon like in other genetic code, it is the case of the Chen W C et al (2002 Acta Biophysica Sinica 18(1) 87). The present theory avoids some inexplicable events of the 20 kinds of amino acids code, in other words it solves the problem of 'the 64 codon assignments of mRNA to amino acids is probably completely wrong' proposed by Yang (2006 Progress in Modern Biomedicine 6 3). (cross-disciplinary physics and related areas of science and technology)

  19. Hybridization-based reconstruction of small non-coding RNA transcripts from deep sequencing data.

    Science.gov (United States)

    Ragan, Chikako; Mowry, Bryan J; Bauer, Denis C

    2012-09-01

    Recent advances in RNA sequencing technology (RNA-Seq) enables comprehensive profiling of RNAs by producing millions of short sequence reads from size-fractionated RNA libraries. Although conventional tools for detecting and distinguishing non-coding RNAs (ncRNAs) from reference-genome data can be applied to sequence data, ncRNA detection can be improved by harnessing the full information content provided by this new technology. Here we present NorahDesk, the first unbiased and universally applicable method for small ncRNAs detection from RNA-Seq data. NorahDesk utilizes the coverage-distribution of small RNA sequence data as well as thermodynamic assessments of secondary structure to reliably predict and annotate ncRNA classes. Using publicly available mouse sequence data from brain, skeletal muscle, testis and ovary, we evaluated our method with an emphasis on the performance for microRNAs (miRNAs) and piwi-interacting small RNA (piRNA). We compared our method with Dario and mirDeep2 and found that NorahDesk produces longer transcripts with higher read coverage. This feature makes it the first method particularly suitable for the prediction of both known and novel piRNAs.

  20. Sequence-engineered mRNA Without Chemical Nucleoside Modifications Enables an Effective Protein Therapy in Large Animals

    OpenAIRE

    Thess, Andreas; Grund, Stefanie; Mui, Barbara L; Hope, Michael J; Baumhof, Patrick; Fotin-Mleczek, Mariola; Schlake, Thomas

    2015-01-01

    Being a transient carrier of genetic information, mRNA could be a versatile, flexible, and safe means for protein therapies. While recent findings highlight the enormous therapeutic potential of mRNA, evidence that mRNA-based protein therapies are feasible beyond small animals such as mice is still lacking. Previous studies imply that mRNA therapeutics require chemical nucleoside modifications to obtain sufficient protein expression and avoid activation of the innate immune system. Here we sh...

  1. CYP3A5 mRNA degradation by nonsense-mediated mRNA decay.

    Science.gov (United States)

    Busi, Florent; Cresteil, Thierry

    2005-09-01

    The total CYP3A5 mRNA level is significantly greater in carriers of the CYP3A5*1 allele than in CYP3A5*3 homozygotes. Most of the CYP3A5*3 mRNA includes an intronic sequence (exon 3B) containing premature termination codons (PTCs) between exons 3 and 4. Two models were used to investigate the degradation of CYP3A5 mRNA: a CYP3A5 minigene consisting of CYP3A5 exons and introns 3 to 6 transfected into MCF7 cells, and the endogenous CYP3A5 gene expressed in HepG2 cells. The 3'-untranslated region g.31611C>T mutation has no effect on CYP3A5 mRNA decay. Splice variants containing exon 3B were more unstable than wild-type (wt) CYP3A5 mRNA. Cycloheximide prevents the recognition of PTCs by ribosomes: in transfected MCF7 and HepG2 cells, cycloheximide slowed down the degradation of exon 3B-containing splice variants, suggesting the participation of nonsense-mediated decay (NMD). When PTCs were removed from pseudoexon 3B or when UPF1 small interfering RNA was used to impair the NMD mechanism, the decay of the splice variant was reduced, confirming the involvement of NMD in the degradation of CYP3A5 splice variants. Induction could represent a source of variability for CYP3A5 expression and could modify the proportion of splice variants. The extent of CYP3A5 induction was investigated after exposure to barbiturates or steroids: CYP3A4 was markedly induced in a pediatric population compared with untreated neonates. However, no effect could be detected in either the total CYP3A5 RNA, the proportion of splice variant RNA, or the protein level. Therefore, in these carriers, induction is unlikely to switch on the phenotypic CYP3A5 expression in carriers of CYP3A5*3/*3.

  2. Amino-terminal domain of the v-fms oncogene product includes a functional signal peptide that directs synthesis of a transforming glycoprotein in the absence of feline leukemia virus gag sequences

    International Nuclear Information System (INIS)

    Wheeler, E.F.; Roussel, M.F.; Hampe, A.; Walker, M.H.; Fried, V.A.; Look, A.T.; Rettenmier, C.W.; Sherr, C.J.

    1986-01-01

    The nucleotide sequence of a 5' segment of the human genomic c-fms proto-oncogene suggested that recombination between feline leukemia virus and feline c-fms sequences might have occurred in a region encoding the 5' untranslated portion of c-fms mRNA. The polyprotein precursor gP180/sup gag-fms/ encoded by the McDonough strain of feline sarcoma virus was therefore predicted to contain 34 v-fms-coded amino acids derived from sequences of the c-fms gene that are not ordinarily translated from the proto-oncogene mRNA. The (gP180/sup gag-fms/) polyprotein was cotranslationally cleaved near the gag-fms junction to remove its gag gene-coded portion. Determination of the amino-terminal sequence of the resulting v-fms-coded glycoprotein, gp120/sup v-fms/, showed that the site of proteolysis corresponded to a predicted signal peptidase cleavage site within the c-fms gene product. Together, these analyses suggested that the linked gag sequences may not be necessary for expression of a biologically active v-fms gene product. The gag-fms sequences of feline sarcoma virus strain McDonough and the v-fms sequences alone were inserted into a murine retroviral vector containing a neomycin resistance gene. The authors conclude that a cryptic hydrophobic signal peptide sequence in v-fms was unmasked by gag deletion, thereby allowing the correct orientation and transport of the v-fms was unmasked by gag deletion, thereby allowing the correct orientation and transport of the v-fms gene product within membranous organelles. It seems likely that the proteolytic cleavage of gP180/gag-fms/ is mediated by signal peptidase and that the amino termini of gp140/sup v-fms/ and the c-fms gene product are identical

  3. Amino-terminal domain of the v-fms oncogene product includes a functional signal peptide that directs synthesis of a transforming glycoprotein in the absence of feline leukemia virus gag sequences

    Energy Technology Data Exchange (ETDEWEB)

    Wheeler, E.F.; Roussel, M.F.; Hampe, A.; Walker, M.H.; Fried, V.A.; Look, A.T.; Rettenmier, C.W.; Sherr, C.J.

    1986-08-01

    The nucleotide sequence of a 5' segment of the human genomic c-fms proto-oncogene suggested that recombination between feline leukemia virus and feline c-fms sequences might have occurred in a region encoding the 5' untranslated portion of c-fms mRNA. The polyprotein precursor gP180/sup gag-fms/ encoded by the McDonough strain of feline sarcoma virus was therefore predicted to contain 34 v-fms-coded amino acids derived from sequences of the c-fms gene that are not ordinarily translated from the proto-oncogene mRNA. The (gP180/sup gag-fms/) polyprotein was cotranslationally cleaved near the gag-fms junction to remove its gag gene-coded portion. Determination of the amino-terminal sequence of the resulting v-fms-coded glycoprotein, gp120/sup v-fms/, showed that the site of proteolysis corresponded to a predicted signal peptidase cleavage site within the c-fms gene product. Together, these analyses suggested that the linked gag sequences may not be necessary for expression of a biologically active v-fms gene product. The gag-fms sequences of feline sarcoma virus strain McDonough and the v-fms sequences alone were inserted into a murine retroviral vector containing a neomycin resistance gene. The authors conclude that a cryptic hydrophobic signal peptide sequence in v-fms was unmasked by gag deletion, thereby allowing the correct orientation and transport of the v-fms was unmasked by gag deletion, thereby allowing the correct orientation and transport of the v-fms gene product within membranous organelles. It seems likely that the proteolytic cleavage of gP180/gag-fms/ is mediated by signal peptidase and that the amino termini of gp140/sup v-fms/ and the c-fms gene product are identical.

  4. Tissue-specific expression and regulation by 1,25(OH)2D3 of chick protein kinase inhibitor (PKI) mRNA.

    Science.gov (United States)

    Marchetto, G S; Henry, H L

    1997-02-01

    The heat-stable protein kinase inhibitor (PKI) protein is a specific and potent competitive inhibitor of the catalytic subunit of cAMP-dependent protein kinase (PKA). Previously, it has been shown that vitamin D status affects chick kidney PKI activity: a 5- to 10-fold increase in PKI activity was observed in kidneys of chronically vitamin D-deficient chicks and treatment with 1,25-dihydroxyvitamin D3 (1,25[OH]2D3) in cultured kidney cells resulted in a 95% decrease in PKI activity. The authors have recently cloned the cDNA for chick kidney PKI and have used the coding sequence to study the regulation of PKI mRNA. Northern analysis showed the expression of two PKI messages, which are 2.7 and 3.3 kb in size. These mRNAs are expressed in brain, muscle, testis, and kidney, but not in pancreas, liver, or intestine. PKI mRNA steady-state levels are downregulated by 47% in kidneys from vitamin D-replete chicks as compared to vitamin D-deficient chicks. PKI mRNA levels in brain, muscle, and testis are not affected by vitamin D status. Treatment of primary chick kidney cultures treated with 10(-7) M 1,25(OH)2D3 for 24h resulted in a 20-30% decrease in PKI mRNA. 1,25(OH)2D3 treatment does not affect the stability of PKI mRNA as determined by treatment of cell cultures with actinomycin D. This study shows that 1,25(OH)2D3 directly and tissue-specifically downregulates PKI mRNA in the chick kidney.

  5. ICRPfinder: a fast pattern design algorithm for coding sequences and its application in finding potential restriction enzyme recognition sites

    Directory of Open Access Journals (Sweden)

    Stafford Phillip

    2009-09-01

    Full Text Available Abstract Background Restriction enzymes can produce easily definable segments from DNA sequences by using a variety of cut patterns. There are, however, no software tools that can aid in gene building -- that is, modifying wild-type DNA sequences to express the same wild-type amino acid sequences but with enhanced codons, specific cut sites, unique post-translational modifications, and other engineered-in components for recombinant applications. A fast DNA pattern design algorithm, ICRPfinder, is provided in this paper and applied to find or create potential recognition sites in target coding sequences. Results ICRPfinder is applied to find or create restriction enzyme recognition sites by introducing silent mutations. The algorithm is shown capable of mapping existing cut-sites but importantly it also can generate specified new unique cut-sites within a specified region that are guaranteed not to be present elsewhere in the DNA sequence. Conclusion ICRPfinder is a powerful tool for finding or creating specific DNA patterns in a given target coding sequence. ICRPfinder finds or creates patterns, which can include restriction enzyme recognition sites, without changing the translated protein sequence. ICRPfinder is a browser-based JavaScript application and it can run on any platform, in on-line or off-line mode.

  6. Poly A tail length analysis of in vitro transcribed mRNA by LC-MS.

    Science.gov (United States)

    Beverly, Michael; Hagen, Caitlin; Slack, Olga

    2018-02-01

    The 3'-polyadenosine (poly A) tail of in vitro transcribed (IVT) mRNA was studied using liquid chromatography coupled to mass spectrometry (LC-MS). Poly A tails were cleaved from the mRNA using ribonuclease T1 followed by isolation with dT magnetic beads. Extracted tails were then analyzed by LC-MS which provided tail length information at single-nucleotide resolution. A 2100-nt mRNA with plasmid-encoded poly A tail lengths of either 27, 64, 100, or 117 nucleotides was used for these studies as enzymatically added poly A tails showed significant length heterogeneity. The number of As observed in the tails closely matched Sanger sequencing results of the DNA template, and even minor plasmid populations with sequence variations were detected. When the plasmid sequence contained a discreet number of poly As in the tail, analysis revealed a distribution that included tails longer than the encoded tail lengths. These observations were consistent with transcriptional slippage of T7 RNAP taking place within a poly A sequence. The type of RNAP did not alter the observed tail distribution, and comparison of T3, T7, and SP6 showed all three RNAPs produced equivalent tail length distributions. The addition of a sequence at the 3' end of the poly A tail did, however, produce narrower tail length distributions which supports a previously described model of slippage where the 3' end can be locked in place by having a G or C after the poly nucleotide region. Graphical abstract Determination of mRNA poly A tail length using magnetic beads and LC-MS.

  7. Primary structure of human pancreatic protease E determined by sequence analysis of the cloned mRNA

    International Nuclear Information System (INIS)

    Shen, W.; Fletcher, T.S.; Largman, C.

    1987-01-01

    Although protease E was isolated from human pancreas over 10 years ago, its amino acid sequence and relationship to the elastases have not been established. The authors report the isolation of a cDNA clone for human pancreatic protease E and determination of the nucleic acid sequence coding for the protein. The deduced amino acid sequence contains all of the features common to serine proteases. The substrate binding region is highly homologous to those of porcine and rat elastases 1, explaining the similar specificity for alanine reported for protease E and these elastases. However, the amino acid sequence outside the substrate binding region is less than 50% conserved, and there is a striking difference in the overall net charge for protease E (6-) and elastases 1 (8+). These findings confirm that protease E is a new member of the serine protease family. They have attempted to identify amino acid residues important for the interaction between elastases and elastin by examining the amino acid sequence differences between elastases and protease E. In addition to the large number of surface charge changes which are outside the substrate binding region, there are several changes which might be crucial for elastolysis: Leu-73/Arg-73; Arg-217A/Ala-217A; Arg-65A/Gln-65A; and the presence of two new cysteine residues (Cys-98 and Cys-99B) which computer modeling studies predict could form a new disulfide bond, not previously observed for serine proteases. They also present evidence which suggests that human pancreas does not synthesize a basic, alanine-specific elastase similar to porcine elastase 1

  8. CodonLogo: a sequence logo-based viewer for codon patterns.

    Science.gov (United States)

    Sharma, Virag; Murphy, David P; Provan, Gregory; Baranov, Pavel V

    2012-07-15

    Conserved patterns across a multiple sequence alignment can be visualized by generating sequence logos. Sequence logos show each column in the alignment as stacks of symbol(s) where the height of a stack is proportional to its informational content, whereas the height of each symbol within the stack is proportional to its frequency in the column. Sequence logos use symbols of either nucleotide or amino acid alphabets. However, certain regulatory signals in messenger RNA (mRNA) act as combinations of codons. Yet no tool is available for visualization of conserved codon patterns. We present the first application which allows visualization of conserved regions in a multiple sequence alignment in the context of codons. CodonLogo is based on WebLogo3 and uses the same heuristics but treats codons as inseparable units of a 64-letter alphabet. CodonLogo can discriminate patterns of codon conservation from patterns of nucleotide conservation that appear indistinguishable in standard sequence logos. The CodonLogo source code and its implementation (in a local version of the Galaxy Browser) are available at http://recode.ucc.ie/CodonLogo and through the Galaxy Tool Shed at http://toolshed.g2.bx.psu.edu/.

  9. Complete coding sequence of the human raf oncogene and the corresponding structure of the c-raf-1 gene

    Energy Technology Data Exchange (ETDEWEB)

    Bonner, T I; Oppermann, H; Seeburg, P; Kerby, S B; Gunnell, M A; Young, A C; Rapp, U R

    1986-01-24

    The complete 648 amino acid sequence of the human raf oncogene was deduced from the 2977 nucleotide sequence of a fetal liver cDNA. The cDNA has been used to obtain clones which extend the human c-raf-1 locus by an additional 18.9 kb at the 5' end and contain all the remaining coding exons.

  10. Allele-Selective Transcriptome Recruitment to Polysomes Primed for Translation: Protein-Coding and Noncoding RNAs, and RNA Isoforms.

    Directory of Open Access Journals (Sweden)

    Roshan Mascarenhas

    Full Text Available mRNA translation into proteins is highly regulated, but the role of mRNA isoforms, noncoding RNAs (ncRNAs, and genetic variants remains poorly understood. mRNA levels on polysomes have been shown to correlate well with expressed protein levels, pointing to polysomal loading as a critical factor. To study regulation and genetic factors of protein translation we measured levels and allelic ratios of mRNAs and ncRNAs (including microRNAs in lymphoblast cell lines (LCL and in polysomal fractions. We first used targeted assays to measure polysomal loading of mRNA alleles, confirming reported genetic effects on translation of OPRM1 and NAT1, and detecting no effect of rs1045642 (3435C>T in ABCB1 (MDR1 on polysomal loading while supporting previous results showing increased mRNA turnover of the 3435T allele. Use of high-throughput sequencing of complete transcript profiles (RNA-Seq in three LCLs revealed significant differences in polysomal loading of individual RNA classes and isoforms. Correlated polysomal distribution between protein-coding and non-coding RNAs suggests interactions between them. Allele-selective polysome recruitment revealed strong genetic influence for multiple RNAs, attributable either to differential expression of RNA isoforms or to differential loading onto polysomes, the latter defining a direct genetic effect on translation. Genes identified by different allelic RNA ratios between cytosol and polysomes were enriched with published expression quantitative trait loci (eQTLs affecting RNA functions, and associations with clinical phenotypes. Polysomal RNA-Seq combined with allelic ratio analysis provides a powerful approach to study polysomal RNA recruitment and regulatory variants affecting protein translation.

  11. The Number, Organization, and Size of Polymorphic Membrane Protein Coding Sequences as well as the Most Conserved Pmp Protein Differ within and across Chlamydia Species.

    Science.gov (United States)

    Van Lent, Sarah; Creasy, Heather Huot; Myers, Garry S A; Vanrompay, Daisy

    2016-01-01

    Variation is a central trait of the polymorphic membrane protein (Pmp) family. The number of pmp coding sequences differs between Chlamydia species, but it is unknown whether the number of pmp coding sequences is constant within a Chlamydia species. The level of conservation of the Pmp proteins has previously only been determined for Chlamydia trachomatis. As different Pmp proteins might be indispensible for the pathogenesis of different Chlamydia species, this study investigated the conservation of Pmp proteins both within and across C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci. The pmp coding sequences were annotated in 16 C. trachomatis, 6 C. pneumoniae, 2 C. abortus, and 16 C. psittaci genomes. The number and organization of polymorphic membrane coding sequences differed within and across the analyzed Chlamydia species. The length of coding sequences of pmpA,pmpB, and pmpH was conserved among all analyzed genomes, while the length of pmpE/F and pmpG, and remarkably also of the subtype pmpD, differed among the analyzed genomes. PmpD, PmpA, PmpH, and PmpA were the most conserved Pmp in C. trachomatis,C. pneumoniae,C. abortus, and C. psittaci, respectively. PmpB was the most conserved Pmp across the 4 analyzed Chlamydia species. © 2016 S. Karger AG, Basel.

  12. Characterization of X chromosome inactivation using integrated analysis of whole-exome and mRNA sequencing.

    Directory of Open Access Journals (Sweden)

    Szabolcs Szelinger

    Full Text Available In females, X chromosome inactivation (XCI is an epigenetic, gene dosage compensatory mechanism by inactivation of one copy of X in cells. Random XCI of one of the parental chromosomes results in an approximately equal proportion of cells expressing alleles from either the maternally or paternally inherited active X, and is defined by the XCI ratio. Skewed XCI ratio is suggestive of non-random inactivation, which can play an important role in X-linked genetic conditions. Current methods rely on indirect, semi-quantitative DNA methylation-based assay to estimate XCI ratio. Here we report a direct approach to estimate XCI ratio by integrated, family-trio based whole-exome and mRNA sequencing using phase-by-transmission of alleles coupled with allele-specific expression analysis. We applied this method to in silico data and to a clinical patient with mild cognitive impairment but no clear diagnosis or understanding molecular mechanism underlying the phenotype. Simulation showed that phased and unphased heterozygous allele expression can be used to estimate XCI ratio. Segregation analysis of the patient's exome uncovered a de novo, interstitial, 1.7 Mb deletion on Xp22.31 that originated on the paternally inherited X and previously been associated with heterogeneous, neurological phenotype. Phased, allelic expression data suggested an 83∶20 moderately skewed XCI that favored the expression of the maternally inherited, cytogenetically normal X and suggested that the deleterious affect of the de novo event on the paternal copy may be offset by skewed XCI that favors expression of the wild-type X. This study shows the utility of integrated sequencing approach in XCI ratio estimation.

  13. Simultaneous sequencing of coding and noncoding RNA reveals a human transcriptome dominated by a small number of highly expressed noncoding genes.

    Science.gov (United States)

    Boivin, Vincent; Deschamps-Francoeur, Gabrielle; Couture, Sonia; Nottingham, Ryan M; Bouchard-Bourelle, Philia; Lambowitz, Alan M; Scott, Michelle S; Abou-Elela, Sherif

    2018-07-01

    Comparing the abundance of one RNA molecule to another is crucial for understanding cellular functions but most sequencing techniques can target only specific subsets of RNA. In this study, we used a new fragmented ribodepleted TGIRT sequencing method that uses a thermostable group II intron reverse transcriptase (TGIRT) to generate a portrait of the human transcriptome depicting the quantitative relationship of all classes of nonribosomal RNA longer than 60 nt. Comparison between different sequencing methods indicated that FRT is more accurate in ranking both mRNA and noncoding RNA than viral reverse transcriptase-based sequencing methods, even those that specifically target these species. Measurements of RNA abundance in different cell lines using this method correlate with biochemical estimates, confirming tRNA as the most abundant nonribosomal RNA biotype. However, the single most abundant transcript is 7SL RNA, a component of the signal recognition particle. S tructured n on c oding RNAs (sncRNAs) associated with the same biological process are expressed at similar levels, with the exception of RNAs with multiple functions like U1 snRNA. In general, sncRNAs forming RNPs are hundreds to thousands of times more abundant than their mRNA counterparts. Surprisingly, only 50 sncRNA genes produce half of the non-rRNA transcripts detected in two different cell lines. Together the results indicate that the human transcriptome is dominated by a small number of highly expressed sncRNAs specializing in functions related to translation and splicing. © 2018 Boivin et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  14. Clinical significance of LUNX mRNA, CK19 mRNA, CEA mRNA expression in detecting micrometastasis from lung cancer

    International Nuclear Information System (INIS)

    Zhu Guangying; Liu Delin; Chen Jie

    2003-01-01

    Objective: To evaluate the sensitivity, specificity and clinical significance of CK19 mRNA, CEA mRNA and LUNX mRNA for detecting micrometastasis by sampling the peripheral blood and regional lymph nodes of lung cancer patients. Methods: Reverse transcriptase chain reaction (RT-PCR) was used to detect LUNX mRNA, CK19 mRNA, CEA mRNA for micrometastasis by sampling the peripheral blood of 48 lung cancer patients and 44 regional lymph nodes of such patients treated by curative resection. Peripheral blood of 30 patients with pulmonary benign lesions and 10 normal healthy volunteers and lymph nodes of 6 patients with benign pulmonary diseases served as control. Results: 1) LUNX mRNA, CK19 mRNA, CEA mRNA were expressed in all (35/35) lung cancer tissues. 2) In the peripheral blood from 48 lung cancer patients, 30 (62.5%) were positive for LUNX mRNA, 24 (50.0%) positive for CK19 mRNA and 32(66.7%) positive for CEA mRNA. The positive detection rates of micrometastasis in 44 lymph nodes from lung cancer patients were 36.4% (16 out of 44) for LUNX mRNA, 27.3% (12 out of 44) for CK19 mRNA and 40.9% (18 out of 44) for CEA mRNA. 3) In the 30 blood samples from patients with pulmonary benign diseases, 2 (6.7%) expressed CK19 mRNA, but none expressed LUNX mRNA or CEA mRNA. All the 3 molecular markers were negative in the 10 blood samples from healthy volunteers. In 11 lymph nodes from patients with pulmonary benign lesions, none was positive for any of the three markers. 4) In 44 regional lymph nodes from lung cancer patients, 6 (13.6%) were positive for metastasis by histopathological examination, with a positive rate significantly lower than that of the RT-PCR (P<0.05). 5) The micrometastatic positive rate in the peripheral blood of 40 non-small cell lung cancer (NSCLC) patients was significantly related to TNM stage (P=0.01). Conclusions: LUNX mRNA, CK19 MRNA, CEA mRNA are all appropriate target genes for the detection of micrometastasis from lung cancer. LUNX mRNA and CEA mRNA

  15. Identifying mRNA targets of microRNA dysregulated in cancer: with application to clear cell Renal Cell Carcinoma

    Directory of Open Access Journals (Sweden)

    Liou Louis S

    2010-04-01

    Full Text Available Abstract Background MicroRNA regulate mRNA levels in a tissue specific way, either by inducing degradation of the transcript or by inhibiting translation or transcription. Putative mRNA targets of microRNA identified from seed sequence matches are available in many databases. However, such matches have a high false positive rate and cannot identify tissue specificity of regulation. Results We describe a simple method to identify direct mRNA targets of microRNA dysregulated in cancers from expression level measurements in patient matched tumor/normal samples. The word "direct" is used here in a strict sense to: a represent mRNA which have an exact seed sequence match to the microRNA in their 3'UTR, b the seed sequence match is strictly conserved across mouse, human, rat and dog genomes, c the mRNA and microRNA expression levels can distinguish tumor from normal with high significance and d the microRNA/mRNA expression levels are strongly and significantly anti-correlated in tumor and/or normal samples. We apply and validate the method using clear cell Renal Cell Carcinoma (ccRCC and matched normal kidney samples, limiting our analysis to mRNA targets which undergo degradation of the mRNA transcript because of a perfect seed sequence match. Dysregulated microRNA and mRNA are first identified by comparing their expression levels in tumor vs normal samples. Putative dysregulated microRNA/mRNA pairs are identified from these using seed sequence matches, requiring that the seed sequence be conserved in human/dog/rat/mouse genomes. These are further pruned by requiring a strong anti-correlation signature in tumor and/or normal samples. The method revealed many new regulations in ccRCC. For instance, loss of miR-149, miR-200c and mir-141 causes gain of function of oncogenes (KCNMA1, LOX, VEGFA and SEMA6A respectively and increased levels of miR-142-3p, miR-185, mir-34a, miR-224, miR-21 cause loss of function of tumor suppressors LRRC2, PTPN13, SFRP1

  16. Cloning and mRNA expression pattern analysis under low ...

    African Journals Online (AJOL)

    This research cloned endochitinase-antifreeze protein precursor (EAPP) gene of Dong-mu 70 rye (Secale cereale) by designing special primers according to Genbank's EAPP gene sequence, and analyzing the influence of low temperature stress on the expression of mRNA with RT-PCR. The results indicated that the ...

  17. Creatine kinase and alpha-actin mRNA levels decrease in diabetic rat hearts

    International Nuclear Information System (INIS)

    Popovich, B.; Barrieux, A.; Dillmann, W.H.

    1987-01-01

    Diabetic cardiomyopathy is associated with cardiac atrophy and isoenzyme redistribution. To determine if tissue specific changes occur in mRNAs coding for α-actin and creatine kinase (CK), they performed RNA blot analysis. Total ventricular RNA from control (C) and 4 wk old diabetic (D) rats were hybridized with 32 P cDNA probes for α-actin and CK. A tissue independent cDNA probe, CHOA was also used. Signal intensity was quantified by photodensitometry. D CK mRNA was 47 +/- 16% lower in D vs C. Insulin increases CK mRNA by 20% at 1.5 hs, and completely reverses the deficit after 4 wks. D α-actin mRNA is 66 +/- 18% lower in D vs C. Insulin normalized α-actin mRNA by 5 hs. CHOA mRNA is unchanged in D vs C, but D + insulin CHOA mRNA is 30 +/- 2% lower than C. In rats with diabetic cardiomyopathy, muscle specific CK and α-actin mRNAs are decreased. Insulin treatment reverses these changes

  18. 2'-O-methylation in mRNA disrupts tRNA decoding during translation elongation.

    Science.gov (United States)

    Choi, Junhong; Indrisiunaite, Gabriele; DeMirci, Hasan; Ieong, Ka-Weng; Wang, Jinfan; Petrov, Alexey; Prabhakar, Arjun; Rechavi, Gideon; Dominissini, Dan; He, Chuan; Ehrenberg, Måns; Puglisi, Joseph D

    2018-03-01

    Chemical modifications of mRNA may regulate many aspects of mRNA processing and protein synthesis. Recently, 2'-O-methylation of nucleotides was identified as a frequent modification in translated regions of human mRNA, showing enrichment in codons for certain amino acids. Here, using single-molecule, bulk kinetics and structural methods, we show that 2'-O-methylation within coding regions of mRNA disrupts key steps in codon reading during cognate tRNA selection. Our results suggest that 2'-O-methylation sterically perturbs interactions of ribosomal-monitoring bases (G530, A1492 and A1493) with cognate codon-anticodon helices, thereby inhibiting downstream GTP hydrolysis by elongation factor Tu (EF-Tu) and A-site tRNA accommodation, leading to excessive rejection of cognate aminoacylated tRNAs in initial selection and proofreading. Our current and prior findings highlight how chemical modifications of mRNA tune the dynamics of protein synthesis at different steps of translation elongation.

  19. Quantitative PCR--new diagnostic tool for quantifying specific mRNA and DNA molecules

    DEFF Research Database (Denmark)

    Schlemmer, B O; Sorensen, B S; Overgaard, J

    2004-01-01

    of a subset of ligands from the EGF system is increased in bladder cancer. Furthermore, measurement of the mRNA concentration gives important information such as the expression of these ligands correlated to the survival of the patients. In addition to the alterations at the mRNA level, changes also can occur...... at the DNA level in the EGF system. Thus, it has been demonstrated that the number of genes coding for the human epidermal growth factor receptor 2 (HER2) is increased in a number of breast tumors. It is now possible to treat breast cancer patients with a humanized antibody reacting with HER2...... of mRNA or DNA in biological samples. In this study quantitative PCR was used to investigate the role of the EGF (epidermal growth factor) system in cancer both for measurements of mRNA concentrations and for measurements of the number of copies of specific genes. It is shown that the mRNA expression...

  20. Application of the verona coding definitions of emotional sequences (VR-CoDES) on a pediatric data set.

    Science.gov (United States)

    Vatne, Torun M; Finset, Arnstein; Ørnes, Knut; Ruland, Cornelia M

    2010-09-01

    Adult patients present concerns as defined in the Verona Coding Definitions of Emotional Sequences (VR-CoDES), but we do not know how children express their concerns during medical consultations. This study aimed to evaluate the applicability of VR-CoDES to pediatric oncology consultations. Twenty-eight pediatric consultations were coded with the Verona Coding Definitions of Emotional Sequences (VR-CoDES), and the material was also qualitatively analyzed for descriptive purposes. Five consultations were randomly selected for reliability testing and descriptive statistics were computed. Perfect inter-rater reliability for concerns and moderate reliability for cues were obtained. Cues and/or concerns were present in over half of the consultations. Cues were more frequent than concerns, with the majority of cues being verbal hints to hidden concerns or non-verbal cues. Intensity of expressions, limitations in vocabulary, commonality of statements, and complexity of the setting complicated the use of VR-CoDES. Child-specific cues; use of the imperative, cues about past experiences, and use of onomatopoeia were observed. Children with cancer express concerns during medical consultations. VR-CoDES is a reliable tool for coding concerns in pediatric data sets. For future applications in pediatric settings an appendix should be developed to incorporate the child-specific traits. Copyright (c) 2010 Elsevier Ireland Ltd. All rights reserved.

  1. Sequence and RT-PCR expression analysis of two peroxidases from Arabidopsis thaliana belonging to a novel evolutionary branch of plant peroxidases.

    Science.gov (United States)

    Kjaersgård, I V; Jespersen, H M; Rasmussen, S K; Welinder, K G

    1997-03-01

    cDNA clones encoding two new Arabidopsis thaliana peroxidases, ATP 1a and ATP 2a, have been identified by searching the Arabidopsis database of expressed sequence tags (dbEST). They represent a novel branch of hitherto uncharacterized plant peroxidases which is only 35% identical in amino acid sequence to the well characterized group of basic plant peroxidases represented by the horseradish (Armoracia rusticana) isoperoxidases HRP C, HRP E5 and the similar Arabidopsis isoperoxidases ATP Ca, ATP Cb, and ATP Ea. However ATP 1a is 87% identical in amino acid sequence to a peroxidase encoded by an mRNA isolated from cotton (Gossypium hirsutum). As cotton and Arabidopsis belong to rather diverse families (Malvaceae and Crucifereae, respectively), in contrast with Arabidopsis and horseradish (both Crucifereae), the high degree of sequence identity indicates that this novel type of peroxidase, albeit of unknown function, is likely to be widespread in plant species. The atp 1 and atp 2 types of cDNA sequences were the most redundant among the 28 different isoperoxidases identified among about 200 peroxidase encoding ESTs. Interestingly, 8 out of totally 38 EST sequences coding for ATP 1 showed three identical nucleotide substitutions. This variant form is designated ATP 1b. Similarly, six out of totally 16 EST sequences coding for ATP 2 showed a number of deletions and nucleotide changes. This variant form is designated ATP 2b. The selected EST clones are full-length and contain coding regions of 993 nucleotides for atp 1a, and 984 nucleotides for atp 2a. These regions show 61% DNA sequence identity. The predicted mature proteins ATP 1a, and ATP 2a are 57% identical in sequence and contain the structurally and functionally important residues, characteristic of the plant peroxidase superfamily. However, they do show two differences of importance to peroxidase catalysis: (1) the asparagine residue linked with the active site distal histidine via hydrogen bonding is absent

  2. A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

    Directory of Open Access Journals (Sweden)

    Glass John I

    2010-07-01

    repeat may be disseminated by HGT and intra-genomic shuffling. Conclusions We describe novel features of PARCELs (Palindromic Amphipathic Repeat Coding ELements, a set of widely distributed repeat protein domains and coding sequences that were likely acquired through HGT by diverse unicellular microbes, further mobilized and diversified within genomes, and co-opted for expression in the membrane proteome of some taxa. Disseminated by multiple gene-centric vehicles, ORFs harboring these elements enhance accessory gene pools as part of the "mobilome" connecting genomes of various clades, in taxa sharing common niches.

  3. Isolation and characterization of human glycophorin A cDNAs using a synthetic oligonucleotide approach: nucleotide sequence, mRNA structure and regulation by 12-O-tetradecanoylphorbol 13-acetate (TPA)

    International Nuclear Information System (INIS)

    Siebert, P.D.; Fukuda, M.

    1986-01-01

    The authors have previously shown that treatment of human erythroleukemic K562 cells with the tumor-promoting phorbol ester, TPA, results in a diminished expression of glycophorin A at the level of protein biosynthesis and in vitro mRNA translation activity. To further examine the structure, relationships and expression of human glycophorins they have successfully isolated and sequenced several glycophorin A specific cDNA clones derived from K562 cells, by making extensive use of mixed and exact synthetic oligonucleotides as primers and radioactively labeled probes. The nucleotide sequence obtained from the largest glycophorin A cDNA suggests the presence of a hydrophobic leader-like peptide of at least 19 amino acids. Northern gel analysis using both whole cDNA-plasmid and synthetic oligonucleotide probes revealed the existence of multiple mRNAs, three of which they believe to be glycophorin A-specific, whereas a fourth and smaller mRNA appears to be glycophorin B-specific. Furthermore, the abundance of all four glycophorin mRNAs were found to be extensively reduced following treatment of K562 cells with TPA suggesting coordinate regulation, possibly at the level of gene transcription

  4. Excision of a viral reprogramming cassette by delivery of synthetic Cre mRNA

    Science.gov (United States)

    Loh, Yuin-Han; Yang, Jimmy Chen; De Los Angeles, Alejandro; Guo, Chunguang; Cherry, Anne; Rossi, Derrick J.; Park, In-Hyun; Daley, George Q.

    2012-01-01

    The generation of patient-specific induced pluripotent stem (iPS) cells provides an invaluable resource for cell therapy, in vitro modeling of human disease, and drug screening. To date, most human iPS cells have been generated with integrating retro- and lenti-viruses and are limited in their potential utility because residual transgene expression may alter their differentiation potential or induce malignant transformation. Alternatively, transgene-free methods using adenovirus and protein transduction are limited by low efficiency. This report describes a protocol for the generation of transgene-free human induced pluripotent stem cells using retroviral transfection of a single vector, which includes the coding sequences of human OCT4, SOX2, KLF4, and cMYC linked with picornaviral 2A plasmids. Moreover, after reprogramming has been achieved, this cassette can be removed using mRNA transfection of Cre recombinase. The method described herein to excise reprogramming factors with ease and efficiency facilitates the experimental generation and use of transgene-free human iPS cells. PMID:22605648

  5. Metformin-Induced Changes of the Coding Transcriptome and Non-Coding RNAs in the Livers of Non-Alcoholic Fatty Liver Disease Mice.

    Science.gov (United States)

    Guo, Jun; Zhou, Yuan; Cheng, Yafen; Fang, Weiwei; Hu, Gang; Wei, Jie; Lin, Yajun; Man, Yong; Guo, Lixin; Sun, Mingxiao; Cui, Qinghua; Li, Jian

    2018-01-01

    Recent studies have suggested that changes in non-coding mRNA play a key role in the progression of non-alcoholic fatty liver disease (NAFLD). Metformin is now recommended and effective for the treatment of NAFLD. We hope the current analyses of the non-coding mRNA transcriptome will provide a better presentation of the potential roles of mRNAs and long non-coding RNAs (lncRNAs) that underlie NAFLD and metformin intervention. The present study mainly analysed changes in the coding transcriptome and non-coding RNAs after the application of a five-week metformin intervention. Liver samples from three groups of mice were harvested for transcriptome profiling, which covered mRNA, lncRNA, microRNA (miRNA) and circular RNA (circRNA), using a microarray technique. A systematic alleviation of high-fat diet (HFD)-induced transcriptome alterations by metformin was observed. The metformin treatment largely reversed the correlations with diabetes-related pathways. Our analysis also suggested interaction networks between differentially expressed lncRNAs and known hepatic disease genes and interactions between circRNA and their disease-related miRNA partners. Eight HFD-responsive lncRNAs and three metformin-responsive lncRNAs were noted due to their widespread associations with disease genes. Moreover, seven miRNAs that interacted with multiple differentially expressed circRNAs were highlighted because they were likely to be associated with metabolic or liver diseases. The present study identified novel changes in the coding transcriptome and non-coding RNAs in the livers of NAFLD mice after metformin treatment that might shed light on the underlying mechanism by which metformin impedes the progression of NAFLD. © 2018 The Author(s). Published by S. Karger AG, Basel.

  6. A novel link between Sus1 and the cytoplasmic mRNA decay machinery suggests a broad role in mRNA metabolism

    Directory of Open Access Journals (Sweden)

    Llopis Ana

    2010-03-01

    Full Text Available Abstract Background Gene expression is achieved by the coordinated action of multiple factors to ensure a perfect synchrony from chromatin epigenetic regulation through to mRNA export. Sus1 is a conserved mRNA export/transcription factor and is a key player in coupling transcription initiation, elongation and mRNA export. In the nucleus, Sus1 is associated to the transcriptional co-activator SAGA and to the NPC associated complex termed TREX2/THSC. Through these associations, Sus1 mediates the nuclear dynamics of different gene loci and facilitate the export of the new transcripts. Results In this study, we have investigated whether the yeast Sus1 protein is linked to factors involved in mRNA degradation pathways. We provide evidence for genetic interactions between SUS1 and genes coding for components of P-bodies such as PAT1, LSM1, LSM6 and DHH1. We demonstrate that SUS1 deletion is synthetic lethal with 5'→3' decay machinery components LSM1 and PAT1 and has a strong genetic interaction with LSM6 and DHH1. Interestingly, Sus1 overexpression led to an accumulation of Sus1 in cytoplasmic granules, which can co-localise with components of P-bodies and stress granules. In addition, we have identified novel physical interactions between Sus1 and factors associated to P-bodies/stress granules. Finally, absence of LSM1 and PAT1 slightly promotes the Sus1-TREX2 association. Conclusions In this study, we found genetic and biochemical association between Sus1 and components responsible for cytoplasmic mRNA metabolism. Moreover, Sus1 accumulates in discrete cytoplasmic granules, which partially co-localise with P-bodies and stress granules under specific conditions. These interactions suggest a role for Sus1 in gene expression during cytoplasmic mRNA metabolism in addition to its nuclear function.

  7. Consensus coding sequence (CCDS) database: a standardized set of human and mouse protein-coding regions supported by expert curation.

    Science.gov (United States)

    Pujar, Shashikant; O'Leary, Nuala A; Farrell, Catherine M; Loveland, Jane E; Mudge, Jonathan M; Wallin, Craig; Girón, Carlos G; Diekhans, Mark; Barnes, If; Bennett, Ruth; Berry, Andrew E; Cox, Eric; Davidson, Claire; Goldfarb, Tamara; Gonzalez, Jose M; Hunt, Toby; Jackson, John; Joardar, Vinita; Kay, Mike P; Kodali, Vamsi K; Martin, Fergal J; McAndrews, Monica; McGarvey, Kelly M; Murphy, Michael; Rajput, Bhanu; Rangwala, Sanjida H; Riddick, Lillian D; Seal, Ruth L; Suner, Marie-Marthe; Webb, David; Zhu, Sophia; Aken, Bronwen L; Bruford, Elspeth A; Bult, Carol J; Frankish, Adam; Murphy, Terence; Pruitt, Kim D

    2018-01-04

    The Consensus Coding Sequence (CCDS) project provides a dataset of protein-coding regions that are identically annotated on the human and mouse reference genome assembly in genome annotations produced independently by NCBI and the Ensembl group at EMBL-EBI. This dataset is the product of an international collaboration that includes NCBI, Ensembl, HUGO Gene Nomenclature Committee, Mouse Genome Informatics and University of California, Santa Cruz. Identically annotated coding regions, which are generated using an automated pipeline and pass multiple quality assurance checks, are assigned a stable and tracked identifier (CCDS ID). Additionally, coordinated manual review by expert curators from the CCDS collaboration helps in maintaining the integrity and high quality of the dataset. The CCDS data are available through an interactive web page (https://www.ncbi.nlm.nih.gov/CCDS/CcdsBrowse.cgi) and an FTP site (ftp://ftp.ncbi.nlm.nih.gov/pub/CCDS/). In this paper, we outline the ongoing work, growth and stability of the CCDS dataset and provide updates on new collaboration members and new features added to the CCDS user interface. We also present expert curation scenarios, with specific examples highlighting the importance of an accurate reference genome assembly and the crucial role played by input from the research community. Published by Oxford University Press on behalf of Nucleic Acids Research 2017.

  8. Replication error deficient and proficient colorectal cancer gene expression differences caused by 3'UTR polyT sequence deletions

    DEFF Research Database (Denmark)

    Wilding, Jennifer L; McGowan, Simon; Liu, Ying

    2010-01-01

    , and have distinct pathologies. Regulatory sequences controlling all aspects of mRNA processing, especially including message stability, are found in the 3'UTR sequence of most genes. The relevant sequences are typically A/U-rich elements or U repeats. Microarray analysis of 14 RER+ (deficient) and 16 RER......- (proficient) colorectal cancer cell lines confirms a striking difference in expression profiles. Analysis of the incidence of mononucleotide repeat sequences in the 3'UTRs, 5'UTRs, and coding sequences of those genes most differentially expressed in RER+ versus RER- cell lines has shown that much...... of this differential expression can be explained by the occurrence of a massive enrichment of genes with 3'UTR T repeats longer than 11 base pairs in the most differentially expressed genes. This enrichment was confirmed by analysis of two published consensus sets of RER differentially expressed probesets for a large...

  9. The sequence coding and search system: An approach for constructing and analyzing event sequences at commercial nuclear power plants

    International Nuclear Information System (INIS)

    Mays, G.T.

    1989-04-01

    The US Nuclear Regulatory Commission (NRC) has recognized the importance of the collection, assessment, and feedstock of operating experience data from commercial nuclear power plants and has centralized these activities in the Office for Analysis and Evaluation of Operational Data (AEOD). Such data is essential for performing safety and reliability analyses, especially analyses of trends and patterns to identify undesirable changes in plant performance at the earliest opportunity to implement corrective measures to preclude the occurrences of a more serious event. One of NRC's principal tools for collecting and evaluating operating experience data is the Sequence Coding and Search System (SCSS). The SCSS consists of a methodology for structuring event sequences and the requisite computer system to store and search the data. The source information for SCSS is the Licensee Event Report (LER), which is a legally required document. This paper describes the objective SCSS, the information it contains, and the format and approach for constructuring SCSS event sequences. Examples are presented demonstrating the use SCSS to support the analysis of LER data. The SCSS contains over 30,000 LERs describing events from 1980 through the present. Insights gained from working with a complex data system from the initial developmental stage to the point of a mature operating system are highlighted

  10. LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification.

    Science.gov (United States)

    Filatov, Gleb; Bauwens, Bruno; Kertész-Farkas, Attila

    2018-05-07

    Bioinformatics studies often rely on similarity measures between sequence pairs, which often pose a bottleneck in large-scale sequence analysis. Here, we present a new convolutional kernel function for protein sequences called the LZW-Kernel. It is based on code words identified with the Lempel-Ziv-Welch (LZW) universal text compressor. The LZW-Kernel is an alignment-free method, it is always symmetric, is positive, always provides 1.0 for self-similarity and it can directly be used with Support Vector Machines (SVMs) in classification problems, contrary to normalized compression distance (NCD), which often violates the distance metric properties in practice and requires further techniques to be used with SVMs. The LZW-Kernel is a one-pass algorithm, which makes it particularly plausible for big data applications. Our experimental studies on remote protein homology detection and protein classification tasks reveal that the LZW-Kernel closely approaches the performance of the Local Alignment Kernel (LAK) and the SVM-pairwise method combined with Smith-Waterman (SW) scoring at a fraction of the time. Moreover, the LZW-Kernel outperforms the SVM-pairwise method when combined with BLAST scores, which indicates that the LZW code words might be a better basis for similarity measures than local alignment approximations found with BLAST. In addition, the LZW-Kernel outperforms n-gram based mismatch kernels, hidden Markov model based SAM and Fisher kernel, and protein family based PSI-BLAST, among others. Further advantages include the LZW-Kernel's reliance on a simple idea, its ease of implementation, and its high speed, three times faster than BLAST and several magnitudes faster than SW or LAK in our tests. LZW-Kernel is implemented as a standalone C code and is a free open-source program distributed under GPLv3 license and can be downloaded from https://github.com/kfattila/LZW-Kernel. akerteszfarkas@hse.ru. Supplementary data are available at Bioinformatics Online.

  11. The complete genome sequence of the Atlantic salmon paramyxovirus (ASPV)

    International Nuclear Information System (INIS)

    Nylund, Stian; Karlsen, Marius; Nylund, Are

    2008-01-01

    The complete RNA genome of the Atlantic salmon paramyxovirus (ASPV), isolated from Atlantic salmon suffering from proliferative gill inflammation (PGI), has been determined. The genome is 16,965 nucleotides in length and consists of six nonoverlapping genes in the order 3'- N - P/C/V - M - F - HN - L -5', coding for the nucleocapsid, phospho-, matrix, fusion, hemagglutinin-neuraminidase and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and trinucleotide intergenic regions similar to those of other Paramyxoviridae. The ASPV P-gene expression strategy is like that of the respiro- and morbilliviruses, which express the phosphoprotein from the primary transcript, and edit a portion of the mRNA to encode the accessory proteins V and W. It also encodes the C-protein by ribosomal choice of translation initiation. Pairwise comparisons of amino acid identities, and phylogenetic analysis of deduced ASPV protein sequences with homologous sequences from other Paramyxoviridae, show that ASPV has an affinity for the genus Respirovirus, but may represent a new genus within the subfamily Paramyxovirinae

  12. Inhibition of expression in Escherichia coli of a virulence regulator MglB of Francisella tularensis using external guide sequence technology.

    Directory of Open Access Journals (Sweden)

    Gaoping Xiao

    Full Text Available External guide sequences (EGSs have successfully been used to inhibit expression of target genes at the post-transcriptional level in both prokaryotes and eukaryotes. We previously reported that EGS accessible and cleavable sites in the target RNAs can rapidly be identified by screening random EGS (rEGS libraries. Here the method of screening rEGS libraries and a partial RNase T1 digestion assay were used to identify sites accessible to EGSs in the mRNA of a global virulence regulator MglB from Francisella tularensis, a Gram-negative pathogenic bacterium. Specific EGSs were subsequently designed and their activities in terms of the cleavage of mglB mRNA by RNase P were tested in vitro and in vivo. EGS73, EGS148, and EGS155 in both stem and M1 EGS constructs induced mglB mRNA cleavage in vitro. Expression of stem EGS73 and EGS155 in Escherichia coli resulted in significant reduction of the mglB mRNA level coded for the F. tularensis mglB gene inserted in those cells.

  13. AthMethPre: a web server for the prediction and query of mRNA m6A sites in Arabidopsis thaliana.

    Science.gov (United States)

    Xiang, Shunian; Yan, Zhangming; Liu, Ke; Zhang, Yaou; Sun, Zhirong

    2016-10-18

    N 6 -Methyladenosine (m 6 A) is the most prevalent and abundant modification in mRNA that has been linked to many key biological processes. High-throughput experiments have generated m 6 A-peaks across the transcriptome of A. thaliana, but the specific methylated sites were not assigned, which impedes the understanding of m 6 A functions in plants. Therefore, computational prediction of mRNA m 6 A sites becomes emergently important. Here, we present a method to predict the m 6 A sites for A. thaliana mRNA sequence(s). To predict the m 6 A sites of an mRNA sequence, we employed the support vector machine to build a classifier using the features of the positional flanking nucleotide sequence and position-independent k-mer nucleotide spectrum. Our method achieved good performance and was applied to a web server to provide service for the prediction of A. thaliana m 6 A sites. The server also provides a comprehensive database of predicted transcriptome-wide m 6 A sites and curated m 6 A-seq peaks from the literature for query and visualization. The AthMethPre web server is the first web server that provides a user-friendly tool for the prediction and query of A. thaliana mRNA m 6 A sites, which is freely accessible for public use at .

  14. PARN and TOE1 Constitute a 3′ End Maturation Module for Nuclear Non-coding RNAs

    Directory of Open Access Journals (Sweden)

    Ahyeon Son

    2018-04-01

    Full Text Available Summary: Poly(A-specific ribonuclease (PARN and target of EGR1 protein 1 (TOE1 are nuclear granule-associated deadenylases, whose mutations are linked to multiple human diseases. Here, we applied mTAIL-seq and RNA sequencing (RNA-seq to systematically identify the substrates of PARN and TOE1 and elucidate their molecular functions. We found that PARN and TOE1 do not modulate the length of mRNA poly(A tails. Rather, they promote the maturation of nuclear small non-coding RNAs (ncRNAs. PARN and TOE1 act redundantly on some ncRNAs, most prominently small Cajal body-specific RNAs (scaRNAs. scaRNAs are strongly downregulated when PARN and TOE1 are compromised together, leading to defects in small nuclear RNA (snRNA pseudouridylation. They also function redundantly in the biogenesis of telomerase RNA component (TERC, which shares sequence motifs found in H/ACA box scaRNAs. Our findings extend the knowledge of nuclear ncRNA biogenesis, and they provide insights into the pathology of PARN/TOE1-associated genetic disorders whose therapeutic treatments are currently unavailable. : By analyzing the 3′ termini of transcriptome, Son et al. reveal the targets of PARN and TOE1, two nuclear deadenylases with disease associations. Both deadenylases are involved in nuclear small non-coding RNA maturation, but not in mRNA deadenylation. Their combined activity is particularly important for biogenesis of scaRNAs and TERC. Keywords: PARN, TOE1, CAF1Z, deadenylase, 3′ end maturation, adenylation, deadenylation, scaRNA, TERC

  15. Two rare deletions upstream of the NRXN1 gene (2p16.3) affecting the non-coding mRNA AK127244 segregate with diverse psychopathological phenotypes in a family

    DEFF Research Database (Denmark)

    Duong, L. T. T.; Hoeffding, L. K.; Petersen, K. B.

    2015-01-01

    127244 in addition to the pathogenic 15q11.2 deletion in distinct family members. The two deletions upstream of the NRXN1 gene were found to segregate with psychiatric disorders in the family and further similar deletions have been observed in patients diagnosed with autism spectrum disorder. Thus, we...... susceptibility. In this study, we describe a family affected by a wide range of psychiatric disorders including early onset schizophrenia, schizophreniform disorder, and affective disorders. Microarray analysis identified two rare deletions immediately upstream of the NRXN1 gene affecting the non-coding mRNA AK...... suggest that non-coding regions upstream of the NRXN1 gene affecting AK127244 might (as NRXN1) contain susceptibility regions for a wide spectrum of neuropsychiatric disorders. (C) 2015 Elsevier Masson SAS. All rights reserved....

  16. Three new shRNA expression vectors targeting the CYP3A4 coding sequence to inhibit its expression

    Directory of Open Access Journals (Sweden)

    Siyun Xu

    2014-10-01

    Full Text Available RNA interference (RNAi is useful for selective gene silencing. Cytochrome P450 3A4 (CYP3A4, which metabolizes approximately 50% of drugs in clinical use, plays an important role in drug metabolism. In this study, we aimed to develop a short hairpin RNA (shRNA to modulate CYP3A4 expression. Three new shRNAs (S1, S2 and S3 were designed to target the coding sequence (CDS of CYP3A4, cloned into a shRNA expression vector, and tested in different cells. The mixture of three shRNAs produced optimal reduction (55% in CYP3A4 CDS-luciferase activity in both CHL and HEK293 cells. Endogenous CYP3A4 expression in HepG2 cells was decreased about 50% at both mRNA and protein level after transfection of the mixture of three shRNAs. In contrast, CYP3A5 gene expression was not altered by the shRNAs, supporting the selectivity of CYP3A4 shRNAs. In addition, HepG2 cells transfected with CYP3A4 shRNAs were less sensitive to Ginkgolic acids, whose toxic metabolites are produced by CYP3A4. These results demonstrate that vector-based shRNAs could modulate CYP3A4 expression in cells through their actions on CYP3A4 CDS, and CYP3A4 shRNAs may be utilized to define the role of CYP3A4 in drug metabolism and toxicity.

  17. mPUMA: a computational approach to microbiota analysis by de novo assembly of operational taxonomic units based on protein-coding barcode sequences.

    Science.gov (United States)

    Links, Matthew G; Chaban, Bonnie; Hemmingsen, Sean M; Muirhead, Kevin; Hill, Janet E

    2013-08-15

    Formation of operational taxonomic units (OTU) is a common approach to data aggregation in microbial ecology studies based on amplification and sequencing of individual gene targets. The de novo assembly of OTU sequences has been recently demonstrated as an alternative to widely used clustering methods, providing robust information from experimental data alone, without any reliance on an external reference database. Here we introduce mPUMA (microbial Profiling Using Metagenomic Assembly, http://mpuma.sourceforge.net), a software package for identification and analysis of protein-coding barcode sequence data. It was developed originally for Cpn60 universal target sequences (also known as GroEL or Hsp60). Using an unattended process that is independent of external reference sequences, mPUMA forms OTUs by DNA sequence assembly and is capable of tracking OTU abundance. mPUMA processes microbial profiles both in terms of the direct DNA sequence as well as in the translated amino acid sequence for protein coding barcodes. By forming OTUs and calculating abundance through an assembly approach, mPUMA is capable of generating inputs for several popular microbiota analysis tools. Using SFF data from sequencing of a synthetic community of Cpn60 sequences derived from the human vaginal microbiome, we demonstrate that mPUMA can faithfully reconstruct all expected OTU sequences and produce compositional profiles consistent with actual community structure. mPUMA enables analysis of microbial communities while empowering the discovery of novel organisms through OTU assembly.

  18. Sequencing illustrates the transcriptional response of Legionella pneumophila during infection and identifies seventy novel small non-coding RNAs.

    LENUS (Irish Health Repository)

    Weissenmayer, Barbara A

    2011-01-01

    Second generation sequencing has prompted a number of groups to re-interrogate the transcriptomes of several bacterial and archaeal species. One of the central findings has been the identification of complex networks of small non-coding RNAs that play central roles in transcriptional regulation in all growth conditions and for the pathogen\\'s interaction with and survival within host cells. Legionella pneumophila is a gram-negative facultative intracellular human pathogen with a distinct biphasic lifestyle. One of its primary environmental hosts in the free-living amoeba Acanthamoeba castellanii and its infection by L. pneumophila mimics that seen in human macrophages. Here we present analysis of strand specific sequencing of the transcriptional response of L. pneumophila during exponential and post-exponential broth growth and during the replicative and transmissive phase of infection inside A. castellanii. We extend previous microarray based studies as well as uncovering evidence of a complex regulatory architecture underpinned by numerous non-coding RNAs. Over seventy new non-coding RNAs could be identified; many of them appear to be strain specific and in configurations not previously reported. We discover a family of non-coding RNAs preferentially expressed during infection conditions and identify a second copy of 6S RNA in L. pneumophila. We show that the newly discovered putative 6S RNA as well as a number of other non-coding RNAs show evidence for antisense transcription. The nature and extent of the non-coding RNAs and their expression patterns suggests that these may well play central roles in the regulation of Legionella spp. specific traits and offer clues as to how L. pneumophila adapts to its intracellular niche. The expression profiles outlined in the study have been deposited into Genbank\\'s Gene Expression Omnibus (GEO) database under the series accession GSE27232.

  19. Cloning and sequence analysis of serine proteinase of Gloydius ussuriensis venom gland

    International Nuclear Information System (INIS)

    Sun Dejun; Liu Shanshan; Yang Chunwei; Zhao Yizhuo; Chang Shufang; Yan Weiqun

    2005-01-01

    Objective: To construct a cDNA library by using mRNA from Gloydius ussuriensis (G. Ussuriensis) venom gland, to clone and analyze serine proteinase gene from the cDNA library. Methods: Total RNA was isolated from venom gland of G. ussuriensis, mRNA was purified by using mRNA isolation Kit. The whole length cDNA was synthesized by means of smart cDNA synthesis strategy, and amplified by long distance PCR procedure, lately cDAN was cloned into vector pBluescrip-sk. The recombinant cDNA was transformed into E. coli DH5α. The cDNA of serine proteinase gene in the venom gland of G. ussuriensis was detected and amplified using the in situ hybridization. The cDNA fragment was inserted into pGEMT vector, cloned and its nucleotide sequence was determined. Results: The capacity of cDNA library of venom gland was above 2.3 x 10 6 . Its open reading frame was composed of 702 nucleotides and coded a protein pre-zymogen of 234 amino acids. It contained 12 cysteine residues. The sequence analysis indicated that the deduced amino acid sequence of the cDNA fragment shared high identity with the thrombin-like enzyme genes of other snakes in the GenBank. the query sequence exhibited strong amino acid sequence homology of 85% to the serine proteas of T. gramineus, thrombin-like serine proteinase I of D. acutus and serine protease catroxase II of C. atrox respectively. Based on the amino acid sequences of other thrombin-like enzymes, the catalytic residues and disulfide bridges of this thrombin-like enzyme were deduced as follows: catalytic residues, His 41 , Asp 86 , Ser 180 ; and six disulfide bridges Cys 7 -Cys 139 , Cys 26 -Cys 42 , Cys 74 -Cys 232 , Cys 118 -Cys 186 , Cys 150 -Cys 165 , Cys 176 -Cys 201 . Conclusion: The capacity of cDNA library of venom gland is above 2.3 x 10 6 , overtop the level of 10 5 capicity. The constructed cDNA library of G. ussuriensis venom gland would be helpful platform to detect new target genes and further gene manipulate. The cloned serine

  20. The functional half-life of an mRNA depends on the ribosome spacing in an early coding region

    DEFF Research Database (Denmark)

    Pedersen, Margit; Nissen, Søren; Mitarai, Namiko

    2011-01-01

    Bacterial mRNAs are translated by closely spaced ribosomes and degraded from the 5'-end, with half-lives of around 2 min at 37 °C in most cases. Ribosome-free or "naked" mRNA is known to be readily degraded, but the initial event that inactivates the mRNA functionally has not been fully described...

  1. Identification of a functionally distinct truncated BDNF mRNA splice variant and protein in Trachemys scripta elegans.

    Directory of Open Access Journals (Sweden)

    Ganesh Ambigapathy

    Full Text Available Brain-derived neurotrophic factor (BDNF has a diverse functional role and complex pattern of gene expression. Alternative splicing of mRNA transcripts leads to further diversity of mRNAs and protein isoforms. Here, we describe the regulation of BDNF mRNA transcripts in an in vitro model of eyeblink classical conditioning and a unique transcript that forms a functionally distinct truncated BDNF protein isoform. Nine different mRNA transcripts from the BDNF gene of the pond turtle Trachemys scripta elegans (tBDNF are selectively regulated during classical conditioning: exon I mRNA transcripts show no change, exon II transcripts are downregulated, while exon III transcripts are upregulated. One unique transcript that codes from exon II, tBDNF2a, contains a 40 base pair deletion in the protein coding exon that generates a truncated tBDNF protein. The truncated transcript and protein are expressed in the naïve untrained state and are fully repressed during conditioning when full-length mature tBDNF is expressed, thereby having an alternate pattern of expression in conditioning. Truncated BDNF is not restricted to turtles as a truncated mRNA splice variant has been described for the human BDNF gene. Further studies are required to determine the ubiquity of truncated BDNF alternative splice variants across species and the mechanisms of regulation and function of this newly recognized BDNF protein.

  2. Identification of a functionally distinct truncated BDNF mRNA splice variant and protein in Trachemys scripta elegans.

    Science.gov (United States)

    Ambigapathy, Ganesh; Zheng, Zhaoqing; Li, Wei; Keifer, Joyce

    2013-01-01

    Brain-derived neurotrophic factor (BDNF) has a diverse functional role and complex pattern of gene expression. Alternative splicing of mRNA transcripts leads to further diversity of mRNAs and protein isoforms. Here, we describe the regulation of BDNF mRNA transcripts in an in vitro model of eyeblink classical conditioning and a unique transcript that forms a functionally distinct truncated BDNF protein isoform. Nine different mRNA transcripts from the BDNF gene of the pond turtle Trachemys scripta elegans (tBDNF) are selectively regulated during classical conditioning: exon I mRNA transcripts show no change, exon II transcripts are downregulated, while exon III transcripts are upregulated. One unique transcript that codes from exon II, tBDNF2a, contains a 40 base pair deletion in the protein coding exon that generates a truncated tBDNF protein. The truncated transcript and protein are expressed in the naïve untrained state and are fully repressed during conditioning when full-length mature tBDNF is expressed, thereby having an alternate pattern of expression in conditioning. Truncated BDNF is not restricted to turtles as a truncated mRNA splice variant has been described for the human BDNF gene. Further studies are required to determine the ubiquity of truncated BDNF alternative splice variants across species and the mechanisms of regulation and function of this newly recognized BDNF protein.

  3. Self-complementary circular codes in coding theory.

    Science.gov (United States)

    Fimmel, Elena; Michel, Christian J; Starman, Martin; Strüngmann, Lutz

    2018-04-01

    Self-complementary circular codes are involved in pairing genetic processes. A maximal [Formula: see text] self-complementary circular code X of trinucleotides was identified in genes of bacteria, archaea, eukaryotes, plasmids and viruses (Michel in Life 7(20):1-16 2017, J Theor Biol 380:156-177, 2015; Arquès and Michel in J Theor Biol 182:45-58 1996). In this paper, self-complementary circular codes are investigated using the graph theory approach recently formulated in Fimmel et al. (Philos Trans R Soc A 374:20150058, 2016). A directed graph [Formula: see text] associated with any code X mirrors the properties of the code. In the present paper, we demonstrate a necessary condition for the self-complementarity of an arbitrary code X in terms of the graph theory. The same condition has been proven to be sufficient for codes which are circular and of large size [Formula: see text] trinucleotides, in particular for maximal circular codes ([Formula: see text] trinucleotides). For codes of small-size [Formula: see text] trinucleotides, some very rare counterexamples have been constructed. Furthermore, the length and the structure of the longest paths in the graphs associated with the self-complementary circular codes are investigated. It has been proven that the longest paths in such graphs determine the reading frame for the self-complementary circular codes. By applying this result, the reading frame in any arbitrary sequence of trinucleotides is retrieved after at most 15 nucleotides, i.e., 5 consecutive trinucleotides, from the circular code X identified in genes. Thus, an X motif of a length of at least 15 nucleotides in an arbitrary sequence of trinucleotides (not necessarily all of them belonging to X) uniquely defines the reading (correct) frame, an important criterion for analyzing the X motifs in genes in the future.

  4. The sequence coding and search system: an approach for constructing and analyzing event sequences at commercial nuclear power plants

    International Nuclear Information System (INIS)

    Mays, G.T.

    1990-01-01

    The U.S. Nuclear Regulatory Commission (NRC) has recognized the importance of the collection, assessment, and feedback of operating experience data from commercial nuclear power plants and has centralized these activities in the Office for Analysis and Evaluation of Operational Data (AEOD). Such data is essential for performing safety and reliability analyses, especially analyses of trends and patterns to identify undesirable changes in plant performance at the earliest opportunity to implement corrective measures to preclude the occurrence of a more serious event. One of NRC's principal tools for collecting and evaluating operating experience data is the Sequence Coding and Search System (SCSS). The SCSS consists of a methodology for structuring event sequences and the requisite computer system to store and search the data. The source information for SCSS is the Licensee Event Report (LER), which is a legally required document. This paper describes the objectives of SCSS, the information it contains, and the format and approach for constructing SCSS event sequences. Examples are presented demonstrating the use of SCSS to support the analysis of LER data. The SCSS contains over 30,000 LERs describing events from 1980 through the present. Insights gained from working with a complex data system from the initial developmental stage to the point of a mature operating system are highlighted. Considerable experience has been gained in the areas of evolving and changing data requirements, staffing requirements, and quality control and quality assurance procedures for addressing consistency, software/hardware considerations for developing and maintaining a complex system, documentation requirements, and end-user needs. Two other approaches for constructing and evaluating event sequences are examined including the Accident Precursor Program (ASP) where sequences having the potential for core damage are identified and analyzed, and the Significant Event Compilation Tree

  5. Impact of target mRNA structure on siRNA silencing efficiency: A large-scale study.

    Science.gov (United States)

    Gredell, Joseph A; Berger, Angela K; Walton, S Patrick

    2008-07-01

    The selection of active siRNAs is generally based on identifying siRNAs with certain sequence and structural properties. However, the efficiency of RNA interference has also been shown to depend on the structure of the target mRNA, primarily through studies using exogenous transcripts with well-defined secondary structures in the vicinity of the target sequence. While these studies provide a means for examining the impact of target sequence and structure independently, the predicted secondary structures for these transcripts are often not reflective of structures that form in full-length, native mRNAs where interactions can occur between relatively remote segments of the mRNAs. Here, using a combination of experimental results and analysis of a large dataset, we demonstrate that the accessibility of certain local target structures on the mRNA is an important determinant in the gene silencing ability of siRNAs. siRNAs targeting the enhanced green fluorescent protein were chosen using a minimal siRNA selection algorithm followed by classification based on the predicted minimum free energy structures of the target transcripts. Transfection into HeLa and HepG2 cells revealed that siRNAs targeting regions of the mRNA predicted to have unpaired 5'- and 3'-ends resulted in greater gene silencing than regions predicted to have other types of secondary structure. These results were confirmed by analysis of gene silencing data from previously published siRNAs, which showed that mRNA target regions unpaired at either the 5'-end or 3'-end were silenced, on average, approximately 10% more strongly than target regions unpaired in the center or primarily paired throughout. We found this effect to be independent of the structure of the siRNA guide strand. Taken together, these results suggest minimal requirements for nucleation of hybridization between the siRNA guide strand and mRNA and that both mRNA and guide strand structure should be considered when choosing candidate si

  6. Circular codes revisited: a statistical approach.

    Science.gov (United States)

    Gonzalez, D L; Giannerini, S; Rosa, R

    2011-04-21

    In 1996 Arquès and Michel [1996. A complementary circular code in the protein coding genes. J. Theor. Biol. 182, 45-58] discovered the existence of a common circular code in eukaryote and prokaryote genomes. Since then, circular code theory has provoked great interest and underwent a rapid development. In this paper we discuss some theoretical issues related to the synchronization properties of coding sequences and circular codes with particular emphasis on the problem of retrieval and maintenance of the reading frame. Motivated by the theoretical discussion, we adopt a rigorous statistical approach in order to try to answer different questions. First, we investigate the covering capability of the whole class of 216 self-complementary, C(3) maximal codes with respect to a large set of coding sequences. The results indicate that, on average, the code proposed by Arquès and Michel has the best covering capability but, still, there exists a great variability among sequences. Second, we focus on such code and explore the role played by the proportion of the bases by means of a hierarchy of permutation tests. The results show the existence of a sort of optimization mechanism such that coding sequences are tailored as to maximize or minimize the coverage of circular codes on specific reading frames. Such optimization clearly relates the function of circular codes with reading frame synchronization. Copyright © 2011 Elsevier Ltd. All rights reserved.

  7. Cloning, sequence analysis, and expression of the large subunit of the human lymphocyte activation antigen 4F2

    International Nuclear Information System (INIS)

    Lumadue, J.A.; Glick, A.B.; Ruddle, F.H.

    1987-01-01

    Among the earliest expressed antigens on the surface of activated human lymphocytes is the surface antigen 4F2. The authors have used DNA-mediated gene transfer and fluorescence-activated cell sorting to obtain cell lines that contain the gene encoding the large subunit of the human 4F2 antigen in a mouse L-cell background. Human DNAs cloned from these cell lines were subsequently used as hybridization probes to isolate a full-length cDNA clone expressing 4F2. Sequence analysis of the coding region has revealed an amino acid sequence of 529 residues. Hydrophobicity plotting has predicted a probable structure for the protein that includes an external carboxyl terminus, an internal leader sequence, a single hydrophobic transmembrane domain, and two possible membrane-associated domains. The 4F2 cDNA detects a single 1.8-kilobase mRNA in T-cell and B-cell lines. RNA gel blot analysis of RNA derived from quiescent and serum-stimulated Swiss 3T3 fibroblasts reveals a cell-cycle modulation of 4F2 gene expression: the mRNA is present in quiescent fibroblasts but increases 8-fold 24-36 hr after stimulation, at the time of maximal DNA synthesis

  8. Cloning, sequence analysis, and expression of the large subunit of the human lymphocyte activation antigen 4F2

    Energy Technology Data Exchange (ETDEWEB)

    Lumadue, J.A.; Glick, A.B.; Ruddle, F.H.

    1987-12-01

    Among the earliest expressed antigens on the surface of activated human lymphocytes is the surface antigen 4F2. The authors have used DNA-mediated gene transfer and fluorescence-activated cell sorting to obtain cell lines that contain the gene encoding the large subunit of the human 4F2 antigen in a mouse L-cell background. Human DNAs cloned from these cell lines were subsequently used as hybridization probes to isolate a full-length cDNA clone expressing 4F2. Sequence analysis of the coding region has revealed an amino acid sequence of 529 residues. Hydrophobicity plotting has predicted a probable structure for the protein that includes an external carboxyl terminus, an internal leader sequence, a single hydrophobic transmembrane domain, and two possible membrane-associated domains. The 4F2 cDNA detects a single 1.8-kilobase mRNA in T-cell and B-cell lines. RNA gel blot analysis of RNA derived from quiescent and serum-stimulated Swiss 3T3 fibroblasts reveals a cell-cycle modulation of 4F2 gene expression: the mRNA is present in quiescent fibroblasts but increases 8-fold 24-36 hr after stimulation, at the time of maximal DNA synthesis.

  9. Analysis of the 3’ untranslated regions of α-tubulin and S-crystallin mRNA and the identification of CPEB in dark- and light-adapted octopus retinas

    Science.gov (United States)

    Kelly, Shannan; Yamamoto, Hideki

    2008-01-01

    Purpose We previously reported the differential expression and translation of mRNA and protein in dark- and light-adapted octopus retinas, which may result from cytoplasmic polyadenylation element (CPE)–dependent mRNA masking and unmasking. Here we investigate the presence of CPEs in α-tubulin and S-crystallin mRNA and report the identification of cytoplasmic polyadenylation element binding protein (CPEB) in light- and dark-adapted octopus retinas. Methods 3’-RACE and sequencing were used to isolate and analyze the 3’-UTRs of α-tubulin and S-crystallin mRNA. Total retinal protein isolated from light- and dark-adapted octopus retinas was subjected to western blot analysis followed by CPEB antibody detection, PEP-171 inhibition of CPEB, and dephosphorylation of CPEB. Results The following CPE-like sequence was detected in the 3’-UTR of isolated long S-crystallin mRNA variants: UUUAACA. No CPE or CPE-like sequences were detected in the 3’-UTRs of α-tubulin mRNA or of the short S-crystallin mRNA variants. Western blot analysis detected CPEB as two putative bands migrating between 60-80 kDa, while a third band migrated below 30 kDa in dark- and light-adapted retinas. Conclusions The detection of CPEB and the identification of the putative CPE-like sequences in the S-crystallin 3’-UTR suggest that CPEB may be involved in the activation of masked S-crystallin mRNA, but not in the regulation of α-tubulin mRNA, resulting in increased S-crystallin protein synthesis in dark-adapted octopus retinas. PMID:18682811

  10. Serotype identification and VP1 coding sequence analysis of foot-and-mouth disease virus from outbreaks in Eastern and Northern Uganda in 2008/9

    DEFF Research Database (Denmark)

    Kasambula, L.; Belsham, Graham; Siegismund, H. R.

    2012-01-01

    regions, and the presence of FMDV RNA in these samples was determined using a standard diagnostic RT-PCR assay. From the total of 27 positive samples, the VP1 coding region was amplified and sequenced. Each of these sequences showed >99% identity to each other, and just five distinct sequences were...

  11. Phylogenetic analyses of the polyprotein coding sequences of serotype O foot-and-mouth disease viruses in East Africa: evidence for interserotypic recombination

    DEFF Research Database (Denmark)

    Balinda, Sheila; Siegismund, Hans; Muwanika, Vincent

    2010-01-01

    from both serotypes A and O. Conclusions Sequences of the VP1 coding region from recent serotype O FMDVs from Kenya and Uganda are all representatives of a specific East African lineage (topotype EA-2), a probable indication that hardly any FMD introductions of this serotype have occurred from outside...... the region in the recent past. Furthermore, evidence for interserotypic recombination, within the non-structural protein coding regions, between FMDVs of serotypes A and O has been obtained. In addition to characterization using the VP1 coding region, analyses involving the non-structural protein coding...

  12. mRNA processing in yeast

    International Nuclear Information System (INIS)

    Stevens, A.

    1982-01-01

    Investigations in this laboratory center on basic enzymatic reactions of RNA. Still undefined are reactions involved in the conversion of precursors of mRA (pre-mRNA) to mRNA in eukaryotes. The pre-mRNA is called heterogeneous nuclear RNA and is 2 to 6 times larger than mRNA. The conversion, called splicing, involves a removal of internal sequences called introns by endoribonuclease action followed by a rejoining of the 3'- and 5'-end fragments, called exons, by ligating activity. It has not been possible yet to study the enzymes involved in vitro. Also undefined are reactions involved in the turnover or discarding of certain of the pre-mRNA molecules. Yeast is a simple eukaryote and may be expected to have the same, but perhaps simpler, processing reactions as the higher eukaryotes. Two enzymes involved in the processing of pre-mRNA and mRNA in yeast are under investigation. Both enzymes have been partially purified from ribonucleoprotein particles of yeast. The first is a unique decapping enzyme which cleaves [ 3 H]m 7 Gppp [ 14 C]RNA-poly (A) of yeast, yielding [ 3 H]m 7 GDP and is suggested by the finding that the diphosphate product, m 7 GpppA(G), and UDP-glucose are not hydrolyzed. The second enzyme is an endoribonuclease which converts both the [ 3 H] and [ 14 C] labels of [ 3 H]m 7 Gppp[ 14 C]RNA-poly(A) from an oligo(dT)-cellulose bound form to an unbound, acid-insoluble form. Results show that the stimulation involves an interaction of the labeled RNA with the small nuclear RNA. The inhibition of the enzyme by ethidium bromide and its stimulation by small nuclear RNA suggest that it may be a processing ribonuclease, requiring specific double-stranded features in its substrate. The characterization of the unique decapping enzyme and endoribonuclease may help to understand reactions involved in the processing of pre-mRNA and mRNA in eukaryotes

  13. An RNA Phage Lab: MS2 in Walter Fiers' laboratory of molecular biology in Ghent, from genetic code to gene and genome, 1963-1976.

    Science.gov (United States)

    Pierrel, Jérôme

    2012-01-01

    The importance of viruses as model organisms is well-established in molecular biology and Max Delbrück's phage group set standards in the DNA phage field. In this paper, I argue that RNA phages, discovered in the 1960s, were also instrumental in the making of molecular biology. As part of experimental systems, RNA phages stood for messenger RNA (mRNA), genes and genome. RNA was thought to mediate information transfers between DNA and proteins. Furthermore, RNA was more manageable at the bench than DNA due to the availability of specific RNases, enzymes used as chemical tools to analyse RNA. Finally, RNA phages provided scientists with a pure source of mRNA to investigate the genetic code, genes and even a genome sequence. This paper focuses on Walter Fiers' laboratory at Ghent University (Belgium) and their work on the RNA phage MS2. When setting up his Laboratory of Molecular Biology, Fiers planned a comprehensive study of the virus with a strong emphasis on the issue of structure. In his lab, RNA sequencing, now a little-known technique, evolved gradually from a means to solve the genetic code, to a tool for completing the first genome sequence. Thus, I follow the research pathway of Fiers and his 'RNA phage lab' with their evolving experimental system from 1960 to the late 1970s. This study illuminates two decisive shifts in post-war biology: the emergence of molecular biology as a discipline in the 1960s in Europe and of genomics in the 1990s.

  14. Regulatory roles for long ncRNA and mRNA

    NARCIS (Netherlands)

    Karapetyan, A.; Buiting, C.; Kuiper, R.A.; Coolen, M.W.

    2013-01-01

    Recent advances in high-throughput sequencing technology have identified the transcription of a much larger portion of the genome than previously anticipated. Especially in the context of cancer it has become clear that aberrant transcription of both protein-coding and long non-coding RNAs (lncRNAs)

  15. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

    Science.gov (United States)

    2012-01-01

    Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR) are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas. PMID:23256920

  16. CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences

    Directory of Open Access Journals (Sweden)

    Liu Chang

    2012-12-01

    Full Text Available Abstract Background The complete sequences of chloroplast genomes provide wealthy information regarding the evolutionary history of species. With the advance of next-generation sequencing technology, the number of completely sequenced chloroplast genomes is expected to increase exponentially, powerful computational tools annotating the genome sequences are in urgent need. Results We have developed a web server CPGAVAS. The server accepts a complete chloroplast genome sequence as input. First, it predicts protein-coding and rRNA genes based on the identification and mapping of the most similar, full-length protein, cDNA and rRNA sequences by integrating results from Blastx, Blastn, protein2genome and est2genome programs. Second, tRNA genes and inverted repeats (IR are identified using tRNAscan, ARAGORN and vmatch respectively. Third, it calculates the summary statistics for the annotated genome. Fourth, it generates a circular map ready for publication. Fifth, it can create a Sequin file for GenBank submission. Last, it allows the extractions of protein and mRNA sequences for given list of genes and species. The annotation results in GFF3 format can be edited using any compatible annotation editing tools. The edited annotations can then be uploaded to CPGAVAS for update and re-analyses repeatedly. Using known chloroplast genome sequences as test set, we show that CPGAVAS performs comparably to another application DOGMA, while having several superior functionalities. Conclusions CPGAVAS allows the semi-automatic and complete annotation of a chloroplast genome sequence, and the visualization, editing and analysis of the annotation results. It will become an indispensible tool for researchers studying chloroplast genomes. The software is freely accessible from http://www.herbalgenomics.org/cpgavas.

  17. Construction and Analysis of a Novel 2-D Optical Orthogonal Codes Based on Modified One-coincidence Sequence

    Science.gov (United States)

    Ji, Jianhua; Wang, Yanfen; Wang, Ke; Xu, Ming; Zhang, Zhipeng; Yang, Shuwen

    2013-09-01

    A new two-dimensional OOC (optical orthogonal codes) named PC/MOCS is constructed, using PC (prime code) for time spreading and MOCS (modified one-coincidence sequence) for wavelength hopping. Compared with PC/PC, the number of wavelengths for PC/MOCS is not limited to a prime number. Compared with PC/OCS, the length of MOCS need not be expanded to the same length of PC. PC/MOCS can be constructed flexibly, and also can use available wavelengths effectively. Theoretical analysis shows that PC/MOCS can reduce the bit error rate (BER) of OCDMA system, and can support more users than PC/PC and PC/OCS.

  18. Deciphering the genetic regulatory code using an inverse error control coding framework.

    Energy Technology Data Exchange (ETDEWEB)

    Rintoul, Mark Daniel; May, Elebeoba Eni; Brown, William Michael; Johnston, Anna Marie; Watson, Jean-Paul

    2005-03-01

    We have found that developing a computational framework for reconstructing error control codes for engineered data and ultimately for deciphering genetic regulatory coding sequences is a challenging and uncharted area that will require advances in computational technology for exact solutions. Although exact solutions are desired, computational approaches that yield plausible solutions would be considered sufficient as a proof of concept to the feasibility of reverse engineering error control codes and the possibility of developing a quantitative model for understanding and engineering genetic regulation. Such evidence would help move the idea of reconstructing error control codes for engineered and biological systems from the high risk high payoff realm into the highly probable high payoff domain. Additionally this work will impact biological sensor development and the ability to model and ultimately develop defense mechanisms against bioagents that can be engineered to cause catastrophic damage. Understanding how biological organisms are able to communicate their genetic message efficiently in the presence of noise can improve our current communication protocols, a continuing research interest. Towards this end, project goals include: (1) Develop parameter estimation methods for n for block codes and for n, k, and m for convolutional codes. Use methods to determine error control (EC) code parameters for gene regulatory sequence. (2) Develop an evolutionary computing computational framework for near-optimal solutions to the algebraic code reconstruction problem. Method will be tested on engineered and biological sequences.

  19. Whole-Exome Sequencing Identifies Rare and Low-Frequency Coding Variants Associated with LDL Cholesterol

    Science.gov (United States)

    Lange, Leslie A.; Hu, Youna; Zhang, He; Xue, Chenyi; Schmidt, Ellen M.; Tang, Zheng-Zheng; Bizon, Chris; Lange, Ethan M.; Smith, Joshua D.; Turner, Emily H.; Jun, Goo; Kang, Hyun Min; Peloso, Gina; Auer, Paul; Li, Kuo-ping; Flannick, Jason; Zhang, Ji; Fuchsberger, Christian; Gaulton, Kyle; Lindgren, Cecilia; Locke, Adam; Manning, Alisa; Sim, Xueling; Rivas, Manuel A.; Holmen, Oddgeir L.; Gottesman, Omri; Lu, Yingchang; Ruderfer, Douglas; Stahl, Eli A.; Duan, Qing; Li, Yun; Durda, Peter; Jiao, Shuo; Isaacs, Aaron; Hofman, Albert; Bis, Joshua C.; Correa, Adolfo; Griswold, Michael E.; Jakobsdottir, Johanna; Smith, Albert V.; Schreiner, Pamela J.; Feitosa, Mary F.; Zhang, Qunyuan; Huffman, Jennifer E.; Crosby, Jacy; Wassel, Christina L.; Do, Ron; Franceschini, Nora; Martin, Lisa W.; Robinson, Jennifer G.; Assimes, Themistocles L.; Crosslin, David R.; Rosenthal, Elisabeth A.; Tsai, Michael; Rieder, Mark J.; Farlow, Deborah N.; Folsom, Aaron R.; Lumley, Thomas; Fox, Ervin R.; Carlson, Christopher S.; Peters, Ulrike; Jackson, Rebecca D.; van Duijn, Cornelia M.; Uitterlinden, André G.; Levy, Daniel; Rotter, Jerome I.; Taylor, Herman A.; Gudnason, Vilmundur; Siscovick, David S.; Fornage, Myriam; Borecki, Ingrid B.; Hayward, Caroline; Rudan, Igor; Chen, Y. Eugene; Bottinger, Erwin P.; Loos, Ruth J.F.; Sætrom, Pål; Hveem, Kristian; Boehnke, Michael; Groop, Leif; McCarthy, Mark; Meitinger, Thomas; Ballantyne, Christie M.; Gabriel, Stacey B.; O’Donnell, Christopher J.; Post, Wendy S.; North, Kari E.; Reiner, Alexander P.; Boerwinkle, Eric; Psaty, Bruce M.; Altshuler, David; Kathiresan, Sekar; Lin, Dan-Yu; Jarvik, Gail P.; Cupples, L. Adrienne; Kooperberg, Charles; Wilson, James G.; Nickerson, Deborah A.; Abecasis, Goncalo R.; Rich, Stephen S.; Tracy, Russell P.; Willer, Cristen J.; Gabriel, Stacey B.; Altshuler, David M.; Abecasis, Gonçalo R.; Allayee, Hooman; Cresci, Sharon; Daly, Mark J.; de Bakker, Paul I.W.; DePristo, Mark A.; Do, Ron; Donnelly, Peter; Farlow, Deborah N.; Fennell, Tim; Garimella, Kiran; Hazen, Stanley L.; Hu, Youna; Jordan, Daniel M.; Jun, Goo; Kathiresan, Sekar; Kang, Hyun Min; Kiezun, Adam; Lettre, Guillaume; Li, Bingshan; Li, Mingyao; Newton-Cheh, Christopher H.; Padmanabhan, Sandosh; Peloso, Gina; Pulit, Sara; Rader, Daniel J.; Reich, David; Reilly, Muredach P.; Rivas, Manuel A.; Schwartz, Steve; Scott, Laura; Siscovick, David S.; Spertus, John A.; Stitziel, Nathaniel O.; Stoletzki, Nina; Sunyaev, Shamil R.; Voight, Benjamin F.; Willer, Cristen J.; Rich, Stephen S.; Akylbekova, Ermeg; Atwood, Larry D.; Ballantyne, Christie M.; Barbalic, Maja; Barr, R. Graham; Benjamin, Emelia J.; Bis, Joshua; Boerwinkle, Eric; Bowden, Donald W.; Brody, Jennifer; Budoff, Matthew; Burke, Greg; Buxbaum, Sarah; Carr, Jeff; Chen, Donna T.; Chen, Ida Y.; Chen, Wei-Min; Concannon, Pat; Crosby, Jacy; Cupples, L. Adrienne; D’Agostino, Ralph; DeStefano, Anita L.; Dreisbach, Albert; Dupuis, Josée; Durda, J. Peter; Ellis, Jaclyn; Folsom, Aaron R.; Fornage, Myriam; Fox, Caroline S.; Fox, Ervin; Funari, Vincent; Ganesh, Santhi K.; Gardin, Julius; Goff, David; Gordon, Ora; Grody, Wayne; Gross, Myron; Guo, Xiuqing; Hall, Ira M.; Heard-Costa, Nancy L.; Heckbert, Susan R.; Heintz, Nicholas; Herrington, David M.; Hickson, DeMarc; Huang, Jie; Hwang, Shih-Jen; Jacobs, David R.; Jenny, Nancy S.; Johnson, Andrew D.; Johnson, Craig W.; Kawut, Steven; Kronmal, Richard; Kurz, Raluca; Lange, Ethan M.; Lange, Leslie A.; Larson, Martin G.; Lawson, Mark; Lewis, Cora E.; Levy, Daniel; Li, Dalin; Lin, Honghuang; Liu, Chunyu; Liu, Jiankang; Liu, Kiang; Liu, Xiaoming; Liu, Yongmei; Longstreth, William T.; Loria, Cay; Lumley, Thomas; Lunetta, Kathryn; Mackey, Aaron J.; Mackey, Rachel; Manichaikul, Ani; Maxwell, Taylor; McKnight, Barbara; Meigs, James B.; Morrison, Alanna C.; Musani, Solomon K.; Mychaleckyj, Josyf C.; Nettleton, Jennifer A.; North, Kari; O’Donnell, Christopher J.; O’Leary, Daniel; Ong, Frank; Palmas, Walter; Pankow, James S.; Pankratz, Nathan D.; Paul, Shom; Perez, Marco; Person, Sharina D.; Polak, Joseph; Post, Wendy S.; Psaty, Bruce M.; Quinlan, Aaron R.; Raffel, Leslie J.; Ramachandran, Vasan S.; Reiner, Alexander P.; Rice, Kenneth; Rotter, Jerome I.; Sanders, Jill P.; Schreiner, Pamela; Seshadri, Sudha; Shea, Steve; Sidney, Stephen; Silverstein, Kevin; Smith, Nicholas L.; Sotoodehnia, Nona; Srinivasan, Asoke; Taylor, Herman A.; Taylor, Kent; Thomas, Fridtjof; Tracy, Russell P.; Tsai, Michael Y.; Volcik, Kelly A.; Wassel, Chrstina L.; Watson, Karol; Wei, Gina; White, Wendy; Wiggins, Kerri L.; Wilk, Jemma B.; Williams, O. Dale; Wilson, Gregory; Wilson, James G.; Wolf, Phillip; Zakai, Neil A.; Hardy, John; Meschia, James F.; Nalls, Michael; Singleton, Andrew; Worrall, Brad; Bamshad, Michael J.; Barnes, Kathleen C.; Abdulhamid, Ibrahim; Accurso, Frank; Anbar, Ran; Beaty, Terri; Bigham, Abigail; Black, Phillip; Bleecker, Eugene; Buckingham, Kati; Cairns, Anne Marie; Caplan, Daniel; Chatfield, Barbara; Chidekel, Aaron; Cho, Michael; Christiani, David C.; Crapo, James D.; Crouch, Julia; Daley, Denise; Dang, Anthony; Dang, Hong; De Paula, Alicia; DeCelie-Germana, Joan; Drumm, Allen DozorMitch; Dyson, Maynard; Emerson, Julia; Emond, Mary J.; Ferkol, Thomas; Fink, Robert; Foster, Cassandra; Froh, Deborah; Gao, Li; Gershan, William; Gibson, Ronald L.; Godwin, Elizabeth; Gondor, Magdalen; Gutierrez, Hector; Hansel, Nadia N.; Hassoun, Paul M.; Hiatt, Peter; Hokanson, John E.; Howenstine, Michelle; Hummer, Laura K.; Kanga, Jamshed; Kim, Yoonhee; Knowles, Michael R.; Konstan, Michael; Lahiri, Thomas; Laird, Nan; Lange, Christoph; Lin, Lin; Lin, Xihong; Louie, Tin L.; Lynch, David; Make, Barry; Martin, Thomas R.; Mathai, Steve C.; Mathias, Rasika A.; McNamara, John; McNamara, Sharon; Meyers, Deborah; Millard, Susan; Mogayzel, Peter; Moss, Richard; Murray, Tanda; Nielson, Dennis; Noyes, Blakeslee; O’Neal, Wanda; Orenstein, David; O’Sullivan, Brian; Pace, Rhonda; Pare, Peter; Parker, H. Worth; Passero, Mary Ann; Perkett, Elizabeth; Prestridge, Adrienne; Rafaels, Nicholas M.; Ramsey, Bonnie; Regan, Elizabeth; Ren, Clement; Retsch-Bogart, George; Rock, Michael; Rosen, Antony; Rosenfeld, Margaret; Ruczinski, Ingo; Sanford, Andrew; Schaeffer, David; Sell, Cindy; Sheehan, Daniel; Silverman, Edwin K.; Sin, Don; Spencer, Terry; Stonebraker, Jackie; Tabor, Holly K.; Varlotta, Laurie; Vergara, Candelaria I.; Weiss, Robert; Wigley, Fred; Wise, Robert A.; Wright, Fred A.; Wurfel, Mark M.; Zanni, Robert; Zou, Fei; Nickerson, Deborah A.; Rieder, Mark J.; Green, Phil; Shendure, Jay; Akey, Joshua M.; Bustamante, Carlos D.; Crosslin, David R.; Eichler, Evan E.; Fox, P. Keolu; Fu, Wenqing; Gordon, Adam; Gravel, Simon; Jarvik, Gail P.; Johnsen, Jill M.; Kan, Mengyuan; Kenny, Eimear E.; Kidd, Jeffrey M.; Lara-Garduno, Fremiet; Leal, Suzanne M.; Liu, Dajiang J.; McGee, Sean; O’Connor, Timothy D.; Paeper, Bryan; Robertson, Peggy D.; Smith, Joshua D.; Staples, Jeffrey C.; Tennessen, Jacob A.; Turner, Emily H.; Wang, Gao; Yi, Qian; Jackson, Rebecca; Peters, Ulrike; Carlson, Christopher S.; Anderson, Garnet; Anton-Culver, Hoda; Assimes, Themistocles L.; Auer, Paul L.; Beresford, Shirley; Bizon, Chris; Black, Henry; Brunner, Robert; Brzyski, Robert; Burwen, Dale; Caan, Bette; Carty, Cara L.; Chlebowski, Rowan; Cummings, Steven; Curb, J. David; Eaton, Charles B.; Ford, Leslie; Franceschini, Nora; Fullerton, Stephanie M.; Gass, Margery; Geller, Nancy; Heiss, Gerardo; Howard, Barbara V.; Hsu, Li; Hutter, Carolyn M.; Ioannidis, John; Jiao, Shuo; Johnson, Karen C.; Kooperberg, Charles; Kuller, Lewis; LaCroix, Andrea; Lakshminarayan, Kamakshi; Lane, Dorothy; Lasser, Norman; LeBlanc, Erin; Li, Kuo-Ping; Limacher, Marian; Lin, Dan-Yu; Logsdon, Benjamin A.; Ludlam, Shari; Manson, JoAnn E.; Margolis, Karen; Martin, Lisa; McGowan, Joan; Monda, Keri L.; Kotchen, Jane Morley; Nathan, Lauren; Ockene, Judith; O’Sullivan, Mary Jo; Phillips, Lawrence S.; Prentice, Ross L.; Robbins, John; Robinson, Jennifer G.; Rossouw, Jacques E.; Sangi-Haghpeykar, Haleh; Sarto, Gloria E.; Shumaker, Sally; Simon, Michael S.; Stefanick, Marcia L.; Stein, Evan; Tang, Hua; Taylor, Kira C.; Thomson, Cynthia A.; Thornton, Timothy A.; Van Horn, Linda; Vitolins, Mara; Wactawski-Wende, Jean; Wallace, Robert; Wassertheil-Smoller, Sylvia; Zeng, Donglin; Applebaum-Bowden, Deborah; Feolo, Michael; Gan, Weiniu; Paltoo, Dina N.; Sholinsky, Phyliss; Sturcke, Anne

    2014-01-01

    Elevated low-density lipoprotein cholesterol (LDL-C) is a treatable, heritable risk factor for cardiovascular disease. Genome-wide association studies (GWASs) have identified 157 variants associated with lipid levels but are not well suited to assess the impact of rare and low-frequency variants. To determine whether rare or low-frequency coding variants are associated with LDL-C, we exome sequenced 2,005 individuals, including 554 individuals selected for extreme LDL-C (>98th or <2nd percentile). Follow-up analyses included sequencing of 1,302 additional individuals and genotype-based analysis of 52,221 individuals. We observed significant evidence of association between LDL-C and the burden of rare or low-frequency variants in PNPLA5, encoding a phospholipase-domain-containing protein, and both known and previously unidentified variants in PCSK9, LDLR and APOB, three known lipid-related genes. The effect sizes for the burden of rare variants for each associated gene were substantially higher than those observed for individual SNPs identified from GWASs. We replicated the PNPLA5 signal in an independent large-scale sequencing study of 2,084 individuals. In conclusion, this large whole-exome-sequencing study for LDL-C identified a gene not known to be implicated in LDL-C and provides unique insight into the design and analysis of similar experiments. PMID:24507775

  20. Verona Coding Definitions of Emotional Sequences (VR-CoDES): Conceptual framework and future directions.

    Science.gov (United States)

    Piccolo, Lidia Del; Finset, Arnstein; Mellblom, Anneli V; Figueiredo-Braga, Margarida; Korsvold, Live; Zhou, Yuefang; Zimmermann, Christa; Humphris, Gerald

    2017-12-01

    To discuss the theoretical and empirical framework of VR-CoDES and potential future direction in research based on the coding system. The paper is based on selective review of papers relevant to the construction and application of VR-CoDES. VR-CoDES system is rooted in patient-centered and biopsychosocial model of healthcare consultations and on a functional approach to emotion theory. According to the VR-CoDES, emotional interaction is studied in terms of sequences consisting of an eliciting event, an emotional expression by the patient and the immediate response by the clinician. The rationale for the emphasis on sequences, on detailed classification of cues and concerns, and on the choices of explicit vs. non-explicit responses and providing vs. reducing room for further disclosure, as basic categories of the clinician responses, is described. Results from research on VR-CoDES may help raise awareness of emotional sequences. Future directions in applying VR-CoDES in research may include studies on predicting patient and clinician behavior within the consultation, qualitative analyses of longer sequences including several VR-CoDES triads, and studies of effects of emotional communication on health outcomes. VR-CoDES may be applied to develop interventions to promote good handling of patients' emotions in healthcare encounters. Copyright © 2017 Elsevier B.V. All rights reserved.

  1. Molecular cloning and complete nucleotide sequence of a human ventricular myosin light chain 1

    Energy Technology Data Exchange (ETDEWEB)

    Hoffmann, E; Shi, Q W; Floroff, M; Mickle, D A.G.; Wu, T W; Olley, P M; Jackowski, G

    1988-03-25

    Human ventricular plasmid library was constructed. The library was screened with the oligonucleotide probe (17-mer) corresponding to a conserve region of myosin light chain 1 near the carboxy terminal. Full length cDNA recombinant plasmid containing 1100 bp insert was isolated. RNA blot hybridization with this insert detected a message of approximately 1500 bp corresponding to the size of VLCl and mRNA. Complete nucleotide sequence of the coding region was determined in M13 subclones using dideoxy chain termination method. With the isolation of this clone (pCD HLVCl), the publication of the complete nucleotide sequence of HVLCl and the predicted secondary structure of this protein will aid in understanding of the biochemistry of myosin and its function in contraction, the evolution of myosin light genes and the genetic, developmental and physiological regulation of myosin genes.

  2. Complete genome sequence of Fer-de-Lance Virus reveals a novel gene in reptilian Paramyxoviruses

    Science.gov (United States)

    Kurath, G.; Batts, W.N.; Ahne, W.; Winton, J.R.

    2004-01-01

    The complete RNA genome sequence of the archetype reptilian paramyxovirus, Fer-de-Lance virus (FDLV), has been determined. The genome is 15,378 nucleotides in length and consists of seven nonoverlapping genes in the order 3??? N-U-P-M-F-HN-L 5???, coding for the nucleocapsid, unknown, phospho-, matrix, fusion, hemagglutinin-neuraminidase, and large polymerase proteins, respectively. The gene junctions contain highly conserved transcription start and stop signal sequences and tri-nucleotide intergenic regions similar to those of other Paramyxoviridae. The FDLV P gene expression strategy is like that of rubulaviruses, which express the accessory V protein from the primary transcript and edit a portion of the mRNA to encode P and I proteins. There is also an overlapping open reading frame potentially encoding a small basic protein in the P gene. The gene designated U (unknown), encodes a deduced protein of 19.4 kDa that has no counterpart in other paramyxoviruses and has no similarity with sequences in the National Center for Biotechnology Information database. Active transcription of the U gene in infected cells was demonstrated by Northern blot analysis, and bicistronic N-U mRNA was also evident. The genomes of two other snake paramyxovirus genotypes were also found to have U genes, with 11 to 16% nucleotide divergence from the FDLV U gene. Pairwise comparisons of amino acid identities and phylogenetic analyses of all deduced FDLV protein sequences with homologous sequences from other Paramyxoviridae indicate that FDLV represents a new genus within the subfamily Paramyxovirinae. We suggest the name Ferlavirus for the new genus, with FDLV as the type species.

  3. FATHEAD MINNOW VITELLOGENIN: CDNA SEQUENCE AND MRNA AND PROTEIN EXPRESSION AFTER 17 BETA-ESTRADIOL TREATMENT

    Science.gov (United States)

    In the present study, a sensitive ribonuclease protection assay (RPA) for VTG mRNA was developed for the fathead minnow (Pimephales promelas), a species proposed for routine endocrine-disrupting chemical (EDC) screening.

  4. cDNA sequence of human transforming gene hst and identification of the coding sequence required for transforming activity

    International Nuclear Information System (INIS)

    Taira, M.; Yoshida, T.; Miyagawa, K.; Sakamoto, H.; Terada, M.; Sugimura, T.

    1987-01-01

    The hst gene was originally identified as a transforming gene in DNAs from human stomach cancers and from a noncancerous portion of stomach mucosa by DNA-mediated transfection assay using NIH3T3 cells. cDNA clones of hst were isolated from the cDNA library constructed from poly(A) + RNA of a secondary transformant induced by the DNA from a stomach cancer. The sequence analysis of the hst cDNA revealed the presence of two open reading frames. When this cDNA was inserted into an expression vector containing the simian virus 40 promoter, it efficiently induced the transformation of NIH3T3 cells upon transfection. It was found that one of the reading frames, which coded for 206 amino acids, was responsible for the transforming activity

  5. In Silico Mining of Microsatellites in Coding Sequences of the Date Palm (Arecaceae Genome, Characterization, and Transferability

    Directory of Open Access Journals (Sweden)

    Frédérique Aberlenc-Bertossi

    2014-01-01

    Full Text Available Premise of the study: To complement existing sets of primarily dinucleotide microsatellite loci from noncoding sequences of date palm, we developed primers for tri- and hexanucleotide microsatellite loci identified within genes. Due to their conserved genomic locations, the primers should be useful in other palm taxa, and their utility was tested in seven other Phoenix species and in Chamaerops, Livistona, and Hyphaene. Methods and Results: Tandem repeat motifs of 3–6 bp were searched using a simple sequence repeat (SSR–pipeline package in coding portions of the date palm draft genome sequence. Fifteen loci produced highly consistent amplification, intraspecific polymorphisms, and stepwise mutation patterns. Conclusions: These microsatellite loci showed sufficient levels of variability and transferability to make them useful for population genetic, selection signature, and interspecific gene flow studies in Phoenix and other Coryphoideae genera.

  6. Identification of coding and non-coding mutational hotspots in cancer genomes.

    Science.gov (United States)

    Piraino, Scott W; Furney, Simon J

    2017-01-05

    The identification of mutations that play a causal role in tumour development, so called "driver" mutations, is of critical importance for understanding how cancers form and how they might be treated. Several large cancer sequencing projects have identified genes that are recurrently mutated in cancer patients, suggesting a role in tumourigenesis. While the landscape of coding drivers has been extensively studied and many of the most prominent driver genes are well characterised, comparatively less is known about the role of mutations in the non-coding regions of the genome in cancer development. The continuing fall in genome sequencing costs has resulted in a concomitant increase in the number of cancer whole genome sequences being produced, facilitating systematic interrogation of both the coding and non-coding regions of cancer genomes. To examine the mutational landscapes of tumour genomes we have developed a novel method to identify mutational hotspots in tumour genomes using both mutational data and information on evolutionary conservation. We have applied our methodology to over 1300 whole cancer genomes and show that it identifies prominent coding and non-coding regions that are known or highly suspected to play a role in cancer. Importantly, we applied our method to the entire genome, rather than relying on predefined annotations (e.g. promoter regions) and we highlight recurrently mutated regions that may have resulted from increased exposure to mutational processes rather than selection, some of which have been identified previously as targets of selection. Finally, we implicate several pan-cancer and cancer-specific candidate non-coding regions, which could be involved in tumourigenesis. We have developed a framework to identify mutational hotspots in cancer genomes, which is applicable to the entire genome. This framework identifies known and novel coding and non-coding mutional hotspots and can be used to differentiate candidate driver regions from

  7. Recruitment of Staufen2 Enhances Dendritic Localization of an Intron-Containing CaMKIIα mRNA

    Directory of Open Access Journals (Sweden)

    Raúl Ortiz

    2017-07-01

    Full Text Available Regulation of mRNA localization is a conserved cellular process observed in many types of cells and organisms. Asymmetrical mRNA distribution plays a particularly important role in the nervous system, where local translation of localized mRNA represents a key mechanism in synaptic plasticity. CaMKIIα is a very abundant mRNA detected in neurites, consistent with its crucial role at glutamatergic synapses. Here, we report the presence of CaMKIIα mRNA isoforms that contain intron i16 in dendrites, RNA granules, and synaptoneurosomes from primary neurons and brain. This subpopulation of unspliced mRNA preferentially localizes to distal dendrites in a synaptic-activity-dependent manner. Staufen2, a well-established marker of RNA transport in dendrites, interacts with intron i16 sequences and enhances its distal dendritic localization, pointing to the existence of intron-mediated mechanisms in the molecular pathways that modulate dendritic transport and localization of synaptic mRNAs.

  8. Analysis of a cDNA clone expressing a human autoimmune antigen: full-length sequence of the U2 small nuclear RNA-associated B antigen

    International Nuclear Information System (INIS)

    Habets, W.J.; Sillekens, P.T.G.; Hoet, M.H.; Schalken, J.A.; Roebroek, A.J.M.; Leunissen, J.A.M.; Van de Ven, W.J.M.; Van Venrooij, W.J.

    1987-01-01

    A U2 small nuclear RNA-associated protein, designated B'', was recently identified as the target antigen for autoimmune sera from certain patients with systemic lupus erythematosus and other rheumatic diseases. Such antibodies enabled them to isolate cDNA clone λHB''-1 from a phage λgt11 expression library. This clone appeared to code for the B'' protein as established by in vitro translation of hybrid-selected mRNA. The identity of clone λHB''-1 was further confirmed by partial peptide mapping and analysis of the reactivity of the recombinant antigen with monospecific and monoclonal antibodies. Analysis of the nucleotide sequence of the 1015-base-pair cDNA insert of clone λHB''-1 revealed a large open reading frame of 800 nucleotides containing the coding sequence for a polypeptide of 25,457 daltons. In vitro transcription of the λHB''-1 cDNA insert and subsequent translation resulted in a protein product with the molecular size of the B'' protein. These data demonstrate that clone λHB''-1 contains the complete coding sequence of this antigen. The deduced polypeptide sequence contains three very hydrophilic regions that might constitute RNA binding sites and/or antigenic determinants. These findings might have implications both for the understanding of the pathogenesis of rheumatic diseases as well as for the elucidation of the biological function of autoimmune antigens

  9. Rhythmic expression of Nocturnin mRNA in multiple tissues of the mouse

    Directory of Open Access Journals (Sweden)

    Green Carla B

    2001-05-01

    Full Text Available Abstract Background Nocturnin was originally identified by differential display as a circadian clock regulated gene with high expression at night in photoreceptors of the African clawed frog, Xenopus laevis. Although encoding a novel protein, the nocturnin cDNA had strong sequence similarity with a C-terminal domain of the yeast transcription factor CCR4, and with mouse and human ESTs. Since its original identification others have cloned mouse and human homologues of nocturnin/CCR4, and we have cloned a full-length cDNA from mouse retina, along with partial cDNAs from human, cow and chicken. The goal of this study was to determine the temporal pattern of nocturnin mRNA expression in multiple tissues of the mouse. Results cDNA sequence analysis revealed a high degree of conservation among vertebrate nocturnin/CCR4 homologues along with a possible homologue in Drosophila. Northern analysis of mRNA in C3H/He and C57/Bl6 mice revealed that the mNoc gene is expressed in a broad range of tissues, with greatest abundance in liver, kidney and testis. mNoc is also expressed in multiple brain regions including suprachiasmatic nucleus and pineal gland. Furthermore, mNoc exhibits circadian rhythmicity of mRNA abundance with peak levels at the time of light offset in the retina, spleen, heart, kidney and liver. Conclusion The widespread expression and rhythmicity of mNoc mRNA parallels the widespread expression of other circadian clock genes in mammalian tissues, and suggests that nocturnin plays an important role in clock function or as a circadian clock effector.

  10. Cloning, sequencing, and expression of cDNA for human β-glucuronidase

    International Nuclear Information System (INIS)

    Oshima, A.; Kyle, J.W.; Miller, R.D.

    1987-01-01

    The authors report here the cDNA sequence for human placental β-glucuronidase (β-D-glucuronoside glucuronosohydrolase, EC 3.2.1.31) and demonstrate expression of the human enzyme in transfected COS cells. They also sequenced a partial cDNA clone from human fibroblasts that contained a 153-base-pair deletion within the coding sequence and found a second type of cDNA clone from placenta that contained the same deletion. Nuclease S1 mapping studies demonstrated two types of mRNAs in human placenta that corresponded to the two types of cDNA clones isolated. The NH 2 -terminal amino acid sequence determined for human spleen β-glucuronidase agreed with that inferred from the DNA sequence of the two placental clones, beginning at amino acid 23, suggesting a cleaved signal sequence of 22 amino acids. When transfected into COS cells, plasmids containing either placental clone expressed an immunoprecipitable protein that contained N-linked oligosaccharides as evidenced by sensitivity to endoglycosidase F. However, only transfection with the clone containing the 153-base-pair segment led to expression of human β-glucuronidase activity. These studies provide the sequence for the full-length cDNA for human β-glucuronidase, demonstrate the existence of two populations of mRNA for β-glucuronidase in human placenta, only one of which specifies a catalytically active enzyme, and illustrate the importance of expression studies in verifying that a cDNA is functionally full-length

  11. The Extent of mRNA Editing Is Limited in Chicken Liver and Adipose, but Impacted by Tissular Context, Genotype, Age, and Feeding as Exemplified with a Conserved Edited Site in COG3

    Directory of Open Access Journals (Sweden)

    Pierre-François Roux

    2016-02-01

    Full Text Available RNA editing is a posttranscriptional process leading to differences between genomic DNA and transcript sequences, potentially enhancing transcriptome diversity. With recent advances in high-throughput sequencing, many efforts have been made to describe mRNA editing at the transcriptome scale, especially in mammals, yielding contradictory conclusions regarding the extent of this phenomenon. We show, by detailed description of the 25 studies focusing so far on mRNA editing at the whole-transcriptome scale, that systematic sequencing artifacts are considered in most studies whereas biological replication is often neglected and multi-alignment not properly evaluated, which ultimately impairs the legitimacy of results. We recently developed a rigorous strategy to identify mRNA editing using mRNA and genomic DNA sequencing, taking into account sequencing and mapping artifacts, and biological replicates. We applied this method to screen for mRNA editing in liver and white adipose tissue from eight chickens and confirm the small extent of mRNA recoding in this species. Among the 25 unique edited sites identified, three events were previously described in mammals, attesting that this phenomenon is conserved throughout evolution. Deeper investigations on five sites revealed the impact of tissular context, genotype, age, feeding conditions, and sex on mRNA editing levels. More specifically, this analysis highlighted that the editing level at the site located on COG3 was strongly regulated by four of these factors. By comprehensively characterizing the mRNA editing landscape in chickens, our results highlight how this phenomenon is limited and suggest regulation of editing levels by various genetic and environmental factors.

  12. Full-length cDNA sequences from Rhesus monkey placenta tissue: analysis and utility for comparative mapping

    Directory of Open Access Journals (Sweden)

    Lee Sang-Rae

    2010-07-01

    Full Text Available Abstract Background Rhesus monkeys (Macaca mulatta are widely-used as experimental animals in biomedical research and are closely related to other laboratory macaques, such as cynomolgus monkeys (Macaca fascicularis, and to humans, sharing a last common ancestor from about 25 million years ago. Although rhesus monkeys have been studied extensively under field and laboratory conditions, research has been limited by the lack of genetic resources. The present study generated placenta full-length cDNA libraries, characterized the resulting expressed sequence tags, and described their utility for comparative mapping with human RefSeq mRNA transcripts. Results From rhesus monkey placenta full-length cDNA libraries, 2000 full-length cDNA sequences were determined and 1835 rhesus placenta cDNA sequences longer than 100 bp were collected. These sequences were annotated based on homology to human genes. Homology search against human RefSeq mRNAs revealed that our collection included the sequences of 1462 putative rhesus monkey genes. Moreover, we identified 207 genes containing exon alterations in the coding region and the untranslated region of rhesus monkey transcripts, despite the highly conserved structure of the coding regions. Approximately 10% (187 of all full-length cDNA sequences did not represent any public human RefSeq mRNAs. Intriguingly, two rhesus monkey specific exons derived from the transposable elements of AluYRa2 (SINE family and MER11B (LTR family were also identified. Conclusion The 1835 rhesus monkey placenta full-length cDNA sequences described here could expand genomic resources and information of rhesus monkeys. This increased genomic information will greatly contribute to the development of evolutionary biology and biomedical research.

  13. Securing optical code-division multiple-access networks with a postswitching coding scheme of signature reconfiguration

    Science.gov (United States)

    Huang, Jen-Fa; Meng, Sheng-Hui; Lin, Ying-Chen

    2014-11-01

    The optical code-division multiple-access (OCDMA) technique is considered a good candidate for providing optical layer security. An enhanced OCDMA network security mechanism with a pseudonoise (PN) random digital signals type of maximal-length sequence (M-sequence) code switching to protect against eavesdropping is presented. Signature codes unique to individual OCDMA-network users are reconfigured according to the register state of the controlling electrical shift registers. Examples of signature reconfiguration following state switching of the controlling shift register for both the network user and the eavesdropper are numerically illustrated. Dynamically changing the PN state of the shift register to reconfigure the user signature sequence is shown; this hinders eavesdroppers' efforts to decode correct data sequences. The proposed scheme increases the probability of eavesdroppers committing errors in decoding and thereby substantially enhances the degree of an OCDMA network's confidentiality.

  14. Cloning and cDNA sequence of the dihydrolipoamide dehydrogenase component of human α-ketoacid dehydrogenase complexes

    International Nuclear Information System (INIS)

    Pons, G.; Raefsky-Estrin, C.; Carothers, D.J.; Pepin, R.A.; Javed, A.A.; Jesse, B.W.; Ganapathi, M.K.; Samols, D.; Patel, M.S.

    1988-01-01

    cDNA clones comprising the entire coding region for human dihydrolipoamide dehydrogenase have been isolated from a human liver cDNA library. The cDNA sequence of the largest clone consisted of 2082 base pairs and contained a 1527-base open reading frame that encodes a precursor dihydrolipoamide dehydrogenase of 509 amino acid residues. The first 35-amino acid residues of the open reading frame probably correspond to a typical mitochondrial import leader sequence. The predicted amino acid sequence of the mature protein, starting at the residue number 36 of the open reading frame, is almost identical (>98% homology) with the known partial amino acid sequence of the pig heart dihydrolipoamide dehydrogenase. The cDNA clone also contains a 3' untranslated region of 505 bases with an unusual polyadenylylation signal (TATAAA) and a short poly(A) track. By blot-hybridization analysis with the cDNA as probe, two mRNAs, 2.2 and 2.4 kilobases in size, have been detected in human tissues and fibroblasts, whereas only one mRNA (2.4 kilobases) was detected in rat tissues

  15. In silico Coding Sequence Analysis of Walnut GAI and PIP2 Genes and Comparison with Different Plant Species

    Directory of Open Access Journals (Sweden)

    Mahdi Mohseniazar

    2017-02-01

    Full Text Available Introduction: Dwarfism is one of the important traits in breeding of crops and horticulture plants. A dwarfing rootstock will produce trees with 15-50% of standard trees size. In modern intensive fruit tree orchards, dwarfing rootstocks are commonly used to reduce trees size, enabling high-density planting and easy management, thus achieving higher yield. Trees on dwarfing rootstocks can also exhibit other economically important traits, such as precocious flowering, increased yield and increased disease resistance. Dwarf rootstocks have been extensively studied and released in stone and pome fruits, because of presence of genetic materials and the simplicity of budding methods. Control of tree size using genetically dwarf rootstocks for achievement to higher density and mechanized orchard systems is now very important for walnut production in the world especially in Iran. Many different genes can be involved in appear of this. Mutations in GAI and PIP2 genes cause dwarf trait by two different mechanisms in some plant species. In this case, we study in silico analysis of GAI and PIP2 genes consist of conserved sequences and domains, exon and intron number, function of their proteins, targeting, secondary and tertiary structure, and post translational modification. Materials and methods: The GAI and PIP2 mRNA and protein sequences (FASTA format belonging to 17 monocotyledon and dicotyledon were downloaded from NCBI (http://www.ncbi.nlm.nih.gov accessed, on September 2014. Several online web services and software were used for analysis of GAI and PIP2 mRNA and Proteins in plants. Comparative and bioinformatics analyses of PIP2 and GAI proteins were performed online at two websites NCBI (http://www.ncbi.nih.gov and EXPASY (http://expasy.org/tools. Molecular Evolutionary Genetics Analysis (MEGA; version 4 program and CLUSTAL-W with default parameters were used for multiple alignments of sequences. The phylogenetic analysis of GAI and PIP2 protein was

  16. In silico Coding Sequence Analysis of Walnut GAI and PIP2 Genes and Comparison with Different Plant Species

    Directory of Open Access Journals (Sweden)

    Mahdi Mohseniazar

    2017-09-01

    Full Text Available Introduction: Dwarfism is one of the important traits in breeding of crops and horticulture plants. A dwarfing rootstock will produce trees with 15-50% of standard trees size. In modern intensive fruit tree orchards, dwarfing rootstocks are commonly used to reduce trees size, enabling high-density planting and easy management, thus achieving higher yield. Trees on dwarfing rootstocks can also exhibit other economically important traits, such as precocious flowering, increased yield and increased disease resistance. Dwarf rootstocks have been extensively studied and released in stone and pome fruits, because of presence of genetic materials and the simplicity of budding methods. Control of tree size using genetically dwarf rootstocks for achievement to higher density and mechanized orchard systems is now very important for walnut production in the world especially in Iran. Many different genes can be involved in appear of this. Mutations in GAI and PIP2 genes cause dwarf trait by two different mechanisms in some plant species. In this case, we study in silico analysis of GAI and PIP2 genes consist of conserved sequences and domains, exon and intron number, function of their proteins, targeting, secondary and tertiary structure, and post translational modification. Materials and methods: The GAI and PIP2 mRNA and protein sequences (FASTA format belonging to 17 monocotyledon and dicotyledon were downloaded from NCBI (http://www.ncbi.nlm.nih.gov accessed, on September 2014. Several online web services and software were used for analysis of GAI and PIP2 mRNA and Proteins in plants. Comparative and bioinformatics analyses of PIP2 and GAI proteins were performed online at two websites NCBI (http://www.ncbi.nih.gov and EXPASY (http://expasy.org/tools. Molecular Evolutionary Genetics Analysis (MEGA; version 4 program and CLUSTAL-W with default parameters were used for multiple alignments of sequences. The phylogenetic analysis of GAI and PIP2 protein was

  17. Classifying Coding DNA with Nucleotide Statistics

    Directory of Open Access Journals (Sweden)

    Nicolas Carels

    2009-10-01

    Full Text Available In this report, we compared the success rate of classification of coding sequences (CDS vs. introns by Codon Structure Factor (CSF and by a method that we called Universal Feature Method (UFM. UFM is based on the scoring of purine bias (Rrr and stop codon frequency. We show that the success rate of CDS/intron classification by UFM is higher than by CSF. UFM classifies ORFs as coding or non-coding through a score based on (i the stop codon distribution, (ii the product of purine probabilities in the three positions of nucleotide triplets, (iii the product of Cytosine (C, Guanine (G, and Adenine (A probabilities in the 1st, 2nd, and 3rd positions of triplets, respectively, (iv the probabilities of G in 1st and 2nd position of triplets and (v the distance of their GC3 vs. GC2 levels to the regression line of the universal correlation. More than 80% of CDSs (true positives of Homo sapiens (>250 bp, Drosophila melanogaster (>250 bp and Arabidopsis thaliana (>200 bp are successfully classified with a false positive rate lower or equal to 5%. The method releases coding sequences in their coding strand and coding frame, which allows their automatic translation into protein sequences with 95% confidence. The method is a natural consequence of the compositional bias of nucleotides in coding sequences.

  18. New tools to analyze overlapping coding regions.

    Science.gov (United States)

    Bayegan, Amir H; Garcia-Martin, Juan Antonio; Clote, Peter

    2016-12-13

    Retroviruses transcribe messenger RNA for the overlapping Gag and Gag-Pol polyproteins, by using a programmed -1 ribosomal frameshift which requires a slippery sequence and an immediate downstream stem-loop secondary structure, together called frameshift stimulating signal (FSS). It follows that the molecular evolution of this genomic region of HIV-1 is highly constrained, since the retroviral genome must contain a slippery sequence (sequence constraint), code appropriate peptides in reading frames 0 and 1 (coding requirements), and form a thermodynamically stable stem-loop secondary structure (structure requirement). We describe a unique computational tool, RNAsampleCDS, designed to compute the number of RNA sequences that code two (or more) peptides p,q in overlapping reading frames, that are identical (or have BLOSUM/PAM similarity that exceeds a user-specified value) to the input peptides p,q. RNAsampleCDS then samples a user-specified number of messenger RNAs that code such peptides; alternatively, RNAsampleCDS can exactly compute the position-specific scoring matrix and codon usage bias for all such RNA sequences. Our software allows the user to stipulate overlapping coding requirements for all 6 possible reading frames simultaneously, even allowing IUPAC constraints on RNA sequences and fixing GC-content. We generalize the notion of codon preference index (CPI) to overlapping reading frames, and use RNAsampleCDS to generate control sequences required in the computation of CPI. Moreover, by applying RNAsampleCDS, we are able to quantify the extent to which the overlapping coding requirement in HIV-1 [resp. HCV] contribute to the formation of the stem-loop [resp. double stem-loop] secondary structure known as the frameshift stimulating signal. Using our software, we confirm that certain experimentally determined deleterious HCV mutations occur in positions for which our software RNAsampleCDS and RNAiFold both indicate a single possible nucleotide. We

  19. Production of HIV-1 vif mRNA Is Modulated by Natural Nucleotide Variations and SLSA1 RNA Structure in SA1D2prox Genomic Region

    Directory of Open Access Journals (Sweden)

    Masako Nomaguchi

    2017-12-01

    Full Text Available Genomic RNA of HIV-1 contains localized structures critical for viral replication. Its structural analysis has demonstrated a stem-loop structure, SLSA1, in a nearby region of HIV-1 genomic splicing acceptor 1 (SA1. We have previously shown that the expression level of vif mRNA is considerably altered by some natural single-nucleotide variations (nSNVs clustering in SLSA1 structure. In this study, besides eleven nSNVs previously identified by us, we totally found nine new nSNVs in the SLSA1-containing sequence from SA1, splicing donor 2, and through to the start codon of Vif that significantly affect the vif mRNA level, and designated the sequence SA1D2prox (142 nucleotides for HIV-1 NL4-3. We then examined by extensive variant and mutagenesis analyses how SA1D2prox sequence and SLSA1 secondary structure are related to vif mRNA level. While the secondary structure and stability of SLSA1 was largely changed by nSNVs and artificial mutations introduced to restore the original NL4-3 form from altered ones by nSNVs, no clear association of the two SLSA1 properties with vif mRNA level was observed. In contrast, when naturally occurring SA1D2prox sequences that contain multiple nSNVs were examined, we attained significant inverse correlation between the vif level and SLSA1 stability. These results may suggest that SA1D2prox sequence adapts over time, and also that the altered SA1D2prox sequence, SLSA1 stability, and vif level are mutually related. In total, we show here that the entire SA1D2prox sequence and SLSA1 stability critically contribute to the modulation of vif mRNA level.

  20. Rfam: annotating families of non-coding RNA sequences.

    Science.gov (United States)

    Daub, Jennifer; Eberhardt, Ruth Y; Tate, John G; Burge, Sarah W

    2015-01-01

    The primary task of the Rfam database is to collate experimentally validated noncoding RNA (ncRNA) sequences from the published literature and facilitate the prediction and annotation of new homologues in novel nucleotide sequences. We group homologous ncRNA sequences into "families" and related families are further grouped into "clans." We collate and manually curate data cross-references for these families from other databases and external resources. Our Web site offers researchers a simple interface to Rfam and provides tools with which to annotate their own sequences using our covariance models (CMs), through our tools for searching, browsing, and downloading information on Rfam families. In this chapter, we will work through examples of annotating a query sequence, collating family information, and searching for data.

  1. Appendix: a solution hybridization assay to detect radioactive globin messenger RNA nucleotide sequences

    Energy Technology Data Exchange (ETDEWEB)

    Ross, J

    1976-09-15

    In view of the sensitivity and specificity of the solution hybridization assay for unlabeled globin mRNA a similar technique has been devised to detect radioactive globin mRNA sequences with unlabeled globin cDNA. Several properties of the hybridization reaction are presented since RNA kinetic experiments reported recently depend on the validity of this assay. Data on hybridization analysis of (/sup 3/H)RNA from mouse fetal liver or erythroleukemia cell cytoplasm are presented. These data indicate that the excess cDNA solution assay for radioactive globin mRNA detection is specific for globin mRNA sequences. It can be performed rapidly and is highly reproducible from experiment. It is at least 500-fold less sensitive than the assay for unlabeled globin mRNA, due to the RNAase backgrounds of 0.05 to 0.15 %. However, this limitation has not affected kinetic experiments with non-dividing fetal liver erythroid cells, which synthesize relatively large quantities of globin mRNA.

  2. The pokeweed leaf mRNA transcriptome and its regulation by jasmonic acid.

    Directory of Open Access Journals (Sweden)

    Kira C.M. Neller

    2016-03-01

    Full Text Available The American pokeweed plant, Phytolacca americana, is recognized for synthesizing pokeweed antiviral protein (PAP, a ribosome inactivating protein (RIP that inhibits the replication of several plant and animal viruses. The plant is also a heavy metal accumulator with applications in soil remediation. However, little is known about pokeweed stress responses, as large-scale sequencing projects have not been performed for this species. Here, we sequenced the mRNA transcriptome of pokeweed in the presence and absence of jasmonic acid (JA, a hormone mediating plant defense. Trinity-based de novo assembly of mRNA from leaf tissue and BLASTx homology searches against public sequence databases resulted in the annotation of 59 096 transcripts. Differential expression analysis identified JA-responsive genes that may be involved in defense against pathogen infection and herbivory. We confirmed the existence of several PAP isoforms and cloned a potentially novel isoform of PAP. Expression analysis indicated that PAP isoforms are differentially responsive to JA, perhaps indicating specialized roles within the plant. Finally, we identified 52 305 natural antisense transcript pairs, four of which comprised PAP isoforms, suggesting a novel form of RIP gene regulation. This transcriptome-wide study of a Phytolaccaceae family member provides a source of new genes that may be involved in stress tolerance in this plant. The sequences generated in our study have been deposited in the SRA database under project # SRP069141.

  3. Regulation of mRNA translation influences hypoxia tolerance

    International Nuclear Information System (INIS)

    Koritzinsky, M.; Wouters, B.G.; Koumenis, C.

    2003-01-01

    Hypoxia is a heterogenous but common characteristic of human tumours and poor oxygenation is associated with poor prognosis. We believe that the presence of viable hypoxic tumor cells reflects in part an adaptation and tolerance of these cells to oxygen deficiency. Since oxidative phosphorylation is compromized during hypoxia, adaptation may involve both the upregulation of glycolysis as well as downregulation of energy consumption. mRNA translation is one of the most energy costly cellular processes, and we and others have shown that global mRNA translation is rapidly inhibited during hypoxia. However, some mRNAs, including those coding for HIF-1 α and VEGF, remain efficiently translated during hypoxia. Clearly, the mechanisms responsible for the overall inhibition of translation during hypoxia does not compromize the translation of certain hypoxia-induced mRNA species. We therefore hypothesize that the inhibition of mRNA translation serves to promote hypoxia tolerance in two ways: i) through conservation of energy and ii) through differential gene expression involved in hypoxia adaptation. We have recently identified two pathways that are responsible for the global inhibition of translation during hypoxia. The phosphorylation of the eukaryotic initiation factor eIF2 α by the ER resident kinase PERK results in down-regulation of protein synthesis shortly after the onset of hypoxia. In addition, the initiation complex eIF4F is disrupted during long lasting hypoxic conditions. The identification of the molecular pathways responsible for the inhibition of overall translation during hypoxia has rendered it possible to investigate their importance for hypoxia tolerance. We have found that mouse embryo fibroblasts that are knockout for PERK and therefore not able to inhibit protein synthesis efficiently during oxygen deficiency are significantly less tolerant to hypoxia than their wildtype counterparts. We are currently also investigating the functional significance

  4. Peptide inhibitors of botulinum neurotoxin by mRNA display

    International Nuclear Information System (INIS)

    Yiadom, Kwabena P.A.B.; Muhie, Seid; Yang, David C.H.

    2005-01-01

    Botulinum neurotoxins (BoNTs) are extremely toxic. The metalloproteases associated with the toxins cleave proteins essential for neurotransmitter secretion. Inhibitors of the metalloprotease are currently sought to control the toxicity of BoNTs. Toward that goal, we produced a synthetic cDNA for the expression and purification of the metalloprotease of BoNT/A in Escherichia coli as a biotin-ubiquitin fusion protein, and constructed a combinatorial peptide library to screen for BoNT/A light chain inhibitors using mRNA display. A protease assay was developed using immobilized intact SNAP-25 as the substrate. The new peptide inhibitors showed a 10-fold increase in affinity to BoNT/A light chain than the parent peptide. Interestingly, the sequences of the new peptide inhibitors showed abundant hydrophobic residues but few hydrophilic residues. The results suggest that mRNA display may provide a general approach in developing peptide inhibitors of BoNTs

  5. Viewing multiple sequence alignments with the JavaScript Sequence Alignment Viewer (JSAV).

    Science.gov (United States)

    Martin, Andrew C R

    2014-01-01

    The JavaScript Sequence Alignment Viewer (JSAV) is designed as a simple-to-use JavaScript component for displaying sequence alignments on web pages. The display of sequences is highly configurable with options to allow alternative coloring schemes, sorting of sequences and 'dotifying' repeated amino acids. An option is also available to submit selected sequences to another web site, or to other JavaScript code. JSAV is implemented purely in JavaScript making use of the JQuery and JQuery-UI libraries. It does not use any HTML5-specific options to help with browser compatibility. The code is documented using JSDOC and is available from http://www.bioinf.org.uk/software/jsav/.

  6. A selective splicing variant of hepcidin mRNA in hepatocellular carcinoma cell lines

    International Nuclear Information System (INIS)

    Toki, Yasumichi; Sasaki, Katsunori; Tanaka, Hiroki; Yamamoto, Masayo; Hatayama, Mayumi; Ito, Satoshi; Ikuta, Katsuya; Shindo, Motohiro; Hasebe, Takumu; Nakajima, Shunsuke; Sawada, Koji; Fujiya, Mikihiro; Torimoto, Yoshihiro; Ohtake, Takaaki; Kohgo, Yutaka

    2016-01-01

    Hepcidin is a main regulator of iron metabolism, of which abnormal expression affects intestinal absorption and reticuloendothelial sequestration of iron by interacting with ferroportin. It is also noted that abnormal iron accumulation is one of the key factors to facilitate promotion and progression of cancer including hepatoma. By RT-PCR/agarose gel electrophoresis of hepcidin mRNA in a hepatocellular carcinoma cell line HLF, a smaller mRNA band was shown in addition to the wild-type hepcidin mRNA. From sequencing analysis, this additional band was a selective splicing variant of hepcidin mRNA lacking exon 2 of HAMP gene, producing the transcript that encodes truncated peptide lacking 20 amino acids at the middle of preprohepcidin. In the present study, we used the digital PCR, because such a small amount of variant mRNA was difficult to quantitate by the conventional RT-PCR amplification. Among seven hepatoma-derived cell lines, six cell lines have significant copy numbers of this variant mRNA, but not in one cell line. In the transient transfection analysis of variant-type hepcidin cDNA, truncated preprohepcidin has a different character comparing with native preprohepcidin: its product is insensitive to digestion, and secreted into the medium as a whole preprohepcidin form without maturation. Loss or reduction of function of HAMP gene by aberrantly splicing may be a suitable phenomenon to obtain the proliferating advantage of hepatoma cells. - Highlights: • An aberrant splicing variant of hepcidin mRNA lacking exon 2 of HAMP gene. • Absolute quantification of hepcidin mRNA by digital PCR amplification. • Hepatoma-derived cell lines have significant copies of variant-type hepcidin mRNA. • Truncated preprohepcidin is secreted from cells without posttranslational cleavage.

  7. A selective splicing variant of hepcidin mRNA in hepatocellular carcinoma cell lines

    Energy Technology Data Exchange (ETDEWEB)

    Toki, Yasumichi [Division of Gastroenterology and Hematology/Oncology, Department of Medicine, Asahikawa Medical University, Hokkaido 078-8510 (Japan); Sasaki, Katsunori, E-mail: k-sasaki@asahikawa-med.ac.jp [Department of Gastrointestinal Immunology and Regenerative Medicine, Asahikawa Medical University, Hokkaido 078-8510 (Japan); Tanaka, Hiroki [Department of Legal Medicine, Asahikawa Medical University, Hokkaido 078-8510 (Japan); Yamamoto, Masayo; Hatayama, Mayumi; Ito, Satoshi; Ikuta, Katsuya; Shindo, Motohiro; Hasebe, Takumu; Nakajima, Shunsuke; Sawada, Koji; Fujiya, Mikihiro [Division of Gastroenterology and Hematology/Oncology, Department of Medicine, Asahikawa Medical University, Hokkaido 078-8510 (Japan); Torimoto, Yoshihiro [Oncology Center, Asahikawa Medical University Hospital, Hokkaido 078-8510 (Japan); Ohtake, Takaaki; Kohgo, Yutaka [Department of Gastroenterology, International University of Health and Welfare Hospital, Tochigi 329-2763 (Japan)

    2016-08-05

    Hepcidin is a main regulator of iron metabolism, of which abnormal expression affects intestinal absorption and reticuloendothelial sequestration of iron by interacting with ferroportin. It is also noted that abnormal iron accumulation is one of the key factors to facilitate promotion and progression of cancer including hepatoma. By RT-PCR/agarose gel electrophoresis of hepcidin mRNA in a hepatocellular carcinoma cell line HLF, a smaller mRNA band was shown in addition to the wild-type hepcidin mRNA. From sequencing analysis, this additional band was a selective splicing variant of hepcidin mRNA lacking exon 2 of HAMP gene, producing the transcript that encodes truncated peptide lacking 20 amino acids at the middle of preprohepcidin. In the present study, we used the digital PCR, because such a small amount of variant mRNA was difficult to quantitate by the conventional RT-PCR amplification. Among seven hepatoma-derived cell lines, six cell lines have significant copy numbers of this variant mRNA, but not in one cell line. In the transient transfection analysis of variant-type hepcidin cDNA, truncated preprohepcidin has a different character comparing with native preprohepcidin: its product is insensitive to digestion, and secreted into the medium as a whole preprohepcidin form without maturation. Loss or reduction of function of HAMP gene by aberrantly splicing may be a suitable phenomenon to obtain the proliferating advantage of hepatoma cells. - Highlights: • An aberrant splicing variant of hepcidin mRNA lacking exon 2 of HAMP gene. • Absolute quantification of hepcidin mRNA by digital PCR amplification. • Hepatoma-derived cell lines have significant copies of variant-type hepcidin mRNA. • Truncated preprohepcidin is secreted from cells without posttranslational cleavage.

  8. Ultrafast all-optical code-division multiple-access networks

    Science.gov (United States)

    Kwong, Wing C.; Prucnal, Paul R.; Liu, Yanming

    1992-12-01

    In optical code-division multiple access (CDMA), the architecture of optical encoders/decoders is another important factor that needs to be considered, besides the correlation properties of those already extensively studied optical codes. The architecture of optical encoders/decoders affects, for example, the amount of power loss and length of optical delays that are associated with code sequence generation and correlation, which, in turn, affect the power budget, size, and cost of an optical CDMA system. Various CDMA coding architectures are studied in the paper. In contrast to the encoders/decoders used in prime networks (i.e., prime encodes/decoders), which generate, select, and correlate code sequences by a parallel combination of fiber-optic delay-lines, and in 2n networks (i.e., 2n encoders/decoders), which generate and correlate code sequences by a serial combination of 2 X 2 passive couplers and fiber delays with sequence selection performed in a parallel fashion, the modified 2n encoders/decoders generate, select, and correlate code sequences by a serial combination of directional couplers and delays. The power and delay- length requirements of the modified 2n encoders/decoders are compared to that of the prime and 2n encoders/decoders. A 100 Mbit/s optical CDMA experiment in free space demonstrating the feasibility of the all-serial coding architecture using a serial combination of 50/50 beam splitters and retroreflectors at 10 Tchip/s (i.e., 100,000 chip/bit) with 100 fs laser pulses is reported.

  9. Application of MELCOR Code to a French PWR 900 MWe Severe Accident Sequence and Evaluation of Models Performance Focusing on In-Vessel Thermal Hydraulic Results

    International Nuclear Information System (INIS)

    De Rosa, Felice

    2006-01-01

    In the ambit of the Severe Accident Network of Excellence Project (SARNET), funded by the European Union, 6. FISA (Fission Safety) Programme, one of the main tasks is the development and validation of the European Accident Source Term Evaluation Code (ASTEC Code). One of the reference codes used to compare ASTEC results, coming from experimental and Reactor Plant applications, is MELCOR. ENEA is a SARNET member and also an ASTEC and MELCOR user. During the first 18 months of this project, we performed a series of MELCOR and ASTEC calculations referring to a French PWR 900 MWe and to the accident sequence of 'Loss of Steam Generator (SG) Feedwater' (known as H2 sequence in the French classification). H2 is an accident sequence substantially equivalent to a Station Blackout scenario, like a TMLB accident, with the only difference that in H2 sequence the scram is forced to occur with a delay of 28 seconds. The main events during the accident sequence are a loss of normal and auxiliary SG feedwater (0 s), followed by a scram when the water level in SG is equal or less than 0.7 m (after 28 seconds). There is also a main coolant pumps trip when ΔTsat < 10 deg. C, a total opening of the three relief valves when Tric (core maximal outlet temperature) is above 603 K (330 deg. C) and accumulators isolation when primary pressure goes below 1.5 MPa (15 bar). Among many other points, it is worth noting that this was the first time that a MELCOR 1.8.5 input deck was available for a French PWR 900. The main ENEA effort in this period was devoted to prepare the MELCOR input deck using the code version v.1.8.5 (build QZ Oct 2000 with the latest patch 185003 Oct 2001). The input deck, completely new, was prepared taking into account structure, data and same conditions as those found inside ASTEC input decks. The main goal of the work presented in this paper is to put in evidence where and when MELCOR provides good enough results and why, in some cases mainly referring to its

  10. The PRC2-binding long non-coding RNAs in human and mouse genomes are associated with predictive sequence features

    Science.gov (United States)

    Tu, Shiqi; Yuan, Guo-Cheng; Shao, Zhen

    2017-01-01

    Recently, long non-coding RNAs (lncRNAs) have emerged as an important class of molecules involved in many cellular processes. One of their primary functions is to shape epigenetic landscape through interactions with chromatin modifying proteins. However, mechanisms contributing to the specificity of such interactions remain poorly understood. Here we took the human and mouse lncRNAs that were experimentally determined to have physical interactions with Polycomb repressive complex 2 (PRC2), and systematically investigated the sequence features of these lncRNAs by developing a new computational pipeline for sequences composition analysis, in which each sequence is considered as a series of transitions between adjacent nucleotides. Through that, PRC2-binding lncRNAs were found to be associated with a set of distinctive and evolutionarily conserved sequence features, which can be utilized to distinguish them from the others with considerable accuracy. We further identified fragments of PRC2-binding lncRNAs that are enriched with these sequence features, and found they show strong PRC2-binding signals and are more highly conserved across species than the other parts, implying their functional importance.

  11. Genome-wide identification and functional prediction of nitrogen-responsive intergenic and intronic long non-coding RNAs in maize (Zea mays L.).

    Science.gov (United States)

    Lv, Yuanda; Liang, Zhikai; Ge, Min; Qi, Weicong; Zhang, Tifu; Lin, Feng; Peng, Zhaohua; Zhao, Han

    2016-05-11

    Nitrogen (N) is an essential and often limiting nutrient to plant growth and development. Previous studies have shown that the mRNA expressions of numerous genes are regulated by nitrogen supplies; however, little is known about the expressed non-coding elements, for example long non-coding RNAs (lncRNAs) that control the response of maize (Zea mays L.) to nitrogen. LncRNAs are a class of non-coding RNAs larger than 200 bp, which have emerged as key regulators in gene expression. In this study, we surveyed the intergenic/intronic lncRNAs in maize B73 leaves at the V7 stage under conditions of N-deficiency and N-sufficiency using ribosomal RNA depletion and ultra-deep total RNA sequencing approaches. By integration with mRNA expression profiles and physiological evaluations, 7245 lncRNAs and 637 nitrogen-responsive lncRNAs were identified that exhibited unique expression patterns. Co-expression network analysis showed that the nitrogen-responsive lncRNAs were enriched mainly in one of the three co-expressed modules. The genes in the enriched module are mainly involved in NADH dehydrogenase activity, oxidative phosphorylation and the nitrogen compounds metabolic process. We identified a large number of lncRNAs in maize and illustrated their potential regulatory roles in response to N stress. The results lay the foundation for further in-depth understanding of the molecular mechanisms of lncRNAs' role in response to nitrogen stresses.

  12. DNA barcode goes two-dimensions: DNA QR code web server.

    Science.gov (United States)

    Liu, Chang; Shi, Linchun; Xu, Xiaolan; Li, Huan; Xing, Hang; Liang, Dong; Jiang, Kun; Pang, Xiaohui; Song, Jingyuan; Chen, Shilin

    2012-01-01

    The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR) code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.

  13. DNA barcode goes two-dimensions: DNA QR code web server.

    Directory of Open Access Journals (Sweden)

    Chang Liu

    Full Text Available The DNA barcoding technology uses a standard region of DNA sequence for species identification and discovery. At present, "DNA barcode" actually refers to DNA sequences, which are not amenable to information storage, recognition, and retrieval. Our aim is to identify the best symbology that can represent DNA barcode sequences in practical applications. A comprehensive set of sequences for five DNA barcode markers ITS2, rbcL, matK, psbA-trnH, and CO1 was used as the test data. Fifty-three different types of one-dimensional and ten two-dimensional barcode symbologies were compared based on different criteria, such as coding capacity, compression efficiency, and error detection ability. The quick response (QR code was found to have the largest coding capacity and relatively high compression ratio. To facilitate the further usage of QR code-based DNA barcodes, a web server was developed and is accessible at http://qrfordna.dnsalias.org. The web server allows users to retrieve the QR code for a species of interests, convert a DNA sequence to and from a QR code, and perform species identification based on local and global sequence similarities. In summary, the first comprehensive evaluation of various barcode symbologies has been carried out. The QR code has been found to be the most appropriate symbology for DNA barcode sequences. A web server has also been constructed to allow biologists to utilize QR codes in practical DNA barcoding applications.

  14. Improving performance of DS-CDMA systems using chaotic complex Bernoulli spreading codes

    Science.gov (United States)

    Farzan Sabahi, Mohammad; Dehghanfard, Ali

    2014-12-01

    The most important goal of spreading spectrum communication system is to protect communication signals against interference and exploitation of information by unintended listeners. In fact, low probability of detection and low probability of intercept are two important parameters to increase the performance of the system. In Direct Sequence Code Division Multiple Access (DS-CDMA) systems, these properties are achieved by multiplying the data information in spreading sequences. Chaotic sequences, with their particular properties, have numerous applications in constructing spreading codes. Using one-dimensional Bernoulli chaotic sequence as spreading code is proposed in literature previously. The main feature of this sequence is its negative auto-correlation at lag of 1, which with proper design, leads to increase in efficiency of the communication system based on these codes. On the other hand, employing the complex chaotic sequences as spreading sequence also has been discussed in several papers. In this paper, use of two-dimensional Bernoulli chaotic sequences is proposed as spreading codes. The performance of a multi-user synchronous and asynchronous DS-CDMA system will be evaluated by applying these sequences under Additive White Gaussian Noise (AWGN) and fading channel. Simulation results indicate improvement of the performance in comparison with conventional spreading codes like Gold codes as well as similar complex chaotic spreading sequences. Similar to one-dimensional Bernoulli chaotic sequences, the proposed sequences also have negative auto-correlation. Besides, construction of complex sequences with lower average cross-correlation is possible with the proposed method.

  15. Matrin 3 binds and stabilizes mRNA.

    Directory of Open Access Journals (Sweden)

    Maayan Salton

    Full Text Available Matrin 3 (MATR3 is a highly conserved, inner nuclear matrix protein with two zinc finger domains and two RNA recognition motifs (RRM, whose function is largely unknown. Recently we found MATR3 to be phosphorylated by the protein kinase ATM, which activates the cellular response to double strand breaks in the DNA. Here, we show that MATR3 interacts in an RNA-dependent manner with several proteins with established roles in RNA processing, and maintains its interaction with RNA via its RRM2 domain. Deep sequencing of the bound RNA (RIP-seq identified several small noncoding RNA species. Using microarray analysis to explore MATR3's role in transcription, we identified 77 transcripts whose amounts depended on the presence of MATR3. We validated this finding with nine transcripts which were also bound to the MATR3 complex. Finally, we demonstrated the importance of MATR3 for maintaining the stability of several of these mRNA species and conclude that it has a role in mRNA stabilization. The data suggest that the cellular level of MATR3, known to be highly regulated, modulates the stability of a group of gene transcripts.

  16. Heritability in the efficiency of nonsense-mediated mRNA decay in humans.

    LENUS (Irish Health Repository)

    Seoighe, Cathal

    2010-01-01

    BACKGROUND: In eukaryotes mRNA transcripts of protein-coding genes in which an intron has been retained in the coding region normally result in premature stop codons and are therefore degraded through the nonsense-mediated mRNA decay (NMD) pathway. There is evidence in the form of selective pressure for in-frame stop codons in introns and a depletion of length three introns that this is an important and conserved quality-control mechanism. Yet recent reports have revealed that the efficiency of NMD varies across tissues and between individuals, with important clinical consequences. PRINCIPAL FINDINGS: Using previously published Affymetrix exon microarray data from cell lines genotyped as part of the International HapMap project, we investigated whether there are heritable, inter-individual differences in the abundance of intron-containing transcripts, potentially reflecting differences in the efficiency of NMD. We identified intronic probesets using EST data and report evidence of heritability in the extent of intron expression in 56 HapMap trios. We also used a genome-wide association approach to identify genetic markers associated with intron expression. Among the top candidates was a SNP in the DCP1A gene, which forms part of the decapping complex, involved in NMD. CONCLUSIONS: While we caution that some of the apparent inter-individual difference in intron expression may be attributable to different handling or treatments of cell lines, we hypothesize that there is significant polymorphism in the process of NMD, resulting in heritable differences in the abundance of intronic mRNA. Part of this phenotype is likely to be due to a polymorphism in a decapping enzyme on human chromosome 3.

  17. Heritability in the efficiency of nonsense-mediated mRNA decay in humans

    KAUST Repository

    Seoighe, Cathal

    2010-07-21

    Background: In eukaryotes mRNA transcripts of protein-coding genes in which an intron has been retained in the coding region normally result in premature stop codons and are therefore degraded through the nonsense-mediated mRNA decay (NMD) pathway. There is evidence in the form of selective pressure for in-frame stop codons in introns and a depletion of length three introns that this is an important and conserved quality-control mechanism. Yet recent reports have revealed that the efficiency of NMD varies across tissues and between individuals, with important clinical consequences. Principal Findings: Using previously published Affymetrix exon microarray data from cell lines genotyped as part of the International HapMap project, we investigated whether there are heritable, inter-individual differences in the abundance of intron-containing transcripts, potentially reflecting differences in the efficiency of NMD. We identified intronic probesets using EST data and report evidence of heritability in the extent of intron expression in 56 HapMap trios. We also used a genome-wide association approach to identify genetic markers associated with intron expression. Among the top candidates was a SNP in the DCP1A gene, which forms part of the decapping complex, involved in NMD. Conclusions: While we caution that some of the apparent inter-individual difference in intron expression may be attributable to different handling or treatments of cell lines, we hypothesize that there is significant polymorphism in the process of NMD, resulting in heritable differences in the abundance of intronic mRNA. Part of this phenotype is likely to be due to a polymorphism in a decapping enzyme on human chromosome 3. © 2010 Seoighe, Gehring.

  18. Genomic DNA sequence and cytosine methylation changes of adult rice leaves after seeds space flight

    Science.gov (United States)

    Shi, Jinming

    In this study, cytosine methylation on CCGG site and genomic DNA sequence changes of adult leaves of rice after seeds space flight were detected by methylation-sensitive amplification polymorphism (MSAP) and Amplified fragment length polymorphism (AFLP) technique respectively. Rice seeds were planted in the trial field after 4 days space flight on the shenzhou-6 Spaceship of China. Adult leaves of space-treated rice including 8 plants chosen randomly and 2 plants with phenotypic mutation were used for AFLP and MSAP analysis. Polymorphism of both DNA sequence and cytosine methylation were detected. For MSAP analysis, the average polymorphic frequency of the on-ground controls, space-treated plants and mutants are 1.3%, 3.1% and 11% respectively. For AFLP analysis, the average polymorphic frequencies are 1.4%, 2.9%and 8%respectively. Total 27 and 22 polymorphic fragments were cloned sequenced from MSAP and AFLP analysis respectively. Nine of the 27 fragments from MSAP analysis show homology to coding sequence. For the 22 polymorphic fragments from AFLP analysis, no one shows homology to mRNA sequence and eight fragments show homology to repeat region or retrotransposon sequence. These results suggest that although both genomic DNA sequence and cytosine methylation status can be effected by space flight, the genomic region homology to the fragments from genome DNA and cytosine methylation analysis were different.

  19. Deep mRNA sequencing of the Tritonia diomedea brain transcriptome provides access to gene homologues for neuronal excitability, synaptic transmission and peptidergic signalling.

    Directory of Open Access Journals (Sweden)

    Adriano Senatore

    Full Text Available The sea slug Tritonia diomedea (Mollusca, Gastropoda, Nudibranchia, has a simple and highly accessible nervous system, making it useful for studying neuronal and synaptic mechanisms underlying behavior. Although many important contributions have been made using Tritonia, until now, a lack of genetic information has impeded exploration at the molecular level.We performed Illumina sequencing of central nervous system mRNAs from Tritonia, generating 133.1 million 100 base pair, paired-end reads. De novo reconstruction of the RNA-Seq data yielded a total of 185,546 contigs, which partitioned into 123,154 non-redundant gene clusters (unigenes. BLAST comparison with RefSeq and Swiss-Prot protein databases, as well as mRNA data from other invertebrates (gastropod molluscs: Aplysia californica, Lymnaea stagnalis and Biomphalaria glabrata; cnidarian: Nematostella vectensis revealed that up to 76,292 unigenes in the Tritonia transcriptome have putative homologues in other databases, 18,246 of which are below a more stringent E-value cut-off of 1x10-6. In silico prediction of secreted proteins from the Tritonia transcriptome shotgun assembly (TSA produced a database of 579 unique sequences of secreted proteins, which also exhibited markedly higher expression levels compared to other genes in the TSA.Our efforts greatly expand the availability of gene sequences available for Tritonia diomedea. We were able to extract full length protein sequences for most queried genes, including those involved in electrical excitability, synaptic vesicle release and neurotransmission, thus confirming that the transcriptome will serve as a useful tool for probing the molecular correlates of behavior in this species. We also generated a neurosecretome database that will serve as a useful tool for probing peptidergic signalling systems in the Tritonia brain.

  20. A possible contribution of mRNA secondary structure to translation initiation efficiency in Lactococcus lactis

    NARCIS (Netherlands)

    Guchte, Maarten van de; Lende, Ted van der; Kok, Jan; Venema, Gerard

    1991-01-01

    Gene expression signals derived from Lactococcus lactis were linked to lacZ-fused genes with different 5'-nucleotide sequences. Computer predictions of mRNA secondary structure were combined with lacZ expression studies to direct base-substitutions that could possibly influence gene expression.

  1. Translation initiation in bacterial polysomes through ribosome loading on a standby site on a highly translated mRNA

    Science.gov (United States)

    Andreeva, Irena

    2018-01-01

    During translation, consecutive ribosomes load on an mRNA and form a polysome. The first ribosome binds to a single-stranded mRNA region and moves toward the start codon, unwinding potential mRNA structures on the way. In contrast, the following ribosomes can dock at the start codon only when the first ribosome has vacated the initiation site. Here we show that loading of the second ribosome on a natural 38-nt-long 5′ untranslated region of lpp mRNA, which codes for the outer membrane lipoprotein from Escherichia coli, takes place before the leading ribosome has moved away from the start codon. The rapid formation of this standby complex depends on the presence of ribosomal proteins S1/S2 in the leading ribosome. The early recruitment of the second ribosome to the standby site before translation by the leading ribosome and the tight coupling between translation elongation by the first ribosome and the accommodation of the second ribosome can contribute to high translational efficiency of the lpp mRNA. PMID:29632209

  2. Interface requirements to couple thermal hydraulics codes to severe accident codes: ICARE/CATHARE

    Energy Technology Data Exchange (ETDEWEB)

    Camous, F.; Jacq, F.; Chatelard, P. [IPSN/DRS/SEMAR CE-Cadarache, St Paul Lez Durance (France)] [and others

    1997-07-01

    In order to describe with the same code the whole sequence of severe LWR accidents, up to the vessel failure, the Institute of Protection and Nuclear Safety has performed a coupling of the severe accident code ICARE2 to the thermalhydraulics code CATHARE2. The resulting code, ICARE/CATHARE, is designed to be as pertinent as possible in all the phases of the accident. This paper is mainly devoted to the description of the ICARE2-CATHARE2 coupling.

  3. Adaptive decoding of convolutional codes

    Science.gov (United States)

    Hueske, K.; Geldmacher, J.; Götze, J.

    2007-06-01

    Convolutional codes, which are frequently used as error correction codes in digital transmission systems, are generally decoded using the Viterbi Decoder. On the one hand the Viterbi Decoder is an optimum maximum likelihood decoder, i.e. the most probable transmitted code sequence is obtained. On the other hand the mathematical complexity of the algorithm only depends on the used code, not on the number of transmission errors. To reduce the complexity of the decoding process for good transmission conditions, an alternative syndrome based decoder is presented. The reduction of complexity is realized by two different approaches, the syndrome zero sequence deactivation and the path metric equalization. The two approaches enable an easy adaptation of the decoding complexity for different transmission conditions, which results in a trade-off between decoding complexity and error correction performance.

  4. Endogenous ribosomal frameshift signals operate as mRNA destabilizing elements through at least two molecular pathways in yeast.

    Science.gov (United States)

    Belew, Ashton T; Advani, Vivek M; Dinman, Jonathan D

    2011-04-01

    Although first discovered in viruses, previous studies have identified operational -1 ribosomal frameshifting (-1 RF) signals in eukaryotic genomic sequences, and suggested a role in mRNA stability. Here, four yeast -1 RF signals are shown to promote significant mRNA destabilization through the nonsense mediated mRNA decay pathway (NMD), and genetic evidence is presented suggesting that they may also operate through the no-go decay pathway (NGD) as well. Yeast EST2 mRNA is highly unstable and contains up to five -1 RF signals. Ablation of the -1 RF signals or of NMD stabilizes this mRNA, and changes in -1 RF efficiency have opposing effects on the steady-state abundance of the EST2 mRNA. These results demonstrate that endogenous -1 RF signals function as mRNA destabilizing elements through at least two molecular pathways in yeast. Consistent with current evolutionary theory, phylogenetic analyses suggest that -1 RF signals are rapidly evolving cis-acting regulatory elements. Identification of high confidence -1 RF signals in ∼10% of genes in all eukaryotic genomes surveyed suggests that -1 RF is a broadly used post-transcriptional regulator of gene expression.

  5. Syndrome-source-coding and its universal generalization. [error correcting codes for data compression

    Science.gov (United States)

    Ancheta, T. C., Jr.

    1976-01-01

    A method of using error-correcting codes to obtain data compression, called syndrome-source-coding, is described in which the source sequence is treated as an error pattern whose syndrome forms the compressed data. It is shown that syndrome-source-coding can achieve arbitrarily small distortion with the number of compressed digits per source digit arbitrarily close to the entropy of a binary memoryless source. A 'universal' generalization of syndrome-source-coding is formulated which provides robustly effective distortionless coding of source ensembles. Two examples are given, comparing the performance of noiseless universal syndrome-source-coding to (1) run-length coding and (2) Lynch-Davisson-Schalkwijk-Cover universal coding for an ensemble of binary memoryless sources.

  6. Application of Quaternion in improving the quality of global sequence alignment scores for an ambiguous sequence target in Streptococcus pneumoniae DNA

    Science.gov (United States)

    Lestari, D.; Bustamam, A.; Novianti, T.; Ardaneswari, G.

    2017-07-01

    DNA sequence can be defined as a succession of letters, representing the order of nucleotides within DNA, using a permutation of four DNA base codes including adenine (A), guanine (G), cytosine (C), and thymine (T). The precise code of the sequences is determined using DNA sequencing methods and technologies, which have been developed since the 1970s and currently become highly developed, advanced and highly throughput sequencing technologies. So far, DNA sequencing has greatly accelerated biological and medical research and discovery. However, in some cases DNA sequencing could produce any ambiguous and not clear enough sequencing results that make them quite difficult to be determined whether these codes are A, T, G, or C. To solve these problems, in this study we can introduce other representation of DNA codes namely Quaternion Q = (PA, PT, PG, PC), where PA, PT, PG, PC are the probability of A, T, G, C bases that could appear in Q and PA + PT + PG + PC = 1. Furthermore, using Quaternion representations we are able to construct the improved scoring matrix for global sequence alignment processes, by applying a dot product method. Moreover, this scoring matrix produces better and higher quality of the match and mismatch score between two DNA base codes. In implementation, we applied the Needleman-Wunsch global sequence alignment algorithm using Octave, to analyze our target sequence which contains some ambiguous sequence data. The subject sequences are the DNA sequences of Streptococcus pneumoniae families obtained from the Genebank, meanwhile the target DNA sequence are received from our collaborator database. As the results we found the Quaternion representations improve the quality of the sequence alignment score and we can conclude that DNA sequence target has maximum similarity with Streptococcus pneumoniae.

  7. Gold nanoparticle-based beacon to detect STAT5b mRNA expression in living cells: a case optimized by bioinformatics screen.

    Science.gov (United States)

    Deng, Dawei; Li, Yang; Xue, Jianpeng; Wang, Jie; Ai, Guanhua; Li, Xin; Gu, Yueqing

    2015-01-01

    Messenger RNA (mRNA), a single-strand ribonucleic acid with functional gene information is usually abnormally expressed in cancer cells and has become a promising biomarker for the study of tumor progress. Hairpin DNA-coated gold nanoparticle (hDAuNP) beacon containing a bare gold nanoparticle (AuNP) as fluorescence quencher and thiol-terminated fluorescently labeled stem-loop-stem oligonucleotide sequences attached by Au-S bond is currently a new nanoscale biodiagnostic platform capable of mRNA detection, in which the design of the loop region sequence is crucial for hybridizing with the target mRNA. Hence, in this study, to improve the sensitivity and selectivity of hDAuNP beacon simultaneously, the loop region of hairpin DNA was screened by bioinformatics strategy. Here, signal transducer and activator of transcription 5b (STAT5b) mRNA was selected and used as a practical example. The results from the combined characterizations using optical techniques, flow cytometry assay, and cell microscopic imaging showed that after optimization, the as-prepared hDAuNP beacon had higher selectivity and sensitivity for the detection of STAT5b mRNA in living cells, as compared with our previous beacon. Thus, the bioinformatics method may be a promising new strategy for assisting in the designing of the hDAuNP beacon, extending its application in the detection of mRNA expression and the resultant mRNA-based biological processes and disease pathogenesis.

  8. 5'-Terminal AUGs in Escherichia coli mRNAs with Shine-Dalgarno Sequences: Identification and Analysis of Their Roles in Non-Canonical Translation Initiation.

    Directory of Open Access Journals (Sweden)

    Heather J Beck

    Full Text Available Analysis of the Escherichia coli transcriptome identified a unique subset of messenger RNAs (mRNAs that contain a conventional untranslated leader and Shine-Dalgarno (SD sequence upstream of the gene's start codon while also containing an AUG triplet at the mRNA's 5'- terminus (5'-uAUG. Fusion of the coding sequence specified by the 5'-terminal putative AUG start codon to a lacZ reporter gene, as well as primer extension inhibition assays, reveal that the majority of the 5'-terminal upstream open reading frames (5'-uORFs tested support some level of lacZ translation, indicating that these mRNAs can function both as leaderless and canonical SD-leadered mRNAs. Although some of the uORFs were expressed at low levels, others were expressed at levels close to that of the respective downstream genes and as high as the naturally leaderless cI mRNA of bacteriophage λ. These 5'-terminal uORFs potentially encode peptides of varying lengths, but their functions, if any, are unknown. In an effort to determine whether expression from the 5'-terminal uORFs impact expression of the immediately downstream cistron, we examined expression from the downstream coding sequence after mutations were introduced that inhibit efficient 5'-uORF translation. These mutations were found to affect expression from the downstream cistrons to varying degrees, suggesting that some 5'-uORFs may play roles in downstream regulation. Since the 5'-uAUGs found on these conventionally leadered mRNAs can function to bind ribosomes and initiate translation, this indicates that canonical mRNAs containing 5'-uAUGs should be examined for their potential to function also as leaderless mRNAs.

  9. Flexible manipulation of terahertz wave reflection using polarization insensitive coding metasurfaces.

    Science.gov (United States)

    Jiu-Sheng, Li; Ze-Jiang, Zhao; Jian-Quan, Yao

    2017-11-27

    In order to extend to 3-bit encoding, we propose notched-wheel structures as polarization insensitive coding metasurfaces to control terahertz wave reflection and suppress backward scattering. By using a coding sequence of "00110011…" along x-axis direction and 16 × 16 random coding sequence, we investigate the polarization insensitive properties of the coding metasurfaces. By designing the coding sequences of the basic coding elements, the terahertz wave reflection can be flexibly manipulated. Additionally, radar cross section (RCS) reduction in the backward direction is less than -10dB in a wide band. The present approach can offer application for novel terahertz manipulation devices.

  10. Is a genome a codeword of an error-correcting code?

    Directory of Open Access Journals (Sweden)

    Luzinete C B Faria

    Full Text Available Since a genome is a discrete sequence, the elements of which belong to a set of four letters, the question as to whether or not there is an error-correcting code underlying DNA sequences is unavoidable. The most common approach to answering this question is to propose a methodology to verify the existence of such a code. However, none of the methodologies proposed so far, although quite clever, has achieved that goal. In a recent work, we showed that DNA sequences can be identified as codewords in a class of cyclic error-correcting codes known as Hamming codes. In this paper, we show that a complete intron-exon gene, and even a plasmid genome, can be identified as a Hamming code codeword as well. Although this does not constitute a definitive proof that there is an error-correcting code underlying DNA sequences, it is the first evidence in this direction.

  11. Codes and curves

    CERN Document Server

    Walker, Judy L

    2000-01-01

    When information is transmitted, errors are likely to occur. Coding theory examines efficient ways of packaging data so that these errors can be detected, or even corrected. The traditional tools of coding theory have come from combinatorics and group theory. Lately, however, coding theorists have added techniques from algebraic geometry to their toolboxes. In particular, by re-interpreting the Reed-Solomon codes, one can see how to define new codes based on divisors on algebraic curves. For instance, using modular curves over finite fields, Tsfasman, Vladut, and Zink showed that one can define a sequence of codes with asymptotically better parameters than any previously known codes. This monograph is based on a series of lectures the author gave as part of the IAS/PCMI program on arithmetic algebraic geometry. Here, the reader is introduced to the exciting field of algebraic geometric coding theory. Presenting the material in the same conversational tone of the lectures, the author covers linear codes, inclu...

  12. Adaptive decoding of convolutional codes

    Directory of Open Access Journals (Sweden)

    K. Hueske

    2007-06-01

    Full Text Available Convolutional codes, which are frequently used as error correction codes in digital transmission systems, are generally decoded using the Viterbi Decoder. On the one hand the Viterbi Decoder is an optimum maximum likelihood decoder, i.e. the most probable transmitted code sequence is obtained. On the other hand the mathematical complexity of the algorithm only depends on the used code, not on the number of transmission errors. To reduce the complexity of the decoding process for good transmission conditions, an alternative syndrome based decoder is presented. The reduction of complexity is realized by two different approaches, the syndrome zero sequence deactivation and the path metric equalization. The two approaches enable an easy adaptation of the decoding complexity for different transmission conditions, which results in a trade-off between decoding complexity and error correction performance.

  13. Comparison of the frequency of functional SH3 domains with different limited sets of amino acids using mRNA display.

    Directory of Open Access Journals (Sweden)

    Junko Tanaka

    Full Text Available Although modern proteins consist of 20 different amino acids, it has been proposed that primordial proteins consisted of a small set of amino acids, and additional amino acids have gradually been recruited into the genetic code. This hypothesis has recently been supported by comparative genome sequence analysis, but no direct experimental approach has been reported. Here, we utilized a novel experimental approach to test a hypothesis that native-like globular proteins might be easily simplified by a set of putative primitive amino acids with retention of its structure and function than by a set of putative new amino acids. We performed in vitro selection of a functional SH3 domain as a model from partially randomized libraries with different sets of amino acids using mRNA display. Consequently, a library rich in putative primitive amino acids included a larger number of functional SH3 sequences than a library rich in putative new amino acids. Further, the functional SH3 sequences were enriched from the primitive library slightly earlier than from a randomized library with the full set of amino acids, while the function and structure of the selected SH3 proteins with the primitive alphabet were comparable with those from the 20 amino acid alphabet. Application of this approach to various combinations of codons in protein sequences may be useful not only for clarifying the precise order of the amino acid expansion in the early stages of protein evolution but also for efficiently creating novel functional proteins in the laboratory.

  14. Regulation of mRNA Levels by Decay-Promoting Introns that Recruit the Exosome Specificity Factor Mmi1

    Directory of Open Access Journals (Sweden)

    Cornelia Kilchert

    2015-12-01

    Full Text Available In eukaryotic cells, inefficient splicing is surprisingly common and leads to the degradation of transcripts with retained introns. How pre-mRNAs are committed to nuclear decay is unknown. Here, we uncover a mechanism by which specific intron-containing transcripts are targeted for nuclear degradation in fission yeast. Sequence elements within these “decay-promoting” introns co-transcriptionally recruit the exosome specificity factor Mmi1, which induces degradation of the unspliced precursor and leads to a reduction in the levels of the spliced mRNA. This mechanism negatively regulates levels of the RNA helicase DDX5/Dbp2 to promote cell survival in response to stress. In contrast, fast removal of decay-promoting introns by co-transcriptional splicing precludes Mmi1 recruitment and relieves negative expression regulation. We propose that decay-promoting introns facilitate the regulation of gene expression. Based on the identification of multiple additional Mmi1 targets, including mRNAs, long non-coding RNAs, and sn/snoRNAs, we suggest a general role in RNA regulation for Mmi1 through transcript degradation.

  15. Permissive effect of dexamethasone on the increase of proenkephalin mRNA induced by depolarization of chromaffin cells

    International Nuclear Information System (INIS)

    Naranjo, J.R.; Mocchetti, I.; Schwartz, J.P.; Costa, E.

    1986-01-01

    In cultured bovine chromaffin cells, changes in the dynamic state of enkephalin stores elicited experimentally were studied by measuring cellular proenkephalin mRNA, as well as enkephalin precursors and authentic enkephalin content of cells and culture media. In parallel, tyrosine hydroxylase mRNA and catecholamine cell content were also determined. Low concentrations (0.5-100 pM) of dexamethasone increased the cell contents of proenkephalin mRNA and enkephalin-containing peptides. High concentrations of the hormone(1 μM) were required to increase the cell contents of tyrosine hydroxylase mRNA and catecholamines. Depolarization of the cells with 10 μM veratridine resulted in a depletion of enkephalin and catecholamine stores after 24 hr. The enkephalin, but not the catecholamine, content was restored by 48 hr. An increase in proenkephalin mRNA content might account for the recovery; this increase was curtailed by tetrodotoxin and enhanced by 10 pM dexamethasone. Tyrosine hydroxylase mRNA content was not significantly modified by depolarization, even in the presence of 1 μM dexamethasone. Aldosterone, progesterone, testosterone, or estradiol (1 μM) failed to change proenkephalin mRNA. Hence, dexamethasone appears to exert a specific permissive action on the stimulation of the proenkephalin gene elicited by depolarization. Though the catecholamines and enkephalins are localized in the same chromaffin granules and are coreleased by depolarization, the genes coding for the processes that are rate limiting in the production of these neuromodulators can be differentially regulated

  16. The Beads of Translation: Using Beads to Translate mRNA into a Polypeptide Bracelet

    Science.gov (United States)

    Dunlap, Dacey; Patrick, Patricia

    2012-01-01

    During this activity, by making beaded bracelets that represent the steps of translation, students simulate the creation of an amino acid chain. They are given an mRNA sequence that they translate into a corresponding polypeptide chain (beads). This activity focuses on the events and sites of translation. The activity provides students with a…

  17. Noncoding sequence classification based on wavelet transform analysis: part I

    Science.gov (United States)

    Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

    2017-09-01

    DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.

  18. Phylogenetic analyses of the polyprotein coding sequences of serotype O foot-and-mouth disease viruses in East Africa: evidence for interserotypic recombination

    Directory of Open Access Journals (Sweden)

    Balinda Sheila N

    2010-08-01

    Full Text Available Abstract Background Foot-and-mouth disease (FMD is endemic in East Africa with the majority of the reported outbreaks attributed to serotype O virus. In this study, phylogenetic analyses of the polyprotein coding region of serotype O FMD viruses from Kenya and Uganda has been undertaken to infer evolutionary relationships and processes responsible for the generation and maintenance of diversity within this serotype. FMD virus RNA was obtained from six samples following virus isolation in cell culture and in one case by direct extraction from an oropharyngeal sample. Following RT-PCR, the single long open reading frame, encoding the polyprotein, was sequenced. Results Phylogenetic comparisons of the VP1 coding region showed that the recent East African viruses belong to one lineage within the EA-2 topotype while an older Kenyan strain, K/52/1992 is a representative of the topotype EA-1. Evolutionary relationships between the coding regions for the leader protease (L, the capsid region and almost the entire coding region are monophyletic except for the K/52/1992 which is distinct. Furthermore, phylogenetic relationships for the P2 and P3 regions suggest that the K/52/1992 is a probable recombinant between serotypes A and O. A bootscan analysis of K/52/1992 with East African FMD serotype A viruses (A21/KEN/1964 and A23/KEN/1965 and serotype O viral isolate (K/117/1999 revealed that the P2 region is probably derived from a serotype A strain while the P3 region appears to be a mosaic derived from both serotypes A and O. Conclusions Sequences of the VP1 coding region from recent serotype O FMDVs from Kenya and Uganda are all representatives of a specific East African lineage (topotype EA-2, a probable indication that hardly any FMD introductions of this serotype have occurred from outside the region in the recent past. Furthermore, evidence for interserotypic recombination, within the non-structural protein coding regions, between FMDVs of serotypes A

  19. The 0.3-kb fragment containing the R-U5-5'leader sequence of Friend murine leukemia virus influences the level of protein expression from spliced mRNA.

    Science.gov (United States)

    Choo, Yeng Cheng; Seki, Yohei; Machinaga, Akihito; Ogita, Nobuo; Takase-Yoden, Sayaka

    2013-04-19

    A neuropathogenic variant of Friend murine leukemia virus (Fr-MLV) clone A8 induces spongiform neurodegeneration when infected into neonatal rats. Studies with chimeras constructed from the A8 virus and the non-neuropathogenic Fr-MLV clone 57 identified a 0.3-kb KpnI-AatII fragment containing a R-U5-5'leader sequence as an important determinant for inducing spongiosis, in addition to the env gene of A8 as the primary determinant. This 0.3-kb fragment contains a 17-nucleotide difference between the A8 and 57 sequences. We previously showed that the 0.3-kb fragment influences expression levels of Env protein in both cultured cells and rat brain, but the corresponding molecular mechanisms are not well understood. Studies with expression vectors constructed from the full-length proviral genome of Fr-MLV that incorporated the luciferase (luc) gene instead of the env gene found that the vector containing the A8-0.3-kb fragment yielded a larger amount of spliced luc-mRNA and showed higher expression of luciferase when compared to the vector containing the 57-0.3-kb fragment. The amount of total transcripts from the vectors, the poly (A) tail length of their mRNAs, and the nuclear-cytoplasm distribution of luc-mRNA in transfected cells were also evaluated. The 0.3-kb fragment did not influence transcription efficiency, mRNA polyadenylation or nuclear export of luc-mRNA. Mutational analyses were carried out to determine the importance of nucleotides that differ between the A8 and 57 sequences within the 0.3-kb fragment. In particular, seven nucleotides upstream of the 5'splice site (5'ss) were found to be important in regulating the level of protein expression from spliced messages. Interestingly, these nucleotides reside within the stem-loop structure that has been speculated to limit the recognition of 5'ss. The 0.3-kb fragment containing the R-U5-5'leader sequence of Fr-MLV influences the level of protein expression from the spliced-mRNA by regulating the splicing

  20. Elevation of D4 dopamine receptor mRNA in postmortem schizophrenic brain.

    Science.gov (United States)

    Stefanis, N C; Bresnick, J N; Kerwin, R W; Schofield, W N; McAllister, G

    1998-01-01

    The D4 dopamine (DA) receptor has been proposed to be a target for the development of a novel antipsychotic drug based on its pharmacological and distribution profile. There is much interest in whether D4 DA receptor levels are altered in schizophrenia, but the lack of an available receptor subtype-specific radioligand made this difficult to quantitate. In this study, we examined whether D4 mRNA levels are altered in different brain regions of schizophrenics compared to controls. Ribonuclease protection assays were carried out on total RNA samples isolated postmortem from frontal cortex and caudate brain regions of schizophrenics and matched controls. 32P-labelled RNA probes to the D4 DA receptor and to the housekeeping gene, glyceraldehyde-3-phosphate dehydrogenase (G3PDH), were hybridised with the RNA samples, digested with ribonucleases to remove unhybridised probe, and separated on 6% sequencing gels. Densitometer analysis on the subsequent autoradiogams was used to calculate the relative optical density of D4 mRNA compared to G3PDH mRNA. Statistical analysis of the data revealed a 3-fold higher level (P<0.011) of D4 mRNA in the frontal cortex of schizophrenics compared to controls. No increase was seen in caudate. D4 receptors could play a role in mediating dopaminergic activity in frontal cortex, an activity which may be malfunctioning in schizophrenia.

  1. mRNA Cancer Vaccines-Messages that Prevail.

    Science.gov (United States)

    Grunwitz, Christian; Kranz, Lena M

    2017-01-01

    During the last decade, mRNA became increasingly recognized as a versatile tool for the development of new innovative therapeutics. Especially for vaccine development, mRNA is of outstanding interest and numerous clinical trials have been initiated. Strikingly, all of these studies have proven that large-scale GMP production of mRNA is feasible and concordantly report a favorable safety profile of mRNA vaccines. Induction of T-cell immunity is a multi-faceted process comprising antigen acquisition, antigen processing and presentation, as well as immune stimulation. The effectiveness of mRNA vaccines is critically dependent on making the antigen(s) of interest available to professional antigen-presenting cells, especially DCs. Efficient delivery of mRNA into DCs in vivo remains a major challenge in the mRNA vaccine field. This review summarizes the principles of mRNA vaccines and highlights the importance of in vivo mRNA delivery and recent advances in harnessing their therapeutic potential.

  2. Performance Analysis of Direct-Sequence Code-Division Multiple-Access Communications with Asymmetric Quadrature Phase-Shift-Keying Modulation

    Science.gov (United States)

    Wang, C.-W.; Stark, W.

    2005-01-01

    This article considers a quaternary direct-sequence code-division multiple-access (DS-CDMA) communication system with asymmetric quadrature phase-shift-keying (AQPSK) modulation for unequal error protection (UEP) capability. Both time synchronous and asynchronous cases are investigated. An expression for the probability distribution of the multiple-access interference is derived. The exact bit-error performance and the approximate performance using a Gaussian approximation and random signature sequences are evaluated by extending the techniques used for uniform quadrature phase-shift-keying (QPSK) and binary phase-shift-keying (BPSK) DS-CDMA systems. Finally, a general system model with unequal user power and the near-far problem is considered and analyzed. The results show that, for a system with UEP capability, the less protected data bits are more sensitive to the near-far effect that occurs in a multiple-access environment than are the more protected bits.

  3. Principles of mRNA transport in yeast.

    Science.gov (United States)

    Heym, Roland Gerhard; Niessing, Dierk

    2012-06-01

    mRNA localization and localized translation is a common mechanism by which cellular asymmetry is achieved. In higher eukaryotes the mRNA transport machinery is required for such diverse processes as stem cell division and neuronal plasticity. Because mRNA localization in metazoans is highly complex, studies at the molecular level have proven to be cumbersome. However, active mRNA transport has also been reported in fungi including Saccharomyces cerevisiae, Ustilago maydis and Candida albicans, in which these events are less difficult to study. Amongst them, budding yeast S. cerevisiae has yielded mechanistic insights that exceed our understanding of other mRNA localization events to date. In contrast to most reviews, we refrain here from summarizing mRNA localization events from different organisms. Instead we give an in-depth account of ASH1 mRNA localization in budding yeast. This approach is particularly suited to providing a more holistic view of the interconnection between the individual steps of mRNA localization, from transcriptional events to cytoplasmic mRNA transport and localized translation. Because of our advanced mechanistic understanding of mRNA localization in yeast, the present review may also be informative for scientists working, for example, on mRNA localization in embryogenesis or in neurons.

  4. Endoplasmic reticulum stress increases AT1R mRNA expression via TIA-1-dependent mechanism.

    Science.gov (United States)

    Backlund, Michael; Paukku, Kirsi; Kontula, Kimmo K; Lehtonen, Jukka Y A

    2016-04-20

    As the formation of ribonucleoprotein complexes is a major mechanism of angiotensin II type 1 receptor (AT1R) regulation, we sought to identify novel AT1R mRNA binding proteins. By affinity purification and mass spectroscopy, we identified TIA-1. This interaction was confirmed by colocalization of AT1R mRNA and TIA-1 by FISH and immunofluorescence microscopy. In immunoprecipitates of endogenous TIA- 1, reverse transcription-PCR amplified AT1R mRNA. TIA-1 has two binding sites within AT1R 3'-UTR. The binding site proximal to the coding region is glyceraldehyde-3-phosphate dehydrogenase (GAPDH)-dependent whereas the distal binding site is not. TIA-1 functions as a part of endoplasmic reticulum (ER) stress response leading to stress granule (SG) formation and translational silencing. We and others have shown that AT1R expression is increased by ER stress-inducing factors. In unstressed cells, TIA-1 binds to AT1R mRNA and decreases AT1R protein expression. Fluorescence microscopy shows that ER stress induced by thapsigargin leads to the transfer of TIA-1 to SGs. In FISH analysis AT1R mRNA remains in the cytoplasm and no longer colocalizes with TIA-1. Thus, release of TIA-1-mediated suppression by ER stress increases AT1R protein expression. In conclusion, AT1R mRNA is regulated by TIA-1 in a ER stress-dependent manner. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  5. Discovery of precursor and mature microRNAs and their putative gene targets using high-throughput sequencing in pineapple (Ananas comosus var. comosus).

    Science.gov (United States)

    Yusuf, Noor Hydayaty Md; Ong, Wen Dee; Redwan, Raimi Mohamed; Latip, Mariam Abd; Kumar, S Vijay

    2015-10-15

    MicroRNAs (miRNAs) are a class of small, endogenous non-coding RNAs that negatively regulate gene expression, resulting in the silencing of target mRNA transcripts through mRNA cleavage or translational inhibition. MiRNAs play significant roles in various biological and physiological processes in plants. However, the miRNA-mediated gene regulatory network in pineapple, the model tropical non-climacteric fruit, remains largely unexplored. Here, we report a complete list of pineapple mature miRNAs obtained from high-throughput small RNA sequencing and precursor miRNAs (pre-miRNAs) obtained from ESTs. Two small RNA libraries were constructed from pineapple fruits and leaves, respectively, using Illumina's Solexa technology. Sequence similarity analysis using miRBase revealed 579,179 reads homologous to 153 miRNAs from 41 miRNA families. In addition, a pineapple fruit transcriptome library consisting of approximately 30,000 EST contigs constructed using Solexa sequencing was used for the discovery of pre-miRNAs. In all, four pre-miRNAs were identified (MIR156, MIR399, MIR444 and MIR2673). Furthermore, the same pineapple transcriptome was used to dissect the function of the miRNAs in pineapple by predicting their putative targets in conjunction with their regulatory networks. In total, 23 metabolic pathways were found to be regulated by miRNAs in pineapple. The use of high-throughput sequencing in pineapples to unveil the presence of miRNAs and their regulatory pathways provides insight into the repertoire of miRNA regulation used exclusively in this non-climacteric model plant. Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Increased IL-10 mRNA and IL-23 mRNA expression in multiple sclerosis: interferon-beta treatment increases IL-10 mRNA expression while reducing IL-23 mRNA expression

    DEFF Research Database (Denmark)

    Krakauer, M.; Sorensen, P.; Khademi, M.

    2008-01-01

    volunteers served to confirm initial findings. mRNA was analyzed by real-time reverse transcriptase polymerase chain reaction (PCR). RESULTS: We found elevated expression of interleukin (IL)-23 and IL-10 in untreated MS patients. IFN-beta therapy increased IL-10 and decreased IL-23 expression independently...... of the regulatory cytokine IL-10. The elevated IL-23 mRNA levels in MS patients are noteworthy in view of the newly discovered IL-23-driven Th17 T-cell subset, which is crucial in animal models of MS. Since IFN-beta therapy resulted in decreased IL-23 mRNA levels, the Th17 axis could be another target of IFN...

  7. Nucleolin Mediates MicroRNA-directed CSF-1 mRNA Deadenylation but Increases Translation of CSF-1 mRNA*

    Science.gov (United States)

    Woo, Ho-Hyung; Baker, Terri; Laszlo, Csaba; Chambers, Setsuko K.

    2013-01-01

    CSF-1 mRNA 3′UTR contains multiple unique motifs, including a common microRNA (miRNA) target in close proximity to a noncanonical G-quadruplex and AU-rich elements (AREs). Using a luciferase reporter system fused to CSF-1 mRNA 3′UTR, disruption of the miRNA target region, G-quadruplex, and AREs together dramatically increased reporter RNA levels, suggesting important roles for these cis-acting regulatory elements in the down-regulation of CSF-1 mRNA. We find that nucleolin, which binds both G-quadruplex and AREs, enhances deadenylation of CSF-1 mRNA, promoting CSF-1 mRNA decay, while having the capacity to increase translation of CSF-1 mRNA. Through interaction with the CSF-1 3′UTR miRNA common target, we find that miR-130a and miR-301a inhibit CSF-1 expression by enhancing mRNA decay. Silencing of nucleolin prevents the miRNA-directed mRNA decay, indicating a requirement for nucleolin in miRNA activity on CSF-1 mRNA. Downstream effects followed by miR-130a and miR-301a inhibition of directed cellular motility of ovarian cancer cells were found to be dependent on nucleolin. The paradoxical effects of nucleolin on miRNA-directed CSF-1 mRNA deadenylation and on translational activation were explored further. The nucleolin protein contains four acidic stretches, four RNA recognition motifs (RRMs), and nine RGG repeats. All three domains in nucleolin regulate CSF-1 mRNA and protein levels. RRMs increase CSF-1 mRNA, whereas the acidic and RGG domains decrease CSF-1 protein levels. This suggests that nucleolin has the capacity to differentially regulate both CSF-1 RNA and protein levels. Our finding that nucleolin interacts with Ago2 indirectly via RNA and with poly(A)-binding protein C (PABPC) directly suggests a nucleolin-Ago2-PABPC complex formation on mRNA. This complex is in keeping with our suggestion that nucleolin may work with PABPC as a double-edged sword on both mRNA deadenylation and translational activation. Our findings underscore the complexity of

  8. First comparative characterization of three distinct ferritin subunits from a teleost: Evidence for immune-responsive mRNA expression and iron depriving activity of seahorse (Hippocampus abdominalis) ferritins.

    Science.gov (United States)

    Oh, Minyoung; Umasuthan, Navaneethaiyer; Elvitigala, Don Anushka Sandaruwan; Wan, Qiang; Jo, Eunyoung; Ko, Jiyeon; Noh, Gyeong Eon; Shin, Sangok; Rho, Sum; Lee, Jehee

    2016-02-01

    Ferritins play an indispensable role in iron homeostasis through their iron-withholding function in living beings. In the current study, cDNA sequences of three distinct ferritin subunits, including a ferritin H, a ferritin M, and a ferritin L, were identified from big belly seahorse, Hippocampus abdominalis, and molecularly characterized. Complete coding sequences (CDS) of seahorse ferritin H (HaFerH), ferritin M (HaFerM), and ferritin L (HaFerL) subunits were comprised of 531, 528, and 522 base pairs (bp), respectively, which encode polypeptides of 177, 176, and 174 amino acids, respectively, with molecular masses of ∼20-21 kDa. Our in silico analyses demonstrate that these three ferritin subunits exhibit the typical characteristics of ferritin superfamily members including iron regulatory elements, domain signatures, and reactive centers. The coding sequences of HaFerH, M, and L were cloned and the corresponding proteins were overexpressed in a bacterial system. Recombinantly expressed HaFer proteins demonstrated detectable in vivo iron sequestrating (ferroxidase) activity, consistent with their putative iron binding capability. Quantification of the basal expression of these three HaFer sequences in selected tissues demonstrated a gene-specific ubiquitous spatial distribution pattern, with abundance of mRNA in HaFerM in the liver and predominant expression of HaFerH and HaFerL in blood. Interestingly, the basal expression of all three ferritin genes was found to be significantly modulated against pathogenic stress mounted by lipopolysaccharides (LPS), poly I:C, Streptococcus iniae, and Edwardsiella tarda. Collectively, our findings suggest that the three HaFer subunits may be involved in iron (II) homeostasis in big belly seahorse and that they are important in its host defense mechanisms. Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Eukaryotic initiation factor 3 (eIF3) and 5’ mRNA leader sequences as agents of translational regulation in Arabidopsis. Final report

    Energy Technology Data Exchange (ETDEWEB)

    von Arnim, Albrecht G. [Univ. of Tennessee, Knoxville, TN (United States)

    2015-02-04

    Protein synthesis, or translation, consumes a sizable fraction of the cell’s energy budget, estimated at 5% and up to 50% in differentiated and growing cells, respectively. Plants also invest significant energy and biomass to construct and maintain the translation apparatus. Translation is regulated by a variety of external stimuli. Compared to transcriptional control, attributes of translational control include reduced sensitivity to stochastic fluctuation, a finer gauge of control, and more rapid responsiveness to environmental stimuli. Yet, our murky understanding of translational control allows few generalizations. Consequently, translational regulation is underutilized in the context of transgene regulation, although synthetic biologists are now beginning to appropriate RNA-level gene regulation into their regulatory circuits. We also know little about how translational control contributes to the diversity of plant form and function. This project explored how an emerging regulatory mRNA sequence element, upstream open reading frames (uORFs), is integrated with the general translation initiation machinery to permit translational regulation on specific mRNAs.

  10. Functional interrogation of non-coding DNA through CRISPR genome editing.

    Science.gov (United States)

    Canver, Matthew C; Bauer, Daniel E; Orkin, Stuart H

    2017-05-15

    Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. Molecular characterization of long direct repeat (LDR) sequences expressing a stable mRNA encoding for a 35-amino-acid cell-killing peptide and a cis-encoded small antisense RNA in Escherichia coli.

    Science.gov (United States)

    Kawano, Mitsuoki; Oshima, Taku; Kasai, Hiroaki; Mori, Hirotada

    2002-07-01

    Genome sequence analyses of Escherichia coli K-12 revealed four copies of long repetitive elements. These sequences are designated as long direct repeat (LDR) sequences. Three of the repeats (LDR-A, -B, -C), each approximately 500 bp in length, are located as tandem repeats at 27.4 min on the genetic map. Another copy (LDR-D), 450 bp in length and nearly identical to LDR-A, -B and -C, is located at 79.7 min, a position that is directly opposite the position of LDR-A, -B and -C. In this study, we demonstrate that LDR-D encodes a 35-amino-acid peptide, LdrD, the overexpression of which causes rapid cell killing and nucleoid condensation of the host cell. Northern blot and primer extension analysis showed constitutive transcription of a stable mRNA (approximately 370 nucleotides) encoding LdrD and an unstable cis-encoded antisense RNA (approximately 60 nucleotides), which functions as a trans-acting regulator of ldrD translation. We propose that LDR encodes a toxin-antitoxin module. LDR-homologous sequences are not pre-sent on any known plasmids but are conserved in Salmonella and other enterobacterial species.

  12. SINEUPs are modular antisense long-non coding RNAs that increase synthesis of target proteins in cells

    Directory of Open Access Journals (Sweden)

    Silvia eZucchelli

    2015-05-01

    Full Text Available Despite recent efforts in discovering novel long non-coding RNAs (lncRNAs and unveiling their functions in a wide range of biological processes their applications as biotechnological or therapeutic tools are still at their infancy. We have recently shown that AS Uchl1, a natural lncRNA antisense to the Parkinson’s disease-associated gene Ubiquitin carboxyl-terminal esterase L1 (Uchl1, is able to increase UchL1 protein synthesis at post-transcriptional level. Its activity requires two RNA elements: an embedded inverted SINEB2 sequence to increase translation and the overlapping region to target its sense mRNA. This functional organization is shared with several mouse lncRNAs antisense to protein coding genes. The potential use of AS Uchl1-derived lncRNAs as enhancers of target mRNA translation remains unexplored. Here we define AS Uchl1 as the representative member of a new functional class of natural and synthetic antisense lncRNAs that activate translation. We named this class of RNAs SINEUPs for their requirement of the inverted SINEB2 sequence to UP-regulate translation in a gene-specific manner. The overlapping region is indicated as the Binding Doman (BD while the embedded inverted SINEB2 element is the Effector Domain (ED. By swapping BD, synthetic SINEUPs are designed targeting mRNAs of interest. SINEUPs function in an array of cell lines and can be efficiently directed towards N-terminally tagged proteins. Their biological activity is retained in a miniaturized version within the range of small RNAs length. Its modular structure was exploited to successfully design synthetic SINEUPs targeting endogenous Parkinson’s disease-associated DJ-1 and proved to be active in different neuronal cell lines.In summary, SINEUPs represent the first scalable tool to increase synthesis of proteins of interest. We propose SINEUPs as reagents for molecular biology experiments, in protein manufacturing as well as in therapy of haploinsufficiencies.

  13. Metal resistance sequences and transgenic plants

    Science.gov (United States)

    Meagher, Richard Brian; Summers, Anne O.; Rugh, Clayton L.

    1999-10-12

    The present invention provides nucleic acid sequences encoding a metal ion resistance protein, which are expressible in plant cells. The metal resistance protein provides for the enzymatic reduction of metal ions including but not limited to divalent Cu, divalent mercury, trivalent gold, divalent cadmium, lead ions and monovalent silver ions. Transgenic plants which express these coding sequences exhibit increased resistance to metal ions in the environment as compared with plants which have not been so genetically modified. Transgenic plants with improved resistance to organometals including alkylmercury compounds, among others, are provided by the further inclusion of plant-expressible organometal lyase coding sequences, as specifically exemplified by the plant-expressible merB coding sequence. Furthermore, these transgenic plants which have been genetically modified to express the metal resistance coding sequences of the present invention can participate in the bioremediation of metal contamination via the enzymatic reduction of metal ions. Transgenic plants resistant to organometals can further mediate remediation of organic metal compounds, for example, alkylmetal compounds including but not limited to methyl mercury, methyl lead compounds, methyl cadmium and methyl arsenic compounds, in the environment by causing the freeing of mercuric or other metal ions and the reduction of the ionic mercury or other metal ions to the less toxic elemental mercury or other metals.

  14. SR proteins are NXF1 adaptors that link alternative RNA processing to mRNA export.

    Science.gov (United States)

    Müller-McNicoll, Michaela; Botti, Valentina; de Jesus Domingues, Antonio M; Brandl, Holger; Schwich, Oliver D; Steiner, Michaela C; Curk, Tomaz; Poser, Ina; Zarnack, Kathi; Neugebauer, Karla M

    2016-03-01

    Nuclear export factor 1 (NXF1) exports mRNA to the cytoplasm after recruitment to mRNA by specific adaptor proteins. How and why cells use numerous different export adaptors is poorly understood. Here we critically evaluate members of the SR protein family (SRSF1-7) for their potential to act as NXF1 adaptors that couple pre-mRNA processing to mRNA export. Consistent with this proposal, >1000 endogenous mRNAs required individual SR proteins for nuclear export in vivo. To address the mechanism, transcriptome-wide RNA-binding profiles of NXF1 and SRSF1-7 were determined in parallel by individual-nucleotide-resolution UV cross-linking and immunoprecipitation (iCLIP). Quantitative comparisons of RNA-binding sites showed that NXF1 and SR proteins bind mRNA targets at adjacent sites, indicative of cobinding. SRSF3 emerged as the most potent NXF1 adaptor, conferring sequence specificity to RNA binding by NXF1 in last exons. Interestingly, SRSF3 and SRSF7 were shown to bind different sites in last exons and regulate 3' untranslated region length in an opposing manner. Both SRSF3 and SRSF7 promoted NXF1 recruitment to mRNA. Thus, SRSF3 and SRSF7 couple alternative splicing and polyadenylation to NXF1-mediated mRNA export, thereby controlling the cytoplasmic abundance of transcripts with alternative 3' ends. © 2016 Müller-McNicoll et al.; Published by Cold Spring Harbor Laboratory Press.

  15. Biased Gene Conversion and GC-Content Evolution in the Coding Sequences of Reptiles and Vertebrates

    Science.gov (United States)

    Figuet, Emeric; Ballenghien, Marion; Romiguier, Jonathan; Galtier, Nicolas

    2015-01-01

    Mammalian and avian genomes are characterized by a substantial spatial heterogeneity of GC-content, which is often interpreted as reflecting the effect of local GC-biased gene conversion (gBGC), a meiotic repair bias that favors G and C over A and T alleles in high-recombining genomic regions. Surprisingly, the first fully sequenced nonavian sauropsid (i.e., reptile), the green anole Anolis carolinensis, revealed a highly homogeneous genomic GC-content landscape, suggesting the possibility that gBGC might not be at work in this lineage. Here, we analyze GC-content evolution at third-codon positions (GC3) in 44 vertebrates species, including eight newly sequenced transcriptomes, with a specific focus on nonavian sauropsids. We report that reptiles, including the green anole, have a genome-wide distribution of GC3 similar to that of mammals and birds, and we infer a strong GC3-heterogeneity to be already present in the tetrapod ancestor. We further show that the dynamic of coding sequence GC-content is largely governed by karyotypic features in vertebrates, notably in the green anole, in agreement with the gBGC hypothesis. The discrepancy between third-codon positions and noncoding DNA regarding GC-content dynamics in the green anole could not be explained by the activity of transposable elements or selection on codon usage. This analysis highlights the unique value of third-codon positions as an insertion/deletion-free marker of nucleotide substitution biases that ultimately affect the evolution of proteins. PMID:25527834

  16. De novo ORFs in Drosophila are important to organismal fitness and evolved rapidly from previously non-coding sequences.

    Directory of Open Access Journals (Sweden)

    Josephine A Reinhardt

    Full Text Available How non-coding DNA gives rise to new protein-coding genes (de novo genes is not well understood. Recent work has revealed the origins and functions of a few de novo genes, but common principles governing the evolution or biological roles of these genes are unknown. To better define these principles, we performed a parallel analysis of the evolution and function of six putatively protein-coding de novo genes described in Drosophila melanogaster. Reconstruction of the transcriptional history of de novo genes shows that two de novo genes emerged from novel long non-coding RNAs that arose at least 5 MY prior to evolution of an open reading frame. In contrast, four other de novo genes evolved a translated open reading frame and transcription within the same evolutionary interval suggesting that nascent open reading frames (proto-ORFs, while not required, can contribute to the emergence of a new de novo gene. However, none of the genes arose from proto-ORFs that existed long before expression evolved. Sequence and structural evolution of de novo genes was rapid compared to nearby genes and the structural complexity of de novo genes steadily increases over evolutionary time. Despite the fact that these genes are transcribed at a higher level in males than females, and are most strongly expressed in testes, RNAi experiments show that most of these genes are essential in both sexes during metamorphosis. This lethality suggests that protein coding de novo genes in Drosophila quickly become functionally important.

  17. Gold nanoparticle-based beacon to detect STAT5b mRNA expression in living cells: a case optimized by bioinformatics screen

    Directory of Open Access Journals (Sweden)

    Deng D

    2015-04-01

    Full Text Available Dawei Deng,* Yang Li,* Jianpeng Xue, Jie Wang, Guanhua Ai, Xin Li, Yueqing GuDepartment of Biomedical Engineering, China Pharmaceutical University, Nanjing, People’s Republic of China*These authors contributed equally to this workAbstract: Messenger RNA (mRNA, a single-strand ribonucleic acid with functional gene information is usually abnormally expressed in cancer cells and has become a promising biomarker for the study of tumor progress. Hairpin DNA-coated gold nanoparticle (hDAuNP beacon containing a bare gold nanoparticle (AuNP as fluorescence quencher and thiol-terminated fluorescently labeled stem–loop–stem oligonucleotide sequences attached by Au–S bond is currently a new nanoscale biodiagnostic platform capable of mRNA detection, in which the design of the loop region sequence is crucial for hybridizing with the target mRNA. Hence, in this study, to improve the sensitivity and selectivity of hDAuNP beacon simultaneously, the loop region of hairpin DNA was screened by bioinformatics strategy. Here, signal transducer and activator of transcription 5b (STAT5b mRNA was selected and used as a practical example. The results from the combined characterizations using optical techniques, flow cytometry assay, and cell microscopic imaging showed that after optimization, the as-prepared hDAuNP beacon had higher selectivity and sensitivity for the detection of STAT5b mRNA in living cells, as compared with our previous beacon. Thus, the bioinformatics method may be a promising new strategy for assisting in the designing of the hDAuNP beacon, extending its application in the detection of mRNA expression and the resultant mRNA-based biological processes and disease pathogenesis.Keywords: molecular beacon, bioinformatics, gold nanoparticle, STAT5b mRNA, visual detection

  18. High-Throughput Mapping of Single-Neuron Projections by Sequencing of Barcoded RNA.

    Science.gov (United States)

    Kebschull, Justus M; Garcia da Silva, Pedro; Reid, Ashlan P; Peikon, Ian D; Albeanu, Dinu F; Zador, Anthony M

    2016-09-07

    Neurons transmit information to distant brain regions via long-range axonal projections. In the mouse, area-to-area connections have only been systematically mapped using bulk labeling techniques, which obscure the diverse projections of intermingled single neurons. Here we describe MAPseq (Multiplexed Analysis of Projections by Sequencing), a technique that can map the projections of thousands or even millions of single neurons by labeling large sets of neurons with random RNA sequences ("barcodes"). Axons are filled with barcode mRNA, each putative projection area is dissected, and the barcode mRNA is extracted and sequenced. Applying MAPseq to the locus coeruleus (LC), we find that individual LC neurons have preferred cortical targets. By recasting neuroanatomy, which is traditionally viewed as a problem of microscopy, as a problem of sequencing, MAPseq harnesses advances in sequencing technology to permit high-throughput interrogation of brain circuits. Copyright © 2016 Elsevier Inc. All rights reserved.

  19. Novel sequence variations in LAMA2 and SGCG genes modulating cis-acting regulatory elements and RNA secondary structure

    Directory of Open Access Journals (Sweden)

    Olfa Siala

    2010-01-01

    Full Text Available In this study, we detected new sequence variations in LAMA2 and SGCG genes in 5 ethnic populations, and analysed their effect on enhancer composition and mRNA structure. PCR amplification and DNA sequencing were performed and followed by bioinformatics analyses using ESEfinder as well as MFOLD software. We found 3 novel sequence variations in the LAMA2 (c.3174+22_23insAT and c.6085 +12delA and SGCG (c.*102A/C genes. These variations were present in 210 tested healthy controls from Tunisian, Moroccan, Algerian, Lebanese and French populations suggesting that they represent novel polymorphisms within LAMA2 and SGCG genes sequences. ESEfinder showed that the c.*102A/C substitution created a new exon splicing enhancer in the 3'UTR of SGCG genes, whereas the c.6085 +12delA deletion was situated in the base pairing region between LAMA2 mRNA and the U1snRNA spliceosomal components. The RNA structure analyses showed that both variations modulated RNA secondary structure. Our results are suggestive of correlations between mRNA folding and the recruitment of spliceosomal components mediating splicing, including SR proteins. The contribution of common sequence variations to mRNA structural and functional diversity will contribute to a better study of gene expression.

  20. Organ-Specific and Age-Dependent Expression of Insulin-like Growth Factor-I (IGF-I) mRNA Variants: IGF-IA and IB mRNAs in the Mouse

    OpenAIRE

    Ohtsuki, Takashi; Otsuki, Mariko; Murakami, Yousuke; Maekawa, Tetsuya; Yamamoto, Takashi; Akasaka, Koji; Takeuchi, Sakae; Takahashi, Sumio

    2005-01-01

    Insulin-like growth factor-I (IGF-I) gene generates several IGF-I mRNA variants by alternative splicing. Two promoters are present in mouse IGF-I gene. Each promoter encodes two IGF-I mRNA variants (IGF-IA and IGF-IB mRNAs). Variants differ by the presence (IGF-IB) or absence (IGF-IA) of a 52-bp insert in the E domain-coding region. Functional differences among IGF-I mRNAs, and regulatory mechanisms for alternative splicing of IGF-I mRNA are not yet known. We analyzed the expression of mouse ...

  1. No significant regulation of bicoid mRNA by Pumilio or Nanos in the early Drosophila embryo.

    Science.gov (United States)

    Wharton, Tammy H; Nomie, Krystle J; Wharton, Robin P

    2018-01-01

    Drosophila Pumilio (Pum) is a founding member of the conserved Puf domain class of RNA-binding translational regulators. Pum binds with high specificity, contacting eight nucleotides, one with each of the repeats in its RNA-binding domain. In general, Pum is thought to block translation in collaboration with Nanos (Nos), which exhibits no binding specificity in isolation but is recruited jointly to regulatory sequences containing a Pum binding site in the 3'-UTRs of target mRNAs. Unlike Pum, which is ubiquitous in the early embryo, Nos is tightly restricted to the posterior, ensuring that repression of its best-characterized target, maternal hunchback (hb) mRNA, takes place exclusively in the posterior. An exceptional case of Nos-independent regulation by Pum has been described-repression of maternal bicoid (bcd) mRNA at the anterior pole of the early embryo, dependent on both Pum and conserved Pum binding sites in the 3'-UTR of the mRNA. We have re-investigated regulation of bcd in the early embryo; our experiments reveal no evidence of a role for Pum or its conserved binding sites in regulation of the perdurance of bcd mRNA or protein. Instead, we find that Pum and Nos control the accumulation of bcd mRNA in testes.

  2. Statistical properties and fractals of nucleotide clusters in DNA sequences

    International Nuclear Information System (INIS)

    Sun Tingting; Zhang Linxi; Chen Jin; Jiang Zhouting

    2004-01-01

    Statistical properties of nucleotide clusters in DNA sequences and their fractals are investigated in this paper. The average size of nucleotide clusters in non-coding sequence is larger than that in coding sequence. We investigate the cluster-size distribution P(S) for human chromosomes 21 and 22, and the results are different from previous works. The cluster-size distribution P(S 1 +S 2 ) with the total size of sequential Pu-cluster and Py-cluster S 1 +S 2 is studied. We observe that P(S 1 +S 2 ) follows an exponential decay both in coding and non-coding sequences. However, we get different results for human chromosomes 21 and 22. The probability distribution P(S 1 ,S 2 ) of nucleotide clusters with the size of sequential Pu-cluster and Py-cluster S 1 and S 2 respectively, is also examined. In the meantime, some of the linear correlations are obtained in the double logarithmic plots of the fluctuation F(l) versus nucleotide cluster distance l along the DNA chain. The power spectrums of nucleotide clusters are also discussed, and it is concluded that the curves are flat and hardly changed and the 1/3 frequency is neither observed in coding sequence nor in non-coding sequence. These investigations can provide some insights into the nucleotide clusters of DNA sequences

  3. VDR mRNA overexpression is associated with worse prognostic factors in papillary thyroid carcinoma

    Directory of Open Access Journals (Sweden)

    June Young Choi

    2017-03-01

    Full Text Available The purpose of this study was to assess the relationship between vitamin D receptor gene (VDR expression and prognostic factors in papillary thyroid cancer (PTC. mRNA sequencing and somatic mutation data from The Cancer Genome Atlas (TCGA were analyzed. VDR mRNA expression was compared to clinicopathologic variables by linear regression. Tree-based classification was applied to find cutoff and patients were split into low and high VDR group. Logistic regression, Kaplan–Meier analysis, differentially expressed gene (DEG test and pathway analysis were performed to assess the differences between two VDR groups. VDR mRNA expression was elevated in PTC than that in normal thyroid tissue. VDR expressions were high in classic and tall-cell variant PTC and lateral neck node metastasis was present. High VDR group was also associated with classic and tall cell subtype, AJCC stage IV and lower recurrence-free survival. DEG test reveals that 545 genes were upregulated in high VDR group. Thyroid cancer-related pathways were enriched in high VDR group in pathway analyses. VDR mRNA overexpression was correlated with worse prognostic factors such as subtypes of papillary thyroid carcinoma that are known to be worse prognosis, lateral neck node metastasis, advanced stage and recurrence-free survival.

  4. Capsid coding sequences of foot-and-mouth disease viruses are determinants of pathogenicity in pigs

    DEFF Research Database (Denmark)

    Lohse, Louise; Jackson, Terry; Bøtner, Anette

    2012-01-01

    The surface exposed capsid proteins, VP1, VP2 and VP3, of foot-and-mouth disease virus (FMDV) determine its antigenicity and the ability of the virus to interact with host-cell receptors. Hence, modification of these structural proteins may alter the properties of the virus. In the present study we...... compared the pathogenicity of different FMDVs in young pigs. In total 32 pigs, 7-weeks-old, were exposed to virus, either by direct inoculation or through contact with inoculated pigs, using cell culture adapted (O1K B64), chimeric (O1K/A-TUR and O1K/O-UKG) or field strain (O-UKG/34/2001) viruses. The O1K...... coding sequences are determinants of FMDV pathogenicity in pigs....

  5. A two-locus global DNA barcode for land plants: the coding rbcL gene complements the non-coding trnH-psbA spacer region.

    Science.gov (United States)

    Kress, W John; Erickson, David L

    2007-06-06

    A useful DNA barcode requires sufficient sequence variation to distinguish between species and ease of application across a broad range of taxa. Discovery of a DNA barcode for land plants has been limited by intrinsically lower rates of sequence evolution in plant genomes than that observed in animals. This low rate has complicated the trade-off in finding a locus that is universal and readily sequenced and has sufficiently high sequence divergence at the species-level. Here, a global plant DNA barcode system is evaluated by comparing universal application and degree of sequence divergence for nine putative barcode loci, including coding and non-coding regions, singly and in pairs across a phylogenetically diverse set of 48 genera (two species per genus). No single locus could discriminate among species in a pair in more than 79% of genera, whereas discrimination increased to nearly 88% when the non-coding trnH-psbA spacer was paired with one of three coding loci, including rbcL. In silico trials were conducted in which DNA sequences from GenBank were used to further evaluate the discriminatory power of a subset of these loci. These trials supported the earlier observation that trnH-psbA coupled with rbcL can correctly identify and discriminate among related species. A combination of the non-coding trnH-psbA spacer region and a portion of the coding rbcL gene is recommended as a two-locus global land plant barcode that provides the necessary universality and species discrimination.

  6. Filter paper collection of Plasmodium falciparum mRNA for detecting low-density gametocytes

    Directory of Open Access Journals (Sweden)

    Jones Sophie

    2012-08-01

    Full Text Available Abstract Background Accurate sampling of sub-microscopic gametocytes is necessary for epidemiological studies to identify the infectious reservoir of Plasmodium falciparum. Detection of gametocyte mRNA achieves sensitive detection, but requires careful handling of samples. Filter papers can be used for collecting RNA samples, but rigorous testing of their capacity to withstand adverse storage conditions has not been fully explored. Methods Three gametocyte dilutions: 10/μL, 1.0/μL and 0.1/μL were spotted onto Whatman™ 903 Protein Saver Cards, FTA Classic Cards and 3MM filter papers that were stored under frozen, cold chain or tropical conditions for up to 13 weeks . RNA was extracted, then detected by quantitative nucleic acid sequence-based amplification (QT-NASBA and reverse-transcriptase PCR (RT-PCR. Results Successful gametocyte detection was more frequently observed from the Whatman 903 Protein Saver Card compared to the Whatman FTA Classic Card, by both techniques (p  Conclusions This study indicates the Whatman 903 Protein Saver Card is better for Pfs25 mRNA sampling compared to the Whatman FTA Classic Card, and that the Whatman 3MM filter paper may prove to be a satisfactory cheaper option for Pfs25 mRNA sampling. When appropriately dried, filter papers provide a useful approach to Pfs25 mRNA sampling, especially in settings where storage in RNA-protecting buffer is not possible.

  7. Presence of albumin mRNA precursors in nuclei of analbuminemic rat liver lacking cytoplasmic albumin mRNA.

    OpenAIRE

    Esumi, H; Takahashi, Y; Sekiya, T; Sato, S; Nagase, S; Sugimura, T

    1982-01-01

    Analbuminemic rats, which lack serum albumin, were previously found to have no albumin mRNA in the cytoplasm of the liver. In the present study, the existence of nuclear albumin mRNA precursors in the liver of analbuminemic rats was examined by RNA X cDNA hybridization kinetics. Albumin mRNA precursors were present in the nuclei of analbuminemic rat liver at almost normal levels, despite the absence of albumin mRNA from the cytoplasm. Nuclear RNA of analbuminemic rat liver was subjected to el...

  8. Correlation of mRNA Profiles, miRNA Profiles, and Functional Immune Response in Rainbow Trout (Oncorrhynkus Mykiss) Infected With Viral Hemorrhagic Septicemia Virus (VHSV) and in Fish Vaccinated With a DNA Vaccine Against VHSV

    DEFF Research Database (Denmark)

    Bela-Ong, Dennis; Schyth, Brian Dall; Jørgensen, Hanne

    2011-01-01

    and are incorporated into the RNA-Induced Silencing Complex (RISC), which target specific mRNA sequences, causing either mRNA degradation or translation repression. This results in altered mRNA and protein profiles characteristic of a particular cellular phenotype or physiological state. By targeting immune relevant m...

  9. Phylogenomic Resolution of the Phylogeny of Laurasiatherian Mammals: Exploring Phylogenetic Signals within Coding and Noncoding Sequences.

    Science.gov (United States)

    Chen, Meng-Yun; Liang, Dan; Zhang, Peng

    2017-08-01

    The interordinal relationships of Laurasiatherian mammals are currently one of the most controversial questions in mammalian phylogenetics. Previous studies mainly relied on coding sequences (CDS) and seldom used noncoding sequences. Here, by data mining public genome data, we compiled an intron data set of 3,638 genes (all introns from a protein-coding gene are considered as a gene) (19,055,073 bp) and a CDS data set of 10,259 genes (20,994,285 bp), covering all major lineages of Laurasiatheria (except Pholidota). We found that the intron data contained stronger and more congruent phylogenetic signals than the CDS data. In agreement with this observation, concatenation and species-tree analyses of the intron data set yielded well-resolved and identical phylogenies, whereas the CDS data set produced weakly supported and incongruent results. Further analyses showed that the phylogeny inferred from the intron data is highly robust to data subsampling and change in outgroup, but the CDS data produced unstable results under the same conditions. Interestingly, gene tree statistical results showed that the most frequently observed gene tree topologies for the CDS and intron data are identical, suggesting that the major phylogenetic signal within the CDS data is actually congruent with that within the intron data. Our final result of Laurasiatheria phylogeny is (Eulipotyphla,((Chiroptera, Perissodactyla),(Carnivora, Cetartiodactyla))), favoring a close relationship between Chiroptera and Perissodactyla. Our study 1) provides a well-supported phylogenetic framework for Laurasiatheria, representing a step towards ending the long-standing "hard" polytomy and 2) argues that intron within genome data is a promising data resource for resolving rapid radiation events across the tree of life. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  10. Dis3- and exosome subunit-responsive 3′ mRNA instability elements

    International Nuclear Information System (INIS)

    Kiss, Daniel L.; Hou, Dezhi; Gross, Robert H.; Andrulis, Erik D.

    2012-01-01

    Highlights: ► Successful use of a novel RNA-specific bioinformatic tool, RNA SCOPE. ► Identified novel 3′ UTR cis-acting element that destabilizes a reporter mRNA. ► Show exosome subunits are required for cis-acting element-mediated mRNA instability. ► Define precise sequence requirements of novel cis-acting element. ► Show that microarray-defined exosome subunit-regulated mRNAs have novel element. -- Abstract: Eukaryotic RNA turnover is regulated in part by the exosome, a nuclear and cytoplasmic complex of ribonucleases (RNases) and RNA-binding proteins. The major RNase of the complex is thought to be Dis3, a multi-functional 3′–5′ exoribonuclease and endoribonuclease. Although it is known that Dis3 and core exosome subunits are recruited to transcriptionally active genes and to messenger RNA (mRNA) substrates, this recruitment is thought to occur indirectly. We sought to discover cis-acting elements that recruit Dis3 or other exosome subunits. Using a bioinformatic tool called RNA SCOPE to screen the 3′ untranslated regions of up-regulated transcripts from our published Dis3 depletion-derived transcriptomic data set, we identified several motifs as candidate instability elements. Secondary screening using a luciferase reporter system revealed that one cassette—harboring four elements—destabilized the reporter transcript. RNAi-based depletion of Dis3, Rrp6, Rrp4, Rrp40, or Rrp46 diminished the efficacy of cassette-mediated destabilization. Truncation analysis of the cassette showed that two exosome subunit-sensitive elements (ESSEs) destabilized the reporter. Point-directed mutagenesis of ESSE abrogated the destabilization effect. An examination of the transcriptomic data from exosome subunit depletion-based microarrays revealed that mRNAs with ESSEs are found in every up-regulated mRNA data set but are underrepresented or missing from the down-regulated data sets. Taken together, our findings imply a potentially novel mechanism of mRNA

  11. Survey of the transcriptome of Aspergillus oryzae via massively parallel mRNA sequencing

    OpenAIRE

    Wang, Bin; Guo, Guangwu; Wang, Chao; Lin, Ying; Wang, Xiaoning; Zhao, Mouming; Guo, Yong; He, Minghui; Zhang, Yong; Pan, Li

    2010-01-01

    Aspergillus oryzae, an important filamentous fungus used in food fermentation and the enzyme industry, has been shown through genome sequencing and various other tools to have prominent features in its genomic composition. However, the functional complexity of the A. oryzae transcriptome has not yet been fully elucidated. Here, we applied direct high-throughput paired-end RNA-sequencing (RNA-Seq) to the transcriptome of A. oryzae under four different culture conditions. With the high resoluti...

  12. Report number codes

    Energy Technology Data Exchange (ETDEWEB)

    Nelson, R.N. (ed.)

    1985-05-01

    This publication lists all report number codes processed by the Office of Scientific and Technical Information. The report codes are substantially based on the American National Standards Institute, Standard Technical Report Number (STRN)-Format and Creation Z39.23-1983. The Standard Technical Report Number (STRN) provides one of the primary methods of identifying a specific technical report. The STRN consists of two parts: The report code and the sequential number. The report code identifies the issuing organization, a specific program, or a type of document. The sequential number, which is assigned in sequence by each report issuing entity, is not included in this publication. Part I of this compilation is alphabetized by report codes followed by issuing installations. Part II lists the issuing organization followed by the assigned report code(s). In both Parts I and II, the names of issuing organizations appear for the most part in the form used at the time the reports were issued. However, for some of the more prolific installations which have had name changes, all entries have been merged under the current name.

  13. Report number codes

    International Nuclear Information System (INIS)

    Nelson, R.N.

    1985-05-01

    This publication lists all report number codes processed by the Office of Scientific and Technical Information. The report codes are substantially based on the American National Standards Institute, Standard Technical Report Number (STRN)-Format and Creation Z39.23-1983. The Standard Technical Report Number (STRN) provides one of the primary methods of identifying a specific technical report. The STRN consists of two parts: The report code and the sequential number. The report code identifies the issuing organization, a specific program, or a type of document. The sequential number, which is assigned in sequence by each report issuing entity, is not included in this publication. Part I of this compilation is alphabetized by report codes followed by issuing installations. Part II lists the issuing organization followed by the assigned report code(s). In both Parts I and II, the names of issuing organizations appear for the most part in the form used at the time the reports were issued. However, for some of the more prolific installations which have had name changes, all entries have been merged under the current name

  14. mRNA fragments in in vitro culture media are associated with bovine preimplantation embryonic development.

    Science.gov (United States)

    Kropp, Jenna; Khatib, Hasan

    2015-01-01

    In vitro production (IVP) systems have been used to bypass problems of fertilization and early embryonic development. However, embryos produced by IVP are commonly selected for implantation based on morphological assessment, which is not a strong indicator of establishment and maintenance of pregnancy. Thus, there is a need to identify additional indicators of embryonic developmental potential. Previous studies have identified microRNA expression in in vitro culture media to be indicative of embryo quality in both bovine and human embryos. Like microRNAs, mRNAs have been shown to be secreted from cells into the extracellular environment, but it is unknown whether or not these RNAs are secreted by embryos. Thus, the objective of the present study was to determine whether mRNAs are secreted into in vitro culture media and if their expression in the media is indicative of embryo quality. In vitro culture medium was generated and collected from both blastocyst and degenerate (those which fail to develop from the morula to blastocyst stage) embryos. Small-RNA sequencing revealed that many mRNA fragments were present in the culture media. A total of 17 mRNA fragments were differentially expressed between blastocyst and degenerate conditioned media. Differential expression was confirmed by quantitative real-time PCR for fragments of mRNA POSTN and VSNL-1, in four additional biological replicates of media. To better understand the mechanisms of mRNA secretion into the media, the expression of a predicted RNA binding protein of POSTN, PUM2, was knocked down using an antisense oligonucleotide gapmer. Supplementation of a PUM2 gapmer significantly reduced blastocyst development and decreased secretion of POSTN mRNA into the media. Overall, differential mRNA expression in the media was repeatable and sets the framework for future study of mRNA biomarkers in in vitro culture media to improve predictability of reproductive performance.

  15. Comparative genomics beyond sequence-based alignments

    DEFF Research Database (Denmark)

    Þórarinsson, Elfar; Yao, Zizhen; Wiklund, Eric D.

    2008-01-01

    Recent computational scans for non-coding RNAs (ncRNAs) in multiple organisms have relied on existing multiple sequence alignments. However, as sequence similarity drops, a key signal of RNA structure--frequent compensating base changes--is increasingly likely to cause sequence-based alignment me...

  16. Generic detection of poleroviruses using an RT-PCR assay targeting the RdRp coding sequence.

    Science.gov (United States)

    Lotos, Leonidas; Efthimiou, Konstantinos; Maliogka, Varvara I; Katis, Nikolaos I

    2014-03-01

    In this study a two-step RT-PCR assay was developed for the generic detection of poleroviruses. The RdRp coding region was selected as the primers' target, since it differs significantly from that of other members in the family Luteoviridae and its sequence can be more informative than other regions in the viral genome. Species specific RT-PCR assays targeting the same region were also developed for the detection of the six most widespread poleroviral species (Beet mild yellowing virus, Beet western yellows virus, Cucurbit aphid-borne virus, Carrot red leaf virus, Potato leafroll virus and Turnip yellows virus) in Greece and the collection of isolates. These isolates along with other characterized ones were used for the evaluation of the generic PCR's detection range. The developed assay efficiently amplified a 593bp RdRp fragment from 46 isolates of 10 different Polerovirus species. Phylogenetic analysis using the generic PCR's amplicon sequence showed that although it cannot accurately infer evolutionary relationships within the genus it can differentiate poleroviruses at the species level. Overall, the described generic assay could be applied for the reliable detection of Polerovirus infections and, in combination with the specific PCRs, for the identification of new and uncharacterized species in the genus. Copyright © 2013 Elsevier B.V. All rights reserved.

  17. Large scale comparative codon-pair context analysis unveils general rules that fine-tune evolution of mRNA primary structure.

    Directory of Open Access Journals (Sweden)

    Gabriela Moura

    Full Text Available BACKGROUND: Codon usage and codon-pair context are important gene primary structure features that influence mRNA decoding fidelity. In order to identify general rules that shape codon-pair context and minimize mRNA decoding error, we have carried out a large scale comparative codon-pair context analysis of 119 fully sequenced genomes. METHODOLOGIES/PRINCIPAL FINDINGS: We have developed mathematical and software tools for large scale comparative codon-pair context analysis. These methodologies unveiled general and species specific codon-pair context rules that govern evolution of mRNAs in the 3 domains of life. We show that evolution of bacterial and archeal mRNA primary structure is mainly dependent on constraints imposed by the translational machinery, while in eukaryotes DNA methylation and tri-nucleotide repeats impose strong biases on codon-pair context. CONCLUSIONS: The data highlight fundamental differences between prokaryotic and eukaryotic mRNA decoding rules, which are partially independent of codon usage.

  18. Annotating non-coding regions of the genome.

    Science.gov (United States)

    Alexander, Roger P; Fang, Gang; Rozowsky, Joel; Snyder, Michael; Gerstein, Mark B

    2010-08-01

    Most of the human genome consists of non-protein-coding DNA. Recently, progress has been made in annotating these non-coding regions through the interpretation of functional genomics experiments and comparative sequence analysis. One can conceptualize functional genomics analysis as involving a sequence of steps: turning the output of an experiment into a 'signal' at each base pair of the genome; smoothing this signal and segmenting it into small blocks of initial annotation; and then clustering these small blocks into larger derived annotations and networks. Finally, one can relate functional genomics annotations to conserved units and measures of conservation derived from comparative sequence analysis.

  19. Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence

    OpenAIRE

    Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

    2015-01-01

    Background: There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. Methods: All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinform...

  20. Cloning, sequencing, and expression of dnaK-operon proteins from the thermophilic bacterium Thermus thermophilus.

    Science.gov (United States)

    Osipiuk, J; Joachimiak, A

    1997-09-12

    We propose that the dnaK operon of Thermus thermophilus HB8 is composed of three functionally linked genes: dnaK, grpE, and dnaJ. The dnaK and dnaJ gene products are most closely related to their cyanobacterial homologs. The DnaK protein sequence places T. thermophilus in the plastid Hsp70 subfamily. In contrast, the grpE translated sequence is most similar to GrpE from Clostridium acetobutylicum, a Gram-positive anaerobic bacterium. A single promoter region, with homology to the Escherichia coli consensus promoter sequences recognized by the sigma70 and sigma32 transcription factors, precedes the postulated operon. This promoter is heat-shock inducible. The dnaK mRNA level increased more than 30 times upon 10 min of heat shock (from 70 degrees C to 85 degrees C). A strong transcription terminating sequence was found between the dnaK and grpE genes. The individual genes were cloned into pET expression vectors and the thermophilic proteins were overproduced at high levels in E. coli and purified to homogeneity. The recombinant T. thermophilus DnaK protein was shown to have a weak ATP-hydrolytic activity, with an optimum at 90 degrees C. The ATPase was stimulated by the presence of GrpE and DnaJ. Another open reading frame, coding for ClpB heat-shock protein, was found downstream of the dnaK operon.

  1. Codon size reduction as the origin of the triplet genetic code.

    Directory of Open Access Journals (Sweden)

    Pavel V Baranov

    Full Text Available The genetic code appears to be optimized in its robustness to missense errors and frameshift errors. In addition, the genetic code is near-optimal in terms of its ability to carry information in addition to the sequences of encoded proteins. As evolution has no foresight, optimality of the modern genetic code suggests that it evolved from less optimal code variants. The length of codons in the genetic code is also optimal, as three is the minimal nucleotide combination that can encode the twenty standard amino acids. The apparent impossibility of transitions between codon sizes in a discontinuous manner during evolution has resulted in an unbending view that the genetic code was always triplet. Yet, recent experimental evidence on quadruplet decoding, as well as the discovery of organisms with ambiguous and dual decoding, suggest that the possibility of the evolution of triplet decoding from living systems with non-triplet decoding merits reconsideration and further exploration. To explore this possibility we designed a mathematical model of the evolution of primitive digital coding systems which can decode nucleotide sequences into protein sequences. These coding systems can evolve their nucleotide sequences via genetic events of Darwinian evolution, such as point-mutations. The replication rates of such coding systems depend on the accuracy of the generated protein sequences. Computer simulations based on our model show that decoding systems with codons of length greater than three spontaneously evolve into predominantly triplet decoding systems. Our findings suggest a plausible scenario for the evolution of the triplet genetic code in a continuous manner. This scenario suggests an explanation of how protein synthesis could be accomplished by means of long RNA-RNA interactions prior to the emergence of the complex decoding machinery, such as the ribosome, that is required for stabilization and discrimination of otherwise weak triplet codon

  2. Identification of an ICP27-responsive element in the coding region of a herpes simplex virus type 1 late gene.

    Science.gov (United States)

    Sedlackova, Lenka; Perkins, Keith D; Meyer, Julia; Strain, Anna K; Goldman, Oksana; Rice, Stephen A

    2010-03-01

    During productive herpes simplex virus type 1 (HSV-1) infection, a subset of viral delayed-early (DE) and late (L) genes require the immediate-early (IE) protein ICP27 for their expression. However, the cis-acting regulatory sequences in DE and L genes that mediate their specific induction by ICP27 are unknown. One viral L gene that is highly dependent on ICP27 is that encoding glycoprotein C (gC). We previously demonstrated that this gene is posttranscriptionally transactivated by ICP27 in a plasmid cotransfection assay. Based on our past results, we hypothesized that the gC gene possesses a cis-acting inhibitory sequence and that ICP27 overcomes the effects of this sequence to enable efficient gC expression. To test this model, we systematically deleted sequences from the body of the gC gene and tested the resulting constructs for expression. In so doing, we identified a 258-bp "silencing element" (SE) in the 5' portion of the gC coding region. When present, the SE inhibits gC mRNA accumulation from a transiently transfected gC gene, unless ICP27 is present. Moreover, the SE can be transferred to another HSV-1 gene, where it inhibits mRNA accumulation in the absence of ICP27 and confers high-level expression in the presence of ICP27. Thus, for the first time, an ICP27-responsive sequence has been identified in a physiologically relevant ICP27 target gene. To see if the SE functions during viral infection, we engineered HSV-1 recombinants that lack the SE, either in a wild-type (WT) or ICP27-null genetic background. In an ICP27-null background, deletion of the SE led to ICP27-independent expression of the gC gene, demonstrating that the SE functions during viral infection. Surprisingly, the ICP27-independent gC expression seen with the mutant occurred even in the absence of viral DNA synthesis, indicating that the SE helps to regulate the tight DNA replication-dependent expression of gC.

  3. A retinoic acid-inducible mRNA from F9 teratocarcinoma cells encodes a novel protease inhibitor homologue.

    Science.gov (United States)

    Wang, S Y; Gudas, L J

    1990-09-15

    We have previously isolated several cDNA clones specific for mRNA species that increase in abundance during the retinoic acid-associated differentiation of F9 teratocarcinoma stem cells. One of these mRNAs, J6, encodes a approximately 40 kDa protein as assayed by hybrid selection and in vitro translation (Wang, S.-Y., LaRosa, G., and Gudas, L. J. (1985) Dev. Biol. 107, 75-86). The time course of J6 mRNA expression is similar to those of both laminin B1 and collagen IV (alpha 1) messages following retinoic acid addition. To address the functional role of this protein, we have isolated a full-length cDNA clone complementary to this approximately 40-kDa protein mRNA. Sequence analysis reveals an open reading frame of 406 amino acids (Mr 45,652). The carboxyl-terminal portion of this predicted protein contains a region that is homologous to the reactive sites found among members of the serpin (serine protease inhibitor) family. The predicted reactive site (P1-P1') of this J6 protein is Arg-Ser, which is the same as that of antithrombin III. Like ovalbumin and human monocyte-derived plasminogen activator inhibitor (mPAI-2), which are members of the serpin gene family, the J6 protein appears to have no typical amino-terminal signal sequence.

  4. Accident sequence quantification with KIRAP

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Tae Un; Han, Sang Hoon; Kim, Kil You; Yang, Jun Eon; Jeong, Won Dae; Chang, Seung Cheol; Sung, Tae Yong; Kang, Dae Il; Park, Jin Hee; Lee, Yoon Hwan; Hwang, Mi Jeong

    1997-01-01

    The tasks of probabilistic safety assessment(PSA) consists of the identification of initiating events, the construction of event tree for each initiating event, construction of fault trees for event tree logics, the analysis of reliability data and finally the accident sequence quantification. In the PSA, the accident sequence quantification is to calculate the core damage frequency, importance analysis and uncertainty analysis. Accident sequence quantification requires to understand the whole model of the PSA because it has to combine all event tree and fault tree models, and requires the excellent computer code because it takes long computation time. Advanced Research Group of Korea Atomic Energy Research Institute(KAERI) has developed PSA workstation KIRAP(Korea Integrated Reliability Analysis Code Package) for the PSA work. This report describes the procedures to perform accident sequence quantification, the method to use KIRAP`s cut set generator, and method to perform the accident sequence quantification with KIRAP. (author). 6 refs.

  5. Accident sequence quantification with KIRAP

    International Nuclear Information System (INIS)

    Kim, Tae Un; Han, Sang Hoon; Kim, Kil You; Yang, Jun Eon; Jeong, Won Dae; Chang, Seung Cheol; Sung, Tae Yong; Kang, Dae Il; Park, Jin Hee; Lee, Yoon Hwan; Hwang, Mi Jeong.

    1997-01-01

    The tasks of probabilistic safety assessment(PSA) consists of the identification of initiating events, the construction of event tree for each initiating event, construction of fault trees for event tree logics, the analysis of reliability data and finally the accident sequence quantification. In the PSA, the accident sequence quantification is to calculate the core damage frequency, importance analysis and uncertainty analysis. Accident sequence quantification requires to understand the whole model of the PSA because it has to combine all event tree and fault tree models, and requires the excellent computer code because it takes long computation time. Advanced Research Group of Korea Atomic Energy Research Institute(KAERI) has developed PSA workstation KIRAP(Korea Integrated Reliability Analysis Code Package) for the PSA work. This report describes the procedures to perform accident sequence quantification, the method to use KIRAP's cut set generator, and method to perform the accident sequence quantification with KIRAP. (author). 6 refs

  6. DNA Barcoding through Quaternary LDPC Codes.

    Science.gov (United States)

    Tapia, Elizabeth; Spetale, Flavio; Krsticevic, Flavia; Angelone, Laura; Bulacio, Pilar

    2015-01-01

    For many parallel applications of Next-Generation Sequencing (NGS) technologies short barcodes able to accurately multiplex a large number of samples are demanded. To address these competitive requirements, the use of error-correcting codes is advised. Current barcoding systems are mostly built from short random error-correcting codes, a feature that strongly limits their multiplexing accuracy and experimental scalability. To overcome these problems on sequencing systems impaired by mismatch errors, the alternative use of binary BCH and pseudo-quaternary Hamming codes has been proposed. However, these codes either fail to provide a fine-scale with regard to size of barcodes (BCH) or have intrinsic poor error correcting abilities (Hamming). Here, the design of barcodes from shortened binary BCH codes and quaternary Low Density Parity Check (LDPC) codes is introduced. Simulation results show that although accurate barcoding systems of high multiplexing capacity can be obtained with any of these codes, using quaternary LDPC codes may be particularly advantageous due to the lower rates of read losses and undetected sample misidentification errors. Even at mismatch error rates of 10(-2) per base, 24-nt LDPC barcodes can be used to multiplex roughly 2000 samples with a sample misidentification error rate in the order of 10(-9) at the expense of a rate of read losses just in the order of 10(-6).

  7. DNA Barcoding through Quaternary LDPC Codes.

    Directory of Open Access Journals (Sweden)

    Elizabeth Tapia

    Full Text Available For many parallel applications of Next-Generation Sequencing (NGS technologies short barcodes able to accurately multiplex a large number of samples are demanded. To address these competitive requirements, the use of error-correcting codes is advised. Current barcoding systems are mostly built from short random error-correcting codes, a feature that strongly limits their multiplexing accuracy and experimental scalability. To overcome these problems on sequencing systems impaired by mismatch errors, the alternative use of binary BCH and pseudo-quaternary Hamming codes has been proposed. However, these codes either fail to provide a fine-scale with regard to size of barcodes (BCH or have intrinsic poor error correcting abilities (Hamming. Here, the design of barcodes from shortened binary BCH codes and quaternary Low Density Parity Check (LDPC codes is introduced. Simulation results show that although accurate barcoding systems of high multiplexing capacity can be obtained with any of these codes, using quaternary LDPC codes may be particularly advantageous due to the lower rates of read losses and undetected sample misidentification errors. Even at mismatch error rates of 10(-2 per base, 24-nt LDPC barcodes can be used to multiplex roughly 2000 samples with a sample misidentification error rate in the order of 10(-9 at the expense of a rate of read losses just in the order of 10(-6.

  8. Site-Specific Covalent Conjugation of Modified mRNA by tRNA Guanine Transglycosylase.

    Science.gov (United States)

    Ehret, Fabian; Zhou, Cun Yu; Alexander, Seth C; Zhang, Dongyang; Devaraj, Neal K

    2018-03-05

    Modified mRNA (mod-mRNA) has recently been widely studied as the form of RNA useful for therapeutic applications due to its high stability and lowered immune response. Herein, we extend the scope of the recently established RNA-TAG (transglycosylation at guanosine) methodology, a novel approach for genetically encoded site-specific labeling of large mRNA transcripts, by employing mod-mRNA as substrate. As a proof of concept, we covalently attached a fluorescent probe to mCherry encoding mod-mRNA transcripts bearing 5-methylcytidine and/or pseudouridine substitutions with high labeling efficiencies. To provide a versatile labeling methodology with a wide range of possible applications, we employed a two-step strategy for functionalization of the mod-mRNA to highlight the therapeutic potential of this new methodology. We envision that this novel and facile labeling methodology of mod-RNA will have great potential in decorating both coding and noncoding therapeutic RNAs with a variety of diagnostic and functional moieties.

  9. The Coding of Biological Information: From Nucleotide Sequence to Protein Recognition

    Science.gov (United States)

    Štambuk, Nikola

    The paper reviews the classic results of Swanson, Dayhoff, Grantham, Blalock and Root-Bernstein, which link genetic code nucleotide patterns to the protein structure, evolution and molecular recognition. Symbolic representation of the binary addresses defining particular nucleotide and amino acid properties is discussed, with consideration of: structure and metric of the code, direct correspondence between amino acid and nucleotide information, and molecular recognition of the interacting protein motifs coded by the complementary DNA and RNA strands.

  10. Design of Long Period Pseudo-Random Sequences from the Addition of -Sequences over

    Directory of Open Access Journals (Sweden)

    Ren Jian

    2004-01-01

    Full Text Available Pseudo-random sequence with good correlation property and large linear span is widely used in code division multiple access (CDMA communication systems and cryptology for reliable and secure information transmission. In this paper, sequences with long period, large complexity, balance statistics, and low cross-correlation property are constructed from the addition of -sequences with pairwise-prime linear spans (AMPLS. Using -sequences as building blocks, the proposed method proved to be an efficient and flexible approach to construct long period pseudo-random sequences with desirable properties from short period sequences. Applying the proposed method to , a signal set is constructed.

  11. Complete cDNA sequence coding for human docking protein

    Energy Technology Data Exchange (ETDEWEB)

    Hortsch, M; Labeit, S; Meyer, D I

    1988-01-11

    Docking protein (DP, or SRP receptor) is a rough endoplasmic reticulum (ER)-associated protein essential for the targeting and translocation of nascent polypeptides across this membrane. It specifically interacts with a cytoplasmic ribonucleoprotein complex, the signal recognition particle (SRP). The nucleotide sequence of cDNA encoding the entire human DP and its deduced amino acid sequence are given.

  12. Older persons' worries expressed during home care visits: exploring the content of cues and concerns identified by the Verona coding definitions of emotional sequences.

    NARCIS (Netherlands)

    Hafskjold, L.; Eide, T.; Holmström, I.K.; Sundling, V.; Dulmen, S. van; Eide, H.

    2016-01-01

    Objective: Little is known about how older persons in home care express their concerns. Emotional cues and concerns can be identified by the Verona coding definitions of emotional sequences (VR-CoDES), but the method gives no insight into what causes the distress and the emotions involved. The aims

  13. Complete chloroplast genome sequence of a major allogamous forage species, perennial ryegrass (Lolium perenne L.).

    Science.gov (United States)

    Diekmann, Kerstin; Hodkinson, Trevor R; Wolfe, Kenneth H; van den Bekerom, Rob; Dix, Philip J; Barth, Susanne

    2009-06-01

    Lolium perenne L. (perennial ryegrass) is globally one of the most important forage and grassland crops. We sequenced the chloroplast (cp) genome of Lolium perenne cultivar Cashel. The L. perenne cp genome is 135 282 bp with a typical quadripartite structure. It contains genes for 76 unique proteins, 30 tRNAs and four rRNAs. As in other grasses, the genes accD, ycf1 and ycf2 are absent. The genome is of average size within its subfamily Pooideae and of medium size within the Poaceae. Genome size differences are mainly due to length variations in non-coding regions. However, considerable length differences of 1-27 codons in comparison of L. perenne to other Poaceae and 1-68 codons among all Poaceae were also detected. Within the cp genome of this outcrossing cultivar, 10 insertion/deletion polymorphisms and 40 single nucleotide polymorphisms were detected. Two of the polymorphisms involve tiny inversions within hairpin structures. By comparing the genome sequence with RT-PCR products of transcripts for 33 genes, 31 mRNA editing sites were identified, five of them unique to Lolium. The cp genome sequence of L. perenne is available under Accession number AM777385 at the European Molecular Biology Laboratory, National Center for Biotechnology Information and DNA DataBank of Japan.

  14. A STUDY ON DETERMINING THE REFERENCE SPREADING SEQUENCES FOR A DS/CDMACOMMUNICATION SYSTEM

    Directory of Open Access Journals (Sweden)

    Cebrail ÇİFTLİKLİ

    2002-02-01

    Full Text Available In a direct sequence/code division multiple access (DS/CDMA system, the role of the spreading sequences (codes is crucial since the multiple access interference (MAI is the main performance limitation. In this study, we propose an accurate criterion which enables the determination of the reference spreading codes which yield lower bit error rates (BER's in a given code set for a DS/CDMA system using despreading sequences weighted by stepping chip waveforms. The numerical results show that the spreading codes determined by the proposed criterion are the most suitable codes for using as references.

  15. Self-amplifying mRNA vaccines.

    Science.gov (United States)

    Brito, Luis A; Kommareddy, Sushma; Maione, Domenico; Uematsu, Yasushi; Giovani, Cinzia; Berlanda Scorza, Francesco; Otten, Gillis R; Yu, Dong; Mandl, Christian W; Mason, Peter W; Dormitzer, Philip R; Ulmer, Jeffrey B; Geall, Andrew J

    2015-01-01

    This chapter provides a brief introduction to nucleic acid-based vaccines and recent research in developing self-amplifying mRNA vaccines. These vaccines promise the flexibility of plasmid DNA vaccines with enhanced immunogenicity and safety. The key to realizing the full potential of these vaccines is efficient delivery of nucleic acid to the cytoplasm of a cell, where it can amplify and express the encoded antigenic protein. The hydrophilicity and strong net negative charge of RNA impedes cellular uptake. To overcome this limitation, electrostatic complexation with cationic lipids or polymers and physical delivery using electroporation or ballistic particles to improve cellular uptake has been evaluated. This chapter highlights the rapid progress made in using nonviral delivery systems for RNA-based vaccines. Initial preclinical testing of self-amplifying mRNA vaccines has shown nonviral delivery to be capable of producing potent and robust innate and adaptive immune responses in small animals and nonhuman primates. Historically, the prospect of developing mRNA vaccines was uncertain due to concerns of mRNA instability and the feasibility of large-scale manufacturing. Today, these issues are no longer perceived as barriers in the widespread implementation of the technology. Currently, nonamplifying mRNA vaccines are under investigation in human clinical trials and can be produced at a sufficient quantity and quality to meet regulatory requirements. If the encouraging preclinical data with self-amplifying mRNA vaccines are matched by equivalently positive immunogenicity, potency, and tolerability in human trials, this platform could establish nucleic acid vaccines as a versatile new tool for human immunization. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Codes on the Klein quartic, ideals, and decoding

    DEFF Research Database (Denmark)

    Hansen, Johan P.

    1987-01-01

    descriptions as left ideals in the group-algebra GF(2^{3})[G]. This description allows for easy decoding. For instance, in the case of the single error correcting code of length21and dimension16with minimal distance3. decoding is obtained by multiplication with an idempotent in the group algebra.......A sequence of codes with particular symmetries and with large rates compared to their minimal distances is constructed over the field GF(2^{3}). In the sequence there is, for instance, a code of length 21 and dimension10with minimal distance9, and a code of length21and dimension16with minimal...... distance3. The codes are constructed from algebraic geometry using the dictionary between coding theory and algebraic curves over finite fields established by Goppa. The curve used in the present work is the Klein quartic. This curve has the maximal number of rational points over GF(2^{3})allowed by Serre...

  17. A mild form of SLC29A3 disorder: a frameshift deletion leads to the paradoxical translation of an otherwise noncoding mRNA splice variant.

    Directory of Open Access Journals (Sweden)

    Alexandre Bolze

    Full Text Available We investigated two siblings with granulomatous histiocytosis prominent in the nasal area, mimicking rhinoscleroma and Rosai-Dorfman syndrome. Genome-wide linkage analysis and whole-exome sequencing identified a homozygous frameshift deletion in SLC29A3, which encodes human equilibrative nucleoside transporter-3 (hENT3. Germline mutations in SLC29A3 have been reported in rare patients with a wide range of overlapping clinical features and inherited disorders including H syndrome, pigmented hypertrichosis with insulin-dependent diabetes, and Faisalabad histiocytosis. With the exception of insulin-dependent diabetes and mild finger and toe contractures in one sibling, the two patients with nasal granulomatous histiocytosis studied here displayed none of the many SLC29A3-associated phenotypes. This mild clinical phenotype probably results from a remarkable genetic mechanism. The SLC29A3 frameshift deletion prevents the expression of the normally coding transcripts. It instead leads to the translation, expression, and function of an otherwise noncoding, out-of-frame mRNA splice variant lacking exon 3 that is eliminated by nonsense-mediated mRNA decay (NMD in healthy individuals. The mutated isoform differs from the wild-type hENT3 by the modification of 20 residues in exon 2 and the removal of another 28 amino acids in exon 3, which include the second transmembrane domain. As a result, this new isoform displays some functional activity. This mechanism probably accounts for the narrow and mild clinical phenotype of the patients. This study highlights the 'rescue' role played by a normally noncoding mRNA splice variant of SLC29A3, uncovering a new mechanism by which frameshift mutations can be hypomorphic.

  18. [Correlation of codon biases and potential secondary structures with mRNA translation efficiency in unicellular organisms].

    Science.gov (United States)

    Vladimirov, N V; Likhoshvaĭ, V A; Matushkin, Iu G

    2007-01-01

    Gene expression is known to correlate with degree of codon bias in many unicellular organisms. However, such correlation is absent in some organisms. Recently we demonstrated that inverted complementary repeats within coding DNA sequence must be considered for proper estimation of translation efficiency, since they may form secondary structures that obstruct ribosome movement. We have developed a program for estimation of potential coding DNA sequence expression in defined unicellular organism using its genome sequence. The program computes elongation efficiency index. Computation is based on estimation of coding DNA sequence elongation efficiency, taking into account three key factors: codon bias, average number of inverted complementary repeats, and free energy of potential stem-loop structures formed by the repeats. The influence of these factors on translation is numerically estimated. An optimal proportion of these factors is computed for each organism individually. Quantitative translational characteristics of 384 unicellular organisms (351 bacteria, 28 archaea, 5 eukaryota) have been computed using their annotated genomes from NCBI GenBank. Five potential evolutionary strategies of translational optimization have been determined among studied organisms. A considerable difference of preferred translational strategies between Bacteria and Archaea has been revealed. Significant correlations between elongation efficiency index and gene expression levels have been shown for two organisms (S. cerevisiae and H. pylori) using available microarray data. The proposed method allows to estimate numerically the coding DNA sequence translation efficiency and to optimize nucleotide composition of heterologous genes in unicellular organisms. http://www.mgs.bionet.nsc.ru/mgs/programs/eei-calculator/.

  19. Generation of pseudo-random sequences for spread spectrum systems

    Science.gov (United States)

    Moser, R.; Stover, J.

    1985-05-01

    The characteristics of pseudo random radio signal sequences (PRS) are explored. The randomness of the PSR is a matter of artificially altering the sequence of binary digits broadcast. Autocorrelations of the two sequences shifted in time, if high, determine if the signals are the same and thus allow for position identification. Cross-correlation can also be calculated between sequences. Correlations closest to zero are obtained with large volume of prime numbers in the sequences. Techniques for selecting optimal and maximal lengths for the sequences are reviewed. If the correlations are near zero in the sequences, then signal channels can accommodate multiple users. Finally, Gold codes are discussed as a technique for maximizing the code lengths.

  20. Uniform, optimal signal processing of mapped deep-sequencing data.

    Science.gov (United States)

    Kumar, Vibhor; Muratani, Masafumi; Rayan, Nirmala Arul; Kraus, Petra; Lufkin, Thomas; Ng, Huck Hui; Prabhakar, Shyam

    2013-07-01

    Despite their apparent diversity, many problems in the analysis of high-throughput sequencing data are merely special cases of two general problems, signal detection and signal estimation. Here we adapt formally optimal solutions from signal processing theory to analyze signals of DNA sequence reads mapped to a genome. We describe DFilter, a detection algorithm that identifies regulatory features in ChIP-seq, DNase-seq and FAIRE-seq data more accurately than assay-specific algorithms. We also describe EFilter, an estimation algorithm that accurately predicts mRNA levels from as few as 1-2 histone profiles (R ∼0.9). Notably, the presence of regulatory motifs in promoters correlates more with histone modifications than with mRNA levels, suggesting that histone profiles are more predictive of cis-regulatory mechanisms. We show by applying DFilter and EFilter to embryonic forebrain ChIP-seq data that regulatory protein identification and functional annotation are feasible despite tissue heterogeneity. The mathematical formalism underlying our tools facilitates integrative analysis of data from virtually any sequencing-based functional profile.

  1. An upper bound on the number of errors corrected by a convolutional code

    DEFF Research Database (Denmark)

    Justesen, Jørn

    2000-01-01

    The number of errors that a convolutional codes can correct in a segment of the encoded sequence is upper bounded by the number of distinct syndrome sequences of the relevant length.......The number of errors that a convolutional codes can correct in a segment of the encoded sequence is upper bounded by the number of distinct syndrome sequences of the relevant length....

  2. Sequence variants of KHDRBS1 as high penetrance susceptibility risks for primary ovarian insufficiency by mis-regulating mRNA alternative splicing.

    Science.gov (United States)

    Wang, Binbin; Li, Lin; Zhu, Ying; Zhang, Wei; Wang, Xi; Chen, Beili; Li, Tengyan; Pan, Hong; Wang, Jing; Kee, Kehkooi; Cao, Yunxia

    2017-10-01

    Does a novel heterozygous KHDRBS1 variant, identified using whole-exome sequencing (WES) in two patients with primary ovarian insufficiency (POI) in a pedigree, cause defects in mRNA alternative splicing? The heterozygous variant of KHDRBS1 was confirmed to cause defects in alternative splicing of many genes involved in DNA replication and repair. Studies in mice revealed that Khdrbs1 deficient females are subfertile, which manifests as delayed sexual maturity and significantly reduced numbers of secondary and pre-antral follicles. No mutation of KHDRBS1, however, has been reported in patients with POI. This genetic and functional study used WES to find putative mutations in a POI pedigree. Altogether, 215 idiopathic POI patients and 400 healthy controls were screened for KHDRBS1 mutations. Two POI patients were subjected to WES to identify sequence variants. Mutational analysis of the KHDRBS1 gene in 215 idiopathic POI patients and 400 healthy controls were performed. RNA-sequencing was carried out to find the mis-regulation of gene expression due to KHDRBS1 mutation. Bioinformatics was used to analyze the change in alternative splicing events. We identified a heterozygous mutation (c.460A > G, p.M154V) in KHDRBS1 in two patients. Further mutational analysis of 215 idiopathic POI patients with the KHDRBS1 gene found one heterozygous mutation (c.263C > T, p.P88L). We failed to find these two mutations in 400 healthy control women. Using RNA-sequencing, we found that the KGN cells expressing the M154V KHDRBS1 mutant had different expression of 66 genes compared with wild-type (WT) cells. Furthermore, 145 genes were alternatively spliced in M154V cells, and these genes were enriched for DNA replication and repair function, revealing a potential underlying mechanism of the pathology that leads to POI. Although the in vitro assays demonstrated the effect of the KHDRBS1 variant on alternative splicing, further studies are needed to validate the in vivo effects on germ

  3. Differential trypanosome surface coat regulation by a CCCH protein that co-associates with procyclin mRNA cis-elements.

    Directory of Open Access Journals (Sweden)

    Pegine Walrad

    2009-02-01

    Full Text Available The genome of Trypanosoma brucei is unusual in being regulated almost entirely at the post-transcriptional level. In terms of regulation, the best-studied genes are procyclins, which encode a family of major surface GPI-anchored glycoproteins (EP1, EP2, EP3, GPEET that show differential expression in the parasite's tsetse-fly vector. Although procyclin mRNA cis-regulatory sequences have provided the paradigm for post-transcriptional control in kinetoplastid parasites, trans-acting regulators of procyclin mRNAs are unidentified, despite intensive effort over 15 years. Here we identify the developmental regulator, TbZFP3, a CCCH-class predicted RNA binding protein, as an isoform-specific regulator of Procyclin surface coat expression in trypanosomes. We demonstrate (i that endogenous TbZFP3 shows sequence-specific co-precipitation of EP1 and GPEET, but not EP2 and EP3, procyclin mRNA isoforms, (ii that ectopic overexpression of TbZFP3 does not perturb the mRNA abundance of procyclin transcripts, but rather that (iii their protein expression is regulated in an isoform-specific manner, as evidenced by mass spectrometric analysis of the Procyclin expression signature in the transgenic cell lines. The TbZFP3 mRNA-protein complex (TbZFP3mRNP is identified as a trans-regulator of differential surface protein expression in trypanosomes. Moreover, its sequence-specific interactions with procyclin mRNAs are compatible with long-established predictions for Procyclin regulation. Combined with the known association of TbZFP3 with the translational apparatus, this study provides a long-sought missing link between surface protein cis-regulatory signals and the gene expression machinery in trypanosomes.

  4. Annotating pathogenic non-coding variants in genic regions.

    Science.gov (United States)

    Gelfman, Sahar; Wang, Quanli; McSweeney, K Melodi; Ren, Zhong; La Carpia, Francesca; Halvorsen, Matt; Schoch, Kelly; Ratzon, Fanni; Heinzen, Erin L; Boland, Michael J; Petrovski, Slavé; Goldstein, David B

    2017-08-09

    Identifying the underlying causes of disease requires accurate interpretation of genetic variants. Current methods ineffectively capture pathogenic non-coding variants in genic regions, resulting in overlooking synonymous and intronic variants when searching for disease risk. Here we present the Transcript-inferred Pathogenicity (TraP) score, which uses sequence context alterations to reliably identify non-coding variation that causes disease. High TraP scores single out extremely rare variants with lower minor allele frequencies than missense variants. TraP accurately distinguishes known pathogenic and benign variants in synonymous (AUC = 0.88) and intronic (AUC = 0.83) public datasets, dismissing benign variants with exceptionally high specificity. TraP analysis of 843 exomes from epilepsy family trios identifies synonymous variants in known epilepsy genes, thus pinpointing risk factors of disease from non-coding sequence data. TraP outperforms leading methods in identifying non-coding variants that are pathogenic and is therefore a valuable tool for use in gene discovery and the interpretation of personal genomes.While non-coding synonymous and intronic variants are often not under strong selective constraint, they can be pathogenic through affecting splicing or transcription. Here, the authors develop a score that uses sequence context alterations to predict pathogenicity of synonymous and non-coding genetic variants, and provide a web server of pre-computed scores.

  5. On Coding Non-Contiguous Letter Combinations

    Directory of Open Access Journals (Sweden)

    Frédéric eDandurand

    2011-06-01

    Full Text Available Starting from the hypothesis that printed word identification initially involves the parallel mapping of visual features onto location-specific letter identities, we analyze the type of information that would be involved in optimally mapping this location-specific orthographic code onto a location-invariant lexical code. We assume that some intermediate level of coding exists between individual letters and whole words, and that this involves the representation of letter combinations. We then investigate the nature of this intermediate level of coding given the constraints of optimality. This intermediate level of coding is expected to compress data while retaining as much information as possible about word identity. Information conveyed by letters is a function of how much they constrain word identity and how visible they are. Optimization of this coding is a combination of minimizing resources (using the most compact representations and maximizing information. We show that in a large proportion of cases, non-contiguous letter sequences contain more information than contiguous sequences, while at the same time requiring less precise coding. Moreover, we found that the best predictor of human performance in orthographic priming experiments was within-word ranking of conditional probabilities, rather than average conditional probabilities. We conclude that from an optimality perspective, readers learn to select certain contiguous and non-contiguous letter combinations as information that provides the best cue to word identity.

  6. Cloning the human lysozyme cDNA: Inverted Alu repeat in the mRNA and in situ hybridization for macrophages and Paneth cells

    International Nuclear Information System (INIS)

    Chung, L.P.; Keshav, S.; Gordon, S.

    1988-01-01

    Lysozyme is a major secretory product of human and rodent macrophages and a useful marker for myelomonocytic cells. Based on the known human lysozyme amino acid sequence, oligonucleotides were synthesized and used as probes to screen a phorbol 12-myristate 13-acetate-treated U937 cDNA library. A full-length human lysozyme cDNA clone, pHL-2, was obtained and characterized. Sequence analysis shows that human lysozyme, like chicken lysozyme, has in 18-amino-acid-long signal peptide, but unlike the chicken lysozyme cDNA, the human lysozyme cDNA has a >1-kilobase-long 3' nontranslated sequence. Interestingly, within this 3' region, an inverted repeat of the Alu family of repetitive sequences was discovered. In RNA blot analyses, DNA probes prepared from pHL-2 can be used to detect lysozyme mRNA not only from human but also from mouse and rat. Moreover, by in situ hybridization, complementary RNA transcripts have been used as probes to detect lysozyme mRNA in mouse macrophages and Paneth cells. This human lysozyme cDNA clone is therefore likely to be a useful molecular probe for studying macrophage distribution and gene expression

  7. A hairpin within YAP mRNA 3′UTR functions in regulation at post-transcription level

    Energy Technology Data Exchange (ETDEWEB)

    Gao, Yuen; Wang, Yuan; Feng, Jinyan; Feng, Guoxing; Zheng, Minying; Yang, Zhe; Xiao, Zelin; Lu, Zhanping [State Key Laboratory of Medicinal Chemical Biology, Department of Cancer Research, College of Life Sciences, Nankai University, Tianjin 300071 (China); Ye, Lihong [State Key Laboratory of Medicinal Chemical Biology, Department of Biochemistry, College of Life Sciences, Nankai University, Tianjin 300071 (China); Zhang, Xiaodong, E-mail: zhangxd@nankai.edu.cn [State Key Laboratory of Medicinal Chemical Biology, Department of Cancer Research, College of Life Sciences, Nankai University, Tianjin 300071 (China)

    2015-04-03

    The central dogma of gene expression is that DNA is transcribed into messenger RNAs, which in turn serve as the template for protein synthesis. Recently, it has been reported that mRNAs display regulatory roles that rely on their ability to compete for microRNA binding, independent of their protein-coding function. However, the regulatory mechanism of mRNAs remains poorly understood. Here, we report that a hairpin within YAP mRNA 3′untranslated region (3′UTR) functions in regulation at post-transcription level through generating endogenous siRNAs (esiRNAs). Bioinformatics analysis for secondary structure showed that YAP mRNA displayed a hairpin structure (termed standard hairpin, S-hairpin) within its 3′UTR. Surprisingly, we observed that the overexpression of S-hairpin derived from YAP 3′UTR (YAP-sh) increased the luciferase reporter activities of transcriptional factor NF-κB and AP-1 in 293T cells. Moreover, we identified that a fragment from YAP-sh, an esiRNA, was able to target mRNA 3′UTR of NF2 (a member of Hippo-signaling pathway) and YAP mRNA 3′UTR itself in hepatoma cells. Thus, we conclude that the YAP-sh within YAP mRNA 3′UTR may serve as a novel regulatory element, which functions in regulation at post-transcription level. Our finding provides new insights into the mechanism of mRNAs in regulatory function. - Highlights: • An S-hairpin within YAP mRNA 3′UTR possesses regulatory function. • YAP-sh acts as a regulatory element for YAP at post-transcription level. • YAP-sh-3p20, an esiRNA derived from YAP-sh, targets mRNAs of YAP and NF2. • YAP-sh-3p20 depresses the proliferation of HepG2 cells in vitro.

  8. A hairpin within YAP mRNA 3′UTR functions in regulation at post-transcription level

    International Nuclear Information System (INIS)

    Gao, Yuen; Wang, Yuan; Feng, Jinyan; Feng, Guoxing; Zheng, Minying; Yang, Zhe; Xiao, Zelin; Lu, Zhanping; Ye, Lihong; Zhang, Xiaodong

    2015-01-01

    The central dogma of gene expression is that DNA is transcribed into messenger RNAs, which in turn serve as the template for protein synthesis. Recently, it has been reported that mRNAs display regulatory roles that rely on their ability to compete for microRNA binding, independent of their protein-coding function. However, the regulatory mechanism of mRNAs remains poorly understood. Here, we report that a hairpin within YAP mRNA 3′untranslated region (3′UTR) functions in regulation at post-transcription level through generating endogenous siRNAs (esiRNAs). Bioinformatics analysis for secondary structure showed that YAP mRNA displayed a hairpin structure (termed standard hairpin, S-hairpin) within its 3′UTR. Surprisingly, we observed that the overexpression of S-hairpin derived from YAP 3′UTR (YAP-sh) increased the luciferase reporter activities of transcriptional factor NF-κB and AP-1 in 293T cells. Moreover, we identified that a fragment from YAP-sh, an esiRNA, was able to target mRNA 3′UTR of NF2 (a member of Hippo-signaling pathway) and YAP mRNA 3′UTR itself in hepatoma cells. Thus, we conclude that the YAP-sh within YAP mRNA 3′UTR may serve as a novel regulatory element, which functions in regulation at post-transcription level. Our finding provides new insights into the mechanism of mRNAs in regulatory function. - Highlights: • An S-hairpin within YAP mRNA 3′UTR possesses regulatory function. • YAP-sh acts as a regulatory element for YAP at post-transcription level. • YAP-sh-3p20, an esiRNA derived from YAP-sh, targets mRNAs of YAP and NF2. • YAP-sh-3p20 depresses the proliferation of HepG2 cells in vitro

  9. Yak response to high-altitude hypoxic stress by altering mRNA expression and DNA methylation of hypoxia-inducible factors.

    Science.gov (United States)

    Xiong, Xianrong; Fu, Mei; Lan, Daoliang; Li, Jian; Zi, Xiangdong; Zhong, Jincheng

    2015-01-01

    Hypoxia-inducible factors (HIFs) are oxygen-dependent transcriptional activators, which play crucial roles in tumor angiogenesis and mammalian development, and regulate the transcription of genes involved in oxygen homeostasis in response to hypoxia. However, information on HIF-1α and HIF-2α in yak (Bos grunniens) is scarce. The complete coding region of yak HIF-2α was cloned, its mRNA expression in several tissues were determined, and the expression levels were compared with those of closely related low-altitude cattle (Bos taurus), and the methylation status of promoter regions were analyzed to better understand the roles of HIF-1α and HIF-2α in domesticated yak. The yak HIF-2α cDNA was cloned and sequenced in the present work reveals the evolutionary conservation through multiple sequence alignment, although 15 bases changed, resulting in 8 amino acid substitutions in the translated proteins in cattle. The tissue-specific expression results showed that HIF-1α is ubiquitously expressed, whereas HIF-2α expression is limited to endothelial tissues (kidney, heart, lung, spleen, and liver) and blood in yak. Both HIF-1α and HIF-2α expressions were higher in yak tissues than in cattle. The HIF-1α expression level is much higher in yak than cattle in these organs, except for the lung (P hypoxic stress response mechanism and may assist current medical research to understand hypoxia-related diseases.

  10. Detection of siRNA Mediated Target mRNA Cleavage Activities in Human Cells by a Novel Stem-Loop Array RT-PCR Analysis

    Science.gov (United States)

    2016-09-07

    sequences of the target mRNA, and a double stranded stem at the 5′ end that forms a stem -loop to function as a forceps to stabilize the secondary...E-mjournal homepage: www.elsevier.com/locate/bbrepDetection of siRNA-mediated target mRNA cleavage activities in human cells by a novel stem -loop...challenges for the accurate and efficient detection and verification of cleavage sites on target mRNAs. Here we used a sensitive stem -loop array reverse

  11. Molecular characterization of branchial aquaporin 1aa and effects of seawater acclimation, emersion or ammonia exposure on its mRNA expression in the gills, gut, kidney and skin of the freshwater climbing perch, Anabas testudineus.

    Directory of Open Access Journals (Sweden)

    Yuen K Ip

    Full Text Available We obtained a full cDNA coding sequence of aquaporin 1aa (aqp1aa from the gills of the freshwater climbing perch, Anabas testudineus, which had the highest expression in the gills and skin, suggesting an important role of Aqp1aa in these organs. Since seawater acclimation had no significant effects on the branchial and intestinal aqp1aa mRNA expression, and since the mRNA expression of aqp1aa in the gut was extremely low, it can be deduced that Aqp1aa, despite being a water channel, did not play a significant osmoregulatory role in A. testudineus. However, terrestrial exposure led to significant increases in the mRNA expression of aqp1aa in the gills and skin of A. testudineus. Since terrestrial exposure would lead to evaporative water loss, these results further support the proposition that Aqp1aa did not function predominantly for the permeation of water through the gills and skin. Rather, increased aqp1aa mRNA expression might be necessary to facilitate increased ammonia excretion during emersion, because A. testudineus is known to utilize amino acids as energy sources for locomotor activity with increased ammonia production on land. Furthermore, ammonia exposure resulted in significant decreases in mRNA expression of aqp1aa in the gills and skin of A. testudineus, presumably to reduce ammonia influx during ammonia loading. This corroborates previous reports on AQP1 being able to facilitate ammonia permeation. However, a molecular characterization of Aqp1aa from A. testudineus revealed that its intrinsic aquapore might not facilitate NH3 transport. Hence, ammonia probably permeated the central fifth pore of the Aqp1aa tetramer as suggested previously. Taken together, our results indicate that Aqp1aa might have a greater physiological role in ammonia excretion than in osmoregulation in A. testudineus.

  12. MiR-200a is involved in rat epididymal development by targeting β-catenin mRNA

    Institute of Scientific and Technical Information of China (English)

    Xiaojiang Wu; Botao Zhao; Wei Li; Yue Chen; Ruqiang Liang; Lin Li; Youxin Jin; Kangcheng Ruan

    2012-01-01

    The expression of 350 microRNAs (miRNAs) in epididymis of rat from postnatal development to adult (from postnatal days 7-70) was profiled with home-made miRNA microarray.Among them,48 miRNAs changed significantly, in which the expression of miR-200a increased obviously with time,in a good agreement with that obtained from northern blot analysis.The real-time quantitative-polymerase chain reaction result indicated that temporal expression of rat β-catenin was exactly inversed to that of miR-200a during rat epididymal development,implying that miR-200a might also target β-catenin mRNA in rat epididymis as reported by Saydam et al.in humans.The bioinformatic analysis indicated that 3' untranslated region of rat β-catenin mRNA did contain a putative binding site for miR-200a.Meanwhile,it was found that the sequence of this binding site was different from that of human β-catenin mRNA with a deletion of two adjacent nucleotides (U and C).But the results of luciferase targeting assay in HEK 293T cells and the overexpression of miR-200a in rat NRK cells demonstrated that miR-200a did target rat β-catenin mRNA and cause the suppression of its expression.All these results show that miR-200a should be involved in rat epididymal development by targeting β-catenin mRNA of rat and suppressing its expression.

  13. Final report: FASEB Summer Research Conference on ''Post-transcriptional control of gene expression: Effectors of mRNA decay'' [agenda and attendees list

    Energy Technology Data Exchange (ETDEWEB)

    Maquat, Lynne

    2002-12-01

    The goal of this meeting was to provide an interactive forum for scientists working on prokaryotic and eukaryotic mRNA decay. A special seminar presented by a leader in the field of mRNA decay in S. cerevisiae focused on what is known and what needs to be determined, not only for yeast but for other organisms. The large attendance (110 participants) reflects the awareness that mRNA decay is a key player in gene regulation in a way that is affected by the many steps that precede mRNA formation. Sessions were held on the following topics: mRNA transport and mRNP; multicomponent eukaryotic nucleases; nonsense-mediated mRNA decay and nonsense-associated altered splicing; Cis-acting sequences/Trans-acting factors of mRNA decay; translational accuracy; multicomponent bacterial nucleases; interplay between mRNA polyadenylation, translation and decay in prokaryotes and prokaryotic organelles; and RNA interference and other RNA mediators of gene expression. In addition to the talks and two poster sessions, there were three round tables: (1) Does translation occur in the nucleus? (2) Differences and similarities in the mechanisms of mRNA decay in different eukaryotes, and (3) RNA surveillance in bacteria?

  14. Linear network error correction coding

    CERN Document Server

    Guang, Xuan

    2014-01-01

    There are two main approaches in the theory of network error correction coding. In this SpringerBrief, the authors summarize some of the most important contributions following the classic approach, which represents messages by sequences?similar to algebraic coding,?and also briefly discuss the main results following the?other approach,?that uses the theory of rank metric codes for network error correction of representing messages by subspaces. This book starts by establishing the basic linear network error correction (LNEC) model and then characterizes two equivalent descriptions. Distances an

  15. Filter paper collection of Plasmodium falciparum mRNA for detecting low-density gametocytes.

    Science.gov (United States)

    Jones, Sophie; Sutherland, Colin J; Hermsen, Cornelus; Arens, Theo; Teelen, Karina; Hallett, Rachel; Corran, Patrick; van der Vegte-Bolmer, Marga; Sauerwein, Robert; Drakeley, Chris J; Bousema, Teun

    2012-08-08

    Accurate sampling of sub-microscopic gametocytes is necessary for epidemiological studies to identify the infectious reservoir of Plasmodium falciparum. Detection of gametocyte mRNA achieves sensitive detection, but requires careful handling of samples. Filter papers can be used for collecting RNA samples, but rigorous testing of their capacity to withstand adverse storage conditions has not been fully explored. Three gametocyte dilutions: 10/μL, 1.0/μL and 0.1/μL were spotted onto Whatman™ 903 Protein Saver Cards, FTA Classic Cards and 3MM filter papers that were stored under frozen, cold chain or tropical conditions for up to 13 weeks . RNA was extracted, then detected by quantitative nucleic acid sequence-based amplification (QT-NASBA) and reverse-transcriptase PCR (RT-PCR). Successful gametocyte detection was more frequently observed from the Whatman 903 Protein Saver Card compared to the Whatman FTA Classic Card, by both techniques (pFTA Classic Card but not the 903 Protein Saver Card or Whatman 3MM filter paper. The sensitivity of gametocyte detection was decreased when papers were stored at high humidity. This study indicates the Whatman 903 Protein Saver Card is better for Pfs25 mRNA sampling compared to the Whatman FTA Classic Card, and that the Whatman 3MM filter paper may prove to be a satisfactory cheaper option for Pfs25 mRNA sampling. When appropriately dried, filter papers provide a useful approach to Pfs25 mRNA sampling, especially in settings where storage in RNA-protecting buffer is not possible.

  16. The Genomic Code: Genome Evolution and Potential Applications

    KAUST Repository

    Bernardi, Giorgio

    2016-01-25

    The genome of metazoans is organized according to a genomic code which comprises three laws: 1) Compositional correlations hold between contiguous coding and non-coding sequences, as well as among the three codon positions of protein-coding genes; these correlations are the consequence of the fact that the genomes under consideration consist of fairly homogeneous, long (≥200Kb) sequences, the isochores; 2) Although isochores are defined on the basis of purely compositional properties, GC levels of isochores are correlated with all tested structural and functional properties of the genome; 3) GC levels of isochores are correlated with chromosome architecture from interphase to metaphase; in the case of interphase the correlation concerns isochores and the three-dimensional “topological associated domains” (TADs); in the case of mitotic chromosomes, the correlation concerns isochores and chromosomal bands. Finally, the genomic code is the fourth and last pillar of molecular biology, the first three pillars being 1) the double helix structure of DNA; 2) the regulation of gene expression in prokaryotes; and 3) the genetic code.

  17. Inaugural Genomics Automation Congress and the coming deluge of sequencing data.

    Science.gov (United States)

    Creighton, Chad J

    2010-10-01

    Presentations at Select Biosciences's first 'Genomics Automation Congress' (Boston, MA, USA) in 2010 focused on next-generation sequencing and the platforms and methodology around them. The meeting provided an overview of sequencing technologies, both new and emerging. Speakers shared their recent work on applying sequencing to profile cells for various levels of biomolecular complexity, including DNA sequences, DNA copy, DNA methylation, mRNA and microRNA. With sequencing time and costs continuing to drop dramatically, a virtual explosion of very large sequencing datasets is at hand, which will probably present challenges and opportunities for high-level data analysis and interpretation, as well as for information technology infrastructure.

  18. Simultaneous chromatic dispersion and PMD compensation by using coded-OFDM and girth-10 LDPC codes.

    Science.gov (United States)

    Djordjevic, Ivan B; Xu, Lei; Wang, Ting

    2008-07-07

    Low-density parity-check (LDPC)-coded orthogonal frequency division multiplexing (OFDM) is studied as an efficient coded modulation scheme suitable for simultaneous chromatic dispersion and polarization mode dispersion (PMD) compensation. We show that, for aggregate rate of 10 Gb/s, accumulated dispersion over 6500 km of SMF and differential group delay of 100 ps can be simultaneously compensated with penalty within 1.5 dB (with respect to the back-to-back configuration) when training sequence based channel estimation and girth-10 LDPC codes of rate 0.8 are employed.

  19. Structure of the gene encoding VGF, a nervous system-specific mRNA that is rapidly and selectively induced by nerve growth factor in PC12 cells.

    Science.gov (United States)

    Salton, S R; Fischberg, D J; Dong, K W

    1991-05-01

    Nerve growth factor (NGF) plays a critical role in the development and survival of neurons in the peripheral nervous system. Following treatment with NGF but not epidermal growth factor, rat pheochromocytoma (PC12) cells undergo neural differentiation. We have cloned a nervous system-specific mRNA, NGF33.1, that is rapidly and relatively selectively induced by treatment of PC12 cells with NGF and basic fibroblast growth factor in comparison with epidermal growth factor. Analysis of the nucleic acid and predicted amino acid sequences of the NGF33.1 cDNA clone suggested that this clone corresponded to the NGF-inducible mRNA called VGF (A. Levi, J. D. Eldridge, and B. M. Paterson, Science 229:393-395, 1985; R. Possenti, J. D. Eldridge, B. M. Paterson, A. Grasso, and A. Levi, EMBO J. 8:2217-2223, 1989). We have used the NGF33.1 cDNA clone to isolate and characterize the VGF gene, and in this paper we report the complete sequence of the VGF gene, including 853 bases of 5' flank revealed TATAA and CCAAT elements, several GC boxes, and a consensus cyclic AMP response element-binding protein binding site. The VGF promoter contains sequences homologous to other NGF-inducible, neuronal promoters. We further show that VGF mRNA is induced in PC12 cells to a greater extent by depolarization and by phorbol-12-myristate-13-acetate treatment than by 8-bromo-cyclic AMP treatment. By Northern (RNA) and RNase protection analysis, VGF mRNA is detectable in embryonic and postnatal central and peripheral nervous tissues but not in a number of nonneural tissues. In the cascade of events which ultimately leads to the neural differentiation of NGF-treated PC12 cells, the VGF gene encodes the most rapidly and selectively regulated, nervous-system specific mRNA yet identified.

  20. Low-level lasers on microRNA and uncoupling protein 2 mRNA levels in human breast cancer cells

    Science.gov (United States)

    Canuto, K. S.; Teixeira, A. F.; Rodrigues, J. A.; Paoli, F.; Nogueira, E. M.; Mencalha, A. L.; Fonseca, A. S.

    2017-06-01

    MicroRNA is short non-coding RNA and is a mediator of post-transcriptional regulation of gene expression. In addition, uncoupling proteins (UCPs) regulate thermogenesis, metabolic and energy balance, and decrease reactive oxygen species production. Both microRNA and UCP2 expression can be altered in cancer cells. At low power, laser wavelength, frequency, fluence and emission mode deternube photobiological responses, which are the basis of low-level laser therapy. There are few studies on miRNA and UCP mRNA levels after low-level laser exposure on cancer cells. In this work, we evaluate the micrRNA (mir-106b and mir-15a) and UCP2 mRNA levels in human breast cancer cells exposed to low-level lasers. MDA-MB-231 human breast cancer cells were exposed to low-level red and infrared lasers, total RNA was extracted for cDNA synthesis and mRNA levels by real time quantitative polymerase chain reaction were evaluated. Data show that mir-106b and mir-15a relative levels are not altered, but UCP2 mRNA relative levels are increased in MDA-MB-231 human breast cancer cells exposed to low-level red and infrared lasers at fluences used in therapeutic protocols.

  1. On the Organizational Dynamics of the Genetic Code

    KAUST Repository

    Zhang, Zhang

    2011-06-07

    The organization of the canonical genetic code needs to be thoroughly illuminated. Here we reorder the four nucleotides—adenine, thymine, guanine and cytosine—according to their emergence in evolution, and apply the organizational rules to devising an algebraic representation for the canonical genetic code. Under a framework of the devised code, we quantify codon and amino acid usages from a large collection of 917 prokaryotic genome sequences, and associate the usages with its intrinsic structure and classification schemes as well as amino acid physicochemical properties. Our results show that the algebraic representation of the code is structurally equivalent to a content-centric organization of the code and that codon and amino acid usages under different classification schemes were correlated closely with GC content, implying a set of rules governing composition dynamics across a wide variety of prokaryotic genome sequences. These results also indicate that codons and amino acids are not randomly allocated in the code, where the six-fold degenerate codons and their amino acids have important balancing roles for error minimization. Therefore, the content-centric code is of great usefulness in deciphering its hitherto unknown regularities as well as the dynamics of nucleotide, codon, and amino acid compositions.

  2. On the Organizational Dynamics of the Genetic Code

    KAUST Repository

    Zhang, Zhang; Yu, Jun

    2011-01-01

    The organization of the canonical genetic code needs to be thoroughly illuminated. Here we reorder the four nucleotides—adenine, thymine, guanine and cytosine—according to their emergence in evolution, and apply the organizational rules to devising an algebraic representation for the canonical genetic code. Under a framework of the devised code, we quantify codon and amino acid usages from a large collection of 917 prokaryotic genome sequences, and associate the usages with its intrinsic structure and classification schemes as well as amino acid physicochemical properties. Our results show that the algebraic representation of the code is structurally equivalent to a content-centric organization of the code and that codon and amino acid usages under different classification schemes were correlated closely with GC content, implying a set of rules governing composition dynamics across a wide variety of prokaryotic genome sequences. These results also indicate that codons and amino acids are not randomly allocated in the code, where the six-fold degenerate codons and their amino acids have important balancing roles for error minimization. Therefore, the content-centric code is of great usefulness in deciphering its hitherto unknown regularities as well as the dynamics of nucleotide, codon, and amino acid compositions.

  3. Flexibility of the genetic code with respect to DNA structure

    DEFF Research Database (Denmark)

    Baisnée, P. F.; Baldi, Pierre; Brunak, Søren

    2001-01-01

    Motivation. The primary function of DNA is to carry genetic information through the genetic code. DNA, however, contains a variety of other signals related, for instance, to reading frame, codon bias, pairwise codon bias, splice sites and transcription regulation, nucleosome positioning and DNA...... structure. Here we study the relationship between the genetic code and DNA structure and address two questions. First, to which degree does the degeneracy of the genetic code and the acceptable amino acid substitution patterns allow for the superimposition of DNA structural signals to protein coding...... sequences? Second, is the origin or evolution of the genetic code likely to have been constrained by DNA structure? Results. We develop an index for code flexibility with respect to DNA structure. Using five different di- or tri-nucleotide models of sequence-dependent DNA structure, we show...

  4. Differential effects of simple repeating DNA sequences on gene expression from the SV40 early promoter.

    Science.gov (United States)

    Amirhaeri, S; Wohlrab, F; Wells, R D

    1995-02-17

    The influence of simple repeat sequences, cloned into different positions relative to the SV40 early promoter/enhancer, on the transient expression of the chloramphenicol acetyltransferase (CAT) gene was investigated. Insertion of (G)29.(C)29 in either orientation into the 5'-untranslated region of the CAT gene reduced expression in CV-1 cells 50-100 fold when compared with controls with random sequence inserts. Analysis of CAT-specific mRNA levels demonstrated that the effect was due to a reduction of CAT mRNA production rather than to posttranscriptional events. In contrast, insertion of the same insert in either orientation upstream of the promoter-enhancer or downstream of the gene stimulated gene expression 2-3-fold. These effects could be reversed by cotransfection of a competitor plasmid carrying (G)25.(C)25 sequences. The results suggest that a G.C-binding transcription factor modulates gene expression in this system and that promoter strength can be regulated by providing protein-binding sites in trans. Although constructs containing longer tracts of alternating (C-G), (T-G), or (A-T) sequences inhibited CAT expression when inserted in the 5'-untranslated region of the CAT gene, the amount of CAT mRNA was unaffected. Hence, these inhibitions must be due to posttranscriptional events, presumably at the level of translation. These effects of microsatellite sequences on gene expression are discussed with respect to recent data on related simple repeat sequences which cause several human genetic diseases.

  5. A human-specific de novo protein-coding gene associated with human brain functions.

    Directory of Open Access Journals (Sweden)

    Chuan-Yun Li

    2010-03-01

    Full Text Available To understand whether any human-specific new genes may be associated with human brain functions, we computationally screened the genetic vulnerable factors identified through Genome-Wide Association Studies and linkage analyses of nicotine addiction and found one human-specific de novo protein-coding gene, FLJ33706 (alternative gene symbol C20orf203. Cross-species analysis revealed interesting evolutionary paths of how this gene had originated from noncoding DNA sequences: insertion of repeat elements especially Alu contributed to the formation of the first coding exon and six standard splice junctions on the branch leading to humans and chimpanzees, and two subsequent substitutions in the human lineage escaped two stop codons and created an open reading frame of 194 amino acids. We experimentally verified FLJ33706's mRNA and protein expression in the brain. Real-Time PCR in multiple tissues demonstrated that FLJ33706 was most abundantly expressed in brain. Human polymorphism data suggested that FLJ33706 encodes a protein under purifying selection. A specifically designed antibody detected its protein expression across human cortex, cerebellum and midbrain. Immunohistochemistry study in normal human brain cortex revealed the localization of FLJ33706 protein in neurons. Elevated expressions of FLJ33706 were detected in Alzheimer's brain samples, suggesting the role of this novel gene in human-specific pathogenesis of Alzheimer's disease. FLJ33706 provided the strongest evidence so far that human-specific de novo genes can have protein-coding potential and differential protein expression, and be involved in human brain functions.

  6. [Effects of lipopolysaccharides extracted from Porphyromonas endodontalis on the expression of IL-1beta mRNA and IL-6 mRNA in osteoblasts].

    Science.gov (United States)

    Yang, Di; Li, Ren; Qiu, Li-Hong; Li, Chen

    2009-04-01

    To quantify the IL-1 beta mRNA and IL-6 mRNA expression induced by lipopolysaccharides (LPS)extracted from Porphyromonas endodontalis(P.e) in osteoblasts, and to relate P.e-LPS to bone absorption pathogenesis in lesions of chronical apical periodontitis. MG63 was treated with different concentrations of P.e-LPS(0-50 microg/mL) for different hours(0-24h). The expression of IL-1 beta mRNA and IL-6 mRNA was detected by reverse transcription polymerase chain reaction (RT-PCR).Statistical analysis was performed using one- way ANOVA and Dunnett t test with SPSS11.0 software package. The level of IL-1 beta mRNA and IL-6 mRNA increased significantly after treatment with P.e-LPS at more than 5 microg/mL (P<0.01)and for more than 1 hour (P<0.01), which indicated that P.e-LPS induced osteoblasts to express IL-1 beta mRNA and IL-6 mRNA in dose and time dependent manners. P.e-LPS may promote bone resorption in lesions of chronical apical periodontitis by inducing IL-1 beta mRNA and IL-6 mRNA expression in osteoblasts.

  7. Epigenetic mechanisms involved in differential MDR1 mRNA expression between gastric and colon cancer cell lines and rationales for clinical chemotherapy

    Directory of Open Access Journals (Sweden)

    Kim Kyung-Jong

    2008-08-01

    Full Text Available Abstract Background The membrane transporters such as P-glycoprotein (Pgp, the MDR1 gene product, are one of causes of treatment failure in cancer patients. In this study, the epigenetic mechanisms involved in differential MDR1 mRNA expression were compared between 10 gastric and 9 colon cancer cell lines. Methods The MDR1 mRNA levels were determined using PCR and real-time PCR assays after reverse transcription. Cytotoxicity was performed using the MTT assay. Methylation status was explored by quantification PCR-based methylation and bisulfite DNA sequencing analyses. Results The MDR1 mRNA levels obtained by 35 cycles of RT-PCR in gastric cancer cells were just comparable to those obtained by 22 cycles of RT-PCR in colon cancer cells. Real-time RT-PCR analysis revealed that MDR1 mRNA was not detected in the 10 gastric cancer cell lines but variable MDR1 mRNA levels in 7 of 9 colon cancer cell lines except the SNU-C5 and HT-29 cells. MTT assay showed that Pgp inhibitors such as cyclosporine A, verapamil and PSC833 sensitized Colo320HSR (colon, highest MDR1 expression but not SNU-668 (gastric, highest and SNU-C5 (gastric, no expression to paclitaxel. Quantification PCR-based methylation analysis revealed that 90% of gastric cancer cells, and 33% of colon cancer cells were methylated, which were completely matched with the results obtained by bisulfite DNA sequencing analysis. 5-aza-2'-deoxcytidine (5AC, a DNA methyltransferase inhibitor increased the MDR1 mRNA levels in 60% of gastric cells, and in 11% of colon cancer cells. Trichostatin A (TSA, histone deacetylase inhibitor increased the MDR1 mRNA levels in 70% of gastric cancer cells and 55% of colon cancer cells. The combined treatment of 5AC with TSA increased the MDR1 mRNA levels additively in 20% of gastric cancer cells, but synergistically in 40% of gastric and 11% of colon cancer cells. Conclusion These results indicate that the MDR1 mRNA levels in gastric cancer cells are significantly

  8. Presence and Expression of Microbial Genes Regulating Soil Nitrogen Dynamics Along the Tanana River Successional Sequence

    Science.gov (United States)

    Boone, R. D.; Rogers, S. L.

    2004-12-01

    We report on work to assess the functional gene sequences for soil microbiota that control nitrogen cycle pathways along the successional sequence (willow, alder, poplar, white spruce, black spruce) on the Tanana River floodplain, Interior Alaska. Microbial DNA and mRNA were extracted from soils (0-10 cm depth) for amoA (ammonium monooxygenase), nifH (nitrogenase reductase), napA (nitrate reductase), and nirS and nirK (nitrite reductase) genes. Gene presence was determined by amplification of a conserved sequence of each gene employing sequence specific oligonucleotide primers and Polymerase Chain Reaction (PCR). Expression of the genes was measured via nested reverse transcriptase PCR amplification of the extracted mRNA. Amplified PCR products were visualized on agarose electrophoresis gels. All five successional stages show evidence for the presence and expression of microbial genes that regulate N fixation (free-living), nitrification, and nitrate reduction. We detected (1) nifH, napA, and nirK presence and amoA expression (mRNA production) for all five successional stages and (2) nirS and amoA presence and nifH, nirK, and napA expression for early successional stages (willow, alder, poplar). The results highlight that the existing body of previous process-level work has not sufficiently considered the microbial potential for a nitrate economy and free-living N fixation along the complete floodplain successional sequence.

  9. Complementary DNA and derived amino acid sequence of the α subunit of human complement protein C8: evidence for the existence of a separate α subunit messenger RNA

    International Nuclear Information System (INIS)

    Rao, A.G.; Howard, O.M.Z.; Ng, S.C.; Whitehead, A.S.; Colten, H.R.; Sodetz, J.M.

    1987-01-01

    The entire amino acid sequence of the α subunit (M/sub r/ 64,000) of the eight component of complement (C8) was determined by characterizing cDNA clones isolated from a human liver cDNA library. Two clones with overlapping inserts of net length 2.44 kilobases (kb) were isolated and found to contain the entire α coding region [1659 base pairs (bp)]. The 5' end consists of an untranslated region and a leader sequence of 30 amino acids. This sequence contains an apparent initiation Met, signal peptide, and propeptide which ends with an arginine-rich sequence that is characteristic of proteolytic processing sites found in the pro form of protein precursors. The 3' untranslated region contains two polyadenylation signals and a poly(A)sequence. RNA blot analysis of total cellular RNA from the human hepatoma cell line HepG2 revealed a message size of ∼2.5 kb. Features of the 5' and 3' sequences and the message size suggest that a separate mRNA codes for α and argues against the occurrence of a single-chain precursor form of the disulfide-linked α-λ subunit found in mature C8. Analysis of the derived amino acid sequence revealed several membrane surface seeking domains and a possible transmembrane domain. Analysis of the carbohydrate composition indicates 1 or 2 asparagine-linked but no O-linked oligosaccharide chains, a result consistent with predictions from the amino acid sequence. Most significantly, it exhibits a striking overall homology to human C9, with values of 24% on the basis of identity and 46% when conserved substitutions are allowed. As described in an accompanying report this homology also extends to the β subunit of C8

  10. The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing.

    Directory of Open Access Journals (Sweden)

    Jonas Binladen

    2007-02-01

    Full Text Available The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR reactions and subsequent sequencing runs have been unable to combine template DNA from multiple individuals, as homologous sequences cannot be subsequently assigned to their original sources.We use conventional PCR with 5'-nucleotide tagged primers to generate homologous DNA amplification products from multiple specimens, followed by sequencing through the high-throughput Genome Sequence 20 DNA Sequencing System (GS20, Roche/454 Life Sciences. Each DNA sequence is subsequently traced back to its individual source through 5'tag-analysis.We demonstrate that this new approach enables the assignment of virtually all the generated DNA sequences to the correct source once sequencing anomalies are accounted for (miss-assignment rate<0.4%. Therefore, the method enables accurate sequencing and assignment of homologous DNA sequences from multiple sources in single high-throughput GS20 run. We observe a bias in the distribution of the differently tagged primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution of the sequences as sorted by the second nucleotide of the dinucleotide tags. As the results are based on a single GS20 run, the general applicability of the approach requires confirmation. However, our experiments demonstrate that 5'primer tagging is a useful method in which the sequencing power of the GS20 can be applied to PCR-based assays of multiple homologous PCR products. The new approach will be of value to a broad range of research areas, such as those of comparative genomics, complete mitochondrial

  11. Species delimitation and phylogenetic reconstruction of the sinipercids (Perciformes: Sinipercidae) based on target enrichment of thousands of nuclear coding sequences.

    Science.gov (United States)

    Song, Shuli; Zhao, Jinliang; Li, Chenhong

    2017-06-01

    The sinipercids are freshwater fishes endemic to East Asia, mainly in China. Phylogenetic studies on the sinipercids have made great progress in the last decades, but interspecific relationships and evolutionary history of the sinipercids remain unresolved. Lack of distinctive morphological characters leads to problems in validating of some species, such as Siniperca loona. Moreover, genetic data are needed to delimitate species pairs with explicit hypothesis testing, such as in S. chuatsi vs. S. kneri and Coreoperca whiteheadi vs. C. liui. Here we reconstructed phylogeny of the sinipercids with an unprecedented scale of data, 16,943 loci of single-copy coding sequence data from nine sinipercid species, eight putative sister taxa and two outgroups. Targeted sequences were collected using gene enrichment and Illumina sequencing, yielding thousands of protein coding sequences and single nucleotide polymorphisms (SNPs) data. Maximum likelihood and coalescent species tree analyses resulted in identical and highly supported trees. We confirmed that the centrarchids are sister to the sinipercids. A monophyletic Sinipercidae with two genera, Siniperca and Coreoperca was also supported. Different from most previous studies, S. scherzeri was found as the most basal taxon to other species of Siniperca, which consists of two clades: a clade having S. roulei sister to S. chuatsi and S. kneri, and a clade consisting S. loona sister to S. obscura and S. undulata. We found that both S. loona and C. liui are valid species using Bayes factor delimitation (BFD ∗ ) based on SNPs data. Species delimitation also provided decisive support for S. chuatsi and S. kneri being two distinct species. We calibrated a chronogram of the sinipercids based on 100 loci and three fossil calibration points using BEAST, and reconstructed ancestral ranges of the sinipercids using Lagrange Analysis (DEC model) and Statistical Dispersal-Vicariance Analysis (S-DIVA) implemented in RASP. Divergence time

  12. Novel overlapping coding sequences in Chlamydia trachomatis

    DEFF Research Database (Denmark)

    Jensen, Klaus Thorleif; Petersen, Lise; Falk, Søren

    2006-01-01

    that are in agreement with the primary annotation. Forty two genes from the primary annotation are not predicted by EasyGene. The majority of these genes are listed as hypothetical in the primary annotation. The 15 novel predicted genes all overlap with genes on the complementary strand. We find homologues of several...... of the novel genes in C. trachomatis Serovar A and Chlamydia muridarum. Several of the genes have typical gene-like and protein-like features. Furthermore, we confirm transcriptional activity from 10 of the putative genes. The combined evidence suggests that at least seven of the 15 are protein coding genes...

  13. Low levels of PRB3 mRNA are associated with dopamine-agonist resistance and tumor recurrence in prolactinomas.

    Science.gov (United States)

    Wang, Fei; Gao, Hua; Li, Chuzhong; Bai, Jiwei; Lu, Runchun; Cao, Lei; Wu, Yongtu; Hong, Lichuan; Wu, Yonggang; Lan, Xiaolei; Zhang, Yazhuo

    2014-01-01

    Prolactinomas, or prolactin-secreting adenomas, constitute the most common type of hyperfunctioning pituitary adenoma. Dopamine agonists are used as first-line medication for prolactinomas, but the tumors are resistant to the therapy in 5-18 % of patients. To explore potential mechanisms of resistance to bromocriptine (a dopamine agonist), we analyzed six responsive prolactinomas and six resistant prolactinomas by whole-exome sequencing. We identified ten genes with sequence variants that were differentially found in the two groups of tumors. The expression of these genes was then quantified by real-time reverse-transcription PCR (RT-qPCR) in the 12 prolactinomas and in six normal pituitary glands. The mRNA levels of one of the genes, PRB3, were about fourfold lower in resistant prolactinomas than in the responsive tumors (p = 0.02). Furthermore, low PRB3 expression was also associated with tumor recurrence. Our results suggest that low levels of PRB3 mRNA may have a role in dopamine-agonist resistance and tumor recurrence of prolactinomas.

  14. Computer simulation of replacement sequences in copper

    International Nuclear Information System (INIS)

    Schiffgens, J.O.; Schwartz, D.W.; Ariyasu, R.G.; Cascadden, S.E.

    1978-01-01

    Results of computer simulations of , , and replacement sequences in copper are presented, including displacement thresholds, focusing energies, energy losses per replacement, and replacement sequence lengths. These parameters are tabulated for six interatomic potentials and shown to vary in a systematic way with potential stiffness and range. Comparisons of results from calculations made with ADDES, a quasi-dynamical code, and COMENT, a dynamical code, show excellent agreement, demonstrating that the former can be calibrated and used satisfactorily in the analysis of low energy displacement cascades. Upper limits on , , and replacement sequences were found to be approximately 10, approximately 30, and approximately 14 replacements, respectively. (author)

  15. Tentative mapping of transcription-induced interchromosomal interaction using chimeric EST and mRNA data.

    Directory of Open Access Journals (Sweden)

    Per Unneberg

    Full Text Available Recent studies on chromosome conformation show that chromosomes colocalize in the nucleus, bringing together active genes in transcription factories. This spatial proximity of actively transcribing genes could provide a means for RNA interaction at the transcript level. We have screened public databases for chimeric EST and mRNA sequences with the intent of mapping transcription-induced interchromosomal interactions. We suggest that chimeric transcripts may be the result of close encounters of active genes, either as functional products or "noise" in the transcription process, and that they could be used as probes for chromosome interactions. We have found a total of 5,614 chimeric ESTs and 587 chimeric mRNAs that meet our selection criteria. Due to their higher quality, the mRNA findings are of particular interest and we hope that they may serve as food for thought for specialists in diverse areas of molecular biology.

  16. The nucleotide sequences of two leghemoglobin genes from soybean

    DEFF Research Database (Denmark)

    Wiborg, O; Hyldig-Nielsen, J J; Jensen, E O

    1982-01-01

    We present the complete nucleotide sequences of two leghemoglobin genes isolated from soybean DNA. Both genes contain three intervening sequences in identical positions. Comparison of the coding sequences with known amino-acid sequences of soybean leghemoglobins suggest that the two genes...

  17. The Vulnerability Assessment Code for Physical Protection System

    International Nuclear Information System (INIS)

    Jang, Sung Soon; Yoo, Ho Sik

    2007-01-01

    To neutralize the increasing terror threats, nuclear facilities have strong physical protection system (PPS). PPS includes detectors, door locks, fences, regular guard patrols, and a hot line to a nearest military force. To design an efficient PPS and to fully operate it, vulnerability assessment process is required. Evaluating PPS of a nuclear facility is complicate process and, hence, several assessment codes have been developed. The estimation of adversary sequence interruption (EASI) code analyzes vulnerability along a single intrusion path. To evaluate many paths to a valuable asset in an actual facility, the systematic analysis of vulnerability to intrusion (SAVI) code was developed. KAERI improved SAVI and made the Korean analysis of vulnerability to intrusion (KAVI) code. Existing codes (SAVI and KAVI) have limitations in representing the distance of a facility because they use the simplified model of a PPS called adversary sequence diagram. In adversary sequence diagram the position of doors, sensors and fences is described just as the locating area. Thus, the distance between elements is inaccurate and we cannot reflect the range effect of sensors. In this abstract, we suggest accurate and intuitive vulnerability assessment based on raster map modeling of PPS. The raster map of PPS accurately represents the relative position of elements and, thus, the range effect of sensor can be easily incorporable. Most importantly, the raster map is easy to understand

  18. Motion Detection in Ultrasound Image-Sequences Using Tensor Voting

    Science.gov (United States)

    Inba, Masafumi; Yanagida, Hirotaka; Tamura, Yasutaka

    2008-05-01

    Motion detection in ultrasound image sequences using tensor voting is described. We have been developing an ultrasound imaging system adopting a combination of coded excitation and synthetic aperture focusing techniques. In our method, frame rate of the system at distance of 150 mm reaches 5000 frame/s. Sparse array and short duration coded ultrasound signals are used for high-speed data acquisition. However, many artifacts appear in the reconstructed image sequences because of the incompleteness of the transmitted code. To reduce the artifacts, we have examined the application of tensor voting to the imaging method which adopts both coded excitation and synthetic aperture techniques. In this study, the basis of applying tensor voting and the motion detection method to ultrasound images is derived. It was confirmed that velocity detection and feature enhancement are possible using tensor voting in the time and space of simulated ultrasound three-dimensional image sequences.

  19. In vitro detection of mdr1 mRNA in murine leukemia cells with 111In-labeled oligonucleotide

    International Nuclear Information System (INIS)

    Bai Jingming; Yokoyama, Kunihiko; Kinuya, Seigo; Michigishi, Takatoshi; Tonami, Norihisa; Shiba, Kazuhiro; Matsushita, Ryo; Nomura, Masaaki

    2004-01-01

    The feasibility of intracellular mdr1 mRNA expression detection with radiolabeled antisense oligonucleotide (ODN) was investigated in the murine leukemia cell line, P388/S, and its subclonal, adriamycin-resistant cell line, P388/R. The expression level of mdr1 mRNA was analyzed by reverse transcription-polymerase chain reaction (RT-PCR). Existence of the multidrug resistance (MDR) phenomenon was assessed via cellular uptake of 99m Tc-sestamibi (MIBI), a known substrate for P-glycoprotein. A 15-mer phosphorothioate antisense ODN complementary to the sequences located at -1 to 14 of mdr1 mRNA and its corresponding sense ODN were conjugated with the cyclic anhydride of diethylene triamine penta-acetic acid (cDTPA) via an amino group linked to the terminal phosphate at the 5' end at pH 8-9. The DTPA-ODN complexes at concentrations of 0.1-17.4 μMwere reacted with 111 InCl 3 at pH 5 for 1 h. The hybridization affinity of labeled ODN was evaluated with size-exclusion high-performance liquid chromatography following incubation with the complementary sequence. Cellular uptake of labeled ODN was examined in vitro. Furthermore, enhancing effects of synthetic lipid carriers (Transfast) on transmembrane delivery of ODN were assessed. P388/R cells displayed intense mdr1 mRNA expression in comparison with P388/S cells. 99m Tc-MIBI uptake in P388/S cells was higher than that in P388/R cells. Specific radioactivity up to 1,634 MBq/nmol was achieved via elevation of added radioactivity relative to ODN molar amount. The hybridization affinity of antisense 111 In-ODN was preserved at approximately 85% irrespective of specific activity. Cellular uptake of antisense 111 In-ODN did not differ from that of sense 111 In-ODN in either P388/S cells or P388/R cells. However, lipid carrier incorporation significantly increased transmembrane delivery of 111 In-ODN; moreover, specific uptake of antisense 111 In-ODN was demonstrated in P388/R cells. Radiolabeling of ODN at high specific

  20. Coding chaotic billiards. Pt. 3

    International Nuclear Information System (INIS)

    Ullmo, D.; Giannoni, M.J.

    1993-01-01

    Non-tiling compact billiard defined on the pseudosphere is studied 'a la Morse coding'. As for most bounded systems, the coding is non exact. However, two sets of approximate grammar rules can be obtained, one specifying forbidden codes, and the other allowed ones. In-between some sequences remain in the 'unknown' zone, but their relative amount can be reduced to zero if one lets the length of the approximate grammar rules goes to infinity. The relationship between these approximate grammar rules and the 'pruning front' introduced by Cvitanovic et al. is discussed. (authors). 13 refs., 10 figs., 1 tab

  1. The use of coded PCR primers enables high-throughput sequencing of multiple homolog amplification products by 454 parallel sequencing

    DEFF Research Database (Denmark)

    Binladen, Jonas; Gilbert, M Thomas P; Bollback, Jonathan P

    2007-01-01

    BACKGROUND: The invention of the Genome Sequence 20 DNA Sequencing System (454 parallel sequencing platform) has enabled the rapid and high-volume production of sequence data. Until now, however, individual emulsion PCR (emPCR) reactions and subsequent sequencing runs have been unable to combine...... primers that is dependent on the 5' nucleotide of the tag. In particular, primers 5' labelled with a cytosine are heavily overrepresented among the final sequences, while those 5' labelled with a thymine are strongly underrepresented. A weaker bias also exists with regards to the distribution...

  2. Partial sequence homogenization in the 5S multigene families may generate sequence chimeras and spurious results in phylogenetic reconstructions.

    Science.gov (United States)

    Galián, José A; Rosato, Marcela; Rosselló, Josep A

    2014-03-01

    Multigene families have provided opportunities for evolutionary biologists to assess molecular evolution processes and phylogenetic reconstructions at deep and shallow systematic levels. However, the use of these markers is not free of technical and analytical challenges. Many evolutionary studies that used the nuclear 5S rDNA gene family rarely used contiguous 5S coding sequences due to the routine use of head-to-tail polymerase chain reaction primers that are anchored to the coding region. Moreover, the 5S coding sequences have been concatenated with independent, adjacent gene units in many studies, creating simulated chimeric genes as the raw data for evolutionary analysis. This practice is based on the tacitly assumed, but rarely tested, hypothesis that strict intra-locus concerted evolution processes are operating in 5S rDNA genes, without any empirical evidence as to whether it holds for the recovered data. The potential pitfalls of analysing the patterns of molecular evolution and reconstructing phylogenies based on these chimeric genes have not been assessed to date. Here, we compared the sequence integrity and phylogenetic behavior of entire versus concatenated 5S coding regions from a real data set obtained from closely related plant species (Medicago, Fabaceae). Our results suggest that within arrays sequence homogenization is partially operating in the 5S coding region, which is traditionally assumed to be highly conserved. Consequently, concatenating 5S genes increases haplotype diversity, generating novel chimeric genotypes that most likely do not exist within the genome. In addition, the patterns of gene evolution are distorted, leading to incorrect haplotype relationships in some evolutionary reconstructions.

  3. GnRH mRNA levels in male three-spined sticklebacks, Gasterosteus aculeatus, under different reproductive conditions.

    Science.gov (United States)

    Shao, Yi Ta; Tseng, Yung Che; Chang, Chia-Hao; Yan, Hong Young; Hwang, Pung Pung; Borg, Bertil

    2015-02-01

    In vertebrates, reproduction is regulated by the brain-pituitary-gonad (BPG) axis, where the gonadotropin-releasing hormone (GnRH) is one of the key components. However, very little is known about the possible role of GnRH in the environmental and feedback control of fish reproduction. To investigate this, full-length gnrh2 (chicken GnRH II) and gnrh3 (salmon GnRH) sequences of male three-spined sticklebacks (Gasterosteus aculeatus), which are clustered with the taxa of the same GnRH type as other Euteleostei, were cloned and annotated. gnrh1 is absent in this species. The mRNA levels of gnrh2 and gnrh3 in the sticklebacks' brain were measured under breeding and post-breeding conditions as well as in castrated and sham-operated breeding fish and castrated/sham-operated fish kept under long-day (LD 16:8) and short-day (LD 8:16) conditions. Fully breeding males had considerably higher mRNA levels of gnrh2 and gnrh3 in the thalamus (Th) and in the telencephalon and preoptic area (T+POA), respectively, than post-breeding males. Sham-operated breeding males have higher gnrh3 mRNA levels than the corresponding castrated males. Moreover, higher gnrh2 mRNA levels in the Th and higher gnrh3 mRNA levels in the T+POA and hypothalamus (HypTh) were also found in long-day sham-operated males than in sham-operated fish kept under an inhibitory short day photoperiod. Nevertheless, gnrh2 and gnrh3 mRNA levels were not up-regulated in castrated males kept under long-day photoperiod, which suggests that positive feedbacks on the brain-pituitary-gonad axis are necessary for this response. Copyright © 2014 Elsevier Inc. All rights reserved.

  4. High Brain Ammonia Tolerance and Down-Regulation of Na+:K+:2Cl- Cotransporter 1b mRNA and Protein Expression in the Brain of the Swamp Eel, Monopterus albus, Exposed to Environmental Ammonia or Terrestrial Conditions

    Science.gov (United States)

    Ip, Yuen K.; Hou, Zhisheng; Chen, Xiu L.; Ong, Jasmine L. Y.; Chng, You R.; Ching, Biyun; Hiong, Kum C.; Chew, Shit F.

    2013-01-01

    Na+:K+:2Cl- cotransporter 1 (NKCC1) has been implicated in mediating ischemia-, trauma- or ammonia-induced astrocyte swelling/brain edema in mammals. This study aimed to determine the effects of ammonia or terrestrial exposure on ammonia concentrations in the plasma and brain, and the mRNA expression and protein abundance of nkcc/Nkcc in the brain, of the swamp eel Monopterus albus . Ammonia exposure led to a greater increase in the ammonia concentration in the brain of M. albus than terrestrial exposure. The brain ammonia concentration of M. albus reached 4.5 µmol g-1 and 2.7 µmol g-1 after 6 days of exposure to 50 mmol l-1 NH4Cl and terrestrial conditions, respectively. The full cDNA coding sequence of nkcc1b from M. albus brain comprised 3276 bp and coded for 1092 amino acids with an estimated molecular mass of 119.6 kDa. A molecular characterization indicated that it could be activated through phosphorylation and/or glycosylation by osmotic and/or oxidative stresses. Ammonia exposure for 1 day or 6 days led to significant decreases in the nkcc1b mRNA expression and Nkcc1b protein abundance in the brain of M. albus. In comparison, a significant decrease in nkcc1b mRNA expression was observed in the brain of M. albus only after 6 days of terrestrial exposure, but both 1 day and 6 days of terrestrial exposure resulted in significant decreases in the protein abundance of Nkcc1b. These results are novel because it has been established in mammals that ammonia up-regulates NKCC1 expression in astrocytes and NKCC1 plays an important role in ammonia-induced astrocyte swelling and brain edema. By contrast, our results indicate for the first time that M. albus is able to down-regulate the mRNA and protein expression of nkcc1b/Nkcc1b in the brain when confronted with ammonia toxicity, which could be one of the contributing factors to its extraordinarily high brain ammonia tolerance. PMID:24069137

  5. Search for antisense copies of beta-globin mRNA in anemic mouse spleen

    Directory of Open Access Journals (Sweden)

    Taylor John M

    2001-03-01

    Full Text Available Abstract Background Previous studies by Volloch and coworkers have reported that during the expression of high levels of β-globin mRNA in the spleen of anemic mice, they could also detect small but significant levels of an antisense (AS globin RNA species, which they postulated might have somehow arisen by RNA-directed RNA synthesis. For two reasons we undertook to confirm and possibly extend these studies. First, previous studies in our lab have focussed on what is an unequivocal example of host RNA-directed RNA polymerase activity on the RNA genome of human hepatitis delta virus. Second, if AS globin species do exist they could in turn form double-stranded RNA species which might induce post-transcriptional gene silencing, a phenomenon somehow provoked in eukaryotic cells by AS RNA sequences. Results We reexamined critical aspects of the previous globin studies. We used intraperitoneal injections of phenylhydrazine to induce anemia in mice, as demonstrated by the appearance and ultimate disappearance of splenomegaly. While a 30-fold increase in globin mRNA was detected in the spleen, the relative amount of putative AS RNA could be no more than 0.004%. Conclusions Contrary to earlier reports, induction of a major increase in globin transcripts in the mouse spleen was not associated with a detectable level of antisense RNA to globin mRNA.

  6. Performance enhancement of successive interference cancellation scheme based on spectral amplitude coding for optical code-division multiple-access systems using Hadamard codes

    Science.gov (United States)

    Eltaif, Tawfig; Shalaby, Hossam M. H.; Shaari, Sahbudin; Hamarsheh, Mohammad M. N.

    2009-04-01

    A successive interference cancellation scheme is applied to optical code-division multiple-access (OCDMA) systems with spectral amplitude coding (SAC). A detailed analysis of this system, with Hadamard codes used as signature sequences, is presented. The system can easily remove the effect of the strongest signal at each stage of the cancellation process. In addition, simulation of the prose system is performed in order to validate the theoretical results. The system shows a small bit error rate at a large number of active users compared to the SAC OCDMA system. Our results reveal that the proposed system is efficient in eliminating the effect of the multiple-user interference and in the enhancement of the overall performance.

  7. mRNA related to insulin family in human placenta

    International Nuclear Information System (INIS)

    Younes, M.A.; D'Agostino, J.B.; Frazier, M.L.; Besch, P.K.

    1986-01-01

    The authors have previously reported that human term placenta contains mRNA displaying sequence homology to a rat preproinsulin I cDNA clone (p119). When placental poly(A + ) RNA was analyzed for homology to p119 by RNA/DNA blot hybridization, prominent hybridization was observed which was found by densitometric analysis to be three-fold higher than control. To further characterize this insulin-like message, a cDNA library was generated (approx.7000 transformants) using normal term cesarean-sectioned tissue to prepare placental poly(A + ) RNA templates. Five hundred transformants were initially screened by colony hybridization using a 32 P-labeled rat preproinsulin I cDNA as probe. Of the ten initial positives obtained, three were found to be true positives based on Southern hybridization analyses of the recombinant plasmids. Using Taq I digested pBr322 as a size marker, the cDNAs were found to be approximately 300 bp in length. Preliminary DNA sequencing using the Sanger dideoxy chain termination method has revealed that one of these clones displays significant homology to the 5' region of human insulin-like growth factors I and II

  8. mRNA related to insulin family in human placenta

    Energy Technology Data Exchange (ETDEWEB)

    Younes, M.A.; D' Agostino, J.B.; Frazier, M.L.; Besch, P.K.

    1986-03-01

    The authors have previously reported that human term placenta contains mRNA displaying sequence homology to a rat preproinsulin I cDNA clone (p119). When placental poly(A/sup +/) RNA was analyzed for homology to p119 by RNA/DNA blot hybridization, prominent hybridization was observed which was found by densitometric analysis to be three-fold higher than control. To further characterize this insulin-like message, a cDNA library was generated (approx.7000 transformants) using normal term cesarean-sectioned tissue to prepare placental poly(A/sup +/) RNA templates. Five hundred transformants were initially screened by colony hybridization using a /sup 32/P-labeled rat preproinsulin I cDNA as probe. Of the ten initial positives obtained, three were found to be true positives based on Southern hybridization analyses of the recombinant plasmids. Using Taq I digested pBr322 as a size marker, the cDNAs were found to be approximately 300 bp in length. Preliminary DNA sequencing using the Sanger dideoxy chain termination method has revealed that one of these clones displays significant homology to the 5' region of human insulin-like growth factors I and II.

  9. Integrating microRNA and mRNA expression profiles in response to radiation-induced injury in rat lung

    International Nuclear Information System (INIS)

    Xie, Ling; Zhou, Jundong; Zhang, Shuyu; Chen, Qing; Lai, Rensheng; Ding, Weiqun; Song, ChuanJun; Meng, XingJun; Wu, Jinchang

    2014-01-01

    Exposure to radiation provokes cellular responses, which are likely regulated by gene expression networks. MicroRNAs are small non-coding RNAs, which regulate gene expression by promoting mRNA degradation or inhibiting protein translation. The expression patterns of both mRNA and miRNA during the radiation-induced lung injury (RILI) remain less characterized and the role of miRNAs in the regulation of this process has not been studied. The present study sought to evaluate miRNA and mRNA expression profiles in the rat lung after irradiation. Male Wistar rats were subjected to single dose irradiation with 20 Gy using 6 MV x-rays to the right lung. (A dose rate of 5 Gy/min was applied). Rats were sacrificed at 3, 12 and 26 weeks after irradiation, and morphological changes in the lung were examined by haematoxylin and eosin. The miRNA and mRNA expression profiles were evaluated by microarrays and followed by quantitative RT-PCR analysis. A cDNA microarray analysis found 2183 transcripts being up-regulated and 2917 transcripts down-regulated (P ≤ 0.05, ≥2.0 fold change) in the lung tissues after irradiation. Likewise, a miRNAs microarray analysis indicated 15 miRNA species being up-regulated and 8 down-regulated (P ≤ 0.05). Subsequent bioinformatics anal -yses of the differentially expressed mRNA and miRNAs revealed that alterations in mRNA expression following irradiation were negatively correlated with miRNAs expression. Our results provide evidence indicating that irradiation induces alterations of mRNA and miRNA expression in rat lung and that there is a negative correlation of mRNA and miRNA expression levels after irradiation. These findings significantly advance our understanding of the regulatory mechanisms underlying the pathophysiology of radiation-induced lung injury. In summary, RILI does not develop gradually in a linear process. In fact, different cell types interact via cytokines in a very complex network. Furthermore, this study suggests that

  10. Identification of a cytochrome P450 gene in the earthworm Eisenia fetida and its mRNA expression under enrofloxacin stress.

    Science.gov (United States)

    Li, Yinsheng; Zhao, Chun; Lu, Xiaoxu; Ai, Xiaojie; Qiu, Jiangping

    2018-04-15

    Cytochrome P450 (CYP450) enzymes are a family of hemoproteins primarily responsible for detoxification functions. Earthworms have been used as a bioindicator of soil pollution in numerous studies, but no CYP450 gene has so far been cloned. RT-PCR and RACE-PCR were employed to construct and sequence the CYP450 gene DNA from the extracted mRNA in the earthworm Eisenia fetida. The cloned gene (EW1) has an open reading frame of 477bp. The 3'-terminal region contained both the consensus and the signature sequences characteristic of CYP450. It was closely related to the CYP450 gene from the flatworm genus Opisthorchis felineus with 87% homology. The predicted structure of the putative protein was 97% homologous to human CYP450 family 27. This gene has been deposited in GenBank (accession no. KM881474). Earthworms (E. fetida) were then exposed to 1, 10, 100, and 500mgkg -1 enrofloxacin in soils to explore the mRNA expression by real time qPCR. The effect of enrofloxacin on mRNA expression levels of EW1 exhibited a marked hormesis pattern across the enrofloxacin dose range tested. This is believed to be the first reported CYP450 gene in earthworms, with reference value for molecular studies on detoxification processes in earthworms. Copyright © 2017 Elsevier Inc. All rights reserved.

  11. Coding Local and Global Binary Visual Features Extracted From Video Sequences

    Science.gov (United States)

    Baroffio, Luca; Canclini, Antonio; Cesana, Matteo; Redondi, Alessandro; Tagliasacchi, Marco; Tubaro, Stefano

    2015-11-01

    Binary local features represent an effective alternative to real-valued descriptors, leading to comparable results for many visual analysis tasks, while being characterized by significantly lower computational complexity and memory requirements. When dealing with large collections, a more compact representation based on global features is often preferred, which can be obtained from local features by means of, e.g., the Bag-of-Visual-Word (BoVW) model. Several applications, including for example visual sensor networks and mobile augmented reality, require visual features to be transmitted over a bandwidth-limited network, thus calling for coding techniques that aim at reducing the required bit budget, while attaining a target level of efficiency. In this paper we investigate a coding scheme tailored to both local and global binary features, which aims at exploiting both spatial and temporal redundancy by means of intra- and inter-frame coding. In this respect, the proposed coding scheme can be conveniently adopted to support the Analyze-Then-Compress (ATC) paradigm. That is, visual features are extracted from the acquired content, encoded at remote nodes, and finally transmitted to a central controller that performs visual analysis. This is in contrast with the traditional approach, in which visual content is acquired at a node, compressed and then sent to a central unit for further processing, according to the Compress-Then-Analyze (CTA) paradigm. In this paper we experimentally compare ATC and CTA by means of rate-efficiency curves in the context of two different visual analysis tasks: homography estimation and content-based retrieval. Our results show that the novel ATC paradigm based on the proposed coding primitives can be competitive with CTA, especially in bandwidth limited scenarios.

  12. Identification of microRNAs and their targets in Finger millet by high throughput sequencing.

    Science.gov (United States)

    Usha, S; Jyothi, M N; Sharadamma, N; Dixit, Rekha; Devaraj, V R; Nagesh Babu, R

    2015-12-15

    MicroRNAs are short non-coding RNAs which play an important role in regulating gene expression by mRNA cleavage or by translational repression. The majority of identified miRNAs were evolutionarily conserved; however, others expressed in a species-specific manner. Finger millet is an important cereal crop; nonetheless, no practical information is available on microRNAs to date. In this study, we have identified 95 conserved microRNAs belonging to 39 families and 3 novel microRNAs by high throughput sequencing. For the identified conserved and novel miRNAs a total of 507 targets were predicted. 11 miRNAs were validated and tissue specificity was determined by stem loop RT-qPCR, Northern blot. GO analyses revealed targets of miRNA were involved in wide range of regulatory functions. This study implies large number of known and novel miRNAs found in Finger millet which may play important role in growth and development. Copyright © 2015 Elsevier B.V. All rights reserved.

  13. RNA-DNA sequence differences spell genetic code ambiguities

    DEFF Research Database (Denmark)

    Bentin, Thomas; Nielsen, Michael L

    2013-01-01

    A recent paper in Science by Li et al. 2011(1) reports widespread sequence differences in the human transcriptome between RNAs and their encoding genes termed RNA-DNA differences (RDDs). The findings could add a new layer of complexity to gene expression but the study has been criticized. ...

  14. The Coding and Effector Transfer of Movement Sequences

    Science.gov (United States)

    Kovacs, Attila J.; Muhlbauer, Thomas; Shea, Charles H.

    2009-01-01

    Three experiments utilizing a 14-element arm movement sequence were designed to determine if reinstating the visual-spatial coordinates, which require movements to the same spatial locations utilized during acquisition, results in better effector transfer than reinstating the motor coordinates, which require the same pattern of homologous muscle…

  15. Whole-Exome Sequencing of 2,000 Danish Individuals and the Role of Rare Coding Variants in Type 2 Diabetes

    DEFF Research Database (Denmark)

    Lohmueller, Kirk E.; Sparsø, Thomas; Li, Qibin

    2013-01-01

    number of genes. We applied a series of gene-based tests to detect such susceptibility genes. However, no gene showed a significant association with disease risk after we corrected for the number of genes analyzed. Thus, we could reject a model for the genetic architecture of type 2 diabetes where rare......It has been hypothesized that, in aggregate, rare variants in coding regions of genes explain a substantial fraction of the heritability of common diseases. We sequenced the exomes of 1,000 Danish cases with common forms of type 2 diabetes (including body mass index > 27.5 kg/m2 and hypertension...

  16. Optimized Method for Generating and Acquiring GPS Gold Codes

    Directory of Open Access Journals (Sweden)

    Khaled Rouabah

    2015-01-01

    Full Text Available We propose a simpler and faster Gold codes generator, which can be efficiently initialized to any desired code, with a minimum delay. Its principle consists of generating only one sequence (code number 1 from which we can produce all the other different signal codes. This is realized by simply shifting this sequence by different delays that are judiciously determined by using the bicorrelation function characteristics. This is in contrast to the classical Linear Feedback Shift Register (LFSR based Gold codes generator that requires, in addition to the shift process, a significant number of logic XOR gates and a phase selector to change the code. The presence of all these logic XOR gates in classical LFSR based Gold codes generator provokes the consumption of an additional time in the generation and acquisition processes. In addition to its simplicity and its rapidity, the proposed architecture, due to the total absence of XOR gates, has fewer resources than the conventional Gold generator and can thus be produced at lower cost. The Digital Signal Processing (DSP implementations have shown that the proposed architecture presents a solution for acquiring Global Positioning System (GPS satellites signals optimally and in a parallel way.

  17. Identification of a Flavivirus Sequence in a Marine Arthropod.

    Directory of Open Access Journals (Sweden)

    Michael J Conway

    Full Text Available Phylogenetic analysis has yet to uncover the early origins of flaviviruses. In this study, I mined a database of expressed sequence tags in order to discover novel flavivirus sequences. Flavivirus sequences were identified in a pool of mRNA extracted from the sea spider Endeis spinosa (Pycnogonida, Pantopoda. Reconstruction of the translated sequences and BLAST analysis matched the sequence to the flavivirus NS5 gene. Additional sequences corresponding to envelope and the NS5 MTase domain were also identified. Phylogenetic analysis of homologous NS5 sequences revealed that Endeis spinosa NS5 (ESNS5 is likely related to classical insect-specific flaviviruses. It is unclear if ESNS5 represents genetic material from an active viral infection or an integrated viral genome. These data raise the possibility that classical insect-specific flaviviruses and perhaps medically relevant flaviviruses, evolved from progenitors that infected marine arthropods.

  18. Molecular cloning and distribution of oxytocin/vasopressin-like mRNA in the blue swimming crab, Portunus pelagicus, and its inhibitory effect on ovarian steroid release.

    Science.gov (United States)

    Saetan, Jirawat; Kruangkum, Thanapong; Phanthong, Phetcharat; Tipbunjong, Chittipong; Udomuksorn, Wandee; Sobhon, Prasert; Sretarugsa, Prapee

    2018-04-01

    This study was aimed to characterize the full length of mRNA of oxytocin/vasopressin (OT/VP)-like mRNA in female Portunus pelagicus (PpelOT/VP-like mRNA) using a partial PpelOT/VP-like sequence obtained previously in our transcriptome analysis (Saetan, 2014) to construct the primers. The PpelOT/VP-like mRNA was 626 bp long and it encoded the preprohormones containing 158 amino acids. This preprohormone consisted of a signal peptide, an active nonapeptide (CFITNCPPG) followed by the dibasic cleavage site (GKR), and the neurophysin domain. Sequence alignment of the PpelOT/VP-like peptide with those of other animals revealed strong molecular conservation. Phylogenetic analysis of encoded proteins revealed that the PpelOT/VP-like peptide was clustered within the group of crustacean OT/VP-like peptide. Analysis by RT-PCR revealed the expression of mRNA transcripts in the eyestalk, brain, ventral nerve cord (VNC), ovary, intestine and gill. The in situ hybridization demonstrated the cellular localizations of the transcripts in the central nervous system (CNS) and ovary tissues. In the eyestalk, the mRNA expression was observed in the neuronal clusters 1-5 but not in the sinus gland complex. In the brain and the VNC, the transcripts were detected in all neuronal clusters but not in the glial cell. In the ovary, the transcripts were found in all stages of oocytes (Oc1, Oc2, Oc3, and Oc4). In addition, synthetic PpelOT/VP-like peptide could inhibit steroid release from the ovary. The knowledge gained from this study will provide more understanding on neuro-endocrinological controls in this crab species. Copyright © 2018 Elsevier Inc. All rights reserved.

  19. SequenceL: Automated Parallel Algorithms Derived from CSP-NT Computational Laws

    Science.gov (United States)

    Cooke, Daniel; Rushton, Nelson

    2013-01-01

    With the introduction of new parallel architectures like the cell and multicore chips from IBM, Intel, AMD, and ARM, as well as the petascale processing available for highend computing, a larger number of programmers will need to write parallel codes. Adding the parallel control structure to the sequence, selection, and iterative control constructs increases the complexity of code development, which often results in increased development costs and decreased reliability. SequenceL is a high-level programming language that is, a programming language that is closer to a human s way of thinking than to a machine s. Historically, high-level languages have resulted in decreased development costs and increased reliability, at the expense of performance. In recent applications at JSC and in industry, SequenceL has demonstrated the usual advantages of high-level programming in terms of low cost and high reliability. SequenceL programs, however, have run at speeds typically comparable with, and in many cases faster than, their counterparts written in C and C++ when run on single-core processors. Moreover, SequenceL is able to generate parallel executables automatically for multicore hardware, gaining parallel speedups without any extra effort from the programmer beyond what is required to write the sequen tial/singlecore code. A SequenceL-to-C++ translator has been developed that automatically renders readable multithreaded C++ from a combination of a SequenceL program and sample data input. The SequenceL language is based on two fundamental computational laws, Consume-Simplify- Produce (CSP) and Normalize-Trans - pose (NT), which enable it to automate the creation of parallel algorithms from high-level code that has no annotations of parallelism whatsoever. In our anecdotal experience, SequenceL development has been in every case less costly than development of the same algorithm in sequential (that is, single-core, single process) C or C++, and an order of magnitude less

  20. ReRep: Computational detection of repetitive sequences in genome survey sequences (GSS

    Directory of Open Access Journals (Sweden)

    Alves-Ferreira Marcelo

    2008-09-01

    Full Text Available Abstract Background Genome survey sequences (GSS offer a preliminary global view of a genome since, unlike ESTs, they cover coding as well as non-coding DNA and include repetitive regions of the genome. A more precise estimation of the nature, quantity and variability of repetitive sequences very early in a genome sequencing project is of considerable importance, as such data strongly influence the estimation of genome coverage, library quality and progress in scaffold construction. Also, the elimination of repetitive sequences from the initial assembly process is important to avoid errors and unnecessary complexity. Repetitive sequences are also of interest in a variety of other studies, for instance as molecular markers. Results We designed and implemented a straightforward pipeline called ReRep, which combines bioinformatics tools for identifying repetitive structures in a GSS dataset. In a case study, we first applied the pipeline to a set of 970 GSSs, sequenced in our laboratory from the human pathogen Leishmania braziliensis, the causative agent of leishmaniosis, an important public health problem in Brazil. We also verified the applicability of ReRep to new sequencing technologies using a set of 454-reads of an Escheria coli. The behaviour of several parameters in the algorithm is evaluated and suggestions are made for tuning of the analysis. Conclusion The ReRep approach for identification of repetitive elements in GSS datasets proved to be straightforward and efficient. Several potential repetitive sequences were found in a L. braziliensis GSS dataset generated in our laboratory, and further validated by the analysis of a more complete genomic dataset from the EMBL and Sanger Centre databases. ReRep also identified most of the E. coli K12 repeats prior to assembly in an example dataset obtained by automated sequencing using 454 technology. The parameters controlling the algorithm behaved consistently and may be tuned to the properties

  1. Differential expression of the human thymosin-β4 gene in lymphocytes, macrophages, and granulocytes

    International Nuclear Information System (INIS)

    Gondo, H.; Kudo, J.; White, J.W.; Barr, C.; Selvanayagam, P.; Saunders, G.F.

    1987-01-01

    A cDNA clone encoding human thymosin-β 4 was isolated from a cDNA library prepared from peripheral blood leukocytes of a patient with acute lymphocytic leukemia. This clone contained the entire coding sequence of 43 amino acid residues of thymosin-β 4 and had an initiation codon and two termination codons. The amino acid and nucleotide sequences in the coding region were well conserved between rat and human. No signal peptide was found in the deduced protein sequence. Human thymosin-β 4 mRNA, approximately 830 nucleotides in length, was about 30 nucleotides larger than rat thymosin-β 4 mRNA. Expression of the human thymosin-β 4 gene in various primary myeloid and lymphoid malignant cells and in a few human hemopoietic cell lines was studied. Northern blot analyses of different neoplastic B lymphocytes revealed that steady state levels of thymosin-β 4 mRNA varied as a function of differentiation stage. Thymosin-β 4 mRNA levels were decreased in myeloma cells as are class II human leukocyte antigen, Fc receptor, and complement receptor, suggesting a relationship between thymosin-β 4 and the immune response. Treatment of THP-1 cells, a human monocytic cell line, with recombinant human interferon-γ reduced the levels of thymosin-β 4 mRNA. The pattern of thymosin-β 4 gene expression suggests that it may play a fundamental role in the host defense mechanism

  2. Sequence features and phylogenetic analysis of the stress protein Hsp90α in chinook salmon Oncorhynchus tshawytscha, a poikilothermic vertebrate

    Science.gov (United States)

    Palmisano, Aldo N.; Winton, James R.; Dickhoff, Walton W.

    1999-01-01

    We cloned and sequenced a chinook salmon Hsp90 cDNA; sequence analysis shows it to be Hsp90??. Phylogenetic analysis supports the hypothesis that ?? and ?? paralogs of Hsp90 arose as a result of a gene duplication event and that they diverged early in the evolution of vertebrates, before tetrapods separated from the teleost lineage. Among several differences distinguishing poikilothermic Hsp90?? sequences from their bird and mammal orthologs, the teleost versions specifically lack a characteristic QTQDQP phosphorylation site near the N-terminus. We used the cDNA to develop an RNA (Northern) blot to quantify cellular Hsp90 mRNA levels. Chinook salmon embryonic (CHSE-214) cells responded to heat shock with a rapid rise in Hsp90 mRNA through 4 h, followed by a gradual decline over the next 20 h. Hsp90 mRNA level may be useful as a stress indicator, especially in a laboratory setting or in response to acute heat stress.

  3. An automated annotation tool for genomic DNA sequences using

    Indian Academy of Sciences (India)

    Genomic sequence data are often available well before the annotated sequence is published. We present a method for analysis of genomic DNA to identify coding sequences using the GeneScan algorithm and characterize these resultant sequences by BLAST. The routines are used to develop a system for automated ...

  4. Advanced colorectal adenoma related gene expression signature may predict prognostic for colorectal cancer patients with adenoma-carcinoma sequence.

    Science.gov (United States)

    Li, Bing; Shi, Xiao-Yu; Liao, Dai-Xiang; Cao, Bang-Rong; Luo, Cheng-Hua; Cheng, Shu-Jun

    2015-01-01

    There are still no absolute parameters predicting progression of adenoma into cancer. The present study aimed to characterize functional differences on the multistep carcinogenetic process from the adenoma-carcinoma sequence. All samples were collected and mRNA expression profiling was performed by using Agilent Microarray high-throughput gene-chip technology. Then, the characteristics of mRNA expression profiles of adenoma-carcinoma sequence were described with bioinformatics software, and we analyzed the relationship between gene expression profiles of adenoma-adenocarcinoma sequence and clinical prognosis of colorectal cancer. The mRNA expressions of adenoma-carcinoma sequence were significantly different between high-grade intraepithelial neoplasia group and adenocarcinoma group. The biological process of gene ontology function enrichment analysis on differentially expressed genes between high-grade intraepithelial neoplasia group and adenocarcinoma group showed that genes enriched in the extracellular structure organization, skeletal system development, biological adhesion and itself regulated growth regulation, with the P value after FDR correction of less than 0.05. In addition, IPR-related protein mainly focused on the insulin-like growth factor binding proteins. The variable trends of gene expression profiles for adenoma-carcinoma sequence were mainly concentrated in high-grade intraepithelial neoplasia and adenocarcinoma. The differentially expressed genes are significantly correlated between high-grade intraepithelial neoplasia group and adenocarcinoma group. Bioinformatics analysis is an effective way to study the gene expression profiles in the adenoma-carcinoma sequence, and may provide an effective tool to involve colorectal cancer research strategy into colorectal adenoma or advanced adenoma.

  5. Genomic sequence around butterfly wing development genes: annotation and comparative analysis.

    Directory of Open Access Journals (Sweden)

    Inês C Conceição

    Full Text Available BACKGROUND: Analysis of genomic sequence allows characterization of genome content and organization, and access beyond gene-coding regions for identification of functional elements. BAC libraries, where relatively large genomic regions are made readily available, are especially useful for species without a fully sequenced genome and can increase genomic coverage of phylogenetic and biological diversity. For example, no butterfly genome is yet available despite the unique genetic and biological properties of this group, such as diversified wing color patterns. The evolution and development of these patterns is being studied in a few target species, including Bicyclus anynana, where a whole-genome BAC library allows targeted access to large genomic regions. METHODOLOGY/PRINCIPAL FINDINGS: We characterize ∼1.3 Mb of genomic sequence around 11 selected genes expressed in B. anynana developing wings. Extensive manual curation of in silico predictions, also making use of a large dataset of expressed genes for this species, identified repetitive elements and protein coding sequence, and highlighted an expansion of Alcohol dehydrogenase genes. Comparative analysis with orthologous regions of the lepidopteran reference genome allowed assessment of conservation of fine-scale synteny (with detection of new inversions and translocations and of DNA sequence (with detection of high levels of conservation of non-coding regions around some, but not all, developmental genes. CONCLUSIONS: The general properties and organization of the available B. anynana genomic sequence are similar to the lepidopteran reference, despite the more than 140 MY divergence. Our results lay the groundwork for further studies of new interesting findings in relation to both coding and non-coding sequence: 1 the Alcohol dehydrogenase expansion with higher similarity between the five tandemly-repeated B. anynana paralogs than with the corresponding B. mori orthologs, and 2 the high

  6. Sequencing and characterization of the guppy (Poecilia reticulata transcriptome

    Directory of Open Access Journals (Sweden)

    Rodd F Helen

    2011-04-01

    Full Text Available Abstract Background Next-generation sequencing is providing researchers with a relatively fast and affordable option for developing genomic resources for organisms that are not among the traditional genetic models. Here we present a de novo assembly of the guppy (Poecilia reticulata transcriptome using 454 sequence reads, and we evaluate potential uses of this transcriptome, including detection of sex-specific transcripts and deployment as a reference for gene expression analysis in guppies and a related species. Guppies have been model organisms in ecology, evolutionary biology, and animal behaviour for over 100 years. An annotated transcriptome and other genomic tools will facilitate understanding the genetic and molecular bases of adaptation and variation in a vertebrate species with a uniquely well known natural history. Results We generated approximately 336 Mbp of mRNA sequence data from male brain, male body, female brain, and female body. The resulting 1,162,670 reads assembled into 54,921 contigs, creating a reference transcriptome for the guppy with an average read depth of 28×. We annotated nearly 40% of this reference transcriptome by searching protein and gene ontology databases. Using this annotated transcriptome database, we identified candidate genes of interest to the guppy research community, putative single nucleotide polymorphisms (SNPs, and male-specific expressed genes. We also showed that our reference transcriptome can be used for RNA-sequencing-based analysis of differential gene expression. We identified transcripts that, in juveniles, are regulated differently in the presence and absence of an important predator, Rivulus hartii, including two genes implicated in stress response. For each sample in the RNA-seq study, >50% of high-quality reads mapped to unique sequences in the reference database with high confidence. In addition, we evaluated the use of the guppy reference transcriptome for gene expression analyses in

  7. Coding Local and Global Binary Visual Features Extracted From Video Sequences.

    Science.gov (United States)

    Baroffio, Luca; Canclini, Antonio; Cesana, Matteo; Redondi, Alessandro; Tagliasacchi, Marco; Tubaro, Stefano

    2015-11-01

    Binary local features represent an effective alternative to real-valued descriptors, leading to comparable results for many visual analysis tasks while being characterized by significantly lower computational complexity and memory requirements. When dealing with large collections, a more compact representation based on global features is often preferred, which can be obtained from local features by means of, e.g., the bag-of-visual word model. Several applications, including, for example, visual sensor networks and mobile augmented reality, require visual features to be transmitted over a bandwidth-limited network, thus calling for coding techniques that aim at reducing the required bit budget while attaining a target level of efficiency. In this paper, we investigate a coding scheme tailored to both local and global binary features, which aims at exploiting both spatial and temporal redundancy by means of intra- and inter-frame coding. In this respect, the proposed coding scheme can conveniently be adopted to support the analyze-then-compress (ATC) paradigm. That is, visual features are extracted from the acquired content, encoded at remote nodes, and finally transmitted to a central controller that performs the visual analysis. This is in contrast with the traditional approach, in which visual content is acquired at a node, compressed and then sent to a central unit for further processing, according to the compress-then-analyze (CTA) paradigm. In this paper, we experimentally compare the ATC and the CTA by means of rate-efficiency curves in the context of two different visual analysis tasks: 1) homography estimation and 2) content-based retrieval. Our results show that the novel ATC paradigm based on the proposed coding primitives can be competitive with the CTA, especially in bandwidth limited scenarios.

  8. Expression profiles of mRNA and long noncoding RNA in the ovaries of letrozole-induced polycystic ovary syndrome rat model through deep sequencing.

    Science.gov (United States)

    Fu, Lu-Lu; Xu, Ying; Li, Dan-Dan; Dai, Xiao-Wei; Xu, Xin; Zhang, Jing-Shun; Ming, Hao; Zhang, Xue-Ying; Zhang, Guo-Qing; Ma, Ya-Lan; Zheng, Lian-Wen

    2018-05-30

    Polycystic ovary syndrome (PCOS) is one of the most common endocrine disorders in reproductive-aged women. However, the exact pathophysiology of PCOS remains largely unclear. We performed deep sequencing to investigate the mRNA and long noncoding RNA (lncRNA) expression profiles in the ovarian tissues of letrozole-induced PCOS rat model and control rats. A total of 2147 mRNAs and 158 lncRNAs were differentially expressed between the PCOS models and control. Gene ontology analysis indicated that differentially expressed mRNAs were associated with biological adhesion, reproduction, and metabolic process. Pathway analysis results indicated that these aberrantly expressed mRNAs were related to several specific signaling pathways, including insulin resistance, steroid hormone biosynthesis, PPAR signaling pathway, cell adhesion molecules, autoimmune thyroid disease, and AMPK signaling pathway. The relative expression levels of mRNAs and lncRNAs were validated through qRT-PCR. LncRNA-miRNA-mRNA network was constructed to explore ceRNAs involved in the PCOS model and were also verified by qRTPCR experiment. These findings may provide insight into the pathogenesis of PCOS and clues to find key diagnostic and therapeutic roles of lncRNA in PCOS. Copyright © 2018 Elsevier B.V. All rights reserved.

  9. How Changes in Anti-SD Sequences Would Affect SD Sequences in Escherichia coli and Bacillus subtilis.

    Science.gov (United States)

    Abolbaghaei, Akram; Silke, Jordan R; Xia, Xuhua

    2017-05-05

    The 3' end of the small ribosomal RNAs (ssu rRNA) in bacteria is directly involved in the selection and binding of mRNA transcripts during translation initiation via well-documented interactions between a Shine-Dalgarno (SD) sequence located upstream of the initiation codon and an anti-SD (aSD) sequence at the 3' end of the ssu rRNA. Consequently, the 3' end of ssu rRNA (3'TAIL) is strongly conserved among bacterial species because a change in the region may impact the translation of many protein-coding genes. Escherichia coli and Bacillus subtilis differ in their 3' ends of ssu rRNA, being GAUC ACCUCCUUA 3' in E. coli and GAUC ACCUCCUU UCU3' or GAUC ACCUCCUU UCUA3' in B. subtilis Such differences in 3'TAIL lead to species-specific SDs (designated SD Ec for E. coli and SD Bs for B. subtilis ) that can form strong and well-positioned SD/aSD pairing in one species but not in the other. Selection mediated by the species-specific 3'TAIL is expected to favor SD Bs against SD Ec in B. subtilis , but favor SD Ec against SD Bs in E. coli Among well-positioned SDs, SD Ec is used more in E. coli than in B. subtilis , and SD Bs more in B. subtilis than in E. coli Highly expressed genes and genes of high translation efficiency tend to have longer SDs than lowly expressed genes and genes with low translation efficiency in both species, but more so in B. subtilis than in E. coli Both species overuse SDs matching the bolded part of the 3'TAIL shown above. The 3'TAIL difference contributes to the host specificity of phages. Copyright © 2017 Abolbaghaei et al.

  10. An RNA-Seq strategy to detect the complete coding and non-coding transcriptome including full-length imprinted macro ncRNAs.

    Directory of Open Access Journals (Sweden)

    Ru Huang

    Full Text Available Imprinted macro non-protein-coding (nc RNAs are cis-repressor transcripts that silence multiple genes in at least three imprinted gene clusters in the mouse genome. Similar macro or long ncRNAs are abundant in the mammalian genome. Here we present the full coding and non-coding transcriptome of two mouse tissues: differentiated ES cells and fetal head using an optimized RNA-Seq strategy. The data produced is highly reproducible in different sequencing locations and is able to detect the full length of imprinted macro ncRNAs such as Airn and Kcnq1ot1, whose length ranges between 80-118 kb. Transcripts show a more uniform read coverage when RNA is fragmented with RNA hydrolysis compared with cDNA fragmentation by shearing. Irrespective of the fragmentation method, all coding and non-coding transcripts longer than 8 kb show a gradual loss of sequencing tags towards the 3' end. Comparisons to published RNA-Seq datasets show that the strategy presented here is more efficient in detecting known functional imprinted macro ncRNAs and also indicate that standardization of RNA preparation protocols would increase the comparability of the transcriptome between different RNA-Seq datasets.

  11. Low-Complexity Multiple Description Coding of Video Based on 3D Block Transforms

    Directory of Open Access Journals (Sweden)

    Andrey Norkin

    2007-02-01

    Full Text Available The paper presents a multiple description (MD video coder based on three-dimensional (3D transforms. Two balanced descriptions are created from a video sequence. In the encoder, video sequence is represented in a form of coarse sequence approximation (shaper included in both descriptions and residual sequence (details which is split between two descriptions. The shaper is obtained by block-wise pruned 3D-DCT. The residual sequence is coded by 3D-DCT or hybrid, LOT+DCT, 3D-transform. The coding scheme is targeted to mobile devices. It has low computational complexity and improved robustness of transmission over unreliable networks. The coder is able to work at very low redundancies. The coding scheme is simple, yet it outperforms some MD coders based on motion-compensated prediction, especially in the low-redundancy region. The margin is up to 3 dB for reconstruction from one description.

  12. Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

    Science.gov (United States)

    Cao, Yinhe; Tung, Wen-Wen; Gao, J B

    2004-01-01

    With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.

  13. Cloning and sequencing of cDNA encoding human DNA topoisomerase II and localization of the gene to chromosome region 17q21-22

    International Nuclear Information System (INIS)

    Tsai-Pflugfelder, M.; Liu, L.F.; Liu, A.A.; Tewey, K.M.; Whang-Peng, J.; Knutsen, T.; Huebner, K.; Croce, C.M.; Wang, J.C.

    1988-01-01

    Two overlapping cDNA clones encoding human DNA topoisomerase II were identified by two independent methods. In one, a human cDNA library in phage λ was screened by hybridization with a mixed oligonucleotide probe encoding a stretch of seven amino acids found in yeast and Drosophila DNA topoisomerase II; in the other, a different human cDNA library in a λgt11 expression vector was screened for the expression of antigenic determinants that are recognized by rabbit antibodies specific to human DNA topoisomerase II. The entire coding sequences of the human DNA topoisomerase II gene were determined from these and several additional clones, identified through the use of the cloned human TOP2 gene sequences as probes. Hybridization between the cloned sequences and mRNA and genomic DNA indicates that the human enzyme is encoded by a single-copy gene. The location of the gene was mapped to chromosome 17q21-22 by in situ hybridization of a cloned fragment to metaphase chromosomes and by hybridization analysis with a panel of mouse-human hybrid cell lines, each retaining a subset of human chromosomes

  14. Translation Initiation from Conserved Non-AUG Codons Provides Additional Layers of Regulation and Coding Capacity

    Directory of Open Access Journals (Sweden)

    Ivaylo P. Ivanov

    2017-06-01

    Full Text Available Neurospora crassa cpc-1 and Saccharomyces cerevisiae GCN4 are homologs specifying transcription activators that drive the transcriptional response to amino acid limitation. The cpc-1 mRNA contains two upstream open reading frames (uORFs in its >700-nucleotide (nt 5′ leader, and its expression is controlled at the level of translation in response to amino acid starvation. We used N. crassa cell extracts and obtained data indicating that cpc-1 uORF1 and uORF2 are functionally analogous to GCN4 uORF1 and uORF4, respectively, in controlling translation. We also found that the 5′ region upstream of the main coding sequence of the cpc-1 mRNA extends for more than 700 nucleotides without any in-frame stop codon. For 100 cpc-1 homologs from Pezizomycotina and from selected Basidiomycota, 5′ conserved extensions of the CPC1 reading frame are also observed. Multiple non-AUG near-cognate codons (NCCs in the CPC1 reading frame upstream of uORF2, some deeply conserved, could potentially initiate translation. At least four NCCs initiated translation in vitro. In vivo data were consistent with initiation at NCCs to produce N-terminally extended N. crassa CPC1 isoforms. The pivotal role played by CPC1, combined with its translational regulation by uORFs and NCC utilization, underscores the emerging significance of noncanonical initiation events in controlling gene expression.

  15. Transcriptome Sequencing, De Novo Assembly and Differential Gene Expression Analysis of the Early Development of Acipenser baeri.

    Directory of Open Access Journals (Sweden)

    Wei Song

    Full Text Available The molecular mechanisms that drive the development of the endangered fossil fish species Acipenser baeri are difficult to study due to the lack of genomic data. Recent advances in sequencing technologies and the reducing cost of sequencing offer exclusive opportunities for exploring important molecular mechanisms underlying specific biological processes. This manuscript describes the large scale sequencing and analyses of mRNA from Acipenser baeri collected at five development time points using the Illumina Hiseq2000 platform. The sequencing reads were de novo assembled and clustered into 278167 unigenes, of which 57346 (20.62% had 45837 known homologues proteins in Uniprot protein databases while 11509 proteins matched with at least one sequence of assembled unigenes. The remaining 79.38% of unigenes could stand for non-coding unigenes or unigenes specific to A. baeri. A number of 43062 unigenes were annotated into functional categories via Gene Ontology (GO annotation whereas 29526 unigenes were associated with 329 pathways by mapping to KEGG database. Subsequently, 3479 differentially expressed genes were scanned within developmental stages and clustered into 50 gene expression profiles. Genes preferentially expressed at each stage were also identified. Through GO and KEGG pathway enrichment analysis, relevant physiological variations during the early development of A. baeri could be better cognized. Accordingly, the present study gives insights into the transcriptome profile of the early development of A. baeri, and the information contained in this large scale transcriptome will provide substantial references for A. baeri developmental biology and promote its aquaculture research.

  16. Gene cloning and mRNA expression of glutamate dehydrogenase in the liver, brain and intestine of the swamp eel, Monopterus albus, exposed to freshwater, terrestrial conditions, environmental ammonia or salinity stress

    Directory of Open Access Journals (Sweden)

    C Y Toh

    2011-12-01

    Full Text Available The swamp eel, Monopterus albus, is an obligatory air-breathing teleost which can survive long period of emersion, has high environmental and tissue ammonia tolerance, and acclimate from fresh to brackish water. This study was undertaken to clone and sequence gdh expressed in the liver, intestine and brain of M. albus, to verify whether more than one form of gdh were expressed, and to examine the gdh mRNA expressions in these three organs in fish exposed to various adverse conditions using quantitative real-time PCR. Only one gdh gene sequence, consisted of a 133 bp 5’ UTR, a CDS region spanning 1629 bp and a 3’ UTR of approximately 717 bp, was obtained from the liver, intestine and brain of M. albus. The translated Gdh amino acid sequence from the liver of M. albus had 542 residues and was confirmed to be Gdh1a. It had sequence identity of >90% with Oncorhynchus mykiss Gdh1a, Salmo salar Gdh1a1, Bostrychus sinensis Gdh1a and Tribolodon hakonensis Gdh1a, and formed a monophyletic clade with B. sinensis Gdh1a, Tetraodon nigroviridis Gdh1a, Chaenocephalus aceratus Gdh1a, Salmo salar Gdh1a1 and Gdh1a2 and O. mykiss Gdh1a. An increase in mRNA expression of gdh1a could be essential for increased glutamate production in support of increases in glutamine synthesis under certain environmental condition. Indeed, exposure of M. albus to 1 day of terrestrial conditions or 75 mmol l-1 NH4Cl, but not brackish water, resulted in a significant increase in gdh1a mRNA expression in the liver. However, exposure to brackish water, but not terrestrial conditions or 75 mmol l-1 NH4Cl, lead to a significant increase in the intestinal mRNA expression of gdh1a. By contrast, all the three experimental conditions had no significant effects on the mRNA expression of gdh1a in the brain of M. albus. Our results indicate for the first time that gdh mRNA expression was differentially up-regulated in the liver and intestine of M. albus, in responses to ammonia toxicity and

  17. Implementation of LT codes based on chaos

    International Nuclear Information System (INIS)

    Zhou Qian; Li Liang; Chen Zengqiang; Zhao Jiaxiang

    2008-01-01

    Fountain codes provide an efficient way to transfer information over erasure channels like the Internet. LT codes are the first codes fully realizing the digital fountain concept. They are asymptotically optimal rateless erasure codes with highly efficient encoding and decoding algorithms. In theory, for each encoding symbol of LT codes, its degree is randomly chosen according to a predetermined degree distribution, and its neighbours used to generate that encoding symbol are chosen uniformly at random. Practical implementation of LT codes usually realizes the randomness through pseudo-randomness number generator like linear congruential method. This paper applies the pseudo-randomness of chaotic sequence in the implementation of LT codes. Two Kent chaotic maps are used to determine the degree and neighbour(s) of each encoding symbol. It is shown that the implemented LT codes based on chaos perform better than the LT codes implemented by the traditional pseudo-randomness number generator. (general)

  18. A New Video Coding Algorithm Using 3D-Subband Coding and Lattice Vector Quantization

    Energy Technology Data Exchange (ETDEWEB)

    Choi, J.H. [Taejon Junior College, Taejon (Korea, Republic of); Lee, K.Y. [Sung Kyun Kwan University, Suwon (Korea, Republic of)

    1997-12-01

    In this paper, we propose an efficient motion adaptive 3-dimensional (3D) video coding algorithm using 3D subband coding (3D-SBC) and lattice vector quantization (LVQ) for low bit rate. Instead of splitting input video sequences into the fixed number of subbands along the temporal axes, we decompose them into temporal subbands of variable size according to motions in frames. Each spatio-temporally splitted 7 subbands are partitioned by quad tree technique and coded with lattice vector quantization(LVQ). The simulation results show 0.1{approx}4.3dB gain over H.261 in peak signal to noise ratio(PSNR) at low bit rate (64Kbps). (author). 13 refs., 13 figs., 4 tabs.

  19. Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

    Directory of Open Access Journals (Sweden)

    Graner Andreas

    2008-10-01

    Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular

  20. Spread-spectrum communication using binary spatiotemporal chaotic codes

    International Nuclear Information System (INIS)

    Wang Xingang; Zhan Meng; Gong Xiaofeng; Lai, C.H.; Lai, Y.-C.

    2005-01-01

    We propose a scheme to generate binary code for baseband spread-spectrum communication by using a chain of coupled chaotic maps. We compare the performances of this type of spatiotemporal chaotic code with those of a conventional code used frequently in digital communication, the Gold code, and demonstrate that our code is comparable or even superior to the Gold code in several key aspects: security, bit error rate, code generation speed, and the number of possible code sequences. As the field of communicating with chaos faces doubts in terms of performance comparison with conventional digital communication schemes, our work gives a clear message that communicating with chaos can be advantageous and it deserves further attention from the nonlinear science community

  1. Utility of QR codes in biological collections.

    Science.gov (United States)

    Diazgranados, Mauricio; Funk, Vicki A

    2013-01-01

    The popularity of QR codes for encoding information such as URIs has increased exponentially in step with the technological advances and availability of smartphones, digital tablets, and other electronic devices. We propose using QR codes on specimens in biological collections to facilitate linking vouchers' electronic information with their associated collections. QR codes can efficiently provide such links for connecting collections, photographs, maps, ecosystem notes, citations, and even GenBank sequences. QR codes have numerous advantages over barcodes, including their small size, superior security mechanisms, increased complexity and quantity of information, and low implementation cost. The scope of this paper is to initiate an academic discussion about using QR codes on specimens in biological collections.

  2. Utility of QR codes in biological collections

    Directory of Open Access Journals (Sweden)

    Mauricio Diazgranados

    2013-07-01

    Full Text Available The popularity of QR codes for encoding information such as URIs has increased exponentially in step with the technological advances and availability of smartphones, digital tablets, and other electronic devices. We propose using QR codes on specimens in biological collections to facilitate linking vouchers’ electronic information with their associated collections. QR codes can efficiently provide such links for connecting collections, photographs, maps, ecosystem notes, citations, and even GenBank sequences. QR codes have numerous advantages over barcodes, including their small size, superior security mechanisms, increased complexity and quantity of information, and low implementation cost. The scope of this paper is to initiate an academic discussion about using QR codes on specimens in biological collections.

  3. Targeted sequencing of large genomic regions with CATCH-Seq.

    Directory of Open Access Journals (Sweden)

    Kenneth Day

    Full Text Available Current target enrichment systems for large-scale next-generation sequencing typically require synthetic oligonucleotides used as capture reagents to isolate sequences of interest. The majority of target enrichment reagents are focused on gene coding regions or promoters en masse. Here we introduce development of a customizable targeted capture system using biotinylated RNA probe baits transcribed from sheared bacterial artificial chromosome clone templates that enables capture of large, contiguous blocks of the genome for sequencing applications. This clone adapted template capture hybridization sequencing (CATCH-Seq procedure can be used to capture both coding and non-coding regions of a gene, and resolve the boundaries of copy number variations within a genomic target site. Furthermore, libraries constructed with methylated adapters prior to solution hybridization also enable targeted bisulfite sequencing. We applied CATCH-Seq to diverse targets ranging in size from 125 kb to 3.5 Mb. Our approach provides a simple and cost effective alternative to other capture platforms because of template-based, enzymatic probe synthesis and the lack of oligonucleotide design costs. Given its similarity in procedure, CATCH-Seq can also be performed in parallel with commercial systems.

  4. Tuning protein expression using synonymous codon libraries targeted to the 5' mRNA coding region

    DEFF Research Database (Denmark)

    Goltermann, Lise; Borch Jensen, Martin; Bentin, Thomas

    2011-01-01

    intermediate expression levels of green fluorescent protein in Escherichia coli. At least in one case, no apparent effect on protein stability was observed, pointing to RNA level effects as the principal reason for the observed expression differences. Targeting a synonymous codon library to the 5' coding...

  5. Altered expression of asparagine synthetase mRNA in human leukemic and carcinoma cell lines

    Energy Technology Data Exchange (ETDEWEB)

    Goodwin, L.O.; Guzowski, D.E.; Millan, C.A. [North Shore Univ. Hospital/Cornell Univ. Medical College, Manhasset, NY (United States)] [and others

    1994-09-01

    Asparagine synthetase (AS) is the enzyme responsible for the ATP-dependant conversion of aspartic acid to asparagine. The AS gene is expressed constitutively in most mammalian cells, including cells of the lymphoid lineage, as a 2 kb mRNA. In some leukemic phenotypes, AS expression is abrogated, resulting in no detectable enzyme activity. These cells are rendered sensitive to killing by L-asparaginase, which destroys extracellular asparagine. Prolonged treatment of leukemic cells with this agent can lead to resistance and the reappearance of AS activity, suggesting derepression of the AS gene, which has been shown to be regulated by intracellular levels of asparagine. Modulation of AS expression by asparagine employs cis and trans-acting elements involved in transcriptional and translational regulation. We have cloned and sequenced the human AS gene and surrounding sequence elements as well as the full-length cDNA. Using probes specific to the third and fourth exons of AS, we have identified an additional higher molecular weight mRNA (2.7 kb) in Northern blots derived from a chronic myelogenous leukemia and a colon carcinoma but not in normal lymphocytic or other human cell lines. We speculate that elements present in the cancer-derived mRNAs may be involved in the derepression of AS activity. This hypothesis is being evaluated by RNase protection assays using RNA isolated from a variety of human cell lines to characterize and elucidate the nature of this additional AS encoded message.

  6. The Role of the Y-Chromosome in the Establishment of Murine Hybrid Dysgenesis and in the Analysis of the Nucleotide Sequence Organization, Genetic Transmission and Evolution of Repeated Sequences.

    Science.gov (United States)

    Nallaseth, Ferez Soli

    The Y-chromosome presents a unique cytogenetic framework for the evolution of nucleotide sequences. Alignment of nine Y-chromosomal fragments in their increasing Y-specific/non Y-specific (male/female) sequence divergence ratios was directly and inversely related to their interspersion on these two respective genomic fractions. Sequence analysis confirmed a direct relationship between divergence ratios and the Alu, LINE-1, Satellite and their derivative oligonucleotide contents. Thus their relocation on the Y-chromosome is followed by sequence divergence rather than the well documented concerted evolution of these non-coding progenitor repeated sequences. Five of the nine Y-chromosomal fragments are non-pseudoautosomal and transcribed into heterogeneous PolyA^+ RNA and thus can be retrotransposed. Evolutionary and computer analysis identified homologous oligonucleotide tracts in several human loci suggesting common and random mechanistic origins. Dysgenic genomes represent the accelerated evolution driving sequence divergence (McClintock, 1984). Sex reversal and sterility characterizing dysgenesis occurs in C57BL/6JY ^{rm Pos} but not in 129/SvY^{rm Pos} derivative strains. High frequency, random, multi-locus deletion products of the feral Y^{ rm Pos}-chromosome are generated in the germlines of F1(C57BL/6J X 129/SvY^{ rm Pos})(male) and C57BL/6JY ^{rm Pos}(male) but not in 129/SvY^{rm Pos}(male). Equal, 10^{-1}, 10^ {-2}, and 0 copies (relative to males) of Y^{rm Pos}-specific deletion products respectively characterize C57BL/6JY ^{rm Pos} (HC), (LC), (T) and (F) females. The testes determining loci of inactive Y^{rm Pos}-chromosomes in C57BL/6JY^{rm Pos} HC females are the preferentially deleted/rearranged Y ^{rm Pos}-sequences. Disruption of regulation of plasma testosterone and hepatic MUP-A mRNA levels, TRD of a 4.7 Kbp EcoR1 fragment suggest disruption of autosomal/X-chromosomal sequences. These data and the highly repeated progenitor (Alu, GATA, LINE-1

  7. Nuclear imprisonment of host cellular mRNA by nsp1β protein of porcine reproductive and respiratory syndrome virus

    International Nuclear Information System (INIS)

    Han, Mingyuan; Ke, Hanzhong; Zhang, Qingzhan; Yoo, Dongwan

    2017-01-01

    Positive-strand RNA genomes function as mRNA for viral protein synthesis which is fully reliant on host cell translation machinery. Competing with cellular protein translation apparatus needs to ensure the production of viral proteins, but this also stifles host innate defense. In the present study, we showed that porcine reproductive and respiratory syndrome virus (PRRSV), whose replication takes place in the cytoplasm, imprisoned host cell mRNA in the nucleus, which suggests a novel mechanism to enhance translation of PRRSV genome. PRRSV nonstructural protein (nsp) 1β was identified as the nuclear protein playing the role for host mRNA nuclear retention and subversion of host protein synthesis. A SAP (SAF-A/B, Acinus, and PIAS) motif was identified in nsp1β with the consensus sequence of 126 -LQxxLxxxGL- 135 . In situ hybridization unveiled that SAP mutants were unable to cause nuclear retention of host cell mRNAs and did not suppress host protein synthesis. In addition, these SAP mutants reverted PRRSV-nsp1β-mediated suppression of interferon (IFN) production, IFN signaling, and TNF-α production pathway. Using reverse genetics, a series of SAP mutant PRRS viruses, vK124A, vL126A, vG134A, and vL135A were generated. No mRNA nuclear retention was observed during vL126A and vL135A infections. Importantly, vL126A and vL135A did not suppress IFN production. For other arteriviruses, mRNA nuclear accumulation was also observed for LDV-nsp1β and SHFV-nsp1β. EAV-nsp1 was exceptional and did not block the host mRNA nuclear export. - Highlights: •PRRS virus blocks host mRNA nuclear export to the cytoplasm. •PRRSV nsp1β is the viral protein responsible for host mRNA nuclear retention. •SAP domain in nsp1β is essential for host mRNA nuclear retention and type I interferon suppression. •Mutation in the SAP domain of nsp1β causes the loss of function. •Host mRNA nuclear retention by nsp1β is common in the family Arteriviridae, except equine arteritis virus.

  8. Nuclear imprisonment of host cellular mRNA by nsp1β protein of porcine reproductive and respiratory syndrome virus

    Energy Technology Data Exchange (ETDEWEB)

    Han, Mingyuan, E-mail: hanming@umich.edu; Ke, Hanzhong; Zhang, Qingzhan; Yoo, Dongwan, E-mail: dyoo@illinois.edu

    2017-05-15

    Positive-strand RNA genomes function as mRNA for viral protein synthesis which is fully reliant on host cell translation machinery. Competing with cellular protein translation apparatus needs to ensure the production of viral proteins, but this also stifles host innate defense. In the present study, we showed that porcine reproductive and respiratory syndrome virus (PRRSV), whose replication takes place in the cytoplasm, imprisoned host cell mRNA in the nucleus, which suggests a novel mechanism to enhance translation of PRRSV genome. PRRSV nonstructural protein (nsp) 1β was identified as the nuclear protein playing the role for host mRNA nuclear retention and subversion of host protein synthesis. A SAP (SAF-A/B, Acinus, and PIAS) motif was identified in nsp1β with the consensus sequence of {sub 126}-LQxxLxxxGL-{sub 135}. In situ hybridization unveiled that SAP mutants were unable to cause nuclear retention of host cell mRNAs and did not suppress host protein synthesis. In addition, these SAP mutants reverted PRRSV-nsp1β-mediated suppression of interferon (IFN) production, IFN signaling, and TNF-α production pathway. Using reverse genetics, a series of SAP mutant PRRS viruses, vK124A, vL126A, vG134A, and vL135A were generated. No mRNA nuclear retention was observed during vL126A and vL135A infections. Importantly, vL126A and vL135A did not suppress IFN production. For other arteriviruses, mRNA nuclear accumulation was also observed for LDV-nsp1β and SHFV-nsp1β. EAV-nsp1 was exceptional and did not block the host mRNA nuclear export. - Highlights: •PRRS virus blocks host mRNA nuclear export to the cytoplasm. •PRRSV nsp1β is the viral protein responsible for host mRNA nuclear retention. •SAP domain in nsp1β is essential for host mRNA nuclear retention and type I interferon suppression. •Mutation in the SAP domain of nsp1β causes the loss of function. •Host mRNA nuclear retention by nsp1β is common in the family Arteriviridae, except equine

  9. Validation of the Serpent 2-DYNSUB code sequence using the Special Power Excursion Reactor Test III (SPERT III)

    International Nuclear Information System (INIS)

    Knebel, Miriam; Mercatali, Luigi; Sanchez, Victor; Stieglitz, Robert; Macian-Juan, Rafael

    2016-01-01

    Highlights: • Full few-group cross section tables created by Monte Carlo lattice code Serpent 2. • Serpent 2 group constant methodology verified for HFP static and transient cases. • Serpent 2-DYNSUB tool chainvalidated using SPERT III REA experiments. • Serpent 2-DYNSUB tool chain suitable to model RIAs in PWRs. - Abstract: The Special Power Excursion Reactor Test III (SPERT III) is studied using the Serpent 2-DYNSUB code sequence in order to validate it for modeling reactivity insertion accidents (RIA) in PWRs. The SPERT III E-core was a thermal research reactor constructed to analyze reactor dynamics. Its configuration resembles a commercial PWR on terms of fuel type, choice of moderator, coolant flow and system pressure. The initial conditions of the rod ejection accident experiments (REA) performed cover cold startup, hot startup, hot standby and operating power scenarios. Eight of these experiments were analyzed in detail. Firstly, multi-dimensional nodal diffusion cross section tables were created for the three-dimensional reactor simulator DYNSUB employing the Monte Carlo neutron transport code Serpent 2. In a second step, DYNSUB stationary simulations were compared to Monte Carlo reference three-dimensional full scale solutions obtained with Serpent 2 (cold startup conditions) and Serpent 2/SUBCHANFLOW (operating power conditions) with a good agreement being observed. The latter tool is an internal coupling of Serpent 2 and the sub-channel thermal-hydraulics code SUBCHANFLOW. Finally, DYNSUB was utilized to study the eight selected transient experiments. Results were found to match measurements well. As the selected experiments cover much of the possible transient (delayed super-critical, prompt super-critical and super-prompt critical excursion) and initial conditions (cold and hot as well as zero, little and full power reactor states) one expects in commercial PWRs, the obtained results give confidence that the Serpent 2-DYNSUB tool chain is

  10. Promoter Analysis Reveals Globally Differential Regulation of Human Long Non-Coding RNA and Protein-Coding Genes

    KAUST Repository

    Alam, Tanvir

    2014-10-02

    Transcriptional regulation of protein-coding genes is increasingly well-understood on a global scale, yet no comparable information exists for long non-coding RNA (lncRNA) genes, which were recently recognized to be as numerous as protein-coding genes in mammalian genomes. We performed a genome-wide comparative analysis of the promoters of human lncRNA and protein-coding genes, finding global differences in specific genetic and epigenetic features relevant to transcriptional regulation. These two groups of genes are hence subject to separate transcriptional regulatory programs, including distinct transcription factor (TF) proteins that significantly favor lncRNA, rather than coding-gene, promoters. We report a specific signature of promoter-proximal transcriptional regulation of lncRNA genes, including several distinct transcription factor binding sites (TFBS). Experimental DNase I hypersensitive site profiles are consistent with active configurations of these lncRNA TFBS sets in diverse human cell types. TFBS ChIP-seq datasets confirm the binding events that we predicted using computational approaches for a subset of factors. For several TFs known to be directly regulated by lncRNAs, we find that their putative TFBSs are enriched at lncRNA promoters, suggesting that the TFs and the lncRNAs may participate in a bidirectional feedback loop regulatory network. Accordingly, cells may be able to modulate lncRNA expression levels independently of mRNA levels via distinct regulatory pathways. Our results also raise the possibility that, given the historical reliance on protein-coding gene catalogs to define the chromatin states of active promoters, a revision of these chromatin signature profiles to incorporate expressed lncRNA genes is warranted in the future.

  11. OFFSCALE: A PC input processor for the SCALE code system. The CSASIN processor for the criticality sequences

    International Nuclear Information System (INIS)

    Bowman, S.M.

    1994-11-01

    OFFSCALE is a suite of personal computer input processor programs developed at Oak Ridge National Laboratory to provide an easy-to-use interface for modules in the SCALE-4 code system. CSASIN (formerly known as OFFSCALE) is a program in the OFFSCALE suite that serves as a user-friendly interface for the Criticality Safety Analysis Sequences (CSAS) available in SCALE-4. It is designed to assist a SCALE-4 user in preparing an input file for execution of criticality safety problems. Output from CSASIN generates an input file that may be used to execute the CSAS control module in SCALE-4. CSASIN features a pulldown menu system that accesses sophisticated data entry screens. The program allows the user to quickly set up a CSAS input file and perform data checking. This capability increases productivity and decreases the chance of user error

  12. Full-length sequencing and identification of novel polymorphisms in ...

    Indian Academy of Sciences (India)

    The aim of this work was to sequence the entirecoding region of ACACA gene in Valle del Belice sheep breed to identify polymorphic sites. A total of 51 coding exons of ACACA gene were sequenced in 32 individuals of Valle del Belice sheep breed. Sequencing analysis and alignment of obtained sequences showed the ...

  13. Building the sequence map of the human pan-genome

    DEFF Research Database (Denmark)

    Li, Ruiqiang; Li, Yingrui; Zheng, Hancheng

    2010-01-01

    analysis of predicted genes indicated that the novel sequences contain potentially functional coding regions. We estimate that a complete human pan-genome would contain approximately 19-40 Mb of novel sequence not present in the extant reference genome. The extensive amount of novel sequence contributing...

  14. Sequence embedding for fast construction of guide trees for multiple sequence alignment

    LENUS (Irish Health Repository)

    Blackshields, Gordon

    2010-05-14

    Abstract Background The most widely used multiple sequence alignment methods require sequences to be clustered as an initial step. Most sequence clustering methods require a full distance matrix to be computed between all pairs of sequences. This requires memory and time proportional to N 2 for N sequences. When N grows larger than 10,000 or so, this becomes increasingly prohibitive and can form a significant barrier to carrying out very large multiple alignments. Results In this paper, we have tested variations on a class of embedding methods that have been designed for clustering large numbers of complex objects where the individual distance calculations are expensive. These methods involve embedding the sequences in a space where the similarities within a set of sequences can be closely approximated without having to compute all pair-wise distances. Conclusions We show how this approach greatly reduces computation time and memory requirements for clustering large numbers of sequences and demonstrate the quality of the clusterings by benchmarking them as guide trees for multiple alignment. Source code is available for download from http:\\/\\/www.clustal.org\\/mbed.tgz.

  15. Alternative Polyadenylation and Nonsense-Mediated Decay Coordinately Regulate the Human HFE mRNA Levels

    Science.gov (United States)

    Martins, Rute; Proença, Daniela; Silva, Bruno; Barbosa, Cristina; Silva, Ana Luísa; Faustino, Paula; Romão, Luísa

    2012-01-01

    Nonsense-mediated decay (NMD) is an mRNA surveillance pathway that selectively recognizes and degrades defective mRNAs carrying premature translation-termination codons. However, several studies have shown that NMD also targets physiological transcripts that encode full-length proteins, modulating their expression. Indeed, some features of physiological mRNAs can render them NMD-sensitive. Human HFE is a MHC class I protein mainly expressed in the liver that, when mutated, can cause hereditary hemochromatosis, a common genetic disorder of iron metabolism. The HFE gene structure comprises seven exons; although the sixth exon is 1056 base pairs (bp) long, only the first 41 bp encode for amino acids. Thus, the remaining downstream 1015 bp sequence corresponds to the HFE 3′ untranslated region (UTR), along with exon seven. Therefore, this 3′ UTR encompasses an exon/exon junction, a feature that can make the corresponding physiological transcript NMD-sensitive. Here, we demonstrate that in UPF1-depleted or in cycloheximide-treated HeLa and HepG2 cells the HFE transcripts are clearly upregulated, meaning that the physiological HFE mRNA is in fact an NMD-target. This role of NMD in controlling the HFE expression levels was further confirmed in HeLa cells transiently expressing the HFE human gene. Besides, we show, by 3′-RACE analysis in several human tissues that HFE mRNA expression results from alternative cleavage and polyadenylation at four different sites – two were previously described and two are novel polyadenylation sites: one located at exon six, which confers NMD-resistance to the corresponding transcripts, and another located at exon seven. In addition, we show that the amount of HFE mRNA isoforms resulting from cleavage and polyadenylation at exon seven, although present in both cell lines, is higher in HepG2 cells. These results reveal that NMD and alternative polyadenylation may act coordinately to control HFE mRNA levels, possibly varying its

  16. Yeast genome sequencing:

    DEFF Research Database (Denmark)

    Piskur, Jure; Langkjær, Rikke Breinhold

    2004-01-01

    For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...

  17. Molecular characterization of three Rhesus glycoproteins from the gills of the African lungfish, Protopterus annectens, and effects of aestivation on their mRNA expression levels and protein abundance.

    Directory of Open Access Journals (Sweden)

    You R Chng

    Full Text Available African lungfishes are ammonotelic in water. They can aestivate for long periods on land during drought. During aestivation, the gills are covered with dried mucus and ammonia excretion ceases. In fishes, ammonia excretion through the gills involves Rhesus glycoproteins (RhGP/Rhgp. This study aimed to obtain the complete cDNA coding sequences of rhgp from the gills of Protopterus annectens, and to determine their branchial mRNA and protein expression levels during the induction, maintenance and arousal phases of aestivation. Three isoforms of rhgp (rhag, rhbg and rhcg were obtained in the gills of P. annectens. Their complete cDNA coding sequences ranged between 1311 and 1398 bp, coding for 436 to 465 amino acids with estimated molecular masses between 46.8 and 50.9 kDa. Dendrogramic analyses indicated that Rhag was grouped closer to fishes, while Rhbg and Rhcg were grouped closer to tetrapods. During the induction phase, the protein abundance of Rhag, but not its transcript level, was down-regulated in the gills, suggesting that there could be a decrease in the release of ammonia from the erythrocytes to the plasma. Furthermore, the branchial transcript levels of rhbg and rhcg decreased significantly, in preparation for the subsequent shutdown of gill functions. During the maintenance phase, the branchial expression levels of rhag/Rhag, rhbg/Rhbg and rhcg/Rhcg decreased significantly, indicating that their transcription and translation were down-regulated. This could be part of an overall mechanism to shut down branchial functions and save metabolic energy used for transcription and translation. It could also be regarded as an adaptive response to stop ammonia excretion. During the arousal phase, it is essential for the lungfish to regain the ability to excrete ammonia. Indeed, the protein abundance of Rhag, Rhbg and Rhcg recovered to the corresponding control levels after 1 day or 3 days of recovery from 6 months of aestivation.

  18. Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution.

    Science.gov (United States)

    2004-12-09

    We present here a draft genome sequence of the red jungle fowl, Gallus gallus. Because the chicken is a modern descendant of the dinosaurs and the first non-mammalian amniote to have its genome sequenced, the draft sequence of its genome--composed of approximately one billion base pairs of sequence and an estimated 20,000-23,000 genes--provides a new perspective on vertebrate genome evolution, while also improving the annotation of mammalian genomes. For example, the evolutionary distance between chicken and human provides high specificity in detecting functional elements, both non-coding and coding. Notably, many conserved non-coding sequences are far from genes and cannot be assigned to defined functional classes. In coding regions the evolutionary dynamics of protein domains and orthologous groups illustrate processes that distinguish the lineages leading to birds and mammals. The distinctive properties of avian microchromosomes, together with the inferred patterns of conserved synteny, provide additional insights into vertebrate chromosome architecture.

  19. ARMOUR - A Rice miRNA: mRNA Interaction Resource.

    Science.gov (United States)

    Sanan-Mishra, Neeti; Tripathi, Anita; Goswami, Kavita; Shukla, Rohit N; Vasudevan, Madavan; Goswami, Hitesh

    2018-01-01

    ARMOUR was developed as A Rice miRNA:mRNA interaction resource. This informative and interactive database includes the experimentally validated expression profiles of miRNAs under different developmental and abiotic stress conditions across seven Indian rice cultivars. This comprehensive database covers 689 known and 1664 predicted novel miRNAs and their expression profiles in more than 38 different tissues or conditions along with their predicted/known target transcripts. The understanding of miRNA:mRNA interactome in regulation of functional cellular machinery is supported by the sequence information of the mature and hairpin structures. ARMOUR provides flexibility to users in querying the database using multiple ways like known gene identifiers, gene ontology identifiers, KEGG identifiers and also allows on the fly fold change analysis and sequence search query with inbuilt BLAST algorithm. ARMOUR database provides a cohesive platform for novel and mature miRNAs and their expression in different experimental conditions and allows searching for their interacting mRNA targets, GO annotation and their involvement in various biological pathways. The ARMOUR database includes a provision for adding more experimental data from users, with an aim to develop it as a platform for sharing and comparing experimental data contributed by research groups working on rice.

  20. T-lymphocyte cytokine mRNA expression in cystic echinococcosis.

    Science.gov (United States)

    Fauser, S; Kern, P

    1997-04-01

    In the present study we investigated cytokine mRNA expression by peripheral blood mononuclear cells (PBMC) from patients with cystic echinococcosis (CE) after stimulation with different antigens. By using reverse transcriptase polymerase chain reaction (RT-PCR) we could demonstrate that restimulation with crude Echinococcus granulosus antigen (Eg-Ag) induced or enhanced Th2 cytokine mRNA expression, especially IL-5 (by using antigen from sheep cyst fluid) in 23 out of 26 investigated CE patients and IL-10 (by using antigen from camel cyst fluid) in 10 out of 10 investigated CE patients. In contrast, IL-5 mRNA expression was absent in PBMC of healthy controls after Eg-Ag stimulation. To determine the specificity of this reaction we stimulated PBMC from 11 CE patients with crude Echinococcus multilocularis antigen (Em-Ag) and PBMC from 8 CE patients with Toxocara canis antigen (Tc-Ag). We found that the PBMC of patients showed a similar mRNA cytokine pattern on stimulation with Em-Ag when compared with Eg-Ag stimulation. The cytokine mRNA pattern on stimulation with Tc-Ag, however, resembled the cytokine mRNA pattern of unstimulated PBMC. Furthermore, the stimulation of PBMC with crude Mycobacterium tuberculosis antigen (H37Ra) and purified protein derivative (PPD) of M. tuberculosis revealed distinct IL-5 mRNA expression in all investigated CE patients, whereas in healthy controls IL-5 mRNA expression was very weak or totally absent. Thus, our results indicate an induction of Th2 cytokine mRNA expression in CE patients, which is frequently observed in parasite infections. Interestingly, this response persists after stimulation with tuberculosis antigens, which normally induce Th1 response.

  1. Non-coding RNA in Deinococcus radiodurans

    International Nuclear Information System (INIS)

    Chen Zhongzhong; Wang Liangyan; Lin Jun; Tian Bing; Hua Yuejin

    2006-01-01

    Researches on DNA damage and repair pathways of Deinococcus radiodurans show its extreme resistance to ionizing radiation, ultraviolet radiation and reactive oxygen species. Non-coding (ncRNA) RNAs are involved in a variety of processes such as transcriptional regulations, RNA processing and modification, mRNA translation, protein transportation and stability. The conserved secondary structures of intergenic regions of Deinococcus radiodurans R1 were predicted using Stochastic Context Free Grammar (SCFG) scan strategy. Results showed that 28 ncRNA families were present in the non-coding regions of the genome of Deinococcus radiodurans R1. Among these families, IRE is the largest family, followed by Histone3, tRNA, SECIS. DicF, ctRNA-pGA1 and tmRNA are one discovered in bacteria. Results from the comparison with other organisms showed that these ncRNA can be applied to the study of biological function of Deinococcus radiodurans and supply reference for the further study of DNA damage and repair mechanisms of this bacterium. (authors)

  2. Multiple tag labeling method for DNA sequencing

    Science.gov (United States)

    Mathies, R.A.; Huang, X.C.; Quesada, M.A.

    1995-07-25

    A DNA sequencing method is described which uses single lane or channel electrophoresis. Sequencing fragments are separated in the lane and detected using a laser-excited, confocal fluorescence scanner. Each set of DNA sequencing fragments is separated in the same lane and then distinguished using a binary coding scheme employing only two different fluorescent labels. Also described is a method of using radioisotope labels. 5 figs.

  3. Association of Amine-Receptor DNA Sequence Variants with Associative Learning in the Honeybee.

    Science.gov (United States)

    Lagisz, Malgorzata; Mercer, Alison R; de Mouzon, Charlotte; Santos, Luana L S; Nakagawa, Shinichi

    2016-03-01

    Octopamine- and dopamine-based neuromodulatory systems play a critical role in learning and learning-related behaviour in insects. To further our understanding of these systems and resulting phenotypes, we quantified DNA sequence variations at six loci coding octopamine-and dopamine-receptors and their association with aversive and appetitive learning traits in a population of honeybees. We identified 79 polymorphic sequence markers (mostly SNPs and a few insertions/deletions) located within or close to six candidate genes. Intriguingly, we found that levels of sequence variation in the protein-coding regions studied were low, indicating that sequence variation in the coding regions of receptor genes critical to learning and memory is strongly selected against. Non-coding and upstream regions of the same genes, however, were less conserved and sequence variations in these regions were weakly associated with between-individual differences in learning-related traits. While these associations do not directly imply a specific molecular mechanism, they suggest that the cross-talk between dopamine and octopamine signalling pathways may influence olfactory learning and memory in the honeybee.

  4. Expression of calmodulin mRNA in rat olfactory neuroepithelium.

    Science.gov (United States)

    Biffo, S; Goren, T; Khew-Goodall, Y S; Miara, J; Margolis, F L

    1991-04-01

    A calmodulin (CaM) cDNA was isolated by differential hybridization screening of a lambda gt10 library prepared from rat olfactory mucosa. This cDNA fragment, containing most of the open reading frame of the rat CaMI gene, was subcloned and used to characterize steady-state expression of CaM mRNA in rat olfactory neuroepithelium and bulb. Within the bulb mitral cells are the primary neuronal population expressing CaM mRNA. The major CaM mRNA expressed in the olfactory mucosa is 1.7 kb with smaller contributions from mRNAs of 4.0 and 1.4 kb. CaM mRNA was primarily associated with the olfactory neurons and, despite the cellular complexity of the tissue and the known involvement of CaM in diverse cellular processes, was only minimally evident in sustentacular cells, gland cells or respiratory epithelium. Following bulbectomy CaM mRNA declines in the olfactory neuroepithelium as does olfactory marker protein (OMP) mRNA. In contrast to the latter, CaM mRNA makes a partial recovery by one month after surgery. These results, coupled with those from in situ hybridization, indicate that CaM mRNA is expressed in both mature and immature olfactory neurons. The program regulating CaM gene expression in olfactory neurons is distinct from those controlling expression of B50/GAP43 in immature, or OMP in mature, neurons respectively.

  5. RNA Editing in Plant Mitochondria

    Science.gov (United States)

    Hiesel, Rudolf; Wissinger, Bernd; Schuster, Wolfgang; Brennicke, Axel

    1989-12-01

    Comparative sequence analysis of genomic and complementary DNA clones from several mitochondrial genes in the higher plant Oenothera revealed nucleotide sequence divergences between the genomic and the messenger RNA-derived sequences. These sequence alterations could be most easily explained by specific post-transcriptional nucleotide modifications. Most of the nucleotide exchanges in coding regions lead to altered codons in the mRNA that specify amino acids better conserved in evolution than those encoded by the genomic DNA. Several instances show that the genomic arginine codon CGG is edited in the mRNA to the tryptophan codon TGG in amino acid positions that are highly conserved as tryptophan in the homologous proteins of other species. This editing suggests that the standard genetic code is used in plant mitochondria and resolves the frequent coincidence of CGG codons and tryptophan in different plant species. The apparently frequent and non-species-specific equivalency of CGG and TGG codons in particular suggests that RNA editing is a common feature of all higher plant mitochondria.

  6. Genetic analysis of tumorigenesis: XXXII. Localization of constitutionally amplified KRAS sequences to Chinese hamster chromosomes X and Y by in situ hybridization.

    Science.gov (United States)

    Stenman, G; Anisowicz, A; Sager, R

    1988-11-01

    The KRAS gene is constitutionally amplified in the Chinese hamster. We have mapped the amplified sequences by in situ hybridization to two major sites on the X and Y chromosomes, Xq4 and Yp2. No autosomal site was detected despite a search under relaxed hybridization conditions. KRAS DNA is amplified about 50-fold compared to a human cell line known to have a diploid number of KRAS sequences, whereas mRNA expression is 5- to 10-fold lower than in normal human cells. While mRNA expression levels do not necessarily parallel gene copy number, the low expression level strongly suggests that the amplified sequences are transcriptionally silent. It is suggested that the amplified sequences arose from the original KRAS gene on chromosome 8 and that the KRAS sequences on the Y chromosome arose by X-Y recombination.

  7. A bar coding system for environmental projects

    International Nuclear Information System (INIS)

    Barber, R.B.; Hunt, B.J.; Burgess, G.M.

    1988-01-01

    This paper presents BeCode systems, a bar coding system which provides both nuclear and commercial clients with a data capture and custody management program that is accurate, timely, and beneficial to all levels of project operations. Using bar code identifiers is an essentially paperless and error-free method which provides more efficient delivery of data through its menu card-driven structure, which speeds collection of essential data for uploading to a compatible device. The effects of this sequence include real-time information for operator analysis, management review, audits, planning, scheduling, and cost control

  8. Stimulation of S14 mRNA and lipogenesis in brown fat by hypothyroidism, cold exposure, and cafeteria feeding: evidence supporting a general role for S14 in lipogenesis and lipogenesis in the maintenance of thermogenesis

    Energy Technology Data Exchange (ETDEWEB)

    Freake, H.C.; Oppenheimer, J.H.

    1987-05-01

    In liver, thyroid hormone rapidly induces S14 mRNA, which encodes a small acidic protein. This sequence is abundantly expressed only in lipogenic tissues and is thought to have some function in fat metabolism. In the euthyroid rat, we measured 20-fold higher levels of S14 mRNA in interscapular brown adipose tissue than liver. Furthermore, whereas in liver or epididymal fat, hypothyroidism resulted in an 80% fall in S14 mRNA, in brown fat the level of this sequence increased a further 3-fold. In all three tissues, the expression of S14 mRNA correlated well with lipogenesis, as assessed by /sup 3/H/sub 2/O incorporation. Physiological activation of brown fat by chronic cold exposure or cafeteria feeding increased the concentration of S14 mRNA in this tissue and again this was accompanied by a greater rate of fatty acid synthesis. Overall, in liver and white and brown adipose tissue, S14 mRNA and lipogenesis were well correlated and strongly suggest a function of the S14 protein related to fat synthesis. These studies suggest that the S14 protein and lipogenesis may be important for thyroid hormone-induced and brown adipose tissue thermogenesis and that stimulation of these functions in hypothyroid brown fat is a consequence of decreased thyroid hormone-induced thermogenesis elsewhere.

  9. Stimulation of S14 mRNA and lipogenesis in brown fat by hypothyroidism, cold exposure, and cafeteria feeding: evidence supporting a general role for S14 in lipogenesis and lipogenesis in the maintenance of thermogenesis

    International Nuclear Information System (INIS)

    Freake, H.C.; Oppenheimer, J.H.

    1987-01-01

    In liver, thyroid hormone rapidly induces S14 mRNA, which encodes a small acidic protein. This sequence is abundantly expressed only in lipogenic tissues and is thought to have some function in fat metabolism. In the euthyroid rat, we measured 20-fold higher levels of S14 mRNA in interscapular brown adipose tissue than liver. Furthermore, whereas in liver or epididymal fat, hypothyroidism resulted in an 80% fall in S14 mRNA, in brown fat the level of this sequence increased a further 3-fold. In all three tissues, the expression of S14 mRNA correlated well with lipogenesis, as assessed by 3 H 2 O incorporation. Physiological activation of brown fat by chronic cold exposure or cafeteria feeding increased the concentration of S14 mRNA in this tissue and again this was accompanied by a greater rate of fatty acid synthesis. Overall, in liver and white and brown adipose tissue, S14 mRNA and lipogenesis were well correlated and strongly suggest a function of the S14 protein related to fat synthesis. These studies suggest that the S14 protein and lipogenesis may be important for thyroid hormone-induced and brown adipose tissue thermogenesis and that stimulation of these functions in hypothyroid brown fat is a consequence of decreased thyroid hormone-induced thermogenesis elsewhere

  10. A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

    Science.gov (United States)

    Zhang, Ai-bing; Feng, Jie; Ward, Robert D; Wan, Ping; Gao, Qiang; Wu, Jun; Zhao, Wei-zhong

    2012-01-01

    Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI) region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS) genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF) to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish) and two representing non-coding ITS barcodes (rust fungi and brown algae). Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ) and Maximum likelihood (ML) methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI) of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40%) for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37%) for 1094 brown algae queries, both using ITS barcodes.

  11. A new method for species identification via protein-coding and non-coding DNA barcodes by combining machine learning with bioinformatic methods.

    Directory of Open Access Journals (Sweden)

    Ai-bing Zhang

    Full Text Available Species identification via DNA barcodes is contributing greatly to current bioinventory efforts. The initial, and widely accepted, proposal was to use the protein-coding cytochrome c oxidase subunit I (COI region as the standard barcode for animals, but recently non-coding internal transcribed spacer (ITS genes have been proposed as candidate barcodes for both animals and plants. However, achieving a robust alignment for non-coding regions can be problematic. Here we propose two new methods (DV-RBF and FJ-RBF to address this issue for species assignment by both coding and non-coding sequences that take advantage of the power of machine learning and bioinformatics. We demonstrate the value of the new methods with four empirical datasets, two representing typical protein-coding COI barcode datasets (neotropical bats and marine fish and two representing non-coding ITS barcodes (rust fungi and brown algae. Using two random sub-sampling approaches, we demonstrate that the new methods significantly outperformed existing Neighbor-joining (NJ and Maximum likelihood (ML methods for both coding and non-coding barcodes when there was complete species coverage in the reference dataset. The new methods also out-performed NJ and ML methods for non-coding sequences in circumstances of potentially incomplete species coverage, although then the NJ and ML methods performed slightly better than the new methods for protein-coding barcodes. A 100% success rate of species identification was achieved with the two new methods for 4,122 bat queries and 5,134 fish queries using COI barcodes, with 95% confidence intervals (CI of 99.75-100%. The new methods also obtained a 96.29% success rate (95%CI: 91.62-98.40% for 484 rust fungi queries and a 98.50% success rate (95%CI: 96.60-99.37% for 1094 brown algae queries, both using ITS barcodes.

  12. The presence of five nifH-like sequences in Clostridium pasteurianum: sequence divergence and transcription properties.

    OpenAIRE

    Wang, S Z; Chen, J S; Johnson, J L

    1988-01-01

    The nifH gene encodes the iron protein (component II) of the nitrogenase complex. We have previously shown the presence in Clostridium pasteurianum of two nifH-like sequences in addition to the nifH1 gene which codes for a protein identical to the isolated iron protein. In the present study, we report that there are at least five nifH-like sequences in C. pasteurianum. DNA sequencing data indicate that the six nifH (nifH1) and nifH-like (nifH2, nifH3, nifH4, nifH5 and nifH6) sequences are not...

  13. Whole-genome analysis of mRNA decay in Plasmodium falciparum reveals a global lengthening of mRNA half-life during the intra-erythrocytic development cycle.

    Science.gov (United States)

    Shock, Jennifer L; Fischer, Kael F; DeRisi, Joseph L

    2007-01-01

    The rate of mRNA decay is an essential element of post-transcriptional regulation in all organisms. Previously, studies in several organisms found that the specific half-life of each mRNA is precisely related to its physiologic role, and plays an important role in determining levels of gene expression. We used a genome-wide approach to characterize mRNA decay in Plasmodium falciparum. We found that, globally, rates of mRNA decay increase dramatically during the asexual intra-erythrocytic developmental cycle. During the ring stage of the cycle, the average mRNA half-life was 9.5 min, but this was extended to an average of 65 min during the late schizont stage of development. Thus, a major determinant of mRNA decay rate appears to be linked to the stage of intra-erythrocytic development. Furthermore, we found specific variations in decay patterns superimposed upon the dominant trend of progressive half-life lengthening. These variations in decay pattern were frequently enriched for genes with specific cellular functions or processes. Elucidation of Plasmodium mRNA decay rates provides a key element for deciphering mechanisms of genetic control in this parasite, by complementing and extending previous mRNA abundance studies. Our results indicate that progressive stage-dependent decreases in mRNA decay rate function are a major determinant of mRNA accumulation during the schizont stage of intra-erythrocytic development. This type of genome-wide change in mRNA decay rate has not been observed in any other organism to date, and indicates that post-transcriptional regulation may be the dominant mechanism of gene regulation in P. falciparum.

  14. Ddx19 links mRNA nuclear export with progression of transcription and replication and suppresses genomic instability upon DNA damage in proliferating cells.

    Science.gov (United States)

    Hodroj, Dana; Serhal, Kamar; Maiorano, Domenico

    2017-09-03

    The DEAD-box Helicase 19 (Ddx19) gene codes for an RNA helicase involved in both mRNA (mRNA) export from the nucleus into the cytoplasm and in mRNA translation. In unperturbed cells, Ddx19 localizes in the cytoplasm and at the cytoplasmic face of the nuclear pore. Here we review recent findings related to an additional Ddx19 function in the nucleus in resolving RNA:DNA hybrids (R-loops) generated during collision between transcription and replication, and upon DNA damage. Activation of a DNA damage response pathway dependent upon the ATR kinase, a major regulator of replication fork progression, stimulates translocation of the Ddx19 protein from the cytoplasm into the nucleus. Only nuclear Ddx19 is competent to resolve R-loops, and down regulation of Ddx19 expression induces DNA double strand breaks only in proliferating cells. Overall these observations put forward Ddx19 as an important novel mediator of the crosstalk between transcription and replication.

  15. Hepatoma-derived growth factor and nucleolin exist in the same ribonucleoprotein complex

    Directory of Open Access Journals (Sweden)

    Bremer Stephanie

    2013-01-01

    Full Text Available Abstract Background Hepatoma-derived growth factor (HDGF is a protein which is highly expressed in a variety of tumours. HDGF has mitogenic, angiogenic, neurotrophic and antiapoptotic activity but the molecular mechanisms by which it exerts these activities are largely unknown nor has its biological function in tumours been elucidated. Mass spectrometry was performed to analyse the HDGFStrep-tag interactome. By Pull–down-experiments using different protein and nucleic acid constructs the interaction of HDGF and nucleolin was investigated further. Results A number of HDGFStrep-tag copurifying proteins were identified which interact with RNA or are involved in the cellular DNA repair machinery. The most abundant protein, however, copurifying with HDGF in this approach was nucleolin. Therefore we focus on the characterization of the interaction of HDGF and nucleolin in this study. We show that expression of a cytosolic variant of HDGF causes a redistribution of nucleolin into the cytoplasm. Furthermore, formation of HDGF/nucleolin complexes depends on bcl-2 mRNA. Overexpression of full length bcl-2 mRNA increases the number of HDGF/nucleolin complexes whereas expression of only the bcl-2 coding sequence abolishes interaction completely. Further examination reveals that the coding sequence of bcl-2 mRNA together with either the 5′ or 3′ UTR is sufficient for formation of HDGF/nucleolin complexes. When bcl-2 coding sequence within the full length cDNA is replaced by a sequence coding for secretory alkaline phosphatase complex formation is not enhanced. Conclusion The results provide evidence for the existence of HDGF and nucleolin containing nucleoprotein complexes which formation depends on the presence of specific mRNAs. The nature of these RNAs and other components of the complexes should be investigated in future.

  16. Cytokine and acute phase protein mRNA expression in liver tissue from pigs with severe sepsis caused by intravenous inoculation of Staphylococcus aureus

    DEFF Research Database (Denmark)

    Nielsen, Ole Lerberg; Olsen, Helle Gerda; Iburg, Tine

    2010-01-01

    elevated at 36 and 48 h. Microabscesses were found in the livers from pigs killed at 12 h only. The livers from pigs killed at 48 h also showed light, diffuse fibrin exudation (vascular leakage). Real-time PCR showed a decreased hepatic expression of mRNA coding for albumin and increased hepatic expression...... of IL-6, IL-8, IL-1β, and CRP. N o increase could be detected in the IL-1α or TNFα liver-mRNA levels. IL-6, IL-8 and IL-1β expression peaked at 24 hours (2-5 fold compared to the control group). In conclusion, the increased liver cytokine mRNA levels indicate a local hepatic, non-infectious inflammatory...

  17. Molecular cloning, sequence characterization and expression pattern of Rab18 gene from watermelon (Citrullus lanatus).

    Science.gov (United States)

    Xinli, Xiao; Lei, Peng

    2015-03-04

    The complete mRNA sequence of watermelon Rab18 gene was amplified through the rapid amplification of cDNA ends (RACE) method. The full-length mRNA was 1010 bp containing a 645 bp open reading frame, which encodes a protein of 214 amino acids. Sequence analysis revealed that watermelon Rab18 protein shares high homology with the Rab18 of cucumber (99%), muskmelon (98%), Morus notabilis (90%), tomato (89%), wine grape (89%) and potato (88%). Phylogenetic analysis revealed that watermelon Rab18 gene has a closer genetic relationship with Rab18 gene of cucumber and muskmelon. Tissue expression profile analysis indicated that watermelon Rab18 gene was highly expressed in root, stem and leaf, moderately expressed in flower and weakly expressed in fruit.

  18. Human tissue factor: cDNA sequence and chromosome localization of the gene

    International Nuclear Information System (INIS)

    Scarpati, E.M.; Wen, D.; Broze, G.J. Jr.; Miletich, J.P.; Flandermeyer, R.R.; Siegel, N.R.; Sadler, J.E.

    1987-01-01

    A human placenta cDNA library in λgt11 was screened for the expression of tissue factor antigens with rabbit polyclonal anti-human tissue factor immunoglobulin G. Among 4 million recombinant clones screened, one positive, λHTF8, expressed a protein that shared epitopes with authentic human brain tissue factor. The 1.1-kilobase cDNA insert of λHTF8 encoded a peptide that contained the amino-terminal protein sequence of human brain tissue factor. Northern blotting identified a major mRNA species of 2.2 kilobases and a minor species of ∼ 3.2 kilobases in poly(A) + RNA of placenta. Only 2.2-kilobase mRNA was detected in human brain and in the human monocytic U937 cell line. In U937 cells, the quantity of tissue factor mRNA was increased several fold by exposure of the cells to phorbol 12-myristate 13-acetate. Additional cDNA clones were selected by hybridization with the cDNA insert of λHTF8. These overlapping isolates span 2177 base pairs of the tissue factor cDNA sequence that includes a 5'-noncoding region of 75 base pairs, an open reading frame of 885 base pairs, a stop codon, a 3'-noncoding region of 1141 base pairs, and a poly(a) tail. The open reading frame encodes a 33-kilodalton protein of 295 amino acids. The predicted sequence includes a signal peptide of 32 or 34 amino acids, a probable extracellular factor VII binding domain of 217 or 219 amino acids, a transmembrane segment of 23 acids, and a cytoplasmic tail of 21 amino acids. There are three potential glycosylation sites with the sequence Asn-X-Thr/Ser. The 3'-noncoding region contains an inverted Alu family repetitive sequence. The tissue factor gene was localized to chromosome 1 by hybridization of the cDNA insert of λHTF8 to flow-sorted human chromosomes

  19. Capsid coding sequences of foot-and-mouth disease viruses are determinants of pathogenicity in pigs.

    Science.gov (United States)

    Lohse, Louise; Jackson, Terry; Bøtner, Anette; Belsham, Graham J

    2012-05-24

    The surface exposed capsid proteins, VP1, VP2 and VP3, of foot-and-mouth disease virus (FMDV) determine its antigenicity and the ability of the virus to interact with host-cell receptors. Hence, modification of these structural proteins may alter the properties of the virus.In the present study we compared the pathogenicity of different FMDVs in young pigs. In total 32 pigs, 7-weeks-old, were exposed to virus, either by direct inoculation or through contact with inoculated pigs, using cell culture adapted (O1K B64), chimeric (O1K/A-TUR and O1K/O-UKG) or field strain (O-UKG/34/2001) viruses. The O1K B64 virus and the two chimeric viruses are identical to each other except for the capsid coding region.Animals exposed to O1K B64 did not exhibit signs of disease, while pigs exposed to each of the other viruses showed typical clinical signs of foot-and-mouth disease (FMD). All pigs infected with the O1K/O-UKG chimera or the field strain (O-UKG/34/2001) developed fulminant disease. Furthermore, 3 of 4 in-contact pigs exposed to the O1K/O-UKG virus died in the acute phase of infection, likely from myocardial infection. However, in the group exposed to the O1K/A-TUR chimeric virus, only 1 pig showed symptoms of disease within the time frame of the experiment (10 days). All pigs that developed clinical disease showed a high level of viral RNA in serum and infected pigs that survived the acute phase of infection developed a serotype specific antibody response. It is concluded that the capsid coding sequences are determinants of FMDV pathogenicity in pigs.

  20. A bacterial genetic screen identifies functional coding sequences of the insect mariner transposable element Famar1 amplified from the genome of the earwig, Forficula auricularia.

    Science.gov (United States)

    Barry, Elizabeth G; Witherspoon, David J; Lampe, David J

    2004-02-01

    Transposons of the mariner family are widespread in animal genomes and have apparently infected them by horizontal transfer. Most species carry only old defective copies of particular mariner transposons that have diverged greatly from their active horizontally transferred ancestor, while a few contain young, very similar, and active copies. We report here the use of a whole-genome screen in bacteria to isolate somewhat diverged Famar1 copies from the European earwig, Forficula auricularia, that encode functional transposases. Functional and nonfunctional coding sequences of Famar1 and nonfunctional copies of Ammar1 from the European honey bee, Apis mellifera, were sequenced to examine their molecular evolution. No selection for sequence conservation was detected in any clade of a tree derived from these sequences, not even on branches leading to functional copies. This agrees with the current model for mariner transposon evolution that expects neutral evolution within particular hosts, with selection for function occurring only upon horizontal transfer to a new host. Our results further suggest that mariners are not finely tuned genetic entities and that a greater amount of sequence diversification than had previously been appreciated can occur in functional copies in a single host lineage. Finally, this method of isolating active copies can be used to isolate other novel active transposons without resorting to reconstruction of ancestral sequences.

  1. High-Resolution Analysis of Coronavirus Gene Expression by RNA Sequencing and Ribosome Profiling.

    Science.gov (United States)

    Irigoyen, Nerea; Firth, Andrew E; Jones, Joshua D; Chung, Betty Y-W; Siddell, Stuart G; Brierley, Ian

    2016-02-01

    Members of the family Coronaviridae have the largest genomes of all RNA viruses, typically in the region of 30 kilobases. Several coronaviruses, such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV) and Middle East respiratory syndrome-related coronavirus (MERS-CoV), are of medical importance, with high mortality rates and, in the case of SARS-CoV, significant pandemic potential. Other coronaviruses, such as Porcine epidemic diarrhea virus and Avian coronavirus, are important livestock pathogens. Ribosome profiling is a technique which exploits the capacity of the translating ribosome to protect around 30 nucleotides of mRNA from ribonuclease digestion. Ribosome-protected mRNA fragments are purified, subjected to deep sequencing and mapped back to the transcriptome to give a global "snap-shot" of translation. Parallel RNA sequencing allows normalization by transcript abundance. Here we apply ribosome profiling to cells infected with Murine coronavirus, mouse hepatitis virus, strain A59 (MHV-A59), a model coronavirus in the same genus as SARS-CoV and MERS-CoV. The data obtained allowed us to study the kinetics of virus transcription and translation with exquisite precision. We studied the timecourse of positive and negative-sense genomic and subgenomic viral RNA production and the relative translation efficiencies of the different virus ORFs. Virus mRNAs were not found to be translated more efficiently than host mRNAs; rather, virus translation dominates host translation at later time points due to high levels of virus transcripts. Triplet phasing of the profiling data allowed precise determination of translated reading frames and revealed several translated short open reading frames upstream of, or embedded within, known virus protein-coding regions. Ribosome pause sites were identified in the virus replicase polyprotein pp1a ORF and investigated experimentally. Contrary to expectations, ribosomes were not found to pause at the ribosomal

  2. High-Resolution Analysis of Coronavirus Gene Expression by RNA Sequencing and Ribosome Profiling.

    Directory of Open Access Journals (Sweden)

    Nerea Irigoyen

    2016-02-01

    Full Text Available Members of the family Coronaviridae have the largest genomes of all RNA viruses, typically in the region of 30 kilobases. Several coronaviruses, such as Severe acute respiratory syndrome-related coronavirus (SARS-CoV and Middle East respiratory syndrome-related coronavirus (MERS-CoV, are of medical importance, with high mortality rates and, in the case of SARS-CoV, significant pandemic potential. Other coronaviruses, such as Porcine epidemic diarrhea virus and Avian coronavirus, are important livestock pathogens. Ribosome profiling is a technique which exploits the capacity of the translating ribosome to protect around 30 nucleotides of mRNA from ribonuclease digestion. Ribosome-protected mRNA fragments are purified, subjected to deep sequencing and mapped back to the transcriptome to give a global "snap-shot" of translation. Parallel RNA sequencing allows normalization by transcript abundance. Here we apply ribosome profiling to cells infected with Murine coronavirus, mouse hepatitis virus, strain A59 (MHV-A59, a model coronavirus in the same genus as SARS-CoV and MERS-CoV. The data obtained allowed us to study the kinetics of virus transcription and translation with exquisite precision. We studied the timecourse of positive and negative-sense genomic and subgenomic viral RNA production and the relative translation efficiencies of the different virus ORFs. Virus mRNAs were not found to be translated more efficiently than host mRNAs; rather, virus translation dominates host translation at later time points due to high levels of virus transcripts. Triplet phasing of the profiling data allowed precise determination of translated reading frames and revealed several translated short open reading frames upstream of, or embedded within, known virus protein-coding regions. Ribosome pause sites were identified in the virus replicase polyprotein pp1a ORF and investigated experimentally. Contrary to expectations, ribosomes were not found to pause at the

  3. Clofibrate-induced increases in peroxisomal proteins: effect on synthesis, degradation, and mRNA activity

    International Nuclear Information System (INIS)

    Mortensen, R.M.

    1983-01-01

    The effect of clofibrate on the polypeptide composition of peroxisomes was determined. A simple method was developed for the isolation of peroxisomes with a purity of 90-95% using sedimentation in a metrizamide gradient. The specific activities of HD did not change with clofibrate treatment so that the increases in enzyme activities are solely due to increases in protein amounts. The hepatic concentration of HD increased 63 times. The HD synthesis rate, as measured by the incorporation of [ 3 H]leucine, increased 74 times, so that the increase in the synthesis was sufficient to account for the increase in protein. Clofibrate caused no discernible change in the degradation rate of HD labeled with [ 14 C]bicarbonate. The half-life of HD was approximately 2 days. The translatable mRBA coding for HD increased 55 times. This value is not significantly different from the increase in HD protein or in HD synthesis. This observation was also true for several other peroxisomal proteins. Therefore, clofibrate causes an increase in the mRNA activity, which increases the synthesis of HD leading to an accumulation of protein and enzyme activity. The kinetics of the clofibrate-induced changes in HD synthesis rate, protein level, and enzymatic activity was analyzed using a simple model which included the half-lives of the drug, mRNA, and protein. The best fit of the model to the data gave an mRNA half-life of 10 hours and a protein half-life of 1.8 days, with no significant change by clofibrate

  4. RANDNA: a random DNA sequence generator.

    Science.gov (United States)

    Piva, Francesco; Principato, Giovanni

    2006-01-01

    Monte Carlo simulations are useful to verify the significance of data. Genomic regularities, such as the nucleotide correlations or the not uniform distribution of the motifs throughout genomic or mature mRNA sequences, exist and their significance can be checked by means of the Monte Carlo test. The test needs good quality random sequences in order to work, moreover they should have the same nucleotide distribution as the sequences in which the regularities have been found. Random DNA sequences are also useful to estimate the background score of an alignment, that is a threshold below which the resulting score is merely due to chance. We have developed RANDNA, a free software which allows to produce random DNA or RNA sequences setting both their length and the percentage of nucleotide composition. Sequences having the same nucleotide distribution of exonic, intronic or intergenic sequences can be generated. Its graphic interface makes it possible to easily set the parameters that characterize the sequences being produced and saved in a text format file. The pseudo-random number generator function of Borland Delphi 6 is used, since it guarantees a good randomness, a long cycle length and a high speed. We have checked the quality of sequences generated by the software, by means of well-known tests, both by themselves and versus genuine random sequences. We show the good quality of the generated sequences. The software, complete with examples and documentation, is freely available to users from: http://www.introni.it/en/software.

  5. Sequence data and association statistics from 12,940 type 2 diabetes cases and controls

    DEFF Research Database (Denmark)

    Jason, Flannick; Fuchsberger, Christian; Mahajan, Anubha

    2017-01-01

    variants were identified, including 99% of low-frequency (minor allele frequency [MAF] 0.1-5%) non-coding variants in the whole-genome sequenced individuals and 99.7% of low-frequency coding variants in the whole-exome sequenced individuals. Each variant was tested for association with T2D in the sequenced...... individuals, and, to increase power, most were tested in larger numbers of individuals (>80% of low-frequency coding variants in ~82 K Europeans via the exome chip, and ~90% of low-frequency non-coding variants in ~44 K Europeans via genotype imputation). The variants, genotypes, and association statistics...... from these analyses provide the largest reference to date of human genetic information relevant to T2D, for use in activities such as T2D-focused genotype imputation, functional characterization of variants or genes, and other novel analyses to detect associations between sequence variation and T2D....

  6. What Information is Stored in DNA: Does it Contain Digital Error Correcting Codes?

    Science.gov (United States)

    Liebovitch, Larry

    1998-03-01

    The longest term correlations in living systems are the information stored in DNA which reflects the evolutionary history of an organism. The 4 bases (A,T,G,C) encode sequences of amino acids as well as locations of binding sites for proteins that regulate DNA. The fidelity of this important information is maintained by ANALOG error check mechanisms. When a single strand of DNA is replicated the complementary base is inserted in the new strand. Sometimes the wrong base is inserted that sticks out disrupting the phosphate backbone. The new base is not yet methylated, so repair enzymes, that slide along the DNA, can tear out the wrong base and replace it with the right one. The bases in DNA form a sequence of 4 different symbols and so the information is encoded in a DIGITAL form. All the digital codes in our society (ISBN book numbers, UPC product codes, bank account numbers, airline ticket numbers) use error checking code, where some digits are functions of other digits to maintain the fidelity of transmitted informaiton. Does DNA also utitlize a DIGITAL error chekcing code to maintain the fidelity of its information and increase the accuracy of replication? That is, are some bases in DNA functions of other bases upstream or downstream? This raises the interesting mathematical problem: How does one determine whether some symbols in a sequence of symbols are a function of other symbols. It also bears on the issue of determining algorithmic complexity: What is the function that generates the shortest algorithm for reproducing the symbol sequence. The error checking codes most used in our technology are linear block codes. We developed an efficient method to test for the presence of such codes in DNA. We coded the 4 bases as (0,1,2,3) and used Gaussian elimination, modified for modulus 4, to test if some bases are linear combinations of other bases. We used this method to analyze the base sequence in the genes from the lac operon and cytochrome C. We did not find

  7. Profiling of Long Non-coding RNAs and mRNAs by RNA-Sequencing in the Hippocampi of Adult Mice Following Propofol Sedation.

    Science.gov (United States)

    Fan, Jun; Zhou, Quan; Li, Yan; Song, Xiuling; Hu, Jijie; Qin, Zaisheng; Tang, Jing; Tao, Tao

    2018-01-01

    Propofol is a frequently used intravenous anesthetic agent. The impairment caused by propofol on the neural system, especially the hippocampus, has been widely reported. However, the molecular mechanism underlying the effects of propofol on learning and memory functions in the hippocampus is still unclear. In the present study we performed lncRNA and mRNA analysis in the hippocampi of adult mice, after propofol sedation, through RNA-Sequencing (RNA-Seq). A total of 146 differentially expressed lncRNAs and 1103 mRNAs were identified. Bioinformatics analysis, including gene ontology (GO) analysis, pathway analysis and network analysis, were done for the identified dysregulated genes. Pathway analysis indicated that the FoxO signaling pathway played an important role in the effects of propofol on the hippocampus. Finally, four lncRNAs and three proteins were selected from the FoxO-related network for further validation. The up-regulation of lncE230001N04Rik and the down-regulation of lncRP23-430H21.1 and lncB230206L02Rik showed the same fold change tendencies but changes in Gm26532 were not statistically significant in the RNA-Seq results, following propofol sedation. The FoxO pathway-related proteins, PI3K and AKT, are up-regulated in propofol-exposed group. FoxO3a is down-regulated at both mRNA and protein levels. Our study reveals that propofol sedation can influence the expression of lncRNAs and mRNAs in the hippocampus, and bioinformatics analysis have identified key biological processes and pathways associated with propofol sedation. Cumulatively, our results provide a framework for further study on the role of lncRNAs in propofol-induced or -related neurotoxicity, particularly with regards to hippocampus-related dysfunction.

  8. Profiling of Long Non-coding RNAs and mRNAs by RNA-Sequencing in the Hippocampi of Adult Mice Following Propofol Sedation

    Directory of Open Access Journals (Sweden)

    Jun Fan

    2018-03-01

    Full Text Available Propofol is a frequently used intravenous anesthetic agent. The impairment caused by propofol on the neural system, especially the hippocampus, has been widely reported. However, the molecular mechanism underlying the effects of propofol on learning and memory functions in the hippocampus is still unclear. In the present study we performed lncRNA and mRNA analysis in the hippocampi of adult mice, after propofol sedation, through RNA-Sequencing (RNA-Seq. A total of 146 differentially expressed lncRNAs and 1103 mRNAs were identified. Bioinformatics analysis, including gene ontology (GO analysis, pathway analysis and network analysis, were done for the identified dysregulated genes. Pathway analysis indicated that the FoxO signaling pathway played an important role in the effects of propofol on the hippocampus. Finally, four lncRNAs and three proteins were selected from the FoxO-related network for further validation. The up-regulation of lncE230001N04Rik and the down-regulation of lncRP23-430H21.1 and lncB230206L02Rik showed the same fold change tendencies but changes in Gm26532 were not statistically significant in the RNA-Seq results, following propofol sedation. The FoxO pathway-related proteins, PI3K and AKT, are up-regulated in propofol-exposed group. FoxO3a is down-regulated at both mRNA and protein levels. Our study reveals that propofol sedation can influence the expression of lncRNAs and mRNAs in the hippocampus, and bioinformatics analysis have identified key biological processes and pathways associated with propofol sedation. Cumulatively, our results provide a framework for further study on the role of lncRNAs in propofol-induced or -related neurotoxicity, particularly with regards to hippocampus-related dysfunction.

  9. Two duplicated chicken-type lysozyme genes in disc abalone Haliotis discus discus: molecular aspects in relevance to structure, genomic organization, mRNA expression and bacteriolytic function.

    Science.gov (United States)

    Umasuthan, Navaneethaiyer; Bathige, S D N K; Kasthuri, Saranya Revathy; Wan, Qiang; Whang, Ilson; Lee, Jehee

    2013-08-01

    Lysozymes are crucial antibacterial proteins that are associated with catalytic cleavage of peptidoglycan and subsequent bacteriolysis. The present study describes the identification of two lysozyme genes from disc abalone Haliotis discus discus and their characterization at sequence-, genomic-, transcriptional- and functional-levels. Two cDNAs and BAC clones bearing lysozyme genes were isolated from abalone transcriptome and BAC genomic libraries, respectively and sequences were determined. Corresponding deduced amino acid sequences harbored a chicken-type lysozyme (LysC) family profile and exhibited conserved characteristics of LysC family members including active residues (Glu and Asp) and GS(S/T)DYGIFQINS motif suggested that they are LysC counterparts in disc abalone and designated as abLysC1 and abLysC2. While abLysC1 represented the homolog recently reported in Ezo abalone [1], abLysC2 shared significant identity with LysC homologs. Unlike other vertebrate LysCs, coding sequence of abLysCs were distributed within five exons interrupted by four introns. Both abLysCs revealed a broader mRNA distribution with highest levels in mantle (abLysC1) and hepatopancreas (abLysC2) suggesting their likely main role in defense and digestion, respectively. Investigation of temporal transcriptional profiles post-LPS and -pathogen challenges revealed induced-responses of abLysCs in gills and hemocytes. The in vitro muramidase activity of purified recombinant (r) abLysCs proteins was evaluated, and findings indicated that they are active in acidic pH range (3.5-6.5) and over a broad temperature range (20-60 °C) and influenced by ionic strength. When the antibacterial spectra of (r)abLysCs were examined, they displayed differential activities against both Gram positive and Gram negative strains providing evidence for their involvement in bacteriolytic function in abalone physiology. Copyright © 2013 Elsevier Ltd. All rights reserved.

  10. Optimal interference code based on machine learning

    Science.gov (United States)

    Qian, Ye; Chen, Qian; Hu, Xiaobo; Cao, Ercong; Qian, Weixian; Gu, Guohua

    2016-10-01

    In this paper, we analyze the characteristics of pseudo-random code, by the case of m sequence. Depending on the description of coding theory, we introduce the jamming methods. We simulate the interference effect or probability model by the means of MATLAB to consolidate. In accordance with the length of decoding time the adversary spends, we find out the optimal formula and optimal coefficients based on machine learning, then we get the new optimal interference code. First, when it comes to the phase of recognition, this study judges the effect of interference by the way of simulating the length of time over the decoding period of laser seeker. Then, we use laser active deception jamming simulate interference process in the tracking phase in the next block. In this study we choose the method of laser active deception jamming. In order to improve the performance of the interference, this paper simulates the model by MATLAB software. We find out the least number of pulse intervals which must be received, then we can make the conclusion that the precise interval number of the laser pointer for m sequence encoding. In order to find the shortest space, we make the choice of the greatest common divisor method. Then, combining with the coding regularity that has been found before, we restore pulse interval of pseudo-random code, which has been already received. Finally, we can control the time period of laser interference, get the optimal interference code, and also increase the probability of interference as well.

  11. Long Non-Coding RNAs: A Novel Paradigm for Toxicology.

    Science.gov (United States)

    Dempsey, Joseph L; Cui, Julia Yue

    2017-01-01

    Long non-coding RNAs (lncRNAs) are over 200 nucleotides in length and are transcribed from the mammalian genome in a tissue-specific and developmentally regulated pattern. There is growing recognition that lncRNAs are novel biomarkers and/or key regulators of toxicological responses in humans and animal models. Lacking protein-coding capacity, the numerous types of lncRNAs possess a myriad of transcriptional regulatory functions that include cis and trans gene expression, transcription factor activity, chromatin remodeling, imprinting, and enhancer up-regulation. LncRNAs also influence mRNA processing, post-transcriptional regulation, and protein trafficking. Dysregulation of lncRNAs has been implicated in various human health outcomes such as various cancers, Alzheimer's disease, cardiovascular disease, autoimmune diseases, as well as intermediary metabolism such as glucose, lipid, and bile acid homeostasis. Interestingly, emerging evidence in the literature over the past five years has shown that lncRNA regulation is impacted by exposures to various chemicals such as polycyclic aromatic hydrocarbons, benzene, cadmium, chlorpyrifos-methyl, bisphenol A, phthalates, phenols, and bile acids. Recent technological advancements, including next-generation sequencing technologies and novel computational algorithms, have enabled the profiling and functional characterizations of lncRNAs on a genomic scale. In this review, we summarize the biogenesis and general biological functions of lncRNAs, highlight the important roles of lncRNAs in human diseases and especially during the toxicological responses to various xenobiotics, evaluate current methods for identifying aberrant lncRNA expression and molecular target interactions, and discuss the potential to implement these tools to address fundamental questions in toxicology. © The Author 2016. Published by Oxford University Press on behalf of the Society of Toxicology. All rights reserved. For Permissions, please e

  12. Sequence History Update Tool

    Science.gov (United States)

    Khanampompan, Teerapat; Gladden, Roy; Fisher, Forest; DelGuercio, Chris

    2008-01-01

    The Sequence History Update Tool performs Web-based sequence statistics archiving for Mars Reconnaissance Orbiter (MRO). Using a single UNIX command, the software takes advantage of sequencing conventions to automatically extract the needed statistics from multiple files. This information is then used to populate a PHP database, which is then seamlessly formatted into a dynamic Web page. This tool replaces a previous tedious and error-prone process of manually editing HTML code to construct a Web-based table. Because the tool manages all of the statistics gathering and file delivery to and from multiple data sources spread across multiple servers, there is also a considerable time and effort savings. With the use of The Sequence History Update Tool what previously took minutes is now done in less than 30 seconds, and now provides a more accurate archival record of the sequence commanding for MRO.

  13. Apple ring rot-responsive putative microRNAs revealed by high-throughput sequencing in Malus × domestica Borkh.

    Science.gov (United States)

    Yu, Xin-Yi; Du, Bei-Bei; Gao, Zhi-Hong; Zhang, Shi-Jie; Tu, Xu-Tong; Chen, Xiao-Yun; Zhang, Zhen; Qu, Shen-Chun

    2014-08-01

    MicroRNAs (miRNAs) are small non-coding RNAs, which silence target mRNA via cleavage or translational inhibition to function in regulating gene expression. MiRNAs act as important regulators of plant development and stress response. For understanding the role of miRNAs responsive to apple ring rot stress, we identified disease-responsive miRNAs using high-throughput sequencing in Malus × domestica Borkh.. Four small RNA libraries were constructed from two control strains in M. domestica, crabapple (CKHu) and Fuji Naga-fu No. 6 (CKFu), and two disease stress strains, crabapple (DSHu) and Fuji Naga-fu No. 6 (DSFu). A total of 59 miRNA families were identified and five miRNAs might be responsive to apple ring rot infection and validated via qRT-PCR. Furthermore, we predicted 76 target genes which were regulated by conserved miRNAs potentially. Our study demonstrated that miRNAs was responsive to apple ring rot infection and may have important implications on apple disease resistance.

  14. Uniform Circular Antenna Array Applications in Coded DS-CDMA Mobile Communication Systems

    National Research Council Canada - National Science Library

    Seow, Tian

    2003-01-01

    ...) has greatly increased. This thesis examines the use of an equally spaced circular adaptive antenna array at the mobile station for a typical coded direct sequence code division multiple access (DS-CDMA...

  15. Modified Three-Dimensional Multicarrier Optical Prime Codes

    Directory of Open Access Journals (Sweden)

    Rajesh Yadav

    2016-01-01

    Full Text Available We propose a mathematical model for novel three-dimensional multicarrier optical codes in terms of wavelength/time/space based on the prime sequence algorithm. The proposed model has been extensively simulated on MATLAB for prime numbers (P to analyze the performance of code in terms of autocorrelation and cross-correlation. The simulated outcome resembles the mathematical model and gives better results over other methods available in the literature as far as autocorrelation and cross-correlation are concerned. The proposed 3D optical codes are more efficient in terms of cardinality, improved security, and providing quality of services.

  16. The Complete Sequence of a Human Parainfluenzavirus 4 Genome

    Science.gov (United States)

    Yea, Carmen; Cheung, Rose; Collins, Carol; Adachi, Dena; Nishikawa, John; Tellier, Raymond

    2009-01-01

    Although the human parainfluenza virus 4 (HPIV4) has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada). The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97%) with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized. PMID:21994536

  17. The Complete Sequence of a Human Parainfluenzavirus 4 Genome

    Directory of Open Access Journals (Sweden)

    Carmen Yea

    2009-06-01

    Full Text Available Although the human parainfluenza virus 4 (HPIV4 has been known for a long time, its genome, alone among the human paramyxoviruses, has not been completely sequenced to date. In this study we obtained the first complete genomic sequence of HPIV4 from a clinical isolate named SKPIV4 obtained at the Hospital for Sick Children in Toronto (Ontario, Canada. The coding regions for the N, P/V, M, F and HN proteins show very high identities (95% to 97% with previously available partial sequences for HPIV4B. The sequence for the L protein and the non-coding regions represent new information. A surprising feature of the genome is its length, more than 17 kb, making it the longest genome within the genus Rubulavirus, although the length is well within the known range of 15 kb to 19 kb for the subfamily Paramyxovirinae. The availability of a complete genomic sequence will facilitate investigations on a respiratory virus that is still not completely characterized.

  18. Whole Blood mRNA Expression-Based Prognosis of Metastatic Renal Cell Carcinoma.

    Science.gov (United States)

    Giridhar, Karthik V; Sosa, Carlos P; Hillman, David W; Sanhueza, Cristobal; Dalpiaz, Candace L; Costello, Brian A; Quevedo, Fernando J; Pitot, Henry C; Dronca, Roxana S; Ertz, Donna; Cheville, John C; Donkena, Krishna Vanaja; Kohli, Manish

    2017-11-03

    The Memorial Sloan Kettering Cancer Center (MSKCC) prognostic score is based on clinical parameters. We analyzed whole blood mRNA expression in metastatic clear cell renal cell carcinoma (mCCRCC) patients and compared it to the MSKCC score for predicting overall survival. In a discovery set of 19 patients with mRCC, we performed whole transcriptome RNA sequencing and selected eighteen candidate genes for further evaluation based on associations with overall survival and statistical significance. In an independent validation of set of 47 patients with mCCRCC, transcript expression of the 18 candidate genes were quantified using a customized NanoString probeset. Cox regression multivariate analysis confirmed that two of the candidate genes were significantly associated with overall survival. Higher expression of BAG1 [hazard ratio (HR) of 0.14, p < 0.0001, 95% confidence interval (CI) 0.04-0.36] and NOP56 (HR 0.13, p < 0.0001, 95% CI 0.05-0.34) were associated with better prognosis. A prognostic model incorporating expression of BAG1 and NOP56 into the MSKCC score improved prognostication significantly over a model using the MSKCC prognostic score only ( p < 0.0001). Prognostic value of using whole blood mRNA gene profiling in mCCRCC is feasible and should be prospectively confirmed in larger studies.

  19. Summary description of the scale modular code system

    International Nuclear Information System (INIS)

    Parks, C.V.

    1987-12-01

    SCALE - a modular code system for Standardized Computer Analyses for Licensing Evaluation - has been developed at Oak Ridge National Laboratory at the request of the US Nuclear Regulatory Commission staff. The SCALE system utilizes well-established computer codes and methods within standard analytic sequences that allow simplified free-form input, automate the data processing and coupling between codes, and provide accurate and reliable results. System development has been directed at criticality safety, shielding, and heat transfer analysis of spent fuel transport and/or storage casks. However, only a few of the sequences (and none of the individual functional modules) are restricted to cask applications. This report will provide a background on the history of the SCALE development and review the components and their function within the system. The available data libraries are also discussed, together with the automated features that standardize the data processing and systems analysis. 83 refs., 32 figs., 11 tabs

  20. ASTEC V2. Overview of code development and application at GRS

    International Nuclear Information System (INIS)

    Reinke, N.; Nowack, H.; Sonnenkalb, M.

    2011-01-01

    The integral code ASTEC (Accident Source Term Evaluation Code) commonly developed since 1996 by the French IRSN and the German GRS is a fast running programme, which allows the calculation of entire sequences of severe accidents (SA) in light water reactors from the initiating event up to the release of fission products into the environment, thereby covering all important in-vessel and containment phenomena. Thus, the main ASTEC application fields are intended to be accident sequence studies, uncertainty and sensitivity studies, probabilistic safety analysis level 2 as well as support to experiments. The modular structure of ASTEC allows running each module independently and separately, e.g. for separate effects analyses as well as a combination of multiple modules for coupled effects testing and integral analyses. Subject of this paper is an overview of the new V2 series of the ASTEC code system and presentation of exemplary results for the application to severe accidents sequences at PWRs. (orig.)

  1. Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags

    Science.gov (United States)

    de Souza, Sandro J.; Camargo, Anamaria A.; Briones, Marcelo R. S.; Costa, Fernando F.; Nagai, Maria Aparecida; Verjovski-Almeida, Sergio; Zago, Marco A.; Andrade, Luis Eduardo C.; Carrer, Helaine; El-Dorry, Hamza F. A.; Espreafico, Enilza M.; Habr-Gama, Angelita; Giannella-Neto, Daniel; Goldman, Gustavo H.; Gruber, Arthur; Hackel, Christine; Kimura, Edna T.; Maciel, Rui M. B.; Marie, Suely K. N.; Martins, Elizabeth A. L.; Nóbrega, Marina P.; Paçó-Larson, Maria Luisa; Pardini, Maria Inês M. C.; Pereira, Gonçalo G.; Pesquero, João Bosco; Rodrigues, Vanderlei; Rogatto, Silvia R.; da Silva, Ismael D. C. G.; Sogayar, Mari C.; de Fátima Sonati, Maria; Tajara, Eloiza H.; Valentini, Sandro R.; Acencio, Marcio; Alberto, Fernando L.; Amaral, Maria Elisabete J.; Aneas, Ivy; Bengtson, Mário Henrique; Carraro, Dirce M.; Carvalho, Alex F.; Carvalho, Lúcia Helena; Cerutti, Janete M.; Corrêa, Maria Lucia C.; Costa, Maria Cristina R.; Curcio, Cyntia; Gushiken, Tsieko; Ho, Paulo L.; Kimura, Elza; Leite, Luciana C. C.; Maia, Gustavo; Majumder, Paromita; Marins, Mozart; Matsukuma, Adriana; Melo, Analy S. A.; Mestriner, Carlos Alberto; Miracca, Elisabete C.; Miranda, Daniela C.; Nascimento, Ana Lucia T. O.; Nóbrega, Francisco G.; Ojopi, Élida P. B.; Pandolfi, José Rodrigo C.; Pessoa, Luciana Gilbert; Rahal, Paula; Rainho, Claudia A.; da Ro's, Nancy; de Sá, Renata G.; Sales, Magaly M.; da Silva, Neusa P.; Silva, Tereza C.; da Silva, Wilson; Simão, Daniel F.; Sousa, Josane F.; Stecconi, Daniella; Tsukumo, Fernando; Valente, Valéria; Zalcberg, Heloisa; Brentani, Ricardo R.; Reis, Luis F. L.; Dias-Neto, Emmanuel; Simpson, Andrew J. G.

    2000-01-01

    Transcribed sequences in the human genome can be identified with confidence only by alignment with sequences derived from cDNAs synthesized from naturally occurring mRNAs. We constructed a set of 250,000 cDNAs that represent partial expressed gene sequences and that are biased toward the central coding regions of the resulting transcripts. They are termed ORF expressed sequence tags (ORESTES). The 250,000 ORESTES were assembled into 81,429 contigs. Of these, 1,181 (1.45%) were found to match sequences in chromosome 22 with at least one ORESTES contig for 162 (65.6%) of the 247 known genes, for 67 (44.6%) of the 150 related genes, and for 45 of the 148 (30.4%) EST-predicted genes on this chromosome. Using a set of stringent criteria to validate our sequences, we identified a further 219 previously unannotated transcribed sequences on chromosome 22. Of these, 171 were in fact also defined by EST or full length cDNA sequences available in GenBank but not utilized in the initial annotation of the first human chromosome sequence. Thus despite representing less than 15% of all expressed human sequences in the public databases at the time of the present analysis, ORESTES sequences defined 48 transcribed sequences on chromosome 22 not defined by other sequences. All of the transcribed sequences defined by ORESTES coincided with DNA regions predicted as encoding exons by genscan. (http://genes.mit.edu/GENSCAN.html). PMID:11070084

  2. Estrogens and growth factors induce the mRNA of the 52K-pro-cathepsin-D secreted by breast cancer cells

    Energy Technology Data Exchange (ETDEWEB)

    Cavailles, V; Augereau, P; Garcia, M; Rochefort, H

    1988-03-25

    The estrogen-induced 52K protein secreted by human breast cancer cells is a lysosomal protease recently identified as a pro-cathepsin D by sequencing several cDNA clones isolated from MCF/sub 7/ cells. Using one of these clones, the authors detected, in MCF/sub 7/ cells a 2.2 kb mRNA whose level was rapidly increased 4- to 10-fold by estradiol, but not by other classes of steroids. Other mitogens, such as epidermal growth factor and insulin, also induced the 2.2 kb mRNA in a dose-dependent manner. Induction with epidermal growth factor was as rapid but was 2- to 3-fold lower than with estradiol. Antiestrogens had no effect on the 52K-cathepsin-D mRNA in MCF/sub 7/ cells, but became estrogen agonists in two antiestrogen-resistant sublines R/sub 27/ and LY2. The use of transcription and translation inhibitors and nuclear run-on experiments indicate that estradiol enhances transcription of the 52K-cathepsin-D gene in MCF/sub 7/ cells.

  3. Design of Long Period Pseudo-Random Sequences from the Addition of m -Sequences over 𝔽 p

    Directory of Open Access Journals (Sweden)

    Ren Jian

    2004-01-01

    Full Text Available Pseudo-random sequence with good correlation property and large linear span is widely used in code division multiple access (CDMA communication systems and cryptology for reliable and secure information transmission. In this paper, sequences with long period, large complexity, balance statistics, and low cross-correlation property are constructed from the addition of m -sequences with pairwise-prime linear spans (AMPLS. Using m -sequences as building blocks, the proposed method proved to be an efficient and flexible approach to construct long period pseudo-random sequences with desirable properties from short period sequences. Applying the proposed method to 𝔽 2 , a signal set ( ( 2 n − 1 ( 2 m − 1 , ( 2 n + 1 ( 2 m + 1 , ( 2 ( n + 1 / 2 + 1 ( 2 ( m + 1 / 2 + 1 is constructed.

  4. Long non-coding RNAs: Mechanism of action and functional utility

    OpenAIRE

    Bhat, Shakil Ahmad; Ahmad, Syed Mudasir; Mumtaz, Peerzada Tajamul; Malik, Abrar Ahad; Dar, Mashooq Ahmad; Urwat, Uneeb; Shah, Riaz Ahmad; Ganai, Nazir Ahmad

    2016-01-01

    Recent RNA sequencing studies have revealed that most of the human genome is transcribed, but very little of the total transcriptomes has the ability to encode proteins. Long non-coding RNAs (lncRNAs) are non-coding transcripts longer than 200 nucleotides. Members of the non-coding genome include microRNA (miRNA), small regulatory RNAs and other short RNAs. Most of long non-coding RNA (lncRNAs) are poorly annotated. Recent recognition about lncRNAs highlights their effects in many biological ...

  5. [Impacts of the formula of Suoquanwan(SQW) on expression of AQP-2 mRNA and AVPR-V2 mRNA in the kidney of rat polyuria model of Yang-deficiency].

    Science.gov (United States)

    Cao, Hong-Ying; Wu, Qing-He; Huang, Ping; He, Jin-Yang

    2009-06-01

    To observe the impacts of the formula of Suoquanwan (SQW) on the expression of AQP-2 mRNA and AVPR-V2 mRNA in the kidney of rat polyuria model of Yang-deficiency. The model rats were induced by adenine (250 mg/kg) for 4 weeks, then treated respectively with SQW or dDAVP. The expression of AQP-2 mRNA and AVPR-V2 mRNA in kidney of Yang-deficiency model by realtime fluorescence quantitative PCR method were investigated. In model rats, the expression of AQP-2 mRNA and AVPR-V2 mRNA in the kidney decreased, dDAVP and SQW high dose could increased the expression of AQP-2 mRNA and AVPR-V2 mRNA in the kidney. The others had no influence on the expression of AQP-2 mRNA and AVPR-V2 mRNA in the kidney. SQW can increase the expression of AQP-2 mRNA and AVPR-V2 mRNA in the kidney of rat polyuria model of Yang-deficiency.

  6. Retrotransposons and non-protein coding RNAs

    DEFF Research Database (Denmark)

    Mourier, Tobias; Willerslev, Eske

    2009-01-01

    does not merely represent spurious transcription. We review examples of functional RNAs transcribed from retrotransposons, and address the collection of non-protein coding RNAs derived from transposable element sequences, including numerous human microRNAs and the neuronal BC RNAs. Finally, we review...

  7. The role of mRNA translation in the adaptation to hypoxia

    International Nuclear Information System (INIS)

    Koritzinsky, M.; Wouters, B.G.; Koumenis, C.

    2003-01-01

    Hypoxia commonly occurs in human tumours and is associated with a poor prognosis. We and others have shown that global mRNA translation is rapidly inhibited during hypoxia. However, some mRNAs, such as those coding for HIF-1 α and VEGF, remain efficiently translated. We therefore hypothesize that the inhibition of mRNA translation serves to promote hypoxia tolerance in two ways: i) through conservation of energy and ii) through differential gene expression involved in hypoxia adaptation. We are investigating the mechanisms responsible for the down regulation of protein synthesis during hypoxia, and how specific mRNAs maintain their ability to be translated under such conditions. Our goal is to understand the significance of these regulatory mechanisms for hypoxia tolerance in vitro and tumor growth in vivo. We have previously shown that one mechanism responsible for inhibiting protein synthesis during hypoxia is the activation of PERK, which inhibits the essential translation factor eIF2 α . Here we show that PERK-/- MEFs are not able to inhibit protein synthesis efficiently during hypoxia and are significantly less tolerant to hypoxia than wt cells. We also show that other mechanisms are important for sustained low protein synthesis during chronic hypoxia. We demonstrate that the eIF4F complex is disrupted during prolonged hypoxia, and that this is mediated by 4E-BP1 and 4E-T. eIF4F is essential for translation which is dependent upon the 5'mRNA cap-structure. These studies therefore indicate a switch from the inhibition of all translation through eIF2 α during acute hypoxia, to the inhibition of only cap-dependent translation during chronic hypoxia. This model predicts the differential induction of genes that can be translated cap-independently during chronic hypoxia, which is consistent with the observed differential translation of HIF-1 α and VEGF. The functional significance of the disruption of the eIF4F complex during hypoxia is currently being addressed

  8. Inferring microRNA regulation of mRNA with partially ordered samples of paired expression data and exogenous prediction algorithms.

    Directory of Open Access Journals (Sweden)

    Brian Godsey

    Full Text Available MicroRNAs (miRs are known to play an important role in mRNA regulation, often by binding to complementary sequences in "target" mRNAs. Recently, several methods have been developed by which existing sequence-based target predictions can be combined with miR and mRNA expression data to infer true miR-mRNA targeting relationships. It has been shown that the combination of these two approaches gives more reliable results than either by itself. While a few such algorithms give excellent results, none fully addresses expression data sets with a natural ordering of the samples. If the samples in an experiment can be ordered or partially ordered by their expected similarity to one another, such as for time-series or studies of development processes, stages, or types, (e.g. cell type, disease, growth, aging, there are unique opportunities to infer miR-mRNA interactions that may be specific to the underlying processes, and existing methods do not exploit this. We propose an algorithm which specifically addresses [partially] ordered expression data and takes advantage of sample similarities based on the ordering structure. This is done within a Bayesian framework which specifies posterior distributions and therefore statistical significance for each model parameter and latent variable. We apply our model to a previously published expression data set of paired miR and mRNA arrays in five partially ordered conditions, with biological replicates, related to multiple myeloma, and we show how considering potential orderings can improve the inference of miR-mRNA interactions, as measured by existing knowledge about the involved transcripts.

  9. Kangaroo – A pattern-matching program for biological sequences

    Directory of Open Access Journals (Sweden)

    Betel Doron

    2002-07-01

    Full Text Available Abstract Background Biologists are often interested in performing a simple database search to identify proteins or genes that contain a well-defined sequence pattern. Many databases do not provide straightforward or readily available query tools to perform simple searches, such as identifying transcription binding sites, protein motifs, or repetitive DNA sequences. However, in many cases simple pattern-matching searches can reveal a wealth of information. We present in this paper a regular expression pattern-matching tool that was used to identify short repetitive DNA sequences in human coding regions for the purpose of identifying potential mutation sites in mismatch repair deficient cells. Results Kangaroo is a web-based regular expression pattern-matching program that can search for patterns in DNA, protein, or coding region sequences in ten different organisms. The program is implemented to facilitate a wide range of queries with no restriction on the length or complexity of the query expression. The program is accessible on the web at http://bioinfo.mshri.on.ca/kangaroo/ and the source code is freely distributed at http://sourceforge.net/projects/slritools/. Conclusion A low-level simple pattern-matching application can prove to be a useful tool in many research settings. For example, Kangaroo was used to identify potential genetic targets in a human colorectal cancer variant that is characterized by a high frequency of mutations in coding regions containing mononucleotide repeats.

  10. Sequencing the GRHL3 Coding Region Reveals Rare Truncating Mutations and a Common Susceptibility Variant for Nonsyndromic Cleft Palate

    Science.gov (United States)

    Mangold, Elisabeth; Böhmer, Anne C.; Ishorst, Nina; Hoebel, Ann-Kathrin; Gültepe, Pinar; Schuenke, Hannah; Klamt, Johanna; Hofmann, Andrea; Gölz, Lina; Raff, Ruth; Tessmann, Peter; Nowak, Stefanie; Reutter, Heiko; Hemprich, Alexander; Kreusch, Thomas; Kramer, Franz-Josef; Braumann, Bert; Reich, Rudolf; Schmidt, Gül; Jäger, Andreas; Reiter, Rudolf; Brosch, Sibylle; Stavusis, Janis; Ishida, Miho; Seselgyte, Rimante; Moore, Gudrun E.; Nöthen, Markus M.; Borck, Guntram; Aldhorae, Khalid A.; Lace, Baiba; Stanier, Philip; Knapp, Michael; Ludwig, Kerstin U.

    2016-01-01

    Nonsyndromic cleft lip with/without cleft palate (nsCL/P) and nonsyndromic cleft palate only (nsCPO) are the most frequent subphenotypes of orofacial clefts. A common syndromic form of orofacial clefting is Van der Woude syndrome (VWS) where individuals have CL/P or CPO, often but not always associated with lower lip pits. Recently, ∼5% of VWS-affected individuals were identified with mutations in the grainy head-like 3 gene (GRHL3). To investigate GRHL3 in nonsyndromic clefting, we sequenced its coding region in 576 Europeans with nsCL/P and 96 with nsCPO. Most strikingly, nsCPO-affected individuals had a higher minor allele frequency for rs41268753 (0.099) than control subjects (0.049; p = 1.24 × 10−2). This association was replicated in nsCPO/control cohorts from Latvia, Yemen, and the UK (pcombined = 2.63 × 10−5; ORallelic = 2.46 [95% CI 1.6–3.7]) and reached genome-wide significance in combination with imputed data from a GWAS in nsCPO triads (p = 2.73 × 10−9). Notably, rs41268753 is not associated with nsCL/P (p = 0.45). rs41268753 encodes the highly conserved p.Thr454Met (c.1361C>T) (GERP = 5.3), which prediction programs denote as deleterious, has a CADD score of 29.6, and increases protein binding capacity in silico. Sequencing also revealed four novel truncating GRHL3 mutations including two that were de novo in four families, where all nine individuals harboring mutations had nsCPO. This is important for genetic counseling: given that VWS is rare compared to nsCPO, our data suggest that dominant GRHL3 mutations are more likely to cause nonsyndromic than syndromic CPO. Thus, with rare dominant mutations and a common risk variant in the coding region, we have identified an important contribution for GRHL3 in nsCPO. PMID:27018475

  11. Molecular cloning and construction of the coding region for human acetylcholinesterase reveals a G + C-rich attenuating structure

    International Nuclear Information System (INIS)

    Soreq, H.; Ben-Aziz, R.; Prody, C.A.; Seidman, S.; Gnatt, A.; Neville, L.; Lieman-Hurwitz, J.; Lev-Lehman, E.; Ginzberg, D.; Lapidot-Lifson, Y.; Zakut, H.

    1990-01-01

    To study the primary structure of human acetylcholinesterase and its gene expression and amplification, cDNA libraries from human tissues expressing oocyte-translatable AcChoEase mRNA were constructed and screened with labeled oligodeoxynucleotide probes. Several cDNA clones were isolated that encoded a polypeptide with ≥50% identically aligned amino acids to Torpedo AcChoEase and human butyrylcholinesterase. However, these cDNA clones were all truncated within a 300-nucleotide-long G + C-rich region with a predicted pattern of secondary structure having a high Gibbs free energy downstream from the expected 5' end of the coding region. Screening of a genomic DNA library revealed the missing 5' domain. When ligated to the cDNA and constructed into a transcription vector, this sequence encoded a synthetic mRNA translated in microinjected oocytes into catalytically active AcChoEase with marked preference for acetylthiocholine over butyrylthiocholine as a substrate, susceptibility to inhibition by the AcChoEase inhibitor BW284C51, and resistance to the AcChoEase inhibitor tetraisopropylpyrophosphoramide. Blot hybridization of genomic DNA from different individuals carrying amplified AcChoEase genes revealed variable intensities and restriction patterns with probes from the regions upstream and downstream from the predicted G + C-rich structure. Thus, the human AcChoEase gene includes a putative G + C-rich attenuator domain and is subject to structural alterations in cases of AcChoEase gene amplification

  12. Abstract feature codes: The building blocks of the implicit learning system.

    Science.gov (United States)

    Eberhardt, Katharina; Esser, Sarah; Haider, Hilde

    2017-07-01

    According to the Theory of Event Coding (TEC; Hommel, Müsseler, Aschersleben, & Prinz, 2001), action and perception are represented in a shared format in the cognitive system by means of feature codes. In implicit sequence learning research, it is still common to make a conceptual difference between independent motor and perceptual sequences. This supposedly independent learning takes place in encapsulated modules (Keele, Ivry, Mayr, Hazeltine, & Heuer 2003) that process information along single dimensions. These dimensions have remained underspecified so far. It is especially not clear whether stimulus and response characteristics are processed in separate modules. Here, we suggest that feature dimensions as they are described in the TEC should be viewed as the basic content of modules of implicit learning. This means that the modules process all stimulus and response information related to certain feature dimensions of the perceptual environment. In 3 experiments, we investigated by means of a serial reaction time task the nature of the basic units of implicit learning. As a test case, we used stimulus location sequence learning. The results show that a stimulus location sequence and a response location sequence cannot be learned without interference (Experiment 2) unless one of the sequences can be coded via an alternative, nonspatial dimension (Experiment 3). These results support the notion that spatial location is one module of the implicit learning system and, consequently, that there are no separate processing units for stimulus versus response locations. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  13. Mechanisms controlling mRNA processing and translation : decoding the regulatory layers defining gene expression through RNA sequencing

    NARCIS (Netherlands)

    Klerk, Eleonora de

    2015-01-01

    The work described in this thesis focuses on the mechanisms that give rise to alternative mRNAs and their alternative translation into proteins. Each of the described studies has been based on a specific set of high-throughput RNA sequencing technologies. An overview of the available RNA sequencing

  14. Role of a redox-based methylation switch in mRNA life cycle ( pre- & post- transcriptional maturation and protein turnover : Implications in neurological disorders

    Directory of Open Access Journals (Sweden)

    MALAV SUCHIN TRIVEDI

    2012-06-01

    Full Text Available Homeostatic synaptic scaling in response to neuronal stimulus or activation, as well as due to changes in cellular niche, is an important phenomenon for memory consolidation, retrieval, and other similar cognitive functions. Neurological disorders and cognitive disabilities in autism, Rett syndrome, schizophrenia, dementia etc., are strongly correlated to alterations in protein expression (both synaptic and cytoplasmic. This correlation suggests that efficient temporal regulation of synaptic protein expression is important for synaptic plasticity. In addition, equilibrium between mRNA processing, protein translation and protein turnover is a critical sensor/trigger for recording synaptic information, normal cognition and behavior. Thus a regulatory switch, controlling the lifespan, maturation and processing of mRNA, might influence cognition and adaptive behavior. Here, we propose a two part novel hypothesis that methylation might act as this suggested coordinating switch to critically regulate mRNA maturation at 1.The pre-transcription level, by regulating precursor-RNA (pre-RNA processing into mRNA, via other non-coding RNAs and their influence on splicing phenomenon, and 2. the post-transcription level by modulating the regulatory functions of ribonucleoproteins (RNP and RNA binding proteins (RNABP in mRNA translation, dendritic translocation as well as protein synthesis and synaptic turnover. DNA methylation changes are well recognized and highly correlated to gene expression levels as well as, learning and memory; however, RNA methylation changes are recently characterized and yet their functional implications are not established. This review article provides some insight on the intriguing consequences of changes in methylation levels on mRNA life-cycle. We also suggest that, since methylation is under the control of glutathione antioxidant levels, the redox status of neurons might be the central regulatory switch for methylation

  15. The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins.

    Science.gov (United States)

    Ponce de Leon, Miguel; de Miranda, Antonio Basilio; Alvarez-Valin, Fernando; Carels, Nicolas

    2014-01-01

    For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional

  16. ARMOUR – A Rice miRNA: mRNA Interaction Resource

    Directory of Open Access Journals (Sweden)

    Neeti Sanan-Mishra

    2018-05-01

    Full Text Available ARMOUR was developed as ARice miRNA:mRNA interaction resource. This informative and interactive database includes the experimentally validated expression profiles of miRNAs under different developmental and abiotic stress conditions across seven Indian rice cultivars. This comprehensive database covers 689 known and 1664 predicted novel miRNAs and their expression profiles in more than 38 different tissues or conditions along with their predicted/known target transcripts. The understanding of miRNA:mRNA interactome in regulation of functional cellular machinery is supported by the sequence information of the mature and hairpin structures. ARMOUR provides flexibility to users in querying the database using multiple ways like known gene identifiers, gene ontology identifiers, KEGG identifiers and also allows on the fly fold change analysis and sequence search query with inbuilt BLAST algorithm. ARMOUR database provides a cohesive platform for novel and mature miRNAs and their expression in different experimental conditions and allows searching for their interacting mRNA targets, GO annotation and their involvement in various biological pathways. The ARMOUR database includes a provision for adding more experimental data from users, with an aim to develop it as a platform for sharing and comparing experimental data contributed by research groups working on rice.

  17. Evidence for a novel coding sequence overlapping the 5'-terminal ~90 codons of the Gill-associated and Yellow head okavirus envelope glycoprotein gene

    Directory of Open Access Journals (Sweden)

    Atkins John F

    2009-12-01

    Full Text Available Abstract The genus Okavirus (order Nidovirales includes a number of viruses that infect crustaceans, causing major losses in the shrimp industry. These viruses have a linear positive-sense ssRNA genome of ~26-27 kb, encoding a large replicase polyprotein that is expressed from the genomic RNA, and several additional proteins that are expressed from a nested set of 3'-coterminal subgenomic RNAs. In this brief report, we describe the bioinformatic discovery of a new, apparently coding, ORF that overlaps the 5' end of the envelope glycoprotein encoding sequence, ORF3, in the +2 reading frame. The new ORF has a strong coding signature and, in fact, is more conserved at the amino acid level than the overlapping region of ORF3. We propose that translation of the new ORF initiates at a conserved AUG codon separated by just 2 nt from the ORF3 AUG initiation codon, resulting in a novel 86 amino acid protein.

  18. Exome sequencing and genetic testing for MODY.

    Directory of Open Access Journals (Sweden)

    Stefan Johansson

    Full Text Available Genetic testing for monogenic diabetes is important for patient care. Given the extensive genetic and clinical heterogeneity of diabetes, exome sequencing might provide additional diagnostic potential when standard Sanger sequencing-based diagnostics is inconclusive.The aim of the study was to examine the performance of exome sequencing for a molecular diagnosis of MODY in patients who have undergone conventional diagnostic sequencing of candidate genes with negative results.We performed exome enrichment followed by high-throughput sequencing in nine patients with suspected MODY. They were Sanger sequencing-negative for mutations in the HNF1A, HNF4A, GCK, HNF1B and INS genes. We excluded common, non-coding and synonymous gene variants, and performed in-depth analysis on filtered sequence variants in a pre-defined set of 111 genes implicated in glucose metabolism.On average, we obtained 45 X median coverage of the entire targeted exome and found 199 rare coding variants per individual. We identified 0-4 rare non-synonymous and nonsense variants per individual in our a priori list of 111 candidate genes. Three of the variants were considered pathogenic (in ABCC8, HNF4A and PPARG, respectively, thus exome sequencing led to a genetic diagnosis in at least three of the nine patients. Approximately 91% of known heterozygous SNPs in the target exomes were detected, but we also found low coverage in some key diabetes genes using our current exome sequencing approach. Novel variants in the genes ARAP1, GLIS3, MADD, NOTCH2 and WFS1 need further investigation to reveal their possible role in diabetes.Our results demonstrate that exome sequencing can improve molecular diagnostics of MODY when used as a complement to Sanger sequencing. However, improvements will be needed, especially concerning coverage, before the full potential of exome sequencing can be realized.

  19. mRNA Traffic Control Reviewed: N6-Methyladenosine (m6 A) Takes the Driver's Seat.

    Science.gov (United States)

    Visvanathan, Abhirami; Somasundaram, Kumaravel

    2018-01-01

    Messenger RNA is a flexible tool box that plays a key role in the dynamic regulation of gene expression. RNA modifications variegate the message conveyed by the mRNA. Similar to DNA and histone modifications, mRNA modifications are reversible and play a key role in the regulation of molecular events. Our understanding about the landscape of RNA modifications is still rudimentary in contrast to DNA and histone modifications. The major obstacle has been the lack of sensitive detection methods since they are non-editing events. However, with the advent of next-generation sequencing techniques, RNA modifications are being identified precisely at single nucleotide resolution. In recent years, methylation at the N6 position of adenine (m 6 A) has gained the attention of RNA biologists. The m 6 A modification has a set of writers (methylases), erasers (demethylases), and readers. Here, we provide a summary of interesting facts, conflicting findings, and recent advances in the technical and functional aspects of the m 6 A epitranscriptome. © 2017 WILEY Periodicals, Inc.

  20. The rhesus macaque is three times as diverse but more closely equivalent in damaging coding variation as compared to the human

    Directory of Open Access Journals (Sweden)

    Yuan Qiaoping

    2012-06-01

    Full Text Available Abstract Background As a model organism in biomedicine, the rhesus macaque (Macaca mulatta is the most widely used nonhuman primate. Although a draft genome sequence was completed in 2007, there has been no systematic genome-wide comparison of genetic variation of this species to humans. Comparative analysis of functional and nonfunctional diversity in this highly abundant and adaptable non-human primate could inform its use as a model for human biology, and could reveal how variation in population history and size alters patterns and levels of sequence variation in primates. Results We sequenced the mRNA transcriptome and H3K4me3-marked DNA regions in hippocampus from 14 humans and 14 rhesus macaques. Using equivalent methodology and sampling spaces, we identified 462,802 macaque SNPs, most of which were novel and disproportionately located in the functionally important genomic regions we had targeted in the sequencing. At least one SNP was identified in each of 16,797 annotated macaque genes. Accuracy of macaque SNP identification was conservatively estimated to be >90%. Comparative analyses using SNPs equivalently identified in the two species revealed that rhesus macaque has approximately three times higher SNP density and average nucleotide diversity as compared to the human. Based on this level of diversity, the effective population size of the rhesus macaque is approximately 80,000 which contrasts with an effective population size of less than 10,000 for humans. Across five categories of genomic regions, intergenic regions had the highest SNP density and average nucleotide diversity and CDS (coding sequences the lowest, in both humans and macaques. Although there are more coding SNPs (cSNPs per individual in macaques than in humans, the ratio of dN/dS is significantly lower in the macaque. Furthermore, the number of damaging nonsynonymous cSNPs (have damaging effects on protein functions from PolyPhen-2 prediction in the macaque is more