WorldWideScience

Sample records for short sequence elements

  1. CORE-SINEs: eukaryotic short interspersed retroposing elements with common sequence motifs.

    Science.gov (United States)

    Gilbert, N; Labuda, D

    1999-03-16

    A 65-bp "core" sequence is dispersed in hundreds of thousands copies in the human genome. This sequence was found to constitute the central segment of a group of short interspersed elements (SINEs), referred to as mammalian-wide interspersed repeats, that proliferated before the radiation of placental mammals. Here, we propose that the core identifies an ancient tRNA-like SINE element, which survived in different lineages such as mammals, reptiles, birds, and fish, as well as mollusks, presumably for >550 million years. This element gave rise to a number of sequence families (CORE-SINEs), including mammalian-wide interspersed repeats, whose distinct 3' ends are shared with different families of long interspersed elements (LINEs). The evolutionary success of the generic CORE-SINE element can be related to the recruitment of the internal promoter from highly transcribed host RNA as well as to its capacity to adapt to changing retropositional opportunities by sequence exchange with actively amplifying LINEs. It reinforces the notion that the very existence of SINEs depends on the cohabitation with both LINEs and the host genome.

  2. Short Interspersed Nuclear Element (SINE) Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293.

    Science.gov (United States)

    Kanhayuwa, Lakkhana; Coutts, Robert H A

    2016-01-01

    Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.

  3. Short Interspersed Nuclear Element (SINE Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293.

    Directory of Open Access Journals (Sweden)

    Lakkhana Kanhayuwa

    Full Text Available Novel families of short interspersed nuclear element (SINE sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.

  4. Prediction and phylogenetic analysis of mammalian short interspersed elements (SINEs).

    Science.gov (United States)

    Rogozin, I B; Mayorov, V I; Lavrentieva, M V; Milanesi, L; Adkison, L R

    2000-09-01

    The presence of repetitive elements can create serious problems for sequence analysis, especially in the case of homology searches in nucleotide sequence databases. Repetitive elements should be treated carefully by using special programs and databases. In this paper, various aspects of SINE (short interspersed repetitive element) identification, analysis and evolution are discussed.

  5. Characterization of short interspersed elements (SINEs) in a red alga, Porphyra yezoensis.

    Science.gov (United States)

    Zhang, Wenbo; Lin, Xiaofei; Peddigari, Suresh; Takechi, Katsuaki; Takano, Hiroyoshi; Takio, Susumu

    2007-02-01

    Short interspersed element (SINE)-like sequences referred to as PySN1 and PySN2 were identified in a red alga, Porphyra yezoensis. Both elements contained an internal promoter with motifs (A box and B box) recognized by RNA polymerase III, and target site duplications at both ends. Genomic Southern blot analysis revealed that both elements were widely and abundantly distributed on the genome. 3' and 5' RACE suggested that PySN1 was expressed as a chimera transcript with flanking SINE-unrelated sequences and possessed the poly-A tail at the same position near the 3' end of PySN1.

  6. Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

    Energy Technology Data Exchange (ETDEWEB)

    Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

    2007-02-21

    Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

  7. Gene conversion as a secondary mechanism of short interspersed element (SINE) evolution

    Energy Technology Data Exchange (ETDEWEB)

    Kass, D.H. [Louisiana State Univ. Medical Center, New Orleans, LA (United States). Dept. of Biochemistry and Molecular Biology; Batzer, M.A. [Lawrence Livermore National Lab., CA (United States); Deininger, P.L. [Louisiana State Univ. Medical Center, New Orleans, LA (United States). Dept. of Biochemistry and Molecular Biology]|[Alton Ochsner Medical Foundation, New Orleans, LA (United States). Lab. of Molecular Genetics

    1995-01-01

    The Alu repetitive family of short interspersed elements (SINEs) in primates can be subdivided into distinct subfamilies by specific diagnostic nucleotide changes. The older subfamilies are generally very abundant, while the younger subfamilies have fewer copies. Some of the youngest Alu elements are absent in the orthologous loci of nonhuman primates, indicative of recent retroposition events, the primary mode of SINE evolutions. PCR analysis of one young Alu subfamily (Sb2) member found in the low-density lipoprotein receptor gene apparently revealed the presence of this element in the green monkey, orangutan, gorilla, and chimpanzee genomes, as well as the human genome. However, sequence analysis of these genomes revealed a highly mutated, older, primate-specific Alu element was present at this position in the nonhuman primates. Comparison of the flanking DNA sequences upstream of this Alu insertion corresponded to evolution expected for standard primate phylogeny, but comparison of the Alu repeat sequences revealed that the human element departed from this phylogeny. The change in the human sequence apparently occurred by a gene conversion event only within the Alu element itself, converting it from one of the oldest to one of the youngest Alu subfamilies. Although gene conversions of Alu elements are clearly very rare, this finding shows that such events can occur and contribute to specific cases of SINE subfamily evolution.

  8. BLEACHING EUCALYPTUS PULPS WITH SHORT SEQUENCES

    Directory of Open Access Journals (Sweden)

    Flaviana Reis Milagres

    2011-03-01

    Full Text Available Eucalyptus spp kraft pulp, due to its high content of hexenuronic acids, is quite easy to bleach. Therefore, investigations have been made attempting to decrease the number of stages in the bleaching process in order to minimize capital costs. This study focused on the evaluation of short ECF (Elemental Chlorine Free and TCF (Totally Chlorine Free sequences for bleaching oxygen delignified Eucalyptus spp kraft pulp to 90% ISO brightness: PMoDP (Molybdenum catalyzed acid peroxide, chlorine dioxide and hydrogen peroxide, PMoD/P (Molybdenum catalyzed acid peroxide, chlorine dioxide and hydrogen peroxide, without washing PMoD(PO (Molybdenum catalyzed acid peroxide, chlorine dioxide and pressurized peroxide, D(EPODP (chlorine dioxide, extraction oxidative with oxygen and peroxide, chlorine dioxide and hydrogen peroxide, PMoQ(PO (Molybdenum catalyzed acid peroxide, DTPA and pressurized peroxide, and XPMoQ(PO (Enzyme, molybdenum catalyzed acid peroxide, DTPA and pressurized peroxide. Uncommon pulp treatments, such as molybdenum catalyzed acid peroxide (PMo and xylanase (X bleaching stages, were used. Among the ECF alternatives, the two-stage PMoD/P sequence proved highly cost-effective without affecting pulp quality in relation to the traditional D(EPODP sequence and produced better quality effluent in relation to the reference. However, a four stage sequence, XPMoQ(PO, was required to achieve full brightness using the TCF technology. This sequence was highly cost-effective although it only produced pulp of acceptable quality.

  9. Short interspersed elements (SINEs) of the Geomyoidea superfamily rodents.

    Science.gov (United States)

    Gogolevsky, Konstantin P; Kramerov, Dmitri A

    2006-05-24

    A new short interspersed element (SINE) was isolated from the genome of desert kangaroo rat (Dipodomys deserti) using single-primer PCR. This SINE consists of two monomers: the left monomer (IDL) resembles rodent ID element and other tRNAAla(CGC)-derived SINEs, whereas the right one (Geo) shows no similarity with known SINE sequences. PCR and hybridization analyses demonstrated that IDL-Geo SINE is restricted to the rodent superfamily Geomyoidea (families Geomyidea and Heteromyidea). Isolation and analysis of IDL-Geo from California pocket mouse (Chaetodipus californicus) and Botta's pocket gopher (Thomomys bottae) revealed some species-specific features of this SINE family. The structure and evolution of known dimeric SINEs are discussed.

  10. Short read sequence typing (SRST: multi-locus sequence types from short reads

    Directory of Open Access Journals (Sweden)

    Inouye Michael

    2012-07-01

    Full Text Available Abstract Background Multi-locus sequence typing (MLST has become the gold standard for population analyses of bacterial pathogens. This method focuses on the sequences of a small number of loci (usually seven to divide the population and is simple, robust and facilitates comparison of results between laboratories and over time. Over the last decade, researchers and population health specialists have invested substantial effort in building up public MLST databases for nearly 100 different bacterial species, and these databases contain a wealth of important information linked to MLST sequence types such as time and place of isolation, host or niche, serotype and even clinical or drug resistance profiles. Recent advances in sequencing technology mean it is increasingly feasible to perform bacterial population analysis at the whole genome level. This offers massive gains in resolving power and genetic profiling compared to MLST, and will eventually replace MLST for bacterial typing and population analysis. However given the wealth of data currently available in MLST databases, it is crucial to maintain backwards compatibility with MLST schemes so that new genome analyses can be understood in their proper historical context. Results We present a software tool, SRST, for quick and accurate retrieval of sequence types from short read sets, using inputs easily downloaded from public databases. SRST uses read mapping and an allele assignment score incorporating sequence coverage and variability, to determine the most likely allele at each MLST locus. Analysis of over 3,500 loci in more than 500 publicly accessible Illumina read sets showed SRST to be highly accurate at allele assignment. SRST output is compatible with common analysis tools such as eBURST, Clonal Frame or PhyloViz, allowing easy comparison between novel genome data and MLST data. Alignment, fastq and pileup files can also be generated for novel alleles. Conclusions SRST is a novel

  11. RUDI, a short interspersed element of the V-SINE superfamily widespread in molluscan genomes.

    Science.gov (United States)

    Luchetti, Andrea; Šatović, Eva; Mantovani, Barbara; Plohl, Miroslav

    2016-06-01

    Short interspersed elements (SINEs) are non-autonomous retrotransposons that are widespread in eukaryotic genomes. They exhibit a chimeric sequence structure consisting of a small RNA-related head, an anonymous body and an AT-rich tail. Although their turnover and de novo emergence is rapid, some SINE elements found in distantly related species retain similarity in certain core segments (or highly conserved domains, HCD). We have characterized a new SINE element named RUDI in the bivalve molluscs Ruditapes decussatus and R. philippinarum and found this element to be widely distributed in the genomes of a number of mollusc species. An unexpected structural feature of RUDI is the HCD domain type V, which was first found in non-amniote vertebrate SINEs and in the SINE from one cnidarian species. In addition to the V domain, the overall sequence conservation pattern of RUDI elements resembles that found in ancient AmnSINE (~310 Myr old) and Au SINE (~320 Myr old) families, suggesting that RUDI might be among the most ancient SINE families. Sequence conservation suggests a monophyletic origin of RUDI. Nucleotide variability and phylogenetic analyses suggest long-term vertical inheritance combined with at least one horizontal transfer event as the most parsimonious explanation for the observed taxonomic distribution.

  12. In situ detection of a heat-shock regulatory element binding protein using a soluble short synthetic enhancer sequence

    Energy Technology Data Exchange (ETDEWEB)

    Harel-Bellan, A; Brini, A T; Farrar, W L [National Cancer Institute, Frederick, MD (USA); Ferris, D K [Program Resources, Inc., Frederick, MD (USA); Robin, P [Institut Gustave Roussy, Villejuif (France)

    1989-06-12

    In various studies, enhancer binding proteins have been successfully absorbed out by competing sequences inserted into plasmids, resulting in the inhibition of the plasmid expression. Theoretically, such a result could be achieved using synthetic enhancer sequences not inserted into plasmids. In this study, a double stranded DNA sequence corresponding to the human heat shock regulatory element was chemically synthesized. By in vitro retardation assays, the synthetic sequence was shown to bind specifically a protein in extracts from the human T cell line Jurkat. When the synthetic enhancer was electroporated into Jurkat cells, not only the enhancer was shown to remain undegraded into the cells for up to 2 days, but also its was shown to bind intracellularly a protein. The binding was specific and was modulated upon heat shock. Furthermore, the binding protein was shown to be of the expected molecular weight by UV crosslinking. However, when the synthetic enhancer element was co-electroporated with an HSP 70-CAT reporter construct, the expression of the reporter plasmid was consistently enhanced in the presence of the exogenous synthetic enhancer.

  13. SAAS: Short Amino Acid Sequence - A Promising Protein Secondary Structure Prediction Method of Single Sequence

    Directory of Open Access Journals (Sweden)

    Zhou Yuan Wu

    2013-07-01

    Full Text Available In statistical methods of predicting protein secondary structure, many researchers focus on single amino acid frequencies in α-helices, β-sheets, and so on, or the impact near amino acids on an amino acid forming a secondary structure. But the paper considers a short sequence of amino acids (3, 4, 5 or 6 amino acids as integer, and statistics short sequence's probability forming secondary structure. Also, many researchers select low homologous sequences as statistical database. But this paper select whole PDB database. In this paper we propose a strategy to predict protein secondary structure using simple statistical method. Numerical computation shows that, short amino acids sequence as integer to statistics, which can easy see trend of short sequence forming secondary structure, and it will work well to select large statistical database (whole PDB database without considering homologous, and Q3 accuracy is ca. 74% using this paper proposed simple statistical method, but accuracy of others statistical methods is less than 70%.

  14. Sequence composition and gene content of the short arm of rye (Secale cereale chromosome 1.

    Directory of Open Access Journals (Sweden)

    Silvia Fluch

    Full Text Available BACKGROUND: The purpose of the study is to elucidate the sequence composition of the short arm of rye chromosome 1 (Secale cereale with special focus on its gene content, because this portion of the rye genome is an integrated part of several hundreds of bread wheat varieties worldwide. METHODOLOGY/PRINCIPAL FINDINGS: Multiple Displacement Amplification of 1RS DNA, obtained from flow sorted 1RS chromosomes, using 1RS ditelosomic wheat-rye addition line, and subsequent Roche 454FLX sequencing of this DNA yielded 195,313,589 bp sequence information. This quantity of sequence information resulted in 0.43× sequence coverage of the 1RS chromosome arm, permitting the identification of genes with estimated probability of 95%. A detailed analysis revealed that more than 5% of the 1RS sequence consisted of gene space, identifying at least 3,121 gene loci representing 1,882 different gene functions. Repetitive elements comprised about 72% of the 1RS sequence, Gypsy/Sabrina (13.3% being the most abundant. More than four thousand simple sequence repeat (SSR sites mostly located in gene related sequence reads were identified for possible marker development. The existence of chloroplast insertions in 1RS has been verified by identifying chimeric chloroplast-genomic sequence reads. Synteny analysis of 1RS to the full genomes of Oryza sativa and Brachypodium distachyon revealed that about half of the genes of 1RS correspond to the distal end of the short arm of rice chromosome 5 and the proximal region of the long arm of Brachypodium distachyon chromosome 2. Comparison of the gene content of 1RS to 1HS barley chromosome arm revealed high conservation of genes related to chromosome 5 of rice. CONCLUSIONS: The present study revealed the gene content and potential gene functions on this chromosome arm and demonstrated numerous sequence elements like SSRs and gene-related sequences, which can be utilised for future research as well as in breeding of wheat and rye.

  15. A New Approach to Sequence Analysis Exemplified by Identification of cis-Elements in Abscisic Acid Inducible Promoters

    DEFF Research Database (Denmark)

    Busk, Peter Kamp; Hallin, Peter Fischer; Salomon, Jesper

    -regulatory elements. We have developed a method for identifying short, conserved motifs in biological sequences such as proteins, DNA and RNA5. This method was used for analysis of approximately 2000 Arabidopsis thaliana promoters that have been shown by DNA array analysis to be induced by abscisic acid6....... These promoters were compared to 28000 promoters that are not induced by abscisic acid. The analysis identified previously described ABA-inducible promoter elements such as ABRE, CE3 and CRT1 but also new cis-elements were found. Furthermore, the list of DNA elements could be used to predict ABA...

  16. Molecular characterization, genomic distribution and evolutionary dynamics of Short INterspersed Elements in the termite genome.

    Science.gov (United States)

    Luchetti, Andrea; Mantovani, Barbara

    2011-02-01

    Short INterspersed Elements (SINEs) in invertebrates, and especially in animal inbred genomes such that of termites, are poorly known; in this paper we characterize three new SINE families (Talub, Taluc and Talud) through the analyses of 341 sequences, either isolated from the Reticulitermes lucifugus genome or drawn from EST Genbank collection. We further add new data to the only isopteran element known so far, Talua. These SINEs are tRNA-derived elements, with an average length ranging from 258 to 372 bp. The tails are made up by poly(A) or microsatellite motifs. Their copy number varies from 7.9 × 10(3) to 10(5) copies, well within the range observed for other metazoan genomes. Species distribution, age and target site duplication analysis indicate Talud as the oldest, possibly inactive SINE originated before the onset of Isoptera (~150 Myr ago). Taluc underwent to substantial sequence changes throughout the evolution of termites and data suggest it was silenced and then re-activated in the R. lucifugus lineage. Moreover, Taluc shares a conserved sequence block with other unrelated SINEs, as observed for some vertebrate and cephalopod elements. The study of genomic environment showed that insertions are mainly surrounded by microsatellites and other SINEs, indicating a biased accumulation within non-coding regions. The evolutionary dynamics of Talu~ elements is explained through selective mechanisms acting in an inbred genome; in this respect, the study of termites' SINEs activity may provide an interesting framework to address the (co)evolution of mobile elements and the host genome.

  17. Close Sequence Comparisons are Sufficient to Identify Humancis-Regulatory Elements

    Energy Technology Data Exchange (ETDEWEB)

    Prabhakar, Shyam; Poulin, Francis; Shoukry, Malak; Afzal, Veena; Rubin, Edward M.; Couronne, Olivier; Pennacchio, Len A.

    2005-12-01

    Cross-species DNA sequence comparison is the primary method used to identify functional noncoding elements in human and other large genomes. However, little is known about the relative merits of evolutionarily close and distant sequence comparisons, due to the lack of a universal metric for sequence conservation, and also the paucity of empirically defined benchmark sets of cis-regulatory elements. To address this problem, we developed a general-purpose algorithm (Gumby) that detects slowly-evolving regions in primate, mammalian and more distant comparisons without requiring adjustment of parameters, and ranks conserved elements by P-value using Karlin-Altschul statistics. We benchmarked Gumby predictions against previously identified cis-regulatory elements at diverse genomic loci, and also tested numerous extremely conserved human-rodent sequences for transcriptional enhancer activity using reporter-gene assays in transgenic mice. Human regulatory elements were identified with acceptable sensitivity and specificity by comparison with 1-5 other eutherian mammals or 6 other simian primates. More distant comparisons (marsupial, avian, amphibian and fish) failed to identify many of the empirically defined functional noncoding elements. We derived an intuitive relationship between ancient and recent noncoding sequence conservation from whole genome comparative analysis, which explains some of these findings. Lastly, we determined that, in addition to strength of conservation, genomic location and/or density of surrounding conserved elements must also be considered in selecting candidate enhancers for testing at embryonic time points.

  18. Epigenetic regulation of transcription and possible functions of mammalian short interspersed elements, SINEs.

    Science.gov (United States)

    Ichiyanagi, Kenji

    2013-01-01

    Short interspersed elements (SINEs) are a class of retrotransposons, which amplify their copy numbers in their host genomes by retrotransposition. More than a million copies of SINEs are present in a mammalian genome, constituting over 10% of the total genomic sequence. In contrast to the other two classes of retrotransposons, long interspersed elements (LINEs) and long terminal repeat (LTR) elements, SINEs are transcribed by RNA polymerase III. However, like LINEs and LTR elements, the SINE transcription is likely regulated by epigenetic mechanisms such as DNA methylation, at least for human Alu and mouse B1. Whereas SINEs and other transposable elements have long been thought as selfish or junk DNA, recent studies have revealed that they play functional roles at their genomic locations, for example, as distal enhancers, chromatin boundaries and binding sites of many transcription factors. These activities imply that SINE retrotransposition has shaped the regulatory network and chromatin landscape of their hosts. Whereas it is thought that the epigenetic mechanisms were originated as a host defense system against proliferation of parasitic elements, this review discusses a possibility that the same mechanisms are also used to regulate the SINE-derived functions.

  19. Targeted assembly of short sequence reads.

    Directory of Open Access Journals (Sweden)

    René L Warren

    Full Text Available As next-generation sequence (NGS production continues to increase, analysis is becoming a significant bottleneck. However, in situations where information is required only for specific sequence variants, it is not necessary to assemble or align whole genome data sets in their entirety. Rather, NGS data sets can be mined for the presence of sequence variants of interest by localized assembly, which is a faster, easier, and more accurate approach. We present TASR, a streamlined assembler that interrogates very large NGS data sets for the presence of specific variants by only considering reads within the sequence space of input target sequences provided by the user. The NGS data set is searched for reads with an exact match to all possible short words within the target sequence, and these reads are then assembled stringently to generate a consensus of the target and flanking sequence. Typically, variants of a particular locus are provided as different target sequences, and the presence of the variant in the data set being interrogated is revealed by a successful assembly outcome. However, TASR can also be used to find unknown sequences that flank a given target. We demonstrate that TASR has utility in finding or confirming genomic mutations, polymorphisms, fusions and integration events. Targeted assembly is a powerful method for interrogating large data sets for the presence of sequence variants of interest. TASR is a fast, flexible and easy to use tool for targeted assembly.

  20. SINE_scan: an efficient tool to discover short interspersed nuclear elements (SINEs) in large-scale genomic datasets.

    Science.gov (United States)

    Mao, Hongliang; Wang, Hao

    2017-03-01

    Short Interspersed Nuclear Elements (SINEs) are transposable elements (TEs) that amplify through a copy-and-paste mode via RNA intermediates. The computational identification of new SINEs are challenging because of their weak structural signals and rapid diversification in sequences. Here we report SINE_Scan, a highly efficient program to predict SINE elements in genomic DNA sequences. SINE_Scan integrates hallmark of SINE transposition, copy number and structural signals to identify a SINE element. SINE_Scan outperforms the previously published de novo SINE discovery program. It shows high sensitivity and specificity in 19 plant and animal genome assemblies, of which sizes vary from 120 Mb to 3.5 Gb. It identifies numerous new families and substantially increases the estimation of the abundance of SINEs in these genomes. The code of SINE_Scan is freely available at http://github.com/maohlzj/SINE_Scan , implemented in PERL and supported on Linux. wangh8@fudan.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  1. Identification and characterisation of Short Interspersed Nuclear Elements in the olive tree (Olea europaea L.) genome.

    Science.gov (United States)

    Barghini, Elena; Mascagni, Flavia; Natali, Lucia; Giordani, Tommaso; Cavallini, Andrea

    2017-02-01

    Short Interspersed Nuclear Elements (SINEs) are nonautonomous retrotransposons in the genome of most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, SINE identification has been carried out only in a limited number of plant species. This lack of information is apparent especially in non-model plants whose genome has not been sequenced yet. The aim of this work was to produce a specific bioinformatics pipeline for analysing second generation sequence reads of a non-model species and identifying SINEs. We have identified, for the first time, 227 putative SINEs of the olive tree (Olea europaea), that constitute one of the few sets of such sequences in dicotyledonous species. The identified SINEs ranged from 140 to 362 bp in length and were characterised with regard to the occurrence of the tRNA domain in their sequence. The majority of identified elements resulted in single copy or very lowly repeated, often in association with genic sequences. Analysis of sequence similarity allowed us to identify two major groups of SINEs showing different abundances in the olive tree genome, the former with sequence similarity to SINEs of Scrophulariaceae and Solanaceae and the latter to SINEs of Salicaceae. A comparison of sequence conservation between olive SINEs and LTR retrotransposon families suggested that SINE expansion in the genome occurred especially in very ancient times, before LTR retrotransposon expansion, and presumably before the separation of the rosids (to which Oleaceae belong) from the Asterids. Besides providing data on olive SINEs, our results demonstrate the suitability of the pipeline employed for SINE identification. Applying this pipeline will favour further structural and functional analyses on these relatively unknown elements to be performed also in other plant species, even in the absence of a reference genome, and will allow establishing general evolutionary patterns for this kind of repeats in

  2. Enrichment of short interspersed transposable elements to embryonic stem cell-specific hypomethylated gene regions.

    Science.gov (United States)

    Muramoto, Hiroki; Yagi, Shintaro; Hirabayashi, Keiji; Sato, Shinya; Ohgane, Jun; Tanaka, Satoshi; Shiota, Kunio

    2010-08-01

    Embryonic stem cells (ESCs) have a distinctive epigenome, which includes their genome-wide DNA methylation modification status, as represented by the ESC-specific hypomethylation of tissue-dependent and differentially methylated regions (T-DMRs) of Pou5f1 and Nanog. Here, we conducted a genome-wide investigation of sequence characteristics associated with T-DMRs that were differentially methylated between ESCs and somatic cells, by focusing on transposable elements including short interspersed elements (SINEs), long interspersed elements (LINEs) and long terminal repeats (LTRs). We found that hypomethylated T-DMRs were predominantly present in SINE-rich/LINE-poor genomic loci. The enrichment for SINEs spread over 300 kb in cis and there existed SINE-rich genomic domains spreading continuously over 1 Mb, which contained multiple hypomethylated T-DMRs. The characterization of sequence information showed that the enriched SINEs were relatively CpG rich and belonged to specific subfamilies. A subset of the enriched SINEs were hypomethylated T-DMRs in ESCs at Dppa3 gene locus, although SINEs are overall methylated in both ESCs and the liver. In conclusion, we propose that SINE enrichment is the genomic property of regions harboring hypomethylated T-DMRs in ESCs, which is a novel aspect of the ESC-specific epigenomic information.

  3. A short TE gradient-echo sequence using asymmetric sampling

    International Nuclear Information System (INIS)

    Fujita, Norihiko; Harada, Kohshi; Sakurai, Kosuke; Nakanishi, Katsuyuki; Kim, Shyogen; Kozuka, Takahiro

    1990-01-01

    We have developed a gradient-echo pulse sequence with a short TE less than 4 msec using a data set of asymmetric off-center sampling with a broad bandwidth. The use of such a short TE significantly reduces T 2 * dephasing effect even in a two-dimensional mode, and by collecting an off-center echo, motion-induced phase dispersion is also considerably decreased. High immunity of this sequence to these dephasing effects permits clear visualization of anatomical details near the skull base where large local field inhomogeneities and rapid blood flow such as in the internal carotid artery are present. (author)

  4. De novo assembly of human genomes with massively parallel short read sequencing

    DEFF Research Database (Denmark)

    Li, Ruiqiang; Zhu, Hongmei; Ruan, Jue

    2010-01-01

    genomes from short read sequences. We successfully assembled both the Asian and African human genome sequences, achieving an N50 contig size of 7.4 and 5.9 kilobases (kb) and scaffold of 446.3 and 61.9 kb, respectively. The development of this de novo short read assembly method creates new opportunities...... for building reference sequences and carrying out accurate analyses of unexplored genomes in a cost-effective way....

  5. ISRNA: an integrative online toolkit for short reads from high-throughput sequencing data.

    Science.gov (United States)

    Luo, Guan-Zheng; Yang, Wei; Ma, Ying-Ke; Wang, Xiu-Jie

    2014-02-01

    Integrative Short Reads NAvigator (ISRNA) is an online toolkit for analyzing high-throughput small RNA sequencing data. Besides the high-speed genome mapping function, ISRNA provides statistics for genomic location, length distribution and nucleotide composition bias analysis of sequence reads. Number of reads mapped to known microRNAs and other classes of short non-coding RNAs, coverage of short reads on genes, expression abundance of sequence reads as well as some other analysis functions are also supported. The versatile search functions enable users to select sequence reads according to their sub-sequences, expression abundance, genomic location, relationship to genes, etc. A specialized genome browser is integrated to visualize the genomic distribution of short reads. ISRNA also supports management and comparison among multiple datasets. ISRNA is implemented in Java/C++/Perl/MySQL and can be freely accessed at http://omicslab.genetics.ac.cn/ISRNA/.

  6. Evolutionary modes of emergence of short interspersed nuclear element (SINE) families in grasses.

    Science.gov (United States)

    Kögler, Anja; Schmidt, Thomas; Wenke, Torsten

    2017-11-01

    Short interspersed nuclear elements (SINEs) are non-autonomous transposable elements which are propagated by retrotransposition and constitute an inherent part of the genome of most eukaryotic species. Knowledge of heterogeneous and highly abundant SINEs is crucial for de novo (or improvement of) annotation of whole genome sequences. We scanned Poaceae genome sequences of six important cereals (Oryza sativa, Triticum aestivum, Hordeum vulgare, Panicum virgatum, Sorghum bicolor, Zea mays) and Brachypodium distachyon to examine the diversity and evolution of SINE populations. We comparatively analyzed the structural features, distribution, evolutionary relation and abundance of 32 SINE families and subfamilies within grasses, comprising 11 052 individual copies. The investigation of activity profiles within the Poaceae provides insights into their species-specific diversification and amplification. We found that Poaceae SINEs (PoaS) fall into two length categories: simple SINEs of up to 180 bp and dimeric SINEs larger than 240 bp. Detailed analysis at the nucleotide level revealed that multimerization of related and unrelated SINE copies is an important evolutionary mechanism of SINE formation. We conclude that PoaS families diversify by massive reshuffling between SINE families, likely caused by insertion of truncated copies, and provide a model for this evolutionary scenario. Twenty-eight of 32 PoaS families and subfamilies show significant conservation, in particular either in the 5' or 3' regions, across Poaceae species and share large sequence stretches with one or more other PoaS families. © 2017 The Authors The Plant Journal © 2017 John Wiley & Sons Ltd.

  7. Cell type-specific termination of transcription by transposable element sequences.

    Science.gov (United States)

    Conley, Andrew B; Jordan, I King

    2012-09-30

    Transposable elements (TEs) encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS) genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3' UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are evolutionarily young. The extent of transcription

  8. 30 CFR 75.601-3 - Short circuit protection; dual element fuses; current ratings; maximum values.

    Science.gov (United States)

    2010-07-01

    ... 30 Mineral Resources 1 2010-07-01 2010-07-01 false Short circuit protection; dual element fuses... Trailing Cables § 75.601-3 Short circuit protection; dual element fuses; current ratings; maximum values. Dual element fuses having adequate current-interrupting capacity shall meet the requirements for short...

  9. Salt-bridging effects on short amphiphilic helical structure and introducing sequence-based short beta-turn motifs.

    Science.gov (United States)

    Guarracino, Danielle A; Gentile, Kayla; Grossman, Alec; Li, Evan; Refai, Nader; Mohnot, Joy; King, Daniel

    2018-02-01

    Determining the minimal sequence necessary to induce protein folding is beneficial in understanding the role of protein-protein interactions in biological systems, as their three-dimensional structures often dictate their activity. Proteins are generally comprised of discrete secondary structures, from α-helices to β-turns and larger β-sheets, each of which is influenced by its primary structure. Manipulating the sequence of short, moderately helical peptides can help elucidate the influences on folding. We created two new scaffolds based on a modestly helical eight-residue peptide, PT3, we previously published. Using circular dichroism (CD) spectroscopy and changing the possible salt-bridging residues to new combinations of Lys, Arg, Glu, and Asp, we found that our most helical improvements came from the Arg-Glu combination, whereas the Lys-Asp was not significantly different from the Lys-Glu of the parent scaffold, PT3. The marked 3 10 -helical contributions in PT3 were lessened in the Arg-Glu-containing peptide with the beginning of cooperative unfolding seen through a thermal denaturation. However, a unique and unexpected signature was seen for the denaturation of the Lys-Asp peptide which could help elucidate the stages of folding between the 3 10 and α-helix. In addition, we developed a short six-residue peptide with β-turn/sheet CD signature, again to help study minimal sequences needed for folding. Overall, the results indicate that improvements made to short peptide scaffolds by fine-tuning the salt-bridging residues can enhance scaffold structure. Likewise, with the results from the new, short β-turn motif, these can help impact future peptidomimetic designs in creating biologically useful, short, structured β-sheet-forming peptides.

  10. Identification and insertion polymorphisms of short interspersed nuclear elements (SINEs) in Brassica genomes

    International Nuclear Information System (INIS)

    Nouroz, F.; Naveed, M.

    2018-01-01

    The non-LTR retrotransposons (retroposons) are abundant in plant genomes including members of Brassicaceae. Of the retroposons, long interspersed nuclear elements (LINEs) are more copious followed by short interspersed nuclear elements (SINEs) in sequenced eukaryotic genomes. The SINEs are short elements and ranged from 100-500 bps flanked by variable sized target site duplications, 5' tRNA region with polymerase III promoter, internal tRNA unrelated region, 3' LINEs derived region and a poly adenosine tail. Different computational approaches were used for the identification and characterization of SINEs, while PCR was used to detect the SINEs insertion polymorphisms in various Brassica genotypes. Ten previously unidentified families of SINEs were identified and characterized from Brassica genomes. The structural features of these SINEs were studied in detail, which showed typical SINE features displaying small sizes, target site duplications, head regions, internal regions (body) of variable sizes and a poly (A) tail at the 3' terminus. The elements from various families ranged from 206-558 bp, where BoSINE2 family displayed smallest SINE element (206 bp), while larger members belonged to BoSINE9 family (524-558 bp). The distribution and abundance of SINEs in various Brassica species and genotypes (40) at a particular site/locus were investigated by SINEs based PCR markers. Various SINE insertion polymorphisms were detected from different genotypes, where higher PCR bands amplified the SINE insertions, while lower bands amplified the pre-insertion sites (flanking regions). The analysis of Brassica SINEs copy numbers from 10 identified families revealed that around 860 and 1712 copies of SINEs were calculated from B. rapa and B. oleracea Whole-genome shotgun contigs (WGS) respectively. Analysis of insertion sites of Brassica SINEs revealed that the members from all 10 SINE families had shown an insertion preference in AT rich regions. The present

  11. Cell type-specific termination of transcription by transposable element sequences

    Directory of Open Access Journals (Sweden)

    Conley Andrew B

    2012-09-01

    Full Text Available Abstract Background Transposable elements (TEs encode sequences necessary for their own transposition, including signals required for the termination of transcription. TE sequences within the introns of human genes show an antisense orientation bias, which has been proposed to reflect selection against TE sequences in the sense orientation owing to their ability to terminate the transcription of host gene transcripts. While there is evidence in support of this model for some elements, the extent to which TE sequences actually terminate transcription of human gene across the genome remains an open question. Results Using high-throughput sequencing data, we have characterized over 9,000 distinct TE-derived sequences that provide transcription termination sites for 5,747 human genes across eight different cell types. Rarefaction curve analysis suggests that there may be twice as many TE-derived termination sites (TE-TTS genome-wide among all human cell types. The local chromatin environment for these TE-TTS is similar to that seen for 3′ UTR canonical TTS and distinct from the chromatin environment of other intragenic TE sequences. However, those TE-TTS located within the introns of human genes were found to be far more cell type-specific than the canonical TTS. TE-TTS were much more likely to be found in the sense orientation than other intragenic TE sequences of the same TE family and TE-TTS in the sense orientation terminate transcription more efficiently than those found in the antisense orientation. Alu sequences were found to provide a large number of relatively weak TTS, whereas LTR elements provided a smaller number of much stronger TTS. Conclusions TE sequences provide numerous termination sites to human genes, and TE-derived TTS are particularly cell type-specific. Thus, TE sequences provide a powerful mechanism for the diversification of transcriptional profiles between cell types and among evolutionary lineages, since most TE-TTS are

  12. BoS: a large and diverse family of short interspersed elements (SINEs) in Brassica oleracea.

    Science.gov (United States)

    Zhang, Xiaoyu; Wessler, Susan R

    2005-05-01

    Short interspersed elements (SINEs) are nonautonomous non-LTR retrotransposons that populate eukaryotic genomes. Numerous SINE families have been identified in animals, whereas only a few have been described in plants. Here we describe a new family of SINEs, named BoS, that is widespread in Brassicaceae and present at approximately 2000 copies in Brassica oleracea. In addition to sharing a modular structure and target site preference with previously described SINEs, BoS elements have several unusual features. First, the head regions of BoS RNAs can adopt a distinct hairpin-like secondary structure. Second, with 15 distinct subfamilies, BoS represents one of the most diverse SINE families described to date. Third, several of the subfamilies have a mosaic structure that has arisen through the exchange of sequences between existing subfamilies, possibly during retrotransposition. Analysis of BoS subfamilies indicate that they were active during various time periods through the evolution of Brassicaceae and that active elements may still reside in some Brassica species. As such, BoS elements may be a valuable tool as phylogenetic makers for resolving outstanding issues in the evolution of species in the Brassicaceae family.

  13. Comparative genome sequencing of drosophila pseudoobscura: Chromosomal, gene and cis-element evolution

    Energy Technology Data Exchange (ETDEWEB)

    Richards, Stephen; Liu, Yue; Bettencourt, Brian R.; Hradecky, Pavel; Letovsky, Stan; Nielsen, Rasmus; Thornton, Kevin; Todd, Melissa J.; Chen, Rui; Meisel, Richard P.; Couronne, Olivier; Hua, Sujun; Smith, Mark A.; Bussemaker, Harmen J.; van Batenburg, Marinus F.; Howells, Sally L.; Scherer, Steven E.; Sodergren, Erica; Matthews, Beverly B.; Crosby, Madeline A.; Schroeder, Andrew J.; Ortiz-Barrientos, Daniel; Rives, Catherine M.; Metzker, Michael L.; Muzny, Donna M.; Scott, Graham; Steffen, David; Wheeler, David A.; Worley, Kim C.; Havlak, Paul; Durbin, K. James; Egan, Amy; Gill, Rachel; Hume, Jennifer; Morgan, Margaret B.; Miner, George; Hamilton, Cerissa; Huang, Yanmei; Waldron, Lenee; Verduzco, Daniel; Blankenburg, Kerstin P.; Dubchak, Inna; Noor, Mohamed A.F.; Anderson, Wyatt; White, Kevin P.; Clark, Andrew G.; Schaeffer, Stephen W.; Gelbart, William; Weinstock, George M.; Gibbs, Richard A.

    2004-04-01

    The genome sequence of a second fruit fly, D. pseudoobscura, presents an opportunity for comparative analysis of a primary model organism D. melanogaster. The vast majority of Drosophila genes have remained on the same arm, but within each arm gene order has been extensively reshuffled leading to the identification of approximately 1300 syntenic blocks. A repetitive sequence is found in the D. pseudoobscura genome at many junctions between adjacent syntenic blocks. Analysis of this novel repetitive element family suggests that recombination between offset elements may have given rise to many paracentric inversions, thereby contributing to the shuffling of gene order in the D. pseudoobscura lineage. Based on sequence similarity and synteny, 10,516 putative orthologs have been identified as a core gene set conserved over 35 My since divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome wide average consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than control sequences between the species but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a picture of repeat mediated chromosomal rearrangement, and high co-adaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence between these species of Drosophila.

  14. SeqEntropy: genome-wide assessment of repeats for short read sequencing.

    Directory of Open Access Journals (Sweden)

    Hsueh-Ting Chu

    Full Text Available BACKGROUND: Recent studies on genome assembly from short-read sequencing data reported the limitation of this technology to reconstruct the entire genome even at very high depth coverage. We investigated the limitation from the perspective of information theory to evaluate the effect of repeats on short-read genome assembly using idealized (error-free reads at different lengths. METHODOLOGY/PRINCIPAL FINDINGS: We define a metric H(k to be the entropy of sequencing reads at a read length k and use the relative loss of entropy ΔH(k to measure the impact of repeats for the reconstruction of whole-genome from sequences of length k. In our experiments, we found that entropy loss correlates well with de-novo assembly coverage of a genome, and a score of ΔH(k>1% indicates a severe loss in genome reconstruction fidelity. The minimal read lengths to achieve ΔH(k<1% are different for various organisms and are independent of the genome size. For example, in order to meet the threshold of ΔH(k<1%, a read length of 60 bp is needed for the sequencing of human genome (3.2 10(9 bp and 320 bp for the sequencing of fruit fly (1.8×10(8 bp. We also calculated the ΔH(k scores for 2725 prokaryotic chromosomes and plasmids at several read lengths. Our results indicate that the levels of repeats in different genomes are diverse and the entropy of sequencing reads provides a measurement for the repeat structures. CONCLUSIONS/SIGNIFICANCE: The proposed entropy-based measurement, which can be calculated in seconds to minutes in most cases, provides a rapid quantitative evaluation on the limitation of idealized short-read genome sequencing. Moreover, the calculation can be parallelized to scale up to large euakryotic genomes. This approach may be useful to tune the sequencing parameters to achieve better genome assemblies when a closely related genome is already available.

  15. Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

    Science.gov (United States)

    Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D

    2015-05-01

    Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.

  16. Targeted identification of short interspersed nuclear element families shows their widespread existence and extreme heterogeneity in plant genomes.

    Science.gov (United States)

    Wenke, Torsten; Döbel, Thomas; Sörensen, Thomas Rosleff; Junghans, Holger; Weisshaar, Bernd; Schmidt, Thomas

    2011-09-01

    Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-length and truncated SINEs falling into 31 families showing the widespread occurrence of SINEs in higher plants. The investigation focused on potato (Solanum tuberosum), resulting in the detection of seven different SolS SINE families consisting of 1489 full-length and 870 5' truncated copies. Consensus sequences of full-length members range in size from 106 to 244 bp depending on the SINE family. SolS SINEs populated related species and evolved separately, which led to some distinct subfamilies. Solanaceae SINEs are dispersed along chromosomes and distributed without clustering but with preferred integration into short A-rich motifs. They emerged more than 23 million years ago and were species specifically amplified during the radiation of potato, tomato (Solanum lycopersicum), and tobacco (Nicotiana tabacum). We show that tobacco TS retrotransposons are composite SINEs consisting of the 3' end of a long interspersed nuclear element integrated downstream of a nonhomologous SINE family followed by successfully colonization of the genome. We propose an evolutionary scenario for the formation of TS as a spontaneous event, which could be typical for the emergence of SINE families.

  17. Combined evidence annotation of transposable elements in genome sequences.

    Directory of Open Access Journals (Sweden)

    Hadi Quesneville

    2005-07-01

    Full Text Available Transposable elements (TEs are mobile, repetitive sequences that make up significant fractions of metazoan genomes. Despite their near ubiquity and importance in genome and chromosome biology, most efforts to annotate TEs in genome sequences rely on the results of a single computational program, RepeatMasker. In contrast, recent advances in gene annotation indicate that high-quality gene models can be produced from combining multiple independent sources of computational evidence. To elevate the quality of TE annotations to a level comparable to that of gene models, we have developed a combined evidence-model TE annotation pipeline, analogous to systems used for gene annotation, by integrating results from multiple homology-based and de novo TE identification methods. As proof of principle, we have annotated "TE models" in Drosophila melanogaster Release 4 genomic sequences using the combined computational evidence derived from RepeatMasker, BLASTER, TBLASTX, all-by-all BLASTN, RECON, TE-HMM and the previous Release 3.1 annotation. Our system is designed for use with the Apollo genome annotation tool, allowing automatic results to be curated manually to produce reliable annotations. The euchromatic TE fraction of D. melanogaster is now estimated at 5.3% (cf. 3.86% in Release 3.1, and we found a substantially higher number of TEs (n = 6,013 than previously identified (n = 1,572. Most of the new TEs derive from small fragments of a few hundred nucleotides long and highly abundant families not previously annotated (e.g., INE-1. We also estimated that 518 TE copies (8.6% are inserted into at least one other TE, forming a nest of elements. The pipeline allows rapid and thorough annotation of even the most complex TE models, including highly deleted and/or nested elements such as those often found in heterochromatic sequences. Our pipeline can be easily adapted to other genome sequences, such as those of the D. melanogaster heterochromatin or other

  18. A short summary on finite element modelling of fatigue crack closure

    Energy Technology Data Exchange (ETDEWEB)

    Singh, Konjengbam Darunkumar [Indian Institute of Technology, Guwahati (India); Parry, Matthew Roger [Airbus Operations Ltd, Bristol(United Kingdom); Sinclair, Ian [University of Southampton, Southampton (United Kingdom)

    2011-12-15

    This paper presents a short summary pertaining to the finite element modelling of fatigue crack closure. Several key issues related to finite element modelling of fatigue crack closure are highlighted: element type, mesh refinement, stabilization of crack closure, crack-tip node release scheme, constitutive model, specimen geometry, stress-states (i.e., plane stress, plane strain), crack closure monitoring. Reviews are presented for both straight and deflected cracks.

  19. Distribution, Diversity, and Long-Term Retention of Grass Short Interspersed Nuclear Elements (SINEs).

    Science.gov (United States)

    Mao, Hongliang; Wang, Hao

    2017-08-01

    Instances of highly conserved plant short interspersed nuclear element (SINE) families and their enrichment near genes have been well documented, but little is known about the general patterns of such conservation and enrichment and underlying mechanisms. Here, we perform a comprehensive investigation of the structure, distribution, and evolution of SINEs in the grass family by analyzing 14 grass and 5 other flowering plant genomes using comparative genomics methods. We identify 61 SINE families composed of 29,572 copies, in which 46 families are first described. We find that comparing with other grass TEs, grass SINEs show much higher level of conservation in terms of genomic retention: The origin of at least 26% families can be traced to early grass diversification and these families are among most abundant SINE families in 86% species. We find that these families show much higher level of enrichment near protein coding genes than families of relatively recent origin (51%:28%), and that 40% of all grass SINEs are near gene and the percentage is higher than other types of grass TEs. The pattern of enrichment suggests that differential removal of SINE copies in gene-poor regions plays an important role in shaping the genomic distribution of these elements. We also identify a sequence motif located at 3' SINE end which is shared in 17 families. In short, this study provides insights into structure and evolution of SINEs in the grass family. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. Sauria SINEs: Novel short interspersed retroposable elements that are widespread in reptile genomes.

    Science.gov (United States)

    Piskurek, Oliver; Austin, Christopher C; Okada, Norihiro

    2006-05-01

    SINEs are short interspersed retrotransposable elements that invade new genomic sites. Their retrotransposition depends on reverse transcriptase and endonuclease activities encoded by partner LINEs (long interspersed elements). Recent genomic research has demonstrated that retroposons account for at least 40% of the human genome. Hitherto, more than 30 families of SINEs have been characterized in mammalian genomes, comprising approximately 4600 extant species; the distribution and extent of SINEs in reptilian genomes, however, are poorly documented. With more than 7400 species of lizards and snakes, Squamata constitutes the largest and most diverse group of living reptiles. We have discovered and characterized a novel SINE family, Sauria SINEs, whose members are widely distributed among genomes of lizards, snakes, and tuataras. Sauria SINEs comprise a 5' tRNA-related region, a tRNA-unrelated region, and a 3' tail region (containing short tandem repeats) derived from LINEs. We distinguished eight Sauria SINE subfamilies in genomes of four major squamate lineages and investigated their evolutionary relationships. Our data illustrate the overall efficacy of Sauria SINEs as novel retrotransposable markers for elucidation of squamate evolutionary history. We show that all Sauria SINEs share an identical 3' sequence with Bov-B LINEs and propose that they utilize the enzymatic machinery of Bov-B LINEs for their own retrotransposition. This finding, along with the ubiquity of Bov-B LINEs previously demonstrated in squamate genomes, suggests that these LINEs have been an active partner of Sauria SINEs since this SINE family was generated more than 200 million years ago.

  1. TMS Over the Cerebellum Interferes with Short-term Memory of Visual Sequences.

    Science.gov (United States)

    Ferrari, C; Cattaneo, Z; Oldrati, V; Casiraghi, L; Castelli, F; D'Angelo, E; Vecchi, T

    2018-04-30

    Growing evidence suggests that the cerebellum is not only involved in motor functions, but it significantly contributes to sensory and cognitive processing as well. In particular, it has been hypothesized that the cerebellum identifies recurrent serial events and recognizes their violations. Here we used transcranial magnetic stimulation (TMS) to shed light on the role of the cerebellum in short-term memory of visual sequences. In two experiments, we found that TMS over the right cerebellar hemisphere impaired participants' ability to recognize the correct order of appearance of geometrical stimuli varying in shape and/or size. In turn, cerebellar TMS did not affect recognition of highly familiar short sequences of letters or numbers. Overall, our data suggest that the cerebellum is involved in memorizing the order in which (concatenated) stimuli appear, this process being important for sequence learning.

  2. Studying a free fall experiment using short sequences of images

    International Nuclear Information System (INIS)

    Vera, Francisco; Romanque, Cristian

    2008-01-01

    We discuss a new alternative for obtaining position and time coordinates from a video of a free fall experiment. In our approach, after converting the video to a short sequence of images, the images are analyzed using a web page application developed by the author. The main advantage of the setup explained in this work, is that it is simple to use, no software license fees are necessary, and can be scaled-up to be used by a big number of students in introductory physics courses. The steps involved in the full analysis of a falling object are: we grab a short digital video of the experiment and convert it to a sequence of images, then, using a web page that includes all the necessary javascript, the student can easily click on the object of interest to obtain the (x,y,t) coordinates, finally, the student analyze motion using a spreadsheet.

  3. The application of the high throughput sequencing technology in the transposable elements.

    Science.gov (United States)

    Liu, Zhen; Xu, Jian-hong

    2015-09-01

    High throughput sequencing technology has dramatically improved the efficiency of DNA sequencing, and decreased the costs to a great extent. Meanwhile, this technology usually has advantages of better specificity, higher sensitivity and accuracy. Therefore, it has been applied to the research on genetic variations, transcriptomics and epigenomics. Recently, this technology has been widely employed in the studies of transposable elements and has achieved fruitful results. In this review, we summarize the application of high throughput sequencing technology in the fields of transposable elements, including the estimation of transposon content, preference of target sites and distribution, insertion polymorphism and population frequency, identification of rare copies, transposon horizontal transfers as well as transposon tagging. We also briefly introduce the major common sequencing strategies and algorithms, their advantages and disadvantages, and the corresponding solutions. Finally, we envision the developing trends of high throughput sequencing technology, especially the third generation sequencing technology, and its application in transposon studies in the future, hopefully providing a comprehensive understanding and reference for related scientific researchers.

  4. Characterizing novel endogenous retroviruses from genetic variation inferred from short sequence reads

    DEFF Research Database (Denmark)

    Mourier, Tobias; Mollerup, Sarah; Vinner, Lasse

    2015-01-01

    From Illumina sequencing of DNA from brain and liver tissue from the lion, Panthera leo, and tumor samples from the pike-perch, Sander lucioperca, we obtained two assembled sequence contigs with similarity to known retroviruses. Phylogenetic analyses suggest that the pike-perch retrovirus belongs...... to the epsilonretroviruses, and the lion retrovirus to the gammaretroviruses. To determine if these novel retroviral sequences originate from an endogenous retrovirus or from a recently integrated exogenous retrovirus, we assessed the genetic diversity of the parental sequences from which the short Illumina reads...

  5. Targeted Identification of Short Interspersed Nuclear Element Families Shows Their Widespread Existence and Extreme Heterogeneity in Plant Genomes[W

    Science.gov (United States)

    Wenke, Torsten; Döbel, Thomas; Sörensen, Thomas Rosleff; Junghans, Holger; Weisshaar, Bernd; Schmidt, Thomas

    2011-01-01

    Short interspersed nuclear elements (SINEs) are non-long terminal repeat retrotransposons that are highly abundant, heterogeneous, and mostly not annotated in eukaryotic genomes. We developed a tool designated SINE-Finder for the targeted discovery of tRNA-derived SINEs. We analyzed sequence data of 16 plant genomes, including 13 angiosperms and three gymnosperms and identified 17,829 full-length and truncated SINEs falling into 31 families showing the widespread occurrence of SINEs in higher plants. The investigation focused on potato (Solanum tuberosum), resulting in the detection of seven different SolS SINE families consisting of 1489 full-length and 870 5′ truncated copies. Consensus sequences of full-length members range in size from 106 to 244 bp depending on the SINE family. SolS SINEs populated related species and evolved separately, which led to some distinct subfamilies. Solanaceae SINEs are dispersed along chromosomes and distributed without clustering but with preferred integration into short A-rich motifs. They emerged more than 23 million years ago and were species specifically amplified during the radiation of potato, tomato (Solanum lycopersicum), and tobacco (Nicotiana tabacum). We show that tobacco TS retrotransposons are composite SINEs consisting of the 3′ end of a long interspersed nuclear element integrated downstream of a nonhomologous SINE family followed by successfully colonization of the genome. We propose an evolutionary scenario for the formation of TS as a spontaneous event, which could be typical for the emergence of SINE families. PMID:21908723

  6. SRComp: short read sequence compression using burstsort and Elias omega coding.

    Directory of Open Access Journals (Sweden)

    Jeremy John Selva

    Full Text Available Next-generation sequencing (NGS technologies permit the rapid production of vast amounts of data at low cost. Economical data storage and transmission hence becomes an increasingly important challenge for NGS experiments. In this paper, we introduce a new non-reference based read sequence compression tool called SRComp. It works by first employing a fast string-sorting algorithm called burstsort to sort read sequences in lexicographical order and then Elias omega-based integer coding to encode the sorted read sequences. SRComp has been benchmarked on four large NGS datasets, where experimental results show that it can run 5-35 times faster than current state-of-the-art read sequence compression tools such as BEETL and SCALCE, while retaining comparable compression efficiency for large collections of short read sequences. SRComp is a read sequence compression tool that is particularly valuable in certain applications where compression time is of major concern.

  7. Optimization of short amino acid sequences classifier

    Science.gov (United States)

    Barcz, Aleksy; Szymański, Zbigniew

    This article describes processing methods used for short amino acid sequences classification. The data processed are 9-symbols string representations of amino acid sequences, divided into 49 data sets - each one containing samples labeled as reacting or not with given enzyme. The goal of the classification is to determine for a single enzyme, whether an amino acid sequence would react with it or not. Each data set is processed separately. Feature selection is performed to reduce the number of dimensions for each data set. The method used for feature selection consists of two phases. During the first phase, significant positions are selected using Classification and Regression Trees. Afterwards, symbols appearing at the selected positions are substituted with numeric values of amino acid properties taken from the AAindex database. In the second phase the new set of features is reduced using a correlation-based ranking formula and Gram-Schmidt orthogonalization. Finally, the preprocessed data is used for training LS-SVM classifiers. SPDE, an evolutionary algorithm, is used to obtain optimal hyperparameters for the LS-SVM classifier, such as error penalty parameter C and kernel-specific hyperparameters. A simple score penalty is used to adapt the SPDE algorithm to the task of selecting classifiers with best performance measures values.

  8. Short- and long-term evolutionary dynamics of bacterial insertion sequences: insights from Wolbachia endosymbionts.

    Science.gov (United States)

    Cerveau, Nicolas; Leclercq, Sébastien; Leroy, Elodie; Bouchon, Didier; Cordaux, Richard

    2011-01-01

    Transposable elements (TE) are one of the major driving forces of genome evolution, raising the question of the long-term dynamics underlying their evolutionary success. Long-term TE evolution can readily be reconstructed in eukaryotes, thanks to many degraded copies constituting genomic fossil records of past TE proliferations. By contrast, bacterial genomes usually experience high sequence turnover and short TE retention times, thereby obscuring ancient TE evolutionary patterns. We found that Wolbachia bacterial genomes contain 52-171 insertion sequence (IS) TEs. IS account for 11% of Wolbachia wRi, which is one of the highest IS genomic coverage reported in prokaryotes to date. We show that many IS groups are currently expanding in various Wolbachia genomes and that IS horizontal transfers are frequent among strains, which can explain the apparent synchronicity of these IS proliferations. Remarkably, >70% of Wolbachia IS are nonfunctional. They constitute an unusual bacterial IS genomic fossil record providing direct empirical evidence for a long-term IS evolutionary dynamics following successive periods of intense transpositional activity. Our results show that comprehensive IS annotations have the potential to provide new insights into prokaryote TE evolution and, more generally, prokaryote genome evolution. Indeed, the identification of an important IS genomic fossil record in Wolbachia demonstrates that IS elements are not always of recent origin, contrary to the conventional view of TE evolution in prokaryote genomes. Our results also raise the question whether the abundance of IS fossils is specific to Wolbachia or it may be a general, albeit overlooked, feature of prokaryote genomes.

  9. Short interspersed nuclear elements (SINEs) are abundant in Solanaceae and have a family-specific impact on gene structure and genome organization.

    Science.gov (United States)

    Seibt, Kathrin M; Wenke, Torsten; Muders, Katja; Truberg, Bernd; Schmidt, Thomas

    2016-05-01

    Short interspersed nuclear elements (SINEs) are highly abundant non-autonomous retrotransposons that are widespread in plants. They are short in size, non-coding, show high sequence diversity, and are therefore mostly not or not correctly annotated in plant genome sequences. Hence, comparative studies on genomic SINE populations are rare. To explore the structural organization and impact of SINEs, we comparatively investigated the genome sequences of the Solanaceae species potato (Solanum tuberosum), tomato (Solanum lycopersicum), wild tomato (Solanum pennellii), and two pepper cultivars (Capsicum annuum). Based on 8.5 Gbp sequence data, we annotated 82 983 SINE copies belonging to 10 families and subfamilies on a base pair level. Solanaceae SINEs are dispersed over all chromosomes with enrichments in distal regions. Depending on the genome assemblies and gene predictions, 30% of all SINE copies are associated with genes, particularly frequent in introns and untranslated regions (UTRs). The close association with genes is family specific. More than 10% of all genes annotated in the Solanaceae species investigated contain at least one SINE insertion, and we found genes harbouring up to 16 SINE copies. We demonstrate the involvement of SINEs in gene and genome evolution including the donation of splice sites, start and stop codons and exons to genes, enlargement of introns and UTRs, generation of tandem-like duplications and transduction of adjacent sequence regions. © 2016 The Authors The Plant Journal © 2016 John Wiley & Sons Ltd.

  10. Mapping autonomously replicating sequence elements in a 73-kb ...

    Indian Academy of Sciences (India)

    Autonomously replicating sequence (ARS) elements are the genetic determinants of replication origin function in yeasts. They can be easily identified as the plasmids containing them transform yeast cells at a high frequency. As the first step towards identifying all potential replication origins in a 73-kb region of the long arm ...

  11. Long-read sequencing and de novo assembly of a Chinese genome

    Science.gov (United States)

    Short-read sequencing has enabled the de novo assembly of several individual human genomes, but with inherent limitations in characterizing repeat elements. Here we sequence a Chinese individual HX1 by single-molecule real-time (SMRT) long-read sequencing, construct a physical map by NanoChannel arr...

  12. GREAM: A Web Server to Short-List Potentially Important Genomic Repeat Elements Based on Over-/Under-Representation in Specific Chromosomal Locations, Such as the Gene Neighborhoods, within or across 17 Mammalian Species.

    Directory of Open Access Journals (Sweden)

    Darshan Shimoga Chandrashekar

    Full Text Available Genome-wide repeat sequences, such as LINEs, SINEs and LTRs share a considerable part of the mammalian nuclear genomes. These repeat elements seem to be important for multiple functions including the regulation of transcription initiation, alternative splicing and DNA methylation. But it is not possible to study all repeats and, hence, it would help to short-list before exploring their potential functional significance via experimental studies and/or detailed in silico analyses.We developed the 'Genomic Repeat Element Analyzer for Mammals' (GREAM for analysis, screening and selection of potentially important mammalian genomic repeats. This web-server offers many novel utilities. For example, this is the only tool that can reveal a categorized list of specific types of transposons, retro-transposons and other genome-wide repetitive elements that are statistically over-/under-represented in regions around a set of genes, such as those expressed differentially in a disease condition. The output displays the position and frequency of identified elements within the specified regions. In addition, GREAM offers two other types of analyses of genomic repeat sequences: a enrichment within chromosomal region(s of interest, and b comparative distribution across the neighborhood of orthologous genes. GREAM successfully short-listed a repeat element (MER20 known to contain functional motifs. In other case studies, we could use GREAM to short-list repetitive elements in the azoospermia factor a (AZFa region of the human Y chromosome and those around the genes associated with rat liver injury. GREAM could also identify five over-represented repeats around some of the human and mouse transcription factor coding genes that had conserved expression patterns across the two species.GREAM has been developed to provide an impetus to research on the role of repetitive sequences in mammalian genomes by offering easy selection of more interesting repeats in various

  13. The role of heterologous chloroplast sequence elements in transgene integration and expression.

    Science.gov (United States)

    Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry

    2010-04-01

    Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5' untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5' UTR and 3' UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5' UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5' UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation.

  14. Interference by clustered regularly interspaced short palindromic repeat (CRISPR) RNA is governed by a seed sequence

    NARCIS (Netherlands)

    Semenova, E.V.; Jore, M.M.; Westra, E.R.; Oost, van der J.; Brouns, S.J.J.

    2011-01-01

    Prokaryotic clustered regularly interspaced short palindromic repeat (CRISPR)/Cas (CRISPR-associated sequences) systems provide adaptive immunity against viruses when a spacer sequence of small CRISPR RNA (crRNA) matches a protospacer sequence in the viral genome. Viruses that escape CRISPR/Cas

  15. Emergence of Sequence Type 779 Methicillin-Resistant Staphylococcus aureus Harboring a Novel Pseudo Staphylococcal Cassette Chromosome mec (SCCmec)-SCC-SCCCRISPR Composite Element in Irish Hospitals

    Science.gov (United States)

    Kinnevey, Peter M.; Shore, Anna C.; Brennan, Grainne I.; Sullivan, Derek J.; Ehricht, Ralf; Monecke, Stefan; Slickers, Peter

    2013-01-01

    Methicillin-resistant Staphylococcus aureus (MRSA) has been a major cause of nosocomial infection in Irish hospitals for 4 decades, and replacement of predominant MRSA clones has occurred several times. An MRSA isolate recovered in 2006 as part of a larger study of sporadic MRSA exhibited a rare spa (t878) and multilocus sequence (ST779) type and was nontypeable by PCR- and DNA microarray-based staphylococcal cassette chromosome mec (SCCmec) element typing. Whole-genome sequencing revealed the presence of a novel 51-kb composite island (CI) element with three distinct domains, each flanked by direct repeat and inverted repeat sequences, including (i) a pseudo SCCmec element (16.3 kb) carrying mecA with a novel mec class region, a fusidic acid resistance gene (fusC), and two copper resistance genes (copB and copC) but lacking ccr genes; (ii) an SCC element (17.5 kb) carrying a novel ccrAB4 allele; and (iii) an SCC element (17.4 kb) carrying a novel ccrC allele and a clustered regularly interspaced short palindromic repeat (CRISPR) region. The novel CI was subsequently identified by PCR in an additional 13 t878/ST779 MRSA isolates, six from bloodstream infections, recovered between 2006 and 2011 in 11 hospitals. Analysis of open reading frames (ORFs) carried by the CI showed amino acid sequence similarity of 44 to 100% to ORFs from S. aureus and coagulase-negative staphylococci (CoNS). These findings provide further evidence of genetic transfer between S. aureus and CoNS and show how this contributes to the emergence of novel SCCmec elements and MRSA strains. Ongoing surveillance of this MRSA strain is warranted and will require updating of currently used SCCmec typing methods. PMID:23147725

  16. Emergence of sequence type 779 methicillin-resistant Staphylococcus aureus harboring a novel pseudo staphylococcal cassette chromosome mec (SCCmec)-SCC-SCCCRISPR composite element in Irish hospitals.

    LENUS (Irish Health Repository)

    Kinnevey, Peter M

    2013-01-01

    Methicillin-resistant Staphylococcus aureus (MRSA) has been a major cause of nosocomial infection in Irish hospitals for 4 decades, and replacement of predominant MRSA clones has occurred several times. An MRSA isolate recovered in 2006 as part of a larger study of sporadic MRSA exhibited a rare spa (t878) and multilocus sequence (ST779) type and was nontypeable by PCR- and DNA microarray-based staphylococcal cassette chromosome mec (SCCmec) element typing. Whole-genome sequencing revealed the presence of a novel 51-kb composite island (CI) element with three distinct domains, each flanked by direct repeat and inverted repeat sequences, including (i) a pseudo SCCmec element (16.3 kb) carrying mecA with a novel mec class region, a fusidic acid resistance gene (fusC), and two copper resistance genes (copB and copC) but lacking ccr genes; (ii) an SCC element (17.5 kb) carrying a novel ccrAB4 allele; and (iii) an SCC element (17.4 kb) carrying a novel ccrC allele and a clustered regularly interspaced short palindromic repeat (CRISPR) region. The novel CI was subsequently identified by PCR in an additional 13 t878\\/ST779 MRSA isolates, six from bloodstream infections, recovered between 2006 and 2011 in 11 hospitals. Analysis of open reading frames (ORFs) carried by the CI showed amino acid sequence similarity of 44 to 100% to ORFs from S. aureus and coagulase-negative staphylococci (CoNS). These findings provide further evidence of genetic transfer between S. aureus and CoNS and show how this contributes to the emergence of novel SCCmec elements and MRSA strains. Ongoing surveillance of this MRSA strain is warranted and will require updating of currently used SCCmec typing methods.

  17. Resonant magnetoelectric response of composite cantilevers: Theory of short vs. open circuit operation and layer sequence effects

    Directory of Open Access Journals (Sweden)

    Matthias C. Krantz

    2015-11-01

    Full Text Available The magnetoelectric effect in layered composite cantilevers consisting of strain coupled layers of magnetostrictive (MS, piezoelectric (PE, and substrate materials is investigated for magnetic field excitation at bending resonance. Analytic theories are derived for the transverse magnetoelectric (ME response in short and open circuit operation for three different layer sequences and results presented and discussed for the FeCoBSi-AlN-Si and the FeCoBSi-PZT-Si composite systems. Response optimized PE-MS layer thickness ratios are found to greatly change with operation mode shifting from near equal MS and PE layer thicknesses in the open circuit mode to near vanishing PE layer thicknesses in short circuit operation for all layer sequences. In addition the substrate layer thickness is found to differently affect the open and short circuit ME response producing shifts and reversal between ME response maxima depending on layer sequence. The observed rich ME response behavior for different layer thicknesses, sequences, operating modes, and PE materials can be explained by common neutral plane effects and different elastic compliance effects in short and open circuit operation.

  18. Characterization of Campylobacter jejuni applying flaA short variable region sequencing, multilocus sequencing and Fourier transform infrared spectroscopy

    DEFF Research Database (Denmark)

    Josefsen, Mathilde Hartmann; Bonnichsen, Lise; Larsson, Jonas

    flaA short variable region sequencing and phenetic Fourier transform infrared (FTIR) spectroscopy was applied on a collection of 102 Campylobacter jejuni isolated from continuous sampling of organic, free range geese and chickens. FTIR has been shown to serve as a valuable tool in typing...

  19. Comparative genome sequencing of Drosophila pseudoobscura: Chromosomal, gene, and cis-element evolution

    DEFF Research Database (Denmark)

    Richards, Stephen; Liu, Yue; Bettencourt, Brian R.

    2005-01-01

    years (Myr) since the pseudoobscura/melanogaster divergence. Genes expressed in the testes had higher amino acid sequence divergence than the genome-wide average, consistent with the rapid evolution of sex-specific proteins. Cis-regulatory sequences are more conserved than random and nearby sequences......We have sequenced the genome of a second Drosophila species, Drosophila pseudoobscura, and compared this to the genome sequence of Drosophila melanogaster, a primary model organism. Throughout evolution the vast majority of Drosophila genes have remained on the same chromosome arm, but within each...... between the species-but the difference is slight, suggesting that the evolution of cis-regulatory elements is flexible. Overall, a pattern of repeat-mediated chromosomal rearrangement, and high coadaptation of both male genes and cis-regulatory sequences emerges as important themes of genome divergence...

  20. MATAM: reconstruction of phylogenetic marker genes from short sequencing reads in metagenomes.

    Science.gov (United States)

    Pericard, Pierre; Dufresne, Yoann; Couderc, Loïc; Blanquart, Samuel; Touzet, Hélène

    2018-02-15

    Advances in the sequencing of uncultured environmental samples, dubbed metagenomics, raise a growing need for accurate taxonomic assignment. Accurate identification of organisms present within a community is essential to understanding even the most elementary ecosystems. However, current high-throughput sequencing technologies generate short reads which partially cover full-length marker genes and this poses difficult bioinformatic challenges for taxonomy identification at high resolution. We designed MATAM, a software dedicated to the fast and accurate targeted assembly of short reads sequenced from a genomic marker of interest. The method implements a stepwise process based on construction and analysis of a read overlap graph. It is applied to the assembly of 16S rRNA markers and is validated on simulated, synthetic and genuine metagenomes. We show that MATAM outperforms other available methods in terms of low error rates and recovered fractions and is suitable to provide improved assemblies for precise taxonomic assignments. https://github.com/bonsai-team/matam. pierre.pericard@gmail.com or helene.touzet@univ-lille1.fr. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  1. [Short interspersed repetitive sequences (SINEs) and their use as a phylogenetic tool].

    Science.gov (United States)

    Kramerov, D A; Vasetskiĭ, N S

    2009-01-01

    The data on one of the most common repetitive elements of eukaryotic genomes, short interspersed elements (SINEs), are reviewed. Their structure, origin, and functioning in the genome are discussed. The variation and abundance of these neutral genomic markers makes them a convenient and reliable tool for phylogenetic analysis. The main methods of such analysis are presented, and the potential and limitations of this approach are discussed using specific examples.

  2. Researches on Mathematical Relationship of Five Elements of Containing Notes and Fibonacci Sequence Modulo 5

    Directory of Open Access Journals (Sweden)

    Zhaoxue Chen

    2015-01-01

    Full Text Available Considering the five periods and six qi’s theory in TCM almost shares a common basis of stem-branch system with the five elements of containing notes, studying the principle or mathematical structure behind the five elements of containing notes can surely bring a novel view for the five periods and six qi’s researches. By analyzing typical mathematical rules included in He tu, Luo shu, and stem-branch theory in TCM as well as the Fibonacci sequence especially widely existent in the biological world, novel researches are performed on mathematical relationship between the five elements of containing notes and the Fibonacci sequence modulo 5. Enlightened by elementary Yin or Yang number grouping principle of He tu, Luo shu, the 12534 and 31542 key number series of Fibonacci sequence modulo 5 are obtained. And three new arrangements about the five elements of containing notes are then introduced, which have shown close relationship with the two obtained key subsequences of the Fibonacci sequence modulo 5. The novel discovery is quite helpful to recover the scientific secret of the five periods and six qi’s theory in TCM as well as that of whole traditional Chinese culture system, but more data is needed to elucidate the TCM theory further.

  3. Study of the fast inversion recovery pulse sequence. With reference to fast fluid attenuated inversion recovery and fast short TI inversion recovery pulse sequence

    International Nuclear Information System (INIS)

    Tsuchihashi, Toshio; Maki, Toshio; Suzuki, Takeshi

    1997-01-01

    The fast inversion recovery (fast IR) pulse sequence was evaluated. We compared the fast fluid attenuated inversion recovery (fast FLAIR) pulse sequence in which inversion time (TI) was established as equal to the water null point for the purpose of the water-suppressed T 2 -weighted image, with the fast short TI inversion recovery (fast STIR) pulse sequence in which TI was established as equal to the fat null point for purpose of fat suppression. In the fast FLAIR pulse sequence, the water null point was increased by making TR longer. In the FLAIR pulse sequence, the longitudinal magnetization contrast is determined by TI. If TI is increased, T 2 -weighted contrast improves in the same way as increasing TR for the SE pulse sequence. Therefore, images should be taken with long TR and long TI, which are longer than TR and longer than the water null point. On the other hand, the fat null point is not affected by TR in the fast STIR pulse sequence. However, effective TE was affected by variation of the null point. This increased in proportion to the increase in effective TE. Our evaluation indicated that the fast STIR pulse sequence can control the extensive signals from fat in a short time. (author)

  4. PCR and magnetic bead-mediated target capture for the isolation of short interspersed nucleotide elements in fishes.

    Science.gov (United States)

    Liu, Dong; Zhu, Guoli; Tang, Wenqiao; Yang, Jinquan; Guo, Hongyi

    2012-01-01

    Short interspersed nucleotide elements (SINEs), a type of retrotransposon, are widely distributed in various genomes with multiple copies arranged in different orientations, and cause changes to genes and genomes during evolutionary history. This can provide the basis for determining genome diversity, genetic variation and molecular phylogeny, etc. SINE DNA is transcribed into RNA by polymerase III from an internal promoter, which is composed of two conserved boxes, box A and box B. Here we present an approach to isolate novel SINEs based on these promoter elements. Box A of a SINE is obtained via PCR with only one primer identical to box B (B-PCR). Box B and its downstream sequence are acquired by PCR with one primer corresponding to box A (A-PCR). The SINE clone produced by A-PCR is selected as a template to label a probe with biotin. The full-length SINEs are isolated from the genomic pool through complex capture using the biotinylated probe bound to magnetic particles. Using this approach, a novel SINE family, Cn-SINE, from the genomes of Coilia nasus, was isolated. The members are 180-360 bp long. Sequence homology suggests that Cn-SINEs evolved from a leucine tRNA gene. This is the first report of a tRNA(Leu)-related SINE obtained without the use of a genomic library or inverse PCR. These results provide new insights into the origin of SINEs.

  5. PCR and Magnetic Bead-Mediated Target Capture for the Isolation of Short Interspersed Nucleotide Elements in Fishes

    Directory of Open Access Journals (Sweden)

    Dong Liu

    2012-02-01

    Full Text Available Short interspersed nucleotide elements (SINEs, a type of retrotransposon, are widely distributed in various genomes with multiple copies arranged in different orientations, and cause changes to genes and genomes during evolutionary history. This can provide the basis for determining genome diversity, genetic variation and molecular phylogeny, etc. SINE DNA is transcribed into RNA by polymerase III from an internal promoter, which is composed of two conserved boxes, box A and box B. Here we present an approach to isolate novel SINEs based on these promoter elements. Box A of a SINE is obtained via PCR with only one primer identical to box B (B-PCR. Box B and its downstream sequence are acquired by PCR with one primer corresponding to box A (A-PCR. The SINE clone produced by A-PCR is selected as a template to label a probe with biotin. The full-length SINEs are isolated from the genomic pool through complex capture using the biotinylated probe bound to magnetic particles. Using this approach, a novel SINE family, Cn-SINE, from the genomes of Coilia nasus, was isolated. The members are 180–360 bp long. Sequence homology suggests that Cn-SINEs evolved from a leucine tRNA gene. This is the first report of a tRNALeu-related SINE obtained without the use of a genomic library or inverse PCR. These results provide new insights into the origin of SINEs.

  6. Intricate interactions between the bloom-forming cyanobacterium Microcystis aeruginosa and foreign genetic elements, revealed by diversified clustered regularly interspaced short palindromic repeat (CRISPR) signatures.

    Science.gov (United States)

    Kuno, Sotaro; Yoshida, Takashi; Kaneko, Takakazu; Sako, Yoshihiko

    2012-08-01

    Clustered regularly interspaced short palindromic repeats (CRISPR) confer sequence-dependent, adaptive resistance in prokaryotes against viruses and plasmids via incorporation of short sequences, called spacers, derived from foreign genetic elements. CRISPR loci are thus considered to provide records of past infections. To describe the host-parasite (i.e., cyanophages and plasmids) interactions involving the bloom-forming freshwater cyanobacterium Microcystis aeruginosa, we investigated CRISPR in four M. aeruginosa strains and in two previously sequenced genomes. The number of spacers in each locus was larger than the average among prokaryotes. All spacers were strain specific, except for a string of 11 spacers shared in two closely related strains, suggesting diversification of the loci. Using CRISPR repeat-based PCR, 24 CRISPR genotypes were identified in a natural cyanobacterial community. Among 995 unique spacers obtained, only 10 sequences showed similarity to M. aeruginosa phage Ma-LMM01. Of these, six spacers showed only silent or conservative nucleotide mutations compared to Ma-LMM01 sequences, suggesting a strategy by the cyanophage to avert CRISPR immunity dependent on nucleotide identity. These results imply that host-phage interactions can be divided into M. aeruginosa-cyanophage combinations rather than pandemics of population-wide infectious cyanophages. Spacer similarity also showed frequent exposure of M. aeruginosa to small cryptic plasmids that were observed only in a few strains. Thus, the diversification of CRISPR implies that M. aeruginosa has been challenged by diverse communities (almost entirely uncharacterized) of cyanophages and plasmids.

  7. Assessment of importance of elements for systems that condition depends on the sequence of elements failures

    International Nuclear Information System (INIS)

    Povyakalo, A.A.

    1996-01-01

    This paper proposes new general formulas for calculation of indices of elements importance for systems whose condition depends on sequence of elements failures. These systems have been called as systems with memory of failures (M-systems). Techniques existing for assessment of importance of elements are based on the Bool's models of system reliability, for which it is significant to suggest, that in every period of time system state depends only on a combination of states of elements at that very moment of time. These systems have been called as combinational systems (C-systems). Reliability of M-systems at any moment of operating time is a functional having distributions of elements time before failure as its arguments. Bool's models and methods of assessment of element importance, based on these models, are not appropriate for these systems. Pereguda and Povyakalo proposed the new techniques for assessment of elements importance for PO-SS systems that includes Protection Object (PO) and Safety System (PO). PO-SS system is an example of M-system. That technique is used at this paper as a basis for more general consideration. It has been shown that technique proposed for assessment of elements importance for M-systems has well-known Birnbaum's method as its particular case. Also the system with double protection is considered as an example

  8. Keeping it together: Semantic coherence stabilizes phonological sequences in short-term memory.

    Science.gov (United States)

    Savill, Nicola; Ellis, Rachel; Brooke, Emma; Koa, Tiffany; Ferguson, Suzie; Rojas-Rodriguez, Elena; Arnold, Dominic; Smallwood, Jonathan; Jefferies, Elizabeth

    2018-04-01

    Our ability to hold a sequence of speech sounds in mind, in the correct configuration, supports many aspects of communication, but the contribution of conceptual information to this basic phonological capacity remains controversial. Previous research has shown modest and inconsistent benefits of meaning on phonological stability in short-term memory, but these studies were based on sets of unrelated words. Using a novel design, we examined the immediate recall of sentence-like sequences with coherent meaning, alongside both standard word lists and mixed lists containing words and nonwords. We found, and replicated, substantial effects of coherent meaning on phoneme-level accuracy: The phonemes of both words and nonwords within conceptually coherent sequences were more likely to be produced together and in the correct order. Since nonwords do not exist as items in long-term memory, the semantic enhancement of phoneme-level recall for both item types cannot be explained by a lexically based item reconstruction process employed at the point of retrieval ("redintegration"). Instead, our data show, for naturalistic input, that when meaning emerges from the combination of words, the phonological traces that support language are reinforced by a semantic-binding process that has been largely overlooked by past short-term memory research.

  9. Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats

    OpenAIRE

    Gymrek, Melissa

    2016-01-01

    This was presented as a BitesizeBio Webinar entitled "Tools for analyzing genetic variants from sequencing data Case study: short tandem repeats"Accompanying scripts can be accessed on github:https://github.com/mgymrek/mgymrek-bitesizebio-webinar 

  10. Hybridization Capture Using Short PCR Products Enriches Small Genomes by Capturing Flanking Sequences (CapFlank)

    DEFF Research Database (Denmark)

    Tsangaras, Kyriakos; Wales, Nathan; Sicheritz-Pontén, Thomas

    2014-01-01

    , a non-negligible fraction of the resulting sequence reads are not homologous to the bait. We demonstrate that during capture, the bait-hybridized library molecules add additional flanking library sequences iteratively, such that baits limited to targeting relatively short regions (e.g. few hundred...... nucleotides) can result in enrichment across entire mitochondrial and bacterial genomes. Our findings suggest that some of the off-target sequences derived in capture experiments are non-randomly enriched, and that CapFlank will facilitate targeted enrichment of large contiguous sequences with minimal prior...

  11. Choice of reference sequence and assembler for alignment of Listeria monocytogenes short-read sequence data greatly influences rates of error in SNP analyses.

    Directory of Open Access Journals (Sweden)

    Arthur W Pightling

    Full Text Available The wide availability of whole-genome sequencing (WGS and an abundance of open-source software have made detection of single-nucleotide polymorphisms (SNPs in bacterial genomes an increasingly accessible and effective tool for comparative analyses. Thus, ensuring that real nucleotide differences between genomes (i.e., true SNPs are detected at high rates and that the influences of errors (such as false positive SNPs, ambiguously called sites, and gaps are mitigated is of utmost importance. The choices researchers make regarding the generation and analysis of WGS data can greatly influence the accuracy of short-read sequence alignments and, therefore, the efficacy of such experiments. We studied the effects of some of these choices, including: i depth of sequencing coverage, ii choice of reference-guided short-read sequence assembler, iii choice of reference genome, and iv whether to perform read-quality filtering and trimming, on our ability to detect true SNPs and on the frequencies of errors. We performed benchmarking experiments, during which we assembled simulated and real Listeria monocytogenes strain 08-5578 short-read sequence datasets of varying quality with four commonly used assemblers (BWA, MOSAIK, Novoalign, and SMALT, using reference genomes of varying genetic distances, and with or without read pre-processing (i.e., quality filtering and trimming. We found that assemblies of at least 50-fold coverage provided the most accurate results. In addition, MOSAIK yielded the fewest errors when reads were aligned to a nearly identical reference genome, while using SMALT to align reads against a reference sequence that is ∼0.82% distant from 08-5578 at the nucleotide level resulted in the detection of the greatest numbers of true SNPs and the fewest errors. Finally, we show that whether read pre-processing improves SNP detection depends upon the choice of reference sequence and assembler. In total, this study demonstrates that researchers

  12. Stem loop sequences specific to transposable element IS605 are found linked to lipoprotein genes in Borrelia plasmids.

    Directory of Open Access Journals (Sweden)

    Nicholas Delihas

    Full Text Available BACKGROUND: Plasmids of Borrelia species are dynamic structures that contain a large number of repetitive genes, gene fragments, and gene fusions. In addition, the transposable element IS605/200 family, as well as degenerate forms of this IS element, are prevalent. In Helicobacter pylori, flanking regions of the IS605 transposase gene contain sequences that fold into identical small stem loops. These function in transposition at the single-stranded DNA level. METHODOLOGY/PRINCIPAL FINDINGS: In work reported here, bioinformatics techniques were used to scan Borrelia plasmid genomes for IS605 transposable element specific stem loop sequences. Two variant stem loop motifs are found in the left and right flanking regions of the transposase gene. Both motifs appear to have dispersed in plasmid genomes and are found "free-standing" and phylogenetically conserved without the associated IS605 transposase gene or the adjacent flanking sequence. Importantly, IS605 specific stem loop sequences are also found at the 3' ends of lipoprotein genes (PFam12 and PFam60, however the left and right sequences appear to develop their own evolutionary patterns. The lipoprotein gene-linked left stem loop sequences maintain the IS605 stem loop motif in orthologs but only at the RNA level. These show mutations whereby variants fold into phylogenetically conserved RNA-type stem loops that contain the wobble non-Watson-Crick G-U base-pairing. The right flanking sequence is associated with the family lipoprotein-1 genes. A comparison of homologs shows that the IS605 stem loop motif rapidly dissipates, but a more elaborate secondary structure appears to develop in its place. CONCLUSIONS/SIGNIFICANCE: Stem loop sequences specific to the transposable element IS605 are present in plasmid regions devoid of a transposase gene and significantly, are found linked to lipoprotein genes in Borrelia plasmids. These sequences are evolutionarily conserved and/or structurally developed in

  13. Correction of echo shift in reconstruction processing for ultra-short TE pulse sequence

    International Nuclear Information System (INIS)

    Takizawa, Masahiro; Ootsuka, Takehiro; Abe, Takayuki; Takahashi, Tetsuhiko

    2010-01-01

    An ultra-short echo time (TE) pulse sequence is composed of a radial sampling that acquires echo signals radially in the K-space and a half-echo acquisition that acquires only half of the echo signal. The shift in the position of the echo signal (echo shift) caused by the timing errors in the gradient magnetic field pulses affects the image quality in the radial sampling with the half-echo acquisition. To improve image quality, we have developed a signal correction algorithm that detects and eliminates this echo shift during reconstruction by performing a pre-scan within 10 seconds. The results showed that image quality is improved under oblique and/or off-centering conditions that frequently cause image distortion due to hardware error. In conclusion, we have developed a robust ultra-short TE pulse sequence that allows wide latitude in the scan parameters, including oblique and off-centering conditions. (author)

  14. Short interspersed elements (SINEs) of squamate reptiles (Squam1 and Squam2): structure and phylogenetic significance.

    Science.gov (United States)

    Grechko, Vernata V; Kosushkin, Sergei A; Borodulina, Olga R; Butaeva, Fatima G; Darevsky, Ilya S

    2011-05-15

    Short interspersed elements (SINEs) are important nuclear molecular markers of the evolution of many eukaryotes. However, the SINEs of squamate reptile genomes have been little studied. We first identified two families of SINEs, termed Squam1 and Squam2, in the DNA of meadow lizard Darevskia praticola (Lacertidae) by performing DNA hybridization and PCR. Later, the same families of retrotransposons were found using the same methods in members of another 25 lizard families (from Iguania, Scincomorpha, Gekkota, Varanoidea, and Diploglossa infraorders) and two snake families, but their abundances in these taxa varied greatly. Both SINEs were Squamata-specific and were absent from mammals, birds, crocodiles, turtles, amphibians, and fish. Squam1 possessed some characteristics common to tRNA-related SINEs from fish and mammals, while Squam2 belonged to the tRNA(Ala) group of SINEs and had a more unusual and divergent structure. Squam2-related sequences were found in several unannotated GenBank sequences of squamate reptiles. Squam1 abundance in the Polychrotidae, Agamidae, Leiolepididae, Chamaeleonidae, Scincidae, Lacertidae, Gekkonidae, Varanidae, Helodermatidae, and two snake families were 10(2) -10(4) times higher than those in other taxa (Corytophanidae, Iguanidae, Anguidae, Cordylidae, Gerrhosauridae, Pygopodidae, and Eublepharidae). A less dramatic degree of copy number variation was observed for Squam2 in different taxa. Several Squam1 copies from Lacertidae, Chamaeleonidae, Gekkonidae, Varanidae, and Colubridae were sequenced and found to have evident orthologous features, as well as taxa-specific autapomorphies. Squam1 from Lacertidae and Chamaeleonidae could be divided into several subgroups based on sequence differences. Possible applications of these SINEs as Squamata phylogeny markers are discussed. Copyright © 2010 Wiley-Liss, Inc., A Wiley Company.

  15. Accurate estimation of short read mapping quality for next-generation genome sequencing

    Science.gov (United States)

    Ruffalo, Matthew; Koyutürk, Mehmet; Ray, Soumya; LaFramboise, Thomas

    2012-01-01

    Motivation: Several software tools specialize in the alignment of short next-generation sequencing reads to a reference sequence. Some of these tools report a mapping quality score for each alignment—in principle, this quality score tells researchers the likelihood that the alignment is correct. However, the reported mapping quality often correlates weakly with actual accuracy and the qualities of many mappings are underestimated, encouraging the researchers to discard correct mappings. Further, these low-quality mappings tend to correlate with variations in the genome (both single nucleotide and structural), and such mappings are important in accurately identifying genomic variants. Approach: We develop a machine learning tool, LoQuM (LOgistic regression tool for calibrating the Quality of short read mappings, to assign reliable mapping quality scores to mappings of Illumina reads returned by any alignment tool. LoQuM uses statistics on the read (base quality scores reported by the sequencer) and the alignment (number of matches, mismatches and deletions, mapping quality score returned by the alignment tool, if available, and number of mappings) as features for classification and uses simulated reads to learn a logistic regression model that relates these features to actual mapping quality. Results: We test the predictions of LoQuM on an independent dataset generated by the ART short read simulation software and observe that LoQuM can ‘resurrect’ many mappings that are assigned zero quality scores by the alignment tools and are therefore likely to be discarded by researchers. We also observe that the recalibration of mapping quality scores greatly enhances the precision of called single nucleotide polymorphisms. Availability: LoQuM is available as open source at http://compbio.case.edu/loqum/. Contact: matthew.ruffalo@case.edu. PMID:22962451

  16. Short interspersed transposable elements (SINEs) are excluded from imprinted regions in the human genome.

    Science.gov (United States)

    Greally, John M

    2002-01-08

    To test whether regions undergoing genomic imprinting have unique genomic characteristics, imprinted and nonimprinted human loci were compared for nucleotide and retroelement composition. Maternally and paternally expressed subgroups of imprinted genes were found to differ in terms of guanine and cytosine, CpG, and retroelement content, indicating a segregation into distinct genomic compartments. Imprinted regions have been normally permissive to L1 long interspersed transposable element retroposition during mammalian evolution but universally and significantly lack short interspersed transposable elements (SINEs). The primate-specific Alu SINEs, as well as the more ancient mammalian-wide interspersed repeat SINEs, are found at significantly low densities in imprinted regions. The latter paleogenomic signature indicates that the sequence characteristics of currently imprinted regions existed before the mammalian radiation. Transitions from imprinted to nonimprinted genomic regions in cis are characterized by a sharp inflection in SINE content, demonstrating that this genomic characteristic can help predict the presence and extent of regions undergoing imprinting. During primate evolution, SINE accumulation in imprinted regions occurred at a decreased rate compared with control loci. The constraint on SINE accumulation in imprinted regions may be mediated by an active selection process. This selection could be because of SINEs attracting and spreading methylation, as has been found at other loci. Methylation-induced silencing could lead to deleterious consequences at imprinted loci, where inactivation of one allele is already established, and expression is often essential for embryonic growth and survival.

  17. The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads.

    Science.gov (United States)

    Wang, Zhiwen; Hobson, Neil; Galindo, Leonardo; Zhu, Shilin; Shi, Daihu; McDill, Joshua; Yang, Linfeng; Hawkins, Simon; Neutelings, Godfrey; Datla, Raju; Lambert, Georgina; Galbraith, David W; Grassa, Christopher J; Geraldes, Armando; Cronk, Quentin C; Cullis, Christopher; Dash, Prasanta K; Kumar, Polumetla A; Cloutier, Sylvie; Sharpe, Andrew G; Wong, Gane K-S; Wang, Jun; Deyholos, Michael K

    2012-11-01

    Flax (Linum usitatissimum) is an ancient crop that is widely cultivated as a source of fiber, oil and medicinally relevant compounds. To accelerate crop improvement, we performed whole-genome shotgun sequencing of the nuclear genome of flax. Seven paired-end libraries ranging in size from 300 bp to 10 kb were sequenced using an Illumina genome analyzer. A de novo assembly, comprised exclusively of deep-coverage (approximately 94× raw, approximately 69× filtered) short-sequence reads (44-100 bp), produced a set of scaffolds with N(50) =694 kb, including contigs with N(50)=20.1 kb. The contig assembly contained 302 Mb of non-redundant sequence representing an estimated 81% genome coverage. Up to 96% of published flax ESTs aligned to the whole-genome shotgun scaffolds. However, comparisons with independently sequenced BACs and fosmids showed some mis-assembly of regions at the genome scale. A total of 43384 protein-coding genes were predicted in the whole-genome shotgun assembly, and up to 93% of published flax ESTs, and 86% of A. thaliana genes aligned to these predicted genes, indicating excellent coverage and accuracy at the gene level. Analysis of the synonymous substitution rates (K(s) ) observed within duplicate gene pairs was consistent with a recent (5-9 MYA) whole-genome duplication in flax. Within the predicted proteome, we observed enrichment of many conserved domains (Pfam-A) that may contribute to the unique properties of this crop, including agglutinin proteins. Together these results show that de novo assembly, based solely on whole-genome shotgun short-sequence reads, is an efficient means of obtaining nearly complete genome sequence information for some plant species. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.

  18. Annotation and sequence diversity of transposable elements in common bean (Phaseolus vulgaris

    Directory of Open Access Journals (Sweden)

    Scott eJackson

    2014-07-01

    Full Text Available Common bean (Phaseolus vulgaris is an important legume crop grown and consumed worldwide. With the availability of the common bean genome sequence, the next challenge is to annotate the genome and characterize functional DNA elements. Transposable elements (TEs are the most abundant component of plant genomes and can dramatically affect genome evolution and genetic variation. Thus, it is pivotal to identify TEs in the common bean genome. In this study, we performed a genome-wide transposon annotation in common bean using a combination of homology and sequence structure-based methods. We developed a 2.12-Mb transposon database which includes 791 representative transposon sequences and is available upon request or from www.phytozome.org. Of note, nearly all transposons in the database are previously unrecognized TEs. More than 5,000 transposon-related expressed sequence tags (ESTs were detected which indicates that some transposons may be transcriptionally active. Two Ty1-copia retrotransposon families were found to encode the envelope-like protein which has rarely been identified in plant genomes. Also, we identified an extra open reading frame (ORF termed ORF2 from 15 Ty3-gypsy families that was located between the ORF encoding the retrotransposase and the 3’LTR. The ORF2 was in opposite transcriptional orientation to retrotransposase. Sequence homology searches and phylogenetic analysis suggested that the ORF2 may have an ancient origin, but its function is not clear. This transposon data provides a useful resource for understanding the genome organization and evolution and may be used to identify active TEs for developing transposon-tagging system in common bean and other related genomes.

  19. Massive contribution of transposable elements to mammalian regulatory sequences.

    Science.gov (United States)

    Rayan, Nirmala Arul; Del Rosario, Ricardo C H; Prabhakar, Shyam

    2016-09-01

    Barbara McClintock discovered the existence of transposable elements (TEs) in the late 1940s and initially proposed that they contributed to the gene regulatory program of higher organisms. This controversial idea gained acceptance only much later in the 1990s, when the first examples of TE-derived promoter sequences were uncovered. It is now known that half of the human genome is recognizably derived from TEs. It is thus important to understand the scope and nature of their contribution to gene regulation. Here, we provide a timeline of major discoveries in this area and discuss how transposons have revolutionized our understanding of mammalian genomes, with a special emphasis on the massive contribution of TEs to primate evolution. Our analysis of primate-specific functional elements supports a simple model for the rate at which new functional elements arise in unique and TE-derived DNA. Finally, we discuss some of the challenges and unresolved questions in the field, which need to be addressed in order to fully characterize the impact of TEs on gene regulation, evolution and disease processes. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. Binding among Select Episodic Elements Is Altered via Active Short-Term Retrieval

    Science.gov (United States)

    Bridge, Donna J.; Voss, Joel L.

    2015-01-01

    Of the many elements that comprise an episode, are any disproportionately bound to the others? We tested whether active short-term retrieval selectively increases binding. Individual objects from multiobject displays were retrieved after brief delays. Memory was later tested for the other objects. Cueing with actively retrieved objects facilitated…

  1. Determination of short-lived trace elements in environmental samples by neutron activation analysis

    International Nuclear Information System (INIS)

    Wardani, S.; Sihombing, E.; Hamzah, A.; Rochidi; Hery, P.S.; Hartaman, S.; Iman, J.

    1998-01-01

    Concentration of a short-lived trace elements in environmental samples were determined by neutron activation analysis, a counting loss often occur due to the high counting rate. A Pile-Up Rejecter (PUR) electric circuit was installed in counting a short-lived trace elements by a γ-ray spectrometer in order to correct a counting loss. The samples were irradiated for 30∼60 seconds at neutron flux of 3.5 x 10 12 n.cm -2 .s -1 , then the samples cooled for 120 second and counted for 180 second using this system. The nuclides concentration in the varieties environmental samples have a difference analysis result, was more accurate and precise, which the measured result would be 30 % more higher by PUR system than the result would be counted using a conventional γ-ray spectrometry method

  2. The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads

    DEFF Research Database (Denmark)

    Wang, Zhiwen; Hobson, Neil; Galindo, Leonardo

    2012-01-01

    Flax (Linum usitatissimum) is an ancient crop that is widely cultivated as a source of fiber, oil and medicinally relevant compounds. To accelerate crop improvement, we performed whole-genome shotgun sequencing of the nuclear genome of flax. Seven paired-end libraries ranging in size from 300 bp...... these results show that de novo assembly, based solely on whole-genome shotgun short-sequence reads, is an efficient means of obtaining nearly complete genome sequence information for some plant species....

  3. Location analysis for the estrogen receptor-? reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

    OpenAIRE

    Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

    2010-01-01

    Location analysis for estrogen receptor-? (ER?)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ER?-bound loci and quantify the incidence of ERE sequences under two stringencies of detection:

  4. The Role of Heterologous Chloroplast Sequence Elements in Transgene Integration and Expression1[W][OA

    Science.gov (United States)

    Ruhlman, Tracey; Verma, Dheeraj; Samson, Nalapalli; Daniell, Henry

    2010-01-01

    Heterologous regulatory elements and flanking sequences have been used in chloroplast transformation of several crop species, but their roles and mechanisms have not yet been investigated. Nucleotide sequence identity in the photosystem II protein D1 (psbA) upstream region is 59% across all taxa; similar variation was consistent across all genes and taxa examined. Secondary structure and predicted Gibbs free energy values of the psbA 5′ untranslated region (UTR) among different families reflected this variation. Therefore, chloroplast transformation vectors were made for tobacco (Nicotiana tabacum) and lettuce (Lactuca sativa), with endogenous (Nt-Nt, Ls-Ls) or heterologous (Nt-Ls, Ls-Nt) psbA promoter, 5′ UTR and 3′ UTR, regulating expression of the anthrax protective antigen (PA) or human proinsulin (Pins) fused with the cholera toxin B-subunit (CTB). Unique lettuce flanking sequences were completely eliminated during homologous recombination in the transplastomic tobacco genomes but not unique tobacco sequences. Nt-Ls or Ls-Nt transplastomic lines showed reduction of 80% PA and 97% CTB-Pins expression when compared with endogenous psbA regulatory elements, which accumulated up to 29.6% total soluble protein PA and 72.0% total leaf protein CTB-Pins, 2-fold higher than Rubisco. Transgene transcripts were reduced by 84% in Ls-Nt-CTB-Pins and by 72% in Nt-Ls-PA lines. Transcripts containing endogenous 5′ UTR were stabilized in nonpolysomal fractions. Stromal RNA-binding proteins were preferentially associated with endogenous psbA 5′ UTR. A rapid and reproducible regeneration system was developed for lettuce commercial cultivars by optimizing plant growth regulators. These findings underscore the need for sequencing complete crop chloroplast genomes, utilization of endogenous regulatory elements and flanking sequences, as well as optimization of plant growth regulators for efficient chloroplast transformation. PMID:20130101

  5. The mobile genetic element Alu in the human genome

    Energy Technology Data Exchange (ETDEWEB)

    Novick, G.E. [Florida International Univ., Miami, FL (United States); Batzer, M.A.; Deininger, P.L. [Louisiana State Univ. Medical Center, New Orleans, LA (United States)] [and others

    1996-01-01

    Genetic material has been traditionally envisioned as relatively static with the exception of occasional, often deleterious mutations. The sequence DNA-to-RNA-to-protein represented for many years the central dogma relating gene structure and function. Recently, the field of molecular genetics has provided revolutionary information on the dynamic role of repetitive elements in the function of the genetic material and the evolution of humans and other organisms. Alu sequences represent the largest family of short interspersed repetitive elements (SINEs) in humans, being present in an excess of 500,000 copies per haploid genome. Alu elements, as well as the other repetitive elements, were once considered to be useless. Today, the biology of Alu transposable elements is being widely examined in order to determine the molecular basis of a growing number of identified diseases and to provide new directions in genome mapping and biomedical research. 66 refs., 5 figs.

  6. Unusual loop-sequence flexibility of the proximal RNA replication element in EMCV.

    Directory of Open Access Journals (Sweden)

    Jan Zoll

    Full Text Available Picornaviruses contain stable RNA structures at the 5' and 3' ends of the RNA genome, OriL and OriR involved in viral RNA replication. The OriL RNA element found at the 5' end of the enterovirus genome folds into a cloverleaf-like configuration. In vivo SELEX experiments revealed that functioning of the poliovirus cloverleaf depends on a specific structure in this RNA element. Little is known about the OriL of cardioviruses. Here, we investigated structural aspects and requirements of the apical loop of proximal stem-loop SL-A of mengovirus, a strain of EMCV. Using NMR spectroscopy, we showed that the mengovirus SL-A apical loop consists of an octaloop. In vivo SELEX experiments demonstrated that a large number of random sequences are tolerated in the apical octaloop that support virus replication. Mutants in which the SL-A loop size and the length of the upper part of the stem were varied showed that both stem-length and stability of the octaloop are important determinants for viral RNA replication and virus reproduction. Together, these data show that stem-loop A plays an important role in virus replication. The high degree of sequence flexibility and the lack of selective pressure on the octaloop argue against a role in sequence specific RNA-protein or RNA-RNA interactions in which octaloop nucleotides are involved.

  7. Use of short tandem repeat sequences to study Mycobacterium leprae in leprosy patients in Malawi and India.

    Directory of Open Access Journals (Sweden)

    Saroj K Young

    2008-04-01

    Full Text Available Inadequate understanding of the transmission of Mycobacterium leprae makes it difficult to predict the impact of leprosy control interventions. Genotypic tests that allow tracking of individual bacterial strains would strengthen epidemiological studies and contribute to our understanding of the disease.Genotyping assays based on variation in the copy number of short tandem repeat sequences were applied to biopsies collected in population-based epidemiological studies of leprosy in northern Malawi, and from members of multi-case households in Hyderabad, India. In the Malawi series, considerable genotypic variability was observed between patients, and also within patients, when isolates were collected at different times or from different tissues. Less within-patient variability was observed when isolates were collected from similar tissues at the same time. Less genotypic variability was noted amongst the closely related Indian patients than in the Malawi series.Lineages of M. leprae undergo changes in their pattern of short tandem repeat sequences over time. Genetic divergence is particularly likely between bacilli inhabiting different (e.g., skin and nerve tissues. Such variability makes short tandem repeat sequences unsuitable as a general tool for population-based strain typing of M. leprae, or for distinguishing relapse from reinfection. Careful use of these markers may provide insights into the development of disease within individuals and for tracking of short transmission chains.

  8. Detection of short repeated genomic sequences on metaphase chromosomes using padlock probes and target primed rolling circle DNA synthesis

    Directory of Open Access Journals (Sweden)

    Stougaard Magnus

    2007-11-01

    Full Text Available Abstract Background In situ detection of short sequence elements in genomic DNA requires short probes with high molecular resolution and powerful specific signal amplification. Padlock probes can differentiate single base variations. Ligated padlock probes can be amplified in situ by rolling circle DNA synthesis and detected by fluorescence microscopy, thus enhancing PRINS type reactions, where localized DNA synthesis reports on the position of hybridization targets, to potentially reveal the binding of single oligonucleotide-size probe molecules. Such a system has been presented for the detection of mitochondrial DNA in fixed cells, whereas attempts to apply rolling circle detection to metaphase chromosomes have previously failed, according to the literature. Methods Synchronized cultured cells were fixed with methanol/acetic acid to prepare chromosome spreads in teflon-coated diagnostic well-slides. Apart from the slide format and the chromosome spreading everything was done essentially according to standard protocols. Hybridization targets were detected in situ with padlock probes, which were ligated and amplified using target primed rolling circle DNA synthesis, and detected by fluorescence labeling. Results An optimized protocol for the spreading of condensed metaphase chromosomes in teflon-coated diagnostic well-slides was developed. Applying this protocol we generated specimens for target primed rolling circle DNA synthesis of padlock probes recognizing a 40 nucleotide sequence in the male specific repetitive satellite I sequence (DYZ1 on the Y-chromosome and a 32 nucleotide sequence in the repetitive kringle IV domain in the apolipoprotein(a gene positioned on the long arm of chromosome 6. These targets were detected with good efficiency, but the efficiency on other target sites was unsatisfactory. Conclusion Our aim was to test the applicability of the method used on mitochondrial DNA to the analysis of nuclear genomes, in particular as

  9. Determination of short half-life elements in biological, foodstuff, and environmental samples qualitatively by neutron activation analysis

    International Nuclear Information System (INIS)

    Syukria Kurniawati; Muhayatun Santoso; Diah Dwiana Lestiani

    2010-01-01

    NAA applications at routine operation power of 15 MW at Multipurpose Reactor GA Siwabessy (MPR-GAS) for sample matrices analysis have been widely applied. However, the results are not optimum for some matrices especially for short half-live elements. Preliminary study of short half-life elements determination in biological, foodstuff, and environmental samples using 1 MW power have been conducted to solve this problem. The samples were irradiated in rabbit system of MPR-GAS for 5 minutes, counted for 200 seconds by HPGe detector, and the spectrum were analyzed further using software Genie 2000 and Bandung NAA Utility. Analysis under 1 MW power on biological and foodstuff samples were capable to detect eight elements: Al, Br, CI, Ca, I, K, Mg, Ti, and Na for biological samples; Al, Br, CI, Ca, I, K, Mg, Mn, and Na for foodstuff samples, while at 15 MW power only three elements (CI, K, Na) were detected. At 1 MW power the counting process is more optimum due to smaller radiation exposure and dead time. For the environmental samples, the number of elements detected by 1 MW and 15 MW powers did not differ significantly. Generally, the results on the three types of samples showed that the elements of short half-life are better detected at 1 MW than that of 15 MW power. Further research needs to be done to obtain the optimum analytical conditions for irradiation and counting time determination. (author)

  10. Fine de novo sequencing of a fungal genome using only SOLiD short read data: verification on Aspergillus oryzae RIB40.

    Directory of Open Access Journals (Sweden)

    Myco Umemura

    Full Text Available The development of next-generation sequencing (NGS technologies has dramatically increased the throughput, speed, and efficiency of genome sequencing. The short read data generated from NGS platforms, such as SOLiD and Illumina, are quite useful for mapping analysis. However, the SOLiD read data with lengths of <60 bp have been considered to be too short for de novo genome sequencing. Here, to investigate whether de novo sequencing of fungal genomes is possible using only SOLiD short read sequence data, we performed de novo assembly of the Aspergillus oryzae RIB40 genome using only SOLiD read data of 50 bp generated from mate-paired libraries with 2.8- or 1.9-kb insert sizes. The assembled scaffolds showed an N50 value of 1.6 Mb, a 22-fold increase than those obtained using only SOLiD short read in other published reports. In addition, almost 99% of the reference genome was accurately aligned by the assembled scaffold fragments in long lengths. The sequences of secondary metabolite biosynthetic genes and clusters, whose products are of considerable interest in fungal studies due to their potential medicinal, agricultural, and cosmetic properties, were also highly reconstructed in the assembled scaffolds. Based on these findings, we concluded that de novo genome sequencing using only SOLiD short reads is feasible and practical for molecular biological study of fungi. We also investigated the effect of filtering low quality data, library insert size, and k-mer size on the assembly performance, and recommend for the assembly use of mild filtered read data where the N50 was not so degraded and the library has an insert size of ∼2.0 kb, and k-mer size 33.

  11. Survey of transposable elements in sugarcane expressed sequence tags (ESTs

    Directory of Open Access Journals (Sweden)

    Rossi Magdalena

    2001-01-01

    Full Text Available The sugarcane expressed sequence tag (SUCEST project has produced a large number of cDNA sequences from several plant tissues submitted or not to different conditions of stress. In this paper we report the result of a search for transposable elements (TEs revealing a surprising amount of expressed TEs homologues. Of the 260,781 sequences grouped in 81,223 fragment assembly program (Phrap clusters, a total of 276 clones showed homology to previously reported TEs using a stringent cut-off value of e-50 or better. Homologous clones to Copia/Ty1 and Gypsy/Ty3 groups of long terminal repeat (LTR retrotransposons were found but no non-LTR retroelements were identified. All major transposon families were represented in sugarcane including Activator (Ac, Mutator (MuDR, Suppressor-mutator (En/Spm and Mariner. In order to compare the TE diversity in grasses genomes, we carried out a search for TEs described in sugarcane related species O.sativa, Z. mays and S. bicolor. We also present preliminary results showing the potential use of TEs insertion pattern polymorphism as molecular markers for cultivar identification.

  12. Systematic Dissection of Sequence Elements Controlling σ70 Promoters Using a Genomically-Encoded Multiplexed Reporter Assay in E. coli.

    Science.gov (United States)

    Urtecho, Guillaume; Tripp, Arielle D; Insigne, Kimberly; Kim, Hwangbeom; Kosuri, Sriram

    2018-02-01

    Promoters are the key drivers of gene expression and are largely responsible for the regulation of cellular responses to time and environment. In E. coli , decades of studies have revealed most, if not all, of the sequence elements necessary to encode promoter function. Despite our knowledge of these motifs, it is still not possible to predict the strength and regulation of a promoter from primary sequence alone. Here we develop a novel multiplexed assay to study promoter function in E. coli by building a site-specific genomic recombination-mediated cassette exchange (RMCE) system that allows for the facile construction and testing of large libraries of genetic designs integrated into precise genomic locations. We build and test a library of 10,898 σ70 promoter variants consisting of all combinations of a set of eight -35 elements, eight -10 elements, three UP elements, eight spacers, and eight backgrounds. We find that the -35 and -10 sequence elements can explain approximately 74% of the variance in promoter strength within our dataset using a simple log-linear statistical model. Neural network models can explain greater than 95% of the variance in our dataset, and show the increased power is due to nonlinear interactions of other elements such as the spacer, background, and UP elements.

  13. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    Energy Technology Data Exchange (ETDEWEB)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  14. Genome-wide analysis of short interspersed nuclear elements SINES revealed high sequence conservation, gene association and retrotranspositional activity in wheat.

    Science.gov (United States)

    Ben-David, Smadar; Yaakov, Beery; Kashkush, Khalil

    2013-10-01

    Short interspersed nuclear elements (SINEs) are non-autonomous non-LTR retroelements that are present in most eukaryotic species. While SINEs have been intensively investigated in humans and other animal systems, they are poorly studied in plants, especially in wheat (Triticum aestivum). We used quantitative PCR of various wheat species to determine the copy number of a wheat SINE family, termed Au SINE, combined with computer-assisted analyses of the publicly available 454 pyrosequencing database of T. aestivum. In addition, we utilized site-specific PCR on 57 Au SINE insertions, transposon methylation display and transposon display on newly formed wheat polyploids to assess retrotranspositional activity, epigenetic status and genetic rearrangements in Au SINE, respectively. We retrieved 3706 different insertions of Au SINE from the 454 pyrosequencing database of T. aestivum, and found that most of the elements are inserted in A/T-rich regions, while approximately 38% of the insertions are associated with transcribed regions, including known wheat genes. We observed typical retrotransposition of Au SINE in the second generation of a newly formed wheat allohexaploid, and massive hypermethylation in CCGG sites surrounding Au SINE in the third generation. Finally, we observed huge differences in the copy numbers in diploid Triticum and Aegilops species, and a significant increase in the copy numbers in natural wheat polyploids, but no significant increase in the copy number of Au SINE in the first four generations for two of three newly formed allopolyploid species used in this study. Our data indicate that SINEs may play a prominent role in the genomic evolution of wheat through stress-induced activation. © 2013 Ben-Gurion University The Plant Journal © 2013 John Wiley & Sons Ltd.

  15. Behaviour of short-lived fission products within operating UO2 fuel elements

    International Nuclear Information System (INIS)

    Hastings, I.J.; Hunt, C.E.L.; Lipsett, J.J.

    1983-01-01

    We have carried out experiments using a ''sweep gas'' technique to determine the behaviour of short-lived fission products within operating, intact UO 2 fuel elements. The Zircaloy-4-clad elements were 500 mm long and contained fuel of density 10.65-10.71 Mg/m 3 . A He-2% H 2 carrier gas swept gaseous or volatile fission products out of the operating fuel element past a gamma spectrometer for measurement. In tests at linear powers of 45 and 60 kW/m to maximum burnups of 70 MW.h/kg U, the species measured directly at the spectrometer were generally the short-lived xenons and kryptons. We did not observe iodine or bromine during normal operation. However, we have deduced the behaviour of I-133 and I-135 from the decay of Xe-133 and Xe-135 during reactor shutdowns. Plots of R/B (released/born) against lambda (decay constant) or effective lambda for all isotopes observed at 45 and 60 kW/m show that a line of slope -0.5, corresponding with diffusion kinetics, is a good fit to the measured xenon and krypton data. Our inferred release of iodine fits the same line. From this we can extrapolate to an R/B for I-131 of about 5x10 -3 . The ANS 5.4 release correlation gives calculated results in good agreement with our measurements. (author)

  16. Easy and accurate reconstruction of whole HIV genomes from short-read sequence data with shiver

    Science.gov (United States)

    Blanquart, François; Golubchik, Tanya; Gall, Astrid; Bakker, Margreet; Bezemer, Daniela; Croucher, Nicholas J; Hall, Matthew; Hillebregt, Mariska; Ratmann, Oliver; Albert, Jan; Bannert, Norbert; Fellay, Jacques; Fransen, Katrien; Gourlay, Annabelle; Grabowski, M Kate; Gunsenheimer-Bartmeyer, Barbara; Günthard, Huldrych F; Kivelä, Pia; Kouyos, Roger; Laeyendecker, Oliver; Liitsola, Kirsi; Meyer, Laurence; Porter, Kholoud; Ristola, Matti; van Sighem, Ard; Cornelissen, Marion; Kellam, Paul; Reiss, Peter

    2018-01-01

    Abstract Studying the evolution of viruses and their molecular epidemiology relies on accurate viral sequence data, so that small differences between similar viruses can be meaningfully interpreted. Despite its higher throughput and more detailed minority variant data, next-generation sequencing has yet to be widely adopted for HIV. The difficulty of accurately reconstructing the consensus sequence of a quasispecies from reads (short fragments of DNA) in the presence of large between- and within-host diversity, including frequent indels, may have presented a barrier. In particular, mapping (aligning) reads to a reference sequence leads to biased loss of information; this bias can distort epidemiological and evolutionary conclusions. De novo assembly avoids this bias by aligning the reads to themselves, producing a set of sequences called contigs. However contigs provide only a partial summary of the reads, misassembly may result in their having an incorrect structure, and no information is available at parts of the genome where contigs could not be assembled. To address these problems we developed the tool shiver to pre-process reads for quality and contamination, then map them to a reference tailored to the sample using corrected contigs supplemented with the user’s choice of existing reference sequences. Run with two commands per sample, it can easily be used for large heterogeneous data sets. We used shiver to reconstruct the consensus sequence and minority variant information from paired-end short-read whole-genome data produced with the Illumina platform, for sixty-five existing publicly available samples and fifty new samples. We show the systematic superiority of mapping to shiver’s constructed reference compared with mapping the same reads to the closest of 3,249 real references: median values of 13 bases called differently and more accurately, 0 bases called differently and less accurately, and 205 bases of missing sequence recovered. We also

  17. Short-read reading-frame predictors are not created equal: sequence error causes loss of signal

    Directory of Open Access Journals (Sweden)

    Trimble William L

    2012-07-01

    Full Text Available Abstract Background Gene prediction algorithms (or gene callers are an essential tool for analyzing shotgun nucleic acid sequence data. Gene prediction is a ubiquitous step in sequence analysis pipelines; it reduces the volume of data by identifying the most likely reading frame for a fragment, permitting the out-of-frame translations to be ignored. In this study we evaluate five widely used ab initio gene-calling algorithms—FragGeneScan, MetaGeneAnnotator, MetaGeneMark, Orphelia, and Prodigal—for accuracy on short (75–1000 bp fragments containing sequence error from previously published artificial data and “real” metagenomic datasets. Results While gene prediction tools have similar accuracies predicting genes on error-free fragments, in the presence of sequencing errors considerable differences between tools become evident. For error-containing short reads, FragGeneScan finds more prokaryotic coding regions than does MetaGeneAnnotator, MetaGeneMark, Orphelia, or Prodigal. This improved detection of genes in error-containing fragments, however, comes at the cost of much lower (50% specificity and overprediction of genes in noncoding regions. Conclusions Ab initio gene callers offer a significant reduction in the computational burden of annotating individual nucleic acid reads and are used in many metagenomic annotation systems. For predicting reading frames on raw reads, we find the hidden Markov model approach in FragGeneScan is more sensitive than other gene prediction tools, while Prodigal, MGA, and MGM are better suited for higher-quality sequences such as assembled contigs.

  18. A sensitive short read homology search tool for paired-end read sequencing data.

    Science.gov (United States)

    Techa-Angkoon, Prapaporn; Sun, Yanni; Lei, Jikai

    2017-10-16

    Homology search is still a significant step in functional analysis for genomic data. Profile Hidden Markov Model-based homology search has been widely used in protein domain analysis in many different species. In particular, with the fast accumulation of transcriptomic data of non-model species and metagenomic data, profile homology search is widely adopted in integrated pipelines for functional analysis. While the state-of-the-art tool HMMER has achieved high sensitivity and accuracy in domain annotation, the sensitivity of HMMER on short reads declines rapidly. The low sensitivity on short read homology search can lead to inaccurate domain composition and abundance computation. Our experimental results showed that half of the reads were missed by HMMER for a RNA-Seq dataset. Thus, there is a need for better methods to improve the homology search performance for short reads. We introduce a profile homology search tool named Short-Pair that is designed for short paired-end reads. By using an approximate Bayesian approach employing distribution of fragment lengths and alignment scores, Short-Pair can retrieve the missing end and determine true domains. In particular, Short-Pair increases the accuracy in aligning short reads that are part of remote homologs. We applied Short-Pair to a RNA-Seq dataset and a metagenomic dataset and quantified its sensitivity and accuracy on homology search. The experimental results show that Short-Pair can achieve better overall performance than the state-of-the-art methodology of profile homology search. Short-Pair is best used for next-generation sequencing (NGS) data that lack reference genomes. It provides a complementary paired-end read homology search tool to HMMER. The source code is freely available at https://sourceforge.net/projects/short-pair/ .

  19. TE-Locate: A Tool to Locate and Group Transposable Element Occurrences Using Paired-End Next-Generation Sequencing Data

    OpenAIRE

    Platzer, Alexander; Nizhynska, Viktoria; Long, Quan

    2012-01-01

    Transposable elements (TEs) are common mobile DNA elements present in nearly all genomes. Since the movement of TEs within a genome can sometimes have phenotypic consequences, an accurate report of TE actions is desirable. To this end, we developed TE-Locate, a computational tool that uses paired-end reads to identify the novel locations of known TEs. TE-Locate can utilize either a database of TE sequences, or annotated TEs within the reference sequence of interest. This makes TE-Locate usefu...

  20. GenHtr: a tool for comparative assessment of genetic heterogeneity in microbial genomes generated by massive short-read sequencing

    Directory of Open Access Journals (Sweden)

    Yu GongXin

    2010-10-01

    Full Text Available Abstract Background Microevolution is the study of short-term changes of alleles within a population and their effects on the phenotype of organisms. The result of the below-species-level evolution is heterogeneity, where populations consist of subpopulations with a large number of structural variations. Heterogeneity analysis is thus essential to our understanding of how selective and neutral forces shape bacterial populations over a short period of time. The Solexa Genome Analyzer, a next-generation sequencing platform, allows millions of short sequencing reads to be obtained with great accuracy, allowing for the ability to study the dynamics of the bacterial population at the whole genome level. The tool referred to as GenHtr was developed for genome-wide heterogeneity analysis. Results For particular bacterial strains, GenHtr relies on a set of Solexa short reads on given bacteria pathogens and their isogenic reference genome to identify heterogeneity sites, the chromosomal positions with multiple variants of genes in the bacterial population, and variations that occur in large gene families. GenHtr accomplishes this by building and comparatively analyzing genome-wide heterogeneity genotypes for both the newly sequenced genomes (using massive short-read sequencing and their isogenic reference (using simulated data. As proof of the concept, this approach was applied to SRX007711, the Solexa sequencing data for a newly sequenced Staphylococcus aureus subsp. USA300 cell line, and demonstrated that it could predict such multiple variants. They include multiple variants of genes critical in pathogenesis, e.g. genes encoding a LysR family transcriptional regulator, 23 S ribosomal RNA, and DNA mismatch repair protein MutS. The heterogeneity results in non-synonymous and nonsense mutations, leading to truncated proteins for both LysR and MutS. Conclusion GenHtr was developed for genome-wide heterogeneity analysis. Although it is much more time

  1. Configurational statistics of a polymer chain with random sequence of elements

    International Nuclear Information System (INIS)

    Obukhov, S.P.

    1984-10-01

    It is shown that for a disordered polymer chain the upper critical dimension is d c =3. At d≤3 the effect of randomness increases on large scales due to the space correlations of attractive and repulsive monomers, but it can also be screened by repulsive two- or three-body interaction. The renorm group equations indicate that near the theta point it can be the large dispersion of sizes of polymers which differ only in sequences of elements. (orig.)

  2. Consideration of reinforcement mechanism in the short fiber mixing granular materials by granular element simulations

    Science.gov (United States)

    Mori, Kentaro; Kaneko, Kenji; Hashizume, Yutaka

    2017-06-01

    The short fiber mixing method is well known as one of the method to improve the strength of gran- ular soils in geotechnical engineering. Mechanical properties of the short fiber mixing granular materials are influenced by many factors, such as the mixture ratio of the short fiber, the material of short fiber, the length, and the orientation. In particular, the mixture ratio of the short fibers is very important in mixture design. In the past study, we understood that the strength is reduced by too much short fiber mixing by a series of tri-axial compression experiments. Namely, there is "optimum mixture ratio" in the short fiber mixing granular soils. In this study, to consider the mechanism of occurrence of the optimum mixture ratio, we carried out the numerical experiments by granular element method. As the results, we can understand that the strength decrease when too much grain-fiber contact points exist, because a friction coefficient is smaller than the grain-grain contact points.

  3. TE-Locate: A Tool to Locate and Group Transposable Element Occurrences Using Paired-End Next-Generation Sequencing Data.

    Science.gov (United States)

    Platzer, Alexander; Nizhynska, Viktoria; Long, Quan

    2012-09-12

    Transposable elements (TEs) are common mobile DNA elements present in nearly all genomes. Since the movement of TEs within a genome can sometimes have phenotypic consequences, an accurate report of TE actions is desirable. To this end, we developed TE-Locate, a computational tool that uses paired-end reads to identify the novel locations of known TEs. TE-Locate can utilize either a database of TE sequences, or annotated TEs within the reference sequence of interest. This makes TE-Locate useful in the search for any mobile sequence, including retrotransposed gene copies. One major concern is to act on the correct hierarchy level, thereby avoiding an incorrect calling of a single insertion as multiple events of TEs with high sequence similarity. We used the (super)family level, but TE-Locate can also use any other level, right down to the individual transposable element. As an example of analysis with TE-Locate, we used the Swedish population in the 1,001 Arabidopsis genomes project, and presented the biological insights gained from the novel TEs, inducing the association between different TE superfamilies. The program is freely available, and the URL is provided in the end of the paper.

  4. TE-Locate: A Tool to Locate and Group Transposable Element Occurrences Using Paired-End Next-Generation Sequencing Data

    Directory of Open Access Journals (Sweden)

    Quan Long

    2012-09-01

    Full Text Available Transposable elements (TEs are common mobile DNA elements present in nearly all genomes. Since the movement of TEs within a genome can sometimes have phenotypic consequences, an accurate report of TE actions is desirable. To this end, we developed TE-Locate, a computational tool that uses paired-end reads to identify the novel locations of known TEs. TE-Locate can utilize either a database of TE sequences, or annotated TEs within the reference sequence of interest. This makes TE-Locate useful in the search for any mobile sequence, including retrotransposed gene copies. One major concern is to act on the correct hierarchy level, thereby avoiding an incorrect calling of a single insertion as multiple events of TEs with high sequence similarity. We used the (superfamily level, but TE-Locate can also use any other level, right down to the individual transposable element. As an example of analysis with TE-Locate, we used the Swedish population in the 1,001 Arabidopsis genomes project, and presented the biological insights gained from the novel TEs, inducing the association between different TE superfamilies. The program is freely available, and the URL is provided in the end of the paper.

  5. Techniques for Presenting the Short Story in the Advanced ESL Classroom.

    Science.gov (United States)

    Williamson, Julia

    1989-01-01

    Suggestions are made for introducing students, especially those at the college level, to American short stories. An anthology of stories, chronologically presented, is noted as a useful text. Three approaches for presentation include historical sequencing, grouping according to salient elements of fiction, and grouping by theme. Pre-reading…

  6. Computational complexity of algorithms for sequence comparison, short-read assembly and genome alignment.

    Science.gov (United States)

    Baichoo, Shakuntala; Ouzounis, Christos A

    A multitude of algorithms for sequence comparison, short-read assembly and whole-genome alignment have been developed in the general context of molecular biology, to support technology development for high-throughput sequencing, numerous applications in genome biology and fundamental research on comparative genomics. The computational complexity of these algorithms has been previously reported in original research papers, yet this often neglected property has not been reviewed previously in a systematic manner and for a wider audience. We provide a review of space and time complexity of key sequence analysis algorithms and highlight their properties in a comprehensive manner, in order to identify potential opportunities for further research in algorithm or data structure optimization. The complexity aspect is poised to become pivotal as we will be facing challenges related to the continuous increase of genomic data on unprecedented scales and complexity in the foreseeable future, when robust biological simulation at the cell level and above becomes a reality. Copyright © 2017 Elsevier B.V. All rights reserved.

  7. Short interspersed elements (SINEs) in plants: origin, classification, and use as phylogenetic markers.

    Science.gov (United States)

    Deragon, Jean-Marc; Zhang, Xiaoyu

    2006-12-01

    Short interspersed elements (SINEs) are a class of dispersed mobile sequences that use RNA as an intermediate in an amplification process called retroposition. The presence-absence of a SINE at a given locus has been used as a meaningful classification criterion to evaluate phylogenetic relations among species. We review here recent developments in the characterisation of plant SINEs and their use as molecular makers to retrace phylogenetic relations among wild and cultivated Oryza and Brassica species. In Brassicaceae, further use of SINE markers is limited by our partial knowledge of endogenous SINE families (their origin and evolution histories) and by the absence of a clear classification. To solve this problem, phylogenetic relations among all known Brassicaceae SINEs were analyzed and a new classification, grouping SINEs in 15 different families, is proposed. The relative age and size of each Brassicaceae SINE family was evaluated and new phylogenetically supported subfamilies were described. We also present evidence suggesting that new potentially active SINEs recently emerged in Brassica oleracea from the shuffling of preexisting SINE portions. Finally, the comparative evolution history of SINE families present in Arabidopsis thaliana and Brassica oleracea revealed that SINEs were in general more active in the Brassica lineage. The importance of these new data for the use of Brassicaceae SINEs as molecular markers in future applications is discussed.

  8. Transcriptional activity of transposable elements in coelacanth.

    Science.gov (United States)

    Forconi, Mariko; Chalopin, Domitille; Barucca, Marco; Biscotti, Maria Assunta; De Moro, Gianluca; Galiana, Delphine; Gerdol, Marco; Pallavicini, Alberto; Canapa, Adriana; Olmo, Ettore; Volff, Jean-Nicolas

    2014-09-01

    The morphological stasis of coelacanths has long suggested a slow evolutionary rate. General genomic stasis might also imply a decrease of transposable elements activity. To evaluate the potential activity of transposable elements (TEs) in "living fossil" species, transcriptomic data of Latimeria chalumnae and its Indonesian congener Latimeria menadoensis were compared through the RNA-sequencing mapping procedures in three different organs (liver, testis, and muscle). The analysis of coelacanth transcriptomes highlights a significant percentage of transcribed TEs in both species. Major contributors are LINE retrotransposons, especially from the CR1 family. Furthermore, some particular elements such as a LF-SINE and a LINE2 sequences seem to be more expressed than other elements. The amount of TEs expressed in testis suggests possible transposition burst in incoming generations. Moreover, significant amount of TEs in liver and muscle transcriptomes were also observed. Analyses of elements displaying marked organ-specific expression gave us the opportunity to highlight exaptation cases, that is, the recruitment of TEs as new cellular genes, but also to identify a new Latimeria-specific family of Short Interspersed Nuclear Elements called CoeG-SINEs. Overall, transcriptome results do not seem to be in line with a slow-evolving genome with poor TE activity. © 2013 Wiley Periodicals, Inc.

  9. Rapid Multiplex Small DNA Sequencing on the MinION Nanopore Sequencing Platform

    Directory of Open Access Journals (Sweden)

    Shan Wei

    2018-05-01

    Full Text Available Real-time sequencing of short DNA reads has a wide variety of clinical and research applications including screening for mutations, target sequences and aneuploidy. We recently demonstrated that MinION, a nanopore-based DNA sequencing device the size of a USB drive, could be used for short-read DNA sequencing. In this study, an ultra-rapid multiplex library preparation and sequencing method for the MinION is presented and applied to accurately test normal diploid and aneuploidy samples’ genomic DNA in under three hours, including library preparation and sequencing. This novel method shows great promise as a clinical diagnostic test for applications requiring rapid short-read DNA sequencing.

  10. Location analysis for the estrogen receptor-α reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements

    Science.gov (United States)

    Mason, Christopher E.; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M.; Kallen, Roland G.; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B.

    2010-01-01

    Location analysis for estrogen receptor-α (ERα)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERα-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: ERE sequence. We demonstrate that ∼50% of all ERα-bound loci do not have a discernable ERE and show that most ERα-bound EREs are not perfect consensus EREs. Approximately one-third of all ERα-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERα-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERα binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers. PMID:20047966

  11. Location analysis for the estrogen receptor-alpha reveals binding to diverse ERE sequences and widespread binding within repetitive DNA elements.

    Science.gov (United States)

    Mason, Christopher E; Shu, Feng-Jue; Wang, Cheng; Session, Ryan M; Kallen, Roland G; Sidell, Neil; Yu, Tianwei; Liu, Mei Hui; Cheung, Edwin; Kallen, Caleb B

    2010-04-01

    Location analysis for estrogen receptor-alpha (ERalpha)-bound cis-regulatory elements was determined in MCF7 cells using chromatin immunoprecipitation (ChIP)-on-chip. Here, we present the estrogen response element (ERE) sequences that were identified at ERalpha-bound loci and quantify the incidence of ERE sequences under two stringencies of detection: ERE sequence. We demonstrate that approximately 50% of all ERalpha-bound loci do not have a discernable ERE and show that most ERalpha-bound EREs are not perfect consensus EREs. Approximately one-third of all ERalpha-bound ERE sequences reside within repetitive DNA sequences, most commonly of the AluS family. In addition, the 3-bp spacer between the inverted ERE half-sites, rather than being random nucleotides, is C(A/T)G-enriched at bona fide receptor targets. Diverse ERalpha-bound loci were validated using electrophoretic mobility shift assay and ChIP-polymerase chain reaction (PCR). The functional significance of receptor-bound loci was demonstrated using luciferase reporter assays which proved that repetitive element ERE sequences contribute to enhancer function. ChIP-PCR demonstrated estrogen-dependent recruitment of the coactivator SRC3 to these loci in vivo. Our data demonstrate that ERalpha binds to widely variant EREs with less sequence specificity than had previously been suspected and that binding at repetitive and nonrepetitive genomic targets is favored by specific trinucleotide spacers.

  12. HBVRegDB: Annotation, comparison, detection and visualization of regulatory elements in hepatitis B virus sequences

    Directory of Open Access Journals (Sweden)

    Firth Andrew E

    2007-12-01

    Full Text Available Abstract Background The many Hepadnaviridae sequences available have widely varied functional annotation. The genomes are very compact (~3.2 kb but contain multiple layers of functional regulatory elements in addition to coding regions. Key regions are subject to purifying selection, as mutations in these regions will produce non-functional viruses. Results These genomic sequences have been organized into a structured database to facilitate research at the molecular level. HBVRegDB is a comparative genomic analysis tool with an integrated underlying sequence database. The database contains genomic sequence data from representative viruses. In addition to INSDC and RefSeq annotation, HBVRegDB also contains expert and systematically calculated annotations (e.g. promoters and comparative genome analysis results (e.g. blastn, tblastx. It also contains analyses based on curated HBV alignments. Information about conserved regions – including primary conservation (e.g. CDS-Plotcon and RNA secondary structure predictions (e.g. Alidot – is integrated into the database. A large amount of data is graphically presented using the GBrowse (Generic Genome Browser adapted for analysis of viral genomes. Flexible query access is provided based on any annotated genomic feature. Novel regulatory motifs can be found by analysing the annotated sequences. Conclusion HBVRegDB serves as a knowledge database and as a comparative genomic analysis tool for molecular biologists investigating HBV. It is publicly available and complementary to other viral and HBV focused datasets and tools http://hbvregdb.otago.ac.nz. The availability of multiple and highly annotated sequences of viral genomes in one database combined with comparative analysis tools facilitates detection of novel genomic elements.

  13. Diversification, evolution and methylation of short interspersed nuclear element families in sugar beet and related Amaranthaceae species.

    Science.gov (United States)

    Schwichtenberg, Katrin; Wenke, Torsten; Zakrzewski, Falk; Seibt, Kathrin M; Minoche, André; Dohm, Juliane C; Weisshaar, Bernd; Himmelbauer, Heinz; Schmidt, Thomas

    2016-01-01

    Short interspersed nuclear elements (SINEs) are non-autonomous non-long terminal repeat retrotransposons which are widely distributed in eukaryotic organisms. While SINEs have been intensively studied in animals, only limited information is available about plant SINEs. We analysed 22 SINE families from seven genomes of the Amaranthaceae family and identified 34 806 SINEs, including 19 549 full-length copies. With the focus on sugar beet (Beta vulgaris), we performed a comparative analysis of the diversity, genomic and chromosomal organization and the methylation of SINEs to provide a detailed insight into the evolution and age of Amaranthaceae SINEs. The lengths of consensus sequences of SINEs range from 113 nucleotides (nt) up to 224 nt. The SINEs show dispersed distribution on all chromosomes but were found with higher incidence in subterminal euchromatic chromosome regions. The methylation of SINEs is increased compared with their flanking regions, and the strongest effect is visible for cytosines in the CHH context, indicating an involvement of asymmetric methylation in the silencing of SINEs. © 2015 The Authors The Plant Journal © 2015 John Wiley & Sons Ltd.

  14. A novel family of sequence-specific endoribonucleases associated with the clustered regularly interspaced short palindromic repeats.

    Science.gov (United States)

    Beloglazova, Natalia; Brown, Greg; Zimmerman, Matthew D; Proudfoot, Michael; Makarova, Kira S; Kudritska, Marina; Kochinyan, Samvel; Wang, Shuren; Chruszcz, Maksymilian; Minor, Wladek; Koonin, Eugene V; Edwards, Aled M; Savchenko, Alexei; Yakunin, Alexander F

    2008-07-18

    Clustered regularly interspaced short palindromic repeats (CRISPRs) together with the associated CAS proteins protect microbial cells from invasion by foreign genetic elements using presently unknown molecular mechanisms. All CRISPR systems contain proteins of the CAS2 family, suggesting that these uncharacterized proteins play a central role in this process. Here we show that the CAS2 proteins represent a novel family of endoribonucleases. Six purified CAS2 proteins from diverse organisms cleaved single-stranded RNAs preferentially within U-rich regions. A representative CAS2 enzyme, SSO1404 from Sulfolobus solfataricus, cleaved the phosphodiester linkage on the 3'-side and generated 5'-phosphate- and 3'-hydroxyl-terminated oligonucleotides. The crystal structure of SSO1404 was solved at 1.6A resolution revealing the first ribonuclease with a ferredoxin-like fold. Mutagenesis of SSO1404 identified six residues (Tyr-9, Asp-10, Arg-17, Arg-19, Arg-31, and Phe-37) that are important for enzymatic activity and suggested that Asp-10 might be the principal catalytic residue. Thus, CAS2 proteins are sequence-specific endoribonucleases, and we propose that their role in the CRISPR-mediated anti-phage defense might involve degradation of phage or cellular mRNAs.

  15. A compact, in vivo screen of all 6-mers reveals drivers of tissue-specific expression and guides synthetic regulatory element design.

    Science.gov (United States)

    Smith, Robin P; Riesenfeld, Samantha J; Holloway, Alisha K; Li, Qiang; Murphy, Karl K; Feliciano, Natalie M; Orecchia, Lorenzo; Oksenberg, Nir; Pollard, Katherine S; Ahituv, Nadav

    2013-07-18

    Large-scale annotation efforts have improved our ability to coarsely predict regulatory elements throughout vertebrate genomes. However, it is unclear how complex spatiotemporal patterns of gene expression driven by these elements emerge from the activity of short, transcription factor binding sequences. We describe a comprehensive promoter extension assay in which the regulatory potential of all 6 base-pair (bp) sequences was tested in the context of a minimal promoter. To enable this large-scale screen, we developed algorithms that use a reverse-complement aware decomposition of the de Bruijn graph to design a library of DNA oligomers incorporating every 6-bp sequence exactly once. Our library multiplexes all 4,096 unique 6-mers into 184 double-stranded 15-bp oligomers, which is sufficiently compact for in vivo testing. We injected each multiplexed construct into zebrafish embryos and scored GFP expression in 15 tissues at two developmental time points. Twenty-seven constructs produced consistent expression patterns, with the majority doing so in only one tissue. Functional sequences are enriched near biologically relevant genes, match motifs for developmental transcription factors, and are required for enhancer activity. By concatenating tissue-specific functional sequences, we generated completely synthetic enhancers for the notochord, epidermis, spinal cord, forebrain and otic lateral line, and show that short regulatory sequences do not always function modularly. This work introduces a unique in vivo catalog of short, functional regulatory sequences and demonstrates several important principles of regulatory element organization. Furthermore, we provide resources for designing compact, reverse-complement aware k-mer libraries.

  16. Assessing the 5S ribosomal RNA heterogeneity in Arabidopsis thaliana using short RNA next generation sequencing data.

    Science.gov (United States)

    Szymanski, Maciej; Karlowski, Wojciech M

    2016-01-01

    In eukaryotes, ribosomal 5S rRNAs are products of multigene families organized within clusters of tandemly repeated units. Accumulation of genomic data obtained from a variety of organisms demonstrated that the potential 5S rRNA coding sequences show a large number of variants, often incompatible with folding into a correct secondary structure. Here, we present results of an analysis of a large set of short RNA sequences generated by the next generation sequencing techniques, to address the problem of heterogeneity of the 5S rRNA transcripts in Arabidopsis and identification of potentially functional rRNA-derived fragments.

  17. Effects of High Intensity White Noise on Short-Term Memory for Position in a List and Sequence

    Science.gov (United States)

    Daee, Safar; Wilding, J. M.

    1977-01-01

    Seven experiments are described investigating the effecy of high intensity white noise during the visual presentation of words on a number of short-term memory tasks. Examines results relative to position learning and sequence learning. (Editor/RK)

  18. Molecular genetics and epigenetics of CACTA elements

    KAUST Repository

    Fedoroff, Nina V.

    2013-08-21

    The CACTA transposons, so named for a highly conserved motif at element ends, comprise one of the most abundant superfamilies of Class 2 (cut-and-paste) plant transposons. CACTA transposons characteristically include subterminal sequences of several hundred nucleotides containing closely spaced direct and inverted repeats of a short, conserved sequence of 14-15 bp. The Supressor-mutator (Spm) transposon, identified and subjected to detailed genetic analysis by Barbara McClintock, remains the paradigmatic element of the CACTA family. The Spm transposon encodes two proteins required for transposition, the transposase (TnpD) and a regulatory protein (TnpA) that binds to the subterminal repeats. Spm expression is subject to both genetic and epigenetic regulation. The Spm-encoded TnpA serves as an activator of the epigenetically inactivated, methylated Spm, stimulating both transient and heritable activation of the transposon. TnpA also serves as a negative regulator of the demethylated active element promoter and is required, in addition to the TnpD, for transposition. © Springer Science+Business Media, New York 2013.

  19. Phylogenetic relationships among East African haplochromine fish as revealed by short interspersed elements (SINEs).

    Science.gov (United States)

    Terai, Yohey; Takezaki, Naoko; Mayer, Werner E; Tichy, Herbert; Takahata, Naoyuki; Klein, Jan; Okada, Norihiro

    2004-01-01

    Genomic DNA libraries were prepared from two endemic species of Lake Victoria haplochromine (cichlid) fish and used to isolate and characterize a set of short interspersed elements (SINEs). The distribution and sequences of the SINEs were used to infer phylogenetic relationships among East African haplochromines. The SINE-based classification divides the fish into four groups, which, in order of their divergence from a stem lineage, are the endemic Lake Tanganyika flock (group 1); fish of the nonendemic, monotypic, widely distributed genus Astatoreochromis (group 2); the endemic Lake Malawi flock (group 3); and group 4, which contains fish from widely dispersed East African localities including Lakes Victoria, Edward, George, Albert, and Rukwa, as well as many rivers. The group 4 haplochromines are characterized by a subset of polymorphic SINEs, each of which is present in some individuals and absent in others of the same population at a given locality, the same morphologically defined species, and the same mtDNA-defined haplogroup. SINE-defined group 4 contains six of the seven previously described mtDNA haplogroups. One of the polymorphic SINEs appears to be fixed in the endemic Lake Victoria flock; four others display the presence-or-absence polymorphism within the species of this flock. These findings have implications for the origin of Lake Victoria cichlids and for their founding population sizes.

  20. Genome dynamics of short oligonucleotides: the example of bacterial DNA uptake enhancing sequences.

    Directory of Open Access Journals (Sweden)

    Mohammed Bakkali

    Full Text Available Among the many bacteria naturally competent for transformation by DNA uptake-a phenomenon with significant clinical and financial implications- Pasteurellaceae and Neisseriaceae species preferentially take up DNA containing specific short sequences. The genomic overrepresentation of these DNA uptake enhancing sequences (DUES causes preferential uptake of conspecific DNA, but the function(s behind this overrepresentation and its evolution are still a matter for discovery. Here I analyze DUES genome dynamics and evolution and test the validity of the results to other selectively constrained oligonucleotides. I use statistical methods and computer simulations to examine DUESs accumulation in Haemophilus influenzae and Neisseria gonorrhoeae genomes. I analyze DUESs sequence and nucleotide frequencies, as well as those of all their mismatched forms, and prove the dependence of DUESs genomic overrepresentation on their preferential uptake by quantifying and correlating both characteristics. I then argue that mutation, uptake bias, and weak selection against DUESs in less constrained parts of the genome combined are sufficient enough to cause DUESs accumulation in susceptible parts of the genome with no need for other DUES function. The distribution of overrepresentation values across sequences with different mismatch loads compared to the DUES suggests a gradual yet not linear molecular drive of DNA sequences depending on their similarity to the DUES. Other genomically overrepresented sequences, both pro- and eukaryotic, show similar distribution of frequencies suggesting that the molecular drive reported above applies to other frequent oligonucleotides. Rare oligonucleotides, however, seem to be gradually drawn to genomic underrepresentation, thus, suggesting a molecular drag. To my knowledge this work provides the first clear evidence of the gradual evolution of selectively constrained oligonucleotides, including repeated, palindromic and protein

  1. The Genomic Architecture of Novel Simulium damnosum Wolbachia Prophage Sequence Elements and Implications for Onchocerciasis Epidemiology

    Directory of Open Access Journals (Sweden)

    James L. Crainey

    2017-05-01

    Full Text Available Research interest in Wolbachia is growing as new discoveries and technical advancements reveal the public health importance of both naturally occurring and artificial infections. Improved understanding of the Wolbachia bacteriophages (WOs WOcauB2 and WOcauB3 [belonging to a sub-group of four WOs encoding serine recombinases group 1 (sr1WOs], has enhanced the prospect of novel tools for the genetic manipulation of Wolbachia. The basic biology of sr1WOs, including host range and mode of genomic integration is, however, still poorly understood. Very few sr1WOs have been described, with two such elements putatively resulting from integrations at the same Wolbachia genome loci, about 2 kb downstream from the FtsZ cell-division gene. Here, we characterize the DNA sequence flanking the FtsZ gene of wDam, a genetically distinct line of Wolbachia isolated from the West African onchocerciasis vector Simulium squamosum E. Using Roche 454 shot-gun and Sanger sequencing, we have resolved >32 kb of WO prophage sequence into three contigs representing three distinct prophage elements. Spanning ≥36 distinct WO open reading frame gene sequences, these prophage elements correspond roughly to three different WO modules: a serine recombinase and replication module (sr1RRM, a head and base-plate module and a tail module. The sr1RRM module contains replication genes and a Holliday junction recombinase and is unique to the sr1 group WOs. In the extreme terminal of the tail module there is a SpvB protein homolog—believed to have insecticidal properties and proposed to have a role in how Wolbachia parasitize their insect hosts. We propose that these wDam prophage modules all derive from a single WO genome, which we have named here sr1WOdamA1. The best-match database sequence for all of our sr1WOdamA1-predicted gene sequences was annotated as of Wolbachia or Wolbachia phage sourced from an arthropod. Clear evidence of exchange between sr1WOdamA1 and other Wolbachia

  2. Repetitive elements may comprise over two-thirds of the human genome.

    Directory of Open Access Journals (Sweden)

    A P Jason de Koning

    2011-12-01

    Full Text Available Transposable elements (TEs are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo "clouds". We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%-69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM, to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp. Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed "element-specific" P-clouds (ESPs to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed.

  3. Dispersed repetitive sequences in eukaryotic genomes and their possible biological significance

    International Nuclear Information System (INIS)

    Georgiev, G.P.; Kramerov, D.A.; Ryskov, A.P.; Skryabin, K.G.; Lukanidin, E.M.

    1983-01-01

    In this paper is described the properties of a novel mouse mdg-like element, the A2 sequence, which is the most abundant repetitive sequence. We also characterized an ubiquitous B2 sequence that represents, after B1, the dominant family among the short interspersed repeats of the mouse genome. The existence of some putative transposition intermediates was shown for repeats of both A and B types of the mouse genome. These are closed circular DNA of the A type and small polyadenylated B + RNAs. The fundamental question that arises is whether these sequences are simply selfish DNA capable of transpositions or do they fulfill some useful biological functions within the genome. 66 references, 11 figures, 1 table

  4. Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes

    NARCIS (Netherlands)

    Al-Attar, S.; Westra, E.R.; Oost, van der J.; Brouns, S.J.J.

    2011-01-01

    Many prokaryotes contain the recently discovered defense system against mobile genetic elements. This defense system contains a unique type of repetitive DNA stretches, termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs). CRISPRs consist of identical repeated DNA sequences

  5. A Short Interspersed Nuclear Element (SINE)-Based Real-Time PCR Approach to Detect and Quantify Porcine Component in Meat Products.

    Science.gov (United States)

    Zhang, Chi; Fang, Xin; Qiu, Haopu; Li, Ning

    2015-01-01

    Real-time PCR amplification of mitochondria gene could not be used for DNA quantification, and that of single copy DNA did not allow an ideal sensitivity. Moreover, cross-reactions among similar species were commonly observed in the published methods amplifying repetitive sequence, which hindered their further application. The purpose of this study was to establish a short interspersed nuclear element (SINE)-based real-time PCR approach having high specificity for species detection that could be used in DNA quantification. After massive screening of candidate Sus scrofa SINEs, one optimal combination of primers and probe was selected, which had no cross-reaction with other common meat species. LOD of the method was 44 fg DNA/reaction. Further, quantification tests showed this approach was practical in DNA estimation without tissue variance. Thus, this study provided a new tool for qualitative detection of porcine component, which could be promising in the QC of meat products.

  6. Short interspersed elements (SINEs) are a major source of canine genomic diversity.

    Science.gov (United States)

    Wang, Wei; Kirkness, Ewen F

    2005-12-01

    SINEs are retrotransposons that have enjoyed remarkable reproductive success during the course of mammalian evolution, and have played a major role in shaping mammalian genomes. Previously, an analysis of survey-sequence data from an individual dog (a poodle) indicated that canine genomes harbor a high frequency of alleles that differ only by the absence or presence of a SINEC_Cf repeat. Comparison of this survey-sequence data with a draft genome sequence of a distinct dog (a boxer) has confirmed this prediction, and revealed the chromosomal coordinates for >10,000 loci that are bimorphic for SINEC_Cf insertions. Analysis of SINE insertion sites from the genomes of nine additional dogs indicates that 3%-5% are absent from either the poodle or boxer genome sequences--suggesting that an additional 10,000 bimorphic loci could be readily identified in the general dog population. We describe a methodology that can be used to identify these loci, and could be adapted to exploit these bimorphic loci for genotyping purposes. Approximately half of all annotated canine genes contain SINEC_Cf repeats, and these elements are occasionally transcribed. When transcribed in the antisense orientation, they provide splice acceptor sites that can result in incorporation of novel exons. The high frequency of bimorphic SINE insertions in the dog population is predicted to provide numerous examples of allele-specific transcription patterns that will be valuable for the study of differential gene expression among multiple dog breeds.

  7. Screening of SHOX gene sequence variants in Saudi Arabian children with idiopathic short stature.

    Science.gov (United States)

    Alharthi, Abdulla A; El-Hallous, Ehab I; Talaat, Iman M; Alghamdi, Hamed A; Almalki, Matar I; Gaber, Ahmed

    2017-10-01

    Short stature affects approximately 2%-3% of children, representing one of the most frequent disorders for which clinical attention is sought during childhood. Despite assumed genetic heterogeneity, mutations or deletions in the short stature homeobox-containing gene ( SHOX ) are frequently detected in subjects with short stature. Idiopathic short stature (ISS) refers to patients with short stature for various unknown reasons. The goal of this study was to screen all the exons of SHOX to identify related mutations. We screened all the exons of SHOX for mutations analysis in 105 ISS children patients (57 girls and 48 boys) living in Taif governorate, KSA using a direct DNA sequencing method. Height, arm span, and sitting height were recorded, and subischial leg length was calculated. A total of 30 of 105 ISS patients (28%) contained six polymorphic variants in exons 1, 2, 4, and 6. One mutation was found in the DNA domain binding region of exon 4. Three of these polymorphic variants were novel, while the others were reported previously. There were no significant differences in anthropometric measures in ISS patients with and without identifiable polymorphic variants in SHOX . In Saudi Arabia ISS patients, rather than SHOX , it is possible that new genes are involved in longitudinal growth. Additional molecular analysis is required to diagnose and understand the etiology of this disease.

  8. Isolation and amino acid sequence of a short-chain neurotoxin from an Australian elapid snake, Pseudechis australis.

    OpenAIRE

    Takasaki, C; Tamiya, N

    1985-01-01

    A short-chain neurotoxin Pseudechis australis a (toxin Pa a) was isolated from the venom of an Australian elapid snake Pseudechis australis (king brown snake) by sequential chromatography on CM-cellulose, Sephadex G-50 and CM-cellulose columns. Toxin Pa a has an LD50 (intravenous) value of 76 micrograms/kg body wt. in mice and consists of 62 amino acid residues. The amino acid sequence of Pa a shows considerable homology with those of short-chain neurotoxins of elapid snakes, especially of tr...

  9. A SINE in the genome of the cephalochordate amphioxus is an Alu element

    Science.gov (United States)

    Holland, Linda Z.

    2006-01-01

    Transposable elements of about 300 bp, termed “short interspersed nucleotide elements or SINEs are common in eukaryotes. However, Alu elements, SINEs containing restriction sites for the AluI enzyme, have been known only from primates. Here I report the first SINE found in the genome of the cephalochordate, amphioxus. It is an Alu element of 375 bp that does not share substantial identity with any genomic sequences in vertebrates. It was identified because it was located in the FoxD regulatory region in a cosmid derived from one individual, but absent from the two FoxD alleles of BACs from a second individual. However, searches of sequences of BACs and genomic traces from this second individual gave an estimate of 50-100 copies in the amphioxus genome. The finding of an Alu element in amphioxus raises the question of whether Alu elements in amphioxus and primates arose by convergent evolution or by inheritance from a common ancestor. Genome-wide analyses of transposable elements in amphioxus and other chordates such as tunicates, agnathans and cartilaginous fishes could well provide the answer. PMID:16733535

  10. Transposable Elements: No More 'Junk DNA'

    Directory of Open Access Journals (Sweden)

    Yun-Ji Kim

    2012-12-01

    Full Text Available Since the advent of whole-genome sequencing, transposable elements (TEs, just thought to be 'junk' DNA, have been noticed because of their numerous copies in various eukaryotic genomes. Many studies about TEs have been conducted to discover their functions in their host genomes. Based on the results of those studies, it has been generally accepted that they have a function to cause genomic and genetic variations. However, their infinite functions are not fully elucidated. Through various mechanisms, including de novo TE insertions, TE insertion-mediated deletions, and recombination events, they manipulate their host genomes. In this review, we focus on Alu, L1, human endogenous retrovirus, and short interspersed element/variable number of tandem repeats/Alu (SVA elements and discuss how they have affected primate genomes, especially the human and chimpanzee genomes, since their divergence.

  11. Short-distance matrix elements for $D$-meson mixing for 2+1 lattice QCD

    Energy Technology Data Exchange (ETDEWEB)

    Chang, Chia Cheng [Univ. of Illinois, Champaign, IL (United States)

    2015-01-01

    We study the short-distance hadronic matrix elements for D-meson mixing with partially quenched Nf = 2+1 lattice QCD. We use a large set of the MIMD Lattice Computation Collaboration's gauge configurations with a2 tadpole-improved staggered sea quarks and tadpole-improved Lüscher-Weisz gluons. We use the a2 tadpole-improved action for valence light quarks and the Sheikoleslami-Wohlert action with the Fermilab interpretation for the valence charm quark. Our calculation covers the complete set of five operators needed to constrain new physics models for D-meson mixing. We match our matrix elements to the MS-NDR scheme evaluated at 3 GeV. We report values for the Beneke-Buchalla-Greub-Lenz-Nierste choice of evanescent operators.

  12. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences

    OpenAIRE

    Lescot, Magali; Déhais, Patrice; Thijs, Gert; Marchal, Kathleen; Moreau, Yves; Van de Peer, Yves; Rouzé, Pierre; Rombauts, Stephane

    2002-01-01

    PlantCARE is a database of plant cis-acting regulatory elements, enhancers and repressors. Regulatory elements are represented by positional matrices, consensus sequences and individual sites on particular promoter sequences. Links to the EMBL, TRANSFAC and MEDLINE databases are provided when available. Data about the transcription sites are extracted mainly from the literature, supplemented with an increasing number of in silico predicted data. Apart from a general description for specific t...

  13. Sequence elements correlating with circulating viral load in genotype 1b hepatitis C virus infection

    International Nuclear Information System (INIS)

    Watanabe, Hideki; Nagayama, Kazuyoshi; Enomoto, Nobuyuki; Itakura, Jun; Tanabe, Yoko; Hamano, Kosei; Izumi, Namiki; Sato, Chifumi; Watanabe, Mamoru

    2003-01-01

    The correlation between hepatitis C virus (HCV) genomic sequences and circulating HCV RNA levels was assessed to investigate the genetic elements affecting viral load. The interferon sensitivity-determining region (ISDR) sequence and the serum viral load were strongly correlated in 226 patients examined. Analysis of the entire HCV genome from six patients (three with a high and the others with a low viral load) with similar ISDR sequences identified several candidate residues associated with viral load. The amino acid (aa) sequences of these candidate residues and flanking regions in 67 additional patients revealed that only the residue at aa 962 varied significantly between the HCV patients with low and high serum loads (P 0.042). At this position, alanine was observed more frequently in the patients with a high viral load. In conclusion, our results strongly suggest that serum HCV RNA loads are inversely correlated with amino acid substitutions in the ISDR, and aa 962 was identified as a possible second determinant of serum HCV RNA load

  14. Short-circuit protection of LLC resonant converter using voltages across resonant tank elements

    Directory of Open Access Journals (Sweden)

    Denys Igorovych Zaikin

    2015-06-01

    Full Text Available This paper describes two methods for the short-circuit protection of the LLC resonant converter. One of them uses the voltage across the capacitor and the other uses the voltage across the inductor of the resonant tank. These voltages can be processed (integrated or differentiated to recover the resonant tank current. The two circuits illustrated in the described methods make it possible to develop a robust LLC converter design and to avoid using lossy current measurement elements, such as a shunt resistor or current transformer. The methods also allow measuring resonant tank current without breaking high-current paths and connecting the measuring circuit in parallel with the inductor or capacitor of the resonant tank. Practical implementations of these indirect current measurements have been experimentally tested for the short-circuit protection of the 1600 W LLC converter.

  15. Retroposition of the AFC family of SINEs (short interspersed repetitive elements) before and during the adaptive radiation of cichlid fishes in Lake Malawi and related inferences about phylogeny.

    Science.gov (United States)

    Takahashi, K; Nishida, M; Yuma, M; Okada, N

    2001-01-01

    Lake Malawi is home to more than 450 species of endemic cichlids, which provide a spectacular example of adaptive radiation. To clarify the phylogenetic relationships among these fish, we examined the presence and absence of SINEs (short interspersed repetitive elements) at orthologous loci. We identified six loci at which a SINE sequence had apparently been specifically inserted by retroposition in the common ancestor of all the investigated species of endemic cichlids in Lake Malawi. At another locus, unique sharing of a SINE sequence was evident among all the investigated species of endemic non-Mbuna cichlids with the exception of Rhamphochromis sp. The relationships were in good agreement with those deduced in previous studies with various different markers, demonstrating that the SINE method is useful for the elucidation of phylogenetic relationships among cichlids in Lake Malawi. We also characterized a locus that exhibited transspecies polymorphism with respect to the presence or absence of the SINE sequence among non-Mbuna species. This result suggests that incomplete lineage sorting and/or interspecific hybridization might have occurred or be occurring among the species in this group, which might potentially cause misinterpretation of phylogenetic data, in particular when a single-locus marker, such as a sequence in the mitochondrial DNA, is used for analysis.

  16. Molecular reconstruction of extinct LINE-1 elements and their interaction with nonautonomous elements.

    Science.gov (United States)

    Wagstaff, Bradley J; Kroutter, Emily N; Derbes, Rebecca S; Belancio, Victoria P; Roy-Engel, Astrid M

    2013-01-01

    Non-long terminal repeat retroelements continue to impact the human genome through cis-activity of long interspersed element-1 (LINE-1 or L1) and trans-mobilization of Alu. Current activity is dominated by modern subfamilies of these elements, leaving behind an evolutionary graveyard of extinct Alu and L1 subfamilies. Because Alu is a nonautonomous element that relies on L1 to retrotranspose, there is the possibility that competition between these elements has driven selection and antagonistic coevolution between Alu and L1. Through analysis of synonymous versus nonsynonymous codon evolution across L1 subfamilies, we find that the C-terminal ORF2 cys domain experienced a dramatic increase in amino acid substitution rate in the transition from L1PA5 to L1PA4 subfamilies. This observation coincides with the previously reported rapid evolution of ORF1 during the same transition period. Ancestral Alu sequences have been previously reconstructed, as their short size and ubiquity have made it relatively easy to retrieve consensus sequences from the human genome. In contrast, creating constructs of extinct L1 copies is a more laborious task. Here, we report our efforts to recreate and evaluate the retrotransposition capabilities of two ancestral L1 elements, L1PA4 and L1PA8 that were active ~18 and ~40 Ma, respectively. Relative to the modern L1PA1 subfamily, we find that both elements are similarly active in a cell culture retrotransposition assay in HeLa, and both are able to efficiently trans-mobilize Alu elements from several subfamilies. Although we observe some variation in Alu subfamily retrotransposition efficiency, any coevolution that may have occurred between LINEs and SINEs is not evident from these data. Population dynamics and stochastic variation in the number of active source elements likely play an important role in individual LINE or SINE subfamily amplification. If coevolution also contributes to changing retrotransposition rates and the progression of

  17. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    Science.gov (United States)

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-02-15

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).

  18. Enhanced production of recombinant proteins with Corynebacterium glutamicum by deletion of insertion sequences (IS elements).

    Science.gov (United States)

    Choi, Jae Woong; Yim, Sung Sun; Kim, Min Jeong; Jeong, Ki Jun

    2015-12-29

    In most bacteria, various jumping genetic elements including insertion sequences elements (IS elements) cause a variety of genetic rearrangements resulting in harmful effects such as genome and recombinant plasmid instability. The genetic stability of a plasmid in a host is critical for high-level production of recombinant proteins, and in this regard, the development of an IS element-free strain could be a useful strategy for the enhanced production of recombinant proteins. Corynebacterium glutamicum, which is a workhorse in the industrial-scale production of various biomolecules including recombinant proteins, also has several IS elements, and it is necessary to identify the critical IS elements and to develop IS element deleted strain. From the cultivation of C. glutamicum harboring a plasmid for green fluorescent protein (GFP) gene expression, non-fluorescent clones were isolated by FACS (fluorescent activated cell sorting). All the isolated clones had insertions of IS elements in the GFP coding region, and two major IS elements (ISCg1 and ISCg2 families) were identified. By co-cultivating cells harboring either the isolated IS element-inserted plasmid or intact plasmid, it was clearly confirmed that cells harboring the IS element-inserted plasmids became dominant during the cultivation due to their growth advantage over cells containing intact plasmids, which can cause a significant reduction in recombinant protein production during cultivation. To minimize the harmful effects of IS elements on the expression of heterologous genes in C. glutamicum, two IS element free C. glutamicum strains were developed in which each major IS element was deleted, and enhanced productivity in the engineered C. glutamicum strain was successfully demonstrated with three models: GFP, poly(3-hydroxybutyrate) [P(3HB)] and γ-aminobutyrate (GABA). Our findings clearly indicate that the hopping of IS elements could be detrimental to the production of recombinant proteins in C

  19. Distinct Element Method modelling of fold-related fractures in a multilayer sequence

    Science.gov (United States)

    Kaserer, Klemens; Schöpfer, Martin P. J.; Grasemann, Bernhard

    2017-04-01

    Natural fractures have a significant impact on the performance of hydrocarbon systems/reservoirs. In a multilayer sequence, both the fracture density within the individual layers and the type of fracture intersection with bedding contacts are key parameters controlling fluid pathways. In the present study the influence of layer stacking and interlayer friction on fracture density and connectivity within a folded sequence is systematically investigated using 2D Distinct Element Method modelling. Our numerical approach permits forward modelling of both fracture nucleation/propagation/arrest and (contemporaneous) frictional slip along bedding planes in a robust and mechanically sound manner. Folding of the multilayer sequence is achieved by enforcing constant curvature folding by means of a velocity boundary condition at the model base, while a constant overburden pressure is maintained at the model top. The modelling reveals that with high bedding plane friction the multilayer stack behaves mechanically as a single layer so that the neutral surface develops in centre of the sequence and fracture spacing is controlled by the total thickness of the folded sequence. In contrast, low bedding plane friction leads to decoupling of the individual layers (flexural slip folding) so that a neutral surface develops in the centre of each layer and fracture spacing is controlled by the thickness of the individual layers. The low interfacial friction models illustrate that stepping of fractures across bedding planes is a common process, which can however have two contrasting origins: The mechanical properties of the interface cause fracture stepping during fracture propagation. Originally through-going fractures are later offset by interfacial slip during folding. A combination of these two different origins may lead to (apparently) inconsistent fracture offsets across bedding planes within a flexural slip fold.

  20. Analysis of transposable elements in the genome of Asparagus officinalis from high coverage sequence data.

    Science.gov (United States)

    Li, Shu-Fen; Gao, Wu-Jun; Zhao, Xin-Peng; Dong, Tian-Yu; Deng, Chuan-Liang; Lu, Long-Dou

    2014-01-01

    Asparagus officinalis is an economically and nutritionally important vegetable crop that is widely cultivated and is used as a model dioecious species to study plant sex determination and sex chromosome evolution. To improve our understanding of its genome composition, especially with respect to transposable elements (TEs), which make up the majority of the genome, we performed Illumina HiSeq2000 sequencing of both male and female asparagus genomes followed by bioinformatics analysis. We generated 17 Gb of sequence (12×coverage) and assembled them into 163,406 scaffolds with a total cumulated length of 400 Mbp, which represent about 30% of asparagus genome. Overall, TEs masked about 53% of the A. officinalis assembly. Majority of the identified TEs belonged to LTR retrotransposons, which constitute about 28% of genomic DNA, with Ty1/copia elements being more diverse and accumulated to higher copy numbers than Ty3/gypsy. Compared with LTR retrotransposons, non-LTR retrotransposons and DNA transposons were relatively rare. In addition, comparison of the abundance of the TE groups between male and female genomes showed that the overall TE composition was highly similar, with only slight differences in the abundance of several TE groups, which is consistent with the relatively recent origin of asparagus sex chromosomes. This study greatly improves our knowledge of the repetitive sequence construction of asparagus, which facilitates the identification of TEs responsible for the early evolution of plant sex chromosomes and is helpful for further studies on this dioecious plant.

  1. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

    Science.gov (United States)

    Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

    2015-01-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930

  2. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence.

    Directory of Open Access Journals (Sweden)

    Kacy L Gordon

    2015-05-01

    Full Text Available Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2 from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements.

  3. Transduplication resulted in the incorporation of two protein-coding sequences into the Turmoil-1 transposable element of C. elegans

    Directory of Open Access Journals (Sweden)

    Pupko Tal

    2008-10-01

    Full Text Available Abstract Transposable elements may acquire unrelated gene fragments into their sequences in a process called transduplication. Transduplication of protein-coding genes is common in plants, but is unknown of in animals. Here, we report that the Turmoil-1 transposable element in C. elegans has incorporated two protein-coding sequences into its inverted terminal repeat (ITR sequences. The ITRs of Turmoil-1 contain a conserved RNA recognition motif (RRM that originated from the rsp-2 gene and a fragment from the protein-coding region of the cpg-3 gene. We further report that an open reading frame specific to C. elegans may have been created as a result of a Turmoil-1 insertion. Mutations at the 5' splice site of this open reading frame may have reactivated the transduplicated RRM motif. Reviewers This article was reviewed by Dan Graur and William Martin. For the full reviews, please go to the Reviewers' Reports section.

  4. De novo assembly of a 40 Mb eukaryotic genome from short sequence reads: Sordaria macrospora, a model organism for fungal morphogenesis.

    Science.gov (United States)

    Nowrousian, Minou; Stajich, Jason E; Chu, Meiling; Engh, Ines; Espagne, Eric; Halliday, Karen; Kamerewerd, Jens; Kempken, Frank; Knab, Birgit; Kuo, Hsiao-Che; Osiewacz, Heinz D; Pöggeler, Stefanie; Read, Nick D; Seiler, Stephan; Smith, Kristina M; Zickler, Denise; Kück, Ulrich; Freitag, Michael

    2010-04-08

    Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb) with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data can be used for

  5. De novo assembly of a 40 Mb eukaryotic genome from short sequence reads: Sordaria macrospora, a model organism for fungal morphogenesis.

    Directory of Open Access Journals (Sweden)

    Minou Nowrousian

    2010-04-01

    Full Text Available Filamentous fungi are of great importance in ecology, agriculture, medicine, and biotechnology. Thus, it is not surprising that genomes for more than 100 filamentous fungi have been sequenced, most of them by Sanger sequencing. While next-generation sequencing techniques have revolutionized genome resequencing, e.g. for strain comparisons, genetic mapping, or transcriptome and ChIP analyses, de novo assembly of eukaryotic genomes still presents significant hurdles, because of their large size and stretches of repetitive sequences. Filamentous fungi contain few repetitive regions in their 30-90 Mb genomes and thus are suitable candidates to test de novo genome assembly from short sequence reads. Here, we present a high-quality draft sequence of the Sordaria macrospora genome that was obtained by a combination of Illumina/Solexa and Roche/454 sequencing. Paired-end Solexa sequencing of genomic DNA to 85-fold coverage and an additional 10-fold coverage by single-end 454 sequencing resulted in approximately 4 Gb of DNA sequence. Reads were assembled to a 40 Mb draft version (N50 of 117 kb with the Velvet assembler. Comparative analysis with Neurospora genomes increased the N50 to 498 kb. The S. macrospora genome contains even fewer repeat regions than its closest sequenced relative, Neurospora crassa. Comparison with genomes of other fungi showed that S. macrospora, a model organism for morphogenesis and meiosis, harbors duplications of several genes involved in self/nonself-recognition. Furthermore, S. macrospora contains more polyketide biosynthesis genes than N. crassa. Phylogenetic analyses suggest that some of these genes may have been acquired by horizontal gene transfer from a distantly related ascomycete group. Our study shows that, for typical filamentous fungi, de novo assembly of genomes from short sequence reads alone is feasible, that a mixture of Solexa and 454 sequencing substantially improves the assembly, and that the resulting data

  6. Spatially conserved regulatory elements identified within human and mouse Cd247 gene using high-throughput sequencing data from the ENCODE project

    DEFF Research Database (Denmark)

    Pundhir, Sachin; Hannibal, Tine Dahlbæk; Bang-Berthelsen, Claus Heiner

    2014-01-01

    . In this study, we have utilized the wealth of high-throughput sequencing data produced during the Encyclopedia of DNA Elements (ENCODE) project to identify spatially conserved regulatory elements within the Cd247 gene from human and mouse. We show the presence of two transcription factor binding sites...

  7. Defining the Sequence Elements and Candidate Genes for the Coloboma Mutation.

    Directory of Open Access Journals (Sweden)

    Elizabeth A. Robb

    Full Text Available The chicken coloboma mutation exhibits features similar to human congenital developmental malformations such as ocular coloboma, cleft-palate, dwarfism, and polydactyly. The coloboma-associated region and encoded genes were investigated using advanced genomic, genetic, and gene expression technologies. Initially, the mutation was linked to a 990 kb region encoding 11 genes; the application of the genetic and genomic tools led to a reduction of the linked region to 176 kb and the elimination of 7 genes. Furthermore, bioinformatics analyses of capture array-next generation sequence data identified genetic elements including SNPs, insertions, deletions, gaps, chromosomal rearrangements, and miRNA binding sites within the introgressed causative region relative to the reference genome sequence. Coloboma-specific variants within exons, UTRs, and splice sites were studied for their contribution to the mutant phenotype. Our compiled results suggest three genes for future studies. The three candidate genes, SLC30A5 (a zinc transporter, CENPH (a centromere protein, and CDK7 (a cyclin-dependent kinase, are differentially expressed (compared to normal embryos at stages and in tissues affected by the coloboma mutation. Of these genes, two (SLC30A5 and CENPH are considered high-priority candidate based upon studies in other vertebrate model systems.

  8. Determination of k0-factors of short-lived nuclides and application of k0-NAA to selected trace elements

    International Nuclear Information System (INIS)

    Acharya, R.; Holzbecher, J.; Chatt, A.

    2012-01-01

    As part of the standardization program of k 0 -based NAA (k 0 -NAA) methods at the Dalhousie University SLOWPOKE-2 reactor (DUSR) facility, the k 0 -factors of 15 analytically important short-lived nuclides (half-life 197 Au). The elemental standards used were prepared mostly from their primary standard solutions. The samples were irradiated in both inner and outer pneumatic sites of the DUSR facility and counted using an HPGe-detector coupled to an ORTEC’s digital gamma-ray spectrometer. The k 0 -factors determined using both inner and outer irradiation sites were found to be within ±5% with respect to either recommended or literature values in most cases. The Z-score values at 95% confidence level were found to be in the range of ±0.03–1.6. The k 0 -NAA method was applied to three different NIST standard reference materials (SRMs) and concentrations of six elements, namely Ag, F, Hf, Rb, Sc, and Se were determined using their short-lived nuclides. The concentrations of these elements were also determined by relative NAA method for comparison purposes.

  9. Behaviour of short-lived iodines in operating UO2 fuel elements

    International Nuclear Information System (INIS)

    Lipsett, J.J.; Hastings, I.J.; Hunt, C.E.L.

    1984-11-01

    Sweep gas experiments have been done to determine the behaviour of short-lived fission products within operating UO 2 fuel elements at linear powers of 45, 54, and 60 KW/m, and to burnups of 70, 80, and 50 MWh/kgU respectively. Although radioiodine transport was not observed directly during normal operation, equilibrium gap inventories for I-131 were deduced from the shutdown decay behaviour of the fission gases. These inventories were a strong function of fuel power and ranged from 10 GBq (0.27 Ci) to 100 GBq (2.7 Ci) over the range tested. We conclude that the iodine inventory was adsorbed onto the fuel and/or sheath surfaces with a volatile fraction of less than 10 -2 and a charcoal-filter-penetrating fraction of less than 2x10 -4

  10. Short interspersed element (SINE) depletion and long interspersed element (LINE) abundance are not features universally required for imprinting.

    Science.gov (United States)

    Cowley, Michael; de Burca, Anna; McCole, Ruth B; Chahal, Mandeep; Saadat, Ghazal; Oakey, Rebecca J; Schulz, Reiner

    2011-04-20

    Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.

  11. Short interspersed element (SINE depletion and long interspersed element (LINE abundance are not features universally required for imprinting.

    Directory of Open Access Journals (Sweden)

    Michael Cowley

    2011-04-01

    Full Text Available Genomic imprinting is a form of gene dosage regulation in which a gene is expressed from only one of the alleles, in a manner dependent on the parent of origin. The mechanisms governing imprinted gene expression have been investigated in detail and have greatly contributed to our understanding of genome regulation in general. Both DNA sequence features, such as CpG islands, and epigenetic features, such as DNA methylation and non-coding RNAs, play important roles in achieving imprinted expression. However, the relative importance of these factors varies depending on the locus in question. Defining the minimal features that are absolutely required for imprinting would help us to understand how imprinting has evolved mechanistically. Imprinted retrogenes are a subset of imprinted loci that are relatively simple in their genomic organisation, being distinct from large imprinting clusters, and have the potential to be used as tools to address this question. Here, we compare the repeat element content of imprinted retrogene loci with non-imprinted controls that have a similar locus organisation. We observe no significant differences that are conserved between mouse and human, suggesting that the paucity of SINEs and relative abundance of LINEs at imprinted loci reported by others is not a sequence feature universally required for imprinting.

  12. Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae.

    Science.gov (United States)

    Oggioni, M R; Claverys, J P

    1999-10-01

    A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.

  13. Discrete meso-element simulation of the failure behavior of short-fiber composites under dynamic loading

    International Nuclear Information System (INIS)

    Liu Wenyan; Tang, Z.P.; Liu Yunxin

    2000-01-01

    In recent years, more attention has been paid to a better understanding of the failure behavior and mechanism of heterogeneous materials at the meso-scale level. In this paper, the crack initiation and development in epoxy composites reinforced with short steel fibers under dynamic loading were simulated and analyzed with the 2D Discrete Meso-Element Dynamic Method. Results show that the damage process depends greatly on the binding property between matrix and fibers

  14. Identification of ISMyo2, a novel insertion sequence element of IS21 family and its diagnostic potential for detection of Mycobacterium yongonense.

    Science.gov (United States)

    Kim, Byoung-Jun; Kim, Kijeong; Kim, Bo-Ram; Kook, Yoon-Hoh; Kim, Bum-Joon

    2015-10-15

    Mycobacterium yongonense, as a novel member of the M. avium complex (MAC), was recently reported to be isolated from human specimens in South Korea and Italy. Due to its close relatedness to other MAC members, particularly M. intracellulare in taxonomic aspects, the development of a novel diagnostic method for its specific detection is necessary for clinical or epidemiologic purposes. Using the Mycobacterium yongonense genome information, we have identified a novel IS-element, ISMyo2. Targeting the ISMyo2 sequence, we developed a real-time PCR method and applied the technique to Mycobacterial genomic DNA. To identify proper nucleic acid targets for the diagnosis, comparisons of all insertion sequence (IS) elements of 3 M. intracellulare and 3 M. yongonense strains, whose complete genome sequences we reported recently, led to the selection of a novel target gene, the M. yongonense-specific IS element, ISMyo2 (2,387 bp), belonging to the IS21 family. Next, we developed a real-time PCR method using SYBR green I for M. yongonense-specific detection targeting ISMyo2, producing a 338-bp amplicon. When this assay was applied to 28 Mycobacterium reference strains and 63 MAC clinical isolates, it produced amplicons in only the 6 M. yongonense strains, showing a sensitivity of 100 fg of genomic DNA, suggesting its feasibility as a diagnostic method for M. yongonense strains. We identified a novel ISMyo2 IS element belonging to the IS21 family specific to M. yongonense strains via genome analysis, and a real-time PCR method based on its sequences was developed.

  15. Compression and radiation of high-power short rf pulses. II. A novel antenna array design with combined compressor/radiator elements

    KAUST Repository

    Sirenko, Kostyantyn

    2011-01-01

    The paper discusses the radiation of compressed high power short RF pulses using two different types of antennas: (i) A simple monopole antenna and (ii) a novel array design, where each of the elements is constructed by combining a compressor and a radiator. The studies on the monopole antenna demonstrate the possibility of a high power short RF pulse\\'s efficient radiation even using simple antennas. The studies on the novel array design demonstrate that a reduced size array with lower pulse distortion and power decay can be constructed by assembling the array from elements each of which integrates a compressor and a radiator. This design idea can be used with any type of antenna array; in this work it is applied to a phased array.

  16. Representation of individual elements of a complex call sequence in primary auditory cortex

    Directory of Open Access Journals (Sweden)

    Mark Nelson Wallace

    2013-10-01

    Full Text Available Conspecific communication calls can be rhythmic or contain extended, discontinuous series of either constant or frequency modulated harmonic tones and noise bursts separated by brief periods of silence. In the guinea pig, rhythmic calls can produce isomorphic responses within the primary auditory cortex (AI where single units respond to every call element. Other calls such as the chutter comprise a series of short irregular syllables that vary in their spectral content and are more like human speech. These calls can also evoke isomorphic responses, but may only do so in fields in the auditory belt and not in AI. Here we present evidence that cells in AI treat the individual elements within a syllable as separate auditory objects and respond selectively to one or a subset of them. We used a single chutter exemplar to compare single/multi-unit responses in the low-frequency portion of AI - AI(LF and the low-frequency part of the thalamic medial geniculate body - MGB(LF in urethane anaesthetised guinea pigs. Both thalamic and cortical cells responded with brief increases in firing rate to one, or more, of the 8 main elements present in the chutter call. Almost none of the units responded to all 8 elements. While there were many different combinations of responses to between one and five of the elements, MBG(LF and AI(LF neurons exhibited the same specific types of response combinations. Nearby units in the upper layers of the cortex tended to respond to similar combinations of elements while the deep layers were less responsive. Thus the responses from a number of AI units would need to be combined in order to represent the entire chutter call. Our results don’t rule out the possibility of constructive convergence but there was no evidence that a convergence of inputs within AI led to a complete representation of all eight elements.

  17. Draft Genome Sequence of Lactobacillus delbrueckii Strain #22 Isolated from a Patient with Short Bowel Syndrome and Previous d-Lactic Acidosis and Encephalopathy.

    Science.gov (United States)

    Domann, Eugen; Fischer, Florence; Glowatzki, Fabian; Fritzenwanker, Moritz; Hain, Torsten; Zechel-Gran, Silke; Giffhorn-Katz, Susanne; Neubauer, Bernd A

    2016-07-28

    d-Lactic acidosis with associated encephalopathy caused by overgrowth of intestinal lactic acid bacteria is a rarely diagnosed neurological complication of patients with short bowel syndrome. Here, we report the draft genome sequence of Lactobacillus delbrueckii strain #22 isolated from a patient with short bowel syndrome and previous d-lactic acidosis/encephalopathy. Copyright © 2016 Domann et al.

  18. Musicians' and nonmusicians' short-term memory for verbal and musical sequences: comparing phonological similarity and pitch proximity.

    Science.gov (United States)

    Williamson, Victoria J; Baddeley, Alan D; Hitch, Graham J

    2010-03-01

    Language-music comparative studies have highlighted the potential for shared resources or neural overlap in auditory short-term memory. However, there is a lack of behavioral methodologies for comparing verbal and musical serial recall. We developed a visual grid response that allowed both musicians and nonmusicians to perform serial recall of letter and tone sequences. The new method was used to compare the phonological similarity effect with the impact of an operationalized musical equivalent-pitch proximity. Over the course of three experiments, we found that short-term memory for tones had several similarities to verbal memory, including limited capacity and a significant effect of pitch proximity in nonmusicians. Despite being vulnerable to phonological similarity when recalling letters, however, musicians showed no effect of pitch proximity, a result that we suggest might reflect strategy differences. Overall, the findings support a limited degree of correspondence in the way that verbal and musical sounds are processed in auditory short-term memory.

  19. Exploiting BAC-end sequences for the mining, characterization and utility of new short sequences repeat (SSR) markers in Citrus.

    Science.gov (United States)

    Biswas, Manosh Kumar; Chai, Lijun; Mayer, Christoph; Xu, Qiang; Guo, Wenwu; Deng, Xiuxin

    2012-05-01

    The aim of this study was to develop a large set of microsatellite markers based on publicly available BAC-end sequences (BESs), and to evaluate their transferability, discriminating capacity of genotypes and mapping ability in Citrus. A set of 1,281 simple sequence repeat (SSR) markers were developed from the 46,339 Citrus clementina BAC-end sequences (BES), of them 20.67% contained SSR longer than 20 bp, corresponding to roughly one perfect SSR per 2.04 kb. The most abundant motifs were di-nucleotide (16.82%) repeats. Among all repeat motifs (TA/AT)n is the most abundant (8.38%), followed by (AG/CT)n (4.51%). Most of the BES-SSR are located in the non-coding region, but 1.3% of BES-SSRs were found to be associated with transposable element (TE). A total of 400 novel SSR primer pairs were synthesized and their transferability and polymorphism tested on a set of 16 Citrus and Citrus relative's species. Among these 333 (83.25%) were successfully amplified and 260 (65.00%) showed cross-species transferability with Poncirus trifoliata and Fortunella sp. These cross-species transferable markers could be useful for cultivar identification, for genomic study of Citrus, Poncirus and Fortunella sp. Utility of the developed SSR marker was demonstrated by identifying a set of 118 markers each for construction of linkage map of Citrus reticulata and Poncirus trifoliata. Genetic diversity and phylogenetic relationship among 40 Citrus and its related species were conducted with the aid of 25 randomly selected SSR primer pairs and results revealed that citrus genomic SSRs are superior to genic SSR for genetic diversity and germplasm characterization of Citrus spp.

  20. Sequence Capture and Phylogenetic Utility of Genomic Ultraconserved Elements Obtained from Pinned Insect Specimens.

    Directory of Open Access Journals (Sweden)

    Bonnie B Blaimer

    Full Text Available Obtaining sequence data from historical museum specimens has been a growing research interest, invigorated by next-generation sequencing methods that allow inputs of highly degraded DNA. We applied a target enrichment and next-generation sequencing protocol to generate ultraconserved elements (UCEs from 51 large carpenter bee specimens (genus Xylocopa, representing 25 species with specimen ages ranging from 2-121 years. We measured the correlation between specimen age and DNA yield (pre- and post-library preparation DNA concentration and several UCE sequence capture statistics (raw read count, UCE reads on target, UCE mean contig length and UCE locus count with linear regression models. We performed piecewise regression to test for specific breakpoints in the relationship of specimen age and DNA yield and sequence capture variables. Additionally, we compared UCE data from newer and older specimens of the same species and reconstructed their phylogeny in order to confirm the validity of our data. We recovered 6-972 UCE loci from samples with pre-library DNA concentrations ranging from 0.06-9.8 ng/μL. All investigated DNA yield and sequence capture variables were significantly but only moderately negatively correlated with specimen age. Specimens of age 20 years or less had significantly higher pre- and post-library concentrations, UCE contig lengths, and locus counts compared to specimens older than 20 years. We found breakpoints in our data indicating a decrease of the initial detrimental effect of specimen age on pre- and post-library DNA concentration and UCE contig length starting around 21-39 years after preservation. Our phylogenetic results confirmed the integrity of our data, giving preliminary insights into relationships within Xylocopa. We consider the effect of additional factors not measured in this study on our age-related sequence capture results, such as DNA fragmentation and preservation method, and discuss the promise of the UCE

  1. [Bioinformatics Analysis of Clustered Regularly Interspaced Short Palindromic Repeats in the Genomes of Shigella].

    Science.gov (United States)

    Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin

    2015-04-01

    This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.

  2. Survey sequencing and comparative analysis of the elephant shark (Callorhinchus milii genome.

    Directory of Open Access Journals (Sweden)

    Byrappa Venkatesh

    2007-04-01

    Full Text Available Owing to their phylogenetic position, cartilaginous fishes (sharks, rays, skates, and chimaeras provide a critical reference for our understanding of vertebrate genome evolution. The relatively small genome of the elephant shark, Callorhinchus milii, a chimaera, makes it an attractive model cartilaginous fish genome for whole-genome sequencing and comparative analysis. Here, the authors describe survey sequencing (1.4x coverage and comparative analysis of the elephant shark genome, one of the first cartilaginous fish genomes to be sequenced to this depth. Repetitive sequences, represented mainly by a novel family of short interspersed element-like and long interspersed element-like sequences, account for about 28% of the elephant shark genome. Fragments of approximately 15,000 elephant shark genes reveal specific examples of genes that have been lost differentially during the evolution of tetrapod and teleost fish lineages. Interestingly, the degree of conserved synteny and conserved sequences between the human and elephant shark genomes are higher than that between human and teleost fish genomes. Elephant shark contains putative four Hox clusters indicating that, unlike teleost fish genomes, the elephant shark genome has not experienced an additional whole-genome duplication. These findings underscore the importance of the elephant shark as a critical reference vertebrate genome for comparative analysis of the human and other vertebrate genomes. This study also demonstrates that a survey-sequencing approach can be applied productively for comparative analysis of distantly related vertebrate genomes.

  3. Complete Genome Sequence of Sporisorium scitamineum and Biotrophic Interaction Transcriptome with Sugarcane.

    Directory of Open Access Journals (Sweden)

    Lucas M Taniguti

    Full Text Available Sporisorium scitamineum is a biotrophic fungus responsible for the sugarcane smut, a worldwide spread disease. This study provides the complete sequence of individual chromosomes of S. scitamineum from telomere to telomere achieved by a combination of PacBio long reads and Illumina short reads sequence data, as well as a draft sequence of a second fungal strain. Comparative analysis to previous available sequences of another strain detected few polymorphisms among the three genomes. The novel complete sequence described herein allowed us to identify and annotate extended subtelomeric regions, repetitive elements and the mitochondrial DNA sequence. The genome comprises 19,979,571 bases, 6,677 genes encoding proteins, 111 tRNAs and 3 assembled copies of rDNA, out of our estimated number of copies as 130. Chromosomal reorganizations were detected when comparing to sequences of S. reilianum, the closest smut relative, potentially influenced by repeats of transposable elements. Repetitive elements may have also directed the linkage of the two mating-type loci. The fungal transcriptome profiling from in vitro and from interaction with sugarcane at two time points (early infection and whip emergence revealed that 13.5% of the genes were differentially expressed in planta and particular to each developmental stage. Among them are plant cell wall degrading enzymes, proteases, lipases, chitin modification and lignin degradation enzymes, sugar transporters and transcriptional factors. The fungus also modulates transcription of genes related to surviving against reactive oxygen species and other toxic metabolites produced by the plant. Previously described effectors in smut/plant interactions were detected but some new candidates are proposed. Ten genomic islands harboring some of the candidate genes unique to S. scitamineum were expressed only in planta. RNAseq data was also used to reassure gene predictions.

  4. Quantitative analysis of polycomb response elements (PREs at identical genomic locations distinguishes contributions of PRE sequence and genomic environment

    Directory of Open Access Journals (Sweden)

    Okulski Helena

    2011-03-01

    Full Text Available Abstract Background Polycomb/Trithorax response elements (PREs are cis-regulatory elements essential for the regulation of several hundred developmentally important genes. However, the precise sequence requirements for PRE function are not fully understood, and it is also unclear whether these elements all function in a similar manner. Drosophila PRE reporter assays typically rely on random integration by P-element insertion, but PREs are extremely sensitive to genomic position. Results We adapted the ΦC31 site-specific integration tool to enable systematic quantitative comparison of PREs and sequence variants at identical genomic locations. In this adaptation, a miniwhite (mw reporter in combination with eye-pigment analysis gives a quantitative readout of PRE function. We compared the Hox PRE Frontabdominal-7 (Fab-7 with a PRE from the vestigial (vg gene at four landing sites. The analysis revealed that the Fab-7 and vg PREs have fundamentally different properties, both in terms of their interaction with the genomic environment at each site and their inherent silencing abilities. Furthermore, we used the ΦC31 tool to examine the effect of deletions and mutations in the vg PRE, identifying a 106 bp region containing a previously predicted motif (GTGT that is essential for silencing. Conclusions This analysis showed that different PREs have quantifiably different properties, and that changes in as few as four base pairs have profound effects on PRE function, thus illustrating the power and sensitivity of ΦC31 site-specific integration as a tool for the rapid and quantitative dissection of elements of PRE design.

  5. IS1111 insertion sequences of Coxiella burnetii: characterization and use for repetitive element PCR-based differentiation of Coxiella burnetii isolates

    Directory of Open Access Journals (Sweden)

    Massung Robert F

    2007-10-01

    Full Text Available Abstract Background Coxiella burnetii contains the IS1111 transposase which is present 20 times in the Nine Mile phase I (9Mi/I genome. A single PCR primer that binds to each IS element, and primers specific to a region ~500-bp upstream of each of the 20 IS1111 elements were designed. The amplified products were characterized and used to develop a repetitive element PCR genotyping method. Results Isolates Nine Mile phase II, Nine Mile RSA 514, Nine Mile Baca, Scottish, Ohio, Australian QD, Henzerling phase I, Henzerling phase II, M44, KAV, PAV, Q238, Q195 and WAV were tested by PCR and compared to 9Mi/I. Sequencing was used to determine the exact differences in isolates which lacked specific IS elements or produced PCR products of differing size. From this data, an algorithm was created utilizing four primer pairs that allows for differentiation of unknown isolates into five genomic groups. Additional isolates (Priscilla Q177, Idaho Q, Qiyi, Poker Cat, Q229 and Q172 and nine veterinary samples were characterized using the algorithm which resulted in their placement into three distinct genomic groups. Conclusion Through this study significant differences, including missing elements and sequence alterations within and near IS element coding regions, were found between the isolates tested. Further, a method for differentiation of C. burnetii isolates into one of five genomic groups was created. This algorithm may ultimately help to determine the relatedness between known and unknown isolates of C. burnetii.

  6. [Possibilities in the differential diagnosis of brain neoplasms using the long and short time sequences of proton magnetic resonance spectroscopy

    NARCIS (Netherlands)

    Gajewicz, W.; Goraj, B.M.

    2004-01-01

    Currently to perform proton magnetic resonance spectroscopy (1H MRS) with single voxel spectroscopy (SVS) technique long and/or short echo time sequences are used in order to provide complementary information. PURPOSE: The aim of the study was to compare the usefulness of STEAM (time echo, TE, 20

  7. [Clustered regularly interspaced short palindromic repeats: structure, function and application--a review].

    Science.gov (United States)

    Cui, Yujun; Li, Yanjun; Yan, Yanfeng; Yang, Ruifu

    2008-11-01

    CRISPRs (Clustered Regularly Interspaced Short Palindromic Repeats), the basis of spoligotyping technology, can provide prokaryotes with heritable adaptive immunity against phages' invasion. Studies on CRISPR loci and their associated elements, including various CAS (CRISPR-associated) proteins and leader sequences, are still in its infant period. We introduce the brief history', structure, function, bioinformatics research and application of this amazing immunity system in prokaryotic organism for inspiring more scientists to find their interest in this developing topic.

  8. Transposable elements in cancer.

    Science.gov (United States)

    Burns, Kathleen H

    2017-07-01

    Transposable elements give rise to interspersed repeats, sequences that comprise most of our genomes. These mobile DNAs have been historically underappreciated - both because they have been presumed to be unimportant, and because their high copy number and variability pose unique technical challenges. Neither impediment now seems steadfast. Interest in the human mobilome has never been greater, and methods enabling its study are maturing at a fast pace. This Review describes the activity of transposable elements in human cancers, particularly long interspersed element-1 (LINE-1). LINE-1 sequences are self-propagating, protein-coding retrotransposons, and their activity results in somatically acquired insertions in cancer genomes. Altered expression of transposable elements and animation of genomic LINE-1 sequences appear to be hallmarks of cancer, and can be responsible for driving mutations in tumorigenesis.

  9. Efficient and controllable thermal ablation induced by short-pulsed HIFU sequence assisted with perfluorohexane nanodroplets.

    Science.gov (United States)

    Chang, Nan; Lu, Shukuan; Qin, Dui; Xu, Tianqi; Han, Meng; Wang, Supin; Wan, Mingxi

    2018-07-01

    A HIFU sequence with extremely short pulse duration and high pulse repetition frequency can achieve thermal ablation at a low acoustic power using inertial cavitation. Because of its cavitation-dependent property, the therapeutic outcome is unreliable when the treatment zone lacks cavitation nuclei. To overcome this intrinsic limitation, we introduced perfluorocarbon nanodroplets as extra cavitation nuclei into short-pulsed HIFU-mediated thermal ablation. Two types of nanodroplets were used with perfluorohexane (PFH) as the core material coated with bovine serum albumin (BSA) or an anionic fluorosurfactant (FS) to demonstrate the feasibility of this study. The thermal ablation process was recorded by high-speed photography. The inertial cavitation activity during the ablation was revealed by sonoluminescence (SL). The high-speed photography results show that the thermal ablation volume increased by ∼643% and 596% with BSA-PFH and FS-PFH, respectively, than the short-pulsed HIFU alone at an acoustic power of 19.5 W. Using nanodroplets, much larger ablation volumes were created even at a much lower acoustic power. Meanwhile, the treatment time for ablating a desired volume significantly reduced in the presence of nanodroplets. Moreover, by adjusting the treatment time, lesion migration towards the HIFU transducer could also be avoided. The SL results show that the thermal lesion shape was significantly dependent on the inertial cavitation in this short-pulsed HIFU-mediated thermal ablation. The inertial cavitation activity became more predictable by using nanodroplets. Therefore, the introduction of PFH nanodroplets as extra cavitation nuclei made the short-pulsed HIFU thermal ablation more efficient by increasing the ablation volume and speed, and more controllable by reducing the acoustic power and preventing lesion migration. Copyright © 2018. Published by Elsevier B.V.

  10. Using TESS to predict transcription factor binding sites in DNA sequence.

    Science.gov (United States)

    Schug, Jonathan

    2008-03-01

    This unit describes how to use the Transcription Element Search System (TESS). This Web site predicts transcription factor binding sites (TFBS) in DNA sequence using two different kinds of models of sites, strings and positional weight matrices. The binding of transcription factors to DNA is a major part of the control of gene expression. Transcription factors exhibit sequence-specific binding; they form stronger bonds to some DNA sequences than to others. Identification of a good binding site in the promoter for a gene suggests the possibility that the corresponding factor may play a role in the regulation of that gene. However, the sequences transcription factors recognize are typically short and allow for some amount of mismatch. Because of this, binding sites for a factor can typically be found at random every few hundred to a thousand base pairs. TESS has features to help sort through and evaluate the significance of predicted sites.

  11. Compression and radiation of high-power short rf pulses. II. A novel antenna array design with combined compressor/radiator elements

    KAUST Repository

    Sirenko, Kostyantyn; Pazynin, Vadim L.; Sirenko, Yu K.; Bagci, Hakan

    2011-01-01

    The paper discusses the radiation of compressed high power short RF pulses using two different types of antennas: (i) A simple monopole antenna and (ii) a novel array design, where each of the elements is constructed by combining a compressor and a

  12. Heads or tails: L1 insertion-associated 5' homopolymeric sequences

    Directory of Open Access Journals (Sweden)

    Meyer Thomas J

    2010-02-01

    Full Text Available Abstract Background L1s are one of the most successful autonomous mobile elements in primate genomes. These elements comprise as much as 17% of primate genomes with the majority of insertions occurring via target primed reverse transcription (TPRT. Twin priming, a variant of TPRT, can result in unusual DNA sequence architecture. These insertions appear to be inverted, truncated L1s flanked by target site duplications. Results We report on loci with sequence architecture consistent with variants of the twin priming mechanism and introduce dual priming, a mechanism that could generate similar sequence characteristics. These insertions take the form of truncated L1s with hallmarks of classical TPRT insertions but having a poly(T simple repeat at the 5' end of the insertion. We identified loci using computational analyses of the human, chimpanzee, orangutan, rhesus macaque and marmoset genomes. Insertion site characteristics for all putative loci were experimentally verified. Conclusions The 39 loci that passed our computational and experimental screens probably represent inversion-deletion events which resulted in a 5' inverted poly(A tail. Based on our observations of these loci and their local sequence properties, we conclude that they most probably represent twin priming events with unusually short non-inverted portions. We postulate that dual priming could, theoretically, produce the same patterns. The resulting homopolymeric stretches associated with these insertion events may promote genomic instability and create potential target sites for future retrotransposition events.

  13. Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

    Science.gov (United States)

    Cao, Yinhe; Tung, Wen-Wen; Gao, J B

    2004-01-01

    With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.

  14. Translocation and gross deletion breakpoints in human inherited disease and cancer II: Potential involvement of repetitive sequence elements in secondary structure formation between DNA ends.

    Science.gov (United States)

    Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N

    2003-09-01

    Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.

  15. Survey of clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) systems in multiple sequenced strains of Klebsiella pneumoniae.

    Science.gov (United States)

    Ostria-Hernández, Martha Lorena; Sánchez-Vallejo, Carlos Javier; Ibarra, J Antonio; Castro-Escarpulli, Graciela

    2015-08-04

    In recent years the emergence of multidrug resistant Klebsiella pneumoniae strains has been an increasingly common event. This opportunistic species is one of the five main bacterial pathogens that cause hospital infections worldwide and multidrug resistance has been associated with the presence of high molecular weight plasmids. Plasmids are generally acquired through horizontal transfer and therefore is possible that systems that prevent the entry of foreign genetic material are inactive or absent. One of these systems is CRISPR/Cas. However, little is known regarding the clustered regularly interspaced short palindromic repeats and their associated Cas proteins (CRISPR/Cas) system in K. pneumoniae. The adaptive immune system CRISPR/Cas has been shown to limit the entry of foreign genetic elements into bacterial organisms and in some bacteria it has been shown to be involved in regulation of virulence genes. Thus in this work we used bioinformatics tools to determine the presence or absence of CRISPR/Cas systems in available K. pneumoniae genomes. The complete CRISPR/Cas system was identified in two out of the eight complete K. pneumoniae genomes sequences and in four out of the 44 available draft genomes sequences. The cas genes in these strains comprises eight cas genes similar to those found in Escherichia coli, suggesting they belong to the type I-E group, although their arrangement is slightly different. As for the CRISPR sequences, the average lengths of the direct repeats and spacers were 29 and 33 bp, respectively. BLAST searches demonstrated that 38 of the 116 spacer sequences (33%) are significantly similar to either plasmid, phage or genome sequences, while the remaining 78 sequences (67%) showed no significant similarity to other sequences. The region where the CRISPR/Cas systems were located is the same in all the Klebsiella genomes containing it, it has a syntenic architecture, and is located among genes encoding for proteins likely involved in

  16. Deep sequencing of cardiac microRNA-mRNA interactomes in clinical and experimental cardiomyopathy.

    Science.gov (United States)

    Matkovich, Scot J; Dorn, Gerald W

    2015-01-01

    MicroRNAs are a family of short (~21 nucleotide) noncoding RNAs that serve key roles in cellular growth and differentiation and the response of the heart to stress stimuli. As the sequence-specific recognition element of RNA-induced silencing complexes (RISCs), microRNAs bind mRNAs and prevent their translation via mechanisms that may include transcript degradation and/or prevention of ribosome binding. Short microRNA sequences and the ability of microRNAs to bind to mRNA sites having only partial/imperfect sequence complementarity complicate purely computational analyses of microRNA-mRNA interactomes. Furthermore, computational microRNA target prediction programs typically ignore biological context, and therefore the principal determinants of microRNA-mRNA binding: the presence and quantity of each. To address these deficiencies we describe an empirical method, developed via studies of stressed and failing hearts, to determine disease-induced changes in microRNAs, mRNAs, and the mRNAs targeted to the RISC, without cross-linking mRNAs to RISC proteins. Deep sequencing methods are used to determine RNA abundances, delivering unbiased, quantitative RNA data limited only by their annotation in the genome of interest. We describe the laboratory bench steps required to perform these experiments, experimental design strategies to achieve an appropriate number of sequencing reads per biological replicate, and computer-based processing tools and procedures to convert large raw sequencing data files into gene expression measures useful for differential expression analyses.

  17. Silencing of the PiAvr3a effector-encoding gene from Phytophthora infestans by transcriptional fusion to a short interspersed element.

    Science.gov (United States)

    Vetukuri, Ramesh R; Tian, Zhendong; Avrova, Anna O; Savenkov, Eugene I; Dixelius, Christina; Whisson, Stephen C

    2011-12-01

    Phytophthora infestans is the notorious oomycete causing late blight of potato and tomato. A large proportion of the P. infestans genome is composed of transposable elements, the activity of which may be controlled by RNA silencing. Accumulation of small RNAs is one of the hallmarks of RNA silencing. Here we demonstrate the presence of small RNAs corresponding to the sequence of a short interspersed retrotransposable element (SINE) suggesting that small RNAs might be involved in silencing of SINEs in P. infestans. This notion was exploited to develop novel tools for gene silencing in P. infestans by engineering transcriptional fusions of the PiAvr3a gene, encoding an RXLR avirulence effector, to the infSINEm retroelement. Transgenic P. infestans lines expressing either 5'-infSINEm::PiAvr3a-3' or 5'-PiAvr3a::SINEm-3' chimeric transcripts initially exhibited partial silencing of PiAvr3a. Over time, PiAvr3a either recovered wild type transcript levels in some lines, or became fully silenced in others. Introduction of an inverted repeat construct was also successful in yielding P. infestans transgenic lines silenced for PiAvr3a. In contrast, constructs expressing antisense or aberrant RNA transcripts failed to initiate silencing of PiAvr3a. Lines exhibiting the most effective silencing of PiAvr3a were either weakly or non-pathogenic on susceptible potato cv. Bintje. This study expands the repertoire of reverse genetics tools available for P. infestans research, and provides insights into a possible mode of variation in effector expression through spread of silencing from adjacent retroelements. Crown Copyright © 2011. Published by Elsevier Ltd. All rights reserved.

  18. Review of even element super-heavy nuclei and search for element 120

    Energy Technology Data Exchange (ETDEWEB)

    Hofmann, S. [GSI Helmholtzzentrum fuer Schwerionenforschung, Darmstadt (Germany); Goethe-Universitaet Frankfurt, Institut fuer Physik, Frankfurt (Germany); Heinz, S.; Mann, R.; Maurer, J.; Barth, W.; Burkhard, H.G.; Dahl, L.; Kindler, B.; Kojouharov, I.; Lang, R.; Lommel, B.; Runke, J.; Scheidenberger, C.; Schoett, H.J.; Tinschert, K. [GSI Helmholtzzentrum fuer Schwerionenforschung, Darmstadt (Germany); Muenzenberg, G. [GSI Helmholtzzentrum fuer Schwerionenforschung, Darmstadt (Germany); Manipal University, Manipal Centre for Natural Sciences, Manipal, Karnataka (India); Antalic, S.; Saro, S. [Comenius University, Department of Nuclear Physics and Biophysics, Bratislava (Slovakia); Eberhardt, K.; Thoerle-Pospiech, P.; Trautmann, N. [Johannes Gutenberg-Universitaet Mainz, Mainz (Germany); Grzywacz, R. [Oak Ridge National Laboratory, Oak Ridge, TN (United States); University of Tennessee, Knoxville, TN (United States); Hamilton, J.H. [Vanderbuilt University, Department of Physics and Astronomy, Nashville, TN (United States); Henderson, R.A.; Kenneally, J.M.; Moody, K.J.; Shaughnessy, D.A.; Stoyer, M.A. [Lawrence Livermore National Laboratory, Livermore, CA (United States); Miernik, K. [Oak Ridge National Laboratory, Oak Ridge, TN (United States); University of Warsaw, Warsaw (Poland); Miller, D. [University of Tennessee, Knoxville, TN (United States); Morita, K. [RIKEN Nishina Center for Accelerator-Based Science, Wako, Saitama (Japan); Nishio, K. [Japan Atomic Energy Agency, Tokai, Ibaraki (Japan); Popeko, A.G.; Yeremin, A.V. [Joint Institute for Nuclear Research, Dubna (Russian Federation); Roberto, J.B.; Rykaczewski, K.P. [Oak Ridge National Laboratory, Oak Ridge, TN (United States); Uusitalo, J. [University of Jyvaeskylae, Department of Physics, Jyvaeskylae (Finland)

    2016-06-15

    The reaction {sup 54}Cr + {sup 248}Cm was investigated at the velocity filter SHIP at GSI, Darmstadt, with the intention to study production and decay properties of isotopes of element 120. Three correlated signals were measured, which occurred within a period of 279ms. The heights of the signals correspond with the expectations for a decay sequence starting with an isotope of element 120. However, a complete decay chain cannot be established, since a signal from the implantation of the evaporation residue cannot be identified unambiguously. Measured properties of the event chain are discussed in detail. The result is compared with theoretical predictions. Previously measured decay properties of even element super-heavy nuclei were compiled in order to find arguments for an assignment from the systematics of experimental data. In the course of this review, a few tentatively assigned data could be corrected. New interpretations are given for results which could not be assigned definitely in previous studies. The discussion revealed that the cross-section for production of element 120 could be high enough so that a successful experiment seems possible with presently available techniques. However, a continuation of the experiment at SHIP for a necessary confirmation of the results obtained in a relatively short irradiation of five weeks is not possible at GSI presently. Therefore, we decided to publish the results of the measurement and of the review as they exist now. In the summary and outlook section we also present concepts for the continuation of research in the field of super-heavy nuclei. (orig.)

  19. Hardware Accelerated Sequence Alignment with Traceback

    Directory of Open Access Journals (Sweden)

    Scott Lloyd

    2009-01-01

    in a timely manner. Known methods to accelerate alignment on reconfigurable hardware only address sequence comparison, limit the sequence length, or exhibit memory and I/O bottlenecks. A space-efficient, global sequence alignment algorithm and architecture is presented that accelerates the forward scan and traceback in hardware without memory and I/O limitations. With 256 processing elements in FPGA technology, a performance gain over 300 times that of a desktop computer is demonstrated on sequence lengths of 16000. For greater performance, the architecture is scalable to more processing elements.

  20. Cardiac cine imaging at 3 Tesla: initial experience with a 32-element body-array coil.

    Science.gov (United States)

    Fenchel, Michael; Deshpande, Vibhas S; Nael, Kambiz; Finn, J Paul; Miller, Stephan; Ruehm, Stefan; Laub, Gerhard

    2006-08-01

    We sought to assess the feasibility of cardiac cine imaging and evaluate image quality at 3 T using a body-array coil with 32 coil elements. Eight healthy volunteers (3 men; median age 29 years) were examined on a 3-T magnetic resonance scanner (Magnetom Trio, Siemens Medical Solutions) using a 32-element phased-array coil (prototype from In vivo Corp.). Gradient-recalled-echo (GRE) cine (GRAPPAx3), GRE cine with tagging lines, steady-state-free-precession (SSFP) cine (GRAPPAx3 and x4), and SSFP cine(TSENSEx4 andx6) images were acquired in short-axis and 4-chamber view. Reference images with identical scan parameters were acquired using the total-imaging-matrix (Tim) coil system with a total of 12 coil elements. Images were assessed by 2 observers in a consensus reading with regard to image quality, noise and presence of artifacts. Furthermore, signal-to-noise values were determined in phantom measurements. In phantom measurements signal-to-noise values were increased by 115-155% for the various cine sequences using the 32-element coil. Scoring of image quality yielded statistically significant increased image quality with the SSFP-GRAPPAx4, SSFP-TSENSEx4, and SSFP-TSENSEx6 sequence using the 32-element coil (P < 0.05). Similarly, scoring of image noise yielded a statistically significant lower noise rating with the SSFP-GRAPPAx4, GRE-GRAPPAx3, SSFP-TSENSEx4, and SSFP-TSENSEx6 sequence using the 32-element coil (P < 0.05). This study shows that cardiac cine imaging at 3 T using a 32-element body-array coil is feasible in healthy volunteers. Using a large number of coil elements with a favorable sensitivity profile supports faster image acquisition, with high diagnostic image quality even for high parallel imaging factors.

  1. Method and apparatus for biological sequence comparison

    Science.gov (United States)

    Marr, T.G.; Chang, W.I.

    1997-12-23

    A method and apparatus are disclosed for comparing biological sequences from a known source of sequences, with a subject (query) sequence. The apparatus takes as input a set of target similarity levels (such as evolutionary distances in units of PAM), and finds all fragments of known sequences that are similar to the subject sequence at each target similarity level, and are long enough to be statistically significant. The invention device filters out fragments from the known sequences that are too short, or have a lower average similarity to the subject sequence than is required by each target similarity level. The subject sequence is then compared only to the remaining known sequences to find the best matches. The filtering member divides the subject sequence into overlapping blocks, each block being sufficiently large to contain a minimum-length alignment from a known sequence. For each block, the filter member compares the block with every possible short fragment in the known sequences and determines a best match for each comparison. The determined set of short fragment best matches for the block provide an upper threshold on alignment values. Regions of a certain length from the known sequences that have a mean alignment value upper threshold greater than a target unit score are concatenated to form a union. The current block is compared to the union and provides an indication of best local alignment with the subject sequence. 5 figs.

  2. Gift from statistical learning: Visual statistical learning enhances memory for sequence elements and impairs memory for items that disrupt regularities.

    Science.gov (United States)

    Otsuka, Sachio; Saiki, Jun

    2016-02-01

    Prior studies have shown that visual statistical learning (VSL) enhances familiarity (a type of memory) of sequences. How do statistical regularities influence the processing of each triplet element and inserted distractors that disrupt the regularity? Given that increased attention to triplets induced by VSL and inhibition of unattended triplets, we predicted that VSL would promote memory for each triplet constituent, and degrade memory for inserted stimuli. Across the first two experiments, we found that objects from structured sequences were more likely to be remembered than objects from random sequences, and that letters (Experiment 1) or objects (Experiment 2) inserted into structured sequences were less likely to be remembered than those inserted into random sequences. In the subsequent two experiments, we examined an alternative account for our results, whereby the difference in memory for inserted items between structured and random conditions is due to individuation of items within random sequences. Our findings replicated even when control letters (Experiment 3A) or objects (Experiment 3B) were presented before or after, rather than inserted into, random sequences. Our findings suggest that statistical learning enhances memory for each item in a regular set and impairs memory for items that disrupt the regularity. Copyright © 2015 Elsevier B.V. All rights reserved.

  3. In silico Analysis of 3′-End-Processing Signals in Aspergillus oryzae Using Expressed Sequence Tags and Genomic Sequencing Data

    Science.gov (United States)

    Tanaka, Mizuki; Sakai, Yoshifumi; Yamada, Osamu; Shintani, Takahiro; Gomi, Katsuya

    2011-01-01

    To investigate 3′-end-processing signals in Aspergillus oryzae, we created a nucleotide sequence data set of the 3′-untranslated region (3′ UTR) plus 100 nucleotides (nt) sequence downstream of the poly(A) site using A. oryzae expressed sequence tags and genomic sequencing data. This data set comprised 1065 sequences derived from 1042 unique genes. The average 3′ UTR length in A. oryzae was 241 nt, which is greater than that in yeast but similar to that in plants. The 3′ UTR and 100 nt sequence downstream of the poly(A) site is notably U-rich, while the region located 15–30 nt upstream of the poly(A) site is markedly A-rich. The most frequently found hexanucleotide in this A-rich region is AAUGAA, although this sequence accounts for only 6% of all transcripts. These data suggested that A. oryzae has no highly conserved sequence element equivalent to AAUAAA, a mammalian polyadenylation signal. We identified that putative 3′-end-processing signals in A. oryzae, while less well conserved than those in mammals, comprised four sequence elements: the furthest upstream U-rich element, A-rich sequence, cleavage site, and downstream U-rich element flanking the cleavage site. Although these putative 3′-end-processing signals are similar to those in yeast and plants, some notable differences exist between them. PMID:21586533

  4. [Topographic mapping of retinal function with a scanning laser ophthalmoscope and multifocal electroretinography using short M-sequences].

    Science.gov (United States)

    Rudolph, G; Bechmann, M; Berninger, T; Kutschbach, E; Held, U; Tornow, R P; Kalpadakis, P; Zol'nikova, I V; Shamshinova, A M

    2001-01-01

    A new method of multifocal electroretinography making use of scanning laser ophthalmoscope with a wavelength of 630 nm (SLO-m-ERG), evoking short spatial visual stimuli on the retina, is proposed. Algorithm of presenting the visual stimuli and analysis of distribution of local electroretinograms on the surface of the retina is based on short m-sequences. Mathematical cross correlation analysis shows a three-dimensional distribution of bioelectrical activity of the retina in the central visual field. In normal subjects the cone bioelectrical activity is the maximum in the macular area (corresponding to the density of cone distribution) and absent in the blind spot. The method detects the slightest pathological changes in the retina under control of the site of stimulation and ophthalmoscopic picture of the fundus oculi. The site of the pathological process correlates with the topography of changes in bioelectrical activity of the examined retinal area in diseases of the macular area and pigmented retinitis detectable by ophthalmoscopy.

  5. Spike-Based Bayesian-Hebbian Learning of Temporal Sequences.

    Directory of Open Access Journals (Sweden)

    Philip J Tully

    2016-05-01

    Full Text Available Many cognitive and motor functions are enabled by the temporal representation and processing of stimuli, but it remains an open issue how neocortical microcircuits can reliably encode and replay such sequences of information. To better understand this, a modular attractor memory network is proposed in which meta-stable sequential attractor transitions are learned through changes to synaptic weights and intrinsic excitabilities via the spike-based Bayesian Confidence Propagation Neural Network (BCPNN learning rule. We find that the formation of distributed memories, embodied by increased periods of firing in pools of excitatory neurons, together with asymmetrical associations between these distinct network states, can be acquired through plasticity. The model's feasibility is demonstrated using simulations of adaptive exponential integrate-and-fire model neurons (AdEx. We show that the learning and speed of sequence replay depends on a confluence of biophysically relevant parameters including stimulus duration, level of background noise, ratio of synaptic currents, and strengths of short-term depression and adaptation. Moreover, sequence elements are shown to flexibly participate multiple times in the sequence, suggesting that spiking attractor networks of this type can support an efficient combinatorial code. The model provides a principled approach towards understanding how multiple interacting plasticity mechanisms can coordinate hetero-associative learning in unison.

  6. Function and Regulation of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR / CRISPR Associated (Cas Systems

    Directory of Open Access Journals (Sweden)

    Peter C. Fineran

    2012-10-01

    Full Text Available Phages are the most abundant biological entities on earth and pose a constant challenge to their bacterial hosts. Thus, bacteria have evolved numerous ‘innate’ mechanisms of defense against phage, such as abortive infection or restriction/modification systems. In contrast, the clustered regularly interspaced short palindromic repeats (CRISPR systems provide acquired, yet heritable, sequence-specific ‘adaptive’ immunity against phage and other horizontally-acquired elements, such as plasmids. Resistance is acquired following viral infection or plasmid uptake when a short sequence of the foreign genome is added to the CRISPR array. CRISPRs are then transcribed and processed, generally by CRISPR associated (Cas proteins, into short interfering RNAs (crRNAs, which form part of a ribonucleoprotein complex. This complex guides the crRNA to the complementary invading nucleic acid and targets this for degradation. Recently, there have been rapid advances in our understanding of CRISPR/Cas systems. In this review, we will present the current model(s of the molecular events involved in both the acquisition of immunity and interference stages and will also address recent progress in our knowledge of the regulation of CRISPR/Cas systems.

  7. Function and regulation of clustered regularly interspaced short palindromic repeats (CRISPR) / CRISPR associated (Cas) systems.

    Science.gov (United States)

    Richter, Corinna; Chang, James T; Fineran, Peter C

    2012-10-19

    Phages are the most abundant biological entities on earth and pose a constant challenge to their bacterial hosts. Thus, bacteria have evolved numerous 'innate' mechanisms of defense against phage, such as abortive infection or restriction/modification systems. In contrast, the clustered regularly interspaced short palindromic repeats (CRISPR) systems provide acquired, yet heritable, sequence-specific 'adaptive' immunity against phage and other horizontally-acquired elements, such as plasmids. Resistance is acquired following viral infection or plasmid uptake when a short sequence of the foreign genome is added to the CRISPR array. CRISPRs are then transcribed and processed, generally by CRISPR associated (Cas) proteins, into short interfering RNAs (crRNAs), which form part of a ribonucleoprotein complex. This complex guides the crRNA to the complementary invading nucleic acid and targets this for degradation. Recently, there have been rapid advances in our understanding of CRISPR/Cas systems. In this review, we will present the current model(s) of the molecular events involved in both the acquisition of immunity and interference stages and will also address recent progress in our knowledge of the regulation of CRISPR/Cas systems.

  8. The novel as short story

    Directory of Open Access Journals (Sweden)

    Kirk Schlueter

    2013-06-01

    Full Text Available In recent history, the novel has been thought of and defined primarily as a long prose narrative. However, this has not been the case historically, as the original meaning of "novel" was for "a piece of news" or "a short story or novella." Returning to this original definition, I propose a new way of viewing the work known contemporarily as the novel as a collection, or sequence, of united short stories rather than a single indivisible work, with the component short stories or novellas comprising the sequence renamed as "novels." A brief examination of several classic works traditionally considered novels serves to illustrate how this change in definition will affect reading.

  9. The prophage sequences of Lactobacillus plantarum strain WCFS1

    International Nuclear Information System (INIS)

    Ventura, Marco; Canchaya, Carlos; Kleerebezem, Michiel; Vos, Willem M. de; Siezen, Roland J.; Bruessow, Harald

    2003-01-01

    The Lactobacillus plantarum commensal WCFS1 contains four prophage elements in its genome. Lp1 and Lp2 are two about 40-kb-long uninducible prophages that share closely related DNA packaging, head and tail genes defining a second lineage of pac-site Siphoviridae in L. plantarum, distinct from L. plantarum phage phig1e, but related to Bacillus phage SPP1 and Lactococcus phage TP901-1. Northern analysis revealed transcribed prophage genes exclusively near both attachment sites. Comparative genomics identified candidate lysogenic conversion genes (LCG) downstream of the lysis cassette and within the lysogeny module. Notable are genes with sequence similarities to putative LCG from Streptococcus pyogenes prophages and to a Bacillus plasmid. Both prophages harbored tRNA genes. R-Lp3 and R-Lp4 represent short prophage remnants; R-Lp3 abuts Lp2 and displays sequence links to cos-site Siphoviridae

  10. MRI in multiple sclerosis of the spinal cord: evaluation of fast short-tan inversion-recovery and spin-echo sequences

    International Nuclear Information System (INIS)

    Dietemann, J.L.; Thibaut-Menard, A.; Neugroschl, C.; Gillis, C.; Abu Eid, M.; Bogorin, A.; Warter, J.M.; Tranchant, C.

    2000-01-01

    We compared the sensitivity of T2-weighted spin-echo (FSE) and fast short-tau inversion-recovery (fSTIR) sequences in detection of multiple sclerosis of the spinal cord in 100 consecutive patients with clinically confirmed multiple sclerosis (MS); 86 patients underwent also brain MRI. In all, 310 focal lesions were detected on fSTIR and 212 on T2-weighted FSE, spinal cord lesions were seen better on fSTIR images, with a higher contrast between the lesion and the normal spinal cord. In 24 patients in whom cord plaques were shown with both sequences, the cranial study was normal or inconclusive. Assessment of spinal plaques can be particularly important when MRI of the brain is inconclusive, and in there situations fSTIR can be helpful. (orig.)

  11. Whole Genome Sequencing Identifies a Missense Mutation in HES7 Associated with Short Tails in Asian Domestic Cats.

    Science.gov (United States)

    Xu, Xiao; Sun, Xin; Hu, Xue-Song; Zhuang, Yan; Liu, Yue-Chen; Meng, Hao; Miao, Lin; Yu, He; Luo, Shu-Jin

    2016-08-25

    Domestic cats exhibit abundant variations in tail morphology and serve as an excellent model to study the development and evolution of vertebrate tails. Cats with shortened and kinked tails were first recorded in the Malayan archipelago by Charles Darwin in 1868 and remain quite common today in Southeast and East Asia. To elucidate the genetic basis of short tails in Asian cats, we built a pedigree of 13 cats segregating at the trait with a founder from southern China and performed linkage mapping based on whole genome sequencing data from the pedigree. The short-tailed trait was mapped to a 5.6 Mb region of Chr E1, within which the substitution c. 5T > C in the somite segmentation-related gene HES7 was identified as the causal mutation resulting in a missense change (p.V2A). Validation in 245 unrelated cats confirmed the correlation between HES7-c. 5T > C and Chinese short-tailed feral cats as well as the Japanese Bobtail breed, indicating a common genetic basis of the two. In addition, some of our sampled kinked-tailed cats could not be explained by either HES7 or the Manx-related T-box, suggesting at least three independent events in the evolution of domestic cats giving rise to short-tailed traits.

  12. Isotopic and trace element constraints on the genesis of a boninitic sequence in the Thetford Mines ophiolitic complex, Quebec, Canada

    International Nuclear Information System (INIS)

    Olive, V.; Hebert, R.; Loubet, M.

    1997-01-01

    The Mont Ham Massif (part of the Thetford Mines ophiolite, south Quebec) represents a magmatic sequence made up of tholeiitic and boninitic derived products. A geochemical study confirms the multicomponent mixing models that have been classically advanced for the source of boninites, with slab-derived components added to the main refractory harzburgitic peridotite. An isochron diagram of the boninitic rocks is interpreted as a mixing trend between two components: (i) a light rare earth element (LREE) enriched component (A), interpreted as slab-derived fluid-melts equilibrated with sedimentary materials (ε Nd = -3, 147 Sm/ 144 Nd = 0.140), and (ii) a LREE-depleted component (B) (0.21 147 Sm/ 144 Nd Nd = 9). A multicomponent source is also necessary to explain the Nd-isotope and trace element composition of the tholeiites, which are explained by the melting of a more fertile, Iherzolitic mantle and (or) mid-ocean ridge basalt source (component C), characterized by a large-ion lithophile element depicted pattern and an lapetus mantle Nd isotopic composition (ε Nd = 9), mixed in adequate proportions with the two previously infered slab-derived components (A and B). The genesis of the boninites of Mont Ham is not significantly different from those of boninites located in the Pacific. An intraoceanic subduction zone appears to be an appropriate geodynamic environment for the Mont Ham ophiolitic sequence. (author)

  13. Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

    Directory of Open Access Journals (Sweden)

    Graner Andreas

    2008-10-01

    Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular

  14. An evaluation of Comparative Genome Sequencing (CGS by comparing two previously-sequenced bacterial genomes

    Directory of Open Access Journals (Sweden)

    Herring Christopher D

    2007-08-01

    Full Text Available Abstract Background With the development of new technology, it has recently become practical to resequence the genome of a bacterium after experimental manipulation. It is critical though to know the accuracy of the technique used, and to establish confidence that all of the mutations were detected. Results In order to evaluate the accuracy of genome resequencing using the microarray-based Comparative Genome Sequencing service provided by Nimblegen Systems Inc., we resequenced the E. coli strain W3110 Kohara using MG1655 as a reference, both of which have been completely sequenced using traditional sequencing methods. CGS detected 7 of 8 small sequence differences, one large deletion, and 9 of 12 IS element insertions present in W3110, but did not detect a large chromosomal inversion. In addition, we confirmed that CGS also detected 2 SNPs, one deletion and 7 IS element insertions that are not present in the genome sequence, which we attribute to changes that occurred after the creation of the W3110 lambda clone library. The false positive rate for SNPs was one per 244 Kb of genome sequence. Conclusion CGS is an effective way to detect multiple mutations present in one bacterium relative to another, and while highly cost-effective, is prone to certain errors. Mutations occurring in repeated sequences or in sequences with a high degree of secondary structure may go undetected. It is also critical to follow up on regions of interest in which SNPs were not called because they often indicate deletions or IS element insertions.

  15. Weak disorder in Fibonacci sequences

    Energy Technology Data Exchange (ETDEWEB)

    Ben-Naim, E [Theoretical Division and Center for Nonlinear Studies, Los Alamos National Laboratory, Los Alamos, NM 87545 (United States); Krapivsky, P L [Department of Physics and Center for Molecular Cybernetics, Boston University, Boston, MA 02215 (United States)

    2006-05-19

    We study how weak disorder affects the growth of the Fibonacci series. We introduce a family of stochastic sequences that grow by the normal Fibonacci recursion with probability 1 - {epsilon}, but follow a different recursion rule with a small probability {epsilon}. We focus on the weak disorder limit and obtain the Lyapunov exponent that characterizes the typical growth of the sequence elements, using perturbation theory. The limiting distribution for the ratio of consecutive sequence elements is obtained as well. A number of variations to the basic Fibonacci recursion including shift, doubling and copying are considered. (letter to the editor)

  16. Design of Long Period Pseudo-Random Sequences from the Addition of -Sequences over

    Directory of Open Access Journals (Sweden)

    Ren Jian

    2004-01-01

    Full Text Available Pseudo-random sequence with good correlation property and large linear span is widely used in code division multiple access (CDMA communication systems and cryptology for reliable and secure information transmission. In this paper, sequences with long period, large complexity, balance statistics, and low cross-correlation property are constructed from the addition of -sequences with pairwise-prime linear spans (AMPLS. Using -sequences as building blocks, the proposed method proved to be an efficient and flexible approach to construct long period pseudo-random sequences with desirable properties from short period sequences. Applying the proposed method to , a signal set is constructed.

  17. Multimodal sequence learning.

    Science.gov (United States)

    Kemény, Ferenc; Meier, Beat

    2016-02-01

    While sequence learning research models complex phenomena, previous studies have mostly focused on unimodal sequences. The goal of the current experiment is to put implicit sequence learning into a multimodal context: to test whether it can operate across different modalities. We used the Task Sequence Learning paradigm to test whether sequence learning varies across modalities, and whether participants are able to learn multimodal sequences. Our results show that implicit sequence learning is very similar regardless of the source modality. However, the presence of correlated task and response sequences was required for learning to take place. The experiment provides new evidence for implicit sequence learning of abstract conceptual representations. In general, the results suggest that correlated sequences are necessary for implicit sequence learning to occur. Moreover, they show that elements from different modalities can be automatically integrated into one unitary multimodal sequence. Copyright © 2015 Elsevier B.V. All rights reserved.

  18. Cyclic AMP regulation of the human glycoprotein hormone α-subunit gene is mediated by an 18-base-pair element

    International Nuclear Information System (INIS)

    Silver, B.J.; Bokar, J.A.; Virgin, J.B.; Vallen, E.A.; Milsted, A.; Nilson, J.H.

    1987-01-01

    cAMP regulates transcription of the gene encoding the α-subunit of human chorionic gonadotropin (hCG) in the choriocarcinoma cells (BeWo). To define the sequences required for regulation by cAMP, the authors inserted fragments from the 5' flanking region of the α-subunit gene into a test vector containing the simian virus 40 early promoter (devoid of its enhancer) linked to the bacterial chloramphenicol acetyltransferase (CAT) gene. Results from transient expression assays in BeWo cells indicated that a 1500-base-pair (bp) fragment conferred cAMP responsiveness on the CAT gene regardless of position or orientation of the insert relative to the viral promoter. A subfragment extending from position -169 to position -100 had the same effect on cAMP-induced expression. Furthermore, the entire stimulatory effect could be achieved with an 18-bp synthetic oligodeoxynucleotide corresponding to a direct repeat between position -146 and -111. In the absence of cAMP, the α-subunit 5' flanking sequence also enhanced transcription from the simian virus 40 early promoter. They localized this enhancer activity to the same -169/-100 fragment containing the cAMP response element. The 18-bp element alone, however, had no effect on basal expression. Thus, this short DNA sequence serves as a cAMP response element and also functions independently of other promoter-regulatory elements located in the 5' flanking sequence of the α-subunit gene

  19. Optically intraconnected computer employing dynamically reconfigurable holographic optical element

    Science.gov (United States)

    Bergman, Larry A. (Inventor)

    1992-01-01

    An optically intraconnected computer and a reconfigurable holographic optical element employed therein. The basic computer comprises a memory for holding a sequence of instructions to be executed; logic for accessing the instructions in sequence; logic for determining for each the instruction the function to be performed and the effective address thereof; a plurality of individual elements on a common support substrate optimized to perform certain logical sequences employed in executing the instructions; and, element selection logic connected to the logic determining the function to be performed for each the instruction for determining the class of each function and for causing the instruction to be executed by those the elements which perform those associated the logical sequences affecting the instruction execution in an optimum manner. In the optically intraconnected version, the element selection logic is adapted for transmitting and switching signals to the elements optically.

  20. Alignment of Short Reads: A Crucial Step for Application of Next-Generation Sequencing Data in Precision Medicine

    Directory of Open Access Journals (Sweden)

    Hao Ye

    2015-11-01

    Full Text Available Precision medicine or personalized medicine has been proposed as a modernized and promising medical strategy. Genetic variants of patients are the key information for implementation of precision medicine. Next-generation sequencing (NGS is an emerging technology for deciphering genetic variants. Alignment of raw reads to a reference genome is one of the key steps in NGS data analysis. Many algorithms have been developed for alignment of short read sequences since 2008. Users have to make a decision on which alignment algorithm to use in their studies. Selection of the right alignment algorithm determines not only the alignment algorithm but also the set of suitable parameters to be used by the algorithm. Understanding these algorithms helps in selecting the appropriate alignment algorithm for different applications in precision medicine. Here, we review current available algorithms and their major strategies such as seed-and-extend and q-gram filter. We also discuss the challenges in current alignment algorithms, including alignment in multiple repeated regions, long reads alignment and alignment facilitated with known genetic variants.

  1. Using nanopore sequencing to get complete genomes from complex samples

    DEFF Research Database (Denmark)

    Kirkegaard, Rasmus Hansen; Karst, Søren Michael; Nielsen, Per Halkjær

    The advantages of “next generation sequencing” has come at the cost of genome finishing. The dominant sequencing technology provides short reads of 150-300 bp, which has made genome assembly very difficult as the reads do not span important repeat regions. Genomes have thus been added...... to the databases as fragmented assemblies and not as finished contigs that resemble the chromosomes in which the DNA is organised within the cells. This is especially troublesome for genomes derived from complex metagenome sequencing. Databases with incomplete genomes can lead to false conclusions about...... the absence of genes and functional predictions of the organisms. Furthermore, it is common that repetitive elements and marker genes such as the 16S rRNA gene are missing completely from these genome bins. Using nanopore long reads, we demonstrate that it is possible to span these regions and make complete...

  2. ACGT-containing abscisic acid response element (ABRE) and coupling element 3 (CE3) are functionally equivalent.

    Science.gov (United States)

    Hobo, T; Asada, M; Kowyama, Y; Hattori, T

    1999-09-01

    ACGT-containing ABA response elements (ABREs) have been functionally identified in the promoters of various genes. In addition, single copies of ABRE have been found to require a cis-acting, coupling element to achieve ABA induction. A coupling element 3 (CE3) sequence, originally identified as such in the barley HVA1 promoter, is found approximately 30 bp downstream of motif A (ACGT-containing ABRE) in the promoter of the Osem gene. The relationship between these two elements was further defined by linker-scan analyses of a 55 bp fragment of the Osem promoter, which is sufficient for ABA-responsiveness and VP1 activation. The analyses revealed that both motif A and CE3 sequence were required not only for ABA-responsiveness but also for VP1 activation. Since the sequences of motif A and CE3 were found to be similar, motif-exchange experiments were carried out. The experiments demonstrated that motif A and CE3 were interchangeable by each other with respect to both ABA and VP1 regulation. In addition, both sequences were shown to be recognized by a VP1-interacting, ABA-responsive bZIP factor TRAB1. These results indicate that ACGT-containing ABREs and CE3 are functionally equivalent cis-acting elements. Furthermore, TRAB1 was shown to bind two other non-ACGT ABREs. Based on these results, all these ABREs including CE3 are proposed to be categorized into a single class of cis-acting elements.

  3. Highly conserved non-coding elements on either side of SOX9 associated with Pierre Robin sequence.

    Science.gov (United States)

    Benko, Sabina; Fantes, Judy A; Amiel, Jeanne; Kleinjan, Dirk-Jan; Thomas, Sophie; Ramsay, Jacqueline; Jamshidi, Negar; Essafi, Abdelkader; Heaney, Simon; Gordon, Christopher T; McBride, David; Golzio, Christelle; Fisher, Malcolm; Perry, Paul; Abadie, Véronique; Ayuso, Carmen; Holder-Espinasse, Muriel; Kilpatrick, Nicky; Lees, Melissa M; Picard, Arnaud; Temple, I Karen; Thomas, Paul; Vazquez, Marie-Paule; Vekemans, Michel; Roest Crollius, Hugues; Hastie, Nicholas D; Munnich, Arnold; Etchevers, Heather C; Pelet, Anna; Farlie, Peter G; Fitzpatrick, David R; Lyonnet, Stanislas

    2009-03-01

    Pierre Robin sequence (PRS) is an important subgroup of cleft palate. We report several lines of evidence for the existence of a 17q24 locus underlying PRS, including linkage analysis results, a clustering of translocation breakpoints 1.06-1.23 Mb upstream of SOX9, and microdeletions both approximately 1.5 Mb centromeric and approximately 1.5 Mb telomeric of SOX9. We have also identified a heterozygous point mutation in an evolutionarily conserved region of DNA with in vitro and in vivo features of a developmental enhancer. This enhancer is centromeric to the breakpoint cluster and maps within one of the microdeletion regions. The mutation abrogates the in vitro enhancer function and alters binding of the transcription factor MSX1 as compared to the wild-type sequence. In the developing mouse mandible, the 3-Mb region bounded by the microdeletions shows a regionally specific chromatin decompaction in cells expressing Sox9. Some cases of PRS may thus result from developmental misexpression of SOX9 due to disruption of very-long-range cis-regulatory elements.

  4. [Comparative analysis of clustered regularly interspaced short palindromic repeats (CRISPRs) loci in the genomes of halophilic archaea].

    Science.gov (United States)

    Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian

    2009-11-01

    Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.

  5. TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads.

    Science.gov (United States)

    Novák, Petr; Ávila Robledillo, Laura; Koblížková, Andrea; Vrbová, Iva; Neumann, Pavel; Macas, Jirí

    2017-07-07

    Satellite DNA is one of the major classes of repetitive DNA, characterized by tandemly arranged repeat copies that form contiguous arrays up to megabases in length. This type of genomic organization makes satellite DNA difficult to assemble, which hampers characterization of satellite sequences by computational analysis of genomic contigs. Here, we present tandem repeat analyzer (TAREAN), a novel computational pipeline that circumvents this problem by detecting satellite repeats directly from unassembled short reads. The pipeline first employs graph-based sequence clustering to identify groups of reads that represent repetitive elements. Putative satellite repeats are subsequently detected by the presence of circular structures in their cluster graphs. Consensus sequences of repeat monomers are then reconstructed from the most frequent k-mers obtained by decomposing read sequences from corresponding clusters. The pipeline performance was successfully validated by analyzing low-pass genome sequencing data from five plant species where satellite DNA was previously experimentally characterized. Moreover, novel satellite repeats were predicted for the genome of Vicia faba and three of these repeats were verified by detecting their sequences on metaphase chromosomes using fluorescence in situ hybridization. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. Detecting the (quasi-) two-body decays of /τ leptons in short-baseline neutrino oscillation experiments

    Science.gov (United States)

    Asratyan, A.; Balatz, M.; Boehnlein, D.; Childres, S.; Davidenko, G.; Dolgolenko, A.; Dzyubenko, G.; Kaftanov, V.; Kubantsev, M.; Reay, N. W.; Musser, J.; Rosenfeld, C.; Stanton, N. R.; Thun, R.; Tzanakos, G. S.; Verebryusov, V.; Vishnyakov, V.

    1999-05-01

    Novel detector schemes are proposed for the short-baseline neutrino experiments of next generation, aimed at exploring the large- Δm 2 domain of ν μ→ν τ oscillations in the appearance mode. These schemes emphasize good spectrometry for charged particles and for electromagnetic showers and efficient reconstruction of π0→ γγ decays. The basic elements are a sequence of relatively thin emulsion targets, immersed in magnetic field and interspersed with electronic trackers, and a fine-grained electromagnetic calorimeter built of lead glass. These elements act as an integral whole in reconstructing the electromagnetic showers. This conceptual scheme shows good performance in identifying the τ (quasi-) two-body decays by their characteristic kinematics and in selecting the electronic decays of the τ.

  7. Detecting the (quasi-) two-body decays of τ leptons in short-baseline neutrino oscillation experiments

    International Nuclear Information System (INIS)

    Asratyan, A.; Balatz, M.; Boehnlein, D.; Childres, S.; Davidenko, G.; Dolgolenko, A.; Dzyubenko, G.; Kaftanov, V.; Kubantsev, M.; Reay, N.W.; Musser, J.; Rosenfeld, C.; Stanton, N.R.; Thun, R.; Tzanakos, G.S.; Verebryusov, V.; Vishnyakov, V.

    1999-01-01

    Novel detector schemes are proposed for the short-baseline neutrino experiments of next generation, aimed at exploring the large-Δm 2 domain of ν μ →ν τ oscillations in the appearance mode. These schemes emphasize good spectrometry for charged particles and for electromagnetic showers and efficient reconstruction of π 0 →γγ decays. The basic elements are a sequence of relatively thin emulsion targets, immersed in magnetic field and interspersed with electronic trackers, and a fine-grained electromagnetic calorimeter built of lead glass. These elements act as an integral whole in reconstructing the electromagnetic showers. This conceptual scheme shows good performance in identifying the τ (quasi-) two-body decays by their characteristic kinematics and in selecting the electronic decays of the τ

  8. Insight into microevolution of Yersinia pestis by clustered regularly interspaced short palindromic repeats.

    Directory of Open Access Journals (Sweden)

    Yujun Cui

    Full Text Available BACKGROUND: Yersinia pestis, the pathogen of plague, has greatly influenced human history on a global scale. Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR, an element participating in immunity against phages' invasion, is composed of short repeated sequences separated by unique spacers and provides the basis of the spoligotyping technology. In the present research, three CRISPR loci were analyzed in 125 strains of Y. pestis from 26 natural plague foci of China, the former Soviet Union and Mongolia were analyzed, for validating CRISPR-based genotyping method and better understanding adaptive microevolution of Y. pestis. METHODOLOGY/PRINCIPAL FINDINGS: Using PCR amplification, sequencing and online data processing, a high degree of genetic diversity was revealed in all three CRISPR elements. The distribution of spacers and their arrays in Y. pestis strains is strongly region and focus-specific, allowing the construction of a hypothetic evolutionary model of Y. pestis. This model suggests transmission route of microtus strains that encircled Takla Makan Desert and ZhunGer Basin. Starting from Tadjikistan, one branch passed through the Kunlun Mountains, and moved to the Qinghai-Tibet Plateau. Another branch went north via the Pamirs Plateau, the Tianshan Mountains, the Altai Mountains and the Inner Mongolian Plateau. Other Y. pestis lineages might be originated from certain areas along those routes. CONCLUSIONS/SIGNIFICANCE: CRISPR can provide important information for genotyping and evolutionary research of bacteria, which will help to trace the source of outbreaks. The resulting data will make possible the development of very low cost and high-resolution assays for the systematic typing of any new isolate.

  9. Roles of repetitive sequences

    Energy Technology Data Exchange (ETDEWEB)

    Bell, G.I.

    1991-12-31

    The DNA of higher eukaryotes contains many repetitive sequences. The study of repetitive sequences is important, not only because many have important biological function, but also because they provide information on genome organization, evolution and dynamics. In this paper, I will first discuss some generic effects that repetitive sequences will have upon genome dynamics and evolution. In particular, it will be shown that repetitive sequences foster recombination among, and turnover of, the elements of a genome. I will then consider some examples of repetitive sequences, notably minisatellite sequences and telomere sequences as examples of tandem repeats, without and with respectively known function, and Alu sequences as an example of interspersed repeats. Some other examples will also be considered in less detail.

  10. MetaVelvet: An Extension of Velvet Assembler to de novo Metagenome Assembly from Short Sequence Reads (Metagenomics Informatics Challenges Workshop: 10K Genomes at a Time)

    Energy Technology Data Exchange (ETDEWEB)

    Sakakibara, Yasumbumi

    2011-10-13

    Keio University's Yasumbumi Sakakibara on "MetaVelvet: An Extension of Velvet Assembler to de novo Metagenome Assembly from Short Sequence Reads" at the Metagenomics Informatics Challenges Workshop held at the DOE JGI on October 12-13, 2011.

  11. Accelerated Evolution of Conserved Noncoding Sequences in theHuman Genome

    Energy Technology Data Exchange (ETDEWEB)

    Prambhakar, Shyam; Noonan, James P.; Paabo, Svante; Rubin, EdwardM.

    2006-07-06

    Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detect"cryptic" functional elements, which are too weakly conserved amongmammals to distinguish from nonfunctional DNA. To address this problem,we explored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.

  12. The new element 112. Short note

    International Nuclear Information System (INIS)

    Hofmann, S.; Ninov, V.; Hessberger, F.P.; Armbruster, P.; Folger, H.; Muenzenberg, G.; Schoett, H.J.; Popeko, A.G.; Yeremin, A.V.; Saro, S.; Janik, R.; Leino, M.

    1996-02-01

    The new element 112 was produced and identified unambigiously in an experiment at SHIP, GSI Darmstadt. Two decay chains of the isotope 277 112 were observed in irradiations of 208 Pb targets with 70 Zn projectiles of 344 MeV kinetic energy. The isotope decays by emission of α particles with a half-life of (240 -90 +430 ) μs. Two different α energies of (11.649±20) keV and (11.454±20) keV were measured for the two observed decays. The cross-section measured in three weeks of irradiations is (1.0 -0.4 +18 ) pb. (orig.)

  13. The contribution of alu elements to mutagenic DNA double-strand break repair.

    Science.gov (United States)

    Morales, Maria E; White, Travis B; Streva, Vincent A; DeFreece, Cecily B; Hedges, Dale J; Deininger, Prescott L

    2015-03-01

    Alu elements make up the largest family of human mobile elements, numbering 1.1 million copies and comprising 11% of the human genome. As a consequence of evolution and genetic drift, Alu elements of various sequence divergence exist throughout the human genome. Alu/Alu recombination has been shown to cause approximately 0.5% of new human genetic diseases and contribute to extensive genomic structural variation. To begin understanding the molecular mechanisms leading to these rearrangements in mammalian cells, we constructed Alu/Alu recombination reporter cell lines containing Alu elements ranging in sequence divergence from 0%-30% that allow detection of both Alu/Alu recombination and large non-homologous end joining (NHEJ) deletions that range from 1.0 to 1.9 kb in size. Introduction of as little as 0.7% sequence divergence between Alu elements resulted in a significant reduction in recombination, which indicates even small degrees of sequence divergence reduce the efficiency of homology-directed DNA double-strand break (DSB) repair. Further reduction in recombination was observed in a sequence divergence-dependent manner for diverged Alu/Alu recombination constructs with up to 10% sequence divergence. With greater levels of sequence divergence (15%-30%), we observed a significant increase in DSB repair due to a shift from Alu/Alu recombination to variable-length NHEJ which removes sequence between the two Alu elements. This increase in NHEJ deletions depends on the presence of Alu sequence homeology (similar but not identical sequences). Analysis of recombination products revealed that Alu/Alu recombination junctions occur more frequently in the first 100 bp of the Alu element within our reporter assay, just as they do in genomic Alu/Alu recombination events. This is the first extensive study characterizing the influence of Alu element sequence divergence on DNA repair, which will inform predictions regarding the effect of Alu element sequence divergence on both

  14. Complete sequence of Tvv1, a family of Ty 1 copia-like retrotransposons of Vitis vinifera L., reconstituted by chromosome walking.

    Science.gov (United States)

    Pelsy, F.; Merdinoglu, D.

    2002-09-01

    A chromosome-walking strategy was used to sequence and characterize retrotransposons in the grapevine genome. The reconstitution of a family of retroelements, named Tvv1, was achieved by six successive steps. These elements share a single, highly conserved open reading frame 4,153 nucleotides-long, putatively encoding the gag, pro, int, rt and rh proteins. Comparison of the Tvv1 open reading frame coding potential with those of drosophila copia and tobacco Tnt1, revealed that Tvv1 is closely related to Ty 1 copia-like retrotransposons. A highly variable untranslated leader region, upstream of the open reading frame, allowed us to differentiate Tvv1 variants, which represent a family of at least 28 copies, in varying sizes. This internal region is flanked by two long terminal repeats in direct orientation, sized between 149 and 157 bp. Among elements theoretically sized from 4,970 to 5,550 bp, we describe the full-length sequence of a reference element Tvv1-1, 5,343 nucleotides-long. The full-length sequence of Tvv1-1 compared to pea PDR1 shows a 53.3% identity. In addition, both elements contain long terminal repeats of nearly the same size in which the U5 region could be entirely absent. Therefore, we assume that Tvv1 and PDR1 could constitute a particular class of short LTRs retroelements.

  15. Structural and functional analysis of mouse Msx1 gene promoter: sequence conservation with human MSX1 promoter points at potential regulatory elements.

    Science.gov (United States)

    Gonzalez, S M; Ferland, L H; Robert, B; Abdelhay, E

    1998-06-01

    Vertebrate Msx genes are related to one of the most divergent homeobox genes of Drosophila, the muscle segment homeobox (msh) gene, and are expressed in a well-defined pattern at sites of tissue interactions. This pattern of expression is conserved in vertebrates as diverse as quail, zebrafish, and mouse in a range of sites including neural crest, appendages, and craniofacial structures. In the present work, we performed structural and functional analyses in order to identify potential cis-acting elements that may be regulating Msx1 gene expression. To this end, a 4.9-kb segment of the 5'-flanking region was sequenced and analyzed for transcription-factor binding sites. Four regions showing a high concentration of these sites were identified. Transfection assays with fragments of regulatory sequences driving the expression of the bacterial lacZ reporter gene showed that a region of 4 kb upstream of the transcription start site contains positive and negative elements responsible for controlling gene expression. Interestingly, a fragment of 130 bp seems to contain the minimal elements necessary for gene expression, as its removal completely abolishes gene expression in cultured cells. These results are reinforced by comparison of this region with the human Msx1 gene promoter, which shows extensive conservation, including many consensus binding sites, suggesting a regulatory role for them.

  16. IMPROVING THE SKILL AND THE INTEREST OF WRITING ADVERTISEMENTS AND POSTERS THROUGH ESA SEQUENCE

    Directory of Open Access Journals (Sweden)

    Fatma Yuniarti

    2016-04-01

    Full Text Available The action reserach aims at improving the students’ writing skill especially to write advertsements and posters. Both are the short functional texts to be learned at the first semester of the ninth grade. According to the data on pre cycle, the students of class IXA Junior High School Swadhipa Natar, South Lampung got difficulties in writing advertisements and posters. A treatment was necessary to help the students overcome their problem. To consider the related literature, the writer decided to implement ESA sequence in the class. The elements of teaching in ESA Sequence are Engage (to arouse the students’ interests, Study (learn the language focus, and Activate (use the language freely and communicatively.The data were taken from the test of the linguistic competence mastery, the students writing, and the questionnaire. The result shows ESA Sequence can improve the students’ ability in writing advertisements and posters.Key words : ESA (Engange Study Activate, advertisement, poster.

  17. Repetitive Elements in Mycoplasma hyopneumoniae Transcriptional Regulation.

    Directory of Open Access Journals (Sweden)

    Amanda Malvessi Cattani

    Full Text Available Transcriptional regulation, a multiple-step process, is still poorly understood in the important pig pathogen Mycoplasma hyopneumoniae. Basic motifs like promoters and terminators have already been described, but no other cis-regulatory elements have been found. DNA repeat sequences have been shown to be an interesting potential source of cis-regulatory elements. In this work, a genome-wide search for tandem and palindromic repetitive elements was performed in the intergenic regions of all coding sequences from M. hyopneumoniae strain 7448. Computational analysis demonstrated the presence of 144 tandem repeats and 1,171 palindromic elements. The DNA repeat sequences were distributed within the 5' upstream regions of 86% of transcriptional units of M. hyopneumoniae strain 7448. Comparative analysis between distinct repetitive sequences found in related mycoplasma genomes demonstrated different percentages of conservation among pathogenic and nonpathogenic strains. qPCR assays revealed differential expression among genes showing variable numbers of repetitive elements. In addition, repeats found in 206 genes already described to be differentially regulated under different culture conditions of M. hyopneumoniae strain 232 showed almost 80% conservation in relation to M. hyopneumoniae strain 7448 repeats. Altogether, these findings suggest a potential regulatory role of tandem and palindromic DNA repeats in the M. hyopneumoniae transcriptional profile.

  18. Repetitive Elements in Mycoplasma hyopneumoniae Transcriptional Regulation.

    Science.gov (United States)

    Cattani, Amanda Malvessi; Siqueira, Franciele Maboni; Guedes, Rafael Lucas Muniz; Schrank, Irene Silveira

    2016-01-01

    Transcriptional regulation, a multiple-step process, is still poorly understood in the important pig pathogen Mycoplasma hyopneumoniae. Basic motifs like promoters and terminators have already been described, but no other cis-regulatory elements have been found. DNA repeat sequences have been shown to be an interesting potential source of cis-regulatory elements. In this work, a genome-wide search for tandem and palindromic repetitive elements was performed in the intergenic regions of all coding sequences from M. hyopneumoniae strain 7448. Computational analysis demonstrated the presence of 144 tandem repeats and 1,171 palindromic elements. The DNA repeat sequences were distributed within the 5' upstream regions of 86% of transcriptional units of M. hyopneumoniae strain 7448. Comparative analysis between distinct repetitive sequences found in related mycoplasma genomes demonstrated different percentages of conservation among pathogenic and nonpathogenic strains. qPCR assays revealed differential expression among genes showing variable numbers of repetitive elements. In addition, repeats found in 206 genes already described to be differentially regulated under different culture conditions of M. hyopneumoniae strain 232 showed almost 80% conservation in relation to M. hyopneumoniae strain 7448 repeats. Altogether, these findings suggest a potential regulatory role of tandem and palindromic DNA repeats in the M. hyopneumoniae transcriptional profile.

  19. Does tonality boost short-term memory in congenital amusia?

    Science.gov (United States)

    Albouy, Philippe; Schulze, Katrin; Caclin, Anne; Tillmann, Barbara

    2013-11-06

    Congenital amusia is a neuro-developmental disorder of music perception and production. Recent findings have demonstrated that this deficit is linked to an impaired short-term memory for tone sequences. As it has been shown before that non-musicians' implicit knowledge of musical regularities can improve short-term memory for tone information, the present study investigated if this type of implicit knowledge could also influence amusics' short-term memory performance. Congenital amusics and their matched controls, who were non-musicians, had to indicate whether sequences of five tones, presented in pairs, were the same or different; half of the pairs respected musical regularities (tonal sequences) and the other half did not (atonal sequences). As previously reported for non-musician participants, the control participants showed better performance (as measured with d') for tonal sequences than for atonal ones. While this improvement was not observed in amusics, both control and amusic participants showed faster response times for tonal sequences than for atonal sequences. These findings suggest that some implicit processing of tonal structures is potentially preserved in congenital amusia. This observation is encouraging as it strengthens the perspective to exploit implicit knowledge to help reducing pitch perception and memory deficits in amusia. © 2013 Elsevier B.V. All rights reserved.

  20. Car sequencing is NP-hard: a short proof

    OpenAIRE

    B Estellon; F Gardi

    2013-01-01

    In this note, a new proof is given that the car sequencing (CS) problem is NP-hard. Established from the Hamiltonian Path problem, the reduction is direct while closing some gaps remaining in the previous NP-hardness results. Since CS is studied in many operational research courses, this result and its proof are particularly interesting for teaching purposes.

  1. Multiple regulatory mechanisms of hepatocyte growth factor expression in malignant cells with a short poly(dA) sequence in the HGF gene promoter.

    Science.gov (United States)

    Sakai, Kazuko; Takeda, Masayuki; Okamoto, Isamu; Nakagawa, Kazuhiko; Nishio, Kazuto

    2015-01-01

    Hepatocyte growth factor (HGF) expression is a poor prognostic factor in various types of cancer. Expression levels of HGF have been reported to be regulated by shorter poly(dA) sequences in the promoter region. In the present study, the poly(dA) mononucleotide tract in various types of human cancer cell lines was examined and compared with the HGF expression levels in those cells. Short deoxyadenosine repeat sequences were detected in five of the 55 cell lines used in the present study. The H69, IM95, CCK-81, Sui73 and H28 cells exhibited a truncated poly(dA) sequence in which the number of poly(dA) repeats was reduced by ≥5 bp. Two of the cell lines exhibited high HGF expression, determined by reverse transcription quantitative polymerase chain reaction and enzyme-linked immunosorbent assay. The CCK-81, Sui73 and H28 cells with shorter poly(dA) sequences exhibited low HGF expression. The cause of the suppression of HGF expression in the CCK-81, Sui73 and H28 cells was clarified by two approaches, suppression by methylation and single nucleotide polymorphisms in the HGF gene. Exposure to 5-Aza-dC, an inhibitor of DNA methyltransferase 1, induced an increased expression of HGF in the CCK-81 cells, but not in the other cells. Single-nucleotide polymorphism (SNP) rs72525097 in intron 1 was detected in the Sui73 and H28 cells. Taken together, it was found that the defect of poly(dA) in the HGF promoter was present in various types of cancer, including lung, stomach, colorectal, pancreas and mesothelioma. The present study proposes the negative regulation mechanisms by methylation and SNP in intron 1 of HGF for HGF expression in cancer cells with short poly(dA).

  2. IMPROVING THE SKILL AND THE INTEREST OF WRITING ADVERTISEMENTS AND POSTERS THROUGH ESA SEQUENCE

    Directory of Open Access Journals (Sweden)

    author Fatma Yuniarti

    2015-01-01

    Full Text Available The action reserach aims at improving the students’ writing skill especially to write advertsements and posters. Both are the short functional texts to be learned at the second semester. According to the data on pre cycle, the students of second semester got difficulties to write advertisements and posters. A treatment was necessary to help the students overcome their problem. To consider the related literature, the writer decided to implement ESA sequence (Harmer 2001 in the class. The elements of teaching in ESA Sequence are Engage (to arouse the students’ interests, Study (learn the language focus, and Activate (use the language freely and communicatively. The data were taken from the test of the linguistic competence mastery, the students writing, and the questionnaire. The students’ linguistic competence got increased as shown by the score (58 in pre-cycle, 66 in cycle 1, and 70 in cycle 2. The students’ ability to write the short functional texts also get improved as indicated by the average score on writing tasks (53 in pre-cycle, 63 in cycle 1, 72 in cycle 2. The interest also gets better as shown by the score of the questionnaire (22,3 in pre-cycle, 33,5 in cycle 1, and 37 in cycle 2. It means ESA Sequence can improve the studets’ ability to write advertisements and posters.Key words : advertisement, ESA (Engange Study Activate, poster

  3. Deletion of ultraconserved elements yields viable mice

    Energy Technology Data Exchange (ETDEWEB)

    Ahituv, Nadav; Zhu, Yiwen; Visel, Axel; Holt, Amy; Afzal, Veena; Pennacchio, Len A.; Rubin, Edward M.

    2007-07-15

    Ultraconserved elements have been suggested to retainextended perfect sequence identity between the human, mouse, and ratgenomes due to essential functional properties. To investigate thenecessities of these elements in vivo, we removed four non-codingultraconserved elements (ranging in length from 222 to 731 base pairs)from the mouse genome. To maximize the likelihood of observing aphenotype, we chose to delete elements that function as enhancers in amouse transgenic assay and that are near genes that exhibit markedphenotypes both when completely inactivated in the mouse as well as whentheir expression is altered due to other genomic modifications.Remarkably, all four resulting lines of mice lacking these ultraconservedelements were viable and fertile, and failed to reveal any criticalabnormalities when assayed for a variety of phenotypes including growth,longevity, pathology and metabolism. In addition more targeted screens,informed by the abnormalities observed in mice where genes in proximityto the investigated elements had been altered, also failed to revealnotable abnormalities. These results, while not inclusive of all thepossible phenotypic impact of the deleted sequences, indicate thatextreme sequence constraint does not necessarily reflect crucialfunctions required for viability.

  4. Short term reproducibility of a high contrast 3-D isotropic optic nerve imaging sequence in healthy controls

    Science.gov (United States)

    Harrigan, Robert L.; Smith, Alex K.; Mawn, Louise A.; Smith, Seth A.; Landman, Bennett A.

    2016-03-01

    The optic nerve (ON) plays a crucial role in human vision transporting all visual information from the retina to the brain for higher order processing. There are many diseases that affect the ON structure such as optic neuritis, anterior ischemic optic neuropathy and multiple sclerosis. Because the ON is the sole pathway for visual information from the retina to areas of higher level processing, measures of ON damage have been shown to correlate well with visual deficits. Increased intracranial pressure has been shown to correlate with the size of the cerebrospinal fluid (CSF) surrounding the ON. These measures are generally taken at an arbitrary point along the nerve and do not account for changes along the length of the ON. We propose a high contrast and high-resolution 3-D acquired isotropic imaging sequence optimized for ON imaging. We have acquired scan-rescan data using the optimized sequence and a current standard of care protocol for 10 subjects. We show that this sequence has superior contrast-to-noise ratio to the current standard of care while achieving a factor of 11 higher resolution. We apply a previously published automatic pipeline to segment the ON and CSF sheath and measure the size of each individually. We show that these measures of ON size have lower short- term reproducibility than the population variance and the variability along the length of the nerve. We find that the proposed imaging protocol is (1) useful in detecting population differences and local changes and (2) a promising tool for investigating biomarkers related to structural changes of the ON.

  5. Detection of Weakly Conserved Ancestral Mammalian RegulatorySequences by Primate Comparisons

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Qian-fei; Prabhakar, Shyam; Chanan, Sumita; Cheng,Jan-Fang; Rubin, Edward M.; Boffelli, Dario

    2006-06-01

    Genomic comparisons between human and distant, non-primatemammals are commonly used to identify cis-regulatory elements based onconstrained sequence evolution. However, these methods fail to detectcryptic functional elements, which are too weakly conserved among mammalsto distinguish from nonfunctional DNA. To address this problem, weexplored the potential of deep intra-primate sequence comparisons. Wesequenced the orthologs of 558 kb of human genomic sequence, coveringmultiple loci involved in cholesterol homeostasis, in 6 nonhumanprimates. Our analysis identified 6 noncoding DNA elements displayingsignificant conservation among primates, but undetectable in more distantcomparisons. In vitro and in vivo tests revealed that at least three ofthese 6 elements have regulatory function. Notably, the mouse orthologsof these three functional human sequences had regulatory activity despitetheir lack of significant sequence conservation, indicating that they arecryptic ancestral cis-regulatory elements. These regulatory elementscould still be detected in a smaller set of three primate speciesincluding human, rhesus and marmoset. Since the human and rhesus genomesequences are already available, and the marmoset genome is activelybeing sequenced, the primate-specific conservation analysis describedhere can be applied in the near future on a whole-genome scale, tocomplement the annotation provided by more distant speciescomparisons.

  6. An AU-rich element in the 3{prime} untranslated region of the spinach chloroplast petD gene participates in sequence-specific RNA-protein complex formation

    Energy Technology Data Exchange (ETDEWEB)

    Chen, Qiuyun; Adams, C.C.; Usack, L. [Cornell Univ., Ithaca, NY (United States)] [and others

    1995-04-01

    In chloroplasts, the 3{prime} untranslated regions of most mRNAs contain a stem-loop-forming inverted repeat (IR) sequence that is required for mRNA stability and correct 3{prime}-end formation. The IR regions of several mRNAs are also known to bind chloroplast proteins, as judged from in vitro gel mobility shift and UV cross-linking assays, and these RNA-protein interactions may be involved in the regulation of chloroplast mRNA processing and/or stability. Here we describe in detail the RNA and protein components that are involved in 3{prime} IR-containing RNA (3{prime} IR-RNA)-protein complex formation for the spinach chloroplast petD gene, which encodes subunit IV of the cytochrome b{sub 6}/f complex. We show that the complex contains 55-, 41-, and 29-kDa RNA-binding proteins (ribonucleoproteins [RNPs]). These proteins together protect a 90-nucleotide segment of RNA from RNase T{sub 1} digestion; this RNA contains the IR and downstream flanking sequences. Competition experiments using 3{prime} IR-RNAs from the psbA or rbcL gene demonstrate that the RNPs have a strong specificity for the petD sequence. Site-directed mutagenesis was carried out to define the RNA sequence elements required for complex formation. These studies identified an 8-nucleotide AU-rich sequence downstream of the IR; mutations within this sequence had moderate to severe effects on RNA-protein complex formation. Although other similar sequences are present in the petD 3{prime} untranslated region, only a single copy, which we have termed box II, appears to be essential for in vivo protein binding. In addition, the IR itself is necessary for optimal complex formation. These two sequence elements together with an RNP complex may direct correct 3{prime}-end processing and/or influence the stability of petD mRNA in chloroplasts. 48 refs., 9 figs., 2 tabs.

  7. Massively parallel sequencing of forensic STRs

    DEFF Research Database (Denmark)

    Parson, Walther; Ballard, David; Budowle, Bruce

    2016-01-01

    The DNA Commission of the International Society for Forensic Genetics (ISFG) is reviewing factors that need to be considered ahead of the adoption by the forensic community of short tandem repeat (STR) genotyping by massively parallel sequencing (MPS) technologies. MPS produces sequence data that...

  8. libgapmis: extending short-read alignments.

    Science.gov (United States)

    Alachiotis, Nikolaos; Berger, Simon; Flouri, Tomáš; Pissis, Solon P; Stamatakis, Alexandros

    2013-01-01

    A wide variety of short-read alignment programmes have been published recently to tackle the problem of mapping millions of short reads to a reference genome, focusing on different aspects of the procedure such as time and memory efficiency, sensitivity, and accuracy. These tools allow for a small number of mismatches in the alignment; however, their ability to allow for gaps varies greatly, with many performing poorly or not allowing them at all. The seed-and-extend strategy is applied in most short-read alignment programmes. After aligning a substring of the reference sequence against the high-quality prefix of a short read--the seed--an important problem is to find the best possible alignment between a substring of the reference sequence succeeding and the remaining suffix of low quality of the read--extend. The fact that the reads are rather short and that the gap occurrence frequency observed in various studies is rather low suggest that aligning (parts of) those reads with a single gap is in fact desirable. In this article, we present libgapmis, a library for extending pairwise short-read alignments. Apart from the standard CPU version, it includes ultrafast SSE- and GPU-based implementations. libgapmis is based on an algorithm computing a modified version of the traditional dynamic-programming matrix for sequence alignment. Extensive experimental results demonstrate that the functions of the CPU version provided in this library accelerate the computations by a factor of 20 compared to other programmes. The analogous SSE- and GPU-based implementations accelerate the computations by a factor of 6 and 11, respectively, compared to the CPU version. The library also provides the user the flexibility to split the read into fragments, based on the observed gap occurrence frequency and the length of the read, thereby allowing for a variable, but bounded, number of gaps in the alignment. We present libgapmis, a library for extending pairwise short-read alignments. We

  9. Deep-sequencing protocols influence the results obtained in small-RNA sequencing.

    Directory of Open Access Journals (Sweden)

    Joern Toedling

    Full Text Available Second-generation sequencing is a powerful method for identifying and quantifying small-RNA components of cells. However, little attention has been paid to the effects of the choice of sequencing platform and library preparation protocol on the results obtained. We present a thorough comparison of small-RNA sequencing libraries generated from the same embryonic stem cell lines, using different sequencing platforms, which represent the three major second-generation sequencing technologies, and protocols. We have analysed and compared the expression of microRNAs, as well as populations of small RNAs derived from repetitive elements. Despite the fact that different libraries display a good correlation between sequencing platforms, qualitative and quantitative variations in the results were found, depending on the protocol used. Thus, when comparing libraries from different biological samples, it is strongly recommended to use the same sequencing platform and protocol in order to ensure the biological relevance of the comparisons.

  10. Characterizing leader sequences of CRISPR loci

    DEFF Research Database (Denmark)

    Alkhnbashi, Omer; Shah, Shiraz Ali; Garrett, Roger Antony

    2016-01-01

    The CRISPR-Cas system is an adaptive immune system in many archaea and bacteria, which provides resistance against invading genetic elements. The first phase of CRISPR-Cas immunity is called adaptation, in which small DNA fragments are excised from genetic elements and are inserted into a CRISPR...... array generally adjacent to its so called leader sequence at one end of the array. It has been shown that transcription initiation and adaptation signals of the CRISPR array are located within the leader. However, apart from promoters, there is very little knowledge of sequence or structural motifs...... sequences by focusing on the consensus repeat of the adjacent CRISPR array and weak upstream conservation signals. We applied our tool to the analysis of a comprehensive genomic database and identified several characteristic properties of leader sequences specific to archaea and bacteria, ranging from...

  11. Adaptive Basis Selection for Exponential Family Smoothing Splines with Application in Joint Modeling of Multiple Sequencing Samples

    OpenAIRE

    Ma, Ping; Zhang, Nan; Huang, Jianhua Z.; Zhong, Wenxuan

    2017-01-01

    Second-generation sequencing technologies have replaced array-based technologies and become the default method for genomics and epigenomics analysis. Second-generation sequencing technologies sequence tens of millions of DNA/cDNA fragments in parallel. After the resulting sequences (short reads) are mapped to the genome, one gets a sequence of short read counts along the genome. Effective extraction of signals in these short read counts is the key to the success of sequencing technologies. No...

  12. Long Terminal Repeat Retrotransposon Content in Eight Diploid Sunflower Species Inferred from Next-Generation Sequence Data

    Science.gov (United States)

    Tetreault, Hannah M.; Ungerer, Mark C.

    2016-01-01

    The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia. Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius. We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny. PMID:27233667

  13. Sequence-to-Sequence Prediction of Vehicle Trajectory via LSTM Encoder-Decoder Architecture

    OpenAIRE

    Park, Seong Hyeon; Kim, ByeongDo; Kang, Chang Mook; Chung, Chung Choo; Choi, Jun Won

    2018-01-01

    In this paper, we propose a deep learning based vehicle trajectory prediction technique which can generate the future trajectory sequence of surrounding vehicles in real time. We employ the encoder-decoder architecture which analyzes the pattern underlying in the past trajectory using the long short-term memory (LSTM) based encoder and generates the future trajectory sequence using the LSTM based decoder. This structure produces the $K$ most likely trajectory candidates over occupancy grid ma...

  14. Some critical remarks on a sequence of events interpreted to possibly originate from a decay chain of an element 120 isotope

    Energy Technology Data Exchange (ETDEWEB)

    Hessberger, F.P. [GSI - Helmholtzzentrum fuer Schwerionenforschung GmbH, Darmstadt (Germany); Helmholtz-Institut Mainz, Mainz (Germany); Ackermann, D. [GANIL, Caen (France)

    2017-06-15

    A sequence of three events observed in an irradiation of {sup 248}Cm with {sup 54}Cr at the velocity filter SHIP of the GSI - Helmholtzzentrum fuer Schwerionenforschung GmbH, 64291 Darmstadt, Germany, had been interpreted as a decay chain consisting of three α particles. On the basis of measured energies, a possible assignment to the decay of an isotope of element 120 was discussed, although it was stated that a definite assignment could not be made. A critical analysis of the data, however, shows that the reported events do not have the properties of a decay chain consisting of three α particles and (probably being terminated by) a spontaneous fission event, but that this is rather a random sequence of events. (orig.)

  15. Three new insertion sequence elements ISLdl2, ISLdl3, and ISLdl4 in Lactobacillus delbrueckii: isolation, molecular characterization, and potential use for strain identification.

    Science.gov (United States)

    Ravin, Victor; Alatossava, Tapani

    2003-05-01

    A group of new insertion sequence (IS) elements, ISLdl2, ISLdl3, and ISLdl4, from Lactobacillus delbrueckii subsp. lactis ATCC 15808 was isolated, characterized, and used for strain identification together with ISLdl1, recently characterized as an L. delbrueckii IS element belonging to the ISL3 family. ISLdl2 was 1367 bp in size and had a 24 bp IR and an 8 bp DR. The single ORF of ISLdl2 encoded a protein of 392 aa similar to transposases of the IS256 family. ISLdl3 had a single ORF encoding a protein of 343 aa similar to transposases of the IS30 family. Finally, ISLdl4 had a single ORF encoding a protein of 406 aa and displayed homology to the transposases of the IS110 family. ISLdl4 was only slight different from ISL4 (Accession No. AY040213). ISLdl1, ISLdl2, and ISLdl4 were present in all of the 10 L. delbrueckii subsp. lactis and subsp. delbrueckii strains tested, as well as in three of the 11 L. delbrueckii subsp. bulgaricus strains tested. ISLdl3 was present only in four closely related strains of L. delbrueckii subsp. lactis. These IS elements were not observed in Lactobacillus rhamnosus, Lactobacillus acidophilus, Lactobacillus helveticus, or Lactobacillus plantarum. A cluster of IS elements, ISLdl1, ISLdl2, ISLdl3, ISLdl4, and ISL6, was observed in L. delbrueckii subsp. lactis strain ATCC 15808. Within this cluster, ISLdl4 was inserted into ISLdl1 between the left IR and the start codon of ORF455, encoding a putative transposase. Most of the integration sites of the IS elements were strain-specific. We have observed that IS elements can migrate from one strain to another as integral parts of bacterial DNA by using phage LL-H as a vehicle. We demonstrate for the first time that inverse PCR and vectorette PCR methods with primers based on sequences of the IS elements could be used for identification of L. delbrueckii strains.

  16. Structural and functional analysis of an enhancer GPEI having a phorbol 12-O-tetradecanoate 13-acetate responsive element-like sequence found in the rat glutathione transferase P gene.

    Science.gov (United States)

    Okuda, A; Imagawa, M; Maeda, Y; Sakai, M; Muramatsu, M

    1989-10-05

    We have recently identified a typical enhancer, termed GPEI, located about 2.5 kilobases upstream from the transcription initiation site of the rat glutathione transferase P gene. Analyses of 5' and 3' deletion mutants revealed that the cis-acting sequence of GPEI contained the phorbol 12-O-tetradecanoate 13-acetate responsive element (TRE)-like sequence in it. For the maximal activity, however, GPEI required an adjacent upstream sequence of about 19 base pairs in addition to the TRE-like sequence. With the DNA binding gel-shift assay, we could detect protein(s) that specifically binds to the TRE-like sequence of GPEI fragment, which was possibly c-jun.c-fos complex or a similar protein complex. The sequence immediately upstream of the TRE-like sequence did not have any activity by itself, but augmented the latter activity by about 5-fold.

  17. Rare earth elements in the banded iron formation of the Griqualand West sequence, northern Cape Province, South Africa

    International Nuclear Information System (INIS)

    Horstmann, U.E.; Haelbich, I.W.; Cornell, D.H.

    1990-01-01

    The Proterozoic banded iron-formations (BIF) of the Griqualand West sequence of the Transvaal Supergroup in the northern Cape Province of South Africa have been investigated for their rare earth elements (REE) contents. Twenty three REE analyses were completed using an ICP-AES method. Despite diagenetic and metamorphic processes, it can be concluded from the so far available REE data that the conspicuous differences in REE patterns to those reported from elsewhere indicate the BIF of the Transvaal Supergroup to have originated in relative restricted parts or basins of the Precambrian ocean. 7 refs., 1 fig

  18. BarraCUDA - a fast short read sequence aligner using graphics processing units

    Directory of Open Access Journals (Sweden)

    Klus Petr

    2012-01-01

    Full Text Available Abstract Background With the maturation of next-generation DNA sequencing (NGS technologies, the throughput of DNA sequencing reads has soared to over 600 gigabases from a single instrument run. General purpose computing on graphics processing units (GPGPU, extracts the computing power from hundreds of parallel stream processors within graphics processing cores and provides a cost-effective and energy efficient alternative to traditional high-performance computing (HPC clusters. In this article, we describe the implementation of BarraCUDA, a GPGPU sequence alignment software that is based on BWA, to accelerate the alignment of sequencing reads generated by these instruments to a reference DNA sequence. Findings Using the NVIDIA Compute Unified Device Architecture (CUDA software development environment, we ported the most computational-intensive alignment component of BWA to GPU to take advantage of the massive parallelism. As a result, BarraCUDA offers a magnitude of performance boost in alignment throughput when compared to a CPU core while delivering the same level of alignment fidelity. The software is also capable of supporting multiple CUDA devices in parallel to further accelerate the alignment throughput. Conclusions BarraCUDA is designed to take advantage of the parallelism of GPU to accelerate the alignment of millions of sequencing reads generated by NGS instruments. By doing this, we could, at least in part streamline the current bioinformatics pipeline such that the wider scientific community could benefit from the sequencing technology. BarraCUDA is currently available from http://seqbarracuda.sf.net

  19. BarraCUDA - a fast short read sequence aligner using graphics processing units

    LENUS (Irish Health Repository)

    Klus, Petr

    2012-01-13

    Abstract Background With the maturation of next-generation DNA sequencing (NGS) technologies, the throughput of DNA sequencing reads has soared to over 600 gigabases from a single instrument run. General purpose computing on graphics processing units (GPGPU), extracts the computing power from hundreds of parallel stream processors within graphics processing cores and provides a cost-effective and energy efficient alternative to traditional high-performance computing (HPC) clusters. In this article, we describe the implementation of BarraCUDA, a GPGPU sequence alignment software that is based on BWA, to accelerate the alignment of sequencing reads generated by these instruments to a reference DNA sequence. Findings Using the NVIDIA Compute Unified Device Architecture (CUDA) software development environment, we ported the most computational-intensive alignment component of BWA to GPU to take advantage of the massive parallelism. As a result, BarraCUDA offers a magnitude of performance boost in alignment throughput when compared to a CPU core while delivering the same level of alignment fidelity. The software is also capable of supporting multiple CUDA devices in parallel to further accelerate the alignment throughput. Conclusions BarraCUDA is designed to take advantage of the parallelism of GPU to accelerate the alignment of millions of sequencing reads generated by NGS instruments. By doing this, we could, at least in part streamline the current bioinformatics pipeline such that the wider scientific community could benefit from the sequencing technology. BarraCUDA is currently available from http:\\/\\/seqbarracuda.sf.net

  20. Alu Mobile Elements: From Junk DNA to Genomic Gems

    Directory of Open Access Journals (Sweden)

    Sami Dridi

    2012-01-01

    Full Text Available Alus, the short interspersed repeated sequences (SINEs, are retrotransposons that litter the human genomes and have long been considered junk DNA. However, recent findings that these mobile elements are transcribed, both as distinct RNA polymerase III transcripts and as a part of RNA polymerase II transcripts, suggest biological functions and refute the notion that Alus are biologically unimportant. Indeed, Alu RNAs have been shown to control mRNA processing at several levels, to have complex regulatory functions such as transcriptional repression and modulating alternative splicing and to cause a host of human genetic diseases. Alu RNAs embedded in Pol II transcripts can promote evolution and proteome diversity, which further indicates that these mobile retroelements are in fact genomic gems rather than genomic junks.

  1. eShadow: A tool for comparing closely related sequences

    Energy Technology Data Exchange (ETDEWEB)

    Ovcharenko, Ivan; Boffelli, Dario; Loots, Gabriela G.

    2004-01-15

    Primate sequence comparisons are difficult to interpret due to the high degree of sequence similarity shared between such closely related species. Recently, a novel method, phylogenetic shadowing, has been pioneered for predicting functional elements in the human genome through the analysis of multiple primate sequence alignments. We have expanded this theoretical approach to create a computational tool, eShadow, for the identification of elements under selective pressure in multiple sequence alignments of closely related genomes, such as in comparisons of human to primate or mouse to rat DNA. This tool integrates two different statistical methods and allows for the dynamic visualization of the resulting conservation profile. eShadow also includes a versatile optimization module capable of training the underlying Hidden Markov Model to differentially predict functional sequences. This module grants the tool high flexibility in the analysis of multiple sequence alignments and in comparing sequences with different divergence rates. Here, we describe the eShadow comparative tool and its potential uses for analyzing both multiple nucleotide and protein alignments to predict putative functional elements. The eShadow tool is publicly available at http://eshadow.dcode.org/

  2. Analysis of a new strain of Euphorbia mosaic virus with distinct replication specificity unveils a lineage of begomoviruses with short Rep sequences in the DNA-B intergenic region

    Directory of Open Access Journals (Sweden)

    Argüello-Astorga Gerardo R

    2010-10-01

    Full Text Available Abstract Background Euphorbia mosaic virus (EuMV is a member of the SLCV clade, a lineage of New World begomoviruses that display distinctive features in their replication-associated protein (Rep and virion-strand replication origin. The first entirely characterized EuMV isolate is native from Yucatan Peninsula, Mexico; subsequently, EuMV was detected in weeds and pepper plants from another region of Mexico, and partial DNA-A sequences revealed significant differences in their putative replication specificity determinants with respect to EuMV-YP. This study was aimed to investigate the replication compatibility between two EuMV isolates from the same country. Results A new isolate of EuMV was obtained from pepper plants collected at Jalisco, Mexico. Full-length clones of both genomic components of EuMV-Jal were biolistically inoculated into plants of three different species, which developed symptoms indistinguishable from those induced by EuMV-YP. Pseudorecombination experiments with EuMV-Jal and EuMV-YP genomic components demonstrated that these viruses do not form infectious reassortants in Nicotiana benthamiana, presumably because of Rep-iteron incompatibility. Sequence analysis of the EuMV-Jal DNA-B intergenic region (IR led to the unexpected discovery of a 35-nt-long sequence that is identical to a segment of the rep gene in the cognate viral DNA-A. Similar short rep sequences ranging from 35- to 51-nt in length were identified in all EuMV isolates and in three distinct viruses from South America related to EuMV. These short rep sequences in the DNA-B IR are positioned downstream to a ~160-nt non-coding domain highly similar to the CP promoter of begomoviruses belonging to the SLCV clade. Conclusions EuMV strains are not compatible in replication, indicating that this begomovirus species probably is not a replicating lineage in nature. The genomic analysis of EuMV-Jal led to the discovery of a subgroup of SLCV clade viruses that contain in

  3. Disruption of a Transcriptional Repressor by an Insertion Sequence Element Integration Leads to Activation of a Novel Silent Cellobiose Transporter in Lactococcus lactis MG1363.

    Science.gov (United States)

    Solopova, Ana; Kok, Jan; Kuipers, Oscar P

    2017-12-01

    Lactococcus lactis subsp. cremoris strains typically carry many dairy niche-specific adaptations. During adaptation to the milk environment these former plant strains have acquired various pseudogenes and insertion sequence elements indicative of ongoing genome decay and frequent transposition events in their genomes. Here we describe the reactivation of a silenced plant sugar utilization cluster in an L. lactis MG1363 derivative lacking the two main cellobiose transporters, PtcBA-CelB and PtcBAC, upon applying selection pressure to utilize cellobiose. A disruption of the transcriptional repressor gene llmg_1239 by an insertion sequence (IS) element allows expression of the otherwise silent novel cellobiose transporter Llmg_1244 and leads to growth of mutant strains on cellobiose. Llmg_1239 was labeled CclR, for c ellobiose cl uster r epressor. IMPORTANCE Insertion sequences (ISs) play an important role in the evolution of lactococci and other bacteria. They facilitate DNA rearrangements and are responsible for creation of new genetic variants with selective advantages under certain environmental conditions. L. lactis MG1363 possesses 71 copies in a total of 11 different types of IS elements. This study describes yet another example of an IS-mediated adaptive evolution. An integration of IS 981 or IS 905 into a gene coding for a transcriptional repressor led to activation of the repressed gene cluster coding for a plant sugar utilization pathway. The expression of the gene cluster allowed assembly of a novel cellobiose-specific transporter and led to cell growth on cellobiose. Copyright © 2017 American Society for Microbiology.

  4. Two estrogen response element sequences near the PCNA gene are not responsible for its estrogen-enhanced expression in MCF7 cells.

    Science.gov (United States)

    Wang, Cheng; Yu, Jie; Kallen, Caleb B

    2008-01-01

    The proliferating cell nuclear antigen (PCNA) is an essential component of DNA replication, cell cycle regulation, and epigenetic inheritance. High expression of PCNA is associated with poor prognosis in patients with breast cancer. The 5'-region of the PCNA gene contains two computationally-detected estrogen response element (ERE) sequences, one of which is evolutionarily conserved. Both of these sequences are of undocumented cis-regulatory function. We recently demonstrated that estradiol (E2) enhances PCNA mRNA expression in MCF7 breast cancer cells. MCF7 cells proliferate in response to E2. Here, we demonstrate that E2 rapidly enhanced PCNA mRNA and protein expression in a process that requires ERalpha as well as de novo protein synthesis. One of the two upstream ERE sequences was specifically bound by ERalpha-containing protein complexes, in vitro, in gel shift analysis. Yet, each ERE sequence, when cloned as a single copy, or when engineered as two tandem copies of the ERE-containing sequence, was not capable of activating a luciferase reporter construct in response to E2. In MCF7 cells, neither ERE-containing genomic region demonstrated E2-dependent recruitment of ERalpha by sensitive ChIP-PCR assays. We conclude that E2 enhances PCNA gene expression by an indirect process and that computational detection of EREs, even when evolutionarily conserved and when near E2-responsive genes, requires biochemical validation.

  5. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed; Mansour, Essam; Kalnis, Panos

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern

  6. GapMis: a tool for pairwise sequence alignment with a single gap.

    Science.gov (United States)

    Flouri, Tomás; Frousios, Kimon; Iliopoulos, Costas S; Park, Kunsoo; Pissis, Solon P; Tischler, German

    2013-08-01

    Pairwise sequence alignment has received a new motivation due to the advent of recent patents in next-generation sequencing technologies, particularly so for the application of re-sequencing---the assembly of a genome directed by a reference sequence. After the fast alignment between a factor of the reference sequence and a high-quality fragment of a short read by a short-read alignment programme, an important problem is to find the alignment between a relatively short succeeding factor of the reference sequence and the remaining low-quality part of the read allowing a number of mismatches and the insertion of a single gap in the alignment. We present GapMis, a tool for pairwise sequence alignment with a single gap. It is based on a simple algorithm, which computes a different version of the traditional dynamic programming matrix. The presented experimental results demonstrate that GapMis is more suitable and efficient than most popular tools for this task.

  7. Gene expression promoted by the SV40 DNA targeting sequence and the hypoxia-responsive element under normoxia and hypoxia

    Directory of Open Access Journals (Sweden)

    C.B. Sacramento

    2010-08-01

    Full Text Available The main objective of the present study was to find suitable DNA-targeting sequences (DTS for the construction of plasmid vectors to be used to treat ischemic diseases. The well-known Simian virus 40 nuclear DTS (SV40-DTS and hypoxia-responsive element (HRE sequences were used to construct plasmid vectors to express the human vascular endothelial growth factor gene (hVEGF. The rate of plasmid nuclear transport and consequent gene expression under normoxia (20% O2 and hypoxia (less than 5% O2 were determined. Plasmids containing the SV40-DTS or HRE sequences were constructed and used to transfect the A293T cell line (a human embryonic kidney cell line in vitro and mouse skeletal muscle cells in vivo. Plasmid transport to the nucleus was monitored by real-time PCR, and the expression level of the hVEGF gene was measured by ELISA. The in vitro nuclear transport efficiency of the SV40-DTS plasmid was about 50% lower under hypoxia, while the HRE plasmid was about 50% higher under hypoxia. Quantitation of reporter gene expression in vitro and in vivo, under hypoxia and normoxia, confirmed that the SV40-DTS plasmid functioned better under normoxia, while the HRE plasmid was superior under hypoxia. These results indicate that the efficiency of gene expression by plasmids containing DNA binding sequences is affected by the concentration of oxygen in the medium.

  8. The CRISPR Spacer Space Is Dominated by Sequences from Species-Specific Mobilomes.

    Science.gov (United States)

    Shmakov, Sergey A; Sitnik, Vassilii; Makarova, Kira S; Wolf, Yuri I; Severinov, Konstantin V; Koonin, Eugene V

    2017-09-19

    Clustered regularly interspaced short palindromic repeats and CRISPR-associated protein (CRISPR-Cas) systems store the memory of past encounters with foreign DNA in unique spacers that are inserted between direct repeats in CRISPR arrays. For only a small fraction of the spacers, homologous sequences, called protospacers, are detectable in viral, plasmid, and microbial genomes. The rest of the spacers remain the CRISPR "dark matter." We performed a comprehensive analysis of the spacers from all CRISPR- cas loci identified in bacterial and archaeal genomes, and we found that, depending on the CRISPR-Cas subtype and the prokaryotic phylum, protospacers were detectable for 1% to about 19% of the spacers (~7% global average). Among the detected protospacers, the majority, typically 80 to 90%, originated from viral genomes, including proviruses, and among the rest, the most common source was genes that are integrated into microbial chromosomes but are involved in plasmid conjugation or replication. Thus, almost all spacers with identifiable protospacers target mobile genetic elements (MGE). The GC content, as well as dinucleotide and tetranucleotide compositions, of microbial genomes, their spacer complements, and the cognate viral genomes showed a nearly perfect correlation and were almost identical. Given the near absence of self-targeting spacers, these findings are most compatible with the possibility that the spacers, including the dark matter, are derived almost completely from the species-specific microbial mobilomes. IMPORTANCE The principal function of CRISPR-Cas systems is thought to be protection of bacteria and archaea against viruses and other parasitic genetic elements. The CRISPR defense function is mediated by sequences from parasitic elements, known as spacers, that are inserted into CRISPR arrays and then transcribed and employed as guides to identify and inactivate the cognate parasitic genomes. However, only a small fraction of the CRISPR spacers

  9. Distribution of Ds-like sequences in genomes of cereals

    International Nuclear Information System (INIS)

    Vershinin, A.V.; Salina, E.A.; Shumnii, V.K.; Svitashev, S.K.

    1986-01-01

    It has been suggested that insertions of Ds-elements may alter the effectiveness of transcription or translation of the genetic loci and the normal processing of introns and exons, and that they may impair coding frames, etc. The object of the present study was to determine the frequency of occurence of DNA sequences similar to the Ds-controlling elements of mazie (Ds-like sequences) among other representatives of cereals. The conservative feature of the primary structure of transposons from different eukaryotic species served as a basis in this investigation. By means of the ''nick-translation'' reaction with the aid of DNA-polymerase I (alpha- 32 P) dCTP or TTP was introduced into the Ds-element. The specific radioactivity of the preparations obtained was 5 x 10 7 to 1 x 10 8 cpm/gamma. From the results obtained, it is suggested that the genomes of cereals examined contain a collection of Ds-like sequences. The Ds-element may have a significant effect on gene expression in the presence of Ac-like or other sequences, which undergo transposition

  10. Yeast genome sequencing:

    DEFF Research Database (Denmark)

    Piskur, Jure; Langkjær, Rikke Breinhold

    2004-01-01

    For decades, unicellular yeasts have been general models to help understand the eukaryotic cell and also our own biology. Recently, over a dozen yeast genomes have been sequenced, providing the basis to resolve several complex biological questions. Analysis of the novel sequence data has shown...... of closely related species helps in gene annotation and to answer how many genes there really are within the genomes. Analysis of non-coding regions among closely related species has provided an example of how to determine novel gene regulatory sequences, which were previously difficult to analyse because...... they are short and degenerate and occupy different positions. Comparative genomics helps to understand the origin of yeasts and points out crucial molecular events in yeast evolutionary history, such as whole-genome duplication and horizontal gene transfer(s). In addition, the accumulating sequence data provide...

  11. gEVE: a genome-based endogenous viral element database provides comprehensive viral protein-coding sequences in mammalian genomes.

    Science.gov (United States)

    Nakagawa, So; Takahashi, Mahoko Ueda

    2016-01-01

    In mammals, approximately 10% of genome sequences correspond to endogenous viral elements (EVEs), which are derived from ancient viral infections of germ cells. Although most EVEs have been inactivated, some open reading frames (ORFs) of EVEs obtained functions in the hosts. However, EVE ORFs usually remain unannotated in the genomes, and no databases are available for EVE ORFs. To investigate the function and evolution of EVEs in mammalian genomes, we developed EVE ORF databases for 20 genomes of 19 mammalian species. A total of 736,771 non-overlapping EVE ORFs were identified and archived in a database named gEVE (http://geve.med.u-tokai.ac.jp). The gEVE database provides nucleotide and amino acid sequences, genomic loci and functional annotations of EVE ORFs for all 20 genomes. In analyzing RNA-seq data with the gEVE database, we successfully identified the expressed EVE genes, suggesting that the gEVE database facilitates studies of the genomic analyses of various mammalian species.Database URL: http://geve.med.u-tokai.ac.jp. © The Author(s) 2016. Published by Oxford University Press.

  12. Surveying DNA Elements within Functional Genes of Heterocyst-Forming Cyanobacteria.

    Directory of Open Access Journals (Sweden)

    Jason A Hilton

    Full Text Available Some cyanobacteria are capable of differentiating a variety of cell types in response to environmental factors. For instance, in low nitrogen conditions, some cyanobacteria form heterocysts, which are specialized for N2 fixation. Many heterocyst-forming cyanobacteria have DNA elements interrupting key N2 fixation genes, elements that are excised during heterocyst differentiation. While the mechanism for the excision of the element has been well-studied, many questions remain regarding the introduction of the elements into the cyanobacterial lineage and whether they have been retained ever since or have been lost and reintroduced. To examine the evolutionary relationships and possible function of DNA sequences that interrupt genes of heterocyst-forming cyanobacteria, we identified and compared 101 interruption element sequences within genes from 38 heterocyst-forming cyanobacterial genomes. The interruption element lengths ranged from about 1 kb (the minimum able to encode the recombinase responsible for element excision, up to nearly 1 Mb. The recombinase gene sequences served as genetic markers that were common across the interruption elements and were used to track element evolution. Elements were found that interrupted 22 different orthologs, only five of which had been previously observed to be interrupted by an element. Most of the newly identified interrupted orthologs encode proteins that have been shown to have heterocyst-specific activity. However, the presence of interruption elements within genes with no known role in N2 fixation, as well as in three non-heterocyst-forming cyanobacteria, indicates that the processes that trigger the excision of elements may not be limited to heterocyst development or that the elements move randomly within genomes. This comprehensive analysis provides the framework to study the history and behavior of these unique sequences, and offers new insight regarding the frequency and persistence of interruption

  13. Surveying DNA Elements within Functional Genes of Heterocyst-Forming Cyanobacteria.

    Science.gov (United States)

    Hilton, Jason A; Meeks, John C; Zehr, Jonathan P

    2016-01-01

    Some cyanobacteria are capable of differentiating a variety of cell types in response to environmental factors. For instance, in low nitrogen conditions, some cyanobacteria form heterocysts, which are specialized for N2 fixation. Many heterocyst-forming cyanobacteria have DNA elements interrupting key N2 fixation genes, elements that are excised during heterocyst differentiation. While the mechanism for the excision of the element has been well-studied, many questions remain regarding the introduction of the elements into the cyanobacterial lineage and whether they have been retained ever since or have been lost and reintroduced. To examine the evolutionary relationships and possible function of DNA sequences that interrupt genes of heterocyst-forming cyanobacteria, we identified and compared 101 interruption element sequences within genes from 38 heterocyst-forming cyanobacterial genomes. The interruption element lengths ranged from about 1 kb (the minimum able to encode the recombinase responsible for element excision), up to nearly 1 Mb. The recombinase gene sequences served as genetic markers that were common across the interruption elements and were used to track element evolution. Elements were found that interrupted 22 different orthologs, only five of which had been previously observed to be interrupted by an element. Most of the newly identified interrupted orthologs encode proteins that have been shown to have heterocyst-specific activity. However, the presence of interruption elements within genes with no known role in N2 fixation, as well as in three non-heterocyst-forming cyanobacteria, indicates that the processes that trigger the excision of elements may not be limited to heterocyst development or that the elements move randomly within genomes. This comprehensive analysis provides the framework to study the history and behavior of these unique sequences, and offers new insight regarding the frequency and persistence of interruption elements in

  14. The hormone response element mimic sequence of GAS5 lncRNA is sufficient to induce apoptosis in breast cancer cells

    Science.gov (United States)

    Pickard, Mark R.; Williams, Gwyn T.

    2016-01-01

    Growth arrest-specific 5 (GAS5) lncRNA promotes apoptosis, and its expression is down-regulated in breast cancer. GAS5 lncRNA is a decoy of glucocorticoid/related receptors; a stem-loop sequence constitutes the GAS5 hormone response element mimic (HREM), which is essential for the regulation of breast cancer cell apoptosis. This preclinical study aimed to determine if the GAS5 HREM sequence alone promotes the apoptosis of breast cancer cells. Nucleofection of hormone-sensitive and –insensitive breast cancer cell lines with a GAS5 HREM DNA oligonucleotide increased both basal and ultraviolet-C-induced apoptosis, and decreased culture viability and clonogenic growth, similar to GAS5 lncRNA. The HREM oligonucleotide demonstrated similar sequence specificity to the native HREM for its functional activity and had no effect on endogenous GAS5 lncRNA levels. Certain chemically modified HREM oligonucleotides, notably DNA and RNA phosphorothioates, retained pro-apoptotic. activity. Crucially the HREM oligonucleotide could overcome apoptosis resistance secondary to deficient endogenous GAS5 lncRNA levels. Thus, the GAS5 lncRNA HREM sequence alone is sufficient to induce apoptosis in breast cancer cells, including triple-negative breast cancer cells. These findings further suggest that emerging knowledge of structure/function relationships in the field of lncRNA biology can be exploited for the development of entirely novel, oligonucleotide mimic-based, cancer therapies. PMID:26862727

  15. Sorting signed permutations by short operations.

    Science.gov (United States)

    Galvão, Gustavo Rodrigues; Lee, Orlando; Dias, Zanoni

    2015-01-01

    During evolution, global mutations may alter the order and the orientation of the genes in a genome. Such mutations are referred to as rearrangement events, or simply operations. In unichromosomal genomes, the most common operations are reversals, which are responsible for reversing the order and orientation of a sequence of genes, and transpositions, which are responsible for switching the location of two contiguous portions of a genome. The problem of computing the minimum sequence of operations that transforms one genome into another - which is equivalent to the problem of sorting a permutation into the identity permutation - is a well-studied problem that finds application in comparative genomics. There are a number of works concerning this problem in the literature, but they generally do not take into account the length of the operations (i.e. the number of genes affected by the operations). Since it has been observed that short operations are prevalent in the evolution of some species, algorithms that efficiently solve this problem in the special case of short operations are of interest. In this paper, we investigate the problem of sorting a signed permutation by short operations. More precisely, we study four flavors of this problem: (i) the problem of sorting a signed permutation by reversals of length at most 2; (ii) the problem of sorting a signed permutation by reversals of length at most 3; (iii) the problem of sorting a signed permutation by reversals and transpositions of length at most 2; and (iv) the problem of sorting a signed permutation by reversals and transpositions of length at most 3. We present polynomial-time solutions for problems (i) and (iii), a 5-approximation for problem (ii), and a 3-approximation for problem (iv). Moreover, we show that the expected approximation ratio of the 5-approximation algorithm is not greater than 3 for random signed permutations with more than 12 elements. Finally, we present experimental results that show

  16. The synthetic elements

    Energy Technology Data Exchange (ETDEWEB)

    Hoffman, D.C.

    1990-05-01

    Prior to 1940, the heaviest element known was uranium, discovered in 1789. Since that time the elements 93 through 109 have been synthesized and identified and the elements 43, 61, 85, and 87 which were missing form the periodic tables of the 1930's have been discovered. The techniques and problems involved in these discoveries and the placement of the transuranium elements in the periodic table will be discussed. The production and positive identification of elements heavier than Md (Z=101), which have very short half-lives and can only be produced an atom-at-a-time, are very difficult and there have been controversies concerning their discovery. Some of the new methods which have been developed and used in these studies will be described. The prospects for production of still heavier elements will be considered.

  17. The synthetic elements

    International Nuclear Information System (INIS)

    Hoffman, D.C.

    1990-05-01

    Prior to 1940, the heaviest element known was uranium, discovered in 1789. Since that time the elements 93 through 109 have been synthesized and identified and the elements 43, 61, 85, and 87 which were missing form the periodic tables of the 1930's have been discovered. The techniques and problems involved in these discoveries and the placement of the transuranium elements in the periodic table will be discussed. The production and positive identification of elements heavier than Md (Z=101), which have very short half-lives and can only be produced an atom-at-a-time, are very difficult and there have been controversies concerning their discovery. Some of the new methods which have been developed and used in these studies will be described. The prospects for production of still heavier elements will be considered

  18. The recurrence sequences via Sylvester matrices

    Science.gov (United States)

    Karaduman, Erdal; Deveci, Ömür

    2017-07-01

    In this work, we define the Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by using the Slyvester matrices which are obtained from the characteristic polynomials of the Pell and Jacobsthal sequences and then, we study the sequences defined modulo m. Also, we obtain the cyclic groups and the semigroups from the generating matrices of these sequences when read modulo m and then, we derive the relationships among the orders of the cyclic groups and the periods of the sequences. Furthermore, we redefine Pell-Jacobsthal-Slyvester sequence and the Jacobsthal-Pell-Slyvester sequence by means of the elements of the groups and then, we examine them in the finite groups.

  19. Classification of x-ray spectra of 2-3 transitions in the Ne-like and Na-like isoelectronic sequences of the elements from krypton to molybdenum

    International Nuclear Information System (INIS)

    Gordon, H.; Hobby, M.G.; Peacock, N.J.; Cowan, R.D.

    1979-01-01

    Plasmas produced by the laser irradiation of solid targets and in a plasma focus device have been employed as sources of x-ray spectra of the elements Kr(Z = 36)-Mo(Z = 42). The Ne-like isoelectronic sequence has been investigated by comparing observed wavelengths with ab initio atomic structure calculations and isoelectronic interpolation. Previous identifications in Ne-like ions have been extended to krypton, rubidium and strontium. The Na-like satellite structure of these elements has also been studied and a detailed classification of these satellites is presented. (author)

  20. Unveiling Mycoplasma hyopneumoniae Promoters: Sequence Definition and Genomic Distribution

    Science.gov (United States)

    Weber, Shana de Souto; Sant'Anna, Fernando Hayashi; Schrank, Irene Silveira

    2012-01-01

    Several Mycoplasma species have had their genome completely sequenced, including four strains of the swine pathogen Mycoplasma hyopneumoniae. Nevertheless, little is known about the nucleotide sequences that control transcriptional initiation in these microorganisms. Therefore, with the objective of investigating the promoter sequences of M. hyopneumoniae, 23 transcriptional start sites (TSSs) of distinct genes were mapped. A pattern that resembles the σ70 promoter −10 element was found upstream of the TSSs. However, no −35 element was distinguished. Instead, an AT-rich periodic signal was identified. About half of the experimentally defined promoters contained the motif 5′-TRTGn-3′, which was identical to the −16 element usually found in Gram-positive bacteria. The defined promoters were utilized to build position-specific scoring matrices in order to scan putative promoters upstream of all coding sequences (CDSs) in the M. hyopneumoniae genome. Two hundred and one signals were found associated with 169 CDSs. Most of these sequences were located within 100 nucleotides of the start codons. This study has shown that the number of promoter-like sequences in the M. hyopneumoniae genome is more frequent than expected by chance, indicating that most of the sequences detected are probably biologically functional. PMID:22334569

  1. DNA demethylases target promoter transposable elements to positively regulate stress responsive genes in Arabidopsis.

    Science.gov (United States)

    Le, Tuan-Ngoc; Schumann, Ulrike; Smith, Neil A; Tiwari, Sameer; Au, Phil Chi Khang; Zhu, Qian-Hao; Taylor, Jennifer M; Kazan, Kemal; Llewellyn, Danny J; Zhang, Ren; Dennis, Elizabeth S; Wang, Ming-Bo

    2014-09-17

    DNA demethylases regulate DNA methylation levels in eukaryotes. Arabidopsis encodes four DNA demethylases, DEMETER (DME), REPRESSOR OF SILENCING 1 (ROS1), DEMETER-LIKE 2 (DML2), and DML3. While DME is involved in maternal specific gene expression during seed development, the biological function of the remaining DNA demethylases remains unclear. We show that ROS1, DML2, and DML3 play a role in fungal disease resistance in Arabidopsis. A triple DNA demethylase mutant, rdd (ros1 dml2 dml3), shows increased susceptibility to the fungal pathogen Fusarium oxysporum. We identify 348 genes differentially expressed in rdd relative to wild type, and a significant proportion of these genes are downregulated in rdd and have functions in stress response, suggesting that DNA demethylases maintain or positively regulate the expression of stress response genes required for F. oxysporum resistance. The rdd-downregulated stress response genes are enriched for short transposable element sequences in their promoters. Many of these transposable elements and their surrounding sequences show localized DNA methylation changes in rdd, and a general reduction in CHH methylation, suggesting that RNA-directed DNA methylation (RdDM), responsible for CHH methylation, may participate in DNA demethylase-mediated regulation of stress response genes. Many of the rdd-downregulated stress response genes are downregulated in the RdDM mutants nrpd1 and nrpe1, and the RdDM mutants nrpe1 and ago4 show enhanced susceptibility to F. oxysporum infection. Our results suggest that a primary function of DNA demethylases in plants is to regulate the expression of stress response genes by targeting promoter transposable element sequences.

  2. kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets

    Science.gov (United States)

    Fletez-Brant, Christopher; Lee, Dongwon; McCallion, Andrew S.; Beer, Michael A.

    2013-01-01

    Massively parallel sequencing technologies have made the generation of genomic data sets a routine component of many biological investigations. For example, Chromatin immunoprecipitation followed by sequence assays detect genomic regions bound (directly or indirectly) by specific factors, and DNase-seq identifies regions of open chromatin. A major bottleneck in the interpretation of these data is the identification of the underlying DNA sequence code that defines, and ultimately facilitates prediction of, these transcription factor (TF) bound or open chromatin regions. We have recently developed a novel computational methodology, which uses a support vector machine (SVM) with kmer sequence features (kmer-SVM) to identify predictive combinations of short transcription factor-binding sites, which determine the tissue specificity of these genomic assays (Lee, Karchin and Beer, Discriminative prediction of mammalian enhancers from DNA sequence. Genome Res. 2011; 21:2167–80). This regulatory information can (i) give confidence in genomic experiments by recovering previously known binding sites, and (ii) reveal novel sequence features for subsequent experimental testing of cooperative mechanisms. Here, we describe the development and implementation of a web server to allow the broader research community to independently apply our kmer-SVM to analyze and interpret their genomic datasets. We analyze five recently published data sets and demonstrate how this tool identifies accessory factors and repressive sequence elements. kmer-SVM is available at http://kmersvm.beerlab.org. PMID:23771147

  3. Chunk concatenation evolves with practice and sleep-related enhancement consolidation in a complex arm movement sequence

    Directory of Open Access Journals (Sweden)

    Blischke Klaus

    2016-06-01

    Full Text Available This paper addresses the notion of chunk concatenation being associated with sleep-related enhancement consolidation of motor sequence memory, thereby essentially contributing to improvements in sequence execution speed. To this end, element movement times of a multi-joint arm movement sequence incorporated in a recent study by Malangré et al. (2014 were reanalyzed. As sequence elements differed with respect to movement distance, element movement times had to be purged from differences solely due to varying trajectory lengths. This was done by dividing each element movement time per subject and trial block by the respective “reference movement time” collected from subjects who had extensively practiced each sequence element in isolation. Any differences in these “relative element movement times” were supposed to reflect element-specific “production costs” imposed solely by the sequence context. Across all subjects non-idiosyncratic, lasting sequence segmentation was shown, and four possible concatenation points (i.e. transition points between successive chunks within the original arm movement sequence were identified. Based on theoretical suppositions derived from previous work with the discrete sequence production task and the dual processor model (Abrahamse et al., 2013, significantly larger improvements in transition speed occurring at these four concatenation points as compared to the five fastest transition positions within the sequence (associated with mere element execution were assumed to indicate increased chunk concatenation. As a result, chunk concatenation was shown to proceed during acquisition with physical practice, and, most importantly, to significantly progress some more during retention following a night of sleep, but not during a waking interval.

  4. Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

    Directory of Open Access Journals (Sweden)

    Guo Xiang

    2008-12-01

    Full Text Available Abstract Background Parasites in the genus Theileria cause lymphoproliferative diseases in cattle, resulting in enormous socio-economic losses. The availability of the genome sequences and annotation for T. parva and T. annulata has facilitated the study of parasite biology and their relationship with host cell transformation and tropism. However, the mechanism of transcriptional regulation in this genus, which may be key to understanding fundamental aspects of its parasitology, remains poorly understood. In this study, we analyze the evolution of non-coding sequences in the Theileria genome and identify conserved sequence elements that may be involved in gene regulation of these parasitic species. Results Intergenic regions and introns in Theileria are short, and their length distributions are considerably right-skewed. Intergenic regions flanked by genes in 5'-5' orientation tend to be longer and slightly more AT-rich than those flanked by two stop codons; intergenic regions flanked by genes in 3'-5' orientation have intermediate values of length and AT composition. Intron position is negatively correlated with intron length, and positively correlated with GC content. Using stringent criteria, we identified a set of high-quality orthologous non-coding sequences between T. parva and T. annulata, and determined the distribution of selective constraints across regions, which are shown to be higher close to translation start sites. A positive correlation between constraint and length in both intergenic regions and introns suggests a tight control over length expansion of non-coding regions. Genome-wide searches for functional elements revealed several conserved motifs in intergenic regions of Theileria genomes. Two such motifs are preferentially located within the first 60 base pairs upstream of transcription start sites in T. parva, are preferentially associated with specific protein functional categories, and have significant similarity to know

  5. Analysis of trace elements in the shells of short-necked clam Ruditapes philippinarum (Mollusca: Bivalvia) with respect to reconstruction of individual life history

    International Nuclear Information System (INIS)

    Arakawa, Jumpei; Sakamoto, Wataru

    1998-01-01

    Strontium (Sr) concentration in the shells of short-necked clams collected at different locations (Shirahama, warm area and Maizuru, cold area, Japan) was analyzed by two methods, PIXE and EPMA. The Sr concentration of external surface of shell umbo, which was made during short term at early benthic phase, was analyzed by PIXE, and was ranged from 1000 to 3500 ppm for individuals. The Sr concentration of clams collected at Shirahama showed positive correlation with shell length (SL) in individuals with SL < 31 mm, whereas clams collected at Maizuru did not show significant correlation. This result may be caused from the difference of the spawning seasons between two areas. The Sr concentration of cross section of shell umbo, which develops thicker continuously during their life to form faint stratum structure, was analyzed by EPMA along the line across the stratum structure. Some surges and long term waving patterns of the Sr concentration were observed. These results suggest that the life histories of individual clams could be recorded in the shell umbo cross sections as variations of trace elements and analyses of trace elements could clarify the histories of individual clams. (author)

  6. Two estrogen response element sequences near the PCNA gene are not responsible for its estrogen-enhanced expression in MCF7 cells.

    Directory of Open Access Journals (Sweden)

    Cheng Wang

    Full Text Available The proliferating cell nuclear antigen (PCNA is an essential component of DNA replication, cell cycle regulation, and epigenetic inheritance. High expression of PCNA is associated with poor prognosis in patients with breast cancer. The 5'-region of the PCNA gene contains two computationally-detected estrogen response element (ERE sequences, one of which is evolutionarily conserved. Both of these sequences are of undocumented cis-regulatory function. We recently demonstrated that estradiol (E2 enhances PCNA mRNA expression in MCF7 breast cancer cells. MCF7 cells proliferate in response to E2.Here, we demonstrate that E2 rapidly enhanced PCNA mRNA and protein expression in a process that requires ERalpha as well as de novo protein synthesis. One of the two upstream ERE sequences was specifically bound by ERalpha-containing protein complexes, in vitro, in gel shift analysis. Yet, each ERE sequence, when cloned as a single copy, or when engineered as two tandem copies of the ERE-containing sequence, was not capable of activating a luciferase reporter construct in response to E2. In MCF7 cells, neither ERE-containing genomic region demonstrated E2-dependent recruitment of ERalpha by sensitive ChIP-PCR assays.We conclude that E2 enhances PCNA gene expression by an indirect process and that computational detection of EREs, even when evolutionarily conserved and when near E2-responsive genes, requires biochemical validation.

  7. Polyadenylated Sequencing Primers Enable Complete Readability of PCR Amplicons Analyzed by Dideoxynucleotide Sequencing

    Directory of Open Access Journals (Sweden)

    Martin Beránek

    2012-01-01

    Full Text Available Dideoxynucleotide DNA sequencing is one of the principal procedures in molecular biology. Loss of an initial part of nucleotides behind the 3' end of the sequencing primer limits the readability of sequenced amplicons. We present a method which extends the readability by using sequencing primers modified by polyadenylated tails attached to their 5' ends. Performing a polymerase chain reaction, we amplified eight amplicons of six human genes (AMELX, APOE, HFE, MBL2, SERPINA1 and TGFB1 ranging from 106 bp to 680 bp. Polyadenylation of the sequencing primers minimized the loss of bases in all amplicons. Complete sequences of shorter products (AMELX 106 bp, SERPINA1 121 bp, HFE 208 bp, APOE 244 bp, MBL2 317 bp were obtained. In addition, in the case of TGFB1 products (366 bp, 432 bp, and 680 bp, respectively, the lengths of sequencing readings were significantly longer if adenylated primers were used. Thus, single strand dideoxynucleotide sequencing with adenylated primers enables complete or near complete readability of short PCR amplicons.

  8. Unified Deep Learning Architecture for Modeling Biology Sequence.

    Science.gov (United States)

    Wu, Hongjie; Cao, Chengyuan; Xia, Xiaoyan; Lu, Qiang

    2017-10-09

    Prediction of the spatial structure or function of biological macromolecules based on their sequence remains an important challenge in bioinformatics. When modeling biological sequences using traditional sequencing models, characteristics, such as long-range interactions between basic units, the complicated and variable output of labeled structures, and the variable length of biological sequences, usually lead to different solutions on a case-by-case basis. This study proposed the use of bidirectional recurrent neural networks based on long short-term memory or a gated recurrent unit to capture long-range interactions by designing the optional reshape operator to adapt to the diversity of the output labels and implementing a training algorithm to support the training of sequence models capable of processing variable-length sequences. Additionally, the merge and pooling operators enhanced the ability to capture short-range interactions between basic units of biological sequences. The proposed deep-learning model and its training algorithm might be capable of solving currently known biological sequence-modeling problems through the use of a unified framework. We validated our model on one of the most difficult biological sequence-modeling problems currently known, with our results indicating the ability of the model to obtain predictions of protein residue interactions that exceeded the accuracy of current popular approaches by 10% based on multiple benchmarks.

  9. Isotopes a very short introduction

    CERN Document Server

    Ellam, Rob

    2016-01-01

    An isotope is a variant form of a chemical element, containing a different number of neutrons in its nucleus. Most elements exist as several isotopes. Many are stable while others are radioactive, and some may only exist fleetingly before decaying into other elements. In this Very Short Introduction, Rob Ellam explains how isotopes have proved enormously important across all the sciences and in archaeology. Radioactive isotopes may be familiar from their use in nuclear weapons, nuclear power, and in medicine, as well as in carbon dating. They have been central to establishing the age of the Earth and the origins of the solar system. Combining previous and new research, Ellam provides an overview of the nature of stable and radioactive isotopes, and considers their wide range of modern applications. ABOUT THE SERIES: The Very Short Introductions series from Oxford University Press contains hundreds of titles in almost every subject area. These pocket-sized books are the perfect way to get ahead in a new subjec...

  10. Sequence conservation and combinatorial complexity of Drosophila neural precursor cell enhancers

    Directory of Open Access Journals (Sweden)

    Kuzin Alexander

    2008-08-01

    Full Text Available Abstract Background The presence of highly conserved sequences within cis-regulatory regions can serve as a valuable starting point for elucidating the basis of enhancer function. This study focuses on regulation of gene expression during the early events of Drosophila neural development. We describe the use of EvoPrinter and cis-Decoder, a suite of interrelated phylogenetic footprinting and alignment programs, to characterize highly conserved sequences that are shared among co-regulating enhancers. Results Analysis of in vivo characterized enhancers that drive neural precursor gene expression has revealed that they contain clusters of highly conserved sequence blocks (CSBs made up of shorter shared sequence elements which are present in different combinations and orientations within the different co-regulating enhancers; these elements contain either known consensus transcription factor binding sites or consist of novel sequences that have not been functionally characterized. The CSBs of co-regulated enhancers share a large number of sequence elements, suggesting that a diverse repertoire of transcription factors may interact in a highly combinatorial fashion to coordinately regulate gene expression. We have used information gained from our comparative analysis to discover an enhancer that directs expression of the nervy gene in neural precursor cells of the CNS and PNS. Conclusion The combined use EvoPrinter and cis-Decoder has yielded important insights into the combinatorial appearance of fundamental sequence elements required for neural enhancer function. Each of the 30 enhancers examined conformed to a pattern of highly conserved blocks of sequences containing shared constituent elements. These data establish a basis for further analysis and understanding of neural enhancer function.

  11. Representation of the quantum Fourier transform on multilevel basic elements by a sequence of selective rotation operators

    Science.gov (United States)

    Ermilov, A. S.; Zobov, V. E.

    2007-12-01

    To experimentally realize quantum computations on d-level basic elements (qudits) at d > 2, it is necessary to develop schemes for the technical realization of elementary logical operators. We have found sequences of selective rotation operators that represent the operators of the quantum Fourier transform (Walsh-Hadamard matrices) for d = 3-10. For the prime numbers 3, 5, and 7, the well-known method of linear algebra is applied, whereas, for the factorable numbers 6, 9, and 10, the representation of virtual spins is used (which we previously applied for d = 4, 8). Selective rotations can be realized, for example, by means of pulses of an RF magnetic field for systems of quadrupole nuclei or laser pulses for atoms and ions in traps.

  12. Phylogeny based discovery of regulatory elements

    Directory of Open Access Journals (Sweden)

    Cohen Barak A

    2006-05-01

    Full Text Available Abstract Background Algorithms that locate evolutionarily conserved sequences have become powerful tools for finding functional DNA elements, including transcription factor binding sites; however, most methods do not take advantage of an explicit model for the constrained evolution of functional DNA sequences. Results We developed a probabilistic framework that combines an HKY85 model, which assigns probabilities to different base substitutions between species, and weight matrix models of transcription factor binding sites, which describe the probabilities of observing particular nucleotides at specific positions in the binding site. The method incorporates the phylogenies of the species under consideration and takes into account the position specific variation of transcription factor binding sites. Using our framework we assessed the suitability of alignments of genomic sequences from commonly used species as substrates for comparative genomic approaches to regulatory motif finding. We then applied this technique to Saccharomyces cerevisiae and related species by examining all possible six base pair DNA sequences (hexamers and identifying sequences that are conserved in a significant number of promoters. By combining similar conserved hexamers we reconstructed known cis-regulatory motifs and made predictions of previously unidentified motifs. We tested one prediction experimentally, finding it to be a regulatory element involved in the transcriptional response to glucose. Conclusion The experimental validation of a regulatory element prediction missed by other large-scale motif finding studies demonstrates that our approach is a useful addition to the current suite of tools for finding regulatory motifs.

  13. A Gaijin-like miniature inverted repeat transposable element is mobilized in rice during cell differentiation

    Directory of Open Access Journals (Sweden)

    Dong Hai-Tao

    2012-04-01

    Full Text Available Abstract Background Miniature inverted repeat transposable element (MITE is one type of transposable element (TE, which is largely found in eukaryotic genomes and involved in a wide variety of biological events. However, only few MITEs were proved to be currently active and their physiological function remains largely unknown. Results We found that the amplicon discrepancy of a gene locus LOC_Os01g0420 in different rice cultivar genomes was resulted from the existence of a member of Gaijin-like MITEs (mGing. This result indicated that mGing transposition was occurred at this gene locus. By using a modified transposon display (TD analysis, the active transpositions of mGing were detected in rice Jiahua No. 1 genome under three conditions: in seedlings germinated from the seeds received a high dose γ-ray irradiation, in plantlets regenerated from anther-derived calli and from scutellum-derived calli, and were confirmed by PCR validation and sequencing. Sequence analysis revealed that single nucleotide polymorphisms (SNPs or short additional DNA sequences at transposition sites post mGing transposition. It suggested that sequence modification was possibly taken place during mGing transposition. Furthermore, cell re-differentiation experiment showed that active transpositions of both mGing and mPing (another well studied MITE were identified only in regenerated plantlets. Conclusions It is for the first time that mGing active transposition was demonstrated under γ-ray irradiation or in cell re-differentiation process in rice. This newly identified active MITE will provide a foundation for further analysis of the roles of MITEs in biological process.

  14. "I know your name, but not your number"--Patients with verbal short-term memory deficits are impaired in learning sequences of digits.

    Science.gov (United States)

    Bormann, Tobias; Seyboth, Margret; Umarova, Roza; Weiller, Cornelius

    2015-06-01

    Studies on verbal learning in patients with impaired verbal short-term memory (vSTM) have revealed dissociations among types of verbal information. Patients with impaired vSTM are able to learn lists of known words but fail to acquire new word forms. This suggests that vSTM is involved in new word learning. The present study assessed both new word learning and the learning of digit sequences in two patients with impaired vSTM. In two experiments, participants were required to learn people's names, ages and professions, or their four digit 'phone numbers'. The STM patients were impaired on learning unknown family names and phone numbers, but managed to acquire other verbal information. In contrast, a patient with a severe verbal episodic memory impairment was impaired across information types. These results indicate verbal STM involvement in the learning of digit sequences. Copyright © 2015 Elsevier Ltd. All rights reserved.

  15. Photoelectric elements of the eclipsing binary XY Ceti

    International Nuclear Information System (INIS)

    Srivastava, R.K.; Padalia, T.D.

    1975-01-01

    The absolute elements of the system XY Ceti have been obtained on the basis of the spectroscopic elements given by Popper (1971) and the photoelectric elements derived previously. The colours of the components have been obtained. Both components are found to lie fairly on the Main Sequence. The primary component of the system, however, is slightly more evolved as it shows a tendency to drift away from the Main Sequence. The spectral classes now assigned are A5V (primary) and A7V (secondary). The values of Roche constants indicate that the system is a detached one. (Auth.)

  16. Genomic Sequence of a Ranavirus Isolated from Short-Finned Eel (Anguilla australis)

    DEFF Research Database (Denmark)

    Subramaniam, Kuttichantran; Toffan, Anna; Cappellozza, Elisabetta

    2016-01-01

    The short-finned eel ranavirus (SERV) was isolated from short-finned eel imported to Italy from New Zealand. Phylogenomic analyses revealed that SERV is a unique member of the genus Ranavirus, family Iridoviridae, branching at the base of the tree near other fish ranaviruses....

  17. 'Sleeping reactor' irradiations. The use of a shut-down reactor for the determination of elements with short-lived activation products

    International Nuclear Information System (INIS)

    Jerde, E.A.; Oak Ridge National Laboratory, TN; Glasgow, D.C.

    1999-01-01

    Neutron activation analysis utilizing the High Flux Isotope Reactor (HFIR) immediately following SCRAM is a workable solution to obtaining data for ultra-short lived species, principally Al, Ti, Mg, and V. Neutrons are produced in the HFIR core within the beryllium reflector due to gamma-ray bombardment from the spent fuel elements. This neutron flux is not constant, varying by over two orders of magnitude during the first 24 hours. The problems associated with irradiation in a changing neutron flux are removed through the use of a specially tailored activation equation. This activation equation is applicable to any irradiation at HFIR in the firs 24 hours after SCRAM since the fuel elements are identical from cycle to cycle, and the gamma-emitting nuclides responsible for the neutrons reach saturation during the fuel cycle. Reference material tests demonstrate that this method is successful, and detection limit estimates reveal that it should be applicable to materials of widely ranging mass and composition. (author)

  18. Design of Long Period Pseudo-Random Sequences from the Addition of m -Sequences over 𝔽 p

    Directory of Open Access Journals (Sweden)

    Ren Jian

    2004-01-01

    Full Text Available Pseudo-random sequence with good correlation property and large linear span is widely used in code division multiple access (CDMA communication systems and cryptology for reliable and secure information transmission. In this paper, sequences with long period, large complexity, balance statistics, and low cross-correlation property are constructed from the addition of m -sequences with pairwise-prime linear spans (AMPLS. Using m -sequences as building blocks, the proposed method proved to be an efficient and flexible approach to construct long period pseudo-random sequences with desirable properties from short period sequences. Applying the proposed method to 𝔽 2 , a signal set ( ( 2 n − 1 ( 2 m − 1 , ( 2 n + 1 ( 2 m + 1 , ( 2 ( n + 1 / 2 + 1 ( 2 ( m + 1 / 2 + 1 is constructed.

  19. Short interspersed DNA elements and miRNAs: a novel hidden gene regulation layer in zebrafish?

    Science.gov (United States)

    Scarpato, Margherita; Angelini, Claudia; Cocca, Ennio; Pallotta, Maria M; Morescalchi, Maria A; Capriglione, Teresa

    2015-09-01

    In this study, we investigated by in silico analysis the possible correlation between microRNAs (miRNAs) and Anamnia V-SINEs (a superfamily of short interspersed nuclear elements), which belong to those retroposon families that have been preserved in vertebrate genomes for millions of years and are actively transcribed because they are embedded in the 3' untranslated region (UTR) of several genes. We report the results of the analysis of the genomic distribution of these mobile elements in zebrafish (Danio rerio) and discuss their involvement in generating miRNA gene loci. The computational study showed that the genes predicted to bear V-SINEs can be targeted by miRNAs with a very high hybridization E-value. Gene ontology analysis indicates that these genes are mainly involved in metabolic, membrane, and cytoplasmic signaling pathways. Nearly all the miRNAs that were predicted to target the V-SINEs of these genes, i.e., miR-338, miR-9, miR-181, miR-724, miR-735, and miR-204, have been validated in similar regulatory roles in mammals. The large number of genes bearing a V-SINE involved in metabolic and cellular processes suggests that V-SINEs may play a role in modulating cell responses to different stimuli and in preserving the metabolic balance during cell proliferation and differentiation. Although they need experimental validation, these preliminary results suggest that in the genome of D. rerio, as in other TE families in vertebrates, the preservation of V-SINE retroposons may also have been favored by their putative role in gene network modulation.

  20. Short-range order in alloys of nickel with the elements of group VIII of the periodic table

    International Nuclear Information System (INIS)

    Khwaja, F.A.

    1981-08-01

    Experimental measurements of the diffuse X-ray scattering intensity were performed on alloys of Ni with Rh and Os. The atomic short-range order (SRO) parameters αsub(i) and the size-effect parameters βsub(i) were calculated from these measurements. It is established that SRO and size-effect exist in Ni-Rh and Ni-Os alloys analogously as in a few other alloys of Ni with the elements of group VIII of the periodic table. The experimental data was interpreted theoretically by calculating the interaction energies from the pseudo-potentials and the effective valencies of the individual components of the systems studied. It was found that theoretically calculated values of the interaction energies for these alloys are inconsistent with the experimentally determined sign of the SRO parameter. (author)

  1. Google matrix analysis of DNA sequences.

    Science.gov (United States)

    Kandiah, Vivek; Shepelyansky, Dima L

    2013-01-01

    For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW). At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

  2. Google matrix analysis of DNA sequences.

    Directory of Open Access Journals (Sweden)

    Vivek Kandiah

    Full Text Available For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW. At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

  3. Dynamic SPR monitoring of yeast nuclear protein binding to a cis-regulatory element

    International Nuclear Information System (INIS)

    Mao, Grace; Brody, James P.

    2007-01-01

    Gene expression is controlled by protein complexes binding to short specific sequences of DNA, called cis-regulatory elements. Expression of most eukaryotic genes is controlled by dozens of these elements. Comprehensive identification and monitoring of these elements is a major goal of genomics. In pursuit of this goal, we are developing a surface plasmon resonance (SPR) based assay to identify and monitor cis-regulatory elements. To test whether we could reliably monitor protein binding to a regulatory element, we immobilized a 16 bp region of Saccharomyces cerevisiae chromosome 5 onto a gold surface. This 16 bp region of DNA is known to bind several proteins and thought to control expression of the gene RNR1, which varies through the cell cycle. We synchronized yeast cell cultures, and then sampled these cultures at a regular interval. These samples were processed to purify nuclear lysate, which was then exposed to the sensor. We found that nuclear protein binds this particular element of DNA at a significantly higher rate (as compared to unsynchronized cells) during G1 phase. Other time points show levels of DNA-nuclear protein binding similar to the unsynchronized control. We also measured the apparent association complex of the binding to be 0.014 s -1 . We conclude that (1) SPR-based assays can monitor DNA-nuclear protein binding and that (2) for this particular cis-regulatory element, maximum DNA-nuclear protein binding occurs during G1 phase

  4. Deep sequencing-based transcriptome analysis of Plutella xylostella larvae parasitized by Diadegma semiclausum

    Science.gov (United States)

    2011-01-01

    Background Parasitoid insects manipulate their hosts' physiology by injecting various factors into their host upon parasitization. Transcriptomic approaches provide a powerful approach to study insect host-parasitoid interactions at the molecular level. In order to investigate the effects of parasitization by an ichneumonid wasp (Diadegma semiclausum) on the host (Plutella xylostella), the larval transcriptome profile was analyzed using a short-read deep sequencing method (Illumina). Symbiotic polydnaviruses (PDVs) associated with ichneumonid parasitoids, known as ichnoviruses, play significant roles in host immune suppression and developmental regulation. In the current study, D. semiclausum ichnovirus (DsIV) genes expressed in P. xylostella were identified and their sequences compared with other reported PDVs. Five of these genes encode proteins of unknown identity, that have not previously been reported. Results De novo assembly of cDNA sequence data generated 172,660 contigs between 100 and 10000 bp in length; with 35% of > 200 bp in length. Parasitization had significant impacts on expression levels of 928 identified insect host transcripts. Gene ontology data illustrated that the majority of the differentially expressed genes are involved in binding, catalytic activity, and metabolic and cellular processes. In addition, the results show that transcription levels of antimicrobial peptides, such as gloverin, cecropin E and lysozyme, were up-regulated after parasitism. Expression of ichnovirus genes were detected in parasitized larvae with 19 unique sequences identified from five PDV gene families including vankyrin, viral innexin, repeat elements, a cysteine-rich motif, and polar residue rich protein. Vankyrin 1 and repeat element 1 genes showed the highest transcription levels among the DsIV genes. Conclusion This study provides detailed information on differential expression of P. xylostella larval genes following parasitization, DsIV genes expressed in the

  5. Novel porcine repetitive elements

    Directory of Open Access Journals (Sweden)

    Nonneman Dan J

    2006-12-01

    Full Text Available Abstract Background Repetitive elements comprise ~45% of mammalian genomes and are increasingly known to impact genomic function by contributing to the genomic architecture, by direct regulation of gene expression and by affecting genomic size, diversity and evolution. The ubiquity and increasingly understood importance of repetitive elements contribute to the need to identify and annotate them. We set out to identify previously uncharacterized repetitive DNA in the porcine genome. Once found, we characterized the prevalence of these repeats in other mammals. Results We discovered 27 repetitive elements in 220 BACs covering 1% of the porcine genome (Comparative Vertebrate Sequencing Initiative; CVSI. These repeats varied in length from 55 to 1059 nucleotides. To estimate copy numbers, we went to an independent source of data, the BAC-end sequences (Wellcome Trust Sanger Institute, covering approximately 15% of the porcine genome. Copy numbers in BAC-ends were less than one hundred for 6 repeat elements, between 100 and 1000 for 16 and between 1,000 and 10,000 for 5. Several of the repeat elements were found in the bovine genome and we have identified two with orthologous sites, indicating that these elements were present in their common ancestor. None of the repeat elements were found in primate, rodent or dog genomes. We were unable to identify any of the replication machinery common to active transposable elements in these newly identified repeats. Conclusion The presence of both orthologous and non-orthologous sites indicates that some sites existed prior to speciation and some were generated later. The identification of low to moderate copy number repetitive DNA that is specific to artiodactyls will be critical in the assembly of livestock genomes and studies of comparative genomics.

  6. DNA Fingerprinting of Lactobacillus crispatus Strain CTV-05 by Repetitive Element Sequence-Based PCR Analysis in a Pilot Study of Vaginal Colonization

    OpenAIRE

    Antonio, May A. D.; Hillier, Sharon L.

    2003-01-01

    Lactobacillus crispatus is one of the predominant hydrogen peroxide (H2O2)-producing species found in the vagina and is under development as a probiotic for the treatment of bacterial vaginosis. In this study, we assessed whether DNA fingerprinting by repetitive element sequence-based PCR (rep-PCR) can be used to distinguish the capsule strain of L. crispatus (CTV-05) from other endogenous strains as well as other species of vaginal lactobacilli. Vaginal and rectal lactobacilli were identifie...

  7. Why do probabilistic finite element analysis ?

    CERN Document Server

    Thacker, Ben H

    2008-01-01

    The intention of this book is to provide an introduction to performing probabilistic finite element analysis. As a short guideline, the objective is to inform the reader of the use, benefits and issues associated with performing probabilistic finite element analysis without excessive theory or mathematical detail.

  8. Conceptual problems with remote element synthesis

    Indian Academy of Sciences (India)

    The notion of remote element synthesis has recently been modified to explain the presence of nucleogenetic isotopic anomalies and decay products of short-lived nuclides by injection of a small amount of exotic nucleogenetic material. Even with this modification, remote element synthesis seems inconsistent with the ...

  9. Intra-species sequence comparisons for annotating genomes

    Energy Technology Data Exchange (ETDEWEB)

    Boffelli, Dario; Weer, Claire V.; Weng, Li; Lewis, Keith D.; Shoukry, Malak I.; Pachter, Lior; Keys, David N.; Rubin, Edward M.

    2004-07-15

    Analysis of sequence variation among members of a single species offers a potential approach to identify functional DNA elements responsible for biological features unique to that species. Due to its high rate of allelic polymorphism and ease of genetic manipulability, we chose the sea squirt, Ciona intestinalis, to explore intra-species sequence comparisons for genome annotation. A large number of C. intestinalis specimens were collected from four continents and a set of genomic intervals amplified, resequenced and analyzed to determine the mutation rates at each nucleotide in the sequence. We found that regions with low mutation rates efficiently demarcated functionally constrained sequences: these include a set of noncoding elements, which we showed in C intestinalis transgenic assays to act as tissue-specific enhancers, as well as the location of coding sequences. This illustrates that comparisons of multiple members of a species can be used for genome annotation, suggesting a path for the annotation of the sequenced genomes of organisms occupying uncharacterized phylogenetic branches of the animal kingdom and raises the possibility that the resequencing of a large number of Homo sapiens individuals might be used to annotate the human genome and identify sequences defining traits unique to our species. The sequence data from this study has been submitted to GenBank under accession nos. AY667278-AY667407.

  10. Sphene and zircon in the Highland Range volcanic sequence (Miocene, southern Nevada, USA): Elemental partitioning, phase relations, and influence on evolution of silicic magma

    Science.gov (United States)

    Colombini, L.L.; Miller, C.F.; Gualda, G.A.R.; Wooden, J.L.; Miller, J.S.

    2011-01-01

    Sphene is prominent in Miocene plutonic rocks ranging from diorite to granite in southern Nevada, USA, but it is restricted to rhyolites in coeval volcanic sequences. In the Highland Range volcanic sequence, sphene appears as a phenocryst only in the most evolved rocks (72-77 mass% SiO2; matrix glass 77-78 mass% SiO2). Zr-in-sphene temperatures of crystallization are mostly restricted to 715 and 755??C, in contrast to zircon (710-920??C, Ti-in-zircon thermometry). Sphene rim/glass Kds for rare earth elements are extremely high (La 120, Sm 1200, Gd 1300, Lu 240). Rare earth elements, especially the middle REE (MREE), decrease from centers to rims of sphene phenocrysts along with Zr, demonstrating the effect of progressive sphene fractionation. Whole rocks and glasses have MREE-depleted, U-shaped REE patterns as a consequence of sphene fractionation. Within the co-genetic, sphene-rich Searchlight pluton, only evolved leucogranites show comparable MREE depletion. These results indicate that sphene saturation in intruded and extruded magmas occurred only in highly evolved melts: abundant sphene in less silicic plutonic rocks represents a late-stage 'bloom' in fractionated interstitial melt. ?? 2011 Springer-Verlag.

  11. Abundant and diverse clustered regularly interspaced short palindromic repeat spacers in Clostridium difficile strains and prophages target multiple phage types within this pathogen.

    Science.gov (United States)

    Hargreaves, Katherine R; Flores, Cesar O; Lawley, Trevor D; Clokie, Martha R J

    2014-08-26

    Clostridium difficile is an important human-pathogenic bacterium causing antibiotic-associated nosocomial infections worldwide. Mobile genetic elements and bacteriophages have helped shape C. difficile genome evolution. In many bacteria, phage infection may be controlled by a form of bacterial immunity called the clustered regularly interspaced short palindromic repeats/CRISPR-associated (CRISPR/Cas) system. This uses acquired short nucleotide sequences (spacers) to target homologous sequences (protospacers) in phage genomes. C. difficile carries multiple CRISPR arrays, and in this paper we examine the relationships between the host- and phage-carried elements of the system. We detected multiple matches between spacers and regions in 31 C. difficile phage and prophage genomes. A subset of the spacers was located in prophage-carried CRISPR arrays. The CRISPR spacer profiles generated suggest that related phages would have similar host ranges. Furthermore, we show that C. difficile strains of the same ribotype could either have similar or divergent CRISPR contents. Both synonymous and nonsynonymous mutations in the protospacer sequences were identified, as well as differences in the protospacer adjacent motif (PAM), which could explain how phages escape this system. This paper illustrates how the distribution and diversity of CRISPR spacers in C. difficile, and its prophages, could modulate phage predation for this pathogen and impact upon its evolution and pathogenicity. Clostridium difficile is a significant bacterial human pathogen which undergoes continual genome evolution, resulting in the emergence of new virulent strains. Phages are major facilitators of genome evolution in other bacterial species, and we use sequence analysis-based approaches in order to examine whether the CRISPR/Cas system could control these interactions across divergent C. difficile strains. The presence of spacer sequences in prophages that are homologous to phage genomes raises an

  12. Sequence elements controlling expression of Barley stripe mosaic virus subgenomic RNAs in vivo

    International Nuclear Information System (INIS)

    Johnson, Jennifer A.; Bragg, Jennifer N.; Lawrence, Diane M.; Jackson, Andrew O.

    2003-01-01

    Barley stripe mosaic virus (BSMV) contains three positive-sense, single-stranded genomic RNAs, designated α, β, and γ, that encode seven major proteins and one minor translational readthrough protein. Three proteins (αa, βa, and γa) are translated directly from the genomic RNAs and the remaining proteins encoded on RNAβ and RNAγ are expressed via three subgenomic messenger RNAs (sgRNAs). sgRNAβ1 directs synthesis of the triple gene block 1 (TGB1) protein. The TGB2 protein, the TGB2' minor translational readthrough protein, and the TGB3 protein are expressed from sgRNAβ2, which is present in considerably lower abundance than sgRNAβ1. A third sgRNA, sgRNAγ, is required for expression of the γb protein. We have used deletion analyses and site-specific mutations to define the boundaries of promoter regions that are critical for expression of the BSMV sgRNAs in infected protoplasts. The results reveal that the sgRNAβ1 promoter encompasses positions -29 to -2 relative to its transcription start site and is adjacent to a cis-acting element required for RNAβ replication that maps from -107 to -74 relative to the sgRNAβ1 start site. The core sgRNAβ2 promoter includes residues -32 to -17 relative to the sgRNAβ2 transcriptional start site, although maximal activity requires an upstream hexanucleotide sequence residing from positions -64 to -59. The sgRNAγ promoter maps from -21 to +2 relative to its transcription start site and therefore partially overlaps the γa gene. The sgRNAβ1, β2, and γ promoters also differ substantially in sequence, but have similarities to the putative homologous promoters of other Hordeiviruses. These differences are postulated to affect competition for the viral polymerase, coordination of the temporal expression and abundance of the TGB proteins, and constitutive expression of the γb protein

  13. De Novo Assembly of Human Herpes Virus Type 1 (HHV-1) Genome, Mining of Non-Canonical Structures and Detection of Novel Drug-Resistance Mutations Using Short- and Long-Read Next Generation Sequencing Technologies.

    Science.gov (United States)

    Karamitros, Timokratis; Harrison, Ian; Piorkowska, Renata; Katzourakis, Aris; Magiorkinis, Gkikas; Mbisa, Jean Lutamyo

    2016-01-01

    Human herpesvirus type 1 (HHV-1) has a large double-stranded DNA genome of approximately 152 kbp that is structurally complex and GC-rich. This makes the assembly of HHV-1 whole genomes from short-read sequencing data technically challenging. To improve the assembly of HHV-1 genomes we have employed a hybrid genome assembly protocol using data from two sequencing technologies: the short-read Roche 454 and the long-read Oxford Nanopore MinION sequencers. We sequenced 18 HHV-1 cell culture-isolated clinical specimens collected from immunocompromised patients undergoing antiviral therapy. The susceptibility of the samples to several antivirals was determined by plaque reduction assay. Hybrid genome assembly resulted in a decrease in the number of contigs in 6 out of 7 samples and an increase in N(G)50 and N(G)75 of all 7 samples sequenced by both technologies. The approach also enhanced the detection of non-canonical contigs including a rearrangement between the unique (UL) and repeat (T/IRL) sequence regions of one sample that was not detectable by assembly of 454 reads alone. We detected several known and novel resistance-associated mutations in UL23 and UL30 genes. Genome-wide genetic variability ranged from genomes will be useful in determining genetic determinants of drug resistance, virulence, pathogenesis and viral evolution. The numerous, complex repeat regions of the HHV-1 genome currently remain a barrier towards this goal.

  14. Investigating Effects of Screen Layout Elements on Interface and Screen Design Aesthetics

    OpenAIRE

    Altaboli, Ahamed; Lin, Yingzi

    2011-01-01

    A recent study suggested the use of the screen layout elements of balance, unity, and sequence as a part of a computational model of interface aesthetics. It is argued that these three elements are the most contributed terms in the model. In the current study, a controlled experiment was designed and conducted to systematically investigate effects of these three elements (balance, unity, and sequence) on the perceived interface aesthetics. Results showed that the three elements have signific...

  15. Questioning short-term memory and its measurement: Why digit span measures long-term associative learning.

    Science.gov (United States)

    Jones, Gary; Macken, Bill

    2015-11-01

    Traditional accounts of verbal short-term memory explain differences in performance for different types of verbal material by reference to inherent characteristics of the verbal items making up memory sequences. The role of previous experience with sequences of different types is ostensibly controlled for either by deliberate exclusion or by presenting multiple trials constructed from different random permutations. We cast doubt on this general approach in a detailed analysis of the basis for the robust finding that short-term memory for digit sequences is superior to that for other sequences of verbal material. Specifically, we show across four experiments that this advantage is not due to inherent characteristics of digits as verbal items, nor are individual digits within sequences better remembered than other types of individual verbal items. Rather, the advantage for digit sequences stems from the increased frequency, compared to other verbal material, with which digits appear in random sequences in natural language, and furthermore, relatively frequent digit sequences support better short-term serial recall than less frequent ones. We also provide corpus-based computational support for the argument that performance in a short-term memory setting is a function of basic associative learning processes operating on the linguistic experience of the rememberer. The experimental and computational results raise questions not only about the role played by measurement of digit span in cognition generally, but also about the way in which long-term memory processes impact on short-term memory functioning. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.

  16. A simple, flexible and efficient PCR-fusion/Gateway cloning procedure for gene fusion, site-directed mutagenesis, short sequence insertion and domain deletions and swaps

    Directory of Open Access Journals (Sweden)

    Etchells J Peter

    2009-10-01

    Full Text Available Abstract Background The progress and completion of various plant genome sequencing projects has paved the way for diverse functional genomic studies that involve cloning, modification and subsequent expression of target genes. This requires flexible and efficient procedures for generating binary vectors containing: gene fusions, variants from site-directed mutagenesis, addition of protein tags together with domain swaps and deletions. Furthermore, efficient cloning procedures, ideally high throughput, are essential for pyramiding of multiple gene constructs. Results Here, we present a simple, flexible and efficient PCR-fusion/Gateway cloning procedure for construction of binary vectors for a range of gene fusions or variants with single or multiple nucleotide substitutions, short sequence insertions, domain deletions and swaps. Results from selected applications of the procedure which include ORF fusion, introduction of Cys>Ser mutations, insertion of StrepII tag sequence and domain swaps for Arabidopsis secondary cell wall AtCesA genes are demonstrated. Conclusion The PCR-fusion/Gateway cloning procedure described provides an elegant, simple and efficient solution for a wide range of diverse and complicated cloning tasks. Through streamlined cloning of sets of gene fusions and modification variants into binary vectors for systematic functional studies of gene families, our method allows for efficient utilization of the growing sequence and expression data.

  17. Petroleum system elements within the Late Cretaceous and Early Paleogene sediments of Nigeria's inland basins: An integrated sequence stratigraphic approach

    Science.gov (United States)

    Dim, Chidozie Izuchukwu Princeton; Onuoha, K. Mosto; Okeugo, Chukwudike Gabriel; Ozumba, Bertram Maduka

    2017-06-01

    Sequence stratigraphic studies have been carried out using subsurface well and 2D seismic data in the Late Cretaceous and Early Paleogene sediments of Anambra and proximal onshore section of Niger Delta Basin in the Southeastern Nigeria. The aim was to establish the stratigraphic framework for better understanding of the reservoir, source and seal rock presence and distribution in the basin. Thirteen stratigraphic bounding surfaces (consisting of six maximum flooding surfaces - MFSs and seven sequence boundaries - SBs) were recognized and calibrated using a newly modified chronostratigraphic chart. Stratigraphic surfaces were matched with corresponding foraminiferal and palynological biozones, aiding correlation across wells in this study. Well log sequence stratigraphic correlation reveals that stratal packages within the basin are segmented into six depositional sequences occurring from Late Cretaceous to Early Paleogene age. Generated gross depositional environment maps at various MFSs show that sediment packages deposited within shelfal to deep marine settings, reflect continuous rise and fall of sea levels within a regressive cycle. Each of these sequences consist of three system tracts (lowstand system tract - LST, transgressive system tract - TST and highstand system tract - HST) that are associated with mainly progradational and retrogradational sediment stacking patterns. Well correlation reveals that the sand and shale units of the LSTs, HSTs and TSTs, that constitute the reservoir and source/seal packages respectively are laterally continuous and thicken basinwards, due to structural influences. Result from interpretation of seismic section reveals the presence of hanging wall, footwall, horst block and collapsed crest structures. These structural features generally aid migration and offer entrapment mechanism for hydrocarbon accumulation. The combination of these reservoirs, sources, seals and trap elements form a good petroleum system that is viable

  18. Mixed Sequence Reader: A Program for Analyzing DNA Sequences with Heterozygous Base Calling

    Science.gov (United States)

    Chang, Chun-Tien; Tsai, Chi-Neu; Tang, Chuan Yi; Chen, Chun-Houh; Lian, Jang-Hau; Hu, Chi-Yu; Tsai, Chia-Lung; Chao, Angel; Lai, Chyong-Huey; Wang, Tzu-Hao; Lee, Yun-Shien

    2012-01-01

    The direct sequencing of PCR products generates heterozygous base-calling fluorescence chromatograms that are useful for identifying single-nucleotide polymorphisms (SNPs), insertion-deletions (indels), short tandem repeats (STRs), and paralogous genes. Indels and STRs can be easily detected using the currently available Indelligent or ShiftDetector programs, which do not search reference sequences. However, the detection of other genomic variants remains a challenge due to the lack of appropriate tools for heterozygous base-calling fluorescence chromatogram data analysis. In this study, we developed a free web-based program, Mixed Sequence Reader (MSR), which can directly analyze heterozygous base-calling fluorescence chromatogram data in .abi file format using comparisons with reference sequences. The heterozygous sequences are identified as two distinct sequences and aligned with reference sequences. Our results showed that MSR may be used to (i) physically locate indel and STR sequences and determine STR copy number by searching NCBI reference sequences; (ii) predict combinations of microsatellite patterns using the Federal Bureau of Investigation Combined DNA Index System (CODIS); (iii) determine human papilloma virus (HPV) genotypes by searching current viral databases in cases of double infections; (iv) estimate the copy number of paralogous genes, such as β-defensin 4 (DEFB4) and its paralog HSPDP3. PMID:22778697

  19. GIF Video Sentiment Detection Using Semantic Sequence

    Directory of Open Access Journals (Sweden)

    Dazhen Lin

    2017-01-01

    Full Text Available With the development of social media, an increasing number of people use short videos in social media applications to express their opinions and sentiments. However, sentiment detection of short videos is a very challenging task because of the semantic gap problem and sequence based sentiment understanding problem. In this context, we propose a SentiPair Sequence based GIF video sentiment detection approach with two contributions. First, we propose a Synset Forest method to extract sentiment related semantic concepts from WordNet to build a robust SentiPair label set. This approach considers the semantic gap between label words and selects a robust label subset which is related to sentiment. Secondly, we propose a SentiPair Sequence based GIF video sentiment detection approach that learns the semantic sequence to understand the sentiment from GIF videos. Our experiment results on GSO-2016 (GIF Sentiment Ontology data show that our approach not only outperforms four state-of-the-art classification methods but also shows better performance than the state-of-the-art middle level sentiment ontology features, Adjective Noun Pairs (ANPs.

  20. Cross-species functionality of pararetroviral elements driving ribosome shunting.

    Directory of Open Access Journals (Sweden)

    Mikhail M Pooggin

    Full Text Available BACKGROUND: Cauliflower mosaic virus (CaMV and Rice tungro bacilliform virus (RTBV belong to distinct genera of pararetroviruses infecting dicot and monocot plants, respectively. In both viruses, polycistronic translation of pregenomic (pg RNA is initiated by shunting ribosomes that bypass a large region of the pgRNA leader with several short (sORFs and a stable stem-loop structure. The shunt requires translation of a 5'-proximal sORF terminating near the stem. In CaMV, mutations knocking out this sORF nearly abolish shunting and virus viability. METHODOLOGY/PRINCIPAL FINDINGS: Here we show that two distant regions of the CaMV leader that form a minimal shunt configuration comprising the sORF, a bottom part of the stem, and a shunt landing sequence can be replaced by heterologous sequences that form a structurally similar configuration in RTBV without any dramatic effect on shunt-mediated translation and CaMV infectivity. The CaMV-RTBV chimeric leader sequence was largely stable over five viral passages in turnip plants: a few alterations that did eventually occur in the virus progenies are indicative of fine tuning of the chimeric sequence during adaptation to a new host. CONCLUSIONS/SIGNIFICANCE: Our findings demonstrate cross-species functionality of pararetroviral cis-elements driving ribosome shunting and evolutionary conservation of the shunt mechanism. We are grateful to Matthias Müller and Sandra Pauli for technical assistance. This work was initiated at Friedrich Miescher Institute (Basel, Switzerland. We thank Prof. Thomas Boller for hosting the group at the Institute of Botany.

  1. Short-term memory stores organized by information domain.

    Science.gov (United States)

    Noyce, Abigail L; Cestero, Nishmar; Shinn-Cunningham, Barbara G; Somers, David C

    2016-04-01

    Vision and audition have complementary affinities, with vision excelling in spatial resolution and audition excelling in temporal resolution. Here, we investigated the relationships among the visual and auditory modalities and spatial and temporal short-term memory (STM) using change detection tasks. We created short sequences of visual or auditory items, such that each item within a sequence arose at a unique spatial location at a unique time. On each trial, two successive sequences were presented; subjects attended to either space (the sequence of locations) or time (the sequence of inter item intervals) and reported whether the patterns of locations or intervals were identical. Each subject completed blocks of unimodal trials (both sequences presented in the same modality) and crossmodal trials (Sequence 1 visual, Sequence 2 auditory, or vice versa) for both spatial and temporal tasks. We found a strong interaction between modality and task: Spatial performance was best on unimodal visual trials, whereas temporal performance was best on unimodal auditory trials. The order of modalities on crossmodal trials also mattered, suggesting that perceptual fidelity at encoding is critical to STM. Critically, no cost was attributable to crossmodal comparison: In both tasks, performance on crossmodal trials was as good as or better than on the weaker unimodal trials. STM representations of space and time can guide change detection in either the visual or the auditory modality, suggesting that the temporal or spatial organization of STM may supersede sensory-specific organization.

  2. Short-term memory stores organized by information domain

    Science.gov (United States)

    Noyce, Abigail L.; Cestero, Nishmar; Shinn-Cunningham, Barbara G.; Somers, David C.

    2016-01-01

    Vision and audition have complementary affinities, with vision excelling in spatial resolution and audition excelling in temporal resolution. Here, we investigate the relationships among visual and auditory modalities and spatial and temporal short-term memory (STM) using change detection tasks. We created short sequences of visual or auditory items, such that each item within a sequence arose at a unique spatial location at a unique time. On each trial, two successive sequences were presented; subjects attended to either space (the sequence of locations), or time (the sequence of inter-item intervals), and reported whether the patterns of locations or intervals were identical. Each subject completed blocks of unimodal trials (both sequences presented in the same modality) and crossmodal trials (sequence 1 visual and sequence 2 auditory, or vice versa) for both spatial and temporal tasks. We found a strong interaction between modality and task: spatial performance was best on unimodal visual trials, while temporal performance was best on unimodal auditory trials. The order of modalities on crossmodal trials also mattered, suggesting that perceptual fidelity at encoding is critical to STM. Critically, there was no cost attributable to crossmodal comparison: in both tasks, performance on crossmodal trials was as good or better than on the weaker unimodal trials. STM representations of space and time can guide change detection in either the visual or the auditory modality, suggesting that temporal or spatial organization of STM may supersede sensory-specific organization. PMID:26791231

  3. The genomic landscape shaped by selection on transposable elements across 18 mouse strains.

    Science.gov (United States)

    Nellåker, Christoffer; Keane, Thomas M; Yalcin, Binnaz; Wong, Kim; Agam, Avigail; Belgard, T Grant; Flint, Jonathan; Adams, David J; Frankel, Wayne N; Ponting, Chris P

    2012-06-15

    Transposable element (TE)-derived sequence dominates the landscape of mammalian genomes and can modulate gene function by dysregulating transcription and translation. Our current knowledge of TEs in laboratory mouse strains is limited primarily to those present in the C57BL/6J reference genome, with most mouse TEs being drawn from three distinct classes, namely short interspersed nuclear elements (SINEs), long interspersed nuclear elements (LINEs) and the endogenous retrovirus (ERV) superfamily. Despite their high prevalence, the different genomic and gene properties controlling whether TEs are preferentially purged from, or are retained by, genetic drift or positive selection in mammalian genomes remain poorly defined. Using whole genome sequencing data from 13 classical laboratory and 4 wild-derived mouse inbred strains, we developed a comprehensive catalogue of 103,798 polymorphic TE variants. We employ this extensive data set to characterize TE variants across the Mus lineage, and to infer neutral and selective processes that have acted over 2 million years. Our results indicate that the majority of TE variants are introduced though the male germline and that only a minority of TE variants exert detectable changes in gene expression. However, among genes with differential expression across the strains there are twice as many TE variants identified as being putative causal variants as expected. Most TE variants that cause gene expression changes appear to be purged rapidly by purifying selection. Our findings demonstrate that past TE insertions have often been highly deleterious, and help to prioritize TE variants according to their likely contribution to gene expression or phenotype variation.

  4. Facilitating genome navigation : survey sequencing and dense radiation-hybrid gene mapping

    NARCIS (Netherlands)

    Hitte, C; Madeoy, J; Kirkness, EF; Priat, C; Lorentzen, TD; Senger, F; Thomas, D; Derrien, T; Ramirez, C; Scott, C; Evanno, G; Pullar, B; Cadieu, E; Oza, [No Value; Lourgant, K; Jaffe, DB; Tacher, S; Dreano, S; Berkova, N; Andre, C; Deloukas, P; Fraser, C; Lindblad-Toh, K; Ostrander, EA; Galibert, F

    Accurate and comprehensive sequence coverage for large genomes has been restricted to only a few species of specific interest. Lower sequence coverage (survey sequencing) of related species can yield a wealth of information about gene content and putative regulatory elements. But survey sequences

  5. Mammalian small nucleolar RNAs are mobile genetic elements.

    Directory of Open Access Journals (Sweden)

    Michel J Weber

    2006-12-01

    Full Text Available Small nucleolar RNAs (snoRNAs of the H/ACA box and C/D box categories guide the pseudouridylation and the 2'-O-ribose methylation of ribosomal RNAs by forming short duplexes with their target. Similarly, small Cajal body-specific RNAs (scaRNAs guide modifications of spliceosomal RNAs. The vast majority of vertebrate sno/scaRNAs are located in introns of genes transcribed by RNA polymerase II and processed by exonucleolytic trimming after splicing. A bioinformatic search for orthologues of human sno/scaRNAs in sequenced mammalian genomes reveals the presence of species- or lineage-specific sno/scaRNA retroposons (sno/scaRTs characterized by an A-rich tail and an approximately 14-bp target site duplication that corresponds to their insertion site, as determined by interspecific genomic alignments. Three classes of snoRTs are defined based on the extent of intron and exon sequences from the snoRNA parental host gene they contain. SnoRTs frequently insert in gene introns in the sense orientation at genomic hot spots shared with other genetic mobile elements. Previously characterized human snoRNAs are encoded in retroposons whose parental copies can be identified by phylogenic analysis, showing that snoRTs can be faithfully processed. These results identify snoRNAs as a new family of mobile genetic elements. The insertion of new snoRNA copies might constitute a safeguard mechanism by which the biological activity of snoRNAs is maintained in spite of the risk of mutations in the parental copy. I furthermore propose that retroposition followed by genetic drift is a mechanism that increased snoRNA diversity during vertebrate evolution to eventually acquire new RNA-modification functions.

  6. Probabilistic topic modeling for the analysis and classification of genomic sequences

    Science.gov (United States)

    2015-01-01

    Background Studies on genomic sequences for classification and taxonomic identification have a leading role in the biomedical field and in the analysis of biodiversity. These studies are focusing on the so-called barcode genes, representing a well defined region of the whole genome. Recently, alignment-free techniques are gaining more importance because they are able to overcome the drawbacks of sequence alignment techniques. In this paper a new alignment-free method for DNA sequences clustering and classification is proposed. The method is based on k-mers representation and text mining techniques. Methods The presented method is based on Probabilistic Topic Modeling, a statistical technique originally proposed for text documents. Probabilistic topic models are able to find in a document corpus the topics (recurrent themes) characterizing classes of documents. This technique, applied on DNA sequences representing the documents, exploits the frequency of fixed-length k-mers and builds a generative model for a training group of sequences. This generative model, obtained through the Latent Dirichlet Allocation (LDA) algorithm, is then used to classify a large set of genomic sequences. Results and conclusions We performed classification of over 7000 16S DNA barcode sequences taken from Ribosomal Database Project (RDP) repository, training probabilistic topic models. The proposed method is compared to the RDP tool and Support Vector Machine (SVM) classification algorithm in a extensive set of trials using both complete sequences and short sequence snippets (from 400 bp to 25 bp). Our method reaches very similar results to RDP classifier and SVM for complete sequences. The most interesting results are obtained when short sequence snippets are considered. In these conditions the proposed method outperforms RDP and SVM with ultra short sequences and it exhibits a smooth decrease of performance, at every taxonomic level, when the sequence length is decreased. PMID:25916734

  7. Short segment search method for phylogenetic analysis using nested sliding windows

    Science.gov (United States)

    Iskandar, A. A.; Bustamam, A.; Trimarsanto, H.

    2017-10-01

    To analyze phylogenetics in Bioinformatics, coding DNA sequences (CDS) segment is needed for maximal accuracy. However, analysis by CDS cost a lot of time and money, so a short representative segment by CDS, which is envelope protein segment or non-structural 3 (NS3) segment is necessary. After sliding window is implemented, a better short segment than envelope protein segment and NS3 is found. This paper will discuss a mathematical method to analyze sequences using nested sliding window to find a short segment which is representative for the whole genome. The result shows that our method can find a short segment which more representative about 6.57% in topological view to CDS segment than an Envelope segment or NS3 segment.

  8. Fast high-resolution MR imaging using the snapshot-FLASH MR sequence

    International Nuclear Information System (INIS)

    Matthaei, D.; Haase, A.; Henrich, D.; Duhmke, E.

    1990-01-01

    Snapshot, fast low-angle short (FLASH) MR imaging using an accelerated FLASH-MR sequence provides MR images with measuring times far below 1 second. The short TE of this sequence prevents susceptibility artifacts in gradient-echo imaging. In this paper variations of the sequence are shown that provide high resolution images with T1-weighted IR, T2-weighted SE, and chemical shift (CHESS) contrast sequences. METHODS AND MATERIALS: A whole-body 2-T system (Bruker-Medizintechnik) were used in combination with a 60-cm gradient system (providing gradient strength of 5 mT/m) to study healthy volunteers. The measuring time for a 256 x 256 image matrix was 800 msec. This sequence has been used in combination with T1-weighted IR, T2-weighted SE, and CHESS variations

  9. ISVASE: identification of sequence variant associated with splicing event using RNA-seq data.

    Science.gov (United States)

    Aljohi, Hasan Awad; Liu, Wanfei; Lin, Qiang; Yu, Jun; Hu, Songnian

    2017-06-28

    Exon recognition and splicing precisely and efficiently by spliceosome is the key to generate mature mRNAs. About one third or a half of disease-related mutations affect RNA splicing. Software PVAAS has been developed to identify variants associated with aberrant splicing by directly using RNA-seq data. However, it bases on the assumption that annotated splicing site is normal splicing, which is not true in fact. We develop the ISVASE, a tool for specifically identifying sequence variants associated with splicing events (SVASE) by using RNA-seq data. Comparing with PVAAS, our tool has several advantages, such as multi-pass stringent rule-dependent filters and statistical filters, only using split-reads, independent sequence variant identification in each part of splicing (junction), sequence variant detection for both of known and novel splicing event, additional exon-exon junction shift event detection if known splicing events provided, splicing signal evaluation, known DNA mutation and/or RNA editing data supported, higher precision and consistency, and short running time. Using a realistic RNA-seq dataset, we performed a case study to illustrate the functionality and effectiveness of our method. Moreover, the output of SVASEs can be used for downstream analysis such as splicing regulatory element study and sequence variant functional analysis. ISVASE is useful for researchers interested in sequence variants (DNA mutation and/or RNA editing) associated with splicing events. The package is freely available at https://sourceforge.net/projects/isvase/ .

  10. Long-term earthquake forecasts based on the epidemic-type aftershock sequence (ETAS model for short-term clustering

    Directory of Open Access Journals (Sweden)

    Jiancang Zhuang

    2012-07-01

    Full Text Available Based on the ETAS (epidemic-type aftershock sequence model, which is used for describing the features of short-term clustering of earthquake occurrence, this paper presents some theories and techniques related to evaluating the probability distribution of the maximum magnitude in a given space-time window, where the Gutenberg-Richter law for earthquake magnitude distribution cannot be directly applied. It is seen that the distribution of the maximum magnitude in a given space-time volume is determined in the longterm by the background seismicity rate and the magnitude distribution of the largest events in each earthquake cluster. The techniques introduced were applied to the seismicity in the Japan region in the period from 1926 to 2009. It was found that the regions most likely to have big earthquakes are along the Tohoku (northeastern Japan Arc and the Kuril Arc, both with much higher probabilities than the offshore Nankai and Tokai regions.

  11. Repeat-aware modeling and correction of short read errors.

    Science.gov (United States)

    Yang, Xiao; Aluru, Srinivas; Dorman, Karin S

    2011-02-15

    High-throughput short read sequencing is revolutionizing genomics and systems biology research by enabling cost-effective deep coverage sequencing of genomes and transcriptomes. Error detection and correction are crucial to many short read sequencing applications including de novo genome sequencing, genome resequencing, and digital gene expression analysis. Short read error detection is typically carried out by counting the observed frequencies of kmers in reads and validating those with frequencies exceeding a threshold. In case of genomes with high repeat content, an erroneous kmer may be frequently observed if it has few nucleotide differences with valid kmers with multiple occurrences in the genome. Error detection and correction were mostly applied to genomes with low repeat content and this remains a challenging problem for genomes with high repeat content. We develop a statistical model and a computational method for error detection and correction in the presence of genomic repeats. We propose a method to infer genomic frequencies of kmers from their observed frequencies by analyzing the misread relationships among observed kmers. We also propose a method to estimate the threshold useful for validating kmers whose estimated genomic frequency exceeds the threshold. We demonstrate that superior error detection is achieved using these methods. Furthermore, we break away from the common assumption of uniformly distributed errors within a read, and provide a framework to model position-dependent error occurrence frequencies common to many short read platforms. Lastly, we achieve better error correction in genomes with high repeat content. The software is implemented in C++ and is freely available under GNU GPL3 license and Boost Software V1.0 license at "http://aluru-sun.ece.iastate.edu/doku.php?id = redeem". We introduce a statistical framework to model sequencing errors in next-generation reads, which led to promising results in detecting and correcting errors

  12. Characterization of Erwinia amylovora strains from different host plants using repetitive-sequences PCR analysis, and restriction fragment length polymorphism and short-sequence DNA repeats of plasmid pEA29.

    Science.gov (United States)

    Barionovi, D; Giorgi, S; Stoeger, A R; Ruppitsch, W; Scortichini, M

    2006-05-01

    The three main aims of the study were the assessment of the genetic relationship between a deviating Erwinia amylovora strain isolated from Amelanchier sp. (Maloideae) grown in Canada and other strains from Maloideae and Rosoideae, the investigation of the variability of the PstI fragment of the pEA29 plasmid using restriction fragment length polymorphism (RFLP) analysis and the determination of the number of short-sequence DNA repeats (SSR) by DNA sequence analysis in representative strains. Ninety-three strains obtained from 12 plant genera and different geographical locations were examined by repetitive-sequences PCR using Enterobacterial Repetitive Intergenic Consensus, BOX and Repetitive Extragenic Palindromic primer sets. Upon the unweighted pair group method with arithmetic mean analysis, a deviating strain from Amelanchier sp. was analysed using amplified ribosomal DNA restriction analysis (ARDRA) analysis and the sequencing of the 16S rDNA gene. This strain showed 99% similarity to other E. amylovora strains in the 16S gene and the same banding pattern with ARDRA. The RFLP analysis of pEA29 plasmid using MspI and Sau3A restriction enzymes showed a higher variability than that previously observed and no clear-cut grouping of the strains was possible. The number of SSR units reiterated two to 12 times. The strains obtained from pear orchards showing for the first time symptoms of fire blight had a low number of SSR units. The strains from Maloideae exhibit a wider genetic variability than previously thought. The RFLP analysis of a fragment of the pEA29 plasmid would not seem a reliable method for typing E. amylovora strains. A low number of SSR units was observed with first epidemics of fire blight. The current detection techniques are mainly based on the genetic similarities observed within the strains from the cultivated tree-fruit crops. For a more reliable detection of the fire blight pathogen also in wild and ornamentals Rosaceous plants the genetic

  13. Short Note DNA sequences from the Little Brown Bustard Eupodotis ...

    African Journals Online (AJOL)

    Taxonomic classification of birds based exclusively on morphology and plumage traits has often been found to be inconsistent with true evolutionary history when tested with molecular phylogenies based on neutrally evolving markers. Here we present cytochrome-b gene sequences for the poorly known Little Brown ...

  14. Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.

    Science.gov (United States)

    Šatović, Eva; Plohl, Miroslav

    2017-10-01

    Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.

  15. A low-memory algorithm for finding short product representations in finite groups

    NARCIS (Netherlands)

    Bisson, G.; Sutherland, A.V.

    2012-01-01

    We describe a space-efficient algorithm for solving a generalization of the subset sum problem in a finite group G, using a Pollard-¿ approach. Given an element z and a sequence of elements S, our algorithm attempts to find a subsequence of S whose product in G is equal to z. For a random sequence S

  16. A low-memory algorithm for finding short product representations in finite groups

    NARCIS (Netherlands)

    Bisson, G.; Sutherland, A.V.

    2011-01-01

    We describe a space-efficient algorithm for solving a generalization of the subset sum problem in a finite group G, using a Pollard-rho approach. Given an element z and a sequence of elements S, our algorithm attempts to find a subsequence of S whose product in G is equal to z. For a random sequence

  17. Short echo time, fast gradient-echo imaging

    International Nuclear Information System (INIS)

    Haacke, E.M.; Lenz, G.W.

    1987-01-01

    Present fast-gradient-echoes schemes can acquire volume data rapidly and are flexible in T1 or T1/T2 contrast behavior. However, sequences used to date employ echo time (TE) values of about 15 ms +- 5 and, because of in vivo field inhomogeneities (short T2), they suffer badly from signal loss near sinuses and tissue boundaries. The authors implemented sequences with TE = 4-6 ms and found significant improvement in image quality, especially at high fields. Examples with long TEs vs. short TEs are given in the knee, spine, head, and orbits. Further advantages include (1) faster repetition times (15 ms), (2) higher-quality spin-density or T1-weighted images, and (3) reduction of blood motion artifacts

  18. FDSTools: A software package for analysis of massively parallel sequencing data with the ability to recognise and correct STR stutter and other PCR or sequencing noise.

    Science.gov (United States)

    Hoogenboom, Jerry; van der Gaag, Kristiaan J; de Leeuw, Rick H; Sijen, Titia; de Knijff, Peter; Laros, Jeroen F J

    2017-03-01

    Massively parallel sequencing (MPS) is on the advent of a broad scale application in forensic research and casework. The improved capabilities to analyse evidentiary traces representing unbalanced mixtures is often mentioned as one of the major advantages of this technique. However, most of the available software packages that analyse forensic short tandem repeat (STR) sequencing data are not well suited for high throughput analysis of such mixed traces. The largest challenge is the presence of stutter artefacts in STR amplifications, which are not readily discerned from minor contributions. FDSTools is an open-source software solution developed for this purpose. The level of stutter formation is influenced by various aspects of the sequence, such as the length of the longest uninterrupted stretch occurring in an STR. When MPS is used, STRs are evaluated as sequence variants that each have particular stutter characteristics which can be precisely determined. FDSTools uses a database of reference samples to determine stutter and other systemic PCR or sequencing artefacts for each individual allele. In addition, stutter models are created for each repeating element in order to predict stutter artefacts for alleles that are not included in the reference set. This information is subsequently used to recognise and compensate for the noise in a sequence profile. The result is a better representation of the true composition of a sample. Using Promega Powerseq™ Auto System data from 450 reference samples and 31 two-person mixtures, we show that the FDSTools correction module decreases stutter ratios above 20% to below 3%. Consequently, much lower levels of contributions in the mixed traces are detected. FDSTools contains modules to visualise the data in an interactive format allowing users to filter data with their own preferred thresholds. Copyright © 2016 The Authors. Published by Elsevier B.V. All rights reserved.

  19. [Identification of a repetitive sequence element for DNA fingerprinting in Phytophthora sojae].

    Science.gov (United States)

    Yin, Lihua; Wang, Qinhu; Ning, Feng; Zhu, Xiaoying; Zuo, Yuhu; Shan, Weixing

    2010-04-01

    Establishment of DNA fingerprinting in Phytophthora sojae and an analysis of genetic relationship of Heilongjiang and Xinjiang populations. Bioinformatics tools were used to search repetitive sequences in P. sojae and Southern blot analysis was employed for DNA fingerprinting analysis of P. sojae populations from Heilongjiang and Xinjiang using the identified repetitive sequence. A moderately repetitive sequence was identified and designated as PS1227. Southern blot analysis indicated 34 distinct bands ranging in size from 1.5 kb-23 kb, of which 21 were polymorphic among 49 isolates examined. Analysis of single-zoospore progenies showed that the PS1227 fingerprint pattern was mitotically stable. DNA fingerprinting showed that the P. sojae isolates HP4002, SY6 and GJ0105 of Heilongjiang are genetically identical to DW303, 71228 and 71222 of Xinjiang, respectively. A moderately repetitive sequence designated PS1227 which will be useful for epidemiology and population biology studies of P. sojae was obtained, and a PS1227-based DNA fingerprinting analysis provided molecular evidence that P. sojae in Xinjiang was likely introduced from Heilongjiang.

  20. Fracture propagation through a layered shale and limestone sequence at Nash Point, South Wales: Implications on the development of fracture networks in layered sequences

    Science.gov (United States)

    Forbes Inskip, N.; Meredith, P. G.; Gudmundsson, A.

    2017-12-01

    While considerable effort has been expended on the study of fracture propagation in rocks in recent years, our understanding of how fractures propagate through sedimentary rocks composed of layers with different mechanical and elastic properties remains poor. Yet the mechanical layering is a key parameter controlling the propagation of fractures in sedimentary sequences. Here we report measurements of the contrasting properties of the Lower Lias at Nash Point, South Wales, which comprises a sequence of interbedded shale and limestone layers, and how those properties influence fracture propagation. The static Young's modulus (Estat) of both rock types has been measured parallel and normal to bedding. The shale is highly anisotropic, with Estat varying from 2.4 GPa, in the bedding-normal orientation, to 7.9 GPa, in the bedding-parallel orientation, yielding an anisotropy of 107%. By contrast the limestone has a very low anisotropy of 8%, with Estat values varying from 28.5 GPa, in the bedding-normal orientation, to 26.3 GPa in the bedding-parallel orientation. It follows that for a vertical fracture propagating in this sequence the modulus contrast is by a factor of about 12. This is important because the contrast in elastic properties is a key factor in controlling whether fractures arrest, deflect, or propagate across interfaces between layers in a sequence. Preliminary numerical modelling results (using a finite element modelling software) of induced fractures at Nash Point demonstrate a rotation of the maximum principal compressive stress across interfaces but also the concentration of tensile stress within the more competent (high Estat) limestone layers. The tensile strength (σT), using the Brazil-disk technique, and fracture toughness (KIc), using the semi-circular bend methodology, of both rock types have been measured. Measurements were made in the three principal orientations relative to bedding, Arrester, Divider, and Short-Transverse, and also at 15

  1. Discovery of rare, diagnostic AluYb8/9 elements in diverse human populations.

    Science.gov (United States)

    Feusier, Julie; Witherspoon, David J; Scott Watkins, W; Goubert, Clément; Sasani, Thomas A; Jorde, Lynn B

    2017-01-01

    Polymorphic human Alu elements are excellent tools for assessing population structure, and new retrotransposition events can contribute to disease. Next-generation sequencing has greatly increased the potential to discover Alu elements in human populations, and various sequencing and bioinformatics methods have been designed to tackle the problem of detecting these highly repetitive elements. However, current techniques for Alu discovery may miss rare, polymorphic Alu elements. Combining multiple discovery approaches may provide a better profile of the polymorphic Alu mobilome. Alu Yb8/9 elements have been a focus of our recent studies as they are young subfamilies (~2.3 million years old) that contribute ~30% of recent polymorphic Alu retrotransposition events. Here, we update our ME-Scan methods for detecting Alu elements and apply these methods to discover new insertions in a large set of individuals with diverse ancestral backgrounds. We identified 5,288 putative Alu insertion events, including several hundred novel Alu Yb8/9 elements from 213 individuals from 18 diverse human populations. Hundreds of these loci were specific to continental populations, and 23 non-reference population-specific loci were validated by PCR. We provide high-quality sequence information for 68 rare Alu Yb8/9 elements, of which 11 have hallmarks of an active source element. Our subfamily distribution of rare Alu Yb8/9 elements is consistent with previous datasets, and may be representative of rare loci. We also find that while ME-Scan and low-coverage, whole-genome sequencing (WGS) detect different Alu elements in 41 1000 Genomes individuals, the two methods yield similar population structure results. Current in-silico methods for Alu discovery may miss rare, polymorphic Alu elements. Therefore, using multiple techniques can provide a more accurate profile of Alu elements in individuals and populations. We improved our false-negative rate as an indicator of sample quality for future

  2. RNA-Mediated Gene Duplication and Retroposons: Retrogenes, LINEs, SINEs, and Sequence Specificity

    Science.gov (United States)

    2013-01-01

    A substantial number of “retrogenes” that are derived from the mRNA of various intron-containing genes have been reported. A class of mammalian retroposons, long interspersed element-1 (LINE1, L1), has been shown to be involved in the reverse transcription of retrogenes (or processed pseudogenes) and non-autonomous short interspersed elements (SINEs). The 3′-end sequences of various SINEs originated from a corresponding LINE. As the 3′-untranslated regions of several LINEs are essential for retroposition, these LINEs presumably require “stringent” recognition of the 3′-end sequence of the RNA template. However, the 3′-ends of mammalian L1s do not exhibit any similarity to SINEs, except for the presence of 3′-poly(A) repeats. Since the 3′-poly(A) repeats of L1 and Alu SINE are critical for their retroposition, L1 probably recognizes the poly(A) repeats, thereby mobilizing not only Alu SINE but also cytosolic mRNA. Many flowering plants only harbor L1-clade LINEs and a significant number of SINEs with poly(A) repeats, but no homology to the LINEs. Moreover, processed pseudogenes have also been found in flowering plants. I propose that the ancestral L1-clade LINE in the common ancestor of green plants may have recognized a specific RNA template, with stringent recognition then becoming relaxed during the course of plant evolution. PMID:23984183

  3. Structural elements design manual

    CERN Document Server

    Draycott, Trevor

    2012-01-01

    Gives clear explanations of the logical design sequence for structural elements. The Structural Engineer says: `The book explains, in simple terms, and with many examples, Code of Practice methods for sizing structural sections in timber, concrete,masonry and steel. It is the combination into one book of section sizing methods in each of these materials that makes this text so useful....Students will find this an essential support text to the Codes of Practice in their study of element sizing'.

  4. Identification of optimum sequencing depth especially for de novo genome assembly of small genomes using next generation sequencing data.

    Science.gov (United States)

    Desai, Aarti; Marwah, Veer Singh; Yadav, Akshay; Jha, Vineet; Dhaygude, Kishor; Bangar, Ujwala; Kulkarni, Vivek; Jere, Abhay

    2013-01-01

    Next Generation Sequencing (NGS) is a disruptive technology that has found widespread acceptance in the life sciences research community. The high throughput and low cost of sequencing has encouraged researchers to undertake ambitious genomic projects, especially in de novo genome sequencing. Currently, NGS systems generate sequence data as short reads and de novo genome assembly using these short reads is computationally very intensive. Due to lower cost of sequencing and higher throughput, NGS systems now provide the ability to sequence genomes at high depth. However, currently no report is available highlighting the impact of high sequence depth on genome assembly using real data sets and multiple assembly algorithms. Recently, some studies have evaluated the impact of sequence coverage, error rate and average read length on genome assembly using multiple assembly algorithms, however, these evaluations were performed using simulated datasets. One limitation of using simulated datasets is that variables such as error rates, read length and coverage which are known to impact genome assembly are carefully controlled. Hence, this study was undertaken to identify the minimum depth of sequencing required for de novo assembly for different sized genomes using graph based assembly algorithms and real datasets. Illumina reads for E.coli (4.6 MB) S.kudriavzevii (11.18 MB) and C.elegans (100 MB) were assembled using SOAPdenovo, Velvet, ABySS, Meraculous and IDBA-UD. Our analysis shows that 50X is the optimum read depth for assembling these genomes using all assemblers except Meraculous which requires 100X read depth. Moreover, our analysis shows that de novo assembly from 50X read data requires only 6-40 GB RAM depending on the genome size and assembly algorithm used. We believe that this information can be extremely valuable for researchers in designing experiments and multiplexing which will enable optimum utilization of sequencing as well as analysis resources.

  5. Whole genome resequencing reveals natural target site preferences of transposable elements in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Raquel S Linheiro

    Full Text Available Transposable elements are mobile DNA sequences that integrate into host genomes using diverse mechanisms with varying degrees of target site specificity. While the target site preferences of some engineered transposable elements are well studied, the natural target preferences of most transposable elements are poorly characterized. Using population genomic resequencing data from 166 strains of Drosophila melanogaster, we identified over 8,000 new insertion sites not present in the reference genome sequence that we used to decode the natural target preferences of 22 families of transposable element in this species. We found that terminal inverted repeat transposon and long terminal repeat retrotransposon families present clade-specific target site duplications and target site sequence motifs. Additionally, we found that the sequence motifs at transposable element target sites are always palindromes that extend beyond the target site duplication. Our results demonstrate the utility of population genomics data for high-throughput inference of transposable element targeting preferences in the wild and establish general rules for terminal inverted repeat transposon and long terminal repeat retrotransposon target site selection in eukaryotic genomes.

  6. Short-range correlations with pseudopotentials

    International Nuclear Information System (INIS)

    Osman, A.

    1976-01-01

    Short-range correlations in nuclei are considered on an unitary-model operator approach. Short-range pseudopotentials have been added to achieve healing in the correlated wave functions. With the introduction of the pseudopotentials, correlated basis wave functions are constructed. The matrix element for effective interaction in nuclei is developed. The required pseudopotentials have been calculated for the Hamda-Johnston, Yale and Reid potentials and for the nuclear nucleon-nucleon potential A calculated by us according to meson exchange between nucleons. (Osman, A.)

  7. A unified architecture of transcriptional regulatory elements

    DEFF Research Database (Denmark)

    Andersson, Robin; Sandelin, Albin Gustav; Danko, Charles G.

    2015-01-01

    Gene expression is precisely controlled in time and space through the integration of signals that act at gene promoters and gene-distal enhancers. Classically, promoters and enhancers are considered separate classes of regulatory elements, often distinguished by histone modifications. However...... and enhancers are considered a single class of functional element, with a unified architecture for transcription initiation. The context of interacting regulatory elements and the surrounding sequences determine local transcriptional output as well as the enhancer and promoter activities of individual elements....

  8. Resolving the Complexity of Human Skin Metagenomes Using Single-Molecule Sequencing

    Directory of Open Access Journals (Sweden)

    Yu-Chih Tsai

    2016-02-01

    Full Text Available Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation.

  9. Resolving the Complexity of Human Skin Metagenomes Using Single-Molecule Sequencing

    Science.gov (United States)

    Tsai, Yu-Chih; Deming, Clayton; Segre, Julia A.; Kong, Heidi H.; Korlach, Jonas

    2016-01-01

    ABSTRACT Deep metagenomic shotgun sequencing has emerged as a powerful tool to interrogate composition and function of complex microbial communities. Computational approaches to assemble genome fragments have been demonstrated to be an effective tool for de novo reconstruction of genomes from these communities. However, the resultant “genomes” are typically fragmented and incomplete due to the limited ability of short-read sequence data to assemble complex or low-coverage regions. Here, we use single-molecule, real-time (SMRT) sequencing to reconstruct a high-quality, closed genome of a previously uncharacterized Corynebacterium simulans and its companion bacteriophage from a skin metagenomic sample. Considerable improvement in assembly quality occurs in hybrid approaches incorporating short-read data, with even relatively small amounts of long-read data being sufficient to improve metagenome reconstruction. Using short-read data to evaluate strain variation of this C. simulans in its skin community at single-nucleotide resolution, we observed a dominant C. simulans strain with moderate allelic heterozygosity throughout the population. We demonstrate the utility of SMRT sequencing and hybrid approaches in metagenome quantitation, reconstruction, and annotation. PMID:26861018

  10. Dielectric strength test to protection elements for live lines works

    Directory of Open Access Journals (Sweden)

    Carlos Eduardo Pinto-Salamanca

    2017-06-01

    Full Text Available This paper presents the design and assembly of a system of tests of sustained voltage to elements and equipment used in live line maneuvers through tests on gloves and dielectric rods, as these are the first points of contact to ensure safe operations. It means an advance for the creation of a laboratory certified in this type of tests at Universidad Pedagógica y Tecnológica de Colombia (UPTC Faculty of Duitama, considering that currently there are not laboratories that provide this service in Boyacá and Casanare. Dielectric strength tests were performed on personal protection elements and equipment under the parameters of ASTM D120, ASTM F496, ISO 60903, ASTM-F711 and IEEE 978, developing an assembly for testing gloves and dielectric rods with voltage levels up to 15 kV. The results validate the proposed system to outlook of circuit design and implementation, where tests were performed to establish dielectric capacities, in operating under open circuit conditions, with resistive load or short circuit. The compliance with the regulations established under the test sequences of safety parameters for the system and the follow-up to the tests was verified through the use of a management system for the generation of concepts of approval or rejection of the tested elements.

  11. Transposon fingerprinting using low coverage whole genome shotgun sequencing in cacao (Theobroma cacao L.) and related species.

    Science.gov (United States)

    Sveinsson, Saemundur; Gill, Navdeep; Kane, Nolan C; Cronk, Quentin

    2013-07-24

    Transposable elements (TEs) and other repetitive elements are a large and dynamically evolving part of eukaryotic genomes, especially in plants where they can account for a significant proportion of genome size. Their dynamic nature gives them the potential for use in identifying and characterizing crop germplasm. However, their repetitive nature makes them challenging to study using conventional methods of molecular biology. Next generation sequencing and new computational tools have greatly facilitated the investigation of TE variation within species and among closely related species. (i) We generated low-coverage Illumina whole genome shotgun sequencing reads for multiple individuals of cacao (Theobroma cacao) and related species. These reads were analysed using both an alignment/mapping approach and a de novo (graph based clustering) approach. (ii) A standard set of ultra-conserved orthologous sequences (UCOS) standardized TE data between samples and provided phylogenetic information on the relatedness of samples. (iii) The mapping approach proved highly effective within the reference species but underestimated TE abundance in interspecific comparisons relative to the de novo methods. (iv) Individual T. cacao accessions have unique patterns of TE abundance indicating that the TE composition of the genome is evolving actively within this species. (v) LTR/Gypsy elements are the most abundant, comprising c.10% of the genome. (vi) Within T. cacao the retroelement families show an order of magnitude greater sequence variability than the DNA transposon families. (vii) Theobroma grandiflorum has a similar TE composition to T. cacao, but the related genus Herrania is rather different, with LTRs making up a lower proportion of the genome, perhaps because of a massive presence (c. 20%) of distinctive low complexity satellite-like repeats in this genome. (i) Short read alignment/mapping to reference TE contigs provides a simple and effective method of investigating

  12. Coval: improving alignment quality and variant calling accuracy for next-generation sequencing data.

    Directory of Open Access Journals (Sweden)

    Shunichi Kosugi

    Full Text Available Accurate identification of DNA polymorphisms using next-generation sequencing technology is challenging because of a high rate of sequencing error and incorrect mapping of reads to reference genomes. Currently available short read aligners and DNA variant callers suffer from these problems. We developed the Coval software to improve the quality of short read alignments. Coval is designed to minimize the incidence of spurious alignment of short reads, by filtering mismatched reads that remained in alignments after local realignment and error correction of mismatched reads. The error correction is executed based on the base quality and allele frequency at the non-reference positions for an individual or pooled sample. We demonstrated the utility of Coval by applying it to simulated genomes and experimentally obtained short-read data of rice, nematode, and mouse. Moreover, we found an unexpectedly large number of incorrectly mapped reads in 'targeted' alignments, where the whole genome sequencing reads had been aligned to a local genomic segment, and showed that Coval effectively eliminated such spurious alignments. We conclude that Coval significantly improves the quality of short-read sequence alignments, thereby increasing the calling accuracy of currently available tools for SNP and indel identification. Coval is available at http://sourceforge.net/projects/coval105/.

  13. Insertion sequences as variability generators in the Mycoplasma hyopneumoniae and M. synoviae genomes

    Directory of Open Access Journals (Sweden)

    Elgion Lúcio Silva Loreto

    2007-01-01

    Full Text Available We have analyzed the sequenced genomes of three strains of Mycoplasma hyopneumoniae and one strain of M. synoviae, and have found three and two different transposable element families, respectively in each species. In M. hyopneumoniae, the Insertion Sequences of the IS4 family is represented by ISMHp1, a putatively active element. The IS3 family is represented by several degenerated sequences. A third element called tMH was found, which shows some characteristics reminiscent of retrotransposons. In M. synoviae, three different possibly active IS4 elements are present (ISMHp1-like; ISMs1 and IS1634-like elements. The IS30 family is represented by the degenerated IS1630-like element. The IS1634-like element is shown to be involved in chromosomal rearrangements and horizontal gene transfer (HGT. The ISMHp1-like element is shown to relate to the HGT of a 25-kb region from M. gallisepticum to M. synoviae. The fractions of these genomes that correspond to mobile elements varied from 1.35 to 3.13% in M. hyopneumonia strains and was 2.08% in M. synoviae. Although these species possess reduced genomes, they maintain mobile elements, perhaps as a mechanism for genetic variability production.

  14. An Evolutionary Machine Learning Framework for Big Data Sequence Mining

    Science.gov (United States)

    Kamath, Uday Krishna

    2014-01-01

    Sequence classification is an important problem in many real-world applications. Unlike other machine learning data, there are no "explicit" features or signals in sequence data that can help traditional machine learning algorithms learn and predict from the data. Sequence data exhibits inter-relationships in the elements that are…

  15. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern applications require mining of motifs in one very long sequence (i.e., in the order of several gigabytes). For this case, there exist statistical approaches that are fast but inaccurate; or combinatorial methods that are sound and complete. Unfortunately, existing combinatorial methods are serial and very slow. Consequently, they are limited to very short sequences (i.e., a few megabytes), small alphabets (typically 4 symbols for DNA sequences), and restricted types of motifs. This paper presents ACME, a combinatorial method for extracting motifs from a single very long sequence. ACME arranges the search space in contiguous blocks that take advantage of the cache hierarchy in modern architectures, and achieves almost an order of magnitude performance gain in serial execution. It also decomposes the search space in a smart way that allows scalability to thousands of processors with more than 90% speedup. ACME is the only method that: (i) scales to gigabyte-long sequences; (ii) handles large alphabets; (iii) supports interesting types of motifs with minimal additional cost; and (iv) is optimized for a variety of architectures such as multi-core systems, clusters in the cloud, and supercomputers. ACME reduces the extraction time for an exact-length query from 4 hours to 7 minutes on a typical workstation; handles 3 orders of magnitude longer sequences; and scales up to 16, 384 cores on a supercomputer. Copyright is held by the owner/author(s).

  16. Structure of a conjugative element in Streptococcus pneumoniae

    Energy Technology Data Exchange (ETDEWEB)

    Vijayakumar, M.N.; Priebe, S.D.; Guild, W.R.

    1986-06-01

    The authors have cloned and mapped a 69-kilobase (kb) region of the chromosome of Streptococcus pneumoniae DP1322, which carries the conjugative Omega(cat-tet) insertion from S. pneumoniae BM6001. This element proved to be 65.5 kb in size. Location of the junctions was facilitated by cloning a preferred target region from the wild-type strain Rx1 recipient genome. This target site was preferred by both the BM6001 element and the cat-erm-tet element from Streptococcus agalactiae B109. Within the BM6001 element cat and tet were separated by 30 kb, and cat was flanked by two copies of a sequence that was also present in the recipient strain Rx1 DNA. Another sequence at least 2.4 kb in size was found inside the BM6001 element and at two places in the Rx1 genome. Its role is unknown. The ends of the BM6001 element appear to be the same as those of the B109 element, both as seen after transfer to S. pneumoniae and as mapped by others in pDP5 after transposition in Streptococcus faecalis. No homology is seen between the ends of the BM6001 element and no evidence found suggesting that it ever circularizes.

  17. A MapReduce Framework for DNA Sequencing Data Processing

    Directory of Open Access Journals (Sweden)

    Samy Ghoneimy

    2016-12-01

    Full Text Available Genomics and Next Generation Sequencers (NGS like Illumina Hiseq produce data in the order of ‎‎200 billion base pairs in a single one-week run for a 60x human genome coverage, which ‎requires modern high-throughput experimental technologies that can ‎only be tackled with high performance computing (HPC and specialized software algorithms called ‎‎“short read aligners”. This paper focuses on the implementation of the DNA sequencing as a set of MapReduce programs that will accept a DNA data set as a FASTQ file and finally generate a VCF (variant call format file, which has variants for a given DNA data set. In this paper MapReduce/Hadoop along with Burrows-Wheeler Aligner (BWA, Sequence Alignment/Map (SAM ‎tools, are fully utilized to provide various utilities for manipulating alignments, including sorting, merging, indexing, ‎and generating alignments. The Map-Sort-Reduce process is designed to be suited for a Hadoop framework in ‎which each cluster is a traditional N-node Hadoop cluster to utilize all of the Hadoop features like HDFS, program ‎management and fault tolerance. The Map step performs multiple instances of the short read alignment algorithm ‎‎(BoWTie that run in parallel in Hadoop. The ordered list of the sequence reads are used as input tuples and the ‎output tuples are the alignments of the short reads. In the Reduce step many parallel instances of the Short ‎Oligonucleotide Analysis Package for SNP (SOAPsnp algorithm run in the cluster. Input tuples are sorted ‎alignments for a partition and the output tuples are SNP calls. Results are stored via HDFS, and then archived in ‎SOAPsnp format. ‎ The proposed framework enables extremely fast discovering somatic mutations, inferring population genetical ‎parameters, and performing association tests directly based on sequencing data without explicit genotyping or ‎linkage-based imputation. It also demonstrate that this method achieves comparable

  18. Distinct electrophysiological indices of maintenance in auditory and visual short-term memory.

    Science.gov (United States)

    Lefebvre, Christine; Vachon, François; Grimault, Stephan; Thibault, Jennifer; Guimond, Synthia; Peretz, Isabelle; Zatorre, Robert J; Jolicœur, Pierre

    2013-11-01

    We compared the electrophysiological correlates for the maintenance of non-musical tones sequences in auditory short-term memory (ASTM) to those for the short-term maintenance of sequences of coloured disks held in visual short-term memory (VSTM). The visual stimuli yielded a sustained posterior contralateral negativity (SPCN), suggesting that the maintenance of sequences of coloured stimuli engaged structures similar to those involved in the maintenance of simultaneous visual displays. On the other hand, maintenance of acoustic sequences produced a sustained negativity at fronto-central sites. This component is named the Sustained Anterior Negativity (SAN). The amplitude of the SAN increased with increasing load in ASTM and predicted individual differences in the performance. There was no SAN in a control condition with the same auditory stimuli but no memory task, nor one associated with visual memory. These results suggest that the SAN is an index of brain activity related to the maintenance of representations in ASTM that is distinct from the maintenance of representations in VSTM. © 2013 Elsevier Ltd. All rights reserved.

  19. High throughput sequencing and proteomics to identify immunogenic proteins of a new pathogen: the dirty genome approach.

    Science.gov (United States)

    Greub, Gilbert; Kebbi-Beghdadi, Carole; Bertelli, Claire; Collyn, François; Riederer, Beat M; Yersin, Camille; Croxatto, Antony; Raoult, Didier

    2009-12-23

    With the availability of new generation sequencing technologies, bacterial genome projects have undergone a major boost. Still, chromosome completion needs a costly and time-consuming gap closure, especially when containing highly repetitive elements. However, incomplete genome data may be sufficiently informative to derive the pursued information. For emerging pathogens, i.e. newly identified pathogens, lack of release of genome data during gap closure stage is clearly medically counterproductive. We thus investigated the feasibility of a dirty genome approach, i.e. the release of unfinished genome sequences to develop serological diagnostic tools. We showed that almost the whole genome sequence of the emerging pathogen Parachlamydia acanthamoebae was retrieved even with relatively short reads from Genome Sequencer 20 and Solexa. The bacterial proteome was analyzed to select immunogenic proteins, which were then expressed and used to elaborate the first steps of an ELISA. This work constitutes the proof of principle for a dirty genome approach, i.e. the use of unfinished genome sequences of pathogenic bacteria, coupled with proteomics to rapidly identify new immunogenic proteins useful to develop in the future specific diagnostic tests such as ELISA, immunohistochemistry and direct antigen detection. Although applied here to an emerging pathogen, this combined dirty genome sequencing/proteomic approach may be used for any pathogen for which better diagnostics are needed. These genome sequences may also be very useful to develop DNA based diagnostic tests. All these diagnostic tools will allow further evaluations of the pathogenic potential of this obligate intracellular bacterium.

  20. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences.

    Science.gov (United States)

    Chen, Zhuo; Xu, Shixia; Zhou, Kaiya; Yang, Guang

    2011-10-27

    A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales), and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae), and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae), whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving cetacean phylogenetic relationships in the future.

  1. Whale phylogeny and rapid radiation events revealed using novel retroposed elements and their flanking sequences

    Directory of Open Access Journals (Sweden)

    Zhou Kaiya

    2011-10-01

    Full Text Available Abstract Background A diversity of hypotheses have been proposed based on both morphological and molecular data to reveal phylogenetic relationships within the order Cetacea (dolphins, porpoises, and whales, and great progress has been made in the past two decades. However, there is still some controversy concerning relationships among certain cetacean taxa such as river dolphins and delphinoid species, which needs to be further addressed with more markers in an effort to address unresolved portions of the phylogeny. Results An analysis of additional SINE insertions and SINE-flanking sequences supported the monophyly of the order Cetacea as well as Odontocete, Delphinoidea (Delphinidae + Phocoenidae + Mondontidae, and Delphinidae. A sister relationship between Delphinidae and Phocoenidae + Mondontidae was supported, and members of classical river dolphins and the genera Tursiops and Stenella were found to be paraphyletic. Estimates of divergence times revealed rapid divergences of basal Odontocete lineages in the Oligocene and Early Miocene, and a recent rapid diversification of Delphinidae in the Middle-Late Miocene and Pliocene within a narrow time frame. Conclusions Several novel SINEs were found to differentiate Delphinidae from the other two families (Monodontidae and Phocoenidae, whereas the sister grouping of the latter two families with exclusion of Delphinidae was further revealed using the SINE-flanking sequences. Interestingly, some anomalous PCR amplification patterns of SINE insertions were detected, which can be explained as the result of potential ancestral SINE polymorphisms and incomplete lineage sorting. Although a few loci were potentially anomalous, this study demonstrated that the SINE-based approach is a powerful tool in phylogenetic studies. Identifying additional SINE elements that resolve the relationships in the superfamily Delphinoidea and family Delphinidae will be important steps forward in completely resolving

  2. JVM: Java Visual Mapping tool for next generation sequencing read.

    Science.gov (United States)

    Yang, Ye; Liu, Juan

    2015-01-01

    We developed a program JVM (Java Visual Mapping) for mapping next generation sequencing read to reference sequence. The program is implemented in Java and is designed to deal with millions of short read generated by sequence alignment using the Illumina sequencing technology. It employs seed index strategy and octal encoding operations for sequence alignments. JVM is useful for DNA-Seq, RNA-Seq when dealing with single-end resequencing. JVM is a desktop application, which supports reads capacity from 1 MB to 10 GB.

  3. Chemical experiments with superheavy elements.

    Science.gov (United States)

    Türler, Andreas

    2010-01-01

    Unnoticed by many chemists, the Periodic Table of the Elements has been extended significantly in the last couple of years and the 7th period has very recently been completed with eka-Rn (element 118) currently being the heaviest element whose synthesis has been reported. These 'superheavy' elements (also called transactinides with atomic number > or = 104 (Rf)) have been artificially synthesized in fusion reactions at accelerators in minute quantities of a few single atoms. In addition, all isotopes of the transactinide elements are radioactive and decay with rather short half-lives. Nevertheless, it has been possible in some cases to investigate experimentally chemical properties of transactinide elements and even synthesize simple compounds. The experimental investigation of superheavy elements is especially intriguing, since theoretical calculations predict significant deviations from periodic trends due to the influence of strong relativistic effects. In this contribution first experiments with hassium (Hs, atomic number 108), copernicium (Cn, atomic number 112) and element 114 (eka-Pb) are reviewed.

  4. Similarity as an organising principle in short-term memory.

    Science.gov (United States)

    LeCompte, D C; Watkins, M J

    1993-03-01

    The role of stimulus similarity as an organising principle in short-term memory was explored in a series of seven experiments. Each experiment involved the presentation of a short sequence of items that were drawn from two distinct physical classes and arranged such that item class changed after every second item. Following presentation, one item was re-presented as a probe for the 'target' item that had directly followed it in the sequence. Memory for the sequence was considered organised by class if probability of recall was higher when the probe and target were from the same class than when they were from different classes. Such organisation was found when one class was auditory and the other was visual (spoken vs. written words, and sounds vs. pictures). It was also found when both classes were auditory (words spoken in a male voice vs. words spoken in a female voice) and when both classes were visual (digits shown in one location vs. digits shown in another). It is concluded that short-term memory can be organised on the basis of sensory modality and on the basis of certain features within both the auditory and visual modalities.

  5. SoyTEdb: a comprehensive database of transposable elements in the soybean genome

    Directory of Open Access Journals (Sweden)

    Zhu Liucun

    2010-02-01

    Full Text Available Abstract Background Transposable elements are the most abundant components of all characterized genomes of higher eukaryotes. It has been documented that these elements not only contribute to the shaping and reshaping of their host genomes, but also play significant roles in regulating gene expression, altering gene function, and creating new genes. Thus, complete identification of transposable elements in sequenced genomes and construction of comprehensive transposable element databases are essential for accurate annotation of genes and other genomic components, for investigation of potential functional interaction between transposable elements and genes, and for study of genome evolution. The recent availability of the soybean genome sequence has provided an unprecedented opportunity for discovery, and structural and functional characterization of transposable elements in this economically important legume crop. Description Using a combination of structure-based and homology-based approaches, a total of 32,552 retrotransposons (Class I and 6,029 DNA transposons (Class II with clear boundaries and insertion sites were structurally annotated and clearly categorized, and a soybean transposable element database, SoyTEdb, was established. These transposable elements have been anchored in and integrated with the soybean physical map and genetic map, and are browsable and visualizable at any scale along the 20 soybean chromosomes, along with predicted genes and other sequence annotations. BLAST search and other infrastracture tools were implemented to facilitate annotation of transposable elements or fragments from soybean and other related legume species. The majority (> 95% of these elements (particularly a few hundred low-copy-number families are first described in this study. Conclusion SoyTEdb provides resources and information related to transposable elements in the soybean genome, representing the most comprehensive and the largest manually

  6. The organization structure and regulatory elements of Chlamydomonas histone genes reveal features linking plant and animal genes.

    Science.gov (United States)

    Fabry, S; Müller, K; Lindauer, A; Park, P B; Cornelius, T; Schmitt, R

    1995-09-01

    The genome of the green alga Chlamydomonas reinhardtii contains approximately 15 gene clusters of the nucleosomal (or core) histone H2A, H2B, H3 and H4 genes and at least one histone H1 gene. Seven non-allelic histone gene loci were isolated from a genomic library, physically mapped, and the nucleotide sequences of three isotypes of each core histone gene species and one linked H1 gene determined. The core histone genes are organized in clusters of H2A-H2B and H3-H4 pairs, in which each gene pair shows outwardly divergent transcription from a short (< 300 bp) intercistronic region. These intercistronic regions contain typically conserved promoter elements, namely a TATA-box and the three motifs TGGCCAG-G(G/C)-CGAG, CGTTGACC and CGGTTG. Different from the genes of higher plants, but like those of animals and the related alga Volvox, the 3' untranslated regions contain no poly A signal, but a palindromic sequence (3' palindrome) essential for mRNA processing is present. One single H1 gene was found in close linkage to a H2A-H2B pair. The H1 upstream region contains the octameric promoter element GGTTGACC (also found upstream of the core histone genes) and two specific sequence motifs that are shared only with the Volvox H1 promoters. This suggests differential transcription of the H1 and the core histone genes. The H1 gene is interrupted by two introns. Unlike Volvox H3 genes, the three sequenced H3 isoforms are intron-free. Primer-directed PCR of genomic DNA demonstrated, however, that at least 8 of the about 15 H3 genes do contain one intron at a conserved position. In synchronized C. reinhardtii cells, H4 mRNA levels (representative of all core histone mRNAs) peak during cell division, suggesting strict replication-dependent gene control. The derived peptide sequences place C. reinhardtii core histones closer to plants than to animals, except that the H2A histones are more animal-like. The peptide sequence of histone H1 is closely related to the V. carteri VH1-II

  7. Combinatorial events of insertion sequences and ICE in Gram-negative bacteria.

    Science.gov (United States)

    Toleman, Mark A; Walsh, Timothy R

    2011-09-01

    The emergence of antibiotic and antimicrobial resistance in Gram-negative bacteria is incremental and linked to genetic elements that function in a so-called 'one-ended transposition' manner, including ISEcp1, ISCR elements and Tn3-like transposons. The power of these elements lies in their inability to consistently recognize one of their own terminal sequences, while recognizing more genetically distant surrogate sequences. This has the effect of mobilizing the DNA sequence found adjacent to their initial location. In general, resistance in Gram-negatives is closely linked to a few one-off events. These include the capture of the class 1 integron by a Tn5090-like transposon; the formation of the 3' conserved segment (3'-CS); and the fusion of the ISCR1 element to the 3'-CS. The structures formed by these rare events have been massively amplified and disseminated in Gram-negative bacteria, but hitherto, are rarely found in Gram-positives. Such events dominate current resistance gene acquisition and are instrumental in the construction of large resistance gene islands on chromosomes and plasmids. Similar combinatorial events appear to have occurred between conjugative plasmids and phages constructing hybrid elements called integrative and conjugative elements or conjugative transposons. These elements are beginning to be closely linked to some of the more powerful resistance mechanisms such as the extended spectrum β-lactamases, metallo- and AmpC type β-lactamases. Antibiotic resistance in Gram-negative bacteria is dominated by unusual combinatorial mistakes of Insertion sequences and gene fusions which have been selected and amplified by antibiotic pressure enabling the formation of extended resistance islands. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  8. Transformations of visual memory induced by implied motions of pattern elements.

    Science.gov (United States)

    Finke, R A; Freyd, J J

    1985-10-01

    Four experiments measured distortions in short-term visual memory induced by displays depicting independent translations of the elements of a pattern. In each experiment, observers saw a sequence of 4 dot patterns and were instructed to remember the third pattern and to compare it with the fourth. The first three patterns depicted translations of the dots in consistent, but separate directions. Error rates and reaction times for rejecting the fourth pattern as different from the third were substantially higher when the dots in that pattern were displaced slightly forward, in the same directions as the implied motions, compared with when the dots were displaced in the opposite, backward directions. These effects showed little variation across interstimulus intervals ranging from 250 to 2,000 ms, and did not depend on whether the displays gave rise to visual apparent motion. However, they were eliminated when the dots in the fourth pattern were displaced by larger amounts in each direction, corresponding to the dot positions in the next and previous patterns in the same inducing sequence. These findings extend our initial report of the phenomenon of "representational momentum" (Freyd & Finke, 1984a), and help to rule out alternatives to the proposal that visual memories tend to undergo, at least to some extent, the transformations implied by a prior sequence of observed events.

  9. Both positive and negative regulatory elements mediate expression of a photoregulated CAB gene from Nicotiana plumbaginifolia.

    Science.gov (United States)

    Castresana, C; Garcia-Luque, I; Alonso, E; Malik, V S; Cashmore, A R

    1988-01-01

    We have analyzed promoter regulatory elements from a photoregulated CAB gene (Cab-E) isolated from Nicotiana plumbaginifolia. These studies have been performed by introducing chimeric gene constructs into tobacco cells via Agrobacterium tumefaciens-mediated transformation. Expression studies on the regenerated transgenic plants have allowed us to characterize three positive and one negative cis-acting elements that influence photoregulated expression of the Cab-E gene. Within the upstream sequences we have identified two positive regulatory elements (PRE1 and PRE2) which confer maximum levels of photoregulated expression. These sequences contain multiple repeated elements related to the sequence-ACCGGCCCACTT-. We have also identified within the upstream region a negative regulatory element (NRE) extremely rich in AT sequences, which reduces the level of gene expression in the light. We have defined a light regulatory element (LRE) within the promoter region extending from -396 to -186 bp which confers photoregulated expression when fused to a constitutive nopaline synthase ('nos') promoter. Within this region there is a 132-bp element, extending from -368 to -234 bp, which on deletion from the Cab-E promoter reduces gene expression from high levels to undetectable levels. Finally, we have demonstrated for a full length Cab-E promoter conferring high levels of photoregulated expression, that sequences proximal to the Cab-E TATA box are not replaceable by corresponding sequences from a 'nos' promoter. This contrasts with the apparent equivalence of these Cab-E and 'nos' TATA box-proximal sequences in truncated promoters conferring low levels of photoregulated expression. Images PMID:2901343

  10. Short notice inspections

    International Nuclear Information System (INIS)

    Pouchkarev, V.

    1998-01-01

    For 30 years the IAEA safeguards system have evolved and have been strengthened by the regular introduction of new methods and techniques, improving both its effectiveness and efficiency. The member States of the IAEA have indicated their willingness to accept new obligations and associated technical measure that greatly strengthen the nuclear safeguards system. One element of this is the extent to which the IAEA inspectors have physical access to relevant locations for the purpose of providing independent verification of the exclusively peaceful intent of a State nuclear program. The Protocol to Safeguards granted new legal authority with respect to information on, and short notice inspector access to, all buildings on a nuclear site and administrative agreements that improve the process of designating inspectors and IAEA access to modern means of communication. This report is a short description of unannounced or short notice inspections as measures on which the new strengthened and cost efficient system will be based

  11. Mapping sequences by parts

    Directory of Open Access Journals (Sweden)

    Guziolowski Carito

    2007-09-01

    Full Text Available Abstract Background: We present the N-map method, a pairwise and asymmetrical approach which allows us to compare sequences by taking into account evolutionary events that produce shuffled, reversed or repeated elements. Basically, the optimal N-map of a sequence s over a sequence t is the best way of partitioning the first sequence into N parts and placing them, possibly complementary reversed, over the second sequence in order to maximize the sum of their gapless alignment scores. Results: We introduce an algorithm computing an optimal N-map with time complexity O (|s| × |t| × N using O (|s| × |t| × N memory space. Among all the numbers of parts taken in a reasonable range, we select the value N for which the optimal N-map has the most significant score. To evaluate this significance, we study the empirical distributions of the scores of optimal N-maps and show that they can be approximated by normal distributions with a reasonable accuracy. We test the functionality of the approach over random sequences on which we apply artificial evolutionary events. Practical Application: The method is illustrated with four case studies of pairs of sequences involving non-standard evolutionary events.

  12. CACTA-superfamily transposable element is inserted in MYB transcription factor gene of soybean line producing variegated seeds.

    Science.gov (United States)

    Yan, Fan; Di, Shaokang; Takahashi, Ryoji

    2015-08-01

    The R gene of soybean, presumably encoding a MYB transcription factor, controls seed coat color. The gene consists of multiple alleles, R (black), r-m (black spots and (or) concentric streaks on brown seed), and r (brown seed). This study was conducted to determine the structure of the MYB transcription factor gene in a near-isogenic line (NIL) having r-m allele. PCR amplification of a fragment of the candidate gene Glyma.09G235100 generated a fragment of about 1 kb in the soybean cultivar Clark, whereas a fragment of about 14 kb in addition to fragments of 1 and 1.4 kb were produced in L72-2040, a Clark 63 NIL with the r-m allele. Clark 63 is a NIL of Clark with the rxp and Rps1 alleles. A DNA fragment of 13 060 bp was inserted in the intron of Glyma.09G235100 in L72-2040. The fragment had the CACTA motif at both ends, imperfect terminal inverted repeats (TIR), inverse repetition of short sequence motifs close to the 5' and 3' ends, and a duplication of three nucleotides at the site of integration, indicating that it belongs to a CACTA-superfamily transposable element. We designated the element as Tgm11. Overall nucleotide sequence, motifs of TIR, and subterminal repeats were similar to those of Tgm1 and Tgs1, suggesting that these elements comprise a family.

  13. SCREEN FOR DOMINANT BEHAVIORAL MUTATIONS CAUSED BY GENOMIC INSERTION OF P-ELEMENT TRANSPOSONS IN DROSOPHILA: AN EXAMINATION OF THE INTEGRATION OF VIRAL VECTOR SEQUENCES

    OpenAIRE

    FOX, LYLE E.; GREEN, DAVID; YAN, ZIYING; ENGELHARDT, JOHN F.; WU, CHUN-FANG

    2007-01-01

    Here we report the development of a high-throughput screen to assess dominant mutation rates caused by P-element transposition within the Drosophila genome that is suitable for assessing the undesirable effects of integrating foreign regulatory sequences (viral cargo) into a host genome. Three different behavioral paradigms were used: sensitivity to mechanical stress, response to heat stress, and ability to fly. The results, from our screen of 35,000 flies, indicate that mutations caused by t...

  14. Renormalon ambiguities in NRQCD operator matrix elements

    International Nuclear Information System (INIS)

    Bodwin, G.T.; Chen, Y.

    1999-01-01

    We analyze the renormalon ambiguities that appear in factorization formulas in QCD. Our analysis contains a simple argument that the ambiguities in the short-distance coefficients and operator matrix elements are artifacts of dimensional-regularization factorization schemes and are absent in cutoff schemes. We also present a method for computing the renormalon ambiguities in operator matrix elements and apply it to a computation of the ambiguities in the matrix elements that appear in the NRQCD factorization formulas for the annihilation decays of S-wave quarkonia. Our results, combined with those of Braaten and Chen for the short-distance coefficients, provide an explicit demonstration that the ambiguities cancel in the physical decay rates. In addition, we analyze the renormalon ambiguities in the Gremm-Kapustin relation and in various definitions of the heavy-quark mass. copyright 1999 The American Physical Society

  15. Screening for sequence-specific RNA-BPs by comprehensive UV crosslinking

    Directory of Open Access Journals (Sweden)

    Le Meuth-Metzinger Valerie

    2002-06-01

    Full Text Available Abstract Background Specific cis-elements and the associated trans-acting factors have been implicated in the post-transcriptional regulation of gene expression. In the era of genome wide analyses identifying novel trans-acting factors and cis-regulatory elements is a step towards understanding coordinated gene expression. UV-crosslink analysis is a standard method used to identify RNA-binding proteins. Uridine is traditionally used to radiolabel substrate RNAs, however, proteins binding to cis-elments particularly uridine poor will be weakly or not detected. We evaluate here the possibility of using UV-crosslinking with RNA substrates radiolabeled with each of the four ribonucleotides as an approach for screening for novel sequence specific RNA-binding proteins. Results The radiolabeled RNA substrates were derived from the 3'UTRs of the cloned Eg and c-mos Xenopus laevis maternal mRNAs. Specific, but not identical, uv-crosslinking signals were obtained, some of which corresponded to already identified proteins. A signal for a novel 90 kDa protein was observed with the c-mos 3'UTR radiolabeled with both CTP and GTP but not with UTP. The binding site of the 90 kDa RNA-binding protein was localised to a 59-nucleotide portion of the c-mos 3'UTR. Conclusion That the 90 kDa signal was detected with RNAs radiolabeled with CTP or GTP but not UTP illustrates the advantage of radiolabeling all four nucleotides in a UV-crosslink based screen. This method can be used for both long and short RNAs and does not require knowledge of the cis-acting sequence. It should be amenable to high throughput screening for RNA binding proteins.

  16. Evaluation of short repetition time, partial flip angle, gradient recalled echo pulse sequences in cervical spine imaging

    International Nuclear Information System (INIS)

    Enzmann, D.; Rubin, J.B.

    1987-01-01

    A short repetition time (TR), partial flip angle, gradient recalled echo pulse sequence (GRASS) was prospectively studied to optimize it for the diagnosis of cervical disk and cord disease in 98 patients. Changes in signal-to-noise ratio (SNR) and contrast were measured as the following parameters were varied: flip angle (3 0 to 18 0 ), TR (22-60 msec), and echo time (TE) (12.5-25 msec). Flip angle was the single most important parameter. For disk disease, cerebrospinal fluid (CSF) SNR peaked at an 8 0 flip angle in the axial view but at a 4 0 flip angle in the sagittal view. In the sagittal view, disk-CSF contrast decreased progressively from a flip angle of 3 0 , while in the axial view it peaked at 10 0 . For cord lesions the findings were similar except that lesion-cord contrast could be increased by lengthening both TR and TE. No one combination of parameters proved greatly superior for either disk disease or cord disease. The selection of parameters required balancing of several factors that often had opposing effects

  17. Separating metagenomic short reads into genomes via clustering

    Directory of Open Access Journals (Sweden)

    Tanaseichuk Olga

    2012-09-01

    Full Text Available Abstract Background The metagenomics approach allows the simultaneous sequencing of all genomes in an environmental sample. This results in high complexity datasets, where in addition to repeats and sequencing errors, the number of genomes and their abundance ratios are unknown. Recently developed next-generation sequencing (NGS technologies significantly improve the sequencing efficiency and cost. On the other hand, they result in shorter reads, which makes the separation of reads from different species harder. Among the existing computational tools for metagenomic analysis, there are similarity-based methods that use reference databases to align reads and composition-based methods that use composition patterns (i.e., frequencies of short words or l-mers to cluster reads. Similarity-based methods are unable to classify reads from unknown species without close references (which constitute the majority of reads. Since composition patterns are preserved only in significantly large fragments, composition-based tools cannot be used for very short reads, which becomes a significant limitation with the development of NGS. A recently proposed algorithm, AbundanceBin, introduced another method that bins reads based on predicted abundances of the genomes sequenced. However, it does not separate reads from genomes of similar abundance levels. Results In this work, we present a two-phase heuristic algorithm for separating short paired-end reads from different genomes in a metagenomic dataset. We use the observation that most of the l-mers belong to unique genomes when l is sufficiently large. The first phase of the algorithm results in clusters of l-mers each of which belongs to one genome. During the second phase, clusters are merged based on l-mer repeat information. These final clusters are used to assign reads. The algorithm could handle very short reads and sequencing errors. It is initially designed for genomes with similar abundance levels and then

  18. Malazy, a degenerate, species-specific transposable element in Cercospora zeae-maydis.

    Science.gov (United States)

    Shim, Won-Bo; Dunkle, Larry D

    2005-01-01

    Two fungal pathogens, Cercospora zeae-maydis Groups I and II, cause gray leaf spot of maize. During the sequencing of a cosmid library from C. zeae-maydis Group I, we discovered a sequence with high similarity to Maggy, a transposable element from Magnaporthe grisea. The element from C. zeae-maydis, named Malazy, contained 194-base-pair terminal repeats and sequences with high similarity to reverse transcriptase and integrase, components of the POL gene in the gypsy-like retrotransposons in fungi. Sequences with similarity to other POL gene components, protease and ribonuclease, were not detected in Malazy. A single copy of the element was detected by PCR and Southern analyses in all six North American isolates of C. zeae-maydis Group I but was not detected in the four isolates of C. zeae-maydis Group II from three continents or in phylogenetically related species. Fragments of the core domains of reverse transcriptase and integrase contained a high frequency of stop codons that were conserved in all six isolates of Group I. Additional C:G to T:A transitions in occasional isolates usually were silent mutations, while two resulted in isolate-specific stop codons. The absence of Malazy from related species suggests that it was acquired after the divergence of C. zeae-maydis Groups I and II. The high frequency of stop codons and the presence of a single copy of the element suggest that it was inactivated soon after it was acquired. Because the element is inactive and because reading frames for other genes were not found in sequences flanking the element, Malazy does not appear to be the cause of differences leading to speciation or genetic diversity between C. zeae-maydis Groups I and II.

  19. Cis-acting elements in the promoter region of the human aldolase C gene.

    Science.gov (United States)

    Buono, P; de Conciliis, L; Olivetta, E; Izzo, P; Salvatore, F

    1993-08-16

    We investigated the cis-acting sequences involved in the expression of the human aldolase C gene by transient transfections into human neuroblastoma cells (SKNBE). We demonstrate that 420 bp of the 5'-flanking DNA direct at high efficiency the transcription of the CAT reporter gene. A deletion between -420 bp and -164 bp causes a 60% decrease of CAT activity. Gel shift and DNase I footprinting analyses revealed four protected elements: A, B, C and D. Competition analyses indicate that Sp1 or factors sharing a similar sequence specificity bind to elements A and B, but not to elements C and D. Sequence analysis shows a half palindromic ERE motif (GGTCA), in elements B and D. Region D binds a transactivating factor which appears also essential to stabilize the initiation complex.

  20. Far-UV-induced dimeric photoproducts in short oligonucleotides: sequence effects

    International Nuclear Information System (INIS)

    Douki, T.; Zalizniak, T.; Cadet, J.

    1997-01-01

    Cyclobutane pyrimidine dimers and pyrimidine (6-4)pyrimidone adducts represent the two major classes of far-UV-induced DNA photoproducts. Because of the lack of appropriate detection methods for each individual photoproduct, little is known about the effect of the sequence on their formaiton. In the present work, the photoproduct distribution obtained upon exposure of a series of dinucleoside monophosphate to 254 nm light was determined. (author)

  1. Subtyping Salmonella enterica serovar enteritidis isolates from different sources by using sequence typing based on virulence genes and clustered regularly interspaced short palindromic repeats (CRISPRs).

    Science.gov (United States)

    Liu, Fenyun; Kariyawasam, Subhashinie; Jayarao, Bhushan M; Barrangou, Rodolphe; Gerner-Smidt, Peter; Ribot, Efrain M; Knabel, Stephen J; Dudley, Edward G

    2011-07-01

    Salmonella enterica subsp. enterica serovar Enteritidis is a major cause of food-borne salmonellosis in the United States. Two major food vehicles for S. Enteritidis are contaminated eggs and chicken meat. Improved subtyping methods are needed to accurately track specific strains of S. Enteritidis related to human salmonellosis throughout the chicken and egg food system. A sequence typing scheme based on virulence genes (fimH and sseL) and clustered regularly interspaced short palindromic repeats (CRISPRs)-CRISPR-including multi-virulence-locus sequence typing (designated CRISPR-MVLST)-was used to characterize 35 human clinical isolates, 46 chicken isolates, 24 egg isolates, and 63 hen house environment isolates of S. Enteritidis. A total of 27 sequence types (STs) were identified among the 167 isolates. CRISPR-MVLST identified three persistent and predominate STs circulating among U.S. human clinical isolates and chicken, egg, and hen house environmental isolates in Pennsylvania, and an ST that was found only in eggs and humans. It also identified a potential environment-specific sequence type. Moreover, cluster analysis based on fimH and sseL identified a number of clusters, of which several were found in more than one outbreak, as well as 11 singletons. Further research is needed to determine if CRISPR-MVLST might help identify the ecological origins of S. Enteritidis strains that contaminate chickens and eggs.

  2. Establishment of screening technique for mutant cell and analysis of base sequence in the mutation

    International Nuclear Information System (INIS)

    Sofuni, Toshio; Nomi, Takehiko; Yamada, Masami; Masumura, Kenichi

    2000-01-01

    This research project aimed to establish an easy and quick detection method for radiation-induced mutation using molecular-biological techniques and an effective analyzing method for the molecular changes in base sequence. In this year, Spi mutants derived from γ-radiation exposed mouse were analyzed by PCR method and DNA sequence method. Male transgenic mice were exposed to γ-ray at 5,10, 50 Gy and the transgene was taken out from the genome DNA from the spleen in vivo packaging method. Spi mutant plaques were obtained by infecting the recovered phage to E. coli. Sequence analysis for the mutants was made using ALFred DNA sequencer and SequiTherm TM Long-Red Cycle sequencing kit. Sequence analysis was carried out for 41 of 50 independent Spi mutants obtained. The deletions were classified into 4 groups; Group 1 included 15 mutants that were characterized with a large deletion (43 bp-10 kb) with a short homologous sequence. Group 2 included 11 mutants of a large deletion having no homologous sequence at the connecting region. Group 3 included 11 mutants having a short deletion of less than 20 bp, which occurred in the non-repetitive sequence of gam gene and possibly caused by oxidative breakage of DNA or recombination of DNA fragment produced by the breakage. Group 4 included 4 mutants having deletions as short as 20 bp or less in the repetitive sequence of gam gene, resulting in an alteration of the reading frame. Thus, the synthesis of Gam protein was terminated by the appearance of TGA between code 13 and 14 of redB gene, leading to inactivation of gam gene and redBA gene. These results indicated that most of Spi mutants had a deletion in red/gam region and the deletions in more than half mutants occurred in homologous sequences as short as 8 bp. (M.N.)

  3. MR imaging of the orbit and eye using inversion recovery sequences

    International Nuclear Information System (INIS)

    Smith, F.W.; Parekh, S.; Forrester, J.; Redpath, T.W.

    1986-01-01

    Most centers performing MR imaging use spin-echo sequences to produce images; however, there are many advantages to using short TI inversion-recovery sequences for examination of the orbits. By selecting a TI similar to the relaxation time of any structure, the signal from this can be suppressed, thereby enhancing the signal from other structures. Using a sequence of TR = 1,000 msec and TI of less than 200 msec, the signal from fat is suppressed, improving image quality adjacent to the surface coil and providing better contrast between orbital structures and fat. The use of this short TI sequence for the examination of the eye in patients with opaque lenses is an accurate method of diagnosis since the sequence enhances the signal from both long T1 and T2 lesions. Eighty-five patients with orbital or ocular pathology have been studied, and the results demonstrate the usefulness of this technique for diagnosis

  4. Functional noncoding sequences derived from SINEs in the mammalian genome.

    Science.gov (United States)

    Nishihara, Hidenori; Smit, Arian F A; Okada, Norihiro

    2006-07-01

    Recent comparative analyses of mammalian sequences have revealed that a large number of nonprotein-coding genomic regions are under strong selective constraint. Here, we report that some of these loci have been derived from a newly defined family of ancient SINEs (short interspersed repetitive elements). This is a surprising result, as SINEs and other transposable elements are commonly thought to be genomic parasites. We named the ancient SINE family AmnSINE1, for Amniota SINE1, because we found it to be present in mammals as well as in birds, and some copies predate the mammalian-bird split 310 million years ago (Mya). AmnSINE1 has a chimeric structure of a 5S rRNA and a tRNA-derived SINE, and is related to five tRNA-derived SINE families that we characterized here in the coelacanth, dogfish shark, hagfish, and amphioxus genomes. All of the newly described SINE families have a common central domain that is also shared by zebrafish SINE3, and we collectively name them the DeuSINE (Deuterostomia SINE) superfamily. Notably, of the approximately 1000 still identifiable copies of AmnSINE1 in the human genome, 105 correspond to loci phylogenetically highly conserved among mammalian orthologs. The conservation is strongest over the central domain. Thus, AmnSINE1 appears to be the best example of a transposable element of which a significant fraction of the copies have acquired genomic functionality.

  5. In silico discovery of transcription regulatory elements in Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Le Roch Karine G

    2008-02-01

    Full Text Available Abstract Background With the sequence of the Plasmodium falciparum genome and several global mRNA and protein life cycle expression profiling projects now completed, elucidating the underlying networks of transcriptional control important for the progression of the parasite life cycle is highly pertinent to the development of new anti-malarials. To date, relatively little is known regarding the specific mechanisms the parasite employs to regulate gene expression at the mRNA level, with studies of the P. falciparum genome sequence having revealed few cis-regulatory elements and associated transcription factors. Although it is possible the parasite may evoke mechanisms of transcriptional control drastically different from those used by other eukaryotic organisms, the extreme AT-rich nature of P. falciparum intergenic regions (~90% AT presents significant challenges to in silico cis-regulatory element discovery. Results We have developed an algorithm called Gene Enrichment Motif Searching (GEMS that uses a hypergeometric-based scoring function and a position-weight matrix optimization routine to identify with high-confidence regulatory elements in the nucleotide-biased and repeat sequence-rich P. falciparum genome. When applied to promoter regions of genes contained within 21 co-expression gene clusters generated from P. falciparum life cycle microarray data using the semi-supervised clustering algorithm Ontology-based Pattern Identification, GEMS identified 34 putative cis-regulatory elements associated with a variety of parasite processes including sexual development, cell invasion, antigenic variation and protein biosynthesis. Among these candidates were novel motifs, as well as many of the elements for which biological experimental evidence already exists in the Plasmodium literature. To provide evidence for the biological relevance of a cell invasion-related element predicted by GEMS, reporter gene and electrophoretic mobility shift assays

  6. ϕ-statistically quasi Cauchy sequences

    Directory of Open Access Journals (Sweden)

    Bipan Hazarika

    2016-04-01

    Full Text Available Let P denote the space whose elements are finite sets of distinct positive integers. Given any element σ of P, we denote by p(σ the sequence {pn(σ} such that pn(σ=1 for n ∈ σ and pn(σ=0 otherwise. Further Ps={σ∈P:∑n=1∞pn(σ≤s}, i.e. Ps is the set of those σ whose support has cardinality at most s. Let (ϕn be a non-decreasing sequence of positive integers such that nϕn+1≤(n+1ϕn for all n∈N and the class of all sequences (ϕn is denoted by Φ. Let E⊆N. The number δϕ(E=lims→∞1ϕs|{k∈σ,σ∈Ps:k∈E}| is said to be the ϕ-density of E. A sequence (xn of points in R is ϕ-statistically convergent (or Sϕ-convergent to a real number ℓ for every ε > 0 if the set {n∈N:|xn−ℓ|≥ɛ} has ϕ-density zero. We introduce ϕ-statistically ward continuity of a real function. A real function is ϕ-statistically ward continuous if it preserves ϕ-statistically quasi Cauchy sequences where a sequence (xn is called to be ϕ-statistically quasi Cauchy (or Sϕ-quasi Cauchy when (Δxn=(xn+1−xn is ϕ-statistically convergent to 0. i.e. a sequence (xn of points in R is called ϕ-statistically quasi Cauchy (or Sϕ-quasi Cauchy for every ε > 0 if {n∈N:|xn+1−xn|≥ɛ} has ϕ-density zero. Also we introduce the concept of ϕ-statistically ward compactness and obtain results related to ϕ-statistically ward continuity, ϕ-statistically ward compactness, statistically ward continuity, ward continuity, ward compactness, ordinary compactness, uniform continuity, ordinary continuity, δ-ward continuity, and slowly oscillating continuity.

  7. Structural Features of the Seneca Valley Virus Internal Ribosome Entry Site (IRES) Element: a Picornavirus with a Pestivirus-Like IRES

    DEFF Research Database (Denmark)

    Willcocks, Margaret M.; Locker, Nicolas; Gomwalk, Zarmwa

    2011-01-01

    The RNA genome of Seneca Valley virus (SVV), a recently identified picornavirus, contains an internal ribosome entry site (IRES) element which has structural and functional similarity to that from classical swine fever virus (CSFV) and hepatitis C virus, members of the FLAVIVIRIDAE: The SVV IRES...... has an absolute requirement for the presence of a short region of virus-coding sequence to allow it to function either in cells or in rabbit reticulocyte lysate. The IRES activity does not require the translation initiation factor eIF4A or intact eIF4G. The predicted secondary structure indicates...

  8. Animal vocal sequences: not the Markov chains we thought they were.

    Science.gov (United States)

    Kershenbaum, Arik; Bowles, Ann E; Freeberg, Todd M; Jin, Dezhe Z; Lameira, Adriano R; Bohn, Kirsten

    2014-10-07

    Many animals produce vocal sequences that appear complex. Most researchers assume that these sequences are well characterized as Markov chains (i.e. that the probability of a particular vocal element can be calculated from the history of only a finite number of preceding elements). However, this assumption has never been explicitly tested. Furthermore, it is unclear how language could evolve in a single step from a Markovian origin, as is frequently assumed, as no intermediate forms have been found between animal communication and human language. Here, we assess whether animal taxa produce vocal sequences that are better described by Markov chains, or by non-Markovian dynamics such as the 'renewal process' (RP), characterized by a strong tendency to repeat elements. We examined vocal sequences of seven taxa: Bengalese finches Lonchura striata domestica, Carolina chickadees Poecile carolinensis, free-tailed bats Tadarida brasiliensis, rock hyraxes Procavia capensis, pilot whales Globicephala macrorhynchus, killer whales Orcinus orca and orangutans Pongo spp. The vocal systems of most of these species are more consistent with a non-Markovian RP than with the Markovian models traditionally assumed. Our data suggest that non-Markovian vocal sequences may be more common than Markov sequences, which must be taken into account when evaluating alternative hypotheses for the evolution of signalling complexity, and perhaps human language origins. © 2014 The Author(s) Published by the Royal Society. All rights reserved.

  9. Advances in chemical investigations of the heaviest elements

    Directory of Open Access Journals (Sweden)

    Türler Andreas

    2016-01-01

    Full Text Available Although somewhat in the shadow of the discoveries of new elements, experimental chemical investigations of the heaviest elements have made tremendous progress in the last decades. Indeed, it was possible to experimentally determine thermochemical properties of heavy transactinide elements such as copernicium or flerovium. But will it be possible to chemically study all currently known elements of the periodic table up to element 118? While it is experimentally feasible to work with single atoms, the short half-lives of even the longest currently known isotopes of elements 115 through 118 call for new experimental approaches.

  10. Lactobacillus buchneri genotyping on the basis of clustered regularly interspaced short palindromic repeat (CRISPR) locus diversity.

    Science.gov (United States)

    Briner, Alexandra E; Barrangou, Rodolphe

    2014-02-01

    Clustered regularly interspaced short palindromic repeats (CRISPR) in combination with associated sequences (cas) constitute the CRISPR-Cas immune system, which uptakes DNA from invasive genetic elements as novel "spacers" that provide a genetic record of immunization events. We investigated the potential of CRISPR-based genotyping of Lactobacillus buchneri, a species relevant for commercial silage, bioethanol, and vegetable fermentations. Upon investigating the occurrence and diversity of CRISPR-Cas systems in Lactobacillus buchneri genomes, we observed a ubiquitous occurrence of CRISPR arrays containing a 36-nucleotide (nt) type II-A CRISPR locus adjacent to four cas genes, including the universal cas1 and cas2 genes and the type II signature gene cas9. Comparative analysis of CRISPR spacer content in 26 L. buchneri pickle fermentation isolates associated with spoilage revealed 10 unique locus genotypes that contained between 9 and 29 variable spacers. We observed a set of conserved spacers at the ancestral end, reflecting a common origin, as well as leader-end polymorphisms, reflecting recent divergence. Some of these spacers showed perfect identity with phage sequences, and many spacers showed homology to Lactobacillus plasmid sequences. Following a comparative analysis of sequences immediately flanking protospacers that matched CRISPR spacers, we identified a novel putative protospacer-adjacent motif (PAM), 5'-AAAA-3'. Overall, these findings suggest that type II-A CRISPR-Cas systems are valuable for genotyping of L. buchneri.

  11. Clustered regularly interspaced short palindromic repeats (CRISPRs): the hallmark of an ingenious antiviral defense mechanism in prokaryotes.

    Science.gov (United States)

    Al-Attar, Sinan; Westra, Edze R; van der Oost, John; Brouns, Stan J J

    2011-04-01

    Many prokaryotes contain the recently discovered defense system against mobile genetic elements. This defense system contains a unique type of repetitive DNA stretches, termed Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRs). CRISPRs consist of identical repeated DNA sequences (repeats), interspaced by highly variable sequences referred to as spacers. The spacers originate from either phages or plasmids and comprise the prokaryotes' 'immunological memory'. CRISPR-associated (cas) genes encode conserved proteins that together with CRISPRs make-up the CRISPR/Cas system, responsible for defending the prokaryotic cell against invaders. CRISPR-mediated resistance has been proposed to involve three stages: (i) CRISPR-Adaptation, the invader DNA is encountered by the CRISPR/Cas machinery and an invader-derived short DNA fragment is incorporated in the CRISPR array. (ii) CRISPR-Expression, the CRISPR array is transcribed and the transcript is processed by Cas proteins. (iii) CRISPR-Interference, the invaders' nucleic acid is recognized by complementarity to the crRNA and neutralized. An application of the CRISPR/Cas system is the immunization of industry-relevant prokaryotes (or eukaryotes) against mobile-genetic invasion. In addition, the high variability of the CRISPR spacer content can be exploited for phylogenetic and evolutionary studies. Despite impressive progress during the last couple of years, the elucidation of several fundamental details will be a major challenge in future research.

  12. Clinical evaluation of further-developed MRCP sequences in comparison with standard MRCP sequences

    International Nuclear Information System (INIS)

    Hundt, W.; Scheidler, J.; Reiser, M.; Petsch, R.

    2002-01-01

    The purpose of this study was the comparison of technically improved single-shot magnetic resonance cholangiopancreatography (MRCP) sequences with standard single-shot rapid acquisition with relaxation enhancement (RARE) and half-Fourier acquired single-shot turbo spin-echo (HASTE) sequences in evaluating the normal and abnormal biliary duct system. The bile duct system of 45 patients was prospectively investigated on a 1.5-T MRI system. The investigation was performed with RARE and HASTE MR cholangiography sequences with standard and high spatial resolutions, and with a delayed-echo half-Fourier RARE (HASTE) sequence. Findings of the improved MRCP sequences were compared with the standard MRCP sequences. The level of confidence in assessing the diagnosis was divided into five groups. The Wilcoxon signed-rank test at a level of p<0.05 was applied. In 15 patients no pathology was found. The MRCP showed stenoses of the bile duct system in 10 patients and choledocholithiasis and cholecystolithiasis in 16 patients. In 12 patients a dilatation of the bile duct system was found. Comparison of the low- and high spatial resolution sequences and the short and long TE times of the half-Fourier RARE (HASTE) sequence revealed no statistically significant differences regarding accuracy of the examination. The diagnostic confidence level in assessing normal or pathological findings for the high-resolution RARE and half-Fourier RARE (HASTE) was significantly better than for the standard sequences. For the delayed-echo half-Fourier RARE (HASTE) sequence no statistically significant difference was seen. The high-resolution RARE and half-Fourier RARE (HASTE) sequences had a higher confidence level, but there was no significant difference in diagnosis in terms of detection and assessment of pathological changes in the biliary duct system compared with standard sequences. (orig.)

  13. Molecular and bioinformatic analysis of the FB-NOF transposable element.

    Science.gov (United States)

    Badal, Martí; Portela, Anna; Xamena, Noel; Cabré, Oriol

    2006-04-12

    The Drosophila melanogaster transposable element FB-NOF is known to play a role in genome plasticity through the generation of all sort of genomic rearrangements. Moreover, several insertional mutants due to FB mobilizations have been reported. Its structure and sequence, however, have been poorly studied mainly as a consequence of the long, complex and repetitive sequence of FB inverted repeats. This repetitive region is composed of several 154 bp blocks, each with five almost identical repeats. In this paper, we report the sequencing process of 2 kb long FB inverted repeats of a complete FB-NOF element, with high precision and reliability. This achievement has been possible using a new map of the FB repetitive region, which identifies unambiguously each repeat with new features that can be used as landmarks. With this new vision of the element, a list of FB-NOF in the D. melanogaster genomic clones has been done, improving previous works that used only bioinformatic algorithms. The availability of many FB and FB-NOF sequences allowed an analysis of the FB insertion sequences that showed no sequence specificity, but a preference for A/T rich sequences. The position of NOF into FB is also studied, revealing that it is always located after a second repeat in a random block. With the results of this analysis, we propose a model of transposition in which NOF jumps from FB to FB, using an unidentified transposase enzyme that should specifically recognize the second repeat end of the FB blocks.

  14. Long-read sequencing data analysis for yeasts.

    Science.gov (United States)

    Yue, Jia-Xing; Liti, Gianni

    2018-06-01

    Long-read sequencing technologies have become increasingly popular due to their strengths in resolving complex genomic regions. As a leading model organism with small genome size and great biotechnological importance, the budding yeast Saccharomyces cerevisiae has many isolates currently being sequenced with long reads. However, analyzing long-read sequencing data to produce high-quality genome assembly and annotation remains challenging. Here, we present a modular computational framework named long-read sequencing data analysis for yeasts (LRSDAY), the first one-stop solution that streamlines this process. Starting from the raw sequencing reads, LRSDAY can produce chromosome-level genome assembly and comprehensive genome annotation in a highly automated manner with minimal manual intervention, which is not possible using any alternative tool available to date. The annotated genomic features include centromeres, protein-coding genes, tRNAs, transposable elements (TEs), and telomere-associated elements. Although tailored for S. cerevisiae, we designed LRSDAY to be highly modular and customizable, making it adaptable to virtually any eukaryotic organism. When applying LRSDAY to an S. cerevisiae strain, it takes ∼41 h to generate a complete and well-annotated genome from ∼100× Pacific Biosciences (PacBio) running the basic workflow with four threads. Basic experience working within the Linux command-line environment is recommended for carrying out the analysis using LRSDAY.

  15. Enrichment allows identification of diverse, rare elements in metagenomic resistome-virulome sequencing.

    Science.gov (United States)

    Noyes, Noelle R; Weinroth, Maggie E; Parker, Jennifer K; Dean, Chris J; Lakin, Steven M; Raymond, Robert A; Rovira, Pablo; Doster, Enrique; Abdo, Zaid; Martin, Jennifer N; Jones, Kenneth L; Ruiz, Jaime; Boucher, Christina A; Belk, Keith E; Morley, Paul S

    2017-10-17

    Shotgun metagenomic sequencing is increasingly utilized as a tool to evaluate ecological-level dynamics of antimicrobial resistance and virulence, in conjunction with microbiome analysis. Interest in use of this method for environmental surveillance of antimicrobial resistance and pathogenic microorganisms is also increasing. In published metagenomic datasets, the total of all resistance- and virulence-related sequences accounts for enrichment system that incorporates unique molecular indices to count DNA molecules and correct for enrichment bias. The use of the bait-capture and enrichment system significantly increased on-target sequencing of the resistome-virulome, enabling detection of an additional 1441 gene accessions and revealing a low-abundance portion of the resistome-virulome that was more diverse and compositionally different than that detected by more traditional metagenomic assays. The low-abundance portion of the resistome-virulome also contained resistance genes with public health importance, such as extended-spectrum betalactamases, that were not detected using traditional shotgun metagenomic sequencing. In addition, the use of the bait-capture and enrichment system enabled identification of rare resistance gene haplotypes that were used to discriminate between sample origins. These results demonstrate that the rare resistome-virulome contains valuable and unique information that can be utilized for both surveillance and population genetic investigations of resistance. Access to the rare resistome-virulome using the bait-capture and enrichment system validated in this study can greatly advance our understanding of microbiome-resistome dynamics.

  16. Episodic sequence memory is supported by a theta-gamma phase code.

    Science.gov (United States)

    Heusser, Andrew C; Poeppel, David; Ezzyat, Youssef; Davachi, Lila

    2016-10-01

    The meaning we derive from our experiences is not a simple static extraction of the elements but is largely based on the order in which those elements occur. Models propose that sequence encoding is supported by interactions between high- and low-frequency oscillations, such that elements within an experience are represented by neural cell assemblies firing at higher frequencies (gamma) and sequential order is encoded by the specific timing of firing with respect to a lower frequency oscillation (theta). During episodic sequence memory formation in humans, we provide evidence that items in different sequence positions exhibit greater gamma power along distinct phases of a theta oscillation. Furthermore, this segregation is related to successful temporal order memory. Our results provide compelling evidence that memory for order, a core component of an episodic memory, capitalizes on the ubiquitous physiological mechanism of theta-gamma phase-amplitude coupling.

  17. High throughput sequencing and proteomics to identify immunogenic proteins of a new pathogen: the dirty genome approach.

    Directory of Open Access Journals (Sweden)

    Gilbert Greub

    Full Text Available BACKGROUND: With the availability of new generation sequencing technologies, bacterial genome projects have undergone a major boost. Still, chromosome completion needs a costly and time-consuming gap closure, especially when containing highly repetitive elements. However, incomplete genome data may be sufficiently informative to derive the pursued information. For emerging pathogens, i.e. newly identified pathogens, lack of release of genome data during gap closure stage is clearly medically counterproductive. METHODS/PRINCIPAL FINDINGS: We thus investigated the feasibility of a dirty genome approach, i.e. the release of unfinished genome sequences to develop serological diagnostic tools. We showed that almost the whole genome sequence of the emerging pathogen Parachlamydia acanthamoebae was retrieved even with relatively short reads from Genome Sequencer 20 and Solexa. The bacterial proteome was analyzed to select immunogenic proteins, which were then expressed and used to elaborate the first steps of an ELISA. CONCLUSIONS/SIGNIFICANCE: This work constitutes the proof of principle for a dirty genome approach, i.e. the use of unfinished genome sequences of pathogenic bacteria, coupled with proteomics to rapidly identify new immunogenic proteins useful to develop in the future specific diagnostic tests such as ELISA, immunohistochemistry and direct antigen detection. Although applied here to an emerging pathogen, this combined dirty genome sequencing/proteomic approach may be used for any pathogen for which better diagnostics are needed. These genome sequences may also be very useful to develop DNA based diagnostic tests. All these diagnostic tools will allow further evaluations of the pathogenic potential of this obligate intracellular bacterium.

  18. Identification and characterization of cell-specific enhancer elements for the mouse ETF/Tead2 gene.

    Science.gov (United States)

    Tanoue, Y; Yasunami, M; Suzuki, K; Ohkubo, H

    2001-12-21

    We have identified and characterized by transient transfection assays the cell-specific 117-bp enhancer sequence in the first intron of the mouse ETF (Embryonic TEA domain-containing factor)/Tead2 gene required for transcriptional activation in ETF/Tead2 gene-expressing cells, such as P19 cells. The 117-bp enhancer contains one GC-rich sequence (5'-GGGGCGGGG-3'), termed the GC box, and two tandemly repeated GA-rich sequences (5'-GGGGGAGGGG-3'), termed the proximal and distal GA elements. Further analyses, including transfection studies and electrophoretic mobility shift assays using a series of deletion and mutation constructs, indicated that Sp1, a putative activator, may be required to predominate over its competition with another unknown putative repressor, termed the GA element-binding factor, for binding to both the GC box, which overlapped with the proximal GA element, and the distal GA element in the 117-bp sequence in order to achieve a full enhancer activity. We also discuss a possible mechanism underlying the cell-specific enhancer activity of the 117-bp sequence.

  19. Comparison of SHOX and associated elements duplications distribution between patients (Lėri-Weill dyschondrosteosis/idiopathic short stature) and population sample.

    Science.gov (United States)

    Hirschfeldova, Katerina; Solc, Roman

    2017-09-05

    The effect of heterozygous duplications of SHOX and associated elements on Lėri-Weill dyschondrosteosis (LWD) and idiopathic short stature (ISS) development is less distinct when compared to reciprocal deletions. The aim of our study was to compare frequency and distribution of duplications within SHOX and associated elements between population sample and LWD (ISS) patients. A preliminary analysis conducted on Czech population sample of 250 individuals compared to our previously reported sample of 352 ISS/LWD Czech patients indicated that rather than the difference in frequency of duplications it is the difference in their distribution. Particularly, there was an increased frequency of duplications residing to the CNE-9 enhancer in our LWD/ISS sample. To see whether the obtained data are consistent across published studies we made a literature survey to get published cases with SHOX or associated elements duplication and formed the merged LWD, the merged ISS, and the merged population samples. Relative frequency of particular region duplication in each of those merged samples were calculated. There was a significant difference in the relative frequency of CNE-9 enhancer duplications (11 vs. 3) and complete SHOX (exon1-6b) duplications (4 vs. 24) (p-value 0.0139 and p-value 0.000014, respectively) between the merged LWD sample and the merged population sample. We thus propose that partial SHOX duplications and small duplications encompassing CNE-9 enhancer could be highly penetrant alleles associated with ISS and LWD development. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. The effect of short-range spatial variability on soil sampling uncertainty

    Energy Technology Data Exchange (ETDEWEB)

    Perk, Marcel van der [Department of Physical Geography, Utrecht University, P.O. Box 80115, 3508 TC Utrecht (Netherlands)], E-mail: m.vanderperk@geo.uu.nl; De Zorzi, Paolo; Barbizzi, Sabrina; Belli, Maria [Agenzia per la Protezione dell' Ambiente e per i Servizi Tecnici (APAT), Servizio Laboratori, Misure ed Attivita di Campo, Via di Castel Romano, 100-00128 Roma (Italy); Fajgelj, Ales; Sansone, Umberto [International Atomic Energy Agency (IAEA), Agency' s Laboratories Seibersdorf, A-1400 Vienna (Austria); Jeran, Zvonka; Jacimovic, Radojko [Jozef Stefan Institute, Jamova 39, 1000 Ljubljana (Slovenia)

    2008-11-15

    This paper aims to quantify the soil sampling uncertainty arising from the short-range spatial variability of elemental concentrations in the topsoils of agricultural, semi-natural, and contaminated environments. For the agricultural site, the relative standard sampling uncertainty ranges between 1% and 5.5%. For the semi-natural area, the sampling uncertainties are 2-4 times larger than in the agricultural area. The contaminated site exhibited significant short-range spatial variability in elemental composition, which resulted in sampling uncertainties of 20-30%.

  1. The effect of short-range spatial variability on soil sampling uncertainty.

    Science.gov (United States)

    Van der Perk, Marcel; de Zorzi, Paolo; Barbizzi, Sabrina; Belli, Maria; Fajgelj, Ales; Sansone, Umberto; Jeran, Zvonka; Jaćimović, Radojko

    2008-11-01

    This paper aims to quantify the soil sampling uncertainty arising from the short-range spatial variability of elemental concentrations in the topsoils of agricultural, semi-natural, and contaminated environments. For the agricultural site, the relative standard sampling uncertainty ranges between 1% and 5.5%. For the semi-natural area, the sampling uncertainties are 2-4 times larger than in the agricultural area. The contaminated site exhibited significant short-range spatial variability in elemental composition, which resulted in sampling uncertainties of 20-30%.

  2. Analysis of the effect of the Electron-Beam welding sequence for a fixed manufacturing route using finite element simulations applied to ITER vacuum vessel manufacture

    Energy Technology Data Exchange (ETDEWEB)

    Martín-Menéndez, Cristina, E-mail: cristina@natec-ingenieros.com [Numerical Analysis Technologies, S.L. Marqués de San Esteban No. 52, 33206 Gijón (Spain); Rodríguez, Eduardo [Department of Mechanical Engineering, University of Oviedo, Campus de Gijón, 33203 Gijón (Spain); Ottolini, Marco [Ansaldo Nucleare S.p.A., Corso Perrone 25, 16152 Genova (Italy); Caixas, Joan [F4E, c/Josep Pla, n.2, Torres Diagonal Litoral, Edificio B3, E-08019 Barcelona (Spain); Guirao, Julio [Numerical Analysis Technologies, S.L. Marqués de San Esteban No. 52, 33206 Gijón (Spain)

    2016-03-15

    Highlights: • The simulation methodology employed in this paper is able to adapt inside a complex manufacturing route. • The effect of the sequence is lower in a highly constrained assembly than in a lowly constrained one. • The most relevant influence on the distortions is the jigs design, instead of the welding sequence. • The welding distortion analysis should be used as a guidance to design and improve the manufacturing strategy. - Abstract: The ITER Vacuum Vessel Sectors have very tight tolerances and high density of welding. Therefore, prediction and reduction of welding distortion are critical to allow the final assembly with the other Vacuum Vessel Sectors without the production of a full scale prototype. In this paper, the effect of the welding sequence in the distortions inside a fixed manufacturing route and in a highly constrained assembly is studied in the poloidal segment named inboard (PS1). This is one of the four poloidal segments (PS) assembled for the sector. Moreover, some restrictions and limitations in the welding sequence related to the manufacturing process are explained. The results obtained show that the effect of the sequence is lower in a highly constrained assembly than in a low constrained one. A prototype manufactured by AMW consortium (PS1 mock-up) is used in order to validate the finite element method welding simulation employed. The obtained results confirmed that for Electron-Beam welds, both the welding simulation and the mock-up show a low value of distortions.

  3. Determination of essential and trace elements in milk and measurement of short-lived nuclides using FIMS

    International Nuclear Information System (INIS)

    Demiralp, R.; Kalayoglu, S.; Unseren, E.; Grass, F.; Boeck, H.

    1988-01-01

    In the experiments, Gueluem, Sek and Pinar brand of bottled milks and Pinar milk powder which are commercially available were used. As standards IAEA Milk powder A-11, NBS-Orchard Leaves 1571 and for Cu single standard was used. Samples and standards were irradiated together in the central thimble of I.T.U. TRIGA Mark-II reactor for 1-8 hrs and for 60 sec in the fast pneumatic tube of TRIGA Reactor. Depending on the nuclear characteristics of the isotopes to be analyzed, they are counted at different counting times. The activities were measured with a high-purity Ge detector coupled to Canberra 90 model multichannel analyzer in the ITU. In order to determine short half-life nuclides a very fast irradiation and measuring system (FIMS) has been used. When the average values of the 16 elements are considered, it is observed that the amount of Na, As, Al, Mn, Zn, Rb , Co in the milk powder is greater than that of milk, where as in Pinar milk, which is a durable kind of milk, the amount of Na, K, Br, Al , As, Zn, Co is larger than that in daily milk. When daily products of different brands are compared, it was found that the quantity of Fe, Cr, Mg is higher while Cl, Sb, Zn, Rb, Co is less than that in Gueluem milk. The quantity of Cl is about 10 times as much and Mg 7 times less than in Sek milk. It was not possible to investigate how the seasons and the regions from where the milk was collected affect the quantity of the elements. It will be useful to continue the study in this field

  4. A DNA sequence element that advances replication origin activation time in Saccharomyces cerevisiae.

    Science.gov (United States)

    Pohl, Thomas J; Kolor, Katherine; Fangman, Walton L; Brewer, Bonita J; Raghuraman, M K

    2013-11-06

    Eukaryotic origins of DNA replication undergo activation at various times in S-phase, allowing the genome to be duplicated in a temporally staggered fashion. In the budding yeast Saccharomyces cerevisiae, the activation times of individual origins are not intrinsic to those origins but are instead governed by surrounding sequences. Currently, there are two examples of DNA sequences that are known to advance origin activation time, centromeres and forkhead transcription factor binding sites. By combining deletion and linker scanning mutational analysis with two-dimensional gel electrophoresis to measure fork direction in the context of a two-origin plasmid, we have identified and characterized a 19- to 23-bp and a larger 584-bp DNA sequence that are capable of advancing origin activation time.

  5. Application of short-time activation analysis in the sciences

    International Nuclear Information System (INIS)

    Grass, F.

    1991-01-01

    Short-time activation analysis has proved to be a valuable tool in nearly all fields of science. To take full advantage of this technique, it is favorable to use a fast transfer system and a high resolution high rate gamma-spectroscopy system for short lived gamma-emitters and a Cherenkov detector for the determination of hard beta-emitters. It is then possible to utilize sub-minute nuclides Li-8 (740 ms), B-12 (20 ms), F-20 (11.1 s), Y-89m (16 s), and Pb-207m (800 ms) for the determination of these elements. Besides these sub-minute nuclides which constitute the only possibility for neutron activation analysis of these elements there are a number of other elements which form longer lived nuclides on short irradiation. The analysis of the halogenides F, Cl, Br, I in waste water of a sewage incineration plant can be achieved with a single 20 s irradiation and two consecutive measurement of 20 and 600 s using Cl-38m, F-20, Br-79m as well as the longer lived Cl-38, Br-80, I-128

  6. Characterization of Rous sarcoma virus-related sequences in the Japanese quail.

    Science.gov (United States)

    Chambers, J A; Cywinski, A; Chen, P J; Taylor, J M

    1986-08-01

    We detected sequences related to the avian retrovirus Rous sarcoma virus within the genome of the Japanese quail, a species previously considered to be free of endogenous avian leukosis virus elements. Using low-stringency conditions of hybridization, we screened a quail genomic library for clones containing retrovirus-related information. Of five clones so selected, one, lambda Q48, contained sequence information related to the gag, pol, and env genes of Rous sarcoma virus arranged in a contiguous fashion and spanning a distance of approximately 5.8 kilobases. This organization is consistent with the presence of an endogenous retroviral element within the Japanese quail genome. Use of this element as a high-stringency probe on Southern blots of genomic digests of several quail DNA demonstrated hybridization to a series of high-molecular-weight bands. By slot hybridization to quail DNA with a cloned probe, it was deduced that there were approximately 300 copies per diploid cell. In addition, the quail element also hybridized at low stringency to the DNA of the White Leghorn chicken and at high stringency to the DNAs of several species of jungle fowl and both true and ruffed pheasants. Limited nucleotide sequencing analysis of lambda Q48 revealed homologies of 65, 52, and 46% compared with the sequence of Rous sarcoma virus strain Prague C for the endonuclease domain of pol, the pol-env junction, and the 3'-terminal region of env, respectively. Comparisons at the amino acid level were also significant, thus confirming the retrovirus relatedness of the cloned quail element.

  7. Sequence diversity and copy number variation of Mutator-like transposases in wheat

    Directory of Open Access Journals (Sweden)

    Nobuaki Asakura

    2008-01-01

    Full Text Available Partial transposase-coding sequences of Mutator-like elements (MULEs were isolated from a wild einkorn wheat, Triticum urartu, by degenerate PCR. The isolated sequences were classified into a MuDR or Class I clade and divided into two distinct subclasses (subclass I and subclass II. The average pair-wise identity between members of both subclasses was 58.8% at the nucleotide sequence level. Sequence diversity of subclass I was larger than that of subclass II. DNA gel blot analysis showed that subclass I was present as low copy number elements in the genomes of all Triticum and Aegilops accessions surveyed, while subclass II was present as high copy number elements. These two subclasses seemed uncapable of recognizing each other for transposition. The number of copies of subclass II elements was much higher in Aegilops with the S, Sl and D genomes and polyploid Triticum species than in diploid Triticum with the A genome, indicating that active transposition occurred in S, Sl and D genomes before polyploidization. DNA gel blot analysis of six species selected from three subfamilies of Poaceae demonstrated that only the tribe Triticeae possessed both subclasses. These results suggest that the differentiation of these two subclasses occurred before or immediately after the establishment of the tribe Triticeae.

  8. Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

    KAUST Repository

    Sayadi, Ahmed; Briganti, Leonardo; Tramontano, Anna; Via, Allegra

    2011-01-01

    The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length

  9. Deep Sequencing Reveals the Complete Genome and Evidence for Transcriptional Activity of the First Virus-Like Sequences Identified in Aristotelia chilensis (Maqui Berry

    Directory of Open Access Journals (Sweden)

    Javier Villacreses

    2015-04-01

    Full Text Available Here, we report the genome sequence and evidence for transcriptional activity of a virus-like element in the native Chilean berry tree Aristotelia chilensis. We propose to name the endogenous sequence as Aristotelia chilensis Virus 1 (AcV1. High-throughput sequencing of the genome of this tree uncovered an endogenous viral element, with a size of 7122 bp, corresponding to the complete genome of AcV1. Its sequence contains three open reading frames (ORFs: ORFs 1 and 2 shares 66%–73% amino acid similarity with members of the Caulimoviridae virus family, especially the Petunia vein clearing virus (PVCV, Petuvirus genus. ORF1 encodes a movement protein (MP; ORF2 a Reverse Transcriptase (RT and a Ribonuclease H (RNase H domain; and ORF3 showed no amino acid sequence similarity with any other known virus proteins. Analogous to other known endogenous pararetrovirus sequences (EPRVs, AcV1 is integrated in the genome of Maqui Berry and showed low viral transcriptional activity, which was detected by deep sequencing technology (DNA and RNA-seq. Phylogenetic analysis of AcV1 and other pararetroviruses revealed a closer resemblance with Petuvirus. Overall, our data suggests that AcV1 could be a new member of Caulimoviridae family, genus Petuvirus, and the first evidence of this kind of virus in a fruit plant.

  10. Implicitly Defined Neural Networks for Sequence Labeling

    Science.gov (United States)

    2017-07-31

    ularity has soared for the Long Short - Term Memory (LSTM) (Hochreiter and Schmidhuber, 1997) and vari- ants such as Gated Recurrent Unit (GRU) (Cho et...610. Sepp Hochreiter and Jürgen Schmidhuber. 1997. Long short - term memory . Neural computation 9(8):1735– 1780. Zhiheng Huang, Wei Xu, and Kai Yu. 2015...network are coupled together, in order to improve perfor- mance on complex, long -range dependencies in either direction of a sequence. We contrast our

  11. Rare earth elements during diagenesis of abyssal sediments: analogies with a transuranic element americium

    International Nuclear Information System (INIS)

    Boust, D.

    1987-03-01

    One of the possibilities for the storage of high-level radioactive wastes consists in burying them into abyssal sediments, the sediments being supposed to barrier out radionuclides migration. The objective of the work was to estimate the efficiency of sediment barrier with respect to americium. As there is no americium in abyssal sediments, an indirect approach was used: the behaviour of the rare earth elements, the best natural analogs of americium. They were analysed in a 15 m long core, from the Cap Verde abyssal plateau. The terrigenous phase derived from the African continent was modified by short-term processes (1-1000 years); the intermediate rare earth elements were dissolved. Mineral coatings, enriched in rare earth appeared. After burial, the evolution continued at a much slower rate (10 5 - 10 6 years). The rare elements of the mineral coatings derived from the dissolution of the terrigenous phase and from an additional source, deeper in the sediment column. The fluxes of rare earth elements from sediment to water column were estimated. In suboxic sediments, the dissolved particulate equilibrium was related to redox conditions. The short-term reactivity of americium was studied in laboratory experiments. Simple americium migration models showed that the sediments barrier was totally efficient with respect to americium. In the conditions, neptunium 237 a daughter product of americium 241 could induce fluxes of 10 16 atoms per year per ton of stored waste (10 -8 Ci y-1), during millions years, towards the water column [fr

  12. Approaches for in silico finishing of microbial genome sequences

    Directory of Open Access Journals (Sweden)

    Frederico Schmitt Kremer

    Full Text Available Abstract The introduction of next-generation sequencing (NGS had a significant effect on the availability of genomic information, leading to an increase in the number of sequenced genomes from a large spectrum of organisms. Unfortunately, due to the limitations implied by the short-read sequencing platforms, most of these newly sequenced genomes remained as “drafts”, incomplete representations of the whole genetic content. The previous genome sequencing studies indicated that finishing a genome sequenced by NGS, even bacteria, may require additional sequencing to fill the gaps, making the entire process very expensive. As such, several in silico approaches have been developed to optimize the genome assemblies and facilitate the finishing process. The present review aims to explore some free (open source, in many cases tools that are available to facilitate genome finishing.

  13. Approaches for in silico finishing of microbial genome sequences.

    Science.gov (United States)

    Kremer, Frederico Schmitt; McBride, Alan John Alexander; Pinto, Luciano da Silva

    The introduction of next-generation sequencing (NGS) had a significant effect on the availability of genomic information, leading to an increase in the number of sequenced genomes from a large spectrum of organisms. Unfortunately, due to the limitations implied by the short-read sequencing platforms, most of these newly sequenced genomes remained as "drafts", incomplete representations of the whole genetic content. The previous genome sequencing studies indicated that finishing a genome sequenced by NGS, even bacteria, may require additional sequencing to fill the gaps, making the entire process very expensive. As such, several in silico approaches have been developed to optimize the genome assemblies and facilitate the finishing process. The present review aims to explore some free (open source, in many cases) tools that are available to facilitate genome finishing.

  14. Noncoding sequence classification based on wavelet transform analysis: part I

    Science.gov (United States)

    Paredes, O.; Strojnik, M.; Romo-Vázquez, R.; Vélez Pérez, H.; Ranta, R.; Garcia-Torales, G.; Scholl, M. K.; Morales, J. A.

    2017-09-01

    DNA sequences in human genome can be divided into the coding and noncoding ones. Coding sequences are those that are read during the transcription. The identification of coding sequences has been widely reported in literature due to its much-studied periodicity. Noncoding sequences represent the majority of the human genome. They play an important role in gene regulation and differentiation among the cells. However, noncoding sequences do not exhibit periodicities that correlate to their functions. The ENCODE (Encyclopedia of DNA elements) and Epigenomic Roadmap Project projects have cataloged the human noncoding sequences into specific functions. We study characteristics of noncoding sequences with wavelet analysis of genomic signals.

  15. Short circuit detection in the winding and operation of superconducting magnets

    International Nuclear Information System (INIS)

    Walstrom, P.L.

    1982-01-01

    Three categories of shorts will be discussed: (1) shorts to the metallic bobbin or other structural elements, (2) shorts between turns caused by instrumentation wires that are deliberately connected to a turn at the end (e.g., voltage taps) and that short out to another turn but are not completely severed in the process, and (3) short circuits between turns caused by direct contact due to insulation failure by chips of metal bridging turns and by instrumentation wires that bridge turns but are severed in the process of shorting

  16. SEQUENCING OF FLAX LIS-1 INSERTION SITE IN THE ALBIDUM GENOTYPE

    Directory of Open Access Journals (Sweden)

    Jana Žiarovská

    2012-12-01

    Full Text Available The paper presents a methodology of identifying the insertion site of LIS-1-1 (Linum Insertion Sequence 1 element in flax Albidum variety when growing under the in vitro combined with environmental stress conditions. Abiotic stress was induced by a reduced nutrient content in a growth medium. The LIS-1 insertion site amplification was reaLIS-1ed using the forward LIS-L: 5'-GGG CAG TTT AAC TGT AAC GAA - 3 'and revers LIS-R: 5'-GCT TGG ATT TAG ACT TGG CAA C - 3' primers by PCR. PCR product was sequenced by direct sequencing method to proove the nucleotide sequence for matching with database LIS-1 sequence. A comparison has been matched with the sequence of the amplified segment in the database for all nucleotides except the 11-position in the 5'-3 ' direction, where instead of the three adenine pair is a couple in the Albidum variety. Changes caused by mobile elements or insertion sequences result in common flax in variability that can be used for the purposes of development of effective marker identification or environment based markers development.

  17. Modeling of Prepregs during Automated Draping Sequences

    DEFF Research Database (Denmark)

    Krogh, Christian; Glud, Jens Ammitzbøll; Jakobsen, Johnny

    2017-01-01

    algorithm used to generate target points on the mold which are used as input to a draping sequence planner. The draping sequence planner prescribes the displacement history for each gripper in the drape tool and these displacements are then applied to each gripper in a transient model of the draping...... sequence. The model is based on a transient finite element analysis with the material’s constitutive behavior currently being approximated as linear elastic orthotropic. In-plane tensile and bias-extension tests as well as bending tests are conducted and used as input for the model. The virtual draping...

  18. The diploid genome sequence of an Asian individual

    DEFF Research Database (Denmark)

    Wang, Jun; Wang, Wei; Li, Ruiqiang

    2008-01-01

    Here we present the first diploid genome sequence of an Asian individual. The genome was sequenced to 36-fold average coverage using massively parallel sequencing technology. We aligned the short reads onto the NCBI human reference genome to 99.97% coverage, and guided by the reference genome, we...... used uniquely mapped reads to assemble a high-quality consensus sequence for 92% of the Asian individual's genome. We identified approximately 3 million single-nucleotide polymorphisms (SNPs) inside this region, of which 13.6% were not in the dbSNP database. Genotyping analysis showed that SNP...... identification had high accuracy and consistency, indicating the high sequence quality of this assembly. We also carried out heterozygote phasing and haplotype prediction against HapMap CHB and JPT haplotypes (Chinese and Japanese, respectively), sequence comparison with the two available individual genomes (J...

  19. Impact of elevated CO2 and nitrogen fertilization on foliar elemental composition in a short rotation poplar plantation

    International Nuclear Information System (INIS)

    Marinari, Sara; Calfapietra, Carlo; De Angelis, Paolo; Mugnozza, Giuseppe Scarascia; Grego, Stefano

    2007-01-01

    The experiment was carried out on a short rotation coppice culture of poplars (POP-EUROFACE, Central Italy), growing in a free air carbon dioxide enriched atmosphere (FACE). The specific objective of this work was to study whether elevated CO 2 and fertilization (two CO 2 treatments, elevated CO 2 and control, two N fertilization treatments, fertilized and unfertilized), as well as the interaction between treatments caused an unbalanced nutritional status of leaves in three poplar species (P. x euramericana, P. nigra and P. alba). Finally, we discuss the ecological implications of a possible change in foliar nutrients concentration. CO 2 enrichment reduced foliar nitrogen and increased the concentration of magnesium; whereas nitrogen fertilization had opposite effects on leaf nitrogen and magnesium concentrations. Moreover, the interaction between elevated CO 2 and N fertilization amplified some element unbalances such as the K/N-ratio. - CO 2 enrichment reduced foliar nitrogen and increased the magnesium concentration in poplar

  20. Possible interaction between B1 retrotransposon-containing sequences and β(major) globin gene transcriptional activation during MEL cell erythroid differentiation.

    Science.gov (United States)

    Vizirianakis, Ioannis S; Tezias, Sotirios S; Amanatiadou, Elsa P; Tsiftsoglou, Asterios S

    2012-01-01

    Repetitive sequences consist of >50% of mammalian genomic DNAs and among these SINEs (short interspersed nuclear elements), e.g. B1 elements, account for 8% of the mouse genome. In an effort to delineate the molecular mechanism(s) involved in the blockade of the in vitro differentiation program of MEL (murine erythroleukaemia) cells by treatment with methylation inhibitors, we detected a DNA region of 559 bp in chromosome 7 located downstream of the 3'-end of the β(major) globin gene (designated B1-559) with unique characteristics. We have fully characterized this B1-559 region that includes a B1 element, several repeats of ATG initiation codons and consensus DNA-binding sites for erythroid-specific transcription factors NF-E2 (nuclear factor-erythroid-derived 2), GATA-1 and EKLF (erythroid Krüppel-like factor). Fragments derived from B1-559 incubated with nuclear extracts form protein complexes in both undifferentiated and differentiated MEL cells. Transient reporter-gene experiments in MEL and human erythroleukaemia K-562 cells with recombinant constructs containing B1-559 fragments linked to HS-2 (hypersensitive site-2) sequences of human β-globin gene LCR (locus control region) indicated potential cooperation upon erythropoiesis and globin gene expression. The possible interaction between the B1-559 region and β(major) globin gene transcriptional activation upon execution of erythroid MEL cell differentiation programme is discussed. © The Author(s) Journal compilation © 2012 Portland Press Limited

  1. Identifying structural variants using linked-read sequencing data.

    Science.gov (United States)

    Elyanow, Rebecca; Wu, Hsin-Ta; Raphael, Benjamin J

    2017-11-03

    Structural variation, including large deletions, duplications, inversions, translocations, and other rearrangements, is common in human and cancer genomes. A number of methods have been developed to identify structural variants from Illumina short-read sequencing data. However, reliable identification of structural variants remains challenging because many variants have breakpoints in repetitive regions of the genome and thus are difficult to identify with short reads. The recently developed linked-read sequencing technology from 10X Genomics combines a novel barcoding strategy with Illumina sequencing. This technology labels all reads that originate from a small number (~5-10) DNA molecules ~50Kbp in length with the same molecular barcode. These barcoded reads contain long-range sequence information that is advantageous for identification of structural variants. We present Novel Adjacency Identification with Barcoded Reads (NAIBR), an algorithm to identify structural variants in linked-read sequencing data. NAIBR predicts novel adjacencies in a individual genome resulting from structural variants using a probabilistic model that combines multiple signals in barcoded reads. We show that NAIBR outperforms several existing methods for structural variant identification - including two recent methods that also analyze linked-reads - on simulated sequencing data and 10X whole-genome sequencing data from the NA12878 human genome and the HCC1954 breast cancer cell line. Several of the novel somatic structural variants identified in HCC1954 overlap known cancer genes. Software is available at compbio.cs.brown.edu/software. braphael@princeton.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  2. RNA-Seq analysis and gene discovery of Andrias davidianus using Illumina short read sequencing.

    Directory of Open Access Journals (Sweden)

    Fenggang Li

    Full Text Available The Chinese giant salamander, Andrias davidianus, is an important species in the course of evolution; however, there is insufficient genomic data in public databases for understanding its immunologic mechanisms. High-throughput transcriptome sequencing is necessary to generate an enormous number of transcript sequences from A. davidianus for gene discovery. In this study, we generated more than 40 million reads from samples of spleen and skin tissue using the Illumina paired-end sequencing technology. De novo assembly yielded 87,297 transcripts with a mean length of 734 base pairs (bp. Based on the sequence similarities, searching with known proteins, 38,916 genes were identified. Gene enrichment analysis determined that 981 transcripts were assigned to the immune system. Tissue-specific expression analysis indicated that 443 of transcripts were specifically expressed in the spleen and skin. Among these transcripts, 147 transcripts were found to be involved in immune responses and inflammatory reactions, such as fucolectin, β-defensins and lymphotoxin beta. Eight tissue-specific genes were selected for validation using real time reverse transcription quantitative PCR (qRT-PCR. The results showed that these genes were significantly more expressed in spleen and skin than in other tissues, suggesting that these genes have vital roles in the immune response. This work provides a comprehensive genomic sequence resource for A. davidianus and lays the foundation for future research on the immunologic and disease resistance mechanisms of A. davidianus and other amphibians.

  3. Filling the gap between sequence and function: a bioinformatics approach

    NARCIS (Netherlands)

    Bargsten, J.W.

    2014-01-01

    The research presented in this thesis focuses on deriving function from sequence information, with the emphasis on plant sequence data. Unravelling the impact of genomic elements, in most cases genes, on the phenotype of an organism is a major challenge in biological research and modern plant

  4. Wakefield excitation in plasma resonator by a sequence of relativistic electron bunches

    International Nuclear Information System (INIS)

    Kiselev, V.A.; Linnik, A.F.; Mirny, V.I.; Onishchenko, I.N.; Uskov, V.V.

    2008-01-01

    Wakefield excitation in a plasma resonator by a sequence of relativistic electron bunches with the purpose to increase excited field amplitude in comparison to waveguide case is experimentally investigated. A sequence of short electron bunches is produced by the linear resonant accelerator. Plasma resonator is formed at the beam-plasma discharge in rectangular metal waveguide filled with gas and closed by metal foil at entrance and movable short-circuited plunger at exit. Measurements of wakefield amplitude are performed showing considerably higher wakefield amplitude for resonator case

  5. Detection of secondary structure elements in proteins by hydrophobic cluster analysis.

    Science.gov (United States)

    Woodcock, S; Mornon, J P; Henrissat, B

    1992-10-01

    Hydrophobic cluster analysis (HCA) is a protein sequence comparison method based on alpha-helical representations of the sequences where the size, shape and orientation of the clusters of hydrophobic residues are primarily compared. The effectiveness of HCA has been suggested to originate from its potential ability to focus on the residues forming the hydrophobic core of globular proteins. We have addressed the robustness of the bidimensional representation used for HCA in its ability to detect the regular secondary structure elements of proteins. Various parameters have been studied such as those governing cluster size and limits, the hydrophobic residues constituting the clusters as well as the potential shift of the cluster positions with respect to the position of the regular secondary structure elements. The following results have been found to support the alpha-helical bidimensional representation used in HCA: (i) there is a positive correlation (clearly above background noise) between the hydrophobic clusters and the regular secondary structure elements in proteins; (ii) the hydrophobic clusters are centred on the regular secondary structure elements; (iii) the pitch of the helical representation which gives the best correspondence is that of an alpha-helix. The correspondence between hydrophobic clusters and regular secondary structure elements suggests a way to implement variable gap penalties during the automatic alignment of protein sequences.

  6. Estimating the short-circuit impedance

    DEFF Research Database (Denmark)

    Nielsen, Arne Hejde; Pedersen, Knud Ole Helgesen; Poulsen, Niels Kjølstad

    1997-01-01

    A method for establishing a complex value of the short-circuit impedance from naturally occurring variations in voltage and current is discussed. It is the symmetrical three phase impedance at the fundamental grid frequency there is looked for. The positive sequence components in voltage...... and current are derived each period, and the short-circuit impedance is estimated from variations in these components created by load changes in the grid. Due to the noisy and dynamic grid with high harmonic distortion it is necessary to threat the calculated values statistical. This is done recursively...... through a RLS-algorithm. The algorithms have been tested and implemented on a PC at a 132 kV substation supplying a rolling mill. Knowing the short-circuit impedance gives the rolling mill an opportunity to adjust the arc furnace operation to keep flicker below a certain level. Therefore, the PC performs...

  7. Frequency characteristics of coordinate sequences of linear recurrences over Galois rings

    Science.gov (United States)

    Kamlovskii, O. V.

    2013-12-01

    We consider some properties of the coordinate sequences of linear recurrences over Galois rings which characterize the possibility of regarding them as pseudo-random sequences. We study the periodicity properties, linear complexity and frequency characteristics of these sequences. Up to now, these parameters have been studied mainly in the case when the linear recurring sequence has maximal possible period. We investigate the coordinate sequences of linear recurrences of not necessarily maximal period. We obtain sharpened and generalized estimates for the number of elements and r-patterns on the cycles and intervals of these sequences.

  8. Frequency characteristics of coordinate sequences of linear recurrences over Galois rings

    International Nuclear Information System (INIS)

    Certification Research Center, Moscow (Russian Federation))" data-affiliation=" (LLC Certification Research Center, Moscow (Russian Federation))" >Kamlovskii, O V

    2013-01-01

    We consider some properties of the coordinate sequences of linear recurrences over Galois rings which characterize the possibility of regarding them as pseudo-random sequences. We study the periodicity properties, linear complexity and frequency characteristics of these sequences. Up to now, these parameters have been studied mainly in the case when the linear recurring sequence has maximal possible period. We investigate the coordinate sequences of linear recurrences of not necessarily maximal period. We obtain sharpened and generalized estimates for the number of elements and r-patterns on the cycles and intervals of these sequences

  9. Detecting authorized and unauthorized genetically modified organisms containing vip3A by real-time PCR and next-generation sequencing.

    Science.gov (United States)

    Liang, Chanjuan; van Dijk, Jeroen P; Scholtens, Ingrid M J; Staats, Martijn; Prins, Theo W; Voorhuijzen, Marleen M; da Silva, Andrea M; Arisi, Ana Carolina Maisonnave; den Dunnen, Johan T; Kok, Esther J

    2014-04-01

    The growing number of biotech crops with novel genetic elements increasingly complicates the detection of genetically modified organisms (GMOs) in food and feed samples using conventional screening methods. Unauthorized GMOs (UGMOs) in food and feed are currently identified through combining GMO element screening with sequencing the DNA flanking these elements. In this study, a specific and sensitive qPCR assay was developed for vip3A element detection based on the vip3Aa20 coding sequences of the recently marketed MIR162 maize and COT102 cotton. Furthermore, SiteFinding-PCR in combination with Sanger, Illumina or Pacific BioSciences (PacBio) sequencing was performed targeting the flanking DNA of the vip3Aa20 element in MIR162. De novo assembly and Basic Local Alignment Search Tool searches were used to mimic UGMO identification. PacBio data resulted in relatively long contigs in the upstream (1,326 nucleotides (nt); 95 % identity) and downstream (1,135 nt; 92 % identity) regions, whereas Illumina data resulted in two smaller contigs of 858 and 1,038 nt with higher sequence identity (>99 % identity). Both approaches outperformed Sanger sequencing, underlining the potential for next-generation sequencing in UGMO identification.

  10. A SINE-derived element constitutes a unique modular enhancer for mammalian diencephalic Fgf8.

    Directory of Open Access Journals (Sweden)

    Akiko Nakanishi

    Full Text Available Transposable elements, including short interspersed repetitive elements (SINEs, comprise nearly half the mammalian genome. Moreover, they are a major source of conserved non-coding elements (CNEs, which play important functional roles in regulating development-related genes, such as enhancing and silencing, serving for the diversification of morphological and physiological features among species. We previously reported a novel SINE family, AmnSINE1, as part of mammalian-specific CNEs. One AmnSINE1 locus, named AS071, showed an enhancer property in the developing mouse diencephalon. Indeed, AS071 appears to recapitulate the expression of diencephalic fibroblast growth factor 8 (Fgf8. Here we established three independent lines of AS071-transgenic mice and performed detailed expression profiling of AS071-enhanced lacZ in comparison with that of Fgf8 across embryonic stages. We demonstrate that AS071 is a distal enhancer that directs Fgf8 expression in the developing diencephalon. Furthermore, enhancer assays with constructs encoding partially deleted AS071 sequence revealed a unique modular organization in which AS071 contains at least three functionally distinct sub-elements that cooperatively direct the enhancer activity in three diencephalic domains, namely the dorsal midline and the lateral wall of the diencephalon, and the ventral midline of the hypothalamus. Interestingly, the AmnSINE1-derived sub-element was found to specify the enhancer activity to the ventral midline of the hypothalamus. To our knowledge, this is the first discovery of an enhancer element that could be separated into respective sub-elements that determine regional specificity and/or the core enhancing activity. These results potentiate our understanding of the evolution of retroposon-derived cis-regulatory elements as well as the basis for future studies of the molecular mechanism underlying the determination of domain-specificity of an enhancer.

  11. Temporal Clustering and Sequencing in Short-Term Memory and Episodic Memory

    Science.gov (United States)

    Farrell, Simon

    2012-01-01

    A model of short-term memory and episodic memory is presented, with the core assumptions that (a) people parse their continuous experience into episodic clusters and (b) items are clustered together in memory as episodes by binding information within an episode to a common temporal context. Along with the additional assumption that information…

  12. Striking structural dynamism and nucleotide sequence variation of the transposon Galileo in the genome of Drosophila mojavensis.

    Science.gov (United States)

    Marzo, Mar; Bello, Xabier; Puig, Marta; Maside, Xulio; Ruiz, Alfredo

    2013-02-04

    Galileo is a transposable element responsible for the generation of three chromosomal inversions in natural populations of Drosophila buzzatii. Although the most characteristic feature of Galileo is the long internally-repetitive terminal inverted repeats (TIRs), which resemble the Drosophila Foldback element, its transposase-coding sequence has led to its classification as a member of the P-element superfamily (Class II, subclass 1, TIR order). Furthermore, Galileo has a wide distribution in the genus Drosophila, since it has been found in 6 of the 12 Drosophila sequenced genomes. Among these species, D. mojavensis, the one closest to D. buzzatii, presented the highest diversity in sequence and structure of Galileo elements. In the present work, we carried out a thorough search and annotation of all the Galileo copies present in the D. mojavensis sequenced genome. In our set of 170 Galileo copies we have detected 5 Galileo subfamilies (C, D, E, F, and X) with different structures ranging from nearly complete, to only 2 TIR or solo TIR copies. Finally, we have explored the structural and length variation of the Galileo copies that point out the relatively frequent rearrangements within and between Galileo elements. Different mechanisms responsible for these rearrangements are discussed. Although Galileo is a transposable element with an ancient history in the D. mojavensis genome, our data indicate a recent transpositional activity. Furthermore, the dynamism in sequence and structure, mainly affecting the TIRs, suggests an active exchange of sequences among the copies. This exchange could lead to new subfamilies of the transposon, which could be crucial for the long-term survival of the element in the genome.

  13. High-Specificity Targeted Functional Profiling in Microbial Communities with ShortBRED.

    Directory of Open Access Journals (Sweden)

    James Kaminski

    2015-12-01

    Full Text Available Profiling microbial community function from metagenomic sequencing data remains a computationally challenging problem. Mapping millions of DNA reads from such samples to reference protein databases requires long run-times, and short read lengths can result in spurious hits to unrelated proteins (loss of specificity. We developed ShortBRED (Short, Better Representative Extract Dataset to address these challenges, facilitating fast, accurate functional profiling of metagenomic samples. ShortBRED consists of two components: (i a method that reduces reference proteins of interest to short, highly representative amino acid sequences ("markers" and (ii a search step that maps reads to these markers to quantify the relative abundance of their associated proteins. After evaluating ShortBRED on synthetic data, we applied it to profile antibiotic resistance protein families in the gut microbiomes of individuals from the United States, China, Malawi, and Venezuela. Our results support antibiotic resistance as a core function in the human gut microbiome, with tetracycline-resistant ribosomal protection proteins and Class A beta-lactamases being the most widely distributed resistance mechanisms worldwide. ShortBRED markers are applicable to other homology-based search tasks, which we demonstrate here by identifying phylogenetic signatures of antibiotic resistance across more than 3,000 microbial isolate genomes. ShortBRED can be applied to profile a wide variety of protein families of interest; the software, source code, and documentation are available for download at http://huttenhower.sph.harvard.edu/shortbred.

  14. Insertion sequences enrichment in extreme Red sea brine pool vent

    KAUST Repository

    Elbehery, Ali H. A.

    2016-12-03

    Mobile genetic elements are major agents of genome diversification and evolution. Limited studies addressed their characteristics, including abundance, and role in extreme habitats. One of the rare natural habitats exposed to multiple-extreme conditions, including high temperature, salinity and concentration of heavy metals, are the Red Sea brine pools. We assessed the abundance and distribution of different mobile genetic elements in four Red Sea brine pools including the world’s largest known multiple-extreme deep-sea environment, the Red Sea Atlantis II Deep. We report a gradient in the abundance of mobile genetic elements, dramatically increasing in the harshest environment of the pool. Additionally, we identified a strong association between the abundance of insertion sequences and extreme conditions, being highest in the harshest and deepest layer of the Red Sea Atlantis II Deep. Our comparative analyses of mobile genetic elements in secluded, extreme and relatively non-extreme environments, suggest that insertion sequences predominantly contribute to polyextremophiles genome plasticity.

  15. Damper mechanism for nuclear reactor control elements

    International Nuclear Information System (INIS)

    Taft, W.E.

    1976-01-01

    A damper mechanism which provides a nuclear reactor control element decelerating function at the end of the scram stroke is described. The total damping function is produced by the combination of two assemblies, which operate in sequence. First, a tapered dashram assembly decelerates the control element to a lower velocity, after which a spring hydraulic damper assembly takes over to complete the final damping. 3 claims, 2 figures

  16. Direct calculation of off-diagonal matrix elements

    International Nuclear Information System (INIS)

    Killingbeck, J P; Jolicard, G

    2011-01-01

    Gauss elimination is used in a sequence of calculations which give the squares of the off-diagonal matrix elements of x between quartic oscillator eigenstates, in a modification of the original sum rule approach of Tipping et al to the problem. New and more flexible methods are then devised and tested and are shown to permit the isolation and calculation of individual squared matrix elements of x and x 2 .

  17. Specificity determinants for the abscisic acid response element.

    Science.gov (United States)

    Sarkar, Aditya Kumar; Lahiri, Ansuman

    2013-01-01

    Abscisic acid (ABA) response elements (ABREs) are a group of cis-acting DNA elements that have been identified from promoter analysis of many ABA-regulated genes in plants. We are interested in understanding the mechanism of binding specificity between ABREs and a class of bZIP transcription factors known as ABRE binding factors (ABFs). In this work, we have modeled the homodimeric structure of the bZIP domain of ABRE binding factor 1 from Arabidopsis thaliana (AtABF1) and studied its interaction with ACGT core motif-containing ABRE sequences. We have also examined the variation in the stability of the protein-DNA complex upon mutating ABRE sequences using the protein design algorithm FoldX. The high throughput free energy calculations successfully predicted the ability of ABF1 to bind to alternative core motifs like GCGT or AAGT and also rationalized the role of the flanking sequences in determining the specificity of the protein-DNA interaction.

  18. Analysing breast tissue composition with MRI using currently available short, simple sequences

    International Nuclear Information System (INIS)

    Chau, A.C.M.; Hua, J.; Taylor, D.B.

    2016-01-01

    Aim: To determine the most robust commonly available magnetic resonance imaging (MRI) sequence to quantify breast tissue composition at 1.5 T. Materials and methods: Two-dimensional (2D) T1-weighted, Dixon fat, Dixon water and SPAIR images were obtained from five participants and a breast phantom using a 1.5 T Siemens Aera MRI system. Manual segmentation of the breasts was performed, and an in-house computer program was used to generate signal intensity histograms. Relative trough depth and relative peak separation were used to determine the robustness of the images for quantifying the two breast tissues. Total breast volumes and percentage breast densities calculated using the four sequences were compared. Results: Dixon fat histograms had consistently low relative trough depth and relative peak separation compared to those obtained using other sequences. There was no significant difference in total breast volumes and percentage breast densities of the participants or breast phantom using Dixon fat and 2D T1-weighted histograms. Dixon water and SPAIR histograms were not suitable for quantifying breast tissue composition. Conclusion: Dixon fat images are the most robust for the quantification of breast tissue composition using a signal intensity histogram. - Highlights: • Signal intensity histogram analysis can determine robustness of images for quantification of breast tissue composition. • Dixon fat images are the most robust. • The characteristics of the signal intensity histograms from Dixon water and SPAIR images make quantification unsuitable.

  19. Investigating Effects of Screen Layout Elements on Interface and Screen Design Aesthetics

    Directory of Open Access Journals (Sweden)

    Ahamed Altaboli

    2011-01-01

    Full Text Available A recent study suggested the use of the screen layout elements of balance, unity, and sequence as a part of a computational model of interface aesthetics. It is argued that these three elements are the most contributed terms in the model. In the current study, a controlled experiment was designed and conducted to systematically investigate effects of these three elements (balance, unity, and sequence on the perceived interface aesthetics. Results showed that the three elements have significant effects on the perceived interface aesthetics. Significant interactions were also found among the three elements. A regression model relating the perceived visual aesthetics to the three elements was constructed. When validating the model using standard questionnaire scores of real web pages, high correlations were found between the values computed by the model and scores of questionnaire items related to visual layout of the web pages, indicating that layout-based measures are good at assessing the classical dimension of website aesthetics.

  20. Lacunary ideal convergence of multiple sequences

    Directory of Open Access Journals (Sweden)

    Bipan Hazarika

    2016-01-01

    Full Text Available An ideal I is a family of subsets of N×N which is closed under taking finite unions and subsets of its elements. In this article, the concept of lacunary ideal convergence of double sequences has been introduced. Also the relation between lacunary ideal convergent and lacunary Cauchy double sequences has been established. Furthermore, the notions of lacunary ideal limit point and lacunary ideal cluster points have been introduced and find the relation between these two notions. Finally, we have studied the properties such as solidity, monotonic.

  1. In silico Analysis of osr40c1 Promoter Sequence Isolated from Indica Variety Pokkali

    Directory of Open Access Journals (Sweden)

    W.S.I. de Silva

    2017-07-01

    Full Text Available The promoter region of a drought and abscisic acid (ABA inducible gene, osr40c1, was isolated from a salt-tolerant indica rice variety Pokkali, which is 670 bp upstream of the putative translation start codon. In silico promoter analysis of resulted sequence showed that at least 15 types of putative motifs were distributed within the sequence, including two types of common promoter elements, TATA and CAAT boxes. Additionally, several putative cis-acing regulatory elements which may be involved in regulation of osr40c1 expression under different conditions were found in the 5′-upstream region of osr40c1. These are ABA-responsive element, light-responsive elements (ATCT-motif, Box I, G-box, GT1-motif, Gap-box and Sp1, myeloblastosis oncogene response element (CCAAT-box, auxin responsive element (TGA-element, gibberellin-responsive element (GARE-motif and fungal-elicitor responsive elements (Box E and Box-W1. A putative regulatory element, required for endosperm-specific pattern of gene expression designated as Skn-1 motif, was also detected in the Pokkali osr40c1 promoter region. In conclusion, the bioinformatic analysis of osr40c1 promoter region isolated from indica rice variety Pokkali led to the identification of several important stress-responsive cis-acting regulatory elements, and therefore, the isolated promoter sequence could be employed in rice genetic transformation to mediate expression of abiotic stress induced genes.

  2. Memory and learning with rapid audiovisual sequences

    Science.gov (United States)

    Keller, Arielle S.; Sekuler, Robert

    2015-01-01

    We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed. PMID:26575193

  3. Memory and learning with rapid audiovisual sequences.

    Science.gov (United States)

    Keller, Arielle S; Sekuler, Robert

    2015-01-01

    We examined short-term memory for sequences of visual stimuli embedded in varying multisensory contexts. In two experiments, subjects judged the structure of the visual sequences while disregarding concurrent, but task-irrelevant auditory sequences. Stimuli were eight-item sequences in which varying luminances and frequencies were presented concurrently and rapidly (at 8 Hz). Subjects judged whether the final four items in a visual sequence identically replicated the first four items. Luminances and frequencies in each sequence were either perceptually correlated (Congruent) or were unrelated to one another (Incongruent). Experiment 1 showed that, despite encouragement to ignore the auditory stream, subjects' categorization of visual sequences was strongly influenced by the accompanying auditory sequences. Moreover, this influence tracked the similarity between a stimulus's separate audio and visual sequences, demonstrating that task-irrelevant auditory sequences underwent a considerable degree of processing. Using a variant of Hebb's repetition design, Experiment 2 compared musically trained subjects and subjects who had little or no musical training on the same task as used in Experiment 1. Test sequences included some that intermittently and randomly recurred, which produced better performance than sequences that were generated anew for each trial. The auditory component of a recurring audiovisual sequence influenced musically trained subjects more than it did other subjects. This result demonstrates that stimulus-selective, task-irrelevant learning of sequences can occur even when such learning is an incidental by-product of the task being performed.

  4. A first report and complete genome sequence of alfalfa enamovirus from Sudan

    Science.gov (United States)

    A full genome sequence of a viral pathogen, provisionally named alfalfa enamovirus 2 (AEV-2), was reconstructed from short reads obtained by Illumina RNA sequencing of alfalfa sample originating from Sudan. Ambiguous nucleotides in the resultant consensus assembly and identity of the predicted virus...

  5. Hybridization Capture Using Short PCR Products Enriches Small Genomes by Capturing Flanking Sequences (CapFlank)

    DEFF Research Database (Denmark)

    Tsangaras, Kyriakos; Wales, Nathan; Sicheritz-Pontén, Thomas

    2014-01-01

    nucleotides) can result in enrichment across entire mitochondrial and bacterial genomes. Our findings suggest that some of the off-target sequences derived in capture experiments are non-randomly enriched, and that CapFlank will facilitate targeted enrichment of large contiguous sequences with minimal prior...

  6. Lasers with intra-cavity phase elements

    Science.gov (United States)

    Gulses, A. Alkan; Kurtz, Russell; Islas, Gabriel; Anisimov, Igor

    2018-02-01

    Conventional laser resonators yield multimodal output, especially at high powers and short cavity lengths. Since highorder modes exhibit large divergence, it is desirable to suppress them to improve laser quality. Traditionally, such modal discriminations can be achieved by simple apertures that provide absorptive loss for large diameter modes, while allowing the lower orders, such as the fundamental Gaussian, to pass through. However, modal discrimination may not be sufficient for short-cavity lasers, resulting in multimodal operation as well as power loss and overheating in the absorptive part of the aperture. In research to improve laser mode control with minimal energy loss, systematic experiments have been executed using phase-only elements. These were composed of an intra-cavity step function and a diffractive out-coupler made of a computer-generated hologram. The platform was a 15-cm long solid-state laser that employs a neodymium-doped yttrium orthovanadate crystal rod, producing 1064 nm multimodal laser output. The intra-cavity phase elements (PEs) were shown to be highly effective in obtaining beams with reduced M-squared values and increased output powers, yielding improved values of radiance. The utilization of more sophisticated diffractive elements is promising for more difficult laser systems.

  7. ReadDepth: a parallel R package for detecting copy number alterations from short sequencing reads.

    Directory of Open Access Journals (Sweden)

    Christopher A Miller

    2011-01-01

    Full Text Available Copy number alterations are important contributors to many genetic diseases, including cancer. We present the readDepth package for R, which can detect these aberrations by measuring the depth of coverage obtained by massively parallel sequencing of the genome. In addition to achieving higher accuracy than existing packages, our tool runs much faster by utilizing multi-core architectures to parallelize the processing of these large data sets. In contrast to other published methods, readDepth does not require the sequencing of a reference sample, and uses a robust statistical model that accounts for overdispersed data. It includes a method for effectively increasing the resolution obtained from low-coverage experiments by utilizing breakpoint information from paired end sequencing to do positional refinement. We also demonstrate a method for inferring copy number using reads generated by whole-genome bisulfite sequencing, thus enabling integrative study of epigenomic and copy number alterations. Finally, we apply this tool to two genomes, showing that it performs well on genomes sequenced to both low and high coverage. The readDepth package runs on Linux and MacOSX, is released under the Apache 2.0 license, and is available at http://code.google.com/p/readdepth/.

  8. Short RNA guides cleavage by eukaryotic RNase III.

    Directory of Open Access Journals (Sweden)

    Bruno Lamontagne

    Full Text Available In eukaryotes, short RNAs guide a variety of enzymatic activities that range from RNA editing to translation repression. It is hypothesized that pre-existing proteins evolved to bind and use guide RNA during evolution. However, the capacity of modern proteins to adopt new RNA guides has never been demonstrated. Here we show that Rnt1p, the yeast orthologue of the bacterial dsRNA-specific RNase III, can bind short RNA transcripts and use them as guides for sequence-specific cleavage. Target cleavage occurred at a constant distance from the Rnt1p binding site, leaving the guide RNA intact for subsequent cleavage. Our results indicate that RNase III may trigger sequence-specific RNA degradation independent of the RNAi machinery, and they open the road for a new generation of precise RNA silencing tools that do not trigger a dsRNA-mediated immune response.

  9. Molecular genetics and epigenetics of CACTA elements

    KAUST Repository

    Fedoroff, Nina V.

    2013-01-01

    The CACTA transposons, so named for a highly conserved motif at element ends, comprise one of the most abundant superfamilies of Class 2 (cut-and-paste) plant transposons. CACTA transposons characteristically include subterminal sequences of several

  10. Draft Genome Sequences of Four Hospital-Associated Pseudomonas putida Isolates.

    Science.gov (United States)

    Mustapha, Mustapha M; Marsh, Jane W; Ezeonwuka, Chinelo D; Pasculle, Anthony W; Pacey, Marissa P; Querry, Ashley M; Muto, Carlene A; Harrison, Lee H

    2016-09-29

    We present here the draft genome sequences of four Pseudomonas putida isolates belonging to a single clone suspected for nosocomial transmission between patients and a bronchoscope in a tertiary hospital. The four genome sequences belong to a single lineage but contain differences in their mobile genetic elements. Copyright © 2016 Mustapha et al.

  11. Chemistry of the superheavy elements.

    Science.gov (United States)

    Schädel, Matthias

    2015-03-13

    The quest for superheavy elements (SHEs) is driven by the desire to find and explore one of the extreme limits of existence of matter. These elements exist solely due to their nuclear shell stabilization. All 15 presently 'known' SHEs (11 are officially 'discovered' and named) up to element 118 are short-lived and are man-made atom-at-a-time in heavy ion induced nuclear reactions. They are identical to the transactinide elements located in the seventh period of the periodic table beginning with rutherfordium (element 104), dubnium (element 105) and seaborgium (element 106) in groups 4, 5 and 6, respectively. Their chemical properties are often surprising and unexpected from simple extrapolations. After hassium (element 108), chemistry has now reached copernicium (element 112) and flerovium (element 114). For the later ones, the focus is on questions of their metallic or possibly noble gas-like character originating from interplay of most pronounced relativistic effects and electron-shell effects. SHEs provide unique opportunities to get insights into the influence of strong relativistic effects on the atomic electrons and to probe 'relativistically' influenced chemical properties and the architecture of the periodic table at its farthest reach. In addition, they establish a test bench to challenge the validity and predictive power of modern fully relativistic quantum chemical models. © 2015 The Author(s) Published by the Royal Society. All rights reserved.

  12. Applications and Case Studies of the Next-Generation Sequencing Technologies in Food, Nutrition and Agriculture.

    Science.gov (United States)

    Next-generation sequencing technologies are able to produce high-throughput short sequence reads in a cost-effective fashion. The emergence of these technologies has not only facilitated genome sequencing but also changed the landscape of life sciences. Here I survey their major applications ranging...

  13. [Influence of "prehistory" of sequential movements of the right and the left hand on reproduction: coding of positions, movements and sequence structure].

    Science.gov (United States)

    Bobrova, E V; Liakhovetskiĭ, V A; Borshchevskaia, E R

    2011-01-01

    The dependence of errors during reproduction of a sequence of hand movements without visual feedback on the previous right- and left-hand performance ("prehistory") and on positions in space of sequence elements (random or ordered by the explicit rule) was analyzed. It was shown that the preceding information about the ordered positions of the sequence elements was used during right-hand movements, whereas left-hand movements were performed with involvement of the information about the random sequence. The data testify to a central mechanism of the analysis of spatial structure of sequence elements. This mechanism activates movement coding specific for the left hemisphere (vector coding) in case of an ordered sequence structure and positional coding specific for the right hemisphere in case of a random sequence structure.

  14. ReRep: Computational detection of repetitive sequences in genome survey sequences (GSS

    Directory of Open Access Journals (Sweden)

    Alves-Ferreira Marcelo

    2008-09-01

    Full Text Available Abstract Background Genome survey sequences (GSS offer a preliminary global view of a genome since, unlike ESTs, they cover coding as well as non-coding DNA and include repetitive regions of the genome. A more precise estimation of the nature, quantity and variability of repetitive sequences very early in a genome sequencing project is of considerable importance, as such data strongly influence the estimation of genome coverage, library quality and progress in scaffold construction. Also, the elimination of repetitive sequences from the initial assembly process is important to avoid errors and unnecessary complexity. Repetitive sequences are also of interest in a variety of other studies, for instance as molecular markers. Results We designed and implemented a straightforward pipeline called ReRep, which combines bioinformatics tools for identifying repetitive structures in a GSS dataset. In a case study, we first applied the pipeline to a set of 970 GSSs, sequenced in our laboratory from the human pathogen Leishmania braziliensis, the causative agent of leishmaniosis, an important public health problem in Brazil. We also verified the applicability of ReRep to new sequencing technologies using a set of 454-reads of an Escheria coli. The behaviour of several parameters in the algorithm is evaluated and suggestions are made for tuning of the analysis. Conclusion The ReRep approach for identification of repetitive elements in GSS datasets proved to be straightforward and efficient. Several potential repetitive sequences were found in a L. braziliensis GSS dataset generated in our laboratory, and further validated by the analysis of a more complete genomic dataset from the EMBL and Sanger Centre databases. ReRep also identified most of the E. coli K12 repeats prior to assembly in an example dataset obtained by automated sequencing using 454 technology. The parameters controlling the algorithm behaved consistently and may be tuned to the properties

  15. How high is visual short-term memory capacity for object layout?

    Science.gov (United States)

    Sanocki, Thomas; Sellers, Eric; Mittelstadt, Jeff; Sulman, Noah

    2010-05-01

    Previous research measuring visual short-term memory (VSTM) suggests that the capacity for representing the layout of objects is fairly high. In four experiments, we further explored the capacity of VSTM for layout of objects, using the change detection method. In Experiment 1, participants retained most of the elements in displays of 4 to 8 elements. In Experiments 2 and 3, with up to 20 elements, participants retained many of them, reaching a capacity of 13.4 stimulus elements. In Experiment 4, participants retained much of a complex naturalistic scene. In most cases, increasing display size caused only modest reductions in performance, consistent with the idea of configural, variable-resolution grouping. The results indicate that participants can retain a substantial amount of scene layout information (objects and locations) in short-term memory. We propose that this is a case of remote visual understanding, where observers' ability to integrate information from a scene is paramount.

  16. Element segregation behavior of aluminum-copper alloy ZL205A

    Directory of Open Access Journals (Sweden)

    Fan Li

    2014-11-01

    Full Text Available In aluminum-copper alloy, the segregation has a severe bad effect on the alloying degree, strength and corrosion resistance. A deeper understanding of element segregation behavior will have a great significance on the prevention of segregation. In the study, the element segregation behavior of ZL205A aluminum-copper alloy was investigated by examining isothermally solidified samples using scanning electron microscopy and energy dispersive spectroscopy. The calculated results of segregation coefficients show that Cu and Mn are negative segregation elements; while Ti, V and Zr are positive segregation elements. The sequence of element segregation degree from the greatest to the least in ZL205A alloy is Cu, Mn, V, Ti, Zr and Al. The density of residual liquid is expected to increase with a decrease in the quenching temperature ranging from 630 ºC to 550 ºC. The calculated results confirm that the quenching temperature has an insignificant effect on the liquid density; and the variation of density is mainly due to element segregation. Consequently, segregations of Al, Cu and Mn lead to an increase in density, but Ti, V and Zr present the opposite effect. The contribution of each element to the variation of the liquid density was analyzed. The sequence of contributions of alloying elements to the variation of total liquid density is Cu﹥Al﹥Mn﹥V﹥Ti﹥Zr.

  17. Transposable elements and circular DNAs

    KAUST Repository

    Mourier, Tobias

    2016-09-26

    Circular DNAs are extra-chromosomal fragments that become circularized by genomic recombination events. We have recently shown that yeast LTR elements generate circular DNAs through recombination events between their flanking long terminal repeats (LTRs). Similarly, circular DNAs can be generated by recombination between LTRs residing at different genomic loci, in which case the circular DNA will contain the intervening sequence. In yeast, this can result in gene copy number variations when circles contain genes and origins of replication. Here, I speculate on the potential and implications of circular DNAs generated through recombination between human transposable elements.

  18. Transposable elements and circular DNAs

    KAUST Repository

    Mourier, Tobias

    2016-01-01

    Circular DNAs are extra-chromosomal fragments that become circularized by genomic recombination events. We have recently shown that yeast LTR elements generate circular DNAs through recombination events between their flanking long terminal repeats (LTRs). Similarly, circular DNAs can be generated by recombination between LTRs residing at different genomic loci, in which case the circular DNA will contain the intervening sequence. In yeast, this can result in gene copy number variations when circles contain genes and origins of replication. Here, I speculate on the potential and implications of circular DNAs generated through recombination between human transposable elements.

  19. Regulatable elements in the high-level waste management program

    International Nuclear Information System (INIS)

    Oakley, D.

    1979-01-01

    Regulatable elements of a deep geological nuclear waste isolation system are those characteristics of a candidate system which need to be specified to achieve control of its performance. This report identifies the regulatable elements with respect to waste form, repository design, site suitability, and the modeling and decision analysis processes. Regulatable elements in each section are listed and described briefly as they affect the short-term and long-term performance of a deep geological repository

  20. Music Learning with Long Short Term Memory Networks

    OpenAIRE

    Colombo, Florian François

    2015-01-01

    Humans are able to learn and compose complex, yet beautiful, pieces of music as seen in e.g. the highly complicated works of J.S. Bach. However, how our brain is able to store and produce these very long temporal sequences is still an open question. Long short-term memory (LSTM) artificial neural networks have been shown to be efficient in sequence learning tasks thanks to their inherent ability to bridge long time lags between input events and their target signals. Here, I investigate the po...

  1. In Vivo Enhancer Analysis Chromosome 16 Conserved NoncodingSequences

    Energy Technology Data Exchange (ETDEWEB)

    Pennacchio, Len A.; Ahituv, Nadav; Moses, Alan M.; Nobrega,Marcelo; Prabhakar, Shyam; Shoukry, Malak; Minovitsky, Simon; Visel,Axel; Dubchak, Inna; Holt, Amy; Lewis, Keith D.; Plajzer-Frick, Ingrid; Akiyama, Jennifer; De Val, Sarah; Afzal, Veena; Black, Brian L.; Couronne, Olivier; Eisen, Michael B.; Rubin, Edward M.

    2006-02-01

    The identification of enhancers with predicted specificitiesin vertebrate genomes remains a significant challenge that is hampered bya lack of experimentally validated training sets. In this study, weleveraged extreme evolutionary sequence conservation as a filter toidentify putative gene regulatory elements and characterized the in vivoenhancer activity of human-fish conserved and ultraconserved1 noncodingelements on human chromosome 16 as well as such elements from elsewherein the genome. We initially tested 165 of these extremely conservedsequences in a transgenic mouse enhancer assay and observed that 48percent (79/165) functioned reproducibly as tissue-specific enhancers ofgene expression at embryonic day 11.5. While driving expression in abroad range of anatomical structures in the embryo, the majority of the79 enhancers drove expression in various regions of the developingnervous system. Studying a set of DNA elements that specifically droveforebrain expression, we identified DNA signatures specifically enrichedin these elements and used these parameters to rank all ~;3,400human-fugu conserved noncoding elements in the human genome. The testingof the top predictions in transgenic mice resulted in a three-foldenrichment for sequences with forebrain enhancer activity. These datadramatically expand the catalogue of in vivo-characterized human geneenhancers and illustrate the future utility of such training sets for avariety of iological applications including decoding the regulatoryvocabulary of the human genome.

  2. Seedling lethality in Nicotiana plumbaginifolia conferred by Ds transposable element insertion into a plant-specific gene.

    Science.gov (United States)

    Majira, Amel; Domin, Monique; Grandjean, Olivier; Gofron, Krystyna; Houba-Hérin, Nicole

    2002-10-01

    A seedling lethal mutant of Nicotiana plumbaginifolia (sdl-1) was isolated by transposon tagging using a maize Dissociation (Ds) element. The insertion mutation was produced by direct co-transformation of protoplasts with two plasmids: one containing Ds and a second with an Ac transposase gene. sdl-1 seedlings exhibit several phenotypes: swollen organs, short hypocotyls in light and dark conditions, and enlarged and multinucleated cells, that altogether suggest cell growth defects. Mutant cells are able to proliferate under in vitro culture conditions. Genomic DNA sequences bordering the transposon were used to recover cDNA from the normal allele. Complementation of the mutant phenotype with the cDNA confirmed that the transposon had caused the mutation. The Ds element was inserted into the first exon of the open reading frame and the homozygous mutant lacked detectable transcript. Phenocopies of the mutant were obtained by an antisense approach. SDL-1 encodes a novel protein found in several plant genomes but apparently missingfrom animal and fungal genomes; the protein is highly conserved and has a potential plastid targeting motif.

  3. Casimir elements of epsilon Lie algebras

    International Nuclear Information System (INIS)

    Scheunert, M.

    1982-10-01

    The classical framework for investigating the Casimir elements of a Lie algebra is generalized to the case of an epsilon Lie algebra L. We construct the standard L-module isomorphism of the epsilon-symmetric algebra of L onto its enveloping algebra and we introduce the Harish-Chandra homomorphism. In case the generators of L can be written in a canonical two-index form, we construct the associated standard sequence of Casimir elements and derive a formula for their eigenvalues in an arbitrary highest weight module. (orig.)

  4. Preferred Hosts for Short-Period Exoplanets

    Science.gov (United States)

    Kohler, Susanna

    2015-12-01

    In an effort to learn more about how planets form around their host stars, a team of scientists has analyzed the population of Kepler-discovered exoplanet candidates, looking for trends in where theyre found.Planetary OccurrenceSince its launch in 2009, Kepler has found thousands of candidate exoplanets around a variety of star types. Especially intriguing is the large population of super-Earths and mini-Neptunes planets with masses between that of Earth and Neptune that have short orbital periods. How did they come to exist so close to their host star? Did they form in situ, or migrate inwards, or some combination of both processes?To constrain these formation mechanisms, a team of scientists led by Gijs Mulders (University of Arizona and NASAs NExSS coalition) analyzed the population of Kepler planet candidates that have orbital periods between 2 and 50 days.Mulders and collaborators used statistical reconstructions to find the average number of planets, within this orbital range, around each star in the Kepler field. They then determined how this planet occurrence rate changed for different spectral types and therefore the masses of the host stars: do low-mass M-dwarf stars host more or fewer planets than higher-mass, main-sequence F, G, or K stars?Challenging ModelsAuthors estimates for the occurrence rate for short-period planets of different radii around M-dwarfs (purple) and around F, G, and K-type stars (blue). [Mulders et al. 2015]The team found that M dwarfs, compared to F, G, or K stars, host about half as many large planets with orbital periods of P 50 days. But, surprisingly, they host significantly more small planets, racking up an average of 3.5 times the number of planets in the size range of 12.8 Earth-radii.Could it be that M dwarfs have a lower total mass of planets, but that mass is distributed into more, smaller planets? Apparently not: the authors show that the mass of heavy elements trapped in short-orbital-period planets is higher for M

  5. Transcription of Gypsy Elements in a Y-Chromosome Male Fertility Gene of Drosophila Hydei

    Science.gov (United States)

    Hochstenbach, R.; Harhangi, H.; Schouren, K.; Bindels, P.; Suijkerbuijk, R.; Hennig, W.

    1996-01-01

    We have found that defective gypsy retrotransposons are a major constituent of the lampbrush loop pair Nooses in the short arm of the Y chromosome of Drosophila hydei. The loop pair is formed by male fertility gene Q during the primary spermatocyte stage of spermatogenesis, each loop being a single transcription unit with an estimated length of 260 kb. Using fluorescent in situ hybridization, we show that throughout the loop transcripts gypsy elements are interspersed with blocks of a tandemly repetitive Y-specific DNA sequence, ay1. Nooses transcripts containing both sequence types show a wide size range on Northern blots, do not migrate to the cytoplasm, and are degraded just before the first meiotic division. Only one strand of ay1 and only the coding strand of gypsy can be detected in the loop transcripts. However, as cloned genomic DNA fragments also display opposite orientations of ay1 and gypsy, such DNA sections cannot be part of the Nooses. Hence, they are most likely derived from the flanking heterochromatin. The direction of transcription of ay1 and gypsy thus appears to be of a functional significance. PMID:8852843

  6. Proceedings of transuranium elements

    International Nuclear Information System (INIS)

    Anon.

    1992-01-01

    The identification of the first synthetic elements was established by chemical evidence. Conclusive proof of the synthesis of the first artificial element, technetium, was published in 1937 by Perrier and Segre. An essential aspect of their achievement was the prediction of the chemical properties of element 43, which had been missing from the periodic table and which was expected to have properties similar to those of manganese and rhenium. The discovery of other artificial elements, astatine and francium, was facilitated in 1939-1940 by the prediction of their chemical properties. A little more than 50 years ago, in the spring of 1940, Edwin McMillan and Philip Abelson synthesized element 93, neptunium, and confirmed its uniqueness by chemical means. On August 30, 1940, Glenn Seaborg, Arthur Wahl, and the late Joseph Kennedy began their neutron irradiations of uranium nitrate hexahydrate. A few months later they synthesized element 94, later named plutonium, by observing the alpha particles emitted from uranium oxide targets that had been bombarded with deuterons. Shortly thereafter they proved that is was the second transuranium element by establishing its unique oxidation-reduction behavior. The symposium honored the scientists and engineers whose vision and dedication led to the discovery of the transuranium elements and to the understanding of the influence of 5f electrons on their electronic structure and bonding. This volume represents a record of papers presented at the symposium

  7. PlantCARE, a plant cis-acting regulatory element database

    OpenAIRE

    Rombauts, Stephane; Déhais, Patrice; Van Montagu, Marc; Rouzé, Pierre

    1999-01-01

    PlantCARE is a database of plant cis- acting regulatory elements, enhancers and repressors. Besides the transcription motifs found on a sequence, it also offers a link to the EMBL entry that contains the full gene sequence as well as a description of the conditions in which a motif becomes functional. The information on these sites is given by matrices, consensus and individual site sequences on particular genes, depending on the available information. PlantCARE is a relational database avail...

  8. SAMMate: a GUI tool for processing short read alignments in SAM/BAM format

    Directory of Open Access Journals (Sweden)

    Flemington Erik

    2011-01-01

    Full Text Available Abstract Background Next Generation Sequencing (NGS technology generates tens of millions of short reads for each DNA/RNA sample. A key step in NGS data analysis is the short read alignment of the generated sequences to a reference genome. Although storing alignment information in the Sequence Alignment/Map (SAM or Binary SAM (BAM format is now standard, biomedical researchers still have difficulty accessing this information. Results We have developed a Graphical User Interface (GUI software tool named SAMMate. SAMMate allows biomedical researchers to quickly process SAM/BAM files and is compatible with both single-end and paired-end sequencing technologies. SAMMate also automates some standard procedures in DNA-seq and RNA-seq data analysis. Using either standard or customized annotation files, SAMMate allows users to accurately calculate the short read coverage of genomic intervals. In particular, for RNA-seq data SAMMate can accurately calculate the gene expression abundance scores for customized genomic intervals using short reads originating from both exons and exon-exon junctions. Furthermore, SAMMate can quickly calculate a whole-genome signal map at base-wise resolution allowing researchers to solve an array of bioinformatics problems. Finally, SAMMate can export both a wiggle file for alignment visualization in the UCSC genome browser and an alignment statistics report. The biological impact of these features is demonstrated via several case studies that predict miRNA targets using short read alignment information files. Conclusions With just a few mouse clicks, SAMMate will provide biomedical researchers easy access to important alignment information stored in SAM/BAM files. Our software is constantly updated and will greatly facilitate the downstream analysis of NGS data. Both the source code and the GUI executable are freely available under the GNU General Public License at http://sammate.sourceforge.net.

  9. Pigmentation and temporal effects on trace elements in hair

    International Nuclear Information System (INIS)

    Aufreiter, S.; Hancock, R.G.V.

    1990-01-01

    Variations in trace element concentration in the head and facial hair of five individuals with ever-increasing amounts of white hair were examined. Hair was collected from the scalp, cheeks, and chin from one donor on a regular basis since 1984. Samples were separated into white and pigmented fractions, and analyzed at SLOWPOKE-Toronto by INAA for the short-lived, isotope-producing elements Br, Ca, Cl, Cu, I, Mg, Mn, Na, S, and Zn. Temporal concentration variations of these elements over time, and variations of the elemental concentrations in pigmented and white hair were established

  10. Abundance, distribution and potential impact of transposable elements in the genome of Mycosphaerella fijiensis.

    Science.gov (United States)

    Santana, Mateus F; Silva, José C F; Batista, Aline D; Ribeiro, Lílian E; da Silva, Gilvan F; de Araújo, Elza F; de Queiroz, Marisa V

    2012-12-22

    Mycosphaerella fijiensis is a ascomycete that causes Black Sigatoka in bananas. Recently, the M. fijiensis genome was sequenced. Repetitive sequences are ubiquitous components of fungal genomes. In most genomic analyses, repetitive sequences are associated with transposable elements (TEs). TEs are dispersed repetitive DNA sequences found in a host genome. These elements have the ability to move from one location to another within the genome, and their insertion can cause a wide spectrum of mutations in their hosts. Some of the deleterious effects of TEs may be due to ectopic recombination among TEs of the same family. In addition, some transposons are physically linked to genes and can control their expression. To prevent possible damage caused by the presence of TEs in the genome, some fungi possess TE-silencing mechanisms, such as RIP (Repeat Induced Point mutation). In this study, the abundance, distribution and potential impact of TEs in the genome of M. fijiensis were investigated. A total of 613 LTR-Gypsy and 27 LTR-Copia complete elements of the class I were detected. Among the class II elements, a total of 28 Mariner, five Mutator and one Harbinger complete elements were identified. The results of this study indicate that transposons were and are important ectopic recombination sites. A distribution analysis of a transposable element from each class of the M. fijiensis isolates revealed variable hybridization profiles, indicating the activity of these elements. Several genes encoding proteins involved in important metabolic pathways and with potential correlation to pathogenicity systems were identified upstream and downstream of transposable elements. A comparison of the sequences from different transposon groups suggested the action of the RIP silencing mechanism in the genome of this microorganism. The analysis of TEs in M. fijiensis suggests that TEs play an important role in the evolution of this organism because the activity of these elements, as well

  11. Abundance, distribution and potential impact of transposable elements in the genome of Mycosphaerella fijiensis

    Directory of Open Access Journals (Sweden)

    Santana Mateus F

    2012-12-01

    Full Text Available Abstract Background Mycosphaerella fijiensis is a ascomycete that causes Black Sigatoka in bananas. Recently, the M. fijiensis genome was sequenced. Repetitive sequences are ubiquitous components of fungal genomes. In most genomic analyses, repetitive sequences are associated with transposable elements (TEs. TEs are dispersed repetitive DNA sequences found in a host genome. These elements have the ability to move from one location to another within the genome, and their insertion can cause a wide spectrum of mutations in their hosts. Some of the deleterious effects of TEs may be due to ectopic recombination among TEs of the same family. In addition, some transposons are physically linked to genes and can control their expression. To prevent possible damage caused by the presence of TEs in the genome, some fungi possess TE-silencing mechanisms, such as RIP (Repeat Induced Point mutation. In this study, the abundance, distribution and potential impact of TEs in the genome of M. fijiensis were investigated. Results A total of 613 LTR-Gypsy and 27 LTR-Copia complete elements of the class I were detected. Among the class II elements, a total of 28 Mariner, five Mutator and one Harbinger complete elements were identified. The results of this study indicate that transposons were and are important ectopic recombination sites. A distribution analysis of a transposable element from each class of the M. fijiensis isolates revealed variable hybridization profiles, indicating the activity of these elements. Several genes encoding proteins involved in important metabolic pathways and with potential correlation to pathogenicity systems were identified upstream and downstream of transposable elements. A comparison of the sequences from different transposon groups suggested the action of the RIP silencing mechanism in the genome of this microorganism. Conclusions The analysis of TEs in M. fijiensis suggests that TEs play an important role in the evolution of

  12. The ant genomes have been invaded by several types of mariner transposable elements

    Science.gov (United States)

    Lorite, Pedro; Maside, Xulio; Sanllorente, Olivia; Torres, María I.; Periquet, Georges; Palomeque, Teresa

    2012-12-01

    To date, only three types of full-length mariner elements have been described in ants, each one in a different genus of the Myrmicinae subfamily: Sinvmar was isolated from various Solenopsis species, Myrmar from Myrmica ruginodis, and Mboumar from Messor bouvieri. In this study, we report the coexistence of three mariner elements ( Tnigmar- Si, Tnigmar- Mr, and Tnigmar- Mb) in the genome of a single species, Tapinoma nigerrimum (subfamily Dolichoderinae). Molecular evolutionary analyses of the nucleotide sequence data revealed a general agreement between the evolutionary history of most the elements and the ant species that harbour them, and suggest that they are at the vertical inactivation stage of the so-called Mariner Life Cycle. In contrast, significantly reduced levels of synonymous divergence between Mboumar and Tnigmar- Mb and between Myrmar and Botmar (a mariner element isolated from Bombus terrestris), relative to those observed between their hosts, suggest that these elements arrived to the species that host them by horizontal transfer, long after the species' split. The horizontal transfer events for the two pairs of elements could be roughly dated within the last 2 million years and about 14 million years, respectively. As would be expected under this scenario, the coding sequences of the youngest elements, Tnigmar- Mb and Mboumar, are intact and, thus, potentially functional. Each mariner element has a different chromosomal distribution pattern according to their stage within the Mariner Life Cycle. Finally, a new defective transposable element ( Azteca) has also been found inserted into the Tnigmar- Mr sequences showing that the ant genomes have been invaded by at least four different types of mariner elements.

  13. Automated rapid chemistry in heavy element research

    International Nuclear Information System (INIS)

    Schaedel, M.

    1994-01-01

    With the increasingly short half-lives of the heavy element isotopes in the transition region from the heaviest actinides to the transactinide elements the demand for automated rapid chemistry techniques is also increasing. Separation times of significantly less than one minute, high chemical yields, high repetition rates, and an adequate detection system are prerequisites for many successful experiments in this field. The development of techniques for separations in the gas phase and in the aqueous phase for applications of chemical or nuclear studies of the heaviest elements are briefly outlined. Typical examples of results obtained with automated techniques are presented for studies up to element 105, especially those obtained with the Automated Rapid Chemistry Apparatus, ARCA. The prospects to investigate the properties of even heavier elements with chemical techniques are discussed

  14. Whole genome complete resequencing of Bacillus subtilis natto by combining long reads with high-quality short reads.

    Directory of Open Access Journals (Sweden)

    Mayumi Kamada

    Full Text Available De novo microbial genome sequencing reached a turning point with third-generation sequencing (TGS platforms, and several microbial genomes have been improved by TGS long reads. Bacillus subtilis natto is closely related to the laboratory standard strain B. subtilis Marburg 168, and it has a function in the production of the traditional Japanese fermented food "natto." The B. subtilis natto BEST195 genome was previously sequenced with short reads, but it included some incomplete regions. We resequenced the BEST195 genome using a PacBio RS sequencer, and we successfully obtained a complete genome sequence from one scaffold without any gaps, and we also applied Illumina MiSeq short reads to enhance quality. Compared with the previous BEST195 draft genome and Marburg 168 genome, we found that incomplete regions in the previous genome sequence were attributed to GC-bias and repetitive sequences, and we also identified some novel genes that are found only in the new genome.

  15. Property - preserving convergent sequences of invariant sets for linear discrete - time systems

    NARCIS (Netherlands)

    Athanasopoulos, N.; Lazar, M.; Bitsoris, G.

    2014-01-01

    Abstract: New sequences of monotonically increasing sets are introduced, for linear discrete-time systems subject to input and state constraints. The elements of the set sequences are controlled invariant and admissible regions of stabilizability. They are generated from the iterative application of

  16. Isolation and molecular characterization of dTnp1, a mobile and defective transposable element of Nicotiana plumbaginifolia.

    Science.gov (United States)

    Meyer, C; Pouteau, S; Rouzé, P; Caboche, M

    1994-01-01

    By Northern blot analysis of nitrate reductase-deficient mutants of Nicotiana plumbaginifolia, we identified a mutant (mutant D65), obtained after gamma-ray irradiation of protoplasts, which contained an insertion sequence in the nitrate reductase (NR) mRNA. This insertion sequence was localized by polymerase chain reaction (PCR) in the first exon of NR and was also shown to be present in the NR gene. The mutant gene contained a 565 bp insertion sequence that exhibits the sequence characteristics of a transposable element, which was thus named dTnp1. The dTnp1 element has 14 bp terminal inverted repeats and is flanked by an 8-bp target site duplication generated upon transposition. These inverted repeats have significant sequence homology with those of other transposable elements. Judging by its size and the absence of a long open reading frame, dTnp1 appears to represent a defective, although mobile, transposable element. The octamer motif TTTAGGCC was found several times in direct orientation near the 5' and 3' ends of dTnp1 together with a perfect palindrome located after the 5' inverted repeat. Southern blot analysis using an internal probe of dTnp1 suggested that this element occurs as a single copy in the genome of N. plumbaginifolia. It is also present in N. tabacum, but absent in tomato or petunia. The dTnp1 element is therefore of potential use for gene tagging in Nicotiana species.

  17. DNA Nucleotide Sequence Restricted by the RI Endonuclease

    Science.gov (United States)

    Hedgpeth, Joe; Goodman, Howard M.; Boyer, Herbert W.

    1972-01-01

    The sequence of DNA base pairs adjacent to the phosphodiester bonds cleaved by the RI restriction endonuclease in unmodified DNA from coliphage λ has been determined. The 5′-terminal nucleotide labeled with 32P and oligonucleotides up to the heptamer were analyzed from a pancreatic DNase digest. The following sequence of nucleotides adjacent to the RI break made in λ DNA was deduced from these data and from the 3′-dinucleotide sequence and nearest-neighbor analysis obtained from repair synthesis with the DNA polymerase of Rous sarcoma virus [Formula: see text] The RI endonuclease cleavage of the phosphodiester bonds (indicated by arrows) generates 5′-phosphoryls and short cohesive termini of four nucleotides, pApApTpT. The most striking feature of the sequence is its symmetry. PMID:4343974

  18. Retrieval-Induced Inhibition in Short-Term Memory.

    Science.gov (United States)

    Kang, Min-Suk; Choi, Joongrul

    2015-07-01

    We used a visual illusion called motion repulsion as a model system for investigating competition between two mental representations. Subjects were asked to remember two random-dot-motion displays presented in sequence and then to report the motion directions for each. Remembered motion directions were shifted away from the actual motion directions, an effect similar to the motion repulsion observed during perception. More important, the item retrieved second showed greater repulsion than the item retrieved first. This suggests that earlier retrieval exerted greater inhibition on the other item being held in short-term memory. This retrieval-induced motion repulsion could be explained neither by reduced cognitive resources for maintaining short-term memory nor by continued inhibition between short-term memory representations. These results indicate that retrieval of memory representations inhibits other representations in short-term memory. We discuss mechanisms of retrieval-induced inhibition and their implications for the structure of memory. © The Author(s) 2015.

  19. Identification of antimicrobial resistance genes in multidrug-resistant clinical Bacteroides fragilis isolates by whole genome shotgun sequencing

    DEFF Research Database (Denmark)

    Sydenham, Thomas Vognbjerg; Sóki, József; Hasman, Henrik

    2015-01-01

    Bacteroides fragilis constitutes the most frequent anaerobic bacterium causing bacteremia in humans. The genetic background for antimicrobial resistance in B. fragilis is diverse with some genes requiring insertion sequence (IS) elements inserted upstream for increased expression. To evaluate whole...... genome shotgun sequencing as a method for predicting antimicrobial resistance properties, one meropenem resistant and five multidrug-resistant blood culture isolates were sequenced and antimicrobial resistance genes and IS elements identified using ResFinder 2.1 (http...

  20. The Salmon Smai Family of Short Interspersed Repetitive Elements (Sines): Interspecific and Intraspecific Variation of the Insertion of Sines in the Genomes of Chum and Pink Salmon

    OpenAIRE

    Takasaki, N.; Yamaki, T.; Hamada, M.; Park, L.; Okada, N.

    1997-01-01

    The genomes of chum salmon and pink salmon contain a family of short interspersed repetitive elements (SINEs), designated the salmon SmaI family. It is restricted to these two species, a distribution that suggests that this SINE family might have been generated in their common ancestor. When insertions of the SmaI SINEs at 10 orthologous loci of these species were analyzed, however, it was found that there were no shared insertion sites between chum and pink salmon. Furthermore, at six loci w...

  1. Neurotoxic Doses of Chronic Methamphetamine  Trigger Retrotransposition of the Identifier Element  in Rat Dorsal Dentate Gyrus

    Directory of Open Access Journals (Sweden)

    Anna Moszczynska

    2017-03-01

    Full Text Available Short interspersed elements (SINEs are typically silenced by DNA hypermethylation in somatic cells, but can retrotranspose in proliferating cells during adult neurogenesis. Hypomethylation caused by disease pathology or genotoxic stress leads to genomic instability of SINEs. The goal of the present investigation was to determine whether neurotoxic doses of binge or chronic methamphetamine (METH trigger retrotransposition of the identifier (ID element, a member of the rat SINE family, in the dentate gyrus genomic DNA. Adult male Sprague‐Dawley rats were treated with saline or high doses of binge or chronic METH and sacrificed at three different time points thereafter. DNA methylation analysis, immunohistochemistry and next‐generation sequencing (NGS were performed on the dorsal dentate gyrus samples. Binge METH triggered hypomethylation, while chronic METH triggered hypermethylation of the CpG‐2 site. Both METH regimens were associated with increased intensities in poly(A‐binding protein 1 (PABP1, a SINE regulatory protein‐like immunohistochemical staining in the dentate gyrus. The amplification of several ID element sequences was significantly higher in the chronic METH group than in the control group a week after METH, and they mapped to genes coding for proteins regulating cell growth and proliferation, transcription, protein function as well as for a variety of transporters. The results suggest that chronic METH induces ID element retrotransposition in the dorsal dentate gyrus and may affect hippocampal neurogenesis.

  2. Investigation of faulted tunnel models by combined photoelasticity and finite element analysis

    International Nuclear Information System (INIS)

    Ladkany, S.G.; Huang, Yuping

    1994-01-01

    Models of square and circular tunnels with short faults cutting through their surfaces are investigated by photoelasticity. These models, when duplicated by finite element analysis can predict the stress states of square or circular faulted tunnels adequately. Finite element analysis, using gap elements, may be used to investigate full size faulted tunnel system

  3. Management of High-Throughput DNA Sequencing Projects: Alpheus.

    Science.gov (United States)

    Miller, Neil A; Kingsmore, Stephen F; Farmer, Andrew; Langley, Raymond J; Mudge, Joann; Crow, John A; Gonzalez, Alvaro J; Schilkey, Faye D; Kim, Ryan J; van Velkinburgh, Jennifer; May, Gregory D; Black, C Forrest; Myers, M Kathy; Utsey, John P; Frost, Nicholas S; Sugarbaker, David J; Bueno, Raphael; Gullans, Stephen R; Baxter, Susan M; Day, Steve W; Retzel, Ernest F

    2008-12-26

    High-throughput DNA sequencing has enabled systems biology to begin to address areas in health, agricultural and basic biological research. Concomitant with the opportunities is an absolute necessity to manage significant volumes of high-dimensional and inter-related data and analysis. Alpheus is an analysis pipeline, database and visualization software for use with massively parallel DNA sequencing technologies that feature multi-gigabase throughput characterized by relatively short reads, such as Illumina-Solexa (sequencing-by-synthesis), Roche-454 (pyrosequencing) and Applied Biosystem's SOLiD (sequencing-by-ligation). Alpheus enables alignment to reference sequence(s), detection of variants and enumeration of sequence abundance, including expression levels in transcriptome sequence. Alpheus is able to detect several types of variants, including non-synonymous and synonymous single nucleotide polymorphisms (SNPs), insertions/deletions (indels), premature stop codons, and splice isoforms. Variant detection is aided by the ability to filter variant calls based on consistency, expected allele frequency, sequence quality, coverage, and variant type in order to minimize false positives while maximizing the identification of true positives. Alpheus also enables comparisons of genes with variants between cases and controls or bulk segregant pools. Sequence-based differential expression comparisons can be developed, with data export to SAS JMP Genomics for statistical analysis.

  4. Characterization of human MMTV-like (HML) elements similar to a sequence that was highly expressed in a human breast cancer: further definition of the HML-6 group.

    Science.gov (United States)

    Yin, H; Medstrand, P; Kristofferson, A; Dietrich, U; Aman, P; Blomberg, J

    1999-03-30

    Previously, we found a retroviral sequence, HML-6.2BC1, to be expressed at high levels in a multifocal ductal breast cancer from a 41-year-old woman who also developed ovarian carcinoma. The sequence of a human genomic clone (HML-6.28) selected by high-stringency hybridization with HML-6.2BC1 is reported here. It was 99% identical to HML-6.2BC1 and gave the same restriction fragments as total DNA. HML-6.28 is a 4.7-kb provirus with a 5'LTR, truncated in RT. Data from two similar genomic clones and sequences found in GenBank are also reported. Overlaps between them gave a rather complete picture of the HML-6.2BC1-like human endogenous retroviral elements. Work with somatic cell hybrids and FISH localized HML-6.28 to chromosome 6, band p21, close to the MHC region. The causal role of HML-6.28 in breast cancer remains unclear. Nevertheless, the ca. 20 Myr old HML-6 sequences enabled the definition of common and unique features of type A, B, and D (ABD) retroviruses. In Gag, HML-6 has no intervening sequences between matrix and capsid proteins, unlike extant exogenous ABD viruses, possibly an ancestral feature. Alignment of the dUTPase showed it to be present in all ABD viruses, but gave a phylogenetic tree different from trees made from other ABD genes, indicating a distinct phylogeny of dUTPase. A conserved 24-mer sequence in the amino terminus of some ABD envelope genes suggested a conserved function. Copyright 1999 Academic Press.

  5. Impact of Negative Sequence Current Injection by Wind Power Plants

    DEFF Research Database (Denmark)

    Chaudhary, Sanjay; Göksu, Ömer; Teodorescu, Remus

    2013-01-01

    This paper presents an analysis of the impact from negative sequence current injection by wind power plants in power systems under steady-state and short-term unbalanced conditions, including faults. The separate positive and negative sequence current control capability of the grid-side converters...... of full scale converter type wind turbines may be utilized to alter voltage imbalance at the point of connection and further into the grid, in turn changing the resultant negative sequence current flow in the grid. The effects of such control actions have been analyzed and discussed through theoretical...

  6. Sequence dependent aggregation of peptides and fibril formation

    Science.gov (United States)

    Hung, Nguyen Ba; Le, Duy-Manh; Hoang, Trinh X.

    2017-09-01

    Deciphering the links between amino acid sequence and amyloid fibril formation is key for understanding protein misfolding diseases. Here we use Monte Carlo simulations to study the aggregation of short peptides in a coarse-grained model with hydrophobic-polar (HP) amino acid sequences and correlated side chain orientations for hydrophobic contacts. A significant heterogeneity is observed in the aggregate structures and in the thermodynamics of aggregation for systems of different HP sequences and different numbers of peptides. Fibril-like ordered aggregates are found for several sequences that contain the common HPH pattern, while other sequences may form helix bundles or disordered aggregates. A wide variation of the aggregation transition temperatures among sequences, even among those of the same hydrophobic fraction, indicates that not all sequences undergo aggregation at a presumable physiological temperature. The transition is found to be the most cooperative for sequences forming fibril-like structures. For a fibril-prone sequence, it is shown that fibril formation follows the nucleation and growth mechanism. Interestingly, a binary mixture of peptides of an aggregation-prone and a non-aggregation-prone sequence shows the association and conversion of the latter to the fibrillar structure. Our study highlights the role of a sequence in selecting fibril-like aggregates and also the impact of a structural template on fibril formation by peptides of unrelated sequences.

  7. Developing expressed sequence tag libraries and the discovery of simple sequence repeat markers for two species of raspberry (Rubus L.)

    Science.gov (United States)

    Background: Due to a relatively high level of codominant inheritance and transferability within and among taxonomic groups, simple sequence repeat (SSR) markers are important elements in comparative mapping and delineation of genomic regions associated with traits of economic importance. Expressed S...

  8. Short-Range Temporal Interactions in Sleep; Hippocampal Spike Avalanches Support a Large Milieu of Sequential Activity Including Replay.

    Directory of Open Access Journals (Sweden)

    J Matthew Mahoney

    Full Text Available Hippocampal neural systems consolidate multiple complex behaviors into memory. However, the temporal structure of neural firing supporting complex memory consolidation is unknown. Replay of hippocampal place cells during sleep supports the view that a simple repetitive behavior modifies sleep firing dynamics, but does not explain how multiple episodes could be integrated into associative networks for recollection during future cognition. Here we decode sequential firing structure within spike avalanches of all pyramidal cells recorded in sleeping rats after running in a circular track. We find that short sequences that combine into multiple long sequences capture the majority of the sequential structure during sleep, including replay of hippocampal place cells. The ensemble, however, is not optimized for maximally producing the behavior-enriched episode. Thus behavioral programming of sequential correlations occurs at the level of short-range interactions, not whole behavioral sequences and these short sequences are assembled into a large and complex milieu that could support complex memory consolidation.

  9. B chromosome in the beetle Coprophanaeus cyanescens (Scarabaeidae: emphasis in the organization of repetitive DNA sequences

    Directory of Open Access Journals (Sweden)

    Gomes de Oliveira Sarah

    2012-11-01

    Full Text Available Abstract Background To contribute to the knowledge of coleopteran cytogenetics, especially with respect to the genomic content of B chromosomes, we analyzed the composition and organization of repetitive DNA sequences in the Coprophanaeus cyanescens karyotype. We used conventional staining and the application of fluorescence in situ hybridization (FISH mapping using as probes C0t-1 DNA fraction, the 18S and 5S rRNA genes, and the LOA-like non-LTR transposable element (TE. Results The conventional analysis detected 3 individuals (among 50 analyzed carrying one small metacentric and mitotically unstable B chromosome. The FISH analysis revealed a pericentromeric block of C0t-1 DNA in the B chromosome but no 18S or 5S rDNA clusters in this extra element. Using the LOA-like TE probe, the FISH analysis revealed large pericentromeric blocks in eight autosomal bivalents and in the B chromosome, and a pericentromeric block extending to the short arm in one autosomal pair. No positive hybridization signal was observed for the LOA-like element in the sex chromosomes. Conclusions The results indicate that the origin of the B chromosome is associated with the autosomal elements, as demonstrated by the hybridization with C0t-1 DNA and the LOA-like TE. The present study is the first report on the cytogenetic mapping of a TE in coleopteran chromosomes. These TEs could have been involved in the origin and evolution of the B chromosome in C. cyanescens.

  10. Low-pass sequencing for microbial comparative genomics

    Directory of Open Access Journals (Sweden)

    Kennedy Sean

    2004-01-01

    Full Text Available Abstract Background We studied four extremely halophilic archaea by low-pass shotgun sequencing: (1 the metabolically versatile Haloarcula marismortui; (2 the non-pigmented Natrialba asiatica; (3 the psychrophile Halorubrum lacusprofundi and (4 the Dead Sea isolate Halobaculum gomorrense. Approximately one thousand single pass genomic sequences per genome were obtained. The data were analyzed by comparative genomic analyses using the completed Halobacterium sp. NRC-1 genome as a reference. Low-pass shotgun sequencing is a simple, inexpensive, and rapid approach that can readily be performed on any cultured microbe. Results As expected, the four archaeal halophiles analyzed exhibit both bacterial and eukaryotic characteristics as well as uniquely archaeal traits. All five halophiles exhibit greater than sixty percent GC content and low isoelectric points (pI for their predicted proteins. Multiple insertion sequence (IS elements, often involved in genome rearrangements, were identified in H. lacusprofundi and H. marismortui. The core biological functions that govern cellular and genetic mechanisms of H. sp. NRC-1 appear to be conserved in these four other halophiles. Multiple TATA box binding protein (TBP and transcription factor IIB (TFB homologs were identified from most of the four shotgunned halophiles. The reconstructed molecular tree of all five halophiles shows a large divergence between these species, but with the closest relationship being between H. sp. NRC-1 and H. lacusprofundi. Conclusion Despite the diverse habitats of these species, all five halophiles share (1 high GC content and (2 low protein isoelectric points, which are characteristics associated with environmental exposure to UV radiation and hypersalinity, respectively. Identification of multiple IS elements in the genome of H. lacusprofundi and H. marismortui suggest that genome structure and dynamic genome reorganization might be similar to that previously observed in the

  11. Mean Orbital Elements for Geosynchronous Orbit - II - Orbital inclination, longitude of ascending node, mean longitude

    Directory of Open Access Journals (Sweden)

    Kyu-Hong Choi

    1990-06-01

    Full Text Available The osculating orbital elements include the mean, secular, long period, and short period terms. The iterative algorithm used for conversion of osculating orbital elements to mean orbital elements is described. The mean orbital elements of Wc, Ws, and L are obtained.

  12. Cloning and sequencing of phenol oxidase 1 (pox1) gene from ...

    African Journals Online (AJOL)

    The gene (pox1) encoding a phenol oxidase 1 from Pleurotus ostreatus was sequenced and the corresponding pox1-cDNA was also synthesized, cloned and sequenced. The isolated gene is flanked by an upstream region called the promoter (399 bp) prior to the start codon (ATG). The putative metalresponsive elements ...

  13. Quantitative statistical analysis of cis-regulatory sequences in ABA/VP1- and CBF/DREB1-regulated genes of Arabidopsis.

    Science.gov (United States)

    Suzuki, Masaharu; Ketterling, Matthew G; McCarty, Donald R

    2005-09-01

    We have developed a simple quantitative computational approach for objective analysis of cis-regulatory sequences in promoters of coregulated genes. The program, designated MotifFinder, identifies oligo sequences that are overrepresented in promoters of coregulated genes. We used this approach to analyze promoter sequences of Viviparous1 (VP1)/abscisic acid (ABA)-regulated genes and cold-regulated genes, respectively, of Arabidopsis (Arabidopsis thaliana). We detected significantly enriched sequences in up-regulated genes but not in down-regulated genes. This result suggests that gene activation but not repression is mediated by specific and common sequence elements in promoters. The enriched motifs include several known cis-regulatory sequences as well as previously unidentified motifs. With respect to known cis-elements, we dissected the flanking nucleotides of the core sequences of Sph element, ABA response elements (ABREs), and the C repeat/dehydration-responsive element. This analysis identified the motif variants that may correlate with qualitative and quantitative differences in gene expression. While both VP1 and cold responses are mediated in part by ABA signaling via ABREs, these responses correlate with unique ABRE variants distinguished by nucleotides flanking the ACGT core. ABRE and Sph motifs are tightly associated uniquely in the coregulated set of genes showing a strict dependence on VP1 and ABA signaling. Finally, analysis of distribution of the enriched sequences revealed a striking concentration of enriched motifs in a proximal 200-base region of VP1/ABA and cold-regulated promoters. Overall, each class of coregulated genes possesses a discrete set of the enriched motifs with unique distributions in their promoters that may account for the specificity of gene regulation.

  14. Identification of failure sequences sensitive to human error

    International Nuclear Information System (INIS)

    1987-06-01

    This report prepared by the participants of the technical committee meeting on ''Identification of Failure Sequences Sensitive to Human Error'' addresses the subjects discussed during the meeting and the conclusions reached by the committee. Chapter 1 reviews the INSAG recommendations and the main elements of the IAEA Programme in the area of human element. In Chapter 2 the role of human actions in nuclear power plants safety from insights of operational experience is reviewed. Chapter 3 is concerned with the relationship between probabilistic safety assessment and human performance associated with severe accident sequences. Chapter 4 addresses the role of simulators in view of training for accident conditions. Chapter 5 presents the conclusions and future trends. The seven papers presented by members of this technical committee are also included in this technical document. A separate abstract was prepared for each of these papers

  15. Stability of short wavelength tearing and twisting modes

    International Nuclear Information System (INIS)

    Waelbroeck, F.L.

    1998-01-01

    The stability and mutual interaction of tearing and twisting modes in a torus is governed by matrices that generalize the well-known Δ' stability index. The diagonal elements of these matrices determine the intrinsic stability of modes that reconnect the magnetic field at a single resonant surface. The off-diagonal elements indicate the strength of the coupling between the different modes. The author shows how the elements of these matrices can be evaluated, in the limit of short wavelength, from the free energy driving radially extended ballooning modes. The author applies the results by calculating the tearing and twisting Δ' for a model high-beta equilibrium with circular flux surfaces

  16. Dynamic Epigenetic Control of Highly Conserved Noncoding Elements

    KAUST Repository

    Seridi, Loqmane

    2014-10-07

    Background Many noncoding genomic loci have remained constant over long evolutionary periods, suggesting that they are exposed to strong selective pressures. The molecular functions of these elements have been partially elucidated, but the fundamental reason for their extreme conservation is still unknown. Results To gain new insights into the extreme selection of highly conserved noncoding elements (HCNEs), we used a systematic analysis of multi-omic data to study the epigenetic regulation of such elements during the development of Drosophila melanogaster. At the sequence level, HCNEs are GC-rich and have a characteristic oligomeric composition. They have higher levels of stable nucleosome occupancy than their flanking regions, and lower levels of mononucleosomes and H3.3, suggesting that these regions reside in compact chromatin. Furthermore, these regions showed remarkable modulations in histone modification and the expression levels of adjacent genes during development. Although HCNEs are primarily initiated late in replication, about 10% were related to early replication origins. Finally, HCNEs showed strong enrichment within lamina-associated domains. Conclusion HCNEs have distinct and protective sequence properties, undergo dynamic epigenetic regulation, and appear to be associated with the structural components of the chromatin, replication origins, and nuclear matrix. These observations indicate that such elements are likely to have essential cellular functions, and offer insights into their epigenetic properties.

  17. Dynamic Epigenetic Control of Highly Conserved Noncoding Elements

    KAUST Repository

    Seridi, Loqmane; Ryu, Tae Woo; Ravasi, Timothy

    2014-01-01

    Background Many noncoding genomic loci have remained constant over long evolutionary periods, suggesting that they are exposed to strong selective pressures. The molecular functions of these elements have been partially elucidated, but the fundamental reason for their extreme conservation is still unknown. Results To gain new insights into the extreme selection of highly conserved noncoding elements (HCNEs), we used a systematic analysis of multi-omic data to study the epigenetic regulation of such elements during the development of Drosophila melanogaster. At the sequence level, HCNEs are GC-rich and have a characteristic oligomeric composition. They have higher levels of stable nucleosome occupancy than their flanking regions, and lower levels of mononucleosomes and H3.3, suggesting that these regions reside in compact chromatin. Furthermore, these regions showed remarkable modulations in histone modification and the expression levels of adjacent genes during development. Although HCNEs are primarily initiated late in replication, about 10% were related to early replication origins. Finally, HCNEs showed strong enrichment within lamina-associated domains. Conclusion HCNEs have distinct and protective sequence properties, undergo dynamic epigenetic regulation, and appear to be associated with the structural components of the chromatin, replication origins, and nuclear matrix. These observations indicate that such elements are likely to have essential cellular functions, and offer insights into their epigenetic properties.

  18. Nonlinear correlations in the hydrophobicity and average flexibility along the glycolytic enzymes sequences

    Energy Technology Data Exchange (ETDEWEB)

    Ciorsac, Alecu, E-mail: aleciorsac@yahoo.co [Politehnica University of Timisoara, Department of Physical Education and Sport, 2 P-ta Victoriei, 300006, Timisoara (Romania); Craciun, Dana, E-mail: craciundana@gmail.co [Teacher Training Department, West University of Timisoara, 4 Boulevard V. Pirvan, Timisoara, 300223 (Romania); Ostafe, Vasile, E-mail: vostafe@cbg.uvt.r [Department of Chemistry, West University of Timisoara, 16 Pestallozi, 300115, Timisoara (Romania); Laboratory of Advanced Researches in Environmental Protection, Nicholas Georgescu-Roegen Interdisciplinary Research and Formation Platform, 4 Oituz, Timisoara, 300086 (Romania); Isvoran, Adriana, E-mail: aisvoran@cbg.uvt.r [Department of Chemistry, West University of Timisoara, 16 Pestallozi, 300115, Timisoara (Romania); Laboratory of Advanced Researches in Environmental Protection, Nicholas Georgescu-Roegen Interdisciplinary Research and Formation Platform, 4 Oituz, Timisoara, 300086 (Romania)

    2011-04-15

    Research highlights: lights: We focus our study on the glycolytic enzymes. We reveal correlation of hydrophobicity and flexibility along their chains. We also reveal fractal aspects of the glycolytic enzymes structures and surfaces. The glycolytic enzyme sequences are not random. Creation of fractal structures requires the operation of nonlinear dynamics. - Abstract: Nonlinear methods widely used for time series analysis were applied to glycolytic enzyme sequences to derive information concerning the correlation of hydrophobicity and average flexibility along their chains. The 20 sequences of different types of the 10 human glycolytic enzymes were considered as spatial series and were analyzed by spectral analysis, detrended fluctuations analysis and Hurst coefficient calculation. The results agreed that there are both short range and long range correlations of hydrophobicity and average flexibility within investigated sequences, the short range correlations being stronger and indicating that local interactions are the most important for the protein folding. This correlation is also reflected by the fractal nature of the structures of investigated proteins.

  19. Draft genome sequence of the sexually transmitted pathogen Trichomonas vaginalis

    DEFF Research Database (Denmark)

    Carlton, Jane M.; Hirt, Robert P.; Silva, Joana C.

    2007-01-01

    We describe the genome sequence of the protist Trichomonas vaginalis, a sexually transmitted human pathogen. Repeats and transposable elements comprise about two-thirds of the approximately 160-megabase genome, reflecting a recent massive expansion of genetic material. This expansion...... environment. The genome sequence predicts previously unknown functions for the hydrogenosome, which support a common evolutionary origin of this unusual organelle with mitochondria....

  20. Efficacy of Pulsed-Field Gel Electrophoresis and Repetitive Element Sequence-Based PCR in Typing of Salmonella Isolates from Assam, India.

    Science.gov (United States)

    Gogoi, Purnima; Borah, Probodh; Hussain, Iftikar; Das, Leena; Hazarika, Girin; Tamuly, Shantanu; Barkalita, Luit Moni

    2018-05-01

    A total of 12 Salmonella isolates belonging to different serovars, viz , Salmonella enterica serovar Enteritidis ( n = 4), Salmonella enterica serovar Weltevreden ( n = 4), Salmonella enterica serovar Newport ( n = 1), Salmonella enterica serovar Litchifield ( n = 1), and untypeable strains ( n = 2) were isolated from 332 diarrheic fecal samples collected from animals, birds, and humans. Of the two molecular typing methods applied, viz , repetitive element sequence-based PCR (REP-PCR) and pulsed-field gel electrophoresis (PFGE), PFGE could clearly differentiate the strains belonging to different serovars as well as differentiate between strains of the same serovar with respect to their source of isolation, whereas REP-PCR could not differentiate between strains of the same serovar. Thus, it can be suggested that PFGE is more useful and appropriate for molecular typing of Salmonella isolates during epidemiological investigations than REP-PCR. Copyright © 2018 American Society for Microbiology.

  1. Fat suppression in MR imaging with binomial pulse sequences

    International Nuclear Information System (INIS)

    Baudovin, C.J.; Bryant, D.J.; Bydder, G.M.; Young, I.R.

    1989-01-01

    This paper reports on a study to develop pulse sequences allowing suppression of fat signal on MR images without eliminating signal from other tissues with short T1. They have developed such a technique involving selective excitation of protons in water, based on a binomial pulse sequence. Imaging is performed at 0.15 T. Careful shimming is performed to maximize separation of fat and water peaks. A spin-echo 1,500/80 sequence is used, employing 90 degrees pulse with transit frequency optimized for water with null excitation of 20 H offset, followed by a section-selective 180 degrees pulse. With use of the binomial sequence for imagining, reduction in fat signal is seen on images of the pelvis and legs of volunteers. Patient studies show dramatic improvement in visualization of prostatic carcinoma compared with standard sequences

  2. Identification of an estrogen response element in the 3'-flanking region of the murine c-fos protooncogene.

    Science.gov (United States)

    Hyder, S M; Stancel, G M; Nawaz, Z; McDonnell, D P; Loose-Mitchell, D S

    1992-09-05

    We have used transient transfection assays with reporter plasmids expressing chloramphenicol acetyltransferase, linked to regions of mouse c-fos, to identify a specific estrogen response element (ERE) in this protooncogene. This element is located in the untranslated 3'-flanking region of the c-fos gene, 5 kilobases (kb) downstream from the c-fos promoter and 1.5 kb downstream of the poly(A) signal. This element confers estrogen responsiveness to chloramphenicol acetyltransferase reporters linked to both the herpes simplex virus thymidine kinase promoter and the homologous c-fos promoter. Deletion analysis localized the response element to a 200-base pair fragment which contains the element GGTCACCACAGCC that resembles the consensus ERE sequence GGTCACAGTGACC originally identified in Xenopus vitellogenin A2 gene. A synthetic 36-base pair oligodeoxynucleotide containing this c-fos sequence conferred estrogen inducibility to the thymidine kinase promoter. The corresponding sequence also induced reporter activity when present in the c-fos gene fragment 3 kb from the thymidine kinase promoter. Gel-shift experiments demonstrated that synthetic oligonucleotides containing either the consensus ERE or the c-fos element bind human estrogen receptor obtained from a yeast expression system. However, the mobility of the shifted band is faster for the fos-ERE-complex than the consensus ERE complex suggesting that the three-dimensional structure of the protein-DNA complexes is different or that other factors are differentially involved in the two reactions. When the 5'-GGTCA sequence present in the c-fos ERE is mutated to 5'-TTTCA, transcriptional activation and receptor binding activities are both lost. Mutation of the CAGCC-3' element corresponding to the second half-site of the c-fos sequence also led to the loss of receptor binding activity, suggesting that both half-sites of this element are involved in this function. The estrogen induction mediated by either the c-fos or

  3. A Type System for Required/Excluded Elements in CLS

    Directory of Open Access Journals (Sweden)

    Mariangiola Dezani-Ciancaglini

    2009-11-01

    Full Text Available The calculus of looping sequences is a formalism for describing the evolution of biological systems by means of term rewriting rules. We enrich this calculus with a type discipline to guarantee the soundness of reduction rules with respect to some biological properties deriving from the requirement of certain elements, and the repellency of others. As an example, we model a toy system where the repellency of a certain element is captured by our type system and forbids another element to exit a compartment.

  4. Complete Sequence of a F33:A-:B- Conjugative Plasmid Carrying the oqxAB, fosA3 and blaCTX-M-55 Elements from a Foodborne Escherichia coli Strain

    Directory of Open Access Journals (Sweden)

    Marcus Ho-yin Wong

    2016-10-01

    Full Text Available This study reports the complete sequence of pE80, a conjugative IncFII plasmid recovered from an E. coli strain isolated from chicken meat. This plasmid harbors multiple resistance determinants including oqxAB, fosA3, blaCTX-M-55 and blaTEM-1, and is a close variant of the recently reported p42-2 element, which was recovered from E. coli of veterinary source. Recovery of pE80 constitutes evidence that evolution or genetic re-arrangement of IncFII type plasmids residing in animal-borne organisms is an active event, which involves acquisition and integration of foreign resistance elements into the plasmid backbone. Dissemination of these plasmids may further compromise the effectiveness of current antimicrobial strategies.

  5. Exploration of the Drosophila buzzatii transposable element content suggests underestimation of repeats in Drosophila genomes.

    Science.gov (United States)

    Rius, Nuria; Guillén, Yolanda; Delprat, Alejandra; Kapusta, Aurélie; Feschotte, Cédric; Ruiz, Alfredo

    2016-05-10

    Many new Drosophila genomes have been sequenced in recent years using new-generation sequencing platforms and assembly methods. Transposable elements (TEs), being repetitive sequences, are often misassembled, especially in the genomes sequenced with short reads. Consequently, the mobile fraction of many of the new genomes has not been analyzed in detail or compared with that of other genomes sequenced with different methods, which could shed light into the understanding of genome and TE evolution. Here we compare the TE content of three genomes: D. buzzatii st-1, j-19, and D. mojavensis. We have sequenced a new D. buzzatii genome (j-19) that complements the D. buzzatii reference genome (st-1) already published, and compared their TE contents with that of D. mojavensis. We found an underestimation of TE sequences in Drosophila genus NGS-genomes when compared to Sanger-genomes. To be able to compare genomes sequenced with different technologies, we developed a coverage-based method and applied it to the D. buzzatii st-1 and j-19 genome. Between 10.85 and 11.16 % of the D. buzzatii st-1 genome is made up of TEs, between 7 and 7,5 % of D. buzzatii j-19 genome, while TEs represent 15.35 % of the D. mojavensis genome. Helitrons are the most abundant order in the three genomes. TEs in D. buzzatii are less abundant than in D. mojavensis, as expected according to the genome size and TE content positive correlation. However, TEs alone do not explain the genome size difference. TEs accumulate in the dot chromosomes and proximal regions of D. buzzatii and D. mojavensis chromosomes. We also report a significantly higher TE density in D. buzzatii and D. mojavensis X chromosomes, which is not expected under the current models. Our easy-to-use correction method allowed us to identify recently active families in D. buzzatii st-1 belonging to the LTR-retrotransposon superfamily Gypsy.

  6. Clustered Regularly Interspaced Short Palindromic Repeat (CRISPR) RNAs in the Porphyromonas gingivalis CRISPR-Cas I-C System.

    Science.gov (United States)

    Burmistrz, Michal; Rodriguez Martinez, Jose Ignacio; Krochmal, Daniel; Staniec, Dominika; Pyrc, Krzysztof

    2017-12-01

    The CRISPR-Cas (clustered regularly interspaced short palindromic repeat-CRISPR-associated protein) system is unique to prokaryotes and provides the majority of bacteria and archaea with immunity against nucleic acids of foreign origin. CRISPR RNAs (crRNAs) are the key element of this system, since they are responsible for its selectivity and effectiveness. Typical crRNAs consist of a spacer sequence flanked with 5' and 3' handles originating from repeat sequences that are important for recognition of these small RNAs by the Cas machinery. In this investigation, we studied the type I-C CRISPR-Cas system in Porphyromonas gingivalis , a human pathogen associated with periodontitis, rheumatoid arthritis, cardiovascular disease, and aspiration pneumonia. We demonstrated the importance of the 5' handle for crRNA recognition by the effector complex and consequently activity, as well as secondary trimming of the 3' handle, which was not affected by modifications of the repeat sequence. IMPORTANCE Porphyromonas gingivalis , a clinically relevant Gram-negative, anaerobic bacterium, is one of the major etiologic agents of periodontitis and has been linked with the development of other clinical conditions, including rheumatoid arthritis, cardiovascular disease, and aspiration pneumonia. The presented results on the biogenesis and functions of crRNAs expand our understanding of CRISPR-Cas cellular defenses in P. gingivalis and of horizontal gene transfer in bacteria. Copyright © 2017 American Society for Microbiology.

  7. A systematic identification of Kolobok superfamily transposons in Trichomonas vaginalis and sequence analysis on related transposases

    Institute of Scientific and Technical Information of China (English)

    Qingshu Meng; Kaifu Chen; Lina Ma; Songnian Hu; Jun Yu

    2011-01-01

    Transposons are sequence elements widely distributed among genomes of all three kingdoms of life, providing genomic changes and playing significant roles in genome evolution. Trichomonas vaginalis is an excellent model system for transposon study since its genome ( ~ 160 Mb) has been sequenced and is composed of ~65% transposons and other repetitive elements. In this study, we primarily report the identification of Kolobok-type transposons (termed tvBac) in T. vaginalis and the results of transposase sequence analysis. We categorized 24 novel subfamilies of the Kolobok element, including one autonomous subfamily and 23 non-autonomous subfamilies. We also identified a novel H2CH motif in tvBac transposases based on multiple sequence alignment. In addition, we supposed that tvBac and Mutator transposons may have evolved independently from a common ancestor according to our phylogenetic analysis. Our results provide basic information for the understanding of the function and evolution of tvBac transposons in particular and other related transposon families in general.

  8. Applications of nanotechnology, next generation sequencing and microarrays in biomedical research.

    Science.gov (United States)

    Elingaramil, Sauli; Li, Xiaolong; He, Nongyue

    2013-07-01

    Next-generation sequencing technologies, microarrays and advances in bio nanotechnology have had an enormous impact on research within a short time frame. This impact appears certain to increase further as many biomedical institutions are now acquiring these prevailing new technologies. Beyond conventional sampling of genome content, wide-ranging applications are rapidly evolving for next-generation sequencing, microarrays and nanotechnology. To date, these technologies have been applied in a variety of contexts, including whole-genome sequencing, targeted re sequencing and discovery of transcription factor binding sites, noncoding RNA expression profiling and molecular diagnostics. This paper thus discusses current applications of nanotechnology, next-generation sequencing technologies and microarrays in biomedical research and highlights the transforming potential these technologies offer.

  9. Coupling of smooth particle hydrodynamics with the finite element method

    International Nuclear Information System (INIS)

    Attaway, S.W.; Heinstein, M.W.; Swegle, J.W.

    1994-01-01

    A gridless technique called smooth particle hydrodynamics (SPH) has been coupled with the transient dynamics finite element code ppercase[pronto]. In this paper, a new weighted residual derivation for the SPH method will be presented, and the methods used to embed SPH within ppercase[pronto] will be outlined. Example SPH ppercase[pronto] calculations will also be presented. One major difficulty associated with the Lagrangian finite element method is modeling materials with no shear strength; for example, gases, fluids and explosive biproducts. Typically, these materials can be modeled for only a short time with a Lagrangian finite element code. Large distortions cause tangling of the mesh, which will eventually lead to numerical difficulties, such as negative element area or ''bow tie'' elements. Remeshing will allow the problem to continue for a short while, but the large distortions can prevent a complete analysis. SPH is a gridless Lagrangian technique. Requiring no mesh, SPH has the potential to model material fracture, large shear flows and penetration. SPH computes the strain rate and the stress divergence based on the nearest neighbors of a particle, which are determined using an efficient particle-sorting technique. Embedding the SPH method within ppercase[pronto] allows part of the problem to be modeled with quadrilateral finite elements, while other parts are modeled with the gridless SPH method. SPH elements are coupled to the quadrilateral elements through a contact-like algorithm. ((orig.))

  10. Phylogenomics of Phrynosomatid Lizards: Conflicting Signals from Sequence Capture versus Restriction Site Associated DNA Sequencing

    Science.gov (United States)

    Leaché, Adam D.; Chavez, Andreas S.; Jones, Leonard N.; Grummer, Jared A.; Gottscho, Andrew D.; Linkem, Charles W.

    2015-01-01

    Sequence capture and restriction site associated DNA sequencing (RADseq) are popular methods for obtaining large numbers of loci for phylogenetic analysis. These methods are typically used to collect data at different evolutionary timescales; sequence capture is primarily used for obtaining conserved loci, whereas RADseq is designed for discovering single nucleotide polymorphisms (SNPs) suitable for population genetic or phylogeographic analyses. Phylogenetic questions that span both “recent” and “deep” timescales could benefit from either type of data, but studies that directly compare the two approaches are lacking. We compared phylogenies estimated from sequence capture and double digest RADseq (ddRADseq) data for North American phrynosomatid lizards, a species-rich and diverse group containing nine genera that began diversifying approximately 55 Ma. Sequence capture resulted in 584 loci that provided a consistent and strong phylogeny using concatenation and species tree inference. However, the phylogeny estimated from the ddRADseq data was sensitive to the bioinformatics steps used for determining homology, detecting paralogs, and filtering missing data. The topological conflicts among the SNP trees were not restricted to any particular timescale, but instead were associated with short internal branches. Species tree analysis of the largest SNP assembly, which also included the most missing data, supported a topology that matched the sequence capture tree. This preferred phylogeny provides strong support for the paraphyly of the earless lizard genera Holbrookia and Cophosaurus, suggesting that the earless morphology either evolved twice or evolved once and was subsequently lost in Callisaurus. PMID:25663487

  11. The role of the STIR sequence in magnetic resonance imaging examination of bone tumours

    International Nuclear Information System (INIS)

    Golfieri, R.; Baddeley, H.; Pringle, J.S.; Souhami, R.

    1990-01-01

    Sixty patients with primary bone tumours were evaluated with magnetic resonance imaging (MRI) at 0.5 T with both conventional spin-echo (SE) and short inversion time inversion recovery (STIR) sequences. The STIR sequence with T 1 of 120-130 ms in all cases suppressed the high signal from fatty bone marrow, giving a clear depiction of tumour extent, in both its intramedullary and soft-tissue components, and is superior to conventional SE images. The high sensitivity (100% of our cases) of this technique is counterbalanced by its lack of specificity: on STIR sequences both tumour and peritumorous oedema give an increase of signal intensity, limiting assessment of tumour extent. Peritumoral oedema, only present in this series in malignant neoplasms, may however be differentiated on the basis of the configuration of the abnormal areas, and by comparing STIR images with short repetition time/echo time sequence results. (author)

  12. A Note on Sequence Prediction over Large Alphabets

    Directory of Open Access Journals (Sweden)

    Travis Gagie

    2012-02-01

    Full Text Available Building on results from data compression, we prove nearly tight bounds on how well sequences of length n can be predicted in terms of the size σ of the alphabet and the length k of the context considered when making predictions. We compare the performance achievable by an adaptive predictor with no advance knowledge of the sequence, to the performance achievable by the optimal static predictor using a table listing the frequency of each (k + 1-tuple in the sequence. We show that, if the elements of the sequence are chosen uniformly at random, then an adaptive predictor can compete in the expected case if k ≤ logσ n – 3 – ε, for a constant ε > 0, but not if k ≥ logσ n.

  13. Applications and challenges of next-generation sequencing in Brassica species.

    Science.gov (United States)

    Wei, Lijuan; Xiao, Meili; Hayward, Alice; Fu, Donghui

    2013-12-01

    Next-generation sequencing (NGS) produces numerous (often millions) short DNA sequence reads, typically varying between 25 and 400 bp in length, at a relatively low cost and in a short time. This revolutionary technology is being increasingly applied in whole-genome, transcriptome, epigenome and small RNA sequencing, molecular marker and gene discovery, comparative and evolutionary genomics, and association studies. The Brassica genus comprises some of the most agro-economically important crops, providing abundant vegetables, condiments, fodder, oil and medicinal products. Many Brassica species have undergone the process of polyploidization, which makes their genomes exceptionally complex and can create difficulties in genomics research. NGS injects new vigor into Brassica research, yet also faces specific challenges in the analysis of complex crop genomes and traits. In this article, we review the advantages and limitations of different NGS technologies and their applications and challenges, using Brassica as an advanced model system for agronomically important, polyploid crops. Specifically, we focus on the use of NGS for genome resequencing, transcriptome sequencing, development of single-nucleotide polymorphism markers, and identification of novel microRNAs and their targets. We present trends and advances in NGS technology in relation to Brassica crop improvement, with wide application for sophisticated genomics research into agronomically important polyploid crops.

  14. 1H NMR studies of plastocyanin from Scenedesmus obliquus: Complete sequence-specific assignment, secondary structure analysis, and global fold

    International Nuclear Information System (INIS)

    Moore, J.M.; Chazin, W.J.; Wright, P.E.; Powls, R.

    1988-01-01

    Two-dimensional 1 H NMR methods have been used to make sequence-specific resonance assignments for the 97 amino acid residues of the plastocyanin from the green alga Scenedesmus obliquus. Assignments were obtained for all backbone protons and the majority of the side-chain protons. Spin system identification relied heavily on the observation of relayed connectivities to the backbone amide proton. Sequence-specific assignments were made by using the sequential assignment procedure. During this process, an extra valine residue was identified that had not been detected in the original amino acid sequence. Elements of regular secondary structure were identified from characteristic NOE connectivities between backbone protons, coupling constant values, and the observation of slowly exchanging amide protons. The protein in solution contains eight β-strands, one short segment of helix, five reverse turns, and five loops. The β-strands may be arranged into two βsheets on the basis of extensive cross-strand NOE connectivities. The chain-folding topology determined from the NMR experiments is that of a Greek key β-barrel and is similar to that observed for French bean plastocyanin in solution and poplar plastocyanin in the crystalline state. While the overall structures are similar, several differences in local structure between the S. obliquus and higher plant plastocyanins have been identified

  15. Transcription factor trapping by RNA in gene regulatory elements.

    Science.gov (United States)

    Sigova, Alla A; Abraham, Brian J; Ji, Xiong; Molinie, Benoit; Hannett, Nancy M; Guo, Yang Eric; Jangi, Mohini; Giallourakis, Cosmas C; Sharp, Phillip A; Young, Richard A

    2015-11-20

    Transcription factors (TFs) bind specific sequences in promoter-proximal and -distal DNA elements to regulate gene transcription. RNA is transcribed from both of these DNA elements, and some DNA binding TFs bind RNA. Hence, RNA transcribed from regulatory elements may contribute to stable TF occupancy at these sites. We show that the ubiquitously expressed TF Yin-Yang 1 (YY1) binds to both gene regulatory elements and their associated RNA species across the entire genome. Reduced transcription of regulatory elements diminishes YY1 occupancy, whereas artificial tethering of RNA enhances YY1 occupancy at these elements. We propose that RNA makes a modest but important contribution to the maintenance of certain TFs at gene regulatory elements and suggest that transcription of regulatory elements produces a positive-feedback loop that contributes to the stability of gene expression programs. Copyright © 2015, American Association for the Advancement of Science.

  16. International experience in conditioning spent fuel elements

    International Nuclear Information System (INIS)

    Ashton, P.

    1991-04-01

    The purpose of this report is to compile and present in a clear form international experience (USA, Canada, Sweden, FRG, UK, Japan, Switzerland) gained to date in conditioning spent fuel elements. The term conditioning is here taken to mean the handling and packaging of spent fuel elements for short- or long-term storage or final disposal. Plants of a varying nature fall within this scope, both in terms of the type of fuel element treated and the plant purpose eg. experimental or production plant. Emphasis is given to plants which bear some similarity to the concept developed in Germany for direct disposal of spent fuel elements. Worldwide, however, relatively few conditioning plants are in existence or have been conceived. Hence additional plants have been included where aspects of the experience gained are also of relevance eg. plants developed for the consolidation of spent fuel elements. (orig./HP) [de

  17. Reverse transcriptase sequences from mulberry LTR retrotransposons: characterization analysis

    Directory of Open Access Journals (Sweden)

    Ma Bi

    2017-10-01

    Full Text Available Copia and Gypsy play important roles in structural, functional and evolutionary dynamics of plant genomes. In this study, a total of 106 and 101, Copia and Gypsy reverse transcriptase (rt were amplified respectively in the Morus notabilis genome using degenerate primers. All sequences exhibited high levels of heterogeneity, were rich in AT and possessed higher sequence divergence of Copia rt in comparison to Gypsy rt. Two reasons are likely to account for this phenomenon: a these elements often experience deletions or fragmentation by illegitimate or unequal homologous recombination in the transposition process; b strong purifying selective pressure drives the evolution of these elements through “selective silencing” with random mutation and eventual deletion from the host genome. Interestingly, mulberry rt clustered with other rt from distantly related taxa according to the phylogenetic analysis. This phenomenon did not result from horizontal transposable element transfer. Results obtained from fluorescence in situ hybridization revealed that most of the hybridization signals were preferentially concentrated in pericentromeric and distal regions of chromosomes, and these elements may play important roles in the regions in which they are found. Results of this study support the continued pursuit of further functional studies of Copia and Gypsy in the mulberry genome.

  18. Long Aftershock Sequences within Continents and Implications for Earthquake Hazard Assessment

    Science.gov (United States)

    Stein, S. A.; Liu, M.

    2014-12-01

    Recent seismicity in the Tangshan region in North China has prompted concern about a repetition of the 1976 M7.8 earthquake that destroyed the city, killing more than 242,000 people. However, the decay of seismicity there implies that the recent earthquakes are probably aftershocks of the 1976 event. This 37-year sequence is an example of the phenomenon that aftershock sequences within continents are often significantly longer than the typical 10 years at plate boundaries. The long sequence of aftershocks in continents is consistent with a simple friction-based model predicting that the length of aftershock sequences varies inversely with the rate at which faults are loaded. Hence the slowly-deforming continents tend to have aftershock sequences significantly longer than at rapidly-loaded plate boundaries. This effect has two consequences for hazard assessment. First, within the heavily populated continents that are typically within plate interiors, assessments of earthquake hazards rely significantly on the assumption that the locations of small earthquakes shown by the short historical record reflect continuing deformation that will cause future large earthquakes. This assumption would lead to overestimation of the hazard in presently active areas and underestimation elsewhere, if some of these small events are aftershocks. Second, successful attempts to remove aftershocks from catalogs used for hazard assessment would underestimate the hazard, because much of the hazard is due to the aftershocks, and the declustering algorithms implicitly assume short aftershock sequences and thus do not remove long-duration ones.

  19. Chemistry of the heaviest elements

    International Nuclear Information System (INIS)

    Hoffman, D.C.

    1996-01-01

    Studies of the chemical properties of the elements at the uppermost end of the periodic table are discussed. Some historical perspective is given, but major emphasis is on recent studies. Isotopes of these elements are short-lived and, therefore, must be studied near the site of production. They must be produced with charged-particle beams at accelerators rather than via neutron capture. The use of radioactive heavy actinide targets is often required and the number of atoms produced is so small that any chemistry to be performed must be done on an ''atom-at-a-time'' basis. Furthermore, a knowledge of their nuclear properties is required in order to identify and detect them. To date, both gas and aqueous phase properties of elements as heavy as element 104 (rutherfordium) and element 105 (hahnium) have been investigated, even though their longest-lived known isotopes have half-lives of only 65 and 35 seconds, respectively. The experimental results show that their chemical properties cannot be simply extrapolated from the known properties of their lighter homologs in the periodic table, emphasizing the importance of obtaining additional experimental information for the heaviest elements to compare with predictions and help assess the influence of relativistic effects. The feasibility of the extension of chemical studies to still heavier elements is also discussed. (orig.)

  20. Origin of very-short orbital-period binary systems

    International Nuclear Information System (INIS)

    Miyaji, S.

    1983-01-01

    Recent observations of four close binaries have established that there is a group of very-short orbital-period (VSOP) binaries whose orbital periods are less than 60 minutes. The VSOP binaries consist of both X-ray close binaries and cataclysmic variables. Their orbital periods are too short to have a main-sequence companion. However, four binaries, none of which belongs to any globular cluster, are too abundant to be explained by the capturing mechanism of a white dwarf. Therefore it seemed to be worthwhile to present an evolutionary scenario from an original binary system which can be applied for all VSOP binaries. (Auth.)