repeats flank genomic: Topics by WorldWideScience.org

Sample records for repeats flank genomic

Flanking Variation Influences Rates of Stutter in Simple Repeats

Directory of Open Access Journals (Sweden)

August E. Woerner

2017-11-01

Full Text Available It has been posited that the longest uninterrupted stretch (LUS of tandem repeats, as defined by the number of exactly matching repeating motif units, is a better predictor of rates of stutter than the parental allele length (PAL. While there are cases where this hypothesis is likely correct, such as the 9.3 allele in the TH01 locus, there can be situations where it may not apply as well. For example, the PAL may capture flanking indel variations while remaining insensitive to polymorphisms in the repeat, and these haplotypic changes may impact the stutter rate. To address this, rates of stutter were contrasted against the LUS as well as the PAL on different flanking haplotypic backgrounds. This study shows that rates of stutter can vary substantially depending on the flanking haplotype, and while there are cases where the LUS is a better predictor of stutter than the PAL, examples to the contrary are apparent in commonly assayed forensic markers. Further, flanking variation that is 7 bp from the repeat region can impact rates of stutter. These findings suggest that non-proximal effects, such as DNA secondary structure, may be impacting the rates of stutter in common forensic short tandem repeat markers.
[Bioinformatics Analysis of Clustered Regularly Interspaced Short Palindromic Repeats in the Genomes of Shigella].

Science.gov (United States)

Wang, Pengfei; Wang, Yingfang; Duan, Guangcai; Xue, Zerun; Wang, Linlin; Guo, Xiangjiao; Yang, Haiyan; Xi, Yuanlin

2015-04-01

This study was aimed to explore the features of clustered regularly interspaced short palindromic repeats (CRISPR) structures in Shigella by using bioinformatics. We used bioinformatics methods, including BLAST, alignment and RNA structure prediction, to analyze the CRISPR structures of Shigella genomes. The results showed that the CRISPRs existed in the four groups of Shigella, and the flanking sequences of upstream CRISPRs could be classified into the same group with those of the downstream. We also found some relatively conserved palindromic motifs in the leader sequences. Repeat sequences had the same group with corresponding flanking sequences, and could be classified into two different types by their RNA secondary structures, which contain "stem" and "ring". Some spacers were found to homologize with part sequences of plasmids or phages. The study indicated that there were correlations between repeat sequences and flanking sequences, and the repeats might act as a kind of recognition mechanism to mediate the interaction between foreign genetic elements and Cas proteins.
[Comparative analysis of clustered regularly interspaced short palindromic repeats (CRISPRs) loci in the genomes of halophilic archaea].

Science.gov (United States)

Zhang, Fan; Zhang, Bing; Xiang, Hua; Hu, Songnian

2009-11-01

Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) is a widespread system that provides acquired resistance against phages in bacteria and archaea. Here we aim to genome-widely analyze the CRISPR in extreme halophilic archaea, of which the whole genome sequences are available at present time. We used bioinformatics methods including alignment, conservation analysis, GC content and RNA structure prediction to analyze the CRISPR structures of 7 haloarchaeal genomes. We identified the CRISPR structures in 5 halophilic archaea and revealed a conserved palindromic motif in the flanking regions of these CRISPR structures. In addition, we found that the repeat sequences of large CRISPR structures in halophilic archaea were greatly conserved, and two types of predicted RNA secondary structures derived from the repeat sequences were likely determined by the fourth base of the repeat sequence. Our results support the proposal that the leader sequence may function as recognition site by having palindromic structures in flanking regions, and the stem-loop secondary structure formed by repeat sequences may function in mediating the interaction between foreign genetic elements and CAS-encoded proteins.
Hybridization Capture Using Short PCR Products Enriches Small Genomes by Capturing Flanking Sequences (CapFlank)

DEFF Research Database (Denmark)

Tsangaras, Kyriakos; Wales, Nathan; Sicheritz-Pontén, Thomas

2014-01-01

, a non-negligible fraction of the resulting sequence reads are not homologous to the bait. We demonstrate that during capture, the bait-hybridized library molecules add additional flanking library sequences iteratively, such that baits limited to targeting relatively short regions (e.g. few hundred...... nucleotides) can result in enrichment across entire mitochondrial and bacterial genomes. Our findings suggest that some of the off-target sequences derived in capture experiments are non-randomly enriched, and that CapFlank will facilitate targeted enrichment of large contiguous sequences with minimal prior...
Hybridization Capture Using Short PCR Products Enriches Small Genomes by Capturing Flanking Sequences (CapFlank)

DEFF Research Database (Denmark)

Tsangaras, Kyriakos; Wales, Nathan; Sicheritz-Pontén, Thomas

2014-01-01

nucleotides) can result in enrichment across entire mitochondrial and bacterial genomes. Our findings suggest that some of the off-target sequences derived in capture experiments are non-randomly enriched, and that CapFlank will facilitate targeted enrichment of large contiguous sequences with minimal prior...
Evolutionary genomics of miniature inverted-repeat transposable elements (MITEs) in Brassica.

Science.gov (United States)

Nouroz, Faisal; Noreen, Shumaila; Heslop-Harrison, J S

2015-12-01

Miniature inverted-repeat transposable elements (MITEs) are truncated derivatives of autonomous DNA transposons, and are dispersed abundantly in most eukaryotic genomes. We aimed to characterize various MITEs families in Brassica in terms of their presence, sequence characteristics and evolutionary activity. Dot plot analyses involving comparison of homoeologous bacterial artificial chromosome (BAC) sequences allowed identification of 15 novel families of mobile MITEs. Of which, 5 were Stowaway-like with TA Target Site Duplications (TSDs), 4 Tourist-like with TAA/TTA TSDs, 5 Mutator-like with 9-10 bp TSDs and 1 novel MITE (BoXMITE1) flanked by 3 bp TSDs. Our data suggested that there are about 30,000 MITE-related sequences in Brassica rapa and B. oleracea genomes. In situ hybridization showed one abundant family was dispersed in the A-genome, while another was located near 45S rDNA sites. PCR analysis using primers flanking sequences of MITE elements detected MITE insertion polymorphisms between and within the three Brassica (AA, BB, CC) genomes, with many insertions being specific to single genomes and others showing evidence of more recent evolutionary insertions. Our BAC sequence comparison strategy enables identification of evolutionarily active MITEs with no prior knowledge of MITE sequences. The details of MITE families reported in Brassica enable their identification, characterization and annotation. Insertion polymorphisms of MITEs and their transposition activity indicated important mechanism of genome evolution and diversification. MITE families derived from known Mariner, Harbinger and Mutator DNA transposons were discovered, as well as some novel structures. The identification of Brassica MITEs will have broad applications in Brassica genomics, breeding, hybridization and phylogeny through their use as DNA markers.
Identification of genomic insertion and flanking sequence of G2-EPSPS and GAT transgenes in soybean using whole genome sequencing method

Directory of Open Access Journals (Sweden)

Bingfu Guo

2016-07-01

Full Text Available Molecular characterization of sequences flanking exogenous fragment insertions is essential for safety assessment and labeling of genetically modified organisms (GMO. In this study, the T-DNA insertion sites and flanking sequences were identified in two newly developed transgenic glyphosate-tolerant soybeans GE-J16 and ZH10-6 based on whole genome sequencing (WGS method. About 21 Gb sequence data (~21× coverage for each line was generated on Illumina HiSeq 2500 platform. The junction reads mapped to boundary of T-DNA and flanking sequences in these two events were identified by comparing all sequencing reads with soybean reference genome and sequence of transgenic vector. The putative insertion loci and flanking sequences were further confirmed by PCR amplification, Sanger sequencing, and co-segregation analysis. All these analyses supported that exogenous T-DNA fragments were integrated in positions of Chr19: 50543767-50543792 and Chr17: 7980527-7980541 in these two transgenic lines. Identification of the genomic insertion site of the G2-EPSPS and GAT transgenes will facilitate the use of their glyphosate-tolerant traits in soybean breeding program. These results also demonstrated that WGS is a cost-effective and rapid method of identifying sites of T-DNA insertions and flanking sequences in soybean.
Microsatellites grant more stable flanking genes

Directory of Open Access Journals (Sweden)

Joukhadar Reem

2012-10-01

Full Text Available Abstract Background Microsatellites, or simple sequence repeats (SSRs, are DNA sequences that include tandem copies of specific sequences no longer than six bases. SSRs are ubiquitous in all genomes and highly mutable. Presentation of the hypothesis Results from previous studies suggest that flanking regions of SSR are exhibit high stability in a wide range of organisms. We hypothesized that the SSRs ability to discard weak DNA polymerases could be responsible for this unusual stability. . When the weak polymerases are being decayed over SSRs, the flanking sequences would have higher opportunity to be replicated by more stable DNA polymerases. We present evidence of the molecular basis of our hypothesis. Testing the hypothesis The hypothesis could be tested by examining the activity of DNA polymerase during and after a number of PCRs. The PCR reactions should be run with the same SSR locus possessing differences in the SSR length. The hypothesis could also be tested by comparing the mutational rate of a transferred gene between two transformations. The first one has a naked T-DNA (transferred DNA, while the second one has the same T-DNA flanked with two SSRs. Implications of the hypothesis In any transformation experiment, flanking the T-DNA fragment with SSR sequences would result in more stably transferred genes. This process would decrease the unpredictable risks that may occur because of the mutational pressure on this foreign segment.
Accurate typing of short tandem repeats from genome-wide sequencing data and its applications.

Science.gov (United States)

Fungtammasan, Arkarachai; Ananda, Guruprasad; Hile, Suzanne E; Su, Marcia Shu-Wei; Sun, Chen; Harris, Robert; Medvedev, Paul; Eckert, Kristin; Makova, Kateryna D

2015-05-01

Short tandem repeats (STRs) are implicated in dozens of human genetic diseases and contribute significantly to genome variation and instability. Yet profiling STRs from short-read sequencing data is challenging because of their high sequencing error rates. Here, we developed STR-FM, short tandem repeat profiling using flank-based mapping, a computational pipeline that can detect the full spectrum of STR alleles from short-read data, can adapt to emerging read-mapping algorithms, and can be applied to heterogeneous genetic samples (e.g., tumors, viruses, and genomes of organelles). We used STR-FM to study STR error rates and patterns in publicly available human and in-house generated ultradeep plasmid sequencing data sets. We discovered that STRs sequenced with a PCR-free protocol have up to ninefold fewer errors than those sequenced with a PCR-containing protocol. We constructed an error correction model for genotyping STRs that can distinguish heterozygous alleles containing STRs with consecutive repeat numbers. Applying our model and pipeline to Illumina sequencing data with 100-bp reads, we could confidently genotype several disease-related long trinucleotide STRs. Utilizing this pipeline, for the first time we determined the genome-wide STR germline mutation rate from a deeply sequenced human pedigree. Additionally, we built a tool that recommends minimal sequencing depth for accurate STR genotyping, depending on repeat length and sequencing read length. The required read depth increases with STR length and is lower for a PCR-free protocol. This suite of tools addresses the pressing challenges surrounding STR genotyping, and thus is of wide interest to researchers investigating disease-related STRs and STR evolution. © 2015 Fungtammasan et al.; Published by Cold Spring Harbor Laboratory Press.
Analysis of CR1 Repeats in the Zebra Finch Genome

Directory of Open Access Journals (Sweden)

George E. Liu

2013-06-01

Full Text Available Most bird species have smaller genomes and fewer repeats than mammals. Chicken Repeat 1 (CR1 repeat is one of the most abundant families of repeats, ranging from ~133,000 to ~187,000 copies accounting for ~50 to ~80% of the interspersed repeats in the zebra finch and chicken genomes, respectively. CR1 repeats are believed to have arisen from the retrotransposition of a small number of master elements, which gave rise to multiple CR1 subfamilies in the chicken. In this study, we performed a global assessment of the divergence distributions, phylogenies, and consensus sequences of CR1 repeats in the zebra finch genome. We identified and validated 34 CR1 subfamilies and further analyzed the correlation between these subfamilies. We also discovered 4 novel lineage-specific CR1 subfamilies in the zebra finch when compared to the chicken genome. We built various evolutionary trees of these subfamilies and concluded that CR1 repeats may play an important role in reshaping the structure of bird genomes.
Analysis of gene order data supports vertical inheritance of the leukotoxin operon and genome rearrangements in the 5' flanking region in genus Mannheimia

DEFF Research Database (Denmark)

Larsen, Jesper; Kuhnert, Peter; Frey, Joachim

2007-01-01

subclades, thus reaffirming the hypothesis of vertical inheritance of the leukotoxin operon. The presence of individual 5' flanking regions in M. haemolytica + M. glucosida and M. granulomatis reflects later genome rearrangements within each subclade. The evolution of the novel 5' flanking region in M...
Towards accurate de novo assembly for genomes with repeats

NARCIS (Netherlands)

Bucur, Doina

2017-01-01

De novo genome assemblers designed for short k-mer length or using short raw reads are unlikely to recover complex features of the underlying genome, such as repeats hundreds of bases long. We implement a stochastic machine-learning method which obtains accurate assemblies with repeats and
Structural organization of glycophorin A and B genes: Glycophorin B gene evolved by homologous recombination at Alu repeat sequences

International Nuclear Information System (INIS)

Kudo, Shinichi; Fukuda, Minoru

1989-01-01

Glycophorins A (GPA) and B (GPB) are two major sialoglycoproteins of the human erythrocyte membrane. Here the authors present a comparison of the genomic structures of GPA and GPB developed by analyzing DNA clones isolated from a K562 genomic library. Nucleotide sequences of exon-intron junctions and 5' and 3' flanking sequences revealed that the GPA and GPB genes consist of 7 and 5 exons, respectively, and both genes have >95% identical sequence from the 5' flanking region to the region ∼ 1 kilobase downstream from the exon encoding the transmembrane regions. In this homologous part of the genes, GPB lacks one exon due to a point mutation at the 5' splicing site of the third intron, which inactivates the 5' cleavage event of splicing and leads to ligation of the second to the fourth exon. Following these very homologous sequences, the genomic sequences for GPA and GPB diverge significantly and no homology can be detected in their 3' end sequences. The analysis of the Alu sequences and their flanking direct repeat sequences suggest that an ancestral genomic structure has been maintained in the GPA gene, whereas the GPB gene has arisen from the acquisition of 3' sequences different from those of the GPA gene by homologous recombination at the Alu repeats during or after gene duplication
Preliminary Genomic Characterization of Ten Hardwood Tree Species from Multiplexed Low Coverage Whole Genome Sequencing.

Directory of Open Access Journals (Sweden)

Margaret Staton

Full Text Available Forest health issues are on the rise in the United States, resulting from introduction of alien pests and diseases, coupled with abiotic stresses related to climate change. Increasingly, forest scientists are finding genetic/genomic resources valuable in addressing forest health issues. For a set of ten ecologically and economically important native hardwood tree species representing a broad phylogenetic spectrum, we used low coverage whole genome sequencing from multiplex Illumina paired ends to economically profile their genomic content. For six species, the genome content was further analyzed by flow cytometry in order to determine the nuclear genome size. Sequencing yielded a depth of 0.8X to 7.5X, from which in silico analysis yielded preliminary estimates of gene and repetitive sequence content in the genome for each species. Thousands of genomic SSRs were identified, with a clear predisposition toward dinucleotide repeats and AT-rich repeat motifs. Flanking primers were designed for SSR loci for all ten species, ranging from 891 loci in sugar maple to 18,167 in redbay. In summary, we have demonstrated that useful preliminary genome information including repeat content, gene content and useful SSR markers can be obtained at low cost and time input from a single lane of Illumina multiplex sequence.
SeqEntropy: genome-wide assessment of repeats for short read sequencing.

Directory of Open Access Journals (Sweden)

Hsueh-Ting Chu

Full Text Available BACKGROUND: Recent studies on genome assembly from short-read sequencing data reported the limitation of this technology to reconstruct the entire genome even at very high depth coverage. We investigated the limitation from the perspective of information theory to evaluate the effect of repeats on short-read genome assembly using idealized (error-free reads at different lengths. METHODOLOGY/PRINCIPAL FINDINGS: We define a metric H(k to be the entropy of sequencing reads at a read length k and use the relative loss of entropy ΔH(k to measure the impact of repeats for the reconstruction of whole-genome from sequences of length k. In our experiments, we found that entropy loss correlates well with de-novo assembly coverage of a genome, and a score of ΔH(k>1% indicates a severe loss in genome reconstruction fidelity. The minimal read lengths to achieve ΔH(k<1% are different for various organisms and are independent of the genome size. For example, in order to meet the threshold of ΔH(k<1%, a read length of 60 bp is needed for the sequencing of human genome (3.2 10(9 bp and 320 bp for the sequencing of fruit fly (1.8×10(8 bp. We also calculated the ΔH(k scores for 2725 prokaryotic chromosomes and plasmids at several read lengths. Our results indicate that the levels of repeats in different genomes are diverse and the entropy of sequencing reads provides a measurement for the repeat structures. CONCLUSIONS/SIGNIFICANCE: The proposed entropy-based measurement, which can be calculated in seconds to minutes in most cases, provides a rapid quantitative evaluation on the limitation of idealized short-read genome sequencing. Moreover, the calculation can be parallelized to scale up to large euakryotic genomes. This approach may be useful to tune the sequencing parameters to achieve better genome assemblies when a closely related genome is already available.
Discrepancy variation of dinucleotide microsatellite repeats in eukaryotic genomes

Directory of Open Access Journals (Sweden)

HUAN GAO

2009-01-01

Full Text Available To address whether there are differences of variation among repeat motif types and among taxonomic groups, we present here an analysis of variation and correlation of dinucleotide microsatellite repeats in eukaryotic genomes. Ten taxonomic groups were compared, those being primates, mammalia (excluding primates and rodentia, rodentia, birds, fish, amphibians and reptiles, insects, molluscs, plants and fungi, respectively. The data used in the analysis is from the literature published in the Journal of Molecular Ecology Notes. Analysis of variation reveals that there are no significant differences between AC and AG repeat motif types. Moreover, the number of alleles correlates positively with the copy number in both AG and AC repeats. Similar conclusions can be obtained from each taxonomic group. These results strongly suggest that the increase of SSR variation is almost linear with the increase of the copy number of each repeat motif. As well, the results suggest that the variability of SSR in the genomes of low-ranking species seem to be more than that of high-ranking species, excluding primates and fungi.
Genome wide characterization of simple sequence repeats in watermelon genome and their application in comparative mapping and genetic diversity analysis.

Science.gov (United States)

Zhu, Huayu; Song, Pengyao; Koo, Dal-Hoe; Guo, Luqin; Li, Yanman; Sun, Shouru; Weng, Yiqun; Yang, Luming

2016-08-05

Microsatellite markers are one of the most informative and versatile DNA-based markers used in plant genetic research, but their development has traditionally been difficult and costly. The whole genome sequencing with next-generation sequencing (NGS) technologies provides large amounts of sequence data to develop numerous microsatellite markers at whole genome scale. SSR markers have great advantage in cross-species comparisons and allow investigation of karyotype and genome evolution through highly efficient computation approaches such as in silico PCR. Here we described genome wide development and characterization of SSR markers in the watermelon (Citrullus lanatus) genome, which were then use in comparative analysis with two other important crop species in the Cucurbitaceae family: cucumber (Cucumis sativus L.) and melon (Cucumis melo L.). We further applied these markers in evaluating the genetic diversity and population structure in watermelon germplasm collections. A total of 39,523 microsatellite loci were identified from the watermelon draft genome with an overall density of 111 SSRs/Mbp, and 32,869 SSR primers were designed with suitable flanking sequences. The dinucleotide SSRs were the most common type representing 34.09 % of the total SSR loci and the AT-rich motifs were the most abundant in all nucleotide repeat types. In silico PCR analysis identified 832 and 925 SSR markers with each having a single amplicon in the cucumber and melon draft genome, respectively. Comparative analysis with these cross-species SSR markers revealed complicated mosaic patterns of syntenic blocks among the genomes of three species. In addition, genetic diversity analysis of 134 watermelon accessions with 32 highly informative SSR loci placed these lines into two groups with all accessions of C.lanatus var. citorides and three accessions of C. colocynthis clustered in one group and all accessions of C. lanatus var. lanatus and the remaining accessions of C. colocynthis
The CRISPRdb database and tools to display CRISPRs and to generate dictionaries of spacers and repeats

Directory of Open Access Journals (Sweden)

Vergnaud Gilles

2007-05-01

Full Text Available Abstract Background In Archeae and Bacteria, the repeated elements called CRISPRs for "clustered regularly interspaced short palindromic repeats" are believed to participate in the defence against viruses. Short sequences called spacers are stored in-between repeated elements. In the current model, motifs comprising spacers and repeats may target an invading DNA and lead to its degradation through a proposed mechanism similar to RNA interference. Analysis of intra-species polymorphism shows that new motifs (one spacer and one repeated element are added in a polarised fashion. Although their principal characteristics have been described, a lot remains to be discovered on the way CRISPRs are created and evolve. As new genome sequences become available it appears necessary to develop automated scanning tools to make available CRISPRs related information and to facilitate additional investigations. Description We have produced a program, CRISPRFinder, which identifies CRISPRs and extracts the repeated and unique sequences. Using this software, a database is constructed which is automatically updated monthly from newly released genome sequences. Additional tools were created to allow the alignment of flanking sequences in search for similarities between different loci and to build dictionaries of unique sequences. To date, almost six hundred CRISPRs have been identified in 475 published genomes. Two Archeae out of thirty-seven and about half of Bacteria do not possess a CRISPR. Fine analysis of repeated sequences strongly supports the current view that new motifs are added at one end of the CRISPR adjacent to the putative promoter. Conclusion It is hoped that availability of a public database, regularly updated and which can be queried on the web will help in further dissecting and understanding CRISPR structure and flanking sequences evolution. Subsequent analyses of the intra-species CRISPR polymorphism will be facilitated by CRISPRFinder and the
FR-like EBNA1 binding repeats in the human genome

International Nuclear Information System (INIS)

D'Herouel, Aymeric Fouquier; Birgersdotter, Anna; Werner, Maria

2010-01-01

Epstein-Barr virus (EBV) is widely spread in the human population. EBV nuclear antigen 1 (EBNA1) is a transcription factor that activates viral genes and is necessary for viral replication and partitioning, which binds the EBV genome cooperatively. We identify similar EBNA1 repeat binding sites in the human genome using a nearest-neighbor positional weight matrix. Previously experimentally verified EBNA1 sites in the human genome are successfully recovered by our approach. Most importantly, 40 novel regions are identified in the human genome, constituted of tandemly repeated binding sites for EBNA1. Genes located in the vicinity of these regions are presented as possible targets for EBNA1-mediated regulation. Among these, four are discussed in more detail: IQCB1, IMPG1, IRF2BP2 and TPO. Incorporating the cooperative actions of EBNA1 is essential when identifying regulatory regions in the human genome and we believe the findings presented here are highly valuable for the understanding of EBV-induced phenotypic changes.
Chlamydomonas chloroplasts can use short dispersed repeats and multiple pathways to repair a double-strand break in the genome.

Science.gov (United States)

Odom, Obed W; Baek, Kwang-Hyun; Dani, Radhika N; Herrin, David L

2008-03-01

Certain group I introns insert into intronless DNA via an endonuclease that creates a double-strand break (DSB). There are two models for intron homing in phage: synthesis-dependent strand annealing (SDSA) and double-strand break repair (DSBR). The Cr.psbA4 intron homes efficiently from a plasmid into the chloroplast psbA gene in Chlamydomonas, but little is known about the mechanism. Analysis of co-transformants selected using a spectinomycin-resistant 16S gene (16S(spec)) provided evidence for both pathways. We also examined the consequences of the donor DNA having only one-sided or no homology with the psbA gene. When there was no homology with the donor DNA, deletions of up to 5 kb involving direct repeats that flank the psbA gene were obtained. Remarkably, repeats as short as 15 bp were used for this repair, which is consistent with the single-strand annealing (SSA) pathway. When the donor had one-sided homology, the DSB in most co-transformants was repaired using two DNAs, the donor and the 16S(spec) plasmid, which, coincidentally, contained a region that is repeated upstream of psbA. DSB repair using two separate DNAs provides further evidence for the SDSA pathway. These data show that the chloroplast can repair a DSB using short dispersed repeats located proximally, distally, or even on separate molecules relative to the DSB. They also provide a rationale for the extensive repertoire of repeated sequences in this genome.

Repeated extragenic sequences in prokaryotic genomes: a proposal for the origin and dynamics of the RUP element in Streptococcus pneumoniae.

Science.gov (United States)

Oggioni, M R; Claverys, J P

1999-10-01

A survey of all Streptococcus pneumoniae GenBank/EMBL DNA sequence entries and of the public domain sequence (representing more than 90% of the genome) of an S. pneumoniae type 4 strain allowed identification of 108 copies of a 107-bp-long highly repeated intergenic element called RUP (for repeat unit of pneumococcus). Several features of the element, revealed in this study, led to the proposal that RUP is an insertion sequence (IS)-derivative that could still be mobile. Among these features are: (1) a highly significant homology between the terminal inverted repeats (IRs) of RUPs and of IS630-Spn1, a new putative IS of S. pneumoniae; and (2) insertion at a TA dinucleotide, a characteristic target of several members of the IS630 family. Trans-mobilization of RUP is therefore proposed to be mediated by the transposase of IS630-Spn1. To account for the observation that RUPs are distributed among four subtypes which exhibit different degrees of sequence homogeneity, a scenario is invoked based on successive stages of RUP mobility and non-mobility, depending on whether an active transposase is present or absent. In the latter situation, an active transposase could be reintroduced into the species through natural transformation. Examination of sequences flanking RUP revealed a preferential association with ISs. It also provided evidence that RUPs promote sequence rearrangements, thereby contributing to genome flexibility. The possibility that RUP preferentially targets transforming DNA of foreign origin and subsequently favours disruption/rearrangement of exogenous sequences is discussed.
Complete chloroplast genome of Trachelium caeruleum: extensiverearrangements are associated with repeats and tRNAs

Energy Technology Data Exchange (ETDEWEB)

Haberle, Rosemarie C.; Fourcade, Matthew L.; Boore, Jeffrey L.; Jansen, Robert K.

2006-01-09

Chloroplast genome structure, gene order and content arehighly conserved in land plants. We sequenced the complete chloroplastgenome sequence of Trachelium caeruleum (Campanulaceae) a member of anangiosperm family known for highly rearranged chloroplast genomes. Thetotal genome size is 162,321 bp with an IR of 27,273 bp, LSC of 100,113bp and SSC of 7,661 bp. The genome encodes 115 unique genes, with 19duplicated in the IR, a tRNA (trnI-CAU) duplicated once in the LSC and aprotein coding gene (psbJ) duplicated twice, for a total of 137 genes.Four genes (ycf15, rpl23, infA and accD) are truncated and likelynonfunctional; three others (clpP, ycf1 and ycf2) are so highly divergedthat they may now be pseudogenes. The most conspicuous feature of theTrachelium genome is the presence of eighteen internally unrearrangedblocks of genes that have been inverted or relocated within the genome,relative to the typical gene order of most angiosperm chloroplastgenomes. Recombination between repeats or tRNAs has been suggested as twomeans of chloroplast genome rearrangements. We compared the relativenumber of repeats in Trachelium to eight other angiosperm chloroplastgenomes, and evaluated the location of repeats and tRNAs in relation torearrangements. Trachelium has the highest number and largest repeats,which are concentrated near inversion endpoints or other rearrangements.tRNAs occur at many but not all inversion endpoints. There is likely nosingle mechanism responsible for the remarkable number of alterations inthis genome, but both repeats and tRNAs are clearly associated with theserearrangements. Land plant chloroplast genomes are highly conserved instructure, gene order and content. The chloroplast genomes of ferns, thegymnosperm Ginkgo, and most angiosperms are nearly collinear, reflectingthe gene order in lineages that diverged from lycopsids and the ancestralchloroplast gene order over 350 million years ago (Raubeson and Jansen,1992). Although earlier mapping studies
Recombination-dependent replication and gene conversion homogenize repeat sequences and diversify plastid genome structure.

Science.gov (United States)

Ruhlman, Tracey A; Zhang, Jin; Blazier, John C; Sabir, Jamal S M; Jansen, Robert K

2017-04-01

There is a misinterpretation in the literature regarding the variable orientation of the small single copy region of plastid genomes (plastomes). The common phenomenon of small and large single copy inversion, hypothesized to occur through intramolecular recombination between inverted repeats (IR) in a circular, single unit-genome, in fact, more likely occurs through recombination-dependent replication (RDR) of linear plastome templates. If RDR can be primed through both intra- and intermolecular recombination, then this mechanism could not only create inversion isomers of so-called single copy regions, but also an array of alternative sequence arrangements. We used Illumina paired-end and PacBio single-molecule real-time (SMRT) sequences to characterize repeat structure in the plastome of Monsonia emarginata (Geraniaceae). We used OrgConv and inspected nucleotide alignments to infer ancestral nucleotides and identify gene conversion among repeats and mapped long (>1 kb) SMRT reads against the unit-genome assembly to identify alternative sequence arrangements. Although M. emarginata lacks the canonical IR, we found that large repeats (>1 kilobase; kb) represent ∼22% of the plastome nucleotide content. Among the largest repeats (>2 kb), we identified GC-biased gene conversion and mapping filtered, long SMRT reads to the M. emarginata unit-genome assembly revealed alternative, substoichiometric sequence arrangements. We offer a model based on RDR and gene conversion between long repeated sequences in the M. emarginata plastome and provide support that both intra-and intermolecular recombination between large repeats, particularly in repeat-rich plastomes, varies unit-genome structure while homogenizing the nucleotide sequence of repeats. © 2017 Botanical Society of America.
Interstitial telomere-like repeats in the Arabidopsis thaliana genome.

Science.gov (United States)

Uchida, Wakana; Matsunaga, Sachihiro; Sugiyama, Ryuji; Kawano, Shigeyuki

2002-02-01

Eukaryotic chromosomal ends are protected by telomeres, which are thought to play an important role in ensuring the complete replication of chromosomes. On the other hand, non-functional telomere-like repeats in the interchromosomal regions (interstitial telomeric repeats; ITRs) have been reported in several eukaryotes. In this study, we identified eight ITRs in the Arabidopsis thaliana genome, each consisting of complete and degenerate 300- to 1200-bp sequences. The ITRs were grouped into three classes (class IA-B, class II, and class IIIA-E) based on the degeneracy of the telomeric repeats in ITRs. The telomeric repeats of the two ITRs in class I were conserved for the most part, whereas the single ITR in class II, and the five ITRs in class III were relatively degenerated. In addition, degenerate ITRs were surrounded by common sequences that shared 70-100% homology to each other; these are named ITR-adjacent sequences (IAS). Although the genomic regions around ITRs in class I lacked IAS, those around ITRs in class II contained IAS (IASa), and those around five ITRs in class III had nine types of IAS (IASb, c, d, e, f, g, h, i, and j). Ten IAS types in classes II and III showed no significant homology to each other. The chromosomal locations of ITRs and IAS were not category-related, but most of them were adjacent to, or part of, a centromere. These results show that the A. thaliana genome has undergone chromosomal rearrangements, such as end-fusions and segmental duplications.
Genome-wide analysis of tandem repeats in plants and green algae

Science.gov (United States)

Zhixin Zhao; Cheng Guo; Sreeskandarajan Sutharzan; Pei Li; Craig Echt; Jie Zhang; Chun Liang

2014-01-01

Tandem repeats (TRs) extensively exist in the genomes of prokaryotes and eukaryotes. Based on the sequenced genomes and gene annotations of 31 plant and algal species in Phytozome version 8.0 (http://www.phytozome.net/), we examined TRs in a genome-wide scale, characterized their distributions and motif features, and explored their putative biological functions. Among...
Read length and repeat resolution: Exploring prokaryote genomes using next-generation sequencing technologies

KAUST Repository

Cahill, Matt J.

2010-07-12

Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length. 2010 Cahill et al.
Read length and repeat resolution: exploring prokaryote genomes using next-generation sequencing technologies.

Directory of Open Access Journals (Sweden)

Matt J Cahill

Full Text Available BACKGROUND: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. METHODOLOGY/PRINCIPAL FINDINGS: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. CONCLUSIONS: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length.
Read length and repeat resolution: Exploring prokaryote genomes using next-generation sequencing technologies

KAUST Repository

Cahill, Matt J.; Kö ser, Claudio U.; Ross, Nicholas E.; Archer, John A.C.

2010-01-01

Background: There are a growing number of next-generation sequencing technologies. At present, the most cost-effective options also produce the shortest reads. However, even for prokaryotes, there is uncertainty concerning the utility of these technologies for the de novo assembly of complete genomes. This reflects an expectation that short reads will be unable to resolve small, but presumably abundant, repeats. Methodology/Principal Findings: Using a simple model of repeat assembly, we develop and test a technique that, for any read length, can estimate the occurrence of unresolvable repeats in a genome, and thus predict the number of gaps that would need to be closed to produce a complete sequence. We apply this technique to 818 prokaryote genome sequences. This provides a quantitative assessment of the relative performance of various lengths. Notably, unpaired reads of only 150nt can reconstruct approximately 50% of the analysed genomes with fewer than 96 repeat-induced gaps. Nonetheless, there is considerable variation amongst prokaryotes. Some genomes can be assembled to near contiguity using very short reads while others require much longer reads. Conclusions: Given the diversity of prokaryote genomes, a sequencing strategy should be tailored to the organism under study. Our results will provide researchers with a practical resource to guide the selection of the appropriate read length. 2010 Cahill et al.
The primary structures of two yeast enolase genes. Homology between the 5' noncoding flanking regions of yeast enolase and glyceraldehyde-3-phosphate dehydrogenase genes.

Science.gov (United States)

Holland, M J; Holland, J P; Thill, G P; Jackson, K A

1981-02-10

Segments of yeast genomic DNA containing two enolase structural genes have been isolated by subculture cloning procedures using a cDNA hybridization probe synthesized from purified yeast enolase mRNA. Based on restriction endonuclease and transcriptional maps of these two segments of yeast DNA, each hybrid plasmid contains a region of extensive nucleotide sequence homology which forms hybrids with the cDNA probe. The DNA sequences which flank this homologous region in the two hybrid plasmids are nonhomologous indicating that these sequences are nontandemly repeated in the yeast genome. The complete nucleotide sequence of the coding as well as the flanking noncoding regions of these genes has been determined. The amino acid sequence predicted from one reading frame of both structural genes is extremely similar to that determined for yeast enolase (Chin, C. C. Q., Brewer, J. M., Eckard, E., and Wold, F. (1981) J. Biol. Chem. 256, 1370-1376), confirming that these isolated structural genes encode yeast enolase. The nucleotide sequences of the coding regions of the genes are approximately 95% homologous, and neither gene contains an intervening sequence. Codon utilization in the enolase genes follows the same biased pattern previously described for two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes (Holland, J. P., and Holland, M. J. (1980) J. Biol. Chem. 255, 2596-2605). DNA blotting analysis confirmed that the isolated segments of yeast DNA are colinear with yeast genomic DNA and that there are two nontandemly repeated enolase genes per haploid yeast genome. The noncoding portions of the two enolase genes adjacent to the initiation and termination codons are approximately 70% homologous and contain sequences thought to be involved in the synthesis and processing messenger RNA. Finally there are regions of extensive homology between the two enolase structural genes and two yeast glyceraldehyde-3-phosphate dehydrogenase structural genes within the 5
Alu repeats as markers for forensic DNA analyses

Energy Technology Data Exchange (ETDEWEB)

Batzer, M.A.; Alegria-Hartman, M. [Lawrence Livermore National Lab., CA (United States); Kass, D.H. [Louisiana State Univ., New Orleans, LA (United States)] [and others

1994-01-01

The Human-Specific (HS) subfamily of Alu sequences is comprised of a group of 500 nearly identical members which are almost exclusively restricted to the human genome. Individual subfamily members share an average of 98.9% nucleotide identity with the HS subfamily consensus sequence, and have an average age of 2.8 million years. We have developed a Polymerase Chain Reaction (PCR) based assay using primers complementary to the 5 inch and 3 inch unique flanking DNA sequences from each HS Alu that allow the locus to be assayed for the presence or absence of the Alu repeat. The dimorphic HS Alu sequences probably inserted in the human genome after the radiation of modem humans (within the last 200,000-one million years) and represent a unique source of information for human population genetics and forensic DNA analyses. These sites can be developed into Dimorphic Alu Sequence Tagged Sites (DASTS) for the Human Genome Project. HS Alu family member insertions differ from other types of polymorphism (e.g. Variable Number of Tandem Repeat [VNTR] or Restriction Fragment Length Polymorphism [RFLP]) in that polymorphisms due to Alu insertions arise as a result of a unique event which has occurred only one time in the human population and spread through the population from that point. Therefore, individuals that share HS Alu repeats inherited these elements from a common ancestor. Most VNTR and RFLP polymorphisms may arise multiple times in parallel within a population.
Evaluation of Mammalian Interspersed Repeats to investigate the goat genome

Directory of Open Access Journals (Sweden)

P. Mariani

2010-01-01

Full Text Available Among the repeated sequences present in most eukaryotic genomes, SINEs (Short Interspersed Nuclear Elements are widely used to investigate evolution in the mammalian order (Buchanan et al., 1999. One family of these repetitive sequences, the MIR (Mammalian Interspersed Repeats; Jurka et al., 1995, is ubiquitous in all mammals.MIR elements are tRNA-derived SINEs and are identifiable by a conserved core region of about 70 nucleotides.
The Ecological Genomics of Fungi: Repeated Elements in Filamentous Fungi with a Focus on Wood-Decay Fungi

Energy Technology Data Exchange (ETDEWEB)

Murat, Claude [INRA, Nancy, France; Payen, Thibaut [INRA, Nancy, France; Petitpierre, Denis [INRA, Nancy, France; Labbe, Jessy L [ORNL

2013-01-01

In the last decade, the genome of several dozen filamentous fungi have been sequenced. Interestingly, vast diversity in genome size was observed (Fig. 2.1) with 14-fold differences between the 9 Mb of the human pathogenic dandruff fungus (Malassezia globosa; Xu, Saunders, et al., 2007) and the 125 Mb of the ectomycorrhizal black truffle of P rigord (Tuber melanosporum; Martin, Kohler, et al., 2010). Recently, Raffaele and Kamoun (2012) highlighted that the genomes of several lineages of filamentous plant pathogens have been shaped by repeat-driven expansion. Indeed, repeated elements are ubiquitous in all prokaryote and eukaryote genomes; however, their frequencies can vary from just a minor percentage of the genome to more that 60 percent of the genome. Repeated elements can be classified in two major types: satellites DNA and transposable elements. In this chapter, the different types of repeated elements and how these elements can impact genome and gene repertoire will be described. Also, an intriguing link between the transposable elements richness and diversity and the ecological niche will be highlighted.
Distribution and evolution of repeated sequences in genomes of Triatominae (Hemiptera-Reduviidae inferred from genomic in situ hybridization.

Directory of Open Access Journals (Sweden)

Sebastian Pita

Full Text Available The subfamily Triatominae, vectors of Chagas disease, comprises 140 species characterized by a highly homogeneous chromosome number. We analyzed the chromosomal distribution and evolution of repeated sequences in Triatominae genomes by Genomic in situ Hybridization using Triatoma delpontei and Triatoma infestans genomic DNAs as probes. Hybridizations were performed on their own chromosomes and on nine species included in six genera from the two main tribes: Triatomini and Rhodniini. Genomic probes clearly generate two different hybridization patterns, dispersed or accumulated in specific regions or chromosomes. The three used probes generate the same hybridization pattern in each species. However, these patterns are species-specific. In closely related species, the probes strongly hybridized in the autosomal heterochromatic regions, resembling C-banding and DAPI patterns. However, in more distant species these co-localizations are not observed. The heterochromatic Y chromosome is constituted by highly repeated sequences, which is conserved among 10 species of Triatomini tribe suggesting be an ancestral character for this group. However, the Y chromosome in Rhodniini tribe is markedly different, supporting the early evolutionary dichotomy between both tribes. In some species, sex chromosomes and autosomes shared repeated sequences, suggesting meiotic chromatin exchanges among these heterologous chromosomes. Our GISH analyses enabled us to acquire not only reliable information about autosomal repeated sequences distribution but also an insight into sex chromosome evolution in Triatominae. Furthermore, the differentiation obtained by GISH might be a valuable marker to establish phylogenetic relationships and to test the controversial origin of the Triatominae subfamily.
Evolutionary force of AT-rich repeats to trap genomic and episomal DNAs into the rice genome: lessons from endogenous pararetrovirus.

Science.gov (United States)

Liu, Ruifang; Koyanagi, Kanako O; Chen, Sunlu; Kishima, Yuji

2012-12-01

In plant genomes, the incorporation of DNA segments is not a common method of artificial gene transfer. Nevertheless, various segments of pararetroviruses have been found in plant genomes in recent decades. The rice genome contains a number of segments of endogenous rice tungro bacilliform virus-like sequences (ERTBVs), many of which are present between AT dinucleotide repeats (ATrs). Comparison of genomic sequences between two closely related rice subspecies, japonica and indica, allowed us to verify the preferential insertion of ERTBVs into ATrs. In addition to ERTBVs, the comparative analyses showed that ATrs occasionally incorporate repeat sequences including transposable elements, and a wide range of other sequences. Besides the known genomic sequences, the insertion sequences also represented DNAs of unclear origins together with ERTBVs, suggesting that ATrs have integrated episomal DNAs that would have been suspended in the nucleus. Such insertion DNAs might be trapped by ATrs in the genome in a host-dependent manner. Conversely, other simple mono- and dinucleotide sequence repeats (SSR) were less frequently involved in insertion events relative to ATrs. Therefore, ATrs could be regarded as hot spots of double-strand breaks that induce non-homologous end joining. The insertions within ATrs occasionally generated new gene-related sequences or involved structural modifications of existing genes. Likewise, in a comparison between Arabidopsis thaliana and Arabidopsis lyrata, the insertions preferred ATrs to other SSRs. Therefore ATrs in plant genomes could be considered as genomic dumping sites that have trapped various DNA molecules and may have exerted a powerful evolutionary force. © 2012 The Authors. The Plant Journal © 2012 Blackwell Publishing Ltd.
PSSRdb: a relational database of polymorphic simple sequence repeats extracted from prokaryotic genomes.

Science.gov (United States)

Kumar, Pankaj; Chaitanya, Pasumarthy S; Nagarajaram, Hampapathalu A

2011-01-01

PSSRdb (Polymorphic Simple Sequence Repeats database) (http://www.cdfd.org.in/PSSRdb/) is a relational database of polymorphic simple sequence repeats (PSSRs) extracted from 85 different species of prokaryotes. Simple sequence repeats (SSRs) are the tandem repeats of nucleotide motifs of the sizes 1-6 bp and are highly polymorphic. SSR mutations in and around coding regions affect transcription and translation of genes. Such changes underpin phase variations and antigenic variations seen in some bacteria. Although SSR-mediated phase variation and antigenic variations have been well-studied in some bacteria there seems a lot of other species of prokaryotes yet to be investigated for SSR mediated adaptive and other evolutionary advantages. As a part of our on-going studies on SSR polymorphism in prokaryotes we compared the genome sequences of various strains and isolates available for 85 different species of prokaryotes and extracted a number of SSRs showing length variations and created a relational database called PSSRdb. This database gives useful information such as location of PSSRs in genomes, length variation across genomes, the regions harboring PSSRs, etc. The information provided in this database is very useful for further research and analysis of SSRs in prokaryotes.
Tandem repeat regions within the Burkholderia pseudomallei genome and their application for high resolution genotyping

Directory of Open Access Journals (Sweden)

Harvey Steven P

2007-03-01

Full Text Available Abstract Background The facultative, intracellular bacterium Burkholderia pseudomallei is the causative agent of melioidosis, a serious infectious disease of humans and animals. We identified and categorized tandem repeat arrays and their distribution throughout the genome of B. pseudomallei strain K96243 in order to develop a genetic typing method for B. pseudomallei. We then screened 104 of the potentially polymorphic loci across a diverse panel of 31 isolates including B. pseudomallei, B. mallei and B. thailandensis in order to identify loci with varying degrees of polymorphism. A subset of these tandem repeat arrays were subsequently developed into a multiple-locus VNTR analysis to examine 66 B. pseudomallei and 21 B. mallei isolates from around the world, as well as 95 lineages from a serial transfer experiment encompassing ~18,000 generations. Results B. pseudomallei contains a preponderance of tandem repeat loci throughout its genome, many of which are duplicated elsewhere in the genome. The majority of these loci are composed of repeat motif lengths of 6 to 9 bp with 4 to 10 repeat units and are predominately located in intergenic regions of the genome. Across geographically diverse B. pseudomallei and B.mallei isolates, the 32 VNTR loci displayed between 7 and 28 alleles, with Nei's diversity values ranging from 0.47 and 0.94. Mutation rates for these loci are comparable (>10-5 per locus per generation to that of the most diverse tandemly repeated regions found in other less diverse bacteria. Conclusion The frequency, location and duplicate nature of tandemly repeated regions within the B. pseudomallei genome indicate that these tandem repeat regions may play a role in generating and maintaining adaptive genomic variation. Multiple-locus VNTR analysis revealed extensive diversity within the global isolate set containing B. pseudomallei and B. mallei, and it detected genotypic differences within clonal lineages of both species that were
Human Xq28 Inversion Polymorphism: From Sex Linkage to Genomics--A Genetic Mother Lode

Science.gov (United States)

Kirby, Cait S.; Kolber, Natalie; Salih Almohaidi, Asmaa M.; Bierwert, Lou Ann; Saunders, Lori; Williams, Steven; Merritt, Robert

2016-01-01

An inversion polymorphism of the filamin and emerin genes at the tip of the long arm of the human X-chromosome serves as the basis of an investigative laboratory in which students learn something new about their own genomes. Long, nearly identical inverted repeats flanking the filamin and emerin genes illustrate how repetitive elements can lead to…
Genome-wide tracking of unmethylated DNA Alu repeats in normal and cancer cells

DEFF Research Database (Denmark)

Rodriguez, Jairo; Vives, Laura; Jordà, Mireia

2008-01-01

Methylation of the cytosine is the most frequent epigenetic modification of DNA in mammalian cells. In humans, most of the methylated cytosines are found in CpG-rich sequences within tandem and interspersed repeats that make up to 45% of the human genome, being Alu repeats the most common family....
Organization and Evolution of Subtelomeric Satellite Repeats in the Potato Genome

Czech Academy of Sciences Publication Activity Database

Torres, A.T.; Gong, Z.; Iovene, M.; Hirsch, C.D.; Buell, C.R.; Bryan, G.J.; Novák, Petr; Macas, Jiří; Jiang, J.

2011-01-01

Roč. 1, July 2011 (2011), s. 85-92 ISSN 2160-1836 R&D Projects: GA MŠk(CZ) LH11058 Institutional research plan: CEZ:AV0Z50510513 Keywords : Satellite sequences * Potato genome * Repeats Subject RIV: EB - Genetics ; Molecular Biology
The mitochondrial genome of the legume Vigna radiata and the analysis of recombination across short mitochondrial repeats.

Directory of Open Access Journals (Sweden)

Andrew J Alverson

2011-01-01

Full Text Available The mitochondrial genomes of seed plants are exceptionally fluid in size, structure, and sequence content, with the accumulation and activity of repetitive sequences underlying much of this variation. We report the first fully sequenced mitochondrial genome of a legume, Vigna radiata (mung bean, and show that despite its unexceptional size (401,262 nt, the genome is unusually depauperate in repetitive DNA and "promiscuous" sequences from the chloroplast and nuclear genomes. Although Vigna lacks the large, recombinationally active repeats typical of most other seed plants, a PCR survey of its modest repertoire of short (38-297 nt repeats nevertheless revealed evidence for recombination across all of them. A set of novel control assays showed, however, that these results could instead reflect, in part or entirely, artifacts of PCR-mediated recombination. Consequently, we recommend that other methods, especially high-depth genome sequencing, be used instead of PCR to infer patterns of plant mitochondrial recombination. The average-sized but repeat- and feature-poor mitochondrial genome of Vigna makes it ever more difficult to generalize about the factors shaping the size and sequence content of plant mitochondrial genomes.

RECG maintains plastid and mitochondrial genome stability by suppressing extensive recombination between short dispersed repeats.

Directory of Open Access Journals (Sweden)

Masaki Odahara

2015-03-01

Full Text Available Maintenance of plastid and mitochondrial genome stability is crucial for photosynthesis and respiration, respectively. Recently, we have reported that RECA1 maintains mitochondrial genome stability by suppressing gross rearrangements induced by aberrant recombination between short dispersed repeats in the moss Physcomitrella patens. In this study, we studied a newly identified P. patens homolog of bacterial RecG helicase, RECG, some of which is localized in both plastid and mitochondrial nucleoids. RECG partially complements recG deficiency in Escherichia coli cells. A knockout (KO mutation of RECG caused characteristic phenotypes including growth delay and developmental and mitochondrial defects, which are similar to those of the RECA1 KO mutant. The RECG KO cells showed heterogeneity in these phenotypes. Analyses of RECG KO plants showed that mitochondrial genome was destabilized due to a recombination between 8-79 bp repeats and the pattern of the recombination partly differed from that observed in the RECA1 KO mutants. The mitochondrial DNA (mtDNA instability was greater in severe phenotypic RECG KO cells than that in mild phenotypic ones. This result suggests that mitochondrial genomic instability is responsible for the defective phenotypes of RECG KO plants. Some of the induced recombination caused efficient genomic rearrangements in RECG KO mitochondria. Such loci were sometimes associated with a decrease in the levels of normal mtDNA and significant decrease in the number of transcripts derived from the loci. In addition, the RECG KO mutation caused remarkable plastid abnormalities and induced recombination between short repeats (12-63 bp in the plastid DNA. These results suggest that RECG plays a role in the maintenance of both plastid and mitochondrial genome stability by suppressing aberrant recombination between dispersed short repeats; this role is crucial for plastid and mitochondrial functions.
Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey

Directory of Open Access Journals (Sweden)

Varala Kranthi

2007-05-01

Full Text Available Abstract Background Extensive computational and database tools are available to mine genomic and genetic databases for model organisms, but little genomic data is available for many species of ecological or agricultural significance, especially those with large genomes. Genome surveys using conventional sequencing techniques are powerful, particularly for detecting sequences present in many copies per genome. However these methods are time-consuming and have potential drawbacks. High throughput 454 sequencing provides an alternative method by which much information can be gained quickly and cheaply from high-coverage surveys of genomic DNA. Results We sequenced 78 million base-pairs of randomly sheared soybean DNA which passed our quality criteria. Computational analysis of the survey sequences provided global information on the abundant repetitive sequences in soybean. The sequence was used to determine the copy number across regions of large genomic clones or contigs and discover higher-order structures within satellite repeats. We have created an annotated, online database of sequences present in multiple copies in the soybean genome. The low bias of pyrosequencing against repeat sequences is demonstrated by the overall composition of the survey data, which matches well with past estimates of repetitive DNA content obtained by DNA re-association kinetics (Cot analysis. Conclusion This approach provides a potential aid to conventional or shotgun genome assembly, by allowing rapid assessment of copy number in any clone or clone-end sequence. In addition, we show that partial sequencing can provide access to partial protein-coding sequences.
Isolation and characterization of repeat elements of the oak genome and their application in population analysis

International Nuclear Information System (INIS)

Fluch, S.; Burg, K.

1998-01-01

Four minisatellite sequence elements have been identified and isolated from the genome of the oak species Quercus petraea and Quercus robur. Minisatellites 1 and 2 are putative members of repeat families, while minisatellites 3 and 4 show repeat length variation among individuals of test populations. A 590 base pair (bp) long element has also been identified which reveals individual-specific autoradiographic patterns when used as probe in Southern hybridisations of genomic oak DNA. (author)
Two new miniature inverted-repeat transposable elements in the genome of the clam Donax trunculus.

Science.gov (United States)

Šatović, Eva; Plohl, Miroslav

2017-10-01

Repetitive sequences are important components of eukaryotic genomes that drive their evolution. Among them are different types of mobile elements that share the ability to spread throughout the genome and form interspersed repeats. To broaden the generally scarce knowledge on bivalves at the genome level, in the clam Donax trunculus we described two new non-autonomous DNA transposons, miniature inverted-repeat transposable elements (MITEs), named DTC M1 and DTC M2. Like other MITEs, they are characterized by their small size, their A + T richness, and the presence of terminal inverted repeats (TIRs). DTC M1 and DTC M2 are 261 and 286 bp long, respectively, and in addition to TIRs, both of them contain a long imperfect palindrome sequence in their central parts. These elements are present in complete and truncated versions within the genome of the clam D. trunculus. The two new MITEs share only structural similarity, but lack any nucleotide sequence similarity to each other. In a search for related elements in databases, blast search revealed within the Crassostrea gigas genome a larger element sharing sequence similarity only to DTC M1 in its TIR sequences. The lack of sequence similarity with any previously published mobile elements indicates that DTC M1 and DTC M2 elements may be unique to D. trunculus.
Hyb-Seq: Combining Target Enrichment and Genome Skimming for Plant Phylogenomics

Directory of Open Access Journals (Sweden)

Kevin Weitemier

2014-08-01

Full Text Available Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics.
The candidate phylum Poribacteria by single-cell genomics: new insights into phylogeny, cell-compartmentation, eukaryote-like repeat proteins, and other genomic features.

Directory of Open Access Journals (Sweden)

Janine Kamke

Full Text Available The candidate phylum Poribacteria is one of the most dominant and widespread members of the microbial communities residing within marine sponges. Cell compartmentalization had been postulated along with their discovery about a decade ago and their phylogenetic association to the Planctomycetes, Verrucomicrobia, Chlamydiae superphylum was proposed soon thereafter. In the present study we revised these features based on genomic data obtained from six poribacterial single cells. We propose that Poribacteria form a distinct monophyletic phylum contiguous to the PVC superphylum together with other candidate phyla. Our genomic analyses supported the possibility of cell compartmentalization in form of bacterial microcompartments. Further analyses of eukaryote-like protein domains stressed the importance of such proteins with features including tetratricopeptide repeats, leucin rich repeats as well as low density lipoproteins receptor repeats, the latter of which are reported here for the first time from a sponge symbiont. Finally, examining the most abundant protein domain family on poribacterial genomes revealed diverse phyH family proteins, some of which may be related to dissolved organic posphorus uptake.
Efficient engineering of a bacteriophage genome using the type I-E CRISPR-Cas system.

Science.gov (United States)

Kiro, Ruth; Shitrit, Dror; Qimron, Udi

2014-01-01

The clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) system has recently been used to engineer genomes of various organisms, but surprisingly, not those of bacteriophages (phages). Here we present a method to genetically engineer the Escherichia coli phage T7 using the type I-E CRISPR-Cas system. T7 phage genome is edited by homologous recombination with a DNA sequence flanked by sequences homologous to the desired location. Non-edited genomes are targeted by the CRISPR-Cas system, thus enabling isolation of the desired recombinant phages. This method broadens CRISPR Cas-based editing to phages and uses a CRISPR-Cas type other than type II. The method may be adjusted to genetically engineer any bacteriophage genome.
A Boy with an LCR3/4-Flanked 10q22.3q23.2 Microdeletion and Uncommon Phenotypic Features

Science.gov (United States)

Petrova, E.; Neuner, C.; Haaf, T.; Schmid, M.; Wirbelauer, J.; Jurkutat, A.; Wermke, K.; Nanda, I.; Kunstmann, E.

2014-01-01

The recurrent 10q22.3q23.2 deletion with breakpoints within low copy repeats 3 and 4 is a rare genomic disorder, reported in only 13 patients to date. The phenotype is rather uncharacteristic, which makes a clinical diagnosis difficult. A phenotypic feature described in almost all patients is a delay in speech development, albeit systematic studies are still pending. In this study, we report on a boy with an LCR3/4-flanked 10q22.3q23.2 deletion exhibiting an age-appropriate language development evaluated by a standardized test at an age of 2 years and 3 months. The boy was born with a cleft palate – a feature not present in any of the patients described before. Previously reported cases are reviewed, and the role of the BMPR1A gene is discussed. The phenotype of patients with an LCR3/4-flanked 10q22.3q23.2 deletion can be rather variable, so counseling the families regarding the prognosis of an affected child should be done with caution. Long-term studies of affected children are needed to delineate the natural history of this rare disorder. PMID:24550761
Local chromatin structure of heterochromatin regulates repeated DNA stability, nucleolus structure, and genome integrity

Energy Technology Data Exchange (ETDEWEB)

Peng, Jamy C. [Univ. of California, Berkeley, CA (United States)

2007-01-01

Heterochromatin constitutes a significant portion of the genome in higher eukaryotes; approximately 30% in Drosophila and human. Heterochromatin contains a high repeat DNA content and a low density of protein-encoding genes. In contrast, euchromatin is composed mostly of unique sequences and contains the majority of single-copy genes. Genetic and cytological studies demonstrated that heterochromatin exhibits regulatory roles in chromosome organization, centromere function and telomere protection. As an epigenetically regulated structure, heterochromatin formation is not defined by any DNA sequence consensus. Heterochromatin is characterized by its association with nucleosomes containing methylated-lysine 9 of histone H3 (H3K9me), heterochromatin protein 1 (HP1) that binds H3K9me, and Su(var)3-9, which methylates H3K9 and binds HP1. Heterochromatin formation and functions are influenced by HP1, Su(var)3-9, and the RNA interference (RNAi) pathway. My thesis project investigates how heterochromatin formation and function impact nuclear architecture, repeated DNA organization, and genome stability in Drosophila melanogaster. H3K9me-based chromatin reduces extrachromosomal DNA formation; most likely by restricting the access of repair machineries to repeated DNAs. Reducing extrachromosomal ribosomal DNA stabilizes rDNA repeats and the nucleolus structure. H3K9me-based chromatin also inhibits DNA damage in heterochromatin. Cells with compromised heterochromatin structure, due to Su(var)3-9 or dcr-2 (a component of the RNAi pathway) mutations, display severe DNA damage in heterochromatin compared to wild type. In these mutant cells, accumulated DNA damage leads to chromosomal defects such as translocations, defective DNA repair response, and activation of the G2-M DNA repair and mitotic checkpoints that ensure cellular and animal viability. My thesis research suggests that DNA replication, repair, and recombination mechanisms in heterochromatin differ from those in
Alu repeats as markers for human population genetics

Energy Technology Data Exchange (ETDEWEB)

Batzer, M.A.; Alegria-Hartman, M. [Lawrence Livermore National Lab., CA (United States); Bazan, H. [Louisiana State Univ., New Orleans, LA (United States). Medical Center] [and others

1993-09-01

The Human-Specific (HS) subfamily of Alu sequences is comprised of a group of 500 nearly identical members which are almost exclusively restricted to the human genome. Individual subfamily members share an average of 97.9% nucleotide identity with each other and an average of 98.9% nucleotide identity with the HS subfamily consensus sequence. HS Alu family members are thought to be derived from a single source ``master`` gene, and have an average age of 2.8 million years. We have developed a Polymerase Chain Reaction (PCR) based assay using primers complementary to the 5 in. and 3 in. unique flanking DNA sequences from each HS Alu that allows the locus to be assayed for the presence or absence of an Alu repeat. Individual HS Alu sequences were found to be either monomorphic or dimorphic for the presence or absence of each repeat. The monomorphic HS Alu family members inserted in the human genome after the human/great ape divergence (which is thought to have occurred 4--6 million years ago), but before the radiation of modem man. The dimorphic HS Alu sequences inserted in the human genome after the radiation of modem man (within the last 200,000-one million years) and represent a unique source of information for human population genetics and forensic DNA analyses. These sites can be developed into Dimorphic Alu Sequence Tagged Sites (DASTS) for the Human Genome Project as well. HS Alu family member insertion dimorphism differs from other types of polymorphism (e.g. Variable Number of Tandem Repeat [VNTR] or Restriction Fragment Length Polymorphism [RFLP]) because individuals share HS Alu family member insertions based upon identity by descent from a common ancestor as a result of a single event which occurred one time within the human population. The VNTR and RFLP polymorphisms may arise multiple times within a population and are identical by state only.
Survey and analysis of simple sequence repeats in the Laccaria bicolor genome, with development of microsatellite markers

Energy Technology Data Exchange (ETDEWEB)

Labbe, Jessy L [ORNL; Murat, Claude [INRA, Nancy, France; Morin, Emmanuelle [INRA, Nancy, France; Le Tacon, F [UMR, France; Martin, Francis [INRA, Nancy, France

2011-01-01

It is becoming clear that simple sequence repeats (SSRs) play a significant role in fungal genome organization, and they are a large source of genetic markers for population genetics and meiotic maps. We identified SSRs in the Laccaria bicolor genome by in silico survey and analyzed their distribution in the different genomic regions. We also compared the abundance and distribution of SSRs in L. bicolor with those of the following fungal genomes: Phanerochaete chrysosporium, Coprinopsis cinerea, Ustilago maydis, Cryptococcus neoformans, Aspergillus nidulans, Magnaporthe grisea, Neurospora crassa and Saccharomyces cerevisiae. Using the MISA computer program, we detected 277,062 SSRs in the L. bicolor genome representing 8% of the assembled genomic sequence. Among the analyzed basidiomycetes, L. bicolor exhibited the highest SSR density although no correlation between relative abundance and the genome sizes was observed. In most genomes the short motifs (mono- to trinucleotides) were more abundant than the longer repeated SSRs. Generally, in each organism, the occurrence, relative abundance, and relative density of SSRs decreased as the repeat unit increased. Furthermore, each organism had its own common and longest SSRs. In the L. bicolor genome, most of the SSRs were located in intergenic regions (73.3%) and the highest SSR density was observed in transposable elements (TEs; 6,706 SSRs/Mb). However, 81% of the protein-coding genes contained SSRs in their exons, suggesting that SSR polymorphism may alter gene phenotypes. Within a L. bicolor offspring, sequence polymorphism of 78 SSRs was mainly detected in non-TE intergenic regions. Unlike previously developed microsatellite markers, these new ones are spread throughout the genome; these markers could have immediate applications in population genetics.
Hyb-Seq: Combining target enrichment and genome skimming for plant phylogenomics1

Science.gov (United States)

Weitemier, Kevin; Straub, Shannon C. K.; Cronn, Richard C.; Fishbein, Mark; Schmickl, Roswitha; McDonnell, Angela; Liston, Aaron

2014-01-01

• Premise of the study: Hyb-Seq, the combination of target enrichment and genome skimming, allows simultaneous data collection for low-copy nuclear genes and high-copy genomic targets for plant systematics and evolution studies. • Methods and Results: Genome and transcriptome assemblies for milkweed (Asclepias syriaca) were used to design enrichment probes for 3385 exons from 768 genes (>1.6 Mbp) followed by Illumina sequencing of enriched libraries. Hyb-Seq of 12 individuals (10 Asclepias species and two related genera) resulted in at least partial assembly of 92.6% of exons and 99.7% of genes and an average assembly length >2 Mbp. Importantly, complete plastomes and nuclear ribosomal DNA cistrons were assembled using off-target reads. Phylogenomic analyses demonstrated signal conflict between genomes. • Conclusions: The Hyb-Seq approach enables targeted sequencing of thousands of low-copy nuclear exons and flanking regions, as well as genome skimming of high-copy repeats and organellar genomes, to efficiently produce genome-scale data sets for phylogenomics. PMID:25225629
Large scale analysis of small repeats via mining of the human genome

NARCIS (Netherlands)

van den Berg, I.; Bosnacki, D.; Hilbers, P.A.J.

2009-01-01

Small repetitive sequences, called tandem repeats, are abundant throughout the human genome, both in coding and in non-coding regions. Their role is still mostly unknown, but at least 20 of those repetitive sequences have been related to neurodegenerative disorders. The mutational process that is
Mutations in Cytosine-5 tRNA Methyltransferases Impact Mobile Element Expression and Genome Stability at Specific DNA Repeats

Directory of Open Access Journals (Sweden)

Bianca Genenncher

2018-02-01

Full Text Available The maintenance of eukaryotic genome stability is ensured by the interplay of transcriptional as well as post-transcriptional mechanisms that control recombination of repeat regions and the expression and mobility of transposable elements. We report here that mutations in two (cytosine-5 RNA methyltransferases, Dnmt2 and NSun2, impact the accumulation of mobile element-derived sequences and DNA repeat integrity in Drosophila. Loss of Dnmt2 function caused moderate effects under standard conditions, while heat shock exacerbated these effects. In contrast, NSun2 function affected mobile element expression and genome integrity in a heat shock-independent fashion. Reduced tRNA stability in both RCMT mutants indicated that tRNA-dependent processes affected mobile element expression and DNA repeat stability. Importantly, further experiments indicated that complex formation with RNA could also contribute to the impact of RCMT function on gene expression control. These results thus uncover a link between tRNA modification enzymes, the expression of repeat DNA, and genomic integrity.
Genome-Wide Characterization of Simple Sequence Repeat (SSR) Loci in Chinese Jujube and Jujube SSR Primer Transferability

Science.gov (United States)

Xiao, Jing; Zhao, Jin; Liu, Mengjun; Liu, Ping; Dai, Li; Zhao, Zhihui

2015-01-01

Chinese jujube (Ziziphus jujuba), an economically important species in the Rhamnaceae family, is a popular fruit tree in Asia. Here, we surveyed and characterized simple sequence repeats (SSRs) in the jujube genome. A total of 436,676 SSR loci were identified, with an average distance of 0.93 Kb between the loci. A large proportion of the SSRs included mononucleotide, dinucleotide and trinucleotide repeat motifs, which accounted for 64.87%, 24.40%, and 8.74% of all repeats, respectively. Among the mononucleotide repeats, A/T was the most common, whereas AT/TA was the most common dinucleotide repeat. A total of 30,565 primer pairs were successfully designed and screened using a series of criteria. Moreover, 725 of 1,000 randomly selected primer pairs were effective among 6 cultivars, and 511 of these primer pairs were polymorphic. Sequencing the amplicons of two SSRs across three jujube cultivars revealed variations in the repeats. The transferability of jujube SSR primers proved that 35/64 SSRs could be transferred across family boundary. Using jujube SSR primers, clustering analysis results from 15 species were highly consistent with the Angiosperm Phylogeny Group (APGIII) System. The genome-wide characterization of SSRs in Chinese jujube is very valuable for whole-genome characterization and marker-assisted selection in jujube breeding. In addition, the transferability of jujube SSR primers could provide a solid foundation for their further utilization. PMID:26000739
Identification and characterization of short tandem repeats in the Tibetan macaque genome based on resequencing data.

Science.gov (United States)

Liu, San-Xu; Hou, Wei; Zhang, Xue-Yan; Peng, Chang-Jun; Yue, Bi-Song; Fan, Zhen-Xin; Li, Jing

2018-07-18

The Tibetan macaque, which is endemic to China, is currently listed as a Near Endangered primate species by the International Union for Conservation of Nature (IUCN). Short tandem repeats (STRs) refer to repetitive elements of genome sequence that range in length from 1-6 bp. They are found in many organisms and are widely applied in population genetic studies. To clarify the distribution characteristics of genome-wide STRs and understand their variation among Tibetan macaques, we conducted a genome-wide survey of STRs with next-generation sequencing of five macaque samples. A total of 1 077 790 perfect STRs were mined from our assembly, with an N50 of 4 966 bp. Mono-nucleotide repeats were the most abundant, followed by tetra- and di-nucleotide repeats. Analysis of GC content and repeats showed consistent results with other macaques. Furthermore, using STR analysis software (lobSTR), we found that the proportion of base pair deletions in the STRs was greater than that of insertions in the five Tibetan macaque individuals (Pgenome showed good amplification efficiency and could be used to study population genetics in Tibetan macaques. The neighbor-joining tree classified the five macaques into two different branches according to their geographical origin, indicating high genetic differentiation between the Huangshan and Sichuan populations. We elucidated the distribution characteristics of STRs in the Tibetan macaque genome and provided an effective method for screening polymorphic STRs. Our results also lay a foundation for future genetic variation studies of macaques.
D20S16 is a complex interspersed repeated sequence: Genetic and physical analysis of the locus

Energy Technology Data Exchange (ETDEWEB)

Bowden, D.W.; Krawchuk, M.D.; Howard, T.D. [Wake Forest Univ., Winston-Salem, NC (United States)] [and others

1995-01-20

The genomic structure of the D20S16 locus has been evaluated using genetic and physical methods. D20S16, originally detected with the probe CRI-L1214, is a highly informative, complex restriction fragment length polymorphism consisting of two separate allelic systems. The allelic systems have the characteristics of conventional VNTR polymorphisms and are separated by recombination ({theta} = 0.02, Z{sub max} = 74.82), as demonstrated in family studies. Most of these recombination events are meiotic crossovers and are maternal in origin, but two, including deletion of the locus in a cell line from a CEPH family member, occur without evidence for exchange of flanking markers. DNA sequence analysis suggests that the basis of the polymorphism is variable numbers of a 98-bp sequence tandemly repeated with 87 to 90% sequence similarity between repeats. The 98-bp repeat is a dimer of 49 bp sequence with 45 to 98% identity between the elements. In addition, nonpolymorphic genomic sequences adjacent to the polymorphic 98-bp repeat tracts are also repeated but are not polymorphic, i.e., show no individual to individual variation. Restriction enzyme mapping of cosmids containing the CRI-L1214 sequence suggests that there are multiple interspersed repeats of the CRI-L1214 sequence on chromosome 20. The results of dual-color fluorescence in situ hybridization experiments with interphase nuclei are also consistent with multiple repeats of an interspersed sequence on chromosome 20. 23 refs., 6 figs.
Genotyping and Molecular Identification of Date Palm Cultivars Using Inter-Simple Sequence Repeat (ISSR) Markers.

Science.gov (United States)

Ayesh, Basim M

2017-01-01

Molecular markers are credible for the discrimination of genotypes and estimation of the extent of genetic diversity and relatedness in a set of genotypes. Inter-simple sequence repeat (ISSR) markers rapidly reveal high polymorphic fingerprints and have been used frequently to determine the genetic diversity among date palm cultivars. This chapter describes the application of ISSR markers for genotyping of date palm cultivars. The application involves extraction of genomic DNA from the target cultivars with reliable quality and quantity. Subsequently the extracted DNA serves as a template for amplification of genomic regions flanked by inverted simple sequence repeats using a single primer. The similarity of each pair of samples is measured by calculating the number of mono- and polymorphic bands revealed by gel electrophoresis. Matrices constructed for similarity and genetic distance are used to build a phylogenetic tree and cluster analysis, to determine the molecular relatedness of cultivars. The protocol describes 3 out of 9 tested primers consistently amplified 31 loci in 6 date palm cultivars, with 28 polymorphic loci.
Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats

Directory of Open Access Journals (Sweden)

Graner Andreas

2008-10-01

Full Text Available Abstract Background Barley has one of the largest and most complex genomes of all economically important food crops. The rise of new short read sequencing technologies such as Illumina/Solexa permits such large genomes to be effectively sampled at relatively low cost. Based on the corresponding sequence reads a Mathematically Defined Repeat (MDR index can be generated to map repetitive regions in genomic sequences. Results We have generated 574 Mbp of Illumina/Solexa sequences from barley total genomic DNA, representing about 10% of a genome equivalent. From these sequences we generated an MDR index which was then used to identify and mark repetitive regions in the barley genome. Comparison of the MDR plots with expert repeat annotation drawing on the information already available for known repetitive elements revealed a significant correspondence between the two methods. MDR-based annotation allowed for the identification of dozens of novel repeat sequences, though, which were not recognised by hand-annotation. The MDR data was also used to identify gene-containing regions by masking of repetitive sequences in eight de-novo sequenced bacterial artificial chromosome (BAC clones. For half of the identified candidate gene islands indeed gene sequences could be identified. MDR data were only of limited use, when mapped on genomic sequences from the closely related species Triticum monococcum as only a fraction of the repetitive sequences was recognised. Conclusion An MDR index for barley, which was obtained by whole-genome Illumina/Solexa sequencing, proved as efficient in repeat identification as manual expert annotation. Circumventing the labour-intensive step of producing a specific repeat library for expert annotation, an MDR index provides an elegant and efficient resource for the identification of repetitive and low-copy (i.e. potentially gene-containing sequences regions in uncharacterised genomic sequences. The restriction that a particular
Implementing reverse genetics in Rosaceae: analysis of T-DNA flanking sequences of insertional mutant lines in the diploid strawberry, Fragaria vesca.

Science.gov (United States)

Oosumi, Teruko; Ruiz-Rojas, Juan Jairo; Veilleux, Richard E; Dickerman, Allan; Shulaev, Vladimir

2010-09-01

Reverse genetics is used for functional genomics research in model plants. To establish a model system for the systematic reverse genetics research in the Rosaceae family, we analyzed genomic DNA flanking the T-DNA insertions in 191 transgenic plants of the diploid strawberry, Fragaria vesca. One hundred and seventy-six T-DNA flanking sequences were amplified from the right border (RB) and 37 from the left border (LB) by thermal asymmetric interlaced PCR. Analysis of the T-DNA nick positions revealed that T-DNA was most frequently nicked at the cleavage sites. Analysis of 11 T-DNA integration sites indicated that T-DNA was integrated into the F. vesca genome by illegitimate recombination, as reported in other model plants: Arabidopsis, rice and tobacco. First, deletion of DNA was found at T-DNA integration target sites in all transgenic plants tested. Second, microsimilarities of a few base pairs between the left and/or right ends of the T-DNA and genomic sites were found in all transgenic plants tested. Finally, filler DNA was identified in four break-points. Out of 191 transgenic plants, T-DNA flanking sequences of 79 plants (41%) showed significant similarity to genes, elements or proteins of other plant species and 67 (35%) of the sequences are still unknown strawberry gene fragments. T-DNA flanking sequences of 126 plants (66%) showed homology to plant ESTs. This is the first report of T-DNA integration in a sizeable population of a rosaceous species. We have shown in this paper that T-DNA integration in strawberry is not random but directed by sequence microsimilarities in the host genome.

A novel rat genomic simple repeat DNA with RNA-homology shows triplex (H-DNA)-like structure and tissue-specific RNA expression

International Nuclear Information System (INIS)

Dey, Indranil; Rath, Pramod C.

2005-01-01

Mammalian genome contains a wide variety of repetitive DNA sequences of relatively unknown function. We report a novel 227 bp simple repeat DNA (3.3 DNA) with a d {(GA) 7 A (AG) 7 } dinucleotide mirror repeat from the rat (Rattus norvegicus) genome. 3.3 DNA showed 75-85% homology with several eukaryotic mRNAs due to (GA/CU) n dinucleotide repeats by nBlast search and a dispersed distribution in the rat genome by Southern blot hybridization with [ 32 P]3.3 DNA. The d {(GA) 7 A (AG) 7 } mirror repeat formed a triplex (H-DNA)-like structure in vitro. Two large RNAs of 9.1 and 7.5 kb were detected by [ 32 P]3.3 DNA in rat brain by Northern blot hybridization indicating expression of such simple sequence repeats at RNA level in vivo. Further, several cDNAs were isolated from a rat cDNA library by [ 32 P]3.3 DNA probe. Three such cDNAs showed tissue-specific RNA expression in rat. pRT 4.1 cDNA showed strong expression of a 2.39 kb RNA in brain and spleen, pRT 5.5 cDNA showed strong expression of a 2.8 kb RNA in brain and a 3.9 kb RNA in lungs, and pRT 11.4 cDNA showed weak expression of a 2.4 kb RNA in lungs. Thus, genomic simple sequence repeats containing d (GA/CT) n dinucleotides are transcriptionally expressed and regulated in rat tissues. Such d (GA/CT) n dinucleotide repeats may form structural elements (e.g., triplex) which may be sites for functional regulation of genomic coding sequences as well as RNAs. This may be a general function of such transcriptionally active simple sequence repeats widely dispersed in mammalian genome
Clustered Regularly Interspaced Short Palindromic Repeats (CRISPRi) plasmids | Office of Cancer Genomics

Science.gov (United States)

CTD2 researchers at the University of California in San Francisco developed a modified Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) CRISPR/dCas9 system. Catalytically inactive dCas9 enables modular and programmable RNA-guided genome regulation in eukaryotes.
Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome

NARCIS (Netherlands)

Sharp, Andrew J.; Hansen, Sierra; Selzer, Rebecca R.; Cheng, Ze; Regan, Regina; Hurst, Jane A.; Stewart, Helen; Price, Sue M.; Blair, Edward; Hennekam, Raoul C.; Fitzpatrick, Carrie A.; Segraves, Rick; Richmond, Todd A.; Guiver, Cheryl; Albertson, Donna G.; Pinkel, Daniel; Eis, Peggy S.; Schwartz, Stuart; Knight, Samantha J. L.; Eichler, Evan E.

2006-01-01

Genomic disorders are characterized by the presence of flanking segmental duplications that predispose these regions to recurrent rearrangement. Based on the duplication architecture of the genome, we investigated 130 regions that we hypothesized as candidates for previously undescribed genomic
Identification, characterization, and utilization of genome-wide simple sequence repeats to identify a QTL for acidity in apple

Science.gov (United States)

2012-01-01

Background Apple is an economically important fruit crop worldwide. Developing a genetic linkage map is a critical step towards mapping and cloning of genes responsible for important horticultural traits in apple. To facilitate linkage map construction, we surveyed and characterized the distribution and frequency of perfect microsatellites in assembled contig sequences of the apple genome. Results A total of 28,538 SSRs have been identified in the apple genome, with an overall density of 40.8 SSRs per Mb. Di-nucleotide repeats are the most frequent microsatellites in the apple genome, accounting for 71.9% of all microsatellites. AT/TA repeats are the most frequent in genomic regions, accounting for 38.3% of all the G-SSRs, while AG/GA dimers prevail in transcribed sequences, and account for 59.4% of all EST-SSRs. A total set of 310 SSRs is selected to amplify eight apple genotypes. Of these, 245 (79.0%) are found to be polymorphic among cultivars and wild species tested. AG/GA motifs in genomic regions have detected more alleles and higher PIC values than AT/TA or AC/CA motifs. Moreover, AG/GA repeats are more variable than any other dimers in apple, and should be preferentially selected for studies, such as genetic diversity and linkage map construction. A total of 54 newly developed apple SSRs have been genetically mapped. Interestingly, clustering of markers with distorted segregation is observed on linkage groups 1, 2, 10, 15, and 16. A QTL responsible for malic acid content of apple fruits is detected on linkage group 8, and accounts for ~13.5% of the observed phenotypic variation. Conclusions This study demonstrates that di-nucleotide repeats are prevalent in the apple genome and that AT/TA and AG/GA repeats are the most frequent in genomic and transcribed sequences of apple, respectively. All SSR motifs identified in this study as well as those newly mapped SSRs will serve as valuable resources for pursuing apple genetic studies, aiding the apple breeding
DNA dynamics is likely to be a factor in the genomic nucleotide repeats expansions related to diseases.

Directory of Open Access Journals (Sweden)

Boian S Alexandrov

Full Text Available Trinucleotide repeats sequences (TRS represent a common type of genomic DNA motif whose expansion is associated with a large number of human diseases. The driving molecular mechanisms of the TRS ongoing dynamic expansion across generations and within tissues and its influence on genomic DNA functions are not well understood. Here we report results for a novel and notable collective breathing behavior of genomic DNA of tandem TRS, leading to propensity for large local DNA transient openings at physiological temperature. Our Langevin molecular dynamics (LMD and Markov Chain Monte Carlo (MCMC simulations demonstrate that the patterns of openings of various TRSs depend specifically on their length. The collective propensity for DNA strand separation of repeated sequences serves as a precursor for outsized intermediate bubble states independently of the G/C-content. We report that repeats have the potential to interfere with the binding of transcription factors to their consensus sequence by altered DNA breathing dynamics in proximity of the binding sites. These observations might influence ongoing attempts to use LMD and MCMC simulations for TRS-related modeling of genomic DNA functionality in elucidating the common denominators of the dynamic TRS expansion mutation with potential therapeutic applications.
High-resolution comparative mapping among man, cattle and mouse suggests a role for repeat sequences in mammalian genome evolution

Directory of Open Access Journals (Sweden)

Rodolphe François

2006-08-01

Full Text Available Abstract Background Comparative mapping provides new insights into the evolutionary history of genomes. In particular, recent studies in mammals have suggested a role for segmental duplication in genome evolution. In some species such as Drosophila or maize, transposable elements (TEs have been shown to be involved in chromosomal rearrangements. In this work, we have explored the presence of interspersed repeats in regions of chromosomal rearrangements, using an updated high-resolution integrated comparative map among cattle, man and mouse. Results The bovine, human and mouse comparative autosomal map has been constructed using data from bovine genetic and physical maps and from FISH-mapping studies. We confirm most previous results but also reveal some discrepancies. A total of 211 conserved segments have been identified between cattle and man, of which 33 are new segments and 72 correspond to extended, previously known segments. The resulting map covers 91% and 90% of the human and bovine genomes, respectively. Analysis of breakpoint regions revealed a high density of species-specific interspersed repeats in the human and mouse genomes. Conclusion Analysis of the breakpoint regions has revealed specific repeat density patterns, suggesting that TEs may have played a significant role in chromosome evolution and genome plasticity. However, we cannot rule out that repeats and breakpoints accumulate independently in the few same regions where modifications are better tolerated. Likewise, we cannot ascertain whether increased TE density is the cause or the consequence of chromosome rearrangements. Nevertheless, the identification of high density repeat clusters combined with a well-documented repeat phylogeny should highlight probable breakpoints, and permit their precise dating. Combining new statistical models taking the present information into account should help reconstruct ancestral karyotypes.
Distribution and Evolution of Yersinia Leucine-Rich Repeat Proteins

Science.gov (United States)

Hu, Yueming; Huang, He; Hui, Xinjie; Cheng, Xi; White, Aaron P.

2016-01-01

Leucine-rich repeat (LRR) proteins are widely distributed in bacteria, playing important roles in various protein-protein interaction processes. In Yersinia, the well-characterized type III secreted effector YopM also belongs to the LRR protein family and is encoded by virulence plasmids. However, little has been known about other LRR members encoded by Yersinia genomes or their evolution. In this study, the Yersinia LRR proteins were comprehensively screened, categorized, and compared. The LRR proteins encoded by chromosomes (LRR1 proteins) appeared to be more similar to each other and different from those encoded by plasmids (LRR2 proteins) with regard to repeat-unit length, amino acid composition profile, and gene expression regulation circuits. LRR1 proteins were also different from LRR2 proteins in that the LRR1 proteins contained an E3 ligase domain (NEL domain) in the C-terminal region or an NEL domain-encoding nucleotide relic in flanking genomic sequences. The LRR1 protein-encoding genes (LRR1 genes) varied dramatically and were categorized into 4 subgroups (a to d), with the LRR1a to -c genes evolving from the same ancestor and LRR1d genes evolving from another ancestor. The consensus and ancestor repeat-unit sequences were inferred for different LRR1 protein subgroups by use of a maximum parsimony modeling strategy. Structural modeling disclosed very similar repeat-unit structures between LRR1 and LRR2 proteins despite the different unit lengths and amino acid compositions. Structural constraints may serve as the driving force to explain the observed mutations in the LRR regions. This study suggests that there may be functional variation and lays the foundation for future experiments investigating the functions of the chromosomally encoded LRR proteins of Yersinia. PMID:27217422
Relaxation of the south flank after the 7.2-magnitude Kalapana earthquake, Kilauea Volcano, Hawaii

Science.gov (United States)

Dvorak, John J.; Klein, Fred W.; Swanson, Donald A.

1994-01-01

An M = 7.2 earthquake on 29 November 1975 caused the south flank of Kilauea Volcano, Hawaii, to move seaward several meters: a catastrophic release of compression of the south flank caused by earlier injections of magma into the adjacent segment of a rift zone. The focal mechanisms of the mainshock, the largest foreshock, and the largest aftershock suggest seaward movement of the upper block. The rate of aftershocks decreased in a familiar hyperbolic decay, reaching the pre-1975 rate of seismicity by the mid-1980s. Repeated rift-zone intrusions and eruptions after 1975, which occurred within 25 km of the summit area, compressed the adjacent portion of the south flank, apparently masking continued seaward displacement of the south flank. This is evident along a trilateration line that continued to extend, suggesting seaward displacement, immediately after the M = 7.2 earthquake, but then was compressed during a series of intrusions and eruptions that began in September 1977. Farther to the east, trilateration measurements show that the portion of the south flank above the aftershock zone, but beyond the area of compression caused by the rift-zone intrusions and eruptions, continued to move seaward at a decreasing rate until the mid-1980s, mimicking the decay in aftershock rate. Along the same portion of the south flank, the pattern of vertical surface displacements can be explained by continued seaward movement of the south flank and development of two eruptive fissures along the east rift zone, each of which extended from a depth of ∼3 km to the surface. The aftershock rate and continued seaward movement of the south flank are reminiscent of crustal response to other large earthquakes, such as the 1966 M = 6 Parkfield earthquake and the 1983 M = 6.5 Coalinga earthquake.
Comparative Genomics of Carp Herpesviruses

Science.gov (United States)

Kurobe, Tomofumi; Gatherer, Derek; Cunningham, Charles; Korf, Ian; Fukuda, Hideo; Hedrick, Ronald P.; Waltzek, Thomas B.

2013-01-01

Three alloherpesviruses are known to cause disease in cyprinid fish: cyprinid herpesviruses 1 and 3 (CyHV1 and CyHV3) in common carp and koi and cyprinid herpesvirus 2 (CyHV2) in goldfish. We have determined the genome sequences of CyHV1 and CyHV2 and compared them with the published CyHV3 sequence. The CyHV1 and CyHV2 genomes are 291,144 and 290,304 bp, respectively, in size, and thus the CyHV3 genome, at 295,146 bp, remains the largest recorded among the herpesviruses. Each of the three genomes consists of a unique region flanked at each terminus by a sizeable direct repeat. The CyHV1, CyHV2, and CyHV3 genomes are predicted to contain 137, 150, and 155 unique, functional protein-coding genes, respectively, of which six, four, and eight, respectively, are duplicated in the terminal repeat. The three viruses share 120 orthologous genes in a largely colinear arrangement, of which up to 55 are also conserved in the other member of the genus Cyprinivirus, anguillid herpesvirus 1. Twelve genes are conserved convincingly in all sequenced alloherpesviruses, and two others are conserved marginally. The reference CyHV3 strain has been reported to contain five fragmented genes that are presumably nonfunctional. The CyHV2 strain has two fragmented genes, and the CyHV1 strain has none. CyHV1, CyHV2, and CyHV3 have five, six, and five families of paralogous genes, respectively. One family unique to CyHV1 is related to cellular JUNB, which encodes a transcription factor involved in oncogenesis. To our knowledge, this is the first time that JUNB-related sequences have been reported in a herpesvirus. PMID:23269803
Genome-wide identification and validation of simple sequence repeats (SSRs) from Asparagus officinalis.

Science.gov (United States)

Li, Shufen; Zhang, Guojun; Li, Xu; Wang, Lianjun; Yuan, Jinhong; Deng, Chuanliang; Gao, Wujun

2016-06-01

Garden asparagus (Asparagus officinalis), an important vegetable cultivated worldwide, can also serve as a model dioecious plant species in the study of sex determination and sex chromosome evolution. However, limited DNA marker resources have been developed and used for this species. To expand these resources, we examined the DNA sequences for simple sequence repeats (SSRs) in 163,406 scaffolds representing approximately 400 Mbp of the A. officinalis genome. A total of 87,576 SSRs were identified in 59,565 scaffolds. The most abundant SSR repeats were trinucleotide and tetranucleotide, accounting for 29.2 and 29.1% of the total SSRs, respectively, followed by di-, penta-, hexa-, hepta-, and octanucleotides. The AG motif was most common among dinucleotides and was also the most frequent motif in the entire A. officinalis genome, representing 14.7% of all SSRs. A total of 41,917 SSR primers pairs were designed to amplify SSRs. Twenty-two genomic SSR markers were tested in 39 asparagus accessions belonging to ten cultivars and one accession of Asparagus setaceus for determination of genetic diversity. The intra-species polymorphism information content (PIC) values of the 22 genomic SSR markers were intermediate, with an average of 0.41. The genetic diversity between the ten A. officinalis cultivars was low, and the UPGMA dendrogram was largely unrelated to cultivars. It is here suggested that the sex of individuals is an important factor influencing the clustering results. The information reported here provides new information about the organization of the microsatellites in A. officinalis genome and lays a foundation for further genetic studies and breeding applications of A. officinalis and related species. Copyright © 2016 Elsevier Ltd. All rights reserved.
Assembling large genomes: analysis of the stick insect (Clitarchus hookeri) genome reveals a high repeat content and sex-biased genes associated with reproduction.

Science.gov (United States)

Wu, Chen; Twort, Victoria G; Crowhurst, Ross N; Newcomb, Richard D; Buckley, Thomas R

2017-11-16

Stick insects (Phasmatodea) have a high incidence of parthenogenesis and other alternative reproductive strategies, yet the genetic basis of reproduction is poorly understood. Phasmatodea includes nearly 3000 species, yet only the genome of Timema cristinae has been published to date. Clitarchus hookeri is a geographical parthenogenetic stick insect distributed across New Zealand. Sexual reproduction dominates in northern habitats but is replaced by parthenogenesis in the south. Here, we present a de novo genome assembly of a female C. hookeri and use it to detect candidate genes associated with gamete production and development in females and males. We also explore the factors underlying large genome size in stick insects. The C. hookeri genome assembly was 4.2 Gb, similar to the flow cytometry estimate, making it the second largest insect genome sequenced and assembled to date. Like the large genome of Locusta migratoria, the genome of C. hookeri is also highly repetitive and the predicted gene models are much longer than those from most other sequenced insect genomes, largely due to longer introns. Miniature inverted repeat transposable elements (MITEs), absent in the much smaller T. cristinae genome, is the most abundant repeat type in the C. hookeri genome assembly. Mapping RNA-Seq reads from female and male gonadal transcriptomes onto the genome assembly resulted in the identification of 39,940 gene loci, 15.8% and 37.6% of which showed female-biased and male-biased expression, respectively. The genes that were over-expressed in females were mostly associated with molecular transportation, developmental process, oocyte growth and reproductive process; whereas, the male-biased genes were enriched in rhythmic process, molecular transducer activity and synapse. Several genes involved in the juvenile hormone synthesis pathway were also identified. The evolution of large insect genomes such as L. migratoria and C. hookeri genomes is most likely due to the
Cis-acting regulatory sequences promote high-frequency gene conversion between repeated sequences in mammalian cells.

Science.gov (United States)

Raynard, Steven J; Baker, Mark D

2004-01-01

In mammalian cells, little is known about the nature of recombination-prone regions of the genome. Previously, we reported that the immunoglobulin heavy chain (IgH) mu locus behaved as a hotspot for mitotic, intrachromosomal gene conversion (GC) between repeated mu constant (Cmu) regions in mouse hybridoma cells. To investigate whether elements within the mu gene regulatory region were required for hotspot activity, gene targeting was used to delete a 9.1 kb segment encompassing the mu gene promoter (Pmu), enhancer (Emu) and switch region (Smu) from the locus. In these cell lines, GC between the Cmu repeats was significantly reduced, indicating that this 'recombination-enhancing sequence' (RES) is necessary for GC hotspot activity at the IgH locus. Importantly, the RES fragment stimulated GC when appended to the same Cmu repeats integrated at ectopic genomic sites. We also show that deletion of Emu and flanking matrix attachment regions (MARs) from the RES abolishes GC hotspot activity at the IgH locus. However, no stimulation of ectopic GC was observed with the Emu/MARs fragment alone. Finally, we provide evidence that no correlation exists between the level of transcription and GC promoted by the RES. We suggest a model whereby Emu/MARS enhances mitotic GC at the endogenous IgH mu locus by effecting chromatin modifications in adjacent DNA.
Fusion primer and nested integrated PCR (FPNI-PCR: a new high-efficiency strategy for rapid chromosome walking or flanking sequence cloning

Directory of Open Access Journals (Sweden)

Wang Zhen

2011-11-01

Full Text Available Abstract Background The advent of genomics-based technologies has revolutionized many fields of biological enquiry. However, chromosome walking or flanking sequence cloning is still a necessary and important procedure to determining gene structure. Such methods are used to identify T-DNA insertion sites and so are especially relevant for organisms where large T-DNA insertion libraries have been created, such as rice and Arabidopsis. The currently available methods for flanking sequence cloning, including the popular TAIL-PCR technique, are relatively laborious and slow. Results Here, we report a simple and effective fusion primer and nested integrated PCR method (FPNI-PCR for the identification and cloning of unknown genomic regions flanked known sequences. In brief, a set of universal primers was designed that consisted of various 15-16 base arbitrary degenerate oligonucleotides. These arbitrary degenerate primers were fused to the 3' end of an adaptor oligonucleotide which provided a known sequence without degenerate nucleotides, thereby forming the fusion primers (FPs. These fusion primers are employed in the first step of an integrated nested PCR strategy which defines the overall FPNI-PCR protocol. In order to demonstrate the efficacy of this novel strategy, we have successfully used it to isolate multiple genomic sequences namely, 21 orthologs of genes in various species of Rosaceace, 4 MYB genes of Rosa rugosa, 3 promoters of transcription factors of Petunia hybrida, and 4 flanking sequences of T-DNA insertion sites in transgenic tobacco lines and 6 specific genes from sequenced genome of rice and Arabidopsis. Conclusions The successful amplification of target products through FPNI-PCR verified that this novel strategy is an effective, low cost and simple procedure. Furthermore, FPNI-PCR represents a more sensitive, rapid and accurate technique than the established TAIL-PCR and hiTAIL-PCR procedures.
Comparative genomics of community-acquired ST59 methicillin-resistant Staphylococcus aureus in Taiwan: novel mobile resistance structures with IS1216V.

Directory of Open Access Journals (Sweden)

Wei-Chun Hung

Full Text Available Methicillin-resistant Staphylococcus aureus (MRSA with ST59/SCCmecV and Panton-Valentine leukocidin gene is a major community-acquired MRSA (CA-MRSA lineage in Taiwan and has been multidrug-resistant since its initial isolation. In this study, we studied the acquisition mechanism of multidrug resistance in an ST59 CA-MRSA strain (PM1 by comparative genomics. PM1's non-β-lactam resistance was encoded by two unique genetic traits. One was a 21,832-bp composite mobile element structure (MES(PM1, which was flanked by direct repeats of enterococcal IS1216V and was inserted into the chromosomal sasK gene; the target sequence (att was 8 bp long and was duplicated at both ends of MES(PM1. MES(PM1 consisted of two regions: the 5'-end side 12.4-kb region carrying Tn551 (with ermB and Tn5405-like (with aph[3']-IIIa and aadE, similar to an Enterococcus faecalis plasmid, and the 3'-end side 6,587-bp region (MES(cat that carries cat and is flanked by inverted repeats of IS1216V. MES(cat possessed att duplication at both ends and additional two copies of IS1216V inside. MES(PM1 represents the first enterococcal IS1216V-mediated composite transposon emerged in MRSA. IS1216V-mediated deletion likely occurred in IS1216V-rich MES(PM1, resulting in distinct resistance patterns in PM1-derivative strains. Another structure was a 6,025-bp tet-carrying element (MES(tet on a 25,961-bp novel mosaic penicillinase plasmid (pPM1; MES(tet was flanked by direct repeats of IS431, but with no target sequence repeats. Moreover, the PM1 genome was deficient in a copy of the restriction and modification genes (hsdM and hsdS, which might have contributed to the acquisition of enterococcal multidrug resistance.
Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome.

Science.gov (United States)

Sharp, Andrew J; Hansen, Sierra; Selzer, Rebecca R; Cheng, Ze; Regan, Regina; Hurst, Jane A; Stewart, Helen; Price, Sue M; Blair, Edward; Hennekam, Raoul C; Fitzpatrick, Carrie A; Segraves, Rick; Richmond, Todd A; Guiver, Cheryl; Albertson, Donna G; Pinkel, Daniel; Eis, Peggy S; Schwartz, Stuart; Knight, Samantha J L; Eichler, Evan E

2006-09-01

Genomic disorders are characterized by the presence of flanking segmental duplications that predispose these regions to recurrent rearrangement. Based on the duplication architecture of the genome, we investigated 130 regions that we hypothesized as candidates for previously undescribed genomic disorders. We tested 290 individuals with mental retardation by BAC array comparative genomic hybridization and identified 16 pathogenic rearrangements, including de novo microdeletions of 17q21.31 found in four individuals. Using oligonucleotide arrays, we refined the breakpoints of this microdeletion, defining a 478-kb critical region containing six genes that were deleted in all four individuals. We mapped the breakpoints of this deletion and of four other pathogenic rearrangements in 1q21.1, 15q13, 15q24 and 17q12 to flanking segmental duplications, suggesting that these are also sites of recurrent rearrangement. In common with the 17q21.31 deletion, these breakpoint regions are sites of copy number polymorphism in controls, indicating that these may be inherently unstable genomic regions.
Expansion of inverted repeat does not decrease substitution rates in Pelargonium plastid genomes.

Science.gov (United States)

Weng, Mao-Lun; Ruhlman, Tracey A; Jansen, Robert K

2017-04-01

For species with minor inverted repeat (IR) boundary changes in the plastid genome (plastome), nucleotide substitution rates were previously shown to be lower in the IR than the single copy regions (SC). However, the impact of large-scale IR expansion/contraction on plastid nucleotide substitution rates among closely related species remains unclear. We included plastomes from 22 Pelargonium species, including eight newly sequenced genomes, and used both pairwise and model-based comparisons to investigate the impact of the IR on sequence evolution in plastids. Ten types of plastome organization with different inversions or IR boundary changes were identified in Pelargonium. Inclusion in the IR was not sufficient to explain the variation of nucleotide substitution rates. Instead, the rate heterogeneity in Pelargonium plastomes was a mixture of locus-specific, lineage-specific and IR-dependent effects. Our study of Pelargonium plastomes that vary in IR length and gene content demonstrates that the evolutionary consequences of retaining these repeats are more complicated than previously suggested. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.
Flank pain

Science.gov (United States)

... how to do these exercises at home. Nonsteroidal anti-inflammatory drugs (NSAIDs) and physical therapy may be prescribed for flank pain caused by spinal arthritis. Antibiotics are used to treat most kidney infections. You ...
Scarless and sequential gene modification in Pseudomonas using PCR product flanked by short homology regions

Directory of Open Access Journals (Sweden)

Liang Rubing

2010-08-01

Full Text Available Abstract Background The lambda Red recombination system has been used to inactivate chromosomal genes in various bacteria and fungi. The procedure consists of electroporating a polymerase chain reaction (PCR fragment containing antibiotic cassette flanked by homology regions to the target locus into a strain that can express the lambda Red proteins (Gam, Bet, Exo. Results Here a scarless gene modification strategy based on the Red recombination system has been developed to modify Pseudomonas genome DNA via sequential deletion of multiple targets. This process was mediated by plasmid pRKaraRed encoding the Red proteins regulated by PBAD promoter, which was functional in P. aeruginosa as well as in other bacteria. First the target gene was substituted for the sacB-bla cassette flanked by short homology regions (50 bp, and then this marker gene cassette could be replaced by the PCR fragment flanking itself, generating target-deleted genome without any remnants and no change happened to the surrounding region. Twenty genes involved in the synthesis and regulation pathways of the phenazine derivate, pyocyanin, were modified, including one single-point mutation and deletion of two large operons. The recombination efficiencies ranged from 88% to 98%. Multiple-gene modification was also achieved, generating a triple-gene deletion strain PCA (PAO1, ΔphzHΔphzMΔphzS, which could produce another phenazine derivate, phenazine-1-carboxylic acid (PCA, efficiently and exclusively. Conclusions This lambda Red-based technique can be used to generate scarless and sequential gene modification mutants of P. aeruginosa efficiently, using one-step PCR product flanked by short homology regions. Single-point mutation, scarless deletion of genes can be achieved easily in less than three days. This method may give a new way to construct genetically modified P. aeruginosa strains more efficiently and advance the regulatory network study of this organism.
Repeat associated mechanisms of genome evolution and function revealed by the Mus caroli and Mus pahari genomes

Science.gov (United States)

Thybert, David; Roller, Maša; Navarro, Fábio C.P.; Fiddes, Ian; Streeter, Ian; Feig, Christine; Martin-Galvez, David; Kolmogorov, Mikhail; Janoušek, Václav; Akanni, Wasiu; Aken, Bronwen; Aldridge, Sarah; Chakrapani, Varshith; Chow, William; Clarke, Laura; Cummins, Carla; Doran, Anthony; Dunn, Matthew; Goodstadt, Leo; Howe, Kerstin; Howell, Matthew; Josselin, Ambre-Aurore; Karn, Robert C.; Laukaitis, Christina M.; Jingtao, Lilue; Martin, Fergal; Muffato, Matthieu; Nachtweide, Stefanie; Quail, Michael A.; Sisu, Cristina; Stanke, Mario; Stefflova, Klara; Van Oosterhout, Cock; Veyrunes, Frederic; Ward, Ben; Yang, Fengtang; Yazdanifar, Golbahar; Zadissa, Amonida; Adams, David J.; Brazma, Alvis; Gerstein, Mark; Paten, Benedict; Pham, Son; Keane, Thomas M.; Odom, Duncan T.; Flicek, Paul

2018-01-01

Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli, which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology. PMID:29563166
Surface antigens and potential virulence factors from parasites detected by comparative genomics of perfect amino acid repeats

Directory of Open Access Journals (Sweden)

Adler Joël

2007-12-01

Full Text Available Abstract Background Many parasitic organisms, eukaryotes as well as bacteria, possess surface antigens with amino acid repeats. Making up the interface between host and pathogen such repetitive proteins may be virulence factors involved in immune evasion or cytoadherence. They find immunological applications in serodiagnostics and vaccine development. Here we use proteins which contain perfect repeats as a basis for comparative genomics between parasitic and free-living organisms. Results We have developed Reptile http://reptile.unibe.ch, a program for proteome-wide probabilistic description of perfect repeats in proteins. Parasite proteomes exhibited a large variance regarding the proportion of repeat-containing proteins. Interestingly, there was a good correlation between the percentage of highly repetitive proteins and mean protein length in parasite proteomes, but not at all in the proteomes of free-living eukaryotes. Reptile combined with programs for the prediction of transmembrane domains and GPI-anchoring resulted in an effective tool for in silico identification of potential surface antigens and virulence factors from parasites. Conclusion Systemic surveys for perfect amino acid repeats allowed basic comparisons between free-living and parasitic organisms that were directly applicable to predict proteins of serological and parasitological importance. An on-line tool is available at http://genomics.unibe.ch/dora.

Reverse time migration of prism waves for salt flank delineation

KAUST Repository

Dai, Wei; Schuster, Gerard T.

2013-01-01

In this paper, we present a new reverse time migration method for imaging salt flanks with prism wave reflections. It consists of four steps: (1) migrating the seismic data with conventional RTM to give the RTM image; (2) using the RTM image as a reflectivity model to simulate source-side reflections with the Born approximation; (3) zero-lag correlation of the source-side reflection wavefields and receiver-side wavefields to produce the prism wave migration image; and (4) repeating steps 2 and 3 for the receiver-side reflections. An advantage of this method is that there is no need to pick the horizontal reflectors prior to migration of the prism waves. It also separately images the vertical structures at a different step to reduce crosstalk interference. The disadvantage of prism wave migration algorithm is that its computational cost is twice that of conventional RTM. The empirical results with a salt model suggest that prism wave migration can be an effective method for salt flank delineation in the absence of diving waves.
Reverse time migration of prism waves for salt flank delineation

KAUST Repository

Dai, Wei

2013-09-22

In this paper, we present a new reverse time migration method for imaging salt flanks with prism wave reflections. It consists of four steps: (1) migrating the seismic data with conventional RTM to give the RTM image; (2) using the RTM image as a reflectivity model to simulate source-side reflections with the Born approximation; (3) zero-lag correlation of the source-side reflection wavefields and receiver-side wavefields to produce the prism wave migration image; and (4) repeating steps 2 and 3 for the receiver-side reflections. An advantage of this method is that there is no need to pick the horizontal reflectors prior to migration of the prism waves. It also separately images the vertical structures at a different step to reduce crosstalk interference. The disadvantage of prism wave migration algorithm is that its computational cost is twice that of conventional RTM. The empirical results with a salt model suggest that prism wave migration can be an effective method for salt flank delineation in the absence of diving waves.
Genome-Wide Analysis of Simple Sequence Repeats in Bitter Gourd (Momordica charantia

Directory of Open Access Journals (Sweden)

Junjie Cui

2017-06-01

Full Text Available Bitter gourd (Momordica charantia is widely cultivated as a vegetable and medicinal herb in many Asian and African countries. After the sequencing of the cucumber (Cucumis sativus, watermelon (Citrullus lanatus, and melon (Cucumis melo genomes, bitter gourd became the fourth cucurbit species whose whole genome was sequenced. However, a comprehensive analysis of simple sequence repeats (SSRs in bitter gourd, including a comparison with the three aforementioned cucurbit species has not yet been published. Here, we identified a total of 188,091 and 167,160 SSR motifs in the genomes of the bitter gourd lines ‘Dali-11’ and ‘OHB3-1,’ respectively. Subsequently, the SSR content, motif lengths, and classified motif types were characterized for the bitter gourd genomes and compared among all the cucurbit genomes. Lastly, a large set of 138,727 unique in silico SSR primer pairs were designed for bitter gourd. Among these, 71 primers were selected, all of which successfully amplified SSRs from the two bitter gourd lines ‘Dali-11’ and ‘K44’. To further examine the utilization of unique SSR primers, 21 SSR markers were used to genotype a collection of 211 bitter gourd lines from all over the world. A model-based clustering method and phylogenetic analysis indicated a clear separation among the geographic groups. The genomic SSR markers developed in this study have considerable potential value in advancing bitter gourd research.
The chloroplast genome sequence of the green alga Leptosira terrestris: multiple losses of the inverted repeat and extensive genome rearrangements within the Trebouxiophyceae

Directory of Open Access Journals (Sweden)

Turmel Monique

2007-07-01

Full Text Available Abstract Background In the Chlorophyta – the green algal phylum comprising the classes Prasinophyceae, Ulvophyceae, Trebouxiophyceae and Chlorophyceae – the chloroplast genome displays a highly variable architecture. While chlorophycean chloroplast DNAs (cpDNAs deviate considerably from the ancestral pattern described for the prasinophyte Nephroselmis olivacea, the degree of remodelling sustained by the two ulvophyte cpDNAs completely sequenced to date is intermediate relative to those observed for chlorophycean and trebouxiophyte cpDNAs. Chlorella vulgaris (Chlorellales is currently the only photosynthetic trebouxiophyte whose complete cpDNA sequence has been reported. To gain insights into the evolutionary trends of the chloroplast genome in the Trebouxiophyceae, we sequenced cpDNA from the filamentous alga Leptosira terrestris (Ctenocladales. Results The 195,081-bp Leptosira chloroplast genome resembles the 150,613-bp Chlorella genome in lacking a large inverted repeat (IR but differs greatly in gene order. Six of the conserved genes present in Chlorella cpDNA are missing from the Leptosira gene repertoire. The 106 conserved genes, four introns and 11 free standing open reading frames (ORFs account for 48.3% of the genome sequence. This is the lowest gene density yet observed among chlorophyte cpDNAs. Contrary to the situation in Chlorella but similar to that in the chlorophycean Scenedesmus obliquus, the gene distribution is highly biased over the two DNA strands in Leptosira. Nine genes, compared to only three in Chlorella, have significantly expanded coding regions relative to their homologues in ancestral-type green algal cpDNAs. As observed in chlorophycean genomes, the rpoB gene is fragmented into two ORFs. Short repeats account for 5.1% of the Leptosira genome sequence and are present mainly in intergenic regions. Conclusion Our results highlight the great plasticity of the chloroplast genome in the Trebouxiophyceae and indicate
Repeat associated mechanisms of genome evolution and function revealed by the Mus caroli and Mus pahari genomes.

Science.gov (United States)

Thybert, David; Roller, Maša; Navarro, Fábio C P; Fiddes, Ian; Streeter, Ian; Feig, Christine; Martin-Galvez, David; Kolmogorov, Mikhail; Janoušek, Václav; Akanni, Wasiu; Aken, Bronwen; Aldridge, Sarah; Chakrapani, Varshith; Chow, William; Clarke, Laura; Cummins, Carla; Doran, Anthony; Dunn, Matthew; Goodstadt, Leo; Howe, Kerstin; Howell, Matthew; Josselin, Ambre-Aurore; Karn, Robert C; Laukaitis, Christina M; Jingtao, Lilue; Martin, Fergal; Muffato, Matthieu; Nachtweide, Stefanie; Quail, Michael A; Sisu, Cristina; Stanke, Mario; Stefflova, Klara; Van Oosterhout, Cock; Veyrunes, Frederic; Ward, Ben; Yang, Fengtang; Yazdanifar, Golbahar; Zadissa, Amonida; Adams, David J; Brazma, Alvis; Gerstein, Mark; Paten, Benedict; Pham, Son; Keane, Thomas M; Odom, Duncan T; Flicek, Paul

2018-04-01

Understanding the mechanisms driving lineage-specific evolution in both primates and rodents has been hindered by the lack of sister clades with a similar phylogenetic structure having high-quality genome assemblies. Here, we have created chromosome-level assemblies of the Mus caroli and Mus pahari genomes. Together with the Mus musculus and Rattus norvegicus genomes, this set of rodent genomes is similar in divergence times to the Hominidae (human-chimpanzee-gorilla-orangutan). By comparing the evolutionary dynamics between the Muridae and Hominidae, we identified punctate events of chromosome reshuffling that shaped the ancestral karyotype of Mus musculus and Mus caroli between 3 and 6 million yr ago, but that are absent in the Hominidae. Hominidae show between four- and sevenfold lower rates of nucleotide change and feature turnover in both neutral and functional sequences, suggesting an underlying coherence to the Muridae acceleration. Our system of matched, high-quality genome assemblies revealed how specific classes of repeats can play lineage-specific roles in related species. Recent LINE activity has remodeled protein-coding loci to a greater extent across the Muridae than the Hominidae, with functional consequences at the species level such as reproductive isolation. Furthermore, we charted a Muridae-specific retrotransposon expansion at unprecedented resolution, revealing how a single nucleotide mutation transformed a specific SINE element into an active CTCF binding site carrier specifically in Mus caroli , which resulted in thousands of novel, species-specific CTCF binding sites. Our results show that the comparison of matched phylogenetic sets of genomes will be an increasingly powerful strategy for understanding mammalian biology. © 2018 Thybert et al.; Published by Cold Spring Harbor Laboratory Press.
Genomic tools for behavioural ecologists to understand repeatable individual differences in behaviour.

Science.gov (United States)

Bengston, Sarah E; Dahan, Romain A; Donaldson, Zoe; Phelps, Steven M; van Oers, Kees; Sih, Andrew; Bell, Alison M

2018-06-01

Behaviour is a key interface between an animal's genome and its environment. Repeatable individual differences in behaviour have been extensively documented in animals, but the molecular underpinnings of behavioural variation among individuals within natural populations remain largely unknown. Here, we offer a critical review of when molecular techniques may yield new insights, and we provide specific guidance on how and whether the latest tools available are appropriate given different resources, system and organismal constraints, and experimental designs. Integrating molecular genetic techniques with other strategies to study the proximal causes of behaviour provides opportunities to expand rapidly into new avenues of exploration. Such endeavours will enable us to better understand how repeatable individual differences in behaviour have evolved, how they are expressed and how they can be maintained within natural populations of animals.
Identification of genes containing expanded purine repeats in the human genome and their apparent protective role against cancer.

Science.gov (United States)

Singh, Himanshu Narayan; Rajeswari, Moganty R

2016-01-01

Purine repeat sequences present in a gene are unique as they have high propensity to form unusual DNA-triple helix structures. Friedreich's ataxia is the only human disease that is well known to be associated with DNA-triplexes formed by purine repeats. The purpose of this study was to recognize the expanded purine repeats (EPRs) in human genome and find their correlation with cancer pathogenesis. We developed "PuRepeatFinder.pl" algorithm to identify non-overlapping EPRs without pyrimidine interruptions in the human genome and customized for searching repeat lengths, n ≥ 200. A total of 1158 EPRs were identified in the genome which followed Wakeby distribution. Two hundred and ninety-six EPRs were found in geneic regions of 282 genes (EPR-genes). Gene clustering of EPR-genes was done based on their cellular function and a large number of EPR-genes were found to be enzymes/enzyme modulators. Meta-analysis of 282 EPR-genes identified only 63 EPR-genes in association with cancer, mostly in breast, lung, and blood cancers. Protein-protein interaction network analysis of all 282 EPR-genes identified proteins including those in cadherins and VEGF. The two observations, that EPRs can induce mutations under malignant conditions and that identification of some EPR-gene products in vital cell signaling-mediated pathways, together suggest the crucial role of EPRs in carcinogenesis. The new link between EPR-genes and their functionally interacting proteins throws a new dimension in the present understanding of cancer pathogenesis and can help in planning therapeutic strategies. Validation of present results using techniques like NGS is required to establish the role of the EPR genes in cancer pathology.
The structures of bovine herpesvirus 1 virion and concatemeric DNA: implications for cleavage and packaging of herpesvirus genomes

International Nuclear Information System (INIS)

Schynts, Frederic; McVoy, Michael A.; Meurens, Francois; Detry, Bruno; Epstein, Alberto L.; Thiry, Etienne

2003-01-01

Herpesvirus genomes are often characterized by the presence of direct and inverted repeats that delineate their grouping into six structural classes. Class D genomes consist of a long (L) segment and a short (S) segment. The latter is flanked by large inverted repeats. DNA replication produces concatemers of head-to-tail linked genomes that are cleaved into unit genomes during the process of packaging DNA into capsids. Packaged class D genomes are an equimolar mixture of two isomers in which S is in either of two orientations, presumably a consequence of homologous recombination between the inverted repeats. The L segment remains predominantly fixed in a prototype (P) orientation; however, low levels of genomes having inverted L (I L ) segments have been reported for some class D herpesviruses. Inefficient formation of class D I L genomes has been attributed to infrequent L segment inversion, but recent detection of frequent inverted L segments in equine herpesvirus 1 concatemers [Virology 229 (1997) 415-420] suggests that the defect may be at the level of cleavage and packaging rather than inversion. In this study, the structures of virion and concatemeric DNA of another class D herpesvirus, bovine herpesvirus 1, were determined. Virion DNA contained low levels of I L genomes, whereas concatemeric DNA contained significant amounts of L segments in both P and I L orientations. However, concatemeric termini exhibited a preponderance of L termini derived from P isomers which was comparable to the preponderance of P genomes found in virion DNA. Thus, the defect in formation of I L genomes appears to lie at the level of concatemer cleavage. These results have important implications for the mechanisms by which herpesvirus DNA cleavage and packaging occur
Large-Scale Isolation of Microsatellites from Chinese Mitten Crab Eriocheir sinensis via a Solexa Genomic Survey

Directory of Open Access Journals (Sweden)

Qun Wang

2012-12-01

Full Text Available Microsatellites are simple sequence repeats with a high degree of polymorphism in the genome; they are used as DNA markers in many molecular genetic studies. Using traditional methods such as the magnetic beads enrichment method, only a few microsatellite markers have been isolated from the Chinese mitten crab Eriocheir sinensis, as the crab genome sequence information is unavailable. Here, we have identified a large number of microsatellites from the Chinese mitten crab by taking advantage of Solexa genomic surveying. A total of 141,737 SSR (simple sequence repeats motifs were identified via analysis of 883 Mb of the crab genomic DNA information, including mono-, di-, tri-, tetra-, penta- and hexa-nucleotide repeat motifs. The number of di-nucleotide repeat motifs was 82,979, making this the most abundant type of repeat motif (58.54%; the second most abundant were the tri-nucleotide repeats (42,657, 30.11%. Among di-nucleotide repeats, the most frequent repeats were AC motifs, accounting for 67.55% of the total number. AGG motifs were the most frequent (59.32% of the tri-nucleotide motifs. A total of 15,125 microsatellite loci had a flanking sequence suitable for setting the primer of a polymerase chain reaction (PCR. To verify the identified SSRs, a subset of 100 primer pairs was randomly selected for PCR. Eighty two primer sets (82% produced strong PCR products matching expected sizes, and 78% were polymorphic. In an analysis of 30 wild individuals from the Yangtze River with 20 primer sets, the number of alleles per locus ranged from 2–14 and the mean allelic richness was 7.4. No linkage disequilibrium was found between any pair of loci, indicating that the markers were independent. The Hardy-Weinberg equilibrium test showed significant deviation in four of the 20 microsatellite loci after sequential Bonferroni corrections. This method is cost- and time-effective in comparison to traditional approaches for the isolation of microsatellites.
A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum.

Science.gov (United States)

Cheng, Jiaowen; Zhao, Zicheng; Li, Bo; Qin, Cheng; Wu, Zhiming; Trejo-Saavedra, Diana L; Luo, Xirong; Cui, Junjie; Rivera-Bustamante, Rafael F; Li, Shuaicheng; Hu, Kailin

2016-01-07

The sequences of the full set of pepper genomes including nuclear, mitochondrial and chloroplast are now available for use. However, the overall of simple sequence repeats (SSR) distribution in these genomes and their practical implications for molecular marker development in Capsicum have not yet been described. Here, an average of 868,047.50, 45.50 and 30.00 SSR loci were identified in the nuclear, mitochondrial and chloroplast genomes of pepper, respectively. Subsequently, systematic comparisons of various species, genome types, motif lengths, repeat numbers and classified types were executed and discussed. In addition, a local database composed of 113,500 in silico unique SSR primer pairs was built using a homemade bioinformatics workflow. As a pilot study, 65 polymorphic markers were validated among a wide collection of 21 Capsicum genotypes with allele number and polymorphic information content value per marker raging from 2 to 6 and 0.05 to 0.64, respectively. Finally, a comparison of the clustering results with those of a previous study indicated the usability of the newly developed SSR markers. In summary, this first report on the comprehensive characterization of SSR motifs in pepper genomes and the very large set of SSR primer pairs will benefit various genetic studies in Capsicum.
[Progress of genome engineering technology via clustered regularly interspaced short palindromic repeats--a review].

Science.gov (United States)

Li, Hao; Qiu, Shaofu; Song, Hongbin

2013-10-04

In survival competition with phage, bacteria and archaea gradually evolved the acquired immune system--Clustered regularly interspaced short palindromic repeats (CRISPR), presenting the trait of transcribing the crRNA and the CRISPR-associated protein (Cas) to silence or cleaving the foreign double-stranded DNA specifically. In recent years, strong interest arises in prokaryotes primitive immune system and many in-depth researches are going on. Recently, researchers successfully repurposed CRISPR as an RNA-guided platform for sequence-specific gene expression, which provides a simple approach for selectively perturbing gene expression on a genome-wide scale. It will undoubtedly bring genome engineering into a more convenient and accurate new era.
The complete chloroplast genome sequence of Taxus chinensis var. mairei (Taxaceae): loss of an inverted repeat region and comparative analysis with related species.

Science.gov (United States)

Zhang, Yanzhen; Ma, Ji; Yang, Bingxian; Li, Ruyi; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Zhang, Lin

2014-05-01

Taxus chinensis var. mairei (Taxaceae) is a domestic variety of yew species in local China. This plant is one of the sources for paclitaxel, which is a promising antineoplastic chemotherapy drugs during the last decade. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of T. chinensis var. mairei. The T. chinensis var. mairei cp genome is 129,513 bp in length, with 113 single copy genes and two duplicated genes (trnI-CAU, trnQ-UUG). Among the 113 single copy genes, 9 are intron-containing. Compared to other land plant cp genomes, the T. chinensis var. mairei cp genome has lost one of the large inverted repeats (IRs) found in angiosperms, fern, liverwort, and gymnosperm such as Cycas revoluta and Ginkgo biloba L. Compared to related species, the gene order of T. chinensis var. mairei has a large inversion of ~110kb including 91 genes (from rps18 to accD) with gene contents unarranged. Repeat analysis identified 48 direct and 2 inverted repeats 30 bp long or longer with a sequence identity greater than 90%. Repeated short segments were found in genes rps18, rps19 and clpP. Analysis also revealed 22 simple sequence repeat (SSR) loci and almost all are composed of A or T. Copyright © 2014 Elsevier B.V. All rights reserved.
Determinants of Genomic RNA Encapsidation in the Saccharomyces cerevisiae Long Terminal Repeat Retrotransposons Ty1 and Ty3

Directory of Open Access Journals (Sweden)

Katarzyna Pachulska-Wieczorek

2016-07-01

Full Text Available Long-terminal repeat (LTR retrotransposons are transposable genetic elements that replicate intracellularly, and can be considered progenitors of retroviruses. Ty1 and Ty3 are the most extensively characterized LTR retrotransposons whose RNA genomes provide the template for both protein translation and genomic RNA that is packaged into virus-like particles (VLPs and reverse transcribed. Genomic RNAs are not divided into separate pools of translated and packaged RNAs, therefore their trafficking and packaging into VLPs requires an equilibrium between competing events. In this review, we focus on Ty1 and Ty3 genomic RNA trafficking and packaging as essential steps of retrotransposon propagation. We summarize the existing knowledge on genomic RNA sequences and structures essential to these processes, the role of Gag proteins in repression of genomic RNA translation, delivery to VLP assembly sites, and encapsidation.
Analysis of simple sequence repeats in the Gaeumannomyces graminis var. tritici genome and the development of microsatellite markers.

Science.gov (United States)

Li, Wei; Feng, Yanxia; Sun, Haiyan; Deng, Yuanyu; Yu, Hanshou; Chen, Huaigu

2014-11-01

Understanding the genetic structure of Gaeumannomyces graminis var. tritici is essential for the establishment of efficient disease control strategies. It is becoming clear that microsatellites, or simple sequence repeats (SSRs), play an important role in genome organization and phenotypic diversity, and are a large source of genetic markers for population genetics and meiotic maps. In this study, we examined the G. graminis var. tritici genome (1) to analyze its pattern of SSRs, (2) to compare it with other plant pathogenic filamentous fungi, such as Magnaporthe oryzae and M. poae, and (3) to identify new polymorphic SSR markers for genetic diversity. The G. graminis var. tritici genome was rich in SSRs; a total 13,650 SSRs have been identified with mononucleotides being the most common motifs. In coding regions, the densities of tri- and hexanucleotides were significantly higher than in noncoding regions. The di-, tri-, tetra, penta, and hexanucleotide repeats in the G. graminis var. tritici genome were more abundant than the same repeats in M. oryzae and M. poae. From 115 devised primers, 39 SSRs are polymorphic with G. graminis var. tritici isolates, and 8 primers were randomly selected to analyze 116 isolates from China. The number of alleles varied from 2 to 7 and the expected heterozygosity (He) from 0.499 to 0.837. In conclusion, SSRs developed in this study were highly polymorphic, and our analysis indicated that G. graminis var. tritici is a species with high genetic diversity. The results provide a pioneering report for several applications, such as the assessment of population structure and genetic diversity of G. graminis var. tritici.
In silico analysis of Simple Sequence Repeats from chloroplast genomes of Solanaceae species

Directory of Open Access Journals (Sweden)

Evandro Vagner Tambarussi

2009-01-01

Full Text Available The availability of chloroplast genome (cpDNA sequences of Atropa belladonna, Nicotiana sylvestris, N.tabacum, N. tomentosiformis, Solanum bulbocastanum, S. lycopersicum and S. tuberosum, which are Solanaceae species,allowed us to analyze the organization of cpSSRs in their genic and intergenic regions. In general, the number of cpSSRs incpDNA ranged from 161 in S. tuberosum to 226 in N. tabacum, and the number of intergenic cpSSRs was higher than geniccpSSRs. The mononucleotide repeats were the most frequent in studied species, but we also identified di-, tri-, tetra-, pentaandhexanucleotide repeats. Multiple alignments of all cpSSRs sequences from Solanaceae species made the identification ofnucleotide variability possible and the phylogeny was estimated by maximum parsimony. Our study showed that the plastomedatabase can be exploited for phylogenetic analysis and biotechnological approaches.
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa

Directory of Open Access Journals (Sweden)

Shahin Arwa

2012-11-01

Full Text Available Abstract Background Bulbous flowers such as lily and tulip (Liliaceae family are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Results Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups and among the three monocot species: lily, tulip, and rice (6,900 groups were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Conclusions
Generation and analysis of expressed sequence tags in the extreme large genomes Lilium and Tulipa.

Science.gov (United States)

Shahin, Arwa; van Kaauwen, Martijn; Esselink, Danny; Bargsten, Joachim W; van Tuyl, Jaap M; Visser, Richard G F; Arens, Paul

2012-11-20

Bulbous flowers such as lily and tulip (Liliaceae family) are monocot perennial herbs that are economically very important ornamental plants worldwide. However, there are hardly any genetic studies performed and genomic resources are lacking. To build genomic resources and develop tools to speed up the breeding in both crops, next generation sequencing was implemented. We sequenced and assembled transcriptomes of four lily and five tulip genotypes using 454 pyro-sequencing technology. Successfully, we developed the first set of 81,791 contigs with an average length of 514 bp for tulip, and enriched the very limited number of 3,329 available ESTs (Expressed Sequence Tags) for lily with 52,172 contigs with an average length of 555 bp. The contigs together with singletons covered on average 37% of lily and 39% of tulip estimated transcriptome. Mining lily and tulip sequence data for SSRs (Simple Sequence Repeats) showed that di-nucleotide repeats were twice more abundant in UTRs (UnTranslated Regions) compared to coding regions, while tri-nucleotide repeats were equally spread over coding and UTR regions. Two sets of single nucleotide polymorphism (SNP) markers suitable for high throughput genotyping were developed. In the first set, no SNPs flanking the target SNP (50 bp on either side) were allowed. In the second set, one SNP in the flanking regions was allowed, which resulted in a 2 to 3 fold increase in SNP marker numbers compared with the first set. Orthologous groups between the two flower bulbs: lily and tulip (12,017 groups) and among the three monocot species: lily, tulip, and rice (6,900 groups) were determined using OrthoMCL. Orthologous groups were screened for common SNP markers and EST-SSRs to study synteny between lily and tulip, which resulted in 113 common SNP markers and 292 common EST-SSR. Lily and tulip contigs generated were annotated and described according to Gene Ontology terminology. Two transcriptome sets were built that are valuable
Vast diversity of prokaryotic virus genomes encoding double jelly-roll major capsid proteins uncovered by genomic and metagenomic sequence analysis.

Science.gov (United States)

Yutin, Natalya; Bäckström, Disa; Ettema, Thijs J G; Krupovic, Mart; Koonin, Eugene V

2018-04-10

Analysis of metagenomic sequences has become the principal approach for the study of the diversity of viruses. Many recent, extensive metagenomic studies on several classes of viruses have dramatically expanded the visible part of the virosphere, showing that previously undetected viruses, or those that have been considered rare, actually are important components of the global virome. We investigated the provenance of viruses related to tail-less bacteriophages of the family Tectiviridae by searching genomic and metagenomics sequence databases for distant homologs of the tectivirus-like Double Jelly-Roll major capsid proteins (DJR MCP). These searches resulted in the identification of numerous genomes of virus-like elements that are similar in size to tectiviruses (10-15 kilobases) and have diverse gene compositions. By comparison of the gene repertoires, the DJR MCP-encoding genomes were classified into 6 distinct groups that can be predicted to differ in reproduction strategies and host ranges. Only the DJR MCP gene that is present by design is shared by all these genomes, and most also encode a predicted DNA-packaging ATPase; the rest of the genes are present only in subgroups of this unexpectedly diverse collection of DJR MCP-encoding genomes. Only a minority encode a DNA polymerase which is a hallmark of the family Tectiviridae and the putative family "Autolykiviridae". Notably, one of the identified putative DJR MCP viruses encodes a homolog of Cas1 endonuclease, the integrase involved in CRISPR-Cas adaptation and integration of transposon-like elements called casposons. This is the first detected occurrence of Cas1 in a virus. Many of the identified elements are individual contigs flanked by inverted or direct repeats and appear to represent complete, extrachromosomal viral genomes, whereas others are flanked by bacterial genes and thus can be considered as proviruses. These contigs come from metagenomes of widely different environments, some dominated by
Repeat-aware modeling and correction of short read errors.

Science.gov (United States)

Yang, Xiao; Aluru, Srinivas; Dorman, Karin S

2011-02-15

High-throughput short read sequencing is revolutionizing genomics and systems biology research by enabling cost-effective deep coverage sequencing of genomes and transcriptomes. Error detection and correction are crucial to many short read sequencing applications including de novo genome sequencing, genome resequencing, and digital gene expression analysis. Short read error detection is typically carried out by counting the observed frequencies of kmers in reads and validating those with frequencies exceeding a threshold. In case of genomes with high repeat content, an erroneous kmer may be frequently observed if it has few nucleotide differences with valid kmers with multiple occurrences in the genome. Error detection and correction were mostly applied to genomes with low repeat content and this remains a challenging problem for genomes with high repeat content. We develop a statistical model and a computational method for error detection and correction in the presence of genomic repeats. We propose a method to infer genomic frequencies of kmers from their observed frequencies by analyzing the misread relationships among observed kmers. We also propose a method to estimate the threshold useful for validating kmers whose estimated genomic frequency exceeds the threshold. We demonstrate that superior error detection is achieved using these methods. Furthermore, we break away from the common assumption of uniformly distributed errors within a read, and provide a framework to model position-dependent error occurrence frequencies common to many short read platforms. Lastly, we achieve better error correction in genomes with high repeat content. The software is implemented in C++ and is freely available under GNU GPL3 license and Boost Software V1.0 license at "http://aluru-sun.ece.iastate.edu/doku.php?id = redeem". We introduce a statistical framework to model sequencing errors in next-generation reads, which led to promising results in detecting and correcting errors
Genome-wide characterization of centromeric satellites from multiple mammalian genomes.

Science.gov (United States)

Alkan, Can; Cardone, Maria Francesca; Catacchio, Claudia Rita; Antonacci, Francesca; O'Brien, Stephen J; Ryder, Oliver A; Purgato, Stefania; Zoli, Monica; Della Valle, Giuliano; Eichler, Evan E; Ventura, Mario

2011-01-01

Despite its importance in cell biology and evolution, the centromere has remained the final frontier in genome assembly and annotation due to its complex repeat structure. However, isolation and characterization of the centromeric repeats from newly sequenced species are necessary for a complete understanding of genome evolution and function. In recent years, various genomes have been sequenced, but the characterization of the corresponding centromeric DNA has lagged behind. Here, we present a computational method (RepeatNet) to systematically identify higher-order repeat structures from unassembled whole-genome shotgun sequence and test whether these sequence elements correspond to functional centromeric sequences. We analyzed genome datasets from six species of mammals representing the diversity of the mammalian lineage, namely, horse, dog, elephant, armadillo, opossum, and platypus. We define candidate monomer satellite repeats and demonstrate centromeric localization for five of the six genomes. Our analysis revealed the greatest diversity of centromeric sequences in horse and dog in contrast to elephant and armadillo, which showed high-centromeric sequence homogeneity. We could not isolate centromeric sequences within the platypus genome, suggesting that centromeres in platypus are not enriched in satellite DNA. Our method can be applied to the characterization of thousands of other vertebrate genomes anticipated for sequencing in the near future, providing an important tool for annotation of centromeres.

Genome-scale portrait and evolutionary significance of human-specific core promoter tri- and tetranucleotide short tandem repeats.

Science.gov (United States)

Nazaripanah, N; Adelirad, F; Delbari, A; Sahaf, R; Abbasi-Asl, T; Ohadi, M

2018-04-05

While there is an ongoing trend to identify single nucleotide substitutions (SNSs) that are linked to inter/intra-species differences and disease phenotypes, short tandem repeats (STRs)/microsatellites may be of equal (if not more) importance in the above processes. Genes that contain STRs in their promoters have higher expression divergence compared to genes with fixed or no STRs in the gene promoters. In line with the above, recent reports indicate a role of repetitive sequences in the rise of young transcription start sites (TSSs) in human evolution. Following a comparative genomics study of all human protein-coding genes annotated in the GeneCards database, here we provide a genome-scale portrait of human-specific short- and medium-size (≥ 3-repeats) tri- and tetranucleotide STRs and STR motifs in the critical core promoter region between - 120 and + 1 to the TSS and evidence of skewing of this compartment in reference to the STRs that are not human-specific (Levene's test p human-specific transcripts was detected in the tri and tetra human-specific compartments (mid-p genome-scale skewing of STRs at a specific region of the human genome and a link between a number of these STRs and TSS selection/transcript specificity. The STRs and genes listed here may have a role in the evolution and development of characteristics and phenotypes that are unique to the human species.
Evolutionary and biotechnology implications of plastid genome variation in the inverted-repeat-lacking clade of legumes.

Science.gov (United States)

Sabir, Jamal; Schwarz, Erika; Ellison, Nicholas; Zhang, Jin; Baeshen, Nabih A; Mutwakil, Muhammed; Jansen, Robert; Ruhlman, Tracey

2014-08-01

Land plant plastid genomes (plastomes) provide a tractable model for evolutionary study in that they are relatively compact and gene dense. Among the groups that display an appropriate level of variation for structural features, the inverted-repeat-lacking clade (IRLC) of papilionoid legumes presents the potential to advance general understanding of the mechanisms of genomic evolution. Here, are presented six complete plastome sequences from economically important species of the IRLC, a lineage previously represented by only five completed plastomes. A number of characters are compared across the IRLC including gene retention and divergence, synteny, repeat structure and functional gene transfer to the nucleus. The loss of clpP intron 2 was identified in one newly sequenced member of IRLC, Glycyrrhiza glabra. Using deeply sequenced nuclear transcriptomes from two species helped clarify the nature of the functional transfer of accD to the nucleus in Trifolium, which likely occurred in the lineage leading to subgenus Trifolium. Legumes are second only to cereal crops in agricultural importance based on area harvested and total production. Genetic improvement via plastid transformation of IRLC crop species is an appealing proposition. Comparative analyses of intergenic spacer regions emphasize the need for complete genome sequences for developing transformation vectors for plastid genetic engineering of legume crops. © 2014 Society for Experimental Biology, Association of Applied Biologists and John Wiley & Sons Ltd.
Inter-simple sequence repeat (ISSR) loci mapping in the genome of perennial ryegrass

DEFF Research Database (Denmark)

Pivorienė, O; Pašakinskienė, I; Brazauskas, G

2008-01-01

The aim of this study was to identify and characterize new ISSR markers and their loci in the genome of perennial ryegrass. A subsample of the VrnA F2 mapping family of perennial ryegrass comprising 92 individuals was used to develop a linkage map including inter-simple sequence repeat markers...... demonstrated a 70% similarity to the Hordeum vulgare germin gene GerA. Inter-SSR mapping will provide useful information for gene targeting, quantitative trait loci mapping and marker-assisted selection in perennial ryegrass....
A Simple Method to Decode the Complete 18-5.8-28S rRNA Repeated Units of Green Algae by Genome Skimming.

Science.gov (United States)

Lin, Geng-Ming; Lai, Yu-Heng; Audira, Gilbert; Hsiao, Chung-Der

2017-11-06

Green algae, Chlorella ellipsoidea , Haematococcus pluvialis and Aegagropila linnaei (Phylum Chlorophyta) were simultaneously decoded by a genomic skimming approach within 18-5.8-28S rRNA region. Whole genomic DNAs were isolated from green algae and directly subjected to low coverage genome skimming sequencing. After de novo assembly and mapping, the size of complete 18-5.8-28S rRNA repeated units for three green algae were ranged from 5785 to 6028 bp, which showed high nucleotide diversity (π is around 0.5-0.6) within ITS1 and ITS2 (Internal Transcribed Spacer) regions. Previously, the evolutional diversity of algae has been difficult to decode due to the inability design universal primers that amplify specific marker genes across diverse algal species. In this study, our method provided a rapid and universal approach to decode the 18-5.8-28S rRNA repeat unit in three green algal species. In addition, the completely sequenced 18-5.8-28S rRNA repeated units provided a solid nuclear marker for phylogenetic and evolutionary analysis for green algae for the first time.
Flanking sequence determination and specific PCR identification of transgenic wheat B102-1-2.

Science.gov (United States)

Cao, Jijuan; Xu, Junyi; Zhao, Tongtong; Cao, Dongmei; Huang, Xin; Zhang, Piqiao; Luan, Fengxia

2014-01-01

The exogenous fragment sequence and flanking sequence between the exogenous fragment and recombinant chromosome of transgenic wheat B102-1-2 were successfully acquired using genome walking technology. The newly acquired exogenous fragment encoded the full-length sequence of transformed genes with transformed plasmid and corresponding functional genes including ubi, vector pBANF-bar, vector pUbiGUSPlus, vector HSP, reporter vector pUbiGUSPlus, promoter ubiquitin, and coli DH1. A specific polymerase chain reaction (PCR) identification method for transgenic wheat B102-1-2 was established on the basis of designed primers according to flanking sequence. This established specific PCR strategy was validated by using transgenic wheat, transgenic corn, transgenic soybean, transgenic rice, and non-transgenic wheat. A specifically amplified target band was observed only in transgenic wheat B102-1-2. Therefore, this method is characterized by high specificity, high reproducibility, rapid identification, and excellent accuracy for the identification of transgenic wheat B102-1-2.
In situ optical sequencing and structure analysis of a trinucleotide repeat genome region by localization microscopy after specific COMBO-FISH nano-probing

Science.gov (United States)

Stuhlmüller, M.; Schwarz-Finsterle, J.; Fey, E.; Lux, J.; Bach, M.; Cremer, C.; Hinderhofer, K.; Hausmann, M.; Hildenbrand, G.

2015-10-01

Trinucleotide repeat expansions (like (CGG)n) of chromatin in the genome of cell nuclei can cause neurological disorders such as for example the Fragile-X syndrome. Until now the mechanisms are not clearly understood as to how these expansions develop during cell proliferation. Therefore in situ investigations of chromatin structures on the nanoscale are required to better understand supra-molecular mechanisms on the single cell level. By super-resolution localization microscopy (Spectral Position Determination Microscopy; SPDM) in combination with nano-probing using COMBO-FISH (COMBinatorial Oligonucleotide FISH), novel insights into the nano-architecture of the genome will become possible. The native spatial structure of trinucleotide repeat expansion genome regions was analysed and optical sequencing of repetitive units was performed within 3D-conserved nuclei using SPDM after COMBO-FISH. We analysed a (CGG)n-expansion region inside the 5' untranslated region of the FMR1 gene. The number of CGG repeats for a full mutation causing the Fragile-X syndrome was found and also verified by Southern blot. The FMR1 promotor region was similarly condensed like a centromeric region whereas the arrangement of the probes labelling the expansion region seemed to indicate a loop-like nano-structure. These results for the first time demonstrate that in situ chromatin structure measurements on the nanoscale are feasible. Due to further methodological progress it will become possible to estimate the state of trinucleotide repeat mutations in detail and to determine the associated chromatin strand structural changes on the single cell level. In general, the application of the described approach to any genome region will lead to new insights into genome nano-architecture and open new avenues for understanding mechanisms and their relevance in the development of heredity diseases.
The Methanosarcina barkeri genome: comparative analysis withMethanosarcina acetivorans and Methanosarcina mazei reveals extensiverearrangement within methanosarcinal genomes

Energy Technology Data Exchange (ETDEWEB)

Maeder, Dennis L.; Anderson, Iain; Brettin, Thomas S.; Bruce,David C.; Gilna, Paul; Han, Cliff S.; Lapidus, Alla; Metcalf, William W.; Saunders, Elizabeth; Tapia, Roxanne; Sowers, Kevin R.

2006-05-19

We report here a comparative analysis of the genome sequence of Methanosarcina barkeri with those of Methanosarcina acetivorans and Methanosarcina mazei. All three genomes share a conserved double origin of replication and many gene clusters. M. barkeri is distinguished by having an organization that is well conserved with respect to the other Methanosarcinae in the region proximal to the origin of replication with interspecies gene similarities as high as 95%. However it is disordered and marked by increased transposase frequency and decreased gene synteny and gene density in the proximal semi-genome. Of the 3680 open reading frames in M. barkeri, 678 had paralogs with better than 80% similarity to both M. acetivorans and M. mazei while 128 nonhypothetical orfs were unique (non-paralogous) amongst these species including a complete formate dehydrogenase operon, two genes required for N-acetylmuramic acid synthesis, a 14 gene gas vesicle cluster and a bacterial P450-specific ferredoxin reductase cluster not previously observed or characterized in this genus. A cryptic 36 kbp plasmid sequence was detected in M. barkeri that contains an orc1 gene flanked by a presumptive origin of replication consisting of 38 tandem repeats of a 143 nt motif. Three-way comparison of these genomes reveals differing mechanisms for the accrual of changes. Elongation of the large M. acetivorans is the result of multiple gene-scale insertions and duplications uniformly distributed in that genome, while M. barkeri is characterized by localized inversions associated with the loss of gene content. In contrast, the relatively short M. mazei most closely approximates the ancestral organizational state.
Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: Evidence for differences and commonalities in size distributions and size restrictions

NARCIS (Netherlands)

M. Schaap (Michiel); R.J.L.F. Lemmers (Richard); R. Maassen (Roel); P.J. van der Vliet (Patrick); L.F. Hoogerheide (Lennart); H.K. van Dijk (Herman); N. Basturk (Nalan); P. de Knijff (Peter); S.M. van der Maarel (Silvère)

2013-01-01

textabstractBackground: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and
Genome-wide analysis of macrosatellite repeat copy number variation in worldwide populations: evidence for differences and commonalities in size distributions and size restrictions

NARCIS (Netherlands)

Schaap, M.; Lemmers, R.J.L.F.; Maassen, R.; van der Vliet, P.J.; Hoogerheide, L.F.; van Dijk, H.K.; Basturk, N.; de Knijff, P.; van der Maarel, S.M.

2013-01-01

Background: Macrosatellite repeats (MSRs), usually spanning hundreds of kilobases of genomic DNA, comprise a significant proportion of the human genome. Because of their highly polymorphic nature, MSRs represent an extreme example of copy number variation, but their structure and function is largely
The first insight into the salvia (lamiaceae) genome via bac library construction and high-throughput sequencing of target bac clones

International Nuclear Information System (INIS)

Hao, D.C.; Vautrin, S.; Berges, H.; Chen, S.L.

2015-01-01

Salvia is a representative genus of Lamiaceae, a eudicot family with significant species diversity and population adaptibility. One of the key goals of Salvia genomics research is to identify genes of adaptive significance. This information may help to improve the conservation of adaptive genetic variation and the management of medicinal plants to increase their health and productivity. Large-insert genomic libraries are a fundamental tool for achieving this purpose. We report herein the construction, characterization and screening of a gridded BAC library for Salvia officinalis (sage). The S. officinalis BAC library consists of 17,764 clones and the average insert size is 107 Kb, corresponding to 3 haploid genome equivalents. Seventeen positive clones (average insert size 115 Kb) containing five terpene synthase (TPS) genes were screened out by PCR and 12 of them were subject to Illumina HiSeq 2000 sequencing, which yielded 28,097,480 90-bp raw reads (2.53 Gb). Scaffolds containing sabinene synthase (Sab), a Sab homolog, TPS3 (kaurene synthase-like 2), copalyl diphosphate synthase 2 and one cytochrome P450 gene were retrieved via de novo assembly and annotation, which also have flanking noncoding sequences, including predicted promoters and repeat sequences. Among 2,638 repeat sequences, there are 330 amplifiable microsatellites. This BAC library provides a new resource for Lamiaceae genomic studies, including microsatellite marker development, physical mapping, comparative genomics and genome sequencing. Characterization of positive clones provided insights into the structure of the Salvia genome. These sequences will be used in the assembly of a future genome sequence for S. officinalis. (author)
The LINEs and SINEs of Entamoeba histolytica: comparative analysis and genomic distribution.

Science.gov (United States)

Bakre, Abhijeet A; Rawal, Kamal; Ramaswamy, Ram; Bhattacharya, Alok; Bhattacharya, Sudha

2005-07-01

Autonomous non-long terminal repeat retrotransposons are commonly referred to as long interspersed elements (LINEs). Short non-autonomous elements that borrow the LINE machinery are called SINES. The Entamoeba histolytica genome contains three classes of LINEs and SINEs. Together the EhLINEs/SINEs account for about 6% of the genome. The recognizable functional domains in all three EhLINEs included reverse transcriptase and endonuclease. A novel feature was the presence of two types of members-some with a single long ORF (less frequent) and some with two ORFs (more frequent) in both EhLINE1 and 2. The two ORFs were generated by conserved changes leading to stop codon. Computational analysis of the immediate flanking sequences for each element showed that they inserted in AT-rich sequences, with a preponderance of Ts in the upstream site. The elements were very frequently located close to protein-coding genes and other EhLINEs/SINEs. The possible influence of these elements on expression of neighboring genes needs to be determined.
Simple sequence repeats in mycobacterial genomes

Indian Academy of Sciences (India)

2006-12-18

Dec 18, 2006 ... Although prokaryotic genomes derive some plasticity due to microsatellite mutations they have in-built mechanisms to arrest undue expansions of microsatellites and one such mechanism is constituted by post-replicative DNA repair enzymes MutL, MutH and MutS. The mycobacterial genomes lack these ...
Surface waves on the tailward flanks of the Earth's magnetopause

Science.gov (United States)

Seon, J.; Frank, L. A.; Lazarus, A. J.; Lepping, R. P.

1995-01-01

Forty-three examples of ISEE 1 tailward flank side magnetopause crossings are examined and directly compared with upstream solar wind parameters. The crossings are classified into two groups. In the first group, a few sudden magnetopause crossings are observed, whereas repeated magnetopause crossings and oscillatory motions, often with boundary layer signatures, are observed in the second group. These distinctive characteristics of the two groups are interpreted in terms of the surface waves due to the Kelvin-Helmholtz instability. It is found that low solar wind speed tends to favor characteristics of the first group, whereas high solar wind speed yields those of the second group. However, no evident correlations between the groups and the interplanetary magnetic field directions are found.
Evaluation of tetranucleotide repeat locus D7S809 (wg1g9) in the Japanese population.

Science.gov (United States)

Tamaki, K; Huang, X L; Nozawa, H; Yamamoto, T; Uchihi, R; Katsumata, Y; Armour, J A

1996-08-15

The tetrameric short tandem repeat (STR) locus (D7S809) has been evaluated in the Japanese population. In order to detect the alleles, PCR was carried out using primers, one of which was end labelled with 32P, and PCR products were separated by electrophoresis on a denaturing polyacrylamide gel. Using this method, accurate genotypes could be determined from as little as 0.5 ng of genomic DNA. Thirteen different alleles were identified on 256 chromosomes tested. All alleles differed in size by one (4 bp) repeat unit, and no "interalleles' were found. The estimated heterozygosity and the polymorphism information content (PIC) were 0.86 and 0.83, respectively. We observed 42 of the 91 possible different genotypes. The power of discrimination (PD) was 0.96, and no significant deviations from the Hardy-Weinberg equilibrium were found. We retyped all apparently homozygous samples using an alternative pair of flanking primers in order to confirm homozygosity. We also demonstrated a typing result involving sexual assault. D7S809 appears to be a very useful STR locus for forensic practice in Japanese.
Genome wide analysis of acute myeloid leukemia reveal leukemia specific methylome and subtype specific hypomethylation of repeats.

Directory of Open Access Journals (Sweden)

Marwa H Saied

Full Text Available Methylated DNA immunoprecipitation followed by high-throughput sequencing (MeDIP-seq has the potential to identify changes in DNA methylation important in cancer development. In order to understand the role of epigenetic modulation in the development of acute myeloid leukemia (AML we have applied MeDIP-seq to the DNA of 12 AML patients and 4 normal bone marrows. This analysis revealed leukemia-associated differentially methylated regions that included gene promoters, gene bodies, CpG islands and CpG island shores. Two genes (SPHKAP and DPP6 with significantly methylated promoters were of interest and further analysis of their expression showed them to be repressed in AML. We also demonstrated considerable cytogenetic subtype specificity in the methylomes affecting different genomic features. Significantly distinct patterns of hypomethylation of certain interspersed repeat elements were associated with cytogenetic subtypes. The methylation patterns of members of the SINE family tightly clustered all leukemic patients with an enrichment of Alu repeats with a high CpG density (P<0.0001. We were able to demonstrate significant inverse correlation between intragenic interspersed repeat sequence methylation and gene expression with SINEs showing the strongest inverse correlation (R(2 = 0.7. We conclude that the alterations in DNA methylation that accompany the development of AML affect not only the promoters, but also the non-promoter genomic features, with significant demethylation of certain interspersed repeat DNA elements being associated with AML cytogenetic subtypes. MeDIP-seq data were validated using bisulfite pyrosequencing and the Infinium array.
Complex analyses of inverted repeats in mitochondrial genomes revealed their importance and variability.

Science.gov (United States)

Cechová, Jana; Lýsek, Jirí; Bartas, Martin; Brázda, Václav

2018-04-01

The NCBI database contains mitochondrial DNA (mtDNA) genomes from numerous species. We investigated the presence and locations of inverted repeat sequences (IRs) in these mtDNA sequences, which are known to be important for regulating nuclear genomes. IRs were identified in mtDNA in all species. IR lengths and frequencies correlate with evolutionary age and the greatest variability was detected in subgroups of plants and fungi and the lowest variability in mammals. IR presence is non-random and evolutionary favoured. The frequency of IRs generally decreased with IR length, but not for IRs 24 or 30 bp long, which are 1.5 times more abundant. IRs are enriched in sequences from the replication origin, followed by D-loop, stem-loop and miscellaneous sequences, pointing to the importance of IRs in regulatory regions of mitochondrial DNA. Data were produced using Palindrome analyser, freely available on the web at http://bioinformatics.ibp.cz. vaclav@ibp.cz. Supplementary data are available at Bioinformatics online.
A genomic audit of newly-adopted autosomal STRs for forensic identification.

Science.gov (United States)

Phillips, C

2017-07-01

In preparation for the growing use of massively parallel sequencing (MPS) technology to genotype forensic STRs, a comprehensive genomic audit of 73 STRs was made in 2016 [Parson et al., Forensic Sci. Int. Genet. 22, 54-63]. The loci examined included miniSTRs that were not in widespread use, but had been incorporated into MPS kits or were under consideration for this purpose. The current study expands the genomic analysis of autosomal STRs that are not commonly used, to include the full set of developed miniSTRs and an additional 24 STRs, most of which have been recently included in several supplementary forensic multiplex kits for capillary electrophoresis. The genomic audit of these 47 newly-adopted STRs examined the linkage status of new loci on the same chromosome as established forensic STRs; analyzed world-wide population variation of the newly-adopted STRs using published data; assessed their forensic informativeness; and compiled the sequence characteristics, repeat structures and flanking regions of each STR. A further 44 autosomal STRs developed for forensic analyses but not incorporated into commercial kits, are also briefly described. Copyright © 2017 Elsevier B.V. All rights reserved.
Identification of functional SNPs in the 5-prime flanking sequences of human genes

Directory of Open Access Journals (Sweden)

Lenhard Boris

2005-02-01

Full Text Available Abstract Background Over 4 million single nucleotide polymorphisms (SNPs are currently reported to exist within the human genome. Only a small fraction of these SNPs alter gene function or expression, and therefore might be associated with a cell phenotype. These functional SNPs are consequently important in understanding human health. Information related to functional SNPs in candidate disease genes is critical for cost effective genetic association studies, which attempt to understand the genetics of complex diseases like diabetes, Alzheimer's, etc. Robust methods for the identification of functional SNPs are therefore crucial. We report one such experimental approach. Results Sequence conserved between mouse and human genomes, within 5 kilobases of the 5-prime end of 176 GPCR genes, were screened for SNPs. Sequences flanking these SNPs were scored for transcription factor binding sites. Allelic pairs resulting in a significant score difference were predicted to influence the binding of transcription factors (TFs. Ten such SNPs were selected for mobility shift assays (EMSA, resulting in 7 of them exhibiting a reproducible shift. The full-length promoter regions with 4 of the 7 SNPs were cloned in a Luciferase based plasmid reporter system. Two out of the 4 SNPs exhibited differential promoter activity in several human cell lines. Conclusions We propose a method for effective selection of functional, regulatory SNPs that are located in evolutionary conserved 5-prime flanking regions (5'-FR regions of human genes and influence the activity of the transcriptional regulatory region. Some SNPs behave differently in different cell types.
Journal of Biosciences | Indian Academy of Sciences

Indian Academy of Sciences (India)

Flanking regulatory long terminal repeats (LTRs) in Human endogenous retrovirus (HERV) is a kind of typical DNA repeat that is widespread in the human genome. Currently, many algorithms have been developed to detect the latent periodicity of a wide range of DNA repeats. However, no such attempt was made for HERV ...
Complete plastid genome sequence of Daucus carota: implications for biotechnology and phylogeny of angiosperms.

Science.gov (United States)

Ruhlman, Tracey; Lee, Seung-Bum; Jansen, Robert K; Hostetler, Jessica B; Tallon, Luke J; Town, Christopher D; Daniell, Henry

2006-08-31

Carrot (Daucus carota) is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombinant proteins in carrot have had limited success in terms of protein accumulation in the edible tap roots. Plastid genetic engineering offers the potential to overcome this limitation, as demonstrated by the accumulation of BADH in chromoplasts of carrot taproots to confer exceedingly high levels of salt resistance. The complete plastid genome of carrot provides essential information required for genetic engineering. Additionally, the sequence data add to the rapidly growing database of plastid genomes for assessing phylogenetic relationships among angiosperms. The complete carrot plastid genome is 155,911 bp in length, with 115 unique genes and 21 duplicated genes within the IR. There are four ribosomal RNAs, 30 distinct tRNA genes and 18 intron-containing genes. Repeat analysis reveals 12 direct and 2 inverted repeats > or = 30 bp with a sequence identity > or = 90%. Phylogenetic analysis of nucleotide sequences for 61 protein-coding genes using both maximum parsimony (MP) and maximum likelihood (ML) were performed for 29 angiosperms. Phylogenies from both methods provide strong support for the monophyly of several major angiosperm clades, including monocots, eudicots, rosids, asterids, eurosids II, euasterids I, and euasterids II. The carrot plastid genome contains a number of dispersed direct and inverted repeats scattered throughout coding and non-coding regions. This is the first sequenced plastid genome of the family Apiaceae and only the second published genome sequence of the species-rich euasterid II clade. Both MP and ML trees provide very strong support (100% bootstrap) for the sister relationship of Daucus with Panax in the euasterid II clade. These

Complete plastid genome sequence of Daucus carota: Implications for biotechnology and phylogeny of angiosperms

Directory of Open Access Journals (Sweden)

Ruhlman Tracey

2006-08-01

Full Text Available Abstract Background Carrot (Daucus carota is a major food crop in the US and worldwide. Its capacity for storage and its lifecycle as a biennial make it an attractive species for the introduction of foreign genes, especially for oral delivery of vaccines and other therapeutic proteins. Until recently efforts to express recombinant proteins in carrot have had limited success in terms of protein accumulation in the edible tap roots. Plastid genetic engineering offers the potential to overcome this limitation, as demonstrated by the accumulation of BADH in chromoplasts of carrot taproots to confer exceedingly high levels of salt resistance. The complete plastid genome of carrot provides essential information required for genetic engineering. Additionally, the sequence data add to the rapidly growing database of plastid genomes for assessing phylogenetic relationships among angiosperms. Results The complete carrot plastid genome is 155,911 bp in length, with 115 unique genes and 21 duplicated genes within the IR. There are four ribosomal RNAs, 30 distinct tRNA genes and 18 intron-containing genes. Repeat analysis reveals 12 direct and 2 inverted repeats ≥ 30 bp with a sequence identity ≥ 90%. Phylogenetic analysis of nucleotide sequences for 61 protein-coding genes using both maximum parsimony (MP and maximum likelihood (ML were performed for 29 angiosperms. Phylogenies from both methods provide strong support for the monophyly of several major angiosperm clades, including monocots, eudicots, rosids, asterids, eurosids II, euasterids I, and euasterids II. Conclusion The carrot plastid genome contains a number of dispersed direct and inverted repeats scattered throughout coding and non-coding regions. This is the first sequenced plastid genome of the family Apiaceae and only the second published genome sequence of the species-rich euasterid II clade. Both MP and ML trees provide very strong support (100% bootstrap for the sister relationship of
GREAM: A Web Server to Short-List Potentially Important Genomic Repeat Elements Based on Over-/Under-Representation in Specific Chromosomal Locations, Such as the Gene Neighborhoods, within or across 17 Mammalian Species.

Directory of Open Access Journals (Sweden)

Darshan Shimoga Chandrashekar

Full Text Available Genome-wide repeat sequences, such as LINEs, SINEs and LTRs share a considerable part of the mammalian nuclear genomes. These repeat elements seem to be important for multiple functions including the regulation of transcription initiation, alternative splicing and DNA methylation. But it is not possible to study all repeats and, hence, it would help to short-list before exploring their potential functional significance via experimental studies and/or detailed in silico analyses.We developed the 'Genomic Repeat Element Analyzer for Mammals' (GREAM for analysis, screening and selection of potentially important mammalian genomic repeats. This web-server offers many novel utilities. For example, this is the only tool that can reveal a categorized list of specific types of transposons, retro-transposons and other genome-wide repetitive elements that are statistically over-/under-represented in regions around a set of genes, such as those expressed differentially in a disease condition. The output displays the position and frequency of identified elements within the specified regions. In addition, GREAM offers two other types of analyses of genomic repeat sequences: a enrichment within chromosomal region(s of interest, and b comparative distribution across the neighborhood of orthologous genes. GREAM successfully short-listed a repeat element (MER20 known to contain functional motifs. In other case studies, we could use GREAM to short-list repetitive elements in the azoospermia factor a (AZFa region of the human Y chromosome and those around the genes associated with rat liver injury. GREAM could also identify five over-represented repeats around some of the human and mouse transcription factor coding genes that had conserved expression patterns across the two species.GREAM has been developed to provide an impetus to research on the role of repetitive sequences in mammalian genomes by offering easy selection of more interesting repeats in various
Development and characterization of highly polymorphic long TC repeat microsatellite markers for genetic analysis of peanut

Directory of Open Access Journals (Sweden)

Macedo Selma E

2012-02-01

Full Text Available Abstract Background Peanut (Arachis hypogaea L. is a crop of economic and social importance, mainly in tropical areas, and developing countries. Its molecular breeding has been hindered by a shortage of polymorphic genetic markers due to a very narrow genetic base. Microsatellites (SSRs are markers of choice in peanut because they are co-dominant, highly transferrable between species and easily applicable in the allotetraploid genome. In spite of substantial effort over the last few years by a number of research groups, the number of SSRs that are polymorphic for A. hypogaea is still limiting for routine application, creating the demand for the discovery of more markers polymorphic within cultivated germplasm. Findings A plasmid genomic library enriched for TC/AG repeats was constructed and 1401 clones sequenced. From the sequences obtained 146 primer pairs flanking mostly TC microsatellites were developed. The average number of repeat motifs amplified was 23. These 146 markers were characterized on 22 genotypes of cultivated peanut. In total 78 of the markers were polymorphic within cultivated germplasm. Most of those 78 markers were highly informative with an average of 5.4 alleles per locus being amplified. Average gene diversity index (GD was 0.6, and 66 markers showed a GD of more than 0.5. Genetic relationship analysis was performed and corroborated the current taxonomical classification of A. hypogaea subspecies and varieties. Conclusions The microsatellite markers described here are a useful resource for genetics and genomics in Arachis. In particular, the 66 markers that are highly polymorphic in cultivated peanut are a significant step towards routine genetic mapping and marker-assisted selection for the crop.
Analysis of simple sequence repeats in rice bean (Vigna umbellata using an SSR-enriched library

Directory of Open Access Journals (Sweden)

Lixia Wang

2016-02-01

Full Text Available Rice bean (Vigna umbellata Thunb., a warm-season annual legume, is grown in Asia mainly for dried grain or fodder and plays an important role in human and animal nutrition because the grains are rich in protein and some essential fatty acids and minerals. With the aim of expediting the genetic improvement of rice bean, we initiated a project to develop genomic resources and tools for molecular breeding in this little-known but important crop. Here we report the construction of an SSR-enriched genomic library from DNA extracted from pooled young leaf tissues of 22 rice bean genotypes and developing SSR markers. In 433,562 reads generated by a Roche 454 GS-FLX sequencer, we identified 261,458 SSRs, of which 48.8% were of compound form. Dinucleotide repeats were predominant with an absolute proportion of 81.6%, followed by trinucleotides (17.8%. Other types together accounted for 0.6%. The motif AC/GT accounted for 77.7% of the total, followed by AAG/CTT (14.3%, and all others accounted for 12.0%. Among the flanking sequences, 2928 matched putative genes or gene models in the protein database of Arabidopsis thaliana, corresponding with 608 non-redundant Gene Ontology terms. Of these sequences, 11.2% were involved in cellular components, 24.2% were involved molecular functions, and 64.6% were associated with biological processes. Based on homolog analysis, 1595 flanking sequences were similar to mung bean and 500 to common bean genomic sequences. Comparative mapping was conducted using 350 sequences homologous to both mung bean and common bean sequences. Finally, a set of primer pairs were designed, and a validation test showed that 58 of 220 new primers can be used in rice bean and 53 can be transferred to mung bean. However, only 11 were polymorphic when tested on 32 rice bean varieties. We propose that this study lays the groundwork for developing novel SSR markers and will enhance the mapping of qualitative and quantitative traits and marker
The number of genes encoding repeat domain-containing proteins positively correlates with genome size in amoebal giant viruses

Science.gov (United States)

Shukla, Avi; Chatterjee, Anirvan

2018-01-01

Abstract Curiously, in viruses, the virion volume appears to be predominantly driven by genome length rather than the number of proteins it encodes or geometric constraints. With their large genome and giant particle size, amoebal viruses (AVs) are ideally suited to study the relationship between genome and virion size and explore the role of genome plasticity in their evolutionary success. Different genomic regions of AVs exhibit distinct genealogies. Although the vertically transferred core genes and their functions are universally conserved across the nucleocytoplasmic large DNA virus (NCLDV) families and are essential for their replication, the horizontally acquired genes are variable across families and are lineage-specific. When compared with other giant virus families, we observed a near–linear increase in the number of genes encoding repeat domain-containing proteins (RDCPs) with the increase in the genome size of AVs. From what is known about the functions of RDCPs in bacteria and eukaryotes and their prevalence in the AV genomes, we envisage important roles for RDCPs in the life cycle of AVs, their genome expansion, and plasticity. This observation also supports the evolution of AVs from a smaller viral ancestor by the acquisition of diverse gene families from the environment including RDCPs that might have helped in host adaption. PMID:29308275
Comparative genomics of CytR, an unusual member of the LacI family of transcription factors.

Directory of Open Access Journals (Sweden)

Natalia V Sernova

Full Text Available CytR is a transcription regulator from the LacI family, present in some gamma-proteobacteria including Escherichia coli and known not only for its cellular role, control of transport and utilization of nucleosides, but for a number of unusual structural properties. The present study addressed three related problems: structure of CytR-binding sites and motifs, their evolutionary conservation, and identification of new members of the CytR regulon. While the majority of CytR-binding sites are imperfect inverted repeats situated between binding sites for another transcription factor, CRP, other architectures were observed, in particular, direct repeats. While the similarity between sites for different genes in one genome is rather low, and hence the consensus motif is weak, there is high conservation of orthologous sites in different genomes (mainly in the Enterobacteriales arguing for the presence of specific CytR-DNA contacts. On larger evolutionary distances candidate CytR sites may migrate but the approximate distance between flanking CRP sites tends to be conserved, which demonstrates that the overall structure of the CRP-CytR-DNA complex is gene-specific. The analysis yielded candidate CytR-binding sites for orthologs of known regulon members in less studied genomes of the Enterobacteriales and Vibrionales and identified a new candidate member of the CytR regulon, encoding a transporter named NupT (YcdZ.
Leucine-Rich repeat receptor kinases are sporadically distributed in eukaryotic genomes

Directory of Open Access Journals (Sweden)

Diévart Anne

2011-12-01

Full Text Available Abstract Background Plant leucine-rich repeat receptor-like kinases (LRR-RLKs are receptor kinases that contain LRRs in their extracellular domain. In the last 15 years, many research groups have demonstrated major roles played by LRR-RLKs in plants during almost all developmental processes throughout the life of the plant and in defense/resistance against a large range of pathogens. Recently, a breakthrough has been made in this field that challenges the dogma of the specificity of plant LRR-RLKs. Results We analyzed ~1000 complete genomes and show that LRR-RK genes have now been identified in 8 non-plant genomes. We performed an exhaustive phylogenetic analysis of all of these receptors, revealing that all of the LRR-containing receptor subfamilies form lineage-specific clades. Our results suggest that the association of LRRs with RKs appeared independently at least four times in eukaryotic evolutionary history. Moreover, the molecular evolutionary history of the LRR-RKs found in oomycetes is reminiscent of the pattern observed in plants: expansion with amplification/deletion and evolution of the domain organization leading to the functional diversification of members of the gene family. Finally, the expression data suggest that oomycete LRR-RKs may play a role in several stages of the oomycete life cycle. Conclusions In view of the key roles that LRR-RLKs play throughout the entire lifetime of plants and plant-environment interactions, the emergence and expansion of this type of receptor in several phyla along the evolution of eukaryotes, and particularly in oomycete genomes, questions their intrinsic functions in mimicry and/or in the coevolution of receptors between hosts and pathogens.
Analysis of transposons and repeat composition of the sunflower (Helianthus annuus L.) genome.

Science.gov (United States)

Cavallini, Andrea; Natali, Lucia; Zuccolo, Andrea; Giordani, Tommaso; Jurman, Irena; Ferrillo, Veronica; Vitacolonna, Nicola; Sarri, Vania; Cattonaro, Federica; Ceccarelli, Marilena; Cionini, Pier Giorgio; Morgante, Michele

2010-02-01

A sample-sequencing strategy combined with slot-blot hybridization and FISH was used to study the composition of the repetitive component of the sunflower genome. One thousand six hundred thirty-eight sequences for a total of 954,517 bp were analyzed. The fraction of sequences that can be classified as repetitive using computational and hybridization approaches amounts to 62% in total. Almost two thirds remain as yet uncharacterized in nature. Of those characterized, most belong to the gypsy superfamily of LTR-retrotransposons. Unlike in other species, where single families can account for large fractions of the genome, it appears that no transposon family has been amplified to very high levels in sunflower. All other known classes of transposable elements were also found. One family of unknown nature (contig 61) was the most repeated in the sunflower genome. The evolution of the repetitive component in the Helianthus genus and in other Asteraceae was studied by comparative analysis of the hybridization of total genomic DNAs from these species to the sunflower small-insert library and compared to gene-based phylogeny. Very little similarity is observed between Helianthus species and two related Asteraceae species outside of the genus. Most repetitive elements are similar in annual and perennial Helianthus species indicating that sequence amplification largely predates such divergence. Gypsy-like elements are more represented in the annuals than in the perennials, while copia-like elements are similarly represented, attesting a different amplification history of the two superfamilies of LTR-retrotransposons in the Helianthus genus.
Genomic organization of the canine herpesvirus US region.

Science.gov (United States)

Haanes, E J; Tomlinson, C C

1998-02-01

Canine herpesvirus (CHV) is an alpha-herpesvirus of limited pathogenicity in healthy adult dogs and infectivity of the virus appears to be largely limited to cells of canine origin. CHV's low virulence and species specificity make it an attractive candidate for a recombinant vaccine vector to protect dogs against a variety of pathogens. As part of the analysis of the CHV genome, the authors determined the complete nucleotide sequence of the CHV US region as well as portions of the flanking inverted repeats. Seven full open reading frames (ORFs) encoding proteins larger than 100 amino acids were identified within, or partially within the CHV US: cUS2, cUS3, cUS4, cUS6, cUS7, cUS8 and cUS9; which are homologs of the herpes simplex virus type-1 US2; protein kinase; gG, gD, gI, gE; and US9 genes, respectively. An eighth ORF was identified in the inverted repeat region, cIR6, a homolog of the equine herpesvirus type-1 IR6 gene. The authors identified and mapped most of the major transcripts for the predicted CHV US ORFs by Northern analysis.
Right psoas abscess following right flank trauma: a case report ...

African Journals Online (AJOL)

This is a case of 15 year old boy who presented with three weeks history of right flank pain, two weeks history of fever and five days history of inability to walk well. There was history of right flank trauma a week before the onset of right flank pain. He had earlier presented in two different hospitals before he was brought to our ...
Analysis of the genome sequence of the pathogenic Muscovy duck parvovirus strain YY reveals a 14-nucleotide-pair deletion in the inverted terminal repeats.

Science.gov (United States)

Wang, Jianye; Huang, Yu; Zhou, Mingxu; Zhu, Guoqiang

2016-09-01

Genomic information about Muscovy duck parvovirus is still limited. In this study, the genome of the pathogenic MDPV strain YY was sequenced. The full-length genome of YY is 5075 nucleotides (nt) long, 57 nt shorter than that of strain FM. Sequence alignment indicates that the 5' and 3' inverted terminal repeats (ITR) of strain YY contain a 14-nucleotide-pair deletion in the stem of the palindromic hairpin structure in comparison to strain FM and FZ91-30. The deleted region contains one "E-box" site and one repeated motif with the sequence "TTCCGGT" or "ACCGGAA". Phylogenetic trees constructed based the protein coding genes concordantly showed that YY, together with nine other MDPV isolates from various places, clustered in a separate branch, distinct from the branch formed by goose parvovirus (GPV) strains. These results demonstrate that, despite the distinctive deletion, the YY strain still belongs to the classical MDPV group. Moreover, the deletion of ITR may contribute to the genome evolution of MDPV under immunization pressure.
Identification, variation and transcription of pneumococcal repeat sequences

Science.gov (United States)

2011-01-01

Background Small interspersed repeats are commonly found in many bacterial chromosomes. Two families of repeats (BOX and RUP) have previously been identified in the genome of Streptococcus pneumoniae, a nasopharyngeal commensal and respiratory pathogen of humans. However, little is known about the role they play in pneumococcal genetics. Results Analysis of the genome of S. pneumoniae ATCC 700669 revealed the presence of a third repeat family, which we have named SPRITE. All three repeats are present at a reduced density in the genome of the closely related species S. mitis. However, they are almost entirely absent from all other streptococci, although a set of elements related to the pneumococcal BOX repeat was identified in the zoonotic pathogen S. suis. In conjunction with information regarding their distribution within the pneumococcal chromosome, this suggests that it is unlikely that these repeats are specialised sequences performing a particular role for the host, but rather that they constitute parasitic elements. However, comparing insertion sites between pneumococcal sequences indicates that they appear to transpose at a much lower rate than IS elements. Some large BOX elements in S. pneumoniae were found to encode open reading frames on both strands of the genome, whilst another was found to form a composite RNA structure with two T box riboswitches. In multiple cases, such BOX elements were demonstrated as being expressed using directional RNA-seq and RT-PCR. Conclusions BOX, RUP and SPRITE repeats appear to have proliferated extensively throughout the pneumococcal chromosome during the species' past, but novel insertions are currently occurring at a relatively slow rate. Through their extensive secondary structures, they seem likely to affect the expression of genes with which they are co-transcribed. Software for annotation of these repeats is freely available from ftp://ftp.sanger.ac.uk/pub/pathogens/strep_repeats/. PMID:21333003
In silico reversal of repeat-induced point mutation (RIP identifies the origins of repeat families and uncovers obscured duplicated genes

Directory of Open Access Journals (Sweden)

Hane James K

2010-11-01

Full Text Available Abstract Background Repeat-induced point mutation (RIP is a fungal genome defence mechanism guarding against transposon invasion. RIP mutates the sequence of repeated DNA and over time renders the affected regions unrecognisable by similarity search tools such as BLAST. Results DeRIP is a new software tool developed to predict the original sequence of a RIP-mutated region prior to the occurrence of RIP. In this study, we apply deRIP to the genome of the wheat pathogen Stagonospora nodorum SN15 and predict the origin of several previously uncharacterised classes of repetitive DNA. Conclusions Five new classes of transposon repeats and four classes of endogenous gene repeats were identified after deRIP. The deRIP process is a new tool for fungal genomics that facilitates the identification and understanding of the role and origin of fungal repetitive DNA. DeRIP is open-source and is available as part of the RIPCAL suite at http://www.sourceforge.net/projects/ripcal.
Linkage disequilibrium in the insulin gene region: Size variation at the 5{prime} flanking polymorphism and bimodality among {open_quotes}Class I{close_quotes} alleles

Energy Technology Data Exchange (ETDEWEB)

McGinnis, R.E.; Spielman, R.S. [Univ. of Pennsylvania School of Medicine, Philadelphia, PA (United States)

1994-09-01

The 5{prime} flanking polymorphism (5{prime}FP), a hypervariable region at the 5{prime} end of the insulin gene, has {open_quotes}class 1{close_quotes} alleles (650-900 bp long) that are in positive linkage disequilibrium with insulin-dependent diabetes mellitus (IDDM). The authors report that precise sizing of the 5{prime}FP yields a bimodal frequency distribution of class 1 allele lengths. Class 1 alleles belonging to the lower component (650-750 bp) of the bimodal distribution were somewhat more highly associated with IDDM than were alleles from the upper component (760-900 bp), but the difference was not statistically significant. They also examined 5{prime}FP length variation in relation to allelic variation at nearby polymorphisms. At biallelic RFLPs on both sides of the 5{prime}FP, they found that one allele exhibits near-total association with the upper component of the 5FP class 1 distribution. Such associations represent a little-known but potentially wide-spread form of linkage disequilibrium. In this type of disequilibrium, a flanking allele has near-complete association with a single mode of VNTR alleles whose lengths represent consecutive numbers of tandem repeats (CNTR). Such extreme disequilibrium between a CNTR mode and flanking alleles may originate and persist because length mutations at some VNTR loci usually add or delete only one or two repeat units. 22 refs., 5 figs., 6 tabs.
From NGS assembly challenges to instability of fungal mitochondrial genomes: A case study in genome complexity.

Science.gov (United States)

Misas, Elizabeth; Muñoz, José Fernando; Gallo, Juan Esteban; McEwen, Juan Guillermo; Clay, Oliver Keatinge

2016-04-01

The presence of repetitive or non-unique DNA persisting over sizable regions of a eukaryotic genome can hinder the genome's successful de novo assembly from short reads: ambiguities in assigning genome locations to the non-unique subsequences can result in premature termination of contigs and thus overfragmented assemblies. Fungal mitochondrial (mtDNA) genomes are compact (typically less than 100 kb), yet often contain short non-unique sequences that can be shown to impede their successful de novo assembly in silico. Such repeats can also confuse processes in the cell in vivo. A well-studied example is ectopic (out-of-register, illegitimate) recombination associated with repeat pairs, which can lead to deletion of functionally important genes that are located between the repeats. Repeats that remain conserved over micro- or macroevolutionary timescales despite such risks may indicate functionally or structurally (e.g., for replication) important regions. This principle could form the basis of a mining strategy for accelerating discovery of function in genome sequences. We present here our screening of a sample of 11 fully sequenced fungal mitochondrial genomes by observing where exact k-mer repeats occurred several times; initial analyses motivated us to focus on 17-mers occurring more than three times. Based on the diverse repeats we observe, we propose that such screening may serve as an efficient expedient for gaining a rapid but representative first insight into the repeat landscapes of sparsely characterized mitochondrial chromosomes. Our matching of the flagged repeats to previously reported regions of interest supports the idea that systems of persisting, non-trivial repeats in genomes can often highlight features meriting further attention. Copyright © 2016 Elsevier Ltd. All rights reserved.
Algorithms and Complexity Results for Genome Mapping Problems.

Science.gov (United States)

Rajaraman, Ashok; Zanetti, Joao Paulo Pereira; Manuch, Jan; Chauve, Cedric

2017-01-01

Genome mapping algorithms aim at computing an ordering of a set of genomic markers based on local ordering information such as adjacencies and intervals of markers. In most genome mapping models, markers are assumed to occur uniquely in the resulting map. We introduce algorithmic questions that consider repeats, i.e., markers that can have several occurrences in the resulting map. We show that, provided with an upper bound on the copy number of repeated markers and with intervals that span full repeat copies, called repeat spanning intervals, the problem of deciding if a set of adjacencies and repeat spanning intervals admits a genome representation is tractable if the target genome can contain linear and/or circular chromosomal fragments. We also show that extracting a maximum cardinality or weight subset of repeat spanning intervals given a set of adjacencies that admits a genome realization is NP-hard but fixed-parameter tractable in the maximum copy number and the number of adjacent repeats, and tractable if intervals contain a single repeated marker.
A Mitochondrial Genome of Rhyparochromidae (Hemiptera: Heteroptera) and a Comparative Analysis of Related Mitochondrial Genomes.

Science.gov (United States)

Li, Teng; Yang, Jie; Li, Yinwan; Cui, Ying; Xie, Qiang; Bu, Wenjun; Hillis, David M

2016-10-19

The Rhyparochromidae, the largest family of Lygaeoidea, encompasses more than 1,850 described species, but no mitochondrial genome has been sequenced to date. Here we describe the first mitochondrial genome for Rhyparochromidae: a complete mitochondrial genome of Panaorus albomaculatus (Scott, 1874). This mitochondrial genome is comprised of 16,345 bp, and contains the expected 37 genes and control region. The majority of the control region is made up of a large tandem-repeat region, which has a novel pattern not previously observed in other insects. The tandem-repeats region of P. albomaculatus consists of 53 tandem duplications (including one partial repeat), which is the largest number of tandem repeats among all the known insect mitochondrial genomes. Slipped-strand mispairing during replication is likely to have generated this novel pattern of tandem repeats. Comparative analysis of tRNA gene families in sequenced Pentatomomorpha and Lygaeoidea species shows that the pattern of nucleotide conservation is markedly higher on the J-strand. Phylogenetic reconstruction based on mitochondrial genomes suggests that Rhyparochromidae is not the sister group to all the remaining Lygaeoidea, and supports the monophyly of Lygaeoidea.
Genomic organization and developmental fate of adjacent repeated sequences in a foldback DNA clone of Tetrahymena thermophila

International Nuclear Information System (INIS)

Tschunko, A.H.; Loechel, R.H.; McLaren, N.C.; Allen, S.L.

1987-01-01

DNA sequence elimination and rearrangement occurs during the development of somatic cell lineages of eukaryotes and was first discovered over a century ago. However, the significance and mechanism of chromatin elimination are not understood. DNA elimination also occurs during the development of the somatic macronucleus from the germinal micronucleus in unicellular ciliated protozoa such as Tetrahymena thermophila. In this study foldback DNA from the micronucleus was used as a probe to isolate ten clones. All of those tested (4/4) contained sequences that were repetitive in the micronucleus and rearranged in the macronucleus. Inverted repeated sequences were present in one clone. This clone, pTtFBl, was subjected to a detailed analysis of its developmental fate. Subregions were subcloned and used as probes against Southern blots of micronuclear and macronuclear DNA. DNA was labeled with [ 33 P]-labeled dATP. The authors found that all subregions defined repeated sequence families in the micronuclear genome. A minimum of four different families was defined, two of which are retained in the macronucleus and two of which are completely eliminated. The inverted repeat family is retained with little rearrangement. Two of the families, defined by subregions that do not contain parts of the inverted repeat are totally eliminated during macronuclear development-and contain open reading frames. The significance of retained inverted repeats to the process of elimination is discussed
Comparative and functional characterization of intragenic tandem repeats in 10 Aspergillus genomes.

Science.gov (United States)

Gibbons, John G; Rokas, Antonis

2009-03-01

Intragenic tandem repeats (ITRs) are consecutive repeats of three or more nucleotides found in coding regions. ITRs are the underlying cause of several human genetic diseases and have been associated with phenotypic variation, including pathogenesis, in several clades of the tree of life. We have examined the evolution and functional role of ITRs in 10 genomes spanning the fungal genus Aspergillus, a clade of relevance to medicine, agriculture, and industry. We identified several hundred ITRs in each of the species examined. ITR content varied extensively between species, with an average 79% of ITRs unique to a given species. For the fraction of conserved ITR regions, sequence comparisons within species and between close relatives revealed that they were highly variable. ITR-containing proteins were evolutionarily less conserved, compositionally distinct, and overrepresented for domains associated with cell-surface localization and function relative to the rest of the proteome. Furthermore, ITRs were preferentially found in proteins involved in transcription, cellular communication, and cell-type differentiation but were underrepresented in proteins involved in metabolism and energy. Importantly, although ITRs were evolutionarily labile, their functional associations appeared. To be remarkably conserved across eukaryotes. Fungal ITRs likely participate in a variety of developmental processes and cell-surface-associated functions, suggesting that their contribution to fungal lifestyle and evolution may be more general than previously assumed.
Deletion of Repeats in the Alpha C Protein Enhances the Pathogenicity of Group B Streptococci in Immune Mice

OpenAIRE

Gravekamp, C.; Rosner, Bernard; Madoff, L. C.

1998-01-01

The alpha C protein is a protective surface-associated antigen of group B streptococci (GBS). The prototype alpha C protein of GBS (strain A909) contains nine identical tandem repeats, each comprising 82 amino acids, flanked by N- and C-terminal domains. Clinical isolates of GBS show variable numbers of repeats with a normal distribution and a median of 9 to 10 repeats. Here, we show that escape mutants of GBS expressing one-repeat alpha C protein were 100-fold more pathogenic than GBS expres...

Understanding Etna flank instability through numerical models

Science.gov (United States)

Apuani, Tiziana; Corazzato, Claudia; Merri, Andrea; Tibaldi, Alessandro

2013-02-01

As many active volcanoes, Mount Etna shows clear evidence of flank instability, and different mechanisms were suggested to explain this flank dynamics, based on the recorded deformation pattern and character. Shallow and deep deformations, mainly associated with both eruptive and seismic events, are concentrated along recognised fracture and fault systems, mobilising the eastern and south-eastern flank of the volcano. Several interacting causes were postulated to control the phenomenon, including gravity force, magma ascent along the feeding system, and a very complex local and/or regional tectonic activity. Nevertheless, the complexity of such dynamics is still an open subject of research and being the volcano flanks heavily urbanised, the comprehension of the gravitative dynamics is a major issue for public safety and civil protection. The present research explores the effects of the main geological features (in particular the role of the subetnean clays, interposed between the Apennine-Maghrebian flysch and the volcanic products) and the role of weakness zones, identified by fracture and fault systems, on the slope instability process. The effects of magma intrusions are also investigated. The problem is addressed by integrating field data, laboratory tests and numerical modelling. A bi- and tri-dimensional stress-strain analysis was performed by a finite difference numerical code (FLAC and FLAC3D), mainly aimed at evaluating the relationship among geological features, volcano-tectonic structures and magmatic activity in controlling the deformation processes. The analyses are well supported by dedicated structural-mechanical field surveys, which allowed to estimate the rock mass strength and deformability parameters. To take into account the uncertainties which inevitably occur in a so complicated model, many efforts were done in performing a sensitivity analysis along a WNW-ESE section crossing the volcano summit and the Valle del Bove depression. This was
Inhibition of colorectal cancer genomic copy number alterations and chromosomal fragile site tumor suppressor FHIT and WWOX deletions by DNA mismatch repair

Science.gov (United States)

Gelincik, Ozkan; Blecua, Pedro; Edelmann, Winfried; Kucherlapati, Raju; Zhou, Kathy; Jasin, Maria; Gümüş, Zeynep H.; Lipkin, Steven M.

2017-01-01

Homologous recombination (HR) enables precise DNA repair after DNA double strand breaks (DSBs) using identical sequence templates, whereas homeologous recombination (HeR) uses only partially homologous sequences. Homeologous recombination introduces mutations through gene conversion and genomic deletions through single-strand annealing (SSA). DNA mismatch repair (MMR) inhibits HeR, but the roles of mammalian MMR MutL homologues (MLH1, PMS2 and MLH3) proteins in HeR suppression are poorly characterized. Here, we demonstrate that mouse embryonic fibroblasts (MEFs) carrying Mlh1, Pms2, and Mlh3 mutations have higher HeR rates, by using 7,863 uniquely mapping paired direct repeat sequences (DRs) in the mouse genome as endogenous gene conversion and SSA reporters. Additionally, when DSBs are induced by gamma-radiation, Mlh1, Pms2 and Mlh3 mutant MEFs have higher DR copy number alterations (CNAs), including DR CNA hotspots previously identified in mouse MMR-deficient colorectal cancer (dMMR CRC). Analysis of The Cancer Genome Atlas CRC data revealed that dMMR CRCs have higher genome-wide DR HeR rates than MMR proficient CRCs, and that dMMR CRCs have deletion hotspots in tumor suppressors FHIT/WWOX at chromosomal fragile sites FRA3B and FRA16D (which have elevated DSB rates) flanked by paired homologous DRs and inverted repeats (IR). Overall, these data provide novel insights into the MMR-dependent HeR inhibition mechanism and its role in tumor suppression. PMID:29069730
Genome-wide Comparative Analyses Reveal the Dynamic Evolution of Nucleotide-Binding Leucine-Rich Repeat Gene Family among Solanaceae Plants

Directory of Open Access Journals (Sweden)

Eunyoung Seo

2016-08-01

Full Text Available Plants have evolved an elaborate innate immune system against invading pathogens. Within this system, intracellular nucleotide-binding leucine-rich repeat (NLR immune receptors are known play critical roles in effector-triggered immunity (ETI plant defense. We performed genome-wide identification and classification of NLR-coding sequences from the genomes of pepper, tomato, and potato using fixed criteria. We then compared genomic duplication and evolution features. We identified intact 267, 443, and 755 NLR-encoding genes in tomato, potato, and pepper genomes, respectively. Phylogenetic analyses and classification of Solanaceae NLRs revealed that the majority of NLR super family members fell into 14 subgroups, including a TIR-NLR (TNL subgroup and 13 non-TNL subgroups. Specific subgroups have expanded in each genome, with the expansion in pepper showing subgroup-specific physical clusters. Comparative analysis of duplications showed distinct duplication patterns within pepper and among Solanaceae plants suggesting subgroup- or species-specific gene duplication events after speciation, resulting in divergent evolution. Taken together, genome-wide analyses of NLR family members provide insights into their evolutionary history in Solanaceae. These findings also provide important foundational knowledge for understanding NLR evolution and will empower broader characterization of disease resistance genes to be used for crop breeding.
Unilateral flank ovariohysterectomy in guinea pigs (Cavia porcellus).

Science.gov (United States)

Rozanska, D; Rozanski, P; Orzelski, M; Chlebicka, N; Putowska, K

2016-11-01

To describe a simple, minimally invasive method of ovariohysterectomy via a unilateral flank approach in guinea pigs, for use in routine desexing of healthy female guinea pigs or treatment of ovarian cysts. The subjects of this retrospective study were 41 client-owned guinea pigs submitted for routine desexing or treatment of ovarian cysts. They included 16 healthy female guinea pigs aged 8-12 months (Group 1), and 15 females aged from 9 months to 3 years (Group 2), and 10 females aged from 3 to 7 years (Group 3) with different-sized ovarian cysts. Prior to surgery, the animals received clinical examination, blood testing (complete blood count and serum biochemistry profile) and examination of the abdomen using ultrasonography, to assess the condition of the reproductive tract and ensure the guinea pigs were fit for surgery. Ovariohysterectomy was performed via a unilateral flank incision made close to the erector spinae muscle starting approximately 1 cm caudal to the last rib. Both ovaries, uterine horns, and the uterine cervix were localised, ligated, and dissected through this unilateral retroperitoneal incision. Ovariohysterectomy was successfully completed via a single flank incision in 38/41 (93%) guinea pigs. Three guinea pigs with ovarian cysts from Group 3, which were >6 years old died during surgery due to circulatory and respiratory failure under anaesthesia. In the remaining 38 cases, surgery proceeded without complications. A further two guinea pigs from Group 3 were reluctant to move or eat for the first 3 days after surgery but recovered after provision of supportive care. All 38 animals fully recovered and wound healing was normal. This is the first report of ovariohysterectomy via a unilateral flank incision in guinea pigs. This approach is a simple, minimally invasive and safe alternative to the midline or bilateral flank approaches currently used for surgery of the reproductive tract in guinea pigs.
Human-specific HERV-K insertion causes genomic variations in the human genome.

Directory of Open Access Journals (Sweden)

Wonseok Shin

Full Text Available Human endogenous retroviruses (HERV sequences account for about 8% of the human genome. Through comparative genomics and literature mining, we identified a total of 29 human-specific HERV-K insertions. We characterized them focusing on their structure and flanking sequence. The results showed that four of the human-specific HERV-K insertions deleted human genomic sequences via non-classical insertion mechanisms. Interestingly, two of the human-specific HERV-K insertion loci contained two HERV-K internals and three LTR elements, a pattern which could be explained by LTR-LTR ectopic recombination or template switching. In addition, we conducted a polymorphic test and observed that twelve out of the 29 elements are polymorphic in the human population. In conclusion, human-specific HERV-K elements have inserted into human genome since the divergence of human and chimpanzee, causing human genomic changes. Thus, we believe that human-specific HERV-K activity has contributed to the genomic divergence between humans and chimpanzees, as well as within the human population.
Short template switch events explain mutation clusters in the human genome.

Science.gov (United States)

Löytynoja, Ari; Goldman, Nick

2017-06-01

Resequencing efforts are uncovering the extent of genetic variation in humans and provide data to study the evolutionary processes shaping our genome. One recurring puzzle in both intra- and inter-species studies is the high frequency of complex mutations comprising multiple nearby base substitutions or insertion-deletions. We devised a generalized mutation model of template switching during replication that extends existing models of genome rearrangement and used this to study the role of template switch events in the origin of short mutation clusters. Applied to the human genome, our model detects thousands of template switch events during the evolution of human and chimp from their common ancestor and hundreds of events between two independently sequenced human genomes. Although many of these are consistent with a template switch mechanism previously proposed for bacteria, our model also identifies new types of mutations that create short inversions, some flanked by paired inverted repeats. The local template switch process can create numerous complex mutation patterns, including hairpin loop structures, and explains multinucleotide mutations and compensatory substitutions without invoking positive selection, speculative mechanisms, or implausible coincidence. Clustered sequence differences are challenging for current mapping and variant calling methods, and we show that many erroneous variant annotations exist in human reference data. Local template switch events may have been neglected as an explanation for complex mutations because of biases in commonly used analyses. Incorporation of our model into reference-based analysis pipelines and comparisons of de novo assembled genomes will lead to improved understanding of genome variation and evolution. © 2017 Löytynoja and Goldman; Published by Cold Spring Harbor Laboratory Press.
Statistical analyses of conserved features of genomic islands in bacteria.

Science.gov (United States)

Guo, F-B; Xia, Z-K; Wei, W; Zhao, H-L

2014-03-17

We performed statistical analyses of five conserved features of genomic islands of bacteria. Analyses were made based on 104 known genomic islands, which were identified by comparative methods. Four of these features include sequence size, abnormal G+C content, flanking tRNA gene, and embedded mobility gene, which are frequently investigated. One relatively new feature, G+C homogeneity, was also investigated. Among the 104 known genomic islands, 88.5% were found to fall in the typical length of 10-200 kb and 80.8% had G+C deviations with absolute values larger than 2%. For the 88 genomic islands whose hosts have been sequenced and annotated, 52.3% of them were found to have flanking tRNA genes and 64.7% had embedded mobility genes. For the homogeneity feature, 85% had an h homogeneity index less than 0.1, indicating that their G+C content is relatively uniform. Taking all the five features into account, 87.5% of 88 genomic islands had three of them. Only one genomic island had only one conserved feature and none of the genomic islands had zero features. These statistical results should help to understand the general structure of known genomic islands. We found that larger genomic islands tend to have relatively small G+C deviations relative to absolute values. For example, the absolute G+C deviations of 9 genomic islands longer than 100,000 bp were all less than 5%. This is a novel but reasonable result given that larger genomic islands should have greater restrictions in their G+C contents, in order to maintain the stable G+C content of the recipient genome.
File list: InP.Emb.50.AllAg.Embryonic_flank [Chip-atlas[Archive

Lifescience Database Archive (English)

Full Text Available InP.Emb.50.AllAg.Embryonic_flank mm9 Input control Embryo Embryonic flank SRX804059... http://dbarchive.biosciencedbc.jp/kyushu-u/mm9/assembled/InP.Emb.50.AllAg.Embryonic_flank.bed ...
A genome-wide analysis of lentivector integration sites using targeted sequence capture and next generation sequencing technology.

Science.gov (United States)

Ustek, Duran; Sirma, Sema; Gumus, Ergun; Arikan, Muzaffer; Cakiris, Aris; Abaci, Neslihan; Mathew, Jaicy; Emrence, Zeliha; Azakli, Hulya; Cosan, Fulya; Cakar, Atilla; Parlak, Mahmut; Kursun, Olcay

2012-10-01

One application of next-generation sequencing (NGS) is the targeted resequencing of interested genes which has not been used in viral integration site analysis of gene therapy applications. Here, we combined targeted sequence capture array and next generation sequencing to address the whole genome profiling of viral integration sites. Human 293T and K562 cells were transduced with a HIV-1 derived vector. A custom made DNA probe sets targeted pLVTHM vector used to capture lentiviral vector/human genome junctions. The captured DNA was sequenced using GS FLX platform. Seven thousand four hundred and eighty four human genome sequences flanking the long terminal repeats (LTR) of pLVTHM fragment sequences matched with an identity of at least 98% and minimum 50 bp criteria in both cells. In total, 203 unique integration sites were identified. The integrations in both cell lines were totally distant from the CpG islands and from the transcription start sites and preferentially located in introns. A comparison between the two cell lines showed that the lentiviral-transduced DNA does not have the same preferred regions in the two different cell lines. Copyright © 2012 Elsevier B.V. All rights reserved.
Discovery and analysis of an active long terminal repeat-retrotransposable element in Aspergillus oryzae.

Science.gov (United States)

Jie Jin, Feng; Hara, Seiichi; Sato, Atsushi; Koyama, Yasuji

2014-01-01

Wild-type Aspergillus oryzae RIB40 contains two copies of the AO090005001597 gene. We previously constructed A. oryzae RIB40 strain, RKuAF8B, with multiple chromosomal deletions, in which the AO090005001597 copy number was found to be increased significantly. Sequence analysis indicated that AO090005001597 is part of a putative 6,000-bp retrotransposable element, flanked by two long terminal repeats (LTRs) of 669 bp, with characteristics of retroviruses and retrotransposons, and thus designated AoLTR (A. oryzae LTR-retrotransposable element). AoLTR comprised putative reverse transcriptase, RNase H, and integrase domains. The deduced amino acid sequence alignment of AoLTR showed 94% overall identity with AFLAV, an A. flavus Tf1/sushi retrotransposon. Quantitative real-time RT-PCR showed that AoLTR gene expression was significantly increased in the RKuAF8B, in accordance with the increased copy number. Inverse PCR indicated that the full-length retrotransposable element was randomly integrated into multiple genomic locations. However, no obvious phenotypic changes were associated with the increased AoLTR gene copy number.
Simple sequence repeats and compositional bias in the bipartite Ralstonia solanacearum GMI1000 genome

Directory of Open Access Journals (Sweden)

Vandamme Peter

2003-03-01

Full Text Available Abstract Background Ralstonia solanacearum is an important plant pathogen. The genome of R. solananearum GMI1000 is organised into two replicons (a 3.7-Mb chromosome and a 2.1-Mb megaplasmid and this bipartite genome structure is characteristic for most R. solanacearum strains. To determine whether the megaplasmid was acquired via recent horizontal gene transfer or is part of an ancestral single chromosome, we compared the abundance, distribution and compositon of simple sequence repeats (SSRs between both replicons and also compared the respective compositional biases. Results Our data show that both replicons are very similar in respect to distribution and composition of SSRs and presence of compositional biases. Minor variations in SSR and compositional biases observed may be attributable to minor differences in gene expression and regulation of gene expression or can be attributed to the small sample numbers observed. Conclusions The observed similarities indicate that both replicons have shared a similar evolutionary history and thus suggest that the megaplasmid was not recently acquired from other organisms by lateral gene transfer but is a part of an ancestral R. solanacearum chromosome.
The Whole Genome Assembly and Comparative Genomic Research of Thellungiella parvula (Extremophile Crucifer Mitochondrion

Directory of Open Access Journals (Sweden)

Xuelin Wang

2016-01-01

Full Text Available The complete nucleotide sequences of the mitochondrial (mt genome of an extremophile species Thellungiella parvula (T. parvula have been determined with the lengths of 255,773 bp. T. parvula mt genome is a circular sequence and contains 32 protein-coding genes, 19 tRNA genes, and three ribosomal RNA genes with a 11.5% coding sequence. The base composition of 27.5% A, 27.5% T, 22.7% C, and 22.3% G in descending order shows a slight bias of 55% AT. Fifty-three repeats were identified in the mitochondrial genome of T. parvula, including 24 direct repeats, 28 tandem repeats (TRs, and one palindromic repeat. Furthermore, a total of 199 perfect microsatellites have been mined with a high A/T content (83.1% through simple sequence repeat (SSR analysis and they were distributed unevenly within this mitochondrial genome. We also analyzed other plant mitochondrial genomes’ evolution in general, providing clues for the understanding of the evolution of organelles genomes in plants. Comparing with other Brassicaceae species, T. parvula is related to Arabidopsis thaliana whose characters of low temperature resistance have been well documented. This study will provide important genetic tools for other Brassicaceae species research and improve yields of economically important plants.
Gene disruptions using P transposable elements: an integral component of the Drosophila genome project.

OpenAIRE

Spradling, A C; Stern, D M; Kiss, I; Roote, J; Laverty, T; Rubin, G M

1995-01-01

Biologists require genetic as well as molecular tools to decipher genomic information and ultimately to understand gene function. The Berkeley Drosophila Genome Project is addressing these needs with a massive gene disruption project that uses individual, genetically engineered P transposable elements to target open reading frames throughout the Drosophila genome. DNA flanking the insertions is sequenced, thereby placing an extensive series of genetic markers on the physical genomic map and a...
Isolation of human simple repeat loci by hybridization selection.

Science.gov (United States)

Armour, J A; Neumann, R; Gobert, S; Jeffreys, A J

1994-04-01

We have isolated short tandem repeat arrays from the human genome, using a rapid method involving filter hybridization to enrich for tri- or tetranucleotide tandem repeats. About 30% of clones from the enriched library cross-hybridize with probes containing trimeric or tetrameric tandem arrays, facilitating the rapid isolation of large numbers of clones. In an initial analysis of 54 clones, 46 different tandem arrays were identified. Analysis of these tandem repeat loci by PCR showed that 24 were polymorphic in length; substantially higher levels of polymorphism were displayed by the tetrameric repeat loci isolated than by the trimeric repeats. Primary mapping of these loci by linkage analysis showed that they derive from 17 chromosomes, including the X chromosome. We anticipate the use of this strategy for the efficient isolation of tandem repeats from other sources of genomic DNA, including DNA from flow-sorted chromosomes, and from other species.
Characterization of Genomic Deletion Efficiency Mediated by Clustered Regularly Interspaced Palindromic Repeats (CRISPR)/Cas9 Nuclease System in Mammalian Cells*♦

Science.gov (United States)

Canver, Matthew C.; Bauer, Daniel E.; Dass, Abhishek; Yien, Yvette Y.; Chung, Jacky; Masuda, Takeshi; Maeda, Takahiro; Paw, Barry H.; Orkin, Stuart H.

2014-01-01

The clustered regularly interspaced palindromic repeats (CRISPR)/CRISPR-associated (Cas) 9 nuclease system has provided a powerful tool for genome engineering. Double strand breaks may trigger nonhomologous end joining repair, leading to frameshift mutations, or homology-directed repair using an extrachromosomal template. Alternatively, genomic deletions may be produced by a pair of double strand breaks. The efficiency of CRISPR/Cas9-mediated genomic deletions has not been systematically explored. Here, we present a methodology for the production of deletions in mammalian cells, ranging from 1.3 kb to greater than 1 Mb. We observed a high frequency of intended genomic deletions. Nondeleted alleles are nonetheless often edited with inversions or small insertion/deletions produced at CRISPR recognition sites. Deleted alleles also typically include small insertion/deletions at predicted deletion junctions. We retrieved cells with biallelic deletion at a frequency exceeding that of probabilistic expectation. We demonstrate an inverse relationship between deletion frequency and deletion size. This work suggests that CRISPR/Cas9 is a robust system to produce a spectrum of genomic deletions to allow investigation of genes and genetic elements. PMID:24907273
SSR allelic variation in almond (Prunus dulcis Mill.).

Science.gov (United States)

Xie, Hua; Sui, Yi; Chang, Feng-Qi; Xu, Yong; Ma, Rong-Cai

2006-01-01

Sixteen SSR markers including eight EST-SSR and eight genomic SSRs were used for genetic diversity analysis of 23 Chinese and 15 international almond cultivars. EST- and genomic SSR markers previously reported in species of Prunus, mainly peach, proved to be useful for almond genetic analysis. DNA sequences of 117 alleles of six of the 16 SSR loci were analysed to reveal sequence variation among the 38 almond accessions. For the four SSR loci with AG/CT repeats, no insertions or deletions were observed in the flanking regions of the 98 alleles sequenced. Allelic size variation of these loci resulted exclusively from differences in the structures of repeat motifs, which involved interruptions or occurrences of new motif repeats in addition to varying number of AG/CT repeats. Some alleles had a high number of uninterrupted repeat motifs, indicating that SSR mutational patterns differ among alleles at a given SSR locus within the almond species. Allelic homoplasy was observed in the SSR loci because of base substitutions, interruptions or compound repeat motifs. Substitutions in the repeat regions were found at two SSR loci, suggesting that point mutations operate on SSRs and hinder the further SSR expansion by introducing repeat interruptions to stabilize SSR loci. Furthermore, it was shown that some potential point mutations in the flanking regions are linked with new SSR repeat motif variation in almond and peach.
Simple sequence repeat markers useful for sorghum downy mildew (Peronosclerospora sorghi and related species

Directory of Open Access Journals (Sweden)

Odvody Gary N

2008-11-01

Full Text Available Abstract Background A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites to detect differences at the DNA level. Results Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55% with dinucleotide repeats and 6 (11% with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40% and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis, sugar cane (P. sacchari, pearl millet (Sclerospora graminicola and rose (Peronospora sparsa indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34
Simple sequence repeat markers useful for sorghum downy mildew (Peronosclerospora sorghi) and related species.

Science.gov (United States)

Perumal, Ramasamy; Nimmakayala, Padmavathi; Erattaimuthu, Saradha R; No, Eun-Gyu; Reddy, Umesh K; Prom, Louis K; Odvody, Gary N; Luster, Douglas G; Magill, Clint W

2008-11-29

A recent outbreak of sorghum downy mildew in Texas has led to the discovery of both metalaxyl resistance and a new pathotype in the causal organism, Peronosclerospora sorghi. These observations and the difficulty in resolving among phylogenetically related downy mildew pathogens dramatically point out the need for simply scored markers in order to differentiate among isolates and species, and to study the population structure within these obligate oomycetes. Here we present the initial results from the use of a biotin capture method to discover, clone and develop PCR primers that permit the use of simple sequence repeats (microsatellites) to detect differences at the DNA level. Among the 55 primers pairs designed from clones from pathotype 3 of P. sorghi, 36 flanked microsatellite loci containing simple repeats, including 28 (55%) with dinucleotide repeats and 6 (11%) with trinucleotide repeats. A total of 22 microsatellites with CA/AC or GT/TG repeats were the most abundant (40%) and GA/AG or CT/TC types contribute 15% in our collection. When used to amplify DNA from 19 isolates from P. sorghi, as well as from 5 related species that cause downy mildew on other hosts, the number of different bands detected for each SSR primer pair using a LI-COR- DNA Analyzer ranged from two to eight. Successful cross-amplification for 12 primer pairs studied in detail using DNA from downy mildews that attack maize (P. maydis & P. philippinensis), sugar cane (P. sacchari), pearl millet (Sclerospora graminicola) and rose (Peronospora sparsa) indicate that the flanking regions are conserved in all these species. A total of 15 SSR amplicons unique to P. philippinensis (one of the potential threats to US maize production) were detected, and these have potential for development of diagnostic tests. A total of 260 alleles were obtained using 54 microsatellites primer combinations, with an average of 4.8 polymorphic markers per SSR across 34 Peronosclerospora, Peronospora and Sclerospora
Comparative analysis of complete chloroplast genome sequence and inversion variation in Lasthenia burkei (Madieae, Asteraceae).

Science.gov (United States)

Walker, Joseph F; Zanis, Michael J; Emery, Nancy C

2014-04-01

Complete chloroplast genome studies can help resolve relationships among large, complex plant lineages such as Asteraceae. We present the first whole plastome from the Madieae tribe and compare its sequence variation to other chloroplast genomes in Asteraceae. We used high throughput sequencing to obtain the Lasthenia burkei chloroplast genome. We compared sequence structure and rates of molecular evolution in the small single copy (SSC), large single copy (LSC), and inverted repeat (IR) regions to those for eight Asteraceae accessions and one Solanaceae accession. The chloroplast sequence of L. burkei is 150 746 bp and contains 81 unique protein coding genes and 4 coding ribosomal RNA sequences. We identified three major inversions in the L. burkei chloroplast, all of which have been found in other Asteraceae lineages, and a previously unreported inversion in Lactuca sativa. Regions flanking inversions contained tRNA sequences, but did not have particularly high G + C content. Substitution rates varied among the SSC, LSC, and IR regions, and rates of evolution within each region varied among species. Some observed differences in rates of molecular evolution may be explained by the relative proportion of coding to noncoding sequence within regions. Rates of molecular evolution vary substantially within and among chloroplast genomes, and major inversion events may be promoted by the presence of tRNAs. Collectively, these results provide insight into different mechanisms that may promote intramolecular recombination and the inversion of large genomic regions in the plastome.
Carboniferous geology and uranium potential of the northeast flank of the Parana Basin and southwest flank of the Parnaiba Basin, Brazil

International Nuclear Information System (INIS)

Andrade, S.M. de; Camarco, P.E.N.

1984-01-01

The Carboniferous sequences of the northeast flank of the Parana Basin and those of the southwest flank of the Parnaiba Basin have been the subject of discussion and polemics for quite a long time, especially in terms of their stratigraphic relations and depositional environments. Thus, we reinforce our main objective, which is to furnish data for the definition of the uranium potential in these Carboniferous sediments, by adding recently acquired information that should aid in the clarification of the existing controversies. The Carboniferous along the northeast flank of the Parana Basin is represented by the Aquidauana Formation which has been informally divided into three members: lower, middle and upper members. The middle member, of marine origin, constitutes a prospective target for uranium and phosphate associations, in which sandstones interbedded with shales constitute the host rocks. On the other hand, the Carboniferous of the southwest margin of the Parnaiba Basin, which encompasses the Longa, Poti and Piaui Formations has shown very remote possibilities of uranium occurrences. The regional structural framework, as reflected by the Carboniferous rocks along both basin flanks, is characterized by homoclines cut by gravity faults. The faults along these weakness zones were occasionally intruded by basic rocks of Cretaceous age. Superimposed on the regional structure, open folds appear in the form of anticlines and domes. These folds are discontinuous structures resulting from uplift due to vertical stresses or result from differential subsidence along the limbs of the folds. (Author) [pt

Genomic repeat abundances contain phylogenetic signal

Czech Academy of Sciences Publication Activity Database

Dodsworth, S.; Chase, M.W.; Kelly, L.J.; Leitch, I.J.; Macas, Jiří; Novák, Petr; Piednoël, M.; Weiß-Schneeweiss, H.; Leitch, A.R.

2015-01-01

Roč. 64, č. 1 (2015), s. 112-126 ISSN 1063-5157 R&D Projects: GA ČR GBP501/12/G090 Institutional support: RVO:60077344 Keywords : Repetitive DNA * continuous characters * genomics * next-generation sequencing * phylogenetics Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 8.225, year: 2015
A DEL phenotype attributed to RHD Exon 9 sequence deletion: slipped-strand mispairing and blood group polymorphisms.

Science.gov (United States)

Lopez, Genghis H; Turner, Robyn M; McGowan, Eunike C; Schoeman, Elizna M; Scott, Stacy A; O'Brien, Helen; Millard, Glenda M; Roulis, Eileen V; Allen, Amanda J; Liew, Yew-Wah; Flower, Robert L; Hyland, Catherine A

2018-03-01

The RhD blood group antigen is extremely polymorphic and the DEL phenotype represents one such class of polymorphisms. The DEL phenotype prevalent in East Asian populations arises from a synonymous substitution defined as RHD*1227A. However, initially, based on genomic and cDNA studies, the genetic basis for a DEL phenotype in Taiwan was attributed to a deletion of RHD Exon 9 that was never verified at the genomic level by any other independent group. Here we investigate the genetic basis for a Caucasian donor with a DEL partial D phenotype and compare the genomic findings to those initial molecular studies. The 3'-region of the RHD gene was amplified by long-range polymerase chain reaction (PCR) for massively parallel sequencing. Primers were designed to encompass a deletion, flanking Exon 9, by standard PCR for Sanger sequencing. Targeted sequencing of exons and flanking introns was also performed. Genomic DNA exhibited a 1012-bp deletion spanning from Intron 8, across Exon 9 into Intron 9. The deletion breakpoints occurred between two 25-bp repeat motifs flanking Exon 9 such that one repeat sequence remained. Deletion mutations bordered by repeat sequences are a hallmark of slipped-strand mispairing (SSM) event. We propose this genetic mechanism generated the germline deletion in the Caucasian donor. Extensive studies show that the RHD*1227A is the most prevalent DEL allele in East Asian populations and may have confounded the initial molecular studies. Review of the literature revealed that the SSM model explains some of the extreme polymorphisms observed in the clinically significant RhD blood group antigen. © 2017 AABB.
Structural analysis of the 5' flanking region of the β-globin gene in African sickle cell anemia patients: Further evidence for three origins of the sickle cell mutation in Africa

International Nuclear Information System (INIS)

Chebloune, Y.; Pagnier, J.; Trabuchet, G.; Faure, C.; Verdier, G.; Labie, D.; Nigon, V.

1988-01-01

Haplotype analysis of the β-globin gene cluster shows two regions of DNA characterized by nonrandom association of restriction site polymorphisms. These regions are separated by a variable segment containing the repeated sequences (ATTTT) n and (AT) x T y , which might be involved in recombinational events. Studies of haplotypes linked to the sickle cell gene in Africa provide strong argument for three origins of the mutation: Benin, Senegal, and the Central African Republic. The structure of the variable segment in the three African populations was studied by S1 nuclease mapping of genomic DNA, which allows a comparison of several samples. A 1080-base-pair DNA segment was sequenced for one sample from each population. S1 nuclease mapping confirmed the homogeneity of each population with regard to both (ATTTT) n and (AT) x T y repeats. The authors found three additional structures for (AT) x T y correlating with the geographic origin of the patients. Ten other nucleotide positions, 5' and 3' to the (AT) x T y copies, were found to be variable when compared to homologous sequences from human and monkey DNAs. These results allow us to propose an evolutionary scheme for the polymorphisms in the 5' flanking region of the β-globin gene. The results strongly support the hypothesis of three origins for the sickle mutation in Africa
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

Science.gov (United States)

Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J; Leitch, Ilia J

2015-01-01

The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

Directory of Open Access Journals (Sweden)

Jiří Macas

Full Text Available The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57% of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%. Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size.

Science.gov (United States)

Kelly, Laura J; Renny-Byfield, Simon; Pellicer, Jaume; Macas, Jiří; Novák, Petr; Neumann, Pavel; Lysak, Martin A; Day, Peter D; Berger, Madeleine; Fay, Michael F; Nichols, Richard A; Leitch, Andrew R; Leitch, Ilia J

2015-10-01

Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Multi-stage volcanic island flank collapses with coeval explosive caldera-forming eruptions

OpenAIRE

Hunt, James E.; Cassidy, Michael; Talling, Peter J.

2018-01-01

Volcanic flank collapses and explosive eruptions are among the largest and most destructive processes on Earth. Events at Mount St. Helens in May 1980 demonstrated how a relatively small (<5 km3) flank collapse on a terrestrial volcano could immediately precede a devastating eruption. The lateral collapse of volcanic island flanks, such as in the Canary Islands, can be far larger (>300 km3), but can also occur in complex multiple stages. Here, we show that multistage retrogressive lands...
The first genetic map of a synthesized allohexaploid Brassica with A, B and C genomes based on simple sequence repeat markers.

Science.gov (United States)

Yang, S; Chen, S; Geng, X X; Yan, G; Li, Z Y; Meng, J L; Cowling, W A; Zhou, W J

2016-04-01

We present the first genetic map of an allohexaploid Brassica species, based on segregating microsatellite markers in a doubled haploid mapping population generated from a hybrid between two hexaploid parents. This study reports the first genetic map of trigenomic Brassica. A doubled haploid mapping population consisting of 189 lines was obtained via microspore culture from a hybrid H16-1 derived from a cross between two allohexaploid Brassica lines (7H170-1 and Y54-2). Simple sequence repeat primer pairs specific to the A genome (107), B genome (44) and C genome (109) were used to construct a genetic linkage map of the population. Twenty-seven linkage groups were resolved from 274 polymorphic loci on the A genome (109), B genome (49) and C genome (116) covering a total genetic distance of 3178.8 cM with an average distance between markers of 11.60 cM. This is the first genetic framework map for the artificially synthesized Brassica allohexaploids. The linkage groups represent the expected complement of chromosomes in the A, B and C genomes from the original diploid and tetraploid parents. This framework linkage map will be valuable for QTL analysis and future genetic improvement of a new allohexaploid Brassica species, and in improving our understanding of the genetic control of meiosis in new polyploids.
The phylogeny of the social wasp subfamily Polistinae: evidence from microsatellite flanking sequences, mitochondrial COI sequence, and morphological characters

Directory of Open Access Journals (Sweden)

Strassmann Joan E

2004-03-01

Full Text Available Abstract Background Social wasps in the subfamily Polistinae (Hymenoptera: Vespidae have been important in studies of the evolution of sociality, kin selection, and within colony conflicts of interest. These studies have generally been conducted within species, because a resolved phylogeny among species is lacking. We used nuclear DNA microsatellite flanking sequences, mitochondrial COI sequence, and morphological characters to generate a phylogeny for the Polistinae (Hymenoptera using 69 species. Results Our phylogeny is largely concordant with previous phylogenies at higher levels, and is more resolved at the species level. Our results support the monophyly of the New World subgenera of Polistini, while the Old World subgenera are a paraphyletic group. All genera for which we had more than one exemplar were supported as monophyletic except Polybia which is not resolved, and may be paraphyletic. Conclusion The combination of DNA sequences from flanks of microsatellite repeats with mtCOI sequences and morphological characters proved to be useful characters establishing relationships among the different subgenera and species of the Polistini. This is the first detailed hypothesis for the species of this important group.
Characterization of genomic deletion efficiency mediated by clustered regularly interspaced short palindromic repeats (CRISPR)/Cas9 nuclease system in mammalian cells.

Science.gov (United States)

Canver, Matthew C; Bauer, Daniel E; Dass, Abhishek; Yien, Yvette Y; Chung, Jacky; Masuda, Takeshi; Maeda, Takahiro; Paw, Barry H; Orkin, Stuart H

2014-08-01

The clustered regularly interspaced short [corrected] palindromic repeats (CRISPR)/CRISPR-associated (Cas) 9 nuclease system has provided a powerful tool for genome engineering. Double strand breaks may trigger nonhomologous end joining repair, leading to frameshift mutations, or homology-directed repair using an extrachromosomal template. Alternatively, genomic deletions may be produced by a pair of double strand breaks. The efficiency of CRISPR/Cas9-mediated genomic deletions has not been systematically explored. Here, we present a methodology for the production of deletions in mammalian cells, ranging from 1.3 kb to greater than 1 Mb. We observed a high frequency of intended genomic deletions. Nondeleted alleles are nonetheless often edited with inversions or small insertion/deletions produced at CRISPR recognition sites. Deleted alleles also typically include small insertion/deletions at predicted deletion junctions. We retrieved cells with biallelic deletion at a frequency exceeding that of probabilistic expectation. We demonstrate an inverse relationship between deletion frequency and deletion size. This work suggests that CRISPR/Cas9 is a robust system to produce a spectrum of genomic deletions to allow investigation of genes and genetic elements. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.
Enhancer Identification through Comparative Genomics

Energy Technology Data Exchange (ETDEWEB)

Visel, Axel; Bristow, James; Pennacchio, Len A.

2006-10-01

With the availability of genomic sequence from numerousvertebrates, a paradigm shift has occurred in the identification ofdistant-acting gene regulatory elements. In contrast to traditionalgene-centric studies in which investigators randomly scanned genomicfragments that flank genes of interest in functional assays, the modernapproach begins electronically with publicly available comparativesequence datasets that provide investigators with prioritized lists ofputative functional sequences based on their evolutionary conservation.However, although a large number of tools and resources are nowavailable, application of comparative genomic approaches remains far fromtrivial. In particular, it requires users to dynamically consider thespecies and methods for comparison depending on the specific biologicalquestion under investigation. While there is currently no single generalrule to this end, it is clear that when applied appropriately,comparative genomic approaches exponentially increase our power ingenerating biological hypotheses for subsequent experimentaltesting.
Linkage of congenital isolated adrenocorticotropic hormone deficiency to the corticotropin releasing hormone locus using simple sequence repeat polymorphisms

Energy Technology Data Exchange (ETDEWEB)

Kyllo, J.H.; Collins, M.M.; Vetter, K.L. [Univ. of Iowa College of Medicine, Iowa City, IA (United States)] [and others

1996-03-29

Genetic screening techniques using simple sequence repeat polymorphisms were applied to investigate the molecular nature of congenital isolated adrenocorticotropic hormone (ACTH) deficiency. We hypothesize that this rare cause of hypocortisolism shared by a brother and sister with two unaffected sibs and unaffected parents is inherited as an autosomal recessive single gene mutation. Genes involved in the hypothalamic-pituitary axis controlling cortisol sufficiency were investigated for a causal role in this disorder. Southern blotting showed no detectable mutations of the gene encoding pro-opiomelanocortin (POMC), the ACTH precursor. Other candidate genes subsequently considered were those encoding neuroendocrine convertase-1, and neuroendocrine convertase-2 (NEC-1, NEC-2), and corticotropin releasing hormone (CRH). Tests for linkage were performed using polymorphic di- and tetranucleotide simple sequence repeat markers flanking the reported map locations for POMC, NEC-1, NEC-2, and CRH. The chromosomal haplotypes determined by the markers flanking the loci for POMC, NEC-1, and NEC-2 were not compatible with linkage. However, 22 individual markers defining the chromosomal haplotypes flanking CRH were compatible with linkage of the disorder to the immediate area of this gene of chromosome 8. Based on these data, we hypothesize that the ACTH deficiency in this family is due to an abnormality of CRH gene structure or expression. These results illustrate the useful application of high density genetic maps constructed with simple sequence repeat markers for inclusion/exclusion studies of candidate genes in even very small nuclear families segregating for unusual phenotypes. 25 refs., 5 figs., 2 tabs.
GRAbB : Selective Assembly of Genomic Regions, a New Niche for Genomic Research

NARCIS (Netherlands)

Brankovics, Balázs; Zhang, Hao; van Diepeningen, Anne D; van der Lee, Theo A J; Waalwijk, Cees; de Hoog, G Sybren

GRAbB (Genomic Region Assembly by Baiting) is a new program that is dedicated to assemble specific genomic regions from NGS data. This approach is especially useful when dealing with multi copy regions, such as mitochondrial genome and the rDNA repeat region, parts of the genome that are often
The complete chloroplast genome sequence of Mahonia bealei (Berberidaceae) reveals a significant expansion of the inverted repeat and phylogenetic relationship with other angiosperms.

Science.gov (United States)

Ma, Ji; Yang, Bingxian; Zhu, Wei; Sun, Lianli; Tian, Jingkui; Wang, Xumin

2013-10-10

Mahonia bealei (Berberidaceae) is a frequently-used traditional Chinese medicinal plant with efficient anti-inflammatory ability. This plant is one of the sources of berberine, a new cholesterol-lowering drug with anti-diabetic activity. We have sequenced the complete nucleotide sequence of the chloroplast (cp) genome of M. bealei. The complete cp genome of M. bealei is 164,792 bp in length, and has a typical structure with large (LSC 73,052 bp) and small (SSC 18,591 bp) single-copy regions separated by a pair of inverted repeats (IRs 36,501 bp) of large size. The Mahonia cp genome contains 111 unique genes and 39 genes are duplicated in the IR regions. The gene order and content of M. bealei are almost unarranged which is consistent with the hypothesis that large IRs stabilize cp genome and reduce gene loss-and-gain probabilities during evolutionary process. A large IR expansion of over 12 kb has occurred in M. bealei, 15 genes (rps19, rpl22, rps3, rpl16, rpl14, rps8, infA, rpl36, rps11, petD, petB, psbH, psbN, psbT and psbB) have expanded to have an additional copy in the IRs. The IR expansion rearrangement occurred via a double-strand DNA break and subsequence repair, which is different from the ordinary gene conversion mechanism. Repeat analysis identified 39 direct/inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Analysis also revealed 75 simple sequence repeat (SSR) loci and almost all are composed of A or T, contributing to a distinct bias in base composition. Comparison of protein-coding sequences with ESTs reveals 9 putative RNA edits and 5 of them resulted in non-synonymous modifications in rpoC1, rps2, rps19 and ycf1. Phylogenetic analysis using maximum parsimony (MP) and maximum likelihood (ML) was performed on a dataset composed of 65 protein-coding genes from 25 taxa, which yields an identical tree topology as previous plastid-based trees, and provides strong support for the sister relationship between Ranunculaceae and Berberidaceae
Genome-Wide Discovery of Microsatellite Markers from Diploid Progenitor Species, Arachis duranensis and A. ipaensis, and Their Application in Cultivated Peanut (A. hypogaea

Directory of Open Access Journals (Sweden)

Chuanzhi Zhao

2017-07-01

Full Text Available Despite several efforts in the last decade toward development of simple sequence repeat (SSR markers in peanut, there is still a need for more markers for conducting different genetic and breeding studies. With the effort of the International Peanut Genome Initiative, the availability of reference genome for both the diploid progenitors of cultivated peanut allowed us to identify 135,529 and 199,957 SSRs from the A (Arachis duranensis and B genomes (Arachis ipaensis, respectively. Genome sequence analysis showed uneven distribution of the SSR motifs across genomes with variation in parameters such as SSR type, repeat number, and SSR length. Using the flanking sequences of identified SSRs, primers were designed for 51,354 and 60,893 SSRs with densities of 49 and 45 SSRs per Mb in A. duranensis and A. ipaensis, respectively. In silico PCR analysis of these SSR markers showed high transferability between wild and cultivated Arachis species. Two physical maps were developed for the A genome and the B genome using these SSR markers, and two reported disease resistance quantitative trait loci (QTLs, qF2TSWV5 for tomato spotted wilt virus (TSWV and qF2LS6 for leaf spot (LS, were mapped in the 8.135 Mb region of chromosome A04 of A. duranensis. From this genomic region, 719 novel SSR markers were developed, which provide the possibility for fine mapping of these QTLs. In addition, this region also harbors 652 genes and 49 of these are defense related genes, including two NB-ARC genes, three LRR receptor-like genes and three WRKY transcription factors. These disease resistance related genes could contribute to resistance to viral (such as TSWV and fungal (such as LS diseases in peanut. In summary, this study not only provides a large number of molecular markers for potential use in peanut genetic map development and QTL mapping but also for map-based gene cloning and molecular breeding.
An orphan gyrB in the Mycobacterium smegmatis genome

Indian Academy of Sciences (India)

DNA gyrase is an essential topoisomerase found in all bacteria. It is encoded by gyrB and gyrA genes. These genes are organized differently in different bacteria. Direct comparison of Mycobacterium tuberculosis and Mycobacterium smegmatis genomes reveals presence of an additional gyrB in M. smegmatis flanked by ...
Linear and exponential TAIL-PCR: a method for efficient and quick amplification of flanking sequences adjacent to Tn5 transposon insertion sites.

Science.gov (United States)

Jia, Xianbo; Lin, Xinjian; Chen, Jichen

2017-11-02

Current genome walking methods are very time consuming, and many produce non-specific amplification products. To amplify the flanking sequences that are adjacent to Tn5 transposon insertion sites in Serratia marcescens FZSF02, we developed a genome walking method based on TAIL-PCR. This PCR method added a 20-cycle linear amplification step before the exponential amplification step to increase the concentration of the target sequences. Products of the linear amplification and the exponential amplification were diluted 100-fold to decrease the concentration of the templates that cause non-specific amplification. Fast DNA polymerase with a high extension speed was used in this method, and an amplification program was used to rapidly amplify long specific sequences. With this linear and exponential TAIL-PCR (LETAIL-PCR), we successfully obtained products larger than 2 kb from Tn5 transposon insertion mutant strains within 3 h. This method can be widely used in genome walking studies to amplify unknown sequences that are adjacent to known sequences.
Genomic characterization of large heterochromatic gaps in the human genome assembly.

Directory of Open Access Journals (Sweden)

Nicolas Altemose

2014-05-01

Full Text Available The largest gaps in the human genome assembly correspond to multi-megabase heterochromatic regions composed primarily of two related families of tandem repeats, Human Satellites 2 and 3 (HSat2,3. The abundance of repetitive DNA in these regions challenges standard mapping and assembly algorithms, and as a result, the sequence composition and potential biological functions of these regions remain largely unexplored. Furthermore, existing genomic tools designed to predict consensus-based descriptions of repeat families cannot be readily applied to complex satellite repeats such as HSat2,3, which lack a consistent repeat unit reference sequence. Here we present an alignment-free method to characterize complex satellites using whole-genome shotgun read datasets. Utilizing this approach, we classify HSat2,3 sequences into fourteen subfamilies and predict their chromosomal distributions, resulting in a comprehensive satellite reference database to further enable genomic studies of heterochromatic regions. We also identify 1.3 Mb of non-repetitive sequence interspersed with HSat2,3 across 17 unmapped assembly scaffolds, including eight annotated gene predictions. Finally, we apply our satellite reference database to high-throughput sequence data from 396 males to estimate array size variation of the predominant HSat3 array on the Y chromosome, confirming that satellite array sizes can vary between individuals over an order of magnitude (7 to 98 Mb and further demonstrating that array sizes are distributed differently within distinct Y haplogroups. In summary, we present a novel framework for generating initial reference databases for unassembled genomic regions enriched with complex satellite DNA, and we further demonstrate the utility of these reference databases for studying patterns of sequence variation within human populations.
The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family.

Science.gov (United States)

Janoušek, Václav; Karn, Robert C; Laukaitis, Christina M

2013-05-29

Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in
Breaks in the 45S rDNA Lead to Recombination-Mediated Loss of Repeats

NARCIS (Netherlands)

Warmerdam, Daniel O.; van den Berg, Jeroen; Medema, Rene H.

2016-01-01

rDNA repeats constitute the most heavily transcribed region in the human genome. Tumors frequently display elevated levels of recombination in rDNA, indicating that the repeats are a liability to the genomic integrity of a cell. However, little is known about how cells deal with DNA double-stranded

The complete chloroplast genome sequences of Lychnis wilfordii and Silene capitata and comparative analyses with other Caryophyllaceae genomes.

Science.gov (United States)

Kang, Jong-Soo; Lee, Byoung Yoon; Kwak, Myounghai

2017-01-01

The complete chloroplast genomes of Lychnis wilfordii and Silene capitata were determined and compared with ten previously reported Caryophyllaceae chloroplast genomes. The chloroplast genome sequences of L. wilfordii and S. capitata contain 152,320 bp and 150,224 bp, respectively. The gene contents and orders among 12 Caryophyllaceae species are consistent, but several microstructural changes have occurred. Expansion of the inverted repeat (IR) regions at the large single copy (LSC)/IRb and small single copy (SSC)/IR boundaries led to partial or entire gene duplications. Additionally, rearrangements of the LSC region were caused by gene inversions and/or transpositions. The 18 kb inversions, which occurred three times in different lineages of tribe Sileneae, were thought to be facilitated by the intermolecular duplicated sequences. Sequence analyses of the L. wilfordii and S. capitata genomes revealed 39 and 43 repeats, respectively, including forward, palindromic, and reverse repeats. In addition, a total of 67 and 56 simple sequence repeats were discovered in the L. wilfordii and S. capitata chloroplast genomes, respectively. Finally, we constructed phylogenetic trees of the 12 Caryophyllaceae species and two Amaranthaceae species based on 73 protein-coding genes using both maximum parsimony and likelihood methods.
Comparative genomic analysis reveals multiple long terminal repeats, lineage-specific amplification, and frequent interelement recombination for Cassandra retrotransposon in pear (Pyrus bretschneideri Rehd.).

Science.gov (United States)

Yin, Hao; Du, Jianchang; Li, Leiting; Jin, Cong; Fan, Lian; Li, Meng; Wu, Jun; Zhang, Shaoling

2014-06-04

Cassandra transposable elements belong to a specific group of terminal-repeat retrotransposons in miniature (TRIM). Although Cassandra TRIM elements have been found in almost all vascular plants, detailed investigations on the nature, abundance, amplification timeframe, and evolution have not been performed in an individual genome. We therefore conducted a comprehensive analysis of Cassandra retrotransposons using the newly sequenced pear genome along with four other Rosaceae species, including apple, peach, mei, and woodland strawberry. Our data reveal several interesting findings for this particular retrotransposon family: 1) A large number of the intact copies contain three, four, or five long terminal repeats (LTRs) (∼20% in pear); 2) intact copies and solo LTRs with or without target site duplications are both common (∼80% vs. 20%) in each genome; 3) the elements exhibit an overall unbiased distribution among the chromosomes; 4) the elements are most successfully amplified in pear (5,032 copies); and 5) the evolutionary relationships of these elements vary among different lineages, species, and evolutionary time. These results indicate that Cassandra retrotransposons contain more complex structures (elements with multiple LTRs) than what we have known previously, and that frequent interelement unequal recombination followed by transposition may play a critical role in shaping and reshaping host genomes. Thus this study provides insights into the property, propensity, and molecular mechanisms governing the formation and amplification of Cassandra retrotransposons, and enhances our understanding of the structural variation, evolutionary history, and transposition process of LTR retrotransposons in plants. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies.

Science.gov (United States)

Card, Daren C; Schield, Drew R; Reyes-Velasco, Jacobo; Fujita, Matthew K; Andrew, Audra L; Oyler-McCance, Sara J; Fike, Jennifer A; Tomback, Diana F; Ruggiero, Robert P; Castoe, Todd A

2014-01-01

As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (∼3.5-5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.
Two low coverage bird genomes and a comparison of reference-guided versus de novo genome assemblies

Science.gov (United States)

Card, Daren C.; Schield, Drew R.; Reyes-Velasco, Jacobo; Fujita, Matthre K.; Andrew, Audra L.; Oyler-McCance, Sara J.; Fike, Jennifer A.; Tomback, Diana F.; Ruggiero, Robert P.; Castoe, Todd A.

2014-01-01

As a greater number and diversity of high-quality vertebrate reference genomes become available, it is increasingly feasible to use these references to guide new draft assemblies for related species. Reference-guided assembly approaches may substantially increase the contiguity and completeness of a new genome using only low levels of genome coverage that might otherwise be insufficient for de novo genome assembly. We used low-coverage (~3.5–5.5x) Illumina paired-end sequencing to assemble draft genomes of two bird species (the Gunnison Sage-Grouse, Centrocercus minimus, and the Clark's Nutcracker, Nucifraga columbiana). We used these data to estimate de novo genome assemblies and reference-guided assemblies, and compared the information content and completeness of these assemblies by comparing CEGMA gene set representation, repeat element content, simple sequence repeat content, and GC isochore structure among assemblies. Our results demonstrate that even lower-coverage genome sequencing projects are capable of producing informative and useful genomic resources, particularly through the use of reference-guided assemblies.
Myotonin protein-kinase [AGC]n trinucleotide repeat in seven nonhuman primates

Energy Technology Data Exchange (ETDEWEB)

Novelli, G.; Sineo, L.; Pontieri, E. [Catholic Univ. of Rome (Italy)]|[Univ. of Milan (Italy)]|[Univ. Florence (Italy)] [and others

1994-09-01

Myotonic dystrophy (DM) is due to a genomic instability of a trinucleotide [AGC]n motif, located at the 3{prime} UTR region of a protein-kinase gene (myotonin protein kinase, MT-PK). The [AGC] repeat is meiotically and mitotically unstable, and it is directly related to the manifestations of the disorder. Although a gene dosage effect of the MT-PK has been demonstrated n DM muscle, the mechanism(s) by which the intragenic repeat expansion leads to disease is largely unknown. This non-standard mutational event could reflect an evolutionary mechanism widespread among animal genomes. We have isolated and sequenced the complete 3{prime}UTR region of the MT-PK gene in seven primates (macaque, orangutan, gorilla, chimpanzee, gibbon, owl monkey, saimiri), and examined by comparative sequence nucleotide analysis the [AGC]n intragenic repeat and the surrounding nucleotides. The genomic organization, including the [AGC]n repeat structure, was conserved in all examined species, excluding the gibbon (Hylobates agilis), in which the [AGC]n upstream sequence (GGAA) is replaced by a GA dinucleotide. The number of [AGC]n in the examined species ranged between 7 (gorilla) and 13 repeats (owl monkeys), with a polymorphism informative content (PIC) similar to that observed in humans. These results indicate that the 3{prime}UTR [AGC] repeat within the MT-PK gene is evolutionarily conserved, supporting that this region has important regulatory functions.
Mitigation of Flanking Noise Transmission in Periodic Structures of Lightweight Elements

DEFF Research Database (Denmark)

Domadiya, Parthkumar Gandalal

through structural junctions and radiates into neighbouring rooms. To diminish the flanking transmission of sound, frames are usually designed with single or double studs or constructed with layers of foam or another viscoelastic material. This thesis is investigating the behaviour of flanking noise...... transmission in periodic structures of lightweight elements by employing various numerical, analytical and experimental methods. At first, three dimensional finite-element (FE) models of a Z-shaped lightweight panel structure based on various frame designs, inclusion of air and structural coupling between...... elements are considered for describing flanking noise transmission through panels. It is assumed that the ribs are fully fixed to the plates in case of various frame designs, and a parametric study is carried out on the centre panel with regard to various spacing between the ribs. Solid finite elements...
Genome Architecture and Its Roles in Human Copy Number Variation

Directory of Open Access Journals (Sweden)

Lu Chen

2014-12-01

Full Text Available Besides single-nucleotide variants in the human genome, large-scale genomic variants, such as copy number variations (CNVs, are being increasingly discovered as a genetic source of human diversity and the pathogenic factors of diseases. Recent experimental findings have shed light on the links between different genome architectures and CNV mutagenesis. In this review, we summarize various genomic features and discuss their contributions to CNV formation. Genomic repeats, including both low-copy and high-copy repeats, play important roles in CNV instability, which was initially known as DNA recombination events. Furthermore, it has been found that human genomic repeats can also induce DNA replication errors and consequently result in CNV mutations. Some recent studies showed that DNA replication timing, which reflects the high-order information of genomic organization, is involved in human CNV mutations. Our review highlights that genome architecture, from DNA sequence to high-order genomic organization, is an important molecular factor in CNV mutagenesis and human genomic instability.
Flank tectonics of Martian volcanoes

International Nuclear Information System (INIS)

Thomas, P.J.; Squyres, S.W.; Carr, M.H.

1990-01-01

On the flanks of Olympus Mons is a series of terraces, concentrically distributed around the caldera. Their morphology and location suggest that they could be thrust faults caused by compressional failure of the cone. In an attempt to understand the mechanism of faulting and the possible influences of the interior structure of Olympus Mons, the authors have constructed a numerical model for elastic stresses within a Martian volcano. In the absence of internal pressurization, the middle slopes of the cone are subjected to compressional stress, appropriate to the formation of thrust faults. These stresses for Olympus Mons are ∼250 MPa. If a vacant magma chamber is contained within the cone, the region of maximum compressional stress is extended toward the base of the cone. If the magma chamber is pressurized, extensional stresses occur at the summit and on the upper slopes of the cone. For a filled but unpressurized magma chamber, the observed positions of the faults agree well with the calculated region of high compressional stress. Three other volcanoes on Mars, Ascraeus Mons, Arsia Mons, and Pavonis Mons, possess similar terraces. Extending the analysis to other Martian volcanoes, they find that only these three and Olympus Mons have flank stresses that exceed the compressional failure strength of basalt, lending support to the view that the terraces on all four are thrust faults
On detection and assessment of statistical significance of Genomic Islands

Directory of Open Access Journals (Sweden)

Chaudhuri Probal

2008-04-01

Full Text Available Abstract Background Many of the available methods for detecting Genomic Islands (GIs in prokaryotic genomes use markers such as transposons, proximal tRNAs, flanking repeats etc., or they use other supervised techniques requiring training datasets. Most of these methods are primarily based on the biases in GC content or codon and amino acid usage of the islands. However, these methods either do not use any formal statistical test of significance or use statistical tests for which the critical values and the P-values are not adequately justified. We propose a method, which is unsupervised in nature and uses Monte-Carlo statistical tests based on randomly selected segments of a chromosome. Such tests are supported by precise statistical distribution theory, and consequently, the resulting P-values are quite reliable for making the decision. Results Our algorithm (named Design-Island, an acronym for Detection of Statistically Significant Genomic Island runs in two phases. Some 'putative GIs' are identified in the first phase, and those are refined into smaller segments containing horizontally acquired genes in the refinement phase. This method is applied to Salmonella typhi CT18 genome leading to the discovery of several new pathogenicity, antibiotic resistance and metabolic islands that were missed by earlier methods. Many of these islands contain mobile genetic elements like phage-mediated genes, transposons, integrase and IS elements confirming their horizontal acquirement. Conclusion The proposed method is based on statistical tests supported by precise distribution theory and reliable P-values along with a technique for visualizing statistically significant islands. The performance of our method is better than many other well known methods in terms of their sensitivity and accuracy, and in terms of specificity, it is comparable to other methods.
Analysis of expressed sequence tags from Prunus mume flower and fruit and development of simple sequence repeat markers

Directory of Open Access Journals (Sweden)

Gao Zhihong

2010-07-01

Full Text Available Abstract Background Expressed Sequence Tag (EST has been a cost-effective tool in molecular biology and represents an abundant valuable resource for genome annotation, gene expression, and comparative genomics in plants. Results In this study, we constructed a cDNA library of Prunus mume flower and fruit, sequenced 10,123 clones of the library, and obtained 8,656 expressed sequence tag (EST sequences with high quality. The ESTs were assembled into 4,473 unigenes composed of 1,492 contigs and 2,981 singletons and that have been deposited in NCBI (accession IDs: GW868575 - GW873047, among which 1,294 unique ESTs were with known or putative functions. Furthermore, we found 1,233 putative simple sequence repeats (SSRs in the P. mume unigene dataset. We randomly tested 42 pairs of PCR primers flanking potential SSRs, and 14 pairs were identified as true-to-type SSR loci and could amplify polymorphic bands from 20 individual plants of P. mume. We further used the 14 EST-SSR primer pairs to test the transferability on peach and plum. The result showed that nearly 89% of the primer pairs produced target PCR bands in the two species. A high level of marker polymorphism was observed in the plum species (65% and low in the peach (46%, and the clustering analysis of the three species indicated that these SSR markers were useful in the evaluation of genetic relationships and diversity between and within the Prunus species. Conclusions We have constructed the first cDNA library of P. mume flower and fruit, and our data provide sets of molecular biology resources for P. mume and other Prunus species. These resources will be useful for further study such as genome annotation, new gene discovery, gene functional analysis, molecular breeding, evolution and comparative genomics between Prunus species.
SSRscanner: a program for reporting distribution and exact location of simple sequence repeats.

Science.gov (United States)

Anwar, Tamanna; Khan, Asad U

2006-02-20

Simple sequence repeats (SSRs) have become important molecular markers for a broad range of applications, such as genome mapping and characterization, phenotype mapping, marker assisted selection of crop plants and a range of molecular ecology and diversity studies. These repeated DNA sequences are found in both prokaryotes and eukaryotes. They are distributed almost at random throughout the genome, ranging from mononucleotide to trinucleotide repeats. They are also found at longer lengths (> 6 repeating units) of tracts. Most of the computer programs that find SSRs do not report its exact position. A computer program SSRscanner was written to find out distribution, frequency and exact location of each SSR in the genome. SSRscanner is user friendly. It can search repeats of any length and produce outputs with their exact position on chromosome and their frequency of occurrence in the sequence. This program has been written in PERL and is freely available for non-commercial users by request from the authors. Please contact the authors by E-mail: huzzi99@hotmail.com.
Development of Highly Informative Genome-Wide Single Sequence Repeat Markers for Breeding Applications in Sesame and Construction of a Web Resource: SisatBase

Directory of Open Access Journals (Sweden)

Komivi Dossa

2017-08-01

Full Text Available The sequencing of the full nuclear genome of sesame (Sesamum indicum L. provides the platform for functional analyses of genome components and their application in breeding programs. Although the importance of microsatellites markers or simple sequence repeats (SSR in crop genotyping, genetics, and breeding applications is well established, only a little information exist concerning SSRs at the whole genome level in sesame. In addition, SSRs represent a suitable marker type for sesame molecular breeding in developing countries where it is mainly grown. In this study, we identified 138,194 genome-wide SSRs of which 76.5% were physically mapped onto the 13 pseudo-chromosomes. Among these SSRs, up to three primers pairs were supplied for 101,930 SSRs and used to in silico amplify the reference genome together with two newly sequenced sesame accessions. A total of 79,957 SSRs (78% were polymorphic between the three genomes thereby suggesting their promising use in different genomics-assisted breeding applications. From these polymorphic SSRs, 23 were selected and validated to have high polymorphic potential in 48 sesame accessions from different growing areas of Africa. Furthermore, we have developed an online user-friendly database, SisatBase (http://www.sesame-bioinfo.org/SisatBase/, which provides free access to SSRs data as well as an integrated platform for functional analyses. Altogether, the reference SSR and SisatBase would serve as useful resources for genetic assessment, genomic studies, and breeding advancement in sesame, especially in developing countries.
Concurrent Preoperative Presence of Hydronephrosis and Flank Pain Independently Predicts Worse Outcome of Upper Tract Urothelial Carcinoma.

Science.gov (United States)

Yeh, Hsin-Chih; Jan, Hau-Chern; Wu, Wen-Jeng; Li, Ching-Chia; Li, Wei-Ming; Ke, Hung-Lung; Huang, Shu-Pin; Liu, Chia-Chu; Lee, Yung-Chin; Yang, Sheau-Fang; Liang, Peir-In; Huang, Chun-Nung

2015-01-01

To investigate the impact of preoperative hydronephrosis and flank pain on prognosis of patients with upper tract urothelial carcinoma. In total, 472 patients with upper tract urothelial carcinoma managed by radical nephroureterectomy were included from Kaohsiung Medical University Hospital Healthcare System. Clinicopathological data were collected retrospectively for analysis. The significance of hydronephrosis, especially when combined with flank pain, and other relevant factors on overall and cancer-specific survival were evaluated. Of the 472 patients, 292 (62%) had preoperative hydronephrosis and 121 (26%) presented with flank pain. Preoperative hydronephrosis was significantly associated with age, hematuria, flank pain, tumor location, and pathological tumor stage. Concurrent presence of hydronephrosis and flank pain was a significant predictor of non-organ-confined disease (multivariate-adjusted hazard ratio = 2.10, P = 0.025). Kaplan-Meier analysis showed significantly poorer overall and cancer-specific survival in patients with preoperative hydronephrosis (P = 0.005 and P = 0.026, respectively) and in patients with flank pain (P hydronephrosis and flank pain independently predicted adverse outcome (hazard ratio = 1.98, P = 0.016 for overall survival and hazard ratio = 1.87, P = 0.036 for and cancer-specific survival, respectively) in multivariate Cox proportional hazards models. In addition, concurrent presence of hydronephrosis and flank pain was also significantly predictive of worse survival in patient with high grade or muscle-invasive disease. Notably, there was no difference in survival between patients with hydronephrosis but devoid of flank pain and those without hydronephrosis. Concurrent preoperative presence of hydronephrosis and flank pain predicted non-organ-confined status of upper tract urothelial carcinoma. When accompanied with flank pain, hydronephrosis represented an independent predictor for worse outcome in patients with upper tract
Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis.

Science.gov (United States)

Fu, Jianmin; Liu, Huimin; Hu, Jingjing; Liang, Yuqin; Liang, Jinjun; Wuyun, Tana; Tan, Xiaofeng

2016-01-01

Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp) genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp) in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.
Five Complete Chloroplast Genome Sequences from Diospyros: Genome Organization and Comparative Analysis.

Directory of Open Access Journals (Sweden)

Jianmin Fu

Full Text Available Diospyros is the largest genus in Ebenaceae, comprising more than 500 species with remarkable economic value, especially Diospyros kaki Thunb., which has traditionally been an important food resource in China, Korea, and Japan. Complete chloroplast (cp genomes from D. kaki, D. lotus L., D. oleifera Cheng., D. glaucifolia Metc., and Diospyros 'Jinzaoshi' were sequenced using Illumina sequencing technology. This is the first cp genome reported in Ebenaceae. The cp genome sequences of Diospyros ranged from 157,300 to 157,784 bp in length, presenting a typical quadripartite structure with two inverted repeats each separated by one large and one small single-copy region. For each cp genome, 134 genes were annotated, including 80 protein-coding, 31 tRNA, and 4 rRNA unique genes. In all, 179 repeats and 283 single sequence repeats were identified. Four hypervariable regions, namely, intergenic region of trnQ_rps16, trnV_ndhC, and psbD_trnT, and intron of ndhA, were identified in the Diospyros genomes. Phylogenetic analyses based on the whole cp genome, protein-coding, and intergenic and intron sequences indicated that D. oleifera is closely related to D. kaki and could be used as a model plant for future research on D. kaki; to our knowledge, this is proposed for the first time. Further, these analyses together with two large deletions (301 and 140 bp in the cp genome of D. 'Jinzaoshi', support its placement as a new species in Diospyros. Both maximum parsimony and likelihood analyses for 19 taxa indicated the basal position of Ericales in asterids and suggested that Ebenaceae is monophyletic in Ericales.
Genome-wide comparative analysis of 20 miniature inverted-repeat transposable element families in Brassica rapa and B. oleracea.

Directory of Open Access Journals (Sweden)

Perumal Sampath

Full Text Available Miniature inverted-repeat transposable elements (MITEs are ubiquitous, non-autonomous class II transposable elements. Here, we conducted genome-wide comparative analysis of 20 MITE families in B. rapa, B. oleracea, and Arabidopsis thaliana. A total of 5894 and 6026 MITE members belonging to the 20 families were found in the whole genome pseudo-chromosome sequences of B. rapa and B. oleracea, respectively. Meanwhile, only four of the 20 families, comprising 573 members, were identified in the Arabidopsis genome, indicating that most of the families were activated in the Brassica genus after divergence from Arabidopsis. Copy numbers varied from 4 to 1459 for each MITE family, and there was up to 6-fold variation between B. rapa and B. oleracea. In particular, analysis of intact members showed that whereas eleven families were present in similar copy numbers in B. rapa and B. oleracea, nine families showed copy number variation ranging from 2- to 16-fold. Four of those families (BraSto-3, BraTo-3, 4, 5 were more abundant in B. rapa, and the other five (BraSto-1, BraSto-4, BraTo-1, 7 and BraHAT-1 were more abundant in B. oleracea. Overall, 54% and 51% of the MITEs resided in or within 2 kb of a gene in the B. rapa and B. oleracea genomes, respectively. Notably, 92 MITEs were found within the CDS of annotated genes, suggesting that MITEs might play roles in diversification of genes in the recently triplicated Brassica genome. MITE insertion polymorphism (MIP analysis of 289 MITE members showed that 52% and 23% were polymorphic at the inter- and intra-species levels, respectively, indicating that there has been recent MITE activity in the Brassica genome. These recently activated MITE families with abundant MIP will provide useful resources for molecular breeding and identification of novel functional genes arising from MITE insertion.
A versatile palindromic amphipathic repeat coding sequence horizontally distributed among diverse bacterial and eucaryotic microbes

Directory of Open Access Journals (Sweden)

Glass John I

2010-07-01

Full Text Available Abstract Background Intragenic tandem repeats occur throughout all domains of life and impart functional and structural variability to diverse translation products. Repeat proteins confer distinctive surface phenotypes to many unicellular organisms, including those with minimal genomes such as the wall-less bacterial monoderms, Mollicutes. One such repeat pattern in this clade is distributed in a manner suggesting its exchange by horizontal gene transfer (HGT. Expanding genome sequence databases reveal the pattern in a widening range of bacteria, and recently among eucaryotic microbes. We examined the genomic flux and consequences of the motif by determining its distribution, predicted structural features and association with membrane-targeted proteins. Results Using a refined hidden Markov model, we document a 25-residue protein sequence motif tandemly arrayed in variable-number repeats in ORFs lacking assigned functions. It appears sporadically in unicellular microbes from disparate bacterial and eucaryotic clades, representing diverse lifestyles and ecological niches that include host parasitic, marine and extreme environments. Tracts of the repeats predict a malleable configuration of recurring domains, with conserved hydrophobic residues forming an amphipathic secondary structure in which hydrophilic residues endow extensive sequence variation. Many ORFs with these domains also have membrane-targeting sequences that predict assorted topologies; others may comprise reservoirs of sequence variants. We demonstrate expressed variants among surface lipoproteins that distinguish closely related animal pathogens belonging to a subgroup of the Mollicutes. DNA sequences encoding the tandem domains display dyad symmetry. Moreover, in some taxa the domains occur in ORFs selectively associated with mobile elements. These features, a punctate phylogenetic distribution, and different patterns of dispersal in genomes of related taxa, suggest that the
Two tandemly repeated telomere-associated sequences in Nicotiana plumbaginifolia.

Science.gov (United States)

Chen, C M; Wang, C T; Wang, C J; Ho, C H; Kao, Y Y; Chen, C C

1997-12-01

Two tandemly repeated telomere-associated sequences, NP3R and NP4R, have been isolated from Nicotiana plumbaginifolia. The length of a repeating unit for NP3R and NP4R is 165 and 180 nucleotides respectively. The abundance of NP3R, NP4R and telomeric repeats is, respectively, 8.4 x 10(4), 6 x 10(3) and 1.5 x 10(6) copies per haploid genome of N. plumbaginifolia. Fluorescence in situ hybridization revealed that NP3R is located at the ends and/or in interstitial regions of all 10 chromosomes and NP4R on the terminal regions of three chromosomes in the haploid genome of N. plumbaginifolia. Sequence homology search revealed that not only are NP3R and NP4R homologous to HRS60 and GRS, respectively, two tandem repeats isolated from N. tabacum, but that NP3R and NP4R are also related to each other, suggesting that they originated from a common ancestral sequence. The role of these repeated sequences in chromosome healing is discussed based on the observation that two to three copies of a telomere-similar sequence were present in each repeating unit of NP3R and NP4R.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

Directory of Open Access Journals (Sweden)

Charlotte Rehm

Full Text Available In prokaryotes simple sequence repeats (SSRs with unit sizes of 1-5 nucleotides (nt are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4 structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc, Xanthomonas axonopodis pv. citri str. 306 (Xac, and Nostoc sp. strain PCC7120 (Ana. In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.
Investigation of a Quadruplex-Forming Repeat Sequence Highly Enriched in Xanthomonas and Nostoc sp.

Science.gov (United States)

Rehm, Charlotte; Wurmthaler, Lena A; Li, Yuanhao; Frickey, Tancred; Hartig, Jörg S

2015-01-01

In prokaryotes simple sequence repeats (SSRs) with unit sizes of 1-5 nucleotides (nt) are causative for phase and antigenic variation. Although an increased abundance of heptameric repeats was noticed in bacteria, reports about SSRs of 6-9 nt are rare. In particular G-rich repeat sequences with the propensity to fold into G-quadruplex (G4) structures have received little attention. In silico analysis of prokaryotic genomes show putative G4 forming sequences to be abundant. This report focuses on a surprisingly enriched G-rich repeat of the type GGGNATC in Xanthomonas and cyanobacteria such as Nostoc. We studied in detail the genomes of Xanthomonas campestris pv. campestris ATCC 33913 (Xcc), Xanthomonas axonopodis pv. citri str. 306 (Xac), and Nostoc sp. strain PCC7120 (Ana). In all three organisms repeats are spread all over the genome with an over-representation in non-coding regions. Extensive variation of the number of repetitive units was observed with repeat numbers ranging from two up to 26 units. However a clear preference for four units was detected. The strong bias for four units coincides with the requirement of four consecutive G-tracts for G4 formation. Evidence for G4 formation of the consensus repeat sequences was found in biophysical studies utilizing CD spectroscopy. The G-rich repeats are preferably located between aligned open reading frames (ORFs) and are under-represented in coding regions or between divergent ORFs. The G-rich repeats are preferentially located within a distance of 50 bp upstream of an ORF on the anti-sense strand or within 50 bp from the stop codon on the sense strand. Analysis of whole transcriptome sequence data showed that the majority of repeat sequences are transcribed. The genetic loci in the vicinity of repeat regions show increased genomic stability. In conclusion, we introduce and characterize a special class of highly abundant and wide-spread quadruplex-forming repeat sequences in bacteria.

An RNA secondary structure bias for non-homologous reverse transcriptase-mediated deletions in vivo

DEFF Research Database (Denmark)

Duch, Mogens; Carrasco, Maria L; Jespersen, Thomas

2004-01-01

Murine leukemia viruses harboring an internal ribosome entry site (IRES)-directed translational cassette are able to replicate, but undergo loss of heterologous sequences upon continued passage. While complete loss of heterologous sequences is favored when these are flanked by a direct repeat......, deletion mutants with junction sites within the heterologous cassette may also be retrieved, in particular from vectors without flanking repeats. Such deletion mutants were here used to investigate determinants of reverse transcriptase-mediated non-homologous recombination. Based upon previous structural...... result from template switching during first-strand cDNA synthesis and that the choice of acceptor sites for non-homologous recombination are guided by non-paired regions. Our results may have implications for recombination events taking place within structured regions of retroviral RNA genomes...
DISQUIETUDE ON THE EASTERN FLANK: AWAITING ALLIANCE RESPONSE

Directory of Open Access Journals (Sweden)

Octavian Manea

2010-03-01

Full Text Available The absence of significant and tangible military defensive infrastructure on the Eastern flank generated over time a breach of credibility in the security guarantee provided by NATO under its Article 5 commitment. The main argument of the countries in the New Europe now is that, in order to be credible enough, and not just a paper guarantee, a collective defence commitment must be backed by “boots on the ground” and by military tangible logistics.While assuming this perspective, the present article looks at some of the alarm signals coming from the countries on NATO’s Eastern flank, trying to explain the feeling of insecurity perceived by the states in the region as well as the options available to the Euro-Atlantic community in order to engage in a much-needed process of strategic reassurance.
Economic method for helical gear flank surface characterisation

Science.gov (United States)

Koulin, G.; Reavie, T.; Frazer, R. C.; Shaw, B. A.

2018-03-01

Typically the quality of a gear pair is assessed based on simplified geometric tolerances which do not always correlate with functional performance. In order to identify and quantify functional performance based parameters, further development of the gear measurement approach is required. Methodology for interpolation of the full active helical gear flank surface, from sparse line measurements, is presented. The method seeks to identify the minimum number of line measurements required to sufficiently characterise an active gear flank. In the form ground gear example presented, a single helix and three profile line measurements was considered to be acceptable. The resulting surfaces can be used to simulate the meshing engagement of a gear pair and therefore provide insight into functional performance based parameters. Therefore the assessment of the quality can be based on the predicted performance in the context of an application.
Breaks in the 45S rDNA Lead to Recombination-Mediated Loss of Repeats

OpenAIRE

Warmerdam, Daniël O.; van den Berg, Jeroen; Medema, René H.

2016-01-01

rDNA repeats constitute the most heavily transcribed region in the human genome. Tumors frequently display elevated levels of recombination in rDNA, indicating that the repeats are a liability to the genomic integrity of a cell. However, little is known about how cells deal with DNA double-stranded breaks in rDNA. Using selective endonucleases, we show that human cells are highly sensitive to breaks in 45S but not the 5S rDNA repeats. We find that homologous recombination inhibits repair of b...
Variable number of tandem repeat markers in the genome sequence of Mycosphaerella fijiensis, the causal agent of black leaf streak disease of banana (Musa spp).

Science.gov (United States)

Garcia, S A L; Van der Lee, T A J; Ferreira, C F; Te Lintel Hekkert, B; Zapater, M-F; Goodwin, S B; Guzmán, M; Kema, G H J; Souza, M T

2010-11-09

We searched the genome of Mycosphaerella fijiensis for molecular markers that would allow population genetics analysis of this plant pathogen. M. fijiensis, the causal agent of banana leaf streak disease, also known as black Sigatoka, is the most devastating pathogen attacking bananas (Musa spp). Recently, the entire genome sequence of M. fijiensis became available. We screened this database for VNTR markers. Forty-two primer pairs were selected for validation, based on repeat type and length and the number of repeat units. Five VNTR markers showing multiple alleles were validated with a reference set of isolates from different parts of the world and a population from a banana plantation in Costa Rica. Polymorphism information content values varied from 0.6414 to 0.7544 for the reference set and from 0.0400 and 0.7373 for the population set. Eighty percent of the polymorphism information content values were above 0.60, indicating that the markers are highly informative. These markers allowed robust scoring of agarose gels and proved to be useful for variability and population genetics studies. In conclusion, the strategy we developed to identify and validate VNTR markers is an efficient means to incorporate markers that can be used for fungicide resistance management and to develop breeding strategies to control banana black leaf streak disease. This is the first report of VNTR-minisatellites from the M. fijiensis genome sequence.
The First Complete Chloroplast Genome Sequences in Actinidiaceae: Genome Structure and Comparative Analysis.

Science.gov (United States)

Yao, Xiaohong; Tang, Ping; Li, Zuozhou; Li, Dawei; Liu, Yifei; Huang, Hongwen

2015-01-01

Actinidia chinensis is an important economic plant belonging to the basal lineage of the asterids. Availability of a complete Actinidia chloroplast genome sequence is crucial to understanding phylogenetic relationships among major lineages of angiosperms and facilitates kiwifruit genetic improvement. We report here the complete nucleotide sequences of the chloroplast genomes for Actinidia chinensis and A. chinensis var deliciosa obtained through de novo assembly of Illumina paired-end reads produced by total DNA sequencing. The total genome size ranges from 155,446 to 157,557 bp, with an inverted repeat (IR) of 24,013 to 24,391 bp, a large single copy region (LSC) of 87,984 to 88,337 bp and a small single copy region (SSC) of 20,332 to 20,336 bp. The genome encodes 113 different genes, including 79 unique protein-coding genes, 30 tRNA genes and 4 ribosomal RNA genes, with 16 duplicated in the inverted repeats, and a tRNA gene (trnfM-CAU) duplicated once in the LSC region. Comparisons of IR boundaries among four asterid species showed that IR/LSC borders were extended into the 5' portion of the psbA gene and IR contraction occurred in Actinidia. The clap gene has been lost from the chloroplast genome in Actinidia, and may have been transferred to the nucleus during chloroplast evolution. Twenty-seven polymorphic simple sequence repeat (SSR) loci were identified in the Actinidia chloroplast genome. Maximum parsimony analyses of a 72-gene, 16 taxa angiosperm dataset strongly support the placement of Actinidiaceae in Ericales within the basal asterids.
Effect of the bases flanking an abasic site on the recognition of nucleobase by amiloride.

Science.gov (United States)

Rajendran, Arivazhagan; Zhao, Chunxia; Rajendar, Burki; Thiagarajan, Viruthachalam; Sato, Yusuke; Nishizawa, Seiichi; Teramae, Norio

2010-06-01

We explain here the various non-covalent interactions which are responsible for the different binding modes of a small ligand with DNA. The combination of experimental and theoretical methods was used. The interaction of amiloride with thymine was found to depend on the bases flanking the AP site and different binding modes were observed for different flanking bases. Molecular modeling, absorption studies and binding constant measurements support for the different binding patterns. The flanking base dependent recognition of AP site phosphates was investigated by (31)P NMR experiments. The thermodynamics of the ligand-nucleotide interaction was demonstrated by isothermal titration calorimetry. The emission behavior of amiloride was found to depend on the bases flanking the AP site. Amiloride photophysics in the context of AP-site containing DNA is investigated by time-dependent density functional theory. Flanking bases affect the ground and excited electronic states of amiloride when binding to AP site, which causes flanking base-dependent fluorescence signaling. The various noncovalent interactions have been well characterized for the determination of nucleic acid structure and dynamics, and protein-DNA interactions. However, these are not clear for the DNA-small molecule interactions and we believe that our studies will bring a new insight into such phenomena. Copyright 2010 Elsevier B.V. All rights reserved.
Translocation and gross deletion breakpoints in human inherited disease and cancer II: Potential involvement of repetitive sequence elements in secondary structure formation between DNA ends.

Science.gov (United States)

Chuzhanova, Nadia; Abeysinghe, Shaun S; Krawczak, Michael; Cooper, David N

2003-09-01

Translocations and gross deletions are responsible for a significant proportion of both cancer and inherited disease. Although such gene rearrangements are nonuniformly distributed in the human genome, the underlying mutational mechanisms remain unclear. We have studied the potential involvement of various types of repetitive sequence elements in the formation of secondary structure intermediates between the single-stranded DNA ends that recombine during rearrangements. Complexity analysis was used to assess the potential of these ends to form secondary structures, the maximum decrease in complexity consequent to a gross rearrangement being used as an indicator of the type of repeat and the specific DNA ends involved. A total of 175 pairs of deletion/translocation breakpoint junction sequences available from the Gross Rearrangement Breakpoint Database [GRaBD; www.uwcm.ac.uk/uwcm/mg/grabd/grabd.html] were analyzed. Potential secondary structure was noted between the 5' flanking sequence of the first breakpoint and the 3' flanking sequence of the second breakpoint in 49% of rearrangements and between the 5' flanking sequence of the second breakpoint and the 3' flanking sequence of the first breakpoint in 36% of rearrangements. Inverted repeats, inversions of inverted repeats, and symmetric elements were found in association with gross rearrangements at approximately the same frequency. However, inverted repeats and inversions of inverted repeats accounted for the vast majority (83%) of deletions plus small insertions, symmetric elements for one-half of all antigen receptor-mediated translocations, while direct repeats appear only to be involved in mediating simple deletions. These findings extend our understanding of illegitimate recombination by highlighting the importance of secondary structure formation between single-stranded DNA ends at breakpoint junctions. Copyright 2003 Wiley-Liss, Inc.
Multitrait, Random Regression, or Simple Repeatability Model in High-Throughput Phenotyping Data Improve Genomic Prediction for Wheat Grain Yield.

Science.gov (United States)

Sun, Jin; Rutkoski, Jessica E; Poland, Jesse A; Crossa, José; Jannink, Jean-Luc; Sorrells, Mark E

2017-07-01

High-throughput phenotyping (HTP) platforms can be used to measure traits that are genetically correlated with wheat ( L.) grain yield across time. Incorporating such secondary traits in the multivariate pedigree and genomic prediction models would be desirable to improve indirect selection for grain yield. In this study, we evaluated three statistical models, simple repeatability (SR), multitrait (MT), and random regression (RR), for the longitudinal data of secondary traits and compared the impact of the proposed models for secondary traits on their predictive abilities for grain yield. Grain yield and secondary traits, canopy temperature (CT) and normalized difference vegetation index (NDVI), were collected in five diverse environments for 557 wheat lines with available pedigree and genomic information. A two-stage analysis was applied for pedigree and genomic selection (GS). First, secondary traits were fitted by SR, MT, or RR models, separately, within each environment. Then, best linear unbiased predictions (BLUPs) of secondary traits from the above models were used in the multivariate prediction models to compare predictive abilities for grain yield. Predictive ability was substantially improved by 70%, on average, from multivariate pedigree and genomic models when including secondary traits in both training and test populations. Additionally, (i) predictive abilities slightly varied for MT, RR, or SR models in this data set, (ii) results indicated that including BLUPs of secondary traits from the MT model was the best in severe drought, and (iii) the RR model was slightly better than SR and MT models under drought environment. Copyright © 2017 Crop Science Society of America.
Genus-specific protein binding to the large clusters of DNA repeats (short regularly spaced repeats) present in Sulfolobus genomes

DEFF Research Database (Denmark)

Peng, Xu; Brügger, Kim; Shen, Biao

2003-01-01

terminally modified and corresponds to SSO454, an open reading frame of previously unassigned function. It binds specifically to DNA fragments carrying double and single repeat sequences, binding on one side of the repeat structure, and producing an opening of the opposite side of the DNA structure. It also...... recognizes both main families of repeat sequences in S. solfataricus. The recombinant protein, expressed in Escherichia coli, showed the same binding properties to the SRSR repeat as the native one. The SSO454 protein exhibits a tripartite internal repeat structure which yields a good sequence match...... with a helix-turn-helix DNA-binding motif. Although this putative motif is shared by other archaeal proteins, orthologs of SSO454 were only detected in species within the Sulfolobus genus and in the closely related Acidianus genus. We infer that the genus-specific protein induces an opening of the structure...
Telomere maintenance through recruitment of internal genomic regions.

Science.gov (United States)

Seo, Beomseok; Kim, Chuna; Hills, Mark; Sung, Sanghyun; Kim, Hyesook; Kim, Eunkyeong; Lim, Daisy S; Oh, Hyun-Seok; Choi, Rachael Mi Jung; Chun, Jongsik; Shim, Jaegal; Lee, Junho

2015-09-18

Cells surviving crisis are often tumorigenic and their telomeres are commonly maintained through the reactivation of telomerase. However, surviving cells occasionally activate a recombination-based mechanism called alternative lengthening of telomeres (ALT). Here we establish stably maintained survivors in telomerase-deleted Caenorhabditis elegans that escape from sterility by activating ALT. ALT survivors trans-duplicate an internal genomic region, which is already cis-duplicated to chromosome ends, across the telomeres of all chromosomes. These 'Template for ALT' (TALT) regions consist of a block of genomic DNA flanked by telomere-like sequences, and are different between two genetic background. We establish a model that an ancestral duplication of a donor TALT region to a proximal telomere region forms a genomic reservoir ready to be incorporated into telomeres on ALT activation.
Analysis of the a genome genetic diversity among brassica napus, b. rapa and b. juncea accessions using specific simple sequence repeat markers

International Nuclear Information System (INIS)

Tian, H.; Yan, J.; Zhang, R.; Guo, Y.; Hu, S.; Channa, S.A.

2017-01-01

This investigation was aimed at evaluating the genetic diversity of 127 accessions among Brassica napus, B. rapa, and B. juncea by using 15 pairs of the A genome specific simple sequence repeat primers. These 127 accessions could be clearly separated into three groups by cluster analysis, principal component analysis, and population structure analysis separately, and the results analyzed by the three methods were very similar. Group I comprised of mainly B. napus accessions and the most of B. juncea accessions formed Group II, Group III included nearly all of the B. rapa accessions. The result showed that 36.86% of the variance was due to significant differences among populations of species, indicated that abundance genetic diversity existed among the A genome of B. napus, B. rapa, and B. juncea accessions. B. napus, B. rapa, and B. juncea have the abundant genetic diversity in the A genome, and some elite genes can be used to broaden the genetic base of them, especially for B. napus, in future rapeseed breeding program. (author)
Assembly of Repeat Content Using Next Generation Sequencing Data

Energy Technology Data Exchange (ETDEWEB)

labutti, Kurt; Kuo, Alan; Grigoriev, Igor; Copeland, Alex

2014-03-17

Repetitive organisms pose a challenge for short read assembly, and typically only unique regions and repeat regions shorter than the read length, can be accurately assembled. Recently, we have been investigating the use of Pacific Biosciences reads for de novo fungal assembly. We will present an assessment of the quality and degree of repeat reconstruction possible in a fungal genome using long read technology. We will also compare differences in assembly of repeat content using short read and long read technology.
The complete genome sequencing of Prevotella intermedia strain OMA14 and a subsequent fine-scale, intra-species genomic comparison reveal an unusual amplification of conjugative and mobile transposons and identify a novel Prevotella-lineage-specific repeat.

Science.gov (United States)

Naito, Mariko; Ogura, Yoshitoshi; Itoh, Takehiko; Shoji, Mikio; Okamoto, Masaaki; Hayashi, Tetsuya; Nakayama, Koji

2016-02-01

Prevotella intermedia is a pathogenic bacterium involved in periodontal diseases. Here, we present the complete genome sequence of a clinical strain, OMA14, of this bacterium along with the results of comparative genome analysis with strain 17 of the same species whose genome has also been sequenced, but not fully analysed yet. The genomes of both strains consist of two circular chromosomes: the larger chromosomes are similar in size and exhibit a high overall linearity of gene organizations, whereas the smaller chromosomes show a significant size variation and have undergone remarkable genome rearrangements. Unique features of the Pre. intermedia genomes are the presence of a remarkable number of essential genes on the second chromosomes and the abundance of conjugative and mobilizable transposons (CTns and MTns). The CTns/MTns are particularly abundant in the second chromosomes, involved in its extensive genome rearrangement, and have introduced a number of strain-specific genes into each strain. We also found a novel 188-bp repeat sequence that has been highly amplified in Pre. intermedia and are specifically distributed among the Pre. intermedia-related species. These findings expand our understanding of the genetic features of Pre. intermedia and the roles of CTns and MTns in the evolution of bacteria. © The Author 2015. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Flank solar wind interaction. Annual report, June 1991-July 1992

International Nuclear Information System (INIS)

Moses, S.L.; Greenstadt, E.W.

1992-08-01

This report summarizes the results of the first 12 months of our program to study the interaction of the Earth's magnetosphere with the solar wind on the far flanks of the bow shock. This study employs data from the ISEE-3 spacecraft during its traversals of the Earth's magnetotail and correlative data from spacecraft monitoring the solar wind upstream. Our main effort to date has involved assembling data sets and developing new plotting programs. Two talks were given at the Spring Meeting of the American Geophysical Union describing our initial results from analyzing data from the far flank foreshock and magnetosheath. The following sections summarize our results
Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

Directory of Open Access Journals (Sweden)

Huaiyong Luo

Full Text Available The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.
Genome-Wide Analysis of Simple Sequence Repeats and Efficient Development of Polymorphic SSR Markers Based on Whole Genome Re-Sequencing of Multiple Isolates of the Wheat Stripe Rust Fungus.

Science.gov (United States)

Luo, Huaiyong; Wang, Xiaojie; Zhan, Gangming; Wei, Guorong; Zhou, Xinli; Zhao, Jing; Huang, Lili; Kang, Zhensheng

2015-01-01

The biotrophic parasitic fungus Puccinia striiformis f. sp. tritici (Pst) causes stripe rust, a devastating disease of wheat, endangering global food security. Because the Pst population is highly dynamic, it is difficult to develop wheat cultivars with durable and highly effective resistance. Simple sequence repeats (SSRs) are widely used as molecular markers in genetic studies to determine population structure in many organisms. However, only a small number of SSR markers have been developed for Pst. In this study, a total of 4,792 SSR loci were identified using the whole genome sequences of six isolates from different regions of the world, with a marker density of one SSR per 22.95 kb. The majority of the SSRs were di- and tri-nucleotide repeats. A database containing 1,113 SSR markers were established. Through in silico comparison, the previously reported SSR markers were found mainly in exons, whereas the SSR markers in the database were mostly in intergenic regions. Furthermore, 105 polymorphic SSR markers were confirmed in silico by their identical positions and nucleotide variations with INDELs identified among the six isolates. When 104 in silico polymorphic SSR markers were used to genotype 21 Pst isolates, 84 produced the target bands, and 82 of them were polymorphic and revealed the genetic relationships among the isolates. The results show that whole genome re-sequencing of multiple isolates provides an ideal resource for developing SSR markers, and the newly developed SSR markers are useful for genetic and population studies of the wheat stripe rust fungus.
Wear evaluation of flank in burins of high speed steel modified with titanium ions

Science.gov (United States)

E Caballero, J.; V-Niño, E. D.

2017-12-01

This report shows the results obtained researching the flank wearing resistance performed by the high-speed steel (HSS) burins without any surface treatment (reference substrate) and others with surface treatment based on Titanium ions. The flank wearing was carried out by means of an industrial process by chip removal with repetitive tests of dry finished turning of AISI/SAE 1045 steel bars. The useful service life of the burins was evaluated according to ISO 3685:1993, and it was found that the burins treated with Titanium ions showed an increase in the flank wearing resistance with respect to the ones used as reference.
Identifying uniformly mutated segments within repeats.

Science.gov (United States)

Sahinalp, S Cenk; Eichler, Evan; Goldberg, Paul; Berenbrink, Petra; Friedetzky, Tom; Ergun, Funda

2004-12-01

Given a long string of characters from a constant size alphabet we present an algorithm to determine whether its characters have been generated by a single i.i.d. random source. More specifically, consider all possible n-coin models for generating a binary string S, where each bit of S is generated via an independent toss of one of the n coins in the model. The choice of which coin to toss is decided by a random walk on the set of coins where the probability of a coin change is much lower than the probability of using the same coin repeatedly. We present a procedure to evaluate the likelihood of a n-coin model for given S, subject a uniform prior distribution over the parameters of the model (that represent mutation rates and probabilities of copying events). In the absence of detailed prior knowledge of these parameters, the algorithm can be used to determine whether the a posteriori probability for n=1 is higher than for any other n>1. Our algorithm runs in time O(l4logl), where l is the length of S, through a dynamic programming approach which exploits the assumed convexity of the a posteriori probability for n. Our test can be used in the analysis of long alignments between pairs of genomic sequences in a number of ways. For example, functional regions in genome sequences exhibit much lower mutation rates than non-functional regions. Because our test provides means for determining variations in the mutation rate, it may be used to distinguish functional regions from non-functional ones. Another application is in determining whether two highly similar, thus evolutionarily related, genome segments are the result of a single copy event or of a complex series of copy events. This is particularly an issue in evolutionary studies of genome regions rich with repeat segments (especially tandemly repeated segments).
Aberrant splicing in transgenes containing introns, exons, and V5 epitopes: lessons from developing an FSHD mouse model expressing a D4Z4 repeat with flanking genomic sequences.

Directory of Open Access Journals (Sweden)

Eugénie Ansseau

Full Text Available The DUX4 gene, encoded within D4Z4 repeats on human chromosome 4q35, has recently emerged as a key factor in the pathogenic mechanisms underlying Facioscapulohumeral muscular dystrophy (FSHD. This recognition prompted development of animal models expressing the DUX4 open reading frame (ORF alone or embedded within D4Z4 repeats. In the first published model, we used adeno-associated viral vectors (AAV and strong viral control elements (CMV promoter, SV40 poly A to demonstrate that the DUX4 cDNA caused dose-dependent toxicity in mouse muscles. As a follow-up, we designed a second generation of DUX4-expressing AAV vectors to more faithfully genocopy the FSHD-permissive D4Z4 repeat region located at 4q35. This new vector (called AAV.D4Z4.V5.pLAM contained the D4Z4/DUX4 promoter region, a V5 epitope-tagged DUX4 ORF, and the natural 3' untranslated region (pLAM harboring two small introns, DUX4 exons 2 and 3, and the non-canonical poly A signal required for stabilizing DUX4 mRNA in FSHD. AAV.D4Z4.V5.pLAM failed to recapitulate the robust pathology of our first generation vectors following delivery to mouse muscle. We found that the DUX4.V5 junction sequence created an unexpected splice donor in the pre-mRNA that was preferentially utilized to remove the V5 coding sequence and DUX4 stop codon, yielding non-functional DUX4 protein with 55 additional residues on its carboxyl-terminus. Importantly, we further found that aberrant splicing could occur in any expression construct containing a functional splice acceptor and sequences resembling minimal splice donors. Our findings represent an interesting case study with respect to AAV.D4Z4.V5.pLAM, but more broadly serve as a note of caution for designing constructs containing V5 epitope tags and/or transgenes with downstream introns and exons.

C9orf72 hexanucleotide repeat expansions in Chinese sporadic amyotrophic lateral sclerosis.

Science.gov (United States)

He, Ji; Tang, Lu; Benyamin, Beben; Shah, Sonia; Hemani, Gib; Liu, Rong; Ye, Shan; Liu, Xiaolu; Ma, Yan; Zhang, Huagang; Cremin, Katie; Leo, Paul; Wray, Naomi R; Visscher, Peter M; Xu, Huji; Brown, Matthew A; Bartlett, Perry F; Mangelsdorf, Marie; Fan, Dongsheng

2015-09-01

A hexanucleotide repeat expansion (HRE) in the C9orf72 gene has been identified as the most common mutation in amyotrophic lateral sclerosis (ALS) among Caucasian populations. We sought to comprehensively evaluate genetic and epigenetic variants of C9orf72 and the contribution of the HRE in Chinese ALS cases. We performed fragment-length and repeat-primed polymerase chain reaction to determine GGGGCC copy number and expansion within the C9orf72 gene in 1092 sporadic ALS (sALS) and 1062 controls from China. We performed haplotype analysis of 23 single-nucleotide polymorphisms within and surrounding C9orf72. The C9orf72 HRE was found in 3 sALS patients (0.3%) but not in control subjects (p = 0.25). For 2 of the cases with the HRE, genotypes of 8 single-nucleotide polymorphisms flanking the HRE were inconsistent with the haplotype reported to be strongly associated with ALS in Caucasian populations. For these 2 individuals, we found hypermethylation of the CpG island upstream of the repeat, an observation not detected in other sALS patients (p HRE were highly associated with repeat lengths >8 repeats implying that both haplotypes may confer instability of repeat length. Copyright © 2015 Elsevier Inc. All rights reserved.
Complete chloroplast genome sequence of a major economic species, Ziziphus jujuba (Rhamnaceae).

Science.gov (United States)

Ma, Qiuyue; Li, Shuxian; Bi, Changwei; Hao, Zhaodong; Sun, Congrui; Ye, Ning

2017-02-01

Ziziphus jujuba is an important woody plant with high economic and medicinal value. Here, we analyzed and characterized the complete chloroplast (cp) genome of Z. jujuba, the first member of the Rhamnaceae family for which the chloroplast genome sequence has been reported. We also built a web browser for navigating the cp genome of Z. jujuba ( http://bio.njfu.edu.cn/gb2/gbrowse/Ziziphus_jujuba_cp/ ). Sequence analysis showed that this cp genome is 161,466 bp long and has a typical quadripartite structure of large (LSC, 89,120 bp) and small (SSC, 19,348 bp) single-copy regions separated by a pair of inverted repeats (IRs, 26,499 bp). The sequence contained 112 unique genes, including 78 protein-coding genes, 30 transfer RNAs, and four ribosomal RNAs. The genome structure, gene order, GC content, and codon usage are similar to other typical angiosperm cp genomes. A total of 38 tandem repeats, two forward repeats, and three palindromic repeats were detected in the Z. jujuba cp genome. Simple sequence repeat (SSR) analysis revealed that most SSRs were AT-rich. The homopolymer regions in the cp genome of Z. jujuba were verified and manually corrected by Sanger sequencing. One-third of mononucleotide repeats were found to be erroneously sequenced by the 454 pyrosequencing, which resulted in sequences of 1-4 bases shorter than that by the Sanger sequencing. Analyzing the cp genome of Z. jujuba revealed that the IR contraction and expansion events resulted in ycf1 and rps19 pseudogenes. A phylogenetic analysis based on 64 protein-coding genes showed that Z. jujuba was closely related to members of the Elaeagnaceae family, which will be helpful for phylogenetic studies of other Rosales species. The complete cp genome sequence of Z. jujuba will facilitate population, phylogenetic, and cp genetic engineering studies of this economic plant.
Lava-flow hazard on the SE flank of Mt. Etna (Southern Italy)

Science.gov (United States)

Crisci, G. M.; Iovine, G.; Di Gregorio, S.; Lupiano, V.

2008-11-01

A method for mapping lava-flow hazard on the SE flank of Mt. Etna (Sicily, Southern Italy) by applying the Cellular Automata model SCIARA -fv is described, together with employed techniques of calibration and validation through a parallel Genetic Algorithm. The study area is partly urbanised; it has repeatedly been affected by lava flows from flank eruptions in historical time, and shows evidence of a dominant SSE-trending fracture system. Moreover, a dormant deep-seated gravitational deformation, associated with a larger volcano-tectonic phenomenon, affects the whole south-eastern flank of the volcano. The Etnean 2001 Mt. Calcarazzi lava-flow event has been selected for model calibration, while validation has been performed by considering the 2002 Linguaglossa and the 1991-93 Valle del Bove events — suitable data for back analysis being available for these recent eruptions. Quantitative evaluation of the simulations, with respect to the real events, has been performed by means of a couple of fitness functions, which consider either the areas affected by the lava flows, or areas and eruption duration. Sensitivity analyses are in progress for thoroughly evaluating the role of parameters, topographic input data, and mesh geometry on model performance; though, preliminary results have already given encouraging responses on model robustness. In order to evaluate lava-flow hazard in the study area, a regular grid of n.340 possible vents, uniformly covering the study area and located at 500 m intervals, has been hypothesised. For each vent, a statistically-significant number of simulations has been planned, by adopting combinations of durations, lava volumes, and effusion-rate functions, selected by considering available volcanological data. Performed simulations have been stored in a GIS environment for successive analyses and map elaboration. Probabilities of activation, empirically based on past behaviour of the volcano, can be assigned to each vent of the grid, by
MSDB: A Comprehensive Database of Simple Sequence Repeats.

Science.gov (United States)

Avvaru, Akshay Kumar; Saxena, Saketh; Sowpati, Divya Tej; Mishra, Rakesh Kumar

2017-06-01

Microsatellites, also known as Simple Sequence Repeats (SSRs), are short tandem repeats of 1-6 nt motifs present in all genomes, particularly eukaryotes. Besides their usefulness as genome markers, SSRs have been shown to perform important regulatory functions, and variations in their length at coding regions are linked to several disorders in humans. Microsatellites show a taxon-specific enrichment in eukaryotic genomes, and some may be functional. MSDB (Microsatellite Database) is a collection of >650 million SSRs from 6,893 species including Bacteria, Archaea, Fungi, Plants, and Animals. This database is by far the most exhaustive resource to access and analyze SSR data of multiple species. In addition to exploring data in a customizable tabular format, users can view and compare the data of multiple species simultaneously using our interactive plotting system. MSDB is developed using the Django framework and MySQL. It is freely available at http://tdb.ccmb.res.in/msdb. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Human platelet glycoprotein IX: An adhesive prototype of leucine-rich glycoproteins with flank-center-flank structures

International Nuclear Information System (INIS)

Hickey, M.J.; Williams, S.A.; Roth, G.J.

1989-01-01

The glycoprotein (GP) Ib-IX complex on the surface of human platelets functions as the von Willebrand factor receptor and mediates von Willebrand factor-dependent platelet adhesion to blood vessels. GPIX is a relatively small (M r , 17,000) protein that may provide for membrane insertion and orientation of the larger component of the complex. GPIb (M r , 165,000). Using antibody screening, the authors cloned a cDNA encoding GPIX from a human erythroleukemia cell cDNA library constructed in phage λgt11. Lacking a 5' untranslated region and start codon, the cDNA sequence includes 604 nucleotides, beginning with 495 bases at the 5' end coding for 165 amino acids, followed by a stop codon and 106 noncoding bases at the 3' end. By Northern blot analysis, the GPIX cDNA hybridizes with a single 1.0-kilobase species of platelet poly(A) + RNA. Translation of the cDNA sequence gives a predicted protein sequence beginning with a truncated putative signal sequence of 5 amino acids followed by a sequence of 17 amino acids matching that determined directly by Edman degradation of intact GPIX. GPIX contains a leucine-rich glycoprotein (LRG) sequence of 24 amino acids similar to conserved LRG sequences in GPIb and other proteins from humans, Drosophila, and yeast. The role of the flank-LRG center-flank structure in the evolution and function of the LRG proteins remains to be defined
The complete chloroplast genome sequence of Podocarpus lambertii: genome structure, evolutionary aspects, gene content and SSR detection.

Directory of Open Access Journals (Sweden)

Leila do Nascimento Vieira

Full Text Available BACKGROUND: Podocarpus lambertii (Podocarpaceae is a native conifer from the Brazilian Atlantic Forest Biome, which is considered one of the 25 biodiversity hotspots in the world. The advancement of next-generation sequencing technologies has enabled the rapid acquisition of whole chloroplast (cp genome sequences at low cost. Several studies have proven the potential of cp genomes as tools to understand enigmatic and basal phylogenetic relationships at different taxonomic levels, as well as further probe the structural and functional evolution of plants. In this work, we present the complete cp genome sequence of P. lambertii. METHODOLOGY/PRINCIPAL FINDINGS: The P. lambertii cp genome is 133,734 bp in length, and similar to other sequenced cupressophytes, it lacks one of the large inverted repeat regions (IR. It contains 118 unique genes and one duplicated tRNA (trnN-GUU, which occurs as an inverted repeat sequence. The rps16 gene was not found, which was previously reported for the plastid genome of another Podocarpaceae (Nageia nagi and Araucariaceae (Agathis dammara. Structurally, P. lambertii shows 4 inversions of a large DNA fragment ∼20,000 bp compared to the Podocarpus totara cp genome. These unexpected characteristics may be attributed to geographical distance and different adaptive needs. The P. lambertii cp genome presents a total of 28 tandem repeats and 156 SSRs, with homo- and dipolymers being the most common and tri-, tetra-, penta-, and hexapolymers occurring with less frequency. CONCLUSION: The complete cp genome sequence of P. lambertii revealed significant structural changes, even in species from the same genus. These results reinforce the apparently loss of rps16 gene in Podocarpaceae cp genome. In addition, several SSRs in the P. lambertii cp genome are likely intraspecific polymorphism sites, which may allow highly sensitive phylogeographic and population structure studies, as well as phylogenetic studies of species of
Method for Friction Force Estimation on the Flank of Cutting Tools

Directory of Open Access Journals (Sweden)

Luis Huerta

2017-01-01

Full Text Available Friction forces are present in any machining process. These forces could play an important role in the dynamics of the system. In the cutting process, friction is mainly present in the rake face and the flank of the tool. Although the one that acts on the rake face has a major influence, the other one can become also important and could take part in the stability of the system. In this work, experimental identification of the friction on the flank is presented. The experimental determination was carried out by machining aluminum samples in a CNC lathe. As a result, two friction functions were obtained as a function of the cutting speed and the relative motion of the contact elements. Experiments using a worn and a new insert were carried out. Force and acceleration were recorded simultaneously and, from these results, different friction levels were observed depending on the cutting parameters, such as cutting speed, feed rate, and tool condition. Finally, a friction model for the flank friction is presented.
Metagenome sequencing and 98 microbial genomes from Juan de Fuca Ridge flank subsurface fluids

Science.gov (United States)

Jungbluth, Sean P.; Amend, Jan P.; Rappé, Michael S.

2017-03-01

The global deep subsurface biosphere is one of the largest reservoirs for microbial life on our planet. This study takes advantage of new sampling technologies and couples them with improvements to DNA sequencing and associated informatics tools to reconstruct the genomes of uncultivated Bacteria and Archaea from fluids collected deep within the Juan de Fuca Ridge subseafloor. Here, we generated two metagenomes from borehole observatories located 311 meters apart and, using binning tools, retrieved 98 genomes from metagenomes (GFMs). Of the GFMs, 31 were estimated to be >90% complete, while an additional 17 were >70% complete. Phylogenomic analysis revealed 53 bacterial and 45 archaeal GFMs, of which nearly all were distantly related to known cultivated isolates. In the GFMs, abundant Bacteria included Chloroflexi, Nitrospirae, Acetothermia (OP1), EM3, Aminicenantes (OP8), Gammaproteobacteria, and Deltaproteobacteria, while abundant Archaea included Archaeoglobi, Bathyarchaeota (MCG), and Marine Benthic Group E (MBG-E). These data are the first GFMs reconstructed from the deep basaltic subseafloor biosphere, and provide a dataset available for further interrogation.
Optimization of sequence alignment for simple sequence repeat regions

Directory of Open Access Journals (Sweden)

Ogbonnaya Francis C

2011-07-01

Full Text Available Abstract Background Microsatellites, or simple sequence repeats (SSRs, are tandemly repeated DNA sequences, including tandem copies of specific sequences no longer than six bases, that are distributed in the genome. SSR has been used as a molecular marker because it is easy to detect and is used in a range of applications, including genetic diversity, genome mapping, and marker assisted selection. It is also very mutable because of slipping in the DNA polymerase during DNA replication. This unique mutation increases the insertion/deletion (INDELs mutation frequency to a high ratio - more than other types of molecular markers such as single nucleotide polymorphism (SNPs. SNPs are more frequent than INDELs. Therefore, all designed algorithms for sequence alignment fit the vast majority of the genomic sequence without considering microsatellite regions, as unique sequences that require special consideration. The old algorithm is limited in its application because there are many overlaps between different repeat units which result in false evolutionary relationships. Findings To overcome the limitation of the aligning algorithm when dealing with SSR loci, a new algorithm was developed using PERL script with a Tk graphical interface. This program is based on aligning sequences after determining the repeated units first, and the last SSR nucleotides positions. This results in a shifting process according to the inserted repeated unit type. When studying the phylogenic relations before and after applying the new algorithm, many differences in the trees were obtained by increasing the SSR length and complexity. However, less distance between different linage had been observed after applying the new algorithm. Conclusions The new algorithm produces better estimates for aligning SSR loci because it reflects more reliable evolutionary relations between different linages. It reduces overlapping during SSR alignment, which results in a more realistic
Genome-wide identification, sequence characterization, and protein-protein interaction properties of DDB1 (damaged DNA binding protein-1)-binding WD40-repeat family members in Solanum lycopersicum.

Science.gov (United States)

Zhu, Yunye; Huang, Shengxiong; Miao, Min; Tang, Xiaofeng; Yue, Junyang; Wang, Wenjie; Liu, Yongsheng

2015-06-01

One hundred DDB1 (damaged DNA binding protein-1)-binding WD40-repeat domain (DWD) family genes were identified in the S. lycopersicum genome. The DWD genes encode proteins presumably functioning as the substrate recognition subunits of the cullin4-ring ubiquitin E3 ligase complex. These findings provide candidate genes and a research platform for further gene functionality and molecular breeding study. A subclass of DDB1 (damaged DNA binding protein-1)-binding WD40-repeat domain (DWD) family proteins has been demonstrated to function as the substrate recognition subunits of the cullin4-ring ubiquitin E3 ligase complex. However, little information is available about the cognate subfamily genes in tomato (S. lycopersicum). In this study, based on the recently released tomato genome sequences, 100 tomato genes encoding DWD proteins that potentially interact with DDB1 were identified and characterized, including analyses of the detailed annotations, chromosome locations and compositions of conserved amino acid domains. In addition, a phylogenetic tree, which comprises of three main groups, of the subfamily genes was constructed. The physical interaction between tomato DDB1 and 14 representative DWD proteins was determined by yeast two-hybrid and co-immunoprecipitation assays. The subcellular localization of these 14 representative DWD proteins was determined. Six of them were localized in both nucleus and cytoplasm, seven proteins exclusively in cytoplasm, and one protein either in nucleus and cytoplasm, or exclusively in cytoplasm. Comparative genomic analysis demonstrated that the expansion of these subfamily members in tomato predominantly resulted from two whole-genome triplication events in the evolution history.
The hamster flank organ model: Is it relevant to man

International Nuclear Information System (INIS)

Franz, T.J.; Lehman, P.A.; Pochi, P.; Odland, G.F.; Olerud, J.

1989-01-01

The critical role that androgens play in the etiology of acne has led to a search for topically active antiandrogens and the frequent use of the flank organ of the golden Syrian hamster as an animal model. 17-alpha-propyltestosterone (17-PT) has been identified as having potent antiandrogenic activity in the hamster model, and this report describes its clinical evaluation. Two double-blind placebo controlled studies comparing 4% 17-PT in 80% alcohol versus vehicle alone were conducted. One study examined 17-PT sebosuppressive activity in 20 subjects. The second study examined its efficacy in 44 subjects having mild to moderate acne. A third study measured in vitro percutaneous absorption of 17-PT through hamster flank and monkey skin, and human face skin in-vivo, using radioactive drug. 17-PT was found to be ineffective in reducing either the sebum excretion rate or the number of inflammatory acne lesions. Failure of 17-PT to show clinical activity was not a result of poor percutaneous absorption. Total absorption in man was 7.7% of the dose and only 1.0% in the hamster. The sebaceous gland of hamster flank organ is apparently more sensitive to antiandrogens than the human sebaceous gland
Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing.

Science.gov (United States)

Straub, Shannon C K; Fishbein, Mark; Livshultz, Tatyana; Foster, Zachary; Parks, Matthew; Weitemier, Kevin; Cronn, Richard C; Liston, Aaron

2011-05-04

Milkweeds (Asclepias L.) have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L.) could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp) and 5S rDNA (120 bp) sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp), with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae) unigenes (median coverage of 0.29×) and 66% of single copy orthologs (COSII) in asterids (median coverage of 0.14×). From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites) and phylogenetics (low-copy nuclear genes) studies were developed. The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species and its relatives. This study represents a first
ChloroSSRdb: a repository of perfect and imperfect chloroplastic simple sequence repeats (cpSSRs) of green plants.

Science.gov (United States)

Kapil, Aditi; Rai, Piyush Kant; Shanker, Asheesh

2014-01-01

Simple sequence repeats (SSRs) are regions in DNA sequence that contain repeating motifs of length 1-6 nucleotides. These repeats are ubiquitously present and are found in both coding and non-coding regions of genome. A total of 534 complete chloroplast genome sequences (as on 18 September 2014) of Viridiplantae are available at NCBI organelle genome resource. It provides opportunity to mine these genomes for the detection of SSRs and store them in the form of a database. In an attempt to properly manage and retrieve chloroplastic SSRs, we designed ChloroSSRdb which is a relational database developed using SQL server 2008 and accessed through ASP.NET. It provides information of all the three types (perfect, imperfect and compound) of SSRs. At present, ChloroSSRdb contains 124 430 mined SSRs, with majority lying in non-coding region. Out of these, PCR primers were designed for 118 249 SSRs. Tetranucleotide repeats (47 079) were found to be the most frequent repeat type, whereas hexanucleotide repeats (6414) being the least abundant. Additionally, in each species statistical analyses were performed to calculate relative frequency, correlation coefficient and chi-square statistics of perfect and imperfect SSRs. In accordance with the growing interest in SSR studies, ChloroSSRdb will prove to be a useful resource in developing genetic markers, phylogenetic analysis, genetic mapping, etc. Moreover, it will serve as a ready reference for mined SSRs in available chloroplast genomes of green plants. Database URL: www.compubio.in/chlorossrdb/ © The Author(s) 2014. Published by Oxford University Press.
Repeat-associated plasticity in the Helicobacter pylori RD gene family.

Science.gov (United States)

Shak, Joshua R; Dick, Jonathan J; Meinersmann, Richard J; Perez-Perez, Guillermo I; Blaser, Martin J

2009-11-01

The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3' region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5' region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host.
detectIR: a novel program for detecting perfect and imperfect inverted repeats using complex numbers and vector calculation.

Science.gov (United States)

Ye, Congting; Ji, Guoli; Li, Lei; Liang, Chun

2014-01-01

Inverted repeats are present in abundance in both prokaryotic and eukaryotic genomes and can form DNA secondary structures--hairpins and cruciforms that are involved in many important biological processes. Bioinformatics tools for efficient and accurate detection of inverted repeats are desirable, because existing tools are often less accurate and time consuming, sometimes incapable of dealing with genome-scale input data. Here, we present a MATLAB-based program called detectIR for the perfect and imperfect inverted repeat detection that utilizes complex numbers and vector calculation and allows genome-scale data inputs. A novel algorithm is adopted in detectIR to convert the conventional sequence string comparison in inverted repeat detection into vector calculation of complex numbers, allowing non-complementary pairs (mismatches) in the pairing stem and a non-palindromic spacer (loop or gaps) in the middle of inverted repeats. Compared with existing popular tools, our program performs with significantly higher accuracy and efficiency. Using genome sequence data from HIV-1, Arabidopsis thaliana, Homo sapiens and Zea mays for comparison, detectIR can find lots of inverted repeats missed by existing tools whose outputs often contain many invalid cases. detectIR is open source and its source code is freely available at: https://sourceforge.net/projects/detectir.
A Defective mRNA Cleavage and Polyadenylation Complex Facilitates Expansions of Transcribed (GAAn Repeats Associated with Friedreich’s Ataxia

Directory of Open Access Journals (Sweden)

Ryan J. McGinty

2017-09-01

Full Text Available Expansions of microsatellite repeats are responsible for numerous hereditary diseases in humans, including myotonic dystrophy and Friedreich’s ataxia. Whereas the length of an expandable repeat is the main factor determining disease inheritance, recent data point to genomic trans modifiers that can impact the likelihood of expansions and disease progression. Detection of these modifiers may lead to understanding and treating repeat expansion diseases. Here, we describe a method for the rapid, genome-wide identification of trans modifiers for repeat expansion in a yeast experimental system. Using this method, we found that missense mutations in the endoribonuclease subunit (Ysh1 of the mRNA cleavage and polyadenylation complex dramatically increase the rate of (GAAn repeat expansions but only when they are actively transcribed. These expansions correlate with slower transcription elongation caused by the ysh1 mutation. These results reveal an interplay between RNA processing and repeat-mediated genome instability, confirming the validity of our approach.
A Thieno[2,3-b]pyridine-Flanked Diketopyrrolopyrrole Polymer as an n-Type Polymer Semiconductor for All-Polymer Solar Cells and Organic Field-Effect Transistors

KAUST Repository

Chen, Hung-Yang

2017-12-28

A novel fused heterocycle-flanked diketopyrrolopyrrole (DPP) monomer, thieno[2,3-b]pyridine diketopyrrolopyrrole (TPDPP), was designed and synthesized. When copolymerized with 3,4-difluorothiophene using Stille coupling polymerization, the new polymer pTPDPP-TF possesses a highly planar conjugated polymer backbone due to the fused thieno[2,3-b]pyridine flanking unit that effectively alleviates the steric hindrance with both the central DPP core and the 3,4-difluorothiophene repeat unit. This new polymer exhibits a high electron affinity (EA) of −4.1 eV and was successfully utilized as an n-type polymer semiconductor for applications in organic field-effect transistors (OFETs) and all polymer solar cells. A promising n-type charge carrier mobility of 0.1 cm2 V–1 s–1 was obtained in bottom-contact, top-gate OFETs, and a power conversion efficiency (PCE) of 2.72% with a high open-circuit voltage (VOC) of 1.04 V was achieved for all polymer solar cells using PTB7-Th as the polymer donor.
A Thieno[2,3-b]pyridine-Flanked Diketopyrrolopyrrole Polymer as an n-Type Polymer Semiconductor for All-Polymer Solar Cells and Organic Field-Effect Transistors

KAUST Repository

Chen, Hung-Yang; Nikolka, Mark; Wadsworth, Andrew; Yue, Wan; Onwubiko, Ada; Xiao, Mingfei; White, Andrew J. P.; Baran, Derya; Sirringhaus, Henning; McCulloch, Iain

2017-01-01

A novel fused heterocycle-flanked diketopyrrolopyrrole (DPP) monomer, thieno[2,3-b]pyridine diketopyrrolopyrrole (TPDPP), was designed and synthesized. When copolymerized with 3,4-difluorothiophene using Stille coupling polymerization, the new polymer pTPDPP-TF possesses a highly planar conjugated polymer backbone due to the fused thieno[2,3-b]pyridine flanking unit that effectively alleviates the steric hindrance with both the central DPP core and the 3,4-difluorothiophene repeat unit. This new polymer exhibits a high electron affinity (EA) of −4.1 eV and was successfully utilized as an n-type polymer semiconductor for applications in organic field-effect transistors (OFETs) and all polymer solar cells. A promising n-type charge carrier mobility of 0.1 cm2 V–1 s–1 was obtained in bottom-contact, top-gate OFETs, and a power conversion efficiency (PCE) of 2.72% with a high open-circuit voltage (VOC) of 1.04 V was achieved for all polymer solar cells using PTB7-Th as the polymer donor.
Breaks in the 45S rDNA Lead to Recombination-Mediated Loss of Repeats

Directory of Open Access Journals (Sweden)

Daniël O. Warmerdam

2016-03-01

Full Text Available rDNA repeats constitute the most heavily transcribed region in the human genome. Tumors frequently display elevated levels of recombination in rDNA, indicating that the repeats are a liability to the genomic integrity of a cell. However, little is known about how cells deal with DNA double-stranded breaks in rDNA. Using selective endonucleases, we show that human cells are highly sensitive to breaks in 45S but not the 5S rDNA repeats. We find that homologous recombination inhibits repair of breaks in 45S rDNA, and this results in repeat loss. We identify the structural maintenance of chromosomes protein 5 (SMC5 as contributing to recombination-mediated repair of rDNA breaks. Together, our data demonstrate that SMC5-mediated recombination can lead to error-prone repair of 45S rDNA repeats, resulting in their loss and thereby reducing cellular viability.
The peculiar landscape of repetitive sequences in the olive (Olea europaea L.) genome.

Science.gov (United States)

Barghini, Elena; Natali, Lucia; Cossu, Rosa Maria; Giordani, Tommaso; Pindo, Massimo; Cattonaro, Federica; Scalabrin, Simone; Velasco, Riccardo; Morgante, Michele; Cavallini, Andrea

2014-04-01

Analyzing genome structure in different species allows to gain an insight into the evolution of plant genome size. Olive (Olea europaea L.) has a medium-sized haploid genome of 1.4 Gb, whose structure is largely uncharacterized, despite the growing importance of this tree as oil crop. Next-generation sequencing technologies and different computational procedures have been used to study the composition of the olive genome and its repetitive fraction. A total of 2.03 and 2.3 genome equivalents of Illumina and 454 reads from genomic DNA, respectively, were assembled following different procedures, which produced more than 200,000 differently redundant contigs, with mean length higher than 1,000 nt. Mapping Illumina reads onto the assembled sequences was used to estimate their redundancy. The genome data set was subdivided into highly and medium redundant and nonredundant contigs. By combining identification and mapping of repeated sequences, it was established that tandem repeats represent a very large portion of the olive genome (∼31% of the whole genome), consisting of six main families of different length, two of which were first discovered in these experiments. The other large redundant class in the olive genome is represented by transposable elements (especially long terminal repeat-retrotransposons). On the whole, the results of our analyses show the peculiar landscape of the olive genome, related to the massive amplification of tandem repeats, more than that reported for any other sequenced plant genome.

Optimization of turning process through the analytic flank wear modelling

Science.gov (United States)

Del Prete, A.; Franchi, R.; De Lorenzis, D.

2018-05-01

In the present work, the approach used for the optimization of the process capabilities for Oil&Gas components machining will be described. These components are machined by turning of stainless steel castings workpieces. For this purpose, a proper Design Of Experiments (DOE) plan has been designed and executed: as output of the experimentation, data about tool wear have been collected. The DOE has been designed starting from the cutting speed and feed values recommended by the tools manufacturer; the depth of cut parameter has been maintained as a constant. Wear data has been obtained by means the observation of the tool flank wear under an optical microscope: the data acquisition has been carried out at regular intervals of working times. Through a statistical data and regression analysis, analytical models of the flank wear and the tool life have been obtained. The optimization approach used is a multi-objective optimization, which minimizes the production time and the number of cutting tools used, under the constraint on a defined flank wear level. The technique used to solve the optimization problem is a Multi Objective Particle Swarm Optimization (MOPS). The optimization results, validated by the execution of a further experimental campaign, highlighted the reliability of the work and confirmed the usability of the optimized process parameters and the potential benefit for the company.
Indel Group in Genomes (IGG) Molecular Genetic Markers1[OPEN

Science.gov (United States)

Burkart-Waco, Diana; Kuppu, Sundaram; Britt, Anne; Chetelat, Roger

2016-01-01

Genetic markers are essential when developing or working with genetically variable populations. Indel Group in Genomes (IGG) markers are primer pairs that amplify single-locus sequences that differ in size for two or more alleles. They are attractive for their ease of use for rapid genotyping and their codominant nature. Here, we describe a heuristic algorithm that uses a k-mer-based approach to search two or more genome sequences to locate polymorphic regions suitable for designing candidate IGG marker primers. As input to the IGG pipeline software, the user provides genome sequences and the desired amplicon sizes and size differences. Primer sequences flanking polymorphic insertions/deletions are produced as output. IGG marker files for three sets of genomes, Solanum lycopersicum/Solanum pennellii, Arabidopsis (Arabidopsis thaliana) Columbia-0/Landsberg erecta-0 accessions, and S. lycopersicum/S. pennellii/Solanum tuberosum (three-way polymorphic) are included. PMID:27436831
Reconstruction of the eruptive activity on the NE sector of Stromboli volcano: timing of flank eruptions since 15 ka

NARCIS (Netherlands)

Calvari, S.; Branca, S.; Corsaro, R.A.; De Beni, E.; Miraglia, L.; Norini, G.; Wijbrans, J.R.; Boschi, E.

2011-01-01

A multidisciplinary geological and compositional investigation allowed us to reconstruct the occurrence of flank eruptions on the lower NE flank of Stromboli volcano since 15 ka. The oldest flank eruption recognised is Roisa, which occurred at ~15 ka during the Vancori period, and has transitional
[Architecture of the X chromosome, expression of LIM kinase 1, and recombination in the agnostic mutants of Drosophila: a model of human Williams syndrome].

Science.gov (United States)

Savvateeva-Popova, E V; Peresleni, A I; Sharagina, L M; Medvedeva, A V; Korochkina, S E; Grigor'eva, I V; Diuzhikova, N A; Popov, A V; Baricheva, E M; Karagodin, D; Heisenberg, M

2004-06-01

As the Human Genome and Drosophila Genome Projects were completed, it became clear that functions of human disease-associated genes may be elucidated by studying the phenotypic expression of mutations affecting their structural or functional homologs in Drosophila. Genomic diseases were identified as a new class of human disorders. Their cause is recombination, which takes place at gene-flanking duplicons to generate chromosome aberrations such as deletions, duplications, inversions, and translocations. The resulting imbalance of the dosage of developmentally important genes arises at a frequency of 10(-3) (higher than the mutation rate of individual genes) and leads to syndromes with multiple manifestations, including cognitive defects. Genomic DNA fragments were cloned from the Drosophila melanogaster agnostic locus, whose mutations impair learning ability and memory. As a result, the locus was exactly localized in X-chromosome region 11A containing the LIM kinase 1 (LIMK1) gene (CG1848), which is conserved among many species. Hemizygosity for the LIMK1 gene, which is caused by recombination at neighboring extended repeats, underlies cognitive disorders in human Williams syndrome. LIMK1 is a component of the integrin signaling cascade, which regulates the functions of the actin cytoskeleton, synaptogenesis, and morphogenesis in the developing brain. Immunofluorescence analysis revealed LIMK1 in all subdomains of the central complex and the visual system of Drosophila melanogaster. Like in the human genome, the D. melanogaster region is flanked by numerous repeats, which were detected by molecular genetic methods and analysis of ectopic chromosome pairing. The repeats determined a higher rate of spontaneous and induced recombination. including unequal crossing over, in the agnostic gene region. Hence, the agnostic locus was considered as the first D. melanogaster model suitable for studying the genetic defect associated with Williams syndrome in human.
Breaks in the 45S rDNA Lead to Recombination-Mediated Loss of Repeats.

Science.gov (United States)

Warmerdam, Daniël O; van den Berg, Jeroen; Medema, René H

2016-03-22

rDNA repeats constitute the most heavily transcribed region in the human genome. Tumors frequently display elevated levels of recombination in rDNA, indicating that the repeats are a liability to the genomic integrity of a cell. However, little is known about how cells deal with DNA double-stranded breaks in rDNA. Using selective endonucleases, we show that human cells are highly sensitive to breaks in 45S but not the 5S rDNA repeats. We find that homologous recombination inhibits repair of breaks in 45S rDNA, and this results in repeat loss. We identify the structural maintenance of chromosomes protein 5 (SMC5) as contributing to recombination-mediated repair of rDNA breaks. Together, our data demonstrate that SMC5-mediated recombination can lead to error-prone repair of 45S rDNA repeats, resulting in their loss and thereby reducing cellular viability. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.
Flank wears Simulation by using back propagation neural network when cutting hardened H-13 steel in CNC End Milling

Science.gov (United States)

Hazza, Muataz Hazza F. Al; Adesta, Erry Y. T.; Riza, Muhammad

2013-12-01

High speed milling has many advantages such as higher removal rate and high productivity. However, higher cutting speed increase the flank wear rate and thus reducing the cutting tool life. Therefore estimating and predicting the flank wear length in early stages reduces the risk of unaccepted tooling cost. This research presents a neural network model for predicting and simulating the flank wear in the CNC end milling process. A set of sparse experimental data for finish end milling on AISI H13 at hardness of 48 HRC have been conducted to measure the flank wear length. Then the measured data have been used to train the developed neural network model. Artificial neural network (ANN) was applied to predict the flank wear length. The neural network contains twenty hidden layer with feed forward back propagation hierarchical. The neural network has been designed with MATLAB Neural Network Toolbox. The results show a high correlation between the predicted and the observed flank wear which indicates the validity of the models.
Flank wears Simulation by using back propagation neural network when cutting hardened H-13 steel in CNC End Milling

International Nuclear Information System (INIS)

Al Hazza, Muataz Hazza F; Adesta, Erry Y T; Riza, Muhammad

2013-01-01

High speed milling has many advantages such as higher removal rate and high productivity. However, higher cutting speed increase the flank wear rate and thus reducing the cutting tool life. Therefore estimating and predicting the flank wear length in early stages reduces the risk of unaccepted tooling cost. This research presents a neural network model for predicting and simulating the flank wear in the CNC end milling process. A set of sparse experimental data for finish end milling on AISI H13 at hardness of 48 HRC have been conducted to measure the flank wear length. Then the measured data have been used to train the developed neural network model. Artificial neural network (ANN) was applied to predict the flank wear length. The neural network contains twenty hidden layer with feed forward back propagation hierarchical. The neural network has been designed with MATLAB Neural Network Toolbox. The results show a high correlation between the predicted and the observed flank wear which indicates the validity of the models
Structural analyses of the Ankyrin Repeat Domain of TRPV6 and related TRPV ion channels†‡

OpenAIRE

Phelps, Christopher B.; Huang, Robert J.; Lishko, Polina V.; Wang, Ruiqi R.; Gaudet, Rachelle

2008-01-01

Transient Receptor Potential (TRP) proteins are cation channels composed of a transmembrane domain flanked by large N- and C-terminal cytoplasmic domains. All members of the vanilloid family of TRP channels (TRPV) possess an N-terminal ankyrin repeat domain (ARD). The ARD of mammalian TRPV6, an important regulator of calcium uptake and homeostasis, is essential for channel assembly and regulation. The 1.7 Å crystal structure of the TRPV6-ARD reveals conserved structural elements unique to the...
Building a model: developing genomic resources for common milkweed (Asclepias syriaca with low coverage genome sequencing

Directory of Open Access Journals (Sweden)

Weitemier Kevin

2011-05-01

Full Text Available Abstract Background Milkweeds (Asclepias L. have been extensively investigated in diverse areas of evolutionary biology and ecology; however, there are few genetic resources available to facilitate and compliment these studies. This study explored how low coverage genome sequencing of the common milkweed (Asclepias syriaca L. could be useful in characterizing the genome of a plant without prior genomic information and for development of genomic resources as a step toward further developing A. syriaca as a model in ecology and evolution. Results A 0.5× genome of A. syriaca was produced using Illumina sequencing. A virtually complete chloroplast genome of 158,598 bp was assembled, revealing few repeats and loss of three genes: accD, clpP, and ycf1. A nearly complete rDNA cistron (18S-5.8S-26S; 7,541 bp and 5S rDNA (120 bp sequence were obtained. Assessment of polymorphism revealed that the rDNA cistron and 5S rDNA had 0.3% and 26.7% polymorphic sites, respectively. A partial mitochondrial genome sequence (130,764 bp, with identical gene content to tobacco, was also assembled. An initial characterization of repeat content indicated that Ty1/copia-like retroelements are the most common repeat type in the milkweed genome. At least one A. syriaca microread hit 88% of Catharanthus roseus (Apocynaceae unigenes (median coverage of 0.29× and 66% of single copy orthologs (COSII in asterids (median coverage of 0.14×. From this partial characterization of the A. syriaca genome, markers for population genetics (microsatellites and phylogenetics (low-copy nuclear genes studies were developed. Conclusions The results highlight the promise of next generation sequencing for development of genomic resources for any organism. Low coverage genome sequencing allows characterization of the high copy fraction of the genome and exploration of the low copy fraction of the genome, which facilitate the development of molecular tools for further study of a target species
Plasma Transport at the Magnetospheric Flank Boundary. Final report

International Nuclear Information System (INIS)

Otto, Antonius

2012-01-01

Progress is highlighted in these areas: 1. Model of magnetic reconnection induced by three-dimensional Kelvin Helmholtz (KH) modes at the magnetospheric flank boundary; 2. Quantitative evaluation of mass transport from the magnetosheath onto closed geomagnetic field for northward IMF; 3. Comparison of mass transfer by cusp reconnection and Flank Kelvin Helmholtz modes; 4. Entropy constraint and plasma transport in the magnetotail - a new mechanism for current sheet thinning; 5. Test particle model for mass transport onto closed geomagnetic field for northward IMF; 6. Influence of density asymmetry and magnetic shear on (a) the linear and nonlinear growth of 3D Kelvin Helmholtz (KH) modes, and (b) three-dimensional KH mediated mass transport; 7. Examination of entropy and plasma transport in the magnetotail; 8. Entropy change and plasma transport by KH mediated reconnection - mixing and heating of plasma; 9. Entropy and plasma transport in the magnetotail - tail reconnection; and, 10. Wave coupling at the magnetospheric boundary and generation of kinetic Alfven waves
Force Modelling in Orthogonal Cutting Considering Flank Wear Effect

Science.gov (United States)

Rathod, Kanti Bhikhubhai; Lalwani, Devdas I.

2017-05-01

In the present work, an attempt has been made to provide a predictive cutting force model during orthogonal cutting by combining two different force models, that is, a force model for a perfectly sharp tool plus considering the effect of edge radius and a force model for a worn tool. The first force model is for a perfectly sharp tool that is based on Oxley's predictive machining theory for orthogonal cutting as the Oxley's model is for perfectly sharp tool, the effect of cutting edge radius (hone radius) is added and improve model is presented. The second force model is based on worn tool (flank wear) that was proposed by Waldorf. Further, the developed combined force model is also used to predict flank wear width using inverse approach. The performance of the developed combined total force model is compared with the previously published results for AISI 1045 and AISI 4142 materials and found reasonably good agreement.
Variable presence of the inverted repeat and plastome stability in Erodium.

Science.gov (United States)

Blazier, John C; Jansen, Robert K; Mower, Jeffrey P; Govindu, Madhu; Zhang, Jin; Weng, Mao-Lun; Ruhlman, Tracey A

2016-06-01

Several unrelated lineages such as plastids, viruses and plasmids, have converged on quadripartite genomes of similar size with large and small single copy regions and a large inverted repeat (IR). Except for Erodium (Geraniaceae), saguaro cactus and some legumes, the plastomes of all photosynthetic angiosperms display this structure. The functional significance of the IR is not understood and Erodium provides a system to examine the role of the IR in the long-term stability of these genomes. We compared the degree of genomic rearrangement in plastomes of Erodium that differ in the presence and absence of the IR. We sequenced 17 new Erodium plastomes. Using 454, Illumina, PacBio and Sanger sequences, 16 genomes were assembled and categorized along with one incomplete and two previously published Erodium plastomes. We conducted phylogenetic analyses among these species using a dataset of 19 protein-coding genes and determined if significantly higher evolutionary rates had caused the long branch seen previously in phylogenetic reconstructions within the genus. Bioinformatic comparisons were also performed to evaluate plastome evolution across the genus. Erodium plastomes fell into four types (Type 1-4) that differ in their substitution rates, short dispersed repeat content and degree of genomic rearrangement, gene and intron content and GC content. Type 4 plastomes had significantly higher rates of synonymous substitutions (dS) for all genes and for 14 of the 19 genes non-synonymous substitutions (dN) were significantly accelerated. We evaluated the evidence for a single IR loss in Erodium and in doing so discovered that Type 4 plastomes contain a novel IR. The presence or absence of the IR does not affect plastome stability in Erodium. Rather, the overall repeat content shows a negative correlation with genome stability, a pattern in agreement with other angiosperm groups and recent findings on genome stability in bacterial endosymbionts. © The Author 2016
The association between maternal hydronephrosis and acute flank pain during pregnancy: a prospective pilot-study.

Science.gov (United States)

Farr, Alex; Ott, Johannes; Kueronya, Verena; Margreiter, Markus; Javadli, Elchin; Einig, Sabrina; Husslein, Peter W; Bancher-Todesca, Dagmar

2017-10-01

Maternal hydronephrosis may cause flank pain during pregnancy. We aimed to investigate the association between maternal hydronephrosis and flank pain intensity. From 2014 to 2015, all consecutive women with singleton pregnancies, who presented at our tertiary center due to acute flank pain, were prospectively evaluated by renal ultrasonography and pain questionnaires. A visual analogue scale was used to assess pain intensity. The study had 90% power to detect a significant correlation between hydronephrosis and flank pain (Spearman's test). A total of 51 consecutive women with left-sided (13.7%), right-sided (64.7%) or bilateral (21.6%) pain were enrolled. The mean gestational age of these women, who presented due to their pain, was 27.5 ± 6.8 weeks at the time of consultation. The mean VAS score was 7.6 ± 2.2. In 43/51 (84.3%) women, hydronephrosis was found on renal sonograms. No correlation was found between the grade of hydronephrosis and pain intensity (p = 0.466; r= -0.28). Women delivered at a mean gestational age of 38.1 ± 2.4 weeks and their infants had a mean birthweight of 3138 ± 677 g. Hydronephrosis is a common finding among pregnant women with acute flank pain. The grade of hydronephrosis does not affect pain intensity. This study suggests normal pregnancy outcomes in these women.
Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L.) Genome

Science.gov (United States)

Gill, Navdeep; Buti, Matteo; Kane, Nolan; Bellec, Arnaud; Helmstetter, Nicolas; Berges, Hélène; Rieseberg, Loren H.

2014-01-01

Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence. PMID:24833511
Sequence-Based Analysis of Structural Organization and Composition of the Cultivated Sunflower (Helianthus annuus L. Genome

Directory of Open Access Journals (Sweden)

Navdeep Gill

2014-04-01

Full Text Available Sunflower is an important oilseed crop, as well as a model system for evolutionary studies, but its 3.6 gigabase genome has proven difficult to assemble, in part because of the high repeat content of its genome. Here we report on the sequencing, assembly, and analyses of 96 randomly chosen BACs from sunflower to provide additional information on the repeat content of the sunflower genome, assess how repetitive elements in the sunflower genome are organized relative to genes, and compare the genomic distribution of these repeats to that found in other food crops and model species. We also examine the expression of transposable element-related transcripts in EST databases for sunflower to determine the representation of repeats in the transcriptome and to measure their transcriptional activity. Our data confirm previous reports in suggesting that the sunflower genome is >78% repetitive. Sunflower repeats share very little similarity to other plant repeats such as those of Arabidopsis, rice, maize and wheat; overall 28% of repeats are “novel” to sunflower. The repetitive sequences appear to be randomly distributed within the sequenced BACs. Assuming the 96 BACs are representative of the genome as a whole, then approximately 5.2% of the sunflower genome comprises non TE-related genic sequence, with an average gene density of 18kbp/gene. Expression levels of these transposable elements indicate tissue specificity and differential expression in vegetative and reproductive tissues, suggesting that expressed TEs might contribute to sunflower development. The assembled BACs will also be useful for assessing the quality of several different draft assemblies of the sunflower genome and for annotating the reference sequence.
Repeat-Associated Plasticity in the Helicobacter pylori RD Gene Family▿ †

Science.gov (United States)

Shak, Joshua R.; Dick, Jonathan J.; Meinersmann, Richard J.; Perez-Perez, Guillermo I.; Blaser, Martin J.

2009-01-01

The bacterium Helicobacter pylori is remarkable for its ability to persist in the human stomach for decades without provoking sterilizing immunity. Since repetitive DNA can facilitate adaptive genomic flexibility via increased recombination, insertion, and deletion, we searched the genomes of two H. pylori strains for nucleotide repeats. We discovered a family of genes with extensive repetitive DNA that we have termed the H. pylori RD gene family. Each gene of this family is composed of a conserved 3′ region, a variable mid-region encoding 7 and 11 amino acid repeats, and a 5′ region containing one of two possible alleles. Analysis of five complete genome sequences and PCR genotyping of 42 H. pylori strains revealed extensive variation between strains in the number, location, and arrangement of RD genes. Furthermore, examination of multiple strains isolated from a single subject's stomach revealed intrahost variation in repeat number and composition. Despite prior evidence that the protein products of this gene family are expressed at the bacterial cell surface, enzyme-linked immunosorbent assay and immunoblot studies revealed no consistent seroreactivity to a recombinant RD protein by H. pylori-positive hosts. The pattern of repeats uncovered in the RD gene family appears to reflect slipped-strand mispairing or domain duplication, allowing for redundancy and subsequent diversity in genotype and phenotype. This novel family of hypervariable genes with conserved, repetitive, and allelic domains may represent an important locus for understanding H. pylori persistence in its natural host. PMID:19749042
Hazard Potential of Volcanic Flank Collapses Raised by New Megatsunami Evidence

Science.gov (United States)

Ramalho, R. S.; Winckler, G.; Madeira, J.; Helffrich, G. R.; Hipólito, A.; Quartau, R.; Adena, K.; Schaefer, J. M.

2015-12-01

Large-scale gravitational flank collapses of steep volcanic islands are hypothetically capable of triggering megatsunamis with highly catastrophic effects. Yet evidence for the existence and impact of collapsed-triggered megatsunamis and their run-up heights remains scarce and/or is highly contentious. Therefore a considerable debate still exists over the potential magnitude of collapse-triggered tsunamis and their inherent hazard. In particular, doubts still remain whether or not large-scale flank failures typically generate enough volume flux to result in megatsunamis, or alternatively operate by slow-moving or multiple smaller episodic failures with much lower tsunamigenic potential. Here we show that one of the tallest and most active oceanic volcanoes on Earth - Fogo, in the Cape Verde Islands - collapsed catastrophically and triggered a megatsunami with devastating near-field effects ~73,000 years ago. Our deductions are based on the recent discovery and cosmogenic 3He dating of tsunamigenic deposits - comprising fields of stranded megaclasts, chaotic conglomerates, and sand sheets - found on the adjacent Santiago Island, which attest to the impact of this megatsunami and document wave run-up heights exceeding 270 m. The evidence reported here implies that Fogo's flank failure involved at least one sudden and voluminous event that resulted in a megatsunami, in contrast to what has been suggested before. Our work thus provides another line of evidence that large-scale flank failures at steep volcanic islands may indeed happen catastrophically and are capable of triggering tsunamis of enormous height and energy. This new line of evidence therefore reinforces the hazard potential of volcanic island collapses and stands as a warning that such hazard should not be underestimated, particularly in areas where volcanic island edifices are close to other islands or to highly populated continental margins.
Genome editing for crop improvement: Challenges and opportunities.

Science.gov (United States)

Abdallah, Naglaa A; Prakash, Channapatna S; McHughen, Alan G

2015-01-01

Genome or gene editing includes several new techniques to help scientists precisely modify genome sequences. The techniques also enables us to alter the regulation of gene expression patterns in a pre-determined region and facilitates novel insights into the functional genomics of an organism. Emergence of genome editing has brought considerable excitement especially among agricultural scientists because of its simplicity, precision and power as it offers new opportunities to develop improved crop varieties with clear-cut addition of valuable traits or removal of undesirable traits. Research is underway to improve crop varieties with higher yields, strengthen stress tolerance, disease and pest resistance, decrease input costs, and increase nutritional value. Genome editing encompasses a wide variety of tools using either a site-specific recombinase (SSR) or a site-specific nuclease (SSN) system. Both systems require recognition of a known sequence. The SSN system generates single or double strand DNA breaks and activates endogenous DNA repair pathways. SSR technology, such as Cre/loxP and Flp/FRT mediated systems, are able to knockdown or knock-in genes in the genome of eukaryotes, depending on the orientation of the specific sites (loxP, FLP, etc.) flanking the target site. There are 4 main classes of SSN developed to cleave genomic sequences, mega-nucleases (homing endonuclease), zinc finger nucleases (ZFNs), transcriptional activator-like effector nucleases (TALENs), and the CRISPR/Cas nuclease system (clustered regularly interspaced short palindromic repeat/CRISPR-associated protein). The recombinase mediated genome engineering depends on recombinase (sub-) family and target-site and induces high frequencies of homologous recombination. Improving crops with gene editing provides a range of options: by altering only a few nucleotides from billions found in the genomes of living cells, altering the full allele or by inserting a new gene in a targeted region of
The diversity and evolution of Wolbachia ankyrin repeat domain genes.

Directory of Open Access Journals (Sweden)

Stefanos Siozios

Full Text Available Ankyrin repeat domain-encoding genes are common in the eukaryotic and viral domains of life, but they are rare in bacteria, the exception being a few obligate or facultative intracellular Proteobacteria species. Despite having a reduced genome, the arthropod strains of the alphaproteobacterium Wolbachia contain an unusually high number of ankyrin repeat domain-encoding genes ranging from 23 in wMel to 60 in wPip strain. This group of genes has attracted considerable attention for their astonishing large number as well as for the fact that ankyrin proteins are known to participate in protein-protein interactions, suggesting that they play a critical role in the molecular mechanism that determines host-Wolbachia symbiotic interactions. We present a comparative evolutionary analysis of the wMel-related ankyrin repeat domain-encoding genes present in different Drosophila-Wolbachia associations. Our results show that the ankyrin repeat domain-encoding genes change in size by expansion and contraction mediated by short directly repeated sequences. We provide examples of intra-genic recombination events and show that these genes are likely to be horizontally transferred between strains with the aid of bacteriophages. These results confirm previous findings that the Wolbachia genomes are evolutionary mosaics and illustrate the potential that these bacteria have to generate diversity in proteins potentially involved in the symbiotic interactions.
Mononucleotide repeats are asymmetrically distributed in fungal genes

NARCIS (Netherlands)

Passel, van M.W.J.; Graaff, de L.H.

2008-01-01

ABSTRACT: BACKGROUND: Systematic analyses of sequence features have resulted in a better characterisation of the organisation of the genome. A previous study in prokaryotes on the distribution of sequence repeats, which are notoriously variable and can disrupt the reading frame in genes, showed that

The mitochondrial genomes of the ciliates Euplotes minuta and Euplotes crassus

Directory of Open Access Journals (Sweden)

Huynh Minh

2009-11-01

Full Text Available Abstract Background There are thousands of very diverse ciliate species from which only a handful mitochondrial genomes have been studied so far. These genomes are rather similar because the ciliates analysed (Tetrahymena spp. and Paramecium aurelia are closely related. Here we study the mitochondrial genomes of the hypotrichous ciliates Euplotes minuta and Euplotes crassus. These ciliates are only distantly related to Tetrahymena spp. and Paramecium aurelia, but more closely related to Nyctotherus ovalis, which possesses a hydrogenosomal (mitochondrial genome. Results The linear mitochondrial genomes of the hypotrichous ciliates Euplotes minuta and Euplotes crassus were sequenced and compared with the mitochondrial genomes of several Tetrahymena species, Paramecium aurelia and the partially sequenced mitochondrial genome of the anaerobic ciliate Nyctotherus ovalis. This study reports new features such as long 5'gene extensions of several mitochondrial genes, extremely long cox1 and cox2 open reading frames and a large repeat in the middle of the linear mitochondrial genome. The repeat separates the open reading frames into two blocks, each having a single direction of transcription, from the repeat towards the ends of the chromosome. Although the Euplotes mitochondrial gene content is almost identical to that of Paramecium and Tetrahymena, the order of the genes is completely different. In contrast, the 33273 bp (excluding the repeat region piece of the mitochondrial genome that has been sequenced in both Euplotes species exhibits no difference in gene order. Unexpectedly, many of the mitochondrial genes of E. minuta encoding ribosomal proteins possess N-terminal extensions that are similar to mitochondrial targeting signals. Conclusion The mitochondrial genomes of the hypotrichous ciliates Euplotes minuta and Euplotes crassus are rather different from the previously studied genomes. Many genes are extended in size compared to mitochondrial
Biased distribution of DNA uptake sequences towards genome maintenance genes

DEFF Research Database (Denmark)

Davidsen, T.; Rodland, E.A.; Lagesen, K.

2004-01-01

Repeated sequence signatures are characteristic features of all genomic DNA. We have made a rigorous search for repeat genomic sequences in the human pathogens Neisseria meningitidis, Neisseria gonorrhoeae and Haemophilus influenzae and found that by far the most frequent 9-10mers residing within...... in these organisms. Pasteurella multocida also displayed high frequencies of a putative DUS identical to that previously identified in H. influenzae and with a skewed distribution towards genome maintenance genes, indicating that this bacterium might be transformation competent under certain conditions....
Billions of basepairs of recently expanded, repetitive sequences are eliminated from the somatic genome during copepod development.

Science.gov (United States)

Sun, Cheng; Wyngaard, Grace; Walton, D Brian; Wichman, Holly A; Mueller, Rachel Lockridge

2014-03-11

Chromatin diminution is the programmed deletion of DNA from presomatic cell or nuclear lineages during development, producing single organisms that contain two different nuclear genomes. Phylogenetically diverse taxa undergo chromatin diminution--some ciliates, nematodes, copepods, and vertebrates. In cyclopoid copepods, chromatin diminution occurs in taxa with massively expanded germline genomes; depending on species, germline genome sizes range from 15 - 75 Gb, 12-74 Gb of which are lost from pre-somatic cell lineages at germline--soma differentiation. This is more than an order of magnitude more sequence than is lost from other taxa. To date, the sequences excised from copepods have not been analyzed using large-scale genomic datasets, and the processes underlying germline genomic gigantism in this clade, as well as the functional significance of chromatin diminution, have remained unknown. Here, we used high-throughput genomic sequencing and qPCR to characterize the germline and somatic genomes of Mesocyclops edax, a freshwater cyclopoid copepod with a germline genome of ~15 Gb and a somatic genome of ~3 Gb. We show that most of the excised DNA consists of repetitive sequences that are either 1) verifiable transposable elements (TEs), or 2) non-simple repeats of likely TE origin. Repeat elements in both genomes are skewed towards younger (i.e. less divergent) elements. Excised DNA is a non-random sample of the germline repeat element landscape; younger elements, and high frequency DNA transposons and LINEs, are disproportionately eliminated from the somatic genome. Our results suggest that germline genome expansion in M. edax reflects explosive repeat element proliferation, and that billions of base pairs of such repeats are deleted from the somatic genome every generation. Thus, we hypothesize that chromatin diminution is a mechanism that controls repeat element load, and that this load can evolve to be divergent between tissue types within single organisms.
Conservation of Repeats at the Mammalian KCNQ1OT1-CDKN1C Region Suggests a Role in Genomic Imprinting

Directory of Open Access Journals (Sweden)

Marcos De Donato

2017-06-01

Full Text Available KCNQ1OT1 is located in the region with the highest number of genes showing genomic imprinting, but the mechanisms controlling the genes under its influence have not been fully elucidated. Therefore, we conducted a comparative analysis of the KCNQ1/KCNQ1OT1-CDKN1C region to study its conservation across the best assembled eutherian mammalian genomes sequenced to date and analyzed potential elements that may be implicated in the control of genomic imprinting in this region. The genomic features in these regions from human, mouse, cattle, and dog show a higher number of genes and CpG islands (detected using cpgplot from EMBOSS, but lower number of repetitive elements (including short interspersed nuclear elements and long interspersed nuclear elements, compared with their whole chromosomes (detected by RepeatMasker. The KCNQ1OT1-CDKN1C region contains the highest number of conserved noncoding sequences (CNS among mammals, where we found 16 regions containing about 38 different highly conserved repetitive elements (using mVista, such as LINE1 elements: L1M4, L1MB7, HAL1, L1M4a, L1Med, and an LTR element: MLT1H. From these elements, we found 74 CNS showing high sequence identity (>70% between human, cattle, and mouse, from which we identified 13 motifs (using Multiple Em for Motif Elicitation/Motif Alignment and Search Tool with a significant probability of occurrence, 3 of which were the most frequent and were used to find transcription factor–binding sites. We detected several transcription factors (using JASPAR suite from the families SOX, FOX, and GATA. A phylogenetic analysis of these CNS from human, marmoset, mouse, rat, cattle, dog, horse, and elephant shows branches with high levels of support and very similar phylogenetic relationships among these groups, confirming previous reports. Our results suggest that functional DNA elements identified by comparative genomics in a region densely populated with imprinted mammalian genes may be
Complete plastid genome sequencing of Trochodendraceae reveals a significant expansion of the inverted repeat and suggests a Paleogene divergence between the two extant species.

Directory of Open Access Journals (Sweden)

Yan-xia Sun

Full Text Available The early-diverging eudicot order Trochodendrales contains only two monospecific genera, Tetracentron and Trochodendron. Although an extensive fossil record indicates that the clade is perhaps 100 million years old and was widespread throughout the Northern Hemisphere during the Paleogene and Neogene, the two extant genera are both narrowly distributed in eastern Asia. Recent phylogenetic analyses strongly support a clade of Trochodendrales, Buxales, and Gunneridae (core eudicots, but complete plastome analyses do not resolve the relationships among these groups with strong support. However, plastid phylogenomic analyses have not included data for Tetracentron. To better resolve basal eudicot relationships and to clarify when the two extant genera of Trochodendrales diverged, we sequenced the complete plastid genome of Tetracentron sinense using Illumina technology. The Tetracentron and Trochodendron plastomes possess the typical gene content and arrangement that characterize most angiosperm plastid genomes, but both genomes have the same unusual ∼4 kb expansion of the inverted repeat region to include five genes (rpl22, rps3, rpl16, rpl14, and rps8 that are normally found in the large single-copy region. Maximum likelihood analyses of an 83-gene, 88 taxon angiosperm data set yield an identical tree topology as previous plastid-based trees, and moderately support the sister relationship between Buxaceae and Gunneridae. Molecular dating analyses suggest that Tetracentron and Trochodendron diverged between 44-30 million years ago, which is congruent with the fossil record of Trochodendrales and with previous estimates of the divergence time of these two taxa. We also characterize 154 simple sequence repeat loci from the Tetracentron sinense and Trochodendron aralioides plastomes that will be useful in future studies of population genetic structure for these relict species, both of which are of conservation concern.
Trade-off between Transcriptome Plasticity and Genome Evolution in Cephalopods.

Science.gov (United States)

Liscovitch-Brauer, Noa; Alon, Shahar; Porath, Hagit T; Elstein, Boaz; Unger, Ron; Ziv, Tamar; Admon, Arie; Levanon, Erez Y; Rosenthal, Joshua J C; Eisenberg, Eli

2017-04-06

RNA editing, a post-transcriptional process, allows the diversification of proteomes beyond the genomic blueprint; however it is infrequently used among animals for this purpose. Recent reports suggesting increased levels of RNA editing in squids thus raise the question of the nature and effects of these events. We here show that RNA editing is particularly common in behaviorally sophisticated coleoid cephalopods, with tens of thousands of evolutionarily conserved sites. Editing is enriched in the nervous system, affecting molecules pertinent for excitability and neuronal morphology. The genomic sequence flanking editing sites is highly conserved, suggesting that the process confers a selective advantage. Due to the large number of sites, the surrounding conservation greatly reduces the number of mutations and genomic polymorphisms in protein-coding regions. This trade-off between genome evolution and transcriptome plasticity highlights the importance of RNA recoding as a strategy for diversifying proteins, particularly those associated with neural function. PAPERCLIP. Copyright © 2017 Elsevier Inc. All rights reserved.
The complete chloroplast genome sequence of Abies nephrolepis (Pinaceae: Abietoideae

Directory of Open Access Journals (Sweden)

Dong-Keun Yi

2016-06-01

Full Text Available The plant chloroplast (cp genome has maintained a relatively conserved structure and gene content throughout evolution. Cp genome sequences have been used widely for resolving evolutionary and phylogenetic issues at various taxonomic levels of plants. Here, we report the complete cp genome of Abies nephrolepis. The A. nephrolepis cp genome is 121,336 base pairs (bp in length including a pair of short inverted repeat regions (IRa and IRb of 139 bp each separated by a small single copy (SSC region of 54,323 bp (SSC and a large single copy region of 66,735 bp (LSC. It contains 114 genes, 68 of which are protein coding genes, 35 tRNA and four rRNA genes, six open reading frames, and one pseudogene. Seventeen repeat units and 64 simple sequence repeats (SSR have been detected in A. nephrolepis cp genome. Large IR sequences locate in 42-kb inversion points (1186 bp. The A. nephrolepis cp genome is identical to Abies koreana’s which is closely related to taxa. Pairwise comparison between two cp genomes revealed 140 polymorphic sites in each. Complete cp genome sequence of A. nephrolepis has a significant potential to provide information on the evolutionary pattern of Abietoideae and valuable data for development of DNA markers for easy identification and classification.
A Genome-Wide Survey of the Microsatellite Content of the Globe Artichoke Genome and the Development of a Web-Based Database

Science.gov (United States)

Portis, Ezio; Portis, Flavio; Valente, Luisa; Moglia, Andrea; Barchi, Lorenzo; Lanteri, Sergio; Acquadro, Alberto

2016-01-01

The recently acquired genome sequence of globe artichoke (Cynara cardunculus var. scolymus) has been used to catalog the genome’s content of simple sequence repeat (SSR) markers. More than 177,000 perfect SSRs were revealed, equivalent to an overall density across the genome of 244.5 SSRs/Mbp, but some 224,000 imperfect SSRs were also identified. About 21% of these SSRs were complex (two stretches of repeats separated by artichoke accessions, as templates. PMID:27648830
The complete chloroplast genome sequence of Curcuma flaviflora (Curcuma).

Science.gov (United States)

Zhang, Yan; Deng, Jiabin; Li, Yangyi; Gao, Gang; Ding, Chunbang; Zhang, Li; Zhou, Yonghong; Yang, Ruiwu

2016-09-01

The complete chloroplast (cp) genome of Curcuma flaviflora, a medicinal plant in Southeast Asia, was sequenced. The genome size was 160 478 bp in length, with 36.3% GC content. A pair of inverted repeats (IRs) of 26 946 bp were separated by a large single copy (LSC) of 88 008 bp and a small single copy (SSC) of 18 578 bp, respectively. The cp genome contained 132 annotated genes, including 79 protein coding genes, 30 tRNA genes, and four rRNA genes. And 19 of these genes were duplicated in inverted repeat regions.
Genome-wide characterization, evolution, and expression analysis of the leucine-rich repeat receptor-like protein kinase (LRR-RLK) gene family in Rosaceae genomes.

Science.gov (United States)

Sun, Jiangmei; Li, Leiting; Wang, Peng; Zhang, Shaoling; Wu, Juyou

2017-10-10

Leucine-rich repeat receptor-like protein kinase (LRR-RLK) is the largest gene family of receptor-like protein kinases (RLKs) and actively participates in regulating the growth, development, signal transduction, immunity, and stress responses of plants. However, the patterns of LRR-RLK gene family evolution in the five main Rosaceae species for which genome sequences are available have not yet been reported. In this study, we performed a comprehensive analysis of LRR-RLK genes for five Rosaceae species: Fragaria vesca (strawberry), Malus domestica (apple), Pyrus bretschneideri (Chinese white pear), Prunus mume (mei), and Prunus persica (peach), which contained 201, 244, 427, 267, and 258 LRR-RLK genes, respectively. All LRR-RLK genes were further grouped into 23 subfamilies based on the hidden Markov models approach. RLK-Pelle_LRR-XII-1, RLK-Pelle_LRR-XI-1, and RLK-Pelle_LRR-III were the three largest subfamilies. Synteny analysis indicated that there were 236 tandem duplicated genes in the five Rosaceae species, among which subfamilies XII-1 (82 genes) and XI-1 (80 genes) comprised 68.6%. Our results indicate that tandem duplication made a large contribution to the expansion of the subfamilies. The gene expression, tissue-specific expression, and subcellular localization data revealed that LRR-RLK genes were differentially expressed in various organs and tissues, and the largest subfamily XI-1 was highly expressed in all five Rosaceae species, suggesting that LRR-RLKs play important roles in each stage of plant growth and development. Taken together, our results provide an overview of the LRR-RLK family in Rosaceae genomes and the basis for further functional studies.
Complete genome sequence and comparative genomics of the probiotic yeast Saccharomyces boulardii.

Science.gov (United States)

Khatri, Indu; Tomar, Rajul; Ganesan, K; Prasad, G S; Subramanian, Srikrishna

2017-03-23

The probiotic yeast, Saccharomyces boulardii (Sb) is known to be effective against many gastrointestinal disorders and antibiotic-associated diarrhea. To understand molecular basis of probiotic-properties ascribed to Sb we determined the complete genomes of two strains of Sb i.e. Biocodex and unique28 and the draft genomes for three other Sb strains that are marketed as probiotics in India. We compared these genomes with 145 strains of S. cerevisiae (Sc) to understand genome-level similarities and differences between these yeasts. A distinctive feature of Sb from other Sc is absence of Ty elements Ty1, Ty3, Ty4 and associated LTR. However, we could identify complete Ty2 and Ty5 elements in Sb. The genes for hexose transporters HXT11 and HXT9, and asparagine-utilization are absent in all Sb strains. We find differences in repeat periods and copy numbers of repeats in flocculin genes that are likely related to the differential adhesion of Sb as compared to Sc. Core-proteome based taxonomy places Sb strains along with wine strains of Sc. We find the introgression of five genes from Z. bailii into the chromosome IV of Sb and wine strains of Sc. Intriguingly, genes involved in conferring known probiotic properties to Sb are conserved in most Sc strains.
Tc7, a Tc1-hitch hiking transposon in Caenorhabditis elegans.

OpenAIRE

Rezsohazy, R; van Luenen, H G; Durbin, R M; Plasterk, R H

1997-01-01

We have found a novel transposon in the genome of Caenorhabditis elegans. Tc7 is a 921 bp element, made up of two 345 bp inverted repeats separated by a unique, internal sequence. Tc7 does not contain an open reading frame. The outer 38 bp of the inverted repeat show 36 matches with the outer 38 bp of Tc1. This region of Tc1 contains the Tc1-transposase binding site. Furthermore, Tc7 is flanked by TA dinucleotides, just like Tc1, which presumably correspond to the target duplication generated...
Autoradiographic localization of tritiated dihydrotestosterone in the flank organ of the albino hamster

International Nuclear Information System (INIS)

Lucky, A.W.; Eisenfeld, A.J.; Visintin, I.

1985-01-01

In the hamster flank organ, the growth of hair and growth of sebaceous glands are androgen-dependent functions. Although dihydrotestosterone (DHT) is known to be a potent stimulator of flank organ growth, there is no information about localization of DHT receptor sites in this organ. The purpose of this study was to use steroid autoradiography to localize DHT receptors in the hamster flank organ. Because steroid hormones are functional when translocated to nuclear receptors, nuclear localization by autoradiography defines receptor sites. In order to be able to visualize autoradiographic grains from radiolabeled androgens around hair follicles, albino hamsters were studied to avoid confusion between the grains and pigment granules which are abundant in the more common Golden Syrian hamster. Mature male hamsters castrated 24 hours earlier were given tritium-labeled dihydrotestosterone ( [ 3 H]DHT). Using the technique of thaw-mount steroid autoradiography, 4-micron unfixed frozen sections were mounted in the dark onto emulsion-coated glass slides and allowed to develop for 4-6 months. [ 3 H]DHT was found to be concentrated over sebocyte nuclei. The label was present peripherally as well as in differentiating sebocytes. There was no nuclear localization of [ 3 H]DHT in animals pretreated with excessive quantities of unlabeled DHT. Steroid metabolites of [ 3 H] DHT were assessed by thin-layer chromatography in paired tissue samples. Most of the label remained with DHT. Uptake was inhibited in the flank organ of hamsters pretreated with unlabeled DHT
Radon measurements in the SE and NE flank of Mt. Etna (Italy)

International Nuclear Information System (INIS)

La Delfa, S.; Imme, G.; Lo Nigro, S.; Morelli, D.; Patane, G.; Vizzini, F.

2007-01-01

Soil Radon has been monitored at two fixed sites located in the northeastern and southeastern flank of Mt. Etna. In this study we report the comparison between in-soil Radon concentration trend recorded in the SE flank and that one recorded in the NE one, where an in-soil Radon detection system is operating since 2001. The aim of this work was to implement the investigation area finding a suitable radon detection site, in the south-east flank of Mt. Etna, in order to better understand possible links between Radon anomalies and volcano dynamic. Radon data collected in NE and SE sites were compared with the volcanic tremor, frequency of occurrence of earthquakes and seismic strain-release recorded at a fixed 3D digital seismic station placed in the NE site. Same general in-soil Radon trends and anomalies were found in both sites. These results have confirmed the suitability of the chosen southeastern site for the in-soil Radon monitoring at Mt. Etna. The comparison of the recorded Radon concentration anomalies with seismicity and volcanic tremor trends, has also verified a possible link with the volcanic activity, as observed in our previous published studies
Multi-stage volcanic island flank collapses with coeval explosive caldera-forming eruptions.

Science.gov (United States)

Hunt, James E; Cassidy, Michael; Talling, Peter J

2018-01-18

Volcanic flank collapses and explosive eruptions are among the largest and most destructive processes on Earth. Events at Mount St. Helens in May 1980 demonstrated how a relatively small (300 km 3 ), but can also occur in complex multiple stages. Here, we show that multistage retrogressive landslides on Tenerife triggered explosive caldera-forming eruptions, including the Diego Hernandez, Guajara and Ucanca caldera eruptions. Geochemical analyses were performed on volcanic glasses recovered from marine sedimentary deposits, called turbidites, associated with each individual stage of each multistage landslide. These analyses indicate only the lattermost stages of subaerial flank failure contain materials originating from respective coeval explosive eruption, suggesting that initial more voluminous submarine stages of multi-stage flank collapse induce these aforementioned explosive eruption. Furthermore, there are extended time lags identified between the individual stages of multi-stage collapse, and thus an extended time lag between the initial submarine stages of failure and the onset of subsequent explosive eruption. This time lag succeeding landslide-generated static decompression has implications for the response of magmatic systems to un-roofing and poses a significant implication for ocean island volcanism and civil emergency planning.
Mathematical description of tooth flank surface of globoidal worm gear with straight axial tooth profile

Science.gov (United States)

Połowniak, Piotr; Sobolak, Mariusz

2017-12-01

In this article, a mathematical description of tooth flank surface of the globoidal worm and worm wheel generated by the hourglass worm hob with straight tooth axial profile is presented. The kinematic system of globoidal worm gear is shown. The equation of globoid helix and tooth axial profile of worm is derived to determine worm tooth surface. Based on the equation of meshing the contact lines are obtained. The mathematical description of globoidal worm wheel tooth flank is performed on the basis of contact lines and generating the tooth side by the extreme cutting edge of worm hob. The presented mathematical model of tooth flank of TA worm and worm wheel can be used e.g. to analyse the contact pattern of the gear.
Exploration of the Drosophila buzzatii transposable element content suggests underestimation of repeats in Drosophila genomes.

Science.gov (United States)

Rius, Nuria; Guillén, Yolanda; Delprat, Alejandra; Kapusta, Aurélie; Feschotte, Cédric; Ruiz, Alfredo

2016-05-10

Many new Drosophila genomes have been sequenced in recent years using new-generation sequencing platforms and assembly methods. Transposable elements (TEs), being repetitive sequences, are often misassembled, especially in the genomes sequenced with short reads. Consequently, the mobile fraction of many of the new genomes has not been analyzed in detail or compared with that of other genomes sequenced with different methods, which could shed light into the understanding of genome and TE evolution. Here we compare the TE content of three genomes: D. buzzatii st-1, j-19, and D. mojavensis. We have sequenced a new D. buzzatii genome (j-19) that complements the D. buzzatii reference genome (st-1) already published, and compared their TE contents with that of D. mojavensis. We found an underestimation of TE sequences in Drosophila genus NGS-genomes when compared to Sanger-genomes. To be able to compare genomes sequenced with different technologies, we developed a coverage-based method and applied it to the D. buzzatii st-1 and j-19 genome. Between 10.85 and 11.16 % of the D. buzzatii st-1 genome is made up of TEs, between 7 and 7,5 % of D. buzzatii j-19 genome, while TEs represent 15.35 % of the D. mojavensis genome. Helitrons are the most abundant order in the three genomes. TEs in D. buzzatii are less abundant than in D. mojavensis, as expected according to the genome size and TE content positive correlation. However, TEs alone do not explain the genome size difference. TEs accumulate in the dot chromosomes and proximal regions of D. buzzatii and D. mojavensis chromosomes. We also report a significantly higher TE density in D. buzzatii and D. mojavensis X chromosomes, which is not expected under the current models. Our easy-to-use correction method allowed us to identify recently active families in D. buzzatii st-1 belonging to the LTR-retrotransposon superfamily Gypsy.
The mitochondrial and plastid genomes of Volvox carteri: bloated molecules rich in repetitive DNA

Directory of Open Access Journals (Sweden)

Lee Robert W

2009-03-01

Full Text Available Abstract Background The magnitude of noncoding DNA in organelle genomes can vary significantly; it is argued that much of this variation is attributable to the dissemination of selfish DNA. The results of a previous study indicate that the mitochondrial DNA (mtDNA of the green alga Volvox carteri abounds with palindromic repeats, which appear to be selfish elements. We became interested in the evolution and distribution of these repeats when, during a cursory exploration of the V. carteri nuclear DNA (nucDNA and plastid DNA (ptDNA sequences, we found palindromic repeats with similar structural features to those of the mtDNA. Upon this discovery, we decided to investigate the diversity and evolutionary implications of these palindromic elements by sequencing and characterizing large portions of mtDNA and ptDNA and then comparing these data to the V. carteri draft nuclear genome sequence. Results We sequenced 30 and 420 kilobases (kb of the mitochondrial and plastid genomes of V. carteri, respectively – resulting in partial assemblies of these genomes. The mitochondrial genome is the most bloated green-algal mtDNA observed to date: ~61% of the sequence is noncoding, most of which is comprised of short palindromic repeats spread throughout the intergenic and intronic regions. The plastid genome is the largest (>420 kb and most expanded (>80% noncoding ptDNA sequence yet discovered, with a myriad of palindromic repeats in the noncoding regions, which have a similar size and secondary structure to those of the mtDNA. We found that 15 kb (~0.01% of the nuclear genome are homologous to the palindromic elements of the mtDNA, and 50 kb (~0.05% are homologous to those of the ptDNA. Conclusion Selfish elements in the form of short palindromic repeats have propagated in the V. carteri mtDNA and ptDNA, resulting in the distension of these genomes. Copies of these same repeats are also found in a small fraction of the nucDNA, but appear to be inert in this
Predicting Tissue-Specific Enhancers in the Human Genome

Energy Technology Data Exchange (ETDEWEB)

Pennacchio, Len A.; Loots, Gabriela G.; Nobrega, Marcelo A.; Ovcharenko, Ivan

2006-07-01

Determining how transcriptional regulatory signals areencoded in vertebrate genomes is essential for understanding the originsof multi-cellular complexity; yet the genetic code of vertebrate generegulation remains poorly understood. In an attempt to elucidate thiscode, we synergistically combined genome-wide gene expression profiling,vertebrate genome comparisons, and transcription factor binding siteanalysis to define sequence signatures characteristic of candidatetissue-specific enhancers in the human genome. We applied this strategyto microarray-based gene expression profiles from 79 human tissues andidentified 7,187 candidate enhancers that defined their flanking geneexpression, the majority of which were located outside of knownpromoters. We cross-validated this method for its ability to de novopredict tissue-specific gene expression and confirmed its reliability in57 of the 79 available human tissues, with an average precision inenhancer recognition ranging from 32 percent to 63 percent, and asensitivity of 47 percent. We used the sequence signatures identified bythis approach to assign tissue-specific predictions to ~;328,000human-mouse conserved noncoding elements in the human genome. Byoverlapping these genome-wide predictions with a large in vivo dataset ofenhancers validated in transgenic mice, we confirmed our results with a28 percent sensitivity and 50 percent precision. These results indicatethe power of combining complementary genomic datasets as an initialcomputational foray into the global view of tissue-specific generegulation in vertebrates.
Separating metagenomic short reads into genomes via clustering

Directory of Open Access Journals (Sweden)

Tanaseichuk Olga

2012-09-01

Full Text Available Abstract Background The metagenomics approach allows the simultaneous sequencing of all genomes in an environmental sample. This results in high complexity datasets, where in addition to repeats and sequencing errors, the number of genomes and their abundance ratios are unknown. Recently developed next-generation sequencing (NGS technologies significantly improve the sequencing efficiency and cost. On the other hand, they result in shorter reads, which makes the separation of reads from different species harder. Among the existing computational tools for metagenomic analysis, there are similarity-based methods that use reference databases to align reads and composition-based methods that use composition patterns (i.e., frequencies of short words or l-mers to cluster reads. Similarity-based methods are unable to classify reads from unknown species without close references (which constitute the majority of reads. Since composition patterns are preserved only in significantly large fragments, composition-based tools cannot be used for very short reads, which becomes a significant limitation with the development of NGS. A recently proposed algorithm, AbundanceBin, introduced another method that bins reads based on predicted abundances of the genomes sequenced. However, it does not separate reads from genomes of similar abundance levels. Results In this work, we present a two-phase heuristic algorithm for separating short paired-end reads from different genomes in a metagenomic dataset. We use the observation that most of the l-mers belong to unique genomes when l is sufficiently large. The first phase of the algorithm results in clusters of l-mers each of which belongs to one genome. During the second phase, clusters are merged based on l-mer repeat information. These final clusters are used to assign reads. The algorithm could handle very short reads and sequencing errors. It is initially designed for genomes with similar abundance levels and then

Repetitive part of the banana (Musa acuminata) genome investigated by low-depth 454 sequencing.

Science.gov (United States)

Hribová, Eva; Neumann, Pavel; Matsumoto, Takashi; Roux, Nicolas; Macas, Jirí; Dolezel, Jaroslav

2010-09-16

Bananas and plantains (Musa spp.) are grown in more than a hundred tropical and subtropical countries and provide staple food for hundreds of millions of people. They are seed-sterile crops propagated clonally and this makes them vulnerable to a rapid spread of devastating diseases and at the same time hampers breeding improved cultivars. Although the socio-economic importance of bananas and plantains cannot be overestimated, they remain outside the focus of major research programs. This slows down the study of nuclear genome and the development of molecular tools to facilitate banana improvement. In this work, we report on the first thorough characterization of the repeat component of the banana (M. acuminata cv. 'Calcutta 4') genome. Analysis of almost 100 Mb of sequence data (0.15× genome coverage) permitted partial sequence reconstruction and characterization of repetitive DNA, making up about 30% of the genome. The results showed that the banana repeats are predominantly made of various types of Ty1/copia and Ty3/gypsy retroelements representing 16 and 7% of the genome respectively. On the other hand, DNA transposons were found to be rare. In addition to new families of transposable elements, two new satellite repeats were discovered and found useful as cytogenetic markers. To help in banana sequence annotation, a specific Musa repeat database was created, and its utility was demonstrated by analyzing the repeat composition of 62 genomic BAC clones. A low-depth 454 sequencing of banana nuclear genome provided the largest amount of DNA sequence data available until now for Musa and permitted reconstruction of most of the major types of DNA repeats. The information obtained in this study improves the knowledge of the long-range organization of banana chromosomes, and provides sequence resources needed for repeat masking and annotation during the Musa genome sequencing project. It also provides sequence data for isolation of DNA markers to be used in genetic
Transposable element distribution, abundance and role in genome size variation in the genus Oryza.

Science.gov (United States)

Zuccolo, Andrea; Sebastian, Aswathy; Talag, Jayson; Yu, Yeisoo; Kim, HyeRan; Collura, Kristi; Kudrna, Dave; Wing, Rod A

2007-08-29

The genus Oryza is composed of 10 distinct genome types, 6 diploid and 4 polyploid, and includes the world's most important food crop - rice (Oryza sativa [AA]). Genome size variation in the Oryza is more than 3-fold and ranges from 357 Mbp in Oryza glaberrima [AA] to 1283 Mbp in the polyploid Oryza ridleyi [HHJJ]. Because repetitive elements are known to play a significant role in genome size variation, we constructed random sheared small insert genomic libraries from 12 representative Oryza species and conducted a comprehensive study of the repetitive element composition, distribution and phylogeny in this genus. Particular attention was paid to the role played by the most important classes of transposable elements (Long Terminal Repeats Retrotransposons, Long interspersed Nuclear Elements, helitrons, DNA transposable elements) in shaping these genomes and in their contributing to genome size variation. We identified the elements primarily responsible for the most strikingly genome size variation in Oryza. We demonstrated how Long Terminal Repeat retrotransposons belonging to the same families have proliferated to very different extents in various species. We also showed that the pool of Long Terminal Repeat Retrotransposons is substantially conserved and ubiquitous throughout the Oryza and so its origin is ancient and its existence predates the speciation events that originated the genus. Finally we described the peculiar behavior of repeats in the species Oryza coarctata [HHKK] whose placement in the Oryza genus is controversial. Long Terminal Repeat retrotransposons are the major component of the Oryza genomes analyzed and, along with polyploidization, are the most important contributors to the genome size variation across the Oryza genus. Two families of Ty3-gypsy elements (RIRE2 and Atlantys) account for a significant portion of the genome size variations present in the Oryza genus.
Parallel altitudinal clines reveal trends in adaptive evolution of genome size in Zea mays

Science.gov (United States)

Berg, Jeremy J.; Birchler, James A.; Grote, Mark N.; Lorant, Anne; Quezada, Juvenal

2018-01-01

While the vast majority of genome size variation in plants is due to differences in repetitive sequence, we know little about how selection acts on repeat content in natural populations. Here we investigate parallel changes in intraspecific genome size and repeat content of domesticated maize (Zea mays) landraces and their wild relative teosinte across altitudinal gradients in Mesoamerica and South America. We combine genotyping, low coverage whole-genome sequence data, and flow cytometry to test for evidence of selection on genome size and individual repeat abundance. We find that population structure alone cannot explain the observed variation, implying that clinal patterns of genome size are maintained by natural selection. Our modeling additionally provides evidence of selection on individual heterochromatic knob repeats, likely due to their large individual contribution to genome size. To better understand the phenotypes driving selection on genome size, we conducted a growth chamber experiment using a population of highland teosinte exhibiting extensive variation in genome size. We find weak support for a positive correlation between genome size and cell size, but stronger support for a negative correlation between genome size and the rate of cell production. Reanalyzing published data of cell counts in maize shoot apical meristems, we then identify a negative correlation between cell production rate and flowering time. Together, our data suggest a model in which variation in genome size is driven by natural selection on flowering time across altitudinal clines, connecting intraspecific variation in repetitive sequence to important differences in adaptive phenotypes. PMID:29746459
Evolution of linear chromosomes and multipartite genomes in yeast mitochondria

Science.gov (United States)

Valach, Matus; Farkas, Zoltan; Fricova, Dominika; Kovac, Jakub; Brejova, Brona; Vinar, Tomas; Pfeiffer, Ilona; Kucsera, Judit; Tomaska, Lubomir; Lang, B. Franz; Nosek, Jozef

2011-01-01

Mitochondrial genome diversity in closely related species provides an excellent platform for investigation of chromosome architecture and its evolution by means of comparative genomics. In this study, we determined the complete mitochondrial DNA sequences of eight Candida species and analyzed their molecular architectures. Our survey revealed a puzzling variability of genome architecture, including circular- and linear-mapping and multipartite linear forms. We propose that the arrangement of large inverted repeats identified in these genomes plays a crucial role in alterations of their molecular architectures. In specific arrangements, the inverted repeats appear to function as resolution elements, allowing genome conversion among different topologies, eventually leading to genome fragmentation into multiple linear DNA molecules. We suggest that molecular transactions generating linear mitochondrial DNA molecules with defined telomeric structures may parallel the evolutionary emergence of linear chromosomes and multipartite genomes in general and may provide clues for the origin of telomeres and pathways implicated in their maintenance. PMID:21266473
Lactobacillus buchneri genotyping on the basis of clustered regularly interspaced short palindromic repeat (CRISPR) locus diversity.

Science.gov (United States)

Briner, Alexandra E; Barrangou, Rodolphe

2014-02-01

Clustered regularly interspaced short palindromic repeats (CRISPR) in combination with associated sequences (cas) constitute the CRISPR-Cas immune system, which uptakes DNA from invasive genetic elements as novel "spacers" that provide a genetic record of immunization events. We investigated the potential of CRISPR-based genotyping of Lactobacillus buchneri, a species relevant for commercial silage, bioethanol, and vegetable fermentations. Upon investigating the occurrence and diversity of CRISPR-Cas systems in Lactobacillus buchneri genomes, we observed a ubiquitous occurrence of CRISPR arrays containing a 36-nucleotide (nt) type II-A CRISPR locus adjacent to four cas genes, including the universal cas1 and cas2 genes and the type II signature gene cas9. Comparative analysis of CRISPR spacer content in 26 L. buchneri pickle fermentation isolates associated with spoilage revealed 10 unique locus genotypes that contained between 9 and 29 variable spacers. We observed a set of conserved spacers at the ancestral end, reflecting a common origin, as well as leader-end polymorphisms, reflecting recent divergence. Some of these spacers showed perfect identity with phage sequences, and many spacers showed homology to Lactobacillus plasmid sequences. Following a comparative analysis of sequences immediately flanking protospacers that matched CRISPR spacers, we identified a novel putative protospacer-adjacent motif (PAM), 5'-AAAA-3'. Overall, these findings suggest that type II-A CRISPR-Cas systems are valuable for genotyping of L. buchneri.
Virtual Genome Walking across the 32 Gb Ambystoma mexicanum genome; assembling gene models and intronic sequence.

Science.gov (United States)

Evans, Teri; Johnson, Andrew D; Loose, Matthew

2018-01-12

Large repeat rich genomes present challenges for assembly using short read technologies. The 32 Gb axolotl genome is estimated to contain ~19 Gb of repetitive DNA making an assembly from short reads alone effectively impossible. Indeed, this model species has been sequenced to 20× coverage but the reads could not be conventionally assembled. Using an alternative strategy, we have assembled subsets of these reads into scaffolds describing over 19,000 gene models. We call this method Virtual Genome Walking as it locally assembles whole genome reads based on a reference transcriptome, identifying exons and iteratively extending them into surrounding genomic sequence. These assemblies are then linked and refined to generate gene models including upstream and downstream genomic, and intronic, sequence. Our assemblies are validated by comparison with previously published axolotl bacterial artificial chromosome (BAC) sequences. Our analyses of axolotl intron length, intron-exon structure, repeat content and synteny provide novel insights into the genic structure of this model species. This resource will enable new experimental approaches in axolotl, such as ChIP-Seq and CRISPR and aid in future whole genome sequencing efforts. The assembled sequences and annotations presented here are freely available for download from https://tinyurl.com/y8gydc6n . The software pipeline is available from https://github.com/LooseLab/iterassemble .
Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: insights into evolutionary changes in fern chloroplast genomes.

Science.gov (United States)

Gao, Lei; Yi, Xuan; Yang, Yong-Xia; Su, Ying-Juan; Wang, Ting

2009-06-11

Ferns have generally been neglected in studies of chloroplast genomics. Before this study, only one polypod and two basal ferns had their complete chloroplast (cp) genome reported. Tree ferns represent an ancient fern lineage that first occurred in the Late Triassic. In recent phylogenetic analyses, tree ferns were shown to be the sister group of polypods, the most diverse group of living ferns. Availability of cp genome sequence from a tree fern will facilitate interpretation of the evolutionary changes of fern cp genomes. Here we have sequenced the complete cp genome of a scaly tree fern Alsophila spinulosa (Cyatheaceae). The Alsophila cp genome is 156,661 base pairs (bp) in size, and has a typical quadripartite structure with the large (LSC, 86,308 bp) and small single copy (SSC, 21,623 bp) regions separated by two copies of an inverted repeat (IRs, 24,365 bp each). This genome contains 117 different genes encoding 85 proteins, 4 rRNAs and 28 tRNAs. Pseudogenes of ycf66 and trnT-UGU are also detected in this genome. A unique trnR-UCG gene (derived from trnR-CCG) is found between rbcL and accD. The Alsophila cp genome shares some unusual characteristics with the previously sequenced cp genome of the polypod fern Adiantum capillus-veneris, including the absence of 5 tRNA genes that exist in most other cp genomes. The genome shows a high degree of synteny with that of Adiantum, but differs considerably from two basal ferns (Angiopteris evecta and Psilotum nudum). At one endpoint of an ancient inversion we detected a highly repeated 565-bp-region that is absent from the Adiantum cp genome. An additional minor inversion of the trnD-GUC, which is possibly shared by all ferns, was identified by comparison between the fern and other land plant cp genomes. By comparing four fern cp genome sequences it was confirmed that two major rearrangements distinguish higher leptosporangiate ferns from basal fern lineages. The Alsophila cp genome is very similar to that of the
Complete chloroplast genome sequence of a tree fern Alsophila spinulosa: insights into evolutionary changes in fern chloroplast genomes

Directory of Open Access Journals (Sweden)

Yang Yong-Xia

2009-06-01

Full Text Available Abstract Background Ferns have generally been neglected in studies of chloroplast genomics. Before this study, only one polypod and two basal ferns had their complete chloroplast (cp genome reported. Tree ferns represent an ancient fern lineage that first occurred in the Late Triassic. In recent phylogenetic analyses, tree ferns were shown to be the sister group of polypods, the most diverse group of living ferns. Availability of cp genome sequence from a tree fern will facilitate interpretation of the evolutionary changes of fern cp genomes. Here we have sequenced the complete cp genome of a scaly tree fern Alsophila spinulosa (Cyatheaceae. Results The Alsophila cp genome is 156,661 base pairs (bp in size, and has a typical quadripartite structure with the large (LSC, 86,308 bp and small single copy (SSC, 21,623 bp regions separated by two copies of an inverted repeat (IRs, 24,365 bp each. This genome contains 117 different genes encoding 85 proteins, 4 rRNAs and 28 tRNAs. Pseudogenes of ycf66 and trnT-UGU are also detected in this genome. A unique trnR-UCG gene (derived from trnR-CCG is found between rbcL and accD. The Alsophila cp genome shares some unusual characteristics with the previously sequenced cp genome of the polypod fern Adiantum capillus-veneris, including the absence of 5 tRNA genes that exist in most other cp genomes. The genome shows a high degree of synteny with that of Adiantum, but differs considerably from two basal ferns (Angiopteris evecta and Psilotum nudum. At one endpoint of an ancient inversion we detected a highly repeated 565-bp-region that is absent from the Adiantum cp genome. An additional minor inversion of the trnD-GUC, which is possibly shared by all ferns, was identified by comparison between the fern and other land plant cp genomes. Conclusion By comparing four fern cp genome sequences it was confirmed that two major rearrangements distinguish higher leptosporangiate ferns from basal fern lineages. The
Single nucleotide polymorphisms in the 5'-flanking region of the ...

African Journals Online (AJOL)

Prolactin (PRL), a polypeptide hormone synthesized and secreted by the animal's anterior pituitary gland, plays an important role in the regulation of mammalian lactation and avian reproduction. Considering the significant association between single nucleotide polymorphisms (SNPs) in the 5'-flanking region of PRL and ...
Experimental study of the interplay between magmatic rift intrusion and flank instability with application to the 2001 Mount Etna eruption

KAUST Repository

Le Corvec, Nicolas

2014-07-01

Mount Etna volcano is subject to transient magmatic intrusions and flank movement. The east flank of the edifice, in particular, is moving eastward and is dissected by the Timpe Fault System. The relationship of this eastward motion with intrusions and tectonic fault motion, however, remains poorly constrained. Here we explore this relationship by using analogue experiments that are designed to simulate magmatic rift intrusion, flank movement, and fault activity before, during, and after a magmatic intrusion episode. Using particle image velocimetry allows for a precise temporal and spatial analysis of the development and activity of fault systems. The results show that the occurrence of rift intrusion episodes has a direct effect on fault activity. In such a situation, fault activity may occur or may be hindered, depending on the interplay of fault displacement and flank acceleration in response to dike intrusion. Our results demonstrate that a complex interplay may exist between an active tectonic fault system and magmatically induced flank instability. Episodes of magmatic intrusion change the intensity pattern of horizontal flank displacements and may hinder or activate associated faults. We further compare our results with the GPS data of the Mount Etna 2001 eruption and intrusion. We find that syneruptive displacement rates at the Timpe Fault System have differed from the preeruptive or posteruptive periods, which shows a good agreement of both the experimental and the GPS data. Therefore, understanding the flank instability and flank stability at Mount Etna requires consideration of both tectonic and magmatic forcing. Key Points Analyzing Mount Etna east flank dynamics during the 2001 eruption Good correlation between analogue models and GPS data Understanding the different behavior of faulting before/during/after an eruption © 2014. American Geophysical Union. All Rights Reserved.
Experimental study of the interplay between magmatic rift intrusion and flank instability with application to the 2001 Mount Etna eruption

KAUST Repository

Le Corvec, Nicolas; Walter, Thomas R.; Ruch, Joel; Bonforte, Alessandro; Puglisi, Giuseppe

2014-01-01

Mount Etna volcano is subject to transient magmatic intrusions and flank movement. The east flank of the edifice, in particular, is moving eastward and is dissected by the Timpe Fault System. The relationship of this eastward motion with intrusions and tectonic fault motion, however, remains poorly constrained. Here we explore this relationship by using analogue experiments that are designed to simulate magmatic rift intrusion, flank movement, and fault activity before, during, and after a magmatic intrusion episode. Using particle image velocimetry allows for a precise temporal and spatial analysis of the development and activity of fault systems. The results show that the occurrence of rift intrusion episodes has a direct effect on fault activity. In such a situation, fault activity may occur or may be hindered, depending on the interplay of fault displacement and flank acceleration in response to dike intrusion. Our results demonstrate that a complex interplay may exist between an active tectonic fault system and magmatically induced flank instability. Episodes of magmatic intrusion change the intensity pattern of horizontal flank displacements and may hinder or activate associated faults. We further compare our results with the GPS data of the Mount Etna 2001 eruption and intrusion. We find that syneruptive displacement rates at the Timpe Fault System have differed from the preeruptive or posteruptive periods, which shows a good agreement of both the experimental and the GPS data. Therefore, understanding the flank instability and flank stability at Mount Etna requires consideration of both tectonic and magmatic forcing. Key Points Analyzing Mount Etna east flank dynamics during the 2001 eruption Good correlation between analogue models and GPS data Understanding the different behavior of faulting before/during/after an eruption © 2014. American Geophysical Union. All Rights Reserved.
Telomeres and viruses: common themes of genome maintenance

Science.gov (United States)

Deng, Zhong; Wang, Zhuo; Lieberman, Paul M.

2012-01-01

Genome maintenance mechanisms actively suppress genetic instability associated with cancer and aging. Some viruses provoke genetic instability by subverting the host’s control of genome maintenance. Viruses have their own specialized strategies for genome maintenance, which can mimic and modify host cell processes. Here, we review some of the common features of genome maintenance utilized by viruses and host chromosomes, with a particular focus on terminal repeat (TR) elements. The TRs of cellular chromosomes, better known as telomeres, have well-established roles in cellular chromosome stability. Cellular telomeres are themselves maintained by viral-like mechanisms, including self-propagation by reverse transcription, recombination, and retrotransposition. Viral TR elements, like cellular telomeres, are essential for viral genome stability and propagation. We review the structure and function of viral repeat elements and discuss how they may share telomere-like structures and genome protection functions. We consider how viral infections modulate telomere regulatory factors for viral repurposing and can alter normal host telomere structure and chromosome stability. Understanding the common strategies of viral and cellular genome maintenance may provide new insights into viral–host interactions and the mechanisms driving genetic instability in cancer. PMID:23293769
Geologic setting of the proposed West Flank Forge Site, California: Suitability for EGS research and development

Science.gov (United States)

Sabin, Andrew; Blake, Kelly; Lazaro, Mike; Blankenship, Douglas; Kennedy, Mack; McCullough, Jess; DeOreo, S.B.; Hickman, Stephen H.; Glen, Jonathan; Kaven, Joern; Williams, Colin F.; Phelps, Geoffrey; Faulds, James E.; Hinz, Nicholas H.; Calvin, Wendy M.; Siler, Drew; Robertson-Tait, Ann

2017-01-01

The proposed West Flank FORGE site is within the China Lake Naval Air Weapons Station (NAWS), China Lake, CA. The West Flank is west of the Coso geothermal field, an area of China Lake NAWS dominated by the Quaternary Coso volcanic field largely comprised of rhyolite domes and their volcaniclastic and epiclastic horizons. The largest dome flow complex, Sugarloaf Mountain, marks the northwestern margin of the geothermal field. The West Flank is situated due west of Sugarloaf. The geologic setting of the West Flank was determined from one deep well (83-11) drilled as a potential production hole in 2009. The bottom-hole temperature (BHT) of well 83-11 approaches 600 oF (315˚C), but flow tests demonstrate very low, non-commercial permeabilities. With the exception of the upper 600 feet of volcaniclastic alluvium, well 83-11 is completed in granitic basement. The West Flank possesses the primary attributes of a FORGE site: non-commercial permeability (geothermal fieldThe Coso Mountains host the Coso volcanic field and are within a right-releasing stepover between the dextral Airport Lake (ALF) and Little Lake fault zones (LLFZ) and the Wild Horse Mesa and Owens Valley faults. Two distinct fault populations have been identified at Coso: WNW-trending and antithetical, NE-trending strike-slip faults and N- to NNE-trending normal faults. These faults are both high permeability drilling targets at depth within the main (productive) geothermal field and they locally segment the field into distinct hydrothermal regimes. The West Flank may be segmented from the rest of the field by one such northerly trending fault. The overall minimum principal stress orientation in the main geothermal field varies from 103˚ to 108˚; however, the minimum horizontal principal stress in 83-11 is rotated to 081˚.
Whole genome resequencing reveals natural target site preferences of transposable elements in Drosophila melanogaster.

Directory of Open Access Journals (Sweden)

Raquel S Linheiro

Full Text Available Transposable elements are mobile DNA sequences that integrate into host genomes using diverse mechanisms with varying degrees of target site specificity. While the target site preferences of some engineered transposable elements are well studied, the natural target preferences of most transposable elements are poorly characterized. Using population genomic resequencing data from 166 strains of Drosophila melanogaster, we identified over 8,000 new insertion sites not present in the reference genome sequence that we used to decode the natural target preferences of 22 families of transposable element in this species. We found that terminal inverted repeat transposon and long terminal repeat retrotransposon families present clade-specific target site duplications and target site sequence motifs. Additionally, we found that the sequence motifs at transposable element target sites are always palindromes that extend beyond the target site duplication. Our results demonstrate the utility of population genomics data for high-throughput inference of transposable element targeting preferences in the wild and establish general rules for terminal inverted repeat transposon and long terminal repeat retrotransposon target site selection in eukaryotic genomes.
APE1 incision activity at abasic sites in tandem repeat sequences.

Science.gov (United States)

Li, Mengxia; Völker, Jens; Breslauer, Kenneth J; Wilson, David M

2014-05-29

Repetitive DNA sequences, such as those present in microsatellites and minisatellites, telomeres, and trinucleotide repeats (linked to fragile X syndrome, Huntington disease, etc.), account for nearly 30% of the human genome. These domains exhibit enhanced susceptibility to oxidative attack to yield base modifications, strand breaks, and abasic sites; have a propensity to adopt non-canonical DNA forms modulated by the positions of the lesions; and, when not properly processed, can contribute to genome instability that underlies aging and disease development. Knowledge on the repair efficiencies of DNA damage within such repetitive sequences is therefore crucial for understanding the impact of such domains on genomic integrity. In the present study, using strategically designed oligonucleotide substrates, we determined the ability of human apurinic/apyrimidinic endonuclease 1 (APE1) to cleave at apurinic/apyrimidinic (AP) sites in a collection of tandem DNA repeat landscapes involving telomeric and CAG/CTG repeat sequences. Our studies reveal the differential influence of domain sequence, conformation, and AP site location/relative positioning on the efficiency of APE1 binding and strand incision. Intriguingly, our data demonstrate that APE1 endonuclease efficiency correlates with the thermodynamic stability of the DNA substrate. We discuss how these results have both predictive and mechanistic consequences for understanding the success and failure of repair protein activity associated with such oxidatively sensitive, conformationally plastic/dynamic repetitive DNA domains. Published by Elsevier Ltd.
Short Interspersed Nuclear Element (SINE) Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293.

Science.gov (United States)

Kanhayuwa, Lakkhana; Coutts, Robert H A

2016-01-01

Novel families of short interspersed nuclear element (SINE) sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp) flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.
Stress-induced rearrangement of Fusarium retrotransposon sequences.

Science.gov (United States)

Anaya, N; Roncero, M I

1996-11-27

Rearrangement of fusarium oxysporum retrotransposon skippy was induced by growth in the presence of potassium chlorate. Three fungal strains, one sensitive to chlorate (Co60) and two resistant to chlorate and deficient for nitrate reductase (Co65 and Co94), were studied by Southern analysis of their genomic DNA. Polymorphism was detected in their hybridization banding pattern, relative to the wild type grown in the absence of chlorate, using various enzymes with or without restriction sites within the retrotransposon. Results were consistent with the assumption that three different events had occurred in strain Co60: genomic amplification of skippy yielding tandem arrays of the element, generation of new skippy sequences, and deletion of skippy sequences. Amplification of Co60 genomic DNA using the polymerase chain reaction and divergent primers derived from the retrotransposon generated a new band, corresponding to one long terminal repeat plus flanking sequences, that was not present in the wild-type strain. Molecular analysis of nitrate reductase-deficient mutants showed that generation and deletion of skippy sequences, but not genomic amplification in tandem repeats, had occurred in their genomes.
The Complete Chloroplast Genome of Ye-Xing-Ba (Scrophularia dentata; Scrophulariaceae), an Alpine Tibetan Herb.

Science.gov (United States)

Ni, Lianghong; Zhao, Zhili; Dorje, Gaawe; Ma, Mi

2016-01-01

Scrophularia dentata is an important Tibetan medicinal plant and traditionally used for the treatment of exanthema and fever in Traditional Tibetan Medicine (TTM). However, there is little sequence and genomic information available for S. dentata. In this paper, we report the complete chloroplast genome sequence of S. dentata and it is the first sequenced member of the Sect. Tomiophyllum within Scrophularia (Scrophulariaceae). The gene order and organization of the chloroplast genome of S. dentata are similar to other Lamiales chloroplast genomes. The plastome is 152,553 bp in length and includes a pair of inverted repeats (IRs) of 25,523 bp that separate a large single copy (LSC) region of 84,058 bp and a small single copy (SSC) region of 17,449 bp. It has 38.0% GC content and includes 114 unique genes, of which 80 are protein-coding, 30 are transfer RNA, and 4 are ribosomal RNA. Also, it contains 21 forward repeats, 19 palindrome repeats and 41 simple sequence repeats (SSRs). The repeats and SSRs within S. dentata were compared with those of S. takesimensis and present certain discrepancies. The chloroplast genome of S. dentata was compared with other five publicly available Lamiales genomes from different families. All the coding regions and non-coding regions (introns and intergenic spacers) within the six chloroplast genomes have been extracted and analysed. Furthermore, the genome divergent hotspot regions were identified. Our studies could provide basic data for the alpine medicinal species conservation and molecular phylogenetic researches of Scrophulariaceae and Lamiales.
The Complete Chloroplast Genome of Ye-Xing-Ba (Scrophularia dentata; Scrophulariaceae, an Alpine Tibetan Herb.

Directory of Open Access Journals (Sweden)

Lianghong Ni

Full Text Available Scrophularia dentata is an important Tibetan medicinal plant and traditionally used for the treatment of exanthema and fever in Traditional Tibetan Medicine (TTM. However, there is little sequence and genomic information available for S. dentata. In this paper, we report the complete chloroplast genome sequence of S. dentata and it is the first sequenced member of the Sect. Tomiophyllum within Scrophularia (Scrophulariaceae. The gene order and organization of the chloroplast genome of S. dentata are similar to other Lamiales chloroplast genomes. The plastome is 152,553 bp in length and includes a pair of inverted repeats (IRs of 25,523 bp that separate a large single copy (LSC region of 84,058 bp and a small single copy (SSC region of 17,449 bp. It has 38.0% GC content and includes 114 unique genes, of which 80 are protein-coding, 30 are transfer RNA, and 4 are ribosomal RNA. Also, it contains 21 forward repeats, 19 palindrome repeats and 41 simple sequence repeats (SSRs. The repeats and SSRs within S. dentata were compared with those of S. takesimensis and present certain discrepancies. The chloroplast genome of S. dentata was compared with other five publicly available Lamiales genomes from different families. All the coding regions and non-coding regions (introns and intergenic spacers within the six chloroplast genomes have been extracted and analysed. Furthermore, the genome divergent hotspot regions were identified. Our studies could provide basic data for the alpine medicinal species conservation and molecular phylogenetic researches of Scrophulariaceae and Lamiales.
Cloning and characterization of the 5'-flanking region of the Ehox gene

International Nuclear Information System (INIS)

Lee, Woon Kyu; Kim, Yong-Man; Malik, Nasir; Ma Chang; Westphal, Heiner

2006-01-01

The paired-like homeobox-containing gene Ehox plays a role in embryonic stem cell differentiation and is highly expressed in the developing placenta and thymus. To understand the mechanisms of regulation of Ehox gene expression, the 5'-flanking region of the Ehox gene was isolated from a mouse BAC library. 5'-RACE analysis revealed a single transcriptional start site 130 nucleotides upstream of the translation initiation codon. Transient transfection with a luciferase reporter gene under the control of serially deleted 5'-flanking sequences revealed that the nt -84 to -68 region contained a positive cis-acting element for efficient expression of the Ehox gene. Mutational analysis of this region and oligonucleotide competition in the electrophoretic mobility shift assay revealed the presence of a CCAAT box, which is a target for transcription nuclear factor Y (NFY). NFY is essential for positive gene regulation. No tissue-specific enhancer was identified in the 1.9-kb 5'-flanking region of the Ehox gene. Ehox is expressed during the early stages of embryo development, specifically in Brain at 9.5 dpc, as well as during the late stages of embryo development. These results suggest that NFY is an essential regulatory factor for Ehox transcriptional activity, which is important for the post-implantation stage of the developing embryo

Characterization of new Schistosoma mansoni microsatellite loci in sequences obtained from public DNA databases and microsatellite enriched genomic libraries

Directory of Open Access Journals (Sweden)

Rodrigues NB

2002-01-01

Full Text Available In the last decade microsatellites have become one of the most useful genetic markers used in a large number of organisms due to their abundance and high level of polymorphism. Microsatellites have been used for individual identification, paternity tests, forensic studies and population genetics. Data on microsatellite abundance comes preferentially from microsatellite enriched libraries and DNA sequence databases. We have conducted a search in GenBank of more than 16,000 Schistosoma mansoni ESTs and 42,000 BAC sequences. In addition, we obtained 300 sequences from CA and AT microsatellite enriched genomic libraries. The sequences were searched for simple repeats using the RepeatMasker software. Of 16,022 ESTs, we detected 481 (3% sequences that contained 622 microsatellites (434 perfect, 164 imperfect and 24 compounds. Of the 481 ESTs, 194 were grouped in 63 clusters containing 2 to 15 ESTs per cluster. Polymorphisms were observed in 16 clusters. The 287 remaining ESTs were orphan sequences. Of the 42,017 BAC end sequences, 1,598 (3.8% contained microsatellites (2,335 perfect, 287 imperfect and 79 compounds. The 1,598 BAC end sequences 80 were grouped into 17 clusters containing 3 to 17 BAC end sequences per cluster. Microsatellites were present in 67 out of 300 sequences from microsatellite enriched libraries (55 perfect, 38 imperfect and 15 compounds. From all of the observed loci 55 were selected for having the longest perfect repeats and flanking regions that allowed the design of primers for PCR amplification. Additionally we describe two new polymorphic microsatellite loci.
Flank gland-secreted putative chemosignals pertaining to photoperiod, endocrine states, and sociosexual behavior in golden hamsters

Directory of Open Access Journals (Sweden)

Ying-Juan LIU, Da-Wei WANG, Lixing SUN, Jin-Hua ZHANG, Jian-Xu ZHANG

2010-12-01

Full Text Available Behavioral studies have shown that flank glands are involved in chemical communication in golden hamsters Mesocricetus auratus but little chemical analysis has been conducted on volatiles arising from these glands. Using gas chromatography-mass spectrometry, we detected compounds from the flank glands of males, only eight of which were also produced in females. Based on these chemical data we performed a number of further experiments. By manipulating light we found that males exposed to short-photoperiods developed smaller flank glands than those exposed to long-photoperiods. Six flank gland volatiles reduced in relative abundance, which possibly coded for reproductive status of males of this seasonally breeding hamster species. Through dyadic encounters, we were able to induce the formation of dominant-subordinate relationships and show that two glandular compounds became high in relative abundance and may function as dominance pheromones. Castration eliminated all male-specific compounds resulting from flank glands, but bilateral ovariectomies only affected one compound in females. Once these ovariectomized females were treated with testosterone, their glandular compounds resembled those of males, suggesting these compounds are under the main control of androgen. Two female putative pheromones, tetradecanoic acid and hexadecanoic acid, were used in binary choice tests and were both found to attract males over females. Applying a solution of these pheromone compounds to adult males also suppressed their agonistic behavior [Current Zoology 56 (6: 800–812, 2010].
Human terminal deoxyribonucleotidyltransferase: molecular cloning and structural analysis of the gene and 5' flanking region

International Nuclear Information System (INIS)

Riley, L.K.; Morrow, J.K.; Danton, M.J.; Coleman, M.S.

1988-01-01

Human terminal deoxyribonucleotidyltransferase cDNA contains an open reading frame of 1530 base pairs (bp) corresponding to a protein containing 510 amino acids. The encoded protein is a template-independent DNA polymerase found only in a restricted population of normal and malignant prelymphocytes. To begin to investigate the genetic elements responsible for the tissue-specific expression of terminal deoxyribonucleotidyltransferase, genomic clones, containing the entire human gene were isolated and characterized. Initially, cDNA clones were isolated from a library generated from the human lymphoblastoid cell line, MOLT-4R. A cDNA clone containing the entire coding region of the protein was used to isolate a series of overlapping clones from two human genomic libraries. The gene comprises 11 exons and 10 introns and spans 49.4 kilobases. The 5' flanking region (709 bp) including exon 1 was sequenced. Several putative transcription initiation sites were mapped. Within 500 nucleotides of the translation start site, a series of promoter elements was detected. TATA and CAAT sequences, respectively, were found to start at nucleotides -185 and -204, -328 and -370, and -465 and -505. Start sites were found for a cyclic AMP-dependent promoter analog at nucleotide -121, an eight-base sequence corresponding to the IgG promoter enhancer (cd) at nucleotide -455, and an analog of the IgG promoter (pd) at nucleotide -159. These findings suggest that transcripts coding for terminal deoxyribonucleotidyltransferase may be variable in length and that transcription may be influenced by a variety of genetic elements
Genome analysis of an atypical bovine pestivirus from fetal bovine serum.

Science.gov (United States)

Gao, Shandian; Du, Junzheng; Tian, Zhancheng; Xing, Shanshan; Chang, Huiyun; Liu, Guangyuan; Luo, Jianxun; Yin, Hong

2016-08-01

We report the complete genome sequence of a bovine pestivirus LVRI/cont-1 originated from a commercial batch of fetal bovine serum. Its complete genome consists of 12,282 nucleotides (nt), which contain an open reading frame (ORF) of 11,700 bp flanked by 5' and 3' untranslated regions (383 and 199 bp). The size of the 5'UTR and the individual protein coding region of LVRI/cont-1 are identical to those of the reference virus Th/04_KhonKaen, but it has a deletion of the first 56 nt in the 3'UTR. Alignment of the complete nucleotide sequence and phylogenetic analysis indicate that this viral isolate belongs to atypical pestiviruses.
Structural features in the HIV-1 repeat region facilitate strand transfer during reverse transcription

NARCIS (Netherlands)

Berkhout, B.; Vastenhouw, N. L.; Klasens, B. I.; Huthoff, H.

2001-01-01

Two obligatory DNA strand transfers take place during reverse transcription of a retroviral RNA genome. The first strand transfer is facilitated by terminal repeat (R) elements in the viral genome. This strand-transfer reaction depends on base pairing between the cDNA of the 5'R and the 3'R. There
The role of viscous magma mush spreading in volcanic flank motion at Kīlauea Volcano, Hawai'i

NARCIS (Netherlands)

Plattner, C.; Amelung, F.; Baker, S.; Govers, R.; Poland, M.

2013-01-01

Multiple mechanisms have been suggested to explain seaward motion of the south flank of Kīlauea Volcano, Hawai'i. The consistency of flank motion during both waxing and waning magmatic activity at Kīlauea suggests that a continuously acting force, like gravity body force, plays a substantial role.
A family of DNA repeats in Aspergillus nidulans has assimilated degenerated retrotransposons

DEFF Research Database (Denmark)

Nielsen, M.L.; Hermansen, T.D.; Aleksenko, Alexei Y.

2001-01-01

In the course of a chromosomal walk towards the centromere of chromosome IV of Aspergillus nidulans, several cross- hybridizing genomic cosmid clones were isolated. Restriction mapping of two such clones revealed that their restriction patterns were similar in a region of at least 15 kb, indicati......) phenomenon, first described in Neurospora crassa, may have operated in A. nidulans. The data indicate that this family of repeats has assimilated mobile elements that subsequently degenerated but then underwent further duplications as a part of the host repeats....... the presence of a large repeat. The nature of the repeat was further investigated by sequencing and Southern analysis. The study revealed a family of long dispersed repeats with a high degree of sequence similarity. The number and location of the repeats vary between wild isolates. Two copies of the repeat...
Successful flank appraisal with a horizontal well: a Niger Delta example

Energy Technology Data Exchange (ETDEWEB)

Ohanele, C.; Emelumadu, U.

1998-12-31

Case study of a horizontal well successfully drilled in 1994 by Shell Oil in the Niger Delta is described. The well was drilled with the objectives of improving drainage of the major D3.1 reservoir and appraising the poorly defined eastern flank for structure and fluid content of the overlying D3.0 sand. The well was optimized by 3D reservoir and hydrocarbon modeling of these reservoirs. Combining the development and appraisal objectives in one horizontal well proved to be the optimal solution, both from a cost as well as a production consideration. The well proved up over 50 MMstb of additional reserves. The structural flank proved to be significantly shallower than previously mapped and had a positive effect not only on the D3.0 reserves, but also on the the D3.1. 6 figs.
Complete Chloroplast Genome of Pinus massoniana (Pinaceae): Gene Rearrangements, Loss of ndh Genes, and Short Inverted Repeats Contraction, Expansion.

Science.gov (United States)

Ni, ZhouXian; Ye, YouJu; Bai, Tiandao; Xu, Meng; Xu, Li-An

2017-09-11

The chloroplast genome (CPG) of Pinus massoniana belonging to the genus Pinus (Pinaceae), which is a primary source of turpentine, was sequenced and analyzed in terms of gene rearrangements, ndh genes loss, and the contraction and expansion of short inverted repeats (IRs). P. massoniana CPG has a typical quadripartite structure that includes large single copy (LSC) (65,563 bp), small single copy (SSC) (53,230 bp) and two IRs (IRa and IRb, 485 bp). The 108 unique genes were identified, including 73 protein-coding genes, 31 tRNAs, and 4 rRNAs. Most of the 81 simple sequence repeats (SSRs) identified in CPG were mononucleotides motifs of A/T types and located in non-coding regions. Comparisons with related species revealed an inversion (21,556 bp) in the LSC region; P. massoniana CPG lacks all 11 intact ndh genes (four ndh genes lost completely; the five remained truncated as pseudogenes; and the other two ndh genes remain as pseudogenes because of short insertions or deletions). A pair of short IRs was found instead of large IRs, and size variations among pine species were observed, which resulted from short insertions or deletions and non-synchronized variations between "IRa" and "IRb". The results of phylogenetic analyses based on whole CPG sequences of 16 conifers indicated that the whole CPG sequences could be used as a powerful tool in phylogenetic analyses.
Genome Modeling System: A Knowledge Management Platform for Genomics.

Directory of Open Access Journals (Sweden)

Malachi Griffith

2015-07-01

Full Text Available In this work, we present the Genome Modeling System (GMS, an analysis information management system capable of executing automated genome analysis pipelines at a massive scale. The GMS framework provides detailed tracking of samples and data coupled with reliable and repeatable analysis pipelines. The GMS also serves as a platform for bioinformatics development, allowing a large team to collaborate on data analysis, or an individual researcher to leverage the work of others effectively within its data management system. Rather than separating ad-hoc analysis from rigorous, reproducible pipelines, the GMS promotes systematic integration between the two. As a demonstration of the GMS, we performed an integrated analysis of whole genome, exome and transcriptome sequencing data from a breast cancer cell line (HCC1395 and matched lymphoblastoid line (HCC1395BL. These data are available for users to test the software, complete tutorials and develop novel GMS pipeline configurations. The GMS is available at https://github.com/genome/gms.
Extrachromosomal circles of satellite repeats and 5S ribosomal DNA in human cells

Directory of Open Access Journals (Sweden)

Cohen Sarit

2010-03-01

Full Text Available Abstract Background Extrachomosomal circular DNA (eccDNA is ubiquitous in eukaryotic organisms and was detected in every organism tested, including in humans. A two-dimensional gel electrophoresis facilitates the detection of eccDNA in preparations of genomic DNA. Using this technique we have previously demonstrated that most of eccDNA consists of exact multiples of chromosomal tandemly repeated DNA, including both coding genes and satellite DNA. Results Here we report the occurrence of eccDNA in every tested human cell line. It has heterogeneous mass ranging from less than 2 kb to over 20 kb. We describe eccDNA homologous to human alpha satellite and the SstI mega satellite. Moreover, we show, for the first time, circular multimers of the human 5S ribosomal DNA (rDNA, similar to previous findings in Drosophila and plants. We further demonstrate structures that correspond to intermediates of rolling circle replication, which emerge from the circular multimers of 5S rDNA and SstI satellite. Conclusions These findings, and previous reports, support the general notion that every chromosomal tandem repeat is prone to generate eccDNA in eukryoric organisms including humans. They suggest the possible involvement of eccDNA in the length variability observed in arrays of tandem repeats. The implications of eccDNA on genome biology may include mechanisms of centromere evolution, concerted evolution and homogenization of tandem repeats and genomic plasticity.
Genome rearrangements detected by SNP microarrays in individuals with intellectual disability referred with possible Williams syndrome.

Directory of Open Access Journals (Sweden)

Ariel M Pani

2010-08-01

Full Text Available Intellectual disability (ID affects 2-3% of the population and may occur with or without multiple congenital anomalies (MCA or other medical conditions. Established genetic syndromes and visible chromosome abnormalities account for a substantial percentage of ID diagnoses, although for approximately 50% the molecular etiology is unknown. Individuals with features suggestive of various syndromes but lacking their associated genetic anomalies pose a formidable clinical challenge. With the advent of microarray techniques, submicroscopic genome alterations not associated with known syndromes are emerging as a significant cause of ID and MCA.High-density SNP microarrays were used to determine genome wide copy number in 42 individuals: 7 with confirmed alterations in the WS region but atypical clinical phenotypes, 31 with ID and/or MCA, and 4 controls. One individual from the first group had the most telomeric gene in the WS critical region deleted along with 2 Mb of flanking sequence. A second person had the classic WS deletion and a rearrangement on chromosome 5p within the Cri du Chat syndrome (OMIM:123450 region. Six individuals from the ID/MCA group had large rearrangements (3 deletions, 3 duplications, one of whom had a large inversion associated with a deletion that was not detected by the SNP arrays.Combining SNP microarray analyses and qPCR allowed us to clone and sequence 21 deletion breakpoints in individuals with atypical deletions in the WS region and/or ID or MCA. Comparison of these breakpoints to databases of genomic variation revealed that 52% occurred in regions harboring structural variants in the general population. For two probands the genomic alterations were flanked by segmental duplications, which frequently mediate recurrent genome rearrangements; these may represent new genomic disorders. While SNP arrays and related technologies can identify potentially pathogenic deletions and duplications, obtaining sequence information
Development of EST-derived markers in Dendrobium from EST of related taxa

OpenAIRE

Narisa Juejun; Chataporn Chunwongse; Julapark Chunwongse

2013-01-01

Public databases are useful for molecular marker development. The major aim of this study was to develop expressedsequence tag (EST)-derived markers in Dendrobium from available ESTs of Phalaenopsis and Dendrobium. A total of 6063sequences were screened for simple sequence repeats (SSRs) and introns. Primers flanking these regions were generated andtested on genomic DNAs of Phalaenopsis and Dendrobium. Twenty-three percent of amplifiable Phalaenopsis EST-derivedmarkers were cross-genera trans...
Pichia stipitis genomics, transcriptomics, and gene clusters

Science.gov (United States)

Thomas W. Jeffries; Jennifer R. Headman Van Vleet

2009-01-01

Genome sequencing and subsequent global gene expression studies have advanced our understanding of the lignocellulose-fermenting yeast Pichia stipitis. These studies have provided an insight into its central carbon metabolism, and analysis of its genome has revealed numerous functional gene clusters and tandem repeats. Specialized physiological traits are often the...
Genomic Characterization for Parasitic Weeds of the Genus Striga by Sample Sequence Analysis

Directory of Open Access Journals (Sweden)

Matt C. Estep

2012-03-01

Full Text Available Generation of ∼2200 Sanger sequence reads or ∼10,000 454 reads for seven Lour. DNA samples (five species allowed identification of the highly repetitive DNA content in these genomes. The 14 most abundant repeats in these species were identified and partially assembled. Annotation indicated that they represent nine long terminal repeat (LTR retrotransposon families, three tandem satellite repeats, one long interspersed element (LINE retroelement, and one DNA transposon. All of these repeats are most closely related to repetitive elements in other closely related plants and are not products of horizontal transfer from their host species. These repeats were differentially abundant in each species, with the LTR retrotransposons and satellite repeats most responsible for variation in genome size. Each species had some repetitive elements that were more abundant and some less abundant than the other species examined, indicating that no single element or any unilateral growth or decrease trend in genome behavior was responsible for variation in genome size and composition. Genome sizes were determined by flow sorting, and the values of 615 Mb [ (L. Kuntze], 1330 Mb [ (Willd. Vatke], 1425 Mb [ (Delile Benth.] and 2460 Mb ( Benth. suggest a ploidy series, a prediction supported by repetitive DNA sequence analysis. Phylogenetic analysis using six chloroplast loci indicated the ancestral relationships of the five most agriculturally important species, with the unexpected result that the one parasite of dicotyledonous plants ( was found to be more closely related to some of the grass parasites than many of the grass parasites are to each other.
Pipeline to upgrade the genome annotations

Directory of Open Access Journals (Sweden)

Lijin K. Gopi

2017-12-01

Full Text Available Current era of functional genomics is enriched with good quality draft genomes and annotations for many thousands of species and varieties with the support of the advancements in the next generation sequencing technologies (NGS. Around 25,250 genomes, of the organisms from various kingdoms, are submitted in the NCBI genome resource till date. Each of these genomes was annotated using various tools and knowledge-bases that were available during the period of the annotation. It is obvious that these annotations will be improved if the same genome is annotated using improved tools and knowledge-bases. Here we present a new genome annotation pipeline, strengthened with various tools and knowledge-bases that are capable of producing better quality annotations from the consensus of the predictions from different tools. This resource also perform various additional annotations, apart from the usual gene predictions and functional annotations, which involve SSRs, novel repeats, paralogs, proteins with transmembrane helices, signal peptides etc. This new annotation resource is trained to evaluate and integrate all the predictions together to resolve the overlaps and ambiguities of the boundaries. One of the important highlights of this resource is the capability of predicting the phylogenetic relations of the repeats using the evolutionary trace analysis and orthologous gene clusters. We also present a case study, of the pipeline, in which we upgrade the genome annotation of Nelumbo nucifera (sacred lotus. It is demonstrated that this resource is capable of producing an improved annotation for a better understanding of the biology of various organisms.
Evidence for magnocellular involvement in the identification of flanked letters

NARCIS (Netherlands)

Omtzigt, D.; Hendriks, A.W.C.J.; Kolk, H.H.J.

2002-01-01

Little is known about the role of the magno system in reading. One important hypothesis is that this system is involved in the allocation of attention. We reasoned that the presentation of a single letter automatically draws attention to this letter, whereas in the case of a flanked letter, an
BioNano genome mapping of individual chromosomes supports physical mapping and sequence assembly in complex plant genomes.

Science.gov (United States)

Staňková, Helena; Hastie, Alex R; Chan, Saki; Vrána, Jan; Tulpová, Zuzana; Kubaláková, Marie; Visendi, Paul; Hayashi, Satomi; Luo, Mingcheng; Batley, Jacqueline; Edwards, David; Doležel, Jaroslav; Šimková, Hana

2016-07-01

The assembly of a reference genome sequence of bread wheat is challenging due to its specific features such as the genome size of 17 Gbp, polyploid nature and prevalence of repetitive sequences. BAC-by-BAC sequencing based on chromosomal physical maps, adopted by the International Wheat Genome Sequencing Consortium as the key strategy, reduces problems caused by the genome complexity and polyploidy, but the repeat content still hampers the sequence assembly. Availability of a high-resolution genomic map to guide sequence scaffolding and validate physical map and sequence assemblies would be highly beneficial to obtaining an accurate and complete genome sequence. Here, we chose the short arm of chromosome 7D (7DS) as a model to demonstrate for the first time that it is possible to couple chromosome flow sorting with genome mapping in nanochannel arrays and create a de novo genome map of a wheat chromosome. We constructed a high-resolution chromosome map composed of 371 contigs with an N50 of 1.3 Mb. Long DNA molecules achieved by our approach facilitated chromosome-scale analysis of repetitive sequences and revealed a ~800-kb array of tandem repeats intractable to current DNA sequencing technologies. Anchoring 7DS sequence assemblies obtained by clone-by-clone sequencing to the 7DS genome map provided a valuable tool to improve the BAC-contig physical map and validate sequence assembly on a chromosome-arm scale. Our results indicate that creating genome maps for the whole wheat genome in a chromosome-by-chromosome manner is feasible and that they will be an affordable tool to support the production of improved pseudomolecules. © 2016 The Authors. Plant Biotechnology Journal published by Society for Experimental Biology and The Association of Applied Biologists and John Wiley & Sons Ltd.
Phylogenetic analysis of Gossypium L. using restriction fragment length polymorphism of repeated sequences.

Science.gov (United States)

Zhang, Meiping; Rong, Ying; Lee, Mi-Kyung; Zhang, Yang; Stelly, David M; Zhang, Hong-Bin

2015-10-01

Cotton is the world's leading textile fiber crop and is also grown as a bioenergy and food crop. Knowledge of the phylogeny of closely related species and the genome origin and evolution of polyploid species is significant for advanced genomics research and breeding. We have reconstructed the phylogeny of the cotton genus, Gossypium L., and deciphered the genome origin and evolution of its five polyploid species by restriction fragment analysis of repeated sequences. Nuclear DNA of 84 accessions representing 35 species and all eight genomes of the genus were analyzed. The phylogenetic tree of the genus was reconstructed using the parsimony method on 1033 polymorphic repeated sequence restriction fragments. The genome origin of its polyploids was determined by calculating the diploid-polyploid restriction fragment correspondence (RFC). The tree is consistent with the morphological classification, genome designation and geographic distribution of the species at subgenus, section and subsection levels. Gossypium lobatum (D7) was unambiguously shown to have the highest RFC with the D-subgenomes of all five polyploids of the genus, while the common ancestor of Gossypium herbaceum (A1) and Gossypium arboreum (A2) likely contributed to the A-subgenomes of the polyploids. These results provide a comprehensive phylogenetic tree of the cotton genus and new insights into the genome origin and evolution of its polyploid species. The results also further demonstrate a simple, rapid and inexpensive method suitable for phylogenetic analysis of closely related species, especially congeneric species, and the inference of genome origin of polyploids that constitute over 70 % of flowering plants.
Research for genetic instability of human genome

International Nuclear Information System (INIS)

Hori, T.; Takahashi, E.; Tsuji, H.; Yamauchi, M.; Murata, M.

1992-01-01

In the present review paper, the potential relevance of chromosomal fragile sites to carcinogenesis and mutagenesis is discussed based on our own and other's studies. Recent evidence indicate that fragile sites may act as predisposition factors involved in chromosomal instability of the human genome and that the sites may be preferential targets for various DNA damaging agents including ionizing radiation. It is also demonstrated that some critical genomic rearrangements at the fragile sites may contribute towards oncogenesis and that individuals carrying heritable form of fragile site may be at the risk. Although clinical significance of autosomal fragile sites has been a matter of discussion, a fragile site of the X chromosome is known to be associated with an X-linked genetic diseases, called fragile X syndrome. Molecular events leading to the fragile X syndrome have recently been elucidated. The fragile X genotype can be characterized by an increased amount of p(CCG)n repeat DNA sequence in the FMR-1 gene and the repeated sequences are shown to be unstable in both meiosis and mitosis. These repeats might exhibit higher mutation rate than is generally seen in the human genome. Further studies on the fragile sites in molecular biology and radiation biology will yield relevant data to the molecular mechanisms of genetic instability of the human genome as well as to better assessment of genetic effect of ionizing radiation. (author)

Vasopressin-dependent flank marking in golden hamsters is suppressed by drugs used in the treatment of obsessive-compulsive disorder

Directory of Open Access Journals (Sweden)

Messenger Tara

2001-08-01

Full Text Available Abstract Background Alterations in arginine vasopressin regulation and secretion have been proposed as one possible biochemical abnormality in patients with obsessive-compulsive disorder. In golden hamsters, arginine vasopressin microinjections into the anterior hypothalamus trigger robust grooming and flank marking, a stereotyped scent marking behaviors. The intensity and repetition of the behaviors induced by arginine vasopressin is somewhat reminiscent of Obsessive Compulsive Disorder in humans. The present experiments were carried out to test whether pharmacological agents used to alleviate obsessive compulsive disorder could inhibit arginine vasopressin-induced flank marking and grooming. Results Male golden hamsters were treated daily for two weeks with either vehicle, fluoxetine, clomipramine, or desipramine (an ineffective drug, before being tested for arginine vasopressin-induced flank marking and grooming. Flank marking was significantly inhibited in animals treated with fluoxetine or clomipramine but unaffected by treatment with desipramine. Grooming behavior was not affected by any treatment. Conclusion These data suggest that arginine vasopressin-induced flank marking may serve as an animal model for screening drugs used in the control of Obsessive Compulsive Disorder.
PIPEMicroDB: microsatellite database and primer generation tool for pigeonpea genome.

Science.gov (United States)

Sarika; Arora, Vasu; Iquebal, M A; Rai, Anil; Kumar, Dinesh

2013-01-01

Molecular markers play a significant role for crop improvement in desirable characteristics, such as high yield, resistance to disease and others that will benefit the crop in long term. Pigeonpea (Cajanus cajan L.) is the recently sequenced legume by global consortium led by ICRISAT (Hyderabad, India) and been analysed for gene prediction, synteny maps, markers, etc. We present PIgeonPEa Microsatellite DataBase (PIPEMicroDB) with an automated primer designing tool for pigeonpea genome, based on chromosome wise as well as location wise search of primers. Total of 123 387 Short Tandem Repeats (STRs) were extracted from pigeonpea genome, available in public domain using MIcroSAtellite tool (MISA). The database is an online relational database based on 'three-tier architecture' that catalogues information of microsatellites in MySQL and user-friendly interface is developed using PHP. Search for STRs may be customized by limiting their location on chromosome as well as number of markers in that range. This is a novel approach and is not been implemented in any of the existing marker database. This database has been further appended with Primer3 for primer designing of selected markers with left and right flankings of size up to 500 bp. This will enable researchers to select markers of choice at desired interval over the chromosome. Furthermore, one can use individual STRs of a targeted region over chromosome to narrow down location of gene of interest or linked Quantitative Trait Loci (QTLs). Although it is an in silico approach, markers' search based on characteristics and location of STRs is expected to be beneficial for researchers. Database URL: http://cabindb.iasri.res.in/pigeonpea/
Imaging modalities and therapy options in patients with acute flank pain

International Nuclear Information System (INIS)

Grosse, A.; Grosse, C.

2014-01-01

The objective of this article is the description of imaging techniques for the evaluation of patients with acute flank pain and suspicion of urolithiasis and the impact of these techniques in the therapy management of patients with calculi. (orig.) [de
Development of Chloroplast Genomic Resources in Chinese Yam (Dioscorea polystachya

Directory of Open Access Journals (Sweden)

Junling Cao

2018-01-01

Full Text Available Chinese yam has been used both as a food and in traditional herbal medicine. Developing more effective genetic markers in this species is necessary to assess its genetic diversity and perform cultivar identification. In this study, new chloroplast genomic resources were developed using whole chloroplast genomes from six genotypes originating from different geographical locations. The Dioscorea polystachya chloroplast genome is a circular molecule consisting of two single-copy regions separated by a pair of inverted repeats. Comparative analyses of six D. polystachya chloroplast genomes revealed 141 single nucleotide polymorphisms (SNPs. Seventy simple sequence repeats (SSRs were found in the six genotypes, including 24 polymorphic SSRs. Forty-three common indels and five small inversions were detected. Phylogenetic analysis based on the complete chloroplast genome provided the best resolution among the genotypes. Our evaluation of chloroplast genome resources among these genotypes led us to consider the complete chloroplast genome sequence of D. polystachya as a source of reliable and valuable molecular markers for revealing biogeographical structure and the extent of genetic variation in wild populations and for identifying different cultivars.
The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng

Directory of Open Access Journals (Sweden)

Jinhui eChen

2015-06-01

Full Text Available Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around ten species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR region, which was found to be IR region A (IRA, was lost in the M. glyptostroboides cp ge-nome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for relat-ed species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostro-boides is a sister species to Cryptomeria japonica (L. F. D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyp-tostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the conif-erous cp genomes, especially for the position of M. glyptostroboides in plant systemat-ics and evolution.
The complete chloroplast genome sequence of the relict woody plant Metasequoia glyptostroboides Hu et Cheng.

Science.gov (United States)

Chen, Jinhui; Hao, Zhaodong; Xu, Haibin; Yang, Liming; Liu, Guangxin; Sheng, Yu; Zheng, Chen; Zheng, Weiwei; Cheng, Tielong; Shi, Jisen

2015-01-01

Metasequoia glyptostroboides Hu et Cheng is the only species in the genus Metasequoia Miki ex Hu et Cheng, which belongs to the Cupressaceae family. There were around 10 species in the Metasequoia genus, which were widely spread across the Northern Hemisphere during the Cretaceous of the Mesozoic and in the Cenozoic. M. glyptostroboides is the only remaining representative of this genus. Here, we report the complete chloroplast (cp) genome sequence and the cp genomic features of M. glyptostroboides. The M. glyptostroboides cp genome is 131,887 bp in length, with a total of 117 genes comprised of 82 protein-coding genes, 31 tRNA genes and four rRNA genes. In this genome, 11 forward repeats, nine palindromic repeats, and 15 tandem repeats were detected. A total of 188 perfect microsatellites were detected through simple sequence repeat (SSR) analysis and these were distributed unevenly within the cp genome. Comparison of the cp genome structure and gene order to those of several other land plants indicated that a copy of the inverted repeat (IR) region, which was found to be IR region A (IRA), was lost in the M. glyptostroboides cp genome. The five most divergent and five most conserved genes were determined and further phylogenetic analysis was performed among plant species, especially for related species in conifers. Finally, phylogenetic analysis demonstrated that M. glyptostroboides is a sister species to Cryptomeria japonica (L. F.) D. Don and to Taiwania cryptomerioides Hayata. The complete cp genome sequence information of M. glyptostroboides will be great helpful for further investigations of this endemic relict woody plant and for in-depth understanding of the evolutionary history of the coniferous cp genomes, especially for the position of M. glyptostroboides in plant systematics and evolution.
Stability analysis of Western flank of Cumbre Vieja volcano (La Palma) using numerical modelling

Science.gov (United States)

Bru, Guadalupe; Gonzalez, Pablo J.; Fernandez-Merodo, Jose A.; Fernandez, Jose

2016-04-01

La Palma volcanic island is one of the youngest of the Canary archipelago, being a composite volcano formed by three overlapping volcanic centers. There are clear onshore and offshore evidences of past giant landslides that have occurred during its evolution. Currently, the active Cumbre Vieja volcano is in an early development state (Carracedo et al., 2001). The study of flank instability processes aim to assess, among other hazards, catastrophic collapse and potential tsunami generation. Early studies of the potential instability of Cumbre Vieja volcano western flank have focused on the use of sparse geodetic networks (Moss et al. 1999), surface geological mapping techniques (Day et al. 1999) and offshore bathymetry (Urgeles et al. 1999). Recently, a dense GNSS network and satellite radar interferometry results indicate ground motion consistent with deep-seated creeping processes (Prieto et al. 2009, Gonzalez et al. 2010). In this work, we present a geomechanical advanced numerical model that captures the ongoing deformation processes at Cumbre Vieja. We choose the Finite Elements Method (FEM) which is based in continuum mechanics and is the most used for geotechnical applications. FEM has the ability of using arbitrary geometry, heterogeneities, irregular boundaries and different constitutive models representative of the geotechnical units involved. Our main contribution is the introduction of an inverse approach to constrain the geomechanical parameters using satellite radar interferometry displacements. This is the first application of such approach on a large volcano flank study. We suggest that the use of surface displacements and inverse methods to rigorously constrain the geomechanical model parameter space is a powerful tool to understand volcano flank instability. A particular important result of the studied case is the estimation of displaced rock volume, which is a parameter of critical importance for simulations of Cumbre Vieja tsunamigenic hazard
Genome-wide distribution and organization of microsatellites in plants: an insight into marker development in Brachypodium.

Directory of Open Access Journals (Sweden)

Humira Sonah

Full Text Available Plant genomes are complex and contain large amounts of repetitive DNA including microsatellites that are distributed across entire genomes. Whole genome sequences of several monocot and dicot plants that are available in the public domain provide an opportunity to study the origin, distribution and evolution of microsatellites, and also facilitate the development of new molecular markers. In the present investigation, a genome-wide analysis of microsatellite distribution in monocots (Brachypodium, sorghum and rice and dicots (Arabidopsis, Medicago and Populus was performed. A total of 797,863 simple sequence repeats (SSRs were identified in the whole genome sequences of six plant species. Characterization of these SSRs revealed that mono-nucleotide repeats were the most abundant repeats, and that the frequency of repeats decreased with increase in motif length both in monocots and dicots. However, the frequency of SSRs was higher in dicots than in monocots both for nuclear and chloroplast genomes. Interestingly, GC-rich repeats were the dominant repeats only in monocots, with the majority of them being present in the coding region. These coding GC-rich repeats were found to be involved in different biological processes, predominantly binding activities. In addition, a set of 22,879 SSR markers that were validated by e-PCR were developed and mapped on different chromosomes in Brachypodium for the first time, with a frequency of 101 SSR markers per Mb. Experimental validation of 55 markers showed successful amplification of 80% SSR markers in 16 Brachypodium accessions. An online database 'BraMi' (Brachypodium microsatellite markers of these genome-wide SSR markers was developed and made available in the public domain. The observed differential patterns of SSR marker distribution would be useful for studying microsatellite evolution in a monocot-dicot system. SSR markers developed in this study would be helpful for genomic studies in Brachypodium
R-loops: targets for nuclease cleavage and repeat instability.

Science.gov (United States)

Freudenreich, Catherine H

2018-01-11

R-loops form when transcribed RNA remains bound to its DNA template to form a stable RNA:DNA hybrid. Stable R-loops form when the RNA is purine-rich, and are further stabilized by DNA secondary structures on the non-template strand. Interestingly, many expandable and disease-causing repeat sequences form stable R-loops, and R-loops can contribute to repeat instability. Repeat expansions are responsible for multiple neurodegenerative diseases, including Huntington's disease, myotonic dystrophy, and several types of ataxias. Recently, it was found that R-loops at an expanded CAG/CTG repeat tract cause DNA breaks as well as repeat instability (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Two factors were identified as causing R-loop-dependent breaks at CAG/CTG tracts: deamination of cytosines and the MutLγ (Mlh1-Mlh3) endonuclease, defining two new mechanisms for how R-loops can generate DNA breaks (Su and Freudenreich, Proc Natl Acad Sci USA 114, E8392-E8401, 2017). Following R-loop-dependent nicking, base excision repair resulted in repeat instability. These results have implications for human repeat expansion diseases and provide a paradigm for how RNA:DNA hybrids can cause genome instability at structure-forming DNA sequences. This perspective summarizes mechanisms of R-loop-induced fragility at G-rich repeats and new links between DNA breaks and repeat instability.
Transposable elements and circular DNAs

KAUST Repository

Mourier, Tobias

2016-09-26

Circular DNAs are extra-chromosomal fragments that become circularized by genomic recombination events. We have recently shown that yeast LTR elements generate circular DNAs through recombination events between their flanking long terminal repeats (LTRs). Similarly, circular DNAs can be generated by recombination between LTRs residing at different genomic loci, in which case the circular DNA will contain the intervening sequence. In yeast, this can result in gene copy number variations when circles contain genes and origins of replication. Here, I speculate on the potential and implications of circular DNAs generated through recombination between human transposable elements.
Transposable elements and circular DNAs

KAUST Repository

Mourier, Tobias

2016-01-01

Circular DNAs are extra-chromosomal fragments that become circularized by genomic recombination events. We have recently shown that yeast LTR elements generate circular DNAs through recombination events between their flanking long terminal repeats (LTRs). Similarly, circular DNAs can be generated by recombination between LTRs residing at different genomic loci, in which case the circular DNA will contain the intervening sequence. In yeast, this can result in gene copy number variations when circles contain genes and origins of replication. Here, I speculate on the potential and implications of circular DNAs generated through recombination between human transposable elements.
Targeted Genome Regulation and Editing in Plants

KAUST Repository

Piatek, Agnieszka

2016-03-01

The ability to precisely regulate gene expression patterns and to modify genome sequence in a site-specific manner holds much promise in determining gene function and linking genotype to phenotype. DNA-binding modules have been harnessed to generate customizable and programmable chimeric proteins capable of binding to site-specific DNA sequences and regulating the genome and epigenome. Modular DNA-binding domains from zinc fingers (ZFs) and transcriptional activator-like effectors (TALEs) are amenable to engineering to bind any DNA target sequence of interest. Deciphering the code of TALE repeat binding to DNA has helped to engineer customizable TALE proteins capable of binding to any sequence of interest. Therefore TALE repeats provide a rich resource for bioengineering applications. However, the TALE system is limited by the requirement to re-engineer one or two proteins for each new target sequence. Recently, the clustered regularly interspaced palindromic repeats (CRISPR)/ CRISPR associated 9 (Cas9) has been used as a versatile genome editing tool. This machinery has been also repurposed for targeted transcriptional regulation. Due to the facile engineering, simplicity and precision, the CRISPR/Cas9 system is poised to revolutionize the functional genomics studies across diverse eukaryotic species. In this dissertation I employed transcription activator-like effectors and CRISPR/Cas9 systems for targeted genome regulation and editing and my achievements include: 1) I deciphered and extended the DNA-binding code of Ralstonia TAL effectors providing new opportunities for bioengineering of customizable proteins; 2) I repurposed the CRISPR/Cas9 system for site-specific regulation of genes in plant genome; 3) I harnessed the power of CRISPR/Cas9 gene editing tool to study the function of the serine/arginine-rich (SR) proteins.
Efficient Algorithms for Analyzing Segmental Duplications, Deletions, and Inversions in Genomes

Science.gov (United States)

Kahn, Crystal L.; Mozes, Shay; Raphael, Benjamin J.

Segmental duplications, or low-copy repeats, are common in mammalian genomes. In the human genome, most segmental duplications are mosaics consisting of pieces of multiple other segmental duplications. This complex genomic organization complicates analysis of the evolutionary history of these sequences. Earlier, we introduced a genomic distance, called duplication distance, that computes the most parsimonious way to build a target string by repeatedly copying substrings of a source string. We also showed how to use this distance to describe the formation of segmental duplications according to a two-step model that has been proposed to explain human segmental duplications. Here we describe polynomial-time exact algorithms for several extensions of duplication distance including models that allow certain types of substring deletions and inversions. These extensions will permit more biologically realistic analyses of segmental duplications in genomes.
Transformation of natural genetic variation into Haemophilus influenzae genomes.

Directory of Open Access Journals (Sweden)

Joshua Chang Mell

2011-07-01

Full Text Available Many bacteria are able to efficiently bind and take up double-stranded DNA fragments, and the resulting natural transformation shapes bacterial genomes, transmits antibiotic resistance, and allows escape from immune surveillance. The genomes of many competent pathogens show evidence of extensive historical recombination between lineages, but the actual recombination events have not been well characterized. We used DNA from a clinical isolate of Haemophilus influenzae to transform competent cells of a laboratory strain. To identify which of the ~40,000 polymorphic differences had recombined into the genomes of four transformed clones, their genomes and their donor and recipient parents were deep sequenced to high coverage. Each clone was found to contain ~1000 donor polymorphisms in 3-6 contiguous runs (8.1±4.5 kb in length that collectively comprised ~1-3% of each transformed chromosome. Seven donor-specific insertions and deletions were also acquired as parts of larger donor segments, but the presence of other structural variation flanking 12 of 32 recombination breakpoints suggested that these often disrupt the progress of recombination events. This is the first genome-wide analysis of chromosomes directly transformed with DNA from a divergent genotype, connecting experimental studies of transformation with the high levels of natural genetic variation found in isolates of the same species.
Forests of the tropical eastern Andean flank during the middle Pleistocene

NARCIS (Netherlands)

Cárdenas, M.L.; Gosling, W.D.; Pennington, R.T.; Poole, I.; Sherlock, S.C.; Mothes, P.

2014-01-01

Inter-bedded volcanic and organic sediments from Erazo (Ecuador) indicate the presence of four different forest assemblages on the eastern Andean flank during the middle Pleistocene. Radiometric dates (40Ar-39Ar) obtained from the volcanic ash indicate that deposition occurred between 620,000 and
Survey of protein–DNA interactions in Aspergillus oryzae on a genomic scale

Science.gov (United States)

Wang, Chao; Lv, Yangyong; Wang, Bin; Yin, Chao; Lin, Ying; Pan, Li

2015-01-01

The genome-scale delineation of in vivo protein–DNA interactions is key to understanding genome function. Only ∼5% of transcription factors (TFs) in the Aspergillus genus have been identified using traditional methods. Although the Aspergillus oryzae genome contains >600 TFs, knowledge of the in vivo genome-wide TF-binding sites (TFBSs) in aspergilli remains limited because of the lack of high-quality antibodies. We investigated the landscape of in vivo protein–DNA interactions across the A. oryzae genome through coupling the DNase I digestion of intact nuclei with massively parallel sequencing and the analysis of cleavage patterns in protein–DNA interactions at single-nucleotide resolution. The resulting map identified overrepresented de novo TF-binding motifs from genomic footprints, and provided the detailed chromatin remodeling patterns and the distribution of digital footprints near transcription start sites. The TFBSs of 19 known Aspergillus TFs were also identified based on DNase I digestion data surrounding potential binding sites in conjunction with TF binding specificity information. We observed that the cleavage patterns of TFBSs were dependent on the orientation of TF motifs and independent of strand orientation, consistent with the DNA shape features of binding motifs with flanking sequences. PMID:25883143
The tiger genome and comparative analysis with lion and snow leopard genomes.

Science.gov (United States)

Cho, Yun Sung; Hu, Li; Hou, Haolong; Lee, Hang; Xu, Jiaohui; Kwon, Soowhan; Oh, Sukhun; Kim, Hak-Min; Jho, Sungwoong; Kim, Sangsoo; Shin, Young-Ah; Kim, Byung Chul; Kim, Hyunmin; Kim, Chang-Uk; Luo, Shu-Jin; Johnson, Warren E; Koepfli, Klaus-Peter; Schmidt-Küntzel, Anne; Turner, Jason A; Marker, Laurie; Harper, Cindy; Miller, Susan M; Jacobs, Wilhelm; Bertola, Laura D; Kim, Tae Hyung; Lee, Sunghoon; Zhou, Qian; Jung, Hyun-Ju; Xu, Xiao; Gadhvi, Priyvrat; Xu, Pengwei; Xiong, Yingqi; Luo, Yadan; Pan, Shengkai; Gou, Caiyun; Chu, Xiuhui; Zhang, Jilin; Liu, Sanyang; He, Jing; Chen, Ying; Yang, Linfeng; Yang, Yulan; He, Jiaju; Liu, Sha; Wang, Junyi; Kim, Chul Hong; Kwak, Hwanjong; Kim, Jong-Soo; Hwang, Seungwoo; Ko, Junsu; Kim, Chang-Bae; Kim, Sangtae; Bayarlkhagva, Damdin; Paek, Woon Kee; Kim, Seong-Jin; O'Brien, Stephen J; Wang, Jun; Bhak, Jong

2013-01-01

Tigers and their close relatives (Panthera) are some of the world's most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats' hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species.
The tiger genome and comparative analysis with lion and snow leopard genomes

Science.gov (United States)

Cho, Yun Sung; Hu, Li; Hou, Haolong; Lee, Hang; Xu, Jiaohui; Kwon, Soowhan; Oh, Sukhun; Kim, Hak-Min; Jho, Sungwoong; Kim, Sangsoo; Shin, Young-Ah; Kim, Byung Chul; Kim, Hyunmin; Kim, Chang-uk; Luo, Shu-Jin; Johnson, Warren E.; Koepfli, Klaus-Peter; Schmidt-Küntzel, Anne; Turner, Jason A.; Marker, Laurie; Harper, Cindy; Miller, Susan M.; Jacobs, Wilhelm; Bertola, Laura D.; Kim, Tae Hyung; Lee, Sunghoon; Zhou, Qian; Jung, Hyun-Ju; Xu, Xiao; Gadhvi, Priyvrat; Xu, Pengwei; Xiong, Yingqi; Luo, Yadan; Pan, Shengkai; Gou, Caiyun; Chu, Xiuhui; Zhang, Jilin; Liu, Sanyang; He, Jing; Chen, Ying; Yang, Linfeng; Yang, Yulan; He, Jiaju; Liu, Sha; Wang, Junyi; Kim, Chul Hong; Kwak, Hwanjong; Kim, Jong-Soo; Hwang, Seungwoo; Ko, Junsu; Kim, Chang-Bae; Kim, Sangtae; Bayarlkhagva, Damdin; Paek, Woon Kee; Kim, Seong-Jin; O’Brien, Stephen J.; Wang, Jun; Bhak, Jong

2013-01-01

Tigers and their close relatives (Panthera) are some of the world’s most endangered species. Here we report the de novo assembly of an Amur tiger whole-genome sequence as well as the genomic sequences of a white Bengal tiger, African lion, white African lion and snow leopard. Through comparative genetic analyses of these genomes, we find genetic signatures that may reflect molecular adaptations consistent with the big cats’ hypercarnivorous diet and muscle strength. We report a snow leopard-specific genetic determinant in EGLN1 (Met39>Lys39), which is likely to be associated with adaptation to high altitude. We also detect a TYR260G>A mutation likely responsible for the white lion coat colour. Tiger and cat genomes show similar repeat composition and an appreciably conserved synteny. Genomic data from the five big cats provide an invaluable resource for resolving easily identifiable phenotypes evident in very close, but distinct, species. PMID:24045858
CRISPR-Cas: Revolutionising genome engineering

African Journals Online (AJOL)

Within the repeating As, Cs, Gs and Ts of the human genome is the .... Medicine, South African Medical Research Council Extramural Unit for Stem Cell Research and Therapy, ... Second, should we allow embryonic/germline engineering, or.
The whole chloroplast genome of wild rice (Oryza australiensis).

Science.gov (United States)

Wu, Zhiqiang; Ge, Song

2016-01-01

The whole chloroplast genome of wild rice (Oryza australiensis) is characterized in this study. The genome size is 135,224 bp, exhibiting a typical circular structure including a pair of 25,776 bp inverted repeats (IRa,b) separated by a large single-copy region (LSC) of 82,212 bp and a small single-copy region (SSC) of 12,470 bp. The overall GC content of the genome is 38.95%. 110 unique genes were annotated, including 76 protein-coding genes, 4 ribosomal RNA genes, and 30t RNA genes. Among these, 18 are duplicated in the inverted repeat regions, 13 genes contain one intron, and 2 genes (rps12 and ycf3) have two introns.

Complete Chloroplast Genome Sequence of Aquilaria sinensis (Lour.) Gilg and Evolution Analysis within the Malvales Order.

Science.gov (United States)

Wang, Ying; Zhan, Di-Feng; Jia, Xian; Mei, Wen-Li; Dai, Hao-Fu; Chen, Xiong-Ting; Peng, Shi-Qing

2016-01-01

Aquilaria sinensis (Lour.) Gilg is an important medicinal woody plant producing agarwood, which is widely used in traditional Chinese medicine. High-throughput sequencing of chloroplast (cp) genomes enhanced the understanding about evolutionary relationships within plant families. In this study, we determined the complete cp genome sequences for A. sinensis. The size of the A. sinensis cp genome was 159,565 bp. This genome included a large single-copy region of 87,482 bp, a small single-copy region of 19,857 bp, and a pair of inverted repeats (IRa and IRb) of 26,113 bp each. The GC content of the genome was 37.11%. The A. sinensis cp genome encoded 113 functional genes, including 82 protein-coding genes, 27 tRNA genes, and 4 rRNA genes. Seven genes were duplicated in the protein-coding genes, whereas 11 genes were duplicated in the RNA genes. A total of 45 polymorphic simple-sequence repeat loci and 60 pairs of large repeats were identified. Most simple-sequence repeats were located in the noncoding sections of the large single-copy/small single-copy region and exhibited high A/T content. Moreover, 33 pairs of large repeat sequences were located in the protein-coding genes, whereas 27 pairs were located in the intergenic regions. Aquilaria sinensis cp genome bias ended with A/T on the basis of codon usage. The distribution of codon usage in A. sinensis cp genome was most similar to that in the Gonystylus bancanus cp genome. Comparative results of 82 protein-coding genes from 29 species of cp genomes demonstrated that A. sinensis was a sister species to G. bancanus within the Malvales order. Aquilaria sinensis cp genome presented the highest sequence similarity of >90% with the G. bancanus cp genome by using CGView Comparison Tool. This finding strongly supports the placement of A. sinensis as a sister to G. bancanus within the Malvales order. The complete A. sinensis cp genome information will be highly beneficial for further studies on this traditional medicinal
Outline of a genome navigation system based on the properties of GA-sequences and their flanks.

Directory of Open Access Journals (Sweden)

Guenter Albrecht-Buehler

Full Text Available Introducing a new method to visualize large stretches of genomic DNA (see Appendix S1 the article reports that most GA-sequences [1] shared chains of tetra-GA-motifs and contained upstream poly(A-segments. Although not integral parts of them, Alu-elements were found immediately upstream of all human and chimpanzee GA-sequences with an upstream poly(A-segment. The article hypothesizes that genome navigation uses these properties of GA-sequences in the following way. (1 Poly(A binding proteins interact with the upstream poly(A-segments and arrange adjacent GA-sequences side-by-side ('GA-ribbon', while folding the intervening DNA sequences between them into loops ('associated DNA-loops'. (2 Genome navigation uses the GA-ribbon as a search path for specific target genes that is up to 730-fold shorter than the full-length chromosome. (3 As to the specificity of the search, each molecule of a target protein is assumed to catalyze the formation of specific oligomers from a set of transcription factors that recognize tetra-GA-motifs. Their specific combinations of tetra-GA motifs are assumed to be present in the particular GA-sequence whose associated loop contains the gene for the target protein. As long as the target protein is abundant in the cell it produces sufficient numbers of such oligomers which bind to their specific GA-sequences and, thereby, inhibit locally the transcription of the target protein in the associated loop. However, if the amount of target protein drops below a certain threshold, the resultant reduction of specific oligomers leaves the corresponding GA-sequence 'denuded'. In response, the associated DNA-loop releases its nucleosomes and allows transcription of the target protein to proceed. (4 The Alu-transcripts may help control the general background of protein synthesis proportional to the number of transcriptionally active associated loops, especially in stressed cells. (5 The model offers a new mechanism of co-regulation of
Non-radioactive detection of trinucleotide repeat size variability.

Science.gov (United States)

Tomé, Stéphanie; Nicole, Annie; Gomes-Pereira, Mario; Gourdon, Genevieve

2014-03-06

Many human diseases are associated with the abnormal expansion of unstable trinucleotide repeat sequences. The mechanisms of trinucleotide repeat size mutation have not been fully dissected, and their understanding must be grounded on the detailed analysis of repeat size distributions in human tissues and animal models. Small-pool PCR (SP-PCR) is a robust, highly sensitive and efficient PCR-based approach to assess the levels of repeat size variation, providing both quantitative and qualitative data. The method relies on the amplification of a very low number of DNA molecules, through sucessive dilution of a stock genomic DNA solution. Radioactive Southern blot hybridization is sensitive enough to detect SP-PCR products derived from single template molecules, separated by agarose gel electrophoresis and transferred onto DNA membranes. We describe a variation of the detection method that uses digoxigenin-labelled locked nucleic acid probes. This protocol keeps the sensitivity of the original method, while eliminating the health risks associated with the manipulation of radiolabelled probes, and the burden associated with their regulation, manipulation and waste disposal.
Research for genetic instability of human genome

Energy Technology Data Exchange (ETDEWEB)

Hori, T.; Takahashi, E.; Tsuji, H.; Yamauchi, M. (National Inst. of Radiological Sciences, Chiba (Japan)); Murata, M.

1992-01-01

In the present review paper, the potential relevance of chromosomal fragile sites to carcinogenesis and mutagenesis is discussed based on our own and other's studies. Recent evidence indicate that fragile sites may act as predisposition factors involved in chromosomal instability of the human genome and that the sites may be preferential targets for various DNA damaging agents including ionizing radiation. It is also demonstrated that some critical genomic rearrangements at the fragile sites may contribute towards oncogenesis and that individuals carrying heritable form of fragile site may be at the risk. Although clinical significance of autosomal fragile sites has been a matter of discussion, a fragile site of the X chromosome is known to be associated with an X-linked genetic diseases, called fragile X syndrome. Molecular events leading to the fragile X syndrome have recently been elucidated. The fragile X genotype can be characterized by an increased amount of p(CCG)n repeat DNA sequence in the FMR-1 gene and the repeated sequences are shown to be unstable in both meiosis and mitosis. These repeats might exhibit higher mutation rate than is generally seen in the human genome. Further studies on the fragile sites in molecular biology and radiation biology will yield relevant data to the molecular mechanisms of genetic instability of the human genome as well as to better assessment of genetic effect of ionizing radiation. (author).
Genomic Islands: an overview of current software tools and future improvements

Directory of Open Access Journals (Sweden)

Soares Siomar de Castro

2016-03-01

Full Text Available Microbes are highly diverse and widely distributed organisms. They account for ~60% of Earth’s biomass and new predictions point for the existence of 1011 to 1012 species, which are constantly sharing genes through several different mechanisms. Genomic Islands (GI are critical in this context, as they are large regions acquired through horizontal gene transfer. Also, they present common features like genomic signature deviation, transposase genes, flanking tRNAs and insertion sequences. GIs carry large numbers of genes related to specific lifestyle and are commonly classified in Pathogenicity, Resistance, Metabolic or Symbiotic Islands. With the advent of the next-generation sequencing technologies and the deluge of genomic data, many software tools have been developed that aim to tackle the problem of GI prediction and they are all based on the prediction of GI common features. However, there is still room for the development of new software tools that implements new approaches, such as, machine learning and pangenomics based analyses. Finally, GIs will always hold a potential application in every newly invented genomic approach as they are directly responsible for much of the genomic plasticity of bacteria.
Genomic Islands: an overview of current software tools and future improvements.

Science.gov (United States)

Soares, Siomar de Castro; Oliveira, Letícia de Castro; Jaiswal, Arun Kumar; Azevedo, Vasco

2016-03-01

Microbes are highly diverse and widely distributed organisms. They account for ~60% of Earth's biomass and new predictions point for the existence of 1011 to 1012 species, which are constantly sharing genes through several different mechanisms. Genomic Islands (GI) are critical in this context, as they are large regions acquired through horizontal gene transfer. Also, they present common features like genomic signature deviation, transposase genes, flanking tRNAs and insertion sequences. GIs carry large numbers of genes related to specific lifestyle and are commonly classified in Pathogenicity, Resistance, Metabolic or Symbiotic Islands. With the advent of the next-generation sequencing technologies and the deluge of genomic data, many software tools have been developed that aim to tackle the problem of GI prediction and they are all based on the prediction of GI common features. However, there is still room for the development of new software tools that implements new approaches, such as, machine learning and pangenomics based analyses. Finally, GIs will always hold a potential application in every newly invented genomic approach as they are directly responsible for much of the genomic plasticity of bacteria.
Draft genome of the American Eel (Anguilla rostrata).

Science.gov (United States)

Pavey, Scott A; Laporte, Martin; Normandeau, Eric; Gaudin, Jérémy; Letourneau, Louis; Boisvert, Sébastien; Corbeil, Jacques; Audet, Céline; Bernatchez, Louis

2017-07-01

Freshwater eels (Anguilla sp.) have large economic, cultural, ecological and aesthetic importance worldwide, but they suffered more than 90% decline in global stocks over the past few decades. Proper genetic resources, such as sequenced, assembled and annotated genomes, are essential to help plan sustainable recoveries by identifying physiological, biochemical and genetic mechanisms that caused the declines or that may lead to recoveries. Here, we present the first sequenced genome of the American eel. This genome contained 305 043 contigs (N50 = 7397) and 79 209 scaffolds (N50 = 86 641) for a total size of 1.41 Gb, which is in the middle of the range of previous estimations for this species. In addition, protein-coding regions, including introns and flanking regions, are very well represented in the genome, as 95.2% of the 458 core eukaryotic genes and 98.8% of the 248 ultra-conserved subset were represented in the assembly and a total of 26 564 genes were annotated for future functional genomics studies. We performed a candidate gene analysis to compare three genes among all three freshwater eel species and, congruent with the phylogenetic relationships, Japanese eel (A. japanica) exhibited the most divergence. Overall, the sequenced genome presented in this study is a crucial addition to the presently available genetic tools to help guide future conservation efforts of freshwater eels. © 2016 John Wiley & Sons Ltd.
Immediate Genetic and Epigenetic Changes in F1 Hybrids Parented by Species with Divergent Genomes in the Rice Genus (Oryza.

Directory of Open Access Journals (Sweden)

Ying Wu

Full Text Available Inter-specific hybridization occurs frequently in higher plants, and represents a driving force of evolution and speciation. Inter-specific hybridization often induces genetic and epigenetic instabilities in the resultant homoploid hybrids or allopolyploids, a phenomenon known as genome shock. Although genetic and epigenetic consequences of hybridizations between rice subspecies (e.g., japonica and indica and closely related species sharing the same AA genome have been extensively investigated, those of inter-specific hybridizations between more remote species with different genomes in the rice genus, Oryza, remain largely unknown.We investigated the immediate chromosomal and molecular genetic/epigenetic instability of three triploid F1 hybrids produced by inter-specific crossing between species with divergent genomes of Oryza by genomic in situ hybridization (GISH and molecular marker analysis. Transcriptional and transpositional activity of several transposable elements (TEs and methylation stability of their flanking regions were also assessed. We made the following principle findings: (i all three triploid hybrids are stable in both chromosome number and gross structure; (ii stochastic changes in both DNA sequence and methylation occurred in individual plants of all three triploid hybrids, but in general methylation changes occurred at lower frequencies than genetic changes; (iii alteration in DNA methylation occurred to a greater extent in genomic loci flanking potentially active TEs than in randomly sampled loci; (iv transcriptional activation of several TEs commonly occurred in all three hybrids but transpositional events were detected in a genetic context-dependent manner.Artificially constructed inter-specific hybrids of remotely related species with divergent genomes in genus Oryza are chromosomally stable but show immediate and highly stochastic genetic and epigenetic instabilities at the molecular level. These novel hybrids might
First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc Maxim, a Chinese Traditional Medicinal Plant

Directory of Open Access Journals (Sweden)

Gong Xiao

2013-06-01

Full Text Available Epimedium sagittatum (Sieb. et Zucc Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12. However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE repeats identified (65.37% of all TE repeats, particularly LTR (Long Terminal Repeat retrotransposons (52.27% of all TE repeats. Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant.
First Insights into the Large Genome of Epimedium sagittatum (Sieb. et Zucc) Maxim, a Chinese Traditional Medicinal Plant

Science.gov (United States)

Liu, Di; Zeng, Shao-Hua; Chen, Jian-Jun; Zhang, Yan-Jun; Xiao, Gong; Zhu, Lin-Yao; Wang, Ying

2013-01-01

Epimedium sagittatum (Sieb. et Zucc) Maxim is a member of the Berberidaceae family of basal eudicot plants, widely distributed and used as a traditional medicinal plant in China for therapeutic effects on many diseases with a long history. Recent data shows that E. sagittatum has a relatively large genome, with a haploid genome size of ~4496 Mbp, divided into a small number of only 12 diploid chromosomes (2n = 2x = 12). However, little is known about Epimedium genome structure and composition. Here we present the analysis of 691 kb of high-quality genomic sequence derived from 672 randomly selected plasmid clones of E. sagittatum genomic DNA, representing ~0.0154% of the genome. The sampled sequences comprised at least 78.41% repetitive DNA elements and 2.51% confirmed annotated gene sequences, with a total GC% content of 39%. Retrotransposons represented the major class of transposable element (TE) repeats identified (65.37% of all TE repeats), particularly LTR (Long Terminal Repeat) retrotransposons (52.27% of all TE repeats). Chromosome analysis and Fluorescence in situ Hybridization of Gypsy-Ty3 retrotransposons were performed to survey the E. sagittatum genome at the cytological level. Our data provide the first insights into the composition and structure of the E. sagittatum genome, and will facilitate the functional genomic analysis of this valuable medicinal plant. PMID:23807511
NATO’s Northeastern Flank: Emerging Opportunities for Engagement

Science.gov (United States)

2017-01-01

escalation concerns. Engagement should also stress the importance of Polish support for and capabilities toward addressing NATO’s southern flank...in their response to the Ukraine crisis.8 Some countries are either cowed by the Russian threat or genuinely less concerned about it than might be...no small part due to Hungary’s dependence on Russian gas exports, which heat the homes of most Hungarians. It is also, however, due to political
Vertebrate Genome Evolution in the Light of Fish Cytogenomics and rDNAomics

Science.gov (United States)

Howell, W. Mike

2018-01-01

To understand the cytogenomic evolution of vertebrates, we must first unravel the complex genomes of fishes, which were the first vertebrates to evolve and were ancestors to all other vertebrates. We must not forget the immense time span during which the fish genomes had to evolve. Fish cytogenomics is endowed with unique features which offer irreplaceable insights into the evolution of the vertebrate genome. Due to the general DNA base compositional homogeneity of fish genomes, fish cytogenomics is largely based on mapping DNA repeats that still represent serious obstacles in genome sequencing and assembling, even in model species. Localization of repeats on chromosomes of hundreds of fish species and populations originating from diversified environments have revealed the biological importance of this genomic fraction. Ribosomal genes (rDNA) belong to the most informative repeats and in fish, they are subject to a more relaxed regulation than in higher vertebrates. This can result in formation of a literal ‘rDNAome’ consisting of more than 20,000 copies with their high proportion employed in extra-coding functions. Because rDNA has high rates of transcription and recombination, it contributes to genome diversification and can form reproductive barrier. Our overall knowledge of fish cytogenomics grows rapidly by a continuously increasing number of fish genomes sequenced and by use of novel sequencing methods improving genome assembly. The recently revealed exceptional compositional heterogeneity in an ancient fish lineage (gars) sheds new light on the compositional genome evolution in vertebrates generally. We highlight the power of synergy of cytogenetics and genomics in fish cytogenomics, its potential to understand the complexity of genome evolution in vertebrates, which is also linked to clinical applications and the chromosomal backgrounds of speciation. We also summarize the current knowledge on fish cytogenomics and outline its main future avenues. PMID
Human renin 5'-flanking DNA to nucleotide-2750.

Science.gov (United States)

Smith, D L; Jeyapalan, S; Lang, J A; Guo, X H; Sigmund, C D; Morris, B J

1995-01-01

Renin is one of the most important factors in blood pressure and electrolyte regulation in mammals and the renin locus has been implicated in hypertension. To assist studies of promoter control we therefore determined the 5'-flanking sequence of the human gene (REN) to residue -2750 relative to the transcription start site (+1). Sites of homology to consensus sequences for binding of trans-acting factors involved in transcriptional control of other genes were identified, and functionality for two of these (a CRE and Pit-1 site) have so far been demonstrated.
The sequence and de novo assembly of the giant panda genome

Science.gov (United States)

Li, Ruiqiang; Fan, Wei; Tian, Geng; Zhu, Hongmei; He, Lin; Cai, Jing; Huang, Quanfei; Cai, Qingle; Li, Bo; Bai, Yinqi; Zhang, Zhihe; Zhang, Yaping; Wang, Wen; Li, Jun; Wei, Fuwen; Li, Heng; Jian, Min; Li, Jianwen; Zhang, Zhaolei; Nielsen, Rasmus; Li, Dawei; Gu, Wanjun; Yang, Zhentao; Xuan, Zhaoling; Ryder, Oliver A.; Leung, Frederick Chi-Ching; Zhou, Yan; Cao, Jianjun; Sun, Xiao; Fu, Yonggui; Fang, Xiaodong; Guo, Xiaosen; Wang, Bo; Hou, Rong; Shen, Fujun; Mu, Bo; Ni, Peixiang; Lin, Runmao; Qian, Wubin; Wang, Guodong; Yu, Chang; Nie, Wenhui; Wang, Jinhuan; Wu, Zhigang; Liang, Huiqing; Min, Jiumeng; Wu, Qi; Cheng, Shifeng; Ruan, Jue; Wang, Mingwei; Shi, Zhongbin; Wen, Ming; Liu, Binghang; Ren, Xiaoli; Zheng, Huisong; Dong, Dong; Cook, Kathleen; Shan, Gao; Zhang, Hao; Kosiol, Carolin; Xie, Xueying; Lu, Zuhong; Zheng, Hancheng; Li, Yingrui; Steiner, Cynthia C.; Lam, Tommy Tsan-Yuk; Lin, Siyuan; Zhang, Qinghui; Li, Guoqing; Tian, Jing; Gong, Timing; Liu, Hongde; Zhang, Dejin; Fang, Lin; Ye, Chen; Zhang, Juanbin; Hu, Wenbo; Xu, Anlong; Ren, Yuanyuan; Zhang, Guojie; Bruford, Michael W.; Li, Qibin; Ma, Lijia; Guo, Yiran; An, Na; Hu, Yujie; Zheng, Yang; Shi, Yongyong; Li, Zhiqiang; Liu, Qing; Chen, Yanling; Zhao, Jing; Qu, Ning; Zhao, Shancen; Tian, Feng; Wang, Xiaoling; Wang, Haiyin; Xu, Lizhi; Liu, Xiao; Vinar, Tomas; Wang, Yajun; Lam, Tak-Wah; Yiu, Siu-Ming; Liu, Shiping; Zhang, Hemin; Li, Desheng; Huang, Yan; Wang, Xia; Yang, Guohua; Jiang, Zhi; Wang, Junyi; Qin, Nan; Li, Li; Li, Jingxiang; Bolund, Lars; Kristiansen, Karsten; Wong, Gane Ka-Shu; Olson, Maynard; Zhang, Xiuqing; Li, Songgang; Yang, Huanming; Wang, Jian; Wang, Jun

2013-01-01

Using next-generation sequencing technology alone, we have successfully generated and assembled a draft sequence of the giant panda genome. The assembled contigs (2.25 gigabases (Gb)) cover approximately 94% of the whole genome, and the remaining gaps (0.05 Gb) seem to contain carnivore-specific repeats and tandem repeats. Comparisons with the dog and human showed that the panda genome has a lower divergence rate. The assessment of panda genes potentially underlying some of its unique traits indicated that its bamboo diet might be more dependent on its gut microbiome than its own genetic composition. We also identified more than 2.7 million heterozygous single nucleotide polymorphisms in the diploid genome. Our data and analyses provide a foundation for promoting mammalian genetic research, and demonstrate the feasibility for using next-generation sequencing technologies for accurate, cost-effective and rapid de novo assembly of large eukaryotic genomes. PMID:20010809
Focused seismicity triggered by flank instability on Kīlauea's Southwest Rift Zone

Science.gov (United States)

Judson, Josiah; Thelen, Weston A.; Greenfield, Tim; White, Robert S.

2018-03-01

Swarms of earthquakes at the head of the Southwest Rift Zone on Kīlauea Volcano, Hawai´i, reveal an interaction of normal and strike-slip faulting associated with movement of Kīlauea's south flank. A relocated subset of earthquakes between January 2012 and August 2014 are highly focused in space and time at depths that are coincident with the south caldera magma reservoir beneath the southern margin of Kīlauea Caldera. Newly calculated focal mechanisms are dominantly dextral shear with a north-south preferred fault orientation. Two earthquakes within this focused area of seismicity have normal faulting mechanisms, indicating two mechanisms of failure in very close proximity (10's of meters to 100 m). We suggest a model where opening along the Southwest Rift Zone caused by seaward motion of the south flank permits injection of magma and subsequent freezing of a plug, which then fails in a right-lateral strike-slip sense, consistent with the direction of movement of the south flank. The seismicity is concentrated in an area where a constriction occurs between a normal fault and the deeper magma transport system into the Southwest Rift Zone. Although in many ways the Southwest Rift Zone appears analogous to the more active East Rift Zone, the localization of the largest seismicity (>M2.5) within the swarms to a small volume necessitates a different model than has been proposed to explain the lineament outlined by earthquakes along the East Rift Zone.
Hierarchical modeling of genome-wide Short Tandem Repeat (STR) markers infers native American prehistory.

Science.gov (United States)

Lewis, Cecil M

2010-02-01

This study examines a genome-wide dataset of 678 Short Tandem Repeat loci characterized in 444 individuals representing 29 Native American populations as well as the Tundra Netsi and Yakut populations from Siberia. Using these data, the study tests four current hypotheses regarding the hierarchical distribution of neutral genetic variation in native South American populations: (1) the western region of South America harbors more variation than the eastern region of South America, (2) Central American and western South American populations cluster exclusively, (3) populations speaking the Chibchan-Paezan and Equatorial-Tucanoan language stock emerge as a group within an otherwise South American clade, (4) Chibchan-Paezan populations in Central America emerge together at the tips of the Chibchan-Paezan cluster. This study finds that hierarchical models with the best fit place Central American populations, and populations speaking the Chibchan-Paezan language stock, at a basal position or separated from the South American group, which is more consistent with a serial founder effect into South America than that previously described. Western (Andean) South America is found to harbor similar levels of variation as eastern (Equatorial-Tucanoan and Ge-Pano-Carib) South America, which is inconsistent with an initial west coast migration into South America. Moreover, in all relevant models, the estimates of genetic diversity within geographic regions suggest a major bottleneck or founder effect occurring within the North American subcontinent, before the peopling of Central and South America. 2009 Wiley-Liss, Inc.
Magnocellular involvement in flanked-letter identification relates to the allocation of attention

NARCIS (Netherlands)

Omtzigt, D.; Hendriks, A.W.C.J.

2004-01-01

To verify the hypothesis that the magnocellular system is important to flanked-letter identification [Neuropsychologia 40 (2002) 1881] because it subserves attention allocation, we conducted three letter-naming experiments in which we manipulated magnocellular involvement (colour vs. luminance
Varicella-zoster virus (VZV) origin of DNA replication oriS influences origin-dependent DNA replication and flanking gene transcription.

Science.gov (United States)

Khalil, Mohamed I; Sommer, Marvin H; Hay, John; Ruyechan, William T; Arvin, Ann M

2015-07-01

The VZV genome has two origins of DNA replication (oriS), each of which consists of an AT-rich sequence and three origin binding protein (OBP) sites called Box A, C and B. In these experiments, the mutation in the core sequence CGC of the Box A and C not only inhibited DNA replication but also inhibited both ORF62 and ORF63 expression in reporter gene assays. In contrast the Box B mutation did not influence DNA replication or flanking gene transcription. These results suggest that efficient DNA replication enhances ORF62 and ORF63 transcription. Recombinant viruses carrying these mutations in both sites and one with a deletion of the whole oriS were constructed. Surprisingly, the recombinant virus lacking both copies of oriS retained the capacity to replicate in melanoma and HELF cells suggesting that VZV has another origin of DNA replication. Copyright © 2015 Elsevier Inc. All rights reserved.
Identification and characterization of a silencer regulatory element in the 3'-flanking region of the murine CD46 gene.

Science.gov (United States)

Nomura, M; Tsujimura, A; Begum, N A; Matsumoto, M; Wabiko, H; Toyoshima, K; Seya, T

2000-01-01

The murine membrane cofactor protein (CD46) gene is expressed exclusively in testis, in contrast to human CD46, which is expressed ubiquitously. To elucidate the mechanism of differential CD46 gene expression among species, we cloned entire murine CD46 genomic DNA and possible regulatory regions were placed in the flanking region of the luciferase reporter gene. The reporter gene assay revealed a silencing activity not in the promoter, but in the 3'-flanking region of the gene and the silencer-like element was identified within a 0.2-kb region between 0.6 and 0.8 kb downstream of the stop codon. This silencer-like element was highly similar to that of the pig MHC class-I gene. The introduction of a mutation into this putative silencer element of murine CD46 resulted in an abrogation of the silencing effect. Electrophoretic mobility-shift assay indicated the presence of the binding molecule(s) for this silencer sequence in murine cell lines and tissues. A size difference of the protein-silencer-element complex was observed depending upon the solubilizers used for preparation of the nuclear extracts. A mutated silencer sequence failed to interact with the binding molecules. The level of the binding factor was lower in the testicular germ cells compared with other organs. Thus the silencer element and its binding factor may play a role in transcriptional regulation of murine CD46 gene expression. These results imply that the effects of the CD46 silencer element encompass the innate immune and reproductive systems, and in mice may determine the testicular germ-cell-dominant expression of CD46. PMID:11023821
Characterization of polymorphic SSRs among Prunus chloroplast genomes

Science.gov (United States)

An in silico mining process yielded 80, 75, and 78 microsatellites in the chloroplast genome of Prunus persica, P. kansuensis, and P. mume. A and T repeats were predominant in the three genomes, accounting for 67.8% on average and most of them were successful in primer design. For the 80 P. persica ...

Genomic imprinting of IGF2 in marsupials is methylation dependent

Directory of Open Access Journals (Sweden)

Imumorin Ikhide

2008-05-01

Full Text Available Abstract Background- Parent-specific methylation of specific CpG residues is critical to imprinting in eutherian mammals, but its importance to imprinting in marsupials and, thus, the evolutionary origins of the imprinting mechanism have been the subject of controversy. This has been particularly true for the imprinted Insulin-like Growth Factor II (IGF2, a key regulator of embryonic growth in vertebrates and a focal point of the selective forces leading to genomic imprinting. The presence of the essential imprinting effector, DNMT3L, in marsupial genomes and the demonstration of a differentially methylated region (DMR in the retrotransposon-derived imprinted gene, PEG10, in tammar wallaby argue for a role for methylation in imprinting, but several studies have found no evidence of parent-specific methylation at other imprinted loci in marsupials. Results- We performed the most extensive search to date for allele-specific patterns of CpG methylation within CpG isochores or CpG enriched segments across a 22 kilobase region surrounding the IGF2 gene in the South American opossum Monodelphis domestica. We identified a previously unknown 5'-untranslated exon for opossum IGF2, which is flanked by sequences defining a putative neonatal promoter, a DMR and an active Matrix Attachment Region (MAR. Demethylation of this DMR in opossum neonatal fibroblasts results in abherrant biallelic expression of IGF2. Conclusion- The demonstration of a DMR and an active MAR in the 5' flank of opossum IGF2 mirrors the regulatory features of the 5' flank of Igf2 in mice. However, demethylation induced activation of the maternal allele of IGF2 in opossum differs from the demethylation induced repression of the paternal Igf2 allele in mice. While it can now be concluded that parent-specific DNA methylation is an epigentic mark common to Marsupialia and Eutheria, the molecular mechanisms of transcriptional silencing at imprinted loci have clearly evolved along independent
Genome-wide analyses and functional classification of proline repeat-rich proteins: potential role of eIF5A in eukaryotic evolution.

Directory of Open Access Journals (Sweden)

Ajeet Mandal

Full Text Available The eukaryotic translation factor, eIF5A has been recently reported as a sequence-specific elongation factor that facilitates peptide bond formation at consecutive prolines in Saccharomyces cerevisiae, as its ortholog elongation factor P (EF-P does in bacteria. We have searched the genome databases of 35 representative organisms from six kingdoms of life for PPP (Pro-Pro-Pro and/or PPG (Pro-Pro-Gly-encoding genes whose expression is expected to depend on eIF5A. We have made detailed analyses of proteome data of 5 selected species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, Mus musculus and Homo sapiens. The PPP and PPG motifs are low in the prokaryotic proteomes. However, their frequencies markedly increase with the biological complexity of eukaryotic organisms, and are higher in newly derived proteins than in those orthologous proteins commonly shared in all species. Ontology classifications of S. cerevisiae and human genes encoding the highest level of polyprolines reveal their strong association with several specific biological processes, including actin/cytoskeletal associated functions, RNA splicing/turnover, DNA binding/transcription and cell signaling. Previously reported phenotypic defects in actin polarity and mRNA decay of eIF5A mutant strains are consistent with the proposed role for eIF5A in the translation of the polyproline-containing proteins. Of all the amino acid tandem repeats (≥3 amino acids, only the proline repeat frequency correlates with functional complexity of the five organisms examined. Taken together, these findings suggest the importance of proline repeat-rich proteins and a potential role for eIF5A and its hypusine modification pathway in the course of eukaryotic evolution.
Biophysical properties of regions flanking the bHLH-Zip motif in the p22 Max protein

International Nuclear Information System (INIS)

Pursglove, Sharon E.; Fladvad, Malin; Bellanda, Massimo; Moshref, Ahmad; Henriksson, Marie; Carey, Jannette; Sunnerhagen, Maria

2004-01-01

The Max protein is the central dimerization partner in the Myc-Max-Mad network of transcriptional regulators, and a founding structural member of the family of basic-helix-loop-helix (bHLH)-leucine zipper (Zip) proteins. Biologically important regions flanking its bHLH-Zip motif have been disordered or absent in crystal structures. The present study shows that these regions are resistant to proteolysis in both the presence and absence of DNA, and that Max dimers containing both flanking regions have significantly higher helix content as measured by circular dichroism than that predicted from the crystal structures. Nuclear magnetic resonance measurements in the absence of DNA also support the inferred structural order. Deletion of both flanking regions is required to achieve maximal DNA affinity as measured by EMSA. Thus, the previously observed functionalities of these Max regions in DNA binding, phosphorylation, and apoptosis are suggested to be linked to structural properties
Genomic breeding value estimation using nonparametric additive regression models

Directory of Open Access Journals (Sweden)

Solberg Trygve

2009-01-01

Full Text Available Abstract Genomic selection refers to the use of genomewide dense markers for breeding value estimation and subsequently for selection. The main challenge of genomic breeding value estimation is the estimation of many effects from a limited number of observations. Bayesian methods have been proposed to successfully cope with these challenges. As an alternative class of models, non- and semiparametric models were recently introduced. The present study investigated the ability of nonparametric additive regression models to predict genomic breeding values. The genotypes were modelled for each marker or pair of flanking markers (i.e. the predictors separately. The nonparametric functions for the predictors were estimated simultaneously using additive model theory, applying a binomial kernel. The optimal degree of smoothing was determined by bootstrapping. A mutation-drift-balance simulation was carried out. The breeding values of the last generation (genotyped was predicted using data from the next last generation (genotyped and phenotyped. The results show moderate to high accuracies of the predicted breeding values. A determination of predictor specific degree of smoothing increased the accuracy.
Recurrent DNA inversion rearrangements in the human genome

DEFF Research Database (Denmark)

Flores, Margarita; Morales, Lucía; Gonzaga-Jauregui, Claudia

2007-01-01

Several lines of evidence suggest that reiterated sequences in the human genome are targets for nonallelic homologous recombination (NAHR), which facilitates genomic rearrangements. We have used a PCR-based approach to identify breakpoint regions of rearranged structures in the human genome...... to human genomic variation is discussed........ In particular, we have identified intrachromosomal identical repeats that are located in reverse orientation, which may lead to chromosomal inversions. A bioinformatic workflow pathway to select appropriate regions for analysis was developed. Three such regions overlapping with known human genes, located...
New polymorphisms within the variable number tandem repeat (VNTR) 7 locus of Mycobacterium avium subsp. paratuberculosis.

Science.gov (United States)

Fawzy, Ahmad; Zschöck, Michael; Ewers, Christa; Eisenberg, Tobias

2016-06-01

Variable number tandem repeat (VNTR) is a frequently employed typing method of Mycobacterium avium paratuberculosis (MAP) isolates. Based on whole genome sequencing in a previous study, allelic diversity at some VNTR loci seems to over- or under-estimate the actual phylogenetic variance among isolates. Interestingly, two closely related isolates on one farm showed polymorphism at the VNTR 7 locus, raising concerns about the misleading role that it might play in genotyping. We aimed to investigate the underlying basis of VNTR 7-polymorphism by analyzing sequence data for published genomes and field isolates of MAP and other M. avium complex (MAC) members. In contrast to MAP strains from cattle, strains from sheep displayed an "imperfect" repeat within VNTR 7, which was identical to respective allele types in other MAC genomes. Subspecies- and strain-specific single nucleotide polymorphisms (SNPs) and two novel (16 and 56 bp) repeats were detected. Given the combination of the three existing repeats, there are at least five different patterns for VNTR 7. The present findings highlight a higher polymorphism and probable instability of VNTR 7 locus that needs to be considered and challenged in future studies. Until then, sequencing of this locus in future studies is important to correctly assign the underlying allele types.(1). Copyright © 2016 Elsevier Ltd. All rights reserved.
In Silico Genome Comparison and Distribution Analysis of Simple Sequences Repeats in Cassava

Directory of Open Access Journals (Sweden)

Andrea Vásquez

2014-01-01

Full Text Available We conducted a SSRs density analysis in different cassava genomic regions. The information obtained was useful to establish comparisons between cassava’s SSRs genomic distribution and those of poplar, flax, and Jatropha. In general, cassava has a low SSR density (~50 SSRs/Mbp and has a high proportion of pentanucleotides, (24,2 SSRs/Mbp. It was found that coding sequences have 15,5 SSRs/Mbp, introns have 82,3 SSRs/Mbp, 5′ UTRs have 196,1 SSRs/Mbp, and 3′ UTRs have 50,5 SSRs/Mbp. Through motif analysis of cassava’s genome SSRs, the most abundant motif was AT/AT while in intron sequences and UTRs regions it was AG/CT. In addition, in coding sequences the motif AAG/CTT was also found to occur most frequently; in fact, it is the third most used codon in cassava. Sequences containing SSRs were classified according to their functional annotation of Gene Ontology categories. The identified SSRs here may be a valuable addition for genetic mapping and future studies in phylogenetic analyses and genomic evolution.
The complete chloroplast genome sequence of Pelargonium xhortorum: Or ganization and evolution of the largest and most highlyrearranged chloroplast genome of land plants

Energy Technology Data Exchange (ETDEWEB)

Chumley, Timothy W.; Palmer, Jeffrey D.; Mower, Jeffrey P.; Fourcade, H. Matthew; Calie, Patrick J.; Boore, Jeffrey L.; Jansen,Robert K.

2006-01-20

The chloroplast genome of Pelargonium e hortorum has beencompletely sequenced. It maps as a circular molecule of 217,942 bp, andis both the largest and most rearranged land plant chloroplast genome yetsequenced. It features two copies of a greatly expanded inverted repeat(IR) of 75,741 bp each, and consequently diminished single copy regionsof 59,710 bp and 6,750 bp. It also contains two different associations ofrepeated elements that contribute about 10 percent to the overall sizeand account for the majority of repeats found in the genome. Theyrepresent hotspots for rearrangements and gene duplications and include alarge number of pseudogenes. We propose simple models that account forthe major rearrangements with a minimum of eight IR boundary changes and12 inversions in addition to a several insertions of duplicated sequence.The major processes at work (duplication, IR expansion, and inversion)have disrupted at least one and possibly two or three transcriptionaloperons, and the genes involved in these disruptions form the core of thetwo major repeat associations. Despite the vast increase in size andcomplexity of the genome, the gene content is similar to that of otherangiosperms, with the exceptions of a large number of pseudogenes as partof the repeat associations, the recognition of two open reading frames(ORF56 and ORF42) in the trnA intron with similarities to previouslyidentified mitochondrial products (ACRS and pvs-trnA), the loss of accDand trnT-GGU, and in particular, the lack of a recognizably functionalrpoA. One or all of three similar open reading frames may possibly encodethe latter, however.
Complete mitochondrial genome of the larch hawk moth, Sphinx morio (Lepidoptera: Sphingidae).

Science.gov (United States)

Kim, Min Jee; Choi, Sei-Woong; Kim, Iksoo

2013-12-01

The larch hawk moth, Sphinx morio, belongs to the lepidopteran family Sphingidae that has long been studied as a family of model insects in a diverse field. In this study, we describe the complete mitochondrial genome (mitogenome) sequences of the species in terms of general genomic features and characteristic short repetitive sequences found in the A + T-rich region. The 15,299-bp-long genome consisted of a typical set of genes (13 protein-coding genes, 2 rRNA genes, and 22 tRNA genes) and one major non-coding A + T-rich region, with the typical arrangement found in Lepidoptera. The 316-bp-long A + T-rich region located between srRNA and tRNA(Met) harbored the conserved sequence blocks that are typically found in lepidopteran insects. Additionally, the A + T-rich region of S. morio contained three characteristic repeat sequences that are rarely found in Lepidoptera: two identical 12-bp repeat, three identical 5-bp-long tandem repeat, and six nearly identical 5-6 bp long repeat sequences.
The Complete Chloroplast Genome Sequences of Five Epimedium Species: Lights into Phylogenetic and Taxonomic Analyses

Science.gov (United States)

Zhang, Yanjun; Du, Liuwen; Liu, Ao; Chen, Jianjun; Wu, Li; Hu, Weiming; Zhang, Wei; Kim, Kyunghee; Lee, Sang-Choon; Yang, Tae-Jin; Wang, Ying

2016-01-01

Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp) genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR) region and the single-copy (SC) boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR) and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants. PMID:27014326
The complete chloroplast genome sequences of five Epimedium species: lights into phylogenetic and taxonomic analyses

Directory of Open Access Journals (Sweden)

Yanjun eZhang

2016-03-01

Full Text Available Epimedium L. is a phylogenetically and economically important genus in the family Berberidaceae. We here sequenced the complete chloroplast (cp genomes of four Epimedium species using Illumina sequencing technology via a combination of de novo and reference-guided assembly, which was also the first comprehensive cp genome analysis on Epimedium combining the cp genome sequence of E. koreanum previously reported. The five Epimedium cp genomes exhibited typical quadripartite and circular structure that was rather conserved in genomic structure and the synteny of gene order. However, these cp genomes presented obvious variations at the boundaries of the four regions because of the expansion and contraction of the inverted repeat (IR region and the single-copy (SC boundary regions. The trnQ-UUG duplication occurred in the five Epimedium cp genomes, which was not found in the other basal eudicotyledons. The rapidly evolving cp genome regions were detected among the five cp genomes, as well as the difference of simple sequence repeats (SSR and repeat sequence were identified. Phylogenetic relationships among the five Epimedium species based on their cp genomes showed accordance with the updated system of the genus on the whole, but reminded that the evolutionary relationships and the divisions of the genus need further investigation applying more evidences. The availability of these cp genomes provided valuable genetic information for accurately identifying species, taxonomy and phylogenetic resolution and evolution of Epimedium, and assist in exploration and utilization of Epimedium plants.
Combined amplification and hybridization techniques for genome scanning in vegetatively propagated crops

International Nuclear Information System (INIS)

Kahl, G.; Ramser, J.; Terauchi, R.; Lopez-Peralta, C.; Asemota, H.N.; Weising, K.

1998-01-01

A combination of PCR- and hybridization-based genome scanning techniques and sequence comparisons between non-coding chloroplast DNA flanking tRNA genes has been employed to screen Dioscorea species for intra- and interspecific genetic diversity. This methodology detected extensive polymorphisms within Dioscorea bulbifera L., and revealed taxonomic and phylogenetic relationships among cultivated Guinea yams varieties and their potential wild progenitors. Finally, screening of yam germplasm grown in Jamaica permitted reliable discrimination between all major cultivars. Genome scanning by micro satellite-primed PCR (MP-PCR) and random amplified polymorphic DNA (RAPD) analysis in combination with the novel random amplified micro satellite polymorphisms (RAMPO) hybridization technique has shown high potential for the genetic analysis of yams, and holds promise for other vegetatively propagated orphan crops. (author)
Genome Sequences of Marine Shrimp Exopalaemon carinicauda Holthuis Provide Insights into Genome Size Evolution of Caridea.

Science.gov (United States)

Yuan, Jianbo; Gao, Yi; Zhang, Xiaojun; Wei, Jiankai; Liu, Chengzhang; Li, Fuhua; Xiang, Jianhai

2017-07-05

Crustacea, particularly Decapoda, contains many economically important species, such as shrimps and crabs. Crustaceans exhibit enormous (nearly 500-fold) variability in genome size. However, limited genome resources are available for investigating these species. Exopalaemon carinicauda Holthuis, an economical caridean shrimp, is a potential ideal experimental animal for research on crustaceans. In this study, we performed low-coverage sequencing and de novo assembly of the E. carinicauda genome. The assembly covers more than 95% of coding regions. E. carinicauda possesses a large complex genome (5.73 Gb), with size twice higher than those of many decapod shrimps. As such, comparative genomic analyses were implied to investigate factors affecting genome size evolution of decapods. However, clues associated with genome duplication were not identified, and few horizontally transferred sequences were detected. Ultimately, the burst of transposable elements, especially retrotransposons, was determined as the major factor influencing genome expansion. A total of 2 Gb repeats were identified, and RTE-BovB, Jockey, Gypsy, and DIRS were the four major retrotransposons that significantly expanded. Both recent (Jockey and Gypsy) and ancestral (DIRS) originated retrotransposons responsible for the genome evolution. The E. carinicauda genome also exhibited potential for the genomic and experimental research of shrimps.
How genome complexity can explain the difficulty of aligning reads to genomes.

Science.gov (United States)

Phan, Vinhthuy; Gao, Shanshan; Tran, Quang; Vo, Nam S

2015-01-01

Although it is frequently observed that aligning short reads to genomes becomes harder if they contain complex repeat patterns, there has not been much effort to quantify the relationship between complexity of genomes and difficulty of short-read alignment. Existing measures of sequence complexity seem unsuitable for the understanding and quantification of this relationship. We investigated several measures of complexity and found that length-sensitive measures of complexity had the highest correlation to accuracy of alignment. In particular, the rate of distinct substrings of length k, where k is similar to the read length, correlated very highly to alignment performance in terms of precision and recall. We showed how to compute this measure efficiently in linear time, making it useful in practice to estimate quickly the difficulty of alignment for new genomes without having to align reads to them first. We showed how the length-sensitive measures could provide additional information for choosing aligners that would align consistently accurately on new genomes. We formally established a connection between genome complexity and the accuracy of short-read aligners. The relationship between genome complexity and alignment accuracy provides additional useful information for selecting suitable aligners for new genomes. Further, this work suggests that the complexity of genomes sometimes should be thought of in terms of specific computational problems, such as the alignment of short reads to genomes.
CTCF Binding Sites in the Herpes Simplex Virus 1 Genome Display Site-Specific CTCF Occupation, Protein Recruitment, and Insulator Function.

Science.gov (United States)

Washington, Shannan D; Musarrat, Farhana; Ertel, Monica K; Backes, Gregory L; Neumann, Donna M

2018-04-15

There are seven conserved CTCF binding domains in the herpes simplex virus 1 (HSV-1) genome. These binding sites individually flank the latency-associated transcript (LAT) and the immediate early (IE) gene regions, suggesting that CTCF insulators differentially control transcriptional domains in HSV-1 latency. In this work, we show that two CTCF binding motifs in HSV-1 display enhancer blocking in a cell-type-specific manner. We found that CTCF binding to the latent HSV-1 genome was LAT dependent and that the quantity of bound CTCF was site specific. Following reactivation, CTCF eviction was dynamic, suggesting that each CTCF site was independently regulated. We explored whether CTCF sites recruit the polycomb-repressive complex 2 (PRC2) to establish repressive domains through a CTCF-Suz12 interaction and found that Suz12 colocalized to the CTCF insulators flanking the ICP0 and ICP4 regions and, conversely, was removed at early times postreactivation. Collectively, these data support the idea that CTCF sites in HSV-1 are independently regulated and may contribute to lytic-latent HSV-1 control in a site-specific manner. IMPORTANCE The role of chromatin insulators in DNA viruses is an area of interest. It has been shown in several beta- and gammaherpesviruses that insulators likely control the lytic transcriptional profile through protein recruitment and through the formation of three-dimensional (3D) chromatin loops. The ability of insulators to regulate alphaherpesviruses has been understudied to date. The alphaherpesvirus HSV-1 has seven conserved insulator binding motifs that flank regions of the genome known to contribute to the establishment of latency. Our work presented here contributes to the understanding of how insulators control transcription of HSV-1. Copyright © 2018 American Society for Microbiology.
Repetitive DNA in the pea (Pisum sativum L. genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula

Directory of Open Access Journals (Sweden)

Navrátilová Alice

2007-11-01

Full Text Available Abstract Background Extraordinary size variation of higher plant nuclear genomes is in large part caused by differences in accumulation of repetitive DNA. This makes repetitive DNA of great interest for studying the molecular mechanisms shaping architecture and function of complex plant genomes. However, due to methodological constraints of conventional cloning and sequencing, a global description of repeat composition is available for only a very limited number of higher plants. In order to provide further data required for investigating evolutionary patterns of repeated DNA within and between species, we used a novel approach based on massive parallel sequencing which allowed a comprehensive repeat characterization in our model species, garden pea (Pisum sativum. Results Analysis of 33.3 Mb sequence data resulted in quantification and partial sequence reconstruction of major repeat families occurring in the pea genome with at least thousands of copies. Our results showed that the pea genome is dominated by LTR-retrotransposons, estimated at 140,000 copies/1C. Ty3/gypsy elements are less diverse and accumulated to higher copy numbers than Ty1/copia. This is in part due to a large population of Ogre-like retrotransposons which alone make up over 20% of the genome. In addition to numerous types of mobile elements, we have discovered a set of novel satellite repeats and two additional variants of telomeric sequences. Comparative genome analysis revealed that there are only a few repeat sequences conserved between pea and soybean genomes. On the other hand, all major families of pea mobile elements are well represented in M. truncatula. Conclusion We have demonstrated that even in a species with a relatively large genome like pea, where a single 454-sequencing run provided only 0.77% coverage, the generated sequences were sufficient to reconstruct and analyze major repeat families corresponding to a total of 35–48% of the genome. These data
Draft genome sequence of the sexually transmitted pathogen Trichomonas vaginalis

DEFF Research Database (Denmark)

Carlton, Jane M.; Hirt, Robert P.; Silva, Joana C.

2007-01-01

We describe the genome sequence of the protist Trichomonas vaginalis, a sexually transmitted human pathogen. Repeats and transposable elements comprise about two-thirds of the approximately 160-megabase genome, reflecting a recent massive expansion of genetic material. This expansion...... environment. The genome sequence predicts previously unknown functions for the hydrogenosome, which support a common evolutionary origin of this unusual organelle with mitochondria....
Complete sequencing of five araliaceae chloroplast genomes and the phylogenetic implications.

Directory of Open Access Journals (Sweden)

Rong Li

Full Text Available BACKGROUND: The ginseng family (Araliaceae includes a number of economically important plant species. Previously phylogenetic studies circumscribed three major clades within the core ginseng plant family, yet the internal relationships of each major group have been poorly resolved perhaps due to rapid radiation of these lineages. Recent studies have shown that phyogenomics based on chloroplast genomes provides a viable way to resolve complex relationships. METHODOLOGY/PRINCIPAL FINDINGS: We report the complete nucleotide sequences of five Araliaceae chloroplast genomes using next-generation sequencing technology. The five chloroplast genomes are 156,333-156,459 bp in length including a pair of inverted repeats (25,551-26,108 bp separated by the large single-copy (86,028-86,566 bp and small single-copy (18,021-19,117 bp regions. Each chloroplast genome contains the same 114 unique genes consisting of 30 transfer RNA genes, four ribosomal RNA genes, and 80 protein coding genes. Gene size, content, and order, AT content, and IR/SC boundary structure are similar among all Araliaceae chloroplast genomes. A total of 140 repeats were identified in the five chloroplast genomes with palindromic repeat as the most common type. Phylogenomic analyses using parsimony, likelihood, and Bayesian inference based on the complete chloroplast genomes strongly supported the monophyly of the Asian Palmate group and the Aralia-Panax group. Furthermore, the relationships among the sampled taxa within the Asian Palmate group were well resolved. Twenty-six DNA markers with the percentage of variable sites higher than 5% were identified, which may be useful for phylogenetic studies of Araliaceae. CONCLUSION: The chloroplast genomes of Araliaceae are highly conserved in all aspects of genome features. The large-scale phylogenomic data based on the complete chloroplast DNA sequences is shown to be effective for the phylogenetic reconstruction of Araliaceae.
Identification of the centromeric repeat in the threespine stickleback fish (Gasterosteus aculeatus).

Science.gov (United States)

Cech, Jennifer N; Peichel, Catherine L

2015-12-01

Centromere sequences exist as gaps in many genome assemblies due to their repetitive nature. Here we take an unbiased approach utilizing centromere protein A (CENP-A) chomatin immunoprecipitation followed by high-throughput sequencing to identify the centromeric repeat sequence in the threespine stickleback fish (Gasterosteus aculeatus). A 186-bp, AT-rich repeat was validated as centromeric using both fluorescence in situ hybridization (FISH) and immunofluorescence combined with FISH (IF-FISH) on interphase nuclei and metaphase spreads. This repeat hybridizes strongly to the centromere on all chromosomes, with the exception of weak hybridization to the Y chromosome. Together, our work provides the first validated sequence information for the threespine stickleback centromere.
Changes in Bottom Water Physical Properties Above the Mid-Atlantic Ridge Flank in the Brazil Basin

Science.gov (United States)

Zhao, Jian; Thurnherr, Andreas M.

2018-01-01

Warming of abyssal waters in recent decades has been widely documented around the global ocean. Here repeat hydrographic data collected in 1997 and 2014 near a deep fracture zone canyon in the eastern Brazil Basin are used to quantify the long-term change. Significant changes are found in the Antarctic Bottom Water (AABW) within the canyon. The AABW in 2014 was warmer (0.08 ± 0.06°C), saltier (0.01 ± 0.005), and less dense (0.005 ± 0.004 kg m-3) than in 1997. In contrast, the change in the North Atlantic Deep Water has complicated spatial structure and is almost indistinguishable from zero at 95% confidence. The resulting divergence in vertical displacement of the isopycnals modifies the local density stratification. At its peak, the local squared buoyancy frequency (N2) near the canyon is reduced by about 20% from 1997 to 2014. Similar reduction is found in the basinwide averaged profiles over the Mid-Atlantic Ridge flank along 25°W in years 1989, 2005, and 2014. The observed changes in density stratification have important implications for internal tide generation and dissipation.

Biallelic MLH1 SNP cDNA expression or constitutional promoter methylation can hide genomic rearrangements causing Lynch syndrome.

Science.gov (United States)

Morak, Monika; Koehler, Udo; Schackert, Hans Konrad; Steinke, Verena; Royer-Pokora, Brigitte; Schulmann, Karsten; Kloor, Matthias; Höchter, Wilhelm; Weingart, Josef; Keiling, Cortina; Massdorf, Trisari; Holinski-Feder, Elke

2011-08-01

A positive family history, germline mutations in DNA mismatch repair genes, tumours with high microsatellite instability, and loss of mismatch repair protein expression are the hallmarks of hereditary non-polyposis colorectal cancer (Lynch syndrome). However, in ~10-15% of cases of suspected Lynch syndrome, no disease-causing mechanism can be detected. Oligo array analysis was performed to search for genomic imbalances in patients with suspected mutation-negative Lynch syndrome with MLH1 deficiency in their colorectal tumours. A deletion in the LRRFIP2 (leucine-rich repeat flightless-interacting protein 2) gene flanking the MLH1 gene was detected, which turned out to be a paracentric inversion on chromosome 3p22.2 creating two new stable fusion transcripts between MLH1 and LRRFIP2. A single-nucleotide polymorphism in MLH1 exon 8 was expressed from both alleles, initially pointing to appropriate MLH1 function at least in peripheral cells. In a second case, an inherited duplication of the MLH1 gene region resulted in constitutional MLH1 promoter methylation. Constitutional MLH1 promoter methylation may therefore in rare cases be a heritable disease mechanism and should not be overlooked in seemingly sporadic patients.
Dispersed repetitive sequences in eukaryotic genomes and their possible biological significance

International Nuclear Information System (INIS)

Georgiev, G.P.; Kramerov, D.A.; Ryskov, A.P.; Skryabin, K.G.; Lukanidin, E.M.

1983-01-01

In this paper is described the properties of a novel mouse mdg-like element, the A2 sequence, which is the most abundant repetitive sequence. We also characterized an ubiquitous B2 sequence that represents, after B1, the dominant family among the short interspersed repeats of the mouse genome. The existence of some putative transposition intermediates was shown for repeats of both A and B types of the mouse genome. These are closed circular DNA of the A type and small polyadenylated B + RNAs. The fundamental question that arises is whether these sequences are simply selfish DNA capable of transpositions or do they fulfill some useful biological functions within the genome. 66 references, 11 figures, 1 table
Simple sequence repeats in Neurospora crassa: distribution, polymorphism and evolutionary inference

Directory of Open Access Journals (Sweden)

Park Jongsun

2008-01-01

Full Text Available Abstract Background Simple sequence repeats (SSRs have been successfully used for various genetic and evolutionary studies in eukaryotic systems. The eukaryotic model organism Neurospora crassa is an excellent system to study evolution and biological function of SSRs. Results We identified and characterized 2749 SSRs of 963 SSR types in the genome of N. crassa. The distribution of tri-nucleotide (nt SSRs, the most common SSRs in N. crassa, was significantly biased in exons. We further characterized the distribution of 19 abundant SSR types (AST, which account for 71% of total SSRs in the N. crassa genome, using a Poisson log-linear model. We also characterized the size variation of SSRs among natural accessions using Polymorphic Index Content (PIC and ANOVA analyses and found that there are genome-wide, chromosome-dependent and local-specific variations. Using polymorphic SSRs, we have built linkage maps from three line-cross populations. Conclusion Taking our computational, statistical and experimental data together, we conclude that 1 the distributions of the SSRs in the sequenced N. crassa genome differ systematically between chromosomes as well as between SSR types, 2 the size variation of tri-nt SSRs in exons might be an important mechanism in generating functional variation of proteins in N. crassa, 3 there are different levels of evolutionary forces in variation of amino acid repeats, and 4 SSRs are stable molecular markers for genetic studies in N. crassa.
Flank wear analysing of high speed end milling for hardened steel D2 using Taguchi Method

Science.gov (United States)

Hazza Faizi Al-Hazza, Muataz; Ibrahim, Nur Asmawiyah bt; Adesta, Erry T. Y.; Khan, Ahsan Ali; Abdullah Sidek, Atiah Bt.

2017-03-01

One of the main challenges for any manufacturer is how to decrease the machining cost without affecting the final quality of the product. One of the new advanced machining processes in industry is the high speed hard end milling process that merges three advanced machining processes: high speed milling, hard milling and dry milling. However, one of the most important challenges in this process is to control the flank wear rate. Therefore a analyzing the flank wear rate during machining should be investigated in order to determine the best cutting levels that will not affect the final quality of the product. In this research Taguchi method has been used to investigate the effect of cutting speed, feed rate and depth of cut and determine the best level s to minimize the flank wear rate up to total length of 0.3mm based on the ISO standard to maintain the finishing requirements.
Elements in the transcriptional regulatory region flanking herpes simplex virus type 1 oriS stimulate origin function.

Science.gov (United States)

Wong, S W; Schaffer, P A

1991-05-01

Like other DNA-containing viruses, the three origins of herpes simplex virus type 1 (HSV-1) DNA replication are flanked by sequences containing transcriptional regulatory elements. In a transient plasmid replication assay, deletion of sequences comprising the transcriptional regulatory elements of ICP4 and ICP22/47, which flank oriS, resulted in a greater than 80-fold decrease in origin function compared with a plasmid, pOS-822, which retains these sequences. In an effort to identify specific cis-acting elements responsible for this effect, we conducted systematic deletion analysis of the flanking region with plasmid pOS-822 and tested the resulting mutant plasmids for origin function. Stimulation by cis-acting elements was shown to be both distance and orientation dependent, as changes in either parameter resulted in a decrease in oriS function. Additional evidence for the stimulatory effect of flanking sequences on origin function was demonstrated by replacement of these sequences with the cytomegalovirus immediate-early promoter, resulting in nearly wild-type levels of oriS function. In competition experiments, cotransfection of cells with the test plasmid, pOS-822, and increasing molar concentrations of a competitor plasmid which contained the ICP4 and ICP22/47 transcriptional regulatory regions but lacked core origin sequences resulted in a significant reduction in the replication efficiency of pOS-822, demonstrating that factors which bind specifically to the oriS-flanking sequences are likely involved as auxiliary proteins in oriS function. Together, these studies demonstrate that trans-acting factors and the sites to which they bind play a critical role in the efficiency of HSV-1 DNA replication from oriS in transient-replication assays.
Insights from the complete chloroplast genome into the evolution of Sesamum indicum L.

Directory of Open Access Journals (Sweden)

Haiyang Zhang

Full Text Available Sesame (Sesamum indicum L. is one of the oldest oilseed crops. In order to investigate the evolutionary characters according to the Sesame Genome Project, apart from sequencing its nuclear genome, we sequenced the complete chloroplast genome of S. indicum cv. Yuzhi 11 (white seeded using Illumina and 454 sequencing. Comparisons of chloroplast genomes between S. indicum and the 18 other higher plants were then analyzed. The chloroplast genome of cv. Yuzhi 11 contains 153,338 bp and a total of 114 unique genes (KC569603. The number of chloroplast genes in sesame is the same as that in Nicotiana tabacum, Vitis vinifera and Platanus occidentalis. The variation in the length of the large single-copy (LSC regions and inverted repeats (IR in sesame compared to 18 other higher plant species was the main contributor to size variation in the cp genome in these species. The 77 functional chloroplast genes, except for ycf1 and ycf2, were highly conserved. The deletion of the cp ycf1 gene sequence in cp genomes may be due either to its transfer to the nuclear genome, as has occurred in sesame, or direct deletion, as has occurred in Panax ginseng and Cucumis sativus. The sesame ycf2 gene is only 5,721 bp in length and has lost about 1,179 bp. Nucleotides 1-585 of ycf2 when queried in BLAST had hits in the sesame draft genome. Five repeats (R10, R12, R13, R14 and R17 were unique to the sesame chloroplast genome. We also found that IR contraction/expansion in the cp genome alters its rate of evolution. Chloroplast genes and repeats display the signature of convergent evolution in sesame and other species. These findings provide a foundation for further investigation of cp genome evolution in Sesamum and other higher plants.
The sequence of the Helicoverpa armigera single nucleocapsid nucleopolyhedrovirus genome

NARCIS (Netherlands)

Chen, X.; IJkel, W.F.J.; Tarchini, R.; Sun, X.; Sandbrink, H.; Wang, H.; Peters, S.; Zuidema, D.; Klein Lankhorst, R.; Vlak, J.M.; Hu, Z.

2001-01-01

The nucleotide sequence of the Helicoverpa armigera single-nucleocapsid nucleopolyhedrovirus (HaSNPV) DNA genome was determined and analysed. The circular genome encompasses 131 403 bp, has a G C content of 39.1 molnd contains five homologous regions with a unique pattern of repeats.
Twisting right to left: A…A mismatch in a CAG trinucleotide repeat overexpansion provokes left-handed Z-DNA conformation.

Directory of Open Access Journals (Sweden)

Noorain Khan

2015-04-01

Full Text Available Conformational polymorphism of DNA is a major causative factor behind several incurable trinucleotide repeat expansion disorders that arise from overexpansion of trinucleotide repeats located in coding/non-coding regions of specific genes. Hairpin DNA structures that are formed due to overexpansion of CAG repeat lead to Huntington's disorder and spinocerebellar ataxias. Nonetheless, DNA hairpin stem structure that generally embraces B-form with canonical base pairs is poorly understood in the context of periodic noncanonical A…A mismatch as found in CAG repeat overexpansion. Molecular dynamics simulations on DNA hairpin stems containing A…A mismatches in a CAG repeat overexpansion show that A…A dictates local Z-form irrespective of starting glycosyl conformation, in sharp contrast to canonical DNA duplex. Transition from B-to-Z is due to the mechanistic effect that originates from its pronounced nonisostericity with flanking canonical base pairs facilitated by base extrusion, backbone and/or base flipping. Based on these structural insights we envisage that such an unusual DNA structure of the CAG hairpin stem may have a role in disease pathogenesis. As this is the first study that delineates the influence of a single A…A mismatch in reversing DNA helicity, it would further have an impact on understanding DNA mismatch repair.
Complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera, and comparative analyses with other grass genomes

Science.gov (United States)

Saski, Christopher; Lee, Seung-Bum; Fjellheim, Siri; Guda, Chittibabu; Jansen, Robert K.; Luo, Hong; Tomkins, Jeffrey; Rognli, Odd Arne; Clarke, Jihong Liu

2009-01-01

Comparisons of complete chloroplast genome sequences of Hordeum vulgare, Sorghum bicolor and Agrostis stolonifera to six published grass chloroplast genomes reveal that gene content and order are similar but two microstructural changes have occurred. First, the expansion of the IR at the SSC/IRa boundary that duplicates a portion of the 5′ end of ndhH is restricted to the three genera of the subfamily Pooideae (Agrostis, Hordeum and Triticum). Second, a 6 bp deletion in ndhK is shared by Agrostis, Hordeum, Oryza and Triticum, and this event supports the sister relationship between the subfamilies Erhartoideae and Pooideae. Repeat analysis identified 19–37 direct and inverted repeats 30 bp or longer with a sequence identity of at least 90%. Seventeen of the 26 shared repeats are found in all the grass chloroplast genomes examined and are located in the same genes or intergenic spacer (IGS) regions. Examination of simple sequence repeats (SSRs) identified 16–21 potential polymorphic SSRs. Five IGS regions have 100% sequence identity among Zea mays, Saccharum officinarum and Sorghum bicolor, whereas no spacer regions were identical among Oryza sativa, Triticum aestivum, H. vulgare and A. stolonifera despite their close phylogenetic relationship. Alignment of EST sequences and DNA coding sequences identified six C–U conversions in both Sorghum bicolor and H. vulgare but only one in A. stolonifera. Phylogenetic trees based on DNA sequences of 61 protein-coding genes of 38 taxa using both maximum parsimony and likelihood methods provide moderate support for a sister relationship between the subfamilies Erhartoideae and Pooideae. PMID:17534593
Molecular cloning, genomic organization, developmental regulation, and a knock-out mutant of a novel leu-rich repeats-containing G protein-coupled receptor (DLGR-2) from Drosophila melanogaster

DEFF Research Database (Denmark)

Eriksen, Kathrine Krageskov; Hauser, Frank; Schiøtt, Morten

2000-01-01

After screening the Berkeley Drosophila Genome Project database with sequences from a recently characterized Leu-rich repeats-containing G protein-coupled receptor (LGR) fromDrosophila (DLGR-1), we identified a second gene for a different LGR (DLGR-2) and cloned its cDNA. DLGR-2 is 1360 amino aci...... knock-out mutants, where the DLGR-2 gene is interrupted by a P element insertion, die around the time of hatching. This finding, together with the expression data, strongly suggests that DLGR-2 is exclusively involved in development....
Determination of allele frequencies in nine short tandem repeat loci ...

African Journals Online (AJOL)

SERVER

2008-04-17

Apr 17, 2008 ... out the human genome. These loci are a rich source of highly polymorphic markers that may be detected using the polymerase chain reaction (PCR). PCR is a mimic of the normal cellular process of replication of DNA molecules. Each STR is distinguished by the number of times a sequence is repeated, ...
Pancreatic Tail Cancer with Sole Manifestation of Left Flank Pain: A Very Rare Presentation

Directory of Open Access Journals (Sweden)

Hsing-Lin Lin

2008-06-01

Full Text Available Pancreatic cancer is sometimes called a “silent disease” because it often causes no symptoms in the early stage. The symptoms can be quite vague and various depending on the location of cancer in the pancreas. The anatomic site distribution is 78% in the head of the pancreas, 11% in the body, and 11% in the tail. Pancreatic cancer is rarely detected in the early stage, and it is very uncommon to diagnose pancreatic tail cancer during an emergency department visit. The manifestation of pancreatic tail cancer as left flank pain is very rare and has seldom been identified in the literature. We present a case of pancreatic tail cancer with the sole manifestation of dull left flank pain. Having negative findings on an ultrasound study initially, this female patient was misdiagnosed as having possible acute gastritis, urolithiasis or muscle strain after she received gastroendoscopy and colonofiberscopy. Her symptoms persisted for several months and she visited our emergency department due to an acute exacerbation of a persistent dull pain in the left flank area. Radiographic evaluation with computed tomography was performed, and pancreatic tail tumor with multiple metastases was found unexpectedly. We review the literature and discuss this rare presentation of pancreatic tail cancer.
BRAD, the genetics and genomics database for Brassica plants

Directory of Open Access Journals (Sweden)

Li Pingxia

2011-10-01

Full Text Available Abstract Background Brassica species include both vegetable and oilseed crops, which are very important to the daily life of common human beings. Meanwhile, the Brassica species represent an excellent system for studying numerous aspects of plant biology, specifically for the analysis of genome evolution following polyploidy, so it is also very important for scientific research. Now, the genome of Brassica rapa has already been assembled, it is the time to do deep mining of the genome data. Description BRAD, the Brassica database, is a web-based resource focusing on genome scale genetic and genomic data for important Brassica crops. BRAD was built based on the first whole genome sequence and on further data analysis of the Brassica A genome species, Brassica rapa (Chiifu-401-42. It provides datasets, such as the complete genome sequence of B. rapa, which was de novo assembled from Illumina GA II short reads and from BAC clone sequences, predicted genes and associated annotations, non coding RNAs, transposable elements (TE, B. rapa genes' orthologous to those in A. thaliana, as well as genetic markers and linkage maps. BRAD offers useful searching and data mining tools, including search across annotation datasets, search for syntenic or non-syntenic orthologs, and to search the flanking regions of a certain target, as well as the tools of BLAST and Gbrowse. BRAD allows users to enter almost any kind of information, such as a B. rapa or A. thaliana gene ID, physical position or genetic marker. Conclusion BRAD, a new database which focuses on the genetics and genomics of the Brassica plants has been developed, it aims at helping scientists and breeders to fully and efficiently use the information of genome data of Brassica plants. BRAD will be continuously updated and can be accessed through http://brassicadb.org.
An unusual manifestation of acute appendicitis with left flank pain

Directory of Open Access Journals (Sweden)

Roland Talanow, MD, PhD

2008-08-01

Full Text Available The author presents a case with an unusual presentation of early appendicitis. The patient presented initially with left sided flank pain. Workup for nephrolithiasis, including non-contrast CT of the abdomen and pelvis was negative for renal stones or hydronephrosis. After discharge, the patient presented one week later in the ED with right lower quadrant pain. Contrast enhanced CT of the abdomen revealed perforated appendicitis.
PCR artifact in testing for homologous recombination in genomic editing in zebrafish.

Directory of Open Access Journals (Sweden)

Minho Won

Full Text Available We report a PCR-induced artifact in testing for homologous recombination in zebrafish. We attempted to replace the lnx2a gene with a donor cassette, mediated by a TALEN induced double stranded cut. The donor construct was flanked with homology arms of about 1 kb at the 5' and 3' ends. Injected embryos (G0 were raised and outcrossed to wild type fish. A fraction of the progeny appeared to have undergone the desired homologous recombination, as tested by PCR using primer pairs extending from genomic DNA outside the homology region to a site within the donor cassette. However, Southern blots revealed that no recombination had taken place. We conclude that recombination happened during PCR in vitro between the donor integrated elsewhere in the genome and the lnx2a locus. We conclude that PCR alone may be insufficient to verify homologous recombination in genome editing experiments in zebrafish.
Comparative genomic analysis of single-molecule sequencing and hybrid approaches for finishing the Clostridium autoethanogenum JA1-1 strain DSM 10061 genome

Energy Technology Data Exchange (ETDEWEB)

Brown, Steven D [ORNL; Nagaraju, Shilpa [LanzaTech; Utturkar, Sagar M [ORNL; De Tissera, Sashini [LanzaTech; Segovia, Simón [LanzaTech; Mitchell, Wayne [LanzaTech; Land, Miriam L [ORNL; Dassanayake, Asela [LanzaTech; Köpke, Michael [LanzaTech

2014-01-01

Background Clostridium autoethanogenum strain JA1-1 (DSM 10061) is an acetogen capable of fermenting CO, CO2 and H2 (e.g. from syngas or waste gases) into biofuel ethanol and commodity chemicals such as 2,3-butanediol. A draft genome sequence consisting of 100 contigs has been published. Results A closed, high-quality genome sequence for C. autoethanogenum DSM10061 was generated using only the latest single-molecule DNA sequencing technology and without the need for manual finishing. It is assigned to the most complex genome classification based upon genome features such as repeats, prophage, nine copies of the rRNA gene operons. It has a low G + C content of 31.1%. Illumina, 454, Illumina/454 hybrid assemblies were generated and then compared to the draft and PacBio assemblies using summary statistics, CGAL, QUAST and REAPR bioinformatics tools and comparative genomic approaches. Assemblies based upon shorter read DNA technologies were confounded by the large number repeats and their size, which in the case of the rRNA gene operons were ~5 kb. CRISPR (Clustered Regularly Interspaced Short Paloindromic Repeats) systems among biotechnologically relevant Clostridia were classified and related to plasmid content and prophages. Potential associations between plasmid content and CRISPR systems may have implications for historical industrial scale Acetone-Butanol-Ethanol (ABE) fermentation failures and future large scale bacterial fermentations. While C. autoethanogenum contains an active CRISPR system, no such system is present in the closely related Clostridium ljungdahlii DSM 13528. A common prophage inserted into the Arg-tRNA shared between the strains suggests a common ancestor. However, C. ljungdahlii contains several additional putative prophages and it has more than double the amount of prophage DNA compared to C. autoethanogenum. Other differences include important metabolic genes for central metabolism (as an additional hydrogenase and the absence of a
Meta-Analysis of DNA Tumor-Viral Integration Site Selection Indicates a Role for Repeats, Gene Expression and Epigenetics

Directory of Open Access Journals (Sweden)

Janet M. Doolittle-Hall

2015-11-01

Full Text Available Oncoviruses cause tremendous global cancer burden. For several DNA tumor viruses, human genome integration is consistently associated with cancer development. However, genomic features associated with tumor viral integration are poorly understood. We sought to define genomic determinants for 1897 loci prone to hosting human papillomavirus (HPV, hepatitis B virus (HBV or Merkel cell polyomavirus (MCPyV. These were compared to HIV, whose enzyme-mediated integration is well understood. A comprehensive catalog of integration sites was constructed from the literature and experimentally-determined HPV integration sites. Features were scored in eight categories (genes, expression, open chromatin, histone modifications, methylation, protein binding, chromatin segmentation and repeats and compared to random loci. Random forest models determined loci classification and feature selection. HPV and HBV integrants were not fragile site associated. MCPyV preferred integration near sensory perception genes. Unique signatures of integration-associated predictive genomic features were detected. Importantly, repeats, actively-transcribed regions and histone modifications were common tumor viral integration signatures.
The proviral genome of radiation leukemia virus: Molecular cloning, nucleotide sequence of its long terminal repeat and integration in lymphoma cell DNA

International Nuclear Information System (INIS)

Janowski, M.; Merregaert, J.; Boniver, J.; Maisin, J.R.

1985-01-01

The proviral genome of a thymotropic and leukemogenic C57BL/Ka mouse retrovirus, RadLV/VL/sub 3/(T+L+), was cloned as a biologically active PstI insert in the bacterial plasmid pBR322. Its restriction map was compared to those, already known, of two nonthymotropic and nonleukemogenic viruses of the same mouse strain, the ecotropic BL/Ka(B) and the xenotropic constituent of the radiation leukemia virus complex (RadLV). Differences were observed in the pol gene and in the env gene. Moreover, the nucleotide sequence of the RadLV/VL/sub 3/(T+L+) long terminal repeat revealed the existence of two copies of a 42 bp long sequence, separated by 11 nucleotides and of which BL/Ka(B) possesses only one copy
Morphology and Doping Level of Electropolymerized Biselenophene-Flanked 3,4- Ethylenedioxythiophene Polymer: Effect of Solvents and Electrolytes

International Nuclear Information System (INIS)

Agrawal, Vikash; Shahjad; Bhardwaj, Dinesh; Bhargav, Ranoo; Sharma, Gauri Datt; Bhardwaj, Ramil Kumar; Patra, Asit; Chand, Suresh

2016-01-01

Highlights: • Biselenophene-flanked 3,4-ethylenedioxythiophene polymer films were obtained by electrochemical polymerization. • Supporting electrolyte has significant effect on the doping level, whereas electropolymerized solvent has a major effect on morphology of the polymer films. • Optoelectronic properties and morphology of the electropolymerized films were studied. • Density functional theory (DFT) calculation has been made for optoelectronic properties. - Abstract: Biselenophene-flanked 3,4-ethylenedioxythiophene (EDOT) based polymer films were obtained by electrochemical polymerization. The effects of polymerization conditions such as supporting electrolytes and solvents on doping level, optical property and morphology of the polymer films were systematically studied. Interestingly, we found that polymer prepared by using different supporting electrolytes (TBAPF 6 , TBABF 4 and TBAClO 4 ) has significant effects on the doping level of the polymer films, whereas electropolymerized solvents (acetonitrile and dichloromethane) has no such effects on doping level. The polymer films show reversible dedoping and doping behavior upon treatment with hydrazine hydrate and iodine respectively. Biselenophene-flanked EDOT polymer shows a band gap of about 1.6 eV which is comparable to poly(3,4- ethylenedioxythiophene) (PEDOT) and parent polyselenophene, whereas fine-tuning of HOMO and LUMO energy levels has been found. In contrast, we observed that electropolymerized solvent has a major effect on morphology of the polymer films, while supporting electrolyte has very minor effects on the morphology. The surface morphologies of the polymer films were characterized by scanning electron microscope (SEM) and atomic force microscope (AFM) techniques. We also present an efficient synthesis of bisthiophene-flanked bridged EDOT (ETTE), and biselenophene-flanked bridged EDOT (ESeSeE), and their electrochemical polymerization, characterizations and throughout comparison
Modeling of Principal Flank Wear: An Empirical Approach Combining the Effect of Tool, Environment and Workpiece Hardness

Science.gov (United States)

Mia, Mozammel; Al Bashir, Mahmood; Dhar, Nikhil Ranjan

2016-10-01

Hard turning is increasingly employed in machining, lately, to replace time-consuming conventional turning followed by grinding process. An excessive amount of tool wear in hard turning is one of the main hurdles to be overcome. Many researchers have developed tool wear model, but most of them developed it for a particular work-tool-environment combination. No aggregate model is developed that can be used to predict the amount of principal flank wear for specific machining time. An empirical model of principal flank wear (VB) has been developed for the different hardness of workpiece (HRC40, HRC48 and HRC56) while turning by coated carbide insert with different configurations (SNMM and SNMG) under both dry and high pressure coolant conditions. Unlike other developed model, this model includes the use of dummy variables along with the base empirical equation to entail the effect of any changes in the input conditions on the response. The base empirical equation for principal flank wear is formulated adopting the Exponential Associate Function using the experimental results. The coefficient of dummy variable reflects the shifting of the response from one set of machining condition to another set of machining condition which is determined by simple linear regression. The independent cutting parameters (speed, rate, depth of cut) are kept constant while formulating and analyzing this model. The developed model is validated with different sets of machining responses in turning hardened medium carbon steel by coated carbide inserts. For any particular set, the model can be used to predict the amount of principal flank wear for specific machining time. Since the predicted results exhibit good resemblance with experimental data and the average percentage error is <10 %, this model can be used to predict the principal flank wear for stated conditions.

Effect of Saw Palmetto Supplements on Androgen-Sensitive LNCaP Human Prostate Cancer Cell Number and Syrian Hamster Flank Organ Growth

Directory of Open Access Journals (Sweden)

Alexander B. Opoku-Acheampong

2016-01-01

Full Text Available Saw palmetto supplements (SPS are commonly consumed by men with prostate cancer. We investigated whether SPS fatty acids and phytosterols concentrations determine their growth-inhibitory action in androgen-sensitive LNCaP cells and hamster flank organs. High long-chain fatty acids-low phytosterols (HLLP SPS ≥ 750 nM with testosterone significantly increased and ≥500 nM with dihydrotestosterone significantly decreased LNCaP cell number. High long-chain fatty acids-high phytosterols (HLHP SPS ≥ 500 nM with dihydrotestosterone and high medium-chain fatty acids-low phytosterols (HMLP SPS ≥ 750 nM or with androgens significantly decreased LNCaP cell number (n=3; p<0.05. Five- to six-week-old, castrated male Syrian hamsters were randomized to control (n=4, HLLP, HLHP, and HMLP SPS (n=6 groups. Testosterone or dihydrotestosterone was applied topically daily for 21 days to the right flank organ; the left flank organ was treated with ethanol and served as the control. Thirty minutes later, SPS or ethanol was applied to each flank organ in treatment and control groups, respectively. SPS treatments caused a notable but nonsignificant reduction in the difference between left and right flank organ growth in testosterone-treated SPS groups compared to the control. The same level of inhibition was not seen in dihydrotestosterone-treated SPS groups (p<0.05. Results may suggest that SPS inhibit 5α-reductase thereby preventing hamster flank organ growth.
Effect of Saw Palmetto Supplements on Androgen-Sensitive LNCaP Human Prostate Cancer Cell Number and Syrian Hamster Flank Organ Growth.

Science.gov (United States)

Opoku-Acheampong, Alexander B; Penugonda, Kavitha; Lindshield, Brian L

2016-01-01

Saw palmetto supplements (SPS) are commonly consumed by men with prostate cancer. We investigated whether SPS fatty acids and phytosterols concentrations determine their growth-inhibitory action in androgen-sensitive LNCaP cells and hamster flank organs. High long-chain fatty acids-low phytosterols (HLLP) SPS ≥ 750 nM with testosterone significantly increased and ≥500 nM with dihydrotestosterone significantly decreased LNCaP cell number. High long-chain fatty acids-high phytosterols (HLHP) SPS ≥ 500 nM with dihydrotestosterone and high medium-chain fatty acids-low phytosterols (HMLP) SPS ≥ 750 nM or with androgens significantly decreased LNCaP cell number (n = 3; p < 0.05). Five- to six-week-old, castrated male Syrian hamsters were randomized to control (n = 4), HLLP, HLHP, and HMLP SPS (n = 6) groups. Testosterone or dihydrotestosterone was applied topically daily for 21 days to the right flank organ; the left flank organ was treated with ethanol and served as the control. Thirty minutes later, SPS or ethanol was applied to each flank organ in treatment and control groups, respectively. SPS treatments caused a notable but nonsignificant reduction in the difference between left and right flank organ growth in testosterone-treated SPS groups compared to the control. The same level of inhibition was not seen in dihydrotestosterone-treated SPS groups (p < 0.05). Results may suggest that SPS inhibit 5α-reductase thereby preventing hamster flank organ growth.
Experimental Evaluation and Optimization of Flank Wear During Turning of AISI 4340 Steel with Coated Carbide Inserts Using Different Cutting Fluids

Science.gov (United States)

Lawal, S. A.; Choudhury, I. A.; Nukman, Y.

2015-01-01

The understanding of cutting fluids performance in turning process is very important in order to improve the efficiency of the process. This efficiency can be determined based on certain process parameters such as flank wear, cutting forces developed, temperature developed at the tool chip interface, surface roughness on the work piece, etc. In this study, the objective is to determine the influence of cutting fluids on flank wear during turning of AISI 4340 with coated carbide inserts. The performances of three types of cutting fluids were compared using Taguchi experimental method. The results show that palm kernel oil based cutting fluids performed better than the other two cutting fluids in reducing flank wear. Mathematical models for cutting parameters such as cutting speed, feed rate, depth of cut and cutting fluids were obtained from regression analysis using MINITAB 14 software to predict flank wear. Experiments were conducted based on the optimized values to validate the regression equations for flank wear and 5.82 % error was obtained. The optimal cutting parameters for the flank wear using S/N ratio were 160 m/min of cutting speed (level 1), 0.18 mm/rev of feed (level 1), 1.75 mm of depth of cut (level 2) and 2.97 mm2/s palm kernel oil based cutting fluid (level 3). ANOVA shows cutting speed of 85.36 %; and feed rate 4.81 %) as significant factors.
Chromosome-specific DNA Repeat Probes

Energy Technology Data Exchange (ETDEWEB)

Baumgartner, Adolf; Weier, Jingly Fung; Weier, Heinz-Ulrich G.

2006-03-16

In research as well as in clinical applications, fluorescence in situ hybridization (FISH) has gained increasing popularity as a highly sensitive technique to study cytogenetic changes. Today, hundreds of commercially available DNA probes serve the basic needs of the biomedical research community. Widespread applications, however, are often limited by the lack of appropriately labeled, specific nucleic acid probes. We describe two approaches for an expeditious preparation of chromosome-specific DNAs and the subsequent probe labeling with reporter molecules of choice. The described techniques allow the preparation of highly specific DNA repeat probes suitable for enumeration of chromosomes in interphase cell nuclei or tissue sections. In addition, there is no need for chromosome enrichment by flow cytometry and sorting or molecular cloning. Our PCR-based method uses either bacterial artificial chromosomes or human genomic DNA as templates with {alpha}-satellite-specific primers. Here we demonstrate the production of fluorochrome-labeled DNA repeat probes specific for human chromosomes 17 and 18 in just a few days without the need for highly specialized equipment and without the limitation to only a few fluorochrome labels.
Flanking sequence determination and event-specific detection of genetically modified wheat B73-6-1.

Science.gov (United States)

Xu, Junyi; Cao, Jijuan; Cao, Dongmei; Zhao, Tongtong; Huang, Xin; Zhang, Piqiao; Luan, Fengxia

2013-05-01

In order to establish a specific identification method for genetically modified (GM) wheat, exogenous insert DNA and flanking sequence between exogenous fragment and recombinant chromosome of GM wheat B73-6-1 were successfully acquired by means of conventional polymerase chain reaction (PCR) and thermal asymmetric interlaced (TAIL)-PCR strategies. Newly acquired exogenous fragment covered the full-length sequence of transformed genes such as transformed plasmid and corresponding functional genes including marker uidA, herbicide-resistant bar, ubiquitin promoter, and high-molecular-weight gluten subunit. The flanking sequence between insert DNA revealed high similarity with Triticum turgidum A gene (GenBank: AY494981.1). A specific PCR detection method for GM wheat B73-6-1 was established on the basis of primers designed according to the flanking sequence. This specific PCR method was validated by GM wheat, GM corn, GM soybean, GM rice, and non-GM wheat. The specifically amplified target band was observed only in GM wheat B73-6-1. This method is of high specificity, high reproducibility, rapid identification, and excellent accuracy for the identification of GM wheat B73-6-1.
The Amaranth Genome: Genome, Transcriptome, and Physical Map Assembly

Directory of Open Access Journals (Sweden)

J. W. Clouse

2016-03-01

Full Text Available Amaranth ( L. is an emerging pseudocereal native to the New World that has garnered increased attention in recent years because of its nutritional quality, in particular its seed protein and more specifically its high levels of the essential amino acid lysine. It belongs to the Amaranthaceae family, is an ancient paleopolyploid that shows disomic inheritance (2 = 32, and has an estimated genome size of 466 Mb. Here we present a high-quality draft genome sequence of the grain amaranth. The genome assembly consisted of 377 Mb in 3518 scaffolds with an N of 371 kb. Repetitive element analysis predicted that 48% of the genome is comprised of repeat sequences, of which -like elements were the most commonly classified retrotransposon. A de novo transcriptome consisting of 66,370 contigs was assembled from eight different amaranth tissue and abiotic stress libraries. Annotation of the genome identified 23,059 protein-coding genes. Seven grain amaranths (, , and and their putative progenitor ( were resequenced. A single nucleotide polymorphism (SNP phylogeny supported the classification of as the progenitor species of the grain amaranths. Lastly, we generated a de novo physical map for using the BioNano Genomics’ Genome Mapping platform. The physical map spanned 340 Mb and a hybrid assembly using the BioNano physical maps nearly doubled the N of the assembly to 697 kb. Moreover, we analyzed synteny between amaranth and sugar beet ( L. and estimated, using analysis, the age of the most recent polyploidization event in amaranth.
Genomic applications in forensic medicine

DEFF Research Database (Denmark)

Børsting, Claus; Morling, Niels

2016-01-01

Since the 1980s, advances in DNA technology have revolutionized the scope and practice of forensic medicine. From the days of restriction fragment length polymorphisms (RFLPs) to short tandem repeats (STRs), the current focus is on the next generation genome sequencing. It has been almost a decad...
Fidelity of target site duplication and sequence preference during integration of xenotropic murine leukemia virus-related virus.

Directory of Open Access Journals (Sweden)

Sanggu Kim

Full Text Available Xenotropic murine leukemia virus (MLV-related virus (XMRV is a new human retrovirus associated with prostate cancer and chronic fatigue syndrome. The causal relationship of XMRV infection to human disease and the mechanism of pathogenicity have not been established. During retrovirus replication, integration of the cDNA copy of the viral RNA genome into the host cell chromosome is an essential step and involves coordinated joining of the two ends of the linear viral DNA into staggered sites on target DNA. Correct integration produces proviruses that are flanked by a short direct repeat, which varies from 4 to 6 bp among the retroviruses but is invariant for each particular retrovirus. Uncoordinated joining of the two viral DNA ends into target DNA can cause insertions, deletions, or other genomic alterations at the integration site. To determine the fidelity of XMRV integration, cells infected with XMRV were clonally expanded and DNA sequences at the viral-host DNA junctions were determined and analyzed. We found that a majority of the provirus ends were correctly processed and flanked by a 4-bp direct repeat of host DNA. A weak consensus sequence was also detected at the XMRV integration sites. We conclude that integration of XMRV DNA involves a coordinated joining of two viral DNA ends that are spaced 4 bp apart on the target DNA and proceeds with high fidelity.
Short Interspersed Nuclear Element (SINE Sequences in the Genome of the Human Pathogenic Fungus Aspergillus fumigatus Af293.

Directory of Open Access Journals (Sweden)

Lakkhana Kanhayuwa

Full Text Available Novel families of short interspersed nuclear element (SINE sequences in the human pathogenic fungus Aspergillus fumigatus, clinical isolate Af293, were identified and categorised into tRNA-related and 5S rRNA-related SINEs. Eight predicted tRNA-related SINE families originating from different tRNAs, and nominated as AfuSINE2 sequences, contained target site duplications of short direct repeat sequences (4-14 bp flanking the elements, an extended tRNA-unrelated region and typical features of RNA polymerase III promoter sequences. The elements ranged in size from 140-493 bp and were present in low copy number in the genome and five out of eight were actively transcribed. One putative tRNAArg-derived sequence, AfuSINE2-1a possessed a unique feature of repeated trinucleotide ACT residues at its 3'-terminus. This element was similar in sequence to the I-4_AO element found in A. oryzae and an I-1_AF long nuclear interspersed element-like sequence identified in A. fumigatus Af293. Families of 5S rRNA-related SINE sequences, nominated as AfuSINE3, were also identified and their 5'-5S rRNA-related regions show 50-65% and 60-75% similarity to respectively A. fumigatus 5S rRNAs and SINE3-1_AO found in A. oryzae. A. fumigatus Af293 contains five copies of AfuSINE3 sequences ranging in size from 259-343 bp and two out of five AfuSINE3 sequences were actively transcribed. Investigations on AfuSINE distribution in the fungal genome revealed that the elements are enriched in pericentromeric and subtelomeric regions and inserted within gene-rich regions. We also demonstrated that some, but not all, AfuSINE sequences are targeted by host RNA silencing mechanisms. Finally, we demonstrated that infection of the fungus with mycoviruses had no apparent effects on SINE activity.
Repetitive elements may comprise over two-thirds of the human genome.

Directory of Open Access Journals (Sweden)

A P Jason de Koning

2011-12-01

Full Text Available Transposable elements (TEs are conventionally identified in eukaryotic genomes by alignment to consensus element sequences. Using this approach, about half of the human genome has been previously identified as TEs and low-complexity repeats. We recently developed a highly sensitive alternative de novo strategy, P-clouds, that instead searches for clusters of high-abundance oligonucleotides that are related in sequence space (oligo "clouds". We show here that P-clouds predicts >840 Mbp of additional repetitive sequences in the human genome, thus suggesting that 66%-69% of the human genome is repetitive or repeat-derived. To investigate this remarkable difference, we conducted detailed analyses of the ability of both P-clouds and a commonly used conventional approach, RepeatMasker (RM, to detect different sized fragments of the highly abundant human Alu and MIR SINEs. RM can have surprisingly low sensitivity for even moderately long fragments, in contrast to P-clouds, which has good sensitivity down to small fragment sizes (∼25 bp. Although short fragments have a high intrinsic probability of being false positives, we performed a probabilistic annotation that reflects this fact. We further developed "element-specific" P-clouds (ESPs to identify novel Alu and MIR SINE elements, and using it we identified ∼100 Mb of previously unannotated human elements. ESP estimates of new MIR sequences are in good agreement with RM-based predictions of the amount that RM missed. These results highlight the need for combined, probabilistic genome annotation approaches and suggest that the human genome consists of substantially more repetitive sequence than previously believed.
Comparing Mycobacterium tuberculosis genomes using genome topology networks.

Science.gov (United States)

Jiang, Jianping; Gu, Jianlei; Zhang, Liang; Zhang, Chenyi; Deng, Xiao; Dou, Tonghai; Zhao, Guoping; Zhou, Yan

2015-02-14

Over the last decade, emerging research methods, such as comparative genomic analysis and phylogenetic study, have yielded new insights into genotypes and phenotypes of closely related bacterial strains. Several findings have revealed that genomic structural variations (SVs), including gene gain/loss, gene duplication and genome rearrangement, can lead to different phenotypes among strains, and an investigation of genes affected by SVs may extend our knowledge of the relationships between SVs and phenotypes in microbes, especially in pathogenic bacteria. In this work, we introduce a 'Genome Topology Network' (GTN) method based on gene homology and gene locations to analyze genomic SVs and perform phylogenetic analysis. Furthermore, the concept of 'unfixed ortholog' has been proposed, whose members are affected by SVs in genome topology among close species. To improve the precision of 'unfixed ortholog' recognition, a strategy to detect annotation differences and complete gene annotation was applied. To assess the GTN method, a set of thirteen complete M. tuberculosis genomes was analyzed as a case study. GTNs with two different gene homology-assigning methods were built, the Clusters of Orthologous Groups (COG) method and the orthoMCL clustering method, and two phylogenetic trees were constructed accordingly, which may provide additional insights into whole genome-based phylogenetic analysis. We obtained 24 unfixable COG groups, of which most members were related to immunogenicity and drug resistance, such as PPE-repeat proteins (COG5651) and transcriptional regulator TetR gene family members (COG1309). The GTN method has been implemented in PERL and released on our website. The tool can be downloaded from http://homepage.fudan.edu.cn/zhouyan/gtn/ , and allows re-annotating the 'lost' genes among closely related genomes, analyzing genes affected by SVs, and performing phylogenetic analysis. With this tool, many immunogenic-related and drug resistance-related genes
Combined amplification and hybridization techniques for genome scanning in vegetatively propagated crops

Energy Technology Data Exchange (ETDEWEB)

Kahl, G; Ramser, J; Terauchi, R [Biocentre, University of Frankfurt, Frankfurt am Main (Germany); Lopez-Peralta, C [IRGP, Colegio de Postgraduados, Montecillo, Edo. de Mexico, Texcoco (Mexico); Asemota, H N [Biotechnology Centre, University of the West Indies, Mona, Kingston (Jamaica); Weising, K [School of Biological Sciences, University of Auckland, Auckland (New Zealand)

1998-10-01

A combination of PCR- and hybridization-based genome scanning techniques and sequence comparisons between non-coding chloroplast DNA flanking tRNA genes has been employed to screen Dioscorea species for intra- and interspecific genetic diversity. This methodology detected extensive polymorphisms within Dioscorea bulbifera L., and revealed taxonomic and phylogenetic relationships among cultivated Guinea yams varieties and their potential wild progenitors. Finally, screening of yam germplasm grown in Jamaica permitted reliable discrimination between all major cultivars. Genome scanning by micro satellite-primed PCR (MP-PCR) and random amplified polymorphic DNA (RAPD) analysis in combination with the novel random amplified micro satellite polymorphisms (RAMPO) hybridization technique has shown high potential for the genetic analysis of yams, and holds promise for other vegetatively propagated orphan crops. (author) 46 refs, 3 figs, 3 tabs
Direct and inverted repeats elicit genetic instability by both exploiting and eluding DNA double-strand break repair systems in mycobacteria.

Directory of Open Access Journals (Sweden)

Ewelina A Wojcik

Full Text Available Repetitive DNA sequences with the potential to form alternative DNA conformations, such as slipped structures and cruciforms, can induce genetic instability by promoting replication errors and by serving as a substrate for DNA repair proteins, which may lead to DNA double-strand breaks (DSBs. However, the contribution of each of the DSB repair pathways, homologous recombination (HR, non-homologous end-joining (NHEJ and single-strand annealing (SSA, to this sort of genetic instability is not fully understood. Herein, we assessed the genome-wide distribution of repetitive DNA sequences in the Mycobacterium smegmatis, Mycobacterium tuberculosis and Escherichia coli genomes, and determined the types and frequencies of genetic instability induced by direct and inverted repeats, both in the presence and in the absence of HR, NHEJ, and SSA. All three genomes are strongly enriched in direct repeats and modestly enriched in inverted repeats. When using chromosomally integrated constructs in M. smegmatis, direct repeats induced the perfect deletion of their intervening sequences ~1,000-fold above background. Absence of HR further enhanced these perfect deletions, whereas absence of NHEJ or SSA had no influence, suggesting compromised replication fidelity. In contrast, inverted repeats induced perfect deletions only in the absence of SSA. Both direct and inverted repeats stimulated excision of the constructs from the attB integration sites independently of HR, NHEJ, or SSA. With episomal constructs, direct and inverted repeats triggered DNA instability by activating nucleolytic activity, and absence of the DSB repair pathways (in the order NHEJ>HR>SSA exacerbated this instability. Thus, direct and inverted repeats may elicit genetic instability in mycobacteria by 1 directly interfering with replication fidelity, 2 stimulating the three main DSB repair pathways, and 3 enticing L5 site-specific recombination.
Direct and inverted repeats elicit genetic instability by both exploiting and eluding DNA double-strand break repair systems in mycobacteria.

Science.gov (United States)

Wojcik, Ewelina A; Brzostek, Anna; Bacolla, Albino; Mackiewicz, Pawel; Vasquez, Karen M; Korycka-Machala, Malgorzata; Jaworski, Adam; Dziadek, Jaroslaw

2012-01-01

Repetitive DNA sequences with the potential to form alternative DNA conformations, such as slipped structures and cruciforms, can induce genetic instability by promoting replication errors and by serving as a substrate for DNA repair proteins, which may lead to DNA double-strand breaks (DSBs). However, the contribution of each of the DSB repair pathways, homologous recombination (HR), non-homologous end-joining (NHEJ) and single-strand annealing (SSA), to this sort of genetic instability is not fully understood. Herein, we assessed the genome-wide distribution of repetitive DNA sequences in the Mycobacterium smegmatis, Mycobacterium tuberculosis and Escherichia coli genomes, and determined the types and frequencies of genetic instability induced by direct and inverted repeats, both in the presence and in the absence of HR, NHEJ, and SSA. All three genomes are strongly enriched in direct repeats and modestly enriched in inverted repeats. When using chromosomally integrated constructs in M. smegmatis, direct repeats induced the perfect deletion of their intervening sequences ~1,000-fold above background. Absence of HR further enhanced these perfect deletions, whereas absence of NHEJ or SSA had no influence, suggesting compromised replication fidelity. In contrast, inverted repeats induced perfect deletions only in the absence of SSA. Both direct and inverted repeats stimulated excision of the constructs from the attB integration sites independently of HR, NHEJ, or SSA. With episomal constructs, direct and inverted repeats triggered DNA instability by activating nucleolytic activity, and absence of the DSB repair pathways (in the order NHEJ>HR>SSA) exacerbated this instability. Thus, direct and inverted repeats may elicit genetic instability in mycobacteria by 1) directly interfering with replication fidelity, 2) stimulating the three main DSB repair pathways, and 3) enticing L5 site-specific recombination.
Study of surface roughness and flank wear in hard turning of AISI 4140 steel with coated ceramic inserts

Energy Technology Data Exchange (ETDEWEB)

Das, Sudhansu Ranjan; Kuma, Amaresh [National Institute of Technology, Jamshedpur (India); Dhupal, Debabrata [Veer Surendra Sai University of Technology, Burla (India)

2015-10-15

This experimental investigation deals with dry hard turning of AISI 4140 steel using PVD-TiN coated Al{sub 2}O{sub 3}+TiCN mixed ceramic inserts. The combined effect of cutting parameters (cutting speed, feed and depth of cut) on performance characteristics such as surface roughness and flank wear is explored by Full factorial design (FFD) and analysis of variance (ANOVA). The results show that feed is the principal cutting parameter influencing surface roughness, followed by cutting speed. However, flank wear is affected by the cutting speed and interaction of feed-depth of cut, although depth of cut has not been found statistically significant, but flank wear is an increasing function of depth of cut. Observations are made on the machined surface, and worn tool by Scanning electron microscope (SEM) to establish the process. Abrasion was the major wear mechanism found during hard turning within the studied range. The effect of tool wear on surface roughness was also studied. The experimental data were analyzed to predict the optimal range of surface roughness and flank wear. Based on Response surface methodology (RSM), mathematical models were developed for surface roughness (Ra) and flank wear (VB) with 95% confidence level. Finally, under optimum cutting conditions (obtained by response optimization technique), tool life was evaluated to perform cost analysis for justifying the economic viability of coated ceramic inserts in hard turning. The estimated machining cost per part for TiN coated ceramic was found to be lower (Rs. 12.31) because of higher tool life (51 min), which results in the reduction of downtime and increase in savings.
Study of surface roughness and flank wear in hard turning of AISI 4140 steel with coated ceramic inserts

International Nuclear Information System (INIS)

Das, Sudhansu Ranjan; Kuma, Amaresh; Dhupal, Debabrata

2015-01-01

This experimental investigation deals with dry hard turning of AISI 4140 steel using PVD-TiN coated Al_2O_3+TiCN mixed ceramic inserts. The combined effect of cutting parameters (cutting speed, feed and depth of cut) on performance characteristics such as surface roughness and flank wear is explored by Full factorial design (FFD) and analysis of variance (ANOVA). The results show that feed is the principal cutting parameter influencing surface roughness, followed by cutting speed. However, flank wear is affected by the cutting speed and interaction of feed-depth of cut, although depth of cut has not been found statistically significant, but flank wear is an increasing function of depth of cut. Observations are made on the machined surface, and worn tool by Scanning electron microscope (SEM) to establish the process. Abrasion was the major wear mechanism found during hard turning within the studied range. The effect of tool wear on surface roughness was also studied. The experimental data were analyzed to predict the optimal range of surface roughness and flank wear. Based on Response surface methodology (RSM), mathematical models were developed for surface roughness (Ra) and flank wear (VB) with 95% confidence level. Finally, under optimum cutting conditions (obtained by response optimization technique), tool life was evaluated to perform cost analysis for justifying the economic viability of coated ceramic inserts in hard turning. The estimated machining cost per part for TiN coated ceramic was found to be lower (Rs. 12.31) because of higher tool life (51 min), which results in the reduction of downtime and increase in savings.
Scanning mutagenesis of the amino acid sequences flanking phosphorylation site 1 of the mitochondrial pyruvate dehydrogenase complex

Directory of Open Access Journals (Sweden)

Nagib eAhsan

2012-07-01

Full Text Available The mitochondrial pyruvate dehydrogenase complex is regulated by reversible seryl-phosphorylation of the E1α subunit by a dedicated, intrinsic kinase. The phospho-complex is reactivated when dephosphorylated by an intrinsic PP2C-type protein phosphatase. Both the position of the phosphorylated Ser-residue and the sequences of the flanking amino acids are highly conserved. We have used the synthetic peptide-based kinase client assay plus recombinant pyruvate dehydrogenase E1α and E1α-kinase to perform scanning mutagenesis of the residues flanking the site of phosphorylation. Consistent with the results from phylogenetic analysis of the flanking sequences, the direct peptide-based kinase assays tolerated very few changes. Even conservative changes such as Leu, Ile, or Val for Met, or Glu for Asp, gave very marked reductions in phosphorylation. Overall the results indicate that regulation of the mitochondrial pyruvate dehydrogenase complex by reversible phosphorylation is an extreme example of multiple, interdependent instances of co-evolution.
Finite Element Analysis Of Influence Of Flank Wear Evolution On Forces In Orthogonal Cutting Of 42CrMo4 Steel

Directory of Open Access Journals (Sweden)

Madajewski Marek

2017-01-01

Full Text Available This paper presents analysis of flank wear influence on forces in orthogonal turning of 42CrMo4 steel and evaluates capacity of finite element model to provide such force values. Data about magnitude of feed and cutting force were obtained from measurements with force tensiometer in experimental test as well as from finite element analysis of chip formation process in ABAQUS/Explicit software. For studies an insert with complex rake face was selected and flank wear was simulated by grinding operation on its flank face. The aim of grinding inset surface was to obtain even flat wear along cutting edge, which after the measurement could be modeled with CAD program and applied in FE analysis for selected range of wear width. By comparing both sets of force values as function of flank wear in given cutting conditions FEA model was validated and it was established that it can be applied to analyze other physical aspects of machining. Force analysis found that progression of wear causes increase in cutting force magnitude and steep boost to feed force magnitude. Analysis of Fc/Ff force ratio revealed that flank wear has significant impact on resultant force in orthogonal cutting and magnitude of this force components in cutting and feed direction. Surge in force values can result in transfer of substantial loads to machine-tool interface.
Finite Element Analysis Of Influence Of Flank Wear Evolution On Forces In Orthogonal Cutting Of 42CrMo4 Steel

Science.gov (United States)

Madajewski, Marek; Nowakowski, Zbigniew

2017-01-01

This paper presents analysis of flank wear influence on forces in orthogonal turning of 42CrMo4 steel and evaluates capacity of finite element model to provide such force values. Data about magnitude of feed and cutting force were obtained from measurements with force tensiometer in experimental test as well as from finite element analysis of chip formation process in ABAQUS/Explicit software. For studies an insert with complex rake face was selected and flank wear was simulated by grinding operation on its flank face. The aim of grinding inset surface was to obtain even flat wear along cutting edge, which after the measurement could be modeled with CAD program and applied in FE analysis for selected range of wear width. By comparing both sets of force values as function of flank wear in given cutting conditions FEA model was validated and it was established that it can be applied to analyze other physical aspects of machining. Force analysis found that progression of wear causes increase in cutting force magnitude and steep boost to feed force magnitude. Analysis of Fc/Ff force ratio revealed that flank wear has significant impact on resultant force in orthogonal cutting and magnitude of this force components in cutting and feed direction. Surge in force values can result in transfer of substantial loads to machine-tool interface.
Mitochondrial genome sequences and comparative genomics ofPhytophthora ramorum and P. sojae

Energy Technology Data Exchange (ETDEWEB)

Martin, Frank N.; Douda, Bensasson; Tyler, Brett M.; Boore,Jeffrey L.

2007-01-01

The complete sequences of the mitochondrial genomes of theoomycetes of Phytophthora ramorum and P. sojae were determined during thecourse of their complete nuclear genome sequencing (Tyler, et al. 2006).Both are circular, with sizes of 39,314 bp for P. ramorum and 42,975 bpfor P. sojae. Each contains a total of 37 identifiable protein-encodinggenes, 25 or 26 tRNAs (P. sojae and P. ramorum, respectively)specifying19 amino acids, and a variable number of ORFs (7 for P. ramorum and 12for P. sojae) which are potentially additional functional genes.Non-coding regions comprise approximately 11.5 percent and 18.4 percentof the genomes of P. ramorum and P. sojae, respectively. Relative to P.sojae, there is an inverted repeat of 1,150 bp in P. ramorum thatincludes an unassigned unique ORF, a tRNA gene, and adjacent non-codingsequences, but otherwise the gene order in both species is identical.Comparisons of these genomes with published sequences of the P. infestansmitochondrial genome reveals a number of similarities, but the gene orderin P. infestans differs in two adjacent locations due to inversions.Sequence alignments of the three genomes indicated sequence conservationranging from 75 to 85 percent and that specific regions were morevariable than others.

Complete sequences of organelle genomes from the medicinal plant Rhazya stricta (Apocynaceae) and contrasting patterns of mitochondrial genome evolution across asterids.

Science.gov (United States)

Park, Seongjun; Ruhlman, Tracey A; Sabir, Jamal S M; Mutwakil, Mohammed H Z; Baeshen, Mohammed N; Sabir, Meshaal J; Baeshen, Nabih A; Jansen, Robert K

2014-05-28

Rhazya stricta is native to arid regions in South Asia and the Middle East and is used extensively in folk medicine to treat a wide range of diseases. In addition to generating genomic resources for this medicinally important plant, analyses of the complete plastid and mitochondrial genomes and a nuclear transcriptome from Rhazya provide insights into inter-compartmental transfers between genomes and the patterns of evolution among eight asterid mitochondrial genomes. The 154,841 bp plastid genome is highly conserved with gene content and order identical to the ancestral organization of angiosperms. The 548,608 bp mitochondrial genome exhibits a number of phenomena including the presence of recombinogenic repeats that generate a multipartite organization, transferred DNA from the plastid and nuclear genomes, and bidirectional DNA transfers between the mitochondrion and the nucleus. The mitochondrial genes sdh3 and rps14 have been transferred to the nucleus and have acquired targeting presequences. In the case of rps14, two copies are present in the nucleus; only one has a mitochondrial targeting presequence and may be functional. Phylogenetic analyses of both nuclear and mitochondrial copies of rps14 across angiosperms suggests Rhazya has experienced a single transfer of this gene to the nucleus, followed by a duplication event. Furthermore, the phylogenetic distribution of gene losses and the high level of sequence divergence in targeting presequences suggest multiple, independent transfers of both sdh3 and rps14 across asterids. Comparative analyses of mitochondrial genomes of eight sequenced asterids indicates a complicated evolutionary history in this large angiosperm clade with considerable diversity in genome organization and size, repeat, gene and intron content, and amount of foreign DNA from the plastid and nuclear genomes. Organelle genomes of Rhazya stricta provide valuable information for improving the understanding of mitochondrial genome evolution
Genome-Wide Comparison of Magnaporthe Species Reveals a Host-Specific Pattern of Secretory Proteins and Transposable Elements.

Directory of Open Access Journals (Sweden)

Meghana Deepak Shirke

Full Text Available Blast disease caused by the Magnaporthe species is a major factor affecting the productivity of rice, wheat and millets. This study was aimed at generating genomic information for rice and non-rice Magnaporthe isolates to understand the extent of genetic variation. We have sequenced the whole genome of the Magnaporthe isolates, infecting rice (leaf and neck, finger millet (leaf and neck, foxtail millet (leaf and buffel grass (leaf. Rice and finger millet isolates infecting both leaf and neck tissues were sequenced, since the damage and yield loss caused due to neck blast is much higher as compared to leaf blast. The genome-wide comparison was carried out to study the variability in gene content, candidate effectors, repeat element distribution, genes involved in carbohydrate metabolism and SNPs. The analysis of repeat element footprints revealed some genes such as naringenin, 2-oxoglutarate 3-dioxygenase being targeted by Pot2 and Occan, in isolates from different host species. Some repeat insertions were host-specific while other insertions were randomly shared between isolates. The distributions of repeat elements, secretory proteins, CAZymes and SNPs showed significant variation across host-specific lineages of Magnaporthe indicating an independent genome evolution orchestrated by multiple genomic factors.
Rapid and highly efficient construction of TALE-based transcriptional regulators and nucleases for genome modification.

Science.gov (United States)

Li, Lixin; Piatek, Marek J; Atef, Ahmed; Piatek, Agnieszka; Wibowo, Anjar; Fang, Xiaoyun; Sabir, J S M; Zhu, Jian-Kang; Mahfouz, Magdy M

2012-03-01

Transcription activator-like effectors (TALEs) can be used as DNA-targeting modules by engineering their repeat domains to dictate user-selected sequence specificity. TALEs have been shown to function as site-specific transcriptional activators in a variety of cell types and organisms. TALE nucleases (TALENs), generated by fusing the FokI cleavage domain to TALE, have been used to create genomic double-strand breaks. The identity of the TALE repeat variable di-residues, their number, and their order dictate the DNA sequence specificity. Because TALE repeats are nearly identical, their assembly by cloning or even by synthesis is challenging and time consuming. Here, we report the development and use of a rapid and straightforward approach for the construction of designer TALE (dTALE) activators and nucleases with user-selected DNA target specificity. Using our plasmid set of 100 repeat modules, researchers can assemble repeat domains for any 14-nucleotide target sequence in one sequential restriction-ligation cloning step and in only 24 h. We generated several custom dTALEs and dTALENs with new target sequence specificities and validated their function by transient expression in tobacco leaves and in vitro DNA cleavage assays, respectively. Moreover, we developed a web tool, called idTALE, to facilitate the design of dTALENs and the identification of their genomic targets and potential off-targets in the genomes of several model species. Our dTALE repeat assembly approach along with the web tool idTALE will expedite genome-engineering applications in a variety of cell types and organisms including plants.
Genome-Wide Analysis of Microsatellite Markers Based on Sequenced Database in Chinese Spring Wheat (Triticum aestivum L..

Directory of Open Access Journals (Sweden)

Bin Han

Full Text Available Microsatellites or simple sequence repeats (SSRs are distributed across both prokaryotic and eukaryotic genomes and have been widely used for genetic studies and molecular marker-assisted breeding in crops. Though an ordered draft sequence of hexaploid bread wheat have been announced, the researches about systemic analysis of SSRs for wheat still have not been reported so far. In the present study, we identified 364,347 SSRs from among 10,603,760 sequences of the Chinese spring wheat (CSW genome, which were present at a density of 36.68 SSR/Mb. In total, we detected 488 types of motifs ranging from di- to hexanucleotides, among which dinucleotide repeats dominated, accounting for approximately 42.52% of the genome. The density of tri- to hexanucleotide repeats was 24.97%, 4.62%, 3.25% and 24.65%, respectively. AG/CT, AAG/CTT, AGAT/ATCT, AAAAG/CTTTT and AAAATT/AATTTT were the most frequent repeats among di- to hexanucleotide repeats. Among the 21 chromosomes of CSW, the density of repeats was highest on chromosome 2D and lowest on chromosome 3A. The proportions of di-, tri-, tetra-, penta- and hexanucleotide repeats on each chromosome, and even on the whole genome, were almost identical. In addition, 295,267 SSR markers were successfully developed from the 21 chromosomes of CSW, which cover the entire genome at a density of 29.73 per Mb. All of the SSR markers were validated by reverse electronic-Polymerase Chain Reaction (re-PCR; 70,564 (23.9% were found to be monomorphic and 224,703 (76.1% were found to be polymorphic. A total of 45 monomorphic markers were selected randomly for validation purposes; 24 (53.3% amplified one locus, 8 (17.8% amplified multiple identical loci, and 13 (28.9% did not amplify any fragments from the genomic DNA of CSW. Then a dendrogram was generated based on the 24 monomorphic SSR markers among 20 wheat cultivars and three species of its diploid ancestors showing that monomorphic SSR markers represented a promising
Stabilization of the genome of the mismatch repair deficient Mycobacterium tuberculosis by context-dependent codon choice.

Science.gov (United States)

Wanner, Roger M; Güthlein, Carolin; Springer, Burkhard; Böttger, Erik C; Ackermann, Martin

2008-05-28

The rate at which a stretch of DNA mutates is determined by the cellular systems for DNA replication and repair, and by the nucleotide sequence of the stretch itself. One sequence feature with a particularly strong influence on the mutation rate are nucleotide repeats. Some microbial pathogens use nucleotide repeats in their genome to stochastically vary phenotypic traits and thereby evade host defense. However, such unstable sequences also come at a cost, as mutations are often deleterious. Here, we analyzed how these opposing forces shaped genome stability in the human pathogen Mycobacterium tuberculosis. M. tuberculosis lacks a mismatch repair system, and this renders nucleotide repeats particularly unstable. We found that proteins of M. tuberculosis are encoded by using codons in a context-dependent manner that prevents the emergence of nucleotide repeats. This context-dependent codon choice leads to a strong decrease in the estimated frame-shift mutation rate and thus to an increase in genome stability. These results indicate that a context-specific codon choice can partially compensate for the lack of a mismatch repair system, and helps to maintain genome integrity in this pathogen.
Stabilization of the genome of the mismatch repair deficient Mycobacterium tuberculosis by context-dependent codon choice

Directory of Open Access Journals (Sweden)

Ackermann Martin

2008-05-01

Full Text Available Abstract Background The rate at which a stretch of DNA mutates is determined by the cellular systems for DNA replication and repair, and by the nucleotide sequence of the stretch itself. One sequence feature with a particularly strong influence on the mutation rate are nucleotide repeats. Some microbial pathogens use nucleotide repeats in their genome to stochastically vary phenotypic traits and thereby evade host defense. However, such unstable sequences also come at a cost, as mutations are often deleterious. Here, we analyzed how these opposing forces shaped genome stability in the human pathogen Mycobacterium tuberculosis. M. tuberculosis lacks a mismatch repair system, and this renders nucleotide repeats particularly unstable. Results We found that proteins of M. tuberculosis are encoded by using codons in a context-dependent manner that prevents the emergence of nucleotide repeats. This context-dependent codon choice leads to a strong decrease in the estimated frame-shift mutation rate and thus to an increase in genome stability. Conclusion These results indicate that a context-specific codon choice can partially compensate for the lack of a mismatch repair system, and helps to maintain genome integrity in this pathogen.
Complete Sequence and Analysis of the Mitochondrial Genome of Hemiselmis andersenii CCMP644 (Cryptophyceae

Directory of Open Access Journals (Sweden)

Bowman Sharen

2008-05-01

Full Text Available Abstract Background Cryptophytes are an enigmatic group of unicellular eukaryotes with plastids derived by secondary (i.e., eukaryote-eukaryote endosymbiosis. Cryptophytes are unusual in that they possess four genomes–a host cell-derived nuclear and mitochondrial genome and an endosymbiont-derived plastid and 'nucleomorph' genome. The evolutionary origins of the host and endosymbiont components of cryptophyte algae are at present poorly understood. Thus far, a single complete mitochondrial genome sequence has been determined for the cryptophyte Rhodomonas salina. Here, the second complete mitochondrial genome of the cryptophyte alga Hemiselmis andersenii CCMP644 is presented. Results The H. andersenii mtDNA is 60,553 bp in size and encodes 30 structural RNAs and 36 protein-coding genes, all located on the same strand. A prominent feature of the genome is the presence of a ~20 Kbp long intergenic region comprised of numerous tandem and dispersed repeat units of between 22–336 bp. Adjacent to these repeats are 27 copies of palindromic sequences predicted to form stable DNA stem-loop structures. One such stem-loop is located near a GC-rich and GC-poor region and may have a regulatory function in replication or transcription. The H. andersenii mtDNA shares a number of features in common with the genome of the cryptophyte Rhodomonas salina, including general architecture, gene content, and the presence of a large repeat region. However, the H. andersenii mtDNA is devoid of inverted repeats and introns, which are present in R. salina. Comparative analyses of the suite of tRNAs encoded in the two genomes reveal that the H. andersenii mtDNA has lost or converted its original trnK(uuu gene and possesses a trnS-derived 'trnK(uuu', which appears unable to produce a functional tRNA. Mitochondrial protein coding gene phylogenies strongly support a variety of previously established eukaryotic groups, but fail to resolve the relationships among higher
A novel multiple locus variable number of tandem repeat (VNTR) analysis (MLVA) method for Propionibacterium acnes.

Science.gov (United States)

Hauck, Yolande; Soler, Charles; Gérôme, Patrick; Vong, Rithy; Macnab, Christine; Appere, Géraldine; Vergnaud, Gilles; Pourcel, Christine

2015-07-01

Propionibacterium acnes plays a central role in the pathogenesis of acne and is responsible for severe opportunistic infections. Numerous typing schemes have been developed that allow the identification of phylotypes, but they are often insufficient to differentiate subtypes. To better understand the genetic diversity of this species and to perform epidemiological analyses, high throughput discriminant genotyping techniques are needed. Here we describe the development of a multiple locus variable number of tandem repeats (VNTR) analysis (MLVA) method. Thirteen VNTRs were identified in the genome of P. acnes and were used to genotype a collection of clinical isolates. In addition, publically available sequencing data for 102 genomes were analyzed in silico, providing an MLVA genotype. The clustering of MLVA data was in perfect congruence with whole genome based clustering. Analysis of the clustered regularly interspaced short palindromic repeat (CRISPR) element uncovered new spacers, a supplementary source of genotypic information. The present MLVA13 scheme and associated internet database represents a first line genotyping assay to investigate large number of isolates. Particular strains may then be submitted to full genome sequencing in order to better analyze their pathogenic potential. Copyright © 2015 Elsevier B.V. All rights reserved.
Novel SINEs families in Medicago truncatula and Lotus japonicus: bioinformatic analysis.

Science.gov (United States)

Gadzalski, Marek; Sakowicz, Tomasz

2011-07-01

Although short interspersed elements (SINEs) were discovered nearly 30 years ago, the studies of these genomic repeats were mostly limited to animal genomes. Very little is known about SINEs in legumes--one of the most important plant families. Here we report identification, genomic distribution and molecular features of six novel SINE elements in Lotus japonicus (named LJ_SINE-1, -2, -3) and Medicago truncatula (MT_SINE-1, -2, -3), model species of legume. They possess all the structural features commonly found in short interspersed elements including RNA polymerase III promoter, polyA tail and flanking repeats. SINEs described here are present in low to moderate copy numbers from 150 to 3000. Bioinformatic analyses were used to searched public databases, we have shown that three of new SINE elements from M. truncatula seem to be characteristic of Medicago and Trifolium genera. Two SINE families have been found in L. japonicus and one is present in both M. truncatula and L. japonicus. In addition, we are discussing potential activities of the described elements. Copyright © 2011 Elsevier B.V. All rights reserved.
Identification and insertion polymorphisms of short interspersed nuclear elements (SINEs) in Brassica genomes

International Nuclear Information System (INIS)

Nouroz, F.; Naveed, M.

2018-01-01

The non-LTR retrotransposons (retroposons) are abundant in plant genomes including members of Brassicaceae. Of the retroposons, long interspersed nuclear elements (LINEs) are more copious followed by short interspersed nuclear elements (SINEs) in sequenced eukaryotic genomes. The SINEs are short elements and ranged from 100-500 bps flanked by variable sized target site duplications, 5' tRNA region with polymerase III promoter, internal tRNA unrelated region, 3' LINEs derived region and a poly adenosine tail. Different computational approaches were used for the identification and characterization of SINEs, while PCR was used to detect the SINEs insertion polymorphisms in various Brassica genotypes. Ten previously unidentified families of SINEs were identified and characterized from Brassica genomes. The structural features of these SINEs were studied in detail, which showed typical SINE features displaying small sizes, target site duplications, head regions, internal regions (body) of variable sizes and a poly (A) tail at the 3' terminus. The elements from various families ranged from 206-558 bp, where BoSINE2 family displayed smallest SINE element (206 bp), while larger members belonged to BoSINE9 family (524-558 bp). The distribution and abundance of SINEs in various Brassica species and genotypes (40) at a particular site/locus were investigated by SINEs based PCR markers. Various SINE insertion polymorphisms were detected from different genotypes, where higher PCR bands amplified the SINE insertions, while lower bands amplified the pre-insertion sites (flanking regions). The analysis of Brassica SINEs copy numbers from 10 identified families revealed that around 860 and 1712 copies of SINEs were calculated from B. rapa and B. oleracea Whole-genome shotgun contigs (WGS) respectively. Analysis of insertion sites of Brassica SINEs revealed that the members from all 10 SINE families had shown an insertion preference in AT rich regions. The present
Detecting microsatellites within genomes: significant variation among algorithms

Directory of Open Access Journals (Sweden)

Rivals Eric

2007-04-01

Full Text Available Abstract Background Microsatellites are short, tandemly-repeated DNA sequences which are widely distributed among genomes. Their structure, role and evolution can be analyzed based on exhaustive extraction from sequenced genomes. Several dedicated algorithms have been developed for this purpose. Here, we compared the detection efficiency of five of them (TRF, Mreps, Sputnik, STAR, and RepeatMasker. Results Our analysis was first conducted on the human X chromosome, and microsatellite distributions were characterized by microsatellite number, length, and divergence from a pure motif. The algorithms work with user-defined parameters, and we demonstrate that the parameter values chosen can strongly influence microsatellite distributions. The five algorithms were then compared by fixing parameters settings, and the analysis was extended to three other genomes (Saccharomyces cerevisiae, Neurospora crassa and Drosophila melanogaster spanning a wide range of size and structure. Significant differences for all characteristics of microsatellites were observed among algorithms, but not among genomes, for both perfect and imperfect microsatellites. Striking differences were detected for short microsatellites (below 20 bp, regardless of motif. Conclusion Since the algorithm used strongly influences empirical distributions, studies analyzing microsatellite evolution based on a comparison between empirical and theoretical size distributions should therefore be considered with caution. We also discuss why a typological definition of microsatellites limits our capacity to capture their genomic distributions.
Data of 10 SSR markers for genomes of homo sapiens and monkeys.

Science.gov (United States)

Reddy, K K V V V S; Raju, S Viswanadha; Someswara Rao, Chinta

2017-06-01

In this data, we present 10 Simple Sequence Repeat(SSR) markers TAGA, TCAT, GAAT, AGAT, AGAA, GATA, TATC, CTTT, TCTG and TCTA which are extracted from the genomes of homo sapiens and monkeys using string matching mechanism [1]. All loci showed 4 Base Pair(bp) in allele size, indicating that there are some polymorphisms between individuals correlating to the number of SSR repeats that maybe useful for the detection of similarity among the genotypes. Collectively, these data show that the SSR extraction is a valuable method to illustrate genetic variation of genomes.
Comparative genomics of multidrug resistance in Acinetobacter baumannii.

Directory of Open Access Journals (Sweden)

Pierre-Edouard Fournier

2006-01-01

Full Text Available Acinetobacter baumannii is a species of nonfermentative gram-negative bacteria commonly found in water and soil. This organism was susceptible to most antibiotics in the 1970s. It has now become a major cause of hospital-acquired infections worldwide due to its remarkable propensity to rapidly acquire resistance determinants to a wide range of antibacterial agents. Here we use a comparative genomic approach to identify the complete repertoire of resistance genes exhibited by the multidrug-resistant A. baumannii strain AYE, which is epidemic in France, as well as to investigate the mechanisms of their acquisition by comparison with the fully susceptible A. baumannii strain SDF, which is associated with human body lice. The assembly of the whole shotgun genome sequences of the strains AYE and SDF gave an estimated size of 3.9 and 3.2 Mb, respectively. A. baumannii strain AYE exhibits an 86-kb genomic region termed a resistance island--the largest identified to date--in which 45 resistance genes are clustered. At the homologous location, the SDF strain exhibits a 20 kb-genomic island flanked by transposases but devoid of resistance markers. Such a switching genomic structure might be a hotspot that could explain the rapid acquisition of resistance markers under antimicrobial pressure. Sequence similarity and phylogenetic analyses confirm that most of the resistance genes found in the A. baumannii strain AYE have been recently acquired from bacteria of the genera Pseudomonas, Salmonella, or Escherichia. This study also resulted in the discovery of 19 new putative resistance genes. Whole-genome sequencing appears to be a fast and efficient approach to the exhaustive identification of resistance genes in epidemic infectious agents of clinical significance.
Comparative Genomics of Multidrug Resistance in Acinetobacter baumannii.

Directory of Open Access Journals (Sweden)

2006-01-01

Full Text Available Acinetobacter baumannii is a species of nonfermentative gram-negative bacteria commonly found in water and soil. This organism was susceptible to most antibiotics in the 1970s. It has now become a major cause of hospital-acquired infections worldwide due to its remarkable propensity to rapidly acquire resistance determinants to a wide range of antibacterial agents. Here we use a comparative genomic approach to identify the complete repertoire of resistance genes exhibited by the multidrug-resistant A. baumannii strain AYE, which is epidemic in France, as well as to investigate the mechanisms of their acquisition by comparison with the fully susceptible A. baumannii strain SDF, which is associated with human body lice. The assembly of the whole shotgun genome sequences of the strains AYE and SDF gave an estimated size of 3.9 and 3.2 Mb, respectively. A. baumannii strain AYE exhibits an 86-kb genomic region termed a resistance island-the largest identified to date-in which 45 resistance genes are clustered. At the homologous location, the SDF strain exhibits a 20 kb-genomic island flanked by transposases but devoid of resistance markers. Such a switching genomic structure might be a hotspot that could explain the rapid acquisition of resistance markers under antimicrobial pressure. Sequence similarity and phylogenetic analyses confirm that most of the resistance genes found in the A. baumannii strain AYE have been recently acquired from bacteria of the genera Pseudomonas, Salmonella, or Escherichia. This study also resulted in the discovery of 19 new putative resistance genes. Whole-genome sequencing appears to be a fast and efficient approach to the exhaustive identification of resistance genes in epidemic infectious agents of clinical significance.
Fluorescence In Situ Hybridization (FISH-Based Karyotyping Reveals Rapid Evolution of Centromeric and Subtelomeric Repeats in Common Bean (Phaseolus vulgaris and Relatives

Directory of Open Access Journals (Sweden)

Aiko Iwata-Otsubo

2016-04-01

Full Text Available Fluorescence in situ hybridization (FISH-based karyotyping is a powerful cytogenetics tool to study chromosome organization, behavior, and chromosome evolution. Here, we developed a FISH-based karyotyping system using a probe mixture comprised of centromeric and subtelomeric satellite repeats, 5S rDNA, and chromosome-specific BAC clones in common bean, which enables one to unambiguously distinguish all 11 chromosome pairs. Furthermore, we applied the karyotyping system to several wild relatives and landraces of common bean from two distinct gene pools, as well as other related Phaseolus species, to investigate repeat evolution in the genus Phaseolus. Comparison of karyotype maps within common bean indicates that chromosomal distribution of the centromeric and subtelomeric satellite repeats is stable, whereas the copy number of the repeats was variable, indicating rapid amplification/reduction of the repeats in specific genomic regions. In Phaseolus species that diverged approximately 2–4 million yr ago, copy numbers of centromeric repeats were largely reduced or diverged, and chromosomal distributions have changed, suggesting rapid evolution of centromeric repeats. We also detected variation in the distribution pattern of subtelomeric repeats in Phaseolus species. The FISH-based karyotyping system revealed that satellite repeats are actively and rapidly evolving, forming genomic features unique to individual common bean accessions and Phaseolus species.
A recurrent deletion syndrome at chromosome bands 2p11.2-2p12 flanked by segmental duplications at the breakpoints and including REEP1.

Science.gov (United States)

Stevens, Servi J C; Blom, Eveline W; Siegelaer, Ingrid T J; Smeets, Eric E J G L

2015-04-01

We identified an identical and recurrent 9.4-Mbp deletion at chromosome bands 2p11.2-2p12, which occurred de novo in two unrelated patients. It is flanked at the distal and proximal breakpoints by two homologous segmental duplications consisting of low copy repeat (LCR) blocks in direct orientation, which have >99% sequence identity. Despite the fact that the deletion was almost 10 Mbp in size, the patients showed a relatively mild clinical phenotype, that is, mild-to-moderate intellectual disability, a happy disposition, speech delay and delayed motor development. Their phenotype matches with that of previously described patients. The 2p11.2-2p12 deletion includes the REEP1 gene that is associated with spastic paraplegia and phenotypic features related to this are apparent in most 2p11.2-2p12 deletion patients, but not in all. Other hemizygous genes that may contribute to the clinical phenotype include LRRTM1 and CTNNA2. We propose a recurrent but rare 2p11.2-2p12 deletion syndrome based on (1) the identical, non-random localisation of the de novo deletion breakpoints in two unrelated patients and a patient from literature, (2) the patients' phenotypic similarity and their phenotypic overlap with other 2p deletions and (3) the presence of highly identical LCR blocks flanking both breakpoints, consistent with a non-allelic homologous recombination (NAHR)-mediated rearrangement.
The 5′ and 3′ Untranslated Regions of the Flaviviral Genome

Directory of Open Access Journals (Sweden)

Wy Ching Ng

2017-06-01

Full Text Available Flaviviruses are enveloped arthropod-borne viruses with a single-stranded, positive-sense RNA genome that can cause serious illness in humans and animals. The 11 kb 5′ capped RNA genome consists of a single open reading frame (ORF, and is flanked by 5′ and 3′ untranslated regions (UTR. The ORF is a polyprotein that is processed into three structural and seven non-structural proteins. The UTRs have been shown to be important for viral replication and immune modulation. Both of these regions consist of elements that are essential for genome cyclization, resulting in initiation of RNA synthesis. Genome mutation studies have been employed to investigate each component of the essential elements to show the necessity of each component and its role in viral RNA replication and growth. Furthermore, the highly structured 3′UTR is responsible for the generation of subgenomic flavivirus RNA (sfRNA that helps the virus evade host immune response, thereby affecting viral pathogenesis. In addition, changes within the 3′UTR have been shown to affect transmissibility between vector and host, which can influence the development of vaccines.
The Complete Chloroplast and Mitochondrial Genome Sequences of Boea hygrometrica: Insights into the Evolution of Plant Organellar Genomes

Science.gov (United States)

Wang, Xumin; Deng, Xin; Zhang, Xiaowei; Hu, Songnian; Yu, Jun

2012-01-01

The complete nucleotide sequences of the chloroplast (cp) and mitochondrial (mt) genomes of resurrection plant Boea hygrometrica (Bh, Gesneriaceae) have been determined with the lengths of 153,493 bp and 510,519 bp, respectively. The smaller chloroplast genome contains more genes (147) with a 72% coding sequence, and the larger mitochondrial genome have less genes (65) with a coding faction of 12%. Similar to other seed plants, the Bh cp genome has a typical quadripartite organization with a conserved gene in each region. The Bh mt genome has three recombinant sequence repeats of 222 bp, 843 bp, and 1474 bp in length, which divide the genome into a single master circle (MC) and four isomeric molecules. Compared to other angiosperms, one remarkable feature of the Bh mt genome is the frequent transfer of genetic material from the cp genome during recent Bh evolution. We also analyzed organellar genome evolution in general regarding genome features as well as compositional dynamics of sequence and gene structure/organization, providing clues for the understanding of the evolution of organellar genomes in plants. The cp-derived sequences including tRNAs found in angiosperm mt genomes support the conclusion that frequent gene transfer events may have begun early in the land plant lineage. PMID:22291979
Preliminary assessment of the state of CO2 soil degassing on the flanks of Gede volcano (West Java, Indonesia)

Science.gov (United States)

Kunrat, S. L.; Schwandner, F. M.

2013-12-01

Gede Volcano (West Java) is part of an andesitic stratovolcano complex consisting of Pangrango in the north-west and Gede in the south-east. The last recorded eruptive activity was a phreatic subvolcanian ash eruption in 1957. Current activity is characterized by episodic swarms at 2-4 km depth, and low-temperature (~160°C) crater degassing in two distinct summit crater fumarolic areas. Hot springs occur in the saddle between the Gede and Pangrango edifice, as well as on the NE flank base. The most recent eruptive events produced pyroclastic material, their flow deposits concentrate toward the NE. A collaborative effort between the Center for Volcanology and Geological Hazard Mitigation (CVGHM), Geological Agency and the Earth Observatory of Singapore (EOS) is since 2010 aimed at upgrading the geophysical and geochemical monitoring network at Gede Volcano. To support the monitoring instrumentation upgrades under way, surveys of soil CO2 degassing have been performed on the flanks of Gede, in circular and radial traverses.The goal was to establish a spatial distribution of flank CO2 fluxes, and to allow smart siting for continuous gas monitoring stations. Crater fluxes were not surveyed, as its low-temperature hydrothermal system is likely prone to large hydraulic changes in this tropical environment, resulting in variable permeability effects that might mask signals from deeper reservoir or conduit degassing. The high precipitation intensity in the mountains of tropical Java pose challenges to this method, since soil gas permeability is largely controlled by soil moisture content. Simultaneous soil moisture measurements were undertaken. The soil CO2 surveys were carried out using a LI-8100A campaign flux chamber instrument (LICOR Biosciences, Lincoln, Nebraska). This instrument has a very precise and highly stable sensor and an atmospheric pressure equilibrator, making it highly sensitive to low fluxes. It is the far superior choice for higher precision low
Genetic analysis of environmental strains of the plant pathogen Phytophthora capsici reveals heterogeneous repertoire of effectors and possible effector evolution via genomic island.

Science.gov (United States)

Iribarren, María Josefina; Pascuan, Cecilia; Soto, Gabriela; Ayub, Nicolás Daniel

2015-11-01

Phytophthora capsici is a virulent oomycete pathogen of many vegetable crops. Recently, it has been demonstrated that the recognition of the RXLR effector AVR3a1 of P. capsici (PcAVR3a1) triggers a hypersensitive response and plays a critical role in mediating non-host resistance. Here, we analyzed the occurrence of PcAVR3a1 in 57 isolates of P. capsici derived from globe squash, eggplant, tomato and bell pepper cocultivated in a small geographical area. The occurrence of PcAVR3a1 in environmental strains of P. capsici was confirmed by PCR in only 21 of these pathogen isolates. To understand the presence-absence pattern of PcAVR3a1 in environmental strains, the flanking region of this gene was sequenced. PcAVR3a1 was found within a genetic element that we named PcAVR3a1-GI (PcAVR3a1 genomic island). PcAVR3a1-GI was flanked by a 22-bp direct repeat, which is related to its site-specific recombination site. In addition to the PcAVR3a1 gene, PcAVR3a1-GI also encoded a phage integrase probably associated with the excision and integration of this mobile element. Exposure to plant induced the presence of an episomal circular intermediate of PcAVR3a1-GI, indicating that this mobile element is functional. Collectively, these findings provide evidence of PcAVR3a1 evolution via mobile elements in environmental strains of Phytophthora. © FEMS 2015. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Whole genome PCR scanning reveals the syntenic genome structure of toxigenic Vibrio cholerae strains in the O1/O139 population.

Directory of Open Access Journals (Sweden)

Bo Pang

Full Text Available Vibrio cholerae is commonly found in estuarine water systems. Toxigenic O1 and O139 V. cholerae strains have caused cholera epidemics and pandemics, whereas the nontoxigenic strains within these serogroups only occasionally lead to disease. To understand the differences in the genome and clonality between the toxigenic and nontoxigenic strains of V. cholerae serogroups O1 and O139, we employed a whole genome PCR scanning (WGPScanning method, an rrn operon-mediated fragment rearrangement analysis and comparative genomic hybridization (CGH to analyze the genome structure of different strains. WGPScanning in conjunction with CGH revealed that the genomic contents of the toxigenic strains were conservative, except for a few indels located mainly in mobile elements. Minor nucleotide variation in orthologous genes appeared to be the major difference between the toxigenic strains. rrn operon-mediated rearrangements were infrequent in El Tor toxigenic strains tested using I-CeuI digested pulsed-field gel electrophoresis (PFGE analysis and PCR analysis based on flanking sequence of rrn operons. Using these methods, we found that the genomic structures of toxigenic El Tor and O139 strains were syntenic. The nontoxigenic strains exhibited more extensive sequence variations, but toxin coregulated pilus positive (TCP+ strains had a similar structure. TCP+ nontoxigenic strains could be subdivided into multiple lineages according to the TCP type, suggesting the existence of complex intermediates in the evolution of toxigenic strains. The data indicate that toxigenic O1 El Tor and O139 strains were derived from a single lineage of intermediates from complex clones in the environment. The nontoxigenic strains with non-El Tor type TCP may yet evolve into new epidemic clones after attaining toxigenic attributes.
Modeling the integration of bacterial rRNA fragments into the human cancer genome.

Science.gov (United States)

Sieber, Karsten B; Gajer, Pawel; Dunning Hotopp, Julie C

2016-03-21

Cancer is a disease driven by the accumulation of genomic alterations, including the integration of exogenous DNA into the human somatic genome. We previously identified in silico evidence of DNA fragments from a Pseudomonas-like bacteria integrating into the 5'-UTR of four proto-oncogenes in stomach cancer sequencing data. The functional and biological consequences of these bacterial DNA integrations remain unknown. Modeling of these integrations suggests that the previously identified sequences cover most of the sequence flanking the junction between the bacterial and human DNA. Further examination of these reads reveals that these integrations are rich in guanine nucleotides and the integrated bacterial DNA may have complex transcript secondary structures. The models presented here lay the foundation for future experiments to test if bacterial DNA integrations alter the transcription of the human genes.
Compartmentalization of the Coso East Flank geothermal field imaged by 3-D full-tensor MT inversion

Science.gov (United States)

Lindsey, Nathaniel J.; Kaven, Joern; Davatzes, Nicholas C.; Newman, Gregory A.

2017-01-01

Previous magnetotelluric (MT) studies of the high-temperature Coso geothermal system in California identified a subvertical feature of low resistivity (2–5 Ohm m) and appreciable lateral extent (>1 km) in the producing zone of the East Flank field. However, these models could not reproduce gross 3-D effects in the recorded data. We perform 3-D full-tensor inversion and retrieve a resistivity model that out-performs previous 2-D and 3-D off-diagonal models in terms of its fit to the complete 3-D MT data set as well as the degree of modelling bias. Inclusion of secondary Zxx and Zyy data components leads to a robust east-dip (60†) to the previously identified conductive East Flank reservoir feature, which correlates strongly with recently mapped surface faults, downhole well temperatures, 3-D seismic reflection data, and local microseismicity. We perform synthetic forward modelling to test the best-fit dip of this conductor using the response at a nearby MT station. We interpret the dipping conductor as a fractured and fluidized compartment, which is structurally controlled by an unmapped blind East Flank fault zone.
Refining borders of genome-rearrangements including repetitions

Directory of Open Access Journals (Sweden)

JA Arjona-Medina

2016-10-01

Full Text Available Abstract Background DNA rearrangement events have been widely studied in comparative genomic for many years. The importance of these events resides not only in the study about relatedness among different species, but also to determine the mechanisms behind evolution. Although there are many methods to identify genome-rearrangements (GR, the refinement of their borders has become a huge challenge. Until now no accepted method exists to achieve accurate fine-tuning: i.e. the notion of breakpoint (BP is still an open issue, and despite repeated regions are vital to understand evolution they are not taken into account in most of the GR detection and refinement methods. Methods and results We propose a method to refine the borders of GR including repeated regions. Instead of removing these repetitions to facilitate computation, we take advantage of them using a consensus alignment sequence of the repeated region in between two blocks. Using the concept of identity vectors for Synteny Blocks (SB and repetitions, a Finite State Machine is designed to detect transition points in the difference between such vectors. The method does not force the BP to be a region or a point but depends on the alignment transitions within the SBs and repetitions. Conclusion The accurate definition of the borders of SB and repeated genomic regions and consequently the detection of BP might help to understand the evolutionary model of species. In this manuscript we present a new proposal for such a refinement. Features of the SBs borders and BPs are different and fit with what is expected. SBs with more diversity in annotations and BPs short and richer in DNA replication and stress response, which are strongly linked with rearrangements.
The complete chloroplast genome of Capsicum annuum var. glabriusculum using Illumina sequencing.

Science.gov (United States)

Raveendar, Sebastin; Na, Young-Wang; Lee, Jung-Ro; Shim, Donghwan; Ma, Kyung-Ho; Lee, Sok-Young; Chung, Jong-Wook

2015-07-20

Chloroplast (cp) genome sequences provide a valuable source for DNA barcoding. Molecular phylogenetic studies have concentrated on DNA sequencing of conserved gene loci. However, this approach is time consuming and more difficult to implement when gene organization differs among species. Here we report the complete re-sequencing of the cp genome of Capsicum pepper (Capsicum annuum var. glabriusculum) using the Illumina platform. The total length of the cp genome is 156,817 bp with a 37.7% overall GC content. A pair of inverted repeats (IRs) of 50,284 bp were separated by a small single copy (SSC; 18,948 bp) and a large single copy (LSC; 87,446 bp). The number of cp genes in C. annuum var. glabriusculum is the same as that in other Capsicum species. Variations in the lengths of LSC; SSC and IR regions were the main contributors to the size variation in the cp genome of this species. A total of 125 simple sequence repeat (SSR) and 48 insertions or deletions variants were found by sequence alignment of Capsicum cp genome. These findings provide a foundation for further investigation of cp genome evolution in Capsicum and other higher plants.
Association of the polymorphism in the 5' flanking region of the ovine ...

African Journals Online (AJOL)

The insulin-like growth factor 1 (IGF-I) gene has been described in several studies as a candidate gene for growth traits in farm animals. The present preliminary study attempts to establish associations between growth traits and genetic polymorphisms at the 5' flanking region s IGF-I in the Baluchi sheep. The DNA of 102 ...
Amyloidosis of the renal pelvis presenting as flank pain

Directory of Open Access Journals (Sweden)

Rachel Shikhman, D.O.

2018-02-01

Full Text Available Amyloidosis is a rare disease defined by accumulation of extracellular amyloid systemically or within a specific organ. Localized amyloidosis of the genitourinary system is extremely rare, with the predominate location being the bladder. The imaging findings are often nonspecific and mimic urothelial carcinoma. We present a 49-year-old woman with a chief complaint of flank pain. A filling defect was discovered on radiological imaging. The defect was subsequently biopsied and proven to be a primary amyloidosis of the renal pelvis. We then review the radiological findings of amyloidosis of the genitourinary system.
Harnessing CRISPR-Cas systems for bacterial genome editing.

Science.gov (United States)

Selle, Kurt; Barrangou, Rodolphe

2015-04-01

Manipulation of genomic sequences facilitates the identification and characterization of key genetic determinants in the investigation of biological processes. Genome editing via clustered regularly interspaced short palindromic repeats (CRISPR)-CRISPR-associated (Cas) constitutes a next-generation method for programmable and high-throughput functional genomics. CRISPR-Cas systems are readily reprogrammed to induce sequence-specific DNA breaks at target loci, resulting in fixed mutations via host-dependent DNA repair mechanisms. Although bacterial genome editing is a relatively unexplored and underrepresented application of CRISPR-Cas systems, recent studies provide valuable insights for the widespread future implementation of this technology. This review summarizes recent progress in bacterial genome editing and identifies fundamental genetic and phenotypic outcomes of CRISPR targeting in bacteria, in the context of tool development, genome homeostasis, and DNA repair. Copyright © 2015 Elsevier Ltd. All rights reserved.
Construction of a mutagenesis cartridge for poliovirus genome-linked viral protein: isolation and characterization of viable and nonviable mutants

International Nuclear Information System (INIS)

Kuhn, R.J.; Tada, H.; Ypma-Wong, M.F.; Dunn, J.J.; Semler, B.L.; Wimmer, E.

1988-01-01

By following a strategy of genetic analysis of poliovirus, the authors have constructed a synthetic mutagenesis cartridge spanning the genome-linked viral protein coding region and flanking cleavage sites in an infectious cDNA clone of the type I (Mahoney) genome. The insertion of new restriction sites within the infectious clone has allowed them to replace the wild-type sequences with short complementary pairs of synthetic oligonucleotides containing various mutations. A set of mutations have been made that create methionine codons within the genome-linked viral protein region. The resulting viruses have growth characteristics similar to wild type. Experiments that led to an alteration of the tyrosine residue responsible for the linkage to RNA have resulted in nonviable virus. In one mutant, proteolytic processing assayed in vitro appeared unimpaired by the mutation. They suggest that the position of the tyrosine residue is important for genome-linked viral protein function(s)
Repeated polyploidization of Gossypium genomes and the evolution of spinnable cotton fibres

Science.gov (United States)

Emergent phenotypes are common in polyploids relative to their diploid progenitors, a phenomenon exemplified by spinnable cotton fibers. Following 15-18 fold paleopolyploidy, allopolyploidy 1-2 million years ago reunited divergent Gossypium genomes, imparting new combinatorial complexity that might ...
The karyotype and 5S rRNA genes from Spanish individuals of the bat species Rhinolophus hipposideros (Rhinolophidae; Chiroptera).

Science.gov (United States)

Puerma, Eva; Acosta, Manuel J; Barragán, Maria José L; Martínez, Sergio; Marchal, Juan Alberto; Bullejos, Mónica; Sánchez, Antonio

2008-11-01

The karyotype of individuals of the species Rhinolophus hipposideros from Spain present a chromosome number of 2n = 54 (NFa = 62). The described karyotype for these specimens is very similar to another previously described in individual from Bulgaria. However, the presence of one additional pair of autosomal acrocentric chromosomes in the Bulgarian karyotype and the differences in X chromosome morphology indicated that we have described a new karyotype variant in this species. In addition, we have analyzed several clones of 1.4 and 1 kb of a PstI repeated DNA sequence from the genome of R. hipposideros. The repeated sequence included a region with high identity with the 5S rDNA genes and flanking regions, with no homology with GenBank sequences. Search for polymerase III regulatory elements demonstrated the presence of type I promoter elements (A-box, Intermediate Element and C-box) in the 5S rDNA region. In addition, upstream regulatory elements, as a D-box and Sp1 binding sequences, were present in flanking regions. All data indicated that the cloned repeated sequences are the functional rDNA genes from this species. Finally, FISH demonstrated the presence of rDNA in nine chromosome pairs, which is surprising as most mammals have only one carrier chromosome pair.
Virulence factor rtx in Legionella pneumophila, evidence suggesting it is a modular multifunctional protein

Directory of Open Access Journals (Sweden)

Pelaz Carmen

2008-01-01

Full Text Available Abstract Background The repeats in toxin (Rtx are an important pathogenicity factor involved in host cells invasion of Legionella pneumophila and other pathogenic bacteria. Its role in escaping the host immune system and cytotoxic activity is well known. Its repeated motives and modularity make Rtx a multifunctional factor in pathogenicity. Results The comparative analysis of rtx gene among 6 strains of L. pneumophila showed modularity in their structures. Among compared genomes, the N-terminal region of the protein presents highly dissimilar repeats with functionally similar domains. On the contrary, the C-terminal region is maintained with a fashionable modular configuration, which gives support to its proposed role in adhesion and pore formation. Despite the variability of rtx among the considered strains, the flanking genes are maintained in synteny and similarity. Conclusion In contrast to the extracellular bacteria Vibrio cholerae, in which the rtx gene is highly conserved and flanking genes have lost synteny and similarity, the gene region coding for the Rtx toxin in the intracellular pathogen L. pneumophila shows a rapid evolution. Changes in the rtx could play a role in pathogenicity. The interplay of the Rtx toxin with host membranes might lead to the evolution of new variants that are able to escape host cell defences.
Application of response surface methodology on investigating flank wear in machining hardened steel using PVD TiN coated mixed ceramic insert

Directory of Open Access Journals (Sweden)

Ashok Kumar Sahoo

2013-10-01

Full Text Available The paper presents the development of flank wear model in turning hardened EN 24 steel with PVD TiN coated mixed ceramic insert under dry environment. The paper also investigates the effect of process parameter on flank wear (VBc. The experiments have been conducted using three level full factorial design techniques. The machinability model has been developed in terms of cutting speed (v, feed (f and machining time (t as input variable using response surface methodology. The adequacy of model has been checked using correlation coefficients. As the determination coefficient, R2 (98% is higher for the model developed; the better is the response model fits the actual data. In addition, residuals of the normal probability plot lie reasonably close to a straight line showing that the terms mentioned in the model are statistically significant. The predicted flank wear has been found to lie close to the experimental value. This indicates that the developed model can be effectively used to predict the flank wear in the hard turning. Abrasion and diffusion has been found to be the dominant wear mechanism in machining hardened steel from SEM micrographs at highest parametric range. Machining time has been found to be the most significant parameter on flank wear followed by cutting speed and feed as observed from main effect plot and ANOVA study.
The Complete Chloroplast Genome Sequences of the Medicinal Plant Forsythia suspensa (Oleaceae

Directory of Open Access Journals (Sweden)

Wenbin Wang

2017-10-01

Full Text Available Forsythia suspensa is an important medicinal plant and traditionally applied for the treatment of inflammation, pyrexia, gonorrhea, diabetes, and so on. However, there is limited sequence and genomic information available for F. suspensa. Here, we produced the complete chloroplast genomes of F. suspensa using Illumina sequencing technology. F. suspensa is the first sequenced member within the genus Forsythia (Oleaceae. The gene order and organization of the chloroplast genome of F. suspensa are similar to other Oleaceae chloroplast genomes. The F. suspensa chloroplast genome is 156,404 bp in length, exhibits a conserved quadripartite structure with a large single-copy (LSC; 87,159 bp region, and a small single-copy (SSC; 17,811 bp region interspersed between inverted repeat (IRa/b; 25,717 bp regions. A total of 114 unique genes were annotated, including 80 protein-coding genes, 30 tRNA, and four rRNA. The low GC content (37.8% and codon usage bias for A- or T-ending codons may largely affect gene codon usage. Sequence analysis identified a total of 26 forward repeats, 23 palindrome repeats with lengths >30 bp (identity > 90%, and 54 simple sequence repeats (SSRs with an average rate of 0.35 SSRs/kb. We predicted 52 RNA editing sites in the chloroplast of F. suspensa, all for C-to-U transitions. IR expansion or contraction and the divergent regions were analyzed among several species including the reported F. suspensa in this study. Phylogenetic analysis based on whole-plastome revealed that F. suspensa, as a member of the Oleaceae family, diverged relatively early from Lamiales. This study will contribute to strengthening medicinal resource conservation, molecular phylogenetic, and genetic engineering research investigations of this species.
Mechanism of Repeat-Associated MicroRNAs in Fragile X Syndrome

Directory of Open Access Journals (Sweden)

Karen Kelley

2012-01-01

Full Text Available The majority of the human genome is comprised of non-coding DNA, which frequently contains redundant microsatellite-like trinucleotide repeats. Many of these trinucleotide repeats are involved in triplet repeat expansion diseases (TREDs such as fragile X syndrome (FXS. After transcription, the trinucleotide repeats can fold into RNA hairpins and are further processed by Dicer endoribonuclases to form microRNA (miRNA-like molecules that are capable of triggering targeted gene-silencing effects in the TREDs. However, the function of these repeat-associated miRNAs (ramRNAs is unclear. To solve this question, we identified the first native ramRNA in FXS and successfully developed a transgenic zebrafish model for studying its function. Our studies showed that ramRNA-induced DNA methylation of the FMR1 5′-UTR CGG trinucleotide repeat expansion is responsible for both pathological and neurocognitive characteristics linked to the transcriptional FMR1 gene inactivation and the deficiency of its protein product FMRP. FMRP deficiency often causes synapse deformity in the neurons essential for cognition and memory activities, while FMR1 inactivation augments metabotropic glutamate receptor (mGluR-activated long-term depression (LTD, leading to abnormal neuronal responses in FXS. Using this novel animal model, we may further dissect the etiological mechanisms of TREDs, with the hope of providing insights into new means for therapeutic intervention.
Initial genomics of the human nucleolus.

Directory of Open Access Journals (Sweden)

Attila Németh

2010-03-01

Full Text Available We report for the first time the genomics of a nuclear compartment of the eukaryotic cell. 454 sequencing and microarray analysis revealed the pattern of nucleolus-associated chromatin domains (NADs in the linear human genome and identified different gene families and certain satellite repeats as the major building blocks of NADs, which constitute about 4% of the genome. Bioinformatic evaluation showed that NAD-localized genes take part in specific biological processes, like the response to other organisms, odor perception, and tissue development. 3D FISH and immunofluorescence experiments illustrated the spatial distribution of NAD-specific chromatin within interphase nuclei and its alteration upon transcriptional changes. Altogether, our findings describe the nature of DNA sequences associated with the human nucleolus and provide insights into the function of the nucleolus in genome organization and establishment of nuclear architecture.
Initial Genomics of the Human Nucleolus

Science.gov (United States)

Németh, Attila; Conesa, Ana; Santoyo-Lopez, Javier; Medina, Ignacio; Montaner, David; Péterfia, Bálint; Solovei, Irina; Cremer, Thomas; Dopazo, Joaquin; Längst, Gernot

2010-01-01

We report for the first time the genomics of a nuclear compartment of the eukaryotic cell. 454 sequencing and microarray analysis revealed the pattern of nucleolus-associated chromatin domains (NADs) in the linear human genome and identified different gene families and certain satellite repeats as the major building blocks of NADs, which constitute about 4% of the genome. Bioinformatic evaluation showed that NAD–localized genes take part in specific biological processes, like the response to other organisms, odor perception, and tissue development. 3D FISH and immunofluorescence experiments illustrated the spatial distribution of NAD–specific chromatin within interphase nuclei and its alteration upon transcriptional changes. Altogether, our findings describe the nature of DNA sequences associated with the human nucleolus and provide insights into the function of the nucleolus in genome organization and establishment of nuclear architecture. PMID:20361057
Rapid and highly efficient construction of TALE-based transcriptional regulators and nucleases for genome modification

KAUST Repository

Li, Lixin

2012-01-22

Transcription activator-like effectors (TALEs) can be used as DNA-targeting modules by engineering their repeat domains to dictate user-selected sequence specificity. TALEs have been shown to function as site-specific transcriptional activators in a variety of cell types and organisms. TALE nucleases (TALENs), generated by fusing the FokI cleavage domain to TALE, have been used to create genomic double-strand breaks. The identity of the TALE repeat variable di-residues, their number, and their order dictate the DNA sequence specificity. Because TALE repeats are nearly identical, their assembly by cloning or even by synthesis is challenging and time consuming. Here, we report the development and use of a rapid and straightforward approach for the construction of designer TALE (dTALE) activators and nucleases with user-selected DNA target specificity. Using our plasmid set of 100 repeat modules, researchers can assemble repeat domains for any 14-nucleotide target sequence in one sequential restriction-ligation cloning step and in only 24 h. We generated several custom dTALEs and dTALENs with new target sequence specificities and validated their function by transient expression in tobacco leaves and in vitro DNA cleavage assays, respectively. Moreover, we developed a web tool, called idTALE, to facilitate the design of dTALENs and the identification of their genomic targets and potential off-targets in the genomes of several model species. Our dTALE repeat assembly approach along with the web tool idTALE will expedite genome-engineering applications in a variety of cell types and organisms including plants. © 2012 Springer Science+Business Media B.V.
Complete Sequence and Analysis of Coconut Palm (Cocos nucifera) Mitochondrial Genome.

Science.gov (United States)

Aljohi, Hasan Awad; Liu, Wanfei; Lin, Qiang; Zhao, Yuhui; Zeng, Jingyao; Alamer, Ali; Alanazi, Ibrahim O; Alawad, Abdullah O; Al-Sadi, Abdullah M; Hu, Songnian; Yu, Jun

2016-01-01

Coconut (Cocos nucifera L.), a member of the palm family (Arecaceae), is one of the most economically important crops in tropics, serving as an important source of food, drink, fuel, medicine, and construction material. Here we report an assembly of the coconut (C. nucifera, Oman local Tall cultivar) mitochondrial (mt) genome based on next-generation sequencing data. This genome, 678,653bp in length and 45.5% in GC content, encodes 72 proteins, 9 pseudogenes, 23 tRNAs, and 3 ribosomal RNAs. Within the assembly, we find that the chloroplast (cp) derived regions account for 5.07% of the total assembly length, including 13 proteins, 2 pseudogenes, and 11 tRNAs. The mt genome has a relatively large fraction of repeat content (17.26%), including both forward (tandem) and inverted (palindromic) repeats. Sequence variation analysis shows that the Ti/Tv ratio of the mt genome is lower as compared to that of the nuclear genome and neutral expectation. By combining public RNA-Seq data for coconut, we identify 734 RNA editing sites supported by at least two datasets. In summary, our data provides the second complete mt genome sequence in the family Arecaceae, essential for further investigations on mitochondrial biology of seed plants.
Multiple-locus variable-number tandem repeat analysis of Neisseria meningitidis yields groupings similar to those obtained by multilocus sequence typing.

NARCIS (Netherlands)

Schouls, Leo M; Ende, Arie van der; Damen, Marjolein; Pol, Ingrid van de

2006-01-01

We identified many variable-number tandem repeat (VNTR) loci in the genomes of Neisseria meningitidis serogroups A, B, and C and utilized a number of these loci to develop a multiple-locus variable-number tandem repeat analysis (MLVA). Eighty-five N. meningitidis serogroup B and C isolates obtained

Fragile DNA Motifs Trigger Mutagenesis at Distant Chromosomal Loci in Saccharomyces cerevisiae

Science.gov (United States)

Saini, Natalie; Zhang, Yu; Nishida, Yuri; Sheng, Ziwei; Choudhury, Shilpa; Mieczkowski, Piotr; Lobachev, Kirill S.

2013-01-01

DNA sequences capable of adopting non-canonical secondary structures have been associated with gross-chromosomal rearrangements in humans and model organisms. Previously, we have shown that long inverted repeats that form hairpin and cruciform structures and triplex-forming GAA/TTC repeats induce the formation of double-strand breaks which trigger genome instability in yeast. In this study, we demonstrate that breakage at both inverted repeats and GAA/TTC repeats is augmented by defects in DNA replication. Increased fragility is associated with increased mutation levels in the reporter genes located as far as 8 kb from both sides of the repeats. The increase in mutations was dependent on the presence of inverted or GAA/TTC repeats and activity of the translesion polymerase Polζ. Mutagenesis induced by inverted repeats also required Sae2 which opens hairpin-capped breaks and initiates end resection. The amount of breakage at the repeats is an important determinant of mutations as a perfect palindromic sequence with inherently increased fragility was also found to elevate mutation rates even in replication-proficient strains. We hypothesize that the underlying mechanism for mutagenesis induced by fragile motifs involves the formation of long single-stranded regions in the broken chromosome, invasion of the undamaged sister chromatid for repair, and faulty DNA synthesis employing Polζ. These data demonstrate that repeat-mediated breaks pose a dual threat to eukaryotic genome integrity by inducing chromosomal aberrations as well as mutations in flanking genes. PMID:23785298
The Genome of the Chicken DT40 Bursal Lymphoma Cell Line

DEFF Research Database (Denmark)

Molnar, Janos; Poti, Adam; Pipek, Orsolya

2014-01-01

The chicken DT40 cell line is a widely used model system in the study of multiple cellular processes due to the efficiency of homologous gene targeting. The cell line was derived from a bursal lymphoma induced by avian leukosis virus infection. In this study we characterized the genome of the cell...... chicken genomes and the Gallus gallus reference genome, we found no unique mutational processes shaping the DT40 genome except for a mild increase in insertion and deletion events, particularly deletions at tandem repeats. We mapped coding sequence mutations that are unique to the DT40 genome; mutations...
Complete Mitochondrial Genome of the Medicinal Mushroom Ganoderma lucidum

Science.gov (United States)

Chen, Haimei; Chen, Xiangdong; Lan, Jin; Liu, Chang

2013-01-01

Ganoderma lucidum is one of the well-known medicinal basidiomycetes worldwide. The mitochondrion, referred to as the second genome, is an organelle found in most eukaryotic cells and participates in critical cellular functions. Elucidating the structure and function of this genome is important to understand completely the genetic contents of G. lucidum. In this study, we assembled the mitochondrial genome of G. lucidum and analyzed the differential expressions of its encoded genes across three developmental stages. The mitochondrial genome is a typical circular DNA molecule of 60,630 bp with a GC content of 26.67%. Genome annotation identified genes that encode 15 conserved proteins, 27 tRNAs, small and large rRNAs, four homing endonucleases, and two hypothetical proteins. Except for genes encoding trnW and two hypothetical proteins, all genes were located on the positive strand. For the repeat structure analysis, eight forward, two inverted, and three tandem repeats were detected. A pair of fragments with a total length around 5.5 kb was found in both the nuclear and mitochondrial genomes, which suggests the possible transfer of DNA sequences between two genomes. RNA-Seq data for samples derived from three stages, namely, mycelia, primordia, and fruiting bodies, were mapped to the mitochondrial genome and qualified. The protein-coding genes were expressed higher in mycelia or primordial stages compared with those in the fruiting bodies. The rRNA abundances were significantly higher in all three stages. Two regions were transcribed but did not contain any identified protein or tRNA genes. Furthermore, three RNA-editing sites were detected. Genome synteny analysis showed that significant genome rearrangements occurred in the mitochondrial genomes. This study provides valuable information on the gene contents of the mitochondrial genome and their differential expressions at various developmental stages of G. lucidum. The results contribute to the understanding of the
Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L. genome

Directory of Open Access Journals (Sweden)

González Leonardo Galindo

2012-11-01

Full Text Available Abstract Background Flax (Linum usitatissimum L. is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Results Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage, followed by Long Interspersed Nuclear Element (LINE retrotransposons (2.10% and Mutator DNA transposons (1.99%. Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. Conclusions The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include
Identification, characterization and distribution of transposable elements in the flax (Linum usitatissimum L.) genome.

Science.gov (United States)

González, Leonardo Galindo; Deyholos, Michael K

2012-11-21

Flax (Linum usitatissimum L.) is an important crop for the production of bioproducts derived from its seed and stem fiber. Transposable elements (TEs) are widespread in plant genomes and are a key component of their evolution. The availability of a genome assembly of flax (Linum usitatissimum) affords new opportunities to explore the diversity of TEs and their relationship to genes and gene expression. Four de novo repeat identification algorithms (PILER, RepeatScout, LTR_finder and LTR_STRUC) were applied to the flax genome assembly. The resulting library of flax repeats was combined with the RepBase Viridiplantae division and used with RepeatMasker to identify TEs coverage in the genome. LTR retrotransposons were the most abundant TEs (17.2% genome coverage), followed by Long Interspersed Nuclear Element (LINE) retrotransposons (2.10%) and Mutator DNA transposons (1.99%). Comparison of putative flax TEs to flax transcript databases indicated that TEs are not highly expressed in flax. However, the presence of recent insertions, defined by 100% intra-element LTR similarity, provided evidence for recent TE activity. Spatial analysis showed TE-rich regions, gene-rich regions as well as regions with similar genes and TE density. Monte Carlo simulations for the 71 largest scaffolds (≥ 1 Mb each) did not show any regional differences in the frequency of TE overlap with gene coding sequences. However, differences between TE superfamilies were found in their proximity to genes. Genes within TE-rich regions also appeared to have lower transcript expression, based on EST abundance. When LTR elements were compared, Copia showed more diversity, recent insertions and conserved domains than the Gypsy, demonstrating their importance in genome evolution. The calculated 23.06% TE coverage of the flax WGS assembly is at the low end of the range of TE coverages reported in other eudicots, although this estimate does not include TEs likely found in unassembled repetitive regions of
Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

Science.gov (United States)

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
Distinguishing friends, foes, and freeloaders in giant genomes.

Science.gov (United States)

Bennetzen, Jeffrey L; Park, Minkyu

2018-04-01

Most annotations of large eukaryotic genomes initially find transposable elements (TEs) and other repeats, then mask them so that subsequent efforts can be concentrated on the annotation and study of non-TE genes. However, TEs often contribute to host biology, and their community biologies are of intrinsic interest. This review discusses the challenges, rationale and technologies for comprehensive TE annotation in the commonly giant genomes of animals and plants. Complete discovery of the TEs in a fully sequenced genome is laborious, but feasible, with current strategies in the hands of a careful researcher. These deep TE studies have begun to provide important perspectives on how genomes evolve and the degree to which genome changes do and do not affect eukaryotic biology. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Genomic rearrangements of PTEN in prostate cancer

Directory of Open Access Journals (Sweden)

Sopheap ePhin

2013-09-01

Full Text Available The phosphatase and tensin homolog gene on chromosome 10q23.3 (PTEN is a negative regulator of the PIK3/Akt survival pathway and is the most frequently deleted tumor suppressor gene in prostate cancer. Monoallelic loss of PTEN is present in up to 60% of localized prostate cancers and complete loss of PTEN in prostate cancer is linked to metastasis and androgen independent progression. Studies on the genomic status of PTEN in prostate cancer initially used a two-color fluorescence in-situ hybridization (FISH assay for PTEN copy number detection in formalin fixed paraffin embedded tissue preparations. More recently, a four-color FISH assay containing two additional control probes flanking the PTEN locus with a lower false-positive rate was reported. Combined with the detection of other critical genomic biomarkers for prostate cancer such as ERG, AR, and MYC, the evaluation of PTEN genomic status has proven to be invaluable for patient stratification and management. Although less frequent than allelic deletions, point mutations in the gene and epigenetic silencing are also known to contribute to loss of PTEN function, and ultimately to prostate cancer initiation. Overall, it is clear that PTEN is a powerful biomarker for prostate cancer. Used as a companion diagnostic for emerging therapeutic drugs, FISH analysis of PTEN is promisingly moving human prostate cancer closer to more effective cancer management and therapies.
A complete mitochondrial genome sequence of Asian black bear Sichuan subspecies (Ursus thibetanus mupinensis)

Science.gov (United States)

Hou, Wan-ru; Chen, Yu; Wu, Xia; Hu, Jin-chu; Peng, Zheng-song; Yang, Jung; Tang, Zong-xiang; Zhou, Cai-Quan; Li, Yu-ming; Yang, Shi-kui; Du, Yu-jie; Kong, Ling-lu; Ren, Zheng-long; Zhang, Huai-yu; Shuai, Su-rong

2007-01-01

We obtained the complete mitochondrial genome of U.thibetanus mupinensis by DNA sequencing based on the PCR fragments of 18 primers we designed. The results indicate that the mtDNA is 16 868 bp in size, encodes 13 protein genes, 22 tRNA genes, and 2 rRNA genes, with an overall H-strand base composition of 31.2% A, 25.4% C, 15.5% G and 27.9% T. The sequence of the control region (CR) located between tRNA-Pro and tRNA-Phe is 1422 bp in size, consists of 8.43% of the whole genome, GC content is 51.9% and has a 6bp tandem repeat and two 10bp tandem repeats identified by using the Tandem Repeats Finder. U. thibetanus mupinensis mitochondrial genome shares high similarity with those of three other Ursidae: U. americanus (91.46%), U. arctos (89.25%) and U. maritimus (87.66%). PMID:17205108
The complete mitochondrial genome of Gossypium hirsutum and evolutionary analysis of higher plant mitochondrial genomes.

Science.gov (United States)

Liu, Guozheng; Cao, Dandan; Li, Shuangshuang; Su, Aiguo; Geng, Jianing; Grover, Corrinne E; Hu, Songnian; Hua, Jinping

2013-01-01

Mitochondria are the main manufacturers of cellular ATP in eukaryotes. The plant mitochondrial genome contains large number of foreign DNA and repeated sequences undergone frequently intramolecular recombination. Upland Cotton (Gossypium hirsutum L.) is one of the main natural fiber crops and also an important oil-producing plant in the world. Sequencing of the cotton mitochondrial (mt) genome could be helpful for the evolution research of plant mt genomes. We utilized 454 technology for sequencing and combined with Fosmid library of the Gossypium hirsutum mt genome screening and positive clones sequencing and conducted a series of evolutionary analysis on Cycas taitungensis and 24 angiosperms mt genomes. After data assembling and contigs joining, the complete mitochondrial genome sequence of G. hirsutum was obtained. The completed G.hirsutum mt genome is 621,884 bp in length, and contained 68 genes, including 35 protein genes, four rRNA genes and 29 tRNA genes. Five gene clusters are found conserved in all plant mt genomes; one and four clusters are specifically conserved in monocots and dicots, respectively. Homologous sequences are distributed along the plant mt genomes and species closely related share the most homologous sequences. For species that have both mt and chloroplast genome sequences available, we checked the location of cp-like migration and found several fragments closely linked with mitochondrial genes. The G. hirsutum mt genome possesses most of the common characters of higher plant mt genomes. The existence of syntenic gene clusters, as well as the conservation of some intergenic sequences and genic content among the plant mt genomes suggest that evolution of mt genomes is consistent with plant taxonomy but independent among different species.
Detection in a Japanese population of a length polymorphism in the 5' flanking region of the human β-globin gene with denaturing gradient gel electrophoresis

International Nuclear Information System (INIS)

Takahashi, Noria; Hiyama, Keiko; Kodaira, Mieko; Satoh, Chiyoko

1992-10-01

An analysis of the ATTTT repeat polymorphism located approximately 1,400 base pairs upstream from the β-globin structural gene was carried out by denaturing gradient gel electrophoresis (DGGE) of RNA:DNA duplexes. Genomic or cloned DNAs were digested with restriction enzymes and hybridized with 32 P-labeled RNA probes, and resulting RNA:DNA duplexes were examined by DGGE. A difference in the number of repeat units was recognized by differences in duplex mobility on the DGGE gel. In this study of 81 unrelated Japanese from Hiroshima, a sequence heteromorphism was observed at this site. Alleles with 5 and 6 repeats of the ATTTT unit, which had already been reported, were found in polymorphic proportions. In addition, two unreported alleles, one having 7 repeats and the other having an A-to-G nucleotide substitution in the 5th repeat, were detected. Family study data showed that the segregation of these four types of variants is consistent with an autosomal codominant mode of inheritance. This study also demonstrated that DGGE of RNA:DNA duplexes is a sensitive tool for detecting variations in DNA. (author)
Chloroplast genome of Aconitum barbatum var. puberulum (Ranunculaceae) derived from CCS reads using the PacBio RS platform.

Science.gov (United States)

Chen, Xiaochen; Li, Qiushi; Li, Ying; Qian, Jun; Han, Jianping

2015-01-01

The chloroplast genome (cp genome) of Aconitum barbatum var. puberulum was sequenced using the third-generation sequencing platform based on the single-molecule real-time (SMRT) sequencing approach. To our knowledge, this is the first reported complete cp genome of Aconitum, and we anticipate that it will have great value for phylogenetic studies of the Ranunculaceae family. In total, 23,498 CCS reads and 20,685,462 base pairs were generated, the mean read length was 880 bp, and the longest read was 2,261 bp. Genome coverage of 100% was achieved with a mean coverage of 132× and no gaps. The accuracy of the assembled genome is 99.973%; the assembly was validated using Sanger sequencing of six selected genes from the cp genome. The complete cp genome of A. barbatum var. puberulum is 156,749 bp in length, including a large single-copy region of 87,630 bp and a small single-copy region of 16,941 bp separated by two inverted repeats of 26,089 bp. The cp genome contains 130 genes, including 84 protein-coding genes, 34 tRNA genes and eight rRNA genes. Four forward, five inverted and eight tandem repeats were identified. According to the SSR analysis, the longest poly structure is a 20-T repeat. Our results presented in this paper will facilitate the phylogenetic studies and molecular authentication on Aconitum.
Rapid evolutionary change of common bean (Phaseolus vulgaris L plastome, and the genomic diversification of legume chloroplasts

Directory of Open Access Journals (Sweden)

Dávila Guillermo

2007-07-01

Full Text Available Abstract Background Fabaceae (legumes is one of the largest families of flowering plants, and some members are important crops. In contrast to what we know about their great diversity or economic importance, our knowledge at the genomic level of chloroplast genomes (cpDNAs or plastomes for these crops is limited. Results We sequenced the complete genome of the common bean (Phaseolus vulgaris cv. Negro Jamapa chloroplast. The plastome of P. vulgaris is a 150,285 bp circular molecule. It has gene content similar to that of other legume plastomes, but contains two pseudogenes, rpl33 and rps16. A distinct inversion occurred at the junction points of trnH-GUG/rpl14 and rps19/rps8, as in adzuki bean 1. These two pseudogenes and the inversion were confirmed in 10 varieties representing the two domestication centers of the bean. Genomic comparative analysis indicated that inversions generally occur in legume plastomes and the magnitude and localization of insertions/deletions (indels also vary. The analysis of repeat sequences demonstrated that patterns and sequences of tandem repeats had an important impact on sequence diversification between legume plastomes and tandem repeats did not belong to dispersed repeats. Interestingly, P. vulgaris plastome had higher evolutionary rates of change on both genomic and gene levels than G. max, which could be the consequence of pressure from both mutation and natural selection. Conclusion Legume chloroplast genomes are widely diversified in gene content, gene order, indel structure, abundance and localization of repetitive sequences, intracellular sequence exchange and evolutionary rates. The P. vulgaris plastome is a rapidly evolving genome.
Leakage of active crater lake brine through the north flank at Rincon de la Vieja volcano, northwest Costa Rica, and implications for crater collapse

Science.gov (United States)

Kempter, K.A.; Rowe, G.L.

2000-01-01

The Active Crater at Rincon de la Vieja volcano, Costa Rica, reaches an elevation of 1750 m and contains a warm, hyper-acidic crater lake that probably formed soon after the eruption of the Rio Blanco tephra deposit approximately 3500 years before present. The Active Crater is buttressed by volcanic ridges and older craters on all sides except the north, which dips steeply toward the Caribbean coastal plains. Acidic, above-ambient-temperature streams are found along the Active Crater's north flank at elevations between 800 and 1000 m. A geochemical survey of thermal and non-thermal waters at Rincon de la Vieja was done in 1989 to determine whether hyper-acidic fluids are leaking from the Active Crater through the north flank, affecting the composition of north-flank streams. Results of the water-chemistry survey reveal that three distinct thermal waters are found on the flanks of Rincon de la Vieja volcano: acid chloride-sulfate (ACS), acid sulfate (AS), and neutral chloride (NC) waters. The most extreme ACS water was collected from the crater lake that fills the Active Crater. Chemical analyses of the lake water reveal a hyper-acidic (pH ~ 0) chloride-sulfate brine with elevated concentrations of calcium, magnesium, aluminum, iron, manganese, copper, zinc, fluorine, and boron. The composition of the brine reflects the combined effects of magmatic degassing from a shallow magma body beneath the Active Crater, dissolution of andesitic volcanic rock, and evaporative concentration of dissolved constituents at above-ambient temperatures. Similar cation and anion enrichments are found in the above-ambient-temperature streams draining the north flank of the Active Crater. The pH of north-flank thermal waters range from 3.6 to 4.1 and chloride:sulfate ratios (1.2-1.4) that are a factor of two greater than that of the lake brine (0.60). The waters have an ACS composition that is quite different from the AS and NC thermal waters that occur along the southern flank of Rincon
CRISPR/Cas9 for Human Genome Engineering and Disease Research.

Science.gov (United States)

Xiong, Xin; Chen, Meng; Lim, Wendell A; Zhao, Dehua; Qi, Lei S

2016-08-31

The clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated 9 (Cas9) system, a versatile RNA-guided DNA targeting platform, has been revolutionizing our ability to modify, manipulate, and visualize the human genome, which greatly advances both biological research and therapeutics development. Here, we review the current development of CRISPR/Cas9 technologies for gene editing, transcription regulation, genome imaging, and epigenetic modification. We discuss the broad application of this system to the study of functional genomics, especially genome-wide genetic screening, and to therapeutics development, including establishing disease models, correcting defective genetic mutations, and treating diseases.
Proteus genomic island 1 (PGI1), a new resistance genomic island from two Proteus mirabilis French clinical isolates.

Science.gov (United States)

Siebor, Eliane; Neuwirth, Catherine

2014-12-01

To analyse the genetic environment of the antibiotic resistance genes in two clinical Proteus mirabilis isolates resistant to multiple antibiotics. PCR, gene walking and whole-genome sequencing were used to determine the sequence of the resistance regions, the surrounding genetic structure and the flanking chromosomal regions. A genomic island of 81.1 kb named Proteus genomic island 1 (PGI1) located at the 3'-end of trmE (formerly known as thdF) was characterized. The large MDR region of PGI1 (55.4 kb) included a class 1 integron (aadB and aadA2) and regions deriving from several transposons: Tn2 (blaTEM-135), Tn21, Tn6020-like transposon (aphA1b), a hybrid Tn502/Tn5053 transposon, Tn501, a hybrid Tn1696/Tn1721 transposon [tetA(A)] carrying a class 1 integron (aadA1) and Tn5393 (strA and strB). Several ISs were also present (IS4321, IS1R and IS26). The PGI1 backbone (25.7 kb) was identical to that identified in Salmonella Heidelberg SL476 and shared some identity with the Salmonella genomic island 1 (SGI1) backbone. An IS26-mediated recombination event caused the division of the MDR region into two parts separated by a large chromosomal DNA fragment of 197 kb, the right end of PGI1 and this chromosomal sequence being in inverse orientation. PGI1 is a new resistance genomic island from P. mirabilis belonging to the same island family as SGI1. The role of PGI1 in the spread of antimicrobial resistance genes among Enterobacteriaceae of medical importance needs to be evaluated. © The Author 2014. Published by Oxford University Press on behalf of the British Society for Antimicrobial Chemotherapy. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.
Genomic hypomethylation in the human germline associates with selective structural mutability in the human genome.

Directory of Open Access Journals (Sweden)

Jian Li

Full Text Available The hotspots of structural polymorphisms and structural mutability in the human genome remain to be explained mechanistically. We examine associations of structural mutability with germline DNA methylation and with non-allelic homologous recombination (NAHR mediated by low-copy repeats (LCRs. Combined evidence from four human sperm methylome maps, human genome evolution, structural polymorphisms in the human population, and previous genomic and disease studies consistently points to a strong association of germline hypomethylation and genomic instability. Specifically, methylation deserts, the ~1% fraction of the human genome with the lowest methylation in the germline, show a tenfold enrichment for structural rearrangements that occurred in the human genome since the branching of chimpanzee and are highly enriched for fast-evolving loci that regulate tissue-specific gene expression. Analysis of copy number variants (CNVs from 400 human samples identified using a custom-designed array comparative genomic hybridization (aCGH chip, combined with publicly available structural variation data, indicates that association of structural mutability with germline hypomethylation is comparable in magnitude to the association of structural mutability with LCR-mediated NAHR. Moreover, rare CNVs occurring in the genomes of individuals diagnosed with schizophrenia, bipolar disorder, and developmental delay and de novo CNVs occurring in those diagnosed with autism are significantly more concentrated within hypomethylated regions. These findings suggest a new connection between the epigenome, selective mutability, evolution, and human disease.
The uncharacterized gene 1700093K21Rik and flanking regions are correlated with reproductive isolation in the house mouse, Mus musculus.

Science.gov (United States)

Kass, David H; Janoušek, Václav; Wang, Liuyang; Tucker, Priscilla K

2014-06-01

Reproductive barriers exist between the house mouse subspecies, Mus musculus musculus and M. m. domesticus, members of the Mus musculus species complex, primarily as a result of hybrid male infertility, and a hybrid zone exists where their ranges intersect in Europe. Using single nucleotide polymorphisms (SNPs) diagnostic for the two taxa, the extent of introgression across the genome was previously compared in these hybrid populations. Sixty-nine of 1316 autosomal SNPs exhibited reduced introgression in two hybrid zone transects suggesting maladaptive interactions among certain loci. One of these markers is within a region on chromosome 11 that, in other studies, has been associated with hybrid male sterility of these subspecies. We assessed sequence variation in a 20 Mb region on chromosome 11 flanking this marker, and observed its inclusion within a roughly 150 kb stretch of DNA showing elevated sequence differentiation between the two subspecies. Four genes are associated with this genomic subregion, with two entirely encompassed. One of the two genes, the uncharacterized 1700093K21Rik gene, displays distinguishing features consistent with a potential role in reproductive isolation between these subspecies. Along with its expression specifically within spermatogenic cells, we present various sequence analyses that demonstrate a high rate of molecular evolution of this gene, as well as identify a subspecies amino acid variant resulting in a structural difference. Taken together, the data suggest a role for this gene in reproductive isolation.
SV40 host-substituted variants: a new look at the monkey DNA inserts and recombinant junctions.

Science.gov (United States)

Singer, Maxine; Winocour, Ernest

2011-04-10

The available monkey genomic data banks were examined in order to determine the chromosomal locations of the host DNA inserts in 8 host-substituted SV40 variant DNAs. Five of the 8 variants contained more than one linked monkey DNA insert per tandem repeat unit and in all cases but one, the 19 monkey DNA inserts in the 8 variants mapped to different locations in the monkey genome. The 50 parental DNAs (32 monkey and 18 SV40 DNA segments) which spanned the crossover and flanking regions that participated in monkey/monkey and monkey/SV40 recombinations were characterized by substantial levels of microhomology of up to 8 nucleotides in length; the parental DNAs also exhibited direct and inverted repeats at or adjacent to the crossover sequences. We discuss how the host-substituted SV40 variants arose and the nature of the recombination mechanisms involved. Copyright © 2011 Elsevier Inc. All rights reserved.
Southward flow on the western flank of the Florida Current

Science.gov (United States)

Soloviev, Alexander V.; Hirons, Amy; Maingot, Christopher; Dean, Cayla W.; Dodge, Richard E.; Yankovsky, Alexander E.; Wood, Jon; Weisberg, Robert H.; Luther, Mark E.; McCreary, Julian P.

2017-07-01

A suite of long-term in situ measurements in the Straits of Florida, including the ADCP bottom moorings at an 11-m isobath and 244-m isobath (Miami Terrace) and several ADCP ship transects, have revealed a remarkable feature of the ocean circulation - southward flow on the western, coastal flank of the Florida Current. We have observed three forms of the southward flow - a seasonally varying coastal countercurrent, an undercurrent jet attached to the Florida shelf, and an intermittent undercurrent on the Miami Terrace. According to a 13-year monthly climatology obtained from the near-shore mooring, the coastal countercurrent is a persistent feature from October through January. The southward flow in the form of an undercurrent jet attached to the continental slope was observed during five ship transects from April through September but was not observed during three transects in February, March, and November. This undercurrent jet is well mixed due to strong shear at its top associated with the northward direction of the surface flow (Florida Current) and friction at the bottom. At the same time, no statistically significant seasonal cycle has been observed in the undercurrent flow on the Miami Terrace. Theoretical considerations suggest that several processes could drive the southward current, including interaction between the Florida Current and the shelf, as well as forcing that is independent of the Florida Current. The exact nature of the southward flow on the western flank of the Florida Current is, however, unknown.

CRISPR-Cpf1: A New Tool for Plant Genome Editing

KAUST Repository

Zaidi, Syed Shan-e-Ali; Mahfouz, Magdy M.; Mansoor, Shahid

2017-01-01

Clustered regularly interspaced palindromic repeats (CRISPR)-CRISPR-associated proteins (CRISPR-Cas), a groundbreaking genome-engineering tool, has facilitated targeted trait improvement in plants. Recently, CRISPR-CRISPR from Prevotella and Francisella 1 (Cpf1) has emerged as a new tool for efficient genome editing, including DNA-free editing in plants, with higher efficiency, specificity, and potentially wider applications than CRISPR-Cas9.
CRISPR-Cpf1: A New Tool for Plant Genome Editing

KAUST Repository

Zaidi, Syed Shan-e-Ali

2017-05-19

Clustered regularly interspaced palindromic repeats (CRISPR)-CRISPR-associated proteins (CRISPR-Cas), a groundbreaking genome-engineering tool, has facilitated targeted trait improvement in plants. Recently, CRISPR-CRISPR from Prevotella and Francisella 1 (Cpf1) has emerged as a new tool for efficient genome editing, including DNA-free editing in plants, with higher efficiency, specificity, and potentially wider applications than CRISPR-Cas9.
Genomic dark matter: the reliability of short read mapping illustrated by the genome mappability score.

Science.gov (United States)

Lee, Hayan; Schatz, Michael C

2012-08-15

Genome resequencing and short read mapping are two of the primary tools of genomics and are used for many important applications. The current state-of-the-art in mapping uses the quality values and mapping quality scores to evaluate the reliability of the mapping. These attributes, however, are assigned to individual reads and do not directly measure the problematic repeats across the genome. Here, we present the Genome Mappability Score (GMS) as a novel measure of the complexity of resequencing a genome. The GMS is a weighted probability that any read could be unambiguously mapped to a given position and thus measures the overall composition of the genome itself. We have developed the Genome Mappability Analyzer to compute the GMS of every position in a genome. It leverages the parallelism of cloud computing to analyze large genomes, and enabled us to identify the 5-14% of the human, mouse, fly and yeast genomes that are difficult to analyze with short reads. We examined the accuracy of the widely used BWA/SAMtools polymorphism discovery pipeline in the context of the GMS, and found discovery errors are dominated by false negatives, especially in regions with poor GMS. These errors are fundamental to the mapping process and cannot be overcome by increasing coverage. As such, the GMS should be considered in every resequencing project to pinpoint the 'dark matter' of the genome, including of known clinically relevant variations in these regions. The source code and profiles of several model organisms are available at http://gma-bio.sourceforge.net
Genome-wide analysis of LTR-retrotransposons in oil palm.

Science.gov (United States)

Beulé, Thierry; Agbessi, Mawussé Dt; Dussert, Stephane; Jaligot, Estelle; Guyot, Romain

2015-10-15

The oil palm (Elaeis guineensis Jacq.) is a major cultivated crop and the world's largest source of edible vegetable oil. The genus Elaeis comprises two species E. guineensis, the commercial African oil palm and E. oleifera, which is used in oil palm genetic breeding. The recent publication of both the African oil palm genome assembly and the first draft sequence of its Latin American relative now allows us to tackle the challenge of understanding the genome composition, structure and evolution of these palm genomes through the annotation of their repeated sequences. In this study, we identified, annotated and compared Transposable Elements (TE) from the African and Latin American oil palms. In a first step, Transposable Element databases were built through de novo detection in both genome sequences then the TE content of both genomes was estimated. Then putative full-length retrotransposons with Long Terminal Repeats (LTRs) were further identified in the E. guineensis genome for characterization of their structural diversity, copy number and chromosomal distribution. Finally, their relative expression in several tissues was determined through in silico analysis of publicly available transcriptome data. Our results reveal a congruence in the transpositional history of LTR retrotransposons between E. oleifera and E. guineensis, especially the Sto-4 family. Also, we have identified and described 583 full-length LTR-retrotransposons in the Elaeis guineensis genome. Our work shows that these elements are most likely no longer mobile and that no recent insertion event has occurred. Moreover, the analysis of chromosomal distribution suggests a preferential insertion of Copia elements in gene-rich regions, whereas Gypsy elements appear to be evenly distributed throughout the genome. Considering the high proportion of LTR retrotransposon in the oil palm genome, our work will contribute to a greater understanding of their impact on genome organization and evolution
Targeted Porcine Genome Engineering with TALENs

DEFF Research Database (Denmark)

Luo, Yonglun; Lin, Lin; Golas, Mariola Monika

2015-01-01

Genetically modified pigs are becoming an invaluable animal model for agricultural, pharmaceutical, and biomedical applications. Unlike traditional transgenesis, which is accomplished by randomly inserting an exogenous transgene cassette into the natural chromosomal context, targeted genome editing...... confers precisely editing (e.g., mutations or indels) or insertion of a functional transgenic cassette to user-designed loci. Techniques for targeted genome engineering are growing dramatically and include, e.g., zinc-finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs......), and clustered regularly interspaced short palindromic repeats (CRISPR)/CRISPR-associated (Cas) systems. These systems provide enormous potential applications. In this chapter, we review the use of TALENs for targeted genome editing with focus on their application in pigs. In addition, a brief protocol...
Superfamily of ankyrin repeat proteins in tomato.

Science.gov (United States)

Yuan, Xiaowei; Zhang, Shizhong; Qing, Xiaohe; Sun, Meihong; Liu, Shiyang; Su, Hongyan; Shu, Huairui; Li, Xinzheng

2013-07-10

The ankyrin repeat (ANK) protein family plays a crucial role in plant growth and development and in response to biotic and abiotic stresses. However, no detailed information concerning this family is available for tomato (Solanum lycopersicum) due to the limited information on whole genome sequences. In this study, we identified a total of 130 ANK genes in tomato genome (SlANK), and these genes were distributed across all 12 chromosomes at various densities. And chromosomal localizations of SlANK genes indicated 25 SlANK genes were involved in tandem duplications. Based on their domain composition, all of the SlANK proteins were grouped into 13 subgroups. A combined phylogenetic tree was constructed with the aligned SlANK protein sequences. This tree revealed that the SlANK proteins comprise five major groups. An analysis of the expression profiles of SlANK genes in tomato in different tissues and in response to stresses showed that the SlANK proteins play roles in plant growth, development and stress responses. To our knowledge, this is the first report of a genome-wide analysis of the tomato ANK gene family. This study provides valuable information regarding the classification and putative functions of SlANK genes in tomato. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.
MobilomeFINDER: web-based tools for in silico and experimental discovery of bacterial genomic islands

Science.gov (United States)

Ou, Hong-Yu; He, Xinyi; Harrison, Ewan M.; Kulasekara, Bridget R.; Thani, Ali Bin; Kadioglu, Aras; Lory, Stephen; Hinton, Jay C. D.; Barer, Michael R.; Rajakumar, Kumar

2007-01-01

MobilomeFINDER (http://mml.sjtu.edu.cn/MobilomeFINDER) is an interactive online tool that facilitates bacterial genomic island or ‘mobile genome’ (mobilome) discovery; it integrates the ArrayOme and tRNAcc software packages. ArrayOme utilizes a microarray-derived comparative genomic hybridization input data set to generate ‘inferred contigs’ produced by merging adjacent genes classified as ‘present’. Collectively these ‘fragments’ represent a hypothetical ‘microarray-visualized genome (MVG)’. ArrayOme permits recognition of discordances between physical genome and MVG sizes, thereby enabling identification of strains rich in microarray-elusive novel genes. Individual tRNAcc tools facilitate automated identification of genomic islands by comparative analysis of the contents and contexts of tRNA sites and other integration hotspots in closely related sequenced genomes. Accessory tools facilitate design of hotspot-flanking primers for in silico and/or wet-science-based interrogation of cognate loci in unsequenced strains and analysis of islands for features suggestive of foreign origins; island-specific and genome-contextual features are tabulated and represented in schematic and graphical forms. To date we have used MobilomeFINDER to analyse several Enterobacteriaceae, Pseudomonas aeruginosa and Streptococcus suis genomes. MobilomeFINDER enables high-throughput island identification and characterization through increased exploitation of emerging sequence data and PCR-based profiling of unsequenced test strains; subsequent targeted yeast recombination-based capture permits full-length sequencing and detailed functional studies of novel genomic islands. PMID:17537813
Development of novel simple sequence repeat markers in bitter gourd (Momordica charantia L.) through enriched genomic libraries and their utilization in analysis of genetic diversity and cross-species transferability.

Science.gov (United States)

Saxena, Swati; Singh, Archana; Archak, Sunil; Behera, Tushar K; John, Joseph K; Meshram, Sudhir U; Gaikwad, Ambika B

2015-01-01

Microsatellite or simple sequence repeat (SSR) markers are the preferred markers for genetic analyses of crop plants. The availability of a limited number of such markers in bitter gourd (Momordica charantia L.) necessitates the development and characterization of more SSR markers. These were developed from genomic libraries enriched for three dinucleotide, five trinucleotide, and two tetranucleotide core repeat motifs. Employing the strategy of polymerase chain reaction-based screening, the number of clones to be sequenced was reduced by 81 % and 93.7 % of the sequenced clones contained in microsatellite repeats. Unique primer-pairs were designed for 160 microsatellite loci, and amplicons of expected length were obtained for 151 loci (94.4 %). Evaluation of diversity in 54 bitter gourd accessions at 51 loci indicated that 20 % of the loci were polymorphic with the polymorphic information content values ranging from 0.13 to 0.77. Fifteen Indian varieties were clearly distinguished indicative of the usefulness of the developed markers. Markers at 40 loci (78.4 %) were transferable to six species, viz. Momordica cymbalaria, Momordica subangulata subsp. renigera, Momordica balsamina, Momordica dioca, Momordica cochinchinesis, and Momordica sahyadrica. The microsatellite markers reported will be useful in various genetic and molecular genetic studies in bitter gourd, a cucurbit of immense nutritive, medicinal, and economic importance.
Structure and possible function of a G-quadruplex in the long terminal repeat of the proviral HIV-1 genome.

Science.gov (United States)

De Nicola, Beatrice; Lech, Christopher J; Heddi, Brahim; Regmi, Sagar; Frasson, Ilaria; Perrone, Rosalba; Richter, Sara N; Phan, Anh Tuân

2016-07-27

The long terminal repeat (LTR) of the proviral human immunodeficiency virus (HIV)-1 genome is integral to virus transcription and host cell infection. The guanine-rich U3 region within the LTR promoter, previously shown to form G-quadruplex structures, represents an attractive target to inhibit HIV transcription and replication. In this work, we report the structure of a biologically relevant G-quadruplex within the LTR promoter region of HIV-1. The guanine-rich sequence designated LTR-IV forms a well-defined structure in physiological cationic solution. The nuclear magnetic resonance (NMR) structure of this sequence reveals a parallel-stranded G-quadruplex containing a single-nucleotide thymine bulge, which participates in a conserved stacking interaction with a neighboring single-nucleotide adenine loop. Transcription analysis in a HIV-1 replication competent cell indicates that the LTR-IV region may act as a modulator of G-quadruplex formation in the LTR promoter. Consequently, the LTR-IV G-quadruplex structure presented within this work could represent a valuable target for the design of HIV therapeutics. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
Unenhanced helical CT in the investigation of acute flank pain

International Nuclear Information System (INIS)

Colistro, Robert; Torreggiani, William C.; Lyburn, Iain D.; Harris, Alison C.; Al-Nakshabandi, Nizar A.; Nicolaou, Savvas; Munk, Peter L.

2002-01-01

Unenhanced helical CT has emerged as the imaging technique of choice for the investigation of patients presenting with acute flank pain and suspected nephroureteric stone disease. There are several signs identifiable on unenhanced CT that support a diagnosis of stone disease. However, there are many pitfalls, that may confound a correct diagnosis. Some of the common pitfalls, together with methods to avoid such occurrences, will be discussed. A review of some of the common alternative diagnoses that may mimic the symptoms of nephroureteric stone disease is illustrated. Colistro, R. et al (2002)
Gene conversion homogenizes the CMT1A paralogous repeats

Directory of Open Access Journals (Sweden)

Hurles Matthew E

2001-12-01

Full Text Available Abstract Background Non-allelic homologous recombination between paralogous repeats is increasingly being recognized as a major mechanism causing both pathogenic microdeletions and duplications, and structural polymorphism in the human genome. It has recently been shown empirically that gene conversion can homogenize such repeats, resulting in longer stretches of absolute identity that may increase the rate of non-allelic homologous recombination. Results Here, a statistical test to detect gene conversion between pairs of non-coding sequences is presented. It is shown that the 24 kb Charcot-Marie-Tooth type 1A paralogous repeats (CMT1A-REPs exhibit the imprint of gene conversion processes whilst control orthologous sequences do not. In addition, Monte Carlo simulations of the evolutionary divergence of the CMT1A-REPs, incorporating two alternative models for gene conversion, generate repeats that are statistically indistinguishable from the observed repeats. Bounds are placed on the rate of these conversion processes, with central values of 1.3 × 10-4 and 5.1 × 10-5 per generation for the alternative models. Conclusions This evidence presented here suggests that gene conversion may have played an important role in the evolution of the CMT1A-REP paralogous repeats. The rates of these processes are such that it is probable that homogenized CMT1A-REPs are polymorphic within modern populations. Gene conversion processes are similarly likely to play an important role in the evolution of other segmental duplications and may influence the rate of non-allelic homologous recombination between them.
Flanking Magnitudes: Dissociation between Numerosity and Numerical Value in a Selective Attention Task

Science.gov (United States)

Naparstek, Sharon; Safadi, Ziad; Lichtenstein-Vidne, Limor; Henik, Avishai

2015-01-01

The current research examined whether peripherally presented numerical information can affect the speed of number processing. In 2 experiments, participants were presented with a target matrix flanked by a distractor matrix and were asked to perform a comparative judgment (i.e., decide whether the target was larger or smaller than the reference…
Targeted viral-mediated plant genome editing using crispr/cas9

KAUST Repository

Mahfouz, Magdy M.; Ali, Zahir

2015-01-01

The present disclosure provides a viral-mediated genome-editing platform that facilitates multiplexing, obviates stable transformation, and is applicable across plant species. The RNA2 genome of the tobacco rattle virus (TRV) was engineered to carry and systemically deliver a guide RNA molecules into plants overexpressing Cas9 endonuclease. High genomic modification frequencies were observed in inoculated as well as systemic leaves including the plant growing points. This system facilitates multiplexing and can lead to germinal transmission of the genomic modifications in the progeny, thereby obviating the requirements of repeated transformations and tissue culture. The editing platform of the disclosure is useful in plant genome engineering and applicable across plant species amenable to viral infections for agricultural biotechnology applications.
Targeted viral-mediated plant genome editing using crispr/cas9

KAUST Repository

Mahfouz, Magdy M.

2015-12-17

The present disclosure provides a viral-mediated genome-editing platform that facilitates multiplexing, obviates stable transformation, and is applicable across plant species. The RNA2 genome of the tobacco rattle virus (TRV) was engineered to carry and systemically deliver a guide RNA molecules into plants overexpressing Cas9 endonuclease. High genomic modification frequencies were observed in inoculated as well as systemic leaves including the plant growing points. This system facilitates multiplexing and can lead to germinal transmission of the genomic modifications in the progeny, thereby obviating the requirements of repeated transformations and tissue culture. The editing platform of the disclosure is useful in plant genome engineering and applicable across plant species amenable to viral infections for agricultural biotechnology applications.
GAViT: Genome Assembly Visualization Tool for Short Read Data

Energy Technology Data Exchange (ETDEWEB)

Syed, Aijazuddin; Shapiro, Harris; Tu, Hank; Pangilinan, Jasmyn; Trong, Stephan

2008-03-14

It is a challenging job for genome analysts to accurately debug, troubleshoot, and validate genome assembly results. Genome analysts rely on visualization tools to help validate and troubleshoot assembly results, including such problems as mis-assemblies, low-quality regions, and repeats. Short read data adds further complexity and makes it extremely challenging for the visualization tools to scale and to view all needed assembly information. As a result, there is a need for a visualization tool that can scale to display assembly data from the new sequencing technologies. We present Genome Assembly Visualization Tool (GAViT), a highly scalable and interactive assembly visualization tool developed at the DOE Joint Genome Institute (JGI).
Next-generation sequencing detects repetitive elements expansion in giant genomes of annual killifish genus Austrolebias (Cyprinodontiformes, Rivulidae).

Science.gov (United States)

García, G; Ríos, N; Gutiérrez, V

2015-06-01

Among Neotropical fish fauna, the South American killifish genus Austrolebias (Cyprinodontiformes: Rivulidae) constitutes an excellent model to study the genomic evolutionary processes underlying speciation events. Recently, unusually large genome size has been described in 16 species of this genus, with an average DNA content of about 5.95 ± 0.45 pg per diploid cell (mean C-value of about 2.98 pg). In the present paper we explore the possible origin of this unparallel genomic increase by means of comparative analysis of the repetitive components using NGS (454-Roche) technology in the lowest and highest Rivulidae genomes. Here, we provide the first annotated Rivulidae-repeated sequences composition and their relative repetitive fraction in both genomes. Remarkably, the genomic proportion of the moderately repetitive DNA in Austrolebias charrua genome represents approximately twice (45%) of the repetitive components of the highly related rivulinae taxon Cynopoecilus melanotaenia (25%). Present work provides evidence about the impact of the repeat families that could be distinctly proliferated among sublineages within Rivulidae fish group, explaining the great genome size differences encompassing the differentiation and speciation events in this family.
Sequencing of bovine herpesvirus 4 v.test strain reveals important genome features

Directory of Open Access Journals (Sweden)

Gillet Laurent

2011-08-01

Full Text Available Abstract Background Bovine herpesvirus 4 (BoHV-4 is a useful model for the human pathogenic gammaherpesviruses Epstein-Barr virus and Kaposi's Sarcoma-associated Herpesvirus. Although genome manipulations of this virus have been greatly facilitated by the cloning of the BoHV-4 V.test strain as a Bacterial Artificial Chromosome (BAC, the lack of a complete genome sequence for this strain limits its experimental use. Methods In this study, we have determined the complete sequence of BoHV-4 V.test strain by a pyrosequencing approach. Results The long unique coding region (LUR consists of 108,241 bp encoding at least 79 open reading frames and is flanked by several polyrepetitive DNA units (prDNA. As previously suggested, we showed that the prDNA unit located at the left prDNA-LUR junction (prDNA-G differs from the other prDNA units (prDNA-inner. Namely, the prDNA-G unit lacks the conserved pac-2 cleavage and packaging signal in its right terminal region. Based on the mechanisms of cleavage and packaging of herpesvirus genomes, this feature implies that only genomes bearing left and right end prDNA units are encapsulated into virions. Conclusions In this study, we have determined the complete genome sequence of the BAC-cloned BoHV-4 V.test strain and identified genome organization features that could be important in other herpesviruses.
Use of the p-SINE1-r2 in inferring evolutionary relationships of Thai rice varieties with AA genome

Directory of Open Access Journals (Sweden)

Preecha Prathepha

2006-01-01

Full Text Available In a previous study we described the prevalence and distribution in Thailand of the retroposon p- SINE1-r2, in the intron 10 of the waxy gene in cultivated and wild rice with the AA genome. In this study, additional varieties of rice were collected and sequencing was used to further characterize p-SINE1-r2. It was found that the length of the p-SINE1-r2 nucleotide sequences was about 125 bp, flanked by identical direct repeats of a 14 bp sequence. These sequences were compared and found to be similar to the sequences of p- SINE1-r2 found in Nipponbare, a rice strain discussed in a separate study. However, when compared the 48 DNA sequences identified in this study, much dissimilarity was found within the nucleotide sequences of p- SINE1-r2, in the form of base substitution mutations. Phylogenetic relationships inferred from the nucleotide sequences of these elements in cultivated rice (O. sativa and wild rice (O. nivara. It was found that rice accessions collected from the same geographical distribution have been placed in the same clade. The phylogenetic tree supports the origin and distribution of these rice strains.
Analyses of charophyte chloroplast genomes help characterize the ancestral chloroplast genome of land plants.

Science.gov (United States)

Civaň, Peter; Foster, Peter G; Embley, Martin T; Séneca, Ana; Cox, Cymon J

2014-04-01

Despite the significance of the relationships between embryophytes and their charophyte algal ancestors in deciphering the origin and evolutionary success of land plants, few chloroplast genomes of the charophyte algae have been reconstructed to date. Here, we present new data for three chloroplast genomes of the freshwater charophytes Klebsormidium flaccidum (Klebsormidiophyceae), Mesotaenium endlicherianum (Zygnematophyceae), and Roya anglica (Zygnematophyceae). The chloroplast genome of Klebsormidium has a quadripartite organization with exceptionally large inverted repeat (IR) regions and, uniquely among streptophytes, has lost the rrn5 and rrn4.5 genes from the ribosomal RNA (rRNA) gene cluster operon. The chloroplast genome of Roya differs from other zygnematophycean chloroplasts, including the newly sequenced Mesotaenium, by having a quadripartite structure that is typical of other streptophytes. On the basis of the improbability of the novel gain of IR regions, we infer that the quadripartite structure has likely been lost independently in at least three zygnematophycean lineages, although the absence of the usual rRNA operonic synteny in the IR regions of Roya may indicate their de novo origin. Significantly, all zygnematophycean chloroplast genomes have undergone substantial genomic rearrangement, which may be the result of ancient retroelement activity evidenced by the presence of integrase-like and reverse transcriptase-like elements in the Roya chloroplast genome. Our results corroborate the close phylogenetic relationship between Zygnematophyceae and land plants and identify 89 protein-coding genes and 22 introns present in the chloroplast genome at the time of the evolutionary transition of plants to land, all of which can be found in the chloroplast genomes of extant charophytes.
Complete DNA sequence of the linear mitochondrial genome of the pathogenic yeast Candida parapsilosis

DEFF Research Database (Denmark)

Nosek, J.; Novotna, M.; Hlavatovicova, Z.

2004-01-01

The complete sequence of the mitochondrial DNA of the opportunistic yeast pathogen Candida parapsilosis was determined. The mitochondrial genome is represented by linear DNA molecules terminating with tandem repeats of a 738-bp unit. The number of repeats varies, thus generating a population...

Some links on this page may take you to non-federal websites. Their policies may differ from this site.