genomic dna k-mer: Topics by WorldWideScience.org

Sample records for genomic dna k-mer

Robust k-mer frequency estimation using gapped k-mers.

Science.gov (United States)

Ghandi, Mahmoud; Mohammad-Noori, Morteza; Beer, Michael A

2014-08-01

Oligomers of fixed length, k, commonly known as k-mers, are often used as fundamental elements in the description of DNA sequence features of diverse biological function, or as intermediate elements in the constuction of more complex descriptors of sequence features such as position weight matrices. k-mers are very useful as general sequence features because they constitute a complete and unbiased feature set, and do not require parameterization based on incomplete knowledge of biological mechanisms. However, a fundamental limitation in the use of k-mers as sequence features is that as k is increased, larger spatial correlations in DNA sequence elements can be described, but the frequency of observing any specific k-mer becomes very small, and rapidly approaches a sparse matrix of binary counts. Thus any statistical learning approach using k-mers will be susceptible to noisy estimation of k-mer frequencies once k becomes large. Because all molecular DNA interactions have limited spatial extent, gapped k-mers often carry the relevant biological signal. Here we use gapped k-mer counts to more robustly estimate the ungapped k-mer frequencies, by deriving an equation for the minimum norm estimate of k-mer frequencies given an observed set of gapped k-mer frequencies. We demonstrate that this approach provides a more accurate estimate of the k-mer frequencies in real biological sequences using a sample of CTCF binding sites in the human genome.
K-mer Content, Correlation, and Position Analysis of Genome DNA Sequences for the Identification of Function and Evolutionary Features

Directory of Open Access Journals (Sweden)

Aaron Sievers

2017-04-01

Full Text Available In genome analysis, k-mer-based comparison methods have become standard tools. However, even though they are able to deliver reliable results, other algorithms seem to work better in some cases. To improve k-mer-based DNA sequence analysis and comparison, we successfully checked whether adding positional resolution is beneficial for finding and/or comparing interesting organizational structures. A simple but efficient algorithm for extracting and saving local k-mer spectra (frequency distribution of k-mers was developed and used. The results were analyzed by including positional information based on visualizations as genomic maps and by applying basic vector correlation methods. This analysis was concentrated on small word lengths (1 ≤ k ≤ 4 on relatively small viral genomes of Papillomaviridae and Herpesviridae, while also checking its usability for larger sequences, namely human chromosome 2 and the homologous chromosomes (2A, 2B of a chimpanzee. Using this alignment-free analysis, several regions with specific characteristics in Papillomaviridae and Herpesviridae formerly identified by independent, mostly alignment-based methods, were confirmed. Correlations between the k-mer content and several genes in these genomes have been found, showing similarities between classified and unclassified viruses, which may be potentially useful for further taxonomic research. Furthermore, unknown k-mer correlations in the genomes of Human Herpesviruses (HHVs, which are probably of major biological function, are found and described. Using the chromosomes of a chimpanzee and human that are currently known, identities between the species on every analyzed chromosome were reproduced. This demonstrates the feasibility of our approach for large data sets of complex genomes. Based on these results, we suggest k-mer analysis with positional resolution as a method for closing a gap between the effectiveness of alignment-based methods (like NCBI BLAST and the
KGCAK: a K-mer based database for genome-wide phylogeny and complexity evaluation.

Science.gov (United States)

Wang, Dapeng; Xu, Jiayue; Yu, Jun

2015-09-16

The K-mer approach, treating genomic sequences as simple characters and counting the relative abundance of each string upon a fixed K, has been extensively applied to phylogeny inference for genome assembly, annotation, and comparison. To meet increasing demands for comparing large genome sequences and to promote the use of the K-mer approach, we develop a versatile database, KGCAK ( http://kgcak.big.ac.cn/KGCAK/ ), containing ~8,000 genomes that include genome sequences of diverse life forms (viruses, prokaryotes, protists, animals, and plants) and cellular organelles of eukaryotic lineages. It builds phylogeny based on genomic elements in an alignment-free fashion and provides in-depth data processing enabling users to compare the complexity of genome sequences based on K-mer distribution. We hope that KGCAK becomes a powerful tool for exploring relationship within and among groups of species in a tree of life based on genomic data.
Enhanced regulatory sequence prediction using gapped k-mer features.

Science.gov (United States)

Ghandi, Mahmoud; Lee, Dongwon; Mohammad-Noori, Morteza; Beer, Michael A

2014-07-01

Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem.
Enhanced regulatory sequence prediction using gapped k-mer features.

Directory of Open Access Journals (Sweden)

Mahmoud Ghandi

2014-07-01

Full Text Available Oligomers of length k, or k-mers, are convenient and widely used features for modeling the properties and functions of DNA and protein sequences. However, k-mers suffer from the inherent limitation that if the parameter k is increased to resolve longer features, the probability of observing any specific k-mer becomes very small, and k-mer counts approach a binary variable, with most k-mers absent and a few present once. Thus, any statistical learning approach using k-mers as features becomes susceptible to noisy training set k-mer frequencies once k becomes large. To address this problem, we introduce alternative feature sets using gapped k-mers, a new classifier, gkm-SVM, and a general method for robust estimation of k-mer frequencies. To make the method applicable to large-scale genome wide applications, we develop an efficient tree data structure for computing the kernel matrix. We show that compared to our original kmer-SVM and alternative approaches, our gkm-SVM predicts functional genomic regulatory elements and tissue specific enhancers with significantly improved accuracy, increasing the precision by up to a factor of two. We then show that gkm-SVM consistently outperforms kmer-SVM on human ENCODE ChIP-seq datasets, and further demonstrate the general utility of our method using a Naïve-Bayes classifier. Although developed for regulatory sequence analysis, these methods can be applied to any sequence classification problem.
The effects of sampling on the efficiency and accuracy of k-mer indexes: Theoretical and empirical comparisons using the human genome.

Science.gov (United States)

Almutairy, Meznah; Torng, Eric

2017-01-01

One of the most common ways to search a sequence database for sequences that are similar to a query sequence is to use a k-mer index such as BLAST. A big problem with k-mer indexes is the space required to store the lists of all occurrences of all k-mers in the database. One method for reducing the space needed, and also query time, is sampling where only some k-mer occurrences are stored. Most previous work uses hard sampling, in which enough k-mer occurrences are retained so that all similar sequences are guaranteed to be found. In contrast, we study soft sampling, which further reduces the number of stored k-mer occurrences at a cost of decreasing query accuracy. We focus on finding highly similar local alignments (HSLA) over nucleotide sequences, an operation that is fundamental to biological applications such as cDNA sequence mapping. For our comparison, we use the NCBI BLAST tool with the human genome and human ESTs. When identifying HSLAs, we find that soft sampling significantly reduces both index size and query time with relatively small losses in query accuracy. For the human genome and HSLAs of length at least 100 bp, soft sampling reduces index size 4-10 times more than hard sampling and processes queries 2.3-6.8 times faster, while still achieving retention rates of at least 96.6%. When we apply soft sampling to the problem of mapping ESTs against the genome, we map more than 98% of ESTs perfectly while reducing the index size by a factor of 4 and query time by 23.3%. These results demonstrate that soft sampling is a simple but effective strategy for performing efficient searches for HSLAs. We also provide a new model for sampling with BLAST that predicts empirical retention rates with reasonable accuracy by modeling two key problem factors.
Implementation of hierarchical clustering using k-mer sparse matrix to analyze MERS-CoV genetic relationship

Science.gov (United States)

Bustamam, A.; Ulul, E. D.; Hura, H. F. A.; Siswantining, T.

2017-07-01

Hierarchical clustering is one of effective methods in creating a phylogenetic tree based on the distance matrix between DNA (deoxyribonucleic acid) sequences. One of the well-known methods to calculate the distance matrix is k-mer method. Generally, k-mer is more efficient than some distance matrix calculation techniques. The steps of k-mer method are started from creating k-mer sparse matrix, and followed by creating k-mer singular value vectors. The last step is computing the distance amongst vectors. In this paper, we analyze the sequences of MERS-CoV (Middle East Respiratory Syndrome - Coronavirus) DNA by implementing hierarchical clustering using k-mer sparse matrix in order to perform the phylogenetic analysis. Our results show that the ancestor of our MERS-CoV is coming from Egypt. Moreover, we found that the MERS-CoV infection that occurs in one country may not necessarily come from the same country of origin. This suggests that the process of MERS-CoV mutation might not only be influenced by geographical factor.
De Novo Assembly of Complete Chloroplast Genomes from Non-model Species Based on a K-mer Frequency-Based Selection of Chloroplast Reads from Total DNA Sequences

Directory of Open Access Journals (Sweden)

Shairul Izan

2017-08-01

Full Text Available Whole Genome Shotgun (WGS sequences of plant species often contain an abundance of reads that are derived from the chloroplast genome. Up to now these reads have generally been identified and assembled into chloroplast genomes based on homology to chloroplasts from related species. This re-sequencing approach may select against structural differences between the genomes especially in non-model species for which no close relatives have been sequenced before. The alternative approach is to de novo assemble the chloroplast genome from total genomic DNA sequences. In this study, we used k-mer frequency tables to identify and extract the chloroplast reads from the WGS reads and assemble these using a highly integrated and automated custom pipeline. Our strategy includes steps aimed at optimizing assemblies and filling gaps which are left due to coverage variation in the WGS dataset. We have successfully de novo assembled three complete chloroplast genomes from plant species with a range of nuclear genome sizes to demonstrate the universality of our approach: Solanum lycopersicum (0.9 Gb, Aegilops tauschii (4 Gb and Paphiopedilum henryanum (25 Gb. We also highlight the need to optimize the choice of k and the amount of data used. This new and cost-effective method for de novo short read assembly will facilitate the study of complete chloroplast genomes with more accurate analyses and inferences, especially in non-model plant genomes.
Chromatin accessibility prediction via convolutional long short-term memory networks with k-mer embedding.

Science.gov (United States)

Min, Xu; Zeng, Wanwen; Chen, Ning; Chen, Ting; Jiang, Rui

2017-07-15

Experimental techniques for measuring chromatin accessibility are expensive and time consuming, appealing for the development of computational approaches to predict open chromatin regions from DNA sequences. Along this direction, existing methods fall into two classes: one based on handcrafted k -mer features and the other based on convolutional neural networks. Although both categories have shown good performance in specific applications thus far, there still lacks a comprehensive framework to integrate useful k -mer co-occurrence information with recent advances in deep learning. We fill this gap by addressing the problem of chromatin accessibility prediction with a convolutional Long Short-Term Memory (LSTM) network with k -mer embedding. We first split DNA sequences into k -mers and pre-train k -mer embedding vectors based on the co-occurrence matrix of k -mers by using an unsupervised representation learning approach. We then construct a supervised deep learning architecture comprised of an embedding layer, three convolutional layers and a Bidirectional LSTM (BLSTM) layer for feature learning and classification. We demonstrate that our method gains high-quality fixed-length features from variable-length sequences and consistently outperforms baseline methods. We show that k -mer embedding can effectively enhance model performance by exploring different embedding strategies. We also prove the efficacy of both the convolution and the BLSTM layers by comparing two variations of the network architecture. We confirm the robustness of our model to hyper-parameters by performing sensitivity analysis. We hope our method can eventually reinforce our understanding of employing deep learning in genomic studies and shed light on research regarding mechanisms of chromatin accessibility. The source code can be downloaded from https://github.com/minxueric/ismb2017_lstm . tingchen@tsinghua.edu.cn or ruijiang@tsinghua.edu.cn. Supplementary materials are available at
Comparing fixed sampling with minimizer sampling when using k-mer indexes to find maximal exact matches.

Science.gov (United States)

Almutairy, Meznah; Torng, Eric

2018-01-01

Bioinformatics applications and pipelines increasingly use k-mer indexes to search for similar sequences. The major problem with k-mer indexes is that they require lots of memory. Sampling is often used to reduce index size and query time. Most applications use one of two major types of sampling: fixed sampling and minimizer sampling. It is well known that fixed sampling will produce a smaller index, typically by roughly a factor of two, whereas it is generally assumed that minimizer sampling will produce faster query times since query k-mers can also be sampled. However, no direct comparison of fixed and minimizer sampling has been performed to verify these assumptions. We systematically compare fixed and minimizer sampling using the human genome as our database. We use the resulting k-mer indexes for fixed sampling and minimizer sampling to find all maximal exact matches between our database, the human genome, and three separate query sets, the mouse genome, the chimp genome, and an NGS data set. We reach the following conclusions. First, using larger k-mers reduces query time for both fixed sampling and minimizer sampling at a cost of requiring more space. If we use the same k-mer size for both methods, fixed sampling requires typically half as much space whereas minimizer sampling processes queries only slightly faster. If we are allowed to use any k-mer size for each method, then we can choose a k-mer size such that fixed sampling both uses less space and processes queries faster than minimizer sampling. The reason is that although minimizer sampling is able to sample query k-mers, the number of shared k-mer occurrences that must be processed is much larger for minimizer sampling than fixed sampling. In conclusion, we argue that for any application where each shared k-mer occurrence must be processed, fixed sampling is the right sampling method.
Comparing fixed sampling with minimizer sampling when using k-mer indexes to find maximal exact matches.

Directory of Open Access Journals (Sweden)

Meznah Almutairy

Full Text Available Bioinformatics applications and pipelines increasingly use k-mer indexes to search for similar sequences. The major problem with k-mer indexes is that they require lots of memory. Sampling is often used to reduce index size and query time. Most applications use one of two major types of sampling: fixed sampling and minimizer sampling. It is well known that fixed sampling will produce a smaller index, typically by roughly a factor of two, whereas it is generally assumed that minimizer sampling will produce faster query times since query k-mers can also be sampled. However, no direct comparison of fixed and minimizer sampling has been performed to verify these assumptions. We systematically compare fixed and minimizer sampling using the human genome as our database. We use the resulting k-mer indexes for fixed sampling and minimizer sampling to find all maximal exact matches between our database, the human genome, and three separate query sets, the mouse genome, the chimp genome, and an NGS data set. We reach the following conclusions. First, using larger k-mers reduces query time for both fixed sampling and minimizer sampling at a cost of requiring more space. If we use the same k-mer size for both methods, fixed sampling requires typically half as much space whereas minimizer sampling processes queries only slightly faster. If we are allowed to use any k-mer size for each method, then we can choose a k-mer size such that fixed sampling both uses less space and processes queries faster than minimizer sampling. The reason is that although minimizer sampling is able to sample query k-mers, the number of shared k-mer occurrences that must be processed is much larger for minimizer sampling than fixed sampling. In conclusion, we argue that for any application where each shared k-mer occurrence must be processed, fixed sampling is the right sampling method.
Comparing fixed sampling with minimizer sampling when using k-mer indexes to find maximal exact matches

Science.gov (United States)

Torng, Eric

2018-01-01

Bioinformatics applications and pipelines increasingly use k-mer indexes to search for similar sequences. The major problem with k-mer indexes is that they require lots of memory. Sampling is often used to reduce index size and query time. Most applications use one of two major types of sampling: fixed sampling and minimizer sampling. It is well known that fixed sampling will produce a smaller index, typically by roughly a factor of two, whereas it is generally assumed that minimizer sampling will produce faster query times since query k-mers can also be sampled. However, no direct comparison of fixed and minimizer sampling has been performed to verify these assumptions. We systematically compare fixed and minimizer sampling using the human genome as our database. We use the resulting k-mer indexes for fixed sampling and minimizer sampling to find all maximal exact matches between our database, the human genome, and three separate query sets, the mouse genome, the chimp genome, and an NGS data set. We reach the following conclusions. First, using larger k-mers reduces query time for both fixed sampling and minimizer sampling at a cost of requiring more space. If we use the same k-mer size for both methods, fixed sampling requires typically half as much space whereas minimizer sampling processes queries only slightly faster. If we are allowed to use any k-mer size for each method, then we can choose a k-mer size such that fixed sampling both uses less space and processes queries faster than minimizer sampling. The reason is that although minimizer sampling is able to sample query k-mers, the number of shared k-mer occurrences that must be processed is much larger for minimizer sampling than fixed sampling. In conclusion, we argue that for any application where each shared k-mer occurrence must be processed, fixed sampling is the right sampling method. PMID:29389989
Kmerind: A Flexible Parallel Library for K-mer Indexing of Biological Sequences on Distributed Memory Systems.

Science.gov (United States)

Pan, Tony; Flick, Patrick; Jain, Chirag; Liu, Yongchao; Aluru, Srinivas

2017-10-09

Counting and indexing fixed length substrings, or k-mers, in biological sequences is a key step in many bioinformatics tasks including genome alignment and mapping, genome assembly, and error correction. While advances in next generation sequencing technologies have dramatically reduced the cost and improved latency and throughput, few bioinformatics tools can efficiently process the datasets at the current generation rate of 1.8 terabases every 3 days. We present Kmerind, a high performance parallel k-mer indexing library for distributed memory environments. The Kmerind library provides a set of simple and consistent APIs with sequential semantics and parallel implementations that are designed to be flexible and extensible. Kmerind's k-mer counter performs similarly or better than the best existing k-mer counting tools even on shared memory systems. In a distributed memory environment, Kmerind counts k-mers in a 120 GB sequence read dataset in less than 13 seconds on 1024 Xeon CPU cores, and fully indexes their positions in approximately 17 seconds. Querying for 1% of the k-mers in these indices can be completed in 0.23 seconds and 28 seconds, respectively. Kmerind is the first k-mer indexing library for distributed memory environments, and the first extensible library for general k-mer indexing and counting. Kmerind is available at https://github.com/ParBLiSS/kmerind.
kWIP: The k-mer weighted inner product, a de novo estimator of genetic similarity.

Science.gov (United States)

Murray, Kevin D; Webers, Christfried; Ong, Cheng Soon; Borevitz, Justin; Warthmann, Norman

2017-09-01

Modern genomics techniques generate overwhelming quantities of data. Extracting population genetic variation demands computationally efficient methods to determine genetic relatedness between individuals (or "samples") in an unbiased manner, preferably de novo. Rapid estimation of genetic relatedness directly from sequencing data has the potential to overcome reference genome bias, and to verify that individuals belong to the correct genetic lineage before conclusions are drawn using mislabelled, or misidentified samples. We present the k-mer Weighted Inner Product (kWIP), an assembly-, and alignment-free estimator of genetic similarity. kWIP combines a probabilistic data structure with a novel metric, the weighted inner product (WIP), to efficiently calculate pairwise similarity between sequencing runs from their k-mer counts. It produces a distance matrix, which can then be further analysed and visualised. Our method does not require prior knowledge of the underlying genomes and applications include establishing sample identity and detecting mix-up, non-obvious genomic variation, and population structure. We show that kWIP can reconstruct the true relatedness between samples from simulated populations. By re-analysing several published datasets we show that our results are consistent with marker-based analyses. kWIP is written in C++, licensed under the GNU GPL, and is available from https://github.com/kdmurray91/kwip.
Recapitulating phylogenies using k-mers: from trees to networks.

Science.gov (United States)

Bernard, Guillaume; Ragan, Mark A; Chan, Cheong Xin

2016-01-01

Ernst Haeckel based his landmark Tree of Life on the supposed ontogenic recapitulation of phylogeny, i.e. that successive embryonic stages during the development of an organism re-trace the morphological forms of its ancestors over the course of evolution. Much of this idea has since been discredited. Today, phylogenies are often based on families of molecular sequences. The standard approach starts with a multiple sequence alignment, in which the sequences are arranged relative to each other in a way that maximises a measure of similarity position-by-position along their entire length. A tree (or sometimes a network) is then inferred. Rigorous multiple sequence alignment is computationally demanding, and evolutionary processes that shape the genomes of many microbes (bacteria, archaea and some morphologically simple eukaryotes) can add further complications. In particular, recombination, genome rearrangement and lateral genetic transfer undermine the assumptions that underlie multiple sequence alignment, and imply that a tree-like structure may be too simplistic. Here, using genome sequences of 143 bacterial and archaeal genomes, we construct a network of phylogenetic relatedness based on the number of shared k -mers (subsequences at fixed length k ). Our findings suggest that the network captures not only key aspects of microbial genome evolution as inferred from a tree, but also features that are not treelike. The method is highly scalable, allowing for investigation of genome evolution across a large number of genomes. Instead of using specific regions or sequences from genome sequences, or indeed Haeckel's idea of ontogeny, we argue that genome phylogenies can be inferred using k -mers from whole-genome sequences. Representing these networks dynamically allows biological questions of interest to be formulated and addressed quickly and in a visually intuitive manner.
A k-mer-based barcode DNA classification methodology based on spectral representation and a neural gas network.

Science.gov (United States)

Fiannaca, Antonino; La Rosa, Massimo; Rizzo, Riccardo; Urso, Alfonso

2015-07-01

In this paper, an alignment-free method for DNA barcode classification that is based on both a spectral representation and a neural gas network for unsupervised clustering is proposed. In the proposed methodology, distinctive words are identified from a spectral representation of DNA sequences. A taxonomic classification of the DNA sequence is then performed using the sequence signature, i.e., the smallest set of k-mers that can assign a DNA sequence to its proper taxonomic category. Experiments were then performed to compare our method with other supervised machine learning classification algorithms, such as support vector machine, random forest, ripper, naïve Bayes, ridor, and classification tree, which also consider short DNA sequence fragments of 200 and 300 base pairs (bp). The experimental tests were conducted over 10 real barcode datasets belonging to different animal species, which were provided by the on-line resource "Barcode of Life Database". The experimental results showed that our k-mer-based approach is directly comparable, in terms of accuracy, recall and precision metrics, with the other classifiers when considering full-length sequences. In addition, we demonstrate the robustness of our method when a classification is performed task with a set of short DNA sequences that were randomly extracted from the original data. For example, the proposed method can reach the accuracy of 64.8% at the species level with 200-bp fragments. Under the same conditions, the best other classifier (random forest) reaches the accuracy of 20.9%. Our results indicate that we obtained a clear improvement over the other classifiers for the study of short DNA barcode sequence fragments. Copyright © 2015 Elsevier B.V. All rights reserved.
Simrank: Rapid and sensitive general-purpose k-mer search tool

Energy Technology Data Exchange (ETDEWEB)

DeSantis, T.Z.; Keller, K.; Karaoz, U.; Alekseyenko, A.V; Singh, N.N.S.; Brodie, E.L; Pei, Z.; Andersen, G.L; Larsen, N.

2011-04-01

Terabyte-scale collections of string-encoded data are expected from consortia efforts such as the Human Microbiome Project (http://nihroadmap.nih.gov/hmp). Intra- and inter-project data similarity searches are enabled by rapid k-mer matching strategies. Software applications for sequence database partitioning, guide tree estimation, molecular classification and alignment acceleration have benefited from embedded k-mer searches as sub-routines. However, a rapid, general-purpose, open-source, flexible, stand-alone k-mer tool has not been available. Here we present a stand-alone utility, Simrank, which allows users to rapidly identify database strings the most similar to query strings. Performance testing of Simrank and related tools against DNA, RNA, protein and human-languages found Simrank 10X to 928X faster depending on the dataset. Simrank provides molecular ecologists with a high-throughput, open source choice for comparing large sequence sets to find similarity.
Norgal: extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data.

Science.gov (United States)

Al-Nakeeb, Kosai; Petersen, Thomas Nordahl; Sicheritz-Pontén, Thomas

2017-11-21

Whole-genome sequencing (WGS) projects provide short read nucleotide sequences from nuclear and possibly organelle DNA depending on the source of origin. Mitochondrial DNA is present in animals and fungi, while plants contain DNA from both mitochondria and chloroplasts. Current techniques for separating organelle reads from nuclear reads in WGS data require full reference or partial seed sequences for assembling. Norgal (de Novo ORGAneLle extractor) avoids this requirement by identifying a high frequency subset of k-mers that are predominantly of mitochondrial origin and performing a de novo assembly on a subset of reads that contains these k-mers. The method was applied to WGS data from a panda, brown algae seaweed, butterfly and filamentous fungus. We were able to extract full circular mitochondrial genomes and obtained sequence identities to the reference sequences in the range from 98.5 to 99.5%. We also assembled the chloroplasts of grape vines and cucumbers using Norgal together with seed-based de novo assemblers. Norgal is a pipeline that can extract and assemble full or partial mitochondrial and chloroplast genomes from WGS short reads without prior knowledge. The program is available at: https://bitbucket.org/kosaidtu/norgal .
Statistically Consistent k-mer Methods for Phylogenetic Tree Reconstruction.

Science.gov (United States)

Allman, Elizabeth S; Rhodes, John A; Sullivant, Seth

2017-02-01

Frequencies of k-mers in sequences are sometimes used as a basis for inferring phylogenetic trees without first obtaining a multiple sequence alignment. We show that a standard approach of using the squared Euclidean distance between k-mer vectors to approximate a tree metric can be statistically inconsistent. To remedy this, we derive model-based distance corrections for orthologous sequences without gaps, which lead to consistent tree inference. The identifiability of model parameters from k-mer frequencies is also studied. Finally, we report simulations showing that the corrected distance outperforms many other k-mer methods, even when sequences are generated with an insertion and deletion process. These results have implications for multiple sequence alignment as well since k-mer methods are usually the first step in constructing a guide tree for such algorithms.
The DnaK Chaperone Uses Different Mechanisms To Promote and Inhibit Replication of Vibrio cholerae Chromosome 2

Directory of Open Access Journals (Sweden)

Jyoti K. Jha

2017-04-01

Full Text Available Replication of Vibrio cholerae chromosome 2 (Chr2 depends on molecular chaperone DnaK to facilitate binding of the initiator (RctB to the replication origin. The binding occurs at two kinds of site, 12-mers and 39-mers, which promote and inhibit replication, respectively. Here we show that DnaK employs different mechanisms to enhance the two kinds of binding. We found that mutations in rctB that reduce DnaK binding also reduce 12-mer binding and initiation. The initiation defect is suppressed by second-site mutations that increase 12-mer binding only marginally. Instead, they reduce replication inhibitory mechanisms: RctB dimerization and 39-mer binding. One suppressing change was in a dimerization domain which is folded similarly to the initiator of an iteron plasmid—the presumed progenitor of Chr2. In plasmids, DnaK promotes initiation by reducing dimerization. A different mutation was in the 39-mer binding domain of RctB and inactivated it, indicating an alternative suppression mechanism. Paradoxically, although DnaK increases 39-mer binding, the increase was also achieved by inactivating the DnaK binding site of RctB. This result suggests that the site inhibits the 39-mer binding domain (via autoinhibition when prevented from binding DnaK. Taken together, our results reveal an important feature of the transition from plasmid to chromosome: the Chr2 initiator retains the plasmid-like dimerization domain and its control by chaperones but uses the chaperones in an unprecedented way to control the inhibitory 39-mer binding.

Designing small universal k-mer hitting sets for improved analysis of high-throughput sequencing.

Directory of Open Access Journals (Sweden)

Yaron Orenstein

2017-10-01

Full Text Available With the rapidly increasing volume of deep sequencing data, more efficient algorithms and data structures are needed. Minimizers are a central recent paradigm that has improved various sequence analysis tasks, including hashing for faster read overlap detection, sparse suffix arrays for creating smaller indexes, and Bloom filters for speeding up sequence search. Here, we propose an alternative paradigm that can lead to substantial further improvement in these and other tasks. For integers k and L > k, we say that a set of k-mers is a universal hitting set (UHS if every possible L-long sequence must contain a k-mer from the set. We develop a heuristic called DOCKS to find a compact UHS, which works in two phases: The first phase is solved optimally, and for the second we propose several efficient heuristics, trading set size for speed and memory. The use of heuristics is motivated by showing the NP-hardness of a closely related problem. We show that DOCKS works well in practice and produces UHSs that are very close to a theoretical lower bound. We present results for various values of k and L and by applying them to real genomes show that UHSs indeed improve over minimizers. In particular, DOCKS uses less than 30% of the 10-mers needed to span the human genome compared to minimizers. The software and computed UHSs are freely available at github.com/Shamir-Lab/DOCKS/ and acgt.cs.tau.ac.il/docks/, respectively.
Identification of species based on DNA barcode using k-mer feature vector and Random forest classifier.

Science.gov (United States)

Meher, Prabina Kumar; Sahu, Tanmaya Kumar; Rao, A R

2016-11-05

DNA barcoding is a molecular diagnostic method that allows automated and accurate identification of species based on a short and standardized fragment of DNA. To this end, an attempt has been made in this study to develop a computational approach for identifying the species by comparing its barcode with the barcode sequence of known species present in the reference library. Each barcode sequence was first mapped onto a numeric feature vector based on k-mer frequencies and then Random forest methodology was employed on the transformed dataset for species identification. The proposed approach outperformed similarity-based, tree-based, diagnostic-based approaches and found comparable with existing supervised learning based approaches in terms of species identification success rate, while compared using real and simulated datasets. Based on the proposed approach, an online web interface SPIDBAR has also been developed and made freely available at http://cabgrid.res.in:8080/spidbar/ for species identification by the taxonomists. Copyright © 2016 Elsevier B.V. All rights reserved.
The DnaK Chaperone Uses Different Mechanisms To Promote and Inhibit Replication of Vibrio cholerae Chromosome 2

Energy Technology Data Exchange (ETDEWEB)

Jha, Jyoti K.; Li, Mi; Ghirlando, Rodolfo; Miller Jenkins, Lisa M.; Wlodawer, Alexander; Chattoraj, Dhruba; Dunny, Gary M.

2017-04-18

Replication of Vibrio cholerae chromosome 2 (Chr2) depends on molecular chaperone DnaK to facilitate binding of the initiator (RctB) to the replication origin. The binding occurs at two kinds of site, 12-mers and 39-mers, which promote and inhibit replication, respectively. Here we show that DnaK employs different mechanisms to enhance the two kinds of binding. We found that mutations inrctBthat reduce DnaK binding also reduce 12-mer binding and initiation. The initiation defect is suppressed by second-site mutations that increase 12-mer binding only marginally. Instead, they reduce replication inhibitory mechanisms: RctB dimerization and 39-mer binding. One suppressing change was in a dimerization domain which is folded similarly to the initiator of an iteron plasmid—the presumed progenitor of Chr2. In plasmids, DnaK promotes initiation by reducing dimerization. A different mutation was in the 39-mer binding domain of RctB and inactivated it, indicating an alternative suppression mechanism. Paradoxically, although DnaK increases 39-mer binding, the increase was also achieved by inactivating the DnaK binding site of RctB. This result suggests that the site inhibits the 39-mer binding domain (via autoinhibition) when prevented from binding DnaK. Taken together, our results reveal an important feature of the transition from plasmid to chromosome: the Chr2 initiator retains the plasmid-like dimerization domain and its control by chaperones but uses the chaperones in an unprecedented way to control the inhibitory 39-mer binding. IMPORTANCE The capacity of proteins to undergo remodeling provides opportunities to control their function. However, remodeling remains a poorly understood aspect of the structure-function paradigm due to its dynamic nature. Here we have studied remodeling of the initiator of replication ofVibrio choleraeChr2 by the molecular chaperone, DnaK. We show that DnaK binds to a site on the Chr2 initiator (RctB) that
Transcriptional switching by the MerR protein: Activation and repression mutants implicate distinct DNA and mercury(II) binding domains

International Nuclear Information System (INIS)

Shewchuk, L.M.; Helmann, J.D.; Ross, W.; Park, S.J.; Summers, A.O.; Walsh, C.T.

1989-01-01

Bacterial resistance to mercuric compounds is controlled by the MerR metalloregulatory protein. The MerR protein functions as both a transcriptional repressor and a mercuric ion dependent transcriptional activator. Chemical mutagenesis of the cloned merR structural gene has led to the identification of mutant proteins that are specifically deficient in transcriptional repression, activation, or both. Five mutant proteins have been overproduced, purified to homogeneity, and assayed for ability to dimerize, bind mer operator DNA, and bind mercuric ion. A mutation in the recognition helix of a proposed helix-turn-helix DNA binding motif (E22K) yields protein deficient in both activation and repression in vivo (a - r - ) and deficient in operator binding in vitro. In contrast, mutations in three of the four MerR cysteine residues are repression competent but activation deficient (a - r + ) in vivo. In vitro, the purified cysteine mutant proteins bind to the mer operator site with near wild-type affinity but are variable deficient in binding the in vivo inducer mercury(II) ion. A subset of the isolated proteins also appears compromised in their ability to form dimers at low protein concentrations. These data support a model in which DNA-bound MerR dimer binds one mercuric ion and transmits this occupancy information to a protein region involved in transcriptional activation
Cutoffs and k-mers: implications from a transcriptome study in allopolyploid plants

Directory of Open Access Journals (Sweden)

Gruenheit Nicole

2012-03-01

Full Text Available Abstract Background Transcriptome analysis is increasingly being used to study the evolutionary origins and ecology of non-model plants. One issue for both transcriptome assembly and differential gene expression analyses is the common occurrence in plants of hybridisation and whole genome duplication (WGD and hybridization resulting in allopolyploidy. The divergence of duplicated genes following WGD creates near identical homeologues that can be problematic for de novo assembly and also reference based assembly protocols that use short reads (35 - 100 bp. Results Here we report a successful strategy for the assembly of two transcriptomes made using 75 bp Illumina reads from Pachycladon fastigiatum and Pachycladon cheesemanii. Both are allopolyploid plant species (2n = 20 that originated in the New Zealand Alps about 0.8 million years ago. In a systematic analysis of 19 different coverage cutoffs and 20 different k-mer sizes we showed that i none of the genes could be assembled across all of the parameter space ii assembly of each gene required an optimal set of parameter values and iii these parameter values could be explained in part by different gene expression levels and different degrees of similarity between genes. Conclusions To obtain optimal transcriptome assemblies for allopolyploid plants, k-mer size and k-mer coverage need to be considered simultaneously across a broad parameter space. This is important for assembling a maximum number of full length ESTs and for avoiding chimeric assemblies of homeologous and paralogous gene copies.
Quality assessment of buccal versus blood genomic DNA using the Affymetrix 500 K GeneChip

Directory of Open Access Journals (Sweden)

Martin Lisa J

2007-11-01

Full Text Available Abstract Background With the advent of genome-wide genotyping, the utility of stored buccal brushes for DNA extraction and genotyping has been questioned. We sought to describe the genomic DNA yield and concordance between stored buccal brushes and blood samples from the same individuals in the context of Affymetrix 500 K Human GeneChip genotyping. Results Buccal cytobrushes stored for ~7 years at -80°C prior to extraction yielded sufficient double stranded DNA (dsDNA to be successfully genotyped on the Affymetrix ~262 K NspI chip, with yields between 536 and 1047 ng dsDNA. Using the BRLMM algorithm, genotyping call rates for blood samples averaged 98.4%, and for buccal samples averaged 97.8%. Matched blood samples exhibited 99.2% concordance, while matched blood and buccal samples exhibited 98.8% concordance. Conclusion Buccal cytobrushes stored long-term result in sufficient dsDNA concentrations to achieve high genotyping call rates and concordance with stored blood samples in the context of Affymetrix 500 K SNP genotyping. Thus, given high-quality collection and storage protocols, it is possible to use stored buccal cytobrush samples for genome-wide association studies.
Complete Genome Sequence of Middle East Respiratory Syndrome Coronavirus (MERS-CoV) from the First Imported MERS-CoV Case in China.

Science.gov (United States)

Lu, Roujian; Wang, Yanqun; Wang, Wenling; Nie, Kai; Zhao, Yanjie; Su, Juan; Deng, Yao; Zhou, Weimin; Li, Yang; Wang, Huijuan; Wang, Wen; Ke, Changwen; Ma, Xuejun; Wu, Guizhen; Tan, Wenjie

2015-08-13

On 26 May 2015, an imported Middle East respiratory syndrome coronavirus (MERS-CoV) was identified in Guangdong Province, China, and found to be closely related to the MERS-CoV strain prevalent in South Korea. The full genome of the ChinaGD01 strain was sequenced and analyzed to investigate the epidemiology and evolution of MERS-CoV circulating in South Korea and China. Copyright © 2015 Lu et al.
Biased distribution of DNA uptake sequences towards genome maintenance genes

DEFF Research Database (Denmark)

Davidsen, T.; Rodland, E.A.; Lagesen, K.

2004-01-01

Repeated sequence signatures are characteristic features of all genomic DNA. We have made a rigorous search for repeat genomic sequences in the human pathogens Neisseria meningitidis, Neisseria gonorrhoeae and Haemophilus influenzae and found that by far the most frequent 9-10mers residing within...... in these organisms. Pasteurella multocida also displayed high frequencies of a putative DUS identical to that previously identified in H. influenzae and with a skewed distribution towards genome maintenance genes, indicating that this bacterium might be transformation competent under certain conditions....
A pipeline for the de novo assembly of the Themira biloba (Sepsidae: Diptera) transcriptome using a multiple k-mer length approach.

Science.gov (United States)

Melicher, Dacotah; Torson, Alex S; Dworkin, Ian; Bowsher, Julia H

2014-03-12

The Sepsidae family of flies is a model for investigating how sexual selection shapes courtship and sexual dimorphism in a comparative framework. However, like many non-model systems, there are few molecular resources available. Large-scale sequencing and assembly have not been performed in any sepsid, and the lack of a closely related genome makes investigation of gene expression challenging. Our goal was to develop an automated pipeline for de novo transcriptome assembly, and to use that pipeline to assemble and analyze the transcriptome of the sepsid Themira biloba. Our bioinformatics pipeline uses cloud computing services to assemble and analyze the transcriptome with off-site data management, processing, and backup. It uses a multiple k-mer length approach combined with a second meta-assembly to extend transcripts and recover more bases of transcript sequences than standard single k-mer assembly. We used 454 sequencing to generate 1.48 million reads from cDNA generated from embryo, larva, and pupae of T. biloba and assembled a transcriptome consisting of 24,495 contigs. Annotation identified 16,705 transcripts, including those involved in embryogenesis and limb patterning. We assembled transcriptomes from an additional three non-model organisms to demonstrate that our pipeline assembled a higher-quality transcriptome than single k-mer approaches across multiple species. The pipeline we have developed for assembly and analysis increases contig length, recovers unique transcripts, and assembles more base pairs than other methods through the use of a meta-assembly. The T. biloba transcriptome is a critical resource for performing large-scale RNA-Seq investigations of gene expression patterns, and is the first transcriptome sequenced in this Dipteran family.
Multiple comparative metagenomics using multiset k-mer counting

Directory of Open Access Journals (Sweden)

Gaëtan Benoit

2016-11-01

Full Text Available Background Large scale metagenomic projects aim to extract biodiversity knowledge between different environmental conditions. Current methods for comparing microbial communities face important limitations. Those based on taxonomical or functional assignation rely on a small subset of the sequences that can be associated to known organisms. On the other hand, de novo methods, that compare the whole sets of sequences, either do not scale up on ambitious metagenomic projects or do not provide precise and exhaustive results. Methods These limitations motivated the development of a new de novo metagenomic comparative method, called Simka. This method computes a large collection of standard ecological distances by replacing species counts by k-mer counts. Simka scales-up today’s metagenomic projects thanks to a new parallel k-mer counting strategy on multiple datasets. Results Experiments on public Human Microbiome Project datasets demonstrate that Simka captures the essential underlying biological structure. Simka was able to compute in a few hours both qualitative and quantitative ecological distances on hundreds of metagenomic samples (690 samples, 32 billions of reads. We also demonstrate that analyzing metagenomes at the k-mer level is highly correlated with extremely precise de novo comparison techniques which rely on all-versus-all sequences alignment strategy or which are based on taxonomic profiling.
Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

Directory of Open Access Journals (Sweden)

Maley Carlo C

2008-10-01

Full Text Available Abstract Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12 genomes. Virtually all possible (> 98% 12 bp oligomers appear in vertebrate genomes while 98% to D. melanogaster (12–17 bp, C. elegans (11–17 bp, A. thaliana (11–17 bp, S. cerevisiae (10–16 bp and E. coli (9–15 bp. Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously recognized. This information may be used to detect
Sequence space coverage, entropy of genomes and the potential to detect non-human DNA in human samples

Science.gov (United States)

Liu, Zhandong; Venkatesh, Santosh S; Maley, Carlo C

2008-01-01

Background Genomes store information for building and maintaining organisms. Complete sequencing of many genomes provides the opportunity to study and compare global information properties of those genomes. Results We have analyzed aspects of the information content of Homo sapiens, Mus musculus, Drosophila melanogaster, Caenorhabditis elegans, Arabidopsis thaliana, Saccharomyces cerevisiae, and Escherichia coli (K-12) genomes. Virtually all possible (> 98%) 12 bp oligomers appear in vertebrate genomes while 98% to < 2% of possible oligomers in D. melanogaster (12–17 bp), C. elegans (11–17 bp), A. thaliana (11–17 bp), S. cerevisiae (10–16 bp) and E. coli (9–15 bp). Frequencies of unique oligomers in the genomes follow similar patterns. We identified a set of 2.6 M 15-mers that are more than 1 nucleotide different from all 15-mers in the human genome and so could be used as probes to detect microbes in human samples. In a human sample, these probes would detect 100% of the 433 currently fully sequenced prokaryotes and 75% of the 3065 fully sequenced viruses. The human genome is significantly more compact in sequence space than a random genome. We identified the most frequent 5- to 20-mers in the human genome, which may prove useful as PCR primers. We also identified a bacterium, Anaeromyxobacter dehalogenans, which has an exceptionally low diversity of oligomers given the size of its genome and its GC content. The entropy of coding regions in the human genome is significantly higher than non-coding regions and chromosomes. However chromosomes 1, 2, 9, 12 and 14 have a relatively high proportion of coding DNA without high entropy, and chromosome 20 is the opposite with a low frequency of coding regions but relatively high entropy. Conclusion Measures of the frequency of oligomers are useful for designing PCR assays and for identifying chromosomes and organisms with hidden structure that had not been previously recognized. This information may be used to
Unsupervised binning of environmental genomic fragments based on an error robust selection of l-mers.

Science.gov (United States)

Yang, Bin; Peng, Yu; Leung, Henry Chi-Ming; Yiu, Siu-Ming; Chen, Jing-Chi; Chin, Francis Yuk-Lun

2010-04-16

With the rapid development of genome sequencing techniques, traditional research methods based on the isolation and cultivation of microorganisms are being gradually replaced by metagenomics, which is also known as environmental genomics. The first step, which is still a major bottleneck, of metagenomics is the taxonomic characterization of DNA fragments (reads) resulting from sequencing a sample of mixed species. This step is usually referred as "binning". Existing binning methods are based on supervised or semi-supervised approaches which rely heavily on reference genomes of known microorganisms and phylogenetic marker genes. Due to the limited availability of reference genomes and the bias and instability of marker genes, existing binning methods may not be applicable in many cases. In this paper, we present an unsupervised binning method based on the distribution of a carefully selected set of l-mers (substrings of length l in DNA fragments). From our experiments, we show that our method can accurately bin DNA fragments with various lengths and relative species abundance ratios without using any reference and training datasets. Another feature of our method is its error robustness. The binning accuracy decreases by less than 1% when the sequencing error rate increases from 0% to 5%. Note that the typical sequencing error rate of existing commercial sequencing platforms is less than 2%. We provide a new and effective tool to solve the metagenome binning problem without using any reference datasets or markers information of any known reference genomes (species). The source code of our software tool, the reference genomes of the species for generating the test datasets and the corresponding test datasets are available at http://i.cs.hku.hk/~alse/MetaCluster/.
Update on the use of random 10-mers in mapping and fingerprinting genomes

International Nuclear Information System (INIS)

Sinibaldi, R.M.

2001-01-01

The use of Randomly Amplified Polymorphic DNA (RAPDs) has continued to grow for the last several years. A quick assessment of their use can be estimated by searching PubMed at the National Library of Medicine with the acronym RAPD. Since their first report in 1990, the number of citations with RAPD in them has increased from 12 in 1990, to 45 in 1991, to, 112 in 1993, to, 130 in 1994, to 223 in 1995, to 258 in 1996, to 236 in 1997, to 316 in 1998, to 196 to date (August 31) 1999. The utilization of 10-mers for mapping or fingerprinting has many advantages. These include a relatively low cost, no use of radioactivity, easily adapted to automation, requirement for very small amounts of input DNA, rapid results, existing data bases for many organisms, and low cost equipment requirements. In conjunction with a derived technology such as SCARs (sequence characterized amplified regions), it can provide cost effective and thorough methods for mapping and fingerprinting any genome. Newer methods based on microarray technology may offer powerful but expensive alternative approaches in determining genetic diversity. The costs of arrays should come down with time and improved production methods. In the meantime, RAPDs remain a competent and cost effective method for genome characterizations. (author)
Real-Time Pathogen Detection in the Era of Whole-Genome Sequencing and Big Data: Comparison of k-mer and Site-Based Methods for Inferring the Genetic Distances among Tens of Thousands of Salmonella Samples.

Science.gov (United States)

Pettengill, James B; Pightling, Arthur W; Baugher, Joseph D; Rand, Hugh; Strain, Errol

2016-01-01

The adoption of whole-genome sequencing within the public health realm for molecular characterization of bacterial pathogens has been followed by an increased emphasis on real-time detection of emerging outbreaks (e.g., food-borne Salmonellosis). In turn, large databases of whole-genome sequence data are being populated. These databases currently contain tens of thousands of samples and are expected to grow to hundreds of thousands within a few years. For these databases to be of optimal use one must be able to quickly interrogate them to accurately determine the genetic distances among a set of samples. Being able to do so is challenging due to both biological (evolutionary diverse samples) and computational (petabytes of sequence data) issues. We evaluated seven measures of genetic distance, which were estimated from either k-mer profiles (Jaccard, Euclidean, Manhattan, Mash Jaccard, and Mash distances) or nucleotide sites (NUCmer and an extended multi-locus sequence typing (MLST) scheme). When analyzing empirical data (whole-genome sequence data from 18,997 Salmonella isolates) there are features (e.g., genomic, assembly, and contamination) that cause distances inferred from k-mer profiles, which treat absent data as informative, to fail to accurately capture the distance between samples when compared to distances inferred from differences in nucleotide sites. Thus, site-based distances, like NUCmer and extended MLST, are superior in performance, but accessing the computing resources necessary to perform them may be challenging when analyzing large databases.
Real-Time Pathogen Detection in the Era of Whole-Genome Sequencing and Big Data: Comparison of k-mer and Site-Based Methods for Inferring the Genetic Distances among Tens of Thousands of Salmonella Samples.

Directory of Open Access Journals (Sweden)

James B Pettengill

Full Text Available The adoption of whole-genome sequencing within the public health realm for molecular characterization of bacterial pathogens has been followed by an increased emphasis on real-time detection of emerging outbreaks (e.g., food-borne Salmonellosis. In turn, large databases of whole-genome sequence data are being populated. These databases currently contain tens of thousands of samples and are expected to grow to hundreds of thousands within a few years. For these databases to be of optimal use one must be able to quickly interrogate them to accurately determine the genetic distances among a set of samples. Being able to do so is challenging due to both biological (evolutionary diverse samples and computational (petabytes of sequence data issues. We evaluated seven measures of genetic distance, which were estimated from either k-mer profiles (Jaccard, Euclidean, Manhattan, Mash Jaccard, and Mash distances or nucleotide sites (NUCmer and an extended multi-locus sequence typing (MLST scheme. When analyzing empirical data (whole-genome sequence data from 18,997 Salmonella isolates there are features (e.g., genomic, assembly, and contamination that cause distances inferred from k-mer profiles, which treat absent data as informative, to fail to accurately capture the distance between samples when compared to distances inferred from differences in nucleotide sites. Thus, site-based distances, like NUCmer and extended MLST, are superior in performance, but accessing the computing resources necessary to perform them may be challenging when analyzing large databases.
High-Throughput Block Optical DNA Sequence Identification.

Science.gov (United States)

Sagar, Dodderi Manjunatha; Korshoj, Lee Erik; Hanson, Katrina Bethany; Chowdhury, Partha Pratim; Otoupal, Peter Britton; Chatterjee, Anushree; Nagpal, Prashant

2018-01-01

Optical techniques for molecular diagnostics or DNA sequencing generally rely on small molecule fluorescent labels, which utilize light with a wavelength of several hundred nanometers for detection. Developing a label-free optical DNA sequencing technique will require nanoscale focusing of light, a high-throughput and multiplexed identification method, and a data compression technique to rapidly identify sequences and analyze genomic heterogeneity for big datasets. Such a method should identify characteristic molecular vibrations using optical spectroscopy, especially in the "fingerprinting region" from ≈400-1400 cm -1 . Here, surface-enhanced Raman spectroscopy is used to demonstrate label-free identification of DNA nucleobases with multiplexed 3D plasmonic nanofocusing. While nanometer-scale mode volumes prevent identification of single nucleobases within a DNA sequence, the block optical technique can identify A, T, G, and C content in DNA k-mers. The content of each nucleotide in a DNA block can be a unique and high-throughput method for identifying sequences, genes, and other biomarkers as an alternative to single-letter sequencing. Additionally, coupling two complementary vibrational spectroscopy techniques (infrared and Raman) can improve block characterization. These results pave the way for developing a novel, high-throughput block optical sequencing method with lossy genomic data compression using k-mer identification from multiplexed optical data acquisition. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
Informational and linguistic analysis of large genomic sequence collections via efficient Hadoop cluster algorithms.

Science.gov (United States)

Ferraro Petrillo, Umberto; Roscigno, Gianluca; Cattaneo, Giuseppe; Giancarlo, Raffaele

2018-06-01

Information theoretic and compositional/linguistic analysis of genomes have a central role in bioinformatics, even more so since the associated methodologies are becoming very valuable also for epigenomic and meta-genomic studies. The kernel of those methods is based on the collection of k-mer statistics, i.e. how many times each k-mer in {A,C,G,T}k occurs in a DNA sequence. Although this problem is computationally very simple and efficiently solvable on a conventional computer, the sheer amount of data available now in applications demands to resort to parallel and distributed computing. Indeed, those type of algorithms have been developed to collect k-mer statistics in the realm of genome assembly. However, they are so specialized to this domain that they do not extend easily to the computation of informational and linguistic indices, concurrently on sets of genomes. Following the well-established approach in many disciplines, and with a growing success also in bioinformatics, to resort to MapReduce and Hadoop to deal with 'Big Data' problems, we present KCH, the first set of MapReduce algorithms able to perform concurrently informational and linguistic analysis of large collections of genomic sequences on a Hadoop cluster. The benchmarking of KCH that we provide indicates that it is quite effective and versatile. It is also competitive with respect to the parallel and distributed algorithms highly specialized to k-mer statistics collection for genome assembly problems. In conclusion, KCH is a much needed addition to the growing number of algorithms and tools that use MapReduce for bioinformatics core applications. The software, including instructions for running it over Amazon AWS, as well as the datasets are available at http://www.di-srv.unisa.it/KCH. umberto.ferraro@uniroma1.it. Supplementary data are available at Bioinformatics online.
DNA motif elucidation using belief propagation.

Science.gov (United States)

Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

2013-09-01

Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k=8∼10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/∼wkc/kmerHMM.
DNA motif elucidation using belief propagation

KAUST Repository

Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

2013-01-01

Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

DNA motif elucidation using belief propagation

KAUST Repository

Wong, Ka-Chun

2013-06-29

Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors\\' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).
A Computational Approach for Predicting Role of Human MicroRNAs in MERS-CoV Genome

Directory of Open Access Journals (Sweden)

Md Mahmudul Hasan

2014-01-01

Full Text Available The new epidemic Middle East Respiratory Syndrome (MERS is caused by a type of human coronavirus called MERS-CoV which has global fatality rate of about 30%. We are investigating potential antiviral therapeutics against MERS-CoV by using host microRNAs (miRNAs which may downregulate viral gene expression to quell viral replication. We computationally predicted potential 13 cellular miRNAs from 11 potential hairpin sequences of MERS-CoV genome. Our study provided an interesting hypothesis that those miRNAs, that is, hsa-miR-628-5p, hsa-miR-6804-3p, hsa-miR-4289, hsa-miR-208a-3p, hsa-miR-510-3p, hsa-miR-18a-3p, hsa-miR-329-3p, hsa-miR-548ax, hsa-miR-3934-5p, hsa-miR-4474-5p, hsa-miR-7974, hsa-miR-6865-5p, and hsa-miR-342-3p, would be antiviral therapeutics against MERS-CoV infection.
Ornithine decarboxylase antizyme induces hypomethylation of genome DNA and histone H3 lysine 9 dimethylation (H3K9me2 in human oral cancer cell line.

Directory of Open Access Journals (Sweden)

Daisuke Yamamoto

2010-09-01

Full Text Available Methylation of CpG islands of genome DNA and lysine residues of histone H3 and H4 tails regulates gene transcription. Inhibition of polyamine synthesis by ornithine decarboxylase antizyme-1 (OAZ in human oral cancer cell line resulted in accumulation of decarboxylated S-adenosylmethionine (dcSAM, which acts as a competitive inhibitor of methylation reactions. We anticipated that accumulation of dcSAM impaired methylation reactions and resulted in hypomethylation of genome DNA and histone tails.Global methylation state of genome DNA and lysine residues of histone H3 and H4 tails were assayed by Methylation by Isoschizomers (MIAMI method and western blotting, respectively, in the presence or absence of OAZ expression. Ectopic expression of OAZ mediated hypomethylation of CpG islands of genome DNA and histone H3 lysine 9 dimethylation (H3K9me2. Protein level of DNA methyltransferase 3B (DNMT3B and histone H3K9me specific methyltransferase G9a were down-regulated in OAZ transfectant.OAZ induced hypomethylation of CpG islands of global genome DNA and H3K9me2 by down-regulating DNMT3B and G9a protein level. Hypomethylation of CpG islands of genome DNA and histone H3K9me2 is a potent mechanism of induction of the genes related to tumor suppression and DNA double strand break repair.
Norgal: Extraction and de novo assembly of mitochondrial DNA from whole-genome sequencing data

DEFF Research Database (Denmark)

Al-Nakeeb, Kosai Ali Ahmed; Petersen, Thomas Nordahl; Sicheritz-Pontén, Thomas

2017-01-01

and performing a de novo assembly on a subset of reads that contains these k-mers. The method was applied to WGS data from a panda, brown algae seaweed, butterfly and filamentous fungus. We were able to extract full circular mitochondrial genomes and obtained sequence identities to the reference sequences...
Enhancing Targeted Genomic DNA Editing in Chicken Cells Using the CRISPR/Cas9 System

Science.gov (United States)

Wang, Ling; Yang, Likai; Guo, Yijie; Du, Weili; Yin, Yajun; Zhang, Tao; Lu, Hongzhao

2017-01-01

The CRISPR/Cas9 system has enabled highly efficient genome targeted editing for various organisms. However, few studies have focused on CRISPR/Cas9 nuclease-mediated chicken genome editing compared with mammalian genomes. The current study combined CRISPR with yeast Rad52 (yRad52) to enhance targeted genomic DNA editing in chicken DF-1 cells. The efficiency of CRISPR/Cas9 nuclease-induced targeted mutations in the chicken genome was increased to 41.9% via the enrichment of the dual-reporter surrogate system. In addition, the combined effect of CRISPR nuclease and yRad52 dramatically increased the efficiency of the targeted substitution in the myostatin gene using 50-mer oligodeoxynucleotides (ssODN) as the donor DNA, resulting in a 36.7% editing efficiency after puromycin selection. Furthermore, based on the effect of yRad52, the frequency of exogenous gene integration in the chicken genome was more than 3-fold higher than that without yRad52. Collectively, these results suggest that ssODN is an ideal donor DNA for targeted substitution and that CRISPR/Cas9 combined with yRad52 significantly enhances chicken genome editing. These findings could be extensively applied in other organisms. PMID:28068387
Organizational heterogeneity of vertebrate genomes.

Science.gov (United States)

Frenkel, Svetlana; Kirzhner, Valery; Korol, Abraham

2012-01-01

Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS) analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers) in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM) allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
Organizational heterogeneity of vertebrate genomes.

Directory of Open Access Journals (Sweden)

Svetlana Frenkel

Full Text Available Genomes of higher eukaryotes are mosaics of segments with various structural, functional, and evolutionary properties. The availability of whole-genome sequences allows the investigation of their structure as "texts" using different statistical and computational methods. One such method, referred to as Compositional Spectra (CS analysis, is based on scoring the occurrences of fixed-length oligonucleotides (k-mers in the target DNA sequence. CS analysis allows generating species- or region-specific characteristics of the genome, regardless of their length and the presence of coding DNA. In this study, we consider the heterogeneity of vertebrate genomes as a joint effect of regional variation in sequence organization superimposed on the differences in nucleotide composition. We estimated compositional and organizational heterogeneity of genome and chromosome sequences separately and found that both heterogeneity types vary widely among genomes as well as among chromosomes in all investigated taxonomic groups. The high correspondence of heterogeneity scores obtained on three genome fractions, coding, repetitive, and the remaining part of the noncoding DNA (the genome dark matter--GDM allows the assumption that CS-heterogeneity may have functional relevance to genome regulation. Of special interest for such interpretation is the fact that natural GDM sequences display the highest deviation from the corresponding reshuffled sequences.
DNA sequence+shape kernel enables alignment-free modeling of transcription factor binding.

Science.gov (United States)

Ma, Wenxiu; Yang, Lin; Rohs, Remo; Noble, William Stafford

2017-10-01

Transcription factors (TFs) bind to specific DNA sequence motifs. Several lines of evidence suggest that TF-DNA binding is mediated in part by properties of the local DNA shape: the width of the minor groove, the relative orientations of adjacent base pairs, etc. Several methods have been developed to jointly account for DNA sequence and shape properties in predicting TF binding affinity. However, a limitation of these methods is that they typically require a training set of aligned TF binding sites. We describe a sequence + shape kernel that leverages DNA sequence and shape information to better understand protein-DNA binding preference and affinity. This kernel extends an existing class of k-mer based sequence kernels, based on the recently described di-mismatch kernel. Using three in vitro benchmark datasets, derived from universal protein binding microarrays (uPBMs), genomic context PBMs (gcPBMs) and SELEX-seq data, we demonstrate that incorporating DNA shape information improves our ability to predict protein-DNA binding affinity. In particular, we observe that (i) the k-spectrum + shape model performs better than the classical k-spectrum kernel, particularly for small k values; (ii) the di-mismatch kernel performs better than the k-mer kernel, for larger k; and (iii) the di-mismatch + shape kernel performs better than the di-mismatch kernel for intermediate k values. The software is available at https://bitbucket.org/wenxiu/sequence-shape.git. rohs@usc.edu or william-noble@uw.edu. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.
Characterization of two conformational epitopes of the Chlamydia trachomatis serovar L2 DnaK immunogen

DEFF Research Database (Denmark)

Birkelund, Svend; Mygind, P; Holm, A

1996-01-01

this protein. By use of recombinant DNA techniques, we located the epitopes for two MAbs in the C-terminal variable part. Although the antibodies reacted in an immunoblot assay, it was not possible to map the epitopes completely by use of 16-mer synthetic peptides displaced by one amino acid corresponding......Chlamydia trachomatis DnaK is an important immunogen in chlamydial infections. DnaK is composed of a conserved N-terminal ATP-binding domain and a variable C-terminal peptide-binding domain. To locate the immunogenic part of C. trachomatis Dnak, we generated monoclonal antibodies (MAbs) against...... with the two antibodies. The epitopes were found not to overlap. To obtain DnaK fragments recognized by the antibodies with the same affinity as native C. trachomatis DnaK, it was necessary to express, respectively, regions of 127 and 77 amino acids. The MAbs described in this study thus recognized...
Extreme-Scale De Novo Genome Assembly

Energy Technology Data Exchange (ETDEWEB)

Georganas, Evangelos [Intel Corporation, Santa Clara, CA (United States); Hofmeyr, Steven [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Egan, Rob [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division; Buluc, Aydin [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Oliker, Leonid [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.; Rokhsar, Daniel [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Computational Research Division; Yelick, Katherine [Lawrence Berkeley National Lab. (LBNL), Berkeley, CA (United States). Joint Genome Inst.

2017-09-26

De novo whole genome assembly reconstructs genomic sequence from short, overlapping, and potentially erroneous DNA segments and is one of the most important computations in modern genomics. This work presents HipMER, a high-quality end-to-end de novo assembler designed for extreme scale analysis, via efficient parallelization of the Meraculous code. Genome assembly software has many components, each of which stresses different components of a computer system. This chapter explains the computational challenges involved in each step of the HipMer pipeline, the key distributed data structures, and communication costs in detail. We present performance results of assembling the human genome and the large hexaploid wheat genome on large supercomputers up to tens of thousands of cores.
From NGS assembly challenges to instability of fungal mitochondrial genomes: A case study in genome complexity.

Science.gov (United States)

Misas, Elizabeth; Muñoz, José Fernando; Gallo, Juan Esteban; McEwen, Juan Guillermo; Clay, Oliver Keatinge

2016-04-01

The presence of repetitive or non-unique DNA persisting over sizable regions of a eukaryotic genome can hinder the genome's successful de novo assembly from short reads: ambiguities in assigning genome locations to the non-unique subsequences can result in premature termination of contigs and thus overfragmented assemblies. Fungal mitochondrial (mtDNA) genomes are compact (typically less than 100 kb), yet often contain short non-unique sequences that can be shown to impede their successful de novo assembly in silico. Such repeats can also confuse processes in the cell in vivo. A well-studied example is ectopic (out-of-register, illegitimate) recombination associated with repeat pairs, which can lead to deletion of functionally important genes that are located between the repeats. Repeats that remain conserved over micro- or macroevolutionary timescales despite such risks may indicate functionally or structurally (e.g., for replication) important regions. This principle could form the basis of a mining strategy for accelerating discovery of function in genome sequences. We present here our screening of a sample of 11 fully sequenced fungal mitochondrial genomes by observing where exact k-mer repeats occurred several times; initial analyses motivated us to focus on 17-mers occurring more than three times. Based on the diverse repeats we observe, we propose that such screening may serve as an efficient expedient for gaining a rapid but representative first insight into the repeat landscapes of sparsely characterized mitochondrial chromosomes. Our matching of the flagged repeats to previously reported regions of interest supports the idea that systems of persisting, non-trivial repeats in genomes can often highlight features meriting further attention. Copyright © 2016 Elsevier Ltd. All rights reserved.
Alignment of Escherichia coli K12 DNA sequences to a genomic restriction map.

Science.gov (United States)

Rudd, K E; Miller, W; Ostell, J; Benson, D A

1990-01-25

We use the extensive published information describing the genome of Escherichia coli and new restriction map alignment software to align DNA sequence, genetic, and physical maps. Restriction map alignment software is used which considers restriction maps as strings analogous to DNA or protein sequences except that two values, enzyme name and DNA base address, are associated with each position on the string. The resulting alignments reveal a nearly linear relationship between the physical and genetic maps of the E. coli chromosome. Physical map comparisons with the 1976, 1980, and 1983 genetic maps demonstrate a better fit with the more recent maps. The results of these alignments are genomic kilobase coordinates, orientation and rank of the alignment that best fits the genetic data. A statistical measure based on extreme value distribution is applied to the alignments. Additional computer analyses allow us to estimate the accuracy of the published E. coli genomic restriction map, simulate rearrangements of the bacterial chromosome, and search for repetitive DNA. The procedures we used are general enough to be applicable to other genome mapping projects.
Influence of defects on the effective electrical conductivity of a monolayer produced by random sequential adsorption of linear k-mers onto a square lattice

Science.gov (United States)

Tarasevich, Yuri Yu.; Laptev, Valeri V.; Goltseva, Valeria A.; Lebovka, Nikolai I.

2017-07-01

The effect of defects on the behaviour of electrical conductivity, σ, in a monolayer produced by the random sequential adsorption of linear k-mers (particles occupying k adjacent sites) onto a square lattice is studied by means of a Monte Carlo simulation. The k-mers are deposited on the substrate until a jamming state is reached. The presence of defects in the lattice (impurities) and of defects in the k-mers with concentrations of dl and dk, respectively, is assumed. The defects in the lattice are distributed randomly before deposition and these lattice sites are forbidden for the deposition of k-mers. The defects of the k-mers are distributed randomly on the deposited k-mers. The sites filled with k-mers have high electrical conductivity, σk, whereas the empty sites, and the sites filled by either types of defect have a low electrical conductivity, σl, i.e., a high-contrast, σk /σl ≫ 1, is assumed. We examined isotropic (both the possible x and y orientations of a particle are equiprobable) and anisotropic (all particles are aligned along one given direction, y) deposition. To calculate the effective electrical conductivity, the monolayer was presented as a random resistor network and the Frank-Lobb algorithm was used. The effects of the concentrations of defects dl and dk on the electrical conductivity for the values of k =2n, where n = 1 , 2 , … , 5, were studied. Increase of both the dl and dk parameters values resulted in decreases in the value of σ and the suppression of percolation. Moreover, for anisotropic deposition the electrical conductivity along the y direction was noticeably larger than in the perpendicular direction, x. Phase diagrams in the (dl ,dk)-plane for different values of k were obtained.
Gene organization in rice revealed by full-length cDNA mapping and gene expression analysis through microarray.

Directory of Open Access Journals (Sweden)

Kouji Satoh

Full Text Available Rice (Oryza sativa L. is a model organism for the functional genomics of monocotyledonous plants since the genome size is considerably smaller than those of other monocotyledonous plants. Although highly accurate genome sequences of indica and japonica rice are available, additional resources such as full-length complementary DNA (FL-cDNA sequences are also indispensable for comprehensive analyses of gene structure and function. We cross-referenced 28.5K individual loci in the rice genome defined by mapping of 578K FL-cDNA clones with the 56K loci predicted in the TIGR genome assembly. Based on the annotation status and the presence of corresponding cDNA clones, genes were classified into 23K annotated expressed (AE genes, 33K annotated non-expressed (ANE genes, and 5.5K non-annotated expressed (NAE genes. We developed a 60mer oligo-array for analysis of gene expression from each locus. Analysis of gene structures and expression levels revealed that the general features of gene structure and expression of NAE and ANE genes were considerably different from those of AE genes. The results also suggested that the cloning efficiency of rice FL-cDNA is associated with the transcription activity of the corresponding genetic locus, although other factors may also have an effect. Comparison of the coverage of FL-cDNA among gene families suggested that FL-cDNA from genes encoding rice- or eukaryote-specific domains, and those involved in regulatory functions were difficult to produce in bacterial cells. Collectively, these results indicate that rice genes can be divided into distinct groups based on transcription activity and gene structure, and that the coverage bias of FL-cDNA clones exists due to the incompatibility of certain eukaryotic genes in bacteria.
MicroRNA categorization using sequence motifs and k-mers.

Science.gov (United States)

Yousef, Malik; Khalifa, Waleed; Acar, İlhan Erkin; Allmer, Jens

2017-03-14

Post-transcriptional gene dysregulation can be a hallmark of diseases like cancer and microRNAs (miRNAs) play a key role in the modulation of translation efficiency. Known pre-miRNAs are listed in miRBase, and they have been discovered in a variety of organisms ranging from viruses and microbes to eukaryotic organisms. The computational detection of pre-miRNAs is of great interest, and such approaches usually employ machine learning to discriminate between miRNAs and other sequences. Many features have been proposed describing pre-miRNAs, and we have previously introduced the use of sequence motifs and k-mers as useful ones. There have been reports of xeno-miRNAs detected via next generation sequencing. However, they may be contaminations and to aid that important decision-making process, we aimed to establish a means to differentiate pre-miRNAs from different species. To achieve distinction into species, we used one species' pre-miRNAs as the positive and another species' pre-miRNAs as the negative training and test data for the establishment of machine learned models based on sequence motifs and k-mers as features. This approach resulted in higher accuracy values between distantly related species while species with closer relation produced lower accuracy values. We were able to differentiate among species with increasing success when the evolutionary distance increases. This conclusion is supported by previous reports of fast evolutionary changes in miRNAs since even in relatively closely related species a fairly good discrimination was possible.
Genomic signal processing for DNA sequence clustering.

Science.gov (United States)

Mendizabal-Ruiz, Gerardo; Román-Godínez, Israel; Torres-Ramos, Sulema; Salido-Ruiz, Ricardo A; Vélez-Pérez, Hugo; Morales, J Alejandro

2018-01-01

Genomic signal processing (GSP) methods which convert DNA data to numerical values have recently been proposed, which would offer the opportunity of employing existing digital signal processing methods for genomic data. One of the most used methods for exploring data is cluster analysis which refers to the unsupervised classification of patterns in data. In this paper, we propose a novel approach for performing cluster analysis of DNA sequences that is based on the use of GSP methods and the K-means algorithm. We also propose a visualization method that facilitates the easy inspection and analysis of the results and possible hidden behaviors. Our results support the feasibility of employing the proposed method to find and easily visualize interesting features of sets of DNA data.
Characterization and immunological identification of cDNA clones encoding two human DNA topoisomerase II isozymes

International Nuclear Information System (INIS)

Chung, T.D.Y.; Drake, F.H.; Tan, K.B.; Per, S.R.; Crooke, S.T.; Mirabelli, C.K.

1989-01-01

Several DNA topoisomerase II partial cDNA clones obtained from a human Raji-HN2 cDNA library were sequenced and two classes of nucleotide sequences were found. One member of the first class, SP1, was identical to an internal fragment of human HeLa cell Topo II cDNA described earlier. A member of the second class, SP11, shared extensive nucleotide (75%) and predicted peptide (92%) sequence similarities with the first two-thirds of HeLa Topo II. Each class of cDNAs hybridized to unique, nonoverlapping restriction enzyme fragments of genomic DNA from several human cell lines. Synthetic 24-mer oligonucleotide probes specific for each cDNA class hybridized to 6.5-kilobase mRNAs; furthermore, hybridization of probe specific for one class was not blocked by probe specific for the other. Antibodies raised against a synthetic SP1-encoded dodecapeptide specifically recognized the 170-kDa form of Topo II, while antibodies raised against the corresponding SP11-encoded dodecapeptide, or a second unique SP11-encoded tridecapeptide, selectively recognized the 180-kDa form of Topo II. These data provide genetic and immunochemical evidence for two Topo II isozymes
Barcode server: a visualization-based genome analysis system.

Directory of Open Access Journals (Sweden)

Fenglou Mao

Full Text Available We have previously developed a computational method for representing a genome as a barcode image, which makes various genomic features visually apparent. We have demonstrated that this visual capability has made some challenging genome analysis problems relatively easy to solve. We have applied this capability to a number of challenging problems, including (a identification of horizontally transferred genes, (b identification of genomic islands with special properties and (c binning of metagenomic sequences, and achieved highly encouraging results. These application results inspired us to develop this barcode-based genome analysis server for public service, which supports the following capabilities: (a calculation of the k-mer based barcode image for a provided DNA sequence; (b detection of sequence fragments in a given genome with distinct barcodes from those of the majority of the genome, (c clustering of provided DNA sequences into groups having similar barcodes; and (d homology-based search using Blast against a genome database for any selected genomic regions deemed to have interesting barcodes. The barcode server provides a job management capability, allowing processing of a large number of analysis jobs for barcode-based comparative genome analyses. The barcode server is accessible at http://csbl1.bmb.uga.edu/Barcode.
Study on the relationship between DNA-PKcs and genomic instability and hyper-radiosensitivity

International Nuclear Information System (INIS)

Yang Kang; Zhu Jiayun; Ding Nan; Li Junhong; Hu Wentao; Su Fengtao; He Jinpeng; Li Sha

2010-01-01

To investigate the relationship between DNA-PKcs and genome instability and hyper-radiosensitivity, human glioma cell lines M059K and M059J, as a model expressing wild-type DNA-PKcs and a model defective in DNA-PKcs activity, were exposed to low doses of X-rays. Cells survival fractions were assessed by colony-forming assay and Cytochalasin-B micronucleus assay was employed to detect the genomic instability happening in each single irradiated colony. It has been found that as the post-incubation time increased, M059K cells expressing wild-type DNA-PKcs exhibited low-dose hyper-radiosensitivity and showed a similar genomic instability after 0.2 Gy and 0.6 Gy irradiations, but the M059J cells lacking in DNA-PKcs didn't present low-dose hyper-radiosensitivity and showed a higher genomic instability of 0.6 Gy than that of 0.2 Gy. The results indicate that DNA-PKcs may act as one of the key factors that lead to low-dose hyper-radiosensitivity. (authors)
Probabilistic topic modeling for the analysis and classification of genomic sequences

Science.gov (United States)

2015-01-01

Background Studies on genomic sequences for classification and taxonomic identification have a leading role in the biomedical field and in the analysis of biodiversity. These studies are focusing on the so-called barcode genes, representing a well defined region of the whole genome. Recently, alignment-free techniques are gaining more importance because they are able to overcome the drawbacks of sequence alignment techniques. In this paper a new alignment-free method for DNA sequences clustering and classification is proposed. The method is based on k-mers representation and text mining techniques. Methods The presented method is based on Probabilistic Topic Modeling, a statistical technique originally proposed for text documents. Probabilistic topic models are able to find in a document corpus the topics (recurrent themes) characterizing classes of documents. This technique, applied on DNA sequences representing the documents, exploits the frequency of fixed-length k-mers and builds a generative model for a training group of sequences. This generative model, obtained through the Latent Dirichlet Allocation (LDA) algorithm, is then used to classify a large set of genomic sequences. Results and conclusions We performed classification of over 7000 16S DNA barcode sequences taken from Ribosomal Database Project (RDP) repository, training probabilistic topic models. The proposed method is compared to the RDP tool and Support Vector Machine (SVM) classification algorithm in a extensive set of trials using both complete sequences and short sequence snippets (from 400 bp to 25 bp). Our method reaches very similar results to RDP classifier and SVM for complete sequences. The most interesting results are obtained when short sequence snippets are considered. In these conditions the proposed method outperforms RDP and SVM with ultra short sequences and it exhibits a smooth decrease of performance, at every taxonomic level, when the sequence length is decreased. PMID:25916734

Local chromatin structure of heterochromatin regulates repeated DNA stability, nucleolus structure, and genome integrity

Energy Technology Data Exchange (ETDEWEB)

Peng, Jamy C. [Univ. of California, Berkeley, CA (United States)

2007-01-01

Heterochromatin constitutes a significant portion of the genome in higher eukaryotes; approximately 30% in Drosophila and human. Heterochromatin contains a high repeat DNA content and a low density of protein-encoding genes. In contrast, euchromatin is composed mostly of unique sequences and contains the majority of single-copy genes. Genetic and cytological studies demonstrated that heterochromatin exhibits regulatory roles in chromosome organization, centromere function and telomere protection. As an epigenetically regulated structure, heterochromatin formation is not defined by any DNA sequence consensus. Heterochromatin is characterized by its association with nucleosomes containing methylated-lysine 9 of histone H3 (H3K9me), heterochromatin protein 1 (HP1) that binds H3K9me, and Su(var)3-9, which methylates H3K9 and binds HP1. Heterochromatin formation and functions are influenced by HP1, Su(var)3-9, and the RNA interference (RNAi) pathway. My thesis project investigates how heterochromatin formation and function impact nuclear architecture, repeated DNA organization, and genome stability in Drosophila melanogaster. H3K9me-based chromatin reduces extrachromosomal DNA formation; most likely by restricting the access of repair machineries to repeated DNAs. Reducing extrachromosomal ribosomal DNA stabilizes rDNA repeats and the nucleolus structure. H3K9me-based chromatin also inhibits DNA damage in heterochromatin. Cells with compromised heterochromatin structure, due to Su(var)3-9 or dcr-2 (a component of the RNAi pathway) mutations, display severe DNA damage in heterochromatin compared to wild type. In these mutant cells, accumulated DNA damage leads to chromosomal defects such as translocations, defective DNA repair response, and activation of the G2-M DNA repair and mitotic checkpoints that ensure cellular and animal viability. My thesis research suggests that DNA replication, repair, and recombination mechanisms in heterochromatin differ from those in
When whole-genome alignments just won't work: kSNP v2 software for alignment-free SNP discovery and phylogenetics of hundreds of microbial genomes.

Science.gov (United States)

Gardner, Shea N; Hall, Barry G

2013-01-01

Effective use of rapid and inexpensive whole genome sequencing for microbes requires fast, memory efficient bioinformatics tools for sequence comparison. The kSNP v2 software finds single nucleotide polymorphisms (SNPs) in whole genome data. kSNP v2 has numerous improvements over kSNP v1 including SNP gene annotation; better scaling for draft genomes available as assembled contigs or raw, unassembled reads; a tool to identify the optimal value of k; distribution of packages of executables for Linux and Mac OS X for ease of installation and user-friendly use; and a detailed User Guide. SNP discovery is based on k-mer analysis, and requires no multiple sequence alignment or the selection of a single reference genome. Most target sets with hundreds of genomes complete in minutes to hours. SNP phylogenies are built by maximum likelihood, parsimony, and distance, based on all SNPs, only core SNPs, or SNPs present in some intermediate user-specified fraction of targets. The SNP-based trees that result are consistent with known taxonomy. kSNP v2 can handle many gigabases of sequence in a single run, and if one or more annotated genomes are included in the target set, SNPs are annotated with protein coding and other information (UTRs, etc.) from Genbank file(s). We demonstrate application of kSNP v2 on sets of viral and bacterial genomes, and discuss in detail analysis of a set of 68 finished E. coli and Shigella genomes and a set of the same genomes to which have been added 47 assemblies and four "raw read" genomes of H104:H4 strains from the recent European E. coli outbreak that resulted in both bloody diarrhea and hemolytic uremic syndrome (HUS), and caused at least 50 deaths.
A comparison of alternative 60-mer probe designs in an in-situ synthesized oligonucleotide microarray

Directory of Open Access Journals (Sweden)

Fairbanks Benjamin D

2006-04-01

Full Text Available Abstract Background DNA microarrays have proven powerful for functional genomics studies. Several technologies exist for the generation of whole-genome arrays. It is well documented that 25mer probes directed against different regions of the same gene produce variable signal intensity values. However, the extent to which this is true for probes of greater length (60mers is not well characterized. Moreover, this information has not previously been reported for whole-genome arrays designed against bacteria, whose genomes may differ substantially in characteristics directly affecting microarray performance. Results We report here an analysis of alternative 60mer probe designs for an in-situ synthesized oligonucleotide array for the GC rich, β-proteobacterium Burkholderia cenocepacia. Probes were designed using the ArrayOligoSel3.5 software package and whole-genome microarrays synthesized by Agilent, Inc. using their in-situ, ink-jet technology platform. We first validated the quality of the microarrays as demonstrated by an average signal to noise ratio of >1000. Next, we determined that the variance of replicate probes (1178 total probes examined of identical sequence was 3.8% whereas the variance of alternative probes (558 total alternative probes examined designs was 9.5%. We determined that depending upon the definition, about 2.4% of replicate and 7.8% of alternative probes produced outlier conclusions. Finally, we determined none of the probe design subscores (GC content, internal repeat, binding energy and self annealment produced by ArrayOligoSel3.5 were predictive or probes that produced outlier signals. Conclusion Our analysis demonstrated that the use of multiple probes per target sequence is not essential for in-situ synthesized 60mer oligonucleotide arrays designed against bacteria. Although probes producing outlier signals were identified, the use of ratios results in less than 10% of such outlier conclusions. We also determined that
The Caulobacter crescentus phage phiCbK: genomics of a canonical phage

Directory of Open Access Journals (Sweden)

Gill Jason J

2012-10-01

Full Text Available Abstract Background The bacterium Caulobacter crescentus is a popular model for the study of cell cycle regulation and senescence. The large prolate siphophage phiCbK has been an important tool in C. crescentus biology, and has been studied in its own right as a model for viral morphogenesis. Although a system of some interest, to date little genomic information is available on phiCbK or its relatives. Results Five novel phiCbK-like C. crescentus bacteriophages, CcrMagneto, CcrSwift, CcrKarma, CcrRogue and CcrColossus, were isolated from the environment. The genomes of phage phiCbK and these five environmental phage isolates were obtained by 454 pyrosequencing. The phiCbK-like phage genomes range in size from 205 kb encoding 318 proteins (phiCbK to 280 kb encoding 448 proteins (CcrColossus, and were found to contain nonpermuted terminal redundancies of 10 to 17 kb. A novel method of terminal ligation was developed to map genomic termini, which confirmed termini predicted by coverage analysis. This suggests that sequence coverage discontinuities may be useable as predictors of genomic termini in phage genomes. Genomic modules encoding virion morphogenesis, lysis and DNA replication proteins were identified. The phiCbK-like phages were also found to encode a number of intriguing proteins; all contain a clearly T7-like DNA polymerase, and five of the six encode a possible homolog of the C. crescentus cell cycle regulator GcrA, which may allow the phage to alter the host cell’s replicative state. The structural proteome of phage phiCbK was determined, identifying the portal, major and minor capsid proteins, the tail tape measure and possible tail fiber proteins. All six phage genomes are clearly related; phiCbK, CcrMagneto, CcrSwift, CcrKarma and CcrRogue form a group related at the DNA level, while CcrColossus is more diverged but retains significant similarity at the protein level. Conclusions Due to their lack of any apparent relationship to
Accurate DNA assembly and genome engineering with optimized uracil excision cloning

DEFF Research Database (Denmark)

Cavaleiro, Mafalda; Kim, Se Hyeuk; Seppala, Susanna

2015-01-01

Simple and reliable DNA editing by uracil excision (a.k.a. USER cloning) has been described by several research groups, but the optimal design of cohesive DNA ends for multigene assembly remains elusive. Here, we use two model constructs based on expression of gfp and a four-gene pathway that pro......Simple and reliable DNA editing by uracil excision (a.k.a. USER cloning) has been described by several research groups, but the optimal design of cohesive DNA ends for multigene assembly remains elusive. Here, we use two model constructs based on expression of gfp and a four-gene pathway...... that produces β-carotene to optimize assembly junctions and the uracil excision protocol. By combining uracil excision cloning with a genomic integration technology, we demonstrate that up to six DNA fragments can be assembled in a one-tube reaction for direct genome integration with high accuracy, greatly...... facilitating the advanced engineering of robust cell factories....
DNA Barcoding: Amplification and sequence analysis of rbcl and matK genome regions in three divergent plant species

Directory of Open Access Journals (Sweden)

Javed Iqbal Wattoo

2016-11-01

Full Text Available Background: DNA barcoding is a novel method of species identification based on nucleotide diversity of conserved sequences. The establishment and refining of plant DNA barcoding systems is more challenging due to high genetic diversity among different species. Therefore, targeting the conserved nuclear transcribed regions would be more reliable for plant scientists to reveal genetic diversity, species discrimination and phylogeny. Methods: In this study, we amplified and sequenced the chloroplast DNA regions (matk+rbcl of Solanum nigrum, Euphorbia helioscopia and Dalbergia sissoo to study the functional annotation, homology modeling and sequence analysis to allow a more efficient utilization of these sequences among different plant species. These three species represent three families; Solanaceae, Euphorbiaceae and Fabaceae respectively. Biological sequence homology and divergence of amplified sequences was studied using Basic Local Alignment Tool (BLAST. Results: Both primers (matk+rbcl showed good amplification in three species. The sequenced regions reveled conserved genome information for future identification of different medicinal plants belonging to these species. The amplified conserved barcodes revealed different levels of biological homology after sequence analysis. The results clearly showed that the use of these conserved DNA sequences as barcode primers would be an accurate way for species identification and discrimination. Conclusion: The amplification and sequencing of conserved genome regions identified a novel sequence of matK in native species of Solanum nigrum. The findings of the study would be applicable in medicinal industry to establish DNA based identification of different medicinal plant species to monitor adulteration.
Links between DNA methylation and nucleosome occupancy in the human genome.

Science.gov (United States)

Collings, Clayton K; Anderson, John N

2017-01-01

DNA methylation is an epigenetic modification that is enriched in heterochromatin but depleted at active promoters and enhancers. However, the debate on whether or not DNA methylation is a reliable indicator of high nucleosome occupancy has not been settled. For example, the methylation levels of DNA flanking CTCF sites are higher in linker DNA than in nucleosomal DNA, while other studies have shown that the nucleosome core is the preferred site of methylation. In this study, we make progress toward understanding these conflicting phenomena by implementing a bioinformatics approach that combines MNase-seq and NOMe-seq data and by comprehensively profiling DNA methylation and nucleosome occupancy throughout the human genome. The results demonstrated that increasing methylated CpG density is correlated with nucleosome occupancy in the total genome and within nearly all subgenomic regions. Features with elevated methylated CpG density such as exons, SINE-Alu sequences, H3K36-trimethylated peaks, and methylated CpG islands are among the highest nucleosome occupied elements in the genome, while some of the lowest occupancies are displayed by unmethylated CpG islands and unmethylated transcription factor binding sites. Additionally, outside of CpG islands, the density of CpGs within nucleosomes was shown to be important for the nucleosomal location of DNA methylation with low CpG frequencies favoring linker methylation and high CpG frequencies favoring core particle methylation. Prominent exceptions to the correlations between methylated CpG density and nucleosome occupancy include CpG islands marked by H3K27me3 and CpG-poor heterochromatin marked by H3K9me3, and these modifications, along with DNA methylation, distinguish the major silencing mechanisms of the human epigenome. Thus, the relationship between DNA methylation and nucleosome occupancy is influenced by the density of methylated CpG dinucleotides and by other epigenomic components in chromatin.
TAREAN: a computational tool for identification and characterization of satellite DNA from unassembled short reads.

Science.gov (United States)

Novák, Petr; Ávila Robledillo, Laura; Koblížková, Andrea; Vrbová, Iva; Neumann, Pavel; Macas, Jirí

2017-07-07

Satellite DNA is one of the major classes of repetitive DNA, characterized by tandemly arranged repeat copies that form contiguous arrays up to megabases in length. This type of genomic organization makes satellite DNA difficult to assemble, which hampers characterization of satellite sequences by computational analysis of genomic contigs. Here, we present tandem repeat analyzer (TAREAN), a novel computational pipeline that circumvents this problem by detecting satellite repeats directly from unassembled short reads. The pipeline first employs graph-based sequence clustering to identify groups of reads that represent repetitive elements. Putative satellite repeats are subsequently detected by the presence of circular structures in their cluster graphs. Consensus sequences of repeat monomers are then reconstructed from the most frequent k-mers obtained by decomposing read sequences from corresponding clusters. The pipeline performance was successfully validated by analyzing low-pass genome sequencing data from five plant species where satellite DNA was previously experimentally characterized. Moreover, novel satellite repeats were predicted for the genome of Vicia faba and three of these repeats were verified by detecting their sequences on metaphase chromosomes using fluorescence in situ hybridization. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Metagenome Fragment Classification Using -Mer Frequency Profiles

Directory of Open Access Journals (Sweden)

Gail Rosen

2008-01-01

Full Text Available A vast amount of microbial sequencing data is being generated through large-scale projects in ecology, agriculture, and human health. Efficient high-throughput methods are needed to analyze the mass amounts of metagenomic data, all DNA present in an environmental sample. A major obstacle in metagenomics is the inability to obtain accuracy using technology that yields short reads. We construct the unique -mer frequency profiles of 635 microbial genomes publicly available as of February 2008. These profiles are used to train a naive Bayes classifier (NBC that can be used to identify the genome of any fragment. We show that our method is comparable to BLAST for small 25 bp fragments but does not have the ambiguity of BLAST's tied top scores. We demonstrate that this approach is scalable to identify any fragment from hundreds of genomes. It also performs quite well at the strain, species, and genera levels and achieves strain resolution despite classifying ubiquitous genomic fragments (gene and nongene regions. Cross-validation analysis demonstrates that species-accuracy achieves 90% for highly-represented species containing an average of 8 strains. We demonstrate that such a tool can be used on the Sargasso Sea dataset, and our analysis shows that NBC can be further enhanced.
Identification of Poxvirus Genome Uncoating and DNA Replication Factors with Mutually Redundant Roles.

Science.gov (United States)

Liu, Baoming; Panda, Debasis; Mendez-Rios, Jorge D; Ganesan, Sundar; Wyatt, Linda S; Moss, Bernard

2018-04-01

Genome uncoating is essential for replication of most viruses. For poxviruses, the process is divided into two stages: removal of the envelope, allowing early gene expression, and breaching of the core wall, allowing DNA release, replication, and late gene expression. Subsequent studies showed that the host proteasome and the viral D5 protein, which has an essential role in DNA replication, are required for vaccinia virus (VACV) genome uncoating. In a search for additional VACV uncoating proteins, we noted a report that described a defect in DNA replication and late expression when the gene encoding a 68-kDa ankyrin repeat/F-box protein (68k-ank), associated with the cellular SCF (Skp1, cullin1, F-box-containing complex) ubiquitin ligase complex, was deleted from the attenuated modified vaccinia virus Ankara (MVA). Here we showed that the 68k-ank deletion mutant exhibited diminished genome uncoating, formation of DNA prereplication sites, and degradation of viral cores as well as an additional, independent defect in DNA synthesis. Deletion of the 68k-ank homolog of VACV strain WR, however, was without effect, suggesting the existence of compensating genes. By inserting VACV genes into an MVA 68k-ank deletion mutant, we discovered that M2, a member of the poxvirus immune evasion (PIE) domain superfamily and a regulator of NF-κB, and C5, a member of the BTB/Kelch superfamily associated with cullin-3-based ligase complexes, independently rescued the 68k-ank deletion phenotype. Thus, poxvirus uncoating and DNA replication are intertwined processes involving at least three viral proteins with mutually redundant functions in addition to D5. IMPORTANCE Poxviruses comprise a family of large DNA viruses that infect vertebrates and invertebrates and cause diseases of medical and zoological importance. Poxviruses, unlike most other DNA viruses, replicate in the cytoplasm, and their large genomes usually encode 200 or more proteins with diverse functions. About 90 genes may
Discovery of novel bat coronaviruses in south China that use the same receptor as MERS coronavirus.

Science.gov (United States)

Luo, Chu-Ming; Wang, Ning; Yang, Xing-Lou; Liu, Hai-Zhou; Zhang, Wei; Li, Bei; Hu, Ben; Peng, Cheng; Geng, Qi-Bin; Zhu, Guang-Jian; Li, Fang; Shi, Zheng-Li

2018-04-18

Middle East respiratory syndrome coronavirus (MERS-CoV) has represented a human health threat since 2012. Although several MERS-related CoVs, which belong to the same species as MERS-CoV, have been identified from bats, they do not use the MERS-CoV receptor, dipeptidyl peptidase 4 (DPP4). Here, we screened 1059 bat samples from at least 30 bat species collected in different regions in south China and identified 89 strains of lineage C betacoronaviruses, including Tylonycteris pachypus HKU4 , Pipistrellus pipistrellus HKU5, and MERS-related CoVs. We sequenced the full-length genomes of two positive samples collected from the great evening bat, Ia io , from Guangdong Province. The two genomes were highly similar and exhibited genomic structures identical to those of other lineage C betacoronaviruses. While they exhibited genome-wide nucleotide identities of only 75.3 to 81.2% with other MERS-related CoVs, their gene-coding regions were highly similar to their counterparts, except in the case of the spike proteins. Further protein--protein interaction assays demonstrated that the spike proteins of these MERS-related CoVs bind to the receptor DPP4. Recombination analysis suggested that the newly discovered MERS-related CoVs might have acquired their spike genes from a DPP4-recognizing bat HKU4. Our study provides further evidence that bats represent the evolutionary origins of MERS-CoV. IMPORTANCE Previous studies suggested that the Middle East respiratory syndrome coronavirus (MERS-CoV) may have originated in bats. However, its evolutionary path from bats to humans remains unclear. In this study, we discovered 89 novel lineage C betacoronaviruses (BetaCoVs) in eight bat species. We provide the evidence of a MERS-related CoV derived from the great evening bat that uses the same host receptor as human MERS-CoV. This virus also provides evidence for a natural recombination event between the bat MERS-related CoV and another bat coronavirus HKU4. Our study expands the host
Transcription Restores DNA Repair to Heterochromatin, Determining Regional Mutation Rates in Cancer Genomes

Directory of Open Access Journals (Sweden)

Christina L. Zheng

2014-11-01

Full Text Available Somatic mutations in cancer are more frequent in heterochromatic and late-replicating regions of the genome. We report that regional disparities in mutation density are virtually abolished within transcriptionally silent genomic regions of cutaneous squamous cell carcinomas (cSCCs arising in an XPC−/− background. XPC−/− cells lack global genome nucleotide excision repair (GG-NER, thus establishing differential access of DNA repair machinery within chromatin-rich regions of the genome as the primary cause for the regional disparity. Strikingly, we find that increasing levels of transcription reduce mutation prevalence on both strands of gene bodies embedded within H3K9me3-dense regions, and only to those levels observed in H3K9me3-sparse regions, also in an XPC-dependent manner. Therefore, transcription appears to reduce mutation prevalence specifically by relieving the constraints imposed by chromatin structure on DNA repair. We model this relationship among transcription, chromatin state, and DNA repair, revealing a new, personalized determinant of cancer risk.
DOT1L and H3K79 Methylation in Transcription and Genomic Stability.

Science.gov (United States)

Wood, Katherine; Tellier, Michael; Murphy, Shona

2018-02-27

The organization of eukaryotic genomes into chromatin provides challenges for the cell to accomplish basic cellular functions, such as transcription, DNA replication and repair of DNA damage. Accordingly, a range of proteins modify and/or read chromatin states to regulate access to chromosomal DNA. Yeast Dot1 and the mammalian homologue DOT1L are methyltransferases that can add up to three methyl groups to histone H3 lysine 79 (H3K79). H3K79 methylation is implicated in several processes, including transcription elongation by RNA polymerase II, the DNA damage response and cell cycle checkpoint activation. DOT1L is also an important drug target for treatment of mixed lineage leukemia (MLL)-rearranged leukemia where aberrant transcriptional activation is promoted by DOT1L mislocalisation. This review summarizes what is currently known about the role of Dot1/DOT1L and H3K79 methylation in transcription and genomic stability.
DOT1L and H3K79 Methylation in Transcription and Genomic Stability

Directory of Open Access Journals (Sweden)

Katherine Wood

2018-02-01

Full Text Available The organization of eukaryotic genomes into chromatin provides challenges for the cell to accomplish basic cellular functions, such as transcription, DNA replication and repair of DNA damage. Accordingly, a range of proteins modify and/or read chromatin states to regulate access to chromosomal DNA. Yeast Dot1 and the mammalian homologue DOT1L are methyltransferases that can add up to three methyl groups to histone H3 lysine 79 (H3K79. H3K79 methylation is implicated in several processes, including transcription elongation by RNA polymerase II, the DNA damage response and cell cycle checkpoint activation. DOT1L is also an important drug target for treatment of mixed lineage leukemia (MLL-rearranged leukemia where aberrant transcriptional activation is promoted by DOT1L mislocalisation. This review summarizes what is currently known about the role of Dot1/DOT1L and H3K79 methylation in transcription and genomic stability.
MICA: desktop software for comprehensive searching of DNA databases

Directory of Open Access Journals (Sweden)

Glick Benjamin S

2006-10-01

Full Text Available Abstract Background Molecular biologists work with DNA databases that often include entire genomes. A common requirement is to search a DNA database to find exact matches for a nondegenerate or partially degenerate query. The software programs available for such purposes are normally designed to run on remote servers, but an appealing alternative is to work with DNA databases stored on local computers. We describe a desktop software program termed MICA (K-Mer Indexing with Compact Arrays that allows large DNA databases to be searched efficiently using very little memory. Results MICA rapidly indexes a DNA database. On a Macintosh G5 computer, the complete human genome could be indexed in about 5 minutes. The indexing algorithm recognizes all 15 characters of the DNA alphabet and fully captures the information in any DNA sequence, yet for a typical sequence of length L, the index occupies only about 2L bytes. The index can be searched to return a complete list of exact matches for a nondegenerate or partially degenerate query of any length. A typical search of a long DNA sequence involves reading only a small fraction of the index into memory. As a result, searches are fast even when the available RAM is limited. Conclusion MICA is suitable as a search engine for desktop DNA analysis software.
Functional role of a highly repetitive DNA sequence in anchorage of the mouse genome.

Science.gov (United States)

Neuer-Nitsche, B; Lu, X N; Werner, D

1988-09-12

The major portion of the eukaryotic genome consists of various categories of repetitive DNA sequences which have been studied with respect to their base compositions, organizations, copy numbers, transcription and species specificities; their biological roles, however, are still unclear. A novel quality of a highly repetitive mouse DNA sequence is described which points to a functional role: All copies (approximately 50,000 per haploid genome) of this DNA sequence reside on genomic Alu I DNA fragments each associated with nuclear polypeptides that are not released from DNA by proteinase K, SDS and phenol extraction. By this quality the repetitive DNA sequence is classified as a member of the sub-set of DNA sequences involved in tight DNA-polypeptide complexes which have been previously shown to be components of the subnuclear structure termed 'nuclear matrix'. From these results it has to be concluded that the repetitive DNA sequence characterized in this report represents or comprises a signal for a large number of site specific attachment points of the mouse genome in the nuclear matrix.
Towards accurate de novo assembly for genomes with repeats

NARCIS (Netherlands)

Bucur, Doina

2017-01-01

De novo genome assemblers designed for short k-mer length or using short raw reads are unlikely to recover complex features of the underlying genome, such as repeats hundreds of bases long. We implement a stochastic machine-learning method which obtains accurate assemblies with repeats and
A novel k-mer set memory (KSM) motif representation improves regulatory variant prediction.

Science.gov (United States)

Guo, Yuchun; Tian, Kevin; Zeng, Haoyang; Guo, Xiaoyun; Gifford, David Kenneth

2018-04-13

The representation and discovery of transcription factor (TF) sequence binding specificities is critical for understanding gene regulatory networks and interpreting the impact of disease-associated noncoding genetic variants. We present a novel TF binding motif representation, the k -mer set memory (KSM), which consists of a set of aligned k -mers that are overrepresented at TF binding sites, and a new method called KMAC for de novo discovery of KSMs. We find that KSMs more accurately predict in vivo binding sites than position weight matrix (PWM) models and other more complex motif models across a large set of ChIP-seq experiments. Furthermore, KSMs outperform PWMs and more complex motif models in predicting in vitro binding sites. KMAC also identifies correct motifs in more experiments than five state-of-the-art motif discovery methods. In addition, KSM-derived features outperform both PWM and deep learning model derived sequence features in predicting differential regulatory activities of expression quantitative trait loci (eQTL) alleles. Finally, we have applied KMAC to 1600 ENCODE TF ChIP-seq data sets and created a public resource of KSM and PWM motifs. We expect that the KSM representation and KMAC method will be valuable in characterizing TF binding specificities and in interpreting the effects of noncoding genetic variations. © 2018 Guo et al.; Published by Cold Spring Harbor Laboratory Press.
Informational laws of genome structures

Science.gov (United States)

Bonnici, Vincenzo; Manca, Vincenzo

2016-06-01

In recent years, the analysis of genomes by means of strings of length k occurring in the genomes, called k-mers, has provided important insights into the basic mechanisms and design principles of genome structures. In the present study, we focus on the proper choice of the value of k for applying information theoretic concepts that express intrinsic aspects of genomes. The value k = lg2(n), where n is the genome length, is determined to be the best choice in the definition of some genomic informational indexes that are studied and computed for seventy genomes. These indexes, which are based on information entropies and on suitable comparisons with random genomes, suggest five informational laws, to which all of the considered genomes obey. Moreover, an informational genome complexity measure is proposed, which is a generalized logistic map that balances entropic and anti-entropic components of genomes and is related to their evolutionary dynamics. Finally, applications to computational synthetic biology are briefly outlined.
Comprehensive evaluation of genome-wide 5-hydroxymethylcytosine profiling approaches in human DNA.

Science.gov (United States)

Skvortsova, Ksenia; Zotenko, Elena; Luu, Phuc-Loi; Gould, Cathryn M; Nair, Shalima S; Clark, Susan J; Stirzaker, Clare

2017-01-01

The discovery that 5-methylcytosine (5mC) can be oxidized to 5-hydroxymethylcytosine (5hmC) by the ten-eleven translocation (TET) proteins has prompted wide interest in the potential role of 5hmC in reshaping the mammalian DNA methylation landscape. The gold-standard bisulphite conversion technologies to study DNA methylation do not distinguish between 5mC and 5hmC. However, new approaches to mapping 5hmC genome-wide have advanced rapidly, although it is unclear how the different methods compare in accurately calling 5hmC. In this study, we provide a comparative analysis on brain DNA using three 5hmC genome-wide approaches, namely whole-genome bisulphite/oxidative bisulphite sequencing (WG Bis/OxBis-seq), Infinium HumanMethylation450 BeadChip arrays coupled with oxidative bisulphite (HM450K Bis/OxBis) and antibody-based immunoprecipitation and sequencing of hydroxymethylated DNA (hMeDIP-seq). We also perform loci-specific TET-assisted bisulphite sequencing (TAB-seq) for validation of candidate regions. We show that whole-genome single-base resolution approaches are advantaged in providing precise 5hmC values but require high sequencing depth to accurately measure 5hmC, as this modification is commonly in low abundance in mammalian cells. HM450K arrays coupled with oxidative bisulphite provide a cost-effective representation of 5hmC distribution, at CpG sites with 5hmC levels >~10%. However, 5hmC analysis is restricted to the genomic location of the probes, which is an important consideration as 5hmC modification is commonly enriched at enhancer elements. Finally, we show that the widely used hMeDIP-seq method provides an efficient genome-wide profile of 5hmC and shows high correlation with WG Bis/OxBis-seq 5hmC distribution in brain DNA. However, in cell line DNA with low levels of 5hmC, hMeDIP-seq-enriched regions are not detected by WG Bis/OxBis or HM450K, either suggesting misinterpretation of 5hmC calls by hMeDIP or lack of sensitivity of the latter methods. We

Isolation and Cloning of mercuric reductase gene (merA from mercury-resistant bacteria

Directory of Open Access Journals (Sweden)

Parisa Khoshniyat

2018-03-01

Full Text Available Introduction: Some of the bacteria having merA gene coding mineral mercury reducing enzyme, has genetic potential of Hg removing via reduction of mineral mercury and transformation of that to gas form and finally bioremediation of polluted area. The aim of this study is the isolation of merA gene from resistance bacteria and cloning of that into suitable expression vector and then the environmental bioremediation by the transformation of bacteria with this vector. Materials and methods: A number of bacteria were collected in contaminated areas with mercury in order to isolate merA genes. Polymerase chain reaction had done on the four bacterial genomes including Klebsiella pneumoniae, Pseudomonas aeruginosa, Serratia marcescens and Escherichia coli using the specific primers in order to detect merA gene. For cloning, the primers containing restriction enzyme sites are used, merA gene was isolated and amplified. The amplified fragments were cloned in the expression vector pET21a+ and via heat shock method were transformed into E. coli TOP10 competent cell. For clustering of genes, Mega software version 4 was used and bioanformatic studies were achieved for predicted enzyme. Results: merA gene with 1686 bp in length was isolated from K pneumoniae and E. coli. Recombinant vectors in transgenic bacteria were confirmed by various methods and finally were confirmed by sequencing. The result of clustering these genes with existence genes in NCBI showed high similarity. Discussion and conclusion: The existence of merA gene in bacteria that adapted to Hg pollution area is because of resistance, so with cloning this gene into suitable expression vector and transformation of susceptible bacteria with this vector ability of resistance to Hg in bacteria for bioremediation could be given.
Comparative genome analysis of novel Podoviruses lytic for hypermucoviscous Klebsiella pneumoniae of K1, K2, and K57 capsular types.

Science.gov (United States)

Solovieva, Ekaterina V; Myakinina, Vera P; Kislichkina, Angelina A; Krasilnikova, Valentina M; Verevkin, Vladimir V; Mochalov, Vladimir V; Lev, Anastasia I; Fursova, Nadezhda K; Volozhantsev, Nikolay V

2018-01-02

Hypermucoviscous (HV) strains of capsular types K1, K2 and K57 are the most virulent representatives of the Klebsiella pneumoniae species. Eight novel bacteriophages lytic for HV K. pneumoniae were isolated and characterized. Three bacteriophages, KpV41, KpV475, and KpV71 were found to have a lytic activity against mainly K. pneumoniae of capsular type K1. Two phages, KpV74, and KpV763 were lytic for K2 capsular type K. pneumoniae, and the phage KpV767 was specific to K57-type K. pneumoniae only. Two more phages, KpV766, and KpV48 had no capsular specificity. The phage genomes consist of a linear double-stranded DNA of 40,395-44,623bp including direct terminal repeats of 180-246 bp. The G + C contents are 52.3-54.2 % that is slightly lower than that of genomes of K. pneumoniae strains being used for phage propagation. According to the genome structures, sequence similarity and phylogenetic data, the phages are classified within the genus Kp32virus and Kp34virus of subfamily Autographivirinae, family Podoviridae. In the phage genomes, genes encoding proteins with putative motifs of polysaccharide depolymerase were identified. Depolymerase genes of phages KpV71 and KpV74 lytic for hypermucoviscous K. pneumoniae of K1 and K2 capsular type, respectively, were cloned and expressed in Escherichia coli, and the recombinant gene products were purified. The specificity and polysaccharide-degrading activity of the recombinant depolymerases were demonstrated. Copyright © 2017 Elsevier B.V. All rights reserved.
Genome-wide mapping of DNA strand breaks.

Directory of Open Access Journals (Sweden)

Frédéric Leduc

Full Text Available Determination of cellular DNA damage has so far been limited to global assessment of genome integrity whereas nucleotide-level mapping has been restricted to specific loci by the use of specific primers. Therefore, only limited DNA sequences can be studied and novel regions of genomic instability can hardly be discovered. Using a well-characterized yeast model, we describe a straightforward strategy to map genome-wide DNA strand breaks without compromising nucleotide-level resolution. This technique, termed "damaged DNA immunoprecipitation" (dDIP, uses immunoprecipitation and the terminal deoxynucleotidyl transferase-mediated dUTP-biotin end-labeling (TUNEL to capture DNA at break sites. When used in combination with microarray or next-generation sequencing technologies, dDIP will allow researchers to map genome-wide DNA strand breaks as well as other types of DNA damage and to establish a clear profiling of altered genes and/or intergenic sequences in various experimental conditions. This mapping technique could find several applications for instance in the study of aging, genotoxic drug screening, cancer, meiosis, radiation and oxidative DNA damage.
MerCat: a versatile k-mer counter and diversity estimator for database-independent property analysis obtained from metagenomic and/or metatranscriptomic sequencing data

Energy Technology Data Exchange (ETDEWEB)

White, Richard A.; Panyala, Ajay R.; Glass, Kevin A.; Colby, Sean M.; Glaesemann, Kurt R.; Jansson, Georg C.; Jansson, Janet K.

2017-02-21

MerCat is a parallel, highly scalable and modular property software package for robust analysis of features in next-generation sequencing data. MerCat inputs include assembled contigs and raw sequence reads from any platform resulting in feature abundance counts tables. MerCat allows for direct analysis of data properties without reference sequence database dependency commonly used by search tools such as BLAST and/or DIAMOND for compositional analysis of whole community shotgun sequencing (e.g. metagenomes and metatranscriptomes).
Investigation of genomic instability by assay of DNA fingerprint from the offspring of male mice exposed to chronic low-level γ-radiation

International Nuclear Information System (INIS)

Bezlepkin, V.G.; Vasil'eva, G.V.; Lomaeva, M.G.; Sirota, N.P.; Gaziev, A.I.

2000-01-01

By polymerase chain reaction with arbitrary primer (AP-PCR), the possibility of transmission of genome instability to somatic cells of the offspring (F 1 generation) from male parents of mice exposed to chronic low-dose γ-radiation was studied. Male mice 15 days after exposure to 10-50 cGy were mated with unirradiated females. Biopsies were taken from tale tips of two month-old mice progeny for DNA separation. Primer in the AP-PCR was 20-mer oligonucleotide flanking the micro-satellite locus Atplb2 on chromosome 11 of the mouse. Comparative analysis of individual fingerprints of AP-PCR products on DNA-templates from the offspring of irradiated and unirradiated male mice revealed an increased variability of micro-satellite-associated sequences in the genome of the offspring of males exposed to 25 and 50 cGy. DNA-fingerprints of the offspring of male mice exposed to chronic irradiation doses 10 and 25 cGy. 15 days before fertilization (at the post-meiotic stage of spermatogenesis) showed an increased frequency of non-parent bands. Result of the study point to the possibility of transmission to the offspring somatic cells of changes increasing genome instability from male parents exposed to chronic low-dose radiation prior to fertilization [ru
[Study of genome instability using DNA fingerprinting of the offspring of male mice subjected to chronic low dose gamma irradiation].

Science.gov (United States)

Bezlepkin, V G; Vasil'eva, G V; Lomaeva, M G; Sirota, N P; Gaziev, A I

2000-01-01

By a polymerase chain reaction with an arbitrary primer (AP-PCR), the possibility of transmission of genome instability to somatic cells of the offspring (F1 generation) from male parents of mice exposed to chronic low-level gamma-radiation was studied. Male BALB/c mice 15 days after exposure to 10-50 cGy were mated with unirradiated females. Biopsies were taken from tale tips of two month-old offspring mice and DNA was isolated. The primer in the AP-PCR was a 20-mer oligonucleotide flanking the microsatellite locus Atp1b2 on chromosome 11 of the mouse. A comparative analysis of individual fingerprints of AP-PCR products on DNA-templates from the offspring of irradiated and unirradiated male mice revealed an increased variability of microsatellite-associated sequences in the genome of the offspring of the males exposed to 25 and 50 cGy. The DNA-fingerprints of the offspring of male mice exposed to chronic irradiation with the doses 10 and 25 cGy 15 days before fertilization (at the post-meiotic stage of spermatogenesis) showed an increased frequency of "non-parent bands". The results of the study point to the possibility of transmission to the offspring somatic cells of changes increasing genome instability from male parents exposed to chronic low-level radiation prior to fertilization.
Evaluation of whole genome sequencing for outbreak detection of Salmonella enterica

DEFF Research Database (Denmark)

Leekitcharoenphon, Pimlapas; Nielsen, Eva M.; Kaas, Rolf Sommer

2014-01-01

Salmonella enterica is a common cause of minor and large food borne outbreaks. To achieve successful and nearly ‘real-time’ monitoring and identification of outbreaks, reliable sub-typing is essential. Whole genome sequencing (WGS) shows great promises for using as a routine epidemiological typing....... Enteritidis and 5 S. Derby were also sequenced and used for comparison. A number of different bioinformatics approaches were applied on the data; including pan-genome tree, k-mer tree, nucleotide difference tree and SNP tree. The outcome of each approach was evaluated in relation to the association...... of the isolates to specific outbreaks. The pan-genome tree clustered 65% of the S. Typhimurium isolates according to the pre-defined epidemiology, the k-mer tree 88%, the nucleotide difference tree 100% and the SNP tree 100% of the strains within S. Typhimurium. The resulting outcome of the four phylogenetic...
Genome instabilities arising from ribonucleotides in DNA.

Science.gov (United States)

Klein, Hannah L

2017-08-01

Genomic DNA is transiently contaminated with ribonucleotide residues during the process of DNA replication through misincorporation by the replicative DNA polymerases α, δ and ε, and by the normal replication process on the lagging strand, which uses RNA primers. These ribonucleotides are efficiently removed during replication by RNase H enzymes and the lagging strand synthesis machinery. However, when ribonucleotides remain in DNA they can distort the DNA helix, affect machineries for DNA replication, transcription and repair, and can stimulate genomic instabilities which are manifest as increased mutation, recombination and chromosome alterations. The genomic instabilities associated with embedded ribonucleotides are considered here, along with a discussion of the origin of the lesions that stimulate particular classes of instabilities. Copyright © 2017 Elsevier B.V. All rights reserved.
The fishes of Genome 10K

KAUST Repository

Bernardi, Giacomo

2012-09-01

The Genome 10K project aims to sequence the genomes of 10,000 vertebrates, representing approximately one genome for each vertebrate genus. Since fishes (cartilaginous fishes, ray-finned fishes and lobe-finned fishes) represent more than 50% of extant vertebrates, it is planned to target 4,000 fish genomes. At present, nearly 60 fish genomes are being sequenced at various public funded labs, and under a Genome 10K and BGI pilot project. An additional 100 fishes have been identified for sequencing in the next phase of Genome 10K project. © 2012 Elsevier B.V.
The fishes of Genome 10K

KAUST Repository

Bernardi, Giacomo; Wiley, Edward O.; Mansour, Hicham; Miller, Michael R.; Ortí , Guillermo; Haussler, David H.; O'Brien, Stephen J O; Ryder, Oliver A.; Venkatesh, Byrappa

2012-01-01

The Genome 10K project aims to sequence the genomes of 10,000 vertebrates, representing approximately one genome for each vertebrate genus. Since fishes (cartilaginous fishes, ray-finned fishes and lobe-finned fishes) represent more than 50% of extant vertebrates, it is planned to target 4,000 fish genomes. At present, nearly 60 fish genomes are being sequenced at various public funded labs, and under a Genome 10K and BGI pilot project. An additional 100 fishes have been identified for sequencing in the next phase of Genome 10K project. © 2012 Elsevier B.V.
Predicting DNA Methylation State of CpG Dinucleotide Using Genome Topological Features and Deep Networks.

Science.gov (United States)

Wang, Yiheng; Liu, Tong; Xu, Dong; Shi, Huidong; Zhang, Chaoyang; Mo, Yin-Yuan; Wang, Zheng

2016-01-22

The hypo- or hyper-methylation of the human genome is one of the epigenetic features of leukemia. However, experimental approaches have only determined the methylation state of a small portion of the human genome. We developed deep learning based (stacked denoising autoencoders, or SdAs) software named "DeepMethyl" to predict the methylation state of DNA CpG dinucleotides using features inferred from three-dimensional genome topology (based on Hi-C) and DNA sequence patterns. We used the experimental data from immortalised myelogenous leukemia (K562) and healthy lymphoblastoid (GM12878) cell lines to train the learning models and assess prediction performance. We have tested various SdA architectures with different configurations of hidden layer(s) and amount of pre-training data and compared the performance of deep networks relative to support vector machines (SVMs). Using the methylation states of sequentially neighboring regions as one of the learning features, an SdA achieved a blind test accuracy of 89.7% for GM12878 and 88.6% for K562. When the methylation states of sequentially neighboring regions are unknown, the accuracies are 84.82% for GM12878 and 72.01% for K562. We also analyzed the contribution of genome topological features inferred from Hi-C. DeepMethyl can be accessed at http://dna.cs.usm.edu/deepmethyl/.
MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

Science.gov (United States)

Ozaki, Haruka; Iwasaki, Wataru

2016-08-01

As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.
Characterization of full-length sequenced cDNA inserts (FLIcs) from Atlantic salmon (Salmo salar)

Science.gov (United States)

Andreassen, Rune; Lunner, Sigbjørn; Høyheim, Bjørn

2009-01-01

Background Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs) are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP), the number of sequences where the full-length of the cDNA insert has been determined has been small. Results High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91%) of the transcripts were annotated using Gene Ontology (GO) terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS). The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS). This suggests that the remaining cDNA
Characterization of full-length sequenced cDNA inserts (FLIcs from Atlantic salmon (Salmo salar

Directory of Open Access Journals (Sweden)

Lunner Sigbjørn

2009-10-01

Full Text Available Abstract Background Sequencing of the Atlantic salmon genome is now being planned by an international research consortium. Full-length sequenced inserts from cDNAs (FLIcs are an important tool for correct annotation and clustering of the genomic sequence in any species. The large amount of highly similar duplicate sequences caused by the relatively recent genome duplication in the salmonid ancestor represents a particular challenge for the genome project. FLIcs will therefore be an extremely useful resource for the Atlantic salmon sequencing project. In addition to be helpful in order to distinguish between duplicate genome regions and in determining correct gene structures, FLIcs are an important resource for functional genomic studies and for investigation of regulatory elements controlling gene expression. In contrast to the large number of ESTs available, including the ESTs from 23 developmental and tissue specific cDNA libraries contributed by the Salmon Genome Project (SGP, the number of sequences where the full-length of the cDNA insert has been determined has been small. Results High quality full-length insert sequences from 560 pre-smolt white muscle tissue specific cDNAs were generated, accession numbers [GenBank: BT043497 - BT044056]. Five hundred and ten (91% of the transcripts were annotated using Gene Ontology (GO terms and 440 of the FLIcs are likely to contain a complete coding sequence (cCDS. The sequence information was used to identify putative paralogs, characterize salmon Kozak motifs, polyadenylation signal variation and to identify motifs likely to be involved in the regulation of particular genes. Finally, conserved 7-mers in the 3'UTRs were identified, of which some were identical to miRNA target sequences. Conclusion This paper describes the first Atlantic salmon FLIcs from a tissue and developmental stage specific cDNA library. We have demonstrated that many FLIcs contained a complete coding sequence (cCDS. This
Human Genome Research: Decoding DNA

Science.gov (United States)

dropdown arrow Site Map A-Z Index Menu Synopsis Human Genome Research: Decoding DNA Resources with of the DNA double helix during April 2003. James D. Watson, Francis Crick, and Maurice Wilkins were company Celera announced the completion of a "working draft" reference DNA sequence of the human
Dideoxynucleoside triphosphate-sensitive DNA polymerase from rice is involved in base excision repair and immunologically similar to mammalian DNA pol beta.

Science.gov (United States)

Sarkar, Sailendra Nath; Bakshi, Sankar; Mokkapati, Sanath K; Roy, Sujit; Sengupta, Dibyendu N

2004-07-16

A single polypeptide with ddNTP-sensitive DNA polymerase activity was purified to near homogeneity from the shoot tips of rice seedlings and analysis of the preparations by SDS-PAGE followed by silver staining showed a polypeptide of 67 kDa size. The DNA polymerase activity was found to be inhibitory by ddNTP in both in vitro DNA polymerase activity assay and activity gel analysis. Aphidicolin, an inhibitor of other types of DNA polymerases, had no effect on plant enzyme. The 67 kDa rice DNA polymerase was found to be recognized by the polyclonal antibody (purified IgG) made against rat DNA polymerase beta (pol beta) both in solution and also on Western blot. The recognition was found to be very specific as the activity of Klenow enzyme was unaffected by the antibody. The ability of rice nuclear extract to correct G:U mismatch of oligo-duplex was observed when oligo-duplex with 32P-labeled lower strand containing U (at 22nd position) was used as substrate. Differential appearance of bands at 21-mer, 22-mer, and 51-mer position in presence of dCTP was visible only with G:U mismatch oligo-duplex, but not with G:C oligo-duplex. While ddCTP or polyclonal antibody against rat-DNA pol beta inhibits base excision repair (BER), aphidicolin had no effect. These results for the first time clearly demonstrate the ability of rice nuclear extract to run BER and the involvement of ddNTP-sensitive pol beta type DNA polymerase. Immunological similarity of the ddNTP-sensitive DNA polymerase beta of rice and rat and its involvement in BER revealed the conservation of structure and function of ddNTP-sensitive DNA pol beta in plant and animal.
The Genomic Pattern of tDNA Operon Expression in E. coli.

Directory of Open Access Journals (Sweden)

2005-06-01

Full Text Available In fast-growing microorganisms, a tRNA concentration profile enriched in major isoacceptors selects for the biased usage of cognate codons. This optimizes translational rate for the least mass invested in the translational apparatus. Such translational streamlining is thought to be growth-regulated, but its genetic basis is poorly understood. First, we found in reanalysis of the E. coli tRNA profile that the degree to which it is translationally streamlined is nearly invariant with growth rate. Then, using least squares multiple regression, we partitioned tRNA isoacceptor pools to predicted tDNA operons from the E. coli K12 genome. Co-expression of tDNAs in operons explains the tRNA profile significantly better than tDNA gene dosage alone. Also, operon expression increases significantly with proximity to the origin of replication, oriC, at all growth rates. Genome location explains about 15% of expression variation in a form, at a given growth rate, that is consistent with replication-dependent gene concentration effects. Yet the change in the tRNA profile with growth rate is less than would be expected from such effects. We estimated per-copy expression rates for all tDNA operons that were consistent with independent estimates for rDNA operons. We also found that tDNA operon location, and the location dependence of expression, were significantly different in the leading and lagging strands. The operonic organization and genomic location of tDNA operons are significant factors influencing their expression. Nonrandom patterns of location and strandedness shown by tDNA operons in E. coli suggest that their genomic architecture may be under selection to satisfy physiological demand for tRNA expression at high growth rates.
On DNA codes from a family of chain rings

Directory of Open Access Journals (Sweden)

Elif Segah Oztas

2017-01-01

Full Text Available In this work, we focus on reversible cyclic codes which correspond to reversible DNA codes or reversible-complement DNA codes over a family of finite chain rings, in an effort to extend what was done by Yildiz and Siap in [20]. The ring family that we have considered are of size $2^{2^k}$, $k=1,2, \\cdots$ and we match each ring element with a DNA $2^{k-1}$-mer. We use the so-called $u^2$-adic digit system to solve the reversibility problem and we characterize cyclic codes that correspond to reversible-complement DNA-codes. We then conclude our study with some examples.
A mouse model for MERS coronavirus-induced acute respiratory distress syndrome.

Science.gov (United States)

Cockrell, Adam S; Yount, Boyd L; Scobey, Trevor; Jensen, Kara; Douglas, Madeline; Beall, Anne; Tang, Xian-Chun; Marasco, Wayne A; Heise, Mark T; Baric, Ralph S

2016-11-28

Middle East respiratory syndrome coronavirus (MERS-CoV) is a novel virus that emerged in 2012, causing acute respiratory distress syndrome (ARDS), severe pneumonia-like symptoms and multi-organ failure, with a case fatality rate of ∼36%. Limited clinical studies indicate that humans infected with MERS-CoV exhibit pathology consistent with the late stages of ARDS, which is reminiscent of the disease observed in patients infected with severe acute respiratory syndrome coronavirus. Models of MERS-CoV-induced severe respiratory disease have been difficult to achieve, and small-animal models traditionally used to investigate viral pathogenesis (mouse, hamster, guinea-pig and ferret) are naturally resistant to MERS-CoV. Therefore, we used CRISPR-Cas9 gene editing to modify the mouse genome to encode two amino acids (positions 288 and 330) that match the human sequence in the dipeptidyl peptidase 4 receptor, making mice susceptible to MERS-CoV infection and replication. Serial MERS-CoV passage in these engineered mice was then used to generate a mouse-adapted virus that replicated efficiently within the lungs and evoked symptoms indicative of severe ARDS, including decreased survival, extreme weight loss, decreased pulmonary function, pulmonary haemorrhage and pathological signs indicative of end-stage lung disease. Importantly, therapeutic countermeasures comprising MERS-CoV neutralizing antibody treatment or a MERS-CoV spike protein vaccine protected the engineered mice against MERS-CoV-induced ARDS.
DNA-PKcs, ATM, and ATR Interplay Maintains Genome Integrity during Neurogenesis.

Science.gov (United States)

Enriquez-Rios, Vanessa; Dumitrache, Lavinia C; Downing, Susanna M; Li, Yang; Brown, Eric J; Russell, Helen R; McKinnon, Peter J

2017-01-25

The DNA damage response (DDR) orchestrates a network of cellular processes that integrates cell-cycle control and DNA repair or apoptosis, which serves to maintain genome stability. DNA-PKcs (the catalytic subunit of the DNA-dependent kinase, encoded by PRKDC), ATM (ataxia telangiectasia, mutated), and ATR (ATM and Rad3-related) are related PI3K-like protein kinases and central regulators of the DDR. Defects in these kinases have been linked to neurodegenerative or neurodevelopmental syndromes. In all cases, the key neuroprotective function of these kinases is uncertain. It also remains unclear how interactions between the three DNA damage-responsive kinases coordinate genome stability, particularly in a physiological context. Here, we used a genetic approach to identify the neural function of DNA-PKcs and the interplay between ATM and ATR during neurogenesis. We found that DNA-PKcs loss in the mouse sensitized neuronal progenitors to apoptosis after ionizing radiation because of excessive DNA damage. DNA-PKcs was also required to prevent endogenous DNA damage accumulation throughout the adult brain. In contrast, ATR coordinated the DDR during neurogenesis to direct apoptosis in cycling neural progenitors, whereas ATM regulated apoptosis in both proliferative and noncycling cells. We also found that ATR controls a DNA damage-induced G 2 /M checkpoint in cortical progenitors, independent of ATM and DNA-PKcs. These nonoverlapping roles were further confirmed via sustained murine embryonic or cortical development after all three kinases were simultaneously inactivated. Thus, our results illustrate how DNA-PKcs, ATM, and ATR have unique and essential roles during the DDR, collectively ensuring comprehensive genome maintenance in the nervous system. The DNA damage response (DDR) is essential for prevention of a broad spectrum of different human neurologic diseases. However, a detailed understanding of the DDR at a physiological level is lacking. In contrast to many in

Cooperative heteroassembly of the adenoviral L4-22K and IVa2 proteins onto the viral packaging sequence DNA.

Science.gov (United States)

Yang, Teng-Chieh; Maluf, Nasib Karl

2012-02-21

Human adenovirus (Ad) is an icosahedral, double-stranded DNA virus. Viral DNA packaging refers to the process whereby the viral genome becomes encapsulated by the viral particle. In Ad, activation of the DNA packaging reaction requires at least three viral components: the IVa2 and L4-22K proteins and a section of DNA within the viral genome, called the packaging sequence. Previous studies have shown that the IVa2 and L4-22K proteins specifically bind to conserved elements within the packaging sequence and that these interactions are absolutely required for the observation of DNA packaging. However, the equilibrium mechanism for assembly of IVa2 and L4-22K onto the packaging sequence has not been determined. Here we characterize the assembly of the IVa2 and L4-22K proteins onto truncated packaging sequence DNA by analytical sedimentation velocity and equilibrium methods. At limiting concentrations of L4-22K, we observe a species with two IVa2 monomers and one L4-22K monomer bound to the DNA. In this species, the L4-22K monomer is promoting positive cooperative interactions between the two bound IVa2 monomers. As L4-22K levels are increased, we observe a species with one IVa2 monomer and three L4-22K monomers bound to the DNA. To explain this result, we propose a model in which L4-22K self-assembly on the DNA competes with IVa2 for positive heterocooperative interactions, destabilizing binding of the second IVa2 monomer. Thus, we propose that L4-22K levels control the extent of cooperativity observed between adjacently bound IVa2 monomers. We have also determined the hydrodynamic properties of all observed stoichiometric species; we observe that species with three L4-22K monomers bound have more extended conformations than species with a single L4-22K bound. We suggest this might reflect a molecular switch that controls insertion of the viral DNA into the capsid.
Genome-wide DNA polymorphism analyses using VariScan

Directory of Open Access Journals (Sweden)

Vilella Albert J

2006-09-01

Full Text Available Abstract Background DNA sequence polymorphisms analysis can provide valuable information on the evolutionary forces shaping nucleotide variation, and provides an insight into the functional significance of genomic regions. The recent ongoing genome projects will radically improve our capabilities to detect specific genomic regions shaped by natural selection. Current available methods and software, however, are unsatisfactory for such genome-wide analysis. Results We have developed methods for the analysis of DNA sequence polymorphisms at the genome-wide scale. These methods, which have been tested on a coalescent-simulated and actual data files from mouse and human, have been implemented in the VariScan software package version 2.0. Additionally, we have also incorporated a graphical-user interface. The main features of this software are: i exhaustive population-genetic analyses including those based on the coalescent theory; ii analysis adapted to the shallow data generated by the high-throughput genome projects; iii use of genome annotations to conduct a comprehensive analyses separately for different functional regions; iv identification of relevant genomic regions by the sliding-window and wavelet-multiresolution approaches; v visualization of the results integrated with current genome annotations in commonly available genome browsers. Conclusion VariScan is a powerful and flexible suite of software for the analysis of DNA polymorphisms. The current version implements new algorithms, methods, and capabilities, providing an important tool for an exhaustive exploratory analysis of genome-wide DNA polymorphism data.
MER SPICE Interface

Science.gov (United States)

Sayfi, Elias

2004-01-01

MER SPICE Interface is a software module for use in conjunction with the Mars Exploration Rover (MER) mission and the SPICE software system of the Navigation and Ancillary Information Facility (NAIF) at NASA's Jet Propulsion Laboratory. (SPICE is used to acquire, record, and disseminate engineering, navigational, and other ancillary data describing circumstances under which data were acquired by spaceborne scientific instruments.) Given a Spacecraft Clock value, MER SPICE Interface extracts MER-specific data from SPICE kernels (essentially, raw data files) and calculates values for Planet Day Number, Local Solar Longitude, Local Solar Elevation, Local Solar Azimuth, and Local Solar Time (UTC). MER SPICE Interface was adapted from a subroutine, denoted m98SpiceIF written by Payam Zamani, that was intended to calculate SPICE values for the Mars Polar Lander. The main difference between MER SPICE Interface and m98SpiceIf is that MER SPICE Interface does not explicitly call CHRONOS, a time-conversion program that is part of a library of utility subprograms within SPICE. Instead, MER SPICE Interface mimics some portions of the CHRONOS code, the advantage being that it executes much faster and can efficiently be called from a pipeline of events in a parallel processing environment.
A Portrait of Ribosomal DNA Contacts with Hi-C Reveals 5S and 45S rDNA Anchoring Points in the Folded Human Genome.

Science.gov (United States)

Yu, Shoukai; Lemos, Bernardo

2016-12-31

Ribosomal RNAs (rRNAs) account for >60% of all RNAs in eukaryotic cells and are encoded in the ribosomal DNA (rDNA) arrays. The rRNAs are produced from two sets of loci: the 5S rDNA array resides exclusively on human chromosome 1, whereas the 45S rDNA array resides on the short arm of five human acrocentric chromosomes. The 45S rDNA gives origin to the nucleolus, the nuclear organelle that is the site of ribosome biogenesis. Intriguingly, 5S and 45S rDNA arrays exhibit correlated copy number variation in lymphoblastoid cells (LCLs). Here we examined the genomic architecture and repeat content of the 5S and 45S rDNA arrays in multiple human genome assemblies (including PacBio MHAP assembly) and ascertained contacts between the rDNA arrays and the rest of the genome using Hi-C datasets from two human cell lines (erythroleukemia K562 and lymphoblastoid cells). Our analyses revealed that 5S and 45S arrays each have thousands of contacts in the folded genome, with rDNA-associated regions and genes dispersed across all chromosomes. The rDNA contact map displayed conserved and disparate features between two cell lines, and pointed to specific chromosomes, genomic regions, and genes with evidence of spatial proximity to the rDNA arrays; the data also showed a lack of direct physical interaction between the 5S and 45S rDNA arrays. Finally, the analysis identified an intriguing organization in the 5S array with Alu and 5S elements adjacent to one another and organized in opposite orientation along the array. Portraits of genome folding centered on the ribosomal DNA array could help understand the emergence of concerted variation, the control of 5S and 45S expression, as well as provide insights into an organelle that contributes to the spatial localization of human chromosomes during interphase. © The Author(s) 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Sequencing intractable DNA to close microbial genomes.

Directory of Open Access Journals (Sweden)

Richard A Hurt

Full Text Available Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled "intractable" resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such problematic regions in the "non-contiguous finished" Desulfovibrio desulfuricans ND132 genome (6 intractable gaps and the Desulfovibrio africanus genome (1 intractable gap. The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. The developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.
Sequencing Intractable DNA to Close Microbial Genomes

Energy Technology Data Exchange (ETDEWEB)

Hurt, Jr., Richard Ashley [ORNL; Brown, Steven D [ORNL; Podar, Mircea [ORNL; Palumbo, Anthony Vito [ORNL; Elias, Dwayne A [ORNL

2012-01-01

Advancement in high throughput DNA sequencing technologies has supported a rapid proliferation of microbial genome sequencing projects, providing the genetic blueprint for for in-depth studies. Oftentimes, difficult to sequence regions in microbial genomes are ruled intractable resulting in a growing number of genomes with sequence gaps deposited in databases. A procedure was developed to sequence such difficult regions in the non-contiguous finished Desulfovibrio desulfuricans ND132 genome (6 intractable gaps) and the Desulfovibrio africanus genome (1 intractable gap). The polynucleotides surrounding each gap formed GC rich secondary structures making the regions refractory to amplification and sequencing. Strand-displacing DNA polymerases used in concert with a novel ramped PCR extension cycle supported amplification and closure of all gap regions in both genomes. These developed procedures support accurate gene annotation, and provide a step-wise method that reduces the effort required for genome finishing.
Human-specific HERV-K insertion causes genomic variations in the human genome.

Directory of Open Access Journals (Sweden)

Wonseok Shin

Full Text Available Human endogenous retroviruses (HERV sequences account for about 8% of the human genome. Through comparative genomics and literature mining, we identified a total of 29 human-specific HERV-K insertions. We characterized them focusing on their structure and flanking sequence. The results showed that four of the human-specific HERV-K insertions deleted human genomic sequences via non-classical insertion mechanisms. Interestingly, two of the human-specific HERV-K insertion loci contained two HERV-K internals and three LTR elements, a pattern which could be explained by LTR-LTR ectopic recombination or template switching. In addition, we conducted a polymorphic test and observed that twelve out of the 29 elements are polymorphic in the human population. In conclusion, human-specific HERV-K elements have inserted into human genome since the divergence of human and chimpanzee, causing human genomic changes. Thus, we believe that human-specific HERV-K activity has contributed to the genomic divergence between humans and chimpanzees, as well as within the human population.
A CTAB Procedure Of Total Genomic DNA Extraction For Medicinal Mushrooms

International Nuclear Information System (INIS)

Azhar Mohamad; Muhammad Hussaini Mohd Mustafa; Muhammad Hanif Azhari Noor; Rosnani Abdul Rashid; Hasan Hamdani Hasan Mutaat; Meswan Meskom; Mat Rasol Awang

2014-01-01

Medicinal mushroom is defined as mushrooms used in medicine or medical research. Isolation of intact, high-molecular-mass genomic DNA is essential for many molecular biology applications including Polymerase Chain Reaction (PCR), endonuclease restriction digestion, Southern blot analysis, and genomic library construction. The most important and prerequisite towards reliable molecular biology work is the total genomic DNA of a sample must be in good quality. Five freshly samples of medicinal mushroom were used in this work known as Auriculariapolytricha, Lentinus edode, Pleurotus sayorcaju, Sczhizopyllum commune and Ganodermalucidum. 5 mg of each sample were used to extraction the DNA, prepared in 3 replications and repeated twice. PCR based technique by using ISSR markers were used in checking the amplification ability of the total genomic extraction. A standard Doyle and Doyle protocol for genomic DNA extraction was modified in optimizing the total genomic DNA from the medicinal mushroom.The modification parameters were percentage of CTAB, incubation period and temperature. The results reveal that each sample required a certain combinations of time and period of incubation. Besides, percentage of CTAB in the buffer was found significant in giving a high yielding of extracted total genomic DNA. The extracted total genomic DNA from the medicinal mushroom yielded from 39.7 ng/ μl to 919.1 ng/ μl. The different yield among the samples found to be corresponded to polysaccharide content in the medicinal mushrooms. The objective of this works is to optimize total genomic DNA extraction of medicinal mushrooms towards a high quality intact genomic DNA for molecular activities. (author)
Somatic DNA recombination yielding circular DNA and deletion of a genomic region in embryonic brain

International Nuclear Information System (INIS)

Maeda, Toyoki; Chijiiwa, Yoshiharu; Tsuji, Hideo; Sakoda, Saburo; Tani, Kenzaburo; Suzuki, Tomokazu

2004-01-01

In this study, a mouse genomic region is identified that undergoes DNA rearrangement and yields circular DNA in brain during embryogenesis. External region-directed inverse polymerase chain reaction on circular DNA extracted from late embryonic brain tissue repeatedly detected DNA of this region containing recombination joints. Wide-range genomic PCR and digestion-circularization PCR analysis showed this region underwent recombination accompanied with deletion of intervening sequences, including the circularized regions. This region was mapped by fluorescence in situ hybridization to C1 on mouse chromosome 16, where no gene and no physiological DNA rearrangement had been identified. DNA sequence in the region has segmental homology to an orthologous region on human chromosome 3q.13. These observations demonstrated somatic DNA recombination yielding genomic deletions in brain during embryogenesis
Differential DNA Methylation Analysis without a Reference Genome

Directory of Open Access Journals (Sweden)

Johanna Klughammer

2015-12-01

Full Text Available Genome-wide DNA methylation mapping uncovers epigenetic changes associated with animal development, environmental adaptation, and species evolution. To address the lack of high-throughput methods for DNA methylation analysis in non-model organisms, we developed an integrated approach for studying DNA methylation differences independent of a reference genome. Experimentally, our method relies on an optimized 96-well protocol for reduced representation bisulfite sequencing (RRBS, which we have validated in nine species (human, mouse, rat, cow, dog, chicken, carp, sea bass, and zebrafish. Bioinformatically, we developed the RefFreeDMA software to deduce ad hoc genomes directly from RRBS reads and to pinpoint differentially methylated regions between samples or groups of individuals (http://RefFreeDMA.computational-epigenetics.org. The identified regions are interpreted using motif enrichment analysis and/or cross-mapping to annotated genomes. We validated our method by reference-free analysis of cell-type-specific DNA methylation in the blood of human, cow, and carp. In summary, we present a cost-effective method for epigenome analysis in ecology and evolution, which enables epigenome-wide association studies in natural populations and species without a reference genome.
Defining functional DNA elements in the human genome

Science.gov (United States)

Kellis, Manolis; Wold, Barbara; Snyder, Michael P.; Bernstein, Bradley E.; Kundaje, Anshul; Marinov, Georgi K.; Ward, Lucas D.; Birney, Ewan; Crawford, Gregory E.; Dekker, Job; Dunham, Ian; Elnitski, Laura L.; Farnham, Peggy J.; Feingold, Elise A.; Gerstein, Mark; Giddings, Morgan C.; Gilbert, David M.; Gingeras, Thomas R.; Green, Eric D.; Guigo, Roderic; Hubbard, Tim; Kent, Jim; Lieb, Jason D.; Myers, Richard M.; Pazin, Michael J.; Ren, Bing; Stamatoyannopoulos, John A.; Weng, Zhiping; White, Kevin P.; Hardison, Ross C.

2014-01-01

With the completion of the human genome sequence, attention turned to identifying and annotating its functional DNA elements. As a complement to genetic and comparative genomics approaches, the Encyclopedia of DNA Elements Project was launched to contribute maps of RNA transcripts, transcriptional regulator binding sites, and chromatin states in many cell types. The resulting genome-wide data reveal sites of biochemical activity with high positional resolution and cell type specificity that facilitate studies of gene regulation and interpretation of noncoding variants associated with human disease. However, the biochemically active regions cover a much larger fraction of the genome than do evolutionarily conserved regions, raising the question of whether nonconserved but biochemically active regions are truly functional. Here, we review the strengths and limitations of biochemical, evolutionary, and genetic approaches for defining functional DNA segments, potential sources for the observed differences in estimated genomic coverage, and the biological implications of these discrepancies. We also analyze the relationship between signal intensity, genomic coverage, and evolutionary conservation. Our results reinforce the principle that each approach provides complementary information and that we need to use combinations of all three to elucidate genome function in human biology and disease. PMID:24753594
DNA Repair and Genome Maintenance in Bacillus subtilis

Science.gov (United States)

Lenhart, Justin S.; Schroeder, Jeremy W.; Walsh, Brian W.

2012-01-01

Summary: From microbes to multicellular eukaryotic organisms, all cells contain pathways responsible for genome maintenance. DNA replication allows for the faithful duplication of the genome, whereas DNA repair pathways preserve DNA integrity in response to damage originating from endogenous and exogenous sources. The basic pathways important for DNA replication and repair are often conserved throughout biology. In bacteria, high-fidelity repair is balanced with low-fidelity repair and mutagenesis. Such a balance is important for maintaining viability while providing an opportunity for the advantageous selection of mutations when faced with a changing environment. Over the last decade, studies of DNA repair pathways in bacteria have demonstrated considerable differences between Gram-positive and Gram-negative organisms. Here we review and discuss the DNA repair, genome maintenance, and DNA damage checkpoint pathways of the Gram-positive bacterium Bacillus subtilis. We present their molecular mechanisms and compare the functions and regulation of several pathways with known information on other organisms. We also discuss DNA repair during different growth phases and the developmental program of sporulation. In summary, we present a review of the function, regulation, and molecular mechanisms of DNA repair and mutagenesis in Gram-positive bacteria, with a strong emphasis on B. subtilis. PMID:22933559
Specific and selective target detection of supra-genome 21 Mers Salmonella via silicon nanowires biosensor

Science.gov (United States)

Mustafa, Mohammad Razif Bin; Dhahi, Th S.; Ehfaed, Nuri. A. K. H.; Adam, Tijjani; Hashim, U.; Azizah, N.; Mohammed, Mohammed; Noriman, N. Z.

2017-09-01

The nano structure based on silicon can be surface modified to be used as label-free biosensors that allow real-time measurements. The silicon nanowire surface was functionalized using 3-aminopropyltrimethoxysilane (APTES), which functions as a facilitator to immobilize biomolecules on the silicon nanowire surface. The process is simple, economical; this will pave the way for point-of-care applications. However, the surface modification and subsequent detection mechanism still not clear. Thus, study proposed step by step process of silicon nano surface modification and its possible in specific and selective target detection of Supra-genome 21 Mers Salmonella. The device captured the molecule with precisely; the approach took the advantages of strong binding chemistry created between APTES and biomolecule. The results indicated how modifications of the nanowires provide sensing capability with strong surface chemistries that can lead to specific and selective target detection.
Generation of NSE-MerCreMer transgenic mice with tamoxifen inducible Cre activity in neurons.

Directory of Open Access Journals (Sweden)

Mandy Ka Man Kam

Full Text Available To establish a genetic tool for conditional deletion or expression of gene in neurons in a temporally controlled manner, we generated a transgenic mouse (NSE-MerCreMer, which expressed a tamoxifen inducible type of Cre recombinase specifically in neurons. The tamoxifen inducible Cre recombinase (MerCreMer is a fusion protein containing Cre recombinase with two modified estrogen receptor ligand binding domains at both ends, and is driven by the neural-specific rat neural specific enolase (NSE promoter. A total of two transgenic lines were established, and expression of MerCreMer in neurons of the central and enteric nervous systems was confirmed. Transcript of MerCreMer was detected in several non-neural tissues such as heart, liver, and kidney in these lines. In the background of the Cre reporter mouse strain Rosa26R, Cre recombinase activity was inducible in neurons of adult NSE-MerCreMer mice treated with tamoxifen by intragastric gavage, but not in those fed with corn oil only. We conclude that NSE-MerCreMer lines will be useful for studying gene functions in neurons for the conditions that Cre-mediated recombination resulting in embryonic lethality, which precludes investigation of gene functions in neurons through later stages of development and in adult.
Direct detection of chicken genomic DNA for gender determination by thymine-DNA glycosylase.

Science.gov (United States)

Porat, N; Bogdanov, K; Danielli, A; Arie, A; Samina, I; Hadani, A

2011-02-01

1. Birds, especially nestlings, are generally difficult to sex by morphology and early detection of chick gender in ovo in the hatchery would facilitate removal of unwanted chicks and diminish welfare objections regarding culling after hatch. 2. We describe a method to determine chicken gender without the need for PCR via use of Thymine-DNA Glycosylase (TDG). TDG restores thymine (T)/guanine (G) mismatches to cytosine (C)/G. We show here, that like DNA Polymerase, TDG can recognise, bind and function on a primer hybridised to chicken genomic DNA. 3. The primer contained a T to mismatch a G in a chicken genomic template and the T/G was cleaved with high fidelity by TDG. Thus, the chicken genomic DNA can be identified without PCR amplification via direct and linear detection. Sensitivity was increased using gender specific sequences from the chicken genome. 4. Currently, these are laboratory results, but we anticipate that further development will allow this method to be used in non-laboratory settings, where PCR cannot be employed.
mer-Triammine trifluorido vanadium(III), mer-[VF{sub 3}(NH{sub 3}){sub 3}]. Synthesis and crystal structure

Energy Technology Data Exchange (ETDEWEB)

Woidy, Patrick [Technische Univ. Muenchen, Garching (Germany). Zentrale Technisch-Wissenschaftliche Betriebseinheit Radiochemie Muenchen RCM; Kraus, Florian [Marburg Univ. (Germany). Anorganische und Fluorchemie

2015-07-01

Vanadium trifluoride reacts with dry liquid ammonia under the formation of lilac plate-shaped crystals of mer-triammine trifluorido vanadium(III) (1), mer-[VF{sub 3}(NH{sub 3}){sub 3}]. Single-crystal X-ray analysis was carried out at low temperature to elucidate the structure. The compound crystallizes in the monoclinic space group P2{sup 1}/c with a = 5.7284(4), b = 9.2033(5), c = 10.5271(6) Aa, beta = 91.795(6)°, and V = 554.72(6) Aa{sup 3} at 123 K with Z = 4. The discrete [VF{sub 3}(NH{sub 3}){sub 3}] molecules are interconnected by hydrogen bonds.
Selective DNA-Mediated Assembly of Gold Nanoparticles on Electroded Substrates

Science.gov (United States)

2008-06-01

might use the Watson - Crick base-pairing of DNA as a means for ultrahigh-precision engineering is well- known.5,6 The idea is to use the highly specific...Selective DNA -Mediated Assembly of Gold Nanoparticles on Electroded Substrates K. E. Sapsford,†,‡,∇ D. Park,§ E. R. Goldman,‡ E. E. Foos,| S. A...electrodes via DNA hybridization. Protocols are demonstrated for maximizing selectivity and coverage using 15mers as the active binding agents. Detailed
Genomic DNA extraction from sapwood of Pinus roxburghii for ...

African Journals Online (AJOL)

Ashish

2013-02-22

Feb 22, 2013 ... A method for extraction of genomic DNA from sapwood tissues of mature tall trees of Pinus roxburghii, .... DNA as a template. PCR was performed on a thermal cycler. (Biorad, Mycycler) incorporating 10 ng genomic DNA to a 25 µl reaction mix containing 1X Taq buffer, 3 mM MgCl2, 0.2 mM each of dNTPs ...
DNA Extraction Protocols for Whole-Genome Sequencing in Marine Organisms.

Science.gov (United States)

Panova, Marina; Aronsson, Henrik; Cameron, R Andrew; Dahl, Peter; Godhe, Anna; Lind, Ulrika; Ortega-Martinez, Olga; Pereyra, Ricardo; Tesson, Sylvie V M; Wrange, Anna-Lisa; Blomberg, Anders; Johannesson, Kerstin

2016-01-01

The marine environment harbors a large proportion of the total biodiversity on this planet, including the majority of the earths' different phyla and classes. Studying the genomes of marine organisms can bring interesting insights into genome evolution. Today, almost all marine organismal groups are understudied with respect to their genomes. One potential reason is that extraction of high-quality DNA in sufficient amounts is challenging for many marine species. This is due to high polysaccharide content, polyphenols and other secondary metabolites that will inhibit downstream DNA library preparations. Consequently, protocols developed for vertebrates and plants do not always perform well for invertebrates and algae. In addition, many marine species have large population sizes and, as a consequence, highly variable genomes. Thus, to facilitate the sequence read assembly process during genome sequencing, it is desirable to obtain enough DNA from a single individual, which is a challenge in many species of invertebrates and algae. Here, we present DNA extraction protocols for seven marine species (four invertebrates, two algae, and a marine yeast), optimized to provide sufficient DNA quality and yield for de novo genome sequencing projects.
Rapid DNA extraction of bacterial genome using laundry detergents ...

African Journals Online (AJOL)

Genomic DNA extraction from bacterial cells involves processes normally performed in most biological laboratories. Therefore, various methods have been offered, manually and kit, but these methods may be time consuming and costly. In this paper, genomic DNA extraction of Pseudomonas aeruginosa was investigated ...

Rapid DNA extraction of bacterial genome using laundry detergents ...

African Journals Online (AJOL)

Yomi

2012-01-03

Jan 3, 2012 ... Genomic DNA extraction from bacterial cells involves processes normally performed in most biological laboratories. Therefore, various methods have been offered, manually and kit, but these methods may be time consuming and costly. In this paper, genomic DNA extraction of Pseudomonas aeruginosa ...
De Novo Assembly of Complete Chloroplast Genomes from Non-model Species Based on a K-mer Frequency-Based Selection of Chloroplast Reads from total DNA Sequences.

NARCIS (Netherlands)

Izan, Shairul; Esselink, G.; Visser, R.G.F.; Smulders, M.J.M.; Borm, T.J.A.

2017-01-01

Whole Genome Shotgun (WGS) sequences of plant species often contain an abundance of reads that are derived from the chloroplast genome. Up to now these reads have generally been identified and assembled into chloroplast genomes based on homology to chloroplasts from related species. This
SWAP-Assembler 2: Optimization of De Novo Genome Assembler at Large Scale

Energy Technology Data Exchange (ETDEWEB)

Meng, Jintao; Seo, Sangmin; Balaji, Pavan; Wei, Yanjie; Wang, Bingqiang; Feng, Shengzhong

2016-08-16

In this paper, we analyze and optimize the most time-consuming steps of the SWAP-Assembler, a parallel genome assembler, so that it can scale to a large number of cores for huge genomes with the size of sequencing data ranging from terabyes to petabytes. According to the performance analysis results, the most time-consuming steps are input parallelization, k-mer graph construction, and graph simplification (edge merging). For the input parallelization, the input data is divided into virtual fragments with nearly equal size, and the start position and end position of each fragment are automatically separated at the beginning of the reads. In k-mer graph construction, in order to improve the communication efficiency, the message size is kept constant between any two processes by proportionally increasing the number of nucleotides to the number of processes in the input parallelization step for each round. The memory usage is also decreased because only a small part of the input data is processed in each round. With graph simplification, the communication protocol reduces the number of communication loops from four to two loops and decreases the idle communication time. The optimized assembler is denoted as SWAP-Assembler 2 (SWAP2). In our experiments using a 1000 Genomes project dataset of 4 terabytes (the largest dataset ever used for assembling) on the supercomputer Mira, the results show that SWAP2 scales to 131,072 cores with an efficiency of 40%. We also compared our work with both the HipMER assembler and the SWAP-Assembler. On the Yanhuang dataset of 300 gigabytes, SWAP2 shows a 3X speedup and 4X better scalability compared with the HipMer assembler and is 45 times faster than the SWAP-Assembler. The SWAP2 software is available at https://sourceforge.net/projects/swapassembler.
Sequence analysis of the genome of carnation (Dianthus caryophyllus L.).

Science.gov (United States)

Yagi, Masafumi; Kosugi, Shunichi; Hirakawa, Hideki; Ohmiya, Akemi; Tanase, Koji; Harada, Taro; Kishimoto, Kyutaro; Nakayama, Masayoshi; Ichimura, Kazuo; Onozaki, Takashi; Yamaguchi, Hiroyasu; Sasaki, Nobuhiro; Miyahara, Taira; Nishizaki, Yuzo; Ozeki, Yoshihiro; Nakamura, Noriko; Suzuki, Takamasa; Tanaka, Yoshikazu; Sato, Shusei; Shirasawa, Kenta; Isobe, Sachiko; Miyamura, Yoshinori; Watanabe, Akiko; Nakayama, Shinobu; Kishida, Yoshie; Kohara, Mitsuyo; Tabata, Satoshi

2014-06-01

The whole-genome sequence of carnation (Dianthus caryophyllus L.) cv. 'Francesco' was determined using a combination of different new-generation multiplex sequencing platforms. The total length of the non-redundant sequences was 568,887,315 bp, consisting of 45,088 scaffolds, which covered 91% of the 622 Mb carnation genome estimated by k-mer analysis. The N50 values of contigs and scaffolds were 16,644 bp and 60,737 bp, respectively, and the longest scaffold was 1,287,144 bp. The average GC content of the contig sequences was 36%. A total of 1050, 13, 92 and 143 genes for tRNAs, rRNAs, snoRNA and miRNA, respectively, were identified in the assembled genomic sequences. For protein-encoding genes, 43 266 complete and partial gene structures excluding those in transposable elements were deduced. Gene coverage was ∼ 98%, as deduced from the coverage of the core eukaryotic genes. Intensive characterization of the assigned carnation genes and comparison with those of other plant species revealed characteristic features of the carnation genome. The results of this study will serve as a valuable resource for fundamental and applied research of carnation, especially for breeding new carnation varieties. Further information on the genomic sequences is available at http://carnation.kazusa.or.jp. © The Author 2013. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.
Comparative analysis of mitochondrial genomes between a wheat K-type cytoplasmic male sterility (CMS) line and its maintainer line.

Science.gov (United States)

Liu, Huitao; Cui, Peng; Zhan, Kehui; Lin, Qiang; Zhuo, Guoyin; Guo, Xiaoli; Ding, Feng; Yang, Wenlong; Liu, Dongcheng; Hu, Songnian; Yu, Jun; Zhang, Aimin

2011-03-29

Plant mitochondria, semiautonomous organelles that function as manufacturers of cellular ATP, have their own genome that has a slow rate of evolution and rapid rearrangement. Cytoplasmic male sterility (CMS), a common phenotype in higher plants, is closely associated with rearrangements in mitochondrial DNA (mtDNA), and is widely used to produce F1 hybrid seeds in a variety of valuable crop species. Novel chimeric genes deduced from mtDNA rearrangements causing CMS have been identified in several plants, such as rice, sunflower, pepper, and rapeseed, but there are very few reports about mtDNA rearrangements in wheat. In the present work, we describe the mitochondrial genome of a wheat K-type CMS line and compare it with its maintainer line. The complete mtDNA sequence of a wheat K-type (with cytoplasm of Aegilops kotschyi) CMS line, Ks3, was assembled into a master circle (MC) molecule of 647,559 bp and found to harbor 34 known protein-coding genes, three rRNAs (18 S, 26 S, and 5 S rRNAs), and 16 different tRNAs. Compared to our previously published sequence of a K-type maintainer line, Km3, we detected Ks3-specific mtDNA (> 100 bp, 11.38%) and repeats (> 100 bp, 29 units) as well as genes that are unique to each line: rpl5 was missing in Ks3 and trnH was absent from Km3. We also defined 32 single nucleotide polymorphisms (SNPs) in 13 protein-coding, albeit functionally irrelevant, genes, and predicted 22 unique ORFs in Ks3, representing potential candidates for K-type CMS. All these sequence variations are candidates for involvement in CMS. A comparative analysis of the mtDNA of several angiosperms, including those from Ks3, Km3, rice, maize, Arabidopsis thaliana, and rapeseed, showed that non-coding sequences of higher plants had mostly divergent multiple reorganizations during the mtDNA evolution of higher plants. The complete mitochondrial genome of the wheat K-type CMS line Ks3 is very different from that of its maintainer line Km3, especially in non
Comparative analysis of mitochondrial genomes between a wheat K-type cytoplasmic male sterility (CMS line and its maintainer line

Directory of Open Access Journals (Sweden)

Liu Dongcheng

2011-03-01

Full Text Available Abstract Background Plant mitochondria, semiautonomous organelles that function as manufacturers of cellular ATP, have their own genome that has a slow rate of evolution and rapid rearrangement. Cytoplasmic male sterility (CMS, a common phenotype in higher plants, is closely associated with rearrangements in mitochondrial DNA (mtDNA, and is widely used to produce F1 hybrid seeds in a variety of valuable crop species. Novel chimeric genes deduced from mtDNA rearrangements causing CMS have been identified in several plants, such as rice, sunflower, pepper, and rapeseed, but there are very few reports about mtDNA rearrangements in wheat. In the present work, we describe the mitochondrial genome of a wheat K-type CMS line and compare it with its maintainer line. Results The complete mtDNA sequence of a wheat K-type (with cytoplasm of Aegilops kotschyi CMS line, Ks3, was assembled into a master circle (MC molecule of 647,559 bp and found to harbor 34 known protein-coding genes, three rRNAs (18 S, 26 S, and 5 S rRNAs, and 16 different tRNAs. Compared to our previously published sequence of a K-type maintainer line, Km3, we detected Ks3-specific mtDNA (> 100 bp, 11.38% and repeats (> 100 bp, 29 units as well as genes that are unique to each line: rpl5 was missing in Ks3 and trnH was absent from Km3. We also defined 32 single nucleotide polymorphisms (SNPs in 13 protein-coding, albeit functionally irrelevant, genes, and predicted 22 unique ORFs in Ks3, representing potential candidates for K-type CMS. All these sequence variations are candidates for involvement in CMS. A comparative analysis of the mtDNA of several angiosperms, including those from Ks3, Km3, rice, maize, Arabidopsis thaliana, and rapeseed, showed that non-coding sequences of higher plants had mostly divergent multiple reorganizations during the mtDNA evolution of higher plants. Conclusion The complete mitochondrial genome of the wheat K-type CMS line Ks3 is very different from that of
Cloning of the cDNA for U1 small nuclear ribonucleoprotein particle 70K protein from Arabidopsis thaliana

Science.gov (United States)

Reddy, A. S.; Czernik, A. J.; An, G.; Poovaiah, B. W.

1992-01-01

We cloned and sequenced a plant cDNA that encodes U1 small nuclear ribonucleoprotein (snRNP) 70K protein. The plant U1 snRNP 70K protein cDNA is not full length and lacks the coding region for 68 amino acids in the amino-terminal region as compared to human U1 snRNP 70K protein. Comparison of the deduced amino acid sequence of the plant U1 snRNP 70K protein with the amino acid sequence of animal and yeast U1 snRNP 70K protein showed a high degree of homology. The plant U1 snRNP 70K protein is more closely related to the human counter part than to the yeast 70K protein. The carboxy-terminal half is less well conserved but, like the vertebrate 70K proteins, is rich in charged amino acids. Northern analysis with the RNA isolated from different parts of the plant indicates that the snRNP 70K gene is expressed in all of the parts tested. Southern blotting of genomic DNA using the cDNA indicates that the U1 snRNP 70K protein is coded by a single gene.
The Genome 10K Project: a way forward.

Science.gov (United States)

Koepfli, Klaus-Peter; Paten, Benedict; O'Brien, Stephen J

2015-01-01

The Genome 10K Project was established in 2009 by a consortium of biologists and genome scientists determined to facilitate the sequencing and analysis of the complete genomes of 10,000 vertebrate species. Since then the number of selected and initiated species has risen from ∼26 to 277 sequenced or ongoing with funding, an approximately tenfold increase in five years. Here we summarize the advances and commitments that have occurred by mid-2014 and outline the achievements and present challenges of reaching the 10,000-species goal. We summarize the status of known vertebrate genome projects, recommend standards for pronouncing a genome as sequenced or completed, and provide our present and future vision of the landscape of Genome 10K. The endeavor is ambitious, bold, expensive, and uncertain, but together the Genome 10K Consortium of Scientists and the worldwide genomics community are moving toward their goal of delivering to the coming generation the gift of genome empowerment for many vertebrate species.
Characterization of Camptothecin-induced Genomic Changes in the Camptothecin-resistant T-ALL-derived Cell Line CPT-K5

DEFF Research Database (Denmark)

Kjeldsen, Eigil; Nielsen, Christine J F; Roy, Amit

2018-01-01

-K5 and its parental cell line. We identified copy number alterations affecting genes important for maintaining genome integrity and reducing CPT-induced DNA damage. We show for the first time that short tandem repeats are targets for TOP1 cleavage, that can be differentially stimulated by CPT.......Acquisition of resistance to topoisomerase I (TOP1)-targeting camptothecin (CPT) derivatives is a major clinical problem. Little is known about the underlying chromosomal and genomic mechanisms. We characterized the CPT-K5 cell line expressing mutant CPT-resistant TOP1 and its parental T......-cell derived acute lymphoblastic leukemia CPT-sensitive RPMI-8402 cell line by karyotyping and molecular genetic methods, including subtractive oligo-based array comparative genomic hybridization (soaCGH) analysis. Karyotyping revealed that CPT-K5 cells had acquired additional structural aberrations...
Methyl-Analyzer--whole genome DNA methylation profiling.

Science.gov (United States)

Xin, Yurong; Ge, Yongchao; Haghighi, Fatemeh G

2011-08-15

Methyl-Analyzer is a python package that analyzes genome-wide DNA methylation data produced by the Methyl-MAPS (methylation mapping analysis by paired-end sequencing) method. Methyl-MAPS is an enzymatic-based method that uses both methylation-sensitive and -dependent enzymes covering >80% of CpG dinucleotides within mammalian genomes. It combines enzymatic-based approaches with high-throughput next-generation sequencing technology to provide whole genome DNA methylation profiles. Methyl-Analyzer processes and integrates sequencing reads from methylated and unmethylated compartments and estimates CpG methylation probabilities at single base resolution. Methyl-Analyzer is available at http://github.com/epigenomics/methylmaps. Sample dataset is available for download at http://epigenomicspub.columbia.edu/methylanalyzer_data.html. fgh3@columbia.edu Supplementary data are available at Bioinformatics online.
A compact, in vivo screen of all 6-mers reveals drivers of tissue-specific expression and guides synthetic regulatory element design.

Science.gov (United States)

Smith, Robin P; Riesenfeld, Samantha J; Holloway, Alisha K; Li, Qiang; Murphy, Karl K; Feliciano, Natalie M; Orecchia, Lorenzo; Oksenberg, Nir; Pollard, Katherine S; Ahituv, Nadav

2013-07-18

Large-scale annotation efforts have improved our ability to coarsely predict regulatory elements throughout vertebrate genomes. However, it is unclear how complex spatiotemporal patterns of gene expression driven by these elements emerge from the activity of short, transcription factor binding sequences. We describe a comprehensive promoter extension assay in which the regulatory potential of all 6 base-pair (bp) sequences was tested in the context of a minimal promoter. To enable this large-scale screen, we developed algorithms that use a reverse-complement aware decomposition of the de Bruijn graph to design a library of DNA oligomers incorporating every 6-bp sequence exactly once. Our library multiplexes all 4,096 unique 6-mers into 184 double-stranded 15-bp oligomers, which is sufficiently compact for in vivo testing. We injected each multiplexed construct into zebrafish embryos and scored GFP expression in 15 tissues at two developmental time points. Twenty-seven constructs produced consistent expression patterns, with the majority doing so in only one tissue. Functional sequences are enriched near biologically relevant genes, match motifs for developmental transcription factors, and are required for enhancer activity. By concatenating tissue-specific functional sequences, we generated completely synthetic enhancers for the notochord, epidermis, spinal cord, forebrain and otic lateral line, and show that short regulatory sequences do not always function modularly. This work introduces a unique in vivo catalog of short, functional regulatory sequences and demonstrates several important principles of regulatory element organization. Furthermore, we provide resources for designing compact, reverse-complement aware k-mer libraries.
Draft genome of the lined seahorse, Hippocampus erectus.

Science.gov (United States)

Lin, Qiang; Qiu, Ying; Gu, Ruobo; Xu, Meng; Li, Jia; Bian, Chao; Zhang, Huixian; Qin, Geng; Zhang, Yanhong; Luo, Wei; Chen, Jieming; You, Xinxin; Fan, Mingjun; Sun, Min; Xu, Pao; Venkatesh, Byrappa; Xu, Junming; Fu, Hongtuo; Shi, Qiong

2017-06-01

The lined seahorse, Hippocampus erectus , is an Atlantic species and mainly inhabits shallow sea beds or coral reefs. It has become very popular in China for its wide use in traditional Chinese medicine. In order to improve the aquaculture yield of this valuable fish species, we are trying to develop genomic resources for assistant selection in genetic breeding. Here, we provide whole genome sequencing, assembly, and gene annotation of the lined seahorse, which can enrich genome resource and further application for its molecular breeding. A total of 174.6 Gb (Gigabase) raw DNA sequences were generated by the Illumina Hiseq2500 platform. The final assembly of the lined seahorse genome is around 458 Mb, representing 94% of the estimated genome size (489 Mb by k-mer analysis). The contig N50 and scaffold N50 reached 14.57 kb and 1.97 Mb, respectively. Quality of the assembled genome was assessed by BUSCO with prediction of 85% of the known vertebrate genes and evaluated using the de novo assembled RNA-seq transcripts to prove a high mapping ratio (more than 99% transcripts could be mapped to the assembly). Using homology-based, de novo and transcriptome-based prediction methods, we predicted 20 788 protein-coding genes in the generated assembly, which is less than our previously reported gene number (23 458) of the tiger tail seahorse ( H. comes ). We report a draft genome of the lined seahorse. These generated genomic data are going to enrich genome resource of this economically important fish, and also provide insights into the genetic mechanisms of its iconic morphology and male pregnancy behavior. © The Authors 2017. Published by Oxford University Press.
Whole genome DNA methylation: beyond genes silencing

OpenAIRE

Tirado-Magallanes, Roberto; Rebbani, Khadija; Lim, Ricky; Pradhan, Sriharsa; Benoukraf, Touati

2016-01-01

The combination of DNA bisulfite treatment with high-throughput sequencing technologies has enabled investigation of genome-wide DNA methylation at near base pair level resolution, far beyond that of the kilobase-long canonical CpG islands that initially revealed the biological relevance of this covalent DNA modification. The latest high-resolution studies have revealed a role for very punctual DNA methylation in chromatin plasticity, gene regulation and splicing. Here, we aim to outline the ...
Reciprocal Regulation between DNA-PKcs and Snail1 Conferring Genomic Instability

International Nuclear Information System (INIS)

Seo, Haeng Ran; Lee, Hae June; Jin, Yeung Bae; Bae, Sang Woo; Lee, Yun Sil; Kim, Nam Hee; Kim, Hyun Sil; Nam, Hyung Wook; Yook, Jong In

2010-01-01

Although the roles of DNA-dependent protein kinase catalytic subunit (DNA-PKcs) involving non-homologous end joining (NHEJ) of DNA repair are well recognized, the biological mechanisms and regulators by which DNA-PKcs regulate genomic instability are not clearly defined. We show herein that DNA-PKcs activity resulting from DNA damage caused by ionizing radiation (IR) phosphorylates Snail1 at serine 100, which results in increased Snail1 expression and its function by inhibition of GSK-3-mediated phosphorylation. Furthermore, Snail1 phosphorylated at serine 100 can reciprocally inhibit kinase activity of DNA-PKcs, resulting in an inhibition to recruit DNA-PKcs or Ku70/80 to a DNA double-strand break site, and ultimately inhibition of DNA repair activity. The impairment of repair activity by a direct interaction between Snail1 and DNA-PKcs increases the resistance to DNA damaging agents, such as IR, and genomic instability. Our findings provide a novel cellular mechanism for induction of genomic instability by reciprocal regulation of DNA-PKcs and Snail1
Genome-wide DNA methylation maps in follicular lymphoma cells determined by methylation-enriched bisulfite sequencing.

Directory of Open Access Journals (Sweden)

Jeong-Hyeon Choi

Full Text Available BACKGROUND: Follicular lymphoma (FL is a form of non-Hodgkin's lymphoma (NHL that arises from germinal center (GC B-cells. Despite the significant advances in immunotherapy, FL is still not curable. Beyond transcriptional profiling and genomics datasets, there currently is no epigenome-scale dataset or integrative biology approach that can adequately model this disease and therefore identify novel mechanisms and targets for successful prevention and treatment of FL. METHODOLOGY/PRINCIPAL FINDINGS: We performed methylation-enriched genome-wide bisulfite sequencing of FL cells and normal CD19(+ B-cells using 454 sequencing technology. The methylated DNA fragments were enriched with methyl-binding proteins, treated with bisulfite, and sequenced using the Roche-454 GS FLX sequencer. The total number of bases covered in the human genome was 18.2 and 49.3 million including 726,003 and 1.3 million CpGs in FL and CD19(+ B-cells, respectively. 11,971 and 7,882 methylated regions of interest (MRIs were identified respectively. The genome-wide distribution of these MRIs displayed significant differences between FL and normal B-cells. A reverse trend in the distribution of MRIs between the promoter and the gene body was observed in FL and CD19(+ B-cells. The MRIs identified in FL cells also correlated well with transcriptomic data and ChIP-on-Chip analyses of genome-wide histone modifications such as tri-methyl-H3K27, and tri-methyl-H3K4, indicating a concerted epigenetic alteration in FL cells. CONCLUSIONS/SIGNIFICANCE: This study is the first to provide a large scale and comprehensive analysis of the DNA methylation sequence composition and distribution in the FL epigenome. These integrated approaches have led to the discovery of novel and frequent targets of aberrant epigenetic alterations. The genome-wide bisulfite sequencing approach developed here can be a useful tool for profiling DNA methylation in clinical samples.
A passive physical model for DnaK chaperoning

Science.gov (United States)

Uhl, Lionel; Dumont, Audrey; Dukan, Sam

2018-03-01

Almost all living organisms use protein chaperones with a view to preventing proteins from misfolding or aggregation either spontaneously or during cellular stress. This work uses a reaction-diffusion stochastic model to describe the dynamic localization of the Hsp70 chaperone DnaK in Escherichia coli cells during transient proteotoxic collapse characterized by the accumulation of insoluble proteins. In the model, misfolded (‘abnormal’) proteins are produced during alcoholic stress and have the propensity to aggregate with a polymerization-like kinetics. When aggregates diffuse more slowly they grow larger. According to Michaelis-Menten-type kinetics, DnaK has the propensity to bind with misfolded proteins or aggregates in order to catalyse refolding. To match experimental fluorescence microscopy data showing clusters of DnaK-GFP localized in multiple foci, the model includes spatial zones with local reduced diffusion rates to generate spontaneous assemblies of DnaK called ‘foci’. Numerical simulations of our model succeed in reproducing the kinetics of DnaK localization experimentally observed. DnaK starts from foci, moves to large aggregates during acute stress, resolves those aggregates during recovery and finally returns to its initial punctate localization pattern. Finally, we compare real biological events with hypothetical repartitions of the protein aggregates or DnaK. We then notice that DnaK action is more efficient on protein aggregates than on protein homogeneously distributed.
Selective Gene Delivery for Integrating Exogenous DNA into Plastid and Mitochondrial Genomes Using Peptide-DNA Complexes.

Science.gov (United States)

Yoshizumi, Takeshi; Oikawa, Kazusato; Chuah, Jo-Ann; Kodama, Yutaka; Numata, Keiji

2018-05-14

Selective gene delivery into organellar genomes (mitochondrial and plastid genomes) has been limited because of a lack of appropriate platform technology, even though these organelles are essential for metabolite and energy production. Techniques for selective organellar modification are needed to functionally improve organelles and produce transplastomic/transmitochondrial plants. However, no method for mitochondrial genome modification has yet been established for multicellular organisms including plants. Likewise, modification of plastid genomes has been limited to a few plant species and algae. In the present study, we developed ionic complexes of fusion peptides containing organellar targeting signal and plasmid DNA for selective delivery of exogenous DNA into the plastid and mitochondrial genomes of intact plants. This is the first report of exogenous DNA being integrated into the mitochondrial genomes of not only plants, but also multicellular organisms in general. This fusion peptide-mediated gene delivery system is a breakthrough platform for both plant organellar biotechnology and gene therapy for mitochondrial diseases in animals.
Large-scale chromosome folding versus genomic DNA sequences: A discrete double Fourier transform technique.

Science.gov (United States)

Chechetkin, V R; Lobzin, V V

2017-08-07

Using state-of-the-art techniques combining imaging methods and high-throughput genomic mapping tools leaded to the significant progress in detailing chromosome architecture of various organisms. However, a gap still remains between the rapidly growing structural data on the chromosome folding and the large-scale genome organization. Could a part of information on the chromosome folding be obtained directly from underlying genomic DNA sequences abundantly stored in the databanks? To answer this question, we developed an original discrete double Fourier transform (DDFT). DDFT serves for the detection of large-scale genome regularities associated with domains/units at the different levels of hierarchical chromosome folding. The method is versatile and can be applied to both genomic DNA sequences and corresponding physico-chemical parameters such as base-pairing free energy. The latter characteristic is closely related to the replication and transcription and can also be used for the assessment of temperature or supercoiling effects on the chromosome folding. We tested the method on the genome of E. coli K-12 and found good correspondence with the annotated domains/units established experimentally. As a brief illustration of further abilities of DDFT, the study of large-scale genome organization for bacteriophage PHIX174 and bacterium Caulobacter crescentus was also added. The combined experimental, modeling, and bioinformatic DDFT analysis should yield more complete knowledge on the chromosome architecture and genome organization. Copyright © 2017 Elsevier Ltd. All rights reserved.
Microarray of DNA probes on carboxylate functional beads surface

Institute of Scientific and Technical Information of China (English)

黄承志; 李原芳; 黄新华; 范美坤

2000-01-01

The microarray of DNA probes with 5’ -NH2 and 5’ -Tex/3’ -NH2 modified terminus on 10 um carboxylate functional beads surface in the presence of 1-ethyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC) is characterized in the preseni paper. it was found that the microarray capacity of DNA probes on the beads surface depends on the pH of the aqueous solution, the concentra-tion of DNA probe and the total surface area of the beads. On optimal conditions, the minimum distance of 20 mer single-stranded DNA probe microarrayed on beads surface is about 14 nm, while that of 20 mer double-stranded DNA probes is about 27 nm. If the probe length increases from 20 mer to 35 mer, its microarray density decreases correspondingly. Mechanism study shows that the binding mode of DNA probes on the beads surface is nearly parallel to the beads surface.
Microarray of DNA probes on carboxylate functional beads surface

Institute of Scientific and Technical Information of China (English)

无

2000-01-01

The microarray of DNA probes with 5′-NH2 and 5′-Tex/3′-NH2 modified terminus on 10 m m carboxylate functional beads surface in the presence of 1-ethyl-3-(3-dimethylaminopropyl)- carbodiimide (EDC) is characterized in the present paper. It was found that the microarray capacity of DNA probes on the beads surface depends on the pH of the aqueous solution, the concentration of DNA probe and the total surface area of the beads. On optimal conditions, the minimum distance of 20 mer single-stranded DNA probe microarrayed on beads surface is about 14 nm, while that of 20 mer double-stranded DNA probes is about 27 nm. If the probe length increases from 20 mer to 35 mer, its microarray density decreases correspondingly. Mechanism study shows that the binding mode of DNA probes on the beads surface is nearly parallel to the beads surface.

Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

Science.gov (United States)

Pietrowski, D; Förster, M

2000-01-01

The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).
Radiotaxons and reliability of a genome

International Nuclear Information System (INIS)

Korogodin, V.I.

1982-01-01

Radiosensitivity of cells (D 0 ) is considered with regard to the structural organization of the genome. The following terms are introduced: ''karyotaxon'', organisms with identical structural organization of the genome, and ''specific genome stability'' K=D 0 C, where C is the quantity of DNA in the cell nucleus; K is the amount of energy (eV) the sorption of which in DNA is necessary and sufficient for one elementary damage to occur. It was shown that Ksub(i)=const. within every karyotaxon ''i''. K 1 =100 eV for viruses, and K 4 =61000 eV for the highest level of genome organization (diploid eukaryotes including man). Potential mechanisms of increasing Ksub(i) with increasing level of genome organization and the role of this factor in evolution are discussed [ru
Epigenetic Variation in Monozygotic Twins: A Genome-Wide Analysis of DNA Methylation in Buccal Cells

Directory of Open Access Journals (Sweden)

Jenny van Dongen

2014-05-01

Full Text Available DNA methylation is one of the most extensively studied epigenetic marks in humans. Yet, it is largely unknown what causes variation in DNA methylation between individuals. The comparison of DNA methylation profiles of monozygotic (MZ twins offers a unique experimental design to examine the extent to which such variation is related to individual-specific environmental influences and stochastic events or to familial factors (DNA sequence and shared environment. We measured genome-wide DNA methylation in buccal samples from ten MZ pairs (age 8–19 using the Illumina 450k array and examined twin correlations for methylation level at 420,921 CpGs after QC. After selecting CpGs showing the most variation in the methylation level between subjects, the mean genome-wide correlation (rho was 0.54. The correlation was higher, on average, for CpGs within CpG islands (CGIs, compared to CGI shores, shelves and non-CGI regions, particularly at hypomethylated CpGs. This finding suggests that individual-specific environmental and stochastic influences account for more variation in DNA methylation in CpG-poor regions. Our findings also indicate that it is worthwhile to examine heritable and shared environmental influences on buccal DNA methylation in larger studies that also include dizygotic twins.
LDsplit: screening for cis-regulatory motifs stimulating meiotic recombination hotspots by analysis of DNA sequence polymorphisms.

Science.gov (United States)

Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie

2014-02-17

As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of
Oxidative DNA damage causes mitochondrial genomic instability in Saccharomyces cerevisiae.

Science.gov (United States)

Doudican, Nicole A; Song, Binwei; Shadel, Gerald S; Doetsch, Paul W

2005-06-01

Mitochondria contain their own genome, the integrity of which is required for normal cellular energy metabolism. Reactive oxygen species (ROS) produced by normal mitochondrial respiration can damage cellular macromolecules, including mitochondrial DNA (mtDNA), and have been implicated in degenerative diseases, cancer, and aging. We developed strategies to elevate mitochondrial oxidative stress by exposure to antimycin and H(2)O(2) or utilizing mutants lacking mitochondrial superoxide dismutase (sod2Delta). Experiments were conducted with strains compromised in mitochondrial base excision repair (ntg1Delta) and oxidative damage resistance (pif1Delta) in order to delineate the relationship between these pathways. We observed enhanced ROS production, resulting in a direct increase in oxidative mtDNA damage and mutagenesis. Repair-deficient mutants exposed to oxidative stress conditions exhibited profound genomic instability. Elimination of Ntg1p and Pif1p resulted in a synergistic corruption of respiratory competency upon exposure to antimycin and H(2)O(2). Mitochondrial genomic integrity was substantially compromised in ntg1Delta pif1Delta sod2Delta strains, since these cells exhibit a total loss of mtDNA. A stable respiration-defective strain, possessing a normal complement of mtDNA damage resistance pathways, exhibited a complete loss of mtDNA upon exposure to antimycin and H(2)O(2). This loss was preventable by Sod2p overexpression. These results provide direct evidence that oxidative mtDNA damage can be a major contributor to mitochondrial genomic instability and demonstrate cooperation of Ntg1p and Pif1p to resist the introduction of lesions into the mitochondrial genome.
Engineering MerR for Sequestration and MerA for Reduction of Toxic Metals and Radionuclides

International Nuclear Information System (INIS)

Summers, Anne O.

2008-01-01

The objectives of this project were (1) to alter a metalloregulatory protein (MerR) so that it would bind other toxic metals or radionuclides with similar affinity so that the engineered protein itself and/or bacteria expressing it could be deployed in the environment to specifically sequester such metals and (2) to alter the mercuric reductase, MerA, to reduce radionuclides and render them less mobile. Both projects had a basic science component. In the first case, such information about MerR illuminates how proteins discriminate very similar metals/elements. In the second case, information about MerA reveals the criteria for transmission of reducing equivalents from NADPH to redox-active metals. The work involved genetic engineering of all or parts of both proteins and examination of their resultant properties both in vivo and in vitro, the latter with biochemical and biophysical tools including equilibrium and non-equilibrium dialysis, XAFS, NMR, x-ray crystallography, and titration calorimetry. We defined the basis for metal specificity in MerR, devised a bacterial strain that sequesters Hg while growing, characterized gold reduction by MerA and the role of the metallochaperone domain of MerA, and determined the 3-D structure of MerB, the organomercurial lyase.
EG-13GENOME-WIDE METHYLATION ANALYSIS IDENTIFIES GENOMIC DNA DEMETHYLATION DURING MALIGNANT PROGRESSION OF GLIOMAS

Science.gov (United States)

Saito, Kuniaki; Mukasa, Akitake; Nagae, Genta; Aihara, Koki; Otani, Ryohei; Takayanagi, Shunsaku; Omata, Mayu; Tanaka, Shota; Shibahara, Junji; Takahashi, Miwako; Momose, Toshimitsu; Shimamura, Teppei; Miyano, Satoru; Narita, Yoshitaka; Ueki, Keisuke; Nishikawa, Ryo; Nagane, Motoo; Aburatani, Hiroyuki; Saito, Nobuhito

2014-01-01

Low-grade gliomas often undergo malignant progression, and these transformations are a leading cause of death in patients with low-grade gliomas. However, the molecular mechanisms underlying malignant tumor progression are still not well understood. Recent evidence indicates that epigenetic deregulation is an important cause of gliomagenesis; therefore, we examined the impact of epigenetic changes during malignant progression of low-grade gliomas. Specifically, we used the Illumina Infinium Human Methylation 450K BeadChip to perform genome-wide DNA methylation analysis of 120 gliomas and four normal brains. This study sample included 25 matched-pairs of initial low-grade gliomas and recurrent tumors (temporal heterogeneity) and 20 of the 25 recurring tumors recurred as malignant progressions, and one matched-pair of newly emerging malignant lesions and pre-existing lesions (spatial heterogeneity). Analyses of methylation profiles demonstrated that most low-grade gliomas in our sample (43/51; 84%) had a CpG island methylator phenotype (G-CIMP). Remarkably, approximately 50% of secondary glioblastomas that had progressed from low-grade tumors with the G-CIMP status exhibited a characteristic partial demethylation of genomic DNA during malignant progression, but other recurrent gliomas showed no apparent change in DNA methylation pattern. Interestingly, we found that most loci that were demethylated during malignant progression were located outside of CpG islands. The information of histone modifications patterns in normal human astrocytes and embryonal stem cells also showed that the ratio of active marks at the site corresponding to DNA demethylated loci in G-CIMP-demethylated tumors was significantly lower; this finding indicated that most demethylated loci in G-CIMP-demethylated tumors were likely transcriptionally inactive. A small number of the genes that were upregulated and had demethylated CpG islands were associated with cell cycle-related pathway. In
Next Generation DNA Sequencing and the Future of Genomic Medicine

OpenAIRE

Anderson, Matthew W.; Schrijver, Iris

2010-01-01

In the years since the first complete human genome sequence was reported, there has been a rapid development of technologies to facilitate high-throughput sequence analysis of DNA (termed “next-generation” sequencing). These novel approaches to DNA sequencing offer the promise of complete genomic analysis at a cost feasible for routine clinical diagnostics. However, the ability to more thoroughly interrogate genomic sequence raises a number of important issues with regard to result interpreta...
Transcription facilitated genome-wide recruitment of topoisomerase I and DNA gyrase.

Science.gov (United States)

Ahmed, Wareed; Sala, Claudia; Hegde, Shubhada R; Jha, Rajiv Kumar; Cole, Stewart T; Nagaraja, Valakunja

2017-05-01

Movement of the transcription machinery along a template alters DNA topology resulting in the accumulation of supercoils in DNA. The positive supercoils generated ahead of transcribing RNA polymerase (RNAP) and the negative supercoils accumulating behind impose severe topological constraints impeding transcription process. Previous studies have implied the role of topoisomerases in the removal of torsional stress and the maintenance of template topology but the in vivo interaction of functionally distinct topoisomerases with heterogeneous chromosomal territories is not deciphered. Moreover, how the transcription-induced supercoils influence the genome-wide recruitment of DNA topoisomerases remains to be explored in bacteria. Using ChIP-Seq, we show the genome-wide occupancy profile of both topoisomerase I and DNA gyrase in conjunction with RNAP in Mycobacterium tuberculosis taking advantage of minimal topoisomerase representation in the organism. The study unveils the first in vivo genome-wide interaction of both the topoisomerases with the genomic regions and establishes that transcription-induced supercoils govern their recruitment at genomic sites. Distribution profiles revealed co-localization of RNAP and the two topoisomerases on the active transcriptional units (TUs). At a given locus, topoisomerase I and DNA gyrase were localized behind and ahead of RNAP, respectively, correlating with the twin-supercoiled domains generated. The recruitment of topoisomerases was higher at the genomic loci with higher transcriptional activity and/or at regions under high torsional stress compared to silent genomic loci. Importantly, the occupancy of DNA gyrase, sole type II topoisomerase in Mtb, near the Ter domain of the Mtb chromosome validates its function as a decatenase.
Reliable typing of MERS-CoV variants with a small genome fragment

NARCIS (Netherlands)

S.L. Smits (Saskia); V.S. Raj (Stalin); S.D. Pas (Suzan); C.B.E.M. Reusken (Chantal); K.A. Mohran (Khaled A.); E. Farag (Elmoubasher); H.E. Al Romaihi (Hamad); M.M. AlHajri (Mohd); B.L. Haagmans (Bart); M.P.G. Koopmans D.V.M. (Marion)

2015-01-01

textabstractBackground: Middle East Respiratory Syndrome coronavirus (. MERS-CoV) is an emerging pathogen that causes lower respiratory tract infection in humans. Camels are the likely animal source for zoonotic infection, although exact transmission modes remain to be determined. Human-to-human
ATM signaling and genomic stability in response to DNA damage

International Nuclear Information System (INIS)

Lavin, Martin F.; Birrell, Geoff; Chen, Philip; Kozlov, Sergei; Scott, Shaun; Gueven, Nuri

2005-01-01

DNA double strand breaks represent the most threatening lesion to the integrity of the genome in cells exposed to ionizing radiation and radiomimetic chemicals. Those breaks are recognized, signaled to cell cycle checkpoints and repaired by protein complexes. The product of the gene (ATM) mutated in the human genetic disorder ataxia-telangiectasia (A-T) plays a central role in the recognition and signaling of DNA damage. ATM is one of an ever growing number of proteins which when mutated compromise the stability of the genome and predispose to tumour development. Mechanisms for recognising double strand breaks in DNA, maintaining genome stability and minimizing risk of cancer are discussed
Surveillance of the Middle East respiratory syndrome (MERS) coronavirus (CoV) infection in healthcare workers after contact with confirmed MERS patients: incidence and risk factors of MERS-CoV seropositivity.

Science.gov (United States)

Kim, C-J; Choi, W S; Jung, Y; Kiem, S; Seol, H Y; Woo, H J; Choi, Y H; Son, J S; Kim, K-H; Kim, Y-S; Kim, E S; Park, S H; Yoon, J H; Choi, S-M; Lee, H; Oh, W S; Choi, S-Y; Kim, N-J; Choi, J-P; Park, S Y; Kim, J; Jeong, S J; Lee, K S; Jang, H C; Rhee, J Y; Kim, B-N; Bang, J H; Lee, J H; Park, S; Kim, H Y; Choi, J K; Wi, Y-M; Choi, H J

2016-10-01

Given the mode of transmission of Middle East respiratory syndrome (MERS), healthcare workers (HCWs) in contact with MERS patients are expected to be at risk of MERS infections. We evaluated the prevalence of MERS coronavirus (CoV) immunoglobulin (Ig) G in HCWs exposed to MERS patients and calculated the incidence of MERS-affected cases in HCWs. We enrolled HCWs from hospitals where confirmed MERS patients had visited. Serum was collected 4 to 6 weeks after the last contact with a confirmed MERS patient. We performed an enzyme-linked immunosorbent assay (ELISA) to screen for the presence of MERS-CoV IgG and an indirect immunofluorescence test (IIFT) to confirm MERS-CoV IgG. We used a questionnaire to collect information regarding the exposure. We calculated the incidence of MERS-affected cases by dividing the sum of PCR-confirmed and serology-confirmed cases by the number of exposed HCWs in participating hospitals. In total, 1169 HCWs in 31 hospitals had contact with 114 MERS patients, and among the HCWs, 15 were PCR-confirmed MERS cases in study hospitals. Serologic analysis was performed for 737 participants. ELISA was positive in five participants and borderline for seven. IIFT was positive for two (0.3%) of these 12 participants. Among the participants who did not use appropriate personal protective equipment (PPE), seropositivity was 0.7% (2/294) compared to 0% (0/443) in cases with appropriate PPE use. The incidence of MERS infection in HCWs was 1.5% (17/1169). The seroprevalence of MERS-CoV IgG among HCWs was higher among participants who did not use appropriate PPE. Copyright © 2016. Published by Elsevier Ltd.
Genomic gigantism: DNA loss is slow in mountain grasshoppers.

Science.gov (United States)

Bensasson, D; Petrov, D A; Zhang, D X; Hartl, D L; Hewitt, G M

2001-02-01

Several studies have shown DNA loss to be inversely correlated with genome size in animals. These studies include a comparison between Drosophila and the cricket, Laupala, but there has been no assessment of DNA loss in insects with very large genomes. Podisma pedestris, the brown mountain grasshopper, has a genome over 100 times as large as that of Drosophila and 10 times as large as that of Laupala. We used 58 paralogous nuclear pseudogenes of mitochondrial origin to study the characteristics of insertion, deletion, and point substitution in P. pedestris and Italopodisma. In animals, these pseudogenes are "dead on arrival"; they are abundant in many different eukaryotes, and their mitochondrial origin simplifies the identification of point substitutions accumulated in nuclear pseudogene lineages. There appears to be a mononucleotide repeat within the 643-bp pseudogene sequence studied that acts as a strong hot spot for insertions or deletions (indels). Because the data for other insect species did not contain such an unusual region, hot spots were excluded from species comparisons. The rate of DNA loss relative to point substitution appears to be considerably and significantly lower in the grasshoppers studied than in Drosophila or Laupala. This suggests that the inverse correlation between genome size and the rate of DNA loss can be extended to comparisons between insects with large or gigantic genomes (i.e., Laupala and Podisma). The low rate of DNA loss implies that in grasshoppers, the accumulation of point mutations is a more potent force for obscuring ancient pseudogenes than their loss through indel accumulation, whereas the reverse is true for Drosophila. The main factor contributing to the difference in the rates of DNA loss estimated for grasshoppers, crickets, and Drosophila appears to be deletion size. Large deletions are relatively rare in Podisma and Italopodisma.
Genomic analysis of murine DNA-dependent protein kinase

International Nuclear Information System (INIS)

Fujimori, A.; Abe, M.

2003-01-01

Full text: The gene of catalytic subunit of DNA dependent protein kinase is responsible gene for SCID mice. The molecules play a critical role in non-homologous end joining including the V(D)J recombination. Contribution of the molecules to the difference of radiosensitivity and the susceptibility to cancer has been suggested. Here we show the entire nucleotide sequence of approximately 193 kbp and 84 kbp genomic regions encoding the entire DNA-PKcs gene in the mouse and chicken respectively. Retroposon was found in the intron 51 of mouse genomic DNA-PKcs gene but in human and chicken. Comparative analysis of these two species strongly suggested that only two genes, DNA-PKcs and MCM4, exist in the region of both species. Several conserved sequences and cis elements, however, were predicted. Recently, the orthologous region for the human DNA-PKcs locus was completed. The results of further comparative study will be discussed
Environmental Contamination and Viral Shedding in MERS Patients During MERS-CoV Outbreak in South Korea.

Science.gov (United States)

Bin, Seo Yu; Heo, Jung Yeon; Song, Min-Suk; Lee, Jacob; Kim, Eun-Ha; Park, Su-Jin; Kwon, Hyeok-Il; Kim, Se Mi; Kim, Young-Il; Si, Young-Jae; Lee, In-Won; Baek, Yun Hee; Choi, Won-Suk; Min, Jinsoo; Jeong, Hye Won; Choi, Young Ki

2016-03-15

Although Middle East Respiratory Syndrome coronavirus (MERS-CoV) is characterized by a risk of nosocomial transmission, the detailed mode of transmission and period of virus shedding from infected patients are poorly understood. The aims of this study were to investigate the potential role of environmental contamination by MERS-CoV in healthcare settings and to define the period of viable virus shedding from MERS patients. We investigated environmental contamination from 4 patients in MERS-CoV units of 2 hospitals. MERS-CoV was detected by reverse transcription polymerase chain reaction (PCR) and viable virus was isolated by cultures. Many environmental surfaces of MERS patient rooms, including points frequently touched by patients or healthcare workers, were contaminated by MERS-CoV. Viral RNA was detected up to five days from environmental surfaces following the last positive PCR from patients' respiratory specimens. MERS-CoV RNA was detected in samples from anterooms, medical devices, and air-ventilating equipment. In addition, MERS-CoV was isolated from environmental objects such as bed sheets, bedrails, IV fluid hangers, and X-ray devices. During the late clinical phase of MERS, viable virus could be isolated in 3 of the 4 enrolled patients on day 18 to day 25 after symptom onset. Most of touchable surfaces in MERS units were contaminated by patients and health care workers and the viable virus could shed through respiratory secretion from clinically fully recovered patients. These results emphasize the need for strict environmental surface hygiene practices, and sufficient isolation period based on laboratory results rather than solely on clinical symptoms. © The Author 2015. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
The mitochondrial and plastid genomes of Volvox carteri: bloated molecules rich in repetitive DNA

Directory of Open Access Journals (Sweden)

Lee Robert W

2009-03-01

Full Text Available Abstract Background The magnitude of noncoding DNA in organelle genomes can vary significantly; it is argued that much of this variation is attributable to the dissemination of selfish DNA. The results of a previous study indicate that the mitochondrial DNA (mtDNA of the green alga Volvox carteri abounds with palindromic repeats, which appear to be selfish elements. We became interested in the evolution and distribution of these repeats when, during a cursory exploration of the V. carteri nuclear DNA (nucDNA and plastid DNA (ptDNA sequences, we found palindromic repeats with similar structural features to those of the mtDNA. Upon this discovery, we decided to investigate the diversity and evolutionary implications of these palindromic elements by sequencing and characterizing large portions of mtDNA and ptDNA and then comparing these data to the V. carteri draft nuclear genome sequence. Results We sequenced 30 and 420 kilobases (kb of the mitochondrial and plastid genomes of V. carteri, respectively – resulting in partial assemblies of these genomes. The mitochondrial genome is the most bloated green-algal mtDNA observed to date: ~61% of the sequence is noncoding, most of which is comprised of short palindromic repeats spread throughout the intergenic and intronic regions. The plastid genome is the largest (>420 kb and most expanded (>80% noncoding ptDNA sequence yet discovered, with a myriad of palindromic repeats in the noncoding regions, which have a similar size and secondary structure to those of the mtDNA. We found that 15 kb (~0.01% of the nuclear genome are homologous to the palindromic elements of the mtDNA, and 50 kb (~0.05% are homologous to those of the ptDNA. Conclusion Selfish elements in the form of short palindromic repeats have propagated in the V. carteri mtDNA and ptDNA, resulting in the distension of these genomes. Copies of these same repeats are also found in a small fraction of the nucDNA, but appear to be inert in this
Isolation and analysis of high quality nuclear DNA with reduced organellar DNA for plant genome sequencing and resequencing

Directory of Open Access Journals (Sweden)

Zdepski Anna

2011-05-01

Full Text Available Abstract Background High throughput sequencing (HTS technologies have revolutionized the field of genomics by drastically reducing the cost of sequencing, making it feasible for individual labs to sequence or resequence plant genomes. Obtaining high quality, high molecular weight DNA from plants poses significant challenges due to the high copy number of chloroplast and mitochondrial DNA, as well as high levels of phenolic compounds and polysaccharides. Multiple methods have been used to isolate DNA from plants; the CTAB method is commonly used to isolate total cellular DNA from plants that contain nuclear DNA, as well as chloroplast and mitochondrial DNA. Alternatively, DNA can be isolated from nuclei to minimize chloroplast and mitochondrial DNA contamination. Results We describe optimized protocols for isolation of nuclear DNA from eight different plant species encompassing both monocot and eudicot species. These protocols use nuclei isolation to minimize chloroplast and mitochondrial DNA contamination. We also developed a protocol to determine the number of chloroplast and mitochondrial DNA copies relative to the nuclear DNA using quantitative real time PCR (qPCR. We compared DNA isolated from nuclei to total cellular DNA isolated with the CTAB method. As expected, DNA isolated from nuclei consistently yielded nuclear DNA with fewer chloroplast and mitochondrial DNA copies, as compared to the total cellular DNA prepared with the CTAB method. This protocol will allow for analysis of the quality and quantity of nuclear DNA before starting a plant whole genome sequencing or resequencing experiment. Conclusions Extracting high quality, high molecular weight nuclear DNA in plants has the potential to be a bottleneck in the era of whole genome sequencing and resequencing. The methods that are described here provide a framework for researchers to extract and quantify nuclear DNA in multiple types of plants.
Detection of Non-Amplified Genomic DNA

CERN Document Server

Corradini, Roberto

2012-01-01

This book offers a state-of-the-art overview on non amplified DNA detection methods and provides chemists, biochemists, biotechnologists and material scientists with an introduction to these methods. In fact all these fields have dedicated resources to the problem of nucleic acid detection, each contributing with their own specific methods and concepts. This book will explain the basic principles of the different non amplified DNA detection methods available, highlighting their respective advantages and limitations. The importance of non-amplified DNA sequencing technologies will be also discussed. Non-amplified DNA detection can be achieved by adopting different techniques. Such techniques have allowed the commercialization of innovative platforms for DNA detection that are expected to break into the DNA diagnostics market. The enhanced sensitivity required for the detection of non amplified genomic DNA has prompted new strategies that can achieve ultrasensitivity by combining specific materials with specifi...
TALENs: customizable molecular DNA scissors for genome engineering of plants.

Science.gov (United States)

Chen, Kunling; Gao, Caixia

2013-06-20

Precise genome modification with engineered nucleases is a powerful tool for studying basic biology and applied biotechnology. Transcription activator-like effector nucleases (TALENs), consisting of an engineered specific (TALE) DNA binding domain and a Fok I cleavage domain, are newly developed versatile reagents for genome engineering in different organisms. Because of the simplicity of the DNA recognition code and their modular assembly, TALENs can act as customizable molecular DNA scissors inducing double-strand breaks (DSBs) at given genomic location. Thus, they provide a valuable approach to targeted genome modifications such as mutations, insertions, replacements or chromosome rearrangements. In this article, we review the development of TALENs, and summarize the principles and tools for TALEN-mediated gene targeting in plant cells, as well as current and potential strategies for use in plant research and crop improvement. Copyright © 2013. Published by Elsevier Ltd.
Genome-wide DNA Methylation Profiling of Cell-Free Serum DNA in Esophageal Adenocarcinoma and Barrett Esophagus

Directory of Open Access Journals (Sweden)

Rihong Zhai

2012-01-01

Full Text Available Aberrant DNA methylation (DNAm is a feature of most types of cancers. Genome-wide DNAm profiling has been performed successfully on tumor tissue DNA samples. However, the invasive procedure limits the utility of tumor tissue for epidemiological studies. While recent data indicate that cell-free circulating DNAm (cfDNAm profiles reflect DNAm status in corresponding tumor tissues, no studies have examined the association of cfDNAm with cancer or precursors on a genome-wide scale. The objective of this pilot study was to evaluate the putative significance of genome-wide cfDNAm profiles in esophageal adenocarcinoma (EA and Barrett esophagus (BE, EA precursor. We performed genome-wide DNAm profiling in EA tissue DNA (n = 8 and matched serum DNA (n = 8, in serum DNA of BE (n = 10, and in healthy controls (n = 10 using the Infinium HumanMethylation27 BeadChip that covers 27,578 CpG loci in 14,495 genes. We found that cfDNAm profiles were highly correlated to DNAm profiles in matched tumor tissue DNA (r = 0.92 in patients with EA. We selected the most differentially methylated loci to perform hierarchical clustering analysis. We found that 911 loci can discriminate perfectly between EA and control samples, 554 loci can separate EA from BE samples, and 46 loci can distinguish BE from control samples. These results suggest that genome-wide cfDNAm profiles are highly consistent with DNAm profiles detected in corresponding tumor tissues. Differential cfDNAm profiling may be a useful approach for the noninvasive screening of EA and EA premalignant lesions.

Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

Science.gov (United States)

Cao, Yinhe; Tung, Wen-Wen; Gao, J B

2004-01-01

With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.
Asy2/Mer2: an evolutionarily conserved mediator of meiotic recombination, pairing, and global chromosome compaction.

Science.gov (United States)

Tessé, Sophie; Bourbon, Henri-Marc; Debuchy, Robert; Budin, Karine; Dubois, Emeline; Liangran, Zhang; Antoine, Romain; Piolot, Tristan; Kleckner, Nancy; Zickler, Denise; Espagne, Eric

2017-09-15

Meiosis is the cellular program by which a diploid cell gives rise to haploid gametes for sexual reproduction. Meiotic progression depends on tight physical and functional coupling of recombination steps at the DNA level with specific organizational features of meiotic-prophase chromosomes. The present study reveals that every step of this coupling is mediated by a single molecule: Asy2/Mer2. We show that Mer2, identified so far only in budding and fission yeasts, is in fact evolutionarily conserved from fungi (Mer2/Rec15/Asy2/Bad42) to plants (PRD3/PAIR1) and mammals (IHO1). In yeasts, Mer2 mediates assembly of recombination-initiation complexes and double-strand breaks (DSBs). This role is conserved in the fungus Sordaria However, functional analysis of 13 mer2 mutants and successive localization of Mer2 to axis, synaptonemal complex (SC), and chromatin revealed, in addition, three further important functions. First, after DSB formation, Mer2 is required for pairing by mediating homolog spatial juxtaposition, with implications for crossover (CO) patterning/interference. Second, Mer2 participates in the transfer/maintenance and release of recombination complexes to/from the SC central region. Third, after completion of recombination, potentially dependent on SUMOylation, Mer2 mediates global chromosome compaction and post-recombination chiasma development. Thus, beyond its role as a recombinosome-axis/SC linker molecule, Mer2 has important functions in relation to basic chromosome structure. © 2017 Tessé et al.; Published by Cold Spring Harbor Laboratory Press.
Bat biology, genomes, and the Bat1K project

DEFF Research Database (Denmark)

Teeling, Emma C; Vernes, Sonja C; Dávalos, Liliana M

2018-01-01

and endangered. Here we announce Bat1K, an initiative to sequence the genomes of all living bat species (n∼1,300) to chromosome-level assembly. The Bat1K genome consortium unites bat biologists (>148 members as of writing), computational scientists, conservation organizations, genome technologists, and any...
Detecting single DNA copy number variations in complex genomes using one nanogram of starting DNA and BAC-array CGH.

Science.gov (United States)

Guillaud-Bataille, Marine; Valent, Alexander; Soularue, Pascal; Perot, Christine; Inda, Maria Mar; Receveur, Aline; Smaïli, Sadek; Roest Crollius, Hugues; Bénard, Jean; Bernheim, Alain; Gidrol, Xavier; Danglot, Gisèle

2004-07-29

Comparative genomic hybridization to bacterial artificial chromosome (BAC)-arrays (array-CGH) is a highly efficient technique, allowing the simultaneous measurement of genomic DNA copy number at hundreds or thousands of loci, and the reliable detection of local one-copy-level variations. We report a genome-wide amplification method allowing the same measurement sensitivity, using 1 ng of starting genomic DNA, instead of the classical 1 microg usually necessary. Using a discrete series of DNA fragments, we defined the parameters adapted to the most faithful ligation-mediated PCR amplification and the limits of the technique. The optimized protocol allows a 3000-fold DNA amplification, retaining the quantitative characteristics of the initial genome. Validation of the amplification procedure, using DNA from 10 tumour cell lines hybridized to BAC-arrays of 1500 spots, showed almost perfectly superimposed ratios for the non-amplified and amplified DNAs. Correlation coefficients of 0.96 and 0.99 were observed for regions of low-copy-level variations and all regions, respectively (including in vivo amplified oncogenes). Finally, labelling DNA using two nucleotides bearing the same fluorophore led to a significant increase in reproducibility and to the correct detection of one-copy gain or loss in >90% of the analysed data, even for pseudotriploid tumour genomes.
Highly sensitive polymerase chain reaction-free quantum dot-based quantification of forensic genomic DNA

International Nuclear Information System (INIS)

Tak, Yu Kyung; Kim, Won Young; Kim, Min Jung; Han, Eunyoung; Han, Myun Soo; Kim, Jong Jin; Kim, Wook; Lee, Jong Eun; Song, Joon Myong

2012-01-01

Highlights: ► Genomic DNA quantification were performed using a quantum dot-labeled Alu sequence. ► This probe provided PCR-free determination of human genomic DNA. ► Qdot-labeled Alu probe-hybridized genomic DNAs had a 2.5-femtogram detection limit. ► Qdot-labeled Alu sequence was used to assess DNA samples for human identification. - Abstract: Forensic DNA samples can degrade easily due to exposure to light and moisture at the crime scene. In addition, the amount of DNA acquired at a criminal site is inherently limited. This limited amount of human DNA has to be quantified accurately after the process of DNA extraction. The accurately quantified extracted genomic DNA is then used as a DNA template in polymerase chain reaction (PCR) amplification for short tandem repeat (STR) human identification. Accordingly, highly sensitive and human-specific quantification of forensic DNA samples is an essential issue in forensic study. In this work, a quantum dot (Qdot)-labeled Alu sequence was developed as a probe to simultaneously satisfy both the high sensitivity and human genome selectivity for quantification of forensic DNA samples. This probe provided PCR-free determination of human genomic DNA and had a 2.5-femtogram detection limit due to the strong emission and photostability of the Qdot. The Qdot-labeled Alu sequence has been used successfully to assess 18 different forensic DNA samples for STR human identification.
Mutagenic repair of double-stranded DNA breaks in vaccinia virus genomes requires cellular DNA ligase IV activity in the cytosol.

Science.gov (United States)

Luteijn, Rutger David; Drexler, Ingo; Smith, Geoffrey L; Lebbink, Robert Jan; Wiertz, Emmanuel J H J

2018-04-20

Poxviruses comprise a group of large dsDNA viruses that include members relevant to human and animal health, such as variola virus, monkeypox virus, cowpox virus and vaccinia virus (VACV). Poxviruses are remarkable for their unique replication cycle, which is restricted to the cytoplasm of infected cells. The independence from the host nucleus requires poxviruses to encode most of the enzymes involved in DNA replication, transcription and processing. Here, we use the CRISPR/Cas9 genome engineering system to induce DNA damage to VACV (strain Western Reserve) genomes. We show that targeting CRISPR/Cas9 to essential viral genes limits virus replication efficiently. Although VACV is a strictly cytoplasmic pathogen, we observed extensive viral genome editing at the target site; this is reminiscent of a non-homologous end-joining DNA repair mechanism. This pathway was not dependent on the viral DNA ligase, but critically involved the cellular DNA ligase IV. Our data show that DNA ligase IV can act outside of the nucleus to allow repair of dsDNA breaks in poxvirus genomes. This pathway might contribute to the introduction of mutations within the genome of poxviruses and may thereby promote the evolution of these viruses.
Molecular analysis and genomic organization of major DNA satellites in banana (Musa spp.).

Science.gov (United States)

Čížková, Jana; Hřibová, Eva; Humplíková, Lenka; Christelová, Pavla; Suchánková, Pavla; Doležel, Jaroslav

2013-01-01

Satellite DNA sequences consist of tandemly arranged repetitive units up to thousands nucleotides long in head-to-tail orientation. The evolutionary processes by which satellites arise and evolve include unequal crossing over, gene conversion, transposition and extra chromosomal circular DNA formation. Large blocks of satellite DNA are often observed in heterochromatic regions of chromosomes and are a typical component of centromeric and telomeric regions. Satellite-rich loci may show specific banding patterns and facilitate chromosome identification and analysis of structural chromosome changes. Unlike many other genomes, nuclear genomes of banana (Musa spp.) are poor in satellite DNA and the information on this class of DNA remains limited. The banana cultivars are seed sterile clones originating mostly from natural intra-specific crosses within M. acuminata (A genome) and inter-specific crosses between M. acuminata and M. balbisiana (B genome). Previous studies revealed the closely related nature of the A and B genomes, including similarities in repetitive DNA. In this study we focused on two main banana DNA satellites, which were previously identified in silico. Their genomic organization and molecular diversity was analyzed in a set of nineteen Musa accessions, including representatives of A, B and S (M. schizocarpa) genomes and their inter-specific hybrids. The two DNA satellites showed a high level of sequence conservation within, and a high homology between Musa species. FISH with probes for the satellite DNA sequences, rRNA genes and a single-copy BAC clone 2G17 resulted in characteristic chromosome banding patterns in M. acuminata and M. balbisiana which may aid in determining genomic constitution in interspecific hybrids. In addition to improving the knowledge on Musa satellite DNA, our study increases the number of cytogenetic markers and the number of individual chromosomes, which can be identified in Musa.
Purification of High Molecular Weight Genomic DNA from Powdery Mildew for Long-Read Sequencing.

Science.gov (United States)

Feehan, Joanna M; Scheibel, Katherine E; Bourras, Salim; Underwood, William; Keller, Beat; Somerville, Shauna C

2017-03-31

The powdery mildew fungi are a group of economically important fungal plant pathogens. Relatively little is known about the molecular biology and genetics of these pathogens, in part due to a lack of well-developed genetic and genomic resources. These organisms have large, repetitive genomes, which have made genome sequencing and assembly prohibitively difficult. Here, we describe methods for the collection, extraction, purification and quality control assessment of high molecular weight genomic DNA from one powdery mildew species, Golovinomyces cichoracearum. The protocol described includes mechanical disruption of spores followed by an optimized phenol/chloroform genomic DNA extraction. A typical yield was 7 µg DNA per 150 mg conidia. The genomic DNA that is isolated using this procedure is suitable for long-read sequencing (i.e., > 48.5 kbp). Quality control measures to ensure the size, yield, and purity of the genomic DNA are also described in this method. Sequencing of the genomic DNA of the quality described here will allow for the assembly and comparison of multiple powdery mildew genomes, which in turn will lead to a better understanding and improved control of this agricultural pathogen.
Genomic and Molecular Landscape of DNA Damage Repair Deficiency across The Cancer Genome Atlas

Directory of Open Access Journals (Sweden)

Theo A. Knijnenburg

2018-04-01

Full Text Available Summary: DNA damage repair (DDR pathways modulate cancer risk, progression, and therapeutic response. We systematically analyzed somatic alterations to provide a comprehensive view of DDR deficiency across 33 cancer types. Mutations with accompanying loss of heterozygosity were observed in over 1/3 of DDR genes, including TP53 and BRCA1/2. Other prevalent alterations included epigenetic silencing of the direct repair genes EXO5, MGMT, and ALKBH3 in ∼20% of samples. Homologous recombination deficiency (HRD was present at varying frequency in many cancer types, most notably ovarian cancer. However, in contrast to ovarian cancer, HRD was associated with worse outcomes in several other cancers. Protein structure-based analyses allowed us to predict functional consequences of rare, recurrent DDR mutations. A new machine-learning-based classifier developed from gene expression data allowed us to identify alterations that phenocopy deleterious TP53 mutations. These frequent DDR gene alterations in many human cancers have functional consequences that may determine cancer progression and guide therapy. : Knijnenburg et al. present The Cancer Genome Atlas (TCGA Pan-Cancer analysis of DNA damage repair (DDR deficiency in cancer. They use integrative genomic and molecular analyses to identify frequent DDR alterations across 33 cancer types, correlate gene- and pathway-level alterations with genome-wide measures of genome instability and impaired function, and demonstrate the prognostic utility of DDR deficiency scores. Keywords: The Cancer Genome Atlas PanCanAtlas project, DNA damage repair, somatic mutations, somatic copy-number alterations, epigenetic silencing, DNA damage footprints, mutational signatures, integrative statistical analysis, protein structure analysis
Optimization of conditions to extract high quality DNA for PCR analysis from whole blood using SDS-proteinase K method

Directory of Open Access Journals (Sweden)

Wajhul Qamar

2017-11-01

Full Text Available In case of studies associated with human genetics, genomics, and pharmacogenetics the genomic DNA is extracted from the buccal cells, whole blood etc. Several methods are exploited by the researchers to extract DNA from the whole blood. One of these methods, which utilizes cell lysis and proteolytic properties of sodium dodecyl sulfate (SDS and proteinase K respectively, might also be called SDS-PK method. It does not include any hazardous chemicals such as phenol or chloroform and is inexpensive. However, several researchers report the same method with different formulas and conditions. During our experiments with whole blood DNA extraction we experienced problems such as protein contamination, DNA purity and yield when followed some SDS-PK protocols reported elsewhere. A260/A280 and A260/A230 ratios along with PCR amplification give a clear idea about the procedure that was followed to extract the DNA. In an effort to increase the DNA purity from human whole blood, we pointed out some steps of the protocol that play a crucial role in determining the extraction of high quality DNA.
Ultrafast spectroscopy on DNA-cleavage by endonuclease in molecular crowding.

Science.gov (United States)

Singh, Priya; Choudhury, Susobhan; Dutta, Shreyasi; Adhikari, Aniruddha; Bhattacharya, Siddhartha; Pal, Debasish; Pal, Samir Kumar

2017-10-01

The jam-packed intracellular environments differ the activity of a biological macromolecule from that in laboratory environments (in vitro) through a number of mechanisms called molecular crowding related to structure, function and dynamics of the macromolecule. Here, we have explored the structure, function and dynamics of a model enzyme protein DNase I in molecular crowing of polyethylene glycol (PEG; MW 3350). We have used steady state and picosecond resolved dynamics of a well-known intercalator ethidium bromide (EB) in a 20-mer double-stranded DNA (dsDNA) to monitor the DNA-cleavage by the enzyme in absence and presence PEG. We have also labelled the enzyme by a well-known fluorescent probe 8-anilino-1-naphthalenesulfonic acid ammonium salt (ANS) to study the molecular mechanism of the protein-DNA association through exited state relaxation of the probe in absence (dictated by polarity) and presence of EB in the DNA (dictated by Förster resonance energy transfer (FRET)). The overall and local structures of the protein in presence of PEG have been followed by circular dichroism and time resolved polarization gated spectroscopy respectively. The enhanced dynamical flexibility of protein in presence of PEG as revealed from excited state lifetime and polarization gated anisotropy of ANS has been correlated with the stronger DNA-binding for the higher nuclease activity. We have also used conventional experimental strategy of agarose gel electrophoresis to monitor DNA-cleavage and found consistent results of enhanced nuclease activities both on synthetic 20-mer oligonucleotide and long genomic DNA from calf thymus. Copyright © 2017 Elsevier B.V. All rights reserved.
An Adenovirus DNA Replication Factor, but Not Incoming Genome Complexes, Targets PML Nuclear Bodies.

Science.gov (United States)

Komatsu, Tetsuro; Nagata, Kyosuke; Wodrich, Harald

2016-02-01

Promyelocytic leukemia protein nuclear bodies (PML-NBs) are subnuclear domains implicated in cellular antiviral responses. Despite the antiviral activity, several nuclear replicating DNA viruses use the domains as deposition sites for the incoming viral genomes and/or as sites for viral DNA replication, suggesting that PML-NBs are functionally relevant during early viral infection to establish productive replication. Although PML-NBs and their components have also been implicated in the adenoviral life cycle, it remains unclear whether incoming adenoviral genome complexes target PML-NBs. Here we show using immunofluorescence and live-cell imaging analyses that incoming adenovirus genome complexes neither localize at nor recruit components of PML-NBs during early phases of infection. We further show that the viral DNA binding protein (DBP), an early expressed viral gene and essential DNA replication factor, independently targets PML-NBs. We show that DBP oligomerization is required to selectively recruit the PML-NB components Sp100 and USP7. Depletion experiments suggest that the absence of one PML-NB component might not affect the recruitment of other components toward DBP oligomers. Thus, our findings suggest a model in which an adenoviral DNA replication factor, but not incoming viral genome complexes, targets and modulates PML-NBs to support a conducive state for viral DNA replication and argue against a generalized concept that PML-NBs target incoming viral genomes. The immediate fate upon nuclear delivery of genomes of incoming DNA viruses is largely unclear. Early reports suggested that incoming genomes of herpesviruses are targeted and repressed by PML-NBs immediately upon nuclear import. Genome localization and/or viral DNA replication has also been observed at PML-NBs for other DNA viruses. Thus, it was suggested that PML-NBs may immediately sense and target nuclear viral genomes and hence serve as sites for deposition of incoming viral genomes and
Whole-genome methylation caller designed for methyl- DNA ...

African Journals Online (AJOL)

etchie

2013-02-20

Feb 20, 2013 ... Key words: Methyl-DNA immunoprecipitation, next-generation sequencing, Hidden ... its response to environmental cues. .... have a great potential to become the most cost-effective ... hg18 reference genome (set to 0 if not present in retrieved reads). ..... DNA methylation patterns and epigenetic memory.
Simplified extraction of good quality genomic DNA from a variety of ...

African Journals Online (AJOL)

Depending on the nature and complexity of plant material, proper method needs to be employed for extraction of genomic DNA, along with its performance evaluation by different molecular techniques. Here, we optimized and employed a simple genomic DNA isolation protocol suitable for a variety of plant materials ...
Screening the budding yeast genome reveals unique factors affecting K2 toxin susceptibility.

Science.gov (United States)

Servienė, Elena; Lukša, Juliana; Orentaitė, Irma; Lafontaine, Denis L J; Urbonavičius, Jaunius

2012-01-01

Understanding how biotoxins kill cells is of prime importance in biomedicine and the food industry. The budding yeast (S. cerevisiae) killers serve as a convenient model to study the activity of biotoxins consistently supplying with significant insights into the basic mechanisms of virus-host cell interactions and toxin entry into eukaryotic target cells. K1 and K2 toxins are active at the cell wall, leading to the disruption of the plasma membrane and subsequent cell death by ion leakage. K28 toxin is active in the cell nucleus, blocking DNA synthesis and cell cycle progression, thereby triggering apoptosis. Genome-wide screens in the budding yeast S. cerevisiae identified several hundred effectors of K1 and K28 toxins. Surprisingly, no such screen had been performed for K2 toxin, the most frequent killer toxin among industrial budding yeasts. We conducted several concurrent genome-wide screens in S. cerevisiae and identified 332 novel K2 toxin effectors. The effectors involved in K2 resistance and hypersensitivity largely map in distinct cellular pathways, including cell wall and plasma membrane structure/biogenesis and mitochondrial function for K2 resistance, and cell wall stress signaling and ion/pH homeostasis for K2 hypersensitivity. 70% of K2 effectors are different from those involved in K1 or K28 susceptibility. Our work demonstrates that despite the fact that K1 and K2 toxins share some aspects of their killing strategies, they largely rely on different sets of effectors. Since the vast majority of the host factors identified here is exclusively active towards K2, we conclude that cells have acquired a specific K2 toxin effectors set. Our work thus indicates that K1 and K2 have elaborated different biological pathways and provides a first step towards the detailed characterization of K2 mode of action.
Evaluation of different sources of DNA for use in genome wide studies and forensic application.

Science.gov (United States)

Al Safar, Habiba S; Abidi, Fatima H; Khazanehdari, Kamal A; Dadour, Ian R; Tay, Guan K

2011-02-01

In the field of epidemiology, Genome-Wide Association Studies (GWAS) are commonly used to identify genetic predispositions of many human diseases. Large repositories housing biological specimens for clinical and genetic investigations have been established to store material and data for these studies. The logistics of specimen collection and sample storage can be onerous, and new strategies have to be explored. This study examines three different DNA sources (namely, degraded genomic DNA, amplified degraded genomic DNA and amplified extracted DNA from FTA card) for GWAS using the Illumina platform. No significant difference in call rate was detected between amplified degraded genomic DNA extracted from whole blood and amplified DNA retrieved from FTA™ cards. However, using unamplified-degraded genomic DNA reduced the call rate to a mean of 42.6% compared to amplified DNA extracted from FTA card (mean of 96.6%). This study establishes the utility of FTA™ cards as a viable storage matrix for cells from which DNA can be extracted to perform GWAS analysis.
A DNA-based pattern classifier with in vitro learning and associative recall for genomic characterization and biosensing without explicit sequence knowledge.

Science.gov (United States)

Lee, Ju Seok; Chen, Junghuei; Deaton, Russell; Kim, Jin-Woo

2014-01-01

Genetic material extracted from in situ microbial communities has high promise as an indicator of biological system status. However, the challenge is to access genomic information from all organisms at the population or community scale to monitor the biosystem's state. Hence, there is a need for a better diagnostic tool that provides a holistic view of a biosystem's genomic status. Here, we introduce an in vitro methodology for genomic pattern classification of biological samples that taps large amounts of genetic information from all genes present and uses that information to detect changes in genomic patterns and classify them. We developed a biosensing protocol, termed Biological Memory, that has in vitro computational capabilities to "learn" and "store" genomic sequence information directly from genomic samples without knowledge of their explicit sequences, and that discovers differences in vitro between previously unknown inputs and learned memory molecules. The Memory protocol was designed and optimized based upon (1) common in vitro recombinant DNA operations using 20-base random probes, including polymerization, nuclease digestion, and magnetic bead separation, to capture a snapshot of the genomic state of a biological sample as a DNA memory and (2) the thermal stability of DNA duplexes between new input and the memory to detect similarities and differences. For efficient read out, a microarray was used as an output method. When the microarray-based Memory protocol was implemented to test its capability and sensitivity using genomic DNA from two model bacterial strains, i.e., Escherichia coli K12 and Bacillus subtilis, results indicate that the Memory protocol can "learn" input DNA, "recall" similar DNA, differentiate between dissimilar DNA, and detect relatively small concentration differences in samples. This study demonstrated not only the in vitro information processing capabilities of DNA, but also its promise as a genomic pattern classifier that could
A protocol for large scale genomic DNA isolation for cacao genetics ...

African Journals Online (AJOL)

Advances in DNA technology, such as marker assisted selection, detection of quantitative trait loci and genomic selection also require the isolation of DNA from a large number of samples and the preservation of tissue samples for future use in cacao genome studies. The present study proposes a method for the ...
Genomic DNA extraction from sapwood of Pinus roxburghii for ...

African Journals Online (AJOL)

A method for extraction of genomic DNA from sapwood tissues of mature tall trees of Pinus roxburghii, where collection of needle tissues is extremely difficult has been standardized. The extracted DNA was comparable to that obtained from the needle tissue in terms of yield and purity. The yield of extracted DNA ranged ...
Isolating silkworm genomic DNA without liquid nitrogen suitable for ...

African Journals Online (AJOL)

Genomic DNA was isolated from posterior silk gland of silkworms, Antheraea assama. Absolute alcohol was used as tissue fixing solution instead of grinding in liquid nitrogen, which yielded high molecular weight DNA (>40 kb). Samples yielded similar amount of DNA when fixed in absolute alcohol (400 μmg/g of silk gland ...

Analysis of the giant genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size.

Science.gov (United States)

Kelly, Laura J; Renny-Byfield, Simon; Pellicer, Jaume; Macas, Jiří; Novák, Petr; Neumann, Pavel; Lysak, Martin A; Day, Peter D; Berger, Madeleine; Fay, Michael F; Nichols, Richard A; Leitch, Andrew R; Leitch, Ilia J

2015-10-01

Plants exhibit an extraordinary range of genome sizes, varying by > 2000-fold between the smallest and largest recorded values. In the absence of polyploidy, changes in the amount of repetitive DNA (transposable elements and tandem repeats) are primarily responsible for genome size differences between species. However, there is ongoing debate regarding the relative importance of amplification of repetitive DNA versus its deletion in governing genome size. Using data from 454 sequencing, we analysed the most repetitive fraction of some of the largest known genomes for diploid plant species, from members of Fritillaria. We revealed that genomic expansion has not resulted from the recent massive amplification of just a handful of repeat families, as shown in species with smaller genomes. Instead, the bulk of these immense genomes is composed of highly heterogeneous, relatively low-abundance repeat-derived DNA, supporting a scenario where amplified repeats continually accumulate due to infrequent DNA removal. Our results indicate that a lack of deletion and low turnover of repetitive DNA are major contributors to the evolution of extremely large genomes and show that their size cannot simply be accounted for by the activity of a small number of high-abundance repeat families. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.
Functional interrogation of non-coding DNA through CRISPR genome editing.

Science.gov (United States)

Canver, Matthew C; Bauer, Daniel E; Orkin, Stuart H

2017-05-15

Methodologies to interrogate non-coding regions have lagged behind coding regions despite comprising the vast majority of the genome. However, the rapid evolution of clustered regularly interspaced short palindromic repeats (CRISPR)-based genome editing has provided a multitude of novel techniques for laboratory investigation including significant contributions to the toolbox for studying non-coding DNA. CRISPR-mediated loss-of-function strategies rely on direct disruption of the underlying sequence or repression of transcription without modifying the targeted DNA sequence. CRISPR-mediated gain-of-function approaches similarly benefit from methods to alter the targeted sequence through integration of customized sequence into the genome as well as methods to activate transcription. Here we review CRISPR-based loss- and gain-of-function techniques for the interrogation of non-coding DNA. Copyright © 2017 Elsevier Inc. All rights reserved.
Quantification of trace-level DNA by real-time whole genome amplification.

Science.gov (United States)

Kang, Min-Jung; Yu, Hannah; Kim, Sook-Kyung; Park, Sang-Ryoul; Yang, Inchul

2011-01-01

Quantification of trace amounts of DNA is a challenge in analytical applications where the concentration of a target DNA is very low or only limited amounts of samples are available for analysis. PCR-based methods including real-time PCR are highly sensitive and widely used for quantification of low-level DNA samples. However, ordinary PCR methods require at least one copy of a specific gene sequence for amplification and may not work for a sub-genomic amount of DNA. We suggest a real-time whole genome amplification method adopting the degenerate oligonucleotide primed PCR (DOP-PCR) for quantification of sub-genomic amounts of DNA. This approach enabled quantification of sub-picogram amounts of DNA independently of their sequences. When the method was applied to the human placental DNA of which amount was accurately determined by inductively coupled plasma-optical emission spectroscopy (ICP-OES), an accurate and stable quantification capability for DNA samples ranging from 80 fg to 8 ng was obtained. In blind tests of laboratory-prepared DNA samples, measurement accuracies of 7.4%, -2.1%, and -13.9% with analytical precisions around 15% were achieved for 400-pg, 4-pg, and 400-fg DNA samples, respectively. A similar quantification capability was also observed for other DNA species from calf, E. coli, and lambda phage. Therefore, when provided with an appropriate standard DNA, the suggested real-time DOP-PCR method can be used as a universal method for quantification of trace amounts of DNA.
Detection of Alicyclobacillus species in fruit juice using a random genomic DNA microarray chip.

Science.gov (United States)

Jang, Jun Hyeong; Kim, Sun-Joong; Yoon, Bo Hyun; Ryu, Jee-Hoon; Gu, Man Bock; Chang, Hyo-Ihl

2011-06-01

This study describes a method using a DNA microarray chip to rapidly and simultaneously detect Alicyclobacillus species in orange juice based on the hybridization of genomic DNA with random probes. Three food spoilage bacteria were used in this study: Alicyclobacillus acidocaldarius, Alicyclobacillus acidoterrestris, and Alicyclobacillus cycloheptanicus. The three Alicyclobacillus species were adjusted to 2 × 10(3) CFU/ml and inoculated into pasteurized 100% pure orange juice. Cy5-dCTP labeling was used for reference signals, and Cy3-dCTP was labeled for target genomic DNA. The molar ratio of 1:1 of Cy3-dCTP and Cy5-dCTP was used. DNA microarray chips were fabricated using randomly fragmented DNA of Alicyclobacillus spp. and were hybridized with genomic DNA extracted from Bacillus spp. Genomic DNA extracted from Alicyclobacillus spp. showed a significantly higher hybridization rate compared with DNA of Bacillus spp., thereby distinguishing Alicyclobacillus spp. from Bacillus spp. The results showed that the microarray DNA chip containing randomly fragmented genomic DNA was specific and clearly identified specific food spoilage bacteria. This microarray system is a good tool for rapid and specific detection of thermophilic spoilage bacteria, mainly Alicyclobacillus spp., and is useful and applicable to the fruit juice industry.
Sur terre comme sur mer: organisations spatiales en mer du Nord et mer de Norvège

Directory of Open Access Journals (Sweden)

Maryvonne LE BERRE

1998-09-01

Full Text Available L'article montre, au moyen de représentations cartographiques et graphiques, les structures spatiales développées en mer pour l'exploitation des hydrocarbures. Malgré les différences de milieu, l'appropriation et l'occupation territoriales du domaine maritime s'effectuent selon les mêmes principes que sur le continent, à ceci près que ce qui s'étale en surface sur le continent se développe à la verticale en mer.
Sequencing of chloroplast genome using whole cellular DNA and Solexa sequencing technology

Directory of Open Access Journals (Sweden)

Jian eWu

2012-11-01

Full Text Available Sequencing of the chloroplast genome using traditional sequencing methods has been difficult because of its size (>120 kb and the complicated procedures required to prepare templates. To explore the feasibility of sequencing the chloroplast genome using DNA extracted from whole cells and Solexa sequencing technology, we sequenced whole cellular DNA isolated from leaves of three Brassica rapa accessions with one lane per accession. In total, 246 Mb, 362Mb, 361 Mb sequence data were generated for the three accessions Chiifu-401-42, Z16 and FT, respectively. Microreads were assembled by reference-guided assembly using the cpDNA sequences of B. rapa, Arabidopsis thaliana, and Nicotiana tabacum. We achieved coverage of more than 99.96% of the cp genome in the three tested accessions using the B. rapa sequence as the reference. When A. thaliana or N. tabacum sequences were used as references, 99.7–99.8% or 95.5–99.7% of the B. rapa chloroplast genome was covered, respectively. These results demonstrated that sequencing of whole cellular DNA isolated from young leaves using the Illumina Genome Analyzer is an efficient method for high-throughput sequencing of chloroplast genome.
Survey of protein–DNA interactions in Aspergillus oryzae on a genomic scale

Science.gov (United States)

Wang, Chao; Lv, Yangyong; Wang, Bin; Yin, Chao; Lin, Ying; Pan, Li

2015-01-01

The genome-scale delineation of in vivo protein–DNA interactions is key to understanding genome function. Only ∼5% of transcription factors (TFs) in the Aspergillus genus have been identified using traditional methods. Although the Aspergillus oryzae genome contains >600 TFs, knowledge of the in vivo genome-wide TF-binding sites (TFBSs) in aspergilli remains limited because of the lack of high-quality antibodies. We investigated the landscape of in vivo protein–DNA interactions across the A. oryzae genome through coupling the DNase I digestion of intact nuclei with massively parallel sequencing and the analysis of cleavage patterns in protein–DNA interactions at single-nucleotide resolution. The resulting map identified overrepresented de novo TF-binding motifs from genomic footprints, and provided the detailed chromatin remodeling patterns and the distribution of digital footprints near transcription start sites. The TFBSs of 19 known Aspergillus TFs were also identified based on DNase I digestion data surrounding potential binding sites in conjunction with TF binding specificity information. We observed that the cleavage patterns of TFBSs were dependent on the orientation of TF motifs and independent of strand orientation, consistent with the DNA shape features of binding motifs with flanking sequences. PMID:25883143
A novel rat genomic simple repeat DNA with RNA-homology shows triplex (H-DNA)-like structure and tissue-specific RNA expression

International Nuclear Information System (INIS)

Dey, Indranil; Rath, Pramod C.

2005-01-01

Mammalian genome contains a wide variety of repetitive DNA sequences of relatively unknown function. We report a novel 227 bp simple repeat DNA (3.3 DNA) with a d {(GA) 7 A (AG) 7 } dinucleotide mirror repeat from the rat (Rattus norvegicus) genome. 3.3 DNA showed 75-85% homology with several eukaryotic mRNAs due to (GA/CU) n dinucleotide repeats by nBlast search and a dispersed distribution in the rat genome by Southern blot hybridization with [ 32 P]3.3 DNA. The d {(GA) 7 A (AG) 7 } mirror repeat formed a triplex (H-DNA)-like structure in vitro. Two large RNAs of 9.1 and 7.5 kb were detected by [ 32 P]3.3 DNA in rat brain by Northern blot hybridization indicating expression of such simple sequence repeats at RNA level in vivo. Further, several cDNAs were isolated from a rat cDNA library by [ 32 P]3.3 DNA probe. Three such cDNAs showed tissue-specific RNA expression in rat. pRT 4.1 cDNA showed strong expression of a 2.39 kb RNA in brain and spleen, pRT 5.5 cDNA showed strong expression of a 2.8 kb RNA in brain and a 3.9 kb RNA in lungs, and pRT 11.4 cDNA showed weak expression of a 2.4 kb RNA in lungs. Thus, genomic simple sequence repeats containing d (GA/CT) n dinucleotides are transcriptionally expressed and regulated in rat tissues. Such d (GA/CT) n dinucleotide repeats may form structural elements (e.g., triplex) which may be sites for functional regulation of genomic coding sequences as well as RNAs. This may be a general function of such transcriptionally active simple sequence repeats widely dispersed in mammalian genome
Dynamic DNA cytosine methylation in the Populus trichocarpa genome: tissue-level variation and relationship to gene expression

Directory of Open Access Journals (Sweden)

Vining Kelly J

2012-01-01

Full Text Available Abstract Background DNA cytosine methylation is an epigenetic modification that has been implicated in many biological processes. However, large-scale epigenomic studies have been applied to very few plant species, and variability in methylation among specialized tissues and its relationship to gene expression is poorly understood. Results We surveyed DNA methylation from seven distinct tissue types (vegetative bud, male inflorescence [catkin], female catkin, leaf, root, xylem, phloem in the reference tree species black cottonwood (Populus trichocarpa. Using 5-methyl-cytosine DNA immunoprecipitation followed by Illumina sequencing (MeDIP-seq, we mapped a total of 129,360,151 36- or 32-mer reads to the P. trichocarpa reference genome. We validated MeDIP-seq results by bisulfite sequencing, and compared methylation and gene expression using published microarray data. Qualitative DNA methylation differences among tissues were obvious on a chromosome scale. Methylated genes had lower expression than unmethylated genes, but genes with methylation in transcribed regions ("gene body methylation" had even lower expression than genes with promoter methylation. Promoter methylation was more frequent than gene body methylation in all tissues except male catkins. Male catkins differed in demethylation of particular transposable element categories, in level of gene body methylation, and in expression range of genes with methylated transcribed regions. Tissue-specific gene expression patterns were correlated with both gene body and promoter methylation. Conclusions We found striking differences among tissues in methylation, which were apparent at the chromosomal scale and when genes and transposable elements were examined. In contrast to other studies in plants, gene body methylation had a more repressive effect on transcription than promoter methylation.
DNA Breaks and End Resection Measured Genome-wide by End Sequencing.

Science.gov (United States)

Canela, Andres; Sridharan, Sriram; Sciascia, Nicholas; Tubbs, Anthony; Meltzer, Paul; Sleckman, Barry P; Nussenzweig, André

2016-09-01

DNA double-strand breaks (DSBs) arise during physiological transcription, DNA replication, and antigen receptor diversification. Mistargeting or misprocessing of DSBs can result in pathological structural variation and mutation. Here we describe a sensitive method (END-seq) to monitor DNA end resection and DSBs genome-wide at base-pair resolution in vivo. We utilized END-seq to determine the frequency and spectrum of restriction-enzyme-, zinc-finger-nuclease-, and RAG-induced DSBs. Beyond sequence preference, chromatin features dictate the repertoire of these genome-modifying enzymes. END-seq can detect at least one DSB per cell among 10,000 cells not harboring DSBs, and we estimate that up to one out of 60 cells contains off-target RAG cleavage. In addition to site-specific cleavage, we detect DSBs distributed over extended regions during immunoglobulin class-switch recombination. Thus, END-seq provides a snapshot of DNA ends genome-wide, which can be utilized for understanding genome-editing specificities and the influence of chromatin on DSB pathway choice. Published by Elsevier Inc.
Extensive Viable Middle East Respiratory Syndrome (MERS) Coronavirus Contamination in Air and Surrounding Environment in MERS Isolation Wards.

Science.gov (United States)

Kim, Sung-Han; Chang, So Young; Sung, Minki; Park, Ji Hoon; Bin Kim, Hong; Lee, Heeyoung; Choi, Jae-Phil; Choi, Won Suk; Min, Ji-Young

2016-08-01

The largest outbreak of Middle East respiratory syndrome coronavirus (MERS-CoV) outside the Middle East occurred in South Korea in 2015 and resulted in 186 laboratory-confirmed infections, including 36 (19%) deaths. Some hospitals were considered epicenters of infection and voluntarily shut down most of their operations after nearly half of all transmissions occurred in hospital settings. However, the ways that MERS-CoV is transmitted in healthcare settings are not well defined. We explored the possible contribution of contaminated hospital air and surfaces to MERS transmission by collecting air and swabbing environmental surfaces in 2 hospitals treating MERS-CoV patients. The samples were tested by viral culture with reverse transcription polymerase chain reaction (RT-PCR) and immunofluorescence assay (IFA) using MERS-CoV Spike antibody, and electron microscopy (EM). The presence of MERS-CoV was confirmed by RT-PCR of viral cultures of 4 of 7 air samples from 2 patients' rooms, 1 patient's restroom, and 1 common corridor. In addition, MERS-CoV was detected in 15 of 68 surface swabs by viral cultures. IFA on the cultures of the air and swab samples revealed the presence of MERS-CoV. EM images also revealed intact particles of MERS-CoV in viral cultures of the air and swab samples. These data provide experimental evidence for extensive viable MERS-CoV contamination of the air and surrounding materials in MERS outbreak units. Thus, our findings call for epidemiologic investigation of the possible scenarios for contact and airborne transmission, and raise concern regarding the adequacy of current infection control procedures. © The Author 2016. Published by Oxford University Press for the Infectious Diseases Society of America. All rights reserved. For permissions, e-mail journals.permissions@oup.com.
The Dunaliella salina organelle genomes: large sequences, inflated with intronic and intergenic DNA

Energy Technology Data Exchange (ETDEWEB)

Smith, David R.; Lee, Robert W.; Cushman, John C.; Magnuson, Jon K.; Tran, Duc; Polle, Juergen E.

2010-05-07

Abstract Background: Dunaliella salina Teodoresco, a unicellular, halophilic green alga belonging to the Chlorophyceae, is among the most industrially important microalgae. This is because D. salina can produce massive amounts of β-carotene, which can be collected for commercial purposes, and because of its potential as a feedstock for biofuels production. Although the biochemistry and physiology of D. salina have been studied in great detail, virtually nothing is known about the genomes it carries, especially those within its mitochondrion and plastid. This study presents the complete mitochondrial and plastid genome sequences of D. salina and compares them with those of the model green algae Chlamydomonas reinhardtii and Volvox carteri. Results: The D. salina organelle genomes are large, circular-mapping molecules with ~60% noncoding DNA, placing them among the most inflated organelle DNAs sampled from the Chlorophyta. In fact, the D. salina plastid genome, at 269 kb, is the largest complete plastid DNA (ptDNA) sequence currently deposited in GenBank, and both the mitochondrial and plastid genomes have unprecedentedly high intron densities for organelle DNA: ~1.5 and ~0.4 introns per gene, respectively. Moreover, what appear to be the relics of genes, introns, and intronic open reading frames are found scattered throughout the intergenic ptDNA regions -- a trait without parallel in other characterized organelle genomes and one that gives insight into the mechanisms and modes of expansion of the D. salina ptDNA. Conclusions: These findings confirm the notion that chlamydomonadalean algae have some of the most extreme organelle genomes of all eukaryotes. They also suggest that the events giving rise to the expanded ptDNA architecture of D. salina and other Chlamydomonadales may have occurred early in the evolution of this lineage. Although interesting from a genome evolution standpoint, the D. salina organelle DNA sequences will aid in the development of a viable
The Dunaliella salina organelle genomes: large sequences, inflated with intronic and intergenic DNA

Directory of Open Access Journals (Sweden)

Tran Duc

2010-05-01

Full Text Available Abstract Background Dunaliella salina Teodoresco, a unicellular, halophilic green alga belonging to the Chlorophyceae, is among the most industrially important microalgae. This is because D. salina can produce massive amounts of β-carotene, which can be collected for commercial purposes, and because of its potential as a feedstock for biofuels production. Although the biochemistry and physiology of D. salina have been studied in great detail, virtually nothing is known about the genomes it carries, especially those within its mitochondrion and plastid. This study presents the complete mitochondrial and plastid genome sequences of D. salina and compares them with those of the model green algae Chlamydomonas reinhardtii and Volvox carteri. Results The D. salina organelle genomes are large, circular-mapping molecules with ~60% noncoding DNA, placing them among the most inflated organelle DNAs sampled from the Chlorophyta. In fact, the D. salina plastid genome, at 269 kb, is the largest complete plastid DNA (ptDNA sequence currently deposited in GenBank, and both the mitochondrial and plastid genomes have unprecedentedly high intron densities for organelle DNA: ~1.5 and ~0.4 introns per gene, respectively. Moreover, what appear to be the relics of genes, introns, and intronic open reading frames are found scattered throughout the intergenic ptDNA regions -- a trait without parallel in other characterized organelle genomes and one that gives insight into the mechanisms and modes of expansion of the D. salina ptDNA. Conclusions These findings confirm the notion that chlamydomonadalean algae have some of the most extreme organelle genomes of all eukaryotes. They also suggest that the events giving rise to the expanded ptDNA architecture of D. salina and other Chlamydomonadales may have occurred early in the evolution of this lineage. Although interesting from a genome evolution standpoint, the D. salina organelle DNA sequences will aid in the
Middle East Respiratory Syndrome (MERS)

Science.gov (United States)

Middle East Respiratory Syndrome Coronavirus; MERS-CoV; Novel coronavirus; nCoV ... for Disease Control and Prevention website. Middle East Respiratory Syndrome (MERS): Frequently asked questions and answers. www. ...
High-throughput sequencing of three Lemnoideae (duckweeds chloroplast genomes from total DNA.

Directory of Open Access Journals (Sweden)

Wenqin Wang

Full Text Available BACKGROUND: Chloroplast genomes provide a wealth of information for evolutionary and population genetic studies. Chloroplasts play a particularly important role in the adaption for aquatic plants because they float on water and their major surface is exposed continuously to sunlight. The subfamily of Lemnoideae represents such a collection of aquatic species that because of photosynthesis represents one of the fastest growing plant species on earth. METHODS: We sequenced the chloroplast genomes from three different genera of Lemnoideae, Spirodela polyrhiza, Wolffiella lingulata and Wolffia australiana by high-throughput DNA sequencing of genomic DNA using the SOLiD platform. Unfractionated total DNA contains high copies of plastid DNA so that sequences from the nucleus and mitochondria can easily be filtered computationally. Remaining sequence reads were assembled into contiguous sequences (contigs using SOLiD software tools. Contigs were mapped to a reference genome of Lemna minor and gaps, selected by PCR, were sequenced on the ABI3730xl platform. CONCLUSIONS: This combinatorial approach yielded whole genomic contiguous sequences in a cost-effective manner. Over 1,000-time coverage of chloroplast from total DNA were reached by the SOLiD platform in a single spot on a quadrant slide without purification. Comparative analysis indicated that the chloroplast genome was conserved in gene number and organization with respect to the reference genome of L. minor. However, higher nucleotide substitution, abundant deletions and insertions occurred in non-coding regions of these genomes, indicating a greater genomic dynamics than expected from the comparison of other related species in the Pooideae. Noticeably, there was no transition bias over transversion in Lemnoideae. The data should have immediate applications in evolutionary biology and plant taxonomy with increased resolution and statistical power.
The clinical and virological features of the first imported case causing MERS-CoV outbreak in South Korea, 2015.

Science.gov (United States)

Lee, Ji Yeon; Kim, You-Jin; Chung, Eun Hee; Kim, Dae-Won; Jeong, Ina; Kim, Yeonjae; Yun, Mi-Ran; Kim, Sung Soon; Kim, Gayeon; Joh, Joon-Sung

2017-07-14

In 2015, the largest outbreak of Middle East respiratory syndrome coronavirus (MERS-CoV) infection outside the Middle East occurred in South Korea. We summarized the epidemiological, clinical, and laboratory findings of the first Korean case of MERS-CoV and analyzed whole-genome sequences of MERS-CoV derived from the patient. A 68-year-old man developed fever and myalgia 7 days after returning to Korea, following a 10-day trip to the Middle East. Before diagnosis, he visited 4 hospitals, potentially resulting in secondary transmission to 28 patients. On admission to the National Medical Center (day 9, post-onset of clinical illness), he presented with drowsiness, hypoxia, and multiple patchy infiltrations on the chest radiograph. He was intubated (day 12) because of progressive acute respiratory distress syndrome (ARDS) and INF-α2a and ribavirin treatment was commenced. The treatment course was prolonged by superimposed ventilator associated pneumonia. MERS-CoV PCR results converted to negative from day 47 and the patient was discharged (day 137), following rehabilitation therapy. The complete genome sequence obtained from a sputum sample (taken on day 11) showed the highest sequence similarity (99.59%) with the virus from an outbreak in Riyadh, Saudi Arabia, in February 2015. The first case of MERS-CoV infection had high transmissibility and was associated with a severe clinical course. The patient made a successful recovery after early treatment with antiviral agents and adequate supportive care. This first case in South Korea became a super-spreader because of improper infection control measures, rather than variations of the virus.
Pol II promoter prediction using characteristic 4-mer motifs: a machine learning approach

Directory of Open Access Journals (Sweden)

Shoyaib Mohammad

2008-10-01

Full Text Available Abstract Background Eukaryotic promoter prediction using computational analysis techniques is one of the most difficult jobs in computational genomics that is essential for constructing and understanding genetic regulatory networks. The increased availability of sequence data for various eukaryotic organisms in recent years has necessitated for better tools and techniques for the prediction and analysis of promoters in eukaryotic sequences. Many promoter prediction methods and tools have been developed to date but they have yet to provide acceptable predictive performance. One obvious criteria to improve on current methods is to devise a better system for selecting appropriate features of promoters that distinguish them from non-promoters. Secondly improved performance can be achieved by enhancing the predictive ability of the machine learning algorithms used. Results In this paper, a novel approach is presented in which 128 4-mer motifs in conjunction with a non-linear machine-learning algorithm utilising a Support Vector Machine (SVM are used to distinguish between promoter and non-promoter DNA sequences. By applying this approach to plant, Drosophila, human, mouse and rat sequences, the classification model has showed 7-fold cross-validation percentage accuracies of 83.81%, 94.82%, 91.25%, 90.77% and 82.35% respectively. The high sensitivity and specificity value of 0.86 and 0.90 for plant; 0.96 and 0.92 for Drosophila; 0.88 and 0.92 for human; 0.78 and 0.84 for mouse and 0.82 and 0.80 for rat demonstrate that this technique is less prone to false positive results and exhibits better performance than many other tools. Moreover, this model successfully identifies location of promoter using TATA weight matrix. Conclusion The high sensitivity and specificity indicate that 4-mer frequencies in conjunction with supervised machine-learning methods can be beneficial in the identification of RNA pol II promoters comparative to other methods. This
Evaluation of FTA ® paper for storage of oral meta-genomic DNA.

Science.gov (United States)

Foitzik, Magdalena; Stumpp, Sascha N; Grischke, Jasmin; Eberhard, Jörg; Stiesch, Meike

2014-10-01

The purpose of the present study was to evaluate the short-term storage of meta-genomic DNA from native oral biofilms on FTA(®) paper. Thirteen volunteers of both sexes received an acrylic splint for intraoral biofilm formation over a period of 48 hours. The biofilms were collected, resuspended in phosphate-buffered saline, and either stored on FTA(®) paper or directly processed by standard laboratory DNA extraction. The nucleic acid extraction efficiencies were evaluated by 16S rDNA targeted SSCP fingerprinting. The acquired banding pattern of FTA-derived meta-genomic DNA was compared to a standard DNA preparation protocol. Sensitivity and positive predictive values were calculated. The volunteers showed inter-individual differences in their bacterial species composition. A total of 200 bands were found for both methods and 85% of the banding patterns were equal, representing a sensitivity of 0.941 and a false-negative predictive value of 0.059. Meta-genomic DNA sampling, extraction, and adhesion using FTA(®) paper is a reliable method for storage of microbial DNA for a short period of time.
Genome-wide association between DNA methylation and alternative splicing in an invertebrate

Directory of Open Access Journals (Sweden)

Flores Kevin

2012-09-01

Full Text Available Abstract Background Gene bodies are the most evolutionarily conserved targets of DNA methylation in eukaryotes. However, the regulatory functions of gene body DNA methylation remain largely unknown. DNA methylation in insects appears to be primarily confined to exons. Two recent studies in Apis mellifera (honeybee and Nasonia vitripennis (jewel wasp analyzed transcription and DNA methylation data for one gene in each species to demonstrate that exon-specific DNA methylation may be associated with alternative splicing events. In this study we investigated the relationship between DNA methylation, alternative splicing, and cross-species gene conservation on a genome-wide scale using genome-wide transcription and DNA methylation data. Results We generated RNA deep sequencing data (RNA-seq to measure genome-wide mRNA expression at the exon- and gene-level. We produced a de novo transcriptome from this RNA-seq data and computationally predicted splice variants for the honeybee genome. We found that exons that are included in transcription are higher methylated than exons that are skipped during transcription. We detected enrichment for alternative splicing among methylated genes compared to unmethylated genes using fisher’s exact test. We performed a statistical analysis to reveal that the presence of DNA methylation or alternative splicing are both factors associated with a longer gene length and a greater number of exons in genes. In concordance with this observation, a conservation analysis using BLAST revealed that each of these factors is also associated with higher cross-species gene conservation. Conclusions This study constitutes the first genome-wide analysis exhibiting a positive relationship between exon-level DNA methylation and mRNA expression in the honeybee. Our finding that methylated genes are enriched for alternative splicing suggests that, in invertebrates, exon-level DNA methylation may play a role in the construction of splice
[Whole Genome Sequencing of Human mtDNA Based on Ion Torrent PGM™ Platform].

Science.gov (United States)

Cao, Y; Zou, K N; Huang, J P; Ma, K; Ping, Y

2017-08-01

To analyze and detect the whole genome sequence of human mitochondrial DNA （mtDNA） by Ion Torrent PGM™ platform and to study the differences of mtDNA sequence in different tissues. Samples were collected from 6 unrelated individuals by forensic postmortem examination, including chest blood, hair, costicartilage, nail, skeletal muscle and oral epithelium. Amplification of whole genome sequence of mtDNA was performed by 4 pairs of primer. Libraries were constructed with Ion Shear™ Plus Reagents kit and Ion Plus Fragment Library kit. Whole genome sequencing of mtDNA was performed using Ion Torrent PGM™ platform. Sanger sequencing was used to determine the heteroplasmy positions and the mutation positions on HVⅠ region. The whole genome sequence of mtDNA from all samples were amplified successfully. Six unrelated individuals belonged to 6 different haplotypes. Different tissues in one individual had heteroplasmy difference. The heteroplasmy positions and the mutation positions on HVⅠ region were verified by Sanger sequencing. After a consistency check by the Kappa method, it was found that the results of mtDNA sequence had a high consistency in different tissues. The testing method used in present study for sequencing the whole genome sequence of human mtDNA can detect the heteroplasmy difference in different tissues, which have good consistency. The results provide guidance for the further applications of mtDNA in forensic science. Copyright© by the Editorial Department of Journal of Forensic Medicine

Initiation of lambda DNA replication. The Escherichia coli small heat shock proteins, DnaJ and GrpE, increase DnaK's affinity for the lambda P protein.

Science.gov (United States)

Osipiuk, J; Georgopoulos, C; Zylicz, M

1993-03-05

It is known that the initiation of bacteriophage lambda replication requires the orderly assembly of the lambda O.lambda P.DnaB helicase protein preprimosomal complex at the ori lambda DNA site. The DnaK, DnaJ, and GrpE heat shock proteins act together to destabilize the lambda P.DnaB complex, thus freeing DnaB and allowing it to unwind lambda DNA near the ori lambda site. The first step of this disassembly reaction is the binding of DnaK to the lambda P protein. In this report, we examined the influence of the DnaJ and GrpE proteins on the stability of the lambda P.DnaK complex. We present evidence for the existence of the following protein-protein complexes: lambda P.DnaK, lambda P.DnaJ, DnaJ.DnaK, DnaK.GrpE, and lambda P.DnaK.GrpE. Our results suggest that the presence of GrpE alone destabilizes the lambda P.DnaK complex, whereas the presence of DnaJ alone stabilizes the lambda P.DnaK complex. Using immunoprecipitation, we show that in the presence of GrpE, DnaK exhibits a higher affinity for the lambda P.DnaJ complex than it does alone. Using cross-linking with glutaraldehyde, we show that oligomeric forms of DnaK exhibit a higher affinity for lambda P than monomeric DnaK. However, in the presence of GrpE, monomeric DnaK can efficiently bind lambda P protein. These findings help explain our previous results, namely that in the GrpE-dependent lambda DNA replication system, the DnaK protein requirement can be reduced up to 10-fold.
Isolation of Retroelement from Plant Genomic DNA

OpenAIRE

sprotocols

2014-01-01

Author: Pat Heslop-Harrison ### Abstract: Retroelements and their derivatives are an ubiquitous and abundant component of plant genomes. From the 1990s, PCR based techniques have been developed to isolate the elements from genomic DNA of different plants, and the methods and primers used are presented here. Major classes of retroelements include the Ty1-copia, the Ty3-gypsy and the LINE (non-LTR) groups. Mixed PCR products representing the full heterogeneous pool of retrotransposo...
Genome-wide identification of specific oligonucleotides using artificial neural network and computational genomic analysis

Directory of Open Access Journals (Sweden)

Chen Jiun-Ching

2007-05-01

Full Text Available Abstract Background Genome-wide identification of specific oligonucleotides (oligos is a computationally-intensive task and is a requirement for designing microarray probes, primers, and siRNAs. An artificial neural network (ANN is a machine learning technique that can effectively process complex and high noise data. Here, ANNs are applied to process the unique subsequence distribution for prediction of specific oligos. Results We present a novel and efficient algorithm, named the integration of ANN and BLAST (IAB algorithm, to identify specific oligos. We establish the unique marker database for human and rat gene index databases using the hash table algorithm. We then create the input vectors, via the unique marker database, to train and test the ANN. The trained ANN predicted the specific oligos with high efficiency, and these oligos were subsequently verified by BLAST. To improve the prediction performance, the ANN over-fitting issue was avoided by early stopping with the best observed error and a k-fold validation was also applied. The performance of the IAB algorithm was about 5.2, 7.1, and 6.7 times faster than the BLAST search without ANN for experimental results of 70-mer, 50-mer, and 25-mer specific oligos, respectively. In addition, the results of polymerase chain reactions showed that the primers predicted by the IAB algorithm could specifically amplify the corresponding genes. The IAB algorithm has been integrated into a previously published comprehensive web server to support microarray analysis and genome-wide iterative enrichment analysis, through which users can identify a group of desired genes and then discover the specific oligos of these genes. Conclusion The IAB algorithm has been developed to construct SpecificDB, a web server that provides a specific and valid oligo database of the probe, siRNA, and primer design for the human genome. We also demonstrate the ability of the IAB algorithm to predict specific oligos through
Comparison of protocols for genomic DNA extraction from 'velame ...

African Journals Online (AJOL)

usuario

2013-07-24

Jul 24, 2013 ... involving C. linearifolius, we compared the efficiency of six protocols for genomic DNA extraction previously ... phytic, with diverse aspect and floristics, average rainfall between ..... The variation observed for DNA concentrations estimated with .... performed with protocol 1 (data not shown), or still, bands.
Studies on the effects of persistent RNA priming on DNA replication and genomic stability

OpenAIRE

Stuckey, Ruth

2014-01-01

[EN]: DNA replication and transcription take place on the same DNA template, and the correct interplay between these processes ensures faithful genome duplication. DNA replication must be highly coordinated with other cell cycle events, such as segregation of fully replicated DNA in order to maintain genomic integrity. Transcription generates RNA:DNA hybrids, transient intermediate structures that are degraded by the ribonuclease H (RNaseH) class of enzymes. RNA:DNA hybrids can form R-loops, ...
Pairagon: a highly accurate, HMM-based cDNA-to-genome aligner

DEFF Research Database (Denmark)

Lu, David V; Brown, Randall H; Arumugam, Manimozhiyan

2009-01-01

MOTIVATION: The most accurate way to determine the intron-exon structures in a genome is to align spliced cDNA sequences to the genome. Thus, cDNA-to-genome alignment programs are a key component of most annotation pipelines. The scoring system used to choose the best alignment is a primary...... determinant of alignment accuracy, while heuristics that prevent consideration of certain alignments are a primary determinant of runtime and memory usage. Both accuracy and speed are important considerations in choosing an alignment algorithm, but scoring systems have received much less attention than...
DNA Oncogenic Virus-Induced Oxidative Stress, Genomic Damage, and Aberrant Epigenetic Alterations

Directory of Open Access Journals (Sweden)

Mankgopo Magdeline Kgatle

2017-01-01

Full Text Available Approximately 20% of human cancers is attributable to DNA oncogenic viruses such as human papillomavirus (HPV, hepatitis B virus (HBV, and Epstein-Barr virus (EBV. Unrepaired DNA damage is the most common and overlapping feature of these DNA oncogenic viruses and a source of genomic instability and tumour development. Sustained DNA damage results from unceasing production of reactive oxygen species and activation of inflammasome cascades that trigger genomic changes and increased propensity of epigenetic alterations. Accumulation of epigenetic alterations may interfere with genome-wide cellular signalling machineries and promote malignant transformation leading to cancer development. Untangling and understanding the underlying mechanisms that promote these detrimental effects remain the major objectives for ongoing research and hope for effective virus-induced cancer therapy. Here, we review current literature with an emphasis on how DNA damage influences HPV, HVB, and EBV replication and epigenetic alterations that are associated with carcinogenesis.
Our love-hate relationship with DNA barcodes, the Y2K problem, and the search for next generation barcodes

Directory of Open Access Journals (Sweden)

Jeffrey M. Marcus

2018-01-01

Full Text Available DNA barcodes are very useful for species identification especially when identification by traditional morphological characters is difficult. However, the short mitochondrial and chloroplast barcodes currently in use often fail to distinguish between closely related species, are prone to lateral transfer, and provide inadequate phylogenetic resolution, particularly at deeper nodes. The deficiencies of short barcode identifiers are similar to the deficiencies of the short year identifiers that caused the Y2K problem in computer science. The resolution of the Y2K problem was to increase the size of the year identifiers. The performance of conventional mitochondrial COI barcodes for phylogenetics was compared with the performance of complete mitochondrial genomes and nuclear ribosomal RNA repeats obtained by genome skimming for a set of caddisfly taxa (Insect Order Trichoptera. The analysis focused on Trichoptera Family Hydropsychidae, the net-spinning caddisflies, which demonstrates many of the frustrating limitations of current barcodes. To conduct phylogenetic comparisons, complete mitochondrial genomes (15 kb each and nuclear ribosomal repeats (9 kb each from six caddisfly species were sequenced, assembled, and are reported for the first time. These sequences were analyzed in comparison with eight previously published trichopteran mitochondrial genomes and two triochopteran rRNA repeats, plus outgroup sequences from sister clade Lepidoptera (butterflies and moths. COI trees were not well-resolved, had low bootstrap support, and differed in topology from prior phylogenetic analyses of the Trichoptera. Phylogenetic trees based on mitochondrial genomes or rRNA repeats were well-resolved with high bootstrap support and were largely congruent with each other. Because they are easily sequenced by genome skimming, provide robust phylogenetic resolution at various phylogenetic depths, can better distinguish between closely related species, and (in the
An automated annotation tool for genomic DNA sequences using

Indian Academy of Sciences (India)

Genomic sequence data are often available well before the annotated sequence is published. We present a method for analysis of genomic DNA to identify coding sequences using the GeneScan algorithm and characterize these resultant sequences by BLAST. The routines are used to develop a system for automated ...
The DNA-encoded nucleosome organization of a eukaryotic genome.

Science.gov (United States)

Kaplan, Noam; Moore, Irene K; Fondufe-Mittendorf, Yvonne; Gossett, Andrea J; Tillo, Desiree; Field, Yair; LeProust, Emily M; Hughes, Timothy R; Lieb, Jason D; Widom, Jonathan; Segal, Eran

2009-03-19

Nucleosome organization is critical for gene regulation. In living cells this organization is determined by multiple factors, including the action of chromatin remodellers, competition with site-specific DNA-binding proteins, and the DNA sequence preferences of the nucleosomes themselves. However, it has been difficult to estimate the relative importance of each of these mechanisms in vivo, because in vivo nucleosome maps reflect the combined action of all influencing factors. Here we determine the importance of nucleosome DNA sequence preferences experimentally by measuring the genome-wide occupancy of nucleosomes assembled on purified yeast genomic DNA. The resulting map, in which nucleosome occupancy is governed only by the intrinsic sequence preferences of nucleosomes, is similar to in vivo nucleosome maps generated in three different growth conditions. In vitro, nucleosome depletion is evident at many transcription factor binding sites and around gene start and end sites, indicating that nucleosome depletion at these sites in vivo is partly encoded in the genome. We confirm these results with a micrococcal nuclease-independent experiment that measures the relative affinity of nucleosomes for approximately 40,000 double-stranded 150-base-pair oligonucleotides. Using our in vitro data, we devise a computational model of nucleosome sequence preferences that is significantly correlated with in vivo nucleosome occupancy in Caenorhabditis elegans. Our results indicate that the intrinsic DNA sequence preferences of nucleosomes have a central role in determining the organization of nucleosomes in vivo.
Relationships between 16S-23S rRNA gene internal transcribed spacer DNA and genomic DNA similarities in the taxonomy of phototrophic bacteria

International Nuclear Information System (INIS)

Okamura, K; Hisada, T; Takata, K; Hiraishi, A

2013-01-01

Rapid and accurate identification of microbial species is essential task in microbiology and biotechnology. In prokaryotic systematics, genomic DNA-DNA hybridization is the ultimate tool to determine genetic relationships among bacterial strains at the species level. However, a practical problem in this assay is that the experimental procedure is laborious and time-consuming. In recent years, information on the 16S-23S rRNA gene internal transcribed spacer (ITS) region has been used to classify bacterial strains at the species and intraspecies levels. It is unclear how much information on the ITS region can reflect the genome that contain it. In this study, therefore, we evaluate the quantitative relationship between ITS DNA and entire genomic DNA similarities. For this, we determined ITS sequences of several species of anoxygenic phototrophic bacteria belonging to the order Rhizobiales, and compared with DNA-DNA relatedness among these species. There was a high correlation between the two genetic markers. Based on the regression analysis of this relationship, 70% DNA-DNA relatedness corresponded to 92% ITS sequence similarity. This suggests the usefulness of the ITS sequence similarity as a criterion for determining the genospecies of the phototrophic bacteria. To avoid the effects of polymorphism bias of ITS on similarities, PCR products from all loci of ITS were used directly as genetic probes for comparison. The results of ITS DNA-DNA hybridization coincided well with those of genomic DNA-DNA relatedness. These collective data indicate that the whole ITS DNA-DNA similarity can be used as an alternative to genomic DNA-DNA similarity.
Resurrection of DNA function in vivo from an extinct genome.

Directory of Open Access Journals (Sweden)

Andrew J Pask

2008-05-01

Full Text Available There is a burgeoning repository of information available from ancient DNA that can be used to understand how genomes have evolved and to determine the genetic features that defined a particular species. To assess the functional consequences of changes to a genome, a variety of methods are needed to examine extinct DNA function. We isolated a transcriptional enhancer element from the genome of an extinct marsupial, the Tasmanian tiger (Thylacinus cynocephalus or thylacine, obtained from 100 year-old ethanol-fixed tissues from museum collections. We then examined the function of the enhancer in vivo. Using a transgenic approach, it was possible to resurrect DNA function in transgenic mice. The results demonstrate that the thylacine Col2A1 enhancer directed chondrocyte-specific expression in this extinct mammalian species in the same way as its orthologue does in mice. While other studies have examined extinct coding DNA function in vitro, this is the first example of the restoration of extinct non-coding DNA and examination of its function in vivo. Our method using transgenesis can be used to explore the function of regulatory and protein-coding sequences obtained from any extinct species in an in vivo model system, providing important insights into gene evolution and diversity.
Discovery of cyanophage genomes which contain mitochondrial DNA polymerase.

Science.gov (United States)

Chan, Yi-Wah; Mohr, Remus; Millard, Andrew D; Holmes, Antony B; Larkum, Anthony W; Whitworth, Anna L; Mann, Nicholas H; Scanlan, David J; Hess, Wolfgang R; Clokie, Martha R J

2011-08-01

DNA polymerase γ is a family A DNA polymerase responsible for the replication of mitochondrial DNA in eukaryotes. The origins of DNA polymerase γ have remained elusive because it is not present in any known bacterium, though it has been hypothesized that mitochondria may have inherited the enzyme by phage-mediated nonorthologous displacement. Here, we present an analysis of two full-length homologues of this gene, which were found in the genomes of two bacteriophages, which infect the chlorophyll-d containing cyanobacterium Acaryochloris marina. Phylogenetic analyses of these phage DNA polymerase γ proteins show that they branch deeply within the DNA polymerase γ clade and therefore share a common origin with their eukaryotic homologues. We also found homologues of these phage polymerases in the environmental Community Cyberinfrastructure for Advanced Microbial Ecology Research and Analysis (CAMERA) database, which fell in the same clade. An analysis of the CAMERA assemblies containing the environmental homologues together with the filter fraction metadata indicated some of these assemblies may be of bacterial origin. We also show that the phage-encoded DNA polymerase γ is highly transcribed as the phage genomes are replicated. These findings provide data that may assist in reconstructing the evolution of mitochondria.
Optical Characterization of Oligonucleotide DNA Influenced by Magnetic Fields

Directory of Open Access Journals (Sweden)

Seyedeh Maryam Banihashemian

2013-09-01

Full Text Available UV-VIS spectroscopic analysis of oligonucleotide DNA exposed to different magnetic fields was performed in order to investigate the relationship between DNA extinction coefficients and optical parameters according to magnetic-field strength. The results with the oligonucleotides adenine-thymine 100 mer (AT-100 DNA and cytosine-guanine 100 mer (CG-100 DNA indicate that the magnetic field influences DNA molar extinction coefficients and refractive indexes. The imaginary parts of the refractive index and molar extinction coefficients of the AT-100 and CG-100 DNA decreased after exposure to a magnetic field of 750 mT due to cleavage of the DNA oligonucleotides into smaller segments.
Suicidal function of DNA methylation in age-related genome disintegration.

Science.gov (United States)

Mazin, Alexander L

2009-10-01

This article is dedicated to the 60th anniversary of 5-methylcytosine discovery in DNA. Cytosine methylation can affect genetic and epigenetic processes, works as a part of the genome-defense system and has mutagenic activity; however, the biological functions of this enzymatic modification are not well understood. This review will put forward the hypothesis that the host-defense role of DNA methylation in silencing and mutational destroying of retroviruses and other intragenomic parasites was extended during evolution to most host genes that have to be inactivated in differentiated somatic cells, where it acquired a new function in age-related self-destruction of the genome. The proposed model considers DNA methylation as the generator of 5mC>T transitions that induce 40-70% of all spontaneous somatic mutations of the multiple classes at CpG and CpNpG sites and flanking nucleotides in the p53, FIX, hprt, gpt human genes and some transgenes. The accumulation of 5mC-dependent mutations explains: global changes in the structure of the vertebrate genome throughout evolution; the loss of most 5mC from the DNA of various species over their lifespan and the Hayflick limit of normal cells; the polymorphism of methylation sites, including asymmetric mCpNpN sites; cyclical changes of methylation and demethylation in genes. The suicidal function of methylation may be a special genetic mechanism for increasing DNA damage and the programmed genome disintegration responsible for cell apoptosis and organism aging and death.
Critical threshold levels of DNA methyltransferase 1 are required to maintain DNA methylation across the genome in human cancer cells.

Science.gov (United States)

Cai, Yi; Tsai, Hsing-Chen; Yen, Ray-Whay Chiu; Zhang, Yang W; Kong, Xiangqian; Wang, Wei; Xia, Limin; Baylin, Stephen B

2017-04-01

Reversing DNA methylation abnormalities and associated gene silencing, through inhibiting DNA methyltransferases (DNMTs) is an important potential cancer therapy paradigm. Maximizing this potential requires defining precisely how these enzymes maintain genome-wide, cancer-specific DNA methylation. To date, there is incomplete understanding of precisely how the three DNMTs, 1, 3A, and 3B, interact for maintaining DNA methylation abnormalities in cancer. By combining genetic and shRNA depletion strategies, we define not only a dominant role for DNA methyltransferase 1 (DNMT1) but also distinct roles of 3A and 3B in genome-wide DNA methylation maintenance. Lowering DNMT1 below a threshold level is required for maximal loss of DNA methylation at all genomic regions, including gene body and enhancer regions, and for maximally reversing abnormal promoter DNA hypermethylation and associated gene silencing to reexpress key genes. It is difficult to reach this threshold with patient-tolerable doses of current DNMT inhibitors (DNMTIs). We show that new approaches, like decreasing the DNMT targeting protein, UHRF1, can augment the DNA demethylation capacities of existing DNA methylation inhibitors for fully realizing their therapeutic potential. © 2017 Cai et al.; Published by Cold Spring Harbor Laboratory Press.
Genomic DNA extraction protocols from ovine hair

Directory of Open Access Journals (Sweden)

Jennifer Nonato da Silva Prate

2013-12-01

Full Text Available Genomic DNA extracted from animal cells can be used for several purposes, for example, to know genetic variability and genetic relationships between individuals, breeds and/or species, paternity tests, to describe the genetic profile for registration of the animal at association of breeders, detect genetic polymorphisms (SNP related to characteristics of commercial interest, disease diagnose, assess resistance or susceptibility to pathogens, etc. For such evaluations, in general, DNA is amplified by PCR (polymerase chain reaction, and then subjected to various techniques as RFLP (restriction fragments length polymorphism, SSCP (single strand conformation polymorphism, and sequencing. The DNA may be obtained from blood, buccal swabs, meat, cartilage or hair bulb. Among all, the last biological material has been preferred by farmers for its ease acquisition. Several methods for extracting DNA from hair bulb were reported without any consensus for its implementation. This study aimed to optimize a protocol for efficient DNA extraction for use in PCR-RFLP analysis of the Prion gene. For this study, were collected hair samples containing hair bulb from 131 Santa Inês sheep belonging to the Institute of Zootechny, Nova Odessa - SP. Two DNA extraction protocols were evaluated. The first, called phenol-chloroform-isoamyl alcohol (PCIA has long been used by Animal Genetic Laboratories, whose procedures are described below: in each microtube (1.5 mL containing 500 µL of TE-Tween solution (Tris-HCl 50 mM, EDTA 1 mM and 0.5% Tween 20 were added to approximately 30 hair bulb per animal which was incubated at 65°C with shaking at 170 rpm for 2 hours. Then was added 15 µL of proteinase K [10 mg mL-1] and incubated at 55°C at 170 rpm for 6-12 hours. At the end of digestion was added 1 volume of solution phenol-chloroform-isoamyl alcohol (25:24:1 followed by vigorous shaking for 10 seconds and centrifuged at 8000 rpm and 4°C for 10 minutes. The upper phase
Evaluating Digital PCR for the Quantification of Human Genomic DNA: Accessible Amplifiable Targets.

Science.gov (United States)

Kline, Margaret C; Romsos, Erica L; Duewer, David L

2016-02-16

Polymerase chain reaction (PCR) multiplexed assays perform best when the input quantity of template DNA is controlled to within about a factor of √2. To help ensure that PCR assays yield consistent results over time and place, results from methods used to determine DNA quantity need to be metrologically traceable to a common reference. Many DNA quantitation systems can be accurately calibrated with solutions of DNA in aqueous buffer. Since they do not require external calibration, end-point limiting dilution technologies, collectively termed "digital PCR (dPCR)", have been proposed as suitable for value assigning such DNA calibrants. The performance characteristics of several commercially available dPCR systems have recently been documented using plasmid, viral, or fragmented genomic DNA; dPCR performance with more complex materials, such as human genomic DNA, has been less studied. With the goal of providing a human genomic reference material traceably certified for mass concentration, we are investigating the measurement characteristics of several dPCR systems. We here report results of measurements from multiple PCR assays, on four human genomic DNAs treated with four endonuclease restriction enzymes using both chamber and droplet dPCR platforms. We conclude that dPCR does not estimate the absolute number of PCR targets in a given volume but rather the number of accessible and amplifiable targets. While enzymatic restriction of human genomic DNA increases accessibility for some assays, in well-optimized PCR assays it can reduce the number of amplifiable targets and increase assay variability relative to uncut sample.
Effect of nickel chloride on Arabidopsis genomic DNA and methylation of 18S rDNA

Directory of Open Access Journals (Sweden)

Zhongai Li

2015-01-01

Conclusions: NiCl2 application caused variation of DNA methylation of the Arabidopsis genomic and offspring's. NiCl2 also resulted in nucleolar injury and deformity of root tip cells. The methylation rate of 18S rDNA also changed by adding NiCl2.
cDNA structure, genomic organization and expression patterns of ...

African Journals Online (AJOL)

Visfatin was a newly identified adipocytokine, which was involved in various physiologic and pathologic processes of organisms. The cDNA structure, genomic organization and expression patterns of silver Prussian carp visfatin were described in this report. The silver Prussian carp visfatin cDNA cloned from the liver was ...

α/sub i/-3 cDNA encodes the α subunit of G/sub k/, the stimulatory G protein of receptor-regulated K+ channels

International Nuclear Information System (INIS)

Codina, J.; Olate, J.; Abramowitz, J.; Mattera, R.; Cook, R.G.; Birnbaumer, L.

1988-01-01

cDNA cloning has identified the presence in the human genome of three genes encoding α subunits of pertussis toxin substrates, generically called G/sub i/. They are named α/sub i/-1, α/sub i/-2 and α/sub i/-3. However, none of these genes has been functionally identified with any of the α subunits of several possible G proteins, including pertussis toxin-sensitive G/sub p/'s, stimulatory to phospholipase C or A 2 , G/sub i/, inhibitory to adenylyl cyclase, or G/sub k/, stimulatory to a type of K + channels. The authors now report the nucleotide sequence and the complete predicted amino acid sequence of human liver α/sub i/-3 and the partial amino acid sequence of proteolytic fragments of the α subunit of human erythrocyte G/sub k/. The amino acid sequence of the proteolytic fragment is uniquely encoded by the cDNA of α/sub i/-3, thus identifying it as α/sub k/. The probable identity of α/sub i/-1 with α/sub p/ and possible roles for α/sub i/-2, as well as additional roles for α/sub i/-1 and α/sub i/-3 (α/sub k/) are discussed
An alternative method for cDNA cloning from surrogate eukaryotic cells transfected with the corresponding genomic DNA.

Science.gov (United States)

Hu, Lin-Yong; Cui, Chen-Chen; Song, Yu-Jie; Wang, Xiang-Guo; Jin, Ya-Ping; Wang, Ai-Hua; Zhang, Yong

2012-07-01

cDNA is widely used in gene function elucidation and/or transgenics research but often suitable tissues or cells from which to isolate mRNA for reverse transcription are unavailable. Here, an alternative method for cDNA cloning is described and tested by cloning the cDNA of human LALBA (human alpha-lactalbumin) from genomic DNA. First, genomic DNA containing all of the coding exons was cloned from human peripheral blood and inserted into a eukaryotic expression vector. Next, by delivering the plasmids into either 293T or fibroblast cells, surrogate cells were constructed. Finally, the total RNA was extracted from the surrogate cells and cDNA was obtained by RT-PCR. The human LALBA cDNA that was obtained was compared with the corresponding mRNA published in GenBank. The comparison showed that the two sequences were identical. The novel method for cDNA cloning from surrogate eukaryotic cells described here uses well-established techniques that are feasible and simple to use. We anticipate that this alternative method will have widespread applications.
GI-SVM: A sensitive method for predicting genomic islands based on unannotated sequence of a single genome.

Science.gov (United States)

Lu, Bingxin; Leong, Hon Wai

2016-02-01

Genomic islands (GIs) are clusters of functionally related genes acquired by lateral genetic transfer (LGT), and they are present in many bacterial genomes. GIs are extremely important for bacterial research, because they not only promote genome evolution but also contain genes that enhance adaption and enable antibiotic resistance. Many methods have been proposed to predict GI. But most of them rely on either annotations or comparisons with other closely related genomes. Hence these methods cannot be easily applied to new genomes. As the number of newly sequenced bacterial genomes rapidly increases, there is a need for methods to detect GI based solely on sequences of a single genome. In this paper, we propose a novel method, GI-SVM, to predict GIs given only the unannotated genome sequence. GI-SVM is based on one-class support vector machine (SVM), utilizing composition bias in terms of k-mer content. From our evaluations on three real genomes, GI-SVM can achieve higher recall compared with current methods, without much loss of precision. Besides, GI-SVM allows flexible parameter tuning to get optimal results for each genome. In short, GI-SVM provides a more sensitive method for researchers interested in a first-pass detection of GI in newly sequenced genomes.
Ole Rømers mange talenter

DEFF Research Database (Denmark)

Pedersen, Kurt Møller

2005-01-01

Artiklen giver en oversigt over Ole Rømers videnskabelige liv som astronom, fysiker og matematiker samt hans betydning som kongelig embedsmand.......Artiklen giver en oversigt over Ole Rømers videnskabelige liv som astronom, fysiker og matematiker samt hans betydning som kongelig embedsmand....
Whole-genome methylation caller designed for methyl- DNA ...

African Journals Online (AJOL)

etchie

2013-02-20

Feb 20, 2013 ... Our method uses a single-CpG-resolution, whole-genome methylation ... Key words: Methyl-DNA immunoprecipitation, next-generation sequencing, ...... methylation is prevalent in embryonic stem cells andmaybe mediated.
Rapid and reliable extraction of genomic DNA from various wild-type and transgenic plants

Directory of Open Access Journals (Sweden)

Yang Moon-Sik

2004-09-01

Full Text Available Abstract Background DNA extraction methods for PCR-quality DNA from calluses and plants are not time efficient, since they require that the tissues be ground in liquid nitrogen, followed by precipitation of the DNA pellet in ethanol, washing and drying the pellet, etc. The need for a rapid and simple procedure is urgent, especially when hundreds of samples need to be analyzed. Here, we describe a simple and efficient method of isolating high-quality genomic DNA for PCR amplification and enzyme digestion from calluses, various wild-type and transgenic plants. Results We developed new rapid and reliable genomic DNA extraction method. With our developed method, plant genomic DNA extraction could be performed within 30 min. The method was as follows. Plant tissue was homogenized with salt DNA extraction buffer using hand-operated homogenizer and extracted by phenol:chloroform:isoamyl alcohol (25:24:1. After centrifugation, the supernatant was directly used for DNA template for PCR, resulting in successful amplification for RAPD from various sources of plants and specific foreign genes from transgenic plants. After precipitating the supernatant, the DNA was completely digested by restriction enzymes. Conclusion This DNA extraction procedure promises simplicity, speed, and efficiency, both in terms of time and the amount of plant sample required. In addition, this method does not require expensive facilities for plant genomic DNA extraction.
The prevalence of Middle East respiratory Syndrome coronavirus (MERS-CoV) infection in livestock and temporal relation to locations and seasons.

Science.gov (United States)

Kasem, Samy; Qasim, Ibrahim; Al-Doweriej, Ali; Hashim, Osman; Alkarar, Ali; Abu-Obeida, Ali; Saleh, Mohamed; Al-Hofufi, Ali; Al-Ghadier, Hussein; Hussien, Raed; Al-Sahaf, Ali; Bayoumi, Faisal; Magouz, Asmaa

2018-01-29

The Middle East respiratory syndrome (MERS) has been reported for the first time infecting a human being since 2012. The WHO was notified of 27 countries have reported cases of MERS, the majority of these cases occur in the Arabian Peninsula, particularly in Saudi Arabia. Dromedary camels are likely to be the main source of Middle East respiratory syndrome virus (MERS-CoV) infection in humans. MERS-CoV infection rates among camels in livestock markets and slaughterhouses were investigated in Saudi Arabia. A total of 698 nasal swabs were collected and examined with Rapid assay and rtRT-PCR. Ten MERS-CoV positive samples were subjected to full genomic sequencing. In addition, the sensitivity and specificity of the Rapid immunochromatographic assay (BioNote, South Korea) was evaluated as a diagnostic tool for MERS-CoV compared to rtRT-PCR. The results showed a high percentage of dromedaries (56.4%) had evidence for nasal MERS-CoV infection. Phylogenetic analysis of the ten MERS-CoV isolates showed that the sequences were closely related to the other MERS-CoV strains recovered from camels and human cases. Moreover, the results showed that 195 samples were positive for MERS-CoV by rapid assay compared to 394 positive samples of rtRT-PCR, which showed low rapid assay sensitivity (49.49%) while, the specificity were found to be 100%. These findings indicate that these sites are a highly-hazardous to zoonotic diseases. Copyright © 2018 The Authors. Published by Elsevier Ltd.. All rights reserved.
Continued colonization of the human genome by mitochondrial DNA.

Directory of Open Access Journals (Sweden)

Miria Ricchetti

2004-09-01

Full Text Available Integration of mitochondrial DNA fragments into nuclear chromosomes (giving rise to nuclear DNA sequences of mitochondrial origin, or NUMTs is an ongoing process that shapes nuclear genomes. In yeast this process depends on double-strand-break repair. Since NUMTs lack amplification and specific integration mechanisms, they represent the prototype of exogenous insertions in the nucleus. From sequence analysis of the genome of Homo sapiens, followed by sampling humans from different ethnic backgrounds, and chimpanzees, we have identified 27 NUMTs that are specific to humans and must have colonized human chromosomes in the last 4-6 million years. Thus, we measured the fixation rate of NUMTs in the human genome. Six such NUMTs show insertion polymorphism and provide a useful set of DNA markers for human population genetics. We also found that during recent human evolution, Chromosomes 18 and Y have been more susceptible to colonization by NUMTs. Surprisingly, 23 out of 27 human-specific NUMTs are inserted in known or predicted genes, mainly in introns. Some individuals carry a NUMT insertion in a tumor-suppressor gene and in a putative angiogenesis inhibitor. Therefore in humans, but not in yeast, NUMT integrations preferentially target coding or regulatory sequences. This is indeed the case for novel insertions associated with human diseases and those driven by environmental insults. We thus propose a mutagenic phenomenon that may be responsible for a variety of genetic diseases in humans and suggest that genetic or environmental factors that increase the frequency of chromosome breaks provide the impetus for the continued colonization of the human genome by mitochondrial DNA.
Genome dynamics of short oligonucleotides: the example of bacterial DNA uptake enhancing sequences.

Directory of Open Access Journals (Sweden)

Mohammed Bakkali

Full Text Available Among the many bacteria naturally competent for transformation by DNA uptake-a phenomenon with significant clinical and financial implications- Pasteurellaceae and Neisseriaceae species preferentially take up DNA containing specific short sequences. The genomic overrepresentation of these DNA uptake enhancing sequences (DUES causes preferential uptake of conspecific DNA, but the function(s behind this overrepresentation and its evolution are still a matter for discovery. Here I analyze DUES genome dynamics and evolution and test the validity of the results to other selectively constrained oligonucleotides. I use statistical methods and computer simulations to examine DUESs accumulation in Haemophilus influenzae and Neisseria gonorrhoeae genomes. I analyze DUESs sequence and nucleotide frequencies, as well as those of all their mismatched forms, and prove the dependence of DUESs genomic overrepresentation on their preferential uptake by quantifying and correlating both characteristics. I then argue that mutation, uptake bias, and weak selection against DUESs in less constrained parts of the genome combined are sufficient enough to cause DUESs accumulation in susceptible parts of the genome with no need for other DUES function. The distribution of overrepresentation values across sequences with different mismatch loads compared to the DUES suggests a gradual yet not linear molecular drive of DNA sequences depending on their similarity to the DUES. Other genomically overrepresented sequences, both pro- and eukaryotic, show similar distribution of frequencies suggesting that the molecular drive reported above applies to other frequent oligonucleotides. Rare oligonucleotides, however, seem to be gradually drawn to genomic underrepresentation, thus, suggesting a molecular drag. To my knowledge this work provides the first clear evidence of the gradual evolution of selectively constrained oligonucleotides, including repeated, palindromic and protein
Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications.

Science.gov (United States)

Christen, Matthias; Del Medico, Luca; Christen, Heinz; Christen, Beat

2017-01-01

Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.
Genome Partitioner: A web tool for multi-level partitioning of large-scale DNA constructs for synthetic biology applications.

Directory of Open Access Journals (Sweden)

Matthias Christen

Full Text Available Recent advances in lower-cost DNA synthesis techniques have enabled new innovations in the field of synthetic biology. Still, efficient design and higher-order assembly of genome-scale DNA constructs remains a labor-intensive process. Given the complexity, computer assisted design tools that fragment large DNA sequences into fabricable DNA blocks are needed to pave the way towards streamlined assembly of biological systems. Here, we present the Genome Partitioner software implemented as a web-based interface that permits multi-level partitioning of genome-scale DNA designs. Without the need for specialized computing skills, biologists can submit their DNA designs to a fully automated pipeline that generates the optimal retrosynthetic route for higher-order DNA assembly. To test the algorithm, we partitioned a 783 kb Caulobacter crescentus genome design. We validated the partitioning strategy by assembling a 20 kb test segment encompassing a difficult to synthesize DNA sequence. Successful assembly from 1 kb subblocks into the 20 kb segment highlights the effectiveness of the Genome Partitioner for reducing synthesis costs and timelines for higher-order DNA assembly. The Genome Partitioner is broadly applicable to translate DNA designs into ready to order sequences that can be assembled with standardized protocols, thus offering new opportunities to harness the diversity of microbial genomes for synthetic biology applications. The Genome Partitioner web tool can be accessed at https://christenlab.ethz.ch/GenomePartitioner.
Genome-wide alterations of the DNA replication program during tumor progression

Science.gov (United States)

Arneodo, A.; Goldar, A.; Argoul, F.; Hyrien, O.; Audit, B.

2016-08-01

Oncogenic stress is a major driving force in the early stages of cancer development. Recent experimental findings reveal that, in precancerous lesions and cancers, activated oncogenes may induce stalling and dissociation of DNA replication forks resulting in DNA damage. Replication timing is emerging as an important epigenetic feature that recapitulates several genomic, epigenetic and functional specificities of even closely related cell types. There is increasing evidence that chromosome rearrangements, the hallmark of many cancer genomes, are intimately associated with the DNA replication program and that epigenetic replication timing changes often precede chromosomic rearrangements. The recent development of a novel methodology to map replication fork polarity using deep sequencing of Okazaki fragments has provided new and complementary genome-wide replication profiling data. We review the results of a wavelet-based multi-scale analysis of genomic and epigenetic data including replication profiles along human chromosomes. These results provide new insight into the spatio-temporal replication program and its dynamics during differentiation. Here our goal is to bring to cancer research, the experimental protocols and computational methodologies for replication program profiling, and also the modeling of the spatio-temporal replication program. To illustrate our purpose, we report very preliminary results obtained for the chronic myelogeneous leukemia, the archetype model of cancer. Finally, we discuss promising perspectives on using genome-wide DNA replication profiling as a novel efficient tool for cancer diagnosis, prognosis and personalized treatment.
Global DNA Methylation in the Chestnut Blight Fungus Cryphonectria parasitica and Genome-Wide Changes in DNA Methylation Accompanied with Sectorization

Directory of Open Access Journals (Sweden)

Kum-Kang So

2018-02-01

Full Text Available Mutation in CpBck1, an ortholog of the cell wall integrity mitogen-activated protein kinase kinase kinase (MAPKKK of Saccharomyces cerevisiae, in the chestnut blight fungus Cryphonectria parasitica resulted in a sporadic sectorization as culture proceeded. The progeny from the sectored area maintained the characteristics of the sector, showing a massive morphogenetic change, including robust mycelial growth without differentiation. Epigenetic changes were investigated as the genetic mechanism underlying this sectorization. Quantification of DNA methylation and whole-genome bisulfite sequencing revealed genome-wide DNA methylation of the wild-type at each nucleotide level and changes in DNA methylation of the sectored progeny. Compared to the wild-type, the sectored progeny exhibited marked genome-wide DNA hypomethylation but increased methylation sites. Expression analysis of two DNA methyltransferases, including two representative types of DNA methyltransferase (DNMTase, demonstrated that both were significantly down-regulated in the sectored progeny. However, functional analysis using mutant phenotypes of corresponding DNMTases demonstrated that a mutant of CpDmt1, an ortholog of RID of Neurospora crassa, resulted in the sectored phenotype but the CpDmt2 mutant did not, suggesting that the genetic basis of fungal sectorization is more complex. The present study revealed that a mutation in a signaling pathway component resulted in sectorization accompanied with changes in genome-wide DNA methylation, which suggests that this signal transduction pathway is important for epigenetic control of sectorization via regulation of genes involved in DNA methylation.
Adaptive evolution influences the infectious dose of MERS-CoV necessary to achieve severe respiratory disease.

Science.gov (United States)

Douglas, Madeline G; Kocher, Jacob F; Scobey, Trevor; Baric, Ralph S; Cockrell, Adam S

2018-04-01

We recently established a mouse model (288-330 +/+ ) that developed acute respiratory disease resembling human pathology following infection with a high dose (5 × 10 6 PFU) of mouse-adapted MERS-CoV (icMERSma1). Although this high dose conferred fatal respiratory disease in mice, achieving similar pathology at lower viral doses may more closely reflect naturally acquired infections. Through continued adaptive evolution of icMERSma1 we generated a novel mouse-adapted MERS-CoV (maM35c4) capable of achieving severe respiratory disease at doses between 10 3 and 10 5 PFU. Novel mutations were identified in the maM35c4 genome that may be responsible for eliciting etiologies of acute respiratory distress syndrome at 10-1000 fold lower viral doses. Importantly, comparative genetics of the two mouse-adapted MERS strains allowed us to identify specific mutations that remained fixed through an additional 20 cycles of adaptive evolution. Our data indicate that the extent of MERS-CoV adaptation determines the minimal infectious dose required to achieve severe respiratory disease. Copyright © 2017 Elsevier Inc. All rights reserved.
Chromosomal Localization of DNA Amplifications in Neuroblastoma Tumors Using cDNA Microarray Comparative Genomic Hybridization

Directory of Open Access Journals (Sweden)

Ben Beheshti

2003-01-01

Full Text Available Conventional comparative genomic hybridization (CGH profiling of neuroblastomas has identified many genomic aberrations, although the limited resolution has precluded a precise localization of sequences of interest within amplicons. To map high copy number genomic gains in clinically matched stage IV neuroblastomas, CGH analysis using a 19,200-feature cDNA microarray was used. A dedicated (freely available algorithm was developed for rapid in silico determination of chromosomal localizations of microarray cDNA targets, and for generation of an ideogram-type profile of copy number changes. Using these methodologies, novel gene amplifications undetectable by chromosome CGH were identified, and larger MYCN amplicon sizes (in one tumor up to 6 Mb than those previously reported in neuroblastoma were identified. The genes HPCAL1, LPIN1/KIAA0188, NAG, and NSE1/LOC151354 were found to be coamplified with MYCN. To determine whether stage IV primary tumors could be further subclassified based on their genomic copy number profiles, hierarchical clustering was performed. Cluster analysis of microarray CGH data identified three groups: 1 no amplifications evident, 2 a small MYCN amplicon as the only detectable imbalance, and 3 a large MYCN amplicon with additional gene amplifications. Application of CGH to cDNA microarray targets will help to determine both the variation of amplicon size and help better define amplification-dependent and independent pathways of progression in neuroblastoma.
Trigger Factor and DnaK possess overlapping substrate pools and binding specificities.

Science.gov (United States)

Deuerling, Elke; Patzelt, Holger; Vorderwülbecke, Sonja; Rauch, Thomas; Kramer, Günter; Schaffitzel, Elke; Mogk, Axel; Schulze-Specking, Agnes; Langen, Hanno; Bukau, Bernd

2003-03-01

Ribosome-associated Trigger Factor (TF) and the DnaK chaperone system assist the folding of newly synthesized proteins in Escherichia coli. Here, we show that DnaK and TF share a common substrate pool in vivo. In TF-deficient cells, deltatig, depleted for DnaK and DnaJ the amount of aggregated proteins increases with increasing temperature, amounting to 10% of total soluble protein (approximately 340 protein species) at 37 degrees C. A similar population of proteins aggregated in DnaK depleted tig+ cells, albeit to a much lower extent. Ninety-four aggregated proteins isolated from DnaK- and DnaJ-depleted deltatig cells were identified by mass spectrometry and found to include essential cytosolic proteins. Four potential in vivo substrates were screened for chaperone binding sites using peptide libraries. Although TF and DnaK recognize different binding motifs, 77% of TF binding peptides also associated with DnaK. In the case of the nascent polypeptides TF and DnaK competed for binding, however, with competitive advantage for TF. In vivo, the loss of TF is compensated by the induction of the heat shock response and thus enhanced levels of DnaK. In summary, our results demonstrate that the co-operation of the two mechanistically distinct chaperones in protein folding is based on their overlap in substrate specificities.
Pairagon: a highly accurate, HMM-based cDNA-to-genome aligner.

Science.gov (United States)

Lu, David V; Brown, Randall H; Arumugam, Manimozhiyan; Brent, Michael R

2009-07-01

The most accurate way to determine the intron-exon structures in a genome is to align spliced cDNA sequences to the genome. Thus, cDNA-to-genome alignment programs are a key component of most annotation pipelines. The scoring system used to choose the best alignment is a primary determinant of alignment accuracy, while heuristics that prevent consideration of certain alignments are a primary determinant of runtime and memory usage. Both accuracy and speed are important considerations in choosing an alignment algorithm, but scoring systems have received much less attention than heuristics. We present Pairagon, a pair hidden Markov model based cDNA-to-genome alignment program, as the most accurate aligner for sequences with high- and low-identity levels. We conducted a series of experiments testing alignment accuracy with varying sequence identity. We first created 'perfect' simulated cDNA sequences by splicing the sequences of exons in the reference genome sequences of fly and human. The complete reference genome sequences were then mutated to various degrees using a realistic mutation simulator and the perfect cDNAs were aligned to them using Pairagon and 12 other aligners. To validate these results with natural sequences, we performed cross-species alignment using orthologous transcripts from human, mouse and rat. We found that aligner accuracy is heavily dependent on sequence identity. For sequences with 100% identity, Pairagon achieved accuracy levels of >99.6%, with one quarter of the errors of any other aligner. Furthermore, for human/mouse alignments, which are only 85% identical, Pairagon achieved 87% accuracy, higher than any other aligner. Pairagon source and executables are freely available at http://mblab.wustl.edu/software/pairagon/
Epigenetic control of mobile DNA as an interface between experience and genome change

Directory of Open Access Journals (Sweden)

James A. Shapiro

2014-04-01

Full Text Available Mobile DNA in the genome is subject to RNA-targeted epigenetic control. This control regulates the activity of transposons, retrotransposons and genomic proviruses. Many different life history experiences alter the activities of mobile DNA and the expression of genetic loci regulated by nearby insertions. The same experiences induce alterations in epigenetic formatting and lead to trans-generational modifications of genome expression and stability. These observations lead to the hypothesis that epigenetic formatting directed by non-coding RNA provides a molecular interface between life history events and genome alteration.
Molecular identification of Lodoicea maldivica (coco de mer seeds

Directory of Open Access Journals (Sweden)

Mok Chuen-shing

2011-09-01

Full Text Available Abstract Background The edible endosperm of Lodoicea maldivica with the common name of coco de mer is used in Chinese medicine for treating cough. Native to Seychelles, Lodoicea maldivica seeds have commanded high prices for centuries due to its scarcity. This study aims to develop a molecular identification method for the authentication of Lodoicea maldivica seeds. Methods DNA was extracted from the sample. Two polymerase chain reaction (PCR systems were developed to amplify a region of the chloroplast DNA and the nuclear phosphoribulokinase (PRK region specific to Lodoicea maldivica respectively. DNA sequence of a sample was determined and compared with that of the Lodoicea maldivica reference material. Results The PRK gene of Lodoicea maldivica was successfully amplified and sequenced for identification. Conclusion A new molecular method for the identification of Lodoicea maldivica seeds in fresh, frozen or dried forms was developed.
BuD, a helix–loop–helix DNA-binding domain for genome modification

Energy Technology Data Exchange (ETDEWEB)

Stella, Stefano [Spanish National Cancer Research Centre (CNIO), Calle de Melchor Fernández Almagro 3, 28029 Madrid (Spain); University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen (Denmark); Molina, Rafael; López-Méndez, Blanca [Spanish National Cancer Research Centre (CNIO), Calle de Melchor Fernández Almagro 3, 28029 Madrid (Spain); Juillerat, Alexandre; Bertonati, Claudia; Daboussi, Fayza [Cellectis, 8 Rue de la Croix Jarry, 75013 Paris (France); Campos-Olivas, Ramon [Spanish National Cancer Research Centre (CNIO), Calle de Melchor Fernández Almagro 3, 28029 Madrid (Spain); Duchateau, Phillippe [Cellectis, 8 Rue de la Croix Jarry, 75013 Paris (France); Montoya, Guillermo, E-mail: guillermo.montoya@cpr.ku.dk [Spanish National Cancer Research Centre (CNIO), Calle de Melchor Fernández Almagro 3, 28029 Madrid (Spain); University of Copenhagen, Blegdamsvej 3B, 2200 Copenhagen (Denmark)

2014-07-01

Crystal structures of BurrH and the BurrH–DNA complex are reported. DNA editing offers new possibilities in synthetic biology and biomedicine for modulation or modification of cellular functions to organisms. However, inaccuracy in this process may lead to genome damage. To address this important problem, a strategy allowing specific gene modification has been achieved through the addition, removal or exchange of DNA sequences using customized proteins and the endogenous DNA-repair machinery. Therefore, the engineering of specific protein–DNA interactions in protein scaffolds is key to providing ‘toolkits’ for precise genome modification or regulation of gene expression. In a search for putative DNA-binding domains, BurrH, a protein that recognizes a 19 bp DNA target, was identified. Here, its apo and DNA-bound crystal structures are reported, revealing a central region containing 19 repeats of a helix–loop–helix modular domain (BurrH domain; BuD), which identifies the DNA target by a single residue-to-nucleotide code, thus facilitating its redesign for gene targeting. New DNA-binding specificities have been engineered in this template, showing that BuD-derived nucleases (BuDNs) induce high levels of gene targeting in a locus of the human haemoglobin β (HBB) gene close to mutations responsible for sickle-cell anaemia. Hence, the unique combination of high efficiency and specificity of the BuD arrays can push forward diverse genome-modification approaches for cell or organism redesign, opening new avenues for gene editing.

Salt-Dependent DNA-DNA Spacings in Intact Bacteriophage lambda Reflect Relative Importance of DNA Self-Repulsion and Bending Energies

Energy Technology Data Exchange (ETDEWEB)

X Qiu; D Rau; V Parsegian; L Fang; C Knobler; W Gelbart

2011-12-31

Using solution synchrotron x-ray scattering, we measure the variation of DNA-DNA d spacings in bacteriophage {lambda} with mono-, di-, and polyvalent salt concentrations, for wild-type [48.5 x 10{sup 3} base pairs (bp)] and short-genome-mutant (37.8 kbp) strains. From the decrease in d spacings with increasing salt, we deduce the relative contributions of DNA self-repulsion and bending to the energetics of packaged phage genomes. We quantify the DNA-DNA interaction energies within the intact phage by combining the measured d spacings in the capsid with measurements of osmotic pressure in DNA assemblies under the same salt conditions in bulk solution. In the commonly used Tris-Mg buffer, the DNA-DNA interaction energies inside the phage capsids are shown to be about 1 kT/bp, an order of magnitude larger than the bending energies.
DNA replication origin function is promoted by H3K4 di-methylation in Saccharomyces cerevisiae.

Science.gov (United States)

Rizzardi, Lindsay F; Dorn, Elizabeth S; Strahl, Brian D; Cook, Jeanette Gowen

2012-10-01

DNA replication is a highly regulated process that is initiated from replication origins, but the elements of chromatin structure that contribute to origin activity have not been fully elucidated. To identify histone post-translational modifications important for DNA replication, we initiated a genetic screen to identify interactions between genes encoding chromatin-modifying enzymes and those encoding proteins required for origin function in the budding yeast Saccharomyces cerevisiae. We found that enzymes required for histone H3K4 methylation, both the histone methyltransferase Set1 and the E3 ubiquitin ligase Bre1, are required for robust growth of several hypomorphic replication mutants, including cdc6-1. Consistent with a role for these enzymes in DNA replication, we found that both Set1 and Bre1 are required for efficient minichromosome maintenance. These phenotypes are recapitulated in yeast strains bearing mutations in the histone substrates (H3K4 and H2BK123). Set1 functions as part of the COMPASS complex to mono-, di-, and tri-methylate H3K4. By analyzing strains lacking specific COMPASS complex members or containing H2B mutations that differentially affect H3K4 methylation states, we determined that these replication defects were due to loss of H3K4 di-methylation. Furthermore, histone H3K4 di-methylation is enriched at chromosomal origins. These data suggest that H3K4 di-methylation is necessary and sufficient for normal origin function. We propose that histone H3K4 di-methylation functions in concert with other histone post-translational modifications to support robust genome duplication.
Genome-wide DNA methylation patterns and transcription analysis in sheep muscle.

Directory of Open Access Journals (Sweden)

Christine Couldrey

Full Text Available DNA methylation plays a central role in regulating many aspects of growth and development in mammals through regulating gene expression. The development of next generation sequencing technologies have paved the way for genome-wide, high resolution analysis of DNA methylation landscapes using methodology known as reduced representation bisulfite sequencing (RRBS. While RRBS has proven to be effective in understanding DNA methylation landscapes in humans, mice, and rats, to date, few studies have utilised this powerful method for investigating DNA methylation in agricultural animals. Here we describe the utilisation of RRBS to investigate DNA methylation in sheep Longissimus dorsi muscles. RRBS analysis of ∼1% of the genome from Longissimus dorsi muscles provided data of suitably high precision and accuracy for DNA methylation analysis, at all levels of resolution from genome-wide to individual nucleotides. Combining RRBS data with mRNAseq data allowed the sheep Longissimus dorsi muscle methylome to be compared with methylomes from other species. While some species differences were identified, many similarities were observed between DNA methylation patterns in sheep and other more commonly studied species. The RRBS data presented here highlights the complexity of epigenetic regulation of genes. However, the similarities observed across species are promising, in that knowledge gained from epigenetic studies in human and mice may be applied, with caution, to agricultural species. The ability to accurately measure DNA methylation in agricultural animals will contribute an additional layer of information to the genetic analyses currently being used to maximise production gains in these species.
Relationships among cell survival, O6-alkylguanine-DNA alkyltransferase activity, and reactivation of methylated adenovirus 5 and herpes simplex virus type 1 in human melanoma cell lines

Energy Technology Data Exchange (ETDEWEB)

Maynard, K.; Parsons, P.G.; Cerny, T.; Margison, G.P. (Queensland Institute of Medical Research, Herston (Australia))

1989-09-01

O6-Alkylguanine-DNA alkyltransferase (ATase) activity and host cell reactivation (HCR) of 5-(3-methyl-1-triazeno)imidazole-4-carboxamide (MTIC)-methylated viruses were compared in human melanoma cell lines that were sensitive or resistant to killing by the antitumor DNA-methylating agent MTIC. Enhanced HCR of adenovirus 5 (defined as the Mer+ phenotype) generally showed a semiquantitative correlation with the natural or induced resistance of the host cells to the toxic effects of MTIC and to the level of ATase activity. However, one MTIC-resistant cell line was found (MM170) which had a low level of ATase and intermediate HCR of adenovirus. The HCR of herpes simplex virus type 1 (HSV-1) was enhanced in the Mer+ cells that had natural resistance to MTIC compared with Mer- cells. On the other hand, HCR of HSV-1 in Mer+ cells with induced resistance to MTIC was similar to that in Mer- cells. Neither adenovirus 5 nor HSV-1 infection induced ATase activity in Mer- cells. This indicates that resistance to the toxic effects of methylating agents is not invariably associated with high levels of ATase activity in human melanoma cells. Furthermore, while induction of the Mer+ phenotype from Mer- cells was usually accompanied by the recovery of ATase activity, induced Mer+ cells had less proficient repair than natural Mer+ cells, as judged quantitatively by slightly lower cellular resistance and qualitatively by deficient HCR response for HSV-1. These results suggest that the Mer- and induced Mer+ cells lack an ATase-independent DNA repair mechanism. No differences in MTIC-induced DNA repair synthesis or strand breaks were found between the Mer-, natural Mer+, and induced Mer+ phenotypes. However, UV-induced DNA repair synthesis was higher in the natural Mer+ than in the Mer- or induced Mer+ cells, both of which had increased cellular sensitivity to the antimetabolites methotrexate and hydroxyurea.
Relationships among cell survival, O6-alkylguanine-DNA alkyltransferase activity, and reactivation of methylated adenovirus 5 and herpes simplex virus type 1 in human melanoma cell lines

International Nuclear Information System (INIS)

Maynard, K.; Parsons, P.G.; Cerny, T.; Margison, G.P.

1989-01-01

O6-Alkylguanine-DNA alkyltransferase (ATase) activity and host cell reactivation (HCR) of 5-(3-methyl-1-triazeno)imidazole-4-carboxamide (MTIC)-methylated viruses were compared in human melanoma cell lines that were sensitive or resistant to killing by the antitumor DNA-methylating agent MTIC. Enhanced HCR of adenovirus 5 (defined as the Mer+ phenotype) generally showed a semiquantitative correlation with the natural or induced resistance of the host cells to the toxic effects of MTIC and to the level of ATase activity. However, one MTIC-resistant cell line was found (MM170) which had a low level of ATase and intermediate HCR of adenovirus. The HCR of herpes simplex virus type 1 (HSV-1) was enhanced in the Mer+ cells that had natural resistance to MTIC compared with Mer- cells. On the other hand, HCR of HSV-1 in Mer+ cells with induced resistance to MTIC was similar to that in Mer- cells. Neither adenovirus 5 nor HSV-1 infection induced ATase activity in Mer- cells. This indicates that resistance to the toxic effects of methylating agents is not invariably associated with high levels of ATase activity in human melanoma cells. Furthermore, while induction of the Mer+ phenotype from Mer- cells was usually accompanied by the recovery of ATase activity, induced Mer+ cells had less proficient repair than natural Mer+ cells, as judged quantitatively by slightly lower cellular resistance and qualitatively by deficient HCR response for HSV-1. These results suggest that the Mer- and induced Mer+ cells lack an ATase-independent DNA repair mechanism. No differences in MTIC-induced DNA repair synthesis or strand breaks were found between the Mer-, natural Mer+, and induced Mer+ phenotypes. However, UV-induced DNA repair synthesis was higher in the natural Mer+ than in the Mer- or induced Mer+ cells, both of which had increased cellular sensitivity to the antimetabolites methotrexate and hydroxyurea
Whole-genome sequencing of bloodstream Staphylococcus aureus isolates does not distinguish bacteraemia from endocarditis

DEFF Research Database (Denmark)

Lilje, Berit; Rasmussen, Rasmus Vedby; Dahl, Anders

2017-01-01

Most Staphylococcus aureus isolates can cause invasive disease given the right circumstances, but it is unknown if some isolates are more likely to cause severe infections than others. S. aureus bloodstream isolates from 120 patients with definite infective endocarditis and 121 with S. aureus...... bacteraemia without infective endocarditis underwent whole-genome sequencing. Genome-wide association analysis was performed using a variety of bioinformatics approaches including SNP analysis, accessory genome analysis and k-mer based analysis. Core and accessory genome analyses found no association...... with either of the two clinical groups. In this study, the genome sequences of S. aureus bloodstream isolates did not discriminate between bacteraemia and infective endocarditis. Based on our study and the current literature, it is not convincing that a specific S. aureus genotype is clearly associated...
The logic of DNA replication in double-stranded DNA viruses: insights from global analysis of viral genomes.

Science.gov (United States)

Kazlauskas, Darius; Krupovic, Mart; Venclovas, Česlovas

2016-06-02

Genomic DNA replication is a complex process that involves multiple proteins. Cellular DNA replication systems are broadly classified into only two types, bacterial and archaeo-eukaryotic. In contrast, double-stranded (ds) DNA viruses feature a much broader diversity of DNA replication machineries. Viruses differ greatly in both completeness and composition of their sets of DNA replication proteins. In this study, we explored whether there are common patterns underlying this extreme diversity. We identified and analyzed all major functional groups of DNA replication proteins in all available proteomes of dsDNA viruses. Our results show that some proteins are common to viruses infecting all domains of life and likely represent components of the ancestral core set. These include B-family polymerases, SF3 helicases, archaeo-eukaryotic primases, clamps and clamp loaders of the archaeo-eukaryotic type, RNase H and ATP-dependent DNA ligases. We also discovered a clear correlation between genome size and self-sufficiency of viral DNA replication, the unanticipated dominance of replicative helicases and pervasive functional associations among certain groups of DNA replication proteins. Altogether, our results provide a comprehensive view on the diversity and evolution of replication systems in the DNA virome and uncover fundamental principles underlying the orchestration of viral DNA replication. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.
An essential nonredundant role for mycobacterial DnaK in native protein folding.

Directory of Open Access Journals (Sweden)

Allison Fay

2014-07-01

Full Text Available Protein chaperones are essential in all domains of life to prevent and resolve protein misfolding during translation and proteotoxic stress. HSP70 family chaperones, including E. coli DnaK, function in stress induced protein refolding and degradation, but are dispensable for cellular viability due to redundant chaperone systems that prevent global nascent peptide insolubility. However, the function of HSP70 chaperones in mycobacteria, a genus that includes multiple human pathogens, has not been examined. We find that mycobacterial DnaK is essential for cell growth and required for native protein folding in Mycobacterium smegmatis. Loss of DnaK is accompanied by proteotoxic collapse characterized by the accumulation of insoluble newly synthesized proteins. DnaK is required for solubility of large multimodular lipid synthases, including the essential lipid synthase FASI, and DnaK loss is accompanied by disruption of membrane structure and increased cell permeability. Trigger Factor is nonessential and has a minor role in native protein folding that is only evident in the absence of DnaK. In unstressed cells, DnaK localizes to multiple, dynamic foci, but relocalizes to focal protein aggregates during stationary phase or upon expression of aggregating peptides. Mycobacterial cells restart cell growth after proteotoxic stress by isolating persistent DnaK containing protein aggregates away from daughter cells. These results reveal unanticipated essential nonredunant roles for mycobacterial DnaK in mycobacteria and indicate that DnaK defines a unique susceptibility point in the mycobacterial proteostasis network.
Multiple displacement amplification of whole genomic DNA from urediospores of Puccinia striiformis f. sp. tritici.

Science.gov (United States)

Zhang, R; Ma, Z H; Wu, B M

2015-05-01

Biotrophic fungi, such as Puccinia striiformis f. sp. tritici, because they cannot be cultured on nutrient media, to obtain adequate quantity of DNA for molecular genetic analysis, are usually propagated on living hosts, wheat plants in case of P. striiformis f. sp. tritici. The propagation process is time-, space- and labor-consuming and has been a bottleneck to molecular genetic analysis of this pathogen. In this study we evaluated multiple displacement amplification (MDA) of pathogen genomic DNA from urediospores as an alternative approach to traditional propagation of urediospores followed by DNA extraction. The quantities of pathogen genomic DNA in the products were further determined via real-time PCR with a pair of primers specific for the β-tubulin gene of P. striiformis f. sp. tritici. The amplified fragment length polymorphism (AFLP) fingerprints were also compared between the DNA products. The results demonstrated that adequate genomic DNA at fragment size larger than 23 Kb could be amplified from 20 to 30 urediospores via MDA method. The real-time PCR results suggested that although fresh urediospores collected from diseased leaves were the best, spores picked from diseased leaves stored for a prolonged period could also be used for amplification. AFLP fingerprints exhibited no significant differences between amplified DNA and DNA extracted with CTAB method, suggesting amplified DNA can represent the pathogen's genomic DNA very well. Therefore, MDA could be used to obtain genomic DNA from small precious samples (dozens of spores) for molecular genetic analysis of wheat stripe rust pathogen, and other fungi that are difficult to propagate.
Molecular interaction study of commercial cyclic peptides and MERS-COV papain-like protease as novel drug candidate for MERS-COV

Science.gov (United States)

Nasution, M. A. F.; Azzuhdi, M. G.; Tambunan, U. S. F.

2017-07-01

Middle-east respiratory syndrome coronavirus (MERS-CoV) has become the current outbreak, MERS-CoV infection results in illness at the respiratory system, digestive, and even lead to death with an average mortality caused by MERS-CoV infection reaches 50 %. Until now, there is not any effective vaccine or drug to ward off MERS-CoV infection. Papain-like protease (PLpro) is responsible for cleavage of a nonstructural protein that is essential for viral maturation. Inhibition of PLpro with a ligand will block the cleavage process of nonstructural protein, thus reduce the infection of MERS-CoV. Through of bioinformatics study with molecular docking and binding interaction analysis of commercial cyclic peptides, aldosterone secretion inhibiting factor (1-35) (bovine) was obtained as an inhibitor for PLpro. Thus, aldosterone secretion inhibiting factor (1-35) (bovine) has a potential as a novel candidate drug for treating MERS-CoV.
Optimization of de novo transcriptome assembly from high-throughput short read sequencing data improves functional annotation for non-model organisms

Directory of Open Access Journals (Sweden)

Haznedaroglu Berat Z

2012-07-01

Full Text Available Abstract Background The k-mer hash length is a key factor affecting the output of de novo transcriptome assembly packages using de Bruijn graph algorithms. Assemblies constructed with varying single k-mer choices might result in the loss of unique contiguous sequences (contigs and relevant biological information. A common solution to this problem is the clustering of single k-mer assemblies. Even though annotation is one of the primary goals of a transcriptome assembly, the success of assembly strategies does not consider the impact of k-mer selection on the annotation output. This study provides an in-depth k-mer selection analysis that is focused on the degree of functional annotation achieved for a non-model organism where no reference genome information is available. Individual k-mers and clustered assemblies (CA were considered using three representative software packages. Pair-wise comparison analyses (between individual k-mers and CAs were produced to reveal missing Kyoto Encyclopedia of Genes and Genomes (KEGG ortholog identifiers (KOIs, and to determine a strategy that maximizes the recovery of biological information in a de novo transcriptome assembly. Results Analyses of single k-mer assemblies resulted in the generation of various quantities of contigs and functional annotations within the selection window of k-mers (k-19 to k-63. For each k-mer in this window, generated assemblies contained certain unique contigs and KOIs that were not present in the other k-mer assemblies. Producing a non-redundant CA of k-mers 19 to 63 resulted in a more complete functional annotation than any single k-mer assembly. However, a fraction of unique annotations remained (~0.19 to 0.27% of total KOIs in the assemblies of individual k-mers (k-19 to k-63 that were not present in the non-redundant CA. A workflow to recover these unique annotations is presented. Conclusions This study demonstrated that different k-mer choices result in various quantities
Concentrating and labeling genomic DNA in a nanofluidic array

DEFF Research Database (Denmark)

Marie, Rodolphe; Pedersen, Jonas Nyvold; Mir, Kalim U.

2018-01-01

, however, hinder the polymerase activity. We demonstrate a device and a protocol for the enzymatic labeling of genomic DNA arranged in a dense array of single molecules without attaching the enzyme or the DNA to a surface. DNA molecules accumulate in a dense array of pits embedded within a nanoslit due...... to entropic trapping. We then perform ϕ29 polymerase extension from single-strand nicks created on the trapped molecules to incorporate fluorescent nucleotides into the DNA. The array of entropic traps can be loaded with λ-DNA molecules to more than 90% of capacity at a flow rate of 10 pL min-1. The final...
Instability of plastid DNA in the nuclear genome.

Directory of Open Access Journals (Sweden)

Anna E Sheppard

2009-01-01

Full Text Available Functional gene transfer from the plastid (chloroplast and mitochondrial genomes to the nucleus has been an important driving force in eukaryotic evolution. Non-functional DNA transfer is far more frequent, and the frequency of such transfers from the plastid to the nucleus has been determined experimentally in tobacco using transplastomic lines containing, in their plastid genome, a kanamycin resistance gene (neo readymade for nuclear expression. Contrary to expectations, non-Mendelian segregation of the kanamycin resistance phenotype is seen in progeny of some lines in which neo has been transferred to the nuclear genome. Here, we provide a detailed analysis of the instability of kanamycin resistance in nine of these lines, and we show that it is due to deletion of neo. Four lines showed instability with variation between progeny derived from different areas of the same plant, suggesting a loss of neo during somatic cell division. One line showed a consistent reduction in the proportion of kanamycin-resistant progeny, suggesting a loss of neo during meiosis, and the remaining four lines were relatively stable. To avoid genomic enlargement, the high frequency of plastid DNA integration into the nuclear genome necessitates a counterbalancing removal process. This is the first demonstration of such loss involving a high proportion of recent nuclear integrants. We propose that insertion, deletion, and rearrangement of plastid sequences in the nuclear genome are important evolutionary processes in the generation of novel nuclear genes. This work is also relevant in the context of transgenic plant research and crop production, because similar processes to those described here may be involved in the loss of plant transgenes.
Measuring the Levels of Ribonucleotides Embedded in Genomic DNA.

Science.gov (United States)

Meroni, Alice; Nava, Giulia M; Sertic, Sarah; Plevani, Paolo; Muzi-Falconi, Marco; Lazzaro, Federico

2018-01-01

Ribonucleotides (rNTPs) are incorporated into genomic DNA at a relatively high frequency during replication. They have beneficial effects but, if not removed from the chromosomes, increase genomic instability. Here, we describe a fast method to easily estimate the amounts of embedded ribonucleotides into the genome. The protocol described is performed in Saccharomyces cerevisiae and allows us to quantify altered levels of rNMPs due to different mutations in the replicative polymerase ε. However, this protocol can be easily applied to cells derived from any organism.
The histone H3K9 methylation and RNAi pathways regulate normalnucleolar and repeated DNA organization by inhibiting formation ofextrachromosomal DNAs

Energy Technology Data Exchange (ETDEWEB)

Peng, Jamy C.; Karpen, Gary H.

2006-06-15

In order to identify regulators of nuclear organization, Drosophila mutants in the Su(var)3-9 histone H3K9 methyltransferase, RNAi pathway components, and other regulators of heterochromatin-mediated gene silencing were examined for altered nucleoli and positioning of repeated DNAs. Animals lacking components of the H3K9 methylation and RNAi pathways contained disorganized nucleoli, ribosomal DNA (rDNA) and satellite DNAs. The levels of H3K9 dimethylation (H3K9me2) in chromatin associated with repeated DNAs decreased dramatically in Su(var)3-9 and dcr-2 (dicer-2) mutant tissues compared to wild type. We also observed a substantial increase in extrachromosomal repeated DNAs in mutant tissues. The disorganized nucleolus phenotype depends on the presence of Ligase 4 (Lig4), and ecc DNA formation is not induced by removal of cohesin. We conclude that H3K9 methylation of rDNA and satellites, maintained by Su(var)3-9, HP1, and the RNAi pathway, is necessary for the structural stability of repeated DNAs, which is mediated through suppression of non-homologous end joining (NHEJ). These results suggest a mechanism for how local chromatin structure can regulate genome stability, and the organization of chromosomal elements and nuclear organelles.
Environmental Conditions Constrain the Distribution and Diversity of Archaeal merA in Yellowstone National Park, Wyoming, U.S.A.

Science.gov (United States)

Wang, Y.; Boyd, E.; Crane, S.; Lu-Irving, P.; Krabbenhoft, D.; King, S.; Dighton, J.; Geesey, G.; Barkay, T.

2011-01-01

The distribution and phylogeny of extant protein-encoding genes recovered from geochemically diverse environments can provide insight into the physical and chemical parameters that led to the origin and which constrained the evolution of a functional process. Mercuric reductase (MerA) plays an integral role in mercury (Hg) biogeochemistry by catalyzing the transformation of Hg(II) to Hg(0). Putative merA sequences were amplified from DNA extracts of microbial communities associated with mats and sulfur precipitates from physicochemically diverse Hg-containing springs in Yellowstone National Park, Wyoming, using four PCR primer sets that were designed to capture the known diversity of merA. The recovery of novel and deeply rooted MerA lineages from these habitats supports previous evidence that indicates merA originated in a thermophilic environment. Generalized linear models indicate that the distribution of putative archaeal merA lineages was constrained by a combination of pH, dissolved organic carbon, dissolved total mercury and sulfide. The models failed to identify statistically well supported trends for the distribution of putative bacterial merA lineages as a function of these or other measured environmental variables, suggesting that these lineages were either influenced by environmental parameters not considered in the present study, or the bacterial primer sets were designed to target too broad of a class of genes which may have responded differently to environmental stimuli. The widespread occurrence of merA in the geothermal environments implies a prominent role for Hg detoxification in these environments. Moreover, the differences in the distribution of the merA genes amplified with the four merA primer sets suggests that the organisms putatively engaged in this activity have evolved to occupy different ecological niches within the geothermal gradient. ?? 2011 Springer Science+Business Media, LLC.
Environmental conditions constrain the distribution and diversity of archaeal merA in Yellowstone National Park, Wyoming, U.S.A.

Science.gov (United States)

Wang, Yanping; Boyd, Eric; Crane, Sharron; Lu-Irving, Patricia; Krabbenhoft, David; King, Susan; Dighton, John; Geesey, Gill; Barkay, Tamar

2011-11-01

The distribution and phylogeny of extant protein-encoding genes recovered from geochemically diverse environments can provide insight into the physical and chemical parameters that led to the origin and which constrained the evolution of a functional process. Mercuric reductase (MerA) plays an integral role in mercury (Hg) biogeochemistry by catalyzing the transformation of Hg(II) to Hg(0). Putative merA sequences were amplified from DNA extracts of microbial communities associated with mats and sulfur precipitates from physicochemically diverse Hg-containing springs in Yellowstone National Park, Wyoming, using four PCR primer sets that were designed to capture the known diversity of merA. The recovery of novel and deeply rooted MerA lineages from these habitats supports previous evidence that indicates merA originated in a thermophilic environment. Generalized linear models indicate that the distribution of putative archaeal merA lineages was constrained by a combination of pH, dissolved organic carbon, dissolved total mercury and sulfide. The models failed to identify statistically well supported trends for the distribution of putative bacterial merA lineages as a function of these or other measured environmental variables, suggesting that these lineages were either influenced by environmental parameters not considered in the present study, or the bacterial primer sets were designed to target too broad of a class of genes which may have responded differently to environmental stimuli. The widespread occurrence of merA in the geothermal environments implies a prominent role for Hg detoxification in these environments. Moreover, the differences in the distribution of the merA genes amplified with the four merA primer sets suggests that the organisms putatively engaged in this activity have evolved to occupy different ecological niches within the geothermal gradient.
Assembly of viral genomes from metagenomes

Directory of Open Access Journals (Sweden)

Saskia L Smits

2014-12-01

Full Text Available Viral infections remain a serious global health issue. Metagenomic approaches are increasingly used in the detection of novel viral pathogens but also to generate complete genomes of uncultivated viruses. In silico identification of complete viral genomes from sequence data would allow rapid phylogenetic characterization of these new viruses. Often, however, complete viral genomes are not recovered, but rather several distinct contigs derived from a single entity, some of which have no sequence homology to any known proteins. De novo assembly of single viruses from a metagenome is challenging, not only because of the lack of a reference genome, but also because of intrapopulation variation and uneven or insufficient coverage. Here we explored different assembly algorithms, remote homology searches, genome-specific sequence motifs, k-mer frequency ranking, and coverage profile binning to detect and obtain viral target genomes from metagenomes. All methods were tested on 454-generated sequencing datasets containing three recently described RNA viruses with a relatively large genome which were divergent to previously known viruses from the viral families Rhabdoviridae and Coronaviridae. Depending on specific characteristics of the target virus and the metagenomic community, different assembly and in silico gap closure strategies were successful in obtaining near complete viral genomes.
Detecting DNA double-stranded breaks in mammalian genomes by linear amplification-mediated high-throughput genome-wide translocation sequencing.

Science.gov (United States)

Hu, Jiazhi; Meyers, Robin M; Dong, Junchao; Panchakshari, Rohit A; Alt, Frederick W; Frock, Richard L

2016-05-01

Unbiased, high-throughput assays for detecting and quantifying DNA double-stranded breaks (DSBs) across the genome in mammalian cells will facilitate basic studies of the mechanisms that generate and repair endogenous DSBs. They will also enable more applied studies, such as those to evaluate the on- and off-target activities of engineered nucleases. Here we describe a linear amplification-mediated high-throughput genome-wide sequencing (LAM-HTGTS) method for the detection of genome-wide 'prey' DSBs via their translocation in cultured mammalian cells to a fixed 'bait' DSB. Bait-prey junctions are cloned directly from isolated genomic DNA using LAM-PCR and unidirectionally ligated to bridge adapters; subsequent PCR steps amplify the single-stranded DNA junction library in preparation for Illumina Miseq paired-end sequencing. A custom bioinformatics pipeline identifies prey sequences that contribute to junctions and maps them across the genome. LAM-HTGTS differs from related approaches because it detects a wide range of broken end structures with nucleotide-level resolution. Familiarity with nucleic acid methods and next-generation sequencing analysis is necessary for library generation and data interpretation. LAM-HTGTS assays are sensitive, reproducible, relatively inexpensive, scalable and straightforward to implement with a turnaround time of <1 week.
Application of k-means clustering algorithm in grouping the DNA sequences of hepatitis B virus (HBV)

Science.gov (United States)

Bustamam, A.; Tasman, H.; Yuniarti, N.; Frisca, Mursidah, I.

2017-07-01

Based on WHO data, an estimated of 15 millions people worldwide who are infected with hepatitis B (HBsAg+), which is caused by HBV virus, are also infected by hepatitis D, which is caused by HDV virus. Hepatitis D infection can occur simultaneously with hepatitis B (co infection) or after a person is exposed to chronic hepatitis B (super infection). Since HDV cannot live without HBV, HDV infection is closely related to HBV infection, hence it is very realistic that every effort of prevention against hepatitis B can indirectly prevent hepatitis D. This paper presents clustering of HBV DNA sequences by using k-means clustering algorithm and R programming. Clustering processes are started with collecting HBV DNA sequences from GenBank, then performing extraction HBV DNA sequences using n-mers frequency and furthermore the extraction results are collected as a matrix and normalized using the min-max normalization with interval [0, 1] which will later be used as an input data. The number of clusters is two and the initial centroid selected of the cluster is chosen randomly. In each iteration, the distance of every object to each centroid are calculated using the Euclidean distance and the minimum distance is selected to determine the membership in a cluster until two convergent clusters are created. As the result, the HBV viruses in the first cluster is more virulent than the HBV viruses in the second cluster, so the HBV viruses in the first cluster can potentially evolve with HDV viruses that cause hepatitis D.

Genomic diversity among drug sensitive and multidrug resistant isolates of Mycobacterium tuberculosis with identical DNA fingerprints.

Directory of Open Access Journals (Sweden)

Stefan Niemann

2009-10-01

Full Text Available Mycobacterium tuberculosis complex (MTBC, the causative agent of tuberculosis (TB, is characterized by low sequence diversity making this bacterium one of the classical examples of a genetically monomorphic pathogen. Because of this limited DNA sequence variation, routine genotyping of clinical MTBC isolates for epidemiological purposes relies on highly discriminatory DNA fingerprinting methods based on mobile and repetitive genetic elements. According to the standard view, isolates exhibiting the same fingerprinting pattern are considered direct progeny of the same bacterial clone, and most likely reflect ongoing transmission or disease relapse within individual patients.Here we further investigated this assumption and used massively parallel whole-genome sequencing to compare one drug-susceptible (K-1 and one multidrug resistant (MDR isolate (K-2 of a rapidly spreading M. tuberculosis Beijing genotype clone from a high incidence region (Karakalpakstan, Uzbekistan. Both isolates shared the same IS6110 RFLP pattern and the same allele at 23 out of 24 MIRU-VNTR loci. We generated 23.9 million (K-1 and 33.0 million (K-2 paired 50 bp purity filtered reads corresponding to a mean coverage of 483.5 fold and 656.1 fold respectively. Compared with the laboratory strain H37Rv both Beijing isolates shared 1,209 SNPs. The two Beijing isolates differed by 130 SNPs and one large deletion. The susceptible isolate had 55 specific SNPs, while the MDR variant had 75 specific SNPs, including the five known resistance-conferring mutations.Our results suggest that M. tuberculosis isolates exhibiting identical DNA fingerprinting patterns can harbour substantial genomic diversity. Because this heterogeneity is not captured by traditional genotyping of MTBC, some aspects of the transmission dynamics of tuberculosis could be missed or misinterpreted. Furthermore, a valid differentiation between disease relapse and exogenous reinfection might be impossible using
Genomic diversity among drug sensitive and multidrug resistant isolates of Mycobacterium tuberculosis with identical DNA fingerprints.

Science.gov (United States)

Niemann, Stefan; Köser, Claudio U; Gagneux, Sebastien; Plinke, Claudia; Homolka, Susanne; Bignell, Helen; Carter, Richard J; Cheetham, R Keira; Cox, Anthony; Gormley, Niall A; Kokko-Gonzales, Paula; Murray, Lisa J; Rigatti, Roberto; Smith, Vincent P; Arends, Felix P M; Cox, Helen S; Smith, Geoff; Archer, John A C

2009-10-12

Mycobacterium tuberculosis complex (MTBC), the causative agent of tuberculosis (TB), is characterized by low sequence diversity making this bacterium one of the classical examples of a genetically monomorphic pathogen. Because of this limited DNA sequence variation, routine genotyping of clinical MTBC isolates for epidemiological purposes relies on highly discriminatory DNA fingerprinting methods based on mobile and repetitive genetic elements. According to the standard view, isolates exhibiting the same fingerprinting pattern are considered direct progeny of the same bacterial clone, and most likely reflect ongoing transmission or disease relapse within individual patients. Here we further investigated this assumption and used massively parallel whole-genome sequencing to compare one drug-susceptible (K-1) and one multidrug resistant (MDR) isolate (K-2) of a rapidly spreading M. tuberculosis Beijing genotype clone from a high incidence region (Karakalpakstan, Uzbekistan). Both isolates shared the same IS6110 RFLP pattern and the same allele at 23 out of 24 MIRU-VNTR loci. We generated 23.9 million (K-1) and 33.0 million (K-2) paired 50 bp purity filtered reads corresponding to a mean coverage of 483.5 fold and 656.1 fold respectively. Compared with the laboratory strain H37Rv both Beijing isolates shared 1,209 SNPs. The two Beijing isolates differed by 130 SNPs and one large deletion. The susceptible isolate had 55 specific SNPs, while the MDR variant had 75 specific SNPs, including the five known resistance-conferring mutations. Our results suggest that M. tuberculosis isolates exhibiting identical DNA fingerprinting patterns can harbour substantial genomic diversity. Because this heterogeneity is not captured by traditional genotyping of MTBC, some aspects of the transmission dynamics of tuberculosis could be missed or misinterpreted. Furthermore, a valid differentiation between disease relapse and exogenous reinfection might be impossible using standard
In Vitro Whole Genome DNA Binding Analysis of the Bacterial Replication Initiator and Transcription Factor DnaA.

Directory of Open Access Journals (Sweden)

Janet L Smith

2015-05-01

Full Text Available DnaA, the replication initiation protein in bacteria, is an AAA+ ATPase that binds and hydrolyzes ATP and exists in a heterogeneous population of ATP-DnaA and ADP-DnaA. DnaA binds cooperatively to the origin of replication and several other chromosomal regions, and functions as a transcription factor at some of these regions. We determined the binding properties of Bacillus subtilis DnaA to genomic DNA in vitro at single nucleotide resolution using in vitro DNA affinity purification and deep sequencing (IDAP-Seq. We used these data to identify 269 binding regions, refine the consensus sequence of the DnaA binding site, and compare the relative affinity of binding regions for ATP-DnaA and ADP-DnaA. Most sites had a slightly higher affinity for ATP-DnaA than ADP-DnaA, but a few had a strong preference for binding ATP-DnaA. Of the 269 sites, only the eight strongest binding ones have been observed to bind DnaA in vivo, suggesting that other cellular factors or the amount of available DnaA in vivo restricts DnaA binding to these additional sites. Conversely, we found several chromosomal regions that were bound by DnaA in vivo but not in vitro, and that the nucleoid-associated protein Rok was required for binding in vivo. Our in vitro characterization of the inherent ability of DnaA to bind the genome at single nucleotide resolution provides a backdrop for interpreting data on in vivo binding and regulation of DnaA, and is an approach that should be adaptable to many other DNA binding proteins.
Whole-genome amplified DNA from stored dried blood spots is reliable in high resolution melting curve and sequencing analysis

DEFF Research Database (Denmark)

Winkel, Bo G; Hollegaard, Mads V; Olesen, Morten S

2011-01-01

BACKGROUND: The use of dried blood spots (DBS) samples in genomic workup has been limited by the relative low amounts of genomic DNA (gDNA) they contain. It remains to be proven that whole genome amplified DNA (wgaDNA) from stored DBS samples, constitutes a reliable alternative to gDNA.We wanted...
Reference genome-independent assessment of mutation density using restriction enzyme-phased sequencing

Directory of Open Access Journals (Sweden)

Monson-Miller Jennifer

2012-02-01

Full Text Available Abstract Background The availability of low cost sequencing has spurred its application to discovery and typing of variation, including variation induced by mutagenesis. Mutation discovery is challenging as it requires a substantial amount of sequencing and analysis to detect very rare changes and distinguish them from noise. Also challenging are the cases when the organism of interest has not been sequenced or is highly divergent from the reference. Results We describe the development of a simple method for reduced representation sequencing. Input DNA was digested with a single restriction enzyme and ligated to Y adapters modified to contain a sequence barcode and to provide a compatible overhang for ligation. We demonstrated the efficiency of this method at SNP discovery using rice and arabidopsis. To test its suitability for the discovery of very rare SNP, one control and three mutagenized rice individuals (1, 5 and 10 mM sodium azide were used to prepare genomic libraries for Illumina sequencers by ligating barcoded adapters to NlaIII restriction sites. For genome-dependent discovery 15-30 million of 80 base reads per individual were aligned to the reference sequence achieving individual sequencing coverage from 7 to 15×. We identified high-confidence base changes by comparing sequences across individuals and identified instances consistent with mutations, i.e. changes that were found in a single treated individual and were solely GC to AT transitions. For genome-independent discovery 70-mers were extracted from the sequence of the control individual and single-copy sequence was identified by comparing the 70-mers across samples to evaluate copy number and variation. This de novo "genome" was used to align the reads and identify mutations as above. Covering approximately 1/5 of the 380 Mb genome of rice we detected mutation densities ranging from 0.6 to 4 per Mb of diploid DNA depending on the mutagenic treatment. Conclusions The
Mapping the space of genomic signatures.

Directory of Open Access Journals (Sweden)

Lila Kari

Full Text Available We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity between the corresponding sequences. The graphical representation of DNA sequences utilized, Chaos Game Representation (CGR, is genome- and species-specific and can thus act as a genomic signature. Consequently, Molecular Distance Maps could inform species identification, taxonomic classifications and, to a certain extent, evolutionary history. The image distance employed, Structural Dissimilarity Index (DSSIM, implicitly compares the occurrences of oligomers of length up to k (herein k = 9 in DNA sequences. We computed DSSIM distances for more than 5 million pairs of complete mitochondrial genomes, and used Multi-Dimensional Scaling (MDS to obtain Molecular Distance Maps that visually display the sequence relatedness in various subsets, at different taxonomic levels. This general-purpose method does not require DNA sequence alignment and can thus be used to compare similar or vastly different DNA sequences, genomic or computer-generated, of the same or different lengths. We illustrate potential uses of this approach by applying it to several taxonomic subsets: phylum Vertebrata, (superkingdom Protista, classes Amphibia-Insecta-Mammalia, class Amphibia, and order Primates. This analysis of an extensive dataset confirms that the oligomer composition of full mtDNA sequences can be a source of taxonomic information. This method also correctly finds the mtDNA sequences most closely related to that of the anatomically modern human (the Neanderthal
Genome-Wide Prediction of DNA Methylation Using DNA Composition and Sequence Complexity in Human.

Science.gov (United States)

Wu, Chengchao; Yao, Shixin; Li, Xinghao; Chen, Chujia; Hu, Xuehai

2017-02-16

DNA methylation plays a significant role in transcriptional regulation by repressing activity. Change of the DNA methylation level is an important factor affecting the expression of target genes and downstream phenotypes. Because current experimental technologies can only assay a small proportion of CpG sites in the human genome, it is urgent to develop reliable computational models for predicting genome-wide DNA methylation. Here, we proposed a novel algorithm that accurately extracted sequence complexity features (seven features) and developed a support-vector-machine-based prediction model with integration of the reported DNA composition features (trinucleotide frequency and GC content, 65 features) by utilizing the methylation profiles of embryonic stem cells in human. The prediction results from 22 human chromosomes with size-varied windows showed that the 600-bp window achieved the best average accuracy of 94.7%. Moreover, comparisons with two existing methods further showed the superiority of our model, and cross-species predictions on mouse data also demonstrated that our model has certain generalization ability. Finally, a statistical test of the experimental data and the predicted data on functional regions annotated by ChromHMM found that six out of 10 regions were consistent, which implies reliable prediction of unassayed CpG sites. Accordingly, we believe that our novel model will be useful and reliable in predicting DNA methylation.
A direct detection of Escherichia coli genomic DNA using gold nanoprobes

Directory of Open Access Journals (Sweden)

Padmavathy

2012-02-01

Full Text Available Abstract Background In situation like diagnosis of clinical and forensic samples there exists a need for highly sensitive, rapid and specific DNA detection methods. Though conventional DNA amplification using PCR can provide fast results, it is not widely practised in diagnostic laboratories partially because it requires skilled personnel and expensive equipment. To overcome these limitations nanoparticles have been explored as signalling probes for ultrasensitive DNA detection that can be used in field applications. Among the nanomaterials, gold nanoparticles (AuNPs have been extensively used mainly because of its optical property and ability to get functionalized with a variety of biomolecules. Results We report a protocol for the use of gold nanoparticles functionalized with single stranded oligonucleotide (AuNP- oligo probe as visual detection probes for rapid and specific detection of Escherichia coli. The AuNP- oligo probe on hybridization with target DNA containing complementary sequences remains red whereas test samples without complementary DNA sequences to the probe turns purple due to acid induced aggregation of AuNP- oligo probes. The color change of the solution is observed visually by naked eye demonstrating direct and rapid detection of the pathogenic Escherichia coli from its genomic DNA without the need for PCR amplification. The limit of detection was ~54 ng for unamplified genomic DNA. The method requires less than 30 minutes to complete after genomic DNA extraction. However, by using unamplified enzymatic digested genomic DNA, the detection limit of 11.4 ng was attained. Results of UV-Vis spectroscopic measurement and AFM imaging further support the hypothesis of aggregation based visual discrimination. To elucidate its utility in medical diagnostic, the assay was validated on clinical strains of pathogenic Escherichia coli obtained from local hospitals and spiked urine samples. It was found to be 100% sensitive and proves to
RICD: A rice indica cDNA database resource for rice functional genomics

Directory of Open Access Journals (Sweden)

Zhang Qifa

2008-11-01

Full Text Available Abstract Background The Oryza sativa L. indica subspecies is the most widely cultivated rice. During the last few years, we have collected over 20,000 putative full-length cDNAs and over 40,000 ESTs isolated from various cDNA libraries of two indica varieties Guangluai 4 and Minghui 63. A database of the rice indica cDNAs was therefore built to provide a comprehensive web data source for searching and retrieving the indica cDNA clones. Results Rice Indica cDNA Database (RICD is an online MySQL-PHP driven database with a user-friendly web interface. It allows investigators to query the cDNA clones by keyword, genome position, nucleotide or protein sequence, and putative function. It also provides a series of information, including sequences, protein domain annotations, similarity search results, SNPs and InDels information, and hyperlinks to gene annotation in both The Rice Annotation Project Database (RAP-DB and The TIGR Rice Genome Annotation Resource, expression atlas in RiceGE and variation report in Gramene of each cDNA. Conclusion The online rice indica cDNA database provides cDNA resource with comprehensive information to researchers for functional analysis of indica subspecies and for comparative genomics. The RICD database is available through our website http://www.ncgr.ac.cn/ricd.
Impact of Sample Type and DNA Isolation Procedure on Genomic Inference of Microbiome Composition

DEFF Research Database (Denmark)

Knudsen, Berith Elkær; Bergmark, Lasse; Munk, Patrick

2016-01-01

that in standard protocols. Based on this insight, we designed an improved DNA isolation procedure optimized for microbiome genomics that can be used for the three examined specimen types and potentially also for other biological specimens. A standard operating procedure is available from https://dx.doi.org/10......Explorations of complex microbiomes using genomics greatly enhance our understanding about their diversity, biogeography, and function. The isolation of DNA from microbiome specimens is a key prerequisite for such examinations, but challenges remain in obtaining sufficient DNA quantities required...... for certain sequencing approaches, achieving accurate genomic inference of microbiome composition, and facilitating comparability of findings across specimen types and sequencing projects. These aspects are particularly relevant for the genomics-based global surveillance of infectious agents and antimicrobial...
Relevance of DNA repair pathways on ascorbic acid effects on Echerichia Coli K-12 cells

International Nuclear Information System (INIS)

Slyus, M.A. van; Oliveira, R.L.B. da C.; Felzenszwalb, I.; Gomes, R.A.; Menck, C.F.

1985-01-01

Inactivation kinetics were performed with repair proficient and deficient Escherichia coli K-12 cells treated with oxidized solutions of ascorbic acid. The repair pathways controlled by the recA and uvrA gene products are essential for cell survival to the treatment. However, SOS chromotest result indicates that the SOS functions are only induced at high and toxic concentrations of the drug. Moreover, single strand breaks in DNA from treated cells are detected, demonstrating genome damage promoted by oxidized solutions of ascorbate. (M.A.C.) [pt
G-Quadruplexes Involving Both Strands of Genomic DNA Are Highly Abundant and Colocalize with Functional Sites in the Human Genome.

Directory of Open Access Journals (Sweden)

Andrzej S Kudlicki

Full Text Available The G-quadruplex is a non-canonical DNA structure biologically significant in DNA replication, transcription and telomere stability. To date, only G4s with all guanines originating from the same strand of DNA have been considered in the context of the human nuclear genome. Here, I discuss interstrand topological configurations of G-quadruplex DNA, consisting of guanines from both strands of genomic DNA; an algorithm is presented for predicting such structures. I have identified over 550,000 non-overlapping interstrand G-quadruplex forming sequences in the human genome--significantly more than intrastrand configurations. Functional analysis of interstrand G-quadruplex sites shows strong association with transcription initiation, the results are consistent with the XPB and XPD transcriptional helicases binding only to G-quadruplex DNA with interstrand topology. Interstrand quadruplexes are also enriched in origin of replication sites. Several topology classes of interstrand quadruplex-forming sequences are possible, and different topologies are enriched in different types of structural elements. The list of interstrand quadruplex forming sequences, and the computer program used for their prediction are available at the web address http://moment.utmb.edu/allquads.
Studies on the Interaction between Zinc-Hydroxybenzoite Complex and Genomic DNA

Directory of Open Access Journals (Sweden)

Hacali Necefoglu

2006-04-01

Full Text Available Zinc-Hydroxybenzoite ([Zn (H206] (p-HO-C6H4COO22H20 complex which wassynthesized and characterized by instrumental methods and the DNA samples which hadbeen isolated from cattle were allowed to interact at 37 oC for different time periods. Theinteraction of genomic DNA with this complex has been followed by agarose gelelectrophoresis at 50 V for 2 h. When DNA samples were allowed to interact with this metalcomplex, it was found that band intensities changed with the concentrations of the complex.In the result of interaction between this complex and genomic DNA samples, it wasdetermined that the intensities of bands were changed at the different concentrations of thecomplex. The brightness of the bands was increased and mobility of the bands wasdecreased, indicating the occurrence of increased covalent binding of the metal complexwith DNA. In this study it was concluded that the damage effect of ascorbate was reducedby Zinc-Hydroxybenzoite.
THE PERCEPTION OF OLD AND NEW VALUES IN THE TWO STORIES OF ÖMER SEYFETTİN “BAHAR VE KELEBEKLER” AND “AŞK DALGASI” ÖMER SEYFETTİN’İN “BAHAR VE KELEBEKLER” İLE “AŞK DALGASI” ADLI İKİ HİKÂYESİNDE ESKİ VE YENİ DEĞERLER ALGISI

Directory of Open Access Journals (Sweden)

Gülten BULDUKER

2012-01-01

Full Text Available Ömer Seyfettin, a Pioneer within them ovement of Genç Kalemler, is a distinguished writer with his essay sand stories. He has aimed to write his work swith a pure language and national in an emotional way. The attitude, the post Tanzimat philosophers had taken, with the ideas towards the value judgements, more or less have defined the contents of his works. The writer is in search of new values to shape the society. In this article it has been the main goal toindicate Ömer Seyfettin’s ideas about the old and the modern way of life, and some sort of dilemmas he has been through in Bahar ve Kelebekler and Aşk Dalgası. It is seen that the writer, who believes that maro cilised society can be provided only by breaking hard fanaticism, supports equality of men and women, criticises arranged marriage style, is dreaming of a progressand development Ömer Seyfettin, öncüsü olduğu Genç Kalemler hareketi içerisinde hikâye ve makaleleriyle öne çıkan bir yazardır. Eserlerini sade bir dil ve millî bir duyuş-düşünüşle kaleme almayı amaçlamıştır. Tanzimat sonrası fikir adamlarının “Doğu ve Batı”ya ait değer yargıları karşısındaki düşünceleri ile almış oldukları tavır, az çok onun eserlerinin de muhtevasını belirlemiştir. Yazar, eski gelenek ve göreneklerin önemini yitirmesi durumunda, topluma şekil verecek yeni değerlerin arayışı içerisindedir. Bu makalede Ömer Seyfettin’in Bahar ve Kelebekler ile Aşk Dalgası adlı hikâyelerinde eski ve yeni yaşam tarzı konusundaki düşünceleri ile düştüğü bazı çelişkili durumlar sergilenmeye çalışılmıştır. Daha medenî bir toplum yapısının katı taassupların kırılmasıyla sağlanabileceğine inanan yazarın, kadın-erkek eşitliğini savunduğu, görücü usulünü eleştirdiği; ilim ve fen ışığında bir yükselme, ilerleme hayal ettiği görülmektedir.
Indel Group in Genomes (IGG) Molecular Genetic Markers1[OPEN

Science.gov (United States)

Burkart-Waco, Diana; Kuppu, Sundaram; Britt, Anne; Chetelat, Roger

2016-01-01

Genetic markers are essential when developing or working with genetically variable populations. Indel Group in Genomes (IGG) markers are primer pairs that amplify single-locus sequences that differ in size for two or more alleles. They are attractive for their ease of use for rapid genotyping and their codominant nature. Here, we describe a heuristic algorithm that uses a k-mer-based approach to search two or more genome sequences to locate polymorphic regions suitable for designing candidate IGG marker primers. As input to the IGG pipeline software, the user provides genome sequences and the desired amplicon sizes and size differences. Primer sequences flanking polymorphic insertions/deletions are produced as output. IGG marker files for three sets of genomes, Solanum lycopersicum/Solanum pennellii, Arabidopsis (Arabidopsis thaliana) Columbia-0/Landsberg erecta-0 accessions, and S. lycopersicum/S. pennellii/Solanum tuberosum (three-way polymorphic) are included. PMID:27436831
Efficient identification of Y chromosome sequences in the human and Drosophila genomes

Science.gov (United States)

Carvalho, Antonio Bernardo; Clark, Andrew G.

2013-01-01

Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unmapped scaffolds. Identification of Y-linked genes among these fragments has yielded important insights about the origin and evolution of Y chromosomes, but the process is labor intensive, restricting studies to a small number of species. Apart from these fragmentary assemblies, in a few mammalian species, the euchromatic sequence of the Y is essentially complete, owing to painstaking BAC mapping and sequencing. Here we use female short-read sequencing and k-mer comparison to identify Y-linked sequences in two very different genomes, Drosophila virilis and human. Using this method, essentially all D. virilis scaffolds were unambiguously classified as Y-linked or not Y-linked. We found 800 new scaffolds (totaling 8.5 Mbp), and four new genes in the Y chromosome of D. virilis, including JYalpha, a gene involved in hybrid male sterility. Our results also strongly support the preponderance of gene gains over gene losses in the evolution of the Drosophila Y. In the intensively studied human genome, used here as a positive control, we recovered all previously known genes or gene families, plus a small amount (283 kb) of new, unfinished sequence. Hence, this method works in large and complex genomes and can be applied to any species with sex chromosomes. PMID:23921660
Efficient identification of Y chromosome sequences in the human and Drosophila genomes.

Science.gov (United States)

Carvalho, Antonio Bernardo; Clark, Andrew G

2013-11-01

Notwithstanding their biological importance, Y chromosomes remain poorly known in most species. A major obstacle to their study is the identification of Y chromosome sequences; due to its high content of repetitive DNA, in most genome projects, the Y chromosome sequence is fragmented into a large number of small, unmapped scaffolds. Identification of Y-linked genes among these fragments has yielded important insights about the origin and evolution of Y chromosomes, but the process is labor intensive, restricting studies to a small number of species. Apart from these fragmentary assemblies, in a few mammalian species, the euchromatic sequence of the Y is essentially complete, owing to painstaking BAC mapping and sequencing. Here we use female short-read sequencing and k-mer comparison to identify Y-linked sequences in two very different genomes, Drosophila virilis and human. Using this method, essentially all D. virilis scaffolds were unambiguously classified as Y-linked or not Y-linked. We found 800 new scaffolds (totaling 8.5 Mbp), and four new genes in the Y chromosome of D. virilis, including JYalpha, a gene involved in hybrid male sterility. Our results also strongly support the preponderance of gene gains over gene losses in the evolution of the Drosophila Y. In the intensively studied human genome, used here as a positive control, we recovered all previously known genes or gene families, plus a small amount (283 kb) of new, unfinished sequence. Hence, this method works in large and complex genomes and can be applied to any species with sex chromosomes.
H4K20me0 marks post-replicative chromatin and recruits the TONSL₋MMS22L DNA repair complex

Energy Technology Data Exchange (ETDEWEB)

Saredi, Giulia; Huang, Hongda; Hammond, Colin M.; Alabert, Constance; Bekker-Jensen, Simon; Forne, Ignasi; Reverón-Gómez, Nazaret; Foster, Benjamin M.; Mlejnkova, Lucie; Bartke, Till; Cejka, Petr; Mailand, Niels; Imhof, Axel; Patel, Dinshaw J.; Groth, Anja [UCopenhagen; (MSKCC); (ICL); (LMU); (Zurich)

2016-06-22

Here, we report that after DNA replication, chromosomal processes including DNA repair and transcription take place in the context of sister chromatids. While cell cycle regulation can guide these processes globally, mechanisms to distinguish pre- and post-replicative states locally remain unknown. In this paper we reveal that new histones incorporated during DNA replication provide a signature of post-replicative chromatin, read by the human TONSL–MMS22L^{1, 2, 3, 4} homologous recombination complex. We identify the TONSL ankyrin repeat domain (ARD) as a reader of histone H4 tails unmethylated at K20 (H4K20me0), which are specific to new histones incorporated during DNA replication and mark post-replicative chromatin until the G2/M phase of the cell cycle. Accordingly, TONSL–MMS22L binds new histones H3–H4 both before and after incorporation into nucleosomes, remaining on replicated chromatin until late G2/M. H4K20me0 recognition is required for TONSL–MMS22L binding to chromatin and accumulation at challenged replication forks and DNA lesions. Consequently, TONSL ARD mutants are toxic, compromising genome stability, cell viability and resistance to replication stress. Finally, together, these data reveal a histone-reader-based mechanism for recognizing the post-replicative state, offering a new angle to understand DNA repair with the potential for targeted cancer therapy.
MERS-coronavirus: From discovery to intervention

NARCIS (Netherlands)

W. Widagdo; N.M.A. Okba (Nisreen); V. Stalin Raj; B.L. Haagmans (Bart)

2017-01-01

textabstractMiddle East respiratory syndrome coronavirus (MERS-CoV) still causes outbreaks despite public awareness and implementation of health care measures, such as rapid viral diagnosis and patient quarantine. Here we describe the current epidemiological picture of MERS-CoV, focusing on humans
Involvement of Disperse Repetitive Sequences in Wheat/Rye Genome Adjustment

Directory of Open Access Journals (Sweden)

Manuela Silva

2012-07-01

Full Text Available The union of different genomes in the same nucleus frequently results in hybrid genotypes with improved genome plasticity related to both genome remodeling events and changes in gene expression. Most modern cereal crops are polyploid species. Triticale, synthesized by the cross between wheat and rye, constitutes an excellent model to study polyploidization functional implications. We intend to attain a deeper knowledge of dispersed repetitive sequence involvement in parental genome reshuffle in triticale and in wheat-rye addition lines that have the entire wheat genome plus each rye chromosome pair. Through Random Amplified Polymorphic DNA (RAPD analysis with OPH20 10-mer primer we unraveled clear alterations corresponding to the loss of specific bands from both parental genomes. Moreover, the sequential nature of those events was revealed by the increased absence of rye-origin bands in wheat-rye addition lines in comparison with triticale. Remodeled band sequencing revealed that both repetitive and coding genome domains are affected in wheat-rye hybrid genotypes. Additionally, the amplification and sequencing of pSc20H internal segments showed that the disappearance of parental bands may result from restricted sequence alterations and unraveled the involvement of wheat/rye related repetitive sequences in genome adjustment needed for hybrid plant stabilization.

Evaluating droplet digital PCR for the quantification of human genomic DNA: converting copies per nanoliter to nanograms nuclear DNA per microliter.

Science.gov (United States)

Duewer, David L; Kline, Margaret C; Romsos, Erica L; Toman, Blaza

2018-05-01

The highly multiplexed polymerase chain reaction (PCR) assays used for forensic human identification perform best when used with an accurately determined quantity of input DNA. To help ensure the reliable performance of these assays, we are developing a certified reference material (CRM) for calibrating human genomic DNA working standards. To enable sharing information over time and place, CRMs must provide accurate and stable values that are metrologically traceable to a common reference. We have shown that droplet digital PCR (ddPCR) limiting dilution end-point measurements of the concentration of DNA copies per volume of sample can be traceably linked to the International System of Units (SI). Unlike values assigned using conventional relationships between ultraviolet absorbance and DNA mass concentration, entity-based ddPCR measurements are expected to be stable over time. However, the forensic community expects DNA quantity to be stated in terms of mass concentration rather than entity concentration. The transformation can be accomplished given SI-traceable values and uncertainties for the number of nucleotide bases per human haploid genome equivalent (HHGE) and the average molar mass of a nucleotide monomer in the DNA polymer. This report presents the considerations required to establish the metrological traceability of ddPCR-based mass concentration estimates of human nuclear DNA. Graphical abstract The roots of metrological traceability for human nuclear DNA mass concentration results. Values for the factors in blue must be established experimentally. Values for the factors in red have been established from authoritative source materials. HHGE stands for "haploid human genome equivalent"; there are two HHGE per diploid human genome.
Comparative Genomics of DNA Recombination and Repair in Cyanobacteria: Biotechnological Implications

Science.gov (United States)

Cassier-Chauvat, Corinne; Veaudor, Théo; Chauvat, Franck

2016-01-01

Cyanobacteria are fascinating photosynthetic prokaryotes that are regarded as the ancestors of the plant chloroplast; the purveyors of oxygen and biomass for the food chain; and promising cell factories for an environmentally friendly production of chemicals. In colonizing most waters and soils of our planet, cyanobacteria are inevitably challenged by environmental stresses that generate DNA damages. Furthermore, many strains engineered for biotechnological purposes can use DNA recombination to stop synthesizing the biotechnological product. Hence, it is important to study DNA recombination and repair in cyanobacteria for both basic and applied research. This review reports what is known in a few widely studied model cyanobacteria and what can be inferred by mining the sequenced genomes of morphologically and physiologically diverse strains. We show that cyanobacteria possess many E. coli-like DNA recombination and repair genes, and possibly other genes not yet identified. E. coli-homolog genes are unevenly distributed in cyanobacteria, in agreement with their wide genome diversity. Many genes are extremely well conserved in cyanobacteria (mutMS, radA, recA, recFO, recG, recN, ruvABC, ssb, and uvrABCD), even in small genomes, suggesting that they encode the core DNA repair process. In addition to these core genes, the marine Prochlorococcus and Synechococcus strains harbor recBCD (DNA recombination), umuCD (mutational DNA replication), as well as the key SOS genes lexA (regulation of the SOS system) and sulA (postponing of cell division until completion of DNA reparation). Hence, these strains could possess an E. coli-type SOS system. In contrast, several cyanobacteria endowed with larger genomes lack typical SOS genes. For examples, the two studied Gloeobacter strains lack alkB, lexA, and sulA; and Synechococcus PCC7942 has neither lexA nor recCD. Furthermore, the Synechocystis PCC6803 lexA product does not regulate DNA repair genes. Collectively, these findings
MERS transmission and risk factors: a systematic review.

Science.gov (United States)

Park, Ji-Eun; Jung, Soyoung; Kim, Aeran; Park, Ji-Eun

2018-05-02

Since Middle East respiratory syndrome (MERS) infection was first reported in 2012, many studies have analysed its transmissibility and severity. However, the methodology and results of these studies have varied, and there has been no systematic review of MERS. This study reviews the characteristics and associated risk factors of MERS. We searched international (PubMed, ScienceDirect, Cochrane) and Korean databases (DBpia, KISS) for English- or Korean-language articles using the terms "MERS" and "Middle East respiratory syndrome". Only human studies with > 20 participants were analysed to exclude studies with low representation. Epidemiologic studies with information on transmissibility and severity of MERS as well as studies containing MERS risk factors were included. A total of 59 studies were included. Most studies from Saudi Arabia reported higher mortality (22-69.2%) than those from South Korea (20.4%). While the R 0 value in Saudi Arabia was < 1 in all but one study, in South Korea, the R 0 value was 2.5-8.09 in the early stage and decreased to < 1 in the later stage. The incubation period was 4.5-5.2 days in Saudi Arabia and 6-7.8 days in South Korea. Duration from onset was 4-10 days to confirmation, 2.9-5.3 days to hospitalization, 11-17 days to death, and 14-20 days to discharge. Older age and concomitant disease were the most common factors related to MERS infection, severity, and mortality. The transmissibility and severity of MERS differed by outbreak region and patient characteristics. Further studies assessing the risk of MERS should consider these factors.
MERS-CoV and H5N1 influenza virus antagonize antigen presentation by altering the epigenetic landscape

Energy Technology Data Exchange (ETDEWEB)

Menachery, Vineet D.; Schafer, Alexandra; Burnum-Johnson, Kristin E.; Mitchell, Hugh D.; Eisfeld-Fenney, Amie J.; Walters, Kevin B.; Nicora, Carrie D.; Purvine, Samuel O.; Casey, Cameron P.; Monroe, Matthew E.; Weitz, Karl K.; Stratton, Kelly G.; Webb-Robertson, Bobbie-Jo M.; Gralinski, Lisa; Metz, Thomas O.; Smith, Richard D.; Waters, Katrina M.; Sims, Amy C.; Kawaoka, Yoshihiro; Baric, Ralph

2018-01-16

Convergent evolution dictates that diverse groups of viruses will target both similar and distinct host pathways in order to manipulate the immune response and improve infection. In this study, we sought to leverage this uneven viral antagonism to identify critical host factors that govern disease outcome. Utilizing a systems based approach, we examined differential regulation of IFNγ dependent genes following infection with highly pathogenic viruses including influenza (H5N1-VN1203, H1N1-CA04) and coronaviruses (SARS-CoV, MERS-CoV). Categorizing by function, we observed down regulation of genes associated with antigen presentation following both H5N1-VN1203 and MERS-CoV infection. Further examination revealed global down regulation of antigen presentation genes and was confirmed by proteomics for both H5N1-VN1203 and MERS-CoV infection. Importantly, epigenetic analysis suggested that DNA methylation rather than histone modification plays a crucial role in MERS-CoV mediated antagonism of antigen presentation genes; in contrast, H5N1-VN1203 likely utilizes a combination of epigenetic mechanisms to target antigen presentation. Together, the results indicate a common approach utilized by H5N1-VN1203 and MERS-CoV to modulate antigen presentation and the host adaptive immune response.
Role of oxidative DNA damage in genome instability and cancer

International Nuclear Information System (INIS)

Bignami, M.; Kunkel, T.

2009-01-01

Inactivation of mismatch repair (MMR) is associated with a dramatic genomic instability that is observed experimentally as a mutator phenotype and micro satellite instability (MSI). It has been implicit that the massive genetic instability in MMR defective cells simply reflects the accumulation of spontaneous DNA polymerase errors during DNA replication. We recently identified oxidation damage, a common threat to DNA integrity to which purines are very susceptible, as an important cofactor in this genetic instability
Marker-assisted introgression of drought tolerance from wild ancestors into popular Indian rice varieties using a 7K Infinium SNP array

Directory of Open Access Journals (Sweden)

Ravindra Donde

2017-10-01

Full Text Available Recent advances in the area of genomics have led to the development of high throughput genotyping platforms that have immensely contributed to molecular breeding programs. Custom-designed single nucleotide polymorphism (SNP arrays provide an efficient, cost effective, high throughput genotyping tool for QTL/gene mapping, variety identification, marker-assisted selection, etc. In the current study, two interspecific libraries of Chromosome Segment Substitution Lines (CSSLs were evaluated under both drought and control conditions to identify lines with superior yield under drought. The CSSL libraries consisted of 48 BC4F3 lines derived from O. sativa cv. Curinga (tropical japonica x O. rufipogon, and 32 BC4F3 lines derived from O. sativa cv. Curinga (tropical japonica x O. meridionalis. The phenotypic screening of these 80 CSSLs led to the identification of three lines, MER-20, RUF-16, and RUF-44, that yielded well under drought stress. This line was backcrossed with popular rice variety of India, Swarna-Sub1 to introgress wild chromosome segments responsible for reproductive stage drought tolerance. During backcrossing, tracking of wild introgressions and monitoring of recurrent parent genome recovery was facilitated by the use of the Cornell 6K and 7K Infinium rice SNP arrays. The 6K and 7K SNP arrays assayed 5275 SNPs and 7099 SNPs, respectively, distributed across the 12 chromosomes. In our populations of (MER-20X Swarna sub1 BC2F1 lines, 1775 SNPs were polymorphic using the 6K array. The percentage of recurrent parent genome in these backcrossed lines ranged from 33-92% and the percentage of wild donor genome ranged from 8-67%. Using genotypic selection, 5% of plants were identified for further marker assisted backcrossing, based on the presence of the target donor (wild segment and maximum recovery of recurrent parent background. In the next generation, BC3F1 lines were genotyped using the 7K SNP array, which identified 2521 polymorphic SNPs
Comparative Analysis of the Genomic DNA Isolation Methods on Inula sp. (Asteraceae

Directory of Open Access Journals (Sweden)

Emre SEVİNDİK

2016-12-01

Full Text Available Simple, fast, low-cost and high throughput protocols are required for DNA isolation of plant species. In this study, phenol chloroform isoamyl alcohol and commercial (Sigma DNA isolation kit methods were applied on some Inula species that belong to Asteraceae family. Genomic DNA amounts, A260, A280, A260/A230 and purity degrees (A260/A280 that were obtained through both methods were measured through electrophoresis and spectrophotometer. Additionally, PCR amplification was realized by primer pairs specific to nrDNA ITS, cpDNA ndhF (972F-1603R and trnL-F regions. Results showed that maximum genomic DNA in nanograms obtained by phenol chloroform isoamyl alcohol method. The study also revealed that I. macrocephala had the maximum DNA and I. heterolepis had the minimum DNA amount. A260/A280 purity degrees showed that the highest and lowest purity in gDNAs obtained through phenol-choloform isoamyl alcohol method were in I.aucheriana and I. salicina, respectively. The highest and lowest purity degrees of gDNAs obtained through commercial kit was observed in I. fragilis and I. macrocephala samples, respectively. PCR amplification results showed that while band profiles of each three regions (ITS, trnL-F and ndhF did not yield positive results in PCR amplifications using phenol-choloform isoamyl alcohol method; PCR band profiles obtained through commercial kit yielded positive results. As a result, it is fair to say that the relation of genomic DNA with PCR was found to be more efficient although the maximum amount of genomic DNA was obtained through phenol chloroform isoamyl alcohol method.
Quantification and genome-wide mapping of DNA double-strand breaks.

Science.gov (United States)

Grégoire, Marie-Chantal; Massonneau, Julien; Leduc, Frédéric; Arguin, Mélina; Brazeau, Marc-André; Boissonneault, Guylain

2016-12-01

DNA double-strand breaks (DSBs) represent a major threat to the genetic integrity of the cell. Knowing both their genome-wide distribution and number is important for a better assessment of genotoxicity at a molecular level. Available methods may have underestimated the extent of DSBs as they are based on markers specific to those undergoing active repair or may not be adapted for the large diversity of naturally occurring DNA ends. We have established conditions for an efficient first step of DNA nick and gap repair (NGR) allowing specific determination of DSBs by end labeling with terminal transferase. We used DNA extracted from HeLa cells harboring an I-SceI cassette to induce a targeted nick or DSB and demonstrated by immunocapture of 3'-OH that a prior step of NGR allows specific determination of loci-specific or genome wide DSBs. This method can be applied to the global determination of DSBs using radioactive end labeling and can find several applications aimed at understanding the distribution and kinetics of DSBs formation and repair. Copyright © 2016 Elsevier B.V. All rights reserved.
Separating metagenomic short reads into genomes via clustering

Directory of Open Access Journals (Sweden)

Tanaseichuk Olga

2012-09-01

Full Text Available Abstract Background The metagenomics approach allows the simultaneous sequencing of all genomes in an environmental sample. This results in high complexity datasets, where in addition to repeats and sequencing errors, the number of genomes and their abundance ratios are unknown. Recently developed next-generation sequencing (NGS technologies significantly improve the sequencing efficiency and cost. On the other hand, they result in shorter reads, which makes the separation of reads from different species harder. Among the existing computational tools for metagenomic analysis, there are similarity-based methods that use reference databases to align reads and composition-based methods that use composition patterns (i.e., frequencies of short words or l-mers to cluster reads. Similarity-based methods are unable to classify reads from unknown species without close references (which constitute the majority of reads. Since composition patterns are preserved only in significantly large fragments, composition-based tools cannot be used for very short reads, which becomes a significant limitation with the development of NGS. A recently proposed algorithm, AbundanceBin, introduced another method that bins reads based on predicted abundances of the genomes sequenced. However, it does not separate reads from genomes of similar abundance levels. Results In this work, we present a two-phase heuristic algorithm for separating short paired-end reads from different genomes in a metagenomic dataset. We use the observation that most of the l-mers belong to unique genomes when l is sufficiently large. The first phase of the algorithm results in clusters of l-mers each of which belongs to one genome. During the second phase, clusters are merged based on l-mer repeat information. These final clusters are used to assign reads. The algorithm could handle very short reads and sequencing errors. It is initially designed for genomes with similar abundance levels and then
Ion-channel genosensor for the detection of specific DNA sequences derived from Plum Pox Virus in plant extracts.

Science.gov (United States)

Malecka, Kamila; Michalczuk, Lech; Radecka, Hanna; Radecki, Jerzy

2014-10-09

A DNA biosensor for detection of specific oligonucleotides sequences of Plum Pox Virus (PPV) in plant extracts and buffer is proposed. The working principles of a genosensor are based on the ion-channel mechanism. The NH2-ssDNA probe was deposited onto a glassy carbon electrode surface to form an amide bond between the carboxyl group of oxidized electrode surface and amino group from ssDNA probe. The analytical signals generated as a result of hybridization were registered in Osteryoung square wave voltammetry in the presence of [Fe(CN)6]3-/4- as a redox marker. The 22-mer and 42-mer complementary ssDNA sequences derived from PPV and DNA samples from plants infected with PPV were used as targets. Similar detection limits of 2.4 pM (31.0 pg/mL) and 2.3 pM (29.5 pg/mL) in the concentration range 1-8 pM were observed in the presence of the 22-mer ssDNA and 42-mer complementary ssDNA sequences of PPV, respectively. The genosensor was capable of discriminating between samples consisting of extracts from healthy plants and leaf extracts from infected plants in the concentration range 10-50 pg/mL. The detection limit was 12.8 pg/mL. The genosensor displayed good selectivity and sensitivity. The 20-mer partially complementary DNA sequences with four complementary bases and DNA samples from healthy plants used as negative controls generated low signal.
Genome-wide survey of repetitive DNA elements in the button mushroom Agaricus bisporus

NARCIS (Netherlands)

Foulongne-Oriol, M.; Murat, C.; Castanera, R.; Ramírez, L.; Sonnenberg, A.S.M.

2013-01-01

Repetitive DNA elements are ubiquitous constituents of eukaryotic genomes. The biological roles of these repetitive elements, supposed to impact genome organization and evolution, are not completely elucidated yet. The availability of whole genome sequence offers the opportunity to draw a picture of
Evaluation of plasmid and genomic DNA calibrants used for the quantification of genetically modified organisms.

Science.gov (United States)

Caprioara-Buda, M; Meyer, W; Jeynov, B; Corbisier, P; Trapmann, S; Emons, H

2012-07-01

The reliable quantification of genetically modified organisms (GMOs) by real-time PCR requires, besides thoroughly validated quantitative detection methods, sustainable calibration systems. The latter establishes the anchor points for the measured value and the measurement unit, respectively. In this paper, the suitability of two types of DNA calibrants, i.e. plasmid DNA and genomic DNA extracted from plant leaves, for the certification of the GMO content in reference materials as copy number ratio between two targeted DNA sequences was investigated. The PCR efficiencies and coefficients of determination of the calibration curves as well as the measured copy number ratios for three powder certified reference materials (CRMs), namely ERM-BF415e (NK603 maize), ERM-BF425c (356043 soya), and ERM-BF427c (98140 maize), originally certified for their mass fraction of GMO, were compared for both types of calibrants. In all three systems investigated, the PCR efficiencies of plasmid DNA were slightly closer to the PCR efficiencies observed for the genomic DNA extracted from seed powders rather than those of the genomic DNA extracted from leaves. Although the mean DNA copy number ratios for each CRM overlapped within their uncertainties, the DNA copy number ratios were significantly different using the two types of calibrants. Based on these observations, both plasmid and leaf genomic DNA calibrants would be technically suitable as anchor points for the calibration of the real-time PCR methods applied in this study. However, the most suitable approach to establish a sustainable traceability chain is to fix a reference system based on plasmid DNA.
DNA repair efficiency in germ cells and early mouse embryos and consequences for radiation-induced transgenerational genomic damage

Energy Technology Data Exchange (ETDEWEB)

Marchetti, Francesco; Wyrobek, Andrew J.

2009-01-18

Exposure to ionizing radiation and other environmental agents can affect the genomic integrity of germ cells and induce adverse health effects in the progeny. Efficient DNA repair during gametogenesis and the early embryonic cycles after fertilization is critical for preventing transmission of DNA damage to the progeny and relies on maternal factors stored in the egg before fertilization. The ability of the maternal repair machinery to repair DNA damage in both parental genomes in the fertilizing egg is especially crucial for the fertilizing male genome that has not experienced a DNA repair-competent cellular environment for several weeks prior to fertilization. During the DNA repair-deficient period of spermatogenesis, DNA lesions may accumulate in sperm and be carried into the egg where, if not properly repaired, could result in the formation of heritable chromosomal aberrations or mutations and associated birth defects. Studies with female mice deficient in specific DNA repair genes have shown that: (i) cell cycle checkpoints are activated in the fertilized egg by DNA damage carried by the sperm; and (ii) the maternal genotype plays a major role in determining the efficiency of repairing genomic lesions in the fertilizing sperm and directly affect the risk for abnormal reproductive outcomes. There is also growing evidence that implicates DNA damage carried by the fertilizing gamete as a mediator of postfertilization processes that contribute to genomic instability in subsequent generations. Transgenerational genomic instability most likely involves epigenetic mechanisms or error-prone DNA repair processes in the early embryo. Maternal and embryonic DNA repair processes during the early phases of mammalian embryonic development can have far reaching consequences for the genomic integrity and health of subsequent generations.
Crucial optimization steps in getting premier quality of Aquilaria malaccensis genomic DNA for molecular activities

International Nuclear Information System (INIS)

Muhammad Hanif Azhari; Azhar Mohamad

2013-01-01

Gaharu resin is derived from Aquilaria sp. or Agar wood tree in a tropical ecosystem. In Malaysia the Aquilaria species especially A. malaccensis in danger of extinction in the wild due to illegal logging as its resin is highly used for the production of greatly valued incense throughout Asia. A significant tool in fingerprinting the species is through molecular activities of Polymerase Chain Reaction (PCR) application on which requires total genomic DNA as a starting material. This paper, described optimizations of both fresh and dried samples derived from A. malaccensis for genomic DNA. Three main parameters for the optimization were temperature (60, 65, and 70 degree Celsius), incubation period (30, 60, 90 minutes) and concentration of CTAB (1 %, 3 %, 5 %). The experimental design in these work resulted a total of 46 combinations of the parameters in which 0.5 g samples was used in each combination. Nano-drop spectrometer was used in detecting the quantitative genomic DNA at ng/ μl. In fresh samples, incubation temperatures at 65 degree Celsius for 60 minutes in 3 % were yielded 723.2 ng/ μl genomic DNA. Whereas, for dried samples, incubation temperature at 70 degree Celsius for 90 minutes in 5 % CTAB were yielded 70.2 ng/ μl of genomic DNA. Spectrometer reading at OD280/ 260 was 1.9 for both type of samples. The isolated genomic DNA is useful for the molecular activities to identify specific plants between the same species or among the Aquilaria species. (author)
Tobacco smoking leads to extensive genome-wide changes in DNA methylation.

Directory of Open Access Journals (Sweden)

Sonja Zeilinger

Full Text Available Environmental factors such as tobacco smoking may have long-lasting effects on DNA methylation patterns, which might lead to changes in gene expression and in a broader context to the development or progression of various diseases. We conducted an epigenome-wide association study (EWAs comparing current, former and never smokers from 1793 participants of the population-based KORA F4 panel, with replication in 479 participants from the KORA F3 panel, carried out by the 450K BeadChip with genomic DNA obtained from whole blood. We observed wide-spread differences in the degree of site-specific methylation (with p-values ranging from 9.31E-08 to 2.54E-182 as a function of tobacco smoking in each of the 22 autosomes, with the percent of variance explained by smoking ranging from 1.31 to 41.02. Depending on cessation time and pack-years, methylation levels in former smokers were found to be close to the ones seen in never smokers. In addition, methylation-specific protein binding patterns were observed for cg05575921 within AHRR, which had the highest level of detectable changes in DNA methylation associated with tobacco smoking (-24.40% methylation; p = 2.54E-182, suggesting a regulatory role for gene expression. The results of our study confirm the broad effect of tobacco smoking on the human organism, but also show that quitting tobacco smoking presumably allows regaining the DNA methylation state of never smokers.
Tobacco smoking leads to extensive genome-wide changes in DNA methylation.

Science.gov (United States)

Zeilinger, Sonja; Kühnel, Brigitte; Klopp, Norman; Baurecht, Hansjörg; Kleinschmidt, Anja; Gieger, Christian; Weidinger, Stephan; Lattka, Eva; Adamski, Jerzy; Peters, Annette; Strauch, Konstantin; Waldenberger, Melanie; Illig, Thomas

2013-01-01

Environmental factors such as tobacco smoking may have long-lasting effects on DNA methylation patterns, which might lead to changes in gene expression and in a broader context to the development or progression of various diseases. We conducted an epigenome-wide association study (EWAs) comparing current, former and never smokers from 1793 participants of the population-based KORA F4 panel, with replication in 479 participants from the KORA F3 panel, carried out by the 450K BeadChip with genomic DNA obtained from whole blood. We observed wide-spread differences in the degree of site-specific methylation (with p-values ranging from 9.31E-08 to 2.54E-182) as a function of tobacco smoking in each of the 22 autosomes, with the percent of variance explained by smoking ranging from 1.31 to 41.02. Depending on cessation time and pack-years, methylation levels in former smokers were found to be close to the ones seen in never smokers. In addition, methylation-specific protein binding patterns were observed for cg05575921 within AHRR, which had the highest level of detectable changes in DNA methylation associated with tobacco smoking (-24.40% methylation; p = 2.54E-182), suggesting a regulatory role for gene expression. The results of our study confirm the broad effect of tobacco smoking on the human organism, but also show that quitting tobacco smoking presumably allows regaining the DNA methylation state of never smokers.
Detection of γ-ray-induced DNA damages in malformed dominant lethal embryos of the Japanese medaka (Oryzias latipes) using AP-PCR fingerprinting

International Nuclear Information System (INIS)

Kubota, Yoshiko; Shimada, Atsuko; Shima, Akihiro

1992-01-01

Adult male fish of the medaka HNI strain exposed to 9.5 Gy or 19 Gy (0.95 Gy/min) of γ-rays were mated with non-irradiated female fish of the Hd-rR strain. Genomic DNA was prepared from malformed individual embryos which were expected to be dominant lethal and used for AP-PCR fingerprinting. By the use of a part of the T3 promoter sequence (20 mer), which is not found in the medaka genome as an arbitrary primer, polymorphisms were found in genomic fingerprints which could distinguish the parental strains. On the other hand, fingerprints of F1 hybrids were found to be the sum of those of their parents. Based on these findings, the fingerprints of genomic DNA of each severely malformed embryo were analyzed, because it was expected that radiation-induced genomic damages resulting in severe malformation and eventually in dominant lethals should be detected as changes in paternal fingerprints of F1 hybrids. Indeed, changes were found in genomic DNA as loss of some paternal bands in fingerprints of malformed embryos. One of 10 malformed embryos obtained from 9.5 Gy γ-irradiated males had lost 5 bands. These results indicated a possibility that quantitative as well as qualitative estimation of γ-ray-induced DNA damages can be made by this method which does not require the functional selection based on a specific target gene. (author). 16 refs., 3 figs., 1 tab
G-quadruplex DNA sequences are evolutionarily conserved and associated with distinct genomic features in Saccharomyces cerevisiae.

Directory of Open Access Journals (Sweden)

John A Capra

2010-07-01

Full Text Available G-quadruplex DNA is a four-stranded DNA structure formed by non-Watson-Crick base pairing between stacked sets of four guanines. Many possible functions have been proposed for this structure, but its in vivo role in the cell is still largely unresolved. We carried out a genome-wide survey of the evolutionary conservation of regions with the potential to form G-quadruplex DNA structures (G4 DNA motifs across seven yeast species. We found that G4 DNA motifs were significantly more conserved than expected by chance, and the nucleotide-level conservation patterns suggested that the motif conservation was the result of the formation of G4 DNA structures. We characterized the association of conserved and non-conserved G4 DNA motifs in Saccharomyces cerevisiae with more than 40 known genome features and gene classes. Our comprehensive, integrated evolutionary and functional analysis confirmed the previously observed associations of G4 DNA motifs with promoter regions and the rDNA, and it identified several previously unrecognized associations of G4 DNA motifs with genomic features, such as mitotic and meiotic double-strand break sites (DSBs. Conserved G4 DNA motifs maintained strong associations with promoters and the rDNA, but not with DSBs. We also performed the first analysis of G4 DNA motifs in the mitochondria, and surprisingly found a tenfold higher concentration of the motifs in the AT-rich yeast mitochondrial DNA than in nuclear DNA. The evolutionary conservation of the G4 DNA motif and its association with specific genome features supports the hypothesis that G4 DNA has in vivo functions that are under evolutionary constraint.
Ribosomal DNA sequence heterogeneity reflects intraspecies phylogenies and predicts genome structure in two contrasting yeast species.

Science.gov (United States)

West, Claire; James, Stephen A; Davey, Robert P; Dicks, Jo; Roberts, Ian N

2014-07-01

The ribosomal RNA encapsulates a wealth of evolutionary information, including genetic variation that can be used to discriminate between organisms at a wide range of taxonomic levels. For example, the prokaryotic 16S rDNA sequence is very widely used both in phylogenetic studies and as a marker in metagenomic surveys and the internal transcribed spacer region, frequently used in plant phylogenetics, is now recognized as a fungal DNA barcode. However, this widespread use does not escape criticism, principally due to issues such as difficulties in classification of paralogous versus orthologous rDNA units and intragenomic variation, both of which may be significant barriers to accurate phylogenetic inference. We recently analyzed data sets from the Saccharomyces Genome Resequencing Project, characterizing rDNA sequence variation within multiple strains of the baker's yeast Saccharomyces cerevisiae and its nearest wild relative Saccharomyces paradoxus in unprecedented detail. Notably, both species possess single locus rDNA systems. Here, we use these new variation datasets to assess whether a more detailed characterization of the rDNA locus can alleviate the second of these phylogenetic issues, sequence heterogeneity, while controlling for the first. We demonstrate that a strong phylogenetic signal exists within both datasets and illustrate how they can be used, with existing methodology, to estimate intraspecies phylogenies of yeast strains consistent with those derived from whole-genome approaches. We also describe the use of partial Single Nucleotide Polymorphisms, a type of sequence variation found only in repetitive genomic regions, in identifying key evolutionary features such as genome hybridization events and show their consistency with whole-genome Structure analyses. We conclude that our approach can transform rDNA sequence heterogeneity from a problem to a useful source of evolutionary information, enabling the estimation of highly accurate phylogenies of
DnaB gene product-independence of DNA polymerase III-directed repair synthesis in Escherichia coli K-12

International Nuclear Information System (INIS)

Billen, D.; Hellermann, G.R.

1977-01-01

An investigation has been carried out into the role of dnaB gene product in X-ray-induced repair synthesis carried out by DNA polymerase III in toluene-treated Escherichia coli K-12. A polAl polBlOO dnaB mutant deficient in both DNA polymerase I and II activities was used, and it was shown that the level of X-ray-induced, ATP-dependent, non-conservative DNA synthesis was, unlike semi-conservative DNA synthesis, unaffected by a temperature shift from 30 0 to 42 0 C. The dnaB gene product was not therefore necessary for DNA polymerase III-directed repair synthesis, which occurred in the absence of replicative synthesis. (U.K.)

Highly efficient PCR assay to discriminate allelic DNA methylation status using whole genome amplification

Directory of Open Access Journals (Sweden)

Ito Takashi

2011-06-01

Full Text Available Abstract Background We previously developed a simple method termed HpaII-McrBC PCR (HM-PCR to discriminate allelic methylation status of the genomic sites of interest, and successfully applied it to a comprehensive analysis of CpG islands (CGIs on human chromosome 21q. However, HM-PCR requires 200 ng of genomic DNA to examine one target site, thereby precluding its application to such samples that are limited in quantity. Findings We developed HpaII-McrBC whole-genome-amplification PCR (HM-WGA-PCR that uses whole-genome-amplified DNA as the template. HM-WGA-PCR uses only 1/100th the genomic template material required for HM-PCR. Indeed, we successfully analyzed 147 CGIs by HM-WGA-PCR using only ~300 ng of DNA, whereas previous HM-PCR study had required ~30 μg. Furthermore, we confirmed that allelic methylation status revealed by HM-WGA-PCR is identical to that by HM-PCR in every case of the 147 CGIs tested, proving high consistency between the two methods. Conclusions HM-WGA-PCR would serve as a reliable alternative to HM-PCR in the analysis of allelic methylation status when the quantity of DNA available is limited.
Non-Enzymatic Detection of Bacterial Genomic DNA Using the Bio-Barcode Assay

Science.gov (United States)

Hill, Haley D.; Vega, Rafael A.; Mirkin, Chad A.

2011-01-01

The detection of bacterial genomic DNA through a non-enzymatic nanomaterials based amplification method, the bio-barcode assay, is reported. The assay utilizes oligonucleotide functionalized magnetic microparticles to capture the target of interest from the sample. A critical step in the new assay involves the use of blocking oligonucleotides during heat denaturation of the double stranded DNA. These blockers bind to specific regions of the target DNA upon cooling, and prevent the duplex DNA from re-hybridizing, which allows the particle probes to bind. Following target isolation using the magnetic particles, oligonucleotide functionalized gold nanoparticles act as target recognition agents. The oligonucleotides on the nanoparticle (barcodes) act as amplification surrogates. The barcodes are then detected using the Scanometric method. The limit of detection for this assay was determined to be 2.5 femtomolar, and this is the first demonstration of a barcode type assay for the detection of double stranded, genomic DNA. PMID:17927207
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

Science.gov (United States)

Macas, Jiří; Novák, Petr; Pellicer, Jaume; Čížková, Jana; Koblížková, Andrea; Neumann, Pavel; Fuková, Iva; Doležel, Jaroslav; Kelly, Laura J; Leitch, Ilia J

2015-01-01

The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57%) of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%). Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
In Depth Characterization of Repetitive DNA in 23 Plant Genomes Reveals Sources of Genome Size Variation in the Legume Tribe Fabeae.

Directory of Open Access Journals (Sweden)

Jiří Macas

Full Text Available The differential accumulation and elimination of repetitive DNA are key drivers of genome size variation in flowering plants, yet there have been few studies which have analysed how different types of repeats in related species contribute to genome size evolution within a phylogenetic context. This question is addressed here by conducting large-scale comparative analysis of repeats in 23 species from four genera of the monophyletic legume tribe Fabeae, representing a 7.6-fold variation in genome size. Phylogenetic analysis and genome size reconstruction revealed that this diversity arose from genome size expansions and contractions in different lineages during the evolution of Fabeae. Employing a combination of low-pass genome sequencing with novel bioinformatic approaches resulted in identification and quantification of repeats making up 55-83% of the investigated genomes. In turn, this enabled an analysis of how each major repeat type contributed to the genome size variation encountered. Differential accumulation of repetitive DNA was found to account for 85% of the genome size differences between the species, and most (57% of this variation was found to be driven by a single lineage of Ty3/gypsy LTR-retrotransposons, the Ogre elements. Although the amounts of several other lineages of LTR-retrotransposons and the total amount of satellite DNA were also positively correlated with genome size, their contributions to genome size variation were much smaller (up to 6%. Repeat analysis within a phylogenetic framework also revealed profound differences in the extent of sequence conservation between different repeat types across Fabeae. In addition to these findings, the study has provided a proof of concept for the approach combining recent developments in sequencing and bioinformatics to perform comparative analyses of repetitive DNAs in a large number of non-model species without the need to assemble their genomes.
Human Centered Design and Development for NASA's MerBoard

Science.gov (United States)

Trimble, Jay

2003-01-01

This viewgraph presentation provides an overview of the design and development process for NASA's MerBoard. These devices are large interactive display screens which can be shown on the user's computer, which will allow scientists in many locations to interpret and evaluate mission data in real-time. These tools are scheduled to be used during the 2003 Mars Exploration Rover (MER) expeditions. Topics covered include: mission overview, Mer Human Centered Computers, FIDO 2001 observations and MerBoard prototypes.
Effects of DNA mass on multiple displacement whole genome amplification and genotyping performance

Directory of Open Access Journals (Sweden)

Haque Kashif A

2005-09-01

Full Text Available Abstract Background Whole genome amplification (WGA promises to eliminate practical molecular genetic analysis limitations associated with genomic DNA (gDNA quantity. We evaluated the performance of multiple displacement amplification (MDA WGA using gDNA extracted from lymphoblastoid cell lines (N = 27 with a range of starting gDNA input of 1–200 ng into the WGA reaction. Yield and composition analysis of whole genome amplified DNA (wgaDNA was performed using three DNA quantification methods (OD, PicoGreen® and RT-PCR. Two panels of N = 15 STR (using the AmpFlSTR® Identifiler® panel and N = 49 SNP (TaqMan® genotyping assays were performed on each gDNA and wgaDNA sample in duplicate. gDNA and wgaDNA masses of 1, 4 and 20 ng were used in the SNP assays to evaluate the effects of DNA mass on SNP genotyping assay performance. A total of N = 6,880 STR and N = 56,448 SNP genotype attempts provided adequate power to detect differences in STR and SNP genotyping performance between gDNA and wgaDNA, and among wgaDNA produced from a range of gDNA templates inputs. Results The proportion of double-stranded wgaDNA and human-specific PCR amplifiable wgaDNA increased with increased gDNA input into the WGA reaction. Increased amounts of gDNA input into the WGA reaction improved wgaDNA genotyping performance. Genotype completion or genotype concordance rates of wgaDNA produced from all gDNA input levels were observed to be reduced compared to gDNA, although the reduction was not always statistically significant. Reduced wgaDNA genotyping performance was primarily due to the increased variance of allelic amplification, resulting in loss of heterozygosity or increased undetermined genotypes. MDA WGA produces wgaDNA from no template control samples; such samples exhibited substantial false-positive genotyping rates. Conclusion The amount of gDNA input into the MDA WGA reaction is a critical determinant of genotyping performance of wgaDNA. At least 10 ng of
Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparisons with Other Methods

International Nuclear Information System (INIS)

Wu, Liyou; Yi, T.Y.; Van Nostrand, Joy; Zhou, Jizhong

2010-01-01

Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site (Hanford Reach of the Columbia River (HRCR), 11 strains), Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the average nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.
Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparison with Other Methods

Energy Technology Data Exchange (ETDEWEB)

Wu, Liyou; Yi, T. Y.; Van Nostrand, Joy; Zhou, Jizhong

2010-05-17

Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site [Hanford Reach of the Columbia River (HRCR), 11 strains], Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the average nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.
Genomic signal processing methods for computation of alignment-free distances from DNA sequences.

Science.gov (United States)

Borrayo, Ernesto; Mendizabal-Ruiz, E Gerardo; Vélez-Pérez, Hugo; Romo-Vázquez, Rebeca; Mendizabal, Adriana P; Morales, J Alejandro

2014-01-01

Genomic signal processing (GSP) refers to the use of digital signal processing (DSP) tools for analyzing genomic data such as DNA sequences. A possible application of GSP that has not been fully explored is the computation of the distance between a pair of sequences. In this work we present GAFD, a novel GSP alignment-free distance computation method. We introduce a DNA sequence-to-signal mapping function based on the employment of doublet values, which increases the number of possible amplitude values for the generated signal. Additionally, we explore the use of three DSP distance metrics as descriptors for categorizing DNA signal fragments. Our results indicate the feasibility of employing GAFD for computing sequence distances and the use of descriptors for characterizing DNA fragments.
AID to overcome the limitations of genomic information by introducing somatic DNA alterations.

Science.gov (United States)

Honjo, Tasuku; Muramatsu, Masamichi; Nagaoka, Hitoshi; Kinoshita, Kazuo; Shinkura, Reiko

2006-05-01

The immune system has adopted somatic DNA alterations to overcome the limitations of the genomic information. Activation induced cytidine deaminase (AID) is an essential enzyme to regulate class switch recombination (CSR), somatic hypermutation (SHM) and gene conversion (GC) of the immunoglobulin gene. AID is known to be required for DNA cleavage of S regions in CSR and V regions in SHM. However, its molecular mechanism is a focus of extensive debate. RNA editing hypothesis postulates that AID edits yet unknown mRNA, to generate specific endonucleases for CSR and SHM. By contrast, DNA deamination hypothesis assumes that AID deaminates cytosine in DNA, followed by DNA cleavage by base excision repair enzymes. We summarize the basic knowledge for molecular mechanisms for CSR and SHM and then discuss the importance of AID not only in the immune regulation but also in the genome instability.
A universal, rapid, and inexpensive method for genomic DNA ...

Indian Academy of Sciences (India)

MOHAMMED BAQUR SAHIB A. AL-SHUHAIB

gels, containing 7% glycerol, and 1×TBE buffer. The gels were run under 200 .... Inc. Germany, GeneaidTM DNA Isolation Kit, Geneaid. Biotech., New Taipei City, .... C. L. and Arsenos G. 2015 Comparison of eleven methods for genomic DNA ...
The structures of bovine herpesvirus 1 virion and concatemeric DNA: implications for cleavage and packaging of herpesvirus genomes

International Nuclear Information System (INIS)

Schynts, Frederic; McVoy, Michael A.; Meurens, Francois; Detry, Bruno; Epstein, Alberto L.; Thiry, Etienne

2003-01-01

Herpesvirus genomes are often characterized by the presence of direct and inverted repeats that delineate their grouping into six structural classes. Class D genomes consist of a long (L) segment and a short (S) segment. The latter is flanked by large inverted repeats. DNA replication produces concatemers of head-to-tail linked genomes that are cleaved into unit genomes during the process of packaging DNA into capsids. Packaged class D genomes are an equimolar mixture of two isomers in which S is in either of two orientations, presumably a consequence of homologous recombination between the inverted repeats. The L segment remains predominantly fixed in a prototype (P) orientation; however, low levels of genomes having inverted L (I L ) segments have been reported for some class D herpesviruses. Inefficient formation of class D I L genomes has been attributed to infrequent L segment inversion, but recent detection of frequent inverted L segments in equine herpesvirus 1 concatemers [Virology 229 (1997) 415-420] suggests that the defect may be at the level of cleavage and packaging rather than inversion. In this study, the structures of virion and concatemeric DNA of another class D herpesvirus, bovine herpesvirus 1, were determined. Virion DNA contained low levels of I L genomes, whereas concatemeric DNA contained significant amounts of L segments in both P and I L orientations. However, concatemeric termini exhibited a preponderance of L termini derived from P isomers which was comparable to the preponderance of P genomes found in virion DNA. Thus, the defect in formation of I L genomes appears to lie at the level of concatemer cleavage. These results have important implications for the mechanisms by which herpesvirus DNA cleavage and packaging occur
Facilitating the indirect detection of genomic DNA in an electrochemical DNA biosensor using magnetic nanoparticles and DNA ligase

Directory of Open Access Journals (Sweden)

Roozbeh Hushiarian

2015-12-01

This technique was found to be reliably repeatable. The indirect detection of genomic DNA using this method is significantly improved and showed high efficiency in small amounts of samples with the detection limit of 5.37 × 10−14 M.
Improved reproducibility in genome-wide DNA methylation analysis for PAXgene® fixed samples compared to restored FFPE DNA

DEFF Research Database (Denmark)

Andersen, Gitte Brinch; Hager, Henrik; Hansen, Lise Lotte

2014-01-01

Chip. Quantitative DNA methylation analysis demonstrated that the methylation profile in PAXgene-fixed tissues showed, in comparison with restored FFPE samples, a higher concordance with the profile detected in frozen samples. We demonstrate, for the first time, that DNA from PAXgene conserved tissue performs better......Formalin fixation has been the standard method for conservation of clinical specimens for decades. However, a major drawback is the high degradation of nucleic acids, which complicates its use in genome-wide analyses. Unbiased identification of biomarkers, however, requires genome-wide studies......, precluding the use of the valuable archives of specimens with long-term follow-up data. Therefore, restoration protocols for DNA from formalin-fixed and paraffin-embedded (FFPE) samples have been developed, although they are cost-intensive and time-consuming. An alternative to FFPE and snap...
Gross genomic damage measured by DNA image cytometry independently predicts gastric cancer patient survival

NARCIS (Netherlands)

Belien, J.A.M.; Buffart, T.E.; Gill, A.; Broeckaert, M.A.M.; Quirke, P.; Meijer, G.A.; Grabsch, H.

2009-01-01

BACKGROUND: DNA aneuploidy reflects gross genomic changes. It can be measured by flow cytometry (FCM-DNA) or image cytometry (ICM-DNA). In gastric cancer, the prevalence of DNA aneuploidy has been reported to range from 27 to 100%, with conflicting associations with clinicopathological variables.
Genome-Wide DNA Methylation Profiles of Phlegm-Dampness Constitution

Directory of Open Access Journals (Sweden)

Haiqiang Yao

2018-03-01

Full Text Available Background/Aims: Metabolic diseases are leading health concerns in today’s global society. In traditional Chinese medicine (TCM, one body type studied is the phlegm-dampness constitution (PC, which predisposes individuals to complex metabolic disorders. Genomic studies have revealed the potential metabolic disorders and the molecular features of PC. The role of epigenetics in the regulation of PC, however, is unknown. Methods: We analyzed a genome-wide DNA methylation in 12 volunteers using Illumina Infinium Human Methylation450 BeadChip on peripheral blood mononuclear cells (PBMCs. Eight volunteers had PC and 4 had balanced constitutions. Results: Methylation data indicated a genome-scale hyper-methylation pattern in PC. We located 288 differentially methylated probes (DMPs. A total of 256 genes were mapped, and some of these were metabolic-related. SQSTM1, DLGAP2 and DAB1 indicated diabetes mellitus; HOXC4 and SMPD3, obesity; and GRWD1 and ATP10A, insulin resistance. According to Ingenuity Pathway Analysis (IPA, differentially methylated genes were abundant in multiple metabolic pathways. Conclusion: Our results suggest the potential risk for metabolic disorders in individuals with PC. We also explain the clinical characteristics of PC with DNA methylation features.
Human β satellite DNA: Genomic organization and sequence definition of a class of highly repetitive tandem DNA

International Nuclear Information System (INIS)

Waye, J.S.; Willard, H.F.

1989-01-01

The authors describe a class of human repetitive DNA, called β satellite, that, at a most fundamental level, exists as tandem arrays of diverged ∼68-base-pair monomer repeat units. The monomer units are organized as distinct subsets, each characterized by a multimeric higher-order repeat unit that is tandemly reiterated and represents a recent unit of amplification. They have cloned, characterized, and determined the sequence of two β satellite higher-order repeat units: one located on chromosome 9, the other on the acrocentric chromosomes (13, 14, 15, 21, and 22) and perhaps other sites in the genome. Analysis by pulsed-field gel electrophoresis reveals that these tandem arrays are localized in large domains that are marked by restriction fragment length polymorphisms. In total, β-satellite sequences comprise several million base pairs of DNA in the human genome. Analysis of this DNA family should permit insights into the nature of chromosome-specific and nonspecific modes of satellite DNA evolution and provide useful tools for probing the molecular organization and concerted evolution of the acrocentric chromosomes
A Network of Multi-Tasking Proteins at the DNA Replication Fork Preserves Genome Stability.

Directory of Open Access Journals (Sweden)

2005-12-01

Full Text Available To elucidate the network that maintains high fidelity genome replication, we have introduced two conditional mutant alleles of DNA2, an essential DNA replication gene, into each of the approximately 4,700 viable yeast deletion mutants and determined the fitness of the double mutants. Fifty-six DNA2-interacting genes were identified. Clustering analysis of genomic synthetic lethality profiles of each of 43 of the DNA2-interacting genes defines a network (consisting of 322 genes and 876 interactions whose topology provides clues as to how replication proteins coordinate regulation and repair to protect genome integrity. The results also shed new light on the functions of the query gene DNA2, which, despite many years of study, remain controversial, especially its proposed role in Okazaki fragment processing and the nature of its in vivo substrates. Because of the multifunctional nature of virtually all proteins at the replication fork, the meaning of any single genetic interaction is inherently ambiguous. The multiplexing nature of the current studies, however, combined with follow-up supporting experiments, reveals most if not all of the unique pathways requiring Dna2p. These include not only Okazaki fragment processing and DNA repair but also chromatin dynamics.
Comparative Genomics of Regulation of Fatty Acid and Branched-chain Amino Acid Utilization in Proteobacteria

Energy Technology Data Exchange (ETDEWEB)

Kazakov, Alexey E.; Rodionov, Dmitry A.; Arkin, Adam Paul; Dubchak, Inna; Gelfand, Mikhail S.; Alm, Eric

2008-10-31

Bacteria can use branched-chain amino acids (ILV, i.e. isoleucine, leucine, valine) and fatty acids (FA) as sole carbon and energy sources convering ILV into acetyl-CoA, propanoyl-CoA and propionyl-CoA, respectively. In this work, we used the comparative genomic approach to identify candidate transcriptional factors and DNA motifs that control ILV and FA utilization pathways in proteobacteria. The metabolic regulons were characterized based on the identification and comparison of candidate transcription factor binding sites in groups of phylogenetically related genomes. The reconstructed ILV/FA regulatory network demonstrates considerable variability and involves six transcriptional factors from the MerR, TetR and GntR families binding to eleven distinct DNA motifs. The ILV degradation genes in gamma- and beta-proteobacteria are mainly regulated by anovel regulator from the MerR family (e.g., LiuR in Pseudomonas aeruginosa) (40 species), in addition, the TetR-type regulator LiuQ was identified in some beta-proteobacteria (8 species). Besides the core set of ILV utilization genes, the LiuR regulon in some lineages is expanded to include genes from other metabolic pathways, such as the glyoxylate shunt and glutamate synthase in the Shewanella species. The FA degradation genes are controlled by four regulators including FadR in gamma-proteobacteria (34 species), PsrA in gamma- and beta-proteobacteria (45 species), FadP in beta-proteobacteria (14 species), and LiuR orthologs in alpha-proteobacteria (22 species). The remarkable variability of the regulatory systems associated with the FA degradation pathway is discussed from the functional and evolutionary points of view.
Rapid discrimination and classification of the Lactobacillus plantarum group based on a partial dnaK sequence and DNA fingerprinting techniques.

Science.gov (United States)

Huang, Chien-Hsun; Lee, Fwu-Ling; Liou, Jong-Shian

2010-03-01

The Lactobacillus plantarum group comprises five very closely related species. Some species of this group are considered to be probiotic and widely applied in the food industry. In this study, we compared the use of two different molecular markers, the 16S rRNA and dnaK gene, for discriminating phylogenetic relationships amongst L. plantarum strains using sequencing and DNA fingerprinting. The average sequence similarity for the dnaK gene (89.2%) among five type strains was significantly less than that for the 16S rRNA (99.4%). This result demonstrates that the dnaK gene sequence provided higher resolution than the 16S rRNA and suggests that the dnaK could be used as an additional phylogenetic marker for L. plantarum. Species-specific profiles of the Lactobacillus strains were obtained with RAPD and RFLP methods. Our data indicate that phylogenetic relationships between these strains are easily resolved using sequencing of the dnaK gene or DNA fingerprinting assays.

Birth of scale-free molecular networks and the number of distinct DNA and protein domains per genome.

Science.gov (United States)

Rzhetsky, A; Gomez, S M

2001-10-01

Current growth in the field of genomics has provided a number of exciting approaches to the modeling of evolutionary mechanisms within the genome. Separately, dynamical and statistical analyses of networks such as the World Wide Web and the social interactions existing between humans have shown that these networks can exhibit common fractal properties-including the property of being scale-free. This work attempts to bridge these two fields and demonstrate that the fractal properties of molecular networks are linked to the fractal properties of their underlying genomes. We suggest a stochastic model capable of describing the evolutionary growth of metabolic or signal-transduction networks. This model generates networks that share important statistical properties (so-called scale-free behavior) with real molecular networks. In particular, the frequency of vertices connected to exactly k other vertices follows a power-law distribution. The shape of this distribution remains invariant to changes in network scale: a small subgraph has the same distribution as the complete graph from which it is derived. Furthermore, the model correctly predicts that the frequencies of distinct DNA and protein domains also follow a power-law distribution. Finally, the model leads to a simple equation linking the total number of different DNA and protein domains in a genome with both the total number of genes and the overall network topology. MatLab (MathWorks, Inc.) programs described in this manuscript are available on request from the authors. ar345@columbia.edu.
Effect of specific enzyme inhibitors on replication, total genome DNA repair and on gene-specific DNA repair after UV irradiation in CHO cells

Energy Technology Data Exchange (ETDEWEB)

Jones, J.C.; Stevsner, Tinna; Bohr, Vilhelm A. (National Cancer Institute, NIH, Bethesda, MD (USA). Division of Cancer Treatment, Laboratory of Molecular Pharmacology); Mattern, M.R. (Smith Kline Beecham Pharmaceuticals, King of Prussia, PA (USA). Department of Biomolecular Discovery)

1991-09-01

The effects were studied of some specific enzyme inhibitors on DNA repair and replication after UV damage in Chinese hamster ovary cells. The DNA repair was studied at the level of the average, overall genome and also in the active dihydrofolate reductase gene. Replication was measured in the overall genome. The inhibitors were tested of DNA poly-merase {alpha} and {delta} (aphidicolin), of poly(ADPr) polymerase (3-aminobenzamide), of ribonucleotide reductase (hydroxyurea), of topo-isomerase I (camptothecin), and of topoisomerase II (merbarone, VP-16). In addition, the effects were tested of the potential topoisomerase I activator, {beta}-lapachone. All of these compounds inhibited genome replication and all topoisomerase inhibitors affected the overall genome repair; {beta}-lapachone stimulated it. None of these compounds had any effect on the gene-specific repair. (author). 36 refs.; 3 figs.; 2 tabs.
MERS and SARS

Science.gov (United States)

... primary antibody and Goat anti-Rabbit 10 nm gold particles. Middle East Respiratory Syndrome (MERS-CoV) is ... standard mice or hamsters, scientists at NIAID Rocky Mountain Laboratories (RML) took the unusual step of moving ...
Extensive variation in the density and distribution of DNA polymorphism in sorghum genomes.

Directory of Open Access Journals (Sweden)

Joseph Evans

Full Text Available Sorghum genotypes currently used for grain production in the United States were developed from African landraces that were imported starting in the mid-to-late 19(th century. Farmers and plant breeders selected genotypes for grain production with reduced plant height, early flowering, increased grain yield, adaptation to drought, and improved resistance to lodging, diseases and pests. DNA polymorphisms that distinguish three historically important grain sorghum genotypes, BTx623, BTx642 and Tx7000, were characterized by genome sequencing, genotyping by sequencing, genetic mapping, and pedigree-based haplotype analysis. The distribution and density of DNA polymorphisms in the sequenced genomes varied widely, in part because the lines were derived through breeding and selection from diverse Kafir, Durra, and Caudatum race accessions. Genomic DNA spanning dw1 (SBI-09 and dw3 (SBI-07 had identical haplotypes due to selection for reduced height. Lower SNP density in genes located in pericentromeric regions compared with genes located in euchromatic regions is consistent with background selection in these regions of low recombination. SNP density was higher in euchromatic DNA and varied >100-fold in contiguous intervals that spanned up to 300 Kbp. The localized variation in DNA polymorphism density occurred throughout euchromatic regions where recombination is elevated, however, polymorphism density was not correlated with gene density or DNA methylation. Overall, sorghum chromosomes contain distal euchromatic regions characterized by extensive, localized variation in DNA polymorphism density, and large pericentromeric regions of low gene density, diversity, and recombination.
Genome-Wide Analysis of Transposon and Retroviral Insertions Reveals Preferential Integrations in Regions of DNA Flexibility.

Science.gov (United States)

Vrljicak, Pavle; Tao, Shijie; Varshney, Gaurav K; Quach, Helen Ngoc Bao; Joshi, Adita; LaFave, Matthew C; Burgess, Shawn M; Sampath, Karuna

2016-04-07

DNA transposons and retroviruses are important transgenic tools for genome engineering. An important consideration affecting the choice of transgenic vector is their insertion site preferences. Previous large-scale analyses of Ds transposon integration sites in plants were done on the basis of reporter gene expression or germ-line transmission, making it difficult to discern vertebrate integration preferences. Here, we compare over 1300 Ds transposon integration sites in zebrafish with Tol2 transposon and retroviral integration sites. Genome-wide analysis shows that Ds integration sites in the presence or absence of marker selection are remarkably similar and distributed throughout the genome. No strict motif was found, but a preference for structural features in the target DNA associated with DNA flexibility (Twist, Tilt, Rise, Roll, Shift, and Slide) was observed. Remarkably, this feature is also found in transposon and retroviral integrations in maize and mouse cells. Our findings show that structural features influence the integration of heterologous DNA in genomes, and have implications for targeted genome engineering. Copyright © 2016 Vrljicak et al.
saSNP Approach for Scalable SNP Analyses of Multiple Bacterial or Viral Genomes

Energy Technology Data Exchange (ETDEWEB)

Gardner, Shea [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States); Slezak, Tom [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

2010-07-27

With the flood of whole genome finished and draft microbial sequences, we need faster, more scalable bioinformatics tools for sequence comparison. An algorithm is described to find single nucleotide polymorphisms (SNPs) in whole genome data. It scales to hundreds of bacterial or viral genomes, and can be used for finished and/or draft genomes available as unassembled contigs. The method is fast to compute, finding SNPs and building a SNP phylogeny in seconds to hours. We use it to identify thousands of putative SNPs from all publicly available Filoviridae, Poxviridae, foot-and-mouth disease virus, Bacillus, and Escherichia coli genomes and plasmids. The SNP-based trees that result are consistent with known taxonomy and trees determined in other studies. The approach we describe can handle as input hundreds of gigabases of sequence in a single run. The algorithm is based on k-mer analysis using a suffix array, so we call it saSNP.
A new method to cluster genomes based on cumulative Fourier power spectrum.

Science.gov (United States)

Dong, Rui; Zhu, Ziyue; Yin, Changchuan; He, Rong L; Yau, Stephen S-T

2018-06-20

Analyzing phylogenetic relationships using mathematical methods has always been of importance in bioinformatics. Quantitative research may interpret the raw biological data in a precise way. Multiple Sequence Alignment (MSA) is used frequently to analyze biological evolutions, but is very time-consuming. When the scale of data is large, alignment methods cannot finish calculation in reasonable time. Therefore, we present a new method using moments of cumulative Fourier power spectrum in clustering the DNA sequences. Each sequence is translated into a vector in Euclidean space. Distances between the vectors can reflect the relationships between sequences. The mapping between the spectra and moment vector is one-to-one, which means that no information is lost in the power spectra during the calculation. We cluster and classify several datasets including Influenza A, primates, and human rhinovirus (HRV) datasets to build up the phylogenetic trees. Results show that the new proposed cumulative Fourier power spectrum is much faster and more accurately than MSA and another alignment-free method known as k-mer. The research provides us new insights in the study of phylogeny, evolution, and efficient DNA comparison algorithms for large genomes. The computer programs of the cumulative Fourier power spectrum are available at GitHub (https://github.com/YaulabTsinghua/cumulative-Fourier-power-spectrum). Copyright © 2018. Published by Elsevier B.V.
DNA mismatch repair deficiency accelerates lung neoplasm development in K-rasLA1/+ mice: a brief report

International Nuclear Information System (INIS)

Downey, Charlene M; Jirik, Frank R

2015-01-01

Inherited as well as acquired deficiencies in specific DNA mismatch repair (MMR) components are associated with the development of a wide range of benign and malignant neoplasms. Loss of key members such as MSH2 and MLH1 severely cripples the ability of the cell to recognize and correct such lesions as base:base mismatches and replicative DNA polymerase errors such as slippages at repetitive sequences. Genomic instability resulting from MMR deficiency not only predisposes cells to malignant transformation but may also promote tumor progression. To test the latter, we interbred Msh2 −/− mice with the K-ras LA1/+ transgenic line that spontaneously develops a range of premalignant and malignant lung lesions. Compared to K-ras LA1/+ mice, K-ras LA1/+ ; Msh2 −/− mice developed lung adenomas and adenocarcinomas at an increased frequency and also demonstrated evidence of accelerated adenocarcinoma growth. Since MMR defects have been identified in some human lung cancers, the mutant mice may not only be of preclinical utility but they will also be useful in identifying gene alterations able to act in concert with Kras mutants to promote tumor progression
Quantifying the Number of Independent Organelle DNA Insertions in Genome Evolution and Human Health.

Science.gov (United States)

Hazkani-Covo, Einat; Martin, William F

2017-05-01

Fragments of organelle genomes are often found as insertions in nuclear DNA. These fragments of mitochondrial DNA (numts) and plastid DNA (nupts) are ubiquitous components of eukaryotic genomes. They are, however, often edited out during the genome assembly process, leading to systematic underestimation of their frequency. Numts and nupts, once inserted, can become further fragmented through subsequent insertion of mobile elements or other recombinational events that disrupt the continuity of the inserted sequence relative to the genuine organelle DNA copy. Because numts and nupts are typically identified through sequence comparison tools such as BLAST, disruption of insertions into smaller fragments can lead to systematic overestimation of numt and nupt frequencies. Accurate identification of numts and nupts is important, however, both for better understanding of their role during evolution, and for monitoring their increasingly evident role in human disease. Human populations are polymorphic for 141 numt loci, five numts are causal to genetic disease, and cancer genomic studies are revealing an abundance of numts associated with tumor progression. Here, we report investigation of salient parameters involved in obtaining accurate estimates of numt and nupt numbers in genome sequence data. Numts and nupts from 44 sequenced eukaryotic genomes reveal lineage-specific differences in the number, relative age and frequency of insertional events as well as lineage-specific dynamics of their postinsertional fragmentation. Our findings outline the main technical parameters influencing accurate identification and frequency estimation of numts in genomic studies pertinent to both evolution and human health. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
"Isogaba Maware": quality control of genome DNA by checkpoints.

Science.gov (United States)

Kitazono, A; Matsumoto, T

1998-05-01

Checkpoints maintain the interdependency of cell cycle events by permitting the onset of an event only after the completion of the preceding event. The DNA replication checkpoint induces a cell cycle arrest until the completion of the DNA replication. Similarly, the DNA damage checkpoint arrests cell cycle progression if DNA repair is incomplete. A number of genes that play a role in the two checkpoints have been identified through genetic studies in yeasts, and their homologues have been found in fly, mouse, and human. They form signaling cascades activated by a DNA replication block or DNA damage and subsequently generate the negative constraints on cell cycle regulators. The failure of these signaling cascades results in producing offspring that carry mutations or that lack a portion of the genome. In humans, defects in the checkpoints are often associated with cancer-prone diseases. Focusing mainly on the studies in budding and fission yeasts, we summarize the recent progress.
Tetrahedral gray code for visualization of genome information.

Directory of Open Access Journals (Sweden)

Natsuhiro Ichinose

Full Text Available We propose a tetrahedral Gray code that facilitates visualization of genome information on the surfaces of a tetrahedron, where the relative abundance of each [Formula: see text]-mer in the genomic sequence is represented by a color of the corresponding cell of a triangular lattice. For biological significance, the code is designed such that the [Formula: see text]-mers corresponding to any adjacent pair of cells differ from each other by only one nucleotide. We present a simple procedure to draw such a pattern on the development surfaces of a tetrahedron. The thus constructed tetrahedral Gray code can demonstrate evolutionary conservation and variation of the genome information of many organisms at a glance. We also apply the tetrahedral Gray code to the honey bee (Apis mellifera genome to analyze its methylation structure. The results indicate that the honey bee genome exhibits CpG overrepresentation in spite of its methylation ability and that two conserved motifs, CTCGAG and CGCGCG, in the unmethylated regions are responsible for the overrepresentation of CpG.
Real-time sequence-validated loop-mediated isothermal amplification assays for detection of Middle East respiratory syndrome coronavirus (MERS-CoV.

Directory of Open Access Journals (Sweden)

Sanchita Bhadra

Full Text Available The Middle East respiratory syndrome coronavirus (MERS-CoV, an emerging human coronavirus, causes severe acute respiratory illness with a 35% mortality rate. In light of the recent surge in reported infections we have developed asymmetric five-primer reverse transcription loop-mediated isothermal amplification (RT-LAMP assays for detection of MERS-CoV. Isothermal amplification assays will facilitate the development of portable point-of-care diagnostics that are crucial for management of emerging infections. The RT-LAMP assays are designed to amplify MERS-CoV genomic loci located within the open reading frame (ORF1a and ORF1b genes and upstream of the E gene. Additionally we applied one-step strand displacement probes (OSD for real-time sequence-specific verification of LAMP amplicons. Asymmetric amplification effected by incorporating a single loop primer in each assay accelerated the time-to-result of the OSD-RT-LAMP assays. The resulting assays could detect 0.02 to 0.2 plaque forming units (PFU (5 to 50 PFU/ml of MERS-CoV in infected cell culture supernatants within 30 to 50 min and did not cross-react with common human respiratory pathogens.
Characterization of noncoding regulatory DNA in the human genome.

Science.gov (United States)

Elkon, Ran; Agami, Reuven

2017-08-08

Genetic variants associated with common diseases are usually located in noncoding parts of the human genome. Delineation of the full repertoire of functional noncoding elements, together with efficient methods for probing their biological roles, is therefore of crucial importance. Over the past decade, DNA accessibility and various epigenetic modifications have been associated with regulatory functions. Mapping these features across the genome has enabled researchers to begin to document the full complement of putative regulatory elements. High-throughput reporter assays to probe the functions of regulatory regions have also been developed but these methods separate putative regulatory elements from the chromosome so that any effects of chromatin context and long-range regulatory interactions are lost. Definitive assignment of function(s) to putative cis-regulatory elements requires perturbation of these elements. Genome-editing technologies are now transforming our ability to perturb regulatory elements across entire genomes. Interpretation of high-throughput genetic screens that incorporate genome editors might enable the construction of an unbiased map of functional noncoding elements in the human genome.
Ribosomal RNA Genes Contribute to the Formation of Pseudogenes and Junk DNA in the Human Genome.

Science.gov (United States)

Robicheau, Brent M; Susko, Edward; Harrigan, Amye M; Snyder, Marlene

2017-02-01

Approximately 35% of the human genome can be identified as sequence devoid of a selected-effect function, and not derived from transposable elements or repeated sequences. We provide evidence supporting a known origin for a fraction of this sequence. We show that: 1) highly degraded, but near full length, ribosomal DNA (rDNA) units, including both 45S and Intergenic Spacer (IGS), can be found at multiple sites in the human genome on chromosomes without rDNA arrays, 2) that these rDNA sequences have a propensity for being centromere proximal, and 3) that sequence at all human functional rDNA array ends is divergent from canonical rDNA to the point that it is pseudogenic. We also show that small sequence strings of rDNA (from 45S + IGS) can be found distributed throughout the genome and are identifiable as an "rDNA-like signal", representing 0.26% of the q-arm of HSA21 and ∼2% of the total sequence of other regions tested. The size of sequence strings found in the rDNA-like signal intergrade into the size of sequence strings that make up the full-length degrading rDNA units found scattered throughout the genome. We conclude that the displaced and degrading rDNA sequences are likely of a similar origin but represent different stages in their evolution towards random sequence. Collectively, our data suggests that over vast evolutionary time, rDNA arrays contribute to the production of junk DNA. The concept that the production of rDNA pseudogenes is a by-product of concerted evolution represents a previously under-appreciated process; we demonstrate here its importance. © The Author(s) 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data.

Directory of Open Access Journals (Sweden)

Can Alkan

2007-09-01

Full Text Available The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%-5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution.
Organization and evolution of primate centromeric DNA from whole-genome shotgun sequence data.

Science.gov (United States)

Alkan, Can; Ventura, Mario; Archidiacono, Nicoletta; Rocchi, Mariano; Sahinalp, S Cenk; Eichler, Evan E

2007-09-01

The major DNA constituent of primate centromeres is alpha satellite DNA. As much as 2%-5% of sequence generated as part of primate genome sequencing projects consists of this material, which is fragmented or not assembled as part of published genome sequences due to its highly repetitive nature. Here, we develop computational methods to rapidly recover and categorize alpha-satellite sequences from previously uncharacterized whole-genome shotgun sequence data. We present an algorithm to computationally predict potential higher-order array structure based on paired-end sequence data and then experimentally validate its organization and distribution by experimental analyses. Using whole-genome shotgun data from the human, chimpanzee, and macaque genomes, we examine the phylogenetic relationship of these sequences and provide further support for a model for their evolution and mutation over the last 25 million years. Our results confirm fundamental differences in the dispersal and evolution of centromeric satellites in the Old World monkey and ape lineages of evolution.
Amplification of the Kaposi's sarcoma-associated herpesvirus/human herpesvirus 8 lytic origin of DNA replication is dependent upon a cis-acting AT-rich region and an ORF50 response element and the trans-acting factors ORF50 (K-Rta) and K8 (K-bZIP)

International Nuclear Information System (INIS)

AuCoin, David P.; Colletti, Kelly S.; Cei, Sylvia A.; Papouskova, Iva; Tarrant, Margaret; Pari, Gregory S.

2004-01-01

Kaposi's sarcoma-associated herpesvirus (KSHV), also known as human herpesvirus 8 (HHV8), has significant sequence homology to Epstein-Barr virus (EBV). In cell culture, HHV8 is primarily latent, and viral genes associated with lytic replication are not expressed. Two lytic origins of DNA replication (oriLyt) are present within the HHV8 genome and are composed of an AT-rich region adjacent to GC-rich DNA sequences. We have now identified essential cis- and trans-acting elements required for oriLyt-dependent DNA replication. The transient replication assay was used to show that two AT-rich elements, three consensus AP1 transcription factor-binding sites, an ORF50 response element (RE), and a consensus TATA box motif are essential for efficient origin-dependent DNA replication. Transient transfection of luciferase reporter constructs indicated that the downstream region of the HHV8 oriLyt responds to ORF50 and suggests that part of the oriLyt may be an enhancer/promoter. In addition, a transient cotransfection-replication assay elucidated the set of trans-acting factors required for lytic DNA replication. These factors consist of homologues to the core replication proteins: ORF6 (ssDNA binding protein), ORF9 (DNA polymerase), ORF40-41 (primase-associated factor), ORF44 (helicase), ORF56 (primase), and ORF59 (polymerase processivity factor) common to all herpesviruses along with ORF50 (K-Rta) and K8 (K-bZIP)
In vitro analysis of integrated global high-resolution DNA methylation profiling with genomic imbalance and gene expression in osteosarcoma.

Directory of Open Access Journals (Sweden)

Bekim Sadikovic

Full Text Available Genetic and epigenetic changes contribute to deregulation of gene expression and development of human cancer. Changes in DNA methylation are key epigenetic factors regulating gene expression and genomic stability. Recent progress in microarray technologies resulted in developments of high resolution platforms for profiling of genetic, epigenetic and gene expression changes. OS is a pediatric bone tumor with characteristically high level of numerical and structural chromosomal changes. Furthermore, little is known about DNA methylation changes in OS. Our objective was to develop an integrative approach for analysis of high-resolution epigenomic, genomic, and gene expression profiles in order to identify functional epi/genomic differences between OS cell lines and normal human osteoblasts. A combination of Affymetrix Promoter Tilling Arrays for DNA methylation, Agilent array-CGH platform for genomic imbalance and Affymetrix Gene 1.0 platform for gene expression analysis was used. As a result, an integrative high-resolution approach for interrogation of genome-wide tumour-specific changes in DNA methylation was developed. This approach was used to provide the first genomic DNA methylation maps, and to identify and validate genes with aberrant DNA methylation in OS cell lines. This first integrative analysis of global cancer-related changes in DNA methylation, genomic imbalance, and gene expression has provided comprehensive evidence of the cumulative roles of epigenetic and genetic mechanisms in deregulation of gene expression networks.
DNA dynamics is likely to be a factor in the genomic nucleotide repeats expansions related to diseases.

Directory of Open Access Journals (Sweden)

Boian S Alexandrov

Full Text Available Trinucleotide repeats sequences (TRS represent a common type of genomic DNA motif whose expansion is associated with a large number of human diseases. The driving molecular mechanisms of the TRS ongoing dynamic expansion across generations and within tissues and its influence on genomic DNA functions are not well understood. Here we report results for a novel and notable collective breathing behavior of genomic DNA of tandem TRS, leading to propensity for large local DNA transient openings at physiological temperature. Our Langevin molecular dynamics (LMD and Markov Chain Monte Carlo (MCMC simulations demonstrate that the patterns of openings of various TRSs depend specifically on their length. The collective propensity for DNA strand separation of repeated sequences serves as a precursor for outsized intermediate bubble states independently of the G/C-content. We report that repeats have the potential to interfere with the binding of transcription factors to their consensus sequence by altered DNA breathing dynamics in proximity of the binding sites. These observations might influence ongoing attempts to use LMD and MCMC simulations for TRS-related modeling of genomic DNA functionality in elucidating the common denominators of the dynamic TRS expansion mutation with potential therapeutic applications.
Study in mutation of alfalfa genome DNA due to low energy N+ implantation using RAPD

International Nuclear Information System (INIS)

Chen Roulei; Song Daojun; Yu Zengliang; Li Yufeng; Liang Yunzhang

2001-01-01

After implanted by various dosage N + beams, germination rate of alfalfa seeds appears to be saddle line with dosage increasing. The authors have studied in mutation of genome DNA due to low energy N + implantation, and concluded that 30 differential DNA fragments have been amplified by 8 primers (S 41 , S 42 , S 45 , S 46 , S 50 , S 52 , S 56 , S 58 ) in 100 primers, moreover, number of differential DNA fragments between CK and treatments increases with dosage. Consequently, low energy ion implantation can cause mutation of alfalfa genome DNA. The more dosage it is, the more mutation alfalfa will be

Undermethylated DNA as a source of microsatellites from a conifer genome.

Science.gov (United States)

Zhou, Y; Bui, T; Auckland, L D; Williams, C G

2002-02-01

Developing microsatellites from the large, highly duplicated conifer genome requires special tools. To improve the efficiency of developing Pinus taeda L. microsatellites, undermethylated (UM) DNA fragments were used to construct a microsatellite-enriched copy library. A methylation-sensitive restriction enzyme, McrBC, was used to enrich for UM DNA before library construction. Digested DNA fragments larger than 9 kb were then excised and digested with RsaI and used to construct nine dinucleotide and trinucleotide libraries. A total of 1016 microsatellite-positive clones were detected among 11 904 clones and 620 of these were unique. Of 245 primer sets that produced a PCR product, 113 could be developed as UM microsatellite markers and 70 were polymorphic. Inheritance and marker informativeness were tested for a random sample of 36 polymorphic markers using a three-generation outbred pedigree. Thirty-one microsatellites (86%) had single-locus inheritance despite the highly duplicated nature of the P. taeda genome. Nineteen UM microsatellites had highly informative intercross mating type configurations. Allele number and frequency were estimated for eleven UM microsatellites using a population survey. Allele numbers for these UM microsatellites ranged from 3 to 12 with an average of 5.7 alleles/locus. Frequencies for the 63 alleles were mostly in the low-common range; only 14 of the 63 were in the rare allele (q < 0.05) class. Enriching for UM DNA was an efficient method for developing polymorphic microsatellites from a large plant genome.
Archaeal Genome Guardians Give Insights into Eukaryotic DNA Replication and Damage Response Proteins

Directory of Open Access Journals (Sweden)

David S. Shin

2014-01-01

Full Text Available As the third domain of life, archaea, like the eukarya and bacteria, must have robust DNA replication and repair complexes to ensure genome fidelity. Archaea moreover display a breadth of unique habitats and characteristics, and structural biologists increasingly appreciate these features. As archaea include extremophiles that can withstand diverse environmental stresses, they provide fundamental systems for understanding enzymes and pathways critical to genome integrity and stress responses. Such archaeal extremophiles provide critical data on the periodic table for life as well as on the biochemical, geochemical, and physical limitations to adaptive strategies allowing organisms to thrive under environmental stress relevant to determining the boundaries for life as we know it. Specifically, archaeal enzyme structures have informed the architecture and mechanisms of key DNA repair proteins and complexes. With added abilities to temperature-trap flexible complexes and reveal core domains of transient and dynamic complexes, these structures provide insights into mechanisms of maintaining genome integrity despite extreme environmental stress. The DNA damage response protein structures noted in this review therefore inform the basis for genome integrity in the face of environmental stress, with implications for all domains of life as well as for biomanufacturing, astrobiology, and medicine.
Epigenetic changes of Arabidopsis genome associated with altered DNA methyltransferase and demethylase expressions after gamma irradiation

International Nuclear Information System (INIS)

Kim, Ji Eun; Cho, Eun Ju; Kim, Ji Hong; Chung, Byung Yeoup; Kim, Jin Hong

2012-01-01

DNA methylation at carbon 5 of cytosines is a hall mark of epigenetic inactivation and heterochromatin in both plants and mammals. In Arabidopsis, DNA methylation has two roles that protect the genome from selfish DNA elements and regulate gene expression. Plant genome has three types of DNA methyltransferase, METHYLTRANSFERASE 1 (MET1), DOMAINREARRANGED METHYLASE (DRM) and CHROMOMETHYLASE 3 (CMT3) that are capable of methylating CG, CHG (where H is A, T, or C) and CHH sites, respectively. MET1 is a maintenance DNA methyltransferase that controls CG methylation. Two members of the DRM family, DRM1 and DRM2, are responsible for de novo methylation of CG, CHG, and CHH sites but show a preference for CHH sites. Finally, CMT3 principally carries out CHG methylation and is involved in both de novo methylation and maintenance. Alternatively, active DNA demethylation may occur through the glycosylase activity by removing the methylcytosines from DNA. It may have essential roles in preventing transcriptional silencing of transgenes and endogenous genes and in activating the expression of imprinted genes. DNA demetylation in Arabidopsis is mediated by the DEMETER (DME) family of bifunctional DNA glycosylase. Three targets of DME are MEA (MEDEA), FWA (FLOWERING WAGENINGEN), and FIS2 (FERTILIZATION INDEPENDENT SEED 2). The DME family contains DEMETER-LIKE 2 (DML2), DML3, and REPRESSOR OF SILENING 1 (ROS1). DNA demetylation by ROS1, DML2, and DML3 protect the hypermethylation of specific genome loci. ROS1 is necessary to suppress the promoter methylation and the silencing of endogenous genes. In contrast, the function of DML2 and DML3 has not been reported. Several recent studies have suggested that epigenetic alterations such as change in DNA methylation and histone modification should be caused in plant genomes upon exposure to ionizing radiation. However, there is a lack of data exploring the underlying mechanisms. Therefore, the present study aims to characterize and
Epigenetic changes of Arabidopsis genome associated with altered DNA methyltransferase and demethylase expressions after gamma irradiation

Energy Technology Data Exchange (ETDEWEB)

Kim, Ji Eun; Cho, Eun Ju; Kim, Ji Hong; Chung, Byung Yeoup; Kim, Jin Hong [Korea Atomic Energy Research Institute, Daejeon (Korea, Republic of)

2012-05-15

DNA methylation at carbon 5 of cytosines is a hall mark of epigenetic inactivation and heterochromatin in both plants and mammals. In Arabidopsis, DNA methylation has two roles that protect the genome from selfish DNA elements and regulate gene expression. Plant genome has three types of DNA methyltransferase, METHYLTRANSFERASE 1 (MET1), DOMAINREARRANGED METHYLASE (DRM) and CHROMOMETHYLASE 3 (CMT3) that are capable of methylating CG, CHG (where H is A, T, or C) and CHH sites, respectively. MET1 is a maintenance DNA methyltransferase that controls CG methylation. Two members of the DRM family, DRM1 and DRM2, are responsible for de novo methylation of CG, CHG, and CHH sites but show a preference for CHH sites. Finally, CMT3 principally carries out CHG methylation and is involved in both de novo methylation and maintenance. Alternatively, active DNA demethylation may occur through the glycosylase activity by removing the methylcytosines from DNA. It may have essential roles in preventing transcriptional silencing of transgenes and endogenous genes and in activating the expression of imprinted genes. DNA demetylation in Arabidopsis is mediated by the DEMETER (DME) family of bifunctional DNA glycosylase. Three targets of DME are MEA (MEDEA), FWA (FLOWERING WAGENINGEN), and FIS2 (FERTILIZATION INDEPENDENT SEED 2). The DME family contains DEMETER-LIKE 2 (DML2), DML3, and REPRESSOR OF SILENING 1 (ROS1). DNA demetylation by ROS1, DML2, and DML3 protect the hypermethylation of specific genome loci. ROS1 is necessary to suppress the promoter methylation and the silencing of endogenous genes. In contrast, the function of DML2 and DML3 has not been reported. Several recent studies have suggested that epigenetic alterations such as change in DNA methylation and histone modification should be caused in plant genomes upon exposure to ionizing radiation. However, there is a lack of data exploring the underlying mechanisms. Therefore, the present study aims to characterize and
Sampling the genomic pool of protein tyrosine kinase genes using the polymerase chain reaction with genomic DNA.

Science.gov (United States)

Oates, A C; Wollberg, P; Achen, M G; Wilks, A F

1998-08-28

The polymerase chain reaction (PCR), with cDNA as template, has been widely used to identify members of protein families from many species. A major limitation of using cDNA in PCR is that detection of a family member is dependent on temporal and spatial patterns of gene expression. To circumvent this restriction, and in order to develop a technique that is broadly applicable we have tested the use of genomic DNA as PCR template to identify members of protein families in an expression-independent manner. This test involved amplification of DNA encoding protein tyrosine kinase (PTK) genes from the genomes of three animal species that are well known development models; namely, the mouse Mus musculus, the fruit fly Drosophila melanogaster, and the nematode worm Caenorhabditis elegans. Ten PTK genes were identified from the mouse, 13 from the fruit fly, and 13 from the nematode worm. Among these kinases were 13 members of the PTK family that had not been reported previously. Selected PTKs from this screen were shown to be expressed during development, demonstrating that the amplified fragments did not arise from pseudogenes. This approach will be useful for the identification of many novel members of gene families in organisms of agricultural, medical, developmental and evolutionary significance and for analysis of gene families from any species, or biological sample whose habitat precludes the isolation of mRNA. Furthermore, as a tool to hasten the discovery of members of gene families that are of particular interest, this method offers an opportunity to sample the genome for new members irrespective of their expression pattern.
Genome stability: recent insights in the topoisomerase reverse gyrase and thermophilic DNA alkyltransferase.

Science.gov (United States)

Vettone, Antonella; Perugino, Giuseppe; Rossi, Mosè; Valenti, Anna; Ciaramella, Maria

2014-09-01

Repair and defence of genome integrity from endogenous and environmental hazard is a primary need for all organisms. Natural selection has driven the evolution of multiple cell pathways to deal with different DNA damaging agents. Failure of such processes can hamper cell functions and induce inheritable mutations, which in humans may cause cancerogenicity or certain genetic syndromes, and ultimately cell death. A special case is that of hyperthermophilic bacteria and archaea, flourishing at temperatures higher than 80 °C, conditions that favor genome instability and thus call for specific, highly efficient or peculiar mechanisms to keep their genome intact and functional. Over the last few years, numerous studies have been performed on the activity, function, regulation, physical and functional interaction of enzymes and proteins from hyperthermophilic microorganisms that are able to bind, repair, bypass damaged DNA, or modify its structure or conformation. The present review is focused on two enzymes that act on DNA catalyzing unique reactions: reverse gyrase and DNA alkyltransferase. Although both enzymes belong to evolutionary highly conserved protein families present in organisms of the three domains (Eucarya, Bacteria and Archaea), recently characterized members from hyperthermophilic archaea show both common and peculiar features.
Recent advances in the genome-wide study of DNA replication origins in yeast

Directory of Open Access Journals (Sweden)

Chong ePeng

2015-02-01

Full Text Available DNA replication, one of the central events in the cell cycle, is the basis of biological inheritance. In order to be duplicated, a DNA double helix must be opened at defined sites, which are called DNA replication origins (ORIs. Unlike in bacteria, where replication initiates from a single replication origin, multiple origins are utilized in the eukaryotic genome. Among them, the ORIs in budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe have been best characterized. In recent years, advances in DNA microarray and next-generation sequencing technologies have increased the number of yeast species involved in ORIs research dramatically. The ORIs in some nonconventional yeast species such as Kluyveromyces lactis and Pichia pastoris have also been genome-widely identified. Relevant databases of replication origins in yeast were constructed, then the comparative genomic analysis can be carried out. Here, we review several experimental approaches that have been used to map replication origins in yeast and some of the available web resources related to yeast ORIs. We also discuss the sequence characteristics and chromosome structures of ORIs in the four yeast species, which can be utilized to improve the replication origins prediction.
Recent advances in the genome-wide study of DNA replication origins in yeast

Science.gov (United States)

Peng, Chong; Luo, Hao; Zhang, Xi; Gao, Feng

2015-01-01

DNA replication, one of the central events in the cell cycle, is the basis of biological inheritance. In order to be duplicated, a DNA double helix must be opened at defined sites, which are called DNA replication origins (ORIs). Unlike in bacteria, where replication initiates from a single replication origin, multiple origins are utilized in the eukaryotic genomes. Among them, the ORIs in budding yeast Saccharomyces cerevisiae and the fission yeast Schizosaccharomyces pombe have been best characterized. In recent years, advances in DNA microarray and next-generation sequencing technologies have increased the number of yeast species involved in ORIs research dramatically. The ORIs in some non-conventional yeast species such as Kluyveromyces lactis and Pichia pastoris have also been genome-widely identified. Relevant databases of replication origins in yeast were constructed, then the comparative genomic analysis can be carried out. Here, we review several experimental approaches that have been used to map replication origins in yeast and some of the available web resources related to yeast ORIs. We also discuss the sequence characteristics and chromosome structures of ORIs in the four yeast species, which can be utilized to improve yeast replication origins prediction. PMID:25745419
Comparison of microbial DNA enrichment tools for metagenomic whole genome sequencing.

Science.gov (United States)

Thoendel, Matthew; Jeraldo, Patricio R; Greenwood-Quaintance, Kerryl E; Yao, Janet Z; Chia, Nicholas; Hanssen, Arlen D; Abdel, Matthew P; Patel, Robin

2016-08-01

Metagenomic whole genome sequencing for detection of pathogens in clinical samples is an exciting new area for discovery and clinical testing. A major barrier to this approach is the overwhelming ratio of human to pathogen DNA in samples with low pathogen abundance, which is typical of most clinical specimens. Microbial DNA enrichment methods offer the potential to relieve this limitation by improving this ratio. Two commercially available enrichment kits, the NEBNext Microbiome DNA Enrichment Kit and the Molzym MolYsis Basic kit, were tested for their ability to enrich for microbial DNA from resected arthroplasty component sonicate fluids from prosthetic joint infections or uninfected sonicate fluids spiked with Staphylococcus aureus. Using spiked uninfected sonicate fluid there was a 6-fold enrichment of bacterial DNA with the NEBNext kit and 76-fold enrichment with the MolYsis kit. Metagenomic whole genome sequencing of sonicate fluid revealed 13- to 85-fold enrichment of bacterial DNA using the NEBNext enrichment kit. The MolYsis approach achieved 481- to 9580-fold enrichment, resulting in 7 to 59% of sequencing reads being from the pathogens known to be present in the samples. These results demonstrate the usefulness of these tools when testing clinical samples with low microbial burden using next generation sequencing. Copyright © 2016 Elsevier B.V. All rights reserved.
Two efficient methods for isolation of high-quality genomic DNA from entomopathogenic fungi.

Science.gov (United States)

Serna-Domínguez, María G; Andrade-Michel, Gilda Y; Arredondo-Bernal, Hugo C; Gallou, Adrien

2018-03-27

Conventional and commercial methods for isolation of nucleic acids are available for fungal samples including entomopathogenic fungi (EPF). However, there is not a unique optimal method for all organisms. The cell wall structure and the wide range of secondary metabolites of EPF can broadly interfere with the efficiency of the DNA extraction protocol. This study compares three commercial protocols: DNeasy® Plant Mini Kit (Qiagen), Wizard® Genomic DNA Purification Kit (Promega), and Axygen™ Multisource Genomic DNA Miniprep Kit (Axygen) and three conventional methods based on different buffers: SDS, CTAB/PVPP, and CTAB/β-mercaptoethanol versus three cell lysis procedures: liquid nitrogen homogenization and two bead-beating materials (i.e., tungsten-carbide and stainless-steel) for four representative species of EPF (i.e., Beauveria bassiana, Hirsutella citriformis, Isaria javanica, and Metarhizium anisopliae). Liquid nitrogen homogenization combined with DNeasy® Plant Mini Kit (i.e., QN) or SDS buffer (i.e., SN) significantly improved the yield with a good purity (~1.8) and high integrity (>20,000 bp) of genomic DNA in contrast with other methods, also, these results were better when compared with the two bead-beating materials. The purified DNA was evaluated by PCR-based techniques: amplification of translation elongation factor 1-α (TEF) and two highly sensitive molecular markers (i.e., ISSR and AFLP) with reliable and reproducible results. Despite a variation in yield, purity, and integrity of extracted DNA across the four species of EPF with the different DNA extraction methods, the SN and QN protocols maintained a high-quality of DNA which is required for downstream molecular applications. Copyright © 2018 Elsevier B.V. All rights reserved.
Solution-based targeted genomic enrichment for precious DNA samples

Directory of Open Access Journals (Sweden)

Shearer Aiden

2012-05-01

Full Text Available Abstract Background Solution-based targeted genomic enrichment (TGE protocols permit selective sequencing of genomic regions of interest on a massively parallel scale. These protocols could be improved by: 1 modifying or eliminating time consuming steps; 2 increasing yield to reduce input DNA and excessive PCR cycling; and 3 enhancing reproducible. Results We developed a solution-based TGE method for downstream Illumina sequencing in a non-automated workflow, adding standard Illumina barcode indexes during the post-hybridization amplification to allow for sample pooling prior to sequencing. The method utilizes Agilent SureSelect baits, primers and hybridization reagents for the capture, off-the-shelf reagents for the library preparation steps, and adaptor oligonucleotides for Illumina paired-end sequencing purchased directly from an oligonucleotide manufacturing company. Conclusions This solution-based TGE method for Illumina sequencing is optimized for small- or medium-sized laboratories and addresses the weaknesses of standard protocols by reducing the amount of input DNA required, increasing capture yield, optimizing efficiency, and improving reproducibility.
Long span DNA paired-end-tag (DNA-PET sequencing strategy for the interrogation of genomic structural mutations and fusion-point-guided reconstruction of amplicons.

Directory of Open Access Journals (Sweden)

Fei Yao

Full Text Available Structural variations (SVs contribute significantly to the variability of the human genome and extensive genomic rearrangements are a hallmark of cancer. While genomic DNA paired-end-tag (DNA-PET sequencing is an attractive approach to identify genomic SVs, the current application of PET sequencing with short insert size DNA can be insufficient for the comprehensive mapping of SVs in low complexity and repeat-rich genomic regions. We employed a recently developed procedure to generate PET sequencing data using large DNA inserts of 10-20 kb and compared their characteristics with short insert (1 kb libraries for their ability to identify SVs. Our results suggest that although short insert libraries bear an advantage in identifying small deletions, they do not provide significantly better breakpoint resolution. In contrast, large inserts are superior to short inserts in providing higher physical genome coverage for the same sequencing cost and achieve greater sensitivity, in practice, for the identification of several classes of SVs, such as copy number neutral and complex events. Furthermore, our results confirm that large insert libraries allow for the identification of SVs within repetitive sequences, which cannot be spanned by short inserts. This provides a key advantage in studying rearrangements in cancer, and we show how it can be used in a fusion-point-guided-concatenation algorithm to study focally amplified regions in cancer.
Universal global imprints of genome growth and evolution--equivalent length and cumulative mutation density.

Directory of Open Access Journals (Sweden)

Hong-Da Chen

Full Text Available BACKGROUND: Segmental duplication is widely held to be an important mode of genome growth and evolution. Yet how this would affect the global structure of genomes has been little discussed. METHODS/PRINCIPAL FINDINGS: Here, we show that equivalent length, or L(e, a quantity determined by the variance of fluctuating part of the distribution of the k-mer frequencies in a genome, characterizes the latter's global structure. We computed the L(es of 865 complete chromosomes and found that they have nearly universal but (k-dependent values. The differences among the L(e of a chromosome and those of its coding and non-coding parts were found to be slight. CONCLUSIONS: We verified that these non-trivial results are natural consequences of a genome growth model characterized by random segmental duplication and random point mutation, but not of any model whose dominant growth mechanism is not segmental duplication. Our study also indicates that genomes have a nearly universal cumulative "point" mutation density of about 0.73 mutations per site that is compatible with the relatively low mutation rates of (1-5 x 10(-3/site/Mya previously determined by sequence comparison for the human and E. coli genomes.
Genome-wide DNA methylation analysis of the porcine hypothalamus-pituitary-ovary axis

DEFF Research Database (Denmark)

Yuan, Xiao Long; Zhang, Zhe; Li, Bin

2017-01-01

Previous studies have suggested that DNA methylation in both CpG and CpH (where H = C, T or A) contexts plays a critical role in biological functions of different tissues. However, the genome-wide DNA methylation patterns of porcine hypothalamus-pituitary-ovary (HPO) tissues remain virtually unex...
Alu Mobile Elements: From Junk DNA to Genomic Gems

Directory of Open Access Journals (Sweden)

Sami Dridi

2012-01-01

Full Text Available Alus, the short interspersed repeated sequences (SINEs, are retrotransposons that litter the human genomes and have long been considered junk DNA. However, recent findings that these mobile elements are transcribed, both as distinct RNA polymerase III transcripts and as a part of RNA polymerase II transcripts, suggest biological functions and refute the notion that Alus are biologically unimportant. Indeed, Alu RNAs have been shown to control mRNA processing at several levels, to have complex regulatory functions such as transcriptional repression and modulating alternative splicing and to cause a host of human genetic diseases. Alu RNAs embedded in Pol II transcripts can promote evolution and proteome diversity, which further indicates that these mobile retroelements are in fact genomic gems rather than genomic junks.
Identification of candidate new cancer susceptibility genes using yeast genomics

International Nuclear Information System (INIS)

Brown, M.; Brown, J.A.; Game, J.C.

2003-01-01

A large proportion of cancer susceptibility syndromes are the result of mutations in genes in DNA repair or in cell-cycle checkpoints in response to DNA damage, such as ataxia telangiectasia (AT), Fanconi's anemia (FA), Bloom's syndrome (BS), Nijmegen breakage syndrome (NBS), and xeroderma pigmentosum (XP). Mutations in these genes often cause gross chromosomal instability leading to an increased mutation rate of all genes including those directly responsible for cancer. We have proposed that because the orthologs of these genes in budding yeast, S. cerevisiae, confer protection against killing by DNA damaging agents it should be possible to identify new cancer susceptibility genes by identifying yeast genes whose deletion causes sensitivity to DNA damage. We therefore screened the recently completed collection of individual gene deletion mutants to identify genes that affect sensitivity to DNA-damaging agents. Screening for sensitivity in this obtained up to now with the F98 glioma model othe fact that each deleted gene is replaced by a cassette containing two molecular 'barcodes', or 20-mers, that uniquely identify the strain when DNA from a pool of strains is hybridized to an oligonucleotide array containing the complementary sequences of the barcodes. We performed the screen with UV, IR, H 2 0 2 and other DNA damaging agents. In addition to identifying genes already known to confer resistance to DNA damaging agents we have identified, and individually confirmed, several genes not previously associated with resistance. Several of these are of unknown function. We have also examined the chromosomal stability of selected strains and found that IR sensitive strains often but not always exhibit genomic instability. We are presently constructing a yeast artificial chromosome to globally interrogate all the genes in the deletion pool for their involvement in genomic stability. This work shows that budding yeast is a valuable eukaryotic model organism to identify
The pathological consequences of impaired genome integrity in humans; disorders of the DNA replication machinery.

Science.gov (United States)

O'Driscoll, Mark

2017-01-01

Accurate and efficient replication of the human genome occurs in the context of an array of constitutional barriers, including regional topological constraints imposed by chromatin architecture and processes such as transcription, catenation of the helical polymer and spontaneously generated DNA lesions, including base modifications and strand breaks. DNA replication is fundamentally important for tissue development and homeostasis; differentiation programmes are intimately linked with stem cell division. Unsurprisingly, impairments of the DNA replication machinery can have catastrophic consequences for genome stability and cell division. Functional impacts on DNA replication and genome stability have long been known to play roles in malignant transformation through a variety of complex mechanisms, and significant further insights have been gained from studying model organisms in this context. Congenital hypomorphic defects in components of the DNA replication machinery have been and continue to be identified in humans. These disorders present with a wide range of clinical features. Indeed, in some instances, different mutations in the same gene underlie different clinical presentations. Understanding the origin and molecular basis of these features opens a window onto the range of developmental impacts of suboptimal DNA replication and genome instability in humans. Here, I will briefly overview the basic steps involved in DNA replication and the key concepts that have emerged from this area of research, before switching emphasis to the pathological consequences of defects within the DNA replication network; the human disorders. Copyright © 2016 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd. Copyright © 2016 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.
Comprehensive analysis of genome-wide DNA methylation across human polycystic ovary syndrome ovary granulosa cell.

Science.gov (United States)

Xu, Jiawei; Bao, Xiao; Peng, Zhaofeng; Wang, Linlin; Du, Linqing; Niu, Wenbin; Sun, Yingpu

2016-05-10

Polycystic ovary syndrome (PCOS) affects approximately 7% of the reproductive-age women. A growing body of evidence indicated that epigenetic mechanisms contributed to the development of PCOS. The role of DNA modification in human PCOS ovary granulosa cell is still unknown in PCOS progression. Global DNA methylation and hydroxymethylation were detected between PCOS' and controls' granulosa cell. Genome-wide DNA methylation was profiled to investigate the putative function of DNA methylaiton. Selected genes expressions were analyzed between PCOS' and controls' granulosa cell. Our results showed that the granulosa cell global DNA methylation of PCOS patients was significant higher than the controls'. The global DNA hydroxymethylation showed low level and no statistical difference between PCOS and control. 6936 differentially methylated CpG sites were identified between control and PCOS-obesity. 12245 differential methylated CpG sites were detected between control and PCOS-nonobesity group. 5202 methylated CpG sites were significantly differential between PCOS-obesity and PCOS-nonobesity group. Our results showed that DNA methylation not hydroxymethylation altered genome-wide in PCOS granulosa cell. The different methylation genes were enriched in development protein, transcription factor activity, alternative splicing, sequence-specific DNA binding and embryonic morphogenesis. YWHAQ, NCF2, DHRS9 and SCNA were up-regulation in PCOS-obesity patients with no significance different between control and PCOS-nonobesity patients, which may be activated by lower DNA methylaiton. Global and genome-wide DNA methylation alteration may contribute to different genes expression and PCOS clinical pathology.
Universal and rapid salt-extraction of high quality genomic DNA for PCR-based techniques.

OpenAIRE

Aljanabi, S M; Martinez, I

1997-01-01

A very simple, fast, universally applicable and reproducible method to extract high quality megabase genomic DNA from different organisms is described. We applied the same method to extract high quality complex genomic DNA from different tissues (wheat, barley, potato, beans, pear and almond leaves as well as fungi, insects and shrimps' fresh tissue) without any modification. The method does not require expensive and environmentally hazardous reagents and equipment. It can be performed even i...
DNA-dependent protein kinase participates in the radiation activation of NF-kB

International Nuclear Information System (INIS)

Rosenzweig, Kenneth E.; Youmell, Matthew B.; Price, Brendan D.

1997-01-01

The NF-kB transcription factor is maintained in an inactive state by binding to the lkBa inhibitory protein. Activation requires phosphorylation and degradation of lkBa, releasing active NF-kB. NF-kB can be activated by cytokines, antigens, free radicals and X-ray irradiation. The protein kinase responsible for phosphorylation of lkBa in vivo has not been fully characterized. Here, we have examined the role of the DNA-dependent protein kinases (DNA-PK) in the radiation-activation of NF-kB. Wortmannin is an inhibitor of DNA-PK and related kinases. Exposure of SW480 cells to wortmannin inhibited the radioactivation of NF-kB DNA-binding. Analysis of lkBa levels by western blotting indicated that wortmannin blocked the radiation induced degradation of lkBa. In in vitro experiments, purified DNA-PK was able to efficiently phosphorylate lkBa, and this phosphorylation was inhibited by wortmannin. In contrast, the induction of NF-kB activity by TNFa was unaffected by wortmannin. The results suggest that DNA-PK may phosphorylate lkBa following irradiation, leading to degradation of lkBa and the release of active NF-kB. The inability of wortmannin to block TNFa activation of NF-kB indicates there may be more than one pathway for the activation of NF-kB

Deterministic and stochastic models for middle east respiratory syndrome (MERS)

Science.gov (United States)

Suryani, Dessy Rizki; Zevika, Mona; Nuraini, Nuning

2018-03-01

World Health Organization (WHO) data stated that since September 2012, there were 1,733 cases of Middle East Respiratory Syndrome (MERS) with 628 death cases that occurred in 27 countries. MERS was first identified in Saudi Arabia in 2012 and the largest cases of MERS outside Saudi Arabia occurred in South Korea in 2015. MERS is a disease that attacks the respiratory system caused by infection of MERS-CoV. MERS-CoV transmission occurs directly through direct contact between infected individual with non-infected individual or indirectly through contaminated object by the free virus. Suspected, MERS can spread quickly because of the free virus in environment. Mathematical modeling is used to illustrate the transmission of MERS disease using deterministic model and stochastic model. Deterministic model is used to investigate the temporal dynamic from the system to analyze the steady state condition. Stochastic model approach using Continuous Time Markov Chain (CTMC) is used to predict the future states by using random variables. From the models that were built, the threshold value for deterministic models and stochastic models obtained in the same form and the probability of disease extinction can be computed by stochastic model. Simulations for both models using several of different parameters are shown, and the probability of disease extinction will be compared with several initial conditions.
Digital Droplet Multiple Displacement Amplification (ddMDA for Whole Genome Sequencing of Limited DNA Samples.

Directory of Open Access Journals (Sweden)

Minsoung Rhee

Full Text Available Multiple displacement amplification (MDA is a widely used technique for amplification of DNA from samples containing limited amounts of DNA (e.g., uncultivable microbes or clinical samples before whole genome sequencing. Despite its advantages of high yield and fidelity, it suffers from high amplification bias and non-specific amplification when amplifying sub-nanogram of template DNA. Here, we present a microfluidic digital droplet MDA (ddMDA technique where partitioning of the template DNA into thousands of sub-nanoliter droplets, each containing a small number of DNA fragments, greatly reduces the competition among DNA fragments for primers and polymerase thereby greatly reducing amplification bias. Consequently, the ddMDA approach enabled a more uniform coverage of amplification over the entire length of the genome, with significantly lower bias and non-specific amplification than conventional MDA. For a sample containing 0.1 pg/μL of E. coli DNA (equivalent of ~3/1000 of an E. coli genome per droplet, ddMDA achieves a 65-fold increase in coverage in de novo assembly, and more than 20-fold increase in specificity (percentage of reads mapping to E. coli compared to the conventional tube MDA. ddMDA offers a powerful method useful for many applications including medical diagnostics, forensics, and environmental microbiology.
Genome-wide DNA methylation profiling in the superior temporal gyrus reveals epigenetic signatures associated with Alzheimer's disease.

Science.gov (United States)

Watson, Corey T; Roussos, Panos; Garg, Paras; Ho, Daniel J; Azam, Nidha; Katsel, Pavel L; Haroutunian, Vahram; Sharp, Andrew J

2016-01-19

Alzheimer's disease affects ~13% of people in the United States 65 years and older, making it the most common neurodegenerative disorder. Recent work has identified roles for environmental, genetic, and epigenetic factors in Alzheimer's disease risk. We performed a genome-wide screen of DNA methylation using the Illumina Infinium HumanMethylation450 platform on bulk tissue samples from the superior temporal gyrus of patients with Alzheimer's disease and non-demented controls. We paired a sliding window approach with multivariate linear regression to characterize Alzheimer's disease-associated differentially methylated regions (DMRs). We identified 479 DMRs exhibiting a strong bias for hypermethylated changes, a subset of which were independently associated with aging. DMR intervals overlapped 475 RefSeq genes enriched for gene ontology categories with relevant roles in neuron function and development, as well as cellular metabolism, and included genes reported in Alzheimer's disease genome-wide and epigenome-wide association studies. DMRs were enriched for brain-specific histone signatures and for binding motifs of transcription factors with roles in the brain and Alzheimer's disease pathology. Notably, hypermethylated DMRs preferentially overlapped poised promoter regions, marked by H3K27me3 and H3K4me3, previously shown to co-localize with aging-associated hypermethylation. Finally, the integration of DMR-associated single nucleotide polymorphisms with Alzheimer's disease genome-wide association study risk loci and brain expression quantitative trait loci highlights multiple potential DMRs of interest for further functional analysis. We have characterized changes in DNA methylation in the superior temporal gyrus of patients with Alzheimer's disease, highlighting novel loci that facilitate better characterization of pathways and mechanisms underlying Alzheimer's disease pathogenesis, and improve our understanding of epigenetic signatures that may contribute to the
Fine de novo sequencing of a fungal genome using only SOLiD short read data: verification on Aspergillus oryzae RIB40.

Directory of Open Access Journals (Sweden)

Myco Umemura

Full Text Available The development of next-generation sequencing (NGS technologies has dramatically increased the throughput, speed, and efficiency of genome sequencing. The short read data generated from NGS platforms, such as SOLiD and Illumina, are quite useful for mapping analysis. However, the SOLiD read data with lengths of <60 bp have been considered to be too short for de novo genome sequencing. Here, to investigate whether de novo sequencing of fungal genomes is possible using only SOLiD short read sequence data, we performed de novo assembly of the Aspergillus oryzae RIB40 genome using only SOLiD read data of 50 bp generated from mate-paired libraries with 2.8- or 1.9-kb insert sizes. The assembled scaffolds showed an N50 value of 1.6 Mb, a 22-fold increase than those obtained using only SOLiD short read in other published reports. In addition, almost 99% of the reference genome was accurately aligned by the assembled scaffold fragments in long lengths. The sequences of secondary metabolite biosynthetic genes and clusters, whose products are of considerable interest in fungal studies due to their potential medicinal, agricultural, and cosmetic properties, were also highly reconstructed in the assembled scaffolds. Based on these findings, we concluded that de novo genome sequencing using only SOLiD short reads is feasible and practical for molecular biological study of fungi. We also investigated the effect of filtering low quality data, library insert size, and k-mer size on the assembly performance, and recommend for the assembly use of mild filtered read data where the N50 was not so degraded and the library has an insert size of ∼2.0 kb, and k-mer size 33.
Rapid detection of MERS coronavirus-like viruses in bats: pote1ntial for tracking MERS coronavirus transmission and animal origin.

Science.gov (United States)

Woo, Patrick C Y; Lau, Susanna K P; Chen, Yixin; Wong, Emily Y M; Chan, Kwok-Hung; Chen, Honglin; Zhang, Libiao; Xia, Ningshao; Yuen, Kwok-Yung

2018-03-07

Recently, we developed a monoclonal antibody-based rapid nucleocapsid protein detection assay for diagnosis of MERS coronavirus (MERS-CoV) in humans and dromedary camels. In this study, we examined the usefulness of this assay to detect other lineage C betacoronaviruses closely related to MERS-CoV in bats. The rapid MERS-CoV nucleocapsid protein detection assay was tested positive in 24 (88.9%) of 27 Tylonycteris bat CoV HKU4 (Ty-BatCoV-HKU4) RNA-positive alimentary samples of Tylonycteris pachypus and 4 (19.0%) of 21 Pipistrellus bat CoV HKU5 (Pi-BatCoV-HKU5) RNA-positive alimentary samples of Pipistrellus abramus. There was significantly more Ty-BatCoV-HKU4 RNA-positive alimentary samples than Pi-BatCoV-HKU5 RNA-positive alimentary samples that were tested positive by the rapid MERS-CoV nucleocapsid protein detection assay (P < 0.001 by Chi-square test). The rapid assay was tested negative in all 51 alimentary samples RNA-positive for alphacoronaviruses (Rhinolophus bat CoV HKU2, Myotis bat CoV HKU6, Miniopterus bat CoV HKU8 and Hipposideros batCoV HKU10) and 32 alimentary samples positive for lineage B (SARS-related Rhinolophus bat CoV HKU3) and lineage D (Rousettus bat CoV HKU9) betacoronaviruses. No significant difference was observed between the viral loads of Ty-BatCoV-HKU4/Pi-BatCoV-HKU5 RNA-positive alimentary samples that were tested positive and negative by the rapid test (Mann-Witney U test). The rapid MERS-CoV nucleocapsid protein detection assay is able to rapidly detect lineage C betacoronaviruses in bats. It detected significantly more Ty-BatCoV-HKU4 than Pi-BatCoV-HKU5 because MERS-CoV is more closely related to Ty-BatCoV-HKU4 than Pi-BatCoV-HKU5. This assay will facilitate rapid on-site mass screening of animal samples for ancestors of MERS-CoV and tracking transmission in the related bat species.
Evaluation of three methods of DNA extraction from paraffin-embedded material for the amplification of genomic DNA by means of the PCR technique

Directory of Open Access Journals (Sweden)

MESQUITA Ricardo Alves

2001-01-01

Full Text Available There are several protocols reported in the literature for the extraction of genomic DNA from formalin-fixed paraffin-embedded samples. Genomic DNA is utilized in molecular analyses, including PCR. This study compares three different methods for the extraction of genomic DNA from formalin-fixed paraffin-embedded (inflammatory fibrous hyperplasia and non-formalin-fixed (normal oral mucosa samples: phenol with enzymatic digestion, and silica with and without enzymatic digestion. The amplification of DNA by means of the PCR technique was carried out with primers for the exon 7 of human keratin type 14. Amplicons were analyzed by means of electrophoresis in an 8% polyacrylamide gel with 5% glycerol, followed by silver-staining visualization. The phenol/enzymatic digestion and the silica/enzymatic digestion methods provided amplicons from both tissue samples. The method described is a potential aid in the establishment of the histopathologic diagnosis and in retrospective studies with archival paraffin-embedded samples.
Amplification of pico-scale DNA mediated by bacterial carrier DNA for small-cell-number transcription factor ChIP-seq

DEFF Research Database (Denmark)

Jakobsen, Janus S; Bagger, Frederik O; Hasemann, Marie S

2015-01-01

BACKGROUND: Chromatin-Immunoprecipitation coupled with deep sequencing (ChIP-seq) is used to map transcription factor occupancy and generate epigenetic profiles genome-wide. The requirement of nano-scale ChIP DNA for generation of sequencing libraries has impeded ChIP-seq on in vivo tissues of low...... transcription factor (CEBPA) and histone mark (H3K4me3) ChIP. We further demonstrate that genomic profiles are highly resilient to changes in carrier DNA to ChIP DNA ratios. CONCLUSIONS: This represents a significant advance compared to existing technologies, which involve either complex steps of pre...... cell numbers. RESULTS: We describe a robust, simple and scalable methodology for ChIP-seq of low-abundant cell populations, verified down to 10,000 cells. By employing non-mammalian genome mapping bacterial carrier DNA during amplification, we reliably amplify down to 50 pg of ChIP DNA from...
Shotgun Bisulfite Sequencing of the Betula platyphylla Genome Reveals the Tree’s DNA Methylation Patterning

Directory of Open Access Journals (Sweden)

Chang Su

2014-12-01

Full Text Available DNA methylation plays a critical role in the regulation of gene expression. Most studies of DNA methylation have been performed in herbaceous plants, and little is known about the methylation patterns in tree genomes. In the present study, we generated a map of methylated cytosines at single base pair resolution for Betula platyphylla (white birch by bisulfite sequencing combined with transcriptomics to analyze DNA methylation and its effects on gene expression. We obtained a detailed view of the function of DNA methylation sequence composition and distribution in the genome of B. platyphylla. There are 34,460 genes in the whole genome of birch, and 31,297 genes are methylated. Conservatively, we estimated that 14.29% of genomic cytosines are methylcytosines in birch. Among the methylation sites, the CHH context accounts for 48.86%, and is the largest proportion. Combined transcriptome and methylation analysis showed that the genes with moderate methylation levels had higher expression levels than genes with high and low methylation. In addition, methylated genes are highly enriched for the GO subcategories of binding activities, catalytic activities, cellular processes, response to stimulus and cell death, suggesting that methylation mediates these pathways in birch trees.
Renewed mer model of integral management

Directory of Open Access Journals (Sweden)

Janko Belak

2015-12-01

Full Text Available Background: The research work on entrepreneurship, enterprise's policy and management, which started in 1992, successfully continued in the following years. Between 1992 and 2011, more than 400 academics and other researchers have participated in research work (MER research program whose main orientation has been the creation of their own model of integral management. Results: In past years, academics (researchers and authors of published papers from Austria, Belgium, Bosnia and Herzegovina, Bulgaria, Byelorussia, Canada, the Czech Republic, Croatia, Estonia, France, Germany, Hungary, Italy, Poland, Romania, Russia, the Slovak Republic, Slovenia, Switzerland, Ukraine, and the US have cooperated in MER programs, coming from more than fifty institutions. Thus, scientific doctrines of different universities influenced the development of the MER model which is based on both horizontal and vertical integration of the enterprises' governance and management processes, instruments and institutions into a consistently operating unit. Conclusions: The presented MER model is based on the multi-layer integration of governance and management with an enterprise and its environment, considering the fundamental desires for the enterprises' existence and, thus, their quantitative as well as qualitative changes. The process, instrumental, and institutional integrity of the governance and management is also the initial condition for the implementation of all other integration factors.
Higher-Density Culture in Human Embryonic Stem Cells Results in DNA Damage and Genome Instability

Directory of Open Access Journals (Sweden)

Kurt Jacobs

2016-03-01

Full Text Available Human embryonic stem cells (hESC show great promise for clinical and research applications, but their well-known proneness to genomic instability hampers the development to their full potential. Here, we demonstrate that medium acidification linked to culture density is the main cause of DNA damage and genomic alterations in hESC grown on feeder layers, and this even in the short time span of a single passage. In line with this, we show that increasing the frequency of the medium refreshments minimizes the levels of DNA damage and genetic instability. Also, we show that cells cultured on laminin-521 do not present this increase in DNA damage when grown at high density, although the (long-term impact on their genomic stability remains to be elucidated. Our results explain the high levels of genome instability observed over the years by many laboratories worldwide, and show that the development of optimal culture conditions is key to solving this problem.
Cloning, sequencing, and expression of dnaK-operon proteins from the thermophilic bacterium Thermus thermophilus.

Science.gov (United States)

Osipiuk, J; Joachimiak, A

1997-09-12

We propose that the dnaK operon of Thermus thermophilus HB8 is composed of three functionally linked genes: dnaK, grpE, and dnaJ. The dnaK and dnaJ gene products are most closely related to their cyanobacterial homologs. The DnaK protein sequence places T. thermophilus in the plastid Hsp70 subfamily. In contrast, the grpE translated sequence is most similar to GrpE from Clostridium acetobutylicum, a Gram-positive anaerobic bacterium. A single promoter region, with homology to the Escherichia coli consensus promoter sequences recognized by the sigma70 and sigma32 transcription factors, precedes the postulated operon. This promoter is heat-shock inducible. The dnaK mRNA level increased more than 30 times upon 10 min of heat shock (from 70 degrees C to 85 degrees C). A strong transcription terminating sequence was found between the dnaK and grpE genes. The individual genes were cloned into pET expression vectors and the thermophilic proteins were overproduced at high levels in E. coli and purified to homogeneity. The recombinant T. thermophilus DnaK protein was shown to have a weak ATP-hydrolytic activity, with an optimum at 90 degrees C. The ATPase was stimulated by the presence of GrpE and DnaJ. Another open reading frame, coding for ClpB heat-shock protein, was found downstream of the dnaK operon.
Retrotransposon silencing by DNA methylation can drive mammalian genomic imprinting.

Directory of Open Access Journals (Sweden)

Shunsuke Suzuki

2007-04-01

Full Text Available Among mammals, only eutherians and marsupials are viviparous and have genomic imprinting that leads to parent-of-origin-specific differential gene expression. We used comparative analysis to investigate the origin of genomic imprinting in mammals. PEG10 (paternally expressed 10 is a retrotransposon-derived imprinted gene that has an essential role for the formation of the placenta of the mouse. Here, we show that an orthologue of PEG10 exists in another therian mammal, the marsupial tammar wallaby (Macropus eugenii, but not in a prototherian mammal, the egg-laying platypus (Ornithorhynchus anatinus, suggesting its close relationship to the origin of placentation in therian mammals. We have discovered a hitherto missing link of the imprinting mechanism between eutherians and marsupials because tammar PEG10 is the first example of a differentially methylated region (DMR associated with genomic imprinting in marsupials. Surprisingly, the marsupial DMR was strictly limited to the 5' region of PEG10, unlike the eutherian DMR, which covers the promoter regions of both PEG10 and the adjacent imprinted gene SGCE. These results not only demonstrate a common origin of the DMR-associated imprinting mechanism in therian mammals but provide the first demonstration that DMR-associated genomic imprinting in eutherians can originate from the repression of exogenous DNA sequences and/or retrotransposons by DNA methylation.
Microsatellite DNA in genomic survey sequences and UniGenes of loblolly pine

Science.gov (United States)

Craig S Echt; Surya Saha; Dennis L Deemer; C Dana Nelson

2011-01-01

Genomic DNA sequence databases are a potential and growing resource for simple sequence repeat (SSR) marker development in loblolly pine (Pinus taeda L.). Loblolly pine also has many expressed sequence tags (ESTs) available for microsatellite (SSR) marker development. We compared loblolly pine SSR densities in genome survey sequences (GSSs) to those in non-redundant...
Spectral entropy criteria for structural segmentation in genomic DNA sequences

International Nuclear Information System (INIS)

Chechetkin, V.R.; Lobzin, V.V.

2004-01-01

The spectral entropy is calculated with Fourier structure factors and characterizes the level of structural ordering in a sequence of symbols. It may efficiently be applied to the assessment and reconstruction of the modular structure in genomic DNA sequences. We present the relevant spectral entropy criteria for the local and non-local structural segmentation in DNA sequences. The results are illustrated with the model examples and analysis of intervening exon-intron segments in the protein-coding regions
Guardians of the mycobacterial genome: A review on DNA repair systems in Mycobacterium tuberculosis.

Science.gov (United States)

Singh, Amandeep

2017-12-01

The genomic integrity of Mycobacterium tuberculosis is continuously threatened by the harsh survival conditions inside host macrophages, due to immune and antibiotic stresses. Faithful genome maintenance and repair must be accomplished under stress for the bacillus to survive in the host, necessitating a robust DNA repair system. The importance of DNA repair systems in pathogenesis is well established. Previous examination of the M. tuberculosis genome revealed homologues of almost all the major DNA repair systems, i.e. nucleotide excision repair (NER), base excision repair (BER), homologous recombination (HR) and non-homologous end joining (NHEJ). However, recent developments in the field have pointed to the presence of novel proteins and pathways in mycobacteria. Homologues of archeal mismatch repair proteins were recently reported in mycobacteria, a pathway previously thought to be absent. RecBCD, the major nuclease-helicase enzymes involved in HR in E. coli, were implicated in the single-strand annealing (SSA) pathway. Novel roles of archeo-eukaryotic primase (AEP) polymerases, previously thought to be exclusive to NHEJ, have been reported in BER. Many new proteins with a probable role in DNA repair have also been discovered. It is now realized that the DNA repair systems in M. tuberculosis are highly evolved and have redundant backup mechanisms to mend the damage. This review is an attempt to summarize our current understanding of the DNA repair systems in M. tuberculosis.
Multi-scale coding of genomic information: From DNA sequence to genome structure and function

International Nuclear Information System (INIS)

Arneodo, Alain; Vaillant, Cedric; Audit, Benjamin; Argoul, Francoise; D'Aubenton-Carafa, Yves; Thermes, Claude

2011-01-01

Understanding how chromatin is spatially and dynamically organized in the nucleus of eukaryotic cells and how this affects genome functions is one of the main challenges of cell biology. Since the different orders of packaging in the hierarchical organization of DNA condition the accessibility of DNA sequence elements to trans-acting factors that control the transcription and replication processes, there is actually a wealth of structural and dynamical information to learn in the primary DNA sequence. In this review, we show that when using concepts, methodologies, numerical and experimental techniques coming from statistical mechanics and nonlinear physics combined with wavelet-based multi-scale signal processing, we are able to decipher the multi-scale sequence encoding of chromatin condensation-decondensation mechanisms that play a fundamental role in regulating many molecular processes involved in nuclear functions.
Meraculous: De Novo Genome Assembly with Short Paired-End Reads

Energy Technology Data Exchange (ETDEWEB)

Chapman, Jarrod A.; Ho, Isaac; Sunkara, Sirisha; Luo, Shujun; Schroth, Gary P.; Rokhsar, Daniel S.; Salzberg, Steven L.

2011-08-18

We describe a new algorithm, meraculous, for whole genome assembly of deep paired-end short reads, and apply it to the assembly of a dataset of paired 75-bp Illumina reads derived from the 15.4 megabase genome of the haploid yeast Pichia stipitis. More than 95% of the genome is recovered, with no errors; half the assembled sequence is in contigs longer than 101 kilobases and in scaffolds longer than 269 kilobases. Incorporating fosmid ends recovers entire chromosomes. Meraculous relies on an efficient and conservative traversal of the subgraph of the k-mer (deBruijn) graph of oligonucleotides with unique high quality extensions in the dataset, avoiding an explicit error correction step as used in other short-read assemblers. A novel memory-efficient hashing scheme is introduced. The resulting contigs are ordered and oriented using paired reads separated by ~280 bp or ~3.2 kbp, and many gaps between contigs can be closed using paired-end placements. Practical issues with the dataset are described, and prospects for assembling larger genomes are discussed.
A protein that binds to the P1 origin core and the oriC 13mer region in a methylation-specific fashion is the product of the host seqA gene.

Science.gov (United States)

Brendler, T; Abeles, A; Austin, S

1995-08-15

The P1 plasmid replication origin P1oriR is controlled by methylation of four GATC adenine methylation sites within heptamer repeats. A comparable (13mer) region is present in the host origin, oriC. The two origins show comparable responses to methylation; negative control by recognition of hemimethylated DNA (sequestration) and a positive requirement for methylation for efficient function. We have isolated a host protein that recognizes the P1 origin region only when it is isolated from a strain proficient for adenine methylation. The substantially purified 22 kDa protein also binds to the 13mer region of oriC in a methylation-specific fashion. It proved to be the product of the seqA gene that acts in the negative control of oriC by sequestration. We conclude that the role of the SeqA protein in sequestration is to recognize the methylation state of P1oriR and oriC by direct DNA binding. Using synthetic substrates we show that SeqA binds exclusively to the hemimethylated forms of these origins forms that are the immediate products of replication in a methylation-proficient strain. We also show that the protein can recognize sequences with multiple GATC sites, irrespective of the surrounding sequence. The basis for origin specificity is primarily the persistence of hemimethylated forms that are over-represented in the natural. DNA preparations relative to controls.
Intratracheal exposure of common marmosets to MERS-CoV Jordan-n3/2012 or MERS-CoV EMC/2012 isolates does not result in lethal disease

Energy Technology Data Exchange (ETDEWEB)

Johnson, Reed F., E-mail: johnsonreed@mail.nih.gov [Emerging Viral Pathogens Section, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD (United States); Via, Laura E. [Tuberculosis Research Section, Laboratory of Clinical Infectious Diseases, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD (United States); Kumar, Mia R.; Cornish, Joseph P. [Emerging Viral Pathogens Section, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD (United States); Yellayi, Srikanth; Huzella, Louis; Postnikova, Elena; Oberlander, Nicholas; Bartos, Christopher; Ork, Britini L.; Mazur, Steven; Allan, Cindy; Holbrook, Michael R. [Integrated Research Facility, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD (United States); Solomon, Jeffrey [Center for Infectious Disease Imaging, Radiology and Imaging Sciences, Clinical Center, National Institutes of Health, Bethesda, MD (United States); Johnson, Joshua C. [Integrated Research Facility, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD (United States); Pickel, James [Transgenic Core Facility, National Institute of Mental Health, National Institutes of Health, Bethesda, MD (United States); Hensley, Lisa E. [Integrated Research Facility, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD (United States); Jahrling, Peter B. [Emerging Viral Pathogens Section, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD (United States); Integrated Research Facility, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD (United States)

2015-11-15

Middle East Respiratory Syndrome Coronavirus (MERS-CoV) continues to be a threat to human health in the Middle East. Development of countermeasures is ongoing; however, an animal model that faithfully recapitulates human disease has yet to be defined. A recent study indicated that inoculation of common marmosets resulted in inconsistent lethality. Based on these data we sought to compare two isolates of MERS-CoV. We followed disease progression in common marmosets after intratracheal exposure with: MERS-CoV-EMC/2012, MERS-CoV-Jordan-n3/2012, media, or inactivated virus. Our data suggest that common marmosets developed a mild to moderate non-lethal respiratory disease, which was quantifiable by computed tomography (CT), with limited other clinical signs. Based on CT data, clinical data, and virological data, MERS-CoV inoculation of common marmosets results in mild to moderate clinical signs of disease that are likely due to manipulations of the marmoset rather than as a result of robust viral replication. - Highlights: • Common marmosets infected with MERS-EMC and MERS-JOR did not develop lethal disease. • Infected subjects developed transient signs of clinical disease. • CT indicated few differences between the infected and control groups. • Marmosets do not faithfully replicate human MERS pathogenesis.
Intratracheal exposure of common marmosets to MERS-CoV Jordan-n3/2012 or MERS-CoV EMC/2012 isolates does not result in lethal disease

International Nuclear Information System (INIS)

Johnson, Reed F.; Via, Laura E.; Kumar, Mia R.; Cornish, Joseph P.; Yellayi, Srikanth; Huzella, Louis; Postnikova, Elena; Oberlander, Nicholas; Bartos, Christopher; Ork, Britini L.; Mazur, Steven; Allan, Cindy; Holbrook, Michael R.; Solomon, Jeffrey; Johnson, Joshua C.; Pickel, James; Hensley, Lisa E.; Jahrling, Peter B.

2015-01-01

Middle East Respiratory Syndrome Coronavirus (MERS-CoV) continues to be a threat to human health in the Middle East. Development of countermeasures is ongoing; however, an animal model that faithfully recapitulates human disease has yet to be defined. A recent study indicated that inoculation of common marmosets resulted in inconsistent lethality. Based on these data we sought to compare two isolates of MERS-CoV. We followed disease progression in common marmosets after intratracheal exposure with: MERS-CoV-EMC/2012, MERS-CoV-Jordan-n3/2012, media, or inactivated virus. Our data suggest that common marmosets developed a mild to moderate non-lethal respiratory disease, which was quantifiable by computed tomography (CT), with limited other clinical signs. Based on CT data, clinical data, and virological data, MERS-CoV inoculation of common marmosets results in mild to moderate clinical signs of disease that are likely due to manipulations of the marmoset rather than as a result of robust viral replication. - Highlights: • Common marmosets infected with MERS-EMC and MERS-JOR did not develop lethal disease. • Infected subjects developed transient signs of clinical disease. • CT indicated few differences between the infected and control groups. • Marmosets do not faithfully replicate human MERS pathogenesis.

Factors affecting antimicrobial activity of MUC7 12-mer, a human salivary mucin-derived peptide

Directory of Open Access Journals (Sweden)

Bobek Libuse A

2007-11-01

Full Text Available Abstract Background MUC7 12-mer (RKSYKCLHKRCR, a cationic antimicrobial peptide derived from the human low-molecular-weight salivary mucin MUC7, possesses potent antimicrobial activity in vitro. In order to evaluate the potential therapeutic application of the MUC7 12-mer, we examined the effects of mono- and divalent cations, EDTA, pH, and temperature on its antimicrobial activity. Methods Minimal Inhibitory Concentrations (MICs were determined using a liquid growth inhibition assay in 96-well microtiter plates. MUC7 12-mer was added at concentrations of 1.56–50 μM. MICs were determined at three endpoints: MIC-0, MIC-1, and MIC-2 (the lowest drug concentration showing 10%, 25% and 50% of growth, respectively. To examine the effect of salts or EDTA, a checkerboard microdilution technique was used. Fractional inhibitory concentration index (FICi was calculated on the basis of MIC-0. The viability of microbial cells treated with MUC7 12-mer in the presence of sodium or potassium was also determined by killing assay or flow cytometry. Results The MICs of MUC7 12-mer against organisms tested ranged from 6.25–50 μM. For C. albicans, antagonism (FICi 4.5 was observed for the combination of MUC7 12-mer and calcium; however, there was synergism (FICi 0.22 between MUC7 12-mer and EDTA, and the synergism was retained in the presence of calcium at its physiological concentration (1–2 mM. No antagonism but additivity or indifference (FICi 0.55–2.5 was observed for the combination of MUC7 12-mer and each K+, Na+, Mg2+, or Zn2+. MUC7 12-mer peptide (at 25 μM also exerted killing activity in the presence of NaCl, (up to 25 mM for C. albicans and up to 150 mM for E. coli, a physiological concentration of sodium in the oral cavity and serum, respectively and retained candidacidal activity in the presence of KCl (up to 40 mM. The peptide exhibited higher inhibitory activity against C. albicans at pH 7, 8, and 9 than at pH 5 and 6, and temperature up to
DNA template strand sequencing of single-cells maps genomic rearrangements at high resolution

NARCIS (Netherlands)

Falconer, Ester; Hills, Mark; Naumann, Ulrike; Poon, Steven S. S.; Chavez, Elizabeth A.; Sanders, Ashley D.; Zhao, Yongjun; Hirst, Martin; Lansdorp, Peter M.

DNA rearrangements such as sister chromatid exchanges (SCEs) are sensitive indicators of genomic stress and instability, but they are typically masked by single-cell sequencing techniques. We developed Strand-seq to independently sequence parental DNA template strands from single cells, making it
The genomic sequence of the Chinese hamster ovary (CHO)-K1 cell line

DEFF Research Database (Denmark)

Xu, Xun; Pan, Shengkai; Liu, Xin

2011-01-01

Chinese hamster ovary (CHO)-derived cell lines are the preferred host cells for the production of therapeutic proteins. Here we present a draft genomic sequence of the CHO-K1 ancestral cell line. The assembly comprises 2.45 Gb of genomic sequence, with 24,383 predicted genes. We associate most....... Homologs of most human glycosylation-associated genes are present in the CHO-K1 genome, although 141 of these homologs are not expressed under exponential growth conditions. Many important viral entry genes are also present in the genome but not expressed, which may explain the unusual viral resistance...... property of CHO cell lines. We discuss how the availability of this genome sequence may facilitate genome-scale science for the optimization of biopharmaceutical protein production....
cDNA, genomic cloning and sequence analysis of ribosomal protein ...

African Journals Online (AJOL)

enoh

2012-03-13

Mar 13, 2012 ... cDNA and the genomic sequence of RPS4X were cloned successfully from ... S4 genes plays a role in Turner syndrome; however, this ..... Project of Educational Committee of Sichuan Province ... Molecular biology of the cell.
Evidence of pervasive biologically functional secondary structures within the genomes of eukaryotic single-stranded DNA viruses.

Science.gov (United States)

Muhire, Brejnev Muhizi; Golden, Michael; Murrell, Ben; Lefeuvre, Pierre; Lett, Jean-Michel; Gray, Alistair; Poon, Art Y F; Ngandu, Nobubelo Kwanele; Semegni, Yves; Tanov, Emil Pavlov; Monjane, Adérito Luis; Harkins, Gordon William; Varsani, Arvind; Shepherd, Dionne Natalie; Martin, Darren Patrick

2014-02-01

Single-stranded DNA (ssDNA) viruses have genomes that are potentially capable of forming complex secondary structures through Watson-Crick base pairing between their constituent nucleotides. A few of the structural elements formed by such base pairings are, in fact, known to have important functions during the replication of many ssDNA viruses. Unknown, however, are (i) whether numerous additional ssDNA virus genomic structural elements predicted to exist by computational DNA folding methods actually exist and (ii) whether those structures that do exist have any biological relevance. We therefore computationally inferred lists of the most evolutionarily conserved structures within a diverse selection of animal- and plant-infecting ssDNA viruses drawn from the families Circoviridae, Anelloviridae, Parvoviridae, Nanoviridae, and Geminiviridae and analyzed these for evidence of natural selection favoring the maintenance of these structures. While we find evidence that is consistent with purifying selection being stronger at nucleotide sites that are predicted to be base paired than at sites predicted to be unpaired, we also find strong associations between sites that are predicted to pair with one another and site pairs that are apparently coevolving in a complementary fashion. Collectively, these results indicate that natural selection actively preserves much of the pervasive secondary structure that is evident within eukaryote-infecting ssDNA virus genomes and, therefore, that much of this structure is biologically functional. Lastly, we provide examples of various highly conserved but completely uncharacterized structural elements that likely have important functions within some of the ssDNA virus genomes analyzed here.
Crystal Structures of DNA-Whirly Complexes and Their Role in Arabidopsis Organelle Genome Repair

Energy Technology Data Exchange (ETDEWEB)

Cappadocia, Laurent; Maréchal, Alexandre; Parent, Jean-Sébastien; Lepage, Étienne; Sygusch, Jurgen; Brisson, Normand (Montreal)

2010-09-07

DNA double-strand breaks are highly detrimental to all organisms and need to be quickly and accurately repaired. Although several proteins are known to maintain plastid and mitochondrial genome stability in plants, little is known about the mechanisms of DNA repair in these organelles and the roles of specific proteins. Here, using ciprofloxacin as a DNA damaging agent specific to the organelles, we show that plastids and mitochondria can repair DNA double-strand breaks through an error-prone pathway similar to the microhomology-mediated break-induced replication observed in humans, yeast, and bacteria. This pathway is negatively regulated by the single-stranded DNA (ssDNA) binding proteins from the Whirly family, thus indicating that these proteins could contribute to the accurate repair of plant organelle genomes. To understand the role of Whirly proteins in this process, we solved the crystal structures of several Whirly-DNA complexes. These reveal a nonsequence-specific ssDNA binding mechanism in which DNA is stabilized between domains of adjacent subunits and rendered unavailable for duplex formation and/or protein interactions. Our results suggest a model in which the binding of Whirly proteins to ssDNA would favor accurate repair of DNA double-strand breaks over an error-prone microhomology-mediated break-induced replication repair pathway.
DNA Data Bank of Japan at work on genome sequence data.

Science.gov (United States)

Tateno, Y; Fukami-Kobayashi, K; Miyazaki, S; Sugawara, H; Gojobori, T

1998-01-01

We at the DNA Data Bank of Japan (DDBJ) (http://www.ddbj.nig.ac.jp) have recently begun receiving, processing and releasing EST and genome sequence data submitted by various Japanese genome projects. The data include those for human, Arabidopsis thaliana, rice, nematode, Synechocystis sp. and Escherichia coli. Since the quantity of data is very large, we organized teams to conduct preliminary discussions with project teams about data submission and handling for release to the public. We also developed a mass submission tool to cope with a large quantity of data. In addition, to provide genome data on WWW, we developed a genome information system using Java. This system (http://mol.genes.nig.ac.jp/ecoli/) can in theory be used for any genome sequence data. These activities will facilitate processing of large quantities of EST and genome data.
The Conjugative Relaxase TrwC Promotes Integration of Foreign DNA in the Human Genome.

Science.gov (United States)

González-Prieto, Coral; Gabriel, Richard; Dehio, Christoph; Schmidt, Manfred; Llosa, Matxalen

2017-06-15

Bacterial conjugation is a mechanism of horizontal DNA transfer. The relaxase TrwC of the conjugative plasmid R388 cleaves one strand of the transferred DNA at the oriT gene, covalently attaches to it, and leads the single-stranded DNA (ssDNA) into the recipient cell. In addition, TrwC catalyzes site-specific integration of the transferred DNA into its target sequence present in the genome of the recipient bacterium. Here, we report the analysis of the efficiency and specificity of the integrase activity of TrwC in human cells, using the type IV secretion system of the human pathogen Bartonella henselae to introduce relaxase-DNA complexes. Compared to Mob relaxase from plasmid pBGR1, we found that TrwC mediated a 10-fold increase in the rate of plasmid DNA transfer to human cells and a 100-fold increase in the rate of chromosomal integration of the transferred DNA. We used linear amplification-mediated PCR and plasmid rescue to characterize the integration pattern in the human genome. DNA sequence analysis revealed mostly reconstituted oriT sequences, indicating that TrwC is active and recircularizes transferred DNA in human cells. One TrwC-mediated site-specific integration event was detected, proving that TrwC is capable of mediating site-specific integration in the human genome, albeit with very low efficiency compared to the rate of random integration. Our results suggest that TrwC may stabilize the plasmid DNA molecules in the nucleus of the human cell, probably by recircularization of the transferred DNA strand. This stabilization would increase the opportunities for integration of the DNA by the host machinery. IMPORTANCE Different biotechnological applications, including gene therapy strategies, require permanent modification of target cells. Long-term expression is achieved either by extrachromosomal persistence or by integration of the introduced DNA. Here, we studied the utility of conjugative relaxase TrwC, a bacterial protein with site
Comparative chloroplast genomes of eleven Schima (Theaceae) species: Insights into DNA barcoding and phylogeny.

Science.gov (United States)

Yu, Xiang-Qin; Drew, Bryan T; Yang, Jun-Bo; Gao, Lian-Ming; Li, De-Zhu

2017-01-01

Schima is an ecologically and economically important woody genus in tea family (Theaceae). Unresolved species delimitations and phylogenetic relationships within Schima limit our understanding of the genus and hinder utilization of the genus for economic purposes. In the present study, we conducted comparative analysis among the complete chloroplast (cp) genomes of 11 Schima species. Our results indicate that Schima cp genomes possess a typical quadripartite structure, with conserved genomic structure and gene order. The size of the Schima cp genome is about 157 kilo base pairs (kb). They consistently encode 114 unique genes, including 80 protein-coding genes, 30 tRNAs, and 4 rRNAs, with 17 duplicated in the inverted repeat (IR). These cp genomes are highly conserved and do not show obvious expansion or contraction of the IR region. The percent variability of the 68 coding and 93 noncoding (>150 bp) fragments is consistently less than 3%. The seven most widely touted DNA barcode regions as well as one promising barcode candidate showed low sequence divergence. Eight mutational hotspots were identified from the 11 cp genomes. These hotspots may potentially be useful as specific DNA barcodes for species identification of Schima. The 58 cpSSR loci reported here are complementary to the microsatellite markers identified from the nuclear genome, and will be leveraged for further population-level studies. Phylogenetic relationships among the 11 Schima species were resolved with strong support based on the cp genome data set, which corresponds well with the species distribution pattern. The data presented here will serve as a foundation to facilitate species identification, DNA barcoding and phylogenetic reconstructions for future exploration of Schima.
A DNA minor groove electronegative potential genome map based on photo-chemical probing

DEFF Research Database (Denmark)

Lindemose, Søren; Nielsen, Peter Eigil; Hansen, Morten

2011-01-01

The double-stranded DNA of the genome contains both sequence information directly relating to the protein and RNA coding as well as functional and structural information relating to protein recognition. Only recently is the importance of DNA shape in this recognition process being fully appreciat...
Evaluating genome-wide DNA methylation changes in mice by Methylation Specific Digital Karyotyping

Directory of Open Access Journals (Sweden)

Maruoka Shuichiro

2008-12-01

Full Text Available Abstract Background The study of genome-wide DNA methylation changes has become more accessible with the development of various array-based technologies though when studying species other than human the choice of applications are limited and not always within reach. In this study, we adapted and tested the applicability of Methylation Specific Digital Karyotyping (MSDK, a non-array based method, for the prospective analysis of epigenetic changes after perinatal nutritional modifications in a mouse model of allergic airway disease. MSDK is a sequenced based method that allows a comprehensive and unbiased methylation profiling. The method generates 21 base pairs long sequence tags derived from specific locations in the genome. The resulting tag frequencies determine in a quantitative manner the methylation level of the corresponding loci. Results Genomic DNA from whole lung was isolated and subjected to MSDK analysis using the methylation-sensitive enzyme Not I as the mapping enzyme and Nla III as the fragmenting enzyme. In a pair wise comparison of the generated mouse MSDK libraries we identified 158 loci that are significantly differentially methylated (P-value = 0.05 after perinatal dietary changes in our mouse model. Quantitative methylation specific PCR and sequence analysis of bisulfate modified genomic DNA confirmed changes in methylation at specific loci. Differences in genomic MSDK tag counts for a selected set of genes, correlated well with changes in transcription levels as measured by real-time PCR. Furthermore serial analysis of gene expression profiling demonstrated a dramatic difference in expressed transcripts in mice exposed to perinatal nutritional changes. Conclusion The genome-wide methylation survey applied in this study allowed for an unbiased methylation profiling revealing subtle changes in DNA methylation in mice maternally exposed to dietary changes in methyl-donor content. The MSDK method is applicable for mouse models
Characterization of Actinomyces with genomic DNA fingerprints and rRNA gene probes.

Science.gov (United States)

Bowden, G; Johnson, J; Schachtele, C

1993-08-01

Cellular DNA from 25 Actinomyces naeslundii and Actinomyces viscosus strains belonging to the 7 taxonomic clusters of Fillery et al. (1978) and several unclustered strains was obtained by enzymatic and N-lauroylsarcosine/guanidine isothiocyanate treatment of whole cells, followed by extraction of the nucleic acid. The DNA samples were digested with restriction endonucleases BamHI or PvuII, and agarose gel electrophoresis was used to obtain DNA fingerprints. The DNA fragments were subjected to Southern blot hybridization with a digoxigenin-labeled cDNA probe transcribed from Escherichia coli 16S and 23S rRNA. The patterns of bands from genomic (DNA fingerprints) and rDNA fingerprints (ribotypes) were used for comparison between the taxonomic cluster strains and strains within clusters. Representative strains from each taxonomic cluster provided different BamHI DNA fingerprints and ribotype patterns with 3 to 9 distinct bands. Some strains within a cluster showed identical ribotype patterns with both endonucleases (A. naeslundii B120 and A. naeslundii B102 from cluster 3), while others showed the same pattern with BamHI but a different pattern with PvuII (A. naeslundii ATCC 12104 and 398A from cluster 5). A viscosus ATCC 15987 (cluster 7) and its parent strain T6 yielded identical fingerprint and ribotype patterns. The genomic diversity revealed by DNA fingerprinting and ribotyping demonstrates that these techniques, which do not require phenotypic expression, are suited for study of the oral ecology of the Actinomyces, and for epidemiological tracking of specific Actinomyces strains associated with caries lesions and sites of periodontal destruction.
DHX9 helicase is involved in preventing genomic instability induced by alternatively structured DNA in human cells.

Science.gov (United States)

Jain, Aklank; Bacolla, Albino; Del Mundo, Imee M; Zhao, Junhua; Wang, Guliang; Vasquez, Karen M

2013-12-01

Sequences that have the capacity to adopt alternative (i.e. non-B) DNA structures in the human genome have been implicated in stimulating genomic instability. Previously, we found that a naturally occurring intra-molecular triplex (H-DNA) caused genetic instability in mammals largely in the form of DNA double-strand breaks. Thus, it is of interest to determine the mechanism(s) involved in processing H-DNA. Recently, we demonstrated that human DHX9 helicase preferentially unwinds inter-molecular triplex DNA in vitro. Herein, we used a mutation-reporter system containing H-DNA to examine the relevance of DHX9 activity on naturally occurring H-DNA structures in human cells. We found that H-DNA significantly increased mutagenesis in small-interfering siRNA-treated, DHX9-depleted cells, affecting mostly deletions. Moreover, DHX9 associated with H-DNA in the context of supercoiled plasmids. To further investigate the role of DHX9 in the recognition/processing of H-DNA, we performed binding assays in vitro and chromatin immunoprecipitation assays in U2OS cells. DHX9 recognized H-DNA, as evidenced by its binding to the H-DNA structure and enrichment at the H-DNA region compared with a control region in human cells. These composite data implicate DHX9 in processing H-DNA structures in vivo and support its role in the overall maintenance of genomic stability at sites of alternatively structured DNA.
Automated Processing of 2-D Gel Electrophoretograms of Genomic DNA for Hunting Pathogenic DNA Molecular Changes.

Science.gov (United States)

Takahashi; Nakazawa; Watanabe; Konagaya

1999-01-01

We have developed the automated processing algorithms for 2-dimensional (2-D) electrophoretograms of genomic DNA based on RLGS (Restriction Landmark Genomic Scanning) method, which scans the restriction enzyme recognition sites as the landmark and maps them onto a 2-D electrophoresis gel. Our powerful processing algorithms realize the automated spot recognition from RLGS electrophoretograms and the automated comparison of a huge number of such images. In the final stage of the automated processing, a master spot pattern, on which all the spots in the RLGS images are mapped at once, can be obtained. The spot pattern variations which seemed to be specific to the pathogenic DNA molecular changes can be easily detected by simply looking over the master spot pattern. When we applied our algorithms to the analysis of 33 RLGS images derived from human colon tissues, we successfully detected several colon tumor specific spot pattern changes.
DNA methylation alteration is a major consequence of genome doubling in autotetraploid Brassica rapa

Directory of Open Access Journals (Sweden)

Xu Yanhao

2017-01-01

Full Text Available Polyploids are typically classified as autopolyploids or allopolyploids based on the origin of their chromosome sets. Autopolyploidy is much more common than traditionally believed. Allopolyploidization, accompanied by genomic and transcriptomic changes, has been well investigated. In this study, genetic, DNA methylation and gene expression changes in autotetraploid Brassica rapa were investigated. No genetic alteration was detected using an amplified fragment length polymorphism (AFLP approach. Using a cDNA-AFLP approach, approximately 0.58% of fragments showed changes in gene expression in autotetraploid B. rapa. The methylation-sensitive amplification polymorphism (MSAP analysis showed that approximately 1.7% of the fragments underwent DNA methylation changes upon genome doubling, with hypermethylation and demethylation changes equally affected. Fragments displaying changes in gene expression and methylation status were isolated and then sequenced and characterized, respectively. This study showed that variation in cytosine methylation is a major consequence of genome doubling in autotetraploid Brassica rapa.
Diversification of the celiac disease α-gliadin complex in wheat: a 33-mer peptide with six overlapping epitopes, evolved following polyploidization.

Science.gov (United States)

Ozuna, Carmen V; Iehisa, Julio C M; Giménez, María J; Alvarez, Juan B; Sousa, Carolina; Barro, Francisco

2015-06-01

The gluten proteins from wheat, barley and rye are responsible both for celiac disease (CD) and for non-celiac gluten sensitivity, two pathologies affecting up to 6-8% of the human population worldwide. The wheat α-gliadin proteins contain three major CD immunogenic peptides: p31-43, which induces the innate immune response; the 33-mer, formed by six overlapping copies of three highly stimulatory epitopes; and an additional DQ2.5-glia-α3 epitope which partially overlaps with the 33-mer. Next-generation sequencing (NGS) and Sanger sequencing of α-gliadin genes from diploid and polyploid wheat provided six types of α-gliadins (named 1-6) with strong differences in their frequencies in diploid and polyploid wheat, and in the presence and abundance of these CD immunogenic peptides. Immunogenic variants of the p31-43 peptide were found in most of the α-gliadins. Variants of the DQ2.5-glia-α3 epitope were associated with specific types of α-gliadins. Remarkably, only type 1 α-gliadins contained 33-mer epitopes. Moreover, the full immunodominant 33-mer fragment was only present in hexaploid wheat at low abundance, probably as the result of allohexaploidization events from subtype 1.2 α-gliadins found only in Aegilops tauschii, the D-genome donor of hexaploid wheat. Type 3 α-gliadins seem to be the ancestral type as they are found in most of the α-gliadin-expressing Triticeae species. These findings are important for reducing the incidence of CD by the breeding/selection of wheat varieties with low stimulatory capacity of T cells. Moreover, advanced genome-editing techniques (TALENs, CRISPR) will be easier to implement on the small group of α-gliadins containing only immunogenic peptides. © 2015 Society for Experimental Biology and John Wiley & Sons Ltd.
Rapid methods for the extraction and archiving of molecular grade fungal genomic DNA.

Science.gov (United States)

Borman, Andrew M; Palmer, Michael; Johnson, Elizabeth M

2013-01-01

The rapid and inexpensive extraction of fungal genomic DNA that is of sufficient quality for molecular approaches is central to the molecular identification, epidemiological analysis, taxonomy, and strain typing of pathogenic fungi. Although many commercially available and in-house extraction procedures do eliminate the majority of contaminants that commonly inhibit molecular approaches, the inherent difficulties in breaking fungal cell walls lead to protocols that are labor intensive and that routinely take several hours to complete. Here we describe several methods that we have developed in our laboratory that allow the extremely rapid and inexpensive preparation of fungal genomic DNA.
A truncated receptor-binding domain of MERS-CoV spike protein potently inhibits MERS-CoV infection and induces strong neutralizing antibody responses: implication for developing therapeutics and vaccines.

Directory of Open Access Journals (Sweden)

Lanying Du

Full Text Available An emerging respiratory infectious disease with high mortality, Middle East respiratory syndrome (MERS, is caused by a novel coronavirus (MERS-CoV. It was first reported in 2012 in Saudi Arabia and has now spread to eight countries. Development of effective therapeutics and vaccines is crucial to save lives and halt the spread of MERS-CoV. Here, we show that a recombinant protein containing a 212-amino acid fragment (residues 377-588 in the truncated receptor-binding domain (RBD: residues 367-606 of MERS-CoV spike (S protein fused with human IgG Fc fragment (S377-588-Fc is highly expressed in the culture supernatant of transfected 293T cells. The purified S377-588-Fc protein efficiently binds to dipeptidyl peptidase 4 (DPP4, the receptor of MERS-CoV, and potently inhibited MERS-CoV infection, suggesting its potential to be further developed as a therapeutic modality for treating MERS-CoV infection and saving the patients' lives. The recombinant S377-588-Fc is able to induce in the vaccinated mice strong MERS-CoV S-specific antibodies, which blocks the binding of RBD to DPP4 receptor and effectively neutralizes MERS-CoV infection. These findings indicate that this truncated RBD protein shows promise for further development as an effective and safe vaccine for the prevention of MERS-CoV infection.
Development and analysis of a tick-borne encephalitis virus infectious clone using a novel and rapid strategy.

Science.gov (United States)

Gritsun, T S; Gould, E A

1998-12-01

In less than 1 month we have constructed an infectious clone of attenuated tick-borne encephalitis virus (strain Vasilchenko) from 100 microl of unpurified virus suspension using long high fidelity PCR and a modified bacterial cloning system. Optimization of the 3' antisense primer concentration was essential to achieve PCR synthesis of an 11 kb cDNA copy of RNA from infectious virus. A novel system utilising two antisense primers, a 14-mer for reverse transcription and a 35-mer for long PCR, produced high yields of genomic length cDNA. Use of low copy number Able K cells and an incubation temperature of 28 degrees C increased the genetic stability of cloned cDNA. Clones containing 11 kb cDNA inserts produced colonies of reduced size, thus providing a positive selection system for full length clones. Sequencing of the infectious clone emphasised the improved fidelity of the method compared with conventional PCR and cloning methods. A simple and rapid strategy for genetic manipulation of the infectious clone is also described. These developments represent a significant advance in recombinant technology and should be applicable to positive stranded RNA viruses which cannot easily be purified or genetically manipulated.
Myeloperoxidase-produced Genomic DNA-centered Radicals and Protection by Resveratrol

Science.gov (United States)

Myeloperoxidase (MPO) released by activated neutrophils, production of hypochlorous acid (HOCI) and oxidation of the genomic DNA in epithelial cells is thought to initiate and promote carcinogenesis. In this study we applied the 5,5-dimethyl-l-pyrroline N-oxide (DMPO)-based i;nmu...

Isolation and characterization of cDNA encoding the 80-kDa subunit protein of the human autoantigen Ku (p70/p80) recognized by autoantibodies from patients with scleroderma-polymyositis overlap syndrome

International Nuclear Information System (INIS)

Mimori, Tsuneyo; Ohosone, Yasuo; Hama, Nobuaki; Suwa, Akira; Akizuki, Masashi; Homma, Mitsuo; Griffith, A.J.; Hardin, J.A.

1990-01-01

Anti-Ku (p70/p80) autoantibodies in patients with scleroderma-polymyositis overlap syndrome recognize a 70-kDa/80-kDa protein heterodimer which binds to terminal regions of double-stranded DNA. In the present study, the authors isolated full-length cDNAs that encode the 80-kDa Ku subunit. Initial screening of a human spleen cDNA library with anti-Ku antibodies yielded a cDNA of 1.0 kilobase (kb) (termed K71) encoding a portion of the 80-kDa Ku polypeptide (identification based on immunological criteria). In RNA blots, this cDNA hybridized with two mRNAs of 3.4 and 2.6 kb. In vitro transcription and translation experiments produced an immunoprecipitable polypeptide which comigrated with the 80-kDa Ku subunit. The Ku80-6 cDNA proved to be 3304 nucleotides in length, with an additional poly(A) tail, closely approximating the size of the larger mRNA. It contains a single long open reading frame encoding 732 amino acids. The putative polypeptide has a high content of acidic amino acids and a region with periodic repeat of leucine in every seventh position which may form the leucine zipper structure. In genomic DNA blots, probes derived from the opposite ends of cDNA Ku80-6 hybridized with several nonoverlapping restriction fragments from human leukocyte DNA, indicating that the gene encoding the 80-kDa Ku polypeptide is divided into several exons by intervening sequences
Draft Genome Sequence of Escherichia coli K-12 (ATCC 10798)

OpenAIRE

Dimitrova, Daniela; Engelbrecht, Kathleen C.; Putonti, Catherine; Koenig, David W.; Wolfe, Alan J.

2017-01-01

ABSTRACT Here, we present the draft genome sequence of Escherichia coli ATCC 10798. E.?coli ATCC 10798 is a K-12 strain, one of the most well-studied model microorganisms. The size of the genome was 4,685,496?bp, with a G+C content of 50.70%. This assembly consists of 62 contigs and the F plasmid.
Rhipicephalus microplus dataset of nonredundant raw sequence reads from 454 GS FLX sequencing of Cot-selected (Cot = 660) genomic DNA

Science.gov (United States)

A reassociation kinetics-based approach was used to reduce the complexity of genomic DNA from the Deutsch laboratory strain of the cattle tick, Rhipicephalus microplus, to facilitate genome sequencing. Selected genomic DNA (Cot value = 660) was sequenced using 454 GS FLX technology, resulting in 356...
CAFE: aCcelerated Alignment-FrEe sequence analysis.

Science.gov (United States)

Lu, Yang Young; Tang, Kujin; Ren, Jie; Fuhrman, Jed A; Waterman, Michael S; Sun, Fengzhu

2017-07-03

Alignment-free genome and metagenome comparisons are increasingly important with the development of next generation sequencing (NGS) technologies. Recently developed state-of-the-art k-mer based alignment-free dissimilarity measures including CVTree, $d_2^*$ and $d_2^S$ are more computationally expensive than measures based solely on the k-mer frequencies. Here, we report a standalone software, aCcelerated Alignment-FrEe sequence analysis (CAFE), for efficient calculation of 28 alignment-free dissimilarity measures. CAFE allows for both assembled genome sequences and unassembled NGS shotgun reads as input, and wraps the output in a standard PHYLIP format. In downstream analyses, CAFE can also be used to visualize the pairwise dissimilarity measures, including dendrograms, heatmap, principal coordinate analysis and network display. CAFE serves as a general k-mer based alignment-free analysis platform for studying the relationships among genomes and metagenomes, and is freely available at https://github.com/younglululu/CAFE. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.
Coevolution between Nuclear-Encoded DNA Replication, Recombination, and Repair Genes and Plastid Genome Complexity.

Science.gov (United States)

Zhang, Jin; Ruhlman, Tracey A; Sabir, Jamal S M; Blazier, John Chris; Weng, Mao-Lun; Park, Seongjun; Jansen, Robert K

2016-02-17

Disruption of DNA replication, recombination, and repair (DNA-RRR) systems has been hypothesized to cause highly elevated nucleotide substitution rates and genome rearrangements in the plastids of angiosperms, but this theory remains untested. To investigate nuclear-plastid genome (plastome) coevolution in Geraniaceae, four different measures of plastome complexity (rearrangements, repeats, nucleotide insertions/deletions, and substitution rates) were evaluated along with substitution rates of 12 nuclear-encoded, plastid-targeted DNA-RRR genes from 27 Geraniales species. Significant correlations were detected for nonsynonymous (dN) but not synonymous (dS) substitution rates for three DNA-RRR genes (uvrB/C, why1, and gyrA) supporting a role for these genes in accelerated plastid genome evolution in Geraniaceae. Furthermore, correlation between dN of uvrB/C and plastome complexity suggests the presence of nucleotide excision repair system in plastids. Significant correlations were also detected between plastome complexity and 13 of the 90 nuclear-encoded organelle-targeted genes investigated. Comparisons revealed significant acceleration of dN in plastid-targeted genes of Geraniales relative to Brassicales suggesting this correlation may be an artifact of elevated rates in this gene set in Geraniaceae. Correlation between dN of plastid-targeted DNA-RRR genes and plastome complexity supports the hypothesis that the aberrant patterns in angiosperm plastome evolution could be caused by dysfunction in DNA-RRR systems. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Deep Investigation of Arabidopsis thaliana Junk DNA Reveals a Continuum between Repetitive Elements and Genomic Dark Matter

Science.gov (United States)

Maumus, Florian; Quesneville, Hadi

2014-01-01

Eukaryotic genomes contain highly variable amounts of DNA with no apparent function. This so-called junk DNA is composed of two components: repeated and repeat-derived sequences (together referred to as the repeatome), and non-annotated sequences also known as genomic dark matter. Because of their high duplication rates as compared to other genomic features, transposable elements are predominant contributors to the repeatome and the products of their decay is thought to be a major source of genomic dark matter. Determining the origin and composition of junk DNA is thus important to help understanding genome evolution as well as host biology. In this study, we have used a combination of tools enabling to show that the repeatome from the small and reducing A. thaliana genome is significantly larger than previously thought. Furthermore, we present the concepts and results from a series of innovative approaches suggesting that a significant amount of the A. thaliana dark matter is of repetitive origin. As a tentative standard for the community, we propose a deep compendium annotation of the A. thaliana repeatome that may help addressing farther genome evolution as well as transcriptional and epigenetic regulation in this model plant. PMID:24709859
A simple method of genomic DNA extraction suitable for analysis of bulk fungal strains.

Science.gov (United States)

Zhang, Y J; Zhang, S; Liu, X Z; Wen, H A; Wang, M

2010-07-01

A simple and rapid method (designated thermolysis) for extracting genomic DNA from bulk fungal strains was described. In the thermolysis method, a few mycelia or yeast cells were first rinsed with pure water to remove potential PCR inhibitors and then incubated in a lysis buffer at 85 degrees C to break down cell walls and membranes. This method was used to extract genomic DNA from large numbers of fungal strains (more than 92 species, 35 genera of three phyla) isolated from different sections of natural Ophiocordyceps sinensis specimens. Regions of interest from high as well as single-copy number genes were successfully amplified from the extracted DNA samples. The DNA samples obtained by this method can be stored at -20 degrees C for over 1 year. The method was effective, easy and fast and allowed batch DNA extraction from multiple fungal isolates. Use of the thermolysis method will allow researchers to obtain DNA from fungi quickly for use in molecular assays. This method requires only minute quantities of starting material and is suitable for diverse fungal species.
The Role of DNA Methylation Changes in Radiation-Induced Transgenerational Genomic Instability and Bystander Effects in cranial irradiated Mice

Science.gov (United States)

Zhang, Meng; Sun, Yeqing; Gao, Yinglong; Zhang, Baodong

Heavy-ion radiation could lead to genome instability in the germline, and therefore to transgenerational genome and epigenome instability in offspring of exposed males. The exact mechanisms of radiation-induced genome instability in directly exposed and in bystander organ remain obscure, yet accumulating evidence points to the role of DNA methylation changes in genome instability development. The potential of localized body-part exposures to affect the germline and thus induce genome and epigenome changes in the progeny has not been studied. To investigate whether or not the paternal cranial irradiation can exert deleterious changes in the protected germline and the offsprings, we studied the alteration of DNA methylation in the shielded testes tissue. Here we report that the localized paternal cranial irradiation results in a significant altered DNA methylation in sperm cells and leads to a profound epigenetic dysregulation in the unexposed progeny conceived 3 months after paternal exposure. The possible molecular mechanisms and biological consequences of the observed changes are discussed. Keywords: Heavy-ion radiation; Transgenerational effect; Genomic Instability Bystander Effects; DNA methylation.
cDNA, genomic sequence cloning and overexpression of ribosomal ...

African Journals Online (AJOL)

RPS16 of eukaryote is a component of the 40S small ribosomal subunit encoded by RPS16 gene and is also a homolog of prokaryotic RPS9. The cDNA and genomic sequence of RPS16 was cloned successfully for the first time from the Giant Panda (Ailuropoda melanoleuca) using reverse transcription-polymerase chain ...
cDNA, genomic sequence cloning and overexpression of ribosomal ...

African Journals Online (AJOL)

PRECIOUS

2009-11-02

Nov 2, 2009 ... basic machinery of protein synthesis and regulation, but also in various ... The genomic DNA was isolated from Giant Panda muscle tissue according to the ... for 45 s, 72°C for 2 min in the first cycle and the anneal temperature deceased 0.2°C ..... edition, Cold Spring Harbor aboratory Press. Cold Spring ...
Reconstitution of wild type viral DNA in simian cells transfected with early and late SV40 defective genomes.

Science.gov (United States)

O'Neill, F J; Gao, Y; Xu, X

1993-11-01

The DNAs of polyomaviruses ordinarily exist as a single circular molecule of approximately 5000 base pairs. Variants of SV40, BKV and JCV have been described which contain two complementing defective DNA molecules. These defectives, which form a bipartite genome structure, contain either the viral early region or the late region. The defectives have the unique property of being able to tolerate variable sized reiterations of regulatory and terminus region sequences, and portions of the coding region. They can also exchange coding region sequences with other polyomaviruses. It has been suggested that the bipartite genome structure might be a stage in the evolution of polyomaviruses which can uniquely sustain genome and sequence diversity. However, it is not known if the regulatory and terminus region sequences are highly mutable. Also, it is not known if the bipartite genome structure is reversible and what the conditions might be which would favor restoration of the monomolecular genome structure. We addressed the first question by sequencing the reiterated regulatory and terminus regions of E- and L-SV40 DNAs. This revealed a large number of mutations in the regulatory regions of the defective genomes, including deletions, insertions, rearrangements and base substitutions. We also detected insertions and base substitutions in the T-antigen gene. We addressed the second question by introducing into permissive simian cells, E- and L-SV40 genomes which had been engineered to contain only a single regulatory region. Analysis of viral DNA from transfected cells demonstrated recombined genomes containing a wild type monomolecular DNA structure. However, the complete defectives, containing reiterated regulatory regions, could often compete away the wild type genomes. The recombinant monomolecular genomes were isolated, cloned and found to be infectious. All of the DNA alterations identified in one of the regulatory regions of E-SV40 DNA were present in the recombinant
Structural organization of DNA in chlorella viruses.

Directory of Open Access Journals (Sweden)

Timo Wulfmeyer

Full Text Available Chlorella viruses have icosahedral capsids with an internal membrane enclosing their large dsDNA genomes and associated proteins. Their genomes are packaged in the particles with a predicted DNA density of ca. 0.2 bp nm(-3. Occasionally infection of an algal cell by an individual particle fails and the viral DNA is dynamically ejected from the capsid. This shows that the release of the DNA generates a force, which can aid in the transfer of the genome into the host in a successful infection. Imaging of ejected viral DNA indicates that it is intimately associated with proteins in a periodic fashion. The bulk of the protein particles detected by atomic force microscopy have a size of ∼60 kDa and two proteins (A278L and A282L of about this size are among 6 basic putative DNA binding proteins found in a proteomic analysis of DNA binding proteins packaged in the virion. A combination of fluorescence images of ejected DNA and a bioinformatics analysis of the DNA reveal periodic patterns in the viral DNA. The periodic distribution of GC rich regions in the genome provides potential binding sites for basic proteins. This DNA/protein aggregation could be responsible for the periodic concentration of fluorescently labeled DNA observed in ejected viral DNA. Collectively the data indicate that the large chlorella viruses have a DNA packaging strategy that differs from bacteriophages; it involves proteins and share similarities to that of chromatin structure in eukaryotes.
Feasibility of N-Gram Data-Structures for Next-Generation Pathogen Signature Design

Energy Technology Data Exchange (ETDEWEB)

Gardner, S N

2009-01-26

We determined the most appropriate data structure for handling n-gram (also known as k-mer) string comparisons and storage for genomic sequence data that will scale in terms of memory and speed. This is critical to maintain LLNL as the leader in pathogen detection, as it will guide the design of the 'Next Generation' system for computational signature prediction. There are two parts to k-mer analysis for signature prediction that we investigated. First is the enumeration and frequency counting of all observed k-mers in a sequence database (k-mer is a biological term equivalent to the CS term n-gram). Second is the down-selection and pairing of k-mers to generate a signature. We determined that for the first part, suffix arrays are the preferred method to enumerate k-mers, being memory efficient and relatively easy and fast to compute. For the second part, a subset of the k-mers can be stored and manipulated in a hash, that subset determination based on desired frequency characteristics such as most/least frequent from a set, shared among sequence sets, or discriminating across sequence sets.
Development and validation of an rDNA operon based primer walking strategy applicable to de novo bacterial genome finishing.

Directory of Open Access Journals (Sweden)

Alexander William Eastman

2015-01-01

Full Text Available Advances in sequencing technology have drastically increased the depth and feasibility of bacterial genome sequencing. However, little information is available that details the specific techniques and procedures employed during genome sequencing despite the large numbers of published genomes. Shotgun approaches employed by second-generation sequencing platforms has necessitated the development of robust bioinformatics tools for in silico assembly, and complete assembly is limited by the presence of repetitive DNA sequences and multi-copy operons. Typically, re-sequencing with multiple platforms and laborious, targeted Sanger sequencing are employed to finish a draft bacterial genome. Here we describe a novel strategy based on the identification and targeted sequencing of repetitive rDNA operons to expedite bacterial genome assembly and finishing. Our strategy was validated by finishing the genome of Paenibacillus polymyxa strain CR1, a bacterium with potential in sustainable agriculture and bio-based processes. An analysis of the 38 contigs contained in the P. polymyxa strain CR1 draft genome revealed 12 repetitive rDNA operons with varied intragenic and flanking regions of variable length, unanimously located at contig boundaries and within contig gaps. These highly similar but not identical rDNA operons were experimentally verified and sequenced simultaneously with multiple, specially designed primer sets. This approach also identified and corrected significant sequence rearrangement generated during the initial in silico assembly of sequencing reads. Our approach reduces the required effort associated with blind primer walking for contig assembly, increasing both the speed and feasibility of genome finishing. Our study further reinforces the notion that repetitive DNA elements are major limiting factors for genome finishing. Moreover, we provided a step-by-step workflow for genome finishing, which may guide future bacterial genome finishing
Push-pull effect on the geometrical, optical and charge transfer properties of disubstituted derivatives of mer-tris(4-hydroxy-1,5-naphthyridinato aluminum (mer-AlND3

Directory of Open Access Journals (Sweden)

Rao Joshi Laxmikanth

2016-01-01

Full Text Available To design innovative and novel optical materials with high mobility, two kinds of disubstituted derivatives for mer-tris(4-hydroxy-1,5-naphthyridinato aluminum (mer-AlND3 with push (EDG–pull (EWG substituents have been designed. The structures of mer-tris(8-EDG-2-EWG-4-hydroxy-1,5-naphthyridinato aluminum (type I and mer-tris(8-EWG-2-EDG-4-hydroxy-1,5-naphthyridinato aluminum (type II in the ground and first excited states have been optimized at the B3LYP/6-31G(D and CIS/6-31G(D level of theory, respectively. It can be seen from frontier molecular orbitals analysis, in all these complexes, the highest occupied molecular orbital (HOMO is localized on the pyridine-4-ol ring of A-ligand while lowest unoccupied molecular orbital (LUMO is on the pyridyl ring of B-ligand in ground state irrespective of electron donor/acceptor substitution present on the ligands similar to that of mer-tris(8-hydroxyquinoline aluminum (mer-Alq3 and parent mer-AlND3.The absorption and emission wavelengths have been evaluated at the TD-PBE0/6-31G(D level and it can be see that all the type I derivatives show blue shift while most of the type II derivatives show red shift compared to mer-AlND3. All the disubstituted complexes have showed hypsochromic shifts in both the absorption and emission spectra when compared with the calculated absorption and emission spectra respectively of mer-Alq3. It can be seen that the reorganization energies of some of the disubstituted derivatives are comparable with mer-Alq3 and these derivatives might be good candidates for emitting materials in OLED.
Cloning and DNA sequence of the mercuric- and organomercurial-resistance determinants of plasmid pDU1358

International Nuclear Information System (INIS)

Griffin, H.G.; Foster, T.J.; Silver, S.; Misra, T.K.

1987-01-01

The broad-spectrum mercurial-resistance plasmid pDU1358 was analyzed by cloning the resistance determinants and preparing a physical and genetic map of a 45-kilobase (kb) region of the plasmid that contains two separate mercurial-resistance operons that mapped about 20 kb apart. One encoded narrow-spectrum mercurial resistance to Hg 2+ and a few organomercurials; the other specified broad-spectrum resistance to phenylmercury and additional organomercurials. Each determinant governed mercurial transport functions. Southern DNA x DNA hybridization experiments using gene-specific probes from the plasmid R100 mer operon indicated close homology with the R100 deteminant. The 2153 base pairs of the promoter-distal part of the broad-spectrum Hg 2+ -resistance operon of pDU1358 were sequenced. This region included the 3'-terminal part of the merA gene, merD, unidentified reading frame URF1, and a part of URF2 homologous to previously sequenced determinants of plasmid R100. Between the merA and merD genes, an open reading frame encoding a 212 amino acid polypeptide was identified as the merB gene that determines the enzyme organomercurial lyase that cleaves the C-Hg bond of phenylmercury
Membrane-initiated non-genomic signaling by estrogens in the hypothalamus: cross-talk with glucocorticoids with implications for behavior

Directory of Open Access Journals (Sweden)

Jennifer eRainville

2015-02-01

Full Text Available The estrogen receptor (ER and glucocorticoid receptor (GR are members of the nuclear receptor superfamily that can signal using both non-genomic and genomic transcriptional modes. Though genomic modes of signaling have been well characterized and several behaviors attributed to this signaling mechanism, the physiological significance of non-genomic modes of signaling has not been well understood. This has partly been due to the controversy regarding the identity of the membrane ER (mER or membrane GR (mGR that may mediate rapid, non-genomic signaling and the downstream signaling cascades that may result as a consequence of steroid ligands binding the mER or the mGR. Both estrogens and glucocorticoids exert a number of actions on the hypothalamus, including feedback. This review focuses on the various candidates for the mER or mGR in the hypothalamus and the contribution of non-genomic signaling to classical hypothalamically-driven behaviors and changes in neuronal morphology. It also attempts to categorize some of the possible functions of non-genomic signaling at both the cellular level and at the organismal level that are relevant for behavior, including some behaviors that are regulated by both estrogens and glucocorticoids in a potentially synergistic manner. Lastly, it attempts to show that steroid signaling via non-genomic modes may provide the organism with rapid behavioral responses to stimuli.
A new approach for cloning hLIF cDNA from genomic DNA isolated from the oral mucous membrane.

Science.gov (United States)

Cui, Y H; Zhu, G Q; Chen, Q J; Wang, Y F; Yang, M M; Song, Y X; Wang, J G; Cao, B Y

2011-11-25

Complementary DNA (cDNA) is valuable for investigating protein structure and function in the study of life science, but it is difficult to obtain by traditional reverse transcription. We employed a novel strategy to clone human leukemia inhibitory factor (hLIF) gene cDNA from genomic DNA, which was directly isolated from the mucous membrane of mouth. The hLIF sequence, which is 609 bp long and is composed of three exons, can be acquired within a few hours by amplifying each exon and splicing all of them using overlap-PCR. This new approach developed is simple, time- and cost-effective, without RNA preparation or cDNA synthesis, and is not limited to the specific tissues for a particular gene and the expression level of the gene.
Genomics and radical mediated DNA damage: major differences between ionizing radiation and DNA-cleaving enediynes

International Nuclear Information System (INIS)

Cosgrove, J.P.; Begley, T.J.; Samson, L.D.; Dedon, P.C.

2003-01-01

While the evidence is strong for radical-mediated oxidative processes in the pathophysiology of cancer and aging, the mechanisms by which cells respond to oxidative stress have eluded definition. To this end, we have undertaken genomic studies comparing the response of S. cerevisiae to DNA-specific oxidizing agents, the enediynes calicheamicin (CAL), esperamicin (ESP), and neocarzinostatin (NCS), and the non-specific gamma-radiation (RAD). While RAD results in relatively indiscriminate oxidation of cellular molecules, the enediynes are highly specific to DNA and produce damage by a common mechanism involving radical-mediated oxidation of deoxyribose. Transcriptional profiling in response to these agents (80% survival; 15 min exposure; Affymetrix) revealed unexpected differences between RAD and the enediynes and among the three enediynes. Only 2 genes responded in common to all agents, while 9 genes were regulated in common for the 3 enediynes (no DNA repair genes altered in common). The limited common gene expression changes for the 3 enediynes may result from differences in deoxyribose oxidation chemistry, DNA and chromatin targets or the proportions of single- and double-strand DNA lesions. RAD produced a more robust response than the enediynes, altering expression of 195 and 52 genes by more than 2- and 5-fold, respectively, compared to 16-44 and *2 genes, respectively, for the enediynes. This suggests that the transcriptional response varies in intensity according to the number of cellular features affected by the toxin. Genes showing the strongest up-regulation with RAD: ribonucleotide reductase, multidrug resistance, DS break repair/RAD51, GSH transferase; strongly reduced gene expression: TEL1 (damage signaling), NAT2 (acetyltransferase). Genomic phenotyping studies, using a subset of the Research Genetics deletion library, revealed that loss of apn1, the major AP endonuclease, caused resistance to NCS, possibly due to reduced formation of protein-DNA cross
Cdc45-induced loading of human RPA onto single-stranded DNA.

Science.gov (United States)

Szambowska, Anna; Tessmer, Ingrid; Prus, Piotr; Schlott, Bernhard; Pospiech, Helmut; Grosse, Frank

2017-04-07

Cell division cycle protein 45 (Cdc45) is an essential component of the eukaryotic replicative DNA helicase. We found that human Cdc45 forms a complex with the single-stranded DNA (ssDNA) binding protein RPA. Moreover, it actively loads RPA onto nascent ssDNA. Pull-down assays and surface plasmon resonance studies revealed that Cdc45-bound RPA complexed with ssDNA in the 8-10 nucleotide binding mode, but dissociated when RPA covered a 30-mer. Real-time analysis of RPA-ssDNA binding demonstrated that Cdc45 catalytically loaded RPA onto ssDNA. This placement reaction required physical contacts of Cdc45 with the RPA70A subdomain. Our results imply that Cdc45 controlled stabilization of the 8-nt RPA binding mode, the subsequent RPA transition into 30-mer mode and facilitated an ordered binding to ssDNA. We propose that a Cdc45-mediated loading guarantees a seamless deposition of RPA on newly emerging ssDNA at the nascent replication fork. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

PATtyFams: Protein families for the microbial genomes in the PATRIC database

Directory of Open Access Journals (Sweden)

James J Davis

2016-02-01

Full Text Available The ability to build accurate protein families is a fundamental operation in bioinformatics that influences comparative analyses, genome annotation and metabolic modeling. For several years we have been maintaining protein families for all microbial genomes in the PATRIC database (Pathosystems Resource Integration Center, patricbrc.org in order to drive many of the comparative analysis tools that are available through the PATRIC website. However, due to the burgeoning number of genomes, traditional approaches for generating protein families are becoming prohibitive. In this report, we describe a new approach for generating protein families, which we call PATtyFams. This method uses the k-mer-based function assignments available through RAST (Rapid Annotation using Subsystem Technology to rapidly guide family formation, and then differentiates the function-based groups into families using a Markov Cluster algorithm (MCL. This new approach for generating protein families is rapid, scalable and has properties that are consistent with alignment-based methods.
Complete DNA sequence of the linear mitochondrial genome of the pathogenic yeast Candida parapsilosis

DEFF Research Database (Denmark)

Nosek, J.; Novotna, M.; Hlavatovicova, Z.

2004-01-01

The complete sequence of the mitochondrial DNA of the opportunistic yeast pathogen Candida parapsilosis was determined. The mitochondrial genome is represented by linear DNA molecules terminating with tandem repeats of a 738-bp unit. The number of repeats varies, thus generating a population...
Analyzing MER Uplink Reports

Science.gov (United States)

Savin, Stephen C.

2005-01-01

The MER project includes two rovers working simultaneously on opposite sides of Mars each receiving commands only once a day. Creating this uplink is critical, since a failed uplink means a lost day and a waste of money. Examining the process of creating this uplink, I tracked the use of the system developed for requesting observations as well as the development, from stage to stage, in forming an activity plan. I found the system for requesting observations was commonly misused, if used at all. There are half a dozen reports to document the creation of the uplink plan and often there are discrepancies among them. Despite this, the uplink process worked very well and MER has been one of the most successful missions for NASA in recent memory. Still it is clear there is room for improvement.
ReseqChip: Automated integration of multiple local context probe data from the MitoChip array in mitochondrial DNA sequence assembly

Directory of Open Access Journals (Sweden)

Spang Rainer

2009-12-01

Full Text Available Abstract Background The Affymetrix MitoChip v2.0 is an oligonucleotide tiling array for the resequencing of the human mitochondrial (mt genome. For each of 16,569 nucleotide positions of the mt genome it holds two sets of four 25-mer probes each that match the heavy and the light strand of a reference mt genome and vary only at their central position to interrogate all four possible alleles. In addition, the MitoChip v2.0 carries alternative local context probes to account for known mtDNA variants. These probes have been neglected in most studies due to the lack of software for their automated analysis. Results We provide ReseqChip, a free software that automates the process of resequencing mtDNA using multiple local context probes on the MitoChip v2.0. ReseqChip significantly improves base call rate and sequence accuracy. ReseqChip is available at http://code.open-bio.org/svnweb/index.cgi/bioperl/browse/bioperl-live/trunk/Bio/Microarray/Tools/. Conclusions ReseqChip allows for the automated consolidation of base calls from alternative local mt genome context probes. It thereby improves the accuracy of resequencing, while reducing the number of non-called bases.
Comparison of kDNA PCR-hybridization assay with three PCR methods for canines visceral Leishmaniasis diagnosis

Energy Technology Data Exchange (ETDEWEB)

Pilatti, Marcia M.; Andrade, Antero S.R. [Centro de Desenvolvimento da Tecnologia Nuclear (CDTN/CNEN-MG), Belo Horizonte, MG (Brazil)], e-mail: marciapilatti@yahoo.com.br, e-mail: antero@cdtn.br; Ferreira, Sidney A. [Universidade Federal de Minas Gerais (UFMG), Belo Horizonte, MG (Brazil). Dept. de Parasitologia], e-mail: saninoalmeida@gmail.com

2009-07-01

The sensitivity of the kDNA PCR-Hybridization assay, which uses radioactive DNA probes (labeled with {sup 32}P), was compared with three conventional PCR methods used for canine visceral leishmaniasis diagnosis. All PCR methods had two steps: a first amplification followed by hybridization or by a new amplification (nested or semi nested). Two methods (kDNA PCR-Hybridization and kDNA snPCR) used primers addressed to kinetoplast minicircles and the other two methods to the coding (LnPCR) and intergenic noncoding regions (ITS-1 nPCR) of the ribosomal rRNA genes. The comparison was accomplished in two groups of 23 infected dogs using samples collected by the conjunctival swab procedure. In the Group 1 the DNA was extracted from cotton swabs by phenol-chloroform and in Group 2 by boiling. The most efficient PCR methods in the Group 1 were those based on kDNA targets. The kDNA PCR-Hybridization was able to detect parasites in 22/23 dogs (95.6%) and in 40/46 samples (86.9%). The kDNA snPCR was positive for 21/23 dogs (91.3%) and for 40/46 samples (86.9%). The positivities of the kDNA based methods were significantly higher than the positivities verified for the methods based on ribosomal rRNA genes (p<0.05). In the Group 2 the kDNA PCR- Hybridization showed a better performance detecting parasites in 18/23 dogs (78.3%) and in 31/46 samples (67.4%), significantly higher than the other three methods (p<0.05). The higher sensitivity of the minicircle kDNA based assays reported by others was confirmed in this study and kDNA PCR-Hybridization showed the best sensitivity among the assays evaluated. (author)
Comparison of kDNA PCR-hybridization assay with three PCR methods for canines visceral Leishmaniasis diagnosis

International Nuclear Information System (INIS)

Pilatti, Marcia M.; Andrade, Antero S.R.; Ferreira, Sidney A.

2009-01-01

The sensitivity of the kDNA PCR-Hybridization assay, which uses radioactive DNA probes (labeled with 32 P), was compared with three conventional PCR methods used for canine visceral leishmaniasis diagnosis. All PCR methods had two steps: a first amplification followed by hybridization or by a new amplification (nested or semi nested). Two methods (kDNA PCR-Hybridization and kDNA snPCR) used primers addressed to kinetoplast minicircles and the other two methods to the coding (LnPCR) and intergenic noncoding regions (ITS-1 nPCR) of the ribosomal rRNA genes. The comparison was accomplished in two groups of 23 infected dogs using samples collected by the conjunctival swab procedure. In the Group 1 the DNA was extracted from cotton swabs by phenol-chloroform and in Group 2 by boiling. The most efficient PCR methods in the Group 1 were those based on kDNA targets. The kDNA PCR-Hybridization was able to detect parasites in 22/23 dogs (95.6%) and in 40/46 samples (86.9%). The kDNA snPCR was positive for 21/23 dogs (91.3%) and for 40/46 samples (86.9%). The positivities of the kDNA based methods were significantly higher than the positivities verified for the methods based on ribosomal rRNA genes (p<0.05). In the Group 2 the kDNA PCR- Hybridization showed a better performance detecting parasites in 18/23 dogs (78.3%) and in 31/46 samples (67.4%), significantly higher than the other three methods (p<0.05). The higher sensitivity of the minicircle kDNA based assays reported by others was confirmed in this study and kDNA PCR-Hybridization showed the best sensitivity among the assays evaluated. (author)
A compact view of isochores in the draft human genome sequence

Czech Academy of Sciences Publication Activity Database

Pavlíček, Adam; Pačes, Jan; Clay, O.; Bernardi, G.

2002-01-01

Roč. 511, 1-3 (2002), s. 165-169 ISSN 0014-5793 R&D Projects: GA MŠk LN00A079 Keywords : genome organisation * mammalian DNA * human genome Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.912, year: 2002
Genomic DNA sequence and cytosine methylation changes of adult rice leaves after seeds space flight

Science.gov (United States)

Shi, Jinming

In this study, cytosine methylation on CCGG site and genomic DNA sequence changes of adult leaves of rice after seeds space flight were detected by methylation-sensitive amplification polymorphism (MSAP) and Amplified fragment length polymorphism (AFLP) technique respectively. Rice seeds were planted in the trial field after 4 days space flight on the shenzhou-6 Spaceship of China. Adult leaves of space-treated rice including 8 plants chosen randomly and 2 plants with phenotypic mutation were used for AFLP and MSAP analysis. Polymorphism of both DNA sequence and cytosine methylation were detected. For MSAP analysis, the average polymorphic frequency of the on-ground controls, space-treated plants and mutants are 1.3%, 3.1% and 11% respectively. For AFLP analysis, the average polymorphic frequencies are 1.4%, 2.9%and 8%respectively. Total 27 and 22 polymorphic fragments were cloned sequenced from MSAP and AFLP analysis respectively. Nine of the 27 fragments from MSAP analysis show homology to coding sequence. For the 22 polymorphic fragments from AFLP analysis, no one shows homology to mRNA sequence and eight fragments show homology to repeat region or retrotransposon sequence. These results suggest that although both genomic DNA sequence and cytosine methylation status can be effected by space flight, the genomic region homology to the fragments from genome DNA and cytosine methylation analysis were different.
Identification of DNA repair genes in the human genome

International Nuclear Information System (INIS)

Hoeijmakers, J.H.J.; van Duin, M.; Westerveld, A.; Yasui, A.; Bootsma, D.

1986-01-01

To identify human DNA repair genes we have transfected human genomic DNA ligated to a dominant marker to excision repair deficient xeroderma pigmentosum (XP) and CHO cells. This resulted in the cloning of a human gene, ERCC-1, that complements the defect of a UV- and mitomycin-C sensitive CHO mutant 43-3B. The ERCC-1 gene has a size of 15 kb, consists of 10 exons and is located in the region 19q13.2-q13.3. Its primary transcript is processed into two mRNAs by alternative splicing of an internal coding exon. One of these transcripts encodes a polypeptide of 297 aminoacids. A putative DNA binding protein domain and nuclear location signal could be identified. Significant AA-homology is found between ERCC-1 and the yeast excision repair gene RAD10. 58 references, 6 figures, 1 table
Incidence of genome structure, DNA asymmetry, and cell physiology on T-DNA integration in chromosomes of the phytopathogenic fungus Leptosphaeria maculans.

Science.gov (United States)

Bourras, Salim; Meyer, Michel; Grandaubert, Jonathan; Lapalu, Nicolas; Fudal, Isabelle; Linglin, Juliette; Ollivier, Benedicte; Blaise, Françoise; Balesdent, Marie-Hélène; Rouxel, Thierry

2012-08-01

The ever-increasing generation of sequence data is accompanied by unsatisfactory functional annotation, and complex genomes, such as those of plants and filamentous fungi, show a large number of genes with no predicted or known function. For functional annotation of unknown or hypothetical genes, the production of collections of mutants using Agrobacterium tumefaciens-mediated transformation (ATMT) associated with genotyping and phenotyping has gained wide acceptance. ATMT is also widely used to identify pathogenicity determinants in pathogenic fungi. A systematic analysis of T-DNA borders was performed in an ATMT-mutagenized collection of the phytopathogenic fungus Leptosphaeria maculans to evaluate the features of T-DNA integration in its particular transposable element-rich compartmentalized genome. A total of 318 T-DNA tags were recovered and analyzed for biases in chromosome and genic compartments, existence of CG/AT skews at the insertion site, and occurrence of microhomologies between the T-DNA left border (LB) and the target sequence. Functional annotation of targeted genes was done using the Gene Ontology annotation. The T-DNA integration mainly targeted gene-rich, transcriptionally active regions, and it favored biological processes consistent with the physiological status of a germinating spore. T-DNA integration was strongly biased toward regulatory regions, and mainly promoters. Consistent with the T-DNA intranuclear-targeting model, the density of T-DNA insertion correlated with CG skew near the transcription initiation site. The existence of microhomologies between promoter sequences and the T-DNA LB flanking sequence was also consistent with T-DNA integration to host DNA mediated by homologous recombination based on the microhomology-mediated end-joining pathway.
Criminal Genomic Pragmatism: Prisoners' Representations of DNA Technology and Biosecurity

Science.gov (United States)

Machado, Helena; Silva, Susana

2012-01-01

Background. Within the context of the use of DNA technology in crime investigation, biosecurity is perceived by different stakeholders according to their particular rationalities and interests. Very little is known about prisoners' perceptions and assessments of the uses of DNA technology in solving crime. Aim. To propose a conceptual model that serves to analyse and interpret prisoners' representations of DNA technology and biosecurity. Methods. A qualitative study using an interpretative approach based on 31 semi-structured tape-recorded interviews was carried out between May and September 2009, involving male inmates in three prisons located in the north of Portugal. The content analysis focused on the following topics: the meanings attributed to DNA and assessments of the risks and benefits of the uses of DNA technology and databasing in forensic applications. Results. DNA was described as a record of identity, an exceptional material, and a powerful biometric identifier. The interviewees believed that DNA can be planted to incriminate suspects. Convicted offenders argued for the need to extend the criteria for the inclusion of DNA profiles in forensic databases and to restrict the removal of profiles. Conclusions. The conceptual model entitled criminal genomic pragmatism allows for an understanding of the views of prison inmates regarding DNA technology and biosecurity. PMID:22791960
Criminal Genomic Pragmatism: Prisoners' Representations of DNA Technology and Biosecurity

Directory of Open Access Journals (Sweden)

Helena Machado

2012-01-01

Full Text Available Background. Within the context of the use of DNA technology in crime investigation, biosecurity is perceived by different stakeholders according to their particular rationalities and interests. Very little is known about prisoners’ perceptions and assessments of the uses of DNA technology in solving crime. Aim. To propose a conceptual model that serves to analyse and interpret prisoners’ representations of DNA technology and biosecurity. Methods. A qualitative study using an interpretative approach based on 31 semi-structured tape-recorded interviews was carried out between May and September 2009, involving male inmates in three prisons located in the north of Portugal. The content analysis focused on the following topics: the meanings attributed to DNA and assessments of the risks and benefits of the uses of DNA technology and databasing in forensic applications. Results. DNA was described as a record of identity, an exceptional material, and a powerful biometric identifier. The interviewees believed that DNA can be planted to incriminate suspects. Convicted offenders argued for the need to extend the criteria for the inclusion of DNA profiles in forensic databases and to restrict the removal of profiles. Conclusions. The conceptual model entitled criminal genomic pragmatism allows for an understanding of the views of prison inmates regarding DNA technology and biosecurity.
Criminal genomic pragmatism: prisoners' representations of DNA technology and biosecurity.

Science.gov (United States)

Machado, Helena; Silva, Susana

2012-01-01

Within the context of the use of DNA technology in crime investigation, biosecurity is perceived by different stakeholders according to their particular rationalities and interests. Very little is known about prisoners' perceptions and assessments of the uses of DNA technology in solving crime. To propose a conceptual model that serves to analyse and interpret prisoners' representations of DNA technology and biosecurity. A qualitative study using an interpretative approach based on 31 semi-structured tape-recorded interviews was carried out between May and September 2009, involving male inmates in three prisons located in the north of Portugal. The content analysis focused on the following topics: the meanings attributed to DNA and assessments of the risks and benefits of the uses of DNA technology and databasing in forensic applications. DNA was described as a record of identity, an exceptional material, and a powerful biometric identifier. The interviewees believed that DNA can be planted to incriminate suspects. Convicted offenders argued for the need to extend the criteria for the inclusion of DNA profiles in forensic databases and to restrict the removal of profiles. The conceptual model entitled criminal genomic pragmatism allows for an understanding of the views of prison inmates regarding DNA technology and biosecurity.
cDNA cloning of human DNA topoisomerase I. Catalytic activity of a 67.7-kDa carboxyl-terminal fragment

International Nuclear Information System (INIS)

D'Arpa, P.; Machlin, P.S.; Ratrie, H. III; Rothfield, N.F.; Cleveland, D.W.; Earnshaw, W.C.

1988-01-01

cDNA clones encoding human topoisomerase I were isolated from an expression vector library (λgt11) screened with autoimmune anti-topoisomerase I serum. One of these clones has been expressed as a fusion protein comprised of a 32-kDa fragment of the bacterial TrpE protein linked to 67.7 kDa of protein encoded by the cDNA. Three lines of evidence indicate that the cloned cDNA encodes topoisomerase I. (i) Proteolysis maps of the fusion protein and human nuclear topoisomerase I are essentially identical. (ii) The fusion protein relaxes supercoiled DNA, an activity that can be immunoprecipitated by anti-topoisomerase I serum. (iii) Sequence analysis has revealed that the longest cDNA clone (3645 base pairs) encodes a protein of 765 amino acids that shares 42% identity with Saccharomyces cerevisiae topoisomerase I. The sequence data also show that the catalytically active 67.7-kDa fragment is comprised of the carboxyl terminus
Draft Genome Sequence of Escherichia coli K-12 (ATCC 10798).

Science.gov (United States)

Dimitrova, Daniela; Engelbrecht, Kathleen C; Putonti, Catherine; Koenig, David W; Wolfe, Alan J

2017-07-06

Here, we present the draft genome sequence of Escherichia coli ATCC 10798. E. coli ATCC 10798 is a K-12 strain, one of the most well-studied model microorganisms. The size of the genome was 4,685,496 bp, with a G+C content of 50.70%. This assembly consists of 62 contigs and the F plasmid. Copyright © 2017 Dimitrova et al.
Reverse gyrase functions in genome integrity maintenance by protecting DNA breaks in vivo

DEFF Research Database (Denmark)

Han, Wenyuan; Feng, Xu; She, Qunxin

2017-01-01

Reverse gyrase introduces positive supercoils to circular DNA and is implicated in genome stability maintenance in thermophiles. The extremely thermophilic crenarchaeon Sulfolobus encodes two reverse gyrase proteins, TopR1 (topoisomerase reverse gyrase 1) and TopR2, whose functions in thermophilic...... and subsequent DNA degradation. The former occurred immediately after drug treatment, leading to chromosomal DNA degradation that concurred with TopR1 degradation, followed by chromatin protein degradation and DNA-less cell formation. To gain a further insight into TopR1 function, the expression of the enzyme...
TopBP1/Dpb11 binds DNA anaphase bridges to prevent genome instability.

Science.gov (United States)

Germann, Susanne M; Schramke, Vera; Pedersen, Rune Troelsgaard; Gallina, Irene; Eckert-Boulet, Nadine; Oestergaard, Vibe H; Lisby, Michael

2014-01-06

DNA anaphase bridges are a potential source of genome instability that may lead to chromosome breakage or nondisjunction during mitosis. Two classes of anaphase bridges can be distinguished: DAPI-positive chromatin bridges and DAPI-negative ultrafine DNA bridges (UFBs). Here, we establish budding yeast Saccharomyces cerevisiae and the avian DT40 cell line as model systems for studying DNA anaphase bridges and show that TopBP1/Dpb11 plays an evolutionarily conserved role in their metabolism. Together with the single-stranded DNA binding protein RPA, TopBP1/Dpb11 binds to UFBs, and depletion of TopBP1/Dpb11 led to an accumulation of chromatin bridges. Importantly, the NoCut checkpoint that delays progression from anaphase to abscission in yeast was activated by both UFBs and chromatin bridges independently of Dpb11, and disruption of the NoCut checkpoint in Dpb11-depleted cells led to genome instability. In conclusion, we propose that TopBP1/Dpb11 prevents accumulation of anaphase bridges via stimulation of the Mec1/ATR kinase and suppression of homologous recombination.
Universal internucleotide statistics in full genomes: a footprint of the DNA structure and packaging?

Directory of Open Access Journals (Sweden)

Mikhail I Bogachev

Full Text Available Uncovering the fundamental laws that govern the complex DNA structural organization remains challenging and is largely based upon reconstructions from the primary nucleotide sequences. Here we investigate the distributions of the internucleotide intervals and their persistence properties in complete genomes of various organisms from Archaea and Bacteria to H. Sapiens aiming to reveal the manifestation of the universal DNA architecture. We find that in all considered organisms the internucleotide interval distributions exhibit the same [Formula: see text]-exponential form. While in prokaryotes a single [Formula: see text]-exponential function makes the best fit, in eukaryotes the PDF contains additionally a second [Formula: see text]-exponential, which in the human genome makes a perfect approximation over nearly 10 decades. We suggest that this functional form is a footprint of the heterogeneous DNA structure, where the first [Formula: see text]-exponential reflects the universal helical pitch that appears both in pro- and eukaryotic DNA, while the second [Formula: see text]-exponential is a specific marker of the large-scale eukaryotic DNA organization.
3B11-N, a monoclonal antibody against MERS-CoV, reduces lung pathology in rhesus monkeys following intratracheal inoculation of MERS-CoV Jordan-n3/2012

International Nuclear Information System (INIS)

Johnson, Reed F.; Bagci, Ulas; Keith, Lauren; Tang, Xianchun; Mollura, Daniel J.; Zeitlin, Larry; Qin, Jing; Huzella, Louis; Bartos, Christopher J.; Bohorova, Natasha; Bohorov, Ognian; Goodman, Charles; Kim, Do H.; Paulty, Michael H.; Velasco, Jesus; Whaley, Kevin J.; Johnson, Joshua C.; Pettitt, James; Ork, Britini L.; Solomon, Jeffrey

2016-01-01

Middle East Respiratory Syndrome Coronavirus (MERS-CoV) was identified in 2012 as the causative agent of a severe, lethal respiratory disease occurring across several countries in the Middle East. To date there have been over 1600 laboratory confirmed cases of MERS-CoV in 26 countries with a case fatality rate of 36%. Given the endemic region, it is possible that MERS-CoV could spread during the annual Hajj pilgrimage, necessitating countermeasure development. In this report, we describe the clinical and radiographic changes of rhesus monkeys following infection with 5×10"6 PFU MERS-CoV Jordan-n3/2012. Two groups of NHPs were treated with either a human anti-MERS monoclonal antibody 3B11-N or E410-N, an anti-HIV antibody. MERS-CoV Jordan-n3/2012 infection resulted in quantifiable changes by computed tomography, but limited other clinical signs of disease. 3B11-N treated subjects developed significantly reduced lung pathology when compared to infected, untreated subjects, indicating that this antibody may be a suitable MERS-CoV treatment. - Highlights: • MERS-CoV Jordan-n3/2012 challenge of rhesus monkeys results in a mild disease. • CT can be used to monitor disease progression to aid models of human disease. • Treatment with the human monoclonal antibody 3B11-N resulted in decreased disease.
3B11-N, a monoclonal antibody against MERS-CoV, reduces lung pathology in rhesus monkeys following intratracheal inoculation of MERS-CoV Jordan-n3/2012

Energy Technology Data Exchange (ETDEWEB)

Johnson, Reed F., E-mail: johnsonreed@mail.nih.gov [Emerging Viral Pathogens Section National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD 21702 (United States); Bagci, Ulas [Center for Infectious Disease Imaging, National Institutes of Health Clinical Center, Bethesda MD 20892 (United States); Center for Research in Computer Vision (CRCV), Department of Electrics Electronics and Computer Science, University of Central Florida, Orlando, FL 32816, USA. (United States); Keith, Lauren [Integrated Research Facility, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD 21702 (United States); Tang, Xianchun [Department of Cancer Immunology & AIDS, Dana-Farber Cancer Institute, Harvard Medical School, Boston, MA 02215 (United States); Mollura, Daniel J. [Center for Infectious Disease Imaging, National Institutes of Health Clinical Center, Bethesda MD 20892 (United States); Zeitlin, Larry [Mapp Biopharmaceutical, Inc., San Diego CA 92121 (United States); Qin, Jing [Biostatistics Research Branch, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Bethesda, MD 20892 (United States); Huzella, Louis; Bartos, Christopher J. [Integrated Research Facility, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD 21702 (United States); Bohorova, Natasha; Bohorov, Ognian; Goodman, Charles; Kim, Do H.; Paulty, Michael H.; Velasco, Jesus; Whaley, Kevin J. [Mapp Biopharmaceutical, Inc., San Diego CA 92121 (United States); Johnson, Joshua C.; Pettitt, James; Ork, Britini L. [Integrated Research Facility, National Institute of Allergy and Infectious Diseases, National Institutes of Health, Frederick, MD 21702 (United States); Solomon, Jeffrey [Clinical Research Directorate/Clinical Monitoring Research Program, Leidos Biomedical Research, Inc., Frederick National Laboratory for Cancer Research,Frederick, MD 21702-USA. (United States); and others

2016-03-15

Middle East Respiratory Syndrome Coronavirus (MERS-CoV) was identified in 2012 as the causative agent of a severe, lethal respiratory disease occurring across several countries in the Middle East. To date there have been over 1600 laboratory confirmed cases of MERS-CoV in 26 countries with a case fatality rate of 36%. Given the endemic region, it is possible that MERS-CoV could spread during the annual Hajj pilgrimage, necessitating countermeasure development. In this report, we describe the clinical and radiographic changes of rhesus monkeys following infection with 5×10{sup 6} PFU MERS-CoV Jordan-n3/2012. Two groups of NHPs were treated with either a human anti-MERS monoclonal antibody 3B11-N or E410-N, an anti-HIV antibody. MERS-CoV Jordan-n3/2012 infection resulted in quantifiable changes by computed tomography, but limited other clinical signs of disease. 3B11-N treated subjects developed significantly reduced lung pathology when compared to infected, untreated subjects, indicating that this antibody may be a suitable MERS-CoV treatment. - Highlights: • MERS-CoV Jordan-n3/2012 challenge of rhesus monkeys results in a mild disease. • CT can be used to monitor disease progression to aid models of human disease. • Treatment with the human monoclonal antibody 3B11-N resulted in decreased disease.

Restriction site extension PCR: a novel method for high-throughput characterization of tagged DNA fragments and genome walking.

Directory of Open Access Journals (Sweden)

Jiabing Ji

Full Text Available BACKGROUND: Insertion mutant isolation and characterization are extremely valuable for linking genes to physiological function. Once an insertion mutant phenotype is identified, the challenge is to isolate the responsible gene. Multiple strategies have been employed to isolate unknown genomic DNA that flanks mutagenic insertions, however, all these methods suffer from limitations due to inefficient ligation steps, inclusion of restriction sites within the target DNA, and non-specific product generation. These limitations become close to insurmountable when the goal is to identify insertion sites in a high throughput manner. METHODOLOGY/PRINCIPAL FINDINGS: We designed a novel strategy called Restriction Site Extension PCR (RSE-PCR to efficiently conduct large-scale isolation of unknown genomic DNA fragments linked to DNA insertions. The strategy is a modified adaptor-mediated PCR without ligation. An adapter, with complementarity to the 3' overhang of the endonuclease (KpnI, NsiI, PstI, or SacI restricted DNA fragments, extends the 3' end of the DNA fragments in the first cycle of the primary RSE-PCR. During subsequent PCR cycles and a second semi-nested PCR (secondary RSE-PCR, touchdown and two-step PCR are combined to increase the amplification specificity of target fragments. The efficiency and specificity was demonstrated in our characterization of 37 tex mutants of Arabidopsis. All the steps of RSE-PCR can be executed in a 96 well PCR plate. Finally, RSE-PCR serves as a successful alternative to Genome Walker as demonstrated by gene isolation from maize, a plant with a more complex genome than Arabidopsis. CONCLUSIONS/SIGNIFICANCE: RSE-PCR has high potential application in identifying tagged (T-DNA or transposon sequence or walking from known DNA toward unknown regions in large-genome plants, with likely application in other organisms as well.
Ole Rømer som enevældens administrator

DEFF Research Database (Denmark)

Olden-Jørgensen, Sebastian

2002-01-01

En gennemgang af Ole Rømers liv med vægt på hans rolle som embedsmand og hans deltagelse eller mangel på samme i det politiske spil.......En gennemgang af Ole Rømers liv med vægt på hans rolle som embedsmand og hans deltagelse eller mangel på samme i det politiske spil....
Horizontal gene transfer of a chloroplast DnaJ-Fer protein to Thaumarchaeota and the evolutionary history of the DnaK chaperone system in Archaea.

Science.gov (United States)

Petitjean, Céline; Moreira, David; López-García, Purificación; Brochier-Armanet, Céline

2012-11-26

In 2004, we discovered an atypical protein in metagenomic data from marine thaumarchaeotal species. This protein, referred as DnaJ-Fer, is composed of a J domain fused to a Ferredoxin (Fer) domain. Surprisingly, the same protein was also found in Viridiplantae (green algae and land plants). Because J domain-containing proteins are known to interact with the major chaperone DnaK/Hsp70, this suggested that a DnaK protein was present in Thaumarchaeota. DnaK/Hsp70, its co-chaperone DnaJ and the nucleotide exchange factor GrpE are involved, among others, in heat shocks and heavy metal cellular stress responses. Using phylogenomic approaches we have investigated the evolutionary history of the DnaJ-Fer protein and of interacting proteins DnaK, DnaJ and GrpE in Thaumarchaeota. These proteins have very complex histories, involving several inter-domain horizontal gene transfers (HGTs) to explain the contemporary distribution of these proteins in archaea. These transfers include one from Cyanobacteria to Viridiplantae and one from Viridiplantae to Thaumarchaeota for the DnaJ-Fer protein, as well as independent HGTs from Bacteria to mesophilic archaea for the DnaK/DnaJ/GrpE system, followed by HGTs among mesophilic and thermophilic archaea. We highlight the chimerical origin of the set of proteins DnaK, DnaJ, GrpE and DnaJ-Fer in Thaumarchaeota and suggest that the HGT of these proteins has played an important role in the adaptation of several archaeal groups to mesophilic and thermophilic environments from hyperthermophilic ancestors. Finally, the evolutionary history of DnaJ-Fer provides information useful for the relative dating of the diversification of Archaeplastida and Thaumarchaeota.
Inter-Fork Strand Annealing causes genomic deletions during the termination of DNA replication.

Science.gov (United States)

Morrow, Carl A; Nguyen, Michael O; Fower, Andrew; Wong, Io Nam; Osman, Fekret; Bryer, Claire; Whitby, Matthew C

2017-06-06

Problems that arise during DNA replication can drive genomic alterations that are instrumental in the development of cancers and many human genetic disorders. Replication fork barriers are a commonly encountered problem, which can cause fork collapse and act as hotspots for replication termination. Collapsed forks can be rescued by homologous recombination, which restarts replication. However, replication restart is relatively slow and, therefore, replication termination may frequently occur by an active fork converging on a collapsed fork. We find that this type of non-canonical fork convergence in fission yeast is prone to trigger deletions between repetitive DNA sequences via a mechanism we call Inter-Fork Strand Annealing (IFSA) that depends on the recombination proteins Rad52, Exo1 and Mus81, and is countered by the FANCM-related DNA helicase Fml1. Based on our findings, we propose that IFSA is a potential threat to genomic stability in eukaryotes.
Genome-wide DNA methylation analysis of pseudohypoparathyroidism patients with GNAS imprinting defects.

Science.gov (United States)

Rochtus, Anne; Martin-Trujillo, Alejandro; Izzi, Benedetta; Elli, Francesca; Garin, Intza; Linglart, Agnes; Mantovani, Giovanna; Perez de Nanclares, Guiomar; Thiele, Suzanne; Decallonne, Brigitte; Van Geet, Chris; Monk, David; Freson, Kathleen

2016-01-01

Pseudohypoparathyroidism (PHP) is caused by (epi)genetic defects in the imprinted GNAS cluster. Current classification of PHP patients is hampered by clinical and molecular diagnostic overlaps. The European Consortium for the study of PHP designed a genome-wide methylation study to improve molecular diagnosis. The HumanMethylation 450K BeadChip was used to analyze genome-wide methylation in 24 PHP patients with parathyroid hormone resistance and 20 age- and gender-matched controls. Patients were previously diagnosed with GNAS-specific differentially methylated regions (DMRs) and include 6 patients with known STX16 deletion (PHP(Δstx16)) and 18 without deletion (PHP(neg)). The array demonstrated that PHP patients do not show DNA methylation differences at the whole-genome level. Unsupervised clustering of GNAS-specific DMRs divides PHP(Δstx16) versus PHP(neg) patients. Interestingly, in contrast to the notion that all PHP patients share methylation defects in the A/B DMR while only PHP(Δstx16) patients have normal NESP, GNAS-AS1 and XL methylation, we found a novel DMR (named GNAS-AS2) in the GNAS-AS1 region that is significantly different in both PHP(Δstx16) and PHP(neg), as validated by Sequenom EpiTYPER in a larger PHP cohort. The analysis of 58 DMRs revealed that 8/18 PHP(neg) and 1/6 PHP(Δstx16) patients have multi-locus methylation defects. Validation was performed for FANCC and SVOPL DMRs. This is the first genome-wide methylation study for PHP patients that confirmed that GNAS is the most significant DMR, and the presence of STX16 deletion divides PHP patients in two groups. Moreover, a novel GNAS-AS2 DMR affects all PHP patients, and PHP patients seem sensitive to multi-locus methylation defects.
Delayed Accumulation of H3K27me3 on Nascent DNA Is Essential for Recruitment of Transcription Factors at Early Stages of Stem Cell Differentiation.

Science.gov (United States)

Petruk, Svetlana; Cai, Jingli; Sussman, Robyn; Sun, Guizhi; Kovermann, Sina K; Mariani, Samanta A; Calabretta, Bruno; McMahon, Steven B; Brock, Hugh W; Iacovitti, Lorraine; Mazo, Alexander

2017-04-20

Recruitment of transcription factors (TFs) to repressed genes in euchromatin is essential to activate new transcriptional programs during cell differentiation. However, recruitment of all TFs, including pioneer factors, is impeded by condensed H3K27me3-containing chromatin. Single-cell and gene-specific analyses revealed that, during the first hours of induction of differentiation of mammalian embryonic stem cells (ESCs), accumulation of the repressive histone mark H3K27me3 is delayed after DNA replication, indicative of a decondensed chromatin structure in all regions of the replicating genome. This delay provides a critical "window of opportunity" for recruitment of lineage-specific TFs to DNA. Increasing the levels of post-replicative H3K27me3 or preventing S phase entry inhibited recruitment of new TFs to DNA and significantly blocked cell differentiation. These findings suggest that recruitment of lineage-specifying TFs occurs soon after replication and is facilitated by a decondensed chromatin structure. This insight may explain the developmental plasticity of stem cells and facilitate their exploitation for therapeutic purposes. Copyright © 2017 Elsevier Inc. All rights reserved.
Genomic relations among 31 species of Mammillaria haworth (Cactaceae) using random amplified polymorphic DNA.

Science.gov (United States)

Mattagajasingh, Ilwola; Mukherjee, Arup Kumar; Das, Premananda

2006-01-01

Thirty-one species of Mammillaria were selected to study the molecular phylogeny using random amplified polymorphic DNA (RAPD) markers. High amount of mucilage (gelling polysaccharides) present in Mammillaria was a major obstacle in isolating good quality genomic DNA. The CTAB (cetyl trimethyl ammonium bromide) method was modified to obtain good quality genomic DNA. Twenty-two random decamer primers resulted in 621 bands, all of which were polymorphic. The similarity matrix value varied from 0.109 to 0.622 indicating wide variability among the studied species. The dendrogram obtained from the unweighted pair group method using arithmetic averages (UPGMA) analysis revealed that some of the species did not follow the conventional classification. The present work shows the usefulness of RAPD markers for genetic characterization to establish phylogenetic relations among Mammillaria species.
A Novel Role of MerC in Methylmercury Transport and Phytoremediation of Methylmercury Contamination.

Science.gov (United States)

Sone, Yuka; Uraguchi, Shimpei; Takanezawa, Yasukazu; Nakamura, Ryosuke; Pan-Hou, Hidemitsu; Kiyono, Masako

2017-01-01

MerC, encoded by merC in the transposon Tn21 mer operon, is a heavy metal transporter with potential applications for phytoremediation of heavy metals such as mercuric ion and cadmium. In this study, we demonstrate that MerC also acts as a transporter for methylmercury. When MerC was expressed in Escherichia coli XL1-Blue, cells became hypersensitive to CH 3 Hg(I) and the uptake of CH 3 Hg(I) by these cells was higher than that by cells of the isogenic strain. Moreover, transgenic Arabidopsis plants expressing bacterial MerC or MerC fused to plant soluble N-ethylmaleimide-sensitive factor attachment protein receptors (SNAREs) accumulated CH 3 Hg(I) effectively and their growth was comparable to the wild-type plants. These results demonstrate that when the bacterium-derived merC gene is ectopically introduced in genetically modified plants, MerC expression in the transgenic plants promotes the transport and sequestration of methylmercury. Thus, our results show that the expression of merC in Arabidopsis results in transgenic plants that could be used for the phytoremediation and elimination of toxic methylmercury from the environment.
Modeling DNA affinity landscape through two-round support vector regression with weighted degree kernels

KAUST Repository

Wang, Xiaolei

2014-12-12

Background: A quantitative understanding of interactions between transcription factors (TFs) and their DNA binding sites is key to the rational design of gene regulatory networks. Recent advances in high-throughput technologies have enabled high-resolution measurements of protein-DNA binding affinity. Importantly, such experiments revealed the complex nature of TF-DNA interactions, whereby the effects of nucleotide changes on the binding affinity were observed to be context dependent. A systematic method to give high-quality estimates of such complex affinity landscapes is, thus, essential to the control of gene expression and the advance of synthetic biology. Results: Here, we propose a two-round prediction method that is based on support vector regression (SVR) with weighted degree (WD) kernels. In the first round, a WD kernel with shifts and mismatches is used with SVR to detect the importance of subsequences with different lengths at different positions. The subsequences identified as important in the first round are then fed into a second WD kernel to fit the experimentally measured affinities. To our knowledge, this is the first attempt to increase the accuracy of the affinity prediction by applying two rounds of string kernels and by identifying a small number of crucial k-mers. The proposed method was tested by predicting the binding affinity landscape of Gcn4p in Saccharomyces cerevisiae using datasets from HiTS-FLIP. Our method explicitly identified important subsequences and showed significant performance improvements when compared with other state-of-the-art methods. Based on the identified important subsequences, we discovered two surprisingly stable 10-mers and one sensitive 10-mer which were not reported before. Further test on four other TFs in S. cerevisiae demonstrated the generality of our method. Conclusion: We proposed in this paper a two-round method to quantitatively model the DNA binding affinity landscape. Since the ability to modify
RAMBO-K: Rapid and Sensitive Removal of Background Sequences from Next Generation Sequencing Data.

Directory of Open Access Journals (Sweden)

Simon H Tausch

Full Text Available The assembly of viral or endosymbiont genomes from Next Generation Sequencing (NGS data is often hampered by the predominant abundance of reads originating from the host organism. These reads increase the memory and CPU time usage of the assembler and can lead to misassemblies.We developed RAMBO-K (Read Assignment Method Based On K-mers, a tool which allows rapid and sensitive removal of unwanted host sequences from NGS datasets. Reaching a speed of 10 Megabases/s on 4 CPU cores and a standard hard drive, RAMBO-K is faster than any tool we tested, while showing a consistently high sensitivity and specificity across different datasets.RAMBO-K rapidly and reliably separates reads from different species without data preprocessing. It is suitable as a straightforward standard solution for workflows dealing with mixed datasets. Binaries and source code (java and python are available from http://sourceforge.net/projects/rambok/.
Re-exploration of U's Triangle Brassica Species Based on Chloroplast Genomes and 45S nrDNA Sequences.

Science.gov (United States)

Kim, Chang-Kug; Seol, Young-Joo; Perumal, Sampath; Lee, Jonghoon; Waminal, Nomar Espinosa; Jayakodi, Murukarthick; Lee, Sang-Choon; Jin, Seungwoo; Choi, Beom-Soon; Yu, Yeisoo; Ko, Ho-Cheol; Choi, Ji-Weon; Ryu, Kyoung-Yul; Sohn, Seong-Han; Parkin, Isobel; Yang, Tae-Jin

2018-05-09

The concept of U's triangle, which revealed the importance of polyploidization in plant genome evolution, described natural allopolyploidization events in Brassica using three diploids [B. rapa (A genome), B. nigra (B), and B. oleracea (C)] and derived allotetraploids [B. juncea (AB genome), B. napus (AC), and B. carinata (BC)]. However, comprehensive understanding of Brassica genome evolution has not been fully achieved. Here, we performed low-coverage (2-6×) whole-genome sequencing of 28 accessions of Brassica as well as of Raphanus sativus [R genome] to explore the evolution of six Brassica species based on chloroplast genome and ribosomal DNA variations. Our phylogenomic analyses led to two main conclusions. (1) Intra-species-level chloroplast genome variations are low in the three allotetraploids (2~7 SNPs), but rich and variable in each diploid species (7~193 SNPs). (2) Three allotetraploids maintain two 45SnrDNA types derived from both ancestral species with maternal dominance. Furthermore, this study sheds light on the maternal origin of the AC chloroplast genome. Overall, this study clarifies the genetic relationships of U's triangle species based on a comprehensive genomics approach and provides important genomic resources for correlative and evolutionary studies.
Spectroscopic quantification of 5-hydroxymethylcytosine in genomic DNA.

Science.gov (United States)

Shahal, Tamar; Gilat, Noa; Michaeli, Yael; Redy-Keisar, Orit; Shabat, Doron; Ebenstein, Yuval

2014-08-19

5-Hydroxymethylcytosine (5hmC), a modified form of the DNA base cytosine, is an important epigenetic mark linked to regulation of gene expression in development, and tumorigenesis. We have developed a spectroscopic method for a global quantification of 5hmC in genomic DNA. The assay is performed within a multiwell plate, which allows simultaneous recording of up to 350 samples. Our quantification procedure of 5hmC is direct, simple, and rapid. It relies on a two-step protocol that consists of enzymatic glucosylation of 5hmC with an azide-modified glucose, followed by a "click reaction" with an alkyne-fluorescent tag. The fluorescence intensity recorded from the DNA sample is proportional to its 5hmC content and can be quantified by a simple plate reader measurement. This labeling technique is specific and highly sensitive, allowing detection of 5hmC down to 0.002% of the total nucleotides. Our results reveal significant variations in the 5hmC content obtained from different mouse tissues, in agreement with previously reported data.
Single-tube linear DNA amplification for genome-wide studies using a few thousand cells

NARCIS (Netherlands)

Shankaranarayanan, P.; Mendoza-Parra, M.A.; Gool, van W.; Trindade, L.M.; Gronemeyer, H.

2012-01-01

Linear amplification of DNA (LinDA) by T7 polymerase is a versatile and robust method for generating sufficient amounts of DNA for genome-wide studies with minute amounts of cells. LinDA can be coupled to a great number of global profiling technologies. Indeed, chromatin immunoprecipitation coupled
Inactivating UBE2M impacts the DNA damage response and genome integrity involving multiple cullin ligases.

Directory of Open Access Journals (Sweden)

Scott Cukras

Full Text Available Protein neddylation is involved in a wide variety of cellular processes. Here we show that the DNA damage response is perturbed in cells inactivated with an E2 Nedd8 conjugating enzyme UBE2M, measured by RAD51 foci formation kinetics and cell based DNA repair assays. UBE2M knockdown increases DNA breakages and cellular sensitivity to DNA damaging agents, further suggesting heightened genomic instability and defective DNA repair activity. Investigating the downstream Cullin targets of UBE2M revealed that silencing of Cullin 1, 2, and 4 ligases incurred significant DNA damage. In particular, UBE2M knockdown, or defective neddylation of Cullin 2, leads to a blockade in the G1 to S progression and is associated with delayed S-phase dependent DNA damage response. Cullin 4 inactivation leads to an aberrantly high DNA damage response that is associated with increased DNA breakages and sensitivity of cells to DNA damaging agents, suggesting a DNA repair defect is associated. siRNA interrogation of key Cullin substrates show that CDT1, p21, and Claspin are involved in elevated DNA damage in the UBE2M knockdown cells. Therefore, UBE2M is required to maintain genome integrity by activating multiple Cullin ligases throughout the cell cycle.
Inactivating UBE2M impacts the DNA damage response and genome integrity involving multiple cullin ligases.

Science.gov (United States)

Cukras, Scott; Morffy, Nicholas; Ohn, Takbum; Kee, Younghoon

2014-01-01

Protein neddylation is involved in a wide variety of cellular processes. Here we show that the DNA damage response is perturbed in cells inactivated with an E2 Nedd8 conjugating enzyme UBE2M, measured by RAD51 foci formation kinetics and cell based DNA repair assays. UBE2M knockdown increases DNA breakages and cellular sensitivity to DNA damaging agents, further suggesting heightened genomic instability and defective DNA repair activity. Investigating the downstream Cullin targets of UBE2M revealed that silencing of Cullin 1, 2, and 4 ligases incurred significant DNA damage. In particular, UBE2M knockdown, or defective neddylation of Cullin 2, leads to a blockade in the G1 to S progression and is associated with delayed S-phase dependent DNA damage response. Cullin 4 inactivation leads to an aberrantly high DNA damage response that is associated with increased DNA breakages and sensitivity of cells to DNA damaging agents, suggesting a DNA repair defect is associated. siRNA interrogation of key Cullin substrates show that CDT1, p21, and Claspin are involved in elevated DNA damage in the UBE2M knockdown cells. Therefore, UBE2M is required to maintain genome integrity by activating multiple Cullin ligases throughout the cell cycle.
The architecture of ArgR-DNA complexes at the genome-scale in Escherichia coli

DEFF Research Database (Denmark)

Cho, Suhyung; Cho, Yoo-Bok; Kang, Taek Jin

2015-01-01

DNA-binding motifs that are recognized by transcription factors (TFs) have been well studied; however, challenges remain in determining the in vivo architecture of TF-DNA complexes on a genome-scale. Here, we determined the in vivo architecture of Escherichia coli arginine repressor (ArgR)-DNA co...
A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

KAUST Repository

Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San

2015-01-01

Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.
A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

KAUST Repository

Wong, Ka-Chun

2015-06-11

Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.
Nuclear alpha spectrin: Critical roles in DNA interstrand cross-link repair and genomic stability

OpenAIRE

Lambert, Muriel W

2016-01-01

Non-erythroid alpha spectrin (?IISp) is a structural protein which we have shown is present in the nucleus of human cells. It interacts with a number of nuclear proteins such as actin, lamin, emerin, chromatin remodeling factors, and DNA repair proteins. ?IISp?s interaction with DNA repair proteins has been extensively studied. We have demonstrated that nuclear ?IISp is critical in DNA interstrand cross-link (ICL) repair in S phase, in both genomic (non-telomeric) and telomeric DNA, and in ma...
Mechanistically Distinct Pathways of Divergent Regulatory DNA Creation Contribute to Evolution of Human-Specific Genomic Regulatory Networks Driving Phenotypic Divergence of Homo sapiens.

Science.gov (United States)

Glinsky, Gennadi V

2016-09-19

Thousands of candidate human-specific regulatory sequences (HSRS) have been identified, supporting the hypothesis that unique to human phenotypes result from human-specific alterations of genomic regulatory networks. Collectively, a compendium of multiple diverse families of HSRS that are functionally and structurally divergent from Great Apes could be defined as the backbone of human-specific genomic regulatory networks. Here, the conservation patterns analysis of 18,364 candidate HSRS was carried out requiring that 100% of bases must remap during the alignments of human, chimpanzee, and bonobo sequences. A total of 5,535 candidate HSRS were identified that are: (i) highly conserved in Great Apes; (ii) evolved by the exaptation of highly conserved ancestral DNA; (iii) defined by either the acceleration of mutation rates on the human lineage or the functional divergence from non-human primates. The exaptation of highly conserved ancestral DNA pathway seems mechanistically distinct from the evolution of regulatory DNA segments driven by the species-specific expansion of transposable elements. Genome-wide proximity placement analysis of HSRS revealed that a small fraction of topologically associating domains (TADs) contain more than half of HSRS from four distinct families. TADs that are enriched for HSRS and termed rapidly evolving in humans TADs (revTADs) comprise 0.8-10.3% of 3,127 TADs in the hESC genome. RevTADs manifest distinct correlation patterns between placements of human accelerated regions, human-specific transcription factor-binding sites, and recombination rates. There is a significant enrichment within revTAD boundaries of hESC-enhancers, primate-specific CTCF-binding sites, human-specific RNAPII-binding sites, hCONDELs, and H3K4me3 peaks with human-specific enrichment at TSS in prefrontal cortex neurons (P sapiens is driven by the evolution of human-specific genomic regulatory networks via at least two mechanistically distinct pathways of creation of

Horizontal gene transfer of a chloroplast DnaJ-Fer protein to Thaumarchaeota and the evolutionary history of the DnaK chaperone system in Archaea

Directory of Open Access Journals (Sweden)

Petitjean Céline

2012-11-01

Full Text Available Abstract Background In 2004, we discovered an atypical protein in metagenomic data from marine thaumarchaeotal species. This protein, referred as DnaJ-Fer, is composed of a J domain fused to a Ferredoxin (Fer domain. Surprisingly, the same protein was also found in Viridiplantae (green algae and land plants. Because J domain-containing proteins are known to interact with the major chaperone DnaK/Hsp70, this suggested that a DnaK protein was present in Thaumarchaeota. DnaK/Hsp70, its co-chaperone DnaJ and the nucleotide exchange factor GrpE are involved, among others, in heat shocks and heavy metal cellular stress responses. Results Using phylogenomic approaches we have investigated the evolutionary history of the DnaJ-Fer protein and of interacting proteins DnaK, DnaJ and GrpE in Thaumarchaeota. These proteins have very complex histories, involving several inter-domain horizontal gene transfers (HGTs to explain the contemporary distribution of these proteins in archaea. These transfers include one from Cyanobacteria to Viridiplantae and one from Viridiplantae to Thaumarchaeota for the DnaJ-Fer protein, as well as independent HGTs from Bacteria to mesophilic archaea for the DnaK/DnaJ/GrpE system, followed by HGTs among mesophilic and thermophilic archaea. Conclusions We highlight the chimerical origin of the set of proteins DnaK, DnaJ, GrpE and DnaJ-Fer in Thaumarchaeota and suggest that the HGT of these proteins has played an important role in the adaptation of several archaeal groups to mesophilic and thermophilic environments from hyperthermophilic ancestors. Finally, the evolutionary history of DnaJ-Fer provides information useful for the relative dating of the diversification of Archaeplastida and Thaumarchaeota.
The Paramecium germline genome provides a niche for intragenic parasitic DNA: evolutionary dynamics of internal eliminated sequences.

Science.gov (United States)

Arnaiz, Olivier; Mathy, Nathalie; Baudry, Céline; Malinsky, Sophie; Aury, Jean-Marc; Denby Wilkes, Cyril; Garnier, Olivier; Labadie, Karine; Lauderdale, Benjamin E; Le Mouël, Anne; Marmignon, Antoine; Nowacki, Mariusz; Poulain, Julie; Prajer, Malgorzata; Wincker, Patrick; Meyer, Eric; Duharcourt, Sandra; Duret, Laurent; Bétermier, Mireille; Sperling, Linda

2012-01-01

Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES) from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a fraction of the
The Paramecium germline genome provides a niche for intragenic parasitic DNA: evolutionary dynamics of internal eliminated sequences.

Directory of Open Access Journals (Sweden)

Olivier Arnaiz

Full Text Available Insertions of parasitic DNA within coding sequences are usually deleterious and are generally counter-selected during evolution. Thanks to nuclear dimorphism, ciliates provide unique models to study the fate of such insertions. Their germline genome undergoes extensive rearrangements during development of a new somatic macronucleus from the germline micronucleus following sexual events. In Paramecium, these rearrangements include precise excision of unique-copy Internal Eliminated Sequences (IES from the somatic DNA, requiring the activity of a domesticated piggyBac transposase, PiggyMac. We have sequenced Paramecium tetraurelia germline DNA, establishing a genome-wide catalogue of -45,000 IESs, in order to gain insight into their evolutionary origin and excision mechanism. We obtained direct evidence that PiggyMac is required for excision of all IESs. Homology with known P. tetraurelia Tc1/mariner transposons, described here, indicates that at least a fraction of IESs derive from these elements. Most IES insertions occurred before a recent whole-genome duplication that preceded diversification of the P. aurelia species complex, but IES invasion of the Paramecium genome appears to be an ongoing process. Once inserted, IESs decay rapidly by accumulation of deletions and point substitutions. Over 90% of the IESs are shorter than 150 bp and present a remarkable size distribution with a -10 bp periodicity, corresponding to the helical repeat of double-stranded DNA and suggesting DNA loop formation during assembly of a transpososome-like excision complex. IESs are equally frequent within and between coding sequences; however, excision is not 100% efficient and there is selective pressure against IES insertions, in particular within highly expressed genes. We discuss the possibility that ancient domestication of a piggyBac transposase favored subsequent propagation of transposons throughout the germline by allowing insertions in coding sequences, a
Excised radicle tips as a source of genomic DNA for PCR-based ...

Indian Academy of Sciences (India)

2012-12-13

Dec 13, 2012 ... Cotton; cry1Ac; genomic DNA isolation; high-resolution melting curve analysis; radicle tip; seed purity testing .... cooled to 40°C. Fluorescence data for melting curves were ... greatly increased by introducing automation.
Elg1 forms an alternative RFC complex important for DNA replication and genome integrity

NARCIS (Netherlands)

Bellaoui, Mohammed; Chang, Michael; Ou, Jiongwen; Xu, Hong; Boone, Charles; Brown, Grant W

2003-01-01

Genome-wide synthetic genetic interaction screens with mutants in the mus81 and mms4 replication fork-processing genes identified a novel replication factor C (RFC) homolog, Elg1, which forms an alternative RFC complex with Rfc2-5. This complex is distinct from the DNA replication RFC, the DNA
Tissue Distribution of the MERS-Coronavirus Receptor in Bats

NARCIS (Netherlands)

W. Widagdo; L. Begeman (Lineke); D. Schipper (Debby); P.R.W.A. van Run (Peter); Cunningham, A.A. (Andrew A); Kley, N. (Nils); C.B.E.M. Reusken (Chantal); B.L. Haagmans (Bart); J.M.A. van den Brand (Judith)

2017-01-01

textabstractMiddle East respiratory syndrome coronavirus (MERS-CoV) has been shown to infect both humans and dromedary camels using dipeptidyl peptidase-4 (DPP4) as its receptor.The distribution of DPP4 in the respiratory tract tissues of humans and camels reflects MERS-CoV tropism.Apart from
Tissue Distribution of the MERS-Coronavirus Receptor in Bats

NARCIS (Netherlands)

Widagdo, W; Begeman, Lineke; Schipper, Debby; van Run, Peter R; Cunningham, Andrew A; Kley, Nils; Reusken, Chantal B E M; Haagmans, Bart L; van den Brand, Judith M A

2017-01-01

Middle East respiratory syndrome coronavirus (MERS-CoV) has been shown to infect both humans and dromedary camels using dipeptidyl peptidase-4 (DPP4) as its receptor. The distribution of DPP4 in the respiratory tract tissues of humans and camels reflects MERS-CoV tropism. Apart from dromedary
cDNA, genomic sequence cloning and analysis of the ribosomal ...

African Journals Online (AJOL)

Ribosomal protein L37A (RPL37A) is a component of 60S large ribosomal subunit encoded by the RPL37A gene, which belongs to the family of ribosomal L37AE proteins, located in the cytoplasm. The complementary deoxyribonucleic acid (cDNA) and the genomic sequence of RPL37A were cloned successfully from giant ...
Genome-wide profiling of H3K56 acetylation and transcription factor binding sites in human adipocytes.

Directory of Open Access Journals (Sweden)

Kinyui Alice Lo

Full Text Available The growing epidemic of obesity and metabolic diseases calls for a better understanding of adipocyte biology. The regulation of transcription in adipocytes is particularly important, as it is a target for several therapeutic approaches. Transcriptional outcomes are influenced by both histone modifications and transcription factor binding. Although the epigenetic states and binding sites of several important transcription factors have been profiled in the mouse 3T3-L1 cell line, such data are lacking in human adipocytes. In this study, we identified H3K56 acetylation sites in human adipocytes derived from mesenchymal stem cells. H3K56 is acetylated by CBP and p300, and deacetylated by SIRT1, all are proteins with important roles in diabetes and insulin signaling. We found that while almost half of the genome shows signs of H3K56 acetylation, the highest level of H3K56 acetylation is associated with transcription factors and proteins in the adipokine signaling and Type II Diabetes pathways. In order to discover the transcription factors that recruit acetyltransferases and deacetylases to sites of H3K56 acetylation, we analyzed DNA sequences near H3K56 acetylated regions and found that the E2F recognition sequence was enriched. Using chromatin immunoprecipitation followed by high-throughput sequencing, we confirmed that genes bound by E2F4, as well as those by HSF-1 and C/EBPα, have higher than expected levels of H3K56 acetylation, and that the transcription factor binding sites and acetylation sites are often adjacent but rarely overlap. We also discovered a significant difference between bound targets of C/EBPα in 3T3-L1 and human adipocytes, highlighting the need to construct species-specific epigenetic and transcription factor binding site maps. This is the first genome-wide profile of H3K56 acetylation, E2F4, C/EBPα and HSF-1 binding in human adipocytes, and will serve as an important resource for better understanding adipocyte
A novel method of genomic DNA extraction for Cactaceae1

Science.gov (United States)

Fehlberg, Shannon D.; Allen, Jessica M.; Church, Kathleen

2013-01-01

• Premise of the study: Genetic studies of Cactaceae can at times be impeded by difficult sampling logistics and/or high mucilage content in tissues. Simplifying sampling and DNA isolation through the use of cactus spines has not previously been investigated. • Methods and Results: Several protocols for extracting DNA from spines were tested and modified to maximize yield, amplification, and sequencing. Sampling of and extraction from spines resulted in a simplified protocol overall and complete avoidance of mucilage as compared to typical tissue extractions. Sequences from one nuclear and three plastid regions were obtained across eight genera and 20 species of cacti using DNA extracted from spines. • Conclusions: Genomic DNA useful for amplification and sequencing can be obtained from cactus spines. The protocols described here are valuable for any cactus species, but are particularly useful for investigators interested in sampling living collections, extensive field sampling, and/or conservation genetic studies. PMID:25202521
Extensive and biased intergenomic nonreciprocal DNA exchanges shaped a nascent polyploid genome, Gossypium (cotton).

Science.gov (United States)

Guo, Hui; Wang, Xiyin; Gundlach, Heidrun; Mayer, Klaus F X; Peterson, Daniel G; Scheffler, Brian E; Chee, Peng W; Paterson, Andrew H

2014-08-01

Genome duplication is thought to be central to the evolution of morphological complexity, and some polyploids enjoy a variety of capabilities that transgress those of their diploid progenitors. Comparison of genomic sequences from several tetraploid (AtDt) Gossypium species and genotypes with putative diploid A- and D-genome progenitor species revealed that unidirectional DNA exchanges between homeologous chromosomes were the predominant mechanism responsible for allelic differences between the Gossypium tetraploids and their diploid progenitors. Homeologous gene conversion events (HeGCEs) gradually subsided, declining to rates similar to random mutation during radiation of the polyploid into multiple clades and species. Despite occurring in a common nucleus, preservation of HeGCE is asymmetric in the two tetraploid subgenomes. At-to-Dt conversion is far more abundant than the reciprocal, is enriched in heterochromatin, is highly correlated with GC content and transposon distribution, and may silence abundant A-genome-derived retrotransposons. Dt-to-At conversion is abundant in euchromatin and genes, frequently reversing losses of gene function. The long-standing observation that the nonspinnable-fibered D-genome contributes to the superior yield and quality of tetraploid cotton fibers may be explained by accelerated Dt to At conversion during cotton domestication and improvement, increasing dosage of alleles from the spinnable-fibered A-genome. HeGCE may provide an alternative to (rare) reciprocal DNA exchanges between chromosomes in heterochromatin, where genes have approximately five times greater abundance of Dt-to-At conversion than does adjacent intergenic DNA. Spanning exon-to-gene-sized regions, HeGCE is a natural noninvasive means of gene transfer with the precision of transformation, potentially important in genetic improvement of many crop plants. Copyright © 2014 by the Genetics Society of America.
Glutathionylation of the Bacterial Hsp70 Chaperone DnaK Provides a Link between Oxidative Stress and the Heat Shock Response.

Science.gov (United States)

Zhang, Hong; Yang, Jie; Wu, Si; Gong, Weibin; Chen, Chang; Perrett, Sarah

2016-03-25

DnaK is the major bacterial Hsp70, participating in DNA replication, protein folding, and the stress response. DnaK cooperates with the Hsp40 co-chaperone DnaJ and the nucleotide exchange factor GrpE. Under non-stress conditions, DnaK binds to the heat shock transcription factor σ(32)and facilitates its degradation. Oxidative stress results in temporary inactivation of DnaK due to depletion of cellular ATP and thiol modifications such as glutathionylation until normal cellular ATP levels and a reducing environment are restored. However, the biological significance of DnaK glutathionylation remains unknown, and the mechanisms by which glutathionylation may regulate the activity of DnaK are also unclear. We investigated the conditions under which Escherichia coli DnaK undergoesS-glutathionylation. We observed glutathionylation of DnaK in lysates of E. coli cells that had been subjected to oxidative stress. We also obtained homogeneously glutathionylated DnaK using purified DnaK in the apo state. We found that glutathionylation of DnaK reversibly changes the secondary structure and tertiary conformation, leading to reduced nucleotide and peptide binding ability. The chaperone activity of DnaK was reversibly down-regulated by glutathionylation, accompanying the structural changes. We found that interaction of DnaK with DnaJ, GrpE, or σ(32)becomes weaker when DnaK is glutathionylated, and the interaction is restored upon deglutathionylation. This study confirms that glutathionylation down-regulates the functions of DnaK under oxidizing conditions, and this down-regulation may facilitate release of σ(32)from its interaction with DnaK, thus triggering the heat shock response. Such a mechanism provides a link between oxidative stress and the heat shock response in bacteria. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.
Uracil DNA glycosylase counteracts APOBEC3G-induced hypermutation of hepatitis B viral genomes: excision repair of covalently closed circular DNA.

Directory of Open Access Journals (Sweden)

Kouichi Kitamura

Full Text Available The covalently closed circular DNA (cccDNA of the hepatitis B virus (HBV plays an essential role in chronic hepatitis. The cellular repair system is proposed to convert cytoplasmic nucleocapsid (NC DNA (partially double-stranded DNA into cccDNA in the nucleus. Recently, antiviral cytidine deaminases, AID/APOBEC proteins, were shown to generate uracil residues in the NC-DNA through deamination, resulting in cytidine-to-uracil (C-to-U hypermutation of the viral genome. We investigated whether uracil residues in hepadnavirus DNA were excised by uracil-DNA glycosylase (UNG, a host factor for base excision repair (BER. When UNG activity was inhibited by the expression of the UNG inhibitory protein (UGI, hypermutation of NC-DNA induced by either APOBEC3G or interferon treatment was enhanced in a human hepatocyte cell line. To assess the effect of UNG on the cccDNA viral intermediate, we used the duck HBV (DHBV replication model. Sequence analyses of DHBV DNAs showed that cccDNA accumulated G-to-A or C-to-T mutations in APOBEC3G-expressing cells, and this was extensively enhanced by UNG inhibition. The cccDNA hypermutation generated many premature stop codons in the P gene. UNG inhibition also enhanced the APOBEC3G-mediated suppression of viral replication, including reduction of NC-DNA, pre-C mRNA, and secreted viral particle-associated DNA in prolonged culture. Enhancement of APOBEC3G-mediated suppression by UNG inhibition was not observed when the catalytic site of APOBEC3G was mutated. Transfection experiments of recloned cccDNAs revealed that the combination of UNG inhibition and APOBEC3G expression reduced the replication ability of cccDNA. Taken together, these data indicate that UNG excises uracil residues from the viral genome during or after cccDNA formation in the nucleus and imply that BER pathway activities decrease the antiviral effect of APOBEC3-mediated hypermutation.
Genome-wide identification and comparative analysis of cytosine-5 DNA methyltransferases and demethylase families in wild and cultivated peanut

Directory of Open Access Journals (Sweden)

Pengfei eWang

2016-02-01

Full Text Available AbstractDNA methylation plays important roles in genome protection, regulation of gene expression and was associated with plants development. Plant DNA methylation pattern was mediated by cytosine-5 DNA methyltransferases and demethylase. Although the genomes of AA and BB wild peanuts have been fully sequence, these two gene families have not been studied. In this study we report the identification and analysis of putative cytosine-5 DNA methyltransferases (C5-MTases and demethylase in AA and BB wild peanuts. Cytosine-5 DNA methyltransferases in AA and BB wild peanuts could be classified in known MET, CMT and DRM2 groups based on their domain organization. This result was supported by the gene and protein structural characteristics and phylogenetic analysis. We found that some wild peanut DRM2 numbers didn’t contain UBA domain which was different from other plants such as Arabidopsis, maize, soybean. Five DNA demethylase were found in AA genome and five in BB genome. The selective pressure analysis showed that wild peanut C5-MTases gene mainly underwent purifying selection but many positive selection sites can be detected. Conversely, DNA demethylase genes mainly underwent positive selection during evolution. Additionally, the expression dynamic of cytosine-5 DNA methyltransferases and demethylase genes in different cultivated peanut tissues were analyzed. Expression result showed that cold, heat or drought stress could influence the expression level of C5-MTases and DNA demethylase genes in cultivated peanut. These results are useful for better understanding the complexity of these two gene families, and will facilitate epigenetic studies in peanut.
Double-strand breaks in genome-sized DNA caused by mechanical stress under mixing: Quantitative evaluation through single-molecule observation

Science.gov (United States)

Kikuchi, Hayato; Nose, Keiji; Yoshikawa, Yuko; Yoshikawa, Kenichi

2018-06-01

It is becoming increasingly apparent that changes in the higher-order structure of genome-sized DNA molecules of more than several tens kbp play important roles in the self-control of genome activity in living cells. Unfortunately, it has been rather difficult to prepare genome-sized DNA molecules without damage or fragmentation. Here, we evaluated the degree of double-strand breaks (DSBs) caused by mechanical mixing by single-molecule observation with fluorescence microscopy. The results show that DNA breaks are most significant for the first second after the initiation of mechanical agitation. Based on such observation, we propose a novel mixing procedure to significantly decrease DSBs.
Impact of nuclear organization and chromatin structure on DNA repair and genome stability

International Nuclear Information System (INIS)

Batte, Amandine

2016-01-01

The non-random organization of the eukaryotic cell nucleus and the folding of genome in chromatin more or less condensed can influence many functions related to DNA metabolism, including genome stability. Double-strand breaks (DSBs) are the most deleterious DNA damages for the cells. To preserve genome integrity, eukaryotic cells thus developed DSB repair mechanisms conserved from yeast to human, among which homologous recombination (HR) that uses an intact homologous sequence to repair a broken chromosome. HR can be separated in two sub-pathways: Gene Conversion (GC) transfers genetic information from one molecule to its homologous and Break Induced Replication (BIR) establishes a replication fork than can proceed until the chromosome end. My doctorate work was focused on the contribution of the chromatin context and 3D genome organization on DSB repair. In S. cerevisiae, nuclear organization and heterochromatin spreading at sub-telomeres can be modified through the overexpression of the Sir3 or sir3A2Q mutant proteins. We demonstrated that reducing the physical distance between homologous sequences increased GC rates, reinforcing the notion that homology search is a limiting step for recombination. We also showed that hetero-chromatinization of DSB site fine-tunes DSB resection, limiting the loss of the DSB ends required to perform homology search and complete HR. Finally, we noticed that the presence of heterochromatin at the donor locus decreased both GC and BIR efficiencies, probably by affecting strand invasion. This work highlights new regulatory pathways of DNA repair. (author) [fr
Non-Watson–Crick interactions between PNA and DNA inhibit the ATPase activity of bacteriophage T4 Dda helicase

Science.gov (United States)

Tackett, Alan J.; Corey, David R.; Raney, Kevin D.

2002-01-01

Peptide nucleic acid (PNA) is a DNA mimic in which the nucleobases are linked by an N-(2-aminoethyl) glycine backbone. Here we report that PNA can interact with single-stranded DNA (ssDNA) in a non-sequence-specific fashion. We observed that a 15mer PNA inhibited the ssDNA-stimulated ATPase activity of a bacteriophage T4 helicase, Dda. Surprisingly, when a fluorescein-labeled 15mer PNA was used in binding studies no interaction was observed between PNA and Dda. However, fluorescence polarization did reveal non-sequence-specific interactions between PNA and ssDNA. Thus, the inhibition of ATPase activity of Dda appears to result from depletion of the available ssDNA due to non-Watson–Crick binding of PNA to ssDNA. Inhibition of the ssDNA-stimulated ATPase activity was observed for several PNAs of varying length and sequence. To study the basis for this phenomenon, we examined self-aggregation by PNAs. The 15mer PNA readily self-aggregates to the point of precipitation. Since PNAs are hydrophobic, they aggregate more than DNA or RNA, making the study of this phenomenon essential for understanding the properties of PNA. Non-sequence-specific interactions between PNA and ssDNA were observed at moderate concentrations of PNA, suggesting that such interactions should be considered for antisense and antigene applications. PMID:11842106
Cloning and sequence analysis of cDNA coding for rat nucleolar protein C23

International Nuclear Information System (INIS)

Ghaffari, S.H.; Olson, M.O.J.

1986-01-01

Using synthetic oligonucleotides as primers and probes, the authors have isolated and sequenced cDNA clones encoding protein C23, a putative nucleolus organizer protein. Poly(A + ) RNA was isolated from rat Novikoff hepatoma cells and enriched in C23 mRNA by sucrose density gradient ultracentrifugation. Two deoxyoligonuleotides, a 48- and a 27-mer, were synthesized on the basis of amino acid sequence from the C-terminal half of protein C23 and cDNA sequence data from CHO cell protein. The 48-mer was used a primer for synthesis of cDNA which was then inserted into plasmid pUC9. Transformed bacterial colonies were screened by hybridization with 32 P labeled 27-mer. Two clones among 5000 gave a strong positive signal. Plasmid DNAs from these clones were purified and characterized by blotting and nucleotide sequence analysis. The length of C23 mRNA was estimated to be 3200 bases in a northern blot analysis. The sequence of a 267 b.p. insert shows high homology with the CHO cDNA with only 9 nucleotide differences and an identical amino acid sequence. These studies indicate that this region of the protein is highly conserved
A novel method of providing a library of n-mers or biopolymers

DEFF Research Database (Denmark)

2012-01-01

The present invention relates to a method of providing a library of n-mer sequences, wherein the library is composed of an n-mer sequence. Also the invention concerns a method of providing a library of biopolymer sequences having one or more n-mers in common. Further provided are specific primers...
In situ genomic DNA extraction for PCR analysis of regions of interest in four plant species and one filamentous fungi

Directory of Open Access Journals (Sweden)

Luis E. Rojas

2014-07-01

Full Text Available The extraction methods of genomic DNA are usually laborious and hazardous to human health and the environment by the use of organic solvents (chloroform and phenol. In this work a protocol for in situ extraction of genomic DNA by alkaline lysis is validated. It was used in order to amplify regions of DNA in four species of plants and fungi by polymerase chain reaction (PCR. From plant material of Saccharum officinarum L., Carica papaya L. and Digitalis purpurea L. it was possible to extend different regions of the genome through PCR. Furthermore, it was possible to amplify a fragment of avr-4 gene DNA purified from lyophilized mycelium of Mycosphaerella fijiensis. Additionally, it was possible to amplify the region ap24 transgene inserted into the genome of banana cv. `Grande naine' (Musa AAA. Key words: alkaline lysis, Carica papaya L., Digitalis purpurea L., Musa, Saccharum officinarum L.

Sequencing of whole plastid genomes and nuclear ribosomal DNA of Diospyros species (Ebenaceae) endemic to New Caledonia: many species, little divergence.

Science.gov (United States)

Turner, Barbara; Paun, Ovidiu; Munzinger, Jérôme; Chase, Mark W; Samuel, Rosabelle

2016-06-01

Some plant groups, especially on islands, have been shaped by strong ancestral bottlenecks and rapid, recent radiation of phenotypic characters. Single molecular markers are often not informative enough for phylogenetic reconstruction in such plant groups. Whole plastid genomes and nuclear ribosomal DNA (nrDNA) are viewed by many researchers as sources of information for phylogenetic reconstruction of groups in which expected levels of divergence in standard markers are low. Here we evaluate the usefulness of these data types to resolve phylogenetic relationships among closely related Diospyros species. Twenty-two closely related Diospyros species from New Caledonia were investigated using whole plastid genomes and nrDNA data from low-coverage next-generation sequencing (NGS). Phylogenetic trees were inferred using maximum parsimony, maximum likelihood and Bayesian inference on separate plastid and nrDNA and combined matrices. The plastid and nrDNA sequences were, singly and together, unable to provide well supported phylogenetic relationships among the closely related New Caledonian Diospyros species. In the nrDNA, a 6-fold greater percentage of parsimony-informative characters compared with plastid DNA was found, but the total number of informative sites was greater for the much larger plastid DNA genomes. Combining the plastid and nuclear data improved resolution. Plastid results showed a trend towards geographical clustering of accessions rather than following taxonomic species. In plant groups in which multiple plastid markers are not sufficiently informative, an investigation at the level of the entire plastid genome may also not be sufficient for detailed phylogenetic reconstruction. Sequencing of complete plastid genomes and nrDNA repeats seems to clarify some relationships among the New Caledonian Diospyros species, but the higher percentage of parsimony-informative characters in nrDNA compared with plastid DNA did not help to resolve the phylogenetic tree
Evolutionary analyses of entire genomes do not support the association of mtDNA mutations with Ras/MAPK pathway syndromes.

Directory of Open Access Journals (Sweden)

Alberto Gómez-Carballa

Full Text Available BACKGROUND: There are several known autosomal genes responsible for Ras/MAPK pathway syndromes, including Noonan syndrome (NS and related disorders (such as LEOPARD, neurofibromatosis type 1, although mutations of these genes do not explain all cases. Due to the important role played by the mitochondrion in the energetic metabolism of cardiac muscle, it was recently proposed that variation in the mitochondrial DNA (mtDNA genome could be a risk factor in the Noonan phenotype and in hypertrophic cardiomyopathy (HCM, which is a common clinical feature in Ras/MAPK pathway syndromes. In order to test these hypotheses, we sequenced entire mtDNA genomes in the largest series of patients suffering from Ras/MAPK pathway syndromes analyzed to date (n = 45, most of them classified as NS patients (n = 42. METHODS/PRINCIPAL FINDINGS: The results indicate that the observed mtDNA lineages were mostly of European ancestry, reproducing in a nutshell the expected haplogroup (hg patterns of a typical Iberian dataset (including hgs H, T, J, and U. Three new branches of the mtDNA phylogeny (H1j1, U5b1e, and L2a5 are described for the first time, but none of these are likely to be related to NS or Ras/MAPK pathway syndromes when observed under an evolutionary perspective. Patterns of variation in tRNA and protein genes, as well as redundant, private and heteroplasmic variants, in the mtDNA genomes of patients were as expected when compared with the patterns inferred from a worldwide mtDNA phylogeny based on more than 8700 entire genomes. Moreover, most of the mtDNA variants found in patients had already been reported in healthy individuals and constitute common polymorphisms in human population groups. CONCLUSIONS/SIGNIFICANCE: As a whole, the observed mtDNA genome variation in the NS patients was difficult to reconcile with previous findings that indicated a pathogenic role of mtDNA variants in NS.
Evolutionary Analyses of Entire Genomes Do Not Support the Association of mtDNA Mutations with Ras/MAPK Pathway Syndromes

Science.gov (United States)

Cerezo, María; Balboa, Emilia; Heredia, Claudia; Castro-Feijóo, Lidia; Rica, Itxaso; Barreiro, Jesús; Eirís, Jesús; Cabanas, Paloma; Martínez-Soto, Isabel; Fernández-Toral, Joaquín; Castro-Gago, Manuel; Pombo, Manuel; Carracedo, Ángel; Barros, Francisco

2011-01-01

Background There are several known autosomal genes responsible for Ras/MAPK pathway syndromes, including Noonan syndrome (NS) and related disorders (such as LEOPARD, neurofibromatosis type 1), although mutations of these genes do not explain all cases. Due to the important role played by the mitochondrion in the energetic metabolism of cardiac muscle, it was recently proposed that variation in the mitochondrial DNA (mtDNA) genome could be a risk factor in the Noonan phenotype and in hypertrophic cardiomyopathy (HCM), which is a common clinical feature in Ras/MAPK pathway syndromes. In order to test these hypotheses, we sequenced entire mtDNA genomes in the largest series of patients suffering from Ras/MAPK pathway syndromes analyzed to date (n = 45), most of them classified as NS patients (n = 42). Methods/Principal Findings The results indicate that the observed mtDNA lineages were mostly of European ancestry, reproducing in a nutshell the expected haplogroup (hg) patterns of a typical Iberian dataset (including hgs H, T, J, and U). Three new branches of the mtDNA phylogeny (H1j1, U5b1e, and L2a5) are described for the first time, but none of these are likely to be related to NS or Ras/MAPK pathway syndromes when observed under an evolutionary perspective. Patterns of variation in tRNA and protein genes, as well as redundant, private and heteroplasmic variants, in the mtDNA genomes of patients were as expected when compared with the patterns inferred from a worldwide mtDNA phylogeny based on more than 8700 entire genomes. Moreover, most of the mtDNA variants found in patients had already been reported in healthy individuals and constitute common polymorphisms in human population groups. Conclusions/Significance As a whole, the observed mtDNA genome variation in the NS patients was difficult to reconcile with previous findings that indicated a pathogenic role of mtDNA variants in NS. PMID:21526175
Inhibitory effects of vitamin K3 on DNA polymerase and angiogenesis.

Science.gov (United States)

Matsubara, Kiminori; Kayashima, Tomoko; Mori, Masaharu; Yoshida, Hiromi; Mizushina, Yoshiyuki

2008-09-01

Vitamins play essential roles in cellular reactions and maintain human health. Recent studies have revealed that some vitamins including D3, B6 and K2 and their derivatives have an anti-cancer effect. As a mechanism, their inhibitory effect on cancer-related angiogenesis has been demonstrated. Vitamin K2 (menaquinones) has an anti-cancer effect in particular for hepatic cancer and inhibits angiogenesis. In the current study, we demonstrated that sole vitamin K3 (menadione) selectively inhibits the in vitro activity of eukaryotic DNA polymerase gamma, which is a mitochondrial DNA polymerase, and suppresses angiogenesis in a rat aortic ring model. The anti-angiogenic effect of vitamin K3 has been shown in angiogenesis models using human umbilical vein endothelial cells (HUVECs) with regard to HUVEC growth, tube formation on reconstituted basement membrane and chemotaxis. These results suggest that vitamin K3 may be a potential anti-cancer agent like vitamin K2.
Bat Biology, Genomes, and the Bat1K Project: To Generate Chromosome-Level Genomes for All Living Bat Species.

Science.gov (United States)

Teeling, Emma C; Vernes, Sonja C; Dávalos, Liliana M; Ray, David A; Gilbert, M Thomas P; Myers, Eugene

2018-02-15

Bats are unique among mammals, possessing some of the rarest mammalian adaptations, including true self-powered flight, laryngeal echolocation, exceptional longevity, unique immunity, contracted genomes, and vocal learning. They provide key ecosystem services, pollinating tropical plants, dispersing seeds, and controlling insect pest populations, thus driving healthy ecosystems. They account for more than 20% of all living mammalian diversity, and their crown-group evolutionary history dates back to the Eocene. Despite their great numbers and diversity, many species are threatened and endangered. Here we announce Bat1K, an initiative to sequence the genomes of all living bat species (n∼1,300) to chromosome-level assembly. The Bat1K genome consortium unites bat biologists (>148 members as of writing), computational scientists, conservation organizations, genome technologists, and any interested individuals committed to a better understanding of the genetic and evolutionary mechanisms that underlie the unique adaptations of bats. Our aim is to catalog the unique genetic diversity present in all living bats to better understand the molecular basis of their unique adaptations; uncover their evolutionary history; link genotype with phenotype; and ultimately better understand, promote, and conserve bats. Here we review the unique adaptations of bats and highlight how chromosome-level genome assemblies can uncover the molecular basis of these traits. We present a novel sequencing and assembly strategy and review the striking societal and scientific benefits that will result from the Bat1K initiative.
Annotating the genome by DNA methylation.

Science.gov (United States)

Cedar, Howard; Razin, Aharon

2017-01-01

DNA methylation plays a prominent role in setting up and stabilizing the molecular design of gene regulation and by understanding this process one gains profound insight into the underlying biology of mammals. In this article, we trace the discoveries that provided the foundations of this field, starting with the mapping of methyl groups in the genome and the experiments that helped clarify how methylation patterns are maintained through cell division. We then address the basic relationship between methyl groups and gene repression, as well as the molecular rules involved in controlling this process during development in vivo. Finally, we describe ongoing work aimed at defining the role of this modification in disease and deciphering how it may serve as a mechanism for sensing the environment.
Fine organization of genomic regions tagged to the 5S rDNA locus of the bread wheat 5B chromosome.

Science.gov (United States)

Sergeeva, Ekaterina M; Shcherban, Andrey B; Adonina, Irina G; Nesterov, Michail A; Beletsky, Alexey V; Rakitin, Andrey L; Mardanov, Andrey V; Ravin, Nikolai V; Salina, Elena A

2017-11-14

The multigene family encoding the 5S rRNA, one of the most important structurally-functional part of the large ribosomal subunit, is an obligate component of all eukaryotic genomes. 5S rDNA has long been a favored target for cytological and phylogenetic studies due to the inherent peculiarities of its structural organization, such as the tandem arrays of repetitive units and their high interspecific divergence. The complex polyploid nature of the genome of bread wheat, Triticum aestivum, and the technically difficult task of sequencing clusters of tandem repeats mean that the detailed organization of extended genomic regions containing 5S rRNA genes remains unclear. This is despite the recent progress made in wheat genomic sequencing. Using pyrosequencing of BAC clones, in this work we studied the organization of two distinct 5S rDNA-tagged regions of the 5BS chromosome of bread wheat. Three BAC-clones containing 5S rDNA were identified in the 5BS chromosome-specific BAC-library of Triticum aestivum. Using the results of pyrosequencing and assembling, we obtained six 5S rDNA- containing contigs with a total length of 140,417 bp, and two sets (pools) of individual 5S rDNA sequences belonging to separate, but closely located genomic regions on the 5BS chromosome. Both regions are characterized by the presence of approximately 70-80 copies of 5S rDNA, however, they are completely different in their structural organization. The first region contained highly diverged short-type 5S rDNA units that were disrupted by multiple insertions of transposable elements. The second region contained the more conserved long-type 5S rDNA, organized as a single tandem array. FISH using probes specific to both 5S rDNA unit types showed differences in the distribution and intensity of signals on the chromosomes of polyploid wheat species and their diploid progenitors. A detailed structural organization of two closely located 5S rDNA-tagged genomic regions on the 5BS chromosome of bread
Travel-related MERS-CoV cases: An assessment of exposures and risk factors in a group of Dutch travellers returning from the Kingdom of Saudi Arabia, May 2014

NARCIS (Netherlands)

E.B. Fanoy (Ewout B.); M.A.B. van der Sande (Marianne); M. Kraaij-Dirkzwager (Marleen); K. Dirksen (Kees); M. Jonges (Marcel); W. van der Hoek (Wim); M.P.G. Koopmans D.V.M. (Marion); D.V. Werf (Douwe Vander); G.J.B. Sonder (Gerard); C. van der Weijden (Charlie); J. van der Heuvel (Jet); L.B.S. Gelinck (Luc); J.W. Bouwhuis (Jolande); A.B. van Gageldonk-Lafeber (Rianne)

2014-01-01

markdownabstract__Background:__ In May 2014, Middle East respiratory syndrome coronavirus (MERS-CoV) infection, with closely related viral genomes, was diagnosed in two Dutch residents, returning from a pilgrimage to Medina and Mecca, Kingdom of Saudi Arabia (KSA). These patients travelled with a
A 7-mer knowledge-based potential for detecting native protein structures from decoys

DEFF Research Database (Denmark)

Røgen, Peter

for faster sampling methods. Background: The C-alpha atoms define a polygonal curve in 3-space which is smoothened by the method presented in [1] and is illustrated below. The geometry of a 7-mer is described by two numbers that describe how stretched and curved the smoothening of the 7-mer is. These two...... numbers are called length and distance excess, c.f. [2], and give one point in the length - distance excess - plane, LDE-plane. Method: Given a sequence of amino acids, we break it down to all its 7-mers and search a database of known 3d-structures for similar 7-mer sequences. For the query 7-mer we...... define an energy function in the LDE-plane. This energy is given by the 7-mer found and depends linearly on some design parameters. The energy function of the full query sequence, F, is then a sum over all 7-mers. For a protein P and a decoy D we ideally want F(D)-F(P)=constant.RMSD( D , P ), where 0...
DNA end resection by CtIP and exonuclease 1 prevents genomic instability

DEFF Research Database (Denmark)

Eid, Wassim; Steger, Martin; El-Shemerly, Mahmoud

2010-01-01

End resection of DNA-which is essential for the repair of DNA double-strand breaks (DSBs) by homologous recombination-relies first on the partnership between MRE11-RAD50-NBS1 (MRN) and CtIP, followed by a processive step involving helicases and exonucleases such as exonuclease 1 (EXO1). In this s......End resection of DNA-which is essential for the repair of DNA double-strand breaks (DSBs) by homologous recombination-relies first on the partnership between MRE11-RAD50-NBS1 (MRN) and CtIP, followed by a processive step involving helicases and exonucleases such as exonuclease 1 (EXO1...... of DNA-PK-dependent radial chromosome formation. Thus, our study identifies new functions of CtIP and EXO1 in DNA end resection and provides new information on the regulation of DSB repair pathways, which is a key factor in the maintenance of genome integrity....
DNA adenine methylation modulates pathogenicity of Klebsiella pneumoniae genotype K1

Directory of Open Access Journals (Sweden)

Chi-Tai Fang

2017-08-01

Conclusion: Our results support the view that DNA adenine methylation plays an important role in modulating the pathogenicity of K. pneumoniae genotype K1. The incomplete attenuation indicates the existence of other regulatory factors.
Preventing cleavage of Mer promotes efferocytosis and suppresses acute lung injury in bleomycin treated mice

International Nuclear Information System (INIS)

Lee, Ye-Ji; Lee, Seung-Hae; Youn, Young-So; Choi, Ji-Yeon; Song, Keung-Sub; Cho, Min-Sun; Kang, Jihee Lee

2012-01-01

Mer receptor tyrosine kinase (Mer) regulates macrophage activation and promotes apoptotic cell clearance. Mer activation is regulated through proteolytic cleavage of the extracellular domain. To determine if membrane-bound Mer is cleaved during bleomycin-induced lung injury, and, if so, how preventing the cleavage of Mer enhances apoptotic cell uptake and down-regulates pulmonary immune responses. During bleomycin-induced acute lung injury in mice, membrane-bound Mer expression decreased, but production of soluble Mer and activity as well as expression of disintegrin and metalloproteinase 17 (ADAM17) were enhanced . Treatment with the ADAM inhibitor TAPI-0 restored Mer expression and diminished soluble Mer production. Furthermore, TAPI-0 increased Mer activation in alveolar macrophages and lung tissue resulting in enhanced apoptotic cell clearance in vivo and ex vivo by alveolar macrophages. Suppression of bleomycin-induced pro-inflammatory mediators, but enhancement of hepatocyte growth factor induction were seen after TAPI-0 treatment. Additional bleomycin-induced inflammatory responses reduced by TAPI-0 treatment included inflammatory cell recruitment into the lungs, levels of total protein and lactate dehydrogenase activity in bronchoalveolar lavage fluid, as well as caspase-3 and caspase-9 activity and alveolar epithelial cell apoptosis in lung tissue. Importantly, the effects of TAPI-0 on bleomycin-induced inflammation and apoptosis were reversed by coadministration of specific Mer-neutralizing antibodies. These findings suggest that restored membrane-bound Mer expression by TAPI-0 treatment may help resolve lung inflammation and apoptosis after bleomycin treatment. -- Highlights: ►Mer expression is restored by TAPI-0 treatment in bleomycin-stimulated lung. ►Mer signaling is enhanced by TAPI-0 treatment in bleomycin-stimulated lung. ►TAPI-0 enhances efferocytosis and promotes resolution of lung injury.
DNA immunoprecipitation semiconductor sequencing (DIP-SC-seq) as a rapid method to generate genome wide epigenetic signatures

OpenAIRE

Thomson, John P.; Fawkes, Angie; Ottaviano, Raffaele; Hunter, Jennifer M.; Shukla, Ruchi; Mjoseng, Heidi K.; Clark, Richard; Coutts, Audrey; Murphy, Lee; Meehan, Richard R.

2015-01-01

Modification of DNA resulting in 5-methylcytosine (5 mC) or 5-hydroxymethylcytosine (5hmC) has been shown to influence the local chromatin environment and affect transcription. Although recent advances in next generation sequencing technology allow researchers to map epigenetic modifications across the genome, such experiments are often time-consuming and cost prohibitive. Here we present a rapid and cost effective method of generating genome wide DNA modification maps utilising commercially ...
New redox-active layer create via epoxy-amine reaction - The base of genosensor for the detection of specific DNA and RNA sequences of avian influenza virus H5N1.

Science.gov (United States)

Malecka, Kamila; Stachyra, Anna; Góra-Sochacka, Anna; Sirko, Agnieszka; Zagórski-Ostoja, Włodzimierz; Dehaen, Wim; Radecka, Hanna; Radecki, Jerzy

2015-03-15

This paper concerns the development of a redox-active monolayer and its application for the construction of an electrochemical genosensor designed for the detection of specific DNA and RNA oligonucleotide sequences related to the avian influenza virus (AIV) type H5N1. This new redox layer was created on a gold electrode surface step by step. Cyclic Voltammetry, Osteryoung Square-Wave Voltammetry and Differential Pulse Voltammetry were used for its characterization. This new redox-active layer was applied for the construction of the DNA biosensor. The NH2-NC3 probe (20-mer) was covalently attached to the gold electrode surface via a "click" reaction between the amine and an epoxide group. The hybridization process was monitored using the Osteryoung Square-Wave Voltammetry. The 20-mer DNA and ca. 280-mer RNA oligonucleotides were used as the targets. The constructed genosensor was capable to determine complementary oligonucleotide sequences with a detection limit in the pM range. It is able to distinguish the different position of the part RNA complementary to the DNA probe. The genosensor was very selective. The 20-mer DNA as well as the 280-mer RNA oligonucleotides without a complementary sequence generated a weak signal. Copyright © 2014 Elsevier B.V. All rights reserved.
Using a commercially available DNA extraction kit to obtain high quality human genomic DNA suitable for PCR and genotyping from 11-year-old saliva saturated cotton spit wads

Directory of Open Access Journals (Sweden)

Hudziak James J

2008-12-01

Full Text Available Abstract Background We sought to describe the integrity of human genomic DNA extracted from saliva saturated cotton spit wads stored at -20°C for approximately 11 years. 783 spit wad samples were collected from an ADHD sample population (Vermont Family Study during 1996–2000. Human genomic DNA was extracted from the spit wads using a commercially available kit; QIAamp DNA Blood Midi Kit (Qiagen, Inc., Valencia, CA. with a few modifications. Results The resulting DNA yield was more than adequate for genetic analysis and ranged from approximately 1 μg to a total of 80 μg (mean 17.3 μgs ± 11.9 μgs. A260/A280 ratios for the human genomic DNA extracted from the spit wads was consistently within the generally acceptable values of 1.7–2.0, with the lowest purity being 1.70, and a mean value of 1.937 ± 0.226 for the 783 samples. The DNA also was suitable for PCR reactions as evidenced by the amplification of the serotonin-transporter-linked polymorphic region, 5HTTLPR. 5HTTLPR is a functional polymorphism in the promoter region of the serotonin transporter gene (HTT, SLC6A4, or SERT, consisting of two intensively studied alleles. 770 of the 783 samples (98.3% produced fragments after PCR of the expected size with primers specific for 5HTTLPR. Conclusion High quality and abundant genomic DNA can be successfully retrieved from saliva saturated cotton spit wads using the commercially available kit, QIAamp DNA Blood Midi Kit from Qiagen, Inc. Furthermore, the DNA can be extracted in less than 3 hours and multiple samples can be processed simultaneously thus reducing processing time.
DNA rearrangements from γ-irradiated normal human fibroblasts preferentially occur in transcribed regions of the genome

International Nuclear Information System (INIS)

Forrester, H.B.; Radford, I.R.

2003-01-01

Full text: DNA rearrangement events leading to chromosomal aberrations are central to ionizing radiation-induced cell death. Although DNA double-strand breaks are probably the lesion that initiates formation of chromosomal aberrations, little is understood about the molecular mechanisms that generate and modulate DNA rearrangement. Examination of the sequences that flank sites of DNA rearrangement may provide information regarding the processes and enzymes involved in rearrangement events. Accordingly, we developed a method using inverse PCR that allows the detection and sequencing of putative radiation-induced DNA rearrangements in defined regions of the human genome. The method can detect single copies of a rearrangement event that has occurred in a particular region of the genome and, therefore, DNA rearrangement detection does not require survival and continued multiplication of the affected cell. Ionizing radiation-induced DNA rearrangements were detected in several different regions of the genome of human fibroblast cells that were exposed to 30 Gy of γ-irradiation and then incubated for 24 hours at 37 deg C. There was a 3- to 5-fold increase in the number of products amplified from irradiated as compared with control cells in the target regions 5' to the C-MYC, CDKN1A, RB1, and FGFR2 genes. Sequences were examined from 121 DNA rearrangements. Approximately half of the PCR products were derived from possible inter-chromosomal rearrangements and the remainder were from intra-chromosomal events. A high proportion of the sequences that rearranged with target regions were located in genes, suggesting that rearrangements may occur preferentially in transcribed regions. Eighty-four percent of the sequences examined by reverse transcriptase PCR were from transcribed sequences in IMR-90 cells. The distribution of DNA rearrangements within the target regions is non-random and homology occurs between the sequences involved in rearrangements in some cases but is not
Repair of DNA in replicated and unreplicated portions of the human genome

International Nuclear Information System (INIS)

Waters, R.

1979-01-01

Portions of the human genome that have replicated after ultraviolet light irradiation and those that remain unreplicated have both been examined for the distribution of pyrimidine dimers and the extent of repair replication following their removal. The data indicate that the number of unrepaired dimers and the extent of repair replication seen after their excision are equal in the replicated and unreplicated DNA. Furthermore, the daughter strand of replicated DNA is larger than the average interdimer distance found in the parental strand. Hence, DNA replication in normal human fibroblasts is clearly capable of getting past pyrimidine dimers, and a preferential repair of such lesions in DNA that is about to be or has been replicated does not operate to any visible extent in these cells. (author)
Development and Use of Integrated Microarray-Based Genomic Technologies for Assessing Microbial Community Composition and Dynamics

Energy Technology Data Exchange (ETDEWEB)

J. Zhou; S.-K. Rhee; C. Schadt; T. Gentry; Z. He; X. Li; X. Liu; J. Liebich; S.C. Chong; L. Wu

2004-03-17

To effectively monitor microbial populations involved in various important processes, a 50-mer-based oligonucleotide microarray was developed based on known genes and pathways involved in: biodegradation, metal resistance and reduction, denitrification, nitrification, nitrogen fixation, methane oxidation, methanogenesis, carbon polymer decomposition, and sulfate reduction. This array contains approximately 2000 unique and group-specific probes with <85% similarity to their non-target sequences. Based on artificial probes, our results showed that at hybridization conditions of 50 C and 50% formamide, the 50-mer microarray hybridization can differentiate sequences having <88% similarity. Specificity tests with representative pure cultures indicated that the designed probes on the arrays appeared to be specific to their corresponding target genes. Detection limits were about 5-10ng genomic DNA in the absence of background DNA, and 50-100ng ({approx}1.3{sup o} 10{sup 7} cells) in the presence background DNA. Strong linear relationships between signal intensity and target DNA and RNA concentration were observed (r{sup 2} = 0.95-0.99). Application of this microarray to naphthalene-amended enrichments and soil microcosms demonstrated that composition of the microflora varied depending on incubation conditions. While the naphthalene-degrading genes from Rhodococcus-type microorganisms were dominant in enrichments, the genes involved in naphthalene degradation from Gram-negative microorganisms such as Ralstonia, Comamonas, and Burkholderia were most abundant in the soil microcosms (as well as those for polyaromatic hydrocarbon and nitrotoluene degradation). Although naphthalene degradation is widely known and studied in Pseudomonas, Pseudomonas genes were not detected in either system. Real-time PCR analysis of 4 representative genes was consistent with microarray-based quantification (r{sup 2} = 0.95). Currently, we are also applying this microarray to the study of several
DNA-based identification of spices: DNA isolation, whole genome amplification, and polymerase chain reaction.

Science.gov (United States)

Focke, Felix; Haase, Ilka; Fischer, Markus

2011-01-26

Usually spices are identified morphologically using simple methods like magnifying glasses or microscopic instruments. On the other hand, molecular biological methods like the polymerase chain reaction (PCR) enable an accurate and specific detection also in complex matrices. Generally, the origins of spices are plants with diverse genetic backgrounds and relationships. The processing methods used for the production of spices are complex and individual. Consequently, the development of a reliable DNA-based method for spice analysis is a challenging intention. However, once established, this method will be easily adapted to less difficult food matrices. In the current study, several alternative methods for the isolation of DNA from spices have been developed and evaluated in detail with regard to (i) its purity (photometric), (ii) yield (fluorimetric methods), and (iii) its amplifiability (PCR). Whole genome amplification methods were used to preamplify isolates to improve the ratio between amplifiable DNA and inhibiting substances. Specific primer sets were designed, and the PCR conditions were optimized to detect 18 spices selectively. Assays of self-made spice mixtures were performed to proof the applicability of the developed methods.
CC1, a novel crenarchaeal DNA binding protein.

Science.gov (United States)

Luo, Xiao; Schwarz-Linek, Uli; Botting, Catherine H; Hensel, Reinhard; Siebers, Bettina; White, Malcolm F

2007-01-01

The genomes of the related crenarchaea Pyrobaculum aerophilum and Thermoproteus tenax lack any obvious gene encoding a single-stranded DNA binding protein (SSB). SSBs are essential for DNA replication, recombination, and repair and are found in all other genomes across the three domains of life. These two archaeal genomes also have only one identifiable gene encoding a chromatin protein (the Alba protein), while most other archaea have at least two different abundant chromatin proteins. We performed a biochemical screen for novel nucleic acid binding proteins present in cell extracts of T. tenax. An assay for proteins capable of binding to a single-stranded DNA oligonucleotide resulted in identification of three proteins. The first protein, Alba, has been shown previously to bind single-stranded DNA as well as duplex DNA. The two other proteins, which we designated CC1 (for crenarchaeal chromatin protein 1), are very closely related to one another, and homologs are restricted to the P. aerophilum and Aeropyrum pernix genomes. CC1 is a 6-kDa, monomeric, basic protein that is expressed at a high level in T. tenax. This protein binds single- and double-stranded DNAs with similar affinities. These properties are consistent with a role for CC1 as a crenarchaeal chromatin protein.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.