WorldWideScience

Sample records for dna motif analysis

  1. MotifMark: Finding regulatory motifs in DNA sequences.

    Science.gov (United States)

    Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D

    2017-07-01

    The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.

  2. LDsplit: screening for cis-regulatory motifs stimulating meiotic recombination hotspots by analysis of DNA sequence polymorphisms.

    Science.gov (United States)

    Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie

    2014-02-17

    As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of

  3. PISMA: A Visual Representation of Motif Distribution in DNA Sequences

    Directory of Open Access Journals (Sweden)

    Rogelio Alcántara-Silva

    2017-03-01

    Full Text Available Background: Because the graphical presentation and analysis of motif distribution can provide insights for experimental hypothesis, PISMA aims at identifying motifs on DNA sequences, counting and showing them graphically. The motif length ranges from 2 to 10 bases, and the DNA sequences range up to 10 kb. The motif distribution is shown as a bar-code–like, as a gene-map–like, and as a transcript scheme. Results: We obtained graphical schemes of the CpG site distribution from 91 human papillomavirus genomes. Also, we present 2 analyses: one of DNA motifs associated with either methylation-resistant or methylation-sensitive CpG islands and another analysis of motifs associated with exosome RNA secretion. Availability and Implementation: PISMA is developed in Java; it is executable in any type of hardware and in diverse operating systems. PISMA is freely available to noncommercial users. The English version and the User Manual are provided in Supplementary Files 1 and 2, and a Spanish version is available at www.biomedicas.unam.mx/wp-content/software/pisma.zip and www.biomedicas.unam.mx/wp-content/pdf/manual/pisma.pdf .

  4. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-01-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  5. DNA motif elucidation using belief propagation.

    Science.gov (United States)

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-09-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k=8∼10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/∼wkc/kmerHMM.

  6. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun

    2013-06-29

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors\\' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  7. A Nuclear Ribosomal DNA Phylogeny of Acer Inferred with Maximum Likelihood, Splits Graphs, and Motif Analysis of 606 Sequences

    Science.gov (United States)

    Grimm, Guido W.; Renner, Susanne S.; Stamatakis, Alexandros; Hemleben, Vera

    2007-01-01

    The multi-copy internal transcribed spacer (ITS) region of nuclear ribosomal DNA is widely used to infer phylogenetic relationships among closely related taxa. Here we use maximum likelihood (ML) and splits graph analyses to extract phylogenetic information from ~ 600 mostly cloned ITS sequences, representing 81 species and subspecies of Acer, and both species of its sister Dipteronia. Additional analyses compared sequence motifs in Acer and several hundred Anacardiaceae, Burseraceae, Meliaceae, Rutaceae, and Sapindaceae ITS sequences in GenBank. We also assessed the effects of using smaller data sets of consensus sequences with ambiguity coding (accounting for within-species variation) instead of the full (partly redundant) original sequences. Neighbor-nets and bipartition networks were used to visualize conflict among character state patterns. Species clusters observed in the trees and networks largely agree with morphology-based classifications; of de Jong’s (1994) 16 sections, nine are supported in neighbor-net and bipartition networks, and ten by sequence motifs and the ML tree; of his 19 series, 14 are supported in networks, motifs, and the ML tree. Most nodes had higher bootstrap support with matrices of 105 or 40 consensus sequences than with the original matrix. Within-taxon ITS divergence did not differ between diploid and polyploid Acer, and there was little evidence of differentiated parental ITS haplotypes, suggesting that concerted evolution in Acer acts rapidly. PMID:19455198

  8. A Nuclear Ribosomal DNA Phylogeny of Acer Inferred with Maximum Likelihood, Splits Graphs, and Motif Analysis of 606 Sequences

    Directory of Open Access Journals (Sweden)

    Guido W. Grimm

    2006-01-01

    Full Text Available The multi-copy internal transcribed spacer (ITS region of nuclear ribosomal DNA is widely used to infer phylogenetic relationships among closely related taxa. Here we use maximum likelihood (ML and splits graph analyses to extract phylogenetic information from ~ 600 mostly cloned ITS sequences, representing 81 species and subspecies of Acer, and both species of its sister Dipteronia. Additional analyses compared sequence motifs in Acer and several hundred Anacardiaceae, Burseraceae, Meliaceae, Rutaceae, and Sapindaceae ITS sequences in GenBank. We also assessed the effects of using smaller data sets of consensus sequences with ambiguity coding (accounting for within-species variation instead of the full (partly redundant original sequences. Neighbor-nets and bipartition networks were used to visualize conflict among character state patterns. Species clusters observed in the trees and networks largely agree with morphology-based classifications; of de Jong’s (1994 16 sections, nine are supported in neighbor-net and bipartition networks, and ten by sequence motifs and the ML tree; of his 19 series, 14 are supported in networks, motifs, and the ML tree. Most nodes had higher bootstrap support with matrices of 105 or 40 consensus sequences than with the original matrix. Within-taxon ITS divergence did not differ between diploid and polyploid Acer, and there was little evidence of differentiated parental ITS haplotypes, suggesting that concerted evolution in Acer acts rapidly.

  9. Global MYCN transcription factor binding analysis in neuroblastoma reveals association with distinct E-box motifs and regions of DNA hypermethylation.

    LENUS (Irish Health Repository)

    Murphy, Derek M

    2009-01-01

    BACKGROUND: Neuroblastoma, a cancer derived from precursor cells of the sympathetic nervous system, is a major cause of childhood cancer related deaths. The single most important prognostic indicator of poor clinical outcome in this disease is genomic amplification of MYCN, a member of a family of oncogenic transcription factors. METHODOLOGY: We applied MYCN chromatin immunoprecipitation to microarrays (ChIP-chip) using MYCN amplified\\/non-amplified cell lines as well as a conditional knockdown cell line to determine the distribution of MYCN binding sites within all annotated promoter regions. CONCLUSION: Assessment of E-box usage within consistently positive MYCN binding sites revealed a predominance for the CATGTG motif (p<0.0016), with significant enrichment of additional motifs CATTTG, CATCTG, CAACTG in the MYCN amplified state. For cell lines over-expressing MYCN, gene ontology analysis revealed enrichment for the binding of MYCN at promoter regions of numerous molecular functional groups including DNA helicases and mRNA transcriptional regulation. In order to evaluate MYCN binding with respect to other genomic features, we determined the methylation status of all annotated CpG islands and promoter sequences using methylated DNA immunoprecipitation (MeDIP). The integration of MYCN ChIP-chip and MeDIP data revealed a highly significant positive correlation between MYCN binding and DNA hypermethylation. This association was also detected in regions of hemizygous loss, indicating that the observed association occurs on the same homologue. In summary, these findings suggest that MYCN binding occurs more commonly at CATGTG as opposed to the classic CACGTG E-box motif, and that disease associated over expression of MYCN leads to aberrant binding to additional weaker affinity E-box motifs in neuroblastoma. The co-localization of MYCN binding and DNA hypermethylation further supports the dual role of MYCN, namely that of a classical transcription factor affecting the

  10. DMINDA: an integrated web server for DNA motif identification and analyses.

    Science.gov (United States)

    Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying

    2014-07-01

    DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin

    2015-01-01

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  12. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun

    2015-09-27

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  13. MotifMark: Finding Regulatory Motifs in DNA Sequences

    OpenAIRE

    Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L.; Wang, May D.

    2017-01-01

    The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity be...

  14. Presence of a consensus DNA motif at nearby DNA sequence of the mutation susceptible CG nucleotides.

    Science.gov (United States)

    Chowdhury, Kaushik; Kumar, Suresh; Sharma, Tanu; Sharma, Ankit; Bhagat, Meenakshi; Kamai, Asangla; Ford, Bridget M; Asthana, Shailendra; Mandal, Chandi C

    2018-01-10

    Complexity in tissues affected by cancer arises from somatic mutations and epigenetic modifications in the genome. The mutation susceptible hotspots present within the genome indicate a non-random nature and/or a position specific selection of mutation. An association exists between the occurrence of mutations and epigenetic DNA methylation. This study is primarily aimed at determining mutation status, and identifying a signature for predicting mutation prone zones of tumor suppressor (TS) genes. Nearby sequences from the top five positions having a higher mutation frequency in each gene of 42 TS genes were selected from a cosmic database and were considered as mutation prone zones. The conserved motifs present in the mutation prone DNA fragments were identified. Molecular docking studies were done to determine putative interactions between the identified conserved motifs and enzyme methyltransferase DNMT1. Collective analysis of 42 TS genes found GC as the most commonly replaced and AT as the most commonly formed residues after mutation. Analysis of the top 5 mutated positions of each gene (210 DNA segments for 42 TS genes) identified that CG nucleotides of the amino acid codons (e.g., Arginine) are most susceptible to mutation, and found a consensus DNA "T/AGC/GAGGA/TG" sequence present in these mutation prone DNA segments. Similar to TS genes, analysis of 54 oncogenes not only found CG nucleotides of the amino acid Arg as the most susceptible to mutation, but also identified the presence of similar consensus DNA motifs in the mutation prone DNA fragments (270 DNA segments for 54 oncogenes) of oncogenes. Docking studies depicted that, upon binding of DNMT1 methylates to this consensus DNA motif (C residues of CpG islands), mutation was likely to occur. Thus, this study proposes that DNMT1 mediated methylation in chromosomal DNA may decrease if a foreign DNA segment containing this consensus sequence along with CG nucleotides is exogenously introduced to dividing

  15. MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

    Science.gov (United States)

    Ozaki, Haruka; Iwasaki, Wataru

    2016-08-01

    As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. The limits of de novo DNA motif discovery.

    Directory of Open Access Journals (Sweden)

    David Simcha

    Full Text Available A major challenge in molecular biology is reverse-engineering the cis-regulatory logic that plays a major role in the control of gene expression. This program includes searching through DNA sequences to identify "motifs" that serve as the binding sites for transcription factors or, more generally, are predictive of gene expression across cellular conditions. Several approaches have been proposed for de novo motif discovery-searching sequences without prior knowledge of binding sites or nucleotide patterns. However, unbiased validation is not straightforward. We consider two approaches to unbiased validation of discovered motifs: testing the statistical significance of a motif using a DNA "background" sequence model to represent the null hypothesis and measuring performance in predicting membership in gene clusters. We demonstrate that the background models typically used are "too null," resulting in overly optimistic assessments of significance, and argue that performance in predicting TF binding or expression patterns from DNA motifs should be assessed by held-out data, as in predictive learning. Applying this criterion to common motif discovery methods resulted in universally poor performance, although there is a marked improvement when motifs are statistically significant against real background sequences. Moreover, on synthetic data where "ground truth" is known, discriminative performance of all algorithms is far below the theoretical upper bound, with pronounced "over-fitting" in training. A key conclusion from this work is that the failure of de novo discovery approaches to accurately identify motifs is basically due to statistical intractability resulting from the fixed size of co-regulated gene clusters, and thus such failures do not necessarily provide evidence that unfound motifs are not active biologically. Consequently, the use of prior knowledge to enhance motif discovery is not just advantageous but necessary. An implementation of

  17. Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets.

    Science.gov (United States)

    Vishnevsky, Oleg V; Bocharnikov, Andrey V; Kolchanov, Nikolay A

    2018-02-01

    The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.

  18. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Science.gov (United States)

    Fauteux, François; Strömvik, Martina V

    2009-01-01

    Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs

  19. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Directory of Open Access Journals (Sweden)

    Fauteux François

    2009-10-01

    Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

  20. Hybrid DNA i-motif: Aminoethylprolyl-PNA (pC5) enhance the stability of DNA (dC5) i-motif structure.

    Science.gov (United States)

    Gade, Chandrasekhar Reddy; Sharma, Nagendra K

    2017-12-15

    This report describes the synthesis of C-rich sequence, cytosine pentamer, of aep-PNA and its biophysical studies for the formation of hybrid DNA:aep-PNAi-motif structure with DNA cytosine pentamer (dC 5 ) under acidic pH conditions. Herein, the CD/UV/NMR/ESI-Mass studies strongly support the formation of stable hybrid DNA i-motif structure with aep-PNA even near acidic conditions. Hence aep-PNA C-rich sequence cytosine could be considered as potential DNA i-motif stabilizing agents in vivo conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  1. DistAMo: A web-based tool to characterize DNA-motif distribution on bacterial chromosomes

    Directory of Open Access Journals (Sweden)

    Patrick eSobetzko

    2016-03-01

    Full Text Available Short DNA motifs are involved in a multitude of functions such as for example chromosome segregation, DNA replication or mismatch repair. Distribution of such motifs is often not random and the specific chromosomal pattern relates to the respective motif function. Computational approaches which quantitatively assess such chromosomal motif patterns are necessary. Here we present a new computer tool DistAMo (Distribution Analysis of DNA Motifs. The algorithm uses codon redundancy to calculate the relative abundance of short DNA motifs from single genes to entire chromosomes. Comparative genomics analyses of the GATC-motif distribution in γ-proteobacterial genomes using DistAMo revealed that (i genes beside the replication origin are enriched in GATCs, (ii genome-wide GATC distribution follows a distinct pattern and (iii genes involved in DNA replication and repair are enriched in GATCs. These features are specific for bacterial chromosomes encoding a Dam methyltransferase. The new software is available as a stand-alone or as an easy-to-use web-based server version at http://www.computational.bio.uni-giessen.de/distamo.

  2. MotifNet: a web-server for network motif analysis.

    Science.gov (United States)

    Smoly, Ilan Y; Lerman, Eugene; Ziv-Ukelson, Michal; Yeger-Lotem, Esti

    2017-06-15

    Network motifs are small topological patterns that recur in a network significantly more often than expected by chance. Their identification emerged as a powerful approach for uncovering the design principles underlying complex networks. However, available tools for network motif analysis typically require download and execution of computationally intensive software on a local computer. We present MotifNet, the first open-access web-server for network motif analysis. MotifNet allows researchers to analyze integrated networks, where nodes and edges may be labeled, and to search for motifs of up to eight nodes. The output motifs are presented graphically and the user can interactively filter them by their significance, number of instances, node and edge labels, and node identities, and view their instances. MotifNet also allows the user to distinguish between motifs that are centered on specific nodes and motifs that recur in distinct parts of the network. MotifNet is freely available at http://netbio.bgu.ac.il/motifnet . The website was implemented using ReactJs and supports all major browsers. The server interface was implemented in Python with data stored on a MySQL database. estiyl@bgu.ac.il or michaluz@cs.bgu.ac.il. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  3. DNA mutation motifs in the genes associated with inherited diseases.

    Directory of Open Access Journals (Sweden)

    Michal Růžička

    Full Text Available Mutations in human genes can be responsible for inherited genetic disorders and cancer. Mutations can arise due to environmental factors or spontaneously. It has been shown that certain DNA sequences are more prone to mutate. These sites are termed hotspots and exhibit a higher mutation frequency than expected by chance. In contrast, DNA sequences with lower mutation frequencies than expected by chance are termed coldspots. Mutation hotspots are usually derived from a mutation spectrum, which reflects particular population where an effect of a common ancestor plays a role. To detect coldspots/hotspots unaffected by population bias, we analysed the presence of germline mutations obtained from HGMD database in the 5-nucleotide segments repeatedly occurring in genes associated with common inherited disorders, in particular, the PAH, LDLR, CFTR, F8, and F9 genes. Statistically significant sequences (mutational motifs rarely associated with mutations (coldspots and frequently associated with mutations (hotspots exhibited characteristic sequence patterns, e.g. coldspots contained purine tract while hotspots showed alternating purine-pyrimidine bases, often with the presence of CpG dinucleotide. Using molecular dynamics simulations and free energy calculations, we analysed the global bending properties of two selected coldspots and two hotspots with a G/T mismatch. We observed that the coldspots were inherently more flexible than the hotspots. We assume that this property might be critical for effective mismatch repair as DNA with a mutation recognized by MutSα protein is noticeably bent.

  4. DNA motif alignment by evolving a population of Markov chains.

    Science.gov (United States)

    Bi, Chengpeng

    2009-01-30

    Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences.

  5. Probing structural changes of self assembled i-motif DNA

    KAUST Repository

    Lee, Iljoon; Patil, Sachin; Fhayli, Karim; Alsaiari, Shahad K.; Khashab, Niveen M.

    2015-01-01

    We report an i-motif structural probing system based on Thioflavin T (ThT) as a fluorescent sensor. This probe can discriminate the structural changes of RET and Rb i-motif sequences according to pH change. This journal is

  6. Discovery of cell-type specific DNA motif grammar in cis-regulatory elements using random Forest.

    Science.gov (United States)

    Wang, Xin; Lin, Peijie; Ho, Joshua W K

    2018-01-19

    It has been observed that many transcription factors (TFs) can bind to different genomic loci depending on the cell type in which a TF is expressed in, even though the individual TF usually binds to the same core motif in different cell types. How a TF can bind to the genome in such a highly cell-type specific manner, is a critical research question. One hypothesis is that a TF requires co-binding of different TFs in different cell types. If this is the case, it may be possible to observe different combinations of TF motifs - a motif grammar - located at the TF binding sites in different cell types. In this study, we develop a bioinformatics method to systematically identify DNA motifs in TF binding sites across multiple cell types based on published ChIP-seq data, and address two questions: (1) can we build a machine learning classifier to predict cell-type specificity based on motif combinations alone, and (2) can we extract meaningful cell-type specific motif grammars from this classifier model. We present a Random Forest (RF) based approach to build a multi-class classifier to predict the cell-type specificity of a TF binding site given its motif content. We applied this RF classifier to two published ChIP-seq datasets of TF (TCF7L2 and MAX) across multiple cell types. Using cross-validation, we show that motif combinations alone are indeed predictive of cell types. Furthermore, we present a rule mining approach to extract the most discriminatory rules in the RF classifier, thus allowing us to discover the underlying cell-type specific motif grammar. Our bioinformatics analysis supports the hypothesis that combinatorial TF motif patterns are cell-type specific.

  7. DNA regulatory motif selection based on support vector machine ...

    African Journals Online (AJOL)

    ... machine (SVM) and its application in microarray experiment of Kashin-Beck disease. ... speed and amount of the corresponding mRNA in gene replication process. ... and revealed that some motifs may be related to the immune reactions.

  8. A single thiazole orange molecule forms an exciplex in a DNA i-motif.

    Science.gov (United States)

    Xu, Baochang; Wu, Xiangyang; Yeow, Edwin K L; Shao, Fangwei

    2014-06-18

    A fluorescent exciplex of thiazole orange (TO) is formed in a single-dye conjugated DNA i-motif. The exciplex fluorescence exhibits a large Stokes shift, high quantum yield, robust response to pH oscillation and little structural disturbance to the DNA quadruplex, which can be used to monitor the folding of high-order DNA structures.

  9. I-motif DNA structures are formed in the nuclei of human cells

    Science.gov (United States)

    Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel

    2018-06-01

    Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.

  10. The identification of functional motifs in temporal gene expression analysis

    Directory of Open Access Journals (Sweden)

    Michael G. Surette

    2005-01-01

    Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.

  11. Discovery of a Regulatory Motif for Human Satellite DNA Transcription in Response to BATF2 Overexpression.

    Science.gov (United States)

    Bai, Xuejia; Huang, Wenqiu; Zhang, Chenguang; Niu, Jing; Ding, Wei

    2016-03-01

    One of the basic leucine zipper transcription factors, BATF2, has been found to suppress cancer growth and migration. However, little is known about the genes downstream of BATF2. HeLa cells were stably transfected with BATF2, then chromatin immunoprecipitation-sequencing was employed to identify the DNA motifs responsive to BATF2. Comprehensive bioinformatics analyses indicated that the most significant motif discovered as TTCCATT[CT]GATTCCATTC[AG]AT was primarily distributed among the chromosome centromere regions and mostly within human type II satellite DNA. Such motifs were able to prime the transcription of type II satellite DNA in a directional and asymmetrical manner. Consistently, satellite II transcription was up-regulated in BATF2-overexpressing cells. The present study provides insight into understanding the role of BATF2 in tumours and the importance of satellite DNA in the maintenance of genomic stability. Copyright© 2016 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  12. Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

    Directory of Open Access Journals (Sweden)

    Jie Zhu

    Full Text Available DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.

  13. Accurate Quantification of microRNA via Single Strand Displacement Reaction on DNA Origami Motif

    Science.gov (United States)

    Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

    2013-01-01

    DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs. PMID:23990889

  14. Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

    Science.gov (United States)

    Zhu, Jie; Feng, Xiaolu; Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

    2013-01-01

    DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.

  15. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Science.gov (United States)

    Zandevakili, Pooya; Hu, Ming; Qin, Zhaohui

    2012-01-01

    Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS) that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU)-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  16. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Directory of Open Access Journals (Sweden)

    Pooya Zandevakili

    Full Text Available Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  17. Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

    KAUST Repository

    Kalkatawi, Manal M.

    2011-11-15

    Motivation: Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants. The Author(s) 2011. Published by Oxford University Press. All rights reserved.

  18. Solution structure of a DNA mimicking motif of an RNA aptamer against transcription factor AML1 Runt domain.

    Science.gov (United States)

    Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi

    2013-12-01

    AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.

  19. Molecular dynamics simulations of electrostatics and hydration distributions around RNA and DNA motifs

    Science.gov (United States)

    Marlowe, Ashley E.; Singh, Abhishek; Semichaevsky, Andrey V.; Yingling, Yaroslava G.

    2009-03-01

    Nucleic acid nanoparticles can self-assembly through the formation of complementary loop-loop interactions or stem-stem interactions. Presence and concentration of ions can significantly affect the self-assembly process and the stability of the nanostructure. In this presentation we use explicit molecular dynamics simulations to examine the variations in cationic distributions and hydration environment around DNA and RNA helices and loop-loop interactions. Our simulations show that the potassium and sodium ionic distributions are different around RNA and DNA motifs which could be indicative of ion mediated relative stability of loop-loop complexes. Moreover in RNA loop-loop motifs ions are consistently present and exchanged through a distinct electronegative channel. We will also show how we used the specific RNA loop-loop motif to design a RNA hexagonal nanoparticle.

  20. Rtt107/Esc4 binds silent chromatin and DNA repair proteins using different BRCT motifs

    Directory of Open Access Journals (Sweden)

    Jockusch Rebecca A

    2006-11-01

    Full Text Available Abstract Background By screening a plasmid library for proteins that could cause silencing when targeted to the HMR locus in Saccharomyces cerevisiae, we previously reported the identification of Rtt107/Esc4 based on its ability to establish silent chromatin. In this study we aimed to determine the mechanism of Rtt107/Esc4 targeted silencing and also learn more about its biological functions. Results Targeted silencing by Rtt107/Esc4 was dependent on the SIR genes, which encode obligatory structural and enzymatic components of yeast silent chromatin. Based on its sequence, Rtt107/Esc4 was predicted to contain six BRCT motifs. This motif, originally identified in the human breast tumor suppressor gene BRCA1, is a protein interaction domain. The targeted silencing activity of Rtt107/Esc4 resided within the C-terminal two BRCT motifs, and this region of the protein bound to Sir3 in two-hybrid tests. Deletion of RTT107/ESC4 caused sensitivity to the DNA damaging agent MMS as well as to hydroxyurea. A two-hybrid screen showed that the N-terminal BRCT motifs of Rtt107/Esc4 bound to Slx4, a protein previously shown to be involved in DNA repair and required for viability in a strain lacking the DNA helicase Sgs1. Like SLX genes, RTT107ESC4 interacted genetically with SGS1; esc4Δ sgs1Δ mutants were viable, but exhibited a slow-growth phenotype and also a synergistic DNA repair defect. Conclusion Rtt107/Esc4 binds to the silencing protein Sir3 and the DNA repair protein Slx4 via different BRCT motifs, thus providing a bridge linking silent chromatin to DNA repair enzymes.

  1. STUDYING THE INFLUENCE OF THE PYRENE INTERCALATOR TINA ON THE STABILITY OF DNA i-MOTIFS

    DEFF Research Database (Denmark)

    El-Sayed, Ahmed A.; Pedersen, Erik Bjerregaard; Khaireldin, Nahid A.

    2012-01-01

    Certain cytosine-rich (C-rich) DNA sequences can fold into secondary structures as four-stranded i-motifs with hemiprotonated base pairs. Here we synthesized C-rich TINA-intercalating oligonucleotides by inserting a nonnucleotide pyrene moiety between two C-rich regions. The stability of their i-...

  2. Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

    KAUST Repository

    Kalkatawi, Manal M.; Rangkuti, Farania; Schramm, Michael C.; Jankovic, Boris R.; Kamau, Allan; Chowdhary, Rajesh; Archer, John A.C.; Bajic, Vladimir B.

    2011-01-01

    . These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity

  3. New scoring schema for finding motifs in DNA Sequences

    Directory of Open Access Journals (Sweden)

    Nowzari-Dalini Abbas

    2009-03-01

    Full Text Available Abstract Background Pattern discovery in DNA sequences is one of the most fundamental problems in molecular biology with important applications in finding regulatory signals and transcription factor binding sites. An important task in this problem is to search (or predict known binding sites in a new DNA sequence. For this reason, all subsequences of the given DNA sequence are scored based on an scoring function and the prediction is done by selecting the best score. By assuming no dependency between binding site base positions, most of the available tools for known binding site prediction are designed. Recently Tomovic and Oakeley investigated the statistical basis for either a claim of dependence or independence, to determine whether such a claim is generally true, and they presented a scoring function for binding site prediction based on the dependency between binding site base positions. Our primary objective is to investigate the scoring functions which can be used in known binding site prediction based on the assumption of dependency or independency in binding site base positions. Results We propose a new scoring function based on the dependency between all positions in biding site base positions. This scoring function uses joint information content and mutual information as a measure of dependency between positions in transcription factor binding site. Our method for modeling dependencies is simply an extension of position independency methods. We evaluate our new scoring function on the real data sets extracted from JASPAR and TRANSFAC data bases, and compare the obtained results with two other well known scoring functions. Conclusion The results demonstrate that the new approach improves known binding site discovery and show that the joint information content and mutual information provide a better and more general criterion to investigate the relationships between positions in the TFBS. Our scoring function is formulated by simple

  4. Design of character-based DNA barcode motif for species identification: A computational approach and its validation in fishes.

    Science.gov (United States)

    Chakraborty, Mohua; Dhar, Bishal; Ghosh, Sankar Kumar

    2017-11-01

    The DNA barcodes are generally interpreted using distance-based and character-based methods. The former uses clustering of comparable groups, based on the relative genetic distance, while the latter is based on the presence or absence of discrete nucleotide substitutions. The distance-based approach has a limitation in defining a universal species boundary across the taxa as the rate of mtDNA evolution is not constant throughout the taxa. However, character-based approach more accurately defines this using a unique set of nucleotide characters. The character-based analysis of full-length barcode has some inherent limitations, like sequencing of the full-length barcode, use of a sparse-data matrix and lack of a uniform diagnostic position for each group. A short continuous stretch of a fragment can be used to resolve the limitations. Here, we observe that a 154-bp fragment, from the transversion-rich domain of 1367 COI barcode sequences can successfully delimit species in the three most diverse orders of freshwater fishes. This fragment is used to design species-specific barcode motifs for 109 species by the character-based method, which successfully identifies the correct species using a pattern-matching program. The motifs also correctly identify geographically isolated population of the Cypriniformes species. Further, this region is validated as a species-specific mini-barcode for freshwater fishes by successful PCR amplification and sequencing of the motif (154 bp) using the designed primers. We anticipate that use of such motifs will enhance the diagnostic power of DNA barcode, and the mini-barcode approach will greatly benefit the field-based system of rapid species identification. © 2017 John Wiley & Sons Ltd.

  5. Motif finding in DNA sequences based on skipping nonconserved positions in background Markov chains.

    Science.gov (United States)

    Zhao, Xiaoyan; Sze, Sing-Hoi

    2011-05-01

    One strategy to identify transcription factor binding sites is through motif finding in upstream DNA sequences of potentially co-regulated genes. Despite extensive efforts, none of the existing algorithms perform very well. We consider a string representation that allows arbitrary ignored positions within the nonconserved portion of single motifs, and use O(2(l)) Markov chains to model the background distributions of motifs of length l while skipping these positions within each Markov chain. By focusing initially on positions that have fixed nucleotides to define core occurrences, we develop an algorithm to identify motifs of moderate lengths. We compare the performance of our algorithm to other motif finding algorithms on a few benchmark data sets, and show that significant improvement in accuracy can be obtained when the sites are sufficiently conserved within a given sample, while comparable performance is obtained when the site conservation rate is low. A software program (PosMotif ) and detailed results are available online at http://faculty.cse.tamu.edu/shsze/posmotif.

  6. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin; Wong, Hau-San

    2015-01-01

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  7. A Comparison Study for DNA Motif Modeling on Protein Binding Microarray

    KAUST Repository

    Wong, Ka-Chun

    2015-06-11

    Transcription Factor Binding Sites (TFBSs) are relatively short (5-15 bp) and degenerate. Identifying them is a computationally challenging task. In particular, Protein Binding Microarray (PBM) is a high-throughput platform that can measure the DNA binding preference of a protein in a comprehensive and unbiased manner; for instance, a typical PBM experiment can measure binding signal intensities of a protein to all possible DNA k-mers (k=810). Since proteins can often bind to DNA with different binding intensities, one of the major challenges is to build motif models which can fully capture the quantitative binding affinity data. To learn DNA motif models from the non-convex objective function landscape, several optimization methods are compared and applied to the PBM motif model building problem. In particular, representative methods from different optimization paradigms have been chosen for modeling performance comparison on hundreds of PBM datasets. The results suggest that the multimodal optimization methods are very effective for capturing the binding preference information from PBM data. In particular, we observe a general performance improvement using di-nucleotide modeling over mono-nucleotide modeling. In addition, the models learned by the best-performing method are applied to two independent applications: PBM probe rotation testing and ChIP-Seq peak sequence prediction, demonstrating its biological applicability.

  8. The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

    Directory of Open Access Journals (Sweden)

    Roberts Richard J

    2008-05-01

    Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.

  9. Genome Analysis of Conserved Dehydrin Motifs in Vascular Plants

    Directory of Open Access Journals (Sweden)

    Ahmad A. Malik

    2017-05-01

    Full Text Available Dehydrins, a large family of abiotic stress proteins, are defined by the presence of a mostly conserved motif known as the K-segment, and may also contain two other conserved motifs known as the Y-segment and S-segment. Using the dehydrin literature, we developed a sequence motif definition of the K-segment, which we used to create a large dataset of dehydrin sequences by searching the Pfam00257 dehydrin dataset and the Phytozome 10 sequences of vascular plants. A comprehensive analysis of these sequences reveals that lysine residues are highly conserved in the K-segment, while the amino acid type is often conserved at other positions. Despite the Y-segment name, the central tyrosine is somewhat conserved, but can be substituted with two other small aromatic amino acids (phenylalanine or histidine. The S-segment contains a series of serine residues, but in some proteins is also preceded by a conserved LHR sequence. In many dehydrins containing all three of these motifs the S-segment is linked to the K-segment by a GXGGRRKK motif (where X can be any amino acid, suggesting a functional linkage between these two motifs. An analysis of the sequences shows that the dehydrin architecture and several biochemical properties (isoelectric point, molecular mass, and hydrophobicity score are dependent on each other, and that some dehydrin architectures are overexpressed during certain abiotic stress, suggesting that they may be optimized for a specific abiotic stress while others are involved in all forms of dehydration stress (drought, cold, and salinity.

  10. Role of specific cations and water entropy on the stability of branched DNA motif structures.

    Science.gov (United States)

    Pascal, Tod A; Goddard, William A; Maiti, Prabal K; Vaidehi, Nagarajan

    2012-10-11

    DNA three-way junctions (TWJs) are important intermediates in various cellular processes and are the simplest of a family of branched nucleic acids being considered as scaffolds for biomolecular nanotechnology. Branched nucleic acids are stabilized by divalent cations such as Mg(2+), presumably due to condensation and neutralization of the negatively charged DNA backbone. However, electrostatic screening effects point to more complex solvation dynamics and a large role of interfacial waters in thermodynamic stability. Here, we report extensive computer simulations in explicit water and salt on a model TWJ and use free energy calculations to quantify the role of ionic character and strength on stability. We find that enthalpic stabilization of the first and second hydration shells by Mg(2+) accounts for 1/3 and all of the free energy gain in 50% and pure MgCl(2) solutions, respectively. The more distorted DNA molecule is actually destabilized in pure MgCl(2) compared to pure NaCl. Notably, the first shell, interfacial waters have very low translational and rotational entropy (i.e., mobility) compared to the bulk, an entropic loss that is overcompensated by increased enthalpy from additional electrostatic interactions with Mg(2+). In contrast, the second hydration shell has anomalously high entropy as it is trapped between an immobile and bulklike layer. The nonmonotonic entropic signature and long-range perturbations of the hydration shells to Mg(2+) may have implications in the molecular recognition of these motifs. For example, we find that low salt stabilizes the parallel configuration of the three-way junction, whereas at normal salt we find antiparallel configurations deduced from the NMR. We use the 2PT analysis to follow the thermodynamics of this transition and find that the free energy barrier is dominated by entropic effects that result from the decreased surface area of the antiparallel form which has a smaller number of low entropy waters in the first

  11. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo; Jankovic, Boris R.; Bajic, Vladimir B.; Song, Le; Gao, Xin

    2013-01-01

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  12. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo

    2013-06-21

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  13. Factoring local sequence composition in motif significance analysis.

    Science.gov (United States)

    Ng, Patrick; Keich, Uri

    2008-01-01

    We recently introduced a biologically realistic and reliable significance analysis of the output of a popular class of motif finders. In this paper we further improve our significance analysis by incorporating local base composition information. Relying on realistic biological data simulation, as well as on FDR analysis applied to real data, we show that our method is significantly better than the increasingly popular practice of using the normal approximation to estimate the significance of a finder's output. Finally we turn to leveraging our reliable significance analysis to improve the actual motif finding task. Specifically, endowing a variant of the Gibbs Sampler with our improved significance analysis we demonstrate that de novo finders can perform better than has been perceived. Significantly, our new variant outperforms all the finders reviewed in a recently published comprehensive analysis of the Harbison genome-wide binding location data. Interestingly, many of these finders incorporate additional information such as nucleosome positioning and the significance of binding data.

  14. The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element.

    Science.gov (United States)

    Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko

    2013-07-01

    AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5'-NNCCAC-3' and 5'-GCGMGN'N'-3' (M:A or C; N and N' form Watson-Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences.

  15. DndEi Exhibits Helicase Activity Essential for DNA Phosphorothioate Modification and ATPase Activity Strongly Stimulated by DNA Substrate with a GAAC/GTTC Motif.

    Science.gov (United States)

    Zheng, Tao; Jiang, Pan; Cao, Bo; Cheng, Qiuxiang; Kong, Lingxin; Zheng, Xiaoqing; Hu, Qinghai; You, Delin

    2016-01-15

    Phosphorothioate (PT) modification of DNA, in which the non-bridging oxygen of the backbone phosphate group is replaced by sulfur, is governed by the DndA-E proteins in prokaryotes. To better understand the biochemical mechanism of PT modification, functional analysis of the recently found PT-modifying enzyme DndEi, which has an additional domain compared with canonical DndE, from Riemerella anatipestifer is performed in this study. The additional domain is identified as a DNA helicase, and functional deletion of this domain in vivo leads to PT modification deficiency, indicating an essential role of helicase activity in PT modification. Subsequent analysis reveals that the additional domain has an ATPase activity. Intriguingly, the ATPase activity is strongly stimulated by DNA substrate containing a GAAC/GTTC motif (i.e. the motif at which PT modifications occur in R. anatipestifer) when the additional domain and the other domain (homologous to canonical DndE) are co-expressed as a full-length DndEi. These results reveal that PT modification is a biochemical process with DNA strand separation and intense ATP hydrolysis. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  16. Human telomeric DNA: G-quadruplex, i-motif and Watson–Crick double helix

    Science.gov (United States)

    Phan, Anh Tuân; Mergny, Jean-Louis

    2002-01-01

    Human telomeric DNA composed of (TTAGGG/CCCTAA)n repeats may form a classical Watson–Crick double helix. Each individual strand is also prone to quadruplex formation: the G-rich strand may adopt a G-quadruplex conformation involving G-quartets whereas the C-rich strand may fold into an i-motif based on intercalated C·C+ base pairs. Using an equimolar mixture of the telomeric oligonucleotides d[AGGG(TTAGGG)3] and d[(CCCTAA)3CCCT], we defined which structures existed and which would be the predominant species under a variety of experimental conditions. Under near-physiological conditions of pH, temperature and salt concentration, telomeric DNA was predominantly in a double-helix form. However, at lower pH values or higher temperatures, the G-quadruplex and/or the i-motif efficiently competed with the duplex. We also present kinetic and thermodynamic data for duplex association and for G-quadruplex/i-motif unfolding. PMID:12409451

  17. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    OpenAIRE

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with...

  18. DNA methylation requires a DNMT1 ubiquitin interacting motif (UIM) and histone ubiquitination.

    Science.gov (United States)

    Qin, Weihua; Wolf, Patricia; Liu, Nan; Link, Stephanie; Smets, Martha; La Mastra, Federica; Forné, Ignasi; Pichler, Garwin; Hörl, David; Fellinger, Karin; Spada, Fabio; Bonapace, Ian Marc; Imhof, Axel; Harz, Hartmann; Leonhardt, Heinrich

    2015-08-01

    DNMT1 is recruited by PCNA and UHRF1 to maintain DNA methylation after replication. UHRF1 recognizes hemimethylated DNA substrates via the SRA domain, but also repressive H3K9me3 histone marks with its TTD. With systematic mutagenesis and functional assays, we could show that chromatin binding further involved UHRF1 PHD binding to unmodified H3R2. These complementation assays clearly demonstrated that the ubiquitin ligase activity of the UHRF1 RING domain is required for maintenance DNA methylation. Mass spectrometry of UHRF1-deficient cells revealed H3K18 as a novel ubiquitination target of UHRF1 in mammalian cells. With bioinformatics and mutational analyses, we identified a ubiquitin interacting motif (UIM) in the N-terminal regulatory domain of DNMT1 that binds to ubiquitinated H3 tails and is essential for DNA methylation in vivo. H3 ubiquitination and subsequent DNA methylation required UHRF1 PHD binding to H3R2. These results show the manifold regulatory mechanisms controlling DNMT1 activity that require the reading and writing of epigenetic marks by UHRF1 and illustrate the multifaceted interplay between DNA and histone modifications. The identification and functional characterization of the DNMT1 UIM suggests a novel regulatory principle and we speculate that histone H2AK119 ubiquitination might also lead to UIM-dependent recruitment of DNMT1 and DNA methylation beyond classic maintenance.

  19. qPMS7: a fast algorithm for finding (ℓ, d-motifs in DNA and protein sequences.

    Directory of Open Access Journals (Sweden)

    Hieu Dinh

    Full Text Available Detection of rare events happening in a set of DNA/protein sequences could lead to new biological discoveries. One kind of such rare events is the presence of patterns called motifs in DNA/protein sequences. Finding motifs is a challenging problem since the general version of motif search has been proven to be intractable. Motifs discovery is an important problem in biology. For example, it is useful in the detection of transcription factor binding sites and transcriptional regulatory elements that are very crucial in understanding gene function, human disease, drug design, etc. Many versions of the motif search problem have been proposed in the literature. One such is the (ℓ, d-motif search (or Planted Motif Search (PMS. A generalized version of the PMS problem, namely, Quorum Planted Motif Search (qPMS, is shown to accurately model motifs in real data. However, solving the qPMS problem is an extremely difficult task because a special case of it, the PMS Problem, is already NP-hard, which means that any algorithm solving it can be expected to take exponential time in the worse case scenario. In this paper, we propose a novel algorithm named qPMS7 that tackles the qPMS problem on real data as well as challenging instances. Experimental results show that our Algorithm qPMS7 is on an average 5 times faster than the state-of-art algorithm. The executable program of Algorithm qPMS7 is freely available on the web at http://pms.engr.uconn.edu/downloads/qPMS7.zip. Our online motif discovery tools that use Algorithm qPMS7 are freely available at http://pms.engr.uconn.edu or http://motifsearch.com.

  20. Conserved amino acid motifs from the novel Piv/MooV family of transposases and site-specific recombinases are required for catalysis of DNA inversion by Piv.

    Science.gov (United States)

    Tobiason, D M; Buchner, J M; Thiel, W H; Gernert, K M; Karls, A C

    2001-02-01

    Piv, a site-specific invertase from Moraxella lacunata, exhibits amino acid homology with the transposases of the IS110/IS492 family of insertion elements. The functions of conserved amino acid motifs that define this novel family of both transposases and site-specific recombinases (Piv/MooV family) were examined by mutagenesis of fully conserved amino acids within each motif in Piv. All Piv mutants altered in conserved residues were defective for in vivo inversion of the M. lacunata invertible DNA segment, but competent for in vivo binding to Piv DNA recognition sequences. Although the primary amino acid sequences of the Piv/MooV recombinases do not contain a conserved DDE motif, which defines the retroviral integrase/transposase (IN/Tnps) family, the predicted secondary structural elements of Piv align well with those of the IN/Tnps for which crystal structures have been determined. Molecular modelling of Piv based on these alignments predicts that E59, conserved as either E or D in the Piv/MooV family, forms a catalytic pocket with the conserved D9 and D101 residues. Analysis of Piv E59G confirms a role for E59 in catalysis of inversion. These results suggest that Piv and the related IS110/IS492 transposases mediate DNA recombination by a common mechanism involving a catalytic DED or DDD motif.

  1. The Q Motif Is Involved in DNA Binding but Not ATP Binding in ChlR1 Helicase.

    Directory of Open Access Journals (Sweden)

    Hao Ding

    Full Text Available Helicases are molecular motors that couple the energy of ATP hydrolysis to the unwinding of structured DNA or RNA and chromatin remodeling. The conversion of energy derived from ATP hydrolysis into unwinding and remodeling is coordinated by seven sequence motifs (I, Ia, II, III, IV, V, and VI. The Q motif, consisting of nine amino acids (GFXXPXPIQ with an invariant glutamine (Q residue, has been identified in some, but not all helicases. Compared to the seven well-recognized conserved helicase motifs, the role of the Q motif is less acknowledged. Mutations in the human ChlR1 (DDX11 gene are associated with a unique genetic disorder known as Warsaw Breakage Syndrome, which is characterized by cellular defects in genome maintenance. To examine the roles of the Q motif in ChlR1 helicase, we performed site directed mutagenesis of glutamine to alanine at residue 23 in the Q motif of ChlR1. ChlR1 recombinant protein was overexpressed and purified from HEK293T cells. ChlR1-Q23A mutant abolished the helicase activity of ChlR1 and displayed reduced DNA binding ability. The mutant showed impaired ATPase activity but normal ATP binding. A thermal shift assay revealed that ChlR1-Q23A has a melting point value similar to ChlR1-WT. Partial proteolysis mapping demonstrated that ChlR1-WT and Q23A have a similar globular structure, although some subtle conformational differences in these two proteins are evident. Finally, we found ChlR1 exists and functions as a monomer in solution, which is different from FANCJ, in which the Q motif is involved in protein dimerization. Taken together, our results suggest that the Q motif is involved in DNA binding but not ATP binding in ChlR1 helicase.

  2. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    Science.gov (United States)

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with tandem CYR motifs, has endo- and exonuclease activities against abasic site and other types of base damage. PALF accumulates rapidly at single-strand breaks in a poly(ADP-ribose) polymerase 1 (PARP1)-dependent manner in human cells. Indeed, PALF interacts directly with PARP1 and is required for its activation and for cellular resistance to methyl-methane sulfonate. PALF also interacts directly with KU86, LIGASEIV and phosphorylated XRCC4 proteins and possesses endo/exonuclease activity at protruding DNA ends. Various treatments that produce double-strand breaks induce formation of PALF foci, which fully coincide with γH2AX foci. Thus, PALF and the CYR motif may play important roles in DNA repair of higher eukaryotes. PMID:17396150

  3. Conserved XPB Core Structure and Motifs for DNA Unwinding:Implications for Pathway Selection of Transcription or ExcisionRepair

    Energy Technology Data Exchange (ETDEWEB)

    Fan, Li; Arval, Andrew S.; Cooper, Priscilla K.; Iwai, Shigenori; Hanaoka, Fumio; Tainer, John A.

    2005-04-01

    The human xeroderma pigmentosum group B (XPB) helicase is essential for transcription, nucleotide excision repair, and TFIIH functional assembly. Here, we determined crystal structures of an Archaeoglobus fulgidus XPB homolog (AfXPB) that characterize two RecA-like XPB helicase domains and discover a DNA damage recognition domain (DRD), a unique RED motif, a flexible thumb motif (ThM), and implied conformational changes within a conserved functional core. RED motif mutations dramatically reduce helicase activity, and the DRD and ThM, which flank the RED motif, appear structurally as well as functionally analogous to the MutS mismatch recognition and DNA polymerase thumb domains. Substrate specificity is altered by DNA damage, such that AfXPB unwinds dsDNA with 3' extensions, but not blunt-ended dsDNA, unless it contains a lesion, as shown for CPD or (6-4) photoproducts. Together, these results provide an unexpected mechanism of DNA unwinding with Implications for XPB damage verification in nucleotide excision repair.

  4. μXRF analysis of decoration motifs on Majolica pottery

    International Nuclear Information System (INIS)

    Padilla Lavarez, Roman; Van Espen, Pierr M.; Janssens, K; Schalm, O.

    2001-01-01

    μXRF analysis of decoration motifs on Majolica pottery in fragments corresponding to several Majolica types was carried out using an spectrometer comprising a low power Mo X-ray tube and a elliptic-shape concentration lens with a 60 um spot. Both surface scanning and spot measurements were carried a out, allowing the qualitative identification of the inorganic pigments used for the surface painting decoration and the quantitative analysis of the main glaze composition. The absence of interference signal arising from the excitation on the underlying paste when analysing thin-lead glazing was evaluated, allowing ensuring the suitable of the analytical procedures. A distinction was found between different types of majolica by the composition of the lead tin glaze enamel and by the presence of other elements in the blue, black and orange decoration

  5. Fragile DNA Motifs Trigger Mutagenesis at Distant Chromosomal Loci in Saccharomyces cerevisiae

    Science.gov (United States)

    Saini, Natalie; Zhang, Yu; Nishida, Yuri; Sheng, Ziwei; Choudhury, Shilpa; Mieczkowski, Piotr; Lobachev, Kirill S.

    2013-01-01

    DNA sequences capable of adopting non-canonical secondary structures have been associated with gross-chromosomal rearrangements in humans and model organisms. Previously, we have shown that long inverted repeats that form hairpin and cruciform structures and triplex-forming GAA/TTC repeats induce the formation of double-strand breaks which trigger genome instability in yeast. In this study, we demonstrate that breakage at both inverted repeats and GAA/TTC repeats is augmented by defects in DNA replication. Increased fragility is associated with increased mutation levels in the reporter genes located as far as 8 kb from both sides of the repeats. The increase in mutations was dependent on the presence of inverted or GAA/TTC repeats and activity of the translesion polymerase Polζ. Mutagenesis induced by inverted repeats also required Sae2 which opens hairpin-capped breaks and initiates end resection. The amount of breakage at the repeats is an important determinant of mutations as a perfect palindromic sequence with inherently increased fragility was also found to elevate mutation rates even in replication-proficient strains. We hypothesize that the underlying mechanism for mutagenesis induced by fragile motifs involves the formation of long single-stranded regions in the broken chromosome, invasion of the undamaged sister chromatid for repair, and faulty DNA synthesis employing Polζ. These data demonstrate that repeat-mediated breaks pose a dual threat to eukaryotic genome integrity by inducing chromosomal aberrations as well as mutations in flanking genes. PMID:23785298

  6. Characterizing Motif Dynamics of Electric Brain Activity Using Symbolic Analysis

    Directory of Open Access Journals (Sweden)

    Massimiliano Zanin

    2014-10-01

    Full Text Available Motifs are small recurring circuits of interactions which constitute the backbone of networked systems. Characterizing motif dynamics is therefore key to understanding the functioning of such systems. Here we propose a method to define and quantify the temporal variability and time scales of electroencephalogram (EEG motifs of resting brain activity. Given a triplet of EEG sensors, links between them are calculated by means of linear correlation; each pattern of links (i.e., each motif is then associated to a symbol, and its appearance frequency is analyzed by means of Shannon entropy. Our results show that each motif becomes observable with different coupling thresholds and evolves at its own time scale, with fronto-temporal sensors emerging at high thresholds and changing at fast time scales, and parietal ones at low thresholds and changing at slower rates. Finally, while motif dynamics differed across individuals, for each subject, it showed robustness across experimental conditions, indicating that it could represent an individual dynamical signature.

  7. DNA mimic proteins: functions, structures, and bioinformatic analysis.

    Science.gov (United States)

    Wang, Hao-Ching; Ho, Chun-Han; Hsu, Kai-Cheng; Yang, Jinn-Moon; Wang, Andrew H-J

    2014-05-13

    DNA mimic proteins have DNA-like negative surface charge distributions, and they function by occupying the DNA binding sites of DNA binding proteins to prevent these sites from being accessed by DNA. DNA mimic proteins control the activities of a variety of DNA binding proteins and are involved in a wide range of cellular mechanisms such as chromatin assembly, DNA repair, transcription regulation, and gene recombination. However, the sequences and structures of DNA mimic proteins are diverse, making them difficult to predict by bioinformatic search. To date, only a few DNA mimic proteins have been reported. These DNA mimics were not found by searching for functional motifs in their sequences but were revealed only by structural analysis of their charge distribution. This review highlights the biological roles and structures of 16 reported DNA mimic proteins. We also discuss approaches that might be used to discover new DNA mimic proteins.

  8. Phyloproteomic Analysis of 11780 Six-Residue-Long Motifs Occurrences

    Directory of Open Access Journals (Sweden)

    O. V. Galzitskaya

    2015-01-01

    Full Text Available How is it possible to find good traits for phylogenetic reconstructions? Here, we present a new phyloproteomic criterion that is an occurrence of simple motifs which can be imprints of evolution history. We studied the occurrences of 11780 six-residue-long motifs consisting of two randomly located amino acids in 97 eukaryotic and 25 bacterial proteomes. For all eukaryotic proteomes, with the exception of the Amoebozoa, Stramenopiles, and Diplomonadida kingdoms, the number of proteins containing the motifs from the first group (one of the two amino acids occurs once at the terminal position made about 20%; in the case of motifs from the second (one of two amino acids occurs one time within the pattern and third (the two amino acids occur randomly groups, 30% and 50%, respectively. For bacterial proteomes, this relationship was 10%, 27%, and 63%, respectively. The matrices of correlation coefficients between numbers of proteins where a motif from the set of 11780 motifs appears at least once in 9 kingdoms and 5 phyla of bacteria were calculated. Among the correlation coefficients for eukaryotic proteomes, the correlation between the animal and fungi kingdoms (0.62 is higher than between fungi and plants (0.54. Our study provides support that animals and fungi are sibling kingdoms. Comparison of the frequencies of six-residue-long motifs in different proteomes allows obtaining phylogenetic relationships based on similarities between these frequencies: the Diplomonadida kingdoms are more close to Bacteria than to Eukaryota; Stramenopiles and Amoebozoa are more close to each other than to other kingdoms of Eukaryota.

  9. Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences

    Science.gov (United States)

    König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.

    2013-01-01

    G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141

  10. Genetic analysis of beta1 integrin "activation motifs" in mice

    DEFF Research Database (Denmark)

    Czuchra, Aleksandra; Meyer, Hannelore; Legate, Kyle R

    2006-01-01

    -null phenotype in vivo. Surprisingly, neither the substitution of the tyrosines with phenylalanine nor the aspartic acid with alanine resulted in an obvious defect. These data suggest that the NPXY motifs of the beta1 integrin tail are essential for beta1 integrin function, whereas tyrosine phosphorylation...

  11. Reversible Redox Activity by Ion-pH Dually Modulated Duplex Formation of i-Motif DNA with Complementary G-DNA

    Directory of Open Access Journals (Sweden)

    Soyoung Chang

    2018-04-01

    Full Text Available The unique biological features of supramolecular DNA have led to an increasing interest in biomedical applications such as biosensors. We have developed an i-motif and G-rich DNA conjugated single-walled carbon nanotube hybrid materials, which shows reversible conformational switching upon external stimuli such as pH (5 and 8 and presence of ions (Li+ and K+. We observed reversible electrochemical redox activity upon external stimuli in a quick and robust manner. Given the ease and the robustness of this method, we believe that pH- and ion-driven reversible DNA structure transformations will be utilized for future applications for developing novel biosensors.

  12. Novel and deviant Walker A ATP-binding motifs in bacteriophage large terminase-DNA packaging proteins

    International Nuclear Information System (INIS)

    Mitchell, Michael S.; Rao, Venigalla B.

    2004-01-01

    Bacteriophage terminases constitute a very interesting class of viral-coded multifunctional ATPase 'motors' that apparently drive directional translocation of DNA into an empty viral capsid. A common Walker A motif and other conserved signatures of a critical ATPase catalytic center are identified in the N-terminal half of numerous large terminase proteins. However, several terminases, including the well-characterized λ and SPP1 terminases, seem to lack the classic Walker A in the N-terminus. Using sequence alignment approaches, we discovered the presence of deviant Walker A motifs in these and many other phage terminases. One deviation, the presence of a lysine at the beginning of P-loop, may represent a 3D equivalent of the universally conserved lysine in the Walker A GKT/S signature. This and other novel putative Walker A motifs that first came to light through this study help define the ATPase centers of phage and viral terminases as well as elicit important insights into the molecular functioning of this fundamental motif in biological systems

  13. Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

    Energy Technology Data Exchange (ETDEWEB)

    Chojnowski, Grzegorz, E-mail: gchojnowski@genesilico.pl [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Waleń, Tomasz [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); University of Warsaw, Banacha 2, 02-097 Warsaw (Poland); Piątkowski, Paweł; Potrzebowski, Wojciech [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Bujnicki, Janusz M. [International Institute of Molecular and Cell Biology, Trojdena 4, 02-109 Warsaw (Poland); Adam Mickiewicz University, Umultowska 89, 61-614 Poznan (Poland)

    2015-03-01

    A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure is RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx.

  14. Brickworx builds recurrent RNA and DNA structural motifs into medium- and low-resolution electron-density maps

    International Nuclear Information System (INIS)

    Chojnowski, Grzegorz; Waleń, Tomasz; Piątkowski, Paweł; Potrzebowski, Wojciech; Bujnicki, Janusz M.

    2015-01-01

    A computer program that builds crystal structure models of nucleic acid molecules is presented. Brickworx is a computer program that builds crystal structure models of nucleic acid molecules using recurrent motifs including double-stranded helices. In a first step, the program searches for electron-density peaks that may correspond to phosphate groups; it may also take into account phosphate-group positions provided by the user. Subsequently, comparing the three-dimensional patterns of the P atoms with a database of nucleic acid fragments, it finds the matching positions of the double-stranded helical motifs (A-RNA or B-DNA) in the unit cell. If the target structure is RNA, the helical fragments are further extended with recurrent RNA motifs from a fragment library that contains single-stranded segments. Finally, the matched motifs are merged and refined in real space to find the most likely conformations, including a fit of the sequence to the electron-density map. The Brickworx program is available for download and as a web server at http://iimcb.genesilico.pl/brickworx

  15. The RXL motif of the African cassava mosaic virus Rep protein is necessary for rereplication of yeast DNA and viral infection in plants

    Energy Technology Data Exchange (ETDEWEB)

    Hipp, Katharina; Rau, Peter; Schäfer, Benjamin [Institut für Biomaterialien und biomolekulare Systeme, Abteilung für Molekularbiologie und Virologie der Pflanzen, Universität Stuttgart, Pfaffenwaldring 57, D-70550 Stuttgart (Germany); Gronenborn, Bruno [Institut des Sciences du Végétal, CNRS, 91198 Gif-sur-Yvette (France); Jeske, Holger, E-mail: holger.jeske@bio.uni-stuttgart.de [Institut für Biomaterialien und biomolekulare Systeme, Abteilung für Molekularbiologie und Virologie der Pflanzen, Universität Stuttgart, Pfaffenwaldring 57, D-70550 Stuttgart (Germany)

    2014-08-15

    Geminiviruses, single-stranded DNA plant viruses, encode a replication-initiator protein (Rep) that is indispensable for virus replication. A potential cyclin interaction motif (RXL) in the sequence of African cassava mosaic virus Rep may be an alternative link to cell cycle controls to the known interaction with plant homologs of retinoblastoma protein (pRBR). Mutation of this motif abrogated rereplication in fission yeast induced by expression of wildtype Rep suggesting that Rep interacts via its RXL motif with one or several yeast proteins. The RXL motif is essential for viral infection of Nicotiana benthamiana plants, since mutation of this motif in infectious clones prevented any symptomatic infection. The cell-cycle link (Clink) protein of a nanovirus (faba bean necrotic yellows virus) was investigated that activates the cell cycle by binding via its LXCXE motif to pRBR. Expression of wildtype Clink and a Clink mutant deficient in pRBR-binding did not trigger rereplication in fission yeast. - Highlights: • A potential cyclin interaction motif is conserved in geminivirus Rep proteins. • In ACMV Rep, this motif (RXL) is essential for rereplication of fission yeast DNA. • Mutating RXL abrogated viral infection completely in Nicotiana benthamiana. • Expression of a nanovirus Clink protein in yeast did not induce rereplication. • Plant viruses may have evolved multiple routes to exploit host DNA synthesis.

  16. The RXL motif of the African cassava mosaic virus Rep protein is necessary for rereplication of yeast DNA and viral infection in plants

    International Nuclear Information System (INIS)

    Hipp, Katharina; Rau, Peter; Schäfer, Benjamin; Gronenborn, Bruno; Jeske, Holger

    2014-01-01

    Geminiviruses, single-stranded DNA plant viruses, encode a replication-initiator protein (Rep) that is indispensable for virus replication. A potential cyclin interaction motif (RXL) in the sequence of African cassava mosaic virus Rep may be an alternative link to cell cycle controls to the known interaction with plant homologs of retinoblastoma protein (pRBR). Mutation of this motif abrogated rereplication in fission yeast induced by expression of wildtype Rep suggesting that Rep interacts via its RXL motif with one or several yeast proteins. The RXL motif is essential for viral infection of Nicotiana benthamiana plants, since mutation of this motif in infectious clones prevented any symptomatic infection. The cell-cycle link (Clink) protein of a nanovirus (faba bean necrotic yellows virus) was investigated that activates the cell cycle by binding via its LXCXE motif to pRBR. Expression of wildtype Clink and a Clink mutant deficient in pRBR-binding did not trigger rereplication in fission yeast. - Highlights: • A potential cyclin interaction motif is conserved in geminivirus Rep proteins. • In ACMV Rep, this motif (RXL) is essential for rereplication of fission yeast DNA. • Mutating RXL abrogated viral infection completely in Nicotiana benthamiana. • Expression of a nanovirus Clink protein in yeast did not induce rereplication. • Plant viruses may have evolved multiple routes to exploit host DNA synthesis

  17. Sequence-specific DNA binding by MYC/MAX to low-affinity non-E-box motifs.

    Directory of Open Access Journals (Sweden)

    Michael Allevato

    Full Text Available The MYC oncoprotein regulates transcription of a large fraction of the genome as an obligatory heterodimer with the transcription factor MAX. The MYC:MAX heterodimer and MAX:MAX homodimer (hereafter MYC/MAX bind Enhancer box (E-box DNA elements (CANNTG and have the greatest affinity for the canonical MYC E-box (CME CACGTG. However, MYC:MAX also recognizes E-box variants and was reported to bind DNA in a "non-specific" fashion in vitro and in vivo. Here, in order to identify potential additional non-canonical binding sites for MYC/MAX, we employed high throughput in vitro protein-binding microarrays, along with electrophoretic mobility-shift assays and bioinformatic analyses of MYC-bound genomic loci in vivo. We identified all hexameric motifs preferentially bound by MYC/MAX in vitro, which include the low-affinity non-E-box sequence AACGTT, and found that the vast majority (87% of MYC-bound genomic sites in a human B cell line contain at least one of the top 21 motifs bound by MYC:MAX in vitro. We further show that high MYC/MAX concentrations are needed for specific binding to the low-affinity sequence AACGTT in vitro and that elevated MYC levels in vivo more markedly increase the occupancy of AACGTT sites relative to CME sites, especially at distal intergenic and intragenic loci. Hence, MYC binds diverse DNA motifs with a broad range of affinities in a sequence-specific and dose-dependent manner, suggesting that MYC overexpression has more selective effects on the tumor transcriptome than previously thought.

  18. Mutational analysis of the RecJ exonuclease of Escherichia coli: identification of phosphoesterase motifs.

    Science.gov (United States)

    Sutera, V A; Han, E S; Rajman, L A; Lovett, S T

    1999-10-01

    The recJ gene, identified in Escherichia coli, encodes a Mg(+2)-dependent 5'-to-3' exonuclease with high specificity for single-strand DNA. Genetic and biochemical experiments implicate RecJ exonuclease in homologous recombination, base excision, and methyl-directed mismatch repair. Genes encoding proteins with strong similarities to RecJ have been found in every eubacterial genome sequenced to date, with the exception of Mycoplasma and Mycobacterium tuberculosis. Multiple genes encoding proteins similar to RecJ are found in some eubacteria, including Bacillus and Helicobacter, and in the archaea. Among this divergent set of sequences, seven conserved motifs emerge. We demonstrate here that amino acids within six of these motifs are essential for both the biochemical and genetic functions of E. coli RecJ. These motifs may define interactions with Mg(2+) ions or substrate DNA. A large family of proteins more distantly related to RecJ is present in archaea, eubacteria, and eukaryotes, including a hypothetical protein in the MgPa adhesin operon of Mycoplasma, a domain of putative polyA polymerases in Synechocystis and Aquifex, PRUNE of Drosophila, and an exopolyphosphatase (PPX1) of Saccharomyces cereviseae. Because these six RecJ motifs are shared between exonucleases and exopolyphosphatases, they may constitute an ancient phosphoesterase domain now found in all kingdoms of life.

  19. Molecular Detection, Phylogenetic Analysis, and Identification of Transcription Motifs in Feline Leukemia Virus from Naturally Infected Cats in Malaysia

    Directory of Open Access Journals (Sweden)

    Faruku Bande

    2014-01-01

    Full Text Available A nested PCR assay was used to determine the viral RNA and proviral DNA status of naturally infected cats. Selected samples that were FeLV-positive by PCR were subjected to sequencing, phylogenetic analysis, and motifs search. Of the 39 samples that were positive for FeLV p27 antigen, 87.2% (34/39 were confirmed positive with nested PCR. FeLV proviral DNA was detected in 38 (97.3% of p27-antigen negative samples. Malaysian FeLV isolates are found to be highly similar with a homology of 91% to 100%. Phylogenetic analysis revealed that Malaysian FeLV isolates divided into two clusters, with a majority (86.2% sharing similarity with FeLV-K01803 and fewer isolates (13.8% with FeLV-GM1 strain. Different enhancer motifs including NF-GMa, Krox-20/WT1I-del2, BAF1, AP-2, TBP, TFIIF-beta, TRF, and TFIID are found to occur either in single, duplicate, triplicate, or sets of 5 in different positions within the U3-LTR-gag region. The present result confirms the occurrence of FeLV viral RNA and provirus DNA in naturally infected cats. Malaysian FeLV isolates are highly similar, and a majority of them are closely related to a UK isolate. This study provides the first molecular based information on FeLV in Malaysia. Additionally, different enhancer motifs likely associated with FeLV related pathogenesis have been identified.

  20. Improvement of the Immunogenicity of Porcine Circovirus Type 2 DNA Vaccine by Recombinant ORF2 Gene and CpG Motifs.

    Science.gov (United States)

    Li, Jun; Shi, Jian-Li; Wu, Xiao-Yan; Fu, Fang; Yu, Jiang; Yuan, Xiao-Yuan; Peng, Zhe; Cong, Xiao-Yan; Xu, Shao-Jian; Sun, Wen-Bo; Cheng, Kai-Hui; Du, Yi-Jun; Wu, Jia-Qiang; Wang, Jin-Bao; Huang, Bao-Hua

    2015-06-01

    Nowadays, adjuvant is still important for boosting immunity and improving resistance in animals. In order to boost the immunity of porcine circovirus type 2 (PCV2) DNA vaccine, CpG motifs were inserted. In this study, the dose-effect was studied, and the immunity of PCV2 DNA vaccines by recombinant open reading frame 2 (ORF2) gene and CpG motifs was evaluated. Three-week-old Changbai piglets were inoculated intramuscularly with 200 μg, 400 μg, and 800 μg DNA vaccines containing 14 and 18 CpG motifs, respectively. Average gain and rectum temperature were recorded everyday during the experiments. Blood was collected from the piglets after vaccination to detect the changes of specific antibodies, interleukin-2, and immune cells every week. Tissues were collected for histopathology and polymerase chain reaction. The results indicated that compared to those of the control piglets, all concentrations of two DNA vaccines could induce PCV2-specific antibodies. A cellular immunity test showed that PCV2-specific lymphocytes proliferated the number of TH, TC, and CD3+ positive T-cells raised in the blood of DNA vaccine immune groups. There was no distinct pathological damage and viremia occurring in pigs that were inoculated with DNA vaccines, but there was some minor pathological damage in the control group. The results demonstrated that CpG motifs as an adjuvant could boost the humoral and cellular immunity of pigs to PCV2, especially in terms of cellular immunity. Comparing two DNA vaccines that were constructed, the one containing 18 CpG motifs was more effective. This is the first report that CpG motifs as an adjuvant insert to the PCV2 DNA vaccine could boost immunity.

  1. Spectrometric study of the folding process of i-motif-forming DNA sequences upstream of the c-kit transcription initiation site

    International Nuclear Information System (INIS)

    Bucek, Pavel; Gargallo, Raimundo; Kudrev, Andrei

    2010-01-01

    The c-kit oncogene shows a cytosine-rich DNA region upstream of the transcription initiation site which forms an i-motif structure at slightly acidic pH values (Bucek et al. ). In the present study, the pH-induced formation of i-motif - forming sequences 5'-CCC CTC CCT CGC GCC CGC CCG-3' (ckitC1, native), 5'-CCC TTC CCT TGT GCC CGC CCG-3' (ckitC2) and 5'-CCCTT CCC TTTTT CCC T CCC T-3' (ckitC3) was studied by spectroscopic techniques, such as UV molecular absorption and circular dichroism (CD), in tandem with two multivariate data analysis methods, the hard modelling-based matrix method and the soft modelling-based MCR-ALS approach. Use of the hard chemical modelling enabled us to propose the equilibrium model, which describes spectral changes as functions of solution acidity. Additionally, the intrinsic protonation constant, K in , and the cooperativity parameters, ω c , and ω a , were calculated from the fitting procedure of the coupled CD and molecular absorption spectra. In the case of ckitC2 and ckitC3, the hard model correctly reproduced the spectral variations observed experimentally. The results indicated that folding was accompanied by a cooperative process, i.e. the enhancement of protonated structure stability upon protonation. In contrast, unfolding was accompanied by an anticooperative process. Finally, folding of the native sequence, ckitC1, seemed to follow a more complex mechanism.

  2. Counting of oligomers in sequences generated by markov chains for DNA motif discovery.

    Science.gov (United States)

    Shan, Gao; Zheng, Wei-Mou

    2009-02-01

    By means of the technique of the imbedded Markov chain, an efficient algorithm is proposed to exactly calculate first, second moments of word counts and the probability for a word to occur at least once in random texts generated by a Markov chain. A generating function is introduced directly from the imbedded Markov chain to derive asymptotic approximations for the problem. Two Z-scores, one based on the number of sequences with hits and the other on the total number of word hits in a set of sequences, are examined for discovery of motifs on a set of promoter sequences extracted from A. thaliana genome. Source code is available at http://www.itp.ac.cn/zheng/oligo.c.

  3. Sequence-specific DNA binding activity of the cross-brace zinc finger motif of the piggyBac transposase

    Science.gov (United States)

    Morellet, Nelly; Li, Xianghong; Wieninger, Silke A; Taylor, Jennifer L; Bischerour, Julien; Moriau, Séverine; Lescop, Ewen; Bardiaux, Benjamin; Mathy, Nathalie; Assrir, Nadine; Bétermier, Mireille; Nilges, Michael; Hickman, Alison B; Dyda, Fred; Craig, Nancy L; Guittet, Eric

    2018-01-01

    Abstract The piggyBac transposase (PB) is distinguished by its activity and utility in genome engineering, especially in humans where it has highly promising therapeutic potential. Little is known, however, about the structure–function relationships of the different domains of PB. Here, we demonstrate in vitro and in vivo that its C-terminal Cysteine-Rich Domain (CRD) is essential for DNA breakage, joining and transposition and that it binds to specific DNA sequences in the left and right transposon ends, and to an additional unexpectedly internal site at the left end. Using NMR, we show that the CRD adopts the specific fold of the cross-brace zinc finger protein family. We determine the interaction interfaces between the CRD and its target, the 5′-TGCGT-3′/3′-ACGCA-5′ motifs found in the left, left internal and right transposon ends, and use NMR results to propose docking models for the complex, which are consistent with our site-directed mutagenesis data. Our results provide support for a model of the PB/DNA interactions in the context of the transpososome, which will be useful for the rational design of PB mutants with increased activity. PMID:29385532

  4. Biomimetic trapping cocktail to screen reactive metabolites: use of an amino acid and DNA motif mixture as light/heavy isotope pairs differing in mass shift.

    Science.gov (United States)

    Hosaka, Shuto; Honda, Takuto; Lee, Seon Hwa; Oe, Tomoyuki

    2018-06-01

    Candidate drugs that can be metabolically transformed into reactive electrophilic products, such as epoxides, quinones, and nitroso compounds, are of special concern because subsequent covalent binding to bio-macromolecules can cause adverse drug reactions, such as allergic reactions, hepatotoxicity, and genotoxicity. Several strategies have been reported for screening reactive metabolites, such as a covalent binding assay with radioisotope-labeled drugs and a trapping method followed by LC-MS/MS analyses. Of these, a trapping method using glutathione is the most common, especially at the early stage of drug development. However, the cysteine of glutathione is not the only nucleophilic site in vivo; lysine, histidine, arginine, and DNA bases are also nucleophilic. Indeed, the glutathione trapping method tends to overlook several types of reactive metabolites, such as aldehydes, acylglucuronides, and nitroso compounds. Here, we introduce an alternate way for screening reactive metabolites as follows: A mixture of the light and heavy isotopes of simplified amino acid motifs and a DNA motif is used as a biomimetic trapping cocktail. This mixture consists of [ 2 H 0 ]/[ 2 H 3 ]-1-methylguanidine (arginine motif, Δ 3 Da), [ 2 H 0 ]/[ 2 H 4 ]-2-mercaptoethanol (cysteine motif, Δ 4 Da), [ 2 H 0 ]/[ 2 H 5 ]-4-methylimidazole (histidine motif, Δ 5 Da), [ 2 H 0 ]/[ 2 H 9 ]-n-butylamine (lysine motif, Δ 9 Da), and [ 13 C 0 , 15 N 0 ]/[ 13 C 1 , 15 N 2 ]-2'-deoxyguanosine (DNA motif, Δ 3 Da). Mass tag triggered data-dependent acquisition is used to find the characteristic doublet peaks, followed by specific identification of the light isotope peak using MS/MS. Forty-two model drugs were examined using an in vitro microsome experiment to validate the strategy. Graphical abstract Biomimetic trapping cocktail to screen reactive metabolites.

  5. Dipeptide frequency/bias analysis identifies conserved sites of nonrandomness shared by cysteine-rich motifs.

    Science.gov (United States)

    Campion, S R; Ameen, A S; Lai, L; King, J M; Munzenmaier, T N

    2001-08-15

    This report describes the application of a simple computational tool, AAPAIR.TAB, for the systematic analysis of the cysteine-rich EGF, Sushi, and Laminin motif/sequence families at the two-amino acid level. Automated dipeptide frequency/bias analysis detects preferences in the distribution of amino acids in established protein families, by determining which "ordered dipeptides" occur most frequently in comprehensive motif-specific sequence data sets. Graphic display of the dipeptide frequency/bias data revealed family-specific preferences for certain dipeptides, but more importantly detected a shared preference for employment of the ordered dipeptides Gly-Tyr (GY) and Gly-Phe (GF) in all three protein families. The dipeptide Asn-Gly (NG) also exhibited high-frequency and bias in the EGF and Sushi motif families, whereas Asn-Thr (NT) was distinguished in the Laminin family. Evaluation of the distribution of dipeptides identified by frequency/bias analysis subsequently revealed the highly restricted localization of the G(F/Y) and N(G/T) sequence elements at two separate sites of extreme conservation in the consensus sequence of all three sequence families. The similar employment of the high-frequency/bias dipeptides in three distinct protein sequence families was further correlated with the concurrence of these shared molecular determinants at similar positions within the distinctive scaffolds of three structurally divergent, but similarly employed, motif modules.

  6. Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast.

    Science.gov (United States)

    Tsai, Zing Tsung-Yeh; Shiu, Shin-Han; Tsai, Huai-Kuang

    2015-08-01

    Transcription factor (TF) binding is determined by the presence of specific sequence motifs (SM) and chromatin accessibility, where the latter is influenced by both chromatin state (CS) and DNA structure (DS) properties. Although SM, CS, and DS have been used to predict TF binding sites, a predictive model that jointly considers CS and DS has not been developed to predict either TF-specific binding or general binding properties of TFs. Using budding yeast as model, we found that machine learning classifiers trained with either CS or DS features alone perform better in predicting TF-specific binding compared to SM-based classifiers. In addition, simultaneously considering CS and DS further improves the accuracy of the TF binding predictions, indicating the highly complementary nature of these two properties. The contributions of SM, CS, and DS features to binding site predictions differ greatly between TFs, allowing TF-specific predictions and potentially reflecting different TF binding mechanisms. In addition, a "TF-agnostic" predictive model based on three DNA "intrinsic properties" (in silico predicted nucleosome occupancy, major groove geometry, and dinucleotide free energy) that can be calculated from genomic sequences alone has performance that rivals the model incorporating experiment-derived data. This intrinsic property model allows prediction of binding regions not only across TFs, but also across DNA-binding domain families with distinct structural folds. Furthermore, these predicted binding regions can help identify TF binding sites that have a significant impact on target gene expression. Because the intrinsic property model allows prediction of binding regions across DNA-binding domain families, it is TF agnostic and likely describes general binding potential of TFs. Thus, our findings suggest that it is feasible to establish a TF agnostic model for identifying functional regulatory regions in potentially any sequenced genome.

  7. Contribution of Sequence Motif, Chromatin State, and DNA Structure Features to Predictive Models of Transcription Factor Binding in Yeast.

    Directory of Open Access Journals (Sweden)

    Zing Tsung-Yeh Tsai

    2015-08-01

    Full Text Available Transcription factor (TF binding is determined by the presence of specific sequence motifs (SM and chromatin accessibility, where the latter is influenced by both chromatin state (CS and DNA structure (DS properties. Although SM, CS, and DS have been used to predict TF binding sites, a predictive model that jointly considers CS and DS has not been developed to predict either TF-specific binding or general binding properties of TFs. Using budding yeast as model, we found that machine learning classifiers trained with either CS or DS features alone perform better in predicting TF-specific binding compared to SM-based classifiers. In addition, simultaneously considering CS and DS further improves the accuracy of the TF binding predictions, indicating the highly complementary nature of these two properties. The contributions of SM, CS, and DS features to binding site predictions differ greatly between TFs, allowing TF-specific predictions and potentially reflecting different TF binding mechanisms. In addition, a "TF-agnostic" predictive model based on three DNA "intrinsic properties" (in silico predicted nucleosome occupancy, major groove geometry, and dinucleotide free energy that can be calculated from genomic sequences alone has performance that rivals the model incorporating experiment-derived data. This intrinsic property model allows prediction of binding regions not only across TFs, but also across DNA-binding domain families with distinct structural folds. Furthermore, these predicted binding regions can help identify TF binding sites that have a significant impact on target gene expression. Because the intrinsic property model allows prediction of binding regions across DNA-binding domain families, it is TF agnostic and likely describes general binding potential of TFs. Thus, our findings suggest that it is feasible to establish a TF agnostic model for identifying functional regulatory regions in potentially any sequenced genome.

  8. The Chilo iridescent virus DNA polymerase promoter contains an essential AAAAT motif

    NARCIS (Netherlands)

    Nalcacioglu, R.; Ince, I.A.; Vlak, J.M.; Demirbag, Z.; Oers, van M.M.

    2007-01-01

    The delayed-early DNA polymerase promoter of Chilo iridescent virus (CIV), officially known as Invertebrate iridescent virus, was fine mapped by constructing a series of increasing deletions and by introducing point mutations. The effects of these mutations were examined in a luciferase reporter

  9. Crystallization and preliminary X-ray diffraction analysis of motif N from Saccharomyces cerevisiae Dbf4

    International Nuclear Information System (INIS)

    Matthews, Lindsay A.; Duong, Andrew; Prasad, Ajai A.; Duncker, Bernard P.; Guarné, Alba

    2009-01-01

    To understand the role of the Cdc7–Dbf4 complex in checkpoint responses, a fragment of Saccharomyces cerevisiae Dbf4 encompassing motif N was isolated, overproduced and crystallized. The Cdc7–Dbf4 complex plays an instrumental role in the initiation of DNA replication and is a target of replication-checkpoint responses in Saccharomyces cerevisiae. Cdc7 is a conserved serine/threonine kinase whose activity depends on association with its regulatory subunit, Dbf4. A conserved sequence near the N-terminus of Dbf4 (motif N) is necessary for the interaction of Cdc7–Dbf4 with the checkpoint kinase Rad53. To understand the role of the Cdc7–Dbf4 complex in checkpoint responses, a fragment of Saccharomyces cerevisiae Dbf4 encompassing motif N was isolated, overproduced and crystallized. A complete native data set was collected at 100 K from crystals that diffracted X-rays to 2.75 Å resolution and structure determination is currently under way

  10. Motif enrichment tool.

    Science.gov (United States)

    Blatti, Charles; Sinha, Saurabh

    2014-07-01

    The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Principal component analysis for predicting transcription-factor binding motifs from array-derived data

    Directory of Open Access Journals (Sweden)

    Vincenti Matthew P

    2005-11-01

    Full Text Available Abstract Background The responses to interleukin 1 (IL-1 in human chondrocytes constitute a complex regulatory mechanism, where multiple transcription factors interact combinatorially to transcription-factor binding motifs (TFBMs. In order to select a critical set of TFBMs from genomic DNA information and an array-derived data, an efficient algorithm to solve a combinatorial optimization problem is required. Although computational approaches based on evolutionary algorithms are commonly employed, an analytical algorithm would be useful to predict TFBMs at nearly no computational cost and evaluate varying modelling conditions. Singular value decomposition (SVD is a powerful method to derive primary components of a given matrix. Applying SVD to a promoter matrix defined from regulatory DNA sequences, we derived a novel method to predict the critical set of TFBMs. Results The promoter matrix was defined to establish a quantitative relationship between the IL-1-driven mRNA alteration and genomic DNA sequences of the IL-1 responsive genes. The matrix was decomposed with SVD, and the effects of 8 potential TFBMs (5'-CAGGC-3', 5'-CGCCC-3', 5'-CCGCC-3', 5'-ATGGG-3', 5'-GGGAA-3', 5'-CGTCC-3', 5'-AAAGG-3', and 5'-ACCCA-3' were predicted from a pool of 512 random DNA sequences. The prediction included matches to the core binding motifs of biologically known TFBMs such as AP2, SP1, EGR1, KROX, GC-BOX, ABI4, ETF, E2F, SRF, STAT, IK-1, PPARγ, STAF, ROAZ, and NFκB, and their significance was evaluated numerically using Monte Carlo simulation and genetic algorithm. Conclusion The described SVD-based prediction is an analytical method to provide a set of potential TFBMs involved in transcriptional regulation. The results would be useful to evaluate analytically a contribution of individual DNA sequences.

  12. i-Motif of cytosine-rich human telomere DNA fragments containing natural base lesions

    Czech Academy of Sciences Publication Activity Database

    Dvořáková, Zuzana; Renčiuk, Daniel; Kejnovská, Iva; Školáková, Petra; Bednářová, Klára; Sagi, J.; Vorlíčková, Michaela

    2018-01-01

    Roč. 46, č. 4 (2018), s. 1624-1634 ISSN 1362-4962 R&D Projects: GA ČR(CZ) GA15-06785S; GA ČR GA17-12075S; GA ČR(CZ) GJ17-19170Y; GA MŠk EF15_003/0000477 Institutional support: RVO:68081707 Keywords : pair opening kinetics * g-quadruplex dna Subject RIV: CE - Biochemistry OBOR OECD: Biochemistry and molecular biology

  13. Evaluation of the Stability of DNA i-Motifs in the Nuclei of Living Mammalian Cells

    Czech Academy of Sciences Publication Activity Database

    Dzatko, S.; Krafčíková, M.; Haensel-Hertsch, R.; Fessl, T.; Fiala, R.; Loja, T.; Krafčík, D.; Mergny, Jean-Louis; Foldynova-Trantirkova, Silvie; Trantírek, L.

    2018-01-01

    Roč. 57, č. 8 (2018), s. 2165-2169 ISSN 1433-7851 R&D Projects: GA MŠk EF15_003/0000477 Institutional support: RVO:68081707 Keywords : g-quadruplex * telomeric dna * base-pairs * molecular switch Subject RIV: CG - Electrochemistry OBOR OECD: Electrochemistry (dry cells, batteries, fuel cells, corrosion metals, electrolysis) Impact factor: 11.994, year: 2016

  14. Large-scale discovery of promoter motifs in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Thomas A Down

    2007-01-01

    Full Text Available A key step in understanding gene regulation is to identify the repertoire of transcription factor binding motifs (TFBMs that form the building blocks of promoters and other regulatory elements. Identifying these experimentally is very laborious, and the number of TFBMs discovered remains relatively small, especially when compared with the hundreds of transcription factor genes predicted in metazoan genomes. We have used a recently developed statistical motif discovery approach, NestedMICA, to detect candidate TFBMs from a large set of Drosophila melanogaster promoter regions. Of the 120 motifs inferred in our initial analysis, 25 were statistically significant matches to previously reported motifs, while 87 appeared to be novel. Analysis of sequence conservation and motif positioning suggested that the great majority of these discovered motifs are predictive of functional elements in the genome. Many motifs showed associations with specific patterns of gene expression in the D. melanogaster embryo, and we were able to obtain confident annotation of expression patterns for 25 of our motifs, including eight of the novel motifs. The motifs are available through Tiffin, a new database of DNA sequence motifs. We have discovered many new motifs that are overrepresented in D. melanogaster promoter regions, and offer several independent lines of evidence that these are novel TFBMs. Our motif dictionary provides a solid foundation for further investigation of regulatory elements in Drosophila, and demonstrates techniques that should be applicable in other species. We suggest that further improvements in computational motif discovery should narrow the gap between the set of known motifs and the total number of transcription factors in metazoan genomes.

  15. Molecular dynamics analysis of stabilities of the telomeric Watson-Crick duplex and the associated i-motif as a function of pH and temperature.

    Science.gov (United States)

    Panczyk, Tomasz; Wolski, Pawel

    2018-06-01

    This work deals with a molecular dynamics analysis of the protonated and deprotonated states of the natural sequence d[(CCCTAA) 3 CCCT] of the telomeric DNA forming the intercalated i-motif or paired with the sequence d[(CCCTAA) 3 CCCT] and forming the Watson-Crick (WC) duplex. By utilizing the amber force field for nucleic acids we built the i-motif and the WC duplex either with native cytosines or using their protonated forms. We studied, by applying molecular dynamics simulations, the role of hydrogen bonds between cytosines or in cytosine-guanine pairs in the stabilization of both structures in the physiological fluid. We found that hydrogen bonds exist in the case of protonated i-motif and in the standard form of the WC duplex. They, however, vanish in the case of the deprotonated i-motif and protonated form of the WC duplex. By determining potentials of mean force in the enforced unwrapping of these structures we found that the protonated i-motif is thermodynamically the most stable. Its deprotonation leads to spontaneous and observed directly in the unbiased calculations unfolding of the i-motif to the hairpin structure at normal temperature. The WC duplex is stable in its standard form and its slight destabilization is observed at the acidic pH. However, the protonated WC duplex unwraps very slowly at 310 K and its decomposition was not observed in the unbiased calculations. At higher temperatures (ca. 400 K or more) the WC duplex unwraps spontaneously. Copyright © 2018. Published by Elsevier B.V.

  16. Complete motif analysis of sequence requirements for translation initiation at non-AUG start codons.

    Science.gov (United States)

    Diaz de Arce, Alexander J; Noderer, William L; Wang, Clifford L

    2018-01-25

    The initiation of mRNA translation from start codons other than AUG was previously believed to be rare and of relatively low impact. More recently, evidence has suggested that as much as half of all translation initiation utilizes non-AUG start codons, codons that deviate from AUG by a single base. Furthermore, non-AUG start codons have been shown to be involved in regulation of expression and disease etiology. Yet the ability to gauge expression based on the sequence of a translation initiation site (start codon and its flanking bases) has been limited. Here we have performed a comprehensive analysis of translation initiation sites that utilize non-AUG start codons. By combining genetic-reporter, cell-sorting, and high-throughput sequencing technologies, we have analyzed the expression associated with all possible variants of the -4 to +4 positions of non-AUG translation initiation site motifs. This complete motif analysis revealed that 1) with the right sequence context, certain non-AUG start codons can generate expression comparable to that of AUG start codons, 2) sequence context affects each non-AUG start codon differently, and 3) initiation at non-AUG start codons is highly sensitive to changes in the flanking sequences. Complete motif analysis has the potential to be a key tool for experimental and diagnostic genomics. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. FBI's DNA analysis program

    Science.gov (United States)

    Brown, John R.

    1994-03-01

    Forensic DNA profiling technology is a significant law enforcement tool due to its superior discriminating power. Applying the principles of population genetics to the DNA profile obtained in violent crime investigations results in low frequency of occurrence estimates for the DNA profile. These estimates often range from a frequency of occurrence of 1 in 50 unrelated individuals to 1 in a million unrelated individuals or even smaller. It is this power to discriminate among individuals in the population that has propelled forensic DNA technology to the forefront of forensic testing in violent crime cases. Not only is the technology extremely powerful in including or excluding a criminal suspect as the perpetrator, but it also gives rise to the potential of identifying criminal suspects in cases where the investigators of unknown suspect cases have exhausted all other available leads.

  18. Positional bias of general and tissue-specific regulatory motifs in mouse gene promoters

    Directory of Open Access Journals (Sweden)

    Farré Domènec

    2007-12-01

    Full Text Available Abstract Background The arrangement of regulatory motifs in gene promoters, or promoter architecture, is the result of mutation and selection processes that have operated over many millions of years. In mammals, tissue-specific transcriptional regulation is related to the presence of specific protein-interacting DNA motifs in gene promoters. However, little is known about the relative location and spacing of these motifs. To fill this gap, we have performed a systematic search for motifs that show significant bias at specific promoter locations in a large collection of housekeeping and tissue-specific genes. Results We observe that promoters driving housekeeping gene expression are enriched in particular motifs with strong positional bias, such as YY1, which are of little relevance in promoters driving tissue-specific expression. We also identify a large number of motifs that show positional bias in genes expressed in a highly tissue-specific manner. They include well-known tissue-specific motifs, such as HNF1 and HNF4 motifs in liver, kidney and small intestine, or RFX motifs in testis, as well as many potentially novel regulatory motifs. Based on this analysis, we provide predictions for 559 tissue-specific motifs in mouse gene promoters. Conclusion The study shows that motif positional bias is an important feature of mammalian proximal promoters and that it affects both general and tissue-specific motifs. Motif positional constraints define very distinct promoter architectures depending on breadth of expression and type of tissue.

  19. 14-3-3 checkpoint regulatory proteins interact specifically with DNA repair protein human exonuclease 1 (hEXO1) via a semi-conserved motif

    DEFF Research Database (Denmark)

    Andersen, Sofie Dabros; Keijzers, Guido; Rampakakis, Emmanouil

    2012-01-01

    Human exonuclease 1 (hEXO1) acts directly in diverse DNA processing events, including replication, mismatch repair (MMR), and double strand break repair (DSBR), and it was also recently described to function as damage sensor and apoptosis inducer following DNA damage. In contrast, 14-3-3 proteins...... are specifically induced by replication inhibition leading to protein ubiquitination and degradation. We demonstrate direct and robust interaction between hEXO1 and six of the seven 14-3-3 isoforms in vitro, suggestive of a novel protein interaction network between DNA repair and cell cycle control. Binding...... and most likely a second unidentified binding motif. 14-3-3 associations do not appear to directly influence hEXO1 in vitro nuclease activity or in vitro DNA replication initiation. Moreover, specific phosphorylation variants, including hEXO1 S746A, are efficiently imported to the nucleus; to associate...

  20. ANALYSIS OF STABILITY OF TRINUCLEOTIDE TTC MOTIFS IN COMMON FLAX PLANTED IN THE CHERNOBYL AREA

    Directory of Open Access Journals (Sweden)

    Veronika Lancíková

    2015-02-01

    Full Text Available Flax (Linum usitatissimum L. is one of the oldest domesticated plants — it was cultivated as early as in ancient Egypt and Samaria 10,000 years ago to serve as a source of fiber and oil, whence it later spread around the world. Compared with other plants, the flax genome consists of a high number of repetitive sequences, middle repetitive sequences and small repetitive sequences of nucleotides. The aim of the study was to analyze the stability of the existing trinucleotides motifs of microsatellite DNA of the flax genome (genotype Kyivskyi, growing in the Chernobyl conditions. The Chernobyl area is the most extensive “natural” laboratory suitable for the study of radiation effects. Over the last 20 years, the researches collected important knowledge about the effects of low and high radiation doses on the DNA isolated from the plant material growing on the remediated fields near Chernobyl and the plant material from fields contaminated by radioactive cesium 137Cs and strontium 90Sr. Using eight pairs of microsatellite primers, we successfully amplified the samples from the remediated fields. For each primer in the control samples and remediated samples, we detected 1 to 3 fragments per locus, each in size up to 120 to 250 base pairs. The applied microsatellite primers confirmed the monomorphic condition of microsatellite loci.

  1. Quantification of Chemical and Mechanical Effects on the Formation of the G-Quadruplex and i-Motif in Duplex DNA.

    Science.gov (United States)

    Selvam, Sangeetha; Mandal, Shankar; Mao, Hanbin

    2017-09-05

    The formation of biologically significant tetraplex DNA species, such as G-quadruplexes and i-motifs, is affected by chemical (ions and pH) and mechanical [superhelicity (σ) and molecular crowding] factors. Because of the extremely challenging experimental conditions, the relative importance of these factors on tetraplex folding is unknown. In this work, we quantitatively evaluated the chemical and mechanical effects on the population dynamics of DNA tetraplexes in the insulin-linked polymorphic region using magneto-optical tweezers. By mechanically unfolding individual tetraplexes, we found that ions and pH have the largest effects on the formation of the G-quadruplex and i-motif, respectively. Interestingly, superhelicity has the second largest effect followed by molecular crowding conditions. While chemical effects are specific to tetraplex species, mechanical factors have generic influences. The predominant effect of chemical factors can be attributed to the fact that they directly change the stability of a specific tetraplex, whereas the mechanical factors, superhelicity in particular, reduce the stability of the competing species by changing the kinetics of the melting and annealing of the duplex DNA template in a nonspecific manner. The substantial dependence of tetraplexes on superhelicity provides strong support that DNA tetraplexes can serve as topological sensors to modulate fundamental cellular processes such as transcription.

  2. Cations form sequence selective motifs within DNA grooves via a combination of cation-pi and ion-dipole/hydrogen bond interactions.

    Science.gov (United States)

    Stewart, Mikaela; Dunlap, Tori; Dourlain, Elizabeth; Grant, Bryce; McFail-Isom, Lori

    2013-01-01

    The fine conformational subtleties of DNA structure modulate many fundamental cellular processes including gene activation/repression, cellular division, and DNA repair. Most of these cellular processes rely on the conformational heterogeneity of specific DNA sequences. Factors including those structural characteristics inherent in the particular base sequence as well as those induced through interaction with solvent components combine to produce fine DNA structural variation including helical flexibility and conformation. Cation-pi interactions between solvent cations or their first hydration shell waters and the faces of DNA bases form sequence selectively and contribute to DNA structural heterogeneity. In this paper, we detect and characterize the binding patterns found in cation-pi interactions between solvent cations and DNA bases in a set of high resolution x-ray crystal structures. Specifically, we found that monovalent cations (Tl⁺) and the polarized first hydration shell waters of divalent cations (Mg²⁺, Ca²⁺) form cation-pi interactions with DNA bases stabilizing unstacked conformations. When these cation-pi interactions are combined with electrostatic interactions a pattern of specific binding motifs is formed within the grooves.

  3. The Arabidopsis GAGA-Binding Factor BASIC PENTACYSTEINE6 Recruits the POLYCOMB-REPRESSIVE COMPLEX1 Component LIKE HETEROCHROMATIN PROTEIN1 to GAGA DNA Motifs.

    Science.gov (United States)

    Hecker, Andreas; Brand, Luise H; Peter, Sébastien; Simoncello, Nathalie; Kilian, Joachim; Harter, Klaus; Gaudin, Valérie; Wanke, Dierk

    2015-07-01

    Polycomb-repressive complexes (PRCs) play key roles in development by repressing a large number of genes involved in various functions. Much, however, remains to be discovered about PRC-silencing mechanisms as well as their targeting to specific genomic regions. Besides other mechanisms, GAGA-binding factors in animals can guide PRC members in a sequence-specific manner to Polycomb-responsive DNA elements. Here, we show that the Arabidopsis (Arabidopsis thaliana) GAGA-motif binding factor protein basic pentacysteine6 (BPC6) interacts with like heterochromatin protein1 (LHP1), a PRC1 component, and associates with vernalization2 (VRN2), a PRC2 component, in vivo. By using a modified DNA-protein interaction enzyme-linked immunosorbant assay, we could show that BPC6 was required and sufficient to recruit LHP1 to GAGA motif-containing DNA probes in vitro. We also found that LHP1 interacts with VRN2 and, therefore, can function as a possible scaffold between BPC6 and VRN2. The lhp1-4 bpc4 bpc6 triple mutant displayed a pleiotropic phenotype, extreme dwarfism and early flowering, which disclosed synergistic functions of LHP1 and group II plant BPC members. Transcriptome analyses supported this synergy and suggested a possible function in the concerted repression of homeotic genes, probably through histone H3 lysine-27 trimethylation. Hence, our findings suggest striking similarities between animal and plant GAGA-binding factors in the recruitment of PRC1 and PRC2 components to Polycomb-responsive DNA element-like GAGA motifs, which must have evolved through convergent evolution. © 2015 American Society of Plant Biologists. All Rights Reserved.

  4. Transduction motif analysis of gastric cancer based on a human signaling network

    Energy Technology Data Exchange (ETDEWEB)

    Liu, G.; Li, D.Z.; Jiang, C.S.; Wang, W. [Fuzhou General Hospital of Nanjing Command, Department of Gastroenterology, Fuzhou, China, Department of Gastroenterology, Fuzhou General Hospital of Nanjing Command, Fuzhou (China)

    2014-04-04

    To investigate signal regulation models of gastric cancer, databases and literature were used to construct the signaling network in humans. Topological characteristics of the network were analyzed by CytoScape. After marking gastric cancer-related genes extracted from the CancerResource, GeneRIF, and COSMIC databases, the FANMOD software was used for the mining of gastric cancer-related motifs in a network with three vertices. The significant motif difference method was adopted to identify significantly different motifs in the normal and cancer states. Finally, we conducted a series of analyses of the significantly different motifs, including gene ontology, function annotation of genes, and model classification. A human signaling network was constructed, with 1643 nodes and 5089 regulating interactions. The network was configured to have the characteristics of other biological networks. There were 57,942 motifs marked with gastric cancer-related genes out of a total of 69,492 motifs, and 264 motifs were selected as significantly different motifs by calculating the significant motif difference (SMD) scores. Genes in significantly different motifs were mainly enriched in functions associated with cancer genesis, such as regulation of cell death, amino acid phosphorylation of proteins, and intracellular signaling cascades. The top five significantly different motifs were mainly cascade and positive feedback types. Almost all genes in the five motifs were cancer related, including EPOR, MAPK14, BCL2L1, KRT18, PTPN6, CASP3, TGFBR2, AR, and CASP7. The development of cancer might be curbed by inhibiting signal transductions upstream and downstream of the selected motifs.

  5. Novel essential residues of Hda for interaction with DnaA in the regulatory inactivation of DnaA: unique roles for Hda AAA Box VI and VII motifs.

    Science.gov (United States)

    Nakamura, Kenta; Katayama, Tsutomu

    2010-04-01

    Escherichia coli ATP-DnaA initiates chromosomal replication. For preventing extra-initiations, a complex of ADP-Hda and the DNA-loaded replicase clamp promotes DnaA-ATP hydrolysis, yielding inactive ADP-DnaA. However, the Hda-DnaA interaction mode remains unclear except that the Hda Box VII Arg finger (Arg-153) and DnaA sensor II Arg-334 within each AAA(+) domain are crucial for the DnaA-ATP hydrolysis. Here, we demonstrate that direct and functional interaction of ADP-Hda with DnaA requires the Hda residues Ser-152, Phe-118 and Asn-122 as well as Hda Arg-153 and DnaA Arg-334. Structural analyses suggest intermolecular interactions between Hda Ser-152 and DnaA Arg-334 and between Hda Phe-118 and the DnaA Walker B motif region, in addition to an intramolecular interaction between Hda Asn-122 and Arg-153. These interactions likely sustain a specific association of ADP-Hda and DnaA, promoting DnaA-ATP hydrolysis. Consistently, ATP-DnaA and ADP-DnaA interact with the ADP-Hda-DNA-clamp complex with similar affinities. Hda Phe-118 and Asn-122 are contained in the Box VI region, and their hydrophobic and electrostatic features are basically conserved in the corresponding residues of other AAA(+) proteins, suggesting a conserved role for Box VI. These findings indicate novel interaction mechanisms for Hda-DnaA as well as a potentially fundamental mechanism in AAA(+) protein interactions.

  6. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

    Directory of Open Access Journals (Sweden)

    Down Thomas A

    2010-09-01

    Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.

  7. Specific interaction of the nonstructural protein NS1 of minute virus of mice (MVM) with [ACCA](2) motifs in the centre of the right-end MVM DNA palindrome induces hairpin-primed viral DNA replication.

    Science.gov (United States)

    Willwand, Kurt; Moroianu, Adela; Hörlein, Rita; Stremmel, Wolfgang; Rommelaere, Jean

    2002-07-01

    The linear single-stranded DNA genome of minute virus of mice (MVM) is replicated via a double-stranded replicative form (RF) intermediate DNA. Amplification of viral RF DNA requires the structural transition of the right-end palindrome from a linear duplex into a double-hairpin structure, which serves for the repriming of unidirectional DNA synthesis. This conformational transition was found previously to be induced by the MVM nonstructural protein NS1. Elimination of the cognate NS1-binding sites, [ACCA](2), from the central region of the right-end palindrome next to the axis of symmetry was shown to markedly reduce the efficiency of hairpin-primed DNA replication, as measured in a reconstituted in vitro replication system. Thus, [ACCA](2) sequence motifs are essential as NS1-binding elements in the context of the structural transition of the right-end MVM palindrome.

  8. Motif analysis unveils the possible co-regulation of chloroplast genes and nuclear genes encoding chloroplast proteins.

    Science.gov (United States)

    Wang, Ying; Ding, Jun; Daniell, Henry; Hu, Haiyan; Li, Xiaoman

    2012-09-01

    Chloroplasts play critical roles in land plant cells. Despite their importance and the availability of at least 200 sequenced chloroplast genomes, the number of known DNA regulatory sequences in chloroplast genomes are limited. In this paper, we designed computational methods to systematically study putative DNA regulatory sequences in intergenic regions near chloroplast genes in seven plant species and in promoter sequences of nuclear genes in Arabidopsis and rice. We found that -35/-10 elements alone cannot explain the transcriptional regulation of chloroplast genes. We also concluded that there are unlikely motifs shared by intergenic sequences of most of chloroplast genes, indicating that these genes are regulated differently. Finally and surprisingly, we found five conserved motifs, each of which occurs in no more than six chloroplast intergenic sequences, are significantly shared by promoters of nuclear-genes encoding chloroplast proteins. By integrating information from gene function annotation, protein subcellular localization analyses, protein-protein interaction data, and gene expression data, we further showed support of the functionality of these conserved motifs. Our study implies the existence of unknown nuclear-encoded transcription factors that regulate both chloroplast genes and nuclear genes encoding chloroplast protein, which sheds light on the understanding of the transcriptional regulation of chloroplast genes.

  9. Fractals in DNA sequence analysis

    Institute of Scientific and Technical Information of China (English)

    Yu Zu-Guo(喻祖国); Vo Anh; Gong Zhi-Min(龚志民); Long Shun-Chao(龙顺潮)

    2002-01-01

    Fractal methods have been successfully used to study many problems in physics, mathematics, engineering, finance,and even in biology. There has been an increasing interest in unravelling the mysteries of DNA; for example, how can we distinguish coding and noncoding sequences, and the problems of classification and evolution relationship of organisms are key problems in bioinformatics. Although much research has been carried out by taking into consideration the long-range correlations in DNA sequences, and the global fractal dimension has been used in these works by other people, the models and methods are somewhat rough and the results are not satisfactory. In recent years, our group has introduced a time series model (statistical point of view) and a visual representation (geometrical point of view)to DNA sequence analysis. We have also used fractal dimension, correlation dimension, the Hurst exponent and the dimension spectrum (multifractal analysis) to discuss problems in this field. In this paper, we introduce these fractal models and methods and the results of DNA sequence analysis.

  10. oPOSSUM: integrated tools for analysis of regulatory motif over-representation

    Science.gov (United States)

    Ho Sui, Shannan J.; Fulton, Debra L.; Arenillas, David J.; Kwon, Andrew T.; Wasserman, Wyeth W.

    2007-01-01

    The identification of over-represented transcription factor binding sites from sets of co-expressed genes provides insights into the mechanisms of regulation for diverse biological contexts. oPOSSUM, an internet-based system for such studies of regulation, has been improved and expanded in this new release. New features include a worm-specific version for investigating binding sites conserved between Caenorhabditis elegans and C. briggsae, as well as a yeast-specific version for the analysis of co-expressed sets of Saccharomyces cerevisiae genes. The human and mouse applications feature improvements in ortholog mapping, sequence alignments and the delineation of multiple alternative promoters. oPOSSUM2, introduced for the analysis of over-represented combinations of motifs in human and mouse genes, has been integrated with the original oPOSSUM system. Analysis using user-defined background gene sets is now supported. The transcription factor binding site models have been updated to include new profiles from the JASPAR database. oPOSSUM is available at http://www.cisreg.ca/oPOSSUM/ PMID:17576675

  11. Two sequence motifs from HIF-1α bind to the DNA-binding site of p53

    OpenAIRE

    Hansson, Lars O.; Friedler, Assaf; Freund, Stefan; Rüdiger, Stefan; Fersht, Alan R.

    2002-01-01

    There is evidence that hypoxia-inducible factor-1α (HIF-1α) interacts with the tumor suppressor p53. To characterize the putative interaction, we mapped the binding of the core domain of p53 (p53c) to an array of immobilized HIF-1α-derived peptides and found two peptide-sequence motifs that bound to p53c with micromolar affinity in solution. One sequence was adjacent to and the other coincided with the two proline residues of the oxygen-dependent degradation domain (P402 and P564) that act as...

  12. An Analysis of Multi-type Relational Interactions in FMA Using Graph Motifs with Disjointness Constraints

    Science.gov (United States)

    Zhang, Guo-Qiang; Luo, Lingyun; Ogbuji, Chime; Joslyn, Cliff; Mejino, Jose; Sahoo, Satya S

    2012-01-01

    The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions for detecting logical inconsistencies as well as other anomalies represented by the motifs. MOCH represents patterns of multi-type interaction as small labeled (with multiple types of edges) sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology, we performed exhaustive analyses of a variety of labeled sub-graph motifs. The quality assurance feature of MOCH comes from the distinct use of a subset of the edges of the graph motifs as constraints for disjointness, whereby bringing in rule-based flavor to the approach as well. With possible disjointness implied by antonyms, we performed manual inspection of the resulting FMA fragments and tracked down sources of abnormal inferred conclusions (logical inconsistencies), which are amendable for programmatic revision of the FMA. Our results demonstrate that MOCH provides a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation. PMID:23304382

  13. An analysis of multi-type relational interactions in FMA using graph motifs with disjointness constraints.

    Science.gov (United States)

    Zhang, Guo-Qiang; Luo, Lingyun; Ogbuji, Chime; Joslyn, Cliff; Mejino, Jose; Sahoo, Satya S

    2012-01-01

    The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions for detecting logical inconsistencies as well as other anomalies represented by the motifs. MOCH represents patterns of multi-type interaction as small labeled (with multiple types of edges) sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology, we performed exhaustive analyses of a variety of labeled sub-graph motifs. The quality assurance feature of MOCH comes from the distinct use of a subset of the edges of the graph motifs as constraints for disjointness, whereby bringing in rule-based flavor to the approach as well. With possible disjointness implied by antonyms, we performed manual inspection of the resulting FMA fragments and tracked down sources of abnormal inferred conclusions (logical inconsistencies), which are amendable for programmatic revision of the FMA. Our results demonstrate that MOCH provides a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation.

  14. Pipeline for the Analysis of ChIP-seq Data and New Motif Ranking Procedure

    KAUST Repository

    Ashoor, Haitham

    2011-06-01

    This thesis presents a computational methodology for ab-initio identification of transcription factor binding sites based on ChIP-seq data. This method consists of three main steps, namely ChIP-seq data processing, motif discovery and models selection. A novel method for ranking the models of motifs identified in this process is proposed. This method combines multiple factors in order to rank the provided candidate motifs. It combines the model coverage of the ChIP-seq fragments that contain motifs from which that model is built, the suitable background data made up of shuffled ChIP-seq fragments, and the p-value that resulted from evaluating the model on actual and background data. Two ChIP-seq datasets retrieved from ENCODE project are used to evaluate and demonstrate the ability of the method to predict correct TFBSs with high precision. The first dataset relates to neuron-restrictive silencer factor, NRSF, while the second one corresponds to growth-associated binding protein, GABP. The pipeline system shows high precision prediction for both datasets, as in both cases the top ranked motif closely resembles the known motifs for the respective transcription factors.

  15. Differential DNA Methylation Analysis without a Reference Genome

    Directory of Open Access Journals (Sweden)

    Johanna Klughammer

    2015-12-01

    Full Text Available Genome-wide DNA methylation mapping uncovers epigenetic changes associated with animal development, environmental adaptation, and species evolution. To address the lack of high-throughput methods for DNA methylation analysis in non-model organisms, we developed an integrated approach for studying DNA methylation differences independent of a reference genome. Experimentally, our method relies on an optimized 96-well protocol for reduced representation bisulfite sequencing (RRBS, which we have validated in nine species (human, mouse, rat, cow, dog, chicken, carp, sea bass, and zebrafish. Bioinformatically, we developed the RefFreeDMA software to deduce ad hoc genomes directly from RRBS reads and to pinpoint differentially methylated regions between samples or groups of individuals (http://RefFreeDMA.computational-epigenetics.org. The identified regions are interpreted using motif enrichment analysis and/or cross-mapping to annotated genomes. We validated our method by reference-free analysis of cell-type-specific DNA methylation in the blood of human, cow, and carp. In summary, we present a cost-effective method for epigenome analysis in ecology and evolution, which enables epigenome-wide association studies in natural populations and species without a reference genome.

  16. Structural and functional analysis of VQ motif-containing proteins in Arabidopsis as interacting proteins of WRKY transcription factors.

    Science.gov (United States)

    Cheng, Yuan; Zhou, Yuan; Yang, Yan; Chi, Ying-Jun; Zhou, Jie; Chen, Jian-Ye; Wang, Fei; Fan, Baofang; Shi, Kai; Zhou, Yan-Hong; Yu, Jing-Quan; Chen, Zhixiang

    2012-06-01

    WRKY transcription factors are encoded by a large gene superfamily with a broad range of roles in plants. Recently, several groups have reported that proteins containing a short VQ (FxxxVQxLTG) motif interact with WRKY proteins. We have recently discovered that two VQ proteins from Arabidopsis (Arabidopsis thaliana), SIGMA FACTOR-INTERACTING PROTEIN1 and SIGMA FACTOR-INTERACTING PROTEIN2, act as coactivators of WRKY33 in plant defense by specifically recognizing the C-terminal WRKY domain and stimulating the DNA-binding activity of WRKY33. In this study, we have analyzed the entire family of 34 structurally divergent VQ proteins from Arabidopsis. Yeast (Saccharomyces cerevisiae) two-hybrid assays showed that Arabidopsis VQ proteins interacted specifically with the C-terminal WRKY domains of group I and the sole WRKY domains of group IIc WRKY proteins. Using site-directed mutagenesis, we identified structural features of these two closely related groups of WRKY domains that are critical for interaction with VQ proteins. Quantitative reverse transcription polymerase chain reaction revealed that expression of a majority of Arabidopsis VQ genes was responsive to pathogen infection and salicylic acid treatment. Functional analysis using both knockout mutants and overexpression lines revealed strong phenotypes in growth, development, and susceptibility to pathogen infection. Altered phenotypes were substantially enhanced through cooverexpression of genes encoding interacting VQ and WRKY proteins. These findings indicate that VQ proteins play an important role in plant growth, development, and response to environmental conditions, most likely by acting as cofactors of group I and IIc WRKY transcription factors.

  17. Finishing and Special Motifs: Lessons Learned from CRISPR Analysis Using Next-Generation Draft Sequences (7th Annual SFAF Meeting, 2012)

    Energy Technology Data Exchange (ETDEWEB)

    Campbell, Catherine

    2012-06-01

    Catherine Campbell on "Finishing and Special Motifs: Lessons learned from CRISPR analysis using next-generation draft sequences" at the 2012 Sequencing, Finishing, Analysis in the Future Meeting held June 5-7, 2012 in Santa Fe, New Mexico.

  18. AMP-acetyl CoA synthetase from Leishmania donovani: identification and functional analysis of 'PX4GK' motif.

    Science.gov (United States)

    Soumya, Neelagiri; Kumar, I Sravan; Shivaprasad, S; Gorakh, Landage Nitin; Dinesh, Neeradi; Swamy, Kayala Kambagiri; Singh, Sushma

    2015-04-01

    An adenosine monophosphate forming acetyl CoA synthetase (AceCS) which is the key enzyme involved in the conversion of acetate to acetyl CoA has been identified from Leishmania donovani for the first time. Sequence analysis of L. donovani AceCS (LdAceCS) revealed the presence of a 'PX4GK' motif which is highly conserved throughout organisms with higher sequence identity (96%) to lower sequence identity (38%). A ∼ 77 kDa heterologous protein with C-terminal 6X His-tag was expressed in Escherichia coli. Expression of LdAceCS in promastigotes was confirmed by western blot and RT-PCR analysis. Immunolocalization studies revealed that it is a cytosolic protein. We also report the kinetic characterization of recombinant LdAceCS with acetate, adenosine 5'-triphosphate, coenzyme A and propionate as substrates. Site directed mutagenesis of residues in conserved PX4GK motif of LdAceCS was performed to gain insight into its potential role in substrate binding, catalysis and its role in maintaining structural integrity of the protein. P646A, G651A and K652R exhibited more than 90% loss in activity signifying its indispensible role in the enzyme activity. Substitution of other residues in this motif resulted in altered substrate specificity and catalysis. However, none of them had any role in modulation of the secondary structure of the protein except G651A mutant. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. TFII-I regulates target genes in the PI-3K and TGF-β signaling pathways through a novel DNA binding motif.

    Science.gov (United States)

    Segura-Puimedon, Maria; Borralleras, Cristina; Pérez-Jurado, Luis A; Campuzano, Victoria

    2013-09-25

    General transcription factor (TFII-I) is a multi-functional protein involved in the transcriptional regulation of critical developmental genes, encoded by the GTF2I gene located on chromosome 7q11.23. Haploinsufficiency at GTF2I has been shown to play a major role in the neurodevelopmental features of Williams-Beuren syndrome (WBS). Identification of genes regulated by TFII-I is thus critical to detect molecular determinants of WBS as well as to identify potential new targets for specific pharmacological interventions, which are currently absent. We performed a microarray screening for transcriptional targets of TFII-I in cortex and embryonic cells from Gtf2i mutant and wild-type mice. Candidate genes with altered expression were verified using real-time PCR. A novel motif shared by deregulated genes was found and chromatin immunoprecipitation assays in embryonic fibroblasts were used to document in vitro TFII-I binding to this motif in the promoter regions of deregulated genes. Interestingly, the PI3K and TGFβ signaling pathways were over-represented among TFII-I-modulated genes. In this study we have found a highly conserved DNA element, common to a set of genes regulated by TFII-I, and identified and validated novel in vivo neuronal targets of this protein affecting the PI3K and TGFβ signaling pathways. Overall, our data further contribute to unravel the complexity and variability of the different genetic programs orchestrated by TFII-I. © 2013 Elsevier B.V. All rights reserved.

  20. Theory and Application of DNA Histogram Analysis.

    Science.gov (United States)

    Bagwell, Charles Bruce

    The underlying principles and assumptions associated with DNA histograms are discussed along with the characteristics of fluorescent probes. Information theory was described and used to calculate the information content of a DNA histogram. Two major types of DNA histogram analyses are proposed: parametric and nonparametric analysis. Three levels…

  1. The NS1 polypeptide of the murine parvovirus minute virus of mice binds to DNA sequences containing the motif [ACCA]2-3.

    Science.gov (United States)

    Cotmore, S F; Christensen, J; Nüesch, J P; Tattersall, P

    1995-03-01

    A DNA fragment containing the minute virus of mice 3' replication origin was specifically coprecipitated in immune complexes containing the virally coded NS1, but not the NS2, polypeptide. Antibodies directed against the amino- or carboxy-terminal regions of NS1 precipitated the NS1-origin complexes, but antibodies directed against NS1 amino acids 284 to 459 blocked complex formation. Using affinity-purified histidine-tagged NS1 preparations, we have shown that the specific protein-DNA interaction is of moderate affinity, being stable in 0.1 M salt but rapidly lost at higher salt concentrations. In contrast, generalized (or nonspecific) DNA binding by NS1 could be demonstrated only in low salt. Addition of ATP or gamma S-ATP enhanced specific DNA binding by wild-type NS1 severalfold, but binding was lost under conditions which favored ATP hydrolysis. NS1 molecules with mutations in a critical lysine residue (amino acid 405) in the consensus ATP-binding site bound to the origin, but this binding could not be enhanced by ATP addition. DNase I protection assays carried out with wild-type NS1 in the presence of gamma S-ATP gave footprints which extended over 43 nucleotides on both DNA strands, from the middle of the origin bubble sequence to a position some 14 bp beyond the nick site. The DNA-binding site for NS1 was mapped to a 22-bp fragment from the middle of the 3' replication origin which contains the sequence ACCAACCA. This conforms to a reiterated motif (ACCA)2-3, which occurs, in more or less degenerate form, at many sites throughout the minute virus of mice genome (J. W. Bodner, Virus Genes 2:167-182, 1989). Insertion of a single copy of the sequence (ACCA)3 was shown to be sufficient to confer NS1 binding on an otherwise unrecognized plasmid fragment. The functions of NS1 in the viral life cycle are reevaluated in the light of this result.

  2. N-termini of fungal CSL transcription factors are disordered, enriched in regulatory motifs and inhibit DNA binding in fission yeast.

    Directory of Open Access Journals (Sweden)

    Martin Převorovský

    Full Text Available CSL (CBF1/RBP-Jκ/Suppressor of Hairless/LAG-1 transcription factors are the effector components of the Notch receptor signalling pathway, which is critical for metazoan development. The metazoan CSL proteins (class M can also function in a Notch-independent manner. Recently, two novel classes of CSL proteins, designated F1 and F2, have been identified in fungi. The role of the fungal CSL proteins is unclear, because the Notch pathway is not present in fungi. In fission yeast, the Cbf11 and Cbf12 CSL paralogs play antagonistic roles in cell adhesion and the coordination of cell and nuclear division. Unusually long N-terminal extensions are typical for fungal and invertebrate CSL family members. In this study, we investigate the functional significance of these extended N-termini of CSL proteins.We identify 15 novel CSL family members from 7 fungal species and conduct bioinformatic analyses of a combined dataset containing 34 fungal and 11 metazoan CSL protein sequences. We show that the long, non-conserved N-terminal tails of fungal CSL proteins are likely disordered and enriched in phosphorylation sites and PEST motifs. In a case study of Cbf12 (class F2, we provide experimental evidence that the protein is proteolytically processed and that the N-terminus inhibits the Cbf12-dependent DNA binding activity in an electrophoretic mobility shift assay.This study provides insight into the characteristics of the long N-terminal tails of fungal CSL proteins that may be crucial for controlling DNA-binding and CSL function. We propose that the regulation of DNA binding by Cbf12 via its N-terminal region represents an important means by which fission yeast strikes a balance between the class F1 and class F2 paralog activities. This mode of regulation might be shared with other CSL-positive fungi, some of which are relevant to human disease and biotechnology.

  3. A conserved motif in the linker domain of STAT1 transcription factor is required for both recognition and release from high-affinity DNA-binding sites.

    Science.gov (United States)

    Hüntelmann, Bettina; Staab, Julia; Herrmann-Lingen, Christoph; Meyer, Thomas

    2014-01-01

    Binding to specific palindromic sequences termed gamma-activated sites (GAS) is a hallmark of gene activation by members of the STAT (signal transducer and activator of transcription) family of cytokine-inducible transcription factors. However, the precise molecular mechanisms involved in the signal-dependent finding of target genes by STAT dimers have not yet been very well studied. In this study, we have characterized a sequence motif in the STAT1 linker domain which is highly conserved among the seven human STAT proteins and includes surface-exposed residues in close proximity to the bound DNA. Using site-directed mutagenesis, we have demonstrated that a lysine residue in position 567 of the full-length molecule is required for GAS recognition. The substitution of alanine for this residue completely abolished both binding to high-affinity GAS elements and transcriptional activation of endogenous target genes in cells stimulated with interferon-γ (IFNγ), while the time course of transient nuclear accumulation and tyrosine phosphorylation were virtually unchanged. In contrast, two glutamic acid residues (E559 and E563) on each monomer are important for the dissociation of dimeric STAT1 from DNA and, when mutated to alanine, result in elevated levels of tyrosine-phosphorylated STAT1 as well as prolonged IFNγ-stimulated nuclear accumulation. In conclusion, our data indicate that the kinetics of signal-dependent GAS binding is determined by an array of glutamic acid residues located at the interior surface of the STAT1 dimer. These negatively charged residues appear to align the long axis of the STAT1 dimer in a position perpendicular to the DNA, thereby facilitating the interaction between lysine 567 and the phosphodiester backbone of a bound GAS element, which is a prerequisite for transient gene induction.

  4. Construction of a Holliday Junction in Small Circular DNA Molecules for Stable Motifs and Two-Dimensional Lattices.

    Science.gov (United States)

    Guo, Xin; Wang, Xue-Mei; Wei, Shuai; Xiao, Shou-Jun

    2018-04-12

    Design rules for DNA nanotechnology have been mostly learnt from using linear single-stranded (ss) DNA as the source material. For example, the core structure of a typical DAO (double crossover, antiparallel, odd half-turns) tile for assembling 2D lattices is constructed from only two linear ss-oligonucleotide scaffold strands, similar to two ropes making a square knot. Herein, a new type of coupled DAO (cDAO) tile and 2D lattices of small circular ss-oligonucleotides as scaffold strands and linear ss-oligonucleotides as staple strands are reported. A cDAO tile of cDAO-c64nt (c64nt: circular 64 nucleotides), shaped as a solid parallelogram, is constructed with a Holliday junction (HJ) at the center and two HJs at both poles of a c64nt; similarly, cDAO-c84nt, shaped as a crossed quadrilateral composed of two congruent triangles, is formed with a HJ at the center and four three-way junctions at the corners of a c84nt. Perfect 2D lattices were assembled from cDAO tiles: infinite nanostructures of nanoribbons, nanotubes, and nanorings, and finite nanostructures. The structural relationship between the visible lattices imaged by AFM and the corresponding invisible secondary and tertiary molecular structures of HJs, inclination angle of hydrogen bonds against the double-helix axis, and the chirality of the tile can be interpreted very well. This work could shed new light on DNA nanotechnology with unique circular tiles. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  5. The adeno-associated virus major regulatory protein Rep78-c-Jun-DNA motif complex modulates AP-1 activity

    International Nuclear Information System (INIS)

    Prasad, C. Krishna; Meyers, Craig; Zhan Dejin; You Hong; Chiriva-Internati, Maurizio; Mehta, Jawahar L.; Liu Yong; Hermonat, Paul L.

    2003-01-01

    Multiple epidemiologic studies show that adeno-associated virus (AAV) is negatively associated with cervical cancer (CX CA), a cancer which is positively associated with human papillomavirus (HPV) infection. Mechanisms for this correlation may be by Rep78's (AAV's major regulatory protein) ability to bind the HPV-16 p97 promoter DNA and inhibit transcription, to bind and interfere with the functions of the E7 oncoprotein of HPV-16, and to bind a variety of HPV-important cellular transcription factors such as Sp1 and TBP. c-Jun is another important cellular factor intimately linked to the HPV life cycle, as well as keratinocyte differentiation and skin development. Skin is the natural host tissue for both HPV and AAV. In this article it is demonstrated that Rep78 directly interacts with c-Jun, both in vitro and in vivo, as analyzed by Western blot, yeast two-hybrid cDNA, and electrophoretic mobility shift-supershift assay (EMSA supershift). Addition of anti-Rep78 antibodies inhibited the EMSA supershift. Investigating the biological implications of this interaction, Rep78 inhibited the c-Jun-dependent c-jun promoter in transient and stable chloramphenicol acetyl-transferase (CAT) assays. Rep78 also inhibited c-Jun-augmented c-jun promoter as well as the HPV-16 p97 promoter activity (also c-Jun regulated) in in vitro transcription assays in T47D nuclear extracts. Finally, the Rep78-c-Jun interaction mapped to the amino-half of Rep78. The ability of Rep78 to interact with c-Jun and down-regulate AP-1-dependent transcription suggests one more mechanism by which AAV may modulate the HPV life cycle and the carcinogenesis process

  6. [Personal motif in art].

    Science.gov (United States)

    Gerevich, József

    2015-01-01

    One of the basic questions of the art psychology is whether a personal motif is to be found behind works of art and if so, how openly or indirectly it appears in the work itself. Analysis of examples and documents from the fine arts and literature allow us to conclude that the personal motif that can be identified by the viewer through symbols, at times easily at others with more difficulty, gives an emotional plus to the artistic product. The personal motif may be found in traumatic experiences, in communication to the model or with other emotionally important persons (mourning, disappointment, revenge, hatred, rivalry, revolt etc.), in self-searching, or self-analysis. The emotions are expressed in artistic activity either directly or indirectly. The intention nourished by the artist's identity (Kunstwollen) may stand in the way of spontaneous self-expression, channelling it into hidden paths. Under the influence of certain circumstances, the artist may arouse in the viewer, consciously or unconsciously, an illusionary, misleading image of himself. An examination of the personal motif is one of the important research areas of art therapy.

  7. Analysis of a conserved RGE/RGD motif in HCV E2 in mediating entry

    Directory of Open Access Journals (Sweden)

    Rong Lijun

    2009-01-01

    Full Text Available Abstract Background Hepatitis C virus (HCV encodes two transmembrane glycoproteins E1 and E2 which form a heterodimer. E1 is believed to mediate fusion while E2 has been shown to bind cellular receptors. It is clear that HCV uses a multi-receptor complex to gain entry into susceptible cells, however key elements of this complex remain elusive. In this study, the role of a highly conserved RGE/RGD motif of HCV E2 glycoprotein in viral entry was examined. The effect of each substitution mutation in this motif was tested by challenging susceptible cell lines with mutant HCV E1E2 pseudotyped viruses generated using a lentiviral system (HCVpp. In addition to assaying infectivity, producer cell expression and HCVpp incorporation of HCV E2 proteins, CD81 binding profiles, and conformation of mutants were examined. Results Based on these characteristics, mutants either displayed wt characteristics (high infectivity [≥ 90% of wt HCVpp], CD81 binding, E1E2 expression, and incorporation into viral particles and proper conformation or very low infectivity (≤ 20% of wt HCVpp. Only amino acid substitutions of the 3rd position (D or E resulted in wt characteristics as long as the negative charge was maintained or a neutral alanine was introduced. A change in charge to a positive lysine, disrupted HCVpp infectivity at this position. Conclusion Although most amino acid substitutions within this conserved motif displayed greatly reduced HCVpp infectivity, they retained soluble CD81 binding, proper E2 conformation, and incorporation into HCVpp. Our results suggest that although RGE/D is a well-defined integrin binding motif, in this case the role of these three hyperconserved amino acids does not appear to be integrin binding. As the extent of conservation of this region extends well beyond these three amino acids, we speculate that this region may play an important role in the structure of HCV E2 or in mediating the interaction with other factor(s during

  8. The AT-Hook motif as a versatile minor groove anchor for promoting DNA binding of transcription factor fragments? ?Electronic supplementary information (ESI) available: Peptide synthesis, full experimental procedures and analytical data of the peptides and products obtained. See DOI: 10.1039/c5sc01415h Click here for additional data file.

    OpenAIRE

    Rodr?guez, J?ssica; Mosquera, Jes?s; Couceiro, Jose R.; V?zquez, M. Eugenio; Mascare?as, Jos? L.

    2015-01-01

    We report the development of chimeric DNA binding peptides comprising a DNA binding fragment of natural transcription factors (the basic region of a bZIP protein or a monomeric zinc finger module) and an AT-Hook peptide motif. The resulting peptide conjugates display high DNA affinity and excellent sequence selectivity. Furthermore, the AT-Hook motif also favors the cell internalization of the conjugates.

  9. Comparative analysis of evolutionarily conserved motifs of epidermal growth factor receptor 2 (HER2) predicts novel potential therapeutic epitopes

    DEFF Research Database (Denmark)

    Deng, Xiaohong; Zheng, Xuxu; Yang, Huanming

    2014-01-01

    druggable epitopes/targets. We employed the PROSITE Scan to detect structurally conserved motifs and PRINTS to search for linearly conserved motifs of ECD HER2. We found that the epitopes recognized by trastuzumab and pertuzumab are located in the predicted conserved motifs of ECD HER2, supporting our...

  10. Computational analysis and prediction of the binding motif and protein interacting partners of the Abl SH3 domain.

    Directory of Open Access Journals (Sweden)

    Tingjun Hou

    2006-01-01

    Full Text Available Protein-protein interactions, particularly weak and transient ones, are often mediated by peptide recognition domains, such as Src Homology 2 and 3 (SH2 and SH3 domains, which bind to specific sequence and structural motifs. It is important but challenging to determine the binding specificity of these domains accurately and to predict their physiological interacting partners. In this study, the interactions between 35 peptide ligands (15 binders and 20 non-binders and the Abl SH3 domain were analyzed using molecular dynamics simulation and the Molecular Mechanics/Poisson-Boltzmann Solvent Area method. The calculated binding free energies correlated well with the rank order of the binding peptides and clearly distinguished binders from non-binders. Free energy component analysis revealed that the van der Waals interactions dictate the binding strength of peptides, whereas the binding specificity is determined by the electrostatic interaction and the polar contribution of desolvation. The binding motif of the Abl SH3 domain was then determined by a virtual mutagenesis method, which mutates the residue at each position of the template peptide relative to all other 19 amino acids and calculates the binding free energy difference between the template and the mutated peptides using the Molecular Mechanics/Poisson-Boltzmann Solvent Area method. A single position mutation free energy profile was thus established and used as a scoring matrix to search peptides recognized by the Abl SH3 domain in the human genome. Our approach successfully picked ten out of 13 experimentally determined binding partners of the Abl SH3 domain among the top 600 candidates from the 218,540 decapeptides with the PXXP motif in the SWISS-PROT database. We expect that this physical-principle based method can be applied to other protein domains as well.

  11. LINKAGE ANALYSIS BY 2-DIMENSIONAL DNA TYPING

    NARCIS (Netherlands)

    MEERMAN, GJT; MULLAART, E; VANDERMEULEN, MA; DENDAAS, JHG; MOROLLI, B; UITTERLINDEN, AG; VIJG, J

    1993-01-01

    In two-dimensional (2-D) DNA typing, genomic DNA fragments are separated, first according to size by electrophoresis in a neutral polyacrylamide gel and second according to sequence by denaturing gradient gel electrophoresis, followed by hybridization analysis using micro- and minisatellite core

  12. Bovine and equine forensic DNA analysis

    NARCIS (Netherlands)

    van de Goor, L.H.P.

    2011-01-01

    Animal forensic DNA analysis is being used for human criminal investigations (e.g traces from cats and dogs), wildlife management, breeding and food safety. The most common DNA markers used for such forensic casework are short tandem repeats (STR). Rules and guidelines concerning quality assurance

  13. BIOPEP-PBIL Tool for the Analysis of the Structure of Biologically Active Motifs Derived from Food Proteins

    Directory of Open Access Journals (Sweden)

    Jerzy Dziuba

    2011-01-01

    Full Text Available This work describes a flexible technique for the analysis of protein sequences as a source of motifs affecting bodily functions. The BIOPEP database, along with the Pôle Bioinformatique Lyonnais (PBIL server, were applied to define which activities of peptides dominated in their protein precursors and which structure of the protein contained the most of the revealed activities. Such an approach could be helpful in finding some structural requirements for peptide(s to be regarded as biologically active (bioactive. It was found that apart from the activities of peptides that commonly occur in the majority of proteins (e.g. ACE inhibitors, all analyzed proteins can be a source of motifs involved in e.g. activation of ubiquitin-mediated proteolysis. This could be important in designing diets for patients who suffer from neural diseases. The structure and bioactivity analyses revealed that if peptides were to be 'bioactive', it is essential that they assume the position of a coil (or combination of coil and a-helix in the sequence of their protein precursors. However, it is recommended to consider the factors such as the length of peptide chains, the number of peptides in the database as well as the repeatability of the occurrence of characteristic amino acids, both in the peptide and in the protein when studying the bioactivity and structure of biomolecules.

  14. Nanopore sensors for DNA analysis

    DEFF Research Database (Denmark)

    Solovyeva, Vita; Venkatesan, B.M.; Shim, Jeong

    2012-01-01

    Solid-state nanopore sensors are promising devices for single DNA molecule detection and sequencing. This paper presents a review of our work on solid-state nanopores performed over the last decade. In particular, here we discuss atomic-layer-deposited (ALD)-based, graphene-based, and functionali......Solid-state nanopore sensors are promising devices for single DNA molecule detection and sequencing. This paper presents a review of our work on solid-state nanopores performed over the last decade. In particular, here we discuss atomic-layer-deposited (ALD)-based, graphene...

  15. Hybrids of the bHLH and bZIP protein motifs display different DNA-binding activities in vivo vs. in vitro.

    Directory of Open Access Journals (Sweden)

    Hiu-Kwan Chow

    Full Text Available Minimalist hybrids comprising the DNA-binding domain of bHLH/PAS (basic-helix-loop-helix/Per-Arnt-Sim protein Arnt fused to the leucine zipper (LZ dimerization domain from bZIP (basic region-leucine zipper protein C/EBP were designed to bind the E-box DNA site, CACGTG, targeted by bHLHZ (basic-helix-loop-helix-zipper proteins Myc and Max, as well as the Arnt homodimer. The bHLHZ-like structure of ArntbHLH-C/EBP comprises the Arnt bHLH domain fused to the C/EBP LZ: i.e. swap of the 330 aa PAS domain for the 29 aa LZ. In the yeast one-hybrid assay (Y1H, transcriptional activation from the E-box was strong by ArntbHLH-C/EBP, and undetectable for the truncated ArntbHLH (PAS removed, as detected via readout from the HIS3 and lacZ reporters. In contrast, fluorescence anisotropy titrations showed affinities for the E-box with ArntbHLH-C/EBP and ArntbHLH comparable to other transcription factors (K(d 148.9 nM and 40.2 nM, respectively, but only under select conditions that maintained folded protein. Although in vivo yeast results and in vitro spectroscopic studies for ArntbHLH-C/EBP targeting the E-box correlate well, the same does not hold for ArntbHLH. As circular dichroism confirms that ArntbHLH-C/EBP is a much more strongly alpha-helical structure than ArntbHLH, we conclude that the nonfunctional ArntbHLH in the Y1H must be due to misfolding, leading to the false negative that this protein is incapable of targeting the E-box. Many experiments, including protein design and selections from large libraries, depend on protein domains remaining well-behaved in the nonnative experimental environment, especially small motifs like the bHLH (60-70 aa. Interestingly, a short helical LZ can serve as a folding- and/or solubility-enhancing tag, an important device given the focus of current research on exploration of vast networks of biomolecular interactions.

  16. Characterization of the CrbS/R Two-Component System in Pseudomonas fluorescens Reveals a New Set of Genes under Its Control and a DNA Motif Required for CrbR-Mediated Transcriptional Activation

    Directory of Open Access Journals (Sweden)

    Edgardo Sepulveda

    2017-11-01

    Full Text Available The CrbS/R system is a two-component signal transduction system that regulates acetate utilization in Vibrio cholerae, P. aeruginosa, and P. entomophila. CrbS is a hybrid histidine kinase that belongs to a recently identified family, in which the signaling domain is fused to an SLC5 solute symporter domain through aSTAC domain. Upon activation by CrbS, CrbR activates transcription of the acs gene, which encodes an acetyl-CoA synthase (ACS, and the actP gene, which encodes an acetate/solute symporter. In this work, we characterized the CrbS/R system in Pseudomonas fluorescens SBW25. Through the quantitative proteome analysis of different mutants, we were able to identify a new set of genes under its control, which play an important role during growth on acetate. These results led us to the identification of a conserved DNA motif in the putative promoter region of acetate-utilization genes in the Gammaproteobacteria that is essential for the CrbR-mediated transcriptional activation of genes under acetate-utilizing conditions. Finally, we took advantage of the existence of a second SLC5-containing two-component signal transduction system in P. fluorescens, CbrA/B, to demonstrate that the activation of the response regulator by the histidine kinase is not dependent on substrate transport through the SLC5 domain.

  17. Analysis of network motifs in cellular regulation: Structural similarities, input-output relations and signal integration.

    Science.gov (United States)

    Straube, Ronny

    2017-12-01

    Much of the complexity of regulatory networks derives from the necessity to integrate multiple signals and to avoid malfunction due to cross-talk or harmful perturbations. Hence, one may expect that the input-output behavior of larger networks is not necessarily more complex than that of smaller network motifs which suggests that both can, under certain conditions, be described by similar equations. In this review, we illustrate this approach by discussing the similarities that exist in the steady state descriptions of a simple bimolecular reaction, covalent modification cycles and bacterial two-component systems. Interestingly, in all three systems fundamental input-output characteristics such as thresholds, ultrasensitivity or concentration robustness are described by structurally similar equations. Depending on the system the meaning of the parameters can differ ranging from protein concentrations and affinity constants to complex parameter combinations which allows for a quantitative understanding of signal integration in these systems. We argue that this approach may also be extended to larger regulatory networks. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Sequence analysis of Leukemia DNA

    Science.gov (United States)

    Nacong, Nasria; Lusiyanti, Desy; Irawan, Muhammad. Isa

    2018-03-01

    Cancer is a very deadly disease, one of which is leukemia disease or better known as blood cancer. The cancer cell can be detected by taking DNA in laboratory test. This study focused on local alignment of leukemia and non leukemia data resulting from NCBI in the form of DNA sequences by using Smith-Waterman algorithm. SmithWaterman algorithm was invented by TF Smith and MS Waterman in 1981. These algorithms try to find as much as possible similarity of a pair of sequences, by giving a negative value to the unequal base pair (mismatch), and positive values on the same base pair (match). So that will obtain the maximum positive value as the end of the alignment, and the minimum value as the initial alignment. This study will use sequences of leukemia and 3 sequences of non leukemia.

  19. Mutational Analysis of the RecJ Exonuclease of Escherichia coli: Identification of Phosphoesterase Motifs

    OpenAIRE

    Sutera, Vincent A.; Han, Eugene S.; Rajman, Luis A.; Lovett, Susan T.

    1999-01-01

    The recJ gene, identified in Escherichia coli, encodes a Mg+2-dependent 5′-to-3′ exonuclease with high specificity for single-strand DNA. Genetic and biochemical experiments implicate RecJ exonuclease in homologous recombination, base excision, and methyl-directed mismatch repair. Genes encoding proteins with strong similarities to RecJ have been found in every eubacterial genome sequenced to date, with the exception of Mycoplasma and Mycobacterium tuberculosis. Multiple genes encoding protei...

  20. The Verrucomicrobia LexA-binding Motif: Insights into the Evolutionary Dynamics of the SOS Response

    Directory of Open Access Journals (Sweden)

    Ivan Erill

    2016-07-01

    Full Text Available The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  1. The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response.

    Science.gov (United States)

    Erill, Ivan; Campoy, Susana; Kılıç, Sefa; Barbé, Jordi

    2016-01-01

    The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division, and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  2. G-quadruplex DNA sequences are evolutionarily conserved and associated with distinct genomic features in Saccharomyces cerevisiae.

    Directory of Open Access Journals (Sweden)

    John A Capra

    2010-07-01

    Full Text Available G-quadruplex DNA is a four-stranded DNA structure formed by non-Watson-Crick base pairing between stacked sets of four guanines. Many possible functions have been proposed for this structure, but its in vivo role in the cell is still largely unresolved. We carried out a genome-wide survey of the evolutionary conservation of regions with the potential to form G-quadruplex DNA structures (G4 DNA motifs across seven yeast species. We found that G4 DNA motifs were significantly more conserved than expected by chance, and the nucleotide-level conservation patterns suggested that the motif conservation was the result of the formation of G4 DNA structures. We characterized the association of conserved and non-conserved G4 DNA motifs in Saccharomyces cerevisiae with more than 40 known genome features and gene classes. Our comprehensive, integrated evolutionary and functional analysis confirmed the previously observed associations of G4 DNA motifs with promoter regions and the rDNA, and it identified several previously unrecognized associations of G4 DNA motifs with genomic features, such as mitotic and meiotic double-strand break sites (DSBs. Conserved G4 DNA motifs maintained strong associations with promoters and the rDNA, but not with DSBs. We also performed the first analysis of G4 DNA motifs in the mitochondria, and surprisingly found a tenfold higher concentration of the motifs in the AT-rich yeast mitochondrial DNA than in nuclear DNA. The evolutionary conservation of the G4 DNA motif and its association with specific genome features supports the hypothesis that G4 DNA has in vivo functions that are under evolutionary constraint.

  3. New technologies for DNA analysis

    DEFF Research Database (Denmark)

    McGinn, Steven; Bauer, David; Brefort, Thomas

    2016-01-01

    The REvolutionary Approaches and Devices for Nucleic Acid analysis (READNA) project received funding from the European Commission for 4 1/2 years. The objectives of the project revolved around technological developments in nucleic acid analysis. The project partners have discovered, created and d...

  4. Entropy analysis in yeast DNA

    International Nuclear Information System (INIS)

    Kim, Jongkwang; Kim, Sowun; Lee, Kunsang; Kwon, Younghun

    2009-01-01

    In this article, we investigate the language structure in yeast 16 chromosomes. In order to find it, we use the entropy analysis for codons (or amino acids) of yeast 16 chromosomes, developed in analysis of natural language by Montemurro et al. From the analysis, we can see that there exists a language structure in codons (or amino acids) of yeast 16 chromosomes. Also we find that the grammar structure of amino acids of yeast 16 chromosomes has a deep relationship with secondary structure of protein.

  5. Quantitative mass spectrometry analysis reveals similar substrate consensus motif for human Mps1 kinase and Plk1.

    Directory of Open Access Journals (Sweden)

    Zhen Dou

    Full Text Available BACKGROUND: Members of the Mps1 kinase family play an essential and evolutionarily conserved role in the spindle assembly checkpoint (SAC, a surveillance mechanism that ensures accurate chromosome segregation during mitosis. Human Mps1 (hMps1 is highly phosphorylated during mitosis and many phosphorylation sites have been identified. However, the upstream kinases responsible for these phosphorylations are not presently known. METHODOLOGY/PRINCIPAL FINDINGS: Here, we identify 29 in vivo phosphorylation sites in hMps1. While in vivo analyses indicate that Aurora B and hMps1 activity are required for mitotic hyper-phosphorylation of hMps1, in vitro kinase assays show that Cdk1, MAPK, Plk1 and hMps1 itself can directly phosphorylate hMps1. Although Aurora B poorly phosphorylates hMps1 in vitro, it positively regulates the localization of Mps1 to kinetochores in vivo. Most importantly, quantitative mass spectrometry analysis demonstrates that at least 12 sites within hMps1 can be attributed to autophosphorylation. Remarkably, these hMps1 autophosphorylation sites closely resemble the consensus motif of Plk1, demonstrating that these two mitotic kinases share a similar substrate consensus. CONCLUSIONS/SIGNIFICANCE: hMps1 kinase is regulated by Aurora B kinase and its autophosphorylation. Analysis on hMps1 autophosphorylation sites demonstrates that hMps1 has a substrate preference similar to Plk1 kinase.

  6. CAGEd-oPOSSUM: motif enrichment analysis from CAGE-derived TSSs.

    Science.gov (United States)

    Arenillas, David J; Forrest, Alistair R R; Kawaji, Hideya; Lassmann, Timo; Wasserman, Wyeth W; Mathelier, Anthony

    2016-09-15

    With the emergence of large-scale Cap Analysis of Gene Expression (CAGE) datasets from individual labs and the FANTOM consortium, one can now analyze the cis-regulatory regions associated with gene transcription at an unprecedented level of refinement. By coupling transcription factor binding site (TFBS) enrichment analysis with CAGE-derived genomic regions, CAGEd-oPOSSUM can identify TFs that act as key regulators of genes involved in specific mammalian cell and tissue types. The webtool allows for the analysis of CAGE-derived transcription start sites (TSSs) either provided by the user or selected from ∼1300 mammalian samples from the FANTOM5 project with pre-computed TFBS predicted with JASPAR TF binding profiles. The tool helps power insights into the regulation of genes through the study of the specific usage of TSSs within specific cell types and/or under specific conditions. The CAGEd-oPOSUM web tool is implemented in Perl, MySQL and Apache and is available at http://cagedop.cmmt.ubc.ca/CAGEd_oPOSSUM CONTACTS: anthony.mathelier@ncmm.uio.no or wyeth@cmmt.ubc.ca Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  7. Structural and Functional Analysis of VQ Motif-Containing Proteins in Arabidopsis as Interacting Proteins of WRKY Transcription Factors1[W][OA

    Science.gov (United States)

    Cheng, Yuan; Zhou, Yuan; Yang, Yan; Chi, Ying-Jun; Zhou, Jie; Chen, Jian-Ye; Wang, Fei; Fan, Baofang; Shi, Kai; Zhou, Yan-Hong; Yu, Jing-Quan; Chen, Zhixiang

    2012-01-01

    WRKY transcription factors are encoded by a large gene superfamily with a broad range of roles in plants. Recently, several groups have reported that proteins containing a short VQ (FxxxVQxLTG) motif interact with WRKY proteins. We have recently discovered that two VQ proteins from Arabidopsis (Arabidopsis thaliana), SIGMA FACTOR-INTERACTING PROTEIN1 and SIGMA FACTOR-INTERACTING PROTEIN2, act as coactivators of WRKY33 in plant defense by specifically recognizing the C-terminal WRKY domain and stimulating the DNA-binding activity of WRKY33. In this study, we have analyzed the entire family of 34 structurally divergent VQ proteins from Arabidopsis. Yeast (Saccharomyces cerevisiae) two-hybrid assays showed that Arabidopsis VQ proteins interacted specifically with the C-terminal WRKY domains of group I and the sole WRKY domains of group IIc WRKY proteins. Using site-directed mutagenesis, we identified structural features of these two closely related groups of WRKY domains that are critical for interaction with VQ proteins. Quantitative reverse transcription polymerase chain reaction revealed that expression of a majority of Arabidopsis VQ genes was responsive to pathogen infection and salicylic acid treatment. Functional analysis using both knockout mutants and overexpression lines revealed strong phenotypes in growth, development, and susceptibility to pathogen infection. Altered phenotypes were substantially enhanced through cooverexpression of genes encoding interacting VQ and WRKY proteins. These findings indicate that VQ proteins play an important role in plant growth, development, and response to environmental conditions, most likely by acting as cofactors of group I and IIc WRKY transcription factors. PMID:22535423

  8. Superimposed Code Theorectic Analysis of DNA Codes and DNA Computing

    Science.gov (United States)

    2010-03-01

    that the hybridization that occurs between a DNA strand and its Watson - Crick complement can be used to perform mathematical computation. This research...ssDNA single stranded DNA WC Watson – Crick A Adenine C Cytosine G Guanine T Thymine ... Watson - Crick (WC) duplex, e.g., TCGCA TCGCA . Note that non-WC duplexes can form and such a formation is called a cross-hybridization. Cross

  9. Molecular DNA Analysis in Forensic Identification.

    Science.gov (United States)

    Dumache, Raluca; Ciocan, Veronica; Muresan, Camelia; Enache, Alexandra

    2016-01-01

    Serological and biochemical identification methods used in forensics have several major disadvantages, such as: long time in processing biological sample and lack of sensitivity and specificity. In the last 30 years, DNA molecular analysis has become an important tool in forensic investigations. DNA profiling is based on the short tandem repeats (STR) and aids in human identification from biological samples. Forensic genetics, can provide information on the events which occurred at the crime scene or to supplement other methods of forensic identification. Currently, the methods used in identification are based on polymerase chain reaction (PCR) analyses. This method analyses the autosomal STRs, the Y-chromosome, and the mitochondrial DNA. Correlation of biological samples present at the crime scene with identification, selection, and the probative value factor is therefore the first aspect to be taken into consideration in the forensic genetic analysis. In the last decade, because of the advances in the field of molecular biology, new biomarkers such as: microRNAs (miR), messenger RNA (mRNA), and DNA methylation have been studied and proposed to be used in the forensic identifications of body fluids.

  10. Two Tetrahymena G-DNA-binding proteins, TGP1 and TGP3, share novel motifs and may play a role in micronuclear division

    OpenAIRE

    Lu, Quan; Henderson, Eric

    2000-01-01

    G-DNA is a four-stranded DNA structure with diverse putative biological roles. We have previously purified and cloned a novel G-DNA-binding protein TGP1 from the ciliate Tetrahymena thermophila. Here we report the molecular cloning of TGP3, an additional G-DNA-binding protein from the same organism. The TGP3 cDNA encodes a 365 amino acid protein that is homologous to TGP1 (34% identity and 44% similarity). The proteins share a sequence pattern that contains two novel repetitive and homologous...

  11. Motif discovery in ranked lists of sequences

    DEFF Research Database (Denmark)

    Nielsen, Morten Muhlig; Tataru, Paula; Madsen, Tobias

    2016-01-01

    Motif analysis has long been an important method to characterize biological functionality and the current growth of sequencing-based genomics experiments further extends its potential. These diverse experiments often generate sequence lists ranked by some functional property. There is therefore...... advantage of the regular expression feature, including enrichments for combinations of different microRNA seed sites. The method is implemented and made publicly available as an R package and supports high parallelization on multi-core machinery....... a growing need for motif analysis methods that can exploit this coupled data structure and be tailored for specific biological questions. Here, we present an exploratory motif analysis tool, Regmex (REGular expression Motif EXplorer), which offers several methods to evaluate the correlation of motifs...

  12. Sequence and structural analysis of the chitinase insertion domain reveals two conserved motifs involved in chitin-binding.

    Directory of Open Access Journals (Sweden)

    Hai Li

    2010-01-01

    Full Text Available Chitinases are prevalent in life and are found in species including archaea, bacteria, fungi, plants, and animals. They break down chitin, which is the second most abundant carbohydrate in nature after cellulose. Hence, they are important for maintaining a balance between carbon and nitrogen trapped as insoluble chitin in biomass. Chitinases are classified into two families, 18 and 19 glycoside hydrolases. In addition to a catalytic domain, which is a triosephosphate isomerase barrel, many family 18 chitinases contain another module, i.e., chitinase insertion domain. While numerous studies focus on the biological role of the catalytic domain in chitinase activity, the function of the chitinase insertion domain is not completely understood. Bioinformatics offers an important avenue in which to facilitate understanding the role of residues within the chitinase insertion domain in chitinase function.Twenty-seven chitinase insertion domain sequences, which include four experimentally determined structures and span five kingdoms, were aligned and analyzed using a modified sequence entropy parameter. Thirty-two positions with conserved residues were identified. The role of these conserved residues was explored by conducting a structural analysis of a number of holo-enzymes. Hydrogen bonding and van der Waals calculations revealed a distinct subset of four conserved residues constituting two sequence motifs that interact with oligosaccharides. The other conserved residues may be key to the structure, folding, and stability of this domain.Sequence and structural studies of the chitinase insertion domains conducted within the framework of evolution identified four conserved residues which clearly interact with the substrates. Furthermore, evolutionary studies propose a link between the appearance of the chitinase insertion domain and the function of family 18 chitinases in the subfamily A.

  13. cDNA cloning of the basement membrane chondroitin sulfate proteoglycan core protein, bamacan: a five domain structure including coiled-coil motifs

    DEFF Research Database (Denmark)

    Wu, R R; Couchman, J R

    1997-01-01

    Basement membranes contain several proteoglycans, and those bearing heparan sulfate glycosaminoglycans such as perlecan and agrin usually predominate. Most mammalian basement membranes also contain chondroitin sulfate, and a core protein, bamacan, has been partially characterized. We have now....... The protein sequence has low overall homology, apart from very small NH2- and COOH-terminal motifs. At the junctions between the distal globular domains and the coiled-coil regions lie glycosylation sites, with up to three N-linked oligosaccharides and probably three chondroitin chains. Three other Ser...

  14. CompariMotif: quick and easy comparisons of sequence motifs.

    Science.gov (United States)

    Edwards, Richard J; Davey, Norman E; Shields, Denis C

    2008-05-15

    CompariMotif is a novel tool for making motif-motif comparisons, identifying and describing similarities between regular expression motifs. CompariMotif can identify a number of different relationships between motifs, including exact matches, variants of degenerate motifs and complex overlapping motifs. Motif relationships are scored using shared information content, allowing the best matches to be easily identified in large comparisons. Many input and search options are available, enabling a list of motifs to be compared to itself (to identify recurring motifs) or to datasets of known motifs. CompariMotif can be run online at http://bioware.ucd.ie/ and is freely available for academic use as a set of open source Python modules under a GNU General Public License from http://bioinformatics.ucd.ie/shields/software/comparimotif/

  15. Missense mutations located in structural p53 DNA-binding motifs are associated with extremely poor survival in chronic lymphocytic leukemia.

    Science.gov (United States)

    Trbusek, Martin; Smardova, Jana; Malcikova, Jitka; Sebejova, Ludmila; Dobes, Petr; Svitakova, Miluse; Vranova, Vladimira; Mraz, Marek; Francova, Hana Skuhrova; Doubek, Michael; Brychtova, Yvona; Kuglik, Petr; Pospisilova, Sarka; Mayer, Jiri

    2011-07-01

    There is a distinct connection between TP53 defects and poor prognosis in chronic lymphocytic leukemia (CLL). It remains unclear whether patients harboring TP53 mutations represent a homogenous prognostic group. We evaluated the survival of patients with CLL and p53 defects identified at our institution by p53 yeast functional assay and complementary interphase fluorescence in situ hybridization analysis detecting del(17p) from 2003 to 2010. A defect of the TP53 gene was identified in 100 of 550 patients. p53 mutations were strongly associated with the deletion of 17p and the unmutated IgVH locus (both P DBMs), structurally well-defined parts of the DNA-binding domain, manifested a clearly shorter median survival (12 months) compared with patients having missense mutations outside DBMs (41 months; P = .002) or nonmissense alterations (36 months; P = .005). The difference in survival was similar in the analysis limited to patients harboring mutation accompanied by del(17p) and was also confirmed in a subgroup harboring TP53 defect at diagnosis. The patients with p53 DBMs mutation (at diagnosis) also manifested a short median time to first therapy (TTFT; 1 month). The substantially worse survival and the short TTFT suggest a strong mutated p53 gain-of-function phenotype in patients with CLL with DBMs mutations. The impact of p53 DBMs mutations on prognosis and response to therapy should be analyzed in investigative clinical trials.

  16. Genomic survey, gene expression analysis and structural modeling suggest diverse roles of DNA methyltransferases in legumes.

    Directory of Open Access Journals (Sweden)

    Rohini Garg

    Full Text Available DNA methylation plays a crucial role in development through inheritable gene silencing. Plants possess three types of DNA methyltransferases (MTases, namely Methyltransferase (MET, Chromomethylase (CMT and Domains Rearranged Methyltransferase (DRM, which maintain methylation at CG, CHG and CHH sites. DNA MTases have not been studied in legumes so far. Here, we report the identification and analysis of putative DNA MTases in five legumes, including chickpea, soybean, pigeonpea, Medicago and Lotus. MTases in legumes could be classified in known MET, CMT, DRM and DNA nucleotide methyltransferases (DNMT2 subfamilies based on their domain organization. First three MTases represent DNA MTases, whereas DNMT2 represents a transfer RNA (tRNA MTase. Structural comparison of all the MTases in plants with known MTases in mammalian and plant systems have been reported to assign structural features in context of biological functions of these proteins. The structure analysis clearly specified regions crucial for protein-protein interactions and regions important for nucleosome binding in various domains of CMT and MET proteins. In addition, structural model of DRM suggested that circular permutation of motifs does not have any effect on overall structure of DNA methyltransferase domain. These results provide valuable insights into role of various domains in molecular recognition and should facilitate mechanistic understanding of their function in mediating specific methylation patterns. Further, the comprehensive gene expression analyses of MTases in legumes provided evidence of their role in various developmental processes throughout the plant life cycle and response to various abiotic stresses. Overall, our study will be very helpful in establishing the specific functions of DNA MTases in legumes.

  17. A DNA Structure-Based Bionic Wavelet Transform and Its Application to DNA Sequence Analysis

    Directory of Open Access Journals (Sweden)

    Fei Chen

    2003-01-01

    Full Text Available DNA sequence analysis is of great significance for increasing our understanding of genomic functions. An important task facing us is the exploration of hidden structural information stored in the DNA sequence. This paper introduces a DNA structure-based adaptive wavelet transform (WT – the bionic wavelet transform (BWT – for DNA sequence analysis. The symbolic DNA sequence can be separated into four channels of indicator sequences. An adaptive symbol-to-number mapping, determined from the structural feature of the DNA sequence, was introduced into WT. It can adjust the weight value of each channel to maximise the useful energy distribution of the whole BWT output. The performance of the proposed BWT was examined by analysing synthetic and real DNA sequences. Results show that BWT performs better than traditional WT in presenting greater energy distribution. This new BWT method should be useful for the detection of the latent structural features in future DNA sequence analysis.

  18. Promoter Motifs in NCLDVs: An Evolutionary Perspective

    Science.gov (United States)

    Oliveira, Graziele Pereira; Andrade, Ana Cláudia dos Santos Pereira; Rodrigues, Rodrigo Araújo Lima; Arantes, Thalita Souza; Boratto, Paulo Victor Miranda; Silva, Ludmila Karen dos Santos; Dornas, Fábio Pio; Trindade, Giliane de Souza; Drumond, Betânia Paiva; La Scola, Bernard; Kroon, Erna Geessien; Abrahão, Jônatas Santos

    2017-01-01

    For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV), raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’) that could be evolved gradually by nucleotides’ gain and loss and point mutations. PMID:28117683

  19. Promoter Motifs in NCLDVs: An Evolutionary Perspective

    Directory of Open Access Journals (Sweden)

    Graziele Pereira Oliveira

    2017-01-01

    Full Text Available For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV, raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’ that could be evolved gradually by nucleotides’ gain and loss and point mutations.

  20. Structure of the Cpf1 endonuclease R-loop complex after target DNA cleavage

    DEFF Research Database (Denmark)

    Stella, Stefano; Alcón, Pablo; Montoya, Guillermo

    2017-01-01

    involved in DNA unwinding to form a CRISPR RNA (crRNA)-DNA hybrid and a displaced DNA strand. The protospacer adjacent motif (PAM) is recognized by the PAM-interacting domain. The loop-lysine helix-loop motif in this domain contains three conserved lysine residues that are inserted in a dentate manner...... and the crRNA-DNA hybrid, avoiding DNA re-annealing. Mutations in key residues reveal a mechanism linking the PAM and DNA nuclease sites. Analysis of the Cpf1 structures proposes a singular working model of RNA-guided DNA cleavage, suggesting new avenues for redesign of Cpf1....

  1. Glycomic Analysis of Life Stages of the Human Parasite Schistosoma mansoni Reveals Developmental Expression Profiles of Functional and Antigenic Glycan Motifs.

    Science.gov (United States)

    Smit, Cornelis H; van Diepen, Angela; Nguyen, D Linh; Wuhrer, Manfred; Hoffmann, Karl F; Deelder, André M; Hokke, Cornelis H

    2015-07-01

    Glycans present on glycoproteins and glycolipids of the major human parasite Schistosoma mansoni induce innate as well as adaptive immune responses in the host. To be able to study the molecular characteristics of schistosome infections it is therefore required to determine the expression profiles of glycans and antigenic glycan-motifs during a range of critical stages of the complex schistosome lifecycle. We performed a longitudinal profiling study covering schistosome glycosylation throughout worm- and egg-development using a mass spectrometry-based glycomics approach. Our study revealed that during worm development N-glycans with Galβ1-4(Fucα1-3)GlcNAc (LeX) and core-xylose motifs were rapidly lost after cercariae to schistosomula transformation, whereas GalNAcβ1-4GlcNAc (LDN)-motifs gradually became abundant and predominated in adult worms. LeX-motifs were present on glycolipids up to 2 weeks of schistosomula development, whereas glycolipids with mono- and multifucosylated LDN-motifs remained present up to the adult worm stage. In contrast, expression of complex O-glycans diminished to undetectable levels within days after transformation. During egg development, a rich diversity of N-glycans with fucosylated motifs was expressed, but with α3-core fucose and a high degree of multifucosylated antennae only in mature eggs and miracidia. N-glycan antennae were exclusively LDN-based in miracidia. O-glycans in the mature eggs were also diverse and contained LeX- and multifucosylated LDN, but none of these were associated with miracidia in which we detected only the Galβ1-3(Galβ1-6)GalNAc core glycan. Immature eggs also exhibited short O-glycan core structures only, suggesting that complex fucosylated O-glycans of schistosome eggs are derived primarily from glycoproteins produced by the subshell envelope in the developed egg. Lipid glycans with multifucosylated GlcNAc repeats were present throughout egg development, but with the longer highly fucosylated

  2. MSDmotif: exploring protein sites and motifs

    Directory of Open Access Journals (Sweden)

    Henrick Kim

    2008-07-01

    Full Text Available Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.

  3. Food Fish Identification from DNA Extraction through Sequence Analysis

    Science.gov (United States)

    Hallen-Adams, Heather E.

    2015-01-01

    This experiment exposed 3rd and 4th y undergraduates and graduate students taking a course in advanced food analysis to DNA extraction, polymerase chain reaction (PCR), and DNA sequence analysis. Students provided their own fish sample, purchased from local grocery stores, and the class as a whole extracted DNA, which was then subjected to PCR,…

  4. The Rev1 interacting region (RIR) motif in the scaffold protein XRCC1 mediates a low-affinity interaction with polynucleotide kinase/phosphatase (PNKP) during DNA single-strand break repair.

    Science.gov (United States)

    Breslin, Claire; Mani, Rajam S; Fanta, Mesfin; Hoch, Nicolas; Weinfeld, Michael; Caldecott, Keith W

    2017-09-29

    The scaffold protein X-ray repair cross-complementing 1 (XRCC1) interacts with multiple enzymes involved in DNA base excision repair and single-strand break repair (SSBR) and is important for genetic integrity and normal neurological function. One of the most important interactions of XRCC1 is that with polynucleotide kinase/phosphatase (PNKP), a dual-function DNA kinase/phosphatase that processes damaged DNA termini and that, if mutated, results in ataxia with oculomotor apraxia 4 (AOA4) and microcephaly with early-onset seizures and developmental delay (MCSZ). XRCC1 and PNKP interact via a high-affinity phosphorylation-dependent interaction site in XRCC1 and a forkhead-associated domain in PNKP. Here, we identified using biochemical and biophysical approaches a second PNKP interaction site in XRCC1 that binds PNKP with lower affinity and independently of XRCC1 phosphorylation. However, this interaction nevertheless stimulated PNKP activity and promoted SSBR and cell survival. The low-affinity interaction site required the highly conserved Rev1-interacting region (RIR) motif in XRCC1 and included three critical and evolutionarily invariant phenylalanine residues. We propose a bipartite interaction model in which the previously identified high-affinity interaction acts as a molecular tether, holding XRCC1 and PNKP together and thereby promoting the low-affinity interaction identified here, which then stimulates PNKP directly. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  5. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

    Science.gov (United States)

    Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

    2013-01-01

    The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545

  6. Aberrant DNA Methylation in Human iPSCs Associates with MYC-Binding Motifs in a Clone-Specific Manner Independent of Genetics.

    Science.gov (United States)

    Panopoulos, Athanasia D; Smith, Erin N; Arias, Angelo D; Shepard, Peter J; Hishida, Yuriko; Modesto, Veronica; Diffenderfer, Kenneth E; Conner, Clay; Biggs, William; Sandoval, Efren; D'Antonio-Chronowska, Agnieszka; Berggren, W Travis; Izpisua Belmonte, Juan Carlos; Frazer, Kelly A

    2017-04-06

    Induced pluripotent stem cells (iPSCs) show variable methylation patterns between lines, some of which reflect aberrant differences relative to embryonic stem cells (ESCs). To examine whether this aberrant methylation results from genetic variation or non-genetic mechanisms, we generated human iPSCs from monozygotic twins to investigate how genetic background, clone, and passage number contribute. We found that aberrantly methylated CpGs are enriched in regulatory regions associated with MYC protein motifs and affect gene expression. We classified differentially methylated CpGs as being associated with genetic and/or non-genetic factors (clone and passage), and we found that aberrant methylation preferentially occurs at CpGs associated with clone-specific effects. We further found that clone-specific effects play a strong role in recurrent aberrant methylation at specific CpG sites across different studies. Our results argue that a non-genetic biological mechanism underlies aberrant methylation in iPSCs and that it is likely based on a probabilistic process involving MYC that takes place during or shortly after reprogramming. Published by Elsevier Inc.

  7. Synthesis of a Hoechst 32258 Analogue Amino Acid Building Block for Direct Incorporation of a Fluorescent High-Affinity DNA Binding Motif into Peptides

    DEFF Research Database (Denmark)

    Harrit, Niels; Behrens, Carsten; Nielsen, P. E.

    2001-01-01

    The synthesis of a new versatile "Hoechst 33258-like" Boc-protected amino acid building block for peptide synthesis is described. It is demonstrated that this new ligand is an effective mimic of Hoechst 33258 in terms of DNA affinity and sequence specificity. Furthermore, this minor groove binder...

  8. Import of desired nucleic acid sequences using addressing motif of mitochondrial ribosomal 5S-rRNA for fluorescent in vivo hybridization of mitochondrial DNA and RNA.

    Science.gov (United States)

    Zelenka, Jaroslav; Alán, Lukáš; Jabůrek, Martin; Ježek, Petr

    2014-04-01

    Based on the matrix-addressing sequence of mitochondrial ribosomal 5S-rRNA (termed MAM), which is naturally imported into mitochondria, we have constructed an import system for in vivo targeting of mitochondrial DNA (mtDNA) or mt-mRNA, in order to provide fluorescence hybridization of the desired sequences. Thus DNA oligonucleotides were constructed, containing the 5'-flanked T7 RNA polymerase promoter. After in vitro transcription and fluorescent labeling with Alexa Fluor(®) 488 or 647 dye, we obtained the fluorescent "L-ND5 probe" containing MAM and exemplar cargo, i.e., annealing sequence to a short portion of ND5 mRNA and to the light-strand mtDNA complementary to the heavy strand nd5 mt gene (5'-end 21 base pair sequence). For mitochondrial in vivo fluorescent hybridization, HepG2 cells were treated with dequalinium micelles, containing the fluorescent probes, bringing the probes proximally to the mitochondrial outer membrane and to the natural import system. A verification of import into the mitochondrial matrix of cultured HepG2 cells was provided by confocal microscopy colocalizations. Transfections using lipofectamine or probes without 5S-rRNA addressing MAM sequence or with MAM only were ineffective. Alternatively, the same DNA oligonucleotides with 5'-CACC overhang (substituting T7 promoter) were transcribed from the tetracycline-inducible pENTRH1/TO vector in human embryonic kidney T-REx®-293 cells, while mitochondrial matrix localization after import of the resulting unlabeled RNA was detected by PCR. The MAM-containing probe was then enriched by three-order of magnitude over the natural ND5 mRNA in the mitochondrial matrix. In conclusion, we present a proof-of-principle for mitochondrial in vivo hybridization and mitochondrial nucleic acid import.

  9. The future of forensic DNA analysis

    Science.gov (United States)

    Butler, John M.

    2015-01-01

    The author's thoughts and opinions on where the field of forensic DNA testing is headed for the next decade are provided in the context of where the field has come over the past 30 years. Similar to the Olympic motto of ‘faster, higher, stronger’, forensic DNA protocols can be expected to become more rapid and sensitive and provide stronger investigative potential. New short tandem repeat (STR) loci have expanded the core set of genetic markers used for human identification in Europe and the USA. Rapid DNA testing is on the verge of enabling new applications. Next-generation sequencing has the potential to provide greater depth of coverage for information on STR alleles. Familial DNA searching has expanded capabilities of DNA databases in parts of the world where it is allowed. Challenges and opportunities that will impact the future of forensic DNA are explored including the need for education and training to improve interpretation of complex DNA profiles. PMID:26101278

  10. [Analysis of DNA-DNA homologies in obligate methylotrophic bacteria].

    Science.gov (United States)

    Doronina, N V; Govorukhina, N I; Lysenko, A M; Trotsenko, Iu A

    1988-01-01

    The genotypic affinity of 19 bacterial strains obligately dependent on methanol or methylamine as carbon and energy sources was studied by techniques of molecular DNA hybridization. The high homology level (35-88%) between motile strain Methylophilus methanolovorus V-1447D and nonmotile strain Methylobacillus sp. VSB-792 as well as other motile strains (Pseudomonas methanolica ATCC 21704, Methylomonas methanolica NRRL 5458, Pseudomonas sp. W6, strain A3) indicates that all of them belong to one genus. Rather high level of homology (62-63%) was found between Methylobacillus glycogenes ATCC 29475 and Pseudomonas insueta ATCC 21276 and strain G-10. The motile strain Methylophilus methylotrophus NCIB 10515 has a low homology (below 20%) to other of the studied obligate methylobacteria. Therefore, at least two genetically different genera of obligate methylobacteria can be distinguished, namely Methylophilus and Methylobacillus, the latter being represented by both motile and nonmotile forms.

  11. A microstructural analysis of isoprenol ether-based polycarboxylates and the impact of structural motifs on the dispersing effectiveness

    International Nuclear Information System (INIS)

    Plank, Johann; Li, Huiqun; Ilg, Manuel; Pickelmann, Julia; Eisenreich, Wolfgang; Yao, Yan; Wang, Ziming

    2016-01-01

    Generally, polycarboxylate superplasticizers (PCEs) are synthesized via aqueous free radical copolymerization. The conditions during copolymerization such as relative reactivity and feeding mode and ratio of monomers can cause different monomer sequences in the final product. In this study, the sequence of monomers in PCE polymers synthesized from acrylic acid and isoprenyloxy polyethylene glycol (IPEG) macromonomer was characterized by 13 C nuclear magnetic resonance (NMR) spectroscopy. Three different triads of monomer sequences (EAE, AAE and AAA; E = ether, A = acid monomer) were detected. It was found that IPEG PCEs predominantly contain the structural motifs of AAE and EAE, and less of AAA. Higher additions of acrylic acid do not incorporate into the structure of PCE, but convert to HMW polyacrylate as by-product instead. A PCE with optimal dispersing effectiveness was achieved at high contents of IPEG macromonomer, a molecular weight (M w ) around 40,000 Da and narrow molecular weight distribution.

  12. MHC motif viewer

    DEFF Research Database (Denmark)

    Rapin, Nicolas Philippe Jean-Pierre; Hoof, Ilka; Lund, Ole

    2008-01-01

    . Algorithms that predict which peptides MHC molecules bind have recently been developed and cover many different alleles, but the utility of these algorithms is hampered by the lack of tools for browsing and comparing the specificity of these molecules. We have, therefore, developed a web server, MHC motif....... A special viewing feature, MHC fight, allows for display of the specificity of two different MHC molecules side by side. We show how the web server can be used to discover and display surprising similarities as well as differences between MHC molecules within and between different species. The MHC motif...

  13. Recurrence plot analysis of DNA sequences

    Energy Technology Data Exchange (ETDEWEB)

    Wu Zuobing [State Key Laboratory of Nonlinear Mechanics, Institute of Mechanics, Chinese Academy of Sciences, Beijing 100080 (China)]. E-mail: wuzb@lnm.imech.ac.cn

    2004-11-15

    Recurrence plot technique of DNA sequences is established on metric representation and employed to analyze correlation structure of nucleotide strings. It is found that, in the transference of nucleotide strings, a human DNA fragment has a major correlation distance, but a yeast chromosome's correlation distance has a constant increasing.

  14. DNA hybridization sensing for cytogenetic analysis

    DEFF Research Database (Denmark)

    Kwasny, Dorota; Dapra, Johannes; Brøgger, Anna Line

    2013-01-01

    are rearrangements between two chromosome arms that results in two derivative chromosomes having a mixed DNA sequence. The current detection method is a Fluorescent In situ Hybridization, which requires a use of expensive, fluorescently labeled probes that target the DNA sequences of two chromosomes involved...... in the translocation (Kwasny et al., 2012). We have developed a new double hybridization assay that allows for sorting of the DNA chromosomal fragments into separate compartment, moreover allowing for detection of the translocation. To detect the translocation it is necessary to determine that the two DNA sequences...... forming a derivative chromosome are connected, which is achieved by two subsequent hybridization steps. The first example of the translocation detection was presented on lab-on-a-disc using fluorescently labeled DNA fragments, representing the derivative chromosome (Brøgger et al., 2012). To allow...

  15. Phylogenetic analysis, based on EPIYA repeats in the cagA gene of Indian Helicobacter pylori, and the implications of sequence variation in tyrosine phosphorylation motifs on determining the clinical outcome

    Directory of Open Access Journals (Sweden)

    Santosh K. Tiwari

    2011-01-01

    Full Text Available The population of India harbors one of the world's most highly diverse gene pools, owing to the influx of successive waves of immigrants over regular periods in time. Several phylogenetic studies involving mitochondrial DNA and Y chromosomal variation have demonstrated Europeans to have been the first settlers in India. Nevertheless, certain controversy exists, due to the support given to the thesis that colonization was by the Austro-Asiatic group, prior to the Europeans. Thus, the aim was to investigate pre-historic colonization of India by anatomically modern humans, using conserved stretches of five amino acid (EPIYA sequences in the cagA gene of Helicobacter pylori. Simultaneously, the existence of a pathogenic relationship of tyrosine phosphorylation motifs (TPMs, in 32 H. pylori strains isolated from subjects with several forms of gastric diseases, was also explored. High resolution sequence analysis of the above described genes was performed. The nucleotide sequences obtained were translated into amino acids using MEGA (version 4.0 software for EPIYA. An MJ-Network was constructed for obtaining TPM haplotypes by using NETWORK (version 4.5 software. The findings of the study suggest that Indian H. pylori strains share a common ancestry with Europeans. No specific association of haplotypes with the outcome of disease was revealed through additional network analysis of TPMs.

  16. Identification and analysis of Eimeria nieschulzi gametocyte genes reveal splicing events of gam genes and conserved motifs in the wall-forming proteins within the genus Eimeria (Coccidia, Apicomplexa

    Directory of Open Access Journals (Sweden)

    Wiedmer Stefanie

    2017-01-01

    Full Text Available The genus Eimeria (Apicomplexa, Coccidia provides a wide range of different species with different hosts to study common and variable features within the genus and its species. A common characteristic of all known Eimeria species is the oocyst, the infectious stage where its life cycle starts and ends. In our study, we utilized Eimeria nieschulzi as a model organism. This rat-specific parasite has complex oocyst morphology and can be transfected and even cultivated in vitro up to the oocyst stage. We wanted to elucidate how the known oocyst wall-forming proteins are preserved in this rodent Eimeria species compared to other Eimeria. In newly obtained genomics data, we were able to identify different gametocyte genes that are orthologous to already known gam genes involved in the oocyst wall formation of avian Eimeria species. These genes appeared putatively as single exon genes, but cDNA analysis showed alternative splicing events in the transcripts. The analysis of the translated sequence revealed different conserved motifs but also dissimilar regions in GAM proteins, as well as polymorphic regions. The occurrence of an underrepresented gam56 gene version suggests the existence of a second distinct E. nieschulzi genotype within the E. nieschulzi Landers isolate that we maintain.

  17. Identification and analysis of Eimeria nieschulzi gametocyte genes reveal splicing events of gam genes and conserved motifs in the wall-forming proteins within the genus Eimeria (Coccidia, Apicomplexa)

    Science.gov (United States)

    Wiedmer, Stefanie; Erdbeer, Alexander; Volke, Beate; Randel, Stephanie; Kapplusch, Franz; Hanig, Sacha; Kurth, Michael

    2017-01-01

    The genus Eimeria (Apicomplexa, Coccidia) provides a wide range of different species with different hosts to study common and variable features within the genus and its species. A common characteristic of all known Eimeria species is the oocyst, the infectious stage where its life cycle starts and ends. In our study, we utilized Eimeria nieschulzi as a model organism. This rat-specific parasite has complex oocyst morphology and can be transfected and even cultivated in vitro up to the oocyst stage. We wanted to elucidate how the known oocyst wall-forming proteins are preserved in this rodent Eimeria species compared to other Eimeria. In newly obtained genomics data, we were able to identify different gametocyte genes that are orthologous to already known gam genes involved in the oocyst wall formation of avian Eimeria species. These genes appeared putatively as single exon genes, but cDNA analysis showed alternative splicing events in the transcripts. The analysis of the translated sequence revealed different conserved motifs but also dissimilar regions in GAM proteins, as well as polymorphic regions. The occurrence of an underrepresented gam56 gene version suggests the existence of a second distinct E. nieschulzi genotype within the E. nieschulzi Landers isolate that we maintain. PMID:29210668

  18. Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.

    Directory of Open Access Journals (Sweden)

    Arnoldo J Müller-Molina

    Full Text Available To know the map between transcription factors (TFs and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.

  19. The practical analysis of food: the development of Sakalar quantification table of DNA (SQT-DNA).

    Science.gov (United States)

    Sakalar, Ergün

    2013-11-15

    Practical and highly sensitive Sakalar quantification table of DNA (SQT-DNA) has been developed for the detection% of species-specific DNA amount in food products. Cycle threshold (Ct) data were obtained from multiple curves of real-time qPCR. The statistical analysis was done to estimate the concentration of standard dilutions. Amplicon concentrations versus each Ct value were assessed by the predictions of targets at known concentrations. SQT-DNA was prepared by using the percentage versus each Ct values. The applicability of SQT-DNA to commercial foods was proved by using sausages containing varying ratios of beef, chicken, and soybean. The results showed that SQT-DNA can be used to directly quantify food DNA by a single PCR without the need to construct a standart curve in parallel with the samples every time the experiment is performed, and also quantification by SQT-DNA is as reliable as standard curve quantification for a wide range of DNA concentrations. Copyright © 2013 Elsevier Ltd. All rights reserved.

  20. SOXE transcription factors form selective dimers on non-compact DNA motifs through multifaceted interactions between dimerization and high-mobility group domains.

    Science.gov (United States)

    Huang, Yong-Heng; Jankowski, Aleksander; Cheah, Kathryn S E; Prabhakar, Shyam; Jauch, Ralf

    2015-05-27

    The SOXE transcription factors SOX8, SOX9 and SOX10 are master regulators of mammalian development directing sex determination, gliogenesis, pancreas specification and neural crest development. We identified a set of palindromic SOX binding sites specifically enriched in regulatory regions of melanoma cells. SOXE proteins homodimerize on these sequences with high cooperativity. In contrast to other transcription factor dimers, which are typically rigidly spaced, SOXE group proteins can bind cooperatively at a wide range of dimer spacings. Using truncated forms of SOXE proteins, we show that a single dimerization (DIM) domain, that precedes the DNA binding high mobility group (HMG) domain, is sufficient for dimer formation, suggesting that DIM : HMG rather than DIM:DIM interactions mediate the dimerization. All SOXE members can also heterodimerize in this fashion, whereas SOXE heterodimers with SOX2, SOX4, SOX6 and SOX18 are not supported. We propose a structural model where SOXE-specific intramolecular DIM:HMG interactions are allosterically communicated to the HMG of juxtaposed molecules. Collectively, SOXE factors evolved a unique mode to combinatorially regulate their target genes that relies on a multifaceted interplay between the HMG and DIM domains. This property potentially extends further the diversity of target genes and cell-specific functions that are regulated by SOXE proteins.

  1. SOXE transcription factors form selective dimers on non-compact DNA motifs through multifaceted interactions between dimerization and high-mobility group domains

    Science.gov (United States)

    Huang, Yong-Heng; Jankowski, Aleksander; Cheah, Kathryn S. E.; Prabhakar, Shyam; Jauch, Ralf

    2015-01-01

    The SOXE transcription factors SOX8, SOX9 and SOX10 are master regulators of mammalian development directing sex determination, gliogenesis, pancreas specification and neural crest development. We identified a set of palindromic SOX binding sites specifically enriched in regulatory regions of melanoma cells. SOXE proteins homodimerize on these sequences with high cooperativity. In contrast to other transcription factor dimers, which are typically rigidly spaced, SOXE group proteins can bind cooperatively at a wide range of dimer spacings. Using truncated forms of SOXE proteins, we show that a single dimerization (DIM) domain, that precedes the DNA binding high mobility group (HMG) domain, is sufficient for dimer formation, suggesting that DIM : HMG rather than DIM:DIM interactions mediate the dimerization. All SOXE members can also heterodimerize in this fashion, whereas SOXE heterodimers with SOX2, SOX4, SOX6 and SOX18 are not supported. We propose a structural model where SOXE-specific intramolecular DIM:HMG interactions are allosterically communicated to the HMG of juxtaposed molecules. Collectively, SOXE factors evolved a unique mode to combinatorially regulate their target genes that relies on a multifaceted interplay between the HMG and DIM domains. This property potentially extends further the diversity of target genes and cell-specific functions that are regulated by SOXE proteins. PMID:26013289

  2. A Spatio-Temporal Analysis of Mitochondrial DNA Haplogroup I

    Directory of Open Access Journals (Sweden)

    Revesz Peter Z.

    2016-01-01

    Full Text Available The recent recovery of ancient DNA from a growing number of human samples shows that mitochondrial DNA haplogroup I was introduced to Europe after the end of the Last Glacial Maximum. This paper provides a spatio-temporal analysis of the various subhaplogroups of mitochondrial DNA I. The study suggests that haplogroup I diversified into haplogroups I1, I2’3, I4 and I5 at specific regions in Eurasia and then spread southward to Crete and Egypt.

  3. Rapid DNA analysis for automated processing and interpretation of low DNA content samples.

    Science.gov (United States)

    Turingan, Rosemary S; Vasantgadkar, Sameer; Palombo, Luke; Hogan, Catherine; Jiang, Hua; Tan, Eugene; Selden, Richard F

    2016-01-01

    Short tandem repeat (STR) analysis of casework samples with low DNA content include those resulting from the transfer of epithelial cells from the skin to an object (e.g., cells on a water bottle, or brim of a cap), blood spatter stains, and small bone and tissue fragments. Low DNA content (LDC) samples are important in a wide range of settings, including disaster response teams to assist in victim identification and family reunification, military operations to identify friend or foe, criminal forensics to identify suspects and exonerate the innocent, and medical examiner and coroner offices to identify missing persons. Processing LDC samples requires experienced laboratory personnel, isolated workstations, and sophisticated equipment, requires transport time, and involves complex procedures. We present a rapid DNA analysis system designed specifically to generate STR profiles from LDC samples in field-forward settings by non-technical operators. By performing STR in the field, close to the site of collection, rapid DNA analysis has the potential to increase throughput and to provide actionable information in real time. A Low DNA Content BioChipSet (LDC BCS) was developed and manufactured by injection molding. It was designed to function in the fully integrated Accelerated Nuclear DNA Equipment (ANDE) instrument previously designed for analysis of buccal swab and other high DNA content samples (Investigative Genet. 4(1):1-15, 2013). The LDC BCS performs efficient DNA purification followed by microfluidic ultrafiltration of the purified DNA, maximizing the quantity of DNA available for subsequent amplification and electrophoretic separation and detection of amplified fragments. The system demonstrates accuracy, precision, resolution, signal strength, and peak height ratios appropriate for casework analysis. The LDC rapid DNA analysis system is effective for the generation of STR profiles from a wide range of sample types. The technology broadens the range of sample

  4. Linkage analysis by two-dimensional DNA typing

    NARCIS (Netherlands)

    te Meerman, G J; Mullaart, E; Meulen ,van der Martin; den Daas, J H; Morolli, B; Uitterlinden, A G; Vijg, J

    1993-01-01

    In two-dimensional (2-D) DNA typing, genomic DNA fragments are separated, first according to size by electrophoresis in a neutral polyacrylamide gel and second according to sequence by denaturing gradient gel electrophoresis, followed by hybridization analysis using micro- and minisatellite core

  5. DNA analysis by single molecule stretching in nanofluidic biochips

    DEFF Research Database (Denmark)

    Abad, E.; Juarros, A.; Retolaza, A.

    2011-01-01

    Imprint Lithography (NIL) technology combined with a conventional anodic bonding of the silicon base and Pyrex cover. Using this chip, we have performed single molecule imaging on a bench-top fluorescent microscope system. Lambda phage DNA was used as a model sample to characterize the chip. Single molecules of λ-DNA......Stretching single DNA molecules by confinement in nanofluidic channels has attracted a great interest during the last few years as a DNA analysis tool. We have designed and fabricated a sealed micro/nanofluidic device for DNA stretching applications, based on the use of the high throughput Nano...... stained with the fluorescent dye YOYO-1 were stretched in the nanochannel array and the experimental results were analysed to determine the extension factor of the DNA in the chip and the geometrical average of the nanochannel inner diameter. The determination of the extension ratio of the chip provides...

  6. [The future of forensic DNA analysis for criminal justice].

    Science.gov (United States)

    Laurent, François-Xavier; Vibrac, Geoffrey; Rubio, Aurélien; Thévenot, Marie-Thérèse; Pène, Laurent

    2017-11-01

    In the criminal framework, the analysis of approximately 20 DNA microsatellites enables the establishment of a genetic profile with a high statistical power of discrimination. This technique gives us the possibility to establish or exclude a match between a biological trace detected at a crime scene and a suspect whose DNA was collected via an oral swab. However, conventional techniques do tend to complexify the interpretation of complex DNA samples, such as degraded DNA and mixture DNA. The aim of this review is to highlight the powerness of new forensic DNA methods (including high-throughput sequencing or single-cell sequencing) to facilitate the interpretation of the expert with full compliance with existing french legislation. © 2017 médecine/sciences – Inserm.

  7. Analysis of DNA Hydroxymethylation Using Colorimetric Assay.

    Science.gov (United States)

    Golubov, Andrey; Kovalchuk, Igor

    2017-01-01

    Hydroxymethylcytosine (hmC or 5-hmC) is a nitrogen base occurring as a result of cytosine methylation followed by replacing a methyl group with a hydroxyl group through active oxidation. 5-hmC is considered to be one of the forms of epigenetic modification and is suggested as an intermediate step in a semi-active loss of DNA methylation mark. 5-hmC plays an important role in the epigenetic regulation of gene expression in animals, although its role in plants remains controversial. Here, we present a colorimetric method of quantification of 5-hmC using Brassica rapa DNA.

  8. Promzea: a pipeline for discovery of co-regulatory motifs in maize and other plant species and its application to the anthocyanin and phlobaphene biosynthetic pathways and the Maize Development Atlas.

    Science.gov (United States)

    Liseron-Monfils, Christophe; Lewis, Tim; Ashlock, Daniel; McNicholas, Paul D; Fauteux, François; Strömvik, Martina; Raizada, Manish N

    2013-03-15

    The discovery of genetic networks and cis-acting DNA motifs underlying their regulation is a major objective of transcriptome studies. The recent release of the maize genome (Zea mays L.) has facilitated in silico searches for regulatory motifs. Several algorithms exist to predict cis-acting elements, but none have been adapted for maize. A benchmark data set was used to evaluate the accuracy of three motif discovery programs: BioProspector, Weeder and MEME. Analysis showed that each motif discovery tool had limited accuracy and appeared to retrieve a distinct set of motifs. Therefore, using the benchmark, statistical filters were optimized to reduce the false discovery ratio, and then remaining motifs from all programs were combined to improve motif prediction. These principles were integrated into a user-friendly pipeline for motif discovery in maize called Promzea, available at http://www.promzea.org and on the Discovery Environment of the iPlant Collaborative website. Promzea was subsequently expanded to include rice and Arabidopsis. Within Promzea, a user enters cDNA sequences or gene IDs; corresponding upstream sequences are retrieved from the maize genome. Predicted motifs are filtered, combined and ranked. Promzea searches the chosen plant genome for genes containing each candidate motif, providing the user with the gene list and corresponding gene annotations. Promzea was validated in silico using a benchmark data set: the Promzea pipeline showed a 22% increase in nucleotide sensitivity compared to the best standalone program tool, Weeder, with equivalent nucleotide specificity. Promzea was also validated by its ability to retrieve the experimentally defined binding sites of transcription factors that regulate the maize anthocyanin and phlobaphene biosynthetic pathways. Promzea predicted additional promoter motifs, and genome-wide motif searches by Promzea identified 127 non-anthocyanin/phlobaphene genes that each contained all five predicted promoter

  9. Microfluidic Devices for Forensic DNA Analysis: A Review.

    Science.gov (United States)

    Bruijns, Brigitte; van Asten, Arian; Tiggelaar, Roald; Gardeniers, Han

    2016-08-05

    Microfluidic devices may offer various advantages for forensic DNA analysis, such as reduced risk of contamination, shorter analysis time and direct application at the crime scene. Microfluidic chip technology has already proven to be functional and effective within medical applications, such as for point-of-care use. In the forensic field, one may expect microfluidic technology to become particularly relevant for the analysis of biological traces containing human DNA. This would require a number of consecutive steps, including sample work up, DNA amplification and detection, as well as secure storage of the sample. This article provides an extensive overview of microfluidic devices for cell lysis, DNA extraction and purification, DNA amplification and detection and analysis techniques for DNA. Topics to be discussed are polymerase chain reaction (PCR) on-chip, digital PCR (dPCR), isothermal amplification on-chip, chip materials, integrated devices and commercially available techniques. A critical overview of the opportunities and challenges of the use of chips is discussed, and developments made in forensic DNA analysis over the past 10-20 years with microfluidic systems are described. Areas in which further research is needed are indicated in a future outlook.

  10. Analysis of Low Level DNA Mixtures

    Czech Academy of Sciences Publication Activity Database

    Slovák, Dalibor; Zvárová, Jana

    2013-01-01

    Roč. 1, č. 1 (2013), s. 63-63 ISSN 1805-8698. [EFMI 2013 Special Topic Conference. 17.04.2013-19.04.2013, Prague] Institutional support: RVO:67985807 Keywords : forensic DNA interpretation * low level samples * allele peak heights * dropout probability Subject RIV: IN - Informatics, Computer Science

  11. Gene Expression Analysis Using Agilent DNA Microarrays

    DEFF Research Database (Denmark)

    Stangegaard, Michael

    2009-01-01

    Hybridization of labeled cDNA to microarrays is an intuitively simple and a vastly underestimated process. If it is not performed, optimized, and standardized with the same attention to detail as e.g., RNA amplification, information may be overlooked or even lost. Careful balancing of the amount ...

  12. oPOSSUM-3: advanced analysis of regulatory motif over-representation across genes or ChIP-Seq datasets.

    Science.gov (United States)

    Kwon, Andrew T; Arenillas, David J; Worsley Hunt, Rebecca; Wasserman, Wyeth W

    2012-09-01

    oPOSSUM-3 is a web-accessible software system for identification of over-represented transcription factor binding sites (TFBS) and TFBS families in either DNA sequences of co-expressed genes or sequences generated from high-throughput methods, such as ChIP-Seq. Validation of the system with known sets of co-regulated genes and published ChIP-Seq data demonstrates the capacity for oPOSSUM-3 to identify mediating transcription factors (TF) for co-regulated genes or co-recovered sequences. oPOSSUM-3 is available at http://opossum.cisreg.ca.

  13. RMOD: a tool for regulatory motif detection in signaling network.

    Directory of Open Access Journals (Sweden)

    Jinki Kim

    Full Text Available Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.

  14. Structural analysis of complementary DNA and amino acid sequences of human and rat androgen receptors

    International Nuclear Information System (INIS)

    Chang, C.; Kokontis, J.; Liao, S.

    1988-01-01

    Structural analysis of cDNAs for human and rat androgen receptors (ARs) indicates that the amino-terminal regions of ARs are rich in oligo- and poly(amino acid) motifs as in some homeotic genes. The human AR has a long stretch of repeated glycines, whereas rat AR has a long stretch of glutamines. There is a considerable sequence similarity among ARs and the receptors for glucocorticoids, progestins, and mineralocorticoids within the steroid-binding domains. The cysteine-rich DNA-binding domains are well conserved. Translation of mRNA transcribed from AR cDNAs yielded 94- and 76-kDa proteins and smaller forms that bind to DNA and have high affinity toward androgens. These rat or human ARs were recognized by human autoantibodies to natural Ars. Molecular hybridization studies, using AR cDNAs as probes, indicated that the ventral prostate and other male accessory organs are rich in AR mRNA and that the production of AR mRNA in the target organs may be autoregulated by androgens

  15. UPDG: Utilities package for data analysis of Pooled DNA GWAS

    Directory of Open Access Journals (Sweden)

    Ho Daniel WH

    2012-01-01

    Full Text Available Abstract Background Despite being a well-established strategy for cost reduction in disease gene mapping, pooled DNA association study is much less popular than the individual DNA approach. This situation is especially true for pooled DNA genomewide association study (GWAS, for which very few computer resources have been developed for its data analysis. This motivates the development of UPDG (Utilities package for data analysis of Pooled DNA GWAS. Results UPDG represents a generalized framework for data analysis of pooled DNA GWAS with the integration of Unix/Linux shell operations, Perl programs and R scripts. With the input of raw intensity data from GWAS, UPDG performs the following tasks in a stepwise manner: raw data manipulation, correction for allelic preferential amplification, normalization, nested analysis of variance for genetic association testing, and summarization of analysis results. Detailed instructions, procedures and commands are provided in the comprehensive user manual describing the whole process from preliminary preparation of software installation to final outcome acquisition. An example dataset (input files and sample output files is also included in the package so that users can easily familiarize themselves with the data file formats, working procedures and expected output. Therefore, UPDG is especially useful for users with some computer knowledge, but without a sophisticated programming background. Conclusions UPDG provides a free, simple and platform-independent one-stop service to scientists working on pooled DNA GWAS data analysis, but with less advanced programming knowledge. It is our vision and mission to reduce the hindrance for performing data analysis of pooled DNA GWAS through our contribution of UPDG. More importantly, we hope to promote the popularity of pooled DNA GWAS, which is a very useful research strategy.

  16. DNA analysis for mysteries buried in history

    Directory of Open Access Journals (Sweden)

    Tanuj Kanchan

    2015-09-01

    Full Text Available Over the years DNA technology has proved to be a path breaking invention and this technological advancement in modern investigations will hopefully solve many more mysteries in the time to come. However, the developing world is lagging far behind owing to financial constraints and has resorted to relatively less reliable methods during investigations. Hopefully, developing nations too will follow suit in utilizing this technology to its potential.

  17. Google matrix analysis of DNA sequences.

    Science.gov (United States)

    Kandiah, Vivek; Shepelyansky, Dima L

    2013-01-01

    For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW). At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

  18. Google matrix analysis of DNA sequences.

    Directory of Open Access Journals (Sweden)

    Vivek Kandiah

    Full Text Available For DNA sequences of various species we construct the Google matrix [Formula: see text] of Markov transitions between nearby words composed of several letters. The statistical distribution of matrix elements of this matrix is shown to be described by a power law with the exponent being close to those of outgoing links in such scale-free networks as the World Wide Web (WWW. At the same time the sum of ingoing matrix elements is characterized by the exponent being significantly larger than those typical for WWW networks. This results in a slow algebraic decay of the PageRank probability determined by the distribution of ingoing elements. The spectrum of [Formula: see text] is characterized by a large gap leading to a rapid relaxation process on the DNA sequence networks. We introduce the PageRank proximity correlator between different species which determines their statistical similarity from the view point of Markov chains. The properties of other eigenstates of the Google matrix are also discussed. Our results establish scale-free features of DNA sequence networks showing their similarities and distinctions with the WWW and linguistic networks.

  19. Ancient DNA analysis of dental calculus.

    Science.gov (United States)

    Weyrich, Laura S; Dobney, Keith; Cooper, Alan

    2015-02-01

    Dental calculus (calcified tartar or plaque) is today widespread on modern human teeth around the world. A combination of soft starchy foods, changing acidity of the oral environment, genetic pre-disposition, and the absence of dental hygiene all lead to the build-up of microorganisms and food debris on the tooth crown, which eventually calcifies through a complex process of mineralisation. Millions of oral microbes are trapped and preserved within this mineralised matrix, including pathogens associated with the oral cavity and airways, masticated food debris, and other types of extraneous particles that enter the mouth. As a result, archaeologists and anthropologists are increasingly using ancient human dental calculus to explore broad aspects of past human diet and health. Most recently, high-throughput DNA sequencing of ancient dental calculus has provided valuable insights into the evolution of the oral microbiome and shed new light on the impacts of some of the major biocultural transitions on human health throughout history and prehistory. Here, we provide a brief historical overview of archaeological dental calculus research, and discuss the current approaches to ancient DNA sampling and sequencing. Novel applications of ancient DNA from dental calculus are discussed, highlighting the considerable scope of this new research field for evolutionary biology and modern medicine. Copyright © 2014 Elsevier Ltd. All rights reserved.

  20. Genomic analysis of murine DNA-dependent protein kinase

    International Nuclear Information System (INIS)

    Fujimori, A.; Abe, M.

    2003-01-01

    Full text: The gene of catalytic subunit of DNA dependent protein kinase is responsible gene for SCID mice. The molecules play a critical role in non-homologous end joining including the V(D)J recombination. Contribution of the molecules to the difference of radiosensitivity and the susceptibility to cancer has been suggested. Here we show the entire nucleotide sequence of approximately 193 kbp and 84 kbp genomic regions encoding the entire DNA-PKcs gene in the mouse and chicken respectively. Retroposon was found in the intron 51 of mouse genomic DNA-PKcs gene but in human and chicken. Comparative analysis of these two species strongly suggested that only two genes, DNA-PKcs and MCM4, exist in the region of both species. Several conserved sequences and cis elements, however, were predicted. Recently, the orthologous region for the human DNA-PKcs locus was completed. The results of further comparative study will be discussed

  1. Analysis of DNA interactions using single-molecule force spectroscopy.

    Science.gov (United States)

    Ritzefeld, Markus; Walhorn, Volker; Anselmetti, Dario; Sewald, Norbert

    2013-06-01

    Protein-DNA interactions are involved in many biochemical pathways and determine the fate of the corresponding cell. Qualitative and quantitative investigations on these recognition and binding processes are of key importance for an improved understanding of biochemical processes and also for systems biology. This review article focusses on atomic force microscopy (AFM)-based single-molecule force spectroscopy and its application to the quantification of forces and binding mechanisms that lead to the formation of protein-DNA complexes. AFM and dynamic force spectroscopy are exciting tools that allow for quantitative analysis of biomolecular interactions. Besides an overview on the method and the most important immobilization approaches, the physical basics of the data evaluation is described. Recent applications of AFM-based force spectroscopy to investigate DNA intercalation, complexes involving DNA aptamers and peptide- and protein-DNA interactions are given.

  2. Parole, Sintagmatik, dan Paradigmatik Motif Batik Mega Mendung

    Directory of Open Access Journals (Sweden)

    Rudi - Nababan

    2012-04-01

    Full Text Available ABSTRACT   Discussing traditional batik is related a lot to the organization system of fine arts element ac- companying it, either the pattern of the motif or the technique of the making. In this case, the motif of Mega Mendung Cirebon certainly has patterns and rules which are traditionally different from the other motifs in other areas. Through  semiotics analysis especially with Saussure and Pierce concept, it can be traced that batik with Cirebon motif, in this case Mega Mendung motif, has parole and langue system, as unique fine arts language in batik, and structure of visual syntagmatic and paradigmatic. In the context of batik motif as fine arts language, it is surely related to sign system as symbol and icon.       Keywords: visual semiotic, Cirebon’s batik.

  3. Analisis Unsur Matematika pada Motif Sulam Usus

    Directory of Open Access Journals (Sweden)

    Fredi Ganda Putra

    2017-12-01

    Full Text Available Based on interviews with researchers sources said that the beginning of the intestine embroidery is an art of genuine crafts. Called the intestine embroidery because this technique is a technique of combining a strand of cloth resembling the intestine formed according to the pattern by means of embroidered using a thread. Intestinal embroidery techniques were originally used to create a cover of the women's customary wardrobe of Lampung or often referred to as bebe. But not many people in Lampung, especially people who live in Lampung are still many who do not know and recognize the intestine embroidery because most only know tapis only characteristic of Lampung, besides that there are other cultural results that is embroidered intestine. There are still many who do not know that the intestine motif there is a knowledge of mathematics. The researcher's problem formulation is whether there are mathematical elements contained in the intestine embroidery motif based on the concept of geometry. The purpose of this study is to determine whether there are elements of mathematics contained in the intestine motif based on the concept of geometry. Subjects in this study consisted of 4 people obtained by purposive sampling technique. From the results of data analysis conducted by using descriptive analysis and discussion as follows: (1 Intestinal embroidery motif contains the meaning of mathematics and culture or often called Etnomatematika. On the meaning of culture there is a link between the embroidery intestine with a culture that has been there before as the existence of cultural linkage between Hindu belief Buddhism and there are similarities of motifs and decorative patterns contained in the motif embroidery intestine with ornamental variety in Indonesia. (2 The relationship between the intestine with mathematical motifs there are elements of mathematics such as geometry elements in the form of geometry of dimension one and dimension two, and the

  4. Detection of Adult Green Sturgeon Using Environmental DNA Analysis.

    Directory of Open Access Journals (Sweden)

    Paul S Bergman

    Full Text Available Environmental DNA (eDNA is an emerging sampling method that has been used successfully for detection of rare aquatic species. The Identification of sampling tools that are less stressful for target organisms has become increasingly important for rare and endangered species. A decline in abundance of the Southern Distinct Population Segment (DPS of North American Green Sturgeon located in California's Central Valley has led to its listing as Threatened under the Federal Endangered Species Act in 2006. While visual surveys of spawning Green Sturgeon in the Central Valley are effective at monitoring fish densities in concentrated pool habitats, results do not scale well to the watershed level, providing limited spatial and temporal context. Unlike most traditional survey methods, environmental DNA analysis provides a relatively quick, inexpensive tool that could efficiently monitor the presence and distribution of aquatic species. We positively identified Green Sturgeon DNA at two locations of known presence in the Sacramento River, proving that eDNA can be effective for monitoring the presence of adult sturgeon. While further study is needed to understand uncertainties of the sampling method, our study represents the first documented detection of Green Sturgeon eDNA, indicating that eDNA analysis could provide a new tool for monitoring Green Sturgeon distribution in the Central Valley, complimenting traditional on-going survey methods.

  5. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern applications require mining of motifs in one very long sequence (i.e., in the order of several gigabytes). For this case, there exist statistical approaches that are fast but inaccurate; or combinatorial methods that are sound and complete. Unfortunately, existing combinatorial methods are serial and very slow. Consequently, they are limited to very short sequences (i.e., a few megabytes), small alphabets (typically 4 symbols for DNA sequences), and restricted types of motifs. This paper presents ACME, a combinatorial method for extracting motifs from a single very long sequence. ACME arranges the search space in contiguous blocks that take advantage of the cache hierarchy in modern architectures, and achieves almost an order of magnitude performance gain in serial execution. It also decomposes the search space in a smart way that allows scalability to thousands of processors with more than 90% speedup. ACME is the only method that: (i) scales to gigabyte-long sequences; (ii) handles large alphabets; (iii) supports interesting types of motifs with minimal additional cost; and (iv) is optimized for a variety of architectures such as multi-core systems, clusters in the cloud, and supercomputers. ACME reduces the extraction time for an exact-length query from 4 hours to 7 minutes on a typical workstation; handles 3 orders of magnitude longer sequences; and scales up to 16, 384 cores on a supercomputer. Copyright is held by the owner/author(s).

  6. Isolation and analysis of high quality nuclear DNA with reduced organellar DNA for plant genome sequencing and resequencing

    Directory of Open Access Journals (Sweden)

    Zdepski Anna

    2011-05-01

    Full Text Available Abstract Background High throughput sequencing (HTS technologies have revolutionized the field of genomics by drastically reducing the cost of sequencing, making it feasible for individual labs to sequence or resequence plant genomes. Obtaining high quality, high molecular weight DNA from plants poses significant challenges due to the high copy number of chloroplast and mitochondrial DNA, as well as high levels of phenolic compounds and polysaccharides. Multiple methods have been used to isolate DNA from plants; the CTAB method is commonly used to isolate total cellular DNA from plants that contain nuclear DNA, as well as chloroplast and mitochondrial DNA. Alternatively, DNA can be isolated from nuclei to minimize chloroplast and mitochondrial DNA contamination. Results We describe optimized protocols for isolation of nuclear DNA from eight different plant species encompassing both monocot and eudicot species. These protocols use nuclei isolation to minimize chloroplast and mitochondrial DNA contamination. We also developed a protocol to determine the number of chloroplast and mitochondrial DNA copies relative to the nuclear DNA using quantitative real time PCR (qPCR. We compared DNA isolated from nuclei to total cellular DNA isolated with the CTAB method. As expected, DNA isolated from nuclei consistently yielded nuclear DNA with fewer chloroplast and mitochondrial DNA copies, as compared to the total cellular DNA prepared with the CTAB method. This protocol will allow for analysis of the quality and quantity of nuclear DNA before starting a plant whole genome sequencing or resequencing experiment. Conclusions Extracting high quality, high molecular weight nuclear DNA in plants has the potential to be a bottleneck in the era of whole genome sequencing and resequencing. The methods that are described here provide a framework for researchers to extract and quantify nuclear DNA in multiple types of plants.

  7. Analysis of the mycoplasma genome by recombinant DNA technology

    DEFF Research Database (Denmark)

    Christiansen, C; Frydenberg, Jane; Christiansen, G

    1984-01-01

    A library of DNA fragments from Mycoplasma sp. strain PG50 has been made in the vector pBR325. Analysis in Escherichia coli minicells of randomly picked clones from this library demonstrated that many plasmids can promote synthesis of mycoplasma protein in the E. coli genetic background. Screening....... The DNA sequence of 16S rRNA and the surrounding control regions has been determined....

  8. Laser desorption mass spectrometry for high-throughput DNA analysis and its applications

    Science.gov (United States)

    Chen, C. H. Winston; Golovlev, Valeri V.; Taranenko, N. I.; Allman, S. L.; Isola, Narayana R.; Potter, N. T.; Matteson, K. J.; Chang, Linus Y.

    1999-05-01

    Laser desorption mass spectrometry (LDMS) has been developed for DNA sequencing, disease diagnosis, and DNA fingerprinting for forensic applications. With LDMS, the speed of DNA analysis can be much faster than conventional gel electrophoresis. No dye or radioactive tagging to DNA segments for detection is needed. LDMS is emerging as a new alternative technology for DNA analysis.

  9. Comparative transcriptome analysis of oil palm flowers reveals an EAR-motif-containing R2R3-MYB that modulates phenylpropene biosynthesis.

    Science.gov (United States)

    Li, Ran; Reddy, Vaishnavi Amarr; Jin, Jingjing; Rajan, Chakaravarthy; Wang, Qian; Yue, Genhua; Lim, Chin Huat; Chua, Nam-Hai; Ye, Jian; Sarojam, Rajani

    2017-11-23

    Oil palm is the most productive oil crop and the efficiency of pollination has a direct impact on the yield of oil. Pollination by wind can occur but maximal pollination is mediated by the weevil E. kamerunicus. These weevils complete their life cycle by feeding on male flowers. Attraction of weevils to oil palm flowers is due to the emission of methylchavicol by both male and female flowers. In search for male flowers, the weevils visit female flowers by accident due to methylchavicol fragrance and deposit pollen. Given the importance of methylchavicol emission on pollination, we performed comparative transcriptome analysis of oil palm flowers and leaves to identify candidate genes involved in methylchavicol production in flowers. RNA sequencing (RNA-Seq) of male open flowers, female open flowers and leaves was performed using Illumina HiSeq 2000 platform. Analysis of the transcriptome data revealed that the transcripts of methylchavicol biosynthesis genes were strongly up-regulated whereas transcripts encoding genes involved in lignin production such as, caffeic acid O-methyltransferase (COMT) and Ferulate-5-hydroxylase (F5H) were found to be suppressed in oil palm flowers. Among the transcripts encoding transcription factors, an EAR-motif-containing R2R3-MYB transcription factor (EgMYB4) was found to be enriched in oil palm flowers. We determined that EgMYB4 can suppress the expression of a monolignol pathway gene, EgCOMT, in vivo by binding to the AC elements present in the promoter region. EgMYB4 was further functionally characterized in sweet basil which also produces phenylpropenes like oil palm. Transgenic sweet basil plants showed significant reduction in lignin content but produced more phenylpropenes. Our results suggest that EgMYB4 possibly restrains lignin biosynthesis in oil palm flowers thus allowing enhanced carbon flux into the phenylpropene pathway. This study augments our understanding of the diverse roles that EAR-motif-containing MYBs play to

  10. RADIA: RNA and DNA integrated analysis for somatic mutation detection.

    Directory of Open Access Journals (Sweden)

    Amie J Radenbaugh

    Full Text Available The detection of somatic single nucleotide variants is a crucial component to the characterization of the cancer genome. Mutation calling algorithms thus far have focused on comparing the normal and tumor genomes from the same individual. In recent years, it has become routine for projects like The Cancer Genome Atlas (TCGA to also sequence the tumor RNA. Here we present RADIA (RNA and DNA Integrated Analysis, a novel computational method combining the patient-matched normal and tumor DNA with the tumor RNA to detect somatic mutations. The inclusion of the RNA increases the power to detect somatic mutations, especially at low DNA allelic frequencies. By integrating an individual's DNA and RNA, we are able to detect mutations that would otherwise be missed by traditional algorithms that examine only the DNA. We demonstrate high sensitivity (84% and very high precision (98% and 99% for RADIA in patient data from endometrial carcinoma and lung adenocarcinoma from TCGA. Mutations with both high DNA and RNA read support have the highest validation rate of over 99%. We also introduce a simulation package that spikes in artificial mutations to patient data, rather than simulating sequencing data from a reference genome. We evaluate sensitivity on the simulation data and demonstrate our ability to rescue back mutations at low DNA allelic frequencies by including the RNA. Finally, we highlight mutations in important cancer genes that were rescued due to the incorporation of the RNA.

  11. Genome-Wide Analysis of Transposon and Retroviral Insertions Reveals Preferential Integrations in Regions of DNA Flexibility.

    Science.gov (United States)

    Vrljicak, Pavle; Tao, Shijie; Varshney, Gaurav K; Quach, Helen Ngoc Bao; Joshi, Adita; LaFave, Matthew C; Burgess, Shawn M; Sampath, Karuna

    2016-04-07

    DNA transposons and retroviruses are important transgenic tools for genome engineering. An important consideration affecting the choice of transgenic vector is their insertion site preferences. Previous large-scale analyses of Ds transposon integration sites in plants were done on the basis of reporter gene expression or germ-line transmission, making it difficult to discern vertebrate integration preferences. Here, we compare over 1300 Ds transposon integration sites in zebrafish with Tol2 transposon and retroviral integration sites. Genome-wide analysis shows that Ds integration sites in the presence or absence of marker selection are remarkably similar and distributed throughout the genome. No strict motif was found, but a preference for structural features in the target DNA associated with DNA flexibility (Twist, Tilt, Rise, Roll, Shift, and Slide) was observed. Remarkably, this feature is also found in transposon and retroviral integrations in maize and mouse cells. Our findings show that structural features influence the integration of heterologous DNA in genomes, and have implications for targeted genome engineering. Copyright © 2016 Vrljicak et al.

  12. Norrie disease. Diagnosis of a simplex case by DNA analysis.

    Science.gov (United States)

    Chynn, E W; Walton, D S; Hahn, L B; Dryja, T P

    1996-09-01

    Norrie disease is a rare, X-linked recessive disorder characterized by congenital blindness due to malformed retinas. We describe a simplex patient who had leukokoria and whose clinical diagnosis was confirmed only after molecular genetics analysis. DNA analysis was also used to determine the carrier status of relatives of the proband.

  13. RNA motif search with data-driven element ordering.

    Science.gov (United States)

    Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa

    2016-05-18

    In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .

  14. Motif-role-fingerprints: the building-blocks of motifs, clustering-coefficients and transitivities in directed networks.

    Directory of Open Access Journals (Sweden)

    Mark D McDonnell

    Full Text Available Complex networks are frequently characterized by metrics for which particular subgraphs are counted. One statistic from this category, which we refer to as motif-role fingerprints, differs from global subgraph counts in that the number of subgraphs in which each node participates is counted. As with global subgraph counts, it can be important to distinguish between motif-role fingerprints that are 'structural' (induced subgraphs and 'functional' (partial subgraphs. Here we show mathematically that a vector of all functional motif-role fingerprints can readily be obtained from an arbitrary directed adjacency matrix, and then converted to structural motif-role fingerprints by multiplying that vector by a specific invertible conversion matrix. This result demonstrates that a unique structural motif-role fingerprint exists for any given functional motif-role fingerprint. We demonstrate a similar result for the cases of functional and structural motif-fingerprints without node roles, and global subgraph counts that form the basis of standard motif analysis. We also explicitly highlight that motif-role fingerprints are elemental to several popular metrics for quantifying the subgraph structure of directed complex networks, including motif distributions, directed clustering coefficient, and transitivity. The relationships between each of these metrics and motif-role fingerprints also suggest new subtypes of directed clustering coefficients and transitivities. Our results have potential utility in analyzing directed synaptic networks constructed from neuronal connectome data, such as in terms of centrality. Other potential applications include anomaly detection in networks, identification of similar networks and identification of similar nodes within networks. Matlab code for calculating all stated metrics following calculation of functional motif-role fingerprints is provided as S1 Matlab File.

  15. Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

    KAUST Repository

    Alam, Tanvir; Alazmi, Meshari; Naser, Rayan Mohammad Mahmoud; Huser, Franceline; Momin, Afaque Ahmad Imtiyaz; Walkiewicz, Katarzyna Wiktoria; Canlas, Christian; Huser, Raphaë l; Ali, Amal J.; Merzaban, Jasmeen; Bajic, Vladimir B.; Gao, Xin; Arold, Stefan T.

    2018-01-01

    and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter

  16. Determination of DNA methylation associated with Acer rubrum (red maple) adaptation to metals: analysis of global DNA modifications and methylation-sensitive amplified polymorphism.

    Science.gov (United States)

    Kim, Nam-Soo; Im, Min-Ji; Nkongolo, Kabwe

    2016-08-01

    Red maple (Acer rubum), a common deciduous tree species in Northern Ontario, has shown resistance to soil metal contamination. Previous reports have indicated that this plant does not accumulate metals in its tissue. However, low level of nickel and copper corresponding to the bioavailable levels in contaminated soils in Northern Ontario causes severe physiological damages. No differentiation between metal-contaminated and uncontaminated populations has been reported based on genetic analyses. The main objective of this study was to assess whether DNA methylation is involved in A. rubrum adaptation to soil metal contamination. Global cytosine and methylation-sensitive amplified polymorphism (MSAP) analyses were carried out in A. rubrum populations from metal-contaminated and uncontaminated sites. The global modified cytosine ratios in genomic DNA revealed a significant decrease in cytosine methylation in genotypes from a metal-contaminated site compared to uncontaminated populations. Other genotypes from a different metal-contaminated site within the same region appear to be recalcitrant to metal-induced DNA alterations even ≥30 years of tree life exposure to nickel and copper. MSAP analysis showed a high level of polymorphisms in both uncontaminated (77%) and metal-contaminated (72%) populations. Overall, 205 CCGG loci were identified in which 127 were methylated in either outer or inner cytosine. No differentiation among populations was established based on several genetic parameters tested. The variations for nonmethylated and methylated loci were compared by analysis of molecular variance (AMOVA). For methylated loci, molecular variance among and within populations was 1.5% and 13.2%, respectively. These values were low (0.6% for among populations and 5.8% for within populations) for unmethylated loci. Metal contamination is seen to affect methylation of cytosine residues in CCGG motifs in the A. rubrum populations that were analyzed.

  17. Systematic analysis of phosphotyrosine antibodies recognizing single phosphorylated EPIYA-motifs in CagA of Western-type Helicobacter pylori strains.

    Directory of Open Access Journals (Sweden)

    Judith Lind

    Full Text Available The clinical outcome of Helicobacter pylori infections is determined by multiple host-pathogen interactions that may develop to chronic gastritis, and sometimes peptic ulcers or gastric cancer. Highly virulent strains encode a type IV secretion system (T4SS that delivers the effector protein CagA into gastric epithelial cells. Translocated CagA undergoes tyrosine phosphorylation at EPIYA-sequence motifs, called A, B and C in Western-type strains, by members of the oncogenic Src and Abl host kinases. Phosphorylated EPIYA-motifs mediate interactions of CagA with host signaling factors--in particular various SH2-domain containing human proteins--thereby hijacking multiple downstream signaling cascades. Observations of tyrosine-phosphorylated CagA are mainly based on the use of commercial phosphotyrosine antibodies, which originally were selected to detect phosphotyrosines in mammalian proteins. Systematic studies of phosphorylated EPIYA-motif detection by the different antibodies would be very useful, but are not yet available. To address this issue, we synthesized phospho- and non-phosphopeptides representing each predominant Western CagA EPIYA-motif, and determined the recognition patterns of seven different phosphotyrosine antibodies in Western blots, and also performed infection studies with diverse representative Western H. pylori strains. Our results show that a total of 9-11 amino acids containing the phosphorylated EPIYA-motifs are necessary and sufficient for specific detection by these antibodies, but revealed great variability in sequence recognition. Three of the antibodies recognized phosphorylated EPIYA-motifs A, B and C similarly well; whereas preferential binding to phosphorylated motif A and motifs A and C was found with two and one antibodies, respectively, and the seventh anti-phosphotyrosine antibody did not recognize any phosphorylated EPIYA-motif. Controls showed that none of the antibodies recognized the corresponding non

  18. Sequencing and Analysis of Neanderthal Genomic DNA

    Energy Technology Data Exchange (ETDEWEB)

    Noonan, James P.; Coop, Graham; Kudaravalli, Sridhar; Smith,Doug; Krause, Johannes; Alessi, Joe; Chen, Feng; Platt, Darren; Paabo,Svante; Pritchard, Jonathan K.; Rubin, Edward M.

    2006-06-13

    Recovery and analysis of multiple Neanderthal autosomalsequences using a metagenomic approach reveals that modern humans andNeanderthals split ~;400,000 years ago, without significant evidence ofsubsequent admixture.

  19. Diagnosis of Lung Cancer by Fractal Analysis of Damaged DNA

    Directory of Open Access Journals (Sweden)

    Hamidreza Namazi

    2015-01-01

    Full Text Available Cancer starts when cells in a part of the body start to grow out of control. In fact cells become cancer cells because of DNA damage. A DNA walk of a genome represents how the frequency of each nucleotide of a pairing nucleotide couple changes locally. In this research in order to study the cancer genes, DNA walk plots of genomes of patients with lung cancer were generated using a program written in MATLAB language. The data so obtained was checked for fractal property by computing the fractal dimension using a program written in MATLAB. Also, the correlation of damaged DNA was studied using the Hurst exponent measure. We have found that the damaged DNA sequences are exhibiting higher degree of fractality and less correlation compared with normal DNA sequences. So we confirmed this method can be used for early detection of lung cancer. The method introduced in this research not only is useful for diagnosis of lung cancer but also can be applied for detection and growth analysis of different types of cancers.

  20. A speedup technique for (l, d-motif finding algorithms

    Directory of Open Access Journals (Sweden)

    Dinh Hieu

    2011-03-01

    Full Text Available Abstract Background The discovery of patterns in DNA, RNA, and protein sequences has led to the solution of many vital biological problems. For instance, the identification of patterns in nucleic acid sequences has resulted in the determination of open reading frames, identification of promoter elements of genes, identification of intron/exon splicing sites, identification of SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have proven to be extremely helpful in domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, etc. Motifs are important patterns that are helpful in finding transcriptional regulatory elements, transcription factor binding sites, functional genomics, drug design, etc. As a result, numerous papers have been written to solve the motif search problem. Results Three versions of the motif search problem have been proposed in the literature: Simple Motif Search (SMS, (l, d-motif search (or Planted Motif Search (PMS, and Edit-distance-based Motif Search (EMS. In this paper we focus on PMS. Two kinds of algorithms can be found in the literature for solving the PMS problem: exact and approximate. An exact algorithm identifies the motifs always and an approximate algorithm may fail to identify some or all of the motifs. The exact version of PMS problem has been shown to be NP-hard. Exact algorithms proposed in the literature for PMS take time that is exponential in some of the underlying parameters. In this paper we propose a generic technique that can be used to speedup PMS algorithms. Conclusions We present a speedup technique that can be used on any PMS algorithm. We have tested our speedup technique on a number of algorithms. These experimental results show that our speedup technique is indeed very

  1. Pre-steady-state fluorescence analysis of damaged DNA transfer from human DNA glycosylases to AP endonuclease APE1.

    Science.gov (United States)

    Kuznetsova, Alexandra A; Kuznetsov, Nikita A; Ishchenko, Alexander A; Saparbaev, Murat K; Fedorova, Olga S

    2014-10-01

    DNA glycosylases remove the modified, damaged or mismatched bases from the DNA by hydrolyzing the N-glycosidic bonds. Some enzymes can further catalyze the incision of a resulting abasic (apurinic/apyrimidinic, AP) site through β- or β,δ-elimination mechanisms. In most cases, the incision reaction of the AP-site is catalyzed by special enzymes called AP-endonucleases. Here, we report the kinetic analysis of the mechanisms of modified DNA transfer from some DNA glycosylases to the AP endonuclease, APE1. The modified DNA contained the tetrahydrofurane residue (F), the analogue of the AP-site. DNA glycosylases AAG, OGG1, NEIL1, MBD4(cat) and UNG from different structural superfamilies were used. We found that all DNA glycosylases may utilise direct protein-protein interactions in the transient ternary complex for the transfer of the AP-containing DNA strand to APE1. We hypothesize a fast "flip-flop" exchange mechanism of damaged and undamaged DNA strands within this complex for monofunctional DNA glycosylases like MBD4(cat), AAG and UNG. Bifunctional DNA glycosylase NEIL1 creates tightly specific complex with DNA containing F-site thereby efficiently competing with APE1. Whereas APE1 fast displaces other bifunctional DNA glycosylase OGG1 on F-site thereby induces its shifts to undamaged DNA regions. Kinetic analysis of the transfer of DNA between human DNA glycosylases and APE1 allows us to elucidate the critical step in the base excision repair pathway. Copyright © 2014 Elsevier B.V. All rights reserved.

  2. GenePublisher: automated analysis of DNA microarray data

    DEFF Research Database (Denmark)

    Knudsen, Steen; Workman, Christopher; Sicheritz-Ponten, T.

    2003-01-01

    GenePublisher, a system for automatic analysis of data from DNA microarray experiments, has been implemented with a web interface at http://www.cbs.dtu.dk/services/GenePublisher. Raw data are uploaded to the server together with aspecification of the data. The server performs normalization...

  3. Genetic variation and DNA markers in forensic analysis

    African Journals Online (AJOL)

    SAM

    2014-07-30

    Jul 30, 2014 ... Author(s) agree that this article remain permanently open access under the terms of the Creative Commons Attribution License. 4.0 International ... (mtDNA) is today a routine method of analysis of biological ... A promising approach in this context seems to be .... 1985; Armour et al., 1996). ...... management.

  4. Phylogenetic analysis of the genus Hordeum using repetitive DNA sequences

    DEFF Research Database (Denmark)

    Svitashev, S.; Bryngelsson, T.; Vershinin, A.

    1994-01-01

    A set of six cloned barley (Hordeum vulgare) repetitive DNA sequences was used for the analysis of phylogenetic relationships among 31 species (46 taxa) of the genus Hordeum, using molecular hybridization techniques. In situ hybridization experiments showed dispersed organization of the sequences...

  5. Systematic analysis of DEMETER-like DNA glycosylase genes shows lineage-specific Smi-miR7972 involved in SmDML1 regulation in Salvia miltiorrhiza.

    Science.gov (United States)

    Li, Jiang; Li, Caili; Lu, Shanfa

    2018-05-08

    DEMETER-like DNA glycosylases (DMLs) initiate the base excision repair-dependent DNA demethylation to regulate a wide range of biological processes in plants. Six putative SmDML genes, termed SmDML1-SmDML6, were identified from the genome of S. miltiorrhiza, an emerging model plant for Traditional Chinese Medicine (TCM) studies. Integrated analysis of gene structures, sequence features, conserved domains and motifs, phylogenetic analysis and differential expression showed the conservation and divergence of SmDMLs. SmDML1, SmDML2 and SmDML4 were significantly down-regulated by the treatment of 5Aza-dC, a general DNA methylation inhibitor, suggesting involvement of SmDMLs in genome DNA methylation change. SmDML1 was predicted and experimentally validated to be target of Smi-miR7972. Computational analysis of forty whole genome sequences and almost all of RNA-seq data from Lamiids revealed that MIR7972s were only distributed in some plants of the three orders, including Lamiales, Solanales and Boraginales, and the number of MIR7972 genes varied among species. It suggests that MIR7972 genes underwent expansion and loss during the evolution of some Lamiids species. Phylogenetic analysis of MIR7972s showed closer evolutionary relationships between MIR7972s in Boraginales and Solanales in comparison with Lamiales. These results provide a valuable resource for elucidating DNA demethylation mechanism in S. miltiorrhiza.

  6. Identification and DNA fingerprinting of Legionella strains by randomly amplified polymorphic DNA analysis.

    OpenAIRE

    Bansal, N S; McDonell, F

    1997-01-01

    The randomly amplified polymorphic DNA (RAPD) technique was used in the development of a fingerprinting (typing) and identification protocol for Legionella strains. Twenty decamer random oligonucleotide primers were screened for their discriminatory abilities. Two candidate primers were selected. By using a combination of these primers, RAPD analysis allowed for the differentiation between all different species, between the serogroups, and further differentiation between subtypes of the same ...

  7. Design of potent inhibitors of human RAD51 recombinase based on BRC motifs of BRCA2 protein: modeling and experimental validation of a chimera peptide.

    KAUST Repository

    Nomme, Julian; Renodon-Corniè re, Axelle; Asanomi, Yuya; Sakaguchi, Kazuyasu; Stasiak, Alicja Z; Stasiak, Andrzej; Norden, Bengt; Tran, Vinh; Takahashi, Masayuki

    2010-01-01

    We have previously shown that a 28-amino acid peptide derived from the BRC4 motif of BRCA2 tumor suppressor inhibits selectively human RAD51 recombinase (HsRad51). With the aim of designing better inhibitors for cancer treatment, we combined an in silico docking approach with in vitro biochemical testing to construct a highly efficient chimera peptide from eight existing human BRC motifs. We built a molecular model of all BRC motifs complexed with HsRad51 based on the crystal structure of the BRC4 motif-HsRad51 complex, computed the interaction energy of each residue in each BRC motif, and selected the best amino acid residue at each binding position. This analysis enabled us to propose four amino acid substitutions in the BRC4 motif. Three of these increased the inhibitory effect in vitro, and this effect was found to be additive. We thus obtained a peptide that is about 10 times more efficient in inhibiting HsRad51-ssDNA complex formation than the original peptide.

  8. Design of potent inhibitors of human RAD51 recombinase based on BRC motifs of BRCA2 protein: modeling and experimental validation of a chimera peptide.

    KAUST Repository

    Nomme, Julian

    2010-08-01

    We have previously shown that a 28-amino acid peptide derived from the BRC4 motif of BRCA2 tumor suppressor inhibits selectively human RAD51 recombinase (HsRad51). With the aim of designing better inhibitors for cancer treatment, we combined an in silico docking approach with in vitro biochemical testing to construct a highly efficient chimera peptide from eight existing human BRC motifs. We built a molecular model of all BRC motifs complexed with HsRad51 based on the crystal structure of the BRC4 motif-HsRad51 complex, computed the interaction energy of each residue in each BRC motif, and selected the best amino acid residue at each binding position. This analysis enabled us to propose four amino acid substitutions in the BRC4 motif. Three of these increased the inhibitory effect in vitro, and this effect was found to be additive. We thus obtained a peptide that is about 10 times more efficient in inhibiting HsRad51-ssDNA complex formation than the original peptide.

  9. SSTRAP: A computational model for genomic motif discovery ...

    African Journals Online (AJOL)

    Computational methods can potentially provide high-quality prediction of biological molecules such as DNA binding sites and Transcription factors and therefore reduce the time needed for experimental verification and challenges associated with experimental methods. These biological molecules or motifs have significant ...

  10. Metric learning for DNA microarray data analysis

    International Nuclear Information System (INIS)

    Takeuchi, Ichiro; Nakagawa, Masao; Seto, Masao

    2009-01-01

    In many microarray studies, gene set selection is an important preliminary step for subsequent main task such as tumor classification, cancer subtype identification, etc. In this paper, we investigate the possibility of using metric learning as an alternative to gene set selection. We develop a simple metric learning algorithm aiming to use it for microarray data analysis. Exploiting a property of the algorithm, we introduce a novel approach for extending the metric learning to be adaptive. We apply the algorithm to previously studied microarray data on malignant lymphoma subtype identification.

  11. Identification of sequence motifs significantly associated with antisense activity

    Directory of Open Access Journals (Sweden)

    Peek Andrew S

    2007-06-01

    Full Text Available Abstract Background Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. Results We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. Conclusion The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic

  12. Expression analysis of a ''Cucurbita'' cDNA encoding endonuclease

    International Nuclear Information System (INIS)

    Szopa, J.

    1995-01-01

    The nuclear matrices of plant cell nuclei display intrinsic nuclease activity which consists in nicking supercoiled DNA. A cDNA encoding a 32 kDa endonuclease has been cloned and sequenced. The nucleotide and deduced amino-acid sequences show high homology to known 14-3-3-protein sequences from other sources. The amino-acid sequence shows agreement with consensus sequences for potential phosphorylation by protein kinase A and C and for calcium, lipid and membrane-binding sites. The nucleotide-binding site is also present within the conserved part of the sequence. By Northern blot analysis, the differential expression of the corresponding mRNA was detected; it was the strongest in sink tissues. The endonuclease activity found on DNA-polyacrylamide gel electrophoresis coincided with mRNA content and was the highest in tuber. (author). 22 refs, 6 figs

  13. Network clustering coefficient approach to DNA sequence analysis

    Energy Technology Data Exchange (ETDEWEB)

    Gerhardt, Guenther J.L. [Universidade Federal do Rio Grande do Sul-Hospital de Clinicas de Porto Alegre, Rua Ramiro Barcelos 2350/sala 2040/90035-003 Porto Alegre (Brazil); Departamento de Fisica e Quimica da Universidade de Caxias do Sul, Rua Francisco Getulio Vargas 1130, 95001-970 Caxias do Sul (Brazil); Lemke, Ney [Programa Interdisciplinar em Computacao Aplicada, Unisinos, Av. Unisinos, 950, 93022-000 Sao Leopoldo, RS (Brazil); Corso, Gilberto [Departamento de Biofisica e Farmacologia, Centro de Biociencias, Universidade Federal do Rio Grande do Norte, Campus Universitario, 59072 970 Natal, RN (Brazil)]. E-mail: corso@dfte.ufrn.br

    2006-05-15

    In this work we propose an alternative DNA sequence analysis tool based on graph theoretical concepts. The methodology investigates the path topology of an organism genome through a triplet network. In this network, triplets in DNA sequence are vertices and two vertices are connected if they occur juxtaposed on the genome. We characterize this network topology by measuring the clustering coefficient. We test our methodology against two main bias: the guanine-cytosine (GC) content and 3-bp (base pairs) periodicity of DNA sequence. We perform the test constructing random networks with variable GC content and imposed 3-bp periodicity. A test group of some organisms is constructed and we investigate the methodology in the light of the constructed random networks. We conclude that the clustering coefficient is a valuable tool since it gives information that is not trivially contained in 3-bp periodicity neither in the variable GC content.

  14. An efficient identification strategy of clonal tea cultivars using long-core motif SSR markers.

    Science.gov (United States)

    Wang, Rang Jian; Gao, Xiang Feng; Kong, Xiang Rui; Yang, Jun

    2016-01-01

    Microsatellites, or simple sequence repeats (SSRs), especially those with long-core motifs (tri-, tetra-, penta-, and hexa-nucleotide) represent an excellent tool for DNA fingerprinting. SSRs with long-core motifs are preferred since neighbor alleles are more easily separated and identified from each other, which render the interpretation of electropherograms and the true alleles more reliable. In the present work, with the purpose of characterizing a set of core SSR markers with long-core motifs for well fingerprinting clonal cultivars of tea (Camellia sinensis), we analyzed 66 elite clonal tea cultivars in China with 33 initially-chosen long-core motif SSR markers covering all the 15 linkage groups of tea plant genome. A set of 6 SSR markers were conclusively selected as core SSR markers after further selection. The polymorphic information content (PIC) of the core SSR markers was >0.5, with ≤5 alleles in each marker containing 10 or fewer genotypes. Phylogenetic analysis revealed that the core SSR markers were not strongly correlated with the trait 'cultivar processing-property'. The combined probability of identity (PID) between two random cultivars for the whole set of 6 SSR markers was estimated to be 2.22 × 10(-5), which was quite low, confirmed the usefulness of the proposed SSR markers for fingerprinting analyses in Camellia sinensis. Moreover, for the sake of quickly discriminating the clonal tea cultivars, a cultivar identification diagram (CID) was subsequently established using these core markers, which fully reflected the identification process and provided the immediate information about which SSR markers were needed to identify a cultivar chosen among the tested ones. The results suggested that long-core motif SSR markers used in the investigation contributed to the accurate and efficient identification of the clonal tea cultivars and enabled the protection of intellectual property.

  15. Novel Strategy for Discrimination of Transcription Factor Binding Motifs Employing Mathematical Neural Network

    Science.gov (United States)

    Sugimoto, Asuka; Sumi, Takuya; Kang, Jiyoung; Tateno, Masaru

    2017-07-01

    Recognition in biological macromolecular systems, such as DNA-protein recognition, is one of the most crucial problems to solve toward understanding the fundamental mechanisms of various biological processes. Since specific base sequences of genome DNA are discriminated by proteins, such as transcription factors (TFs), finding TF binding motifs (TFBMs) in whole genome DNA sequences is currently a central issue in interdisciplinary biophysical and information sciences. In the present study, a novel strategy to create a discriminant function for discrimination of TFBMs by constituting mathematical neural networks (NNs) is proposed, together with a method to determine the boundary of signals (TFBMs) and noise in the NN-score (output) space. This analysis also leads to the mathematical limitation of discrimination in the recognition of features representing TFBMs, in an information geometrical manifold. Thus, the present strategy enables the identification of the whole space of TFBMs, right up to the noise boundary.

  16. Quantitative analysis of DNA methylation in chronic lymphocytic leukemia patients.

    Science.gov (United States)

    Lyko, Frank; Stach, Dirk; Brenner, Axel; Stilgenbauer, Stephan; Döhner, Hartmut; Wirtz, Michaela; Wiessler, Manfred; Schmitz, Oliver J

    2004-06-01

    Changes in the genomic DNA methylation level have been found to be closely associated with tumorigenesis. In order to analyze the relation of aberrant DNA methylation to clinical and biological risk factors, we have determined the cytosine methylation level of 81 patients diagnosed with chronic lymphocytic leukemia (CLL). The analysis was based on DNA hydrolysis followed by derivatization of the 2'-desoxyribonucleoside-3'-monophosphates with BODIPY FL EDA. Derivatives were separated by micellar electrokinetic chromatography, and laser-induced fluorescence was used for detection. We analyzed potential correlations between DNA methylation levels and numerous patient parameters, including clinical observations and biological data. As a result, we observed a significant correlation with the immunoglobulin variable heavy chain gene (VH) mutation status. This factor has been repeatedly proposed as a reliable prognostic marker for CLL, which suggests that the methylation level might be a valuable factor in determining the prognostic outcome of CLL. We are now in the process of refining our method to broaden its application potential. In this context, we show here that the oxidation of the fluorescence marker in the samples and the evaporation of methanol in the electrolytes can be prevented by a film of paraffin oil. In summary, our results thus establish capillary electrophoresis as a valuable tool for analyzing the DNA methylation status of clinical samples.

  17. Efficient motif finding algorithms for large-alphabet inputs

    Directory of Open Access Journals (Sweden)

    Pavlovic Vladimir

    2010-10-01

    Full Text Available Abstract Background We consider the problem of identifying motifs, recurring or conserved patterns, in the biological sequence data sets. To solve this task, we present a new deterministic algorithm for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. Results The proposed algorithm (1 improves search efficiency compared to existing algorithms, and (2 scales well with the size of alphabet. On a synthetic planted DNA motif finding problem our algorithm is over 10× more efficient than MITRA, PMSPrune, and RISOTTO for long motifs. Improvements are orders of magnitude higher in the same setting with large alphabets. On benchmark TF-binding site problems (FNP, CRP, LexA we observed reduction in running time of over 12×, with high detection accuracy. The algorithm was also successful in rapidly identifying protein motifs in Lipocalin, Zinc metallopeptidase, and supersecondary structure motifs for Cadherin and Immunoglobin families. Conclusions Our algorithm reduces computational complexity of the current motif finding algorithms and demonstrate strong running time improvements over existing exact algorithms, especially in important and difficult cases of large-alphabet sequences.

  18. Discovering Motifs in Biological Sequences Using the Micron Automata Processor.

    Science.gov (United States)

    Roy, Indranil; Aluru, Srinivas

    2016-01-01

    Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-complete, and the largest solved instance reported to date is (26,11). We propose a novel algorithm for the (l,d) motif search problem using streaming execution over a large set of non-deterministic finite automata (NFA). This solution is designed to take advantage of the micron automata processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We demonstrate the capability for solving much larger instances of the (l, d) motif search problem using the resources available within a single automata processor board, by estimating run-times for problem instances (39,18) and (40,17). The paper serves as a useful guide to solving problems using this new accelerator technology.

  19. Recurrence time statistics: versatile tools for genomic DNA sequence analysis.

    Science.gov (United States)

    Cao, Yinhe; Tung, Wen-Wen; Gao, J B

    2004-01-01

    With the completion of the human and a few model organisms' genomes, and the genomes of many other organisms waiting to be sequenced, it has become increasingly important to develop faster computational tools which are capable of easily identifying the structures and extracting features from DNA sequences. One of the more important structures in a DNA sequence is repeat-related. Often they have to be masked before protein coding regions along a DNA sequence are to be identified or redundant expressed sequence tags (ESTs) are to be sequenced. Here we report a novel recurrence time based method for sequence analysis. The method can conveniently study all kinds of periodicity and exhaustively find all repeat-related features from a genomic DNA sequence. An efficient codon index is also derived from the recurrence time statistics, which has the salient features of being largely species-independent and working well on very short sequences. Efficient codon indices are key elements of successful gene finding algorithms, and are particularly useful for determining whether a suspected EST belongs to a coding or non-coding region. We illustrate the power of the method by studying the genomes of E. coli, the yeast S. cervisivae, the nematode worm C. elegans, and the human, Homo sapiens. Computationally, our method is very efficient. It allows us to carry out analysis of genomes on the whole genomic scale by a PC.

  20. Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.

    Science.gov (United States)

    Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique

    2015-06-01

    Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. Quantitative DNA methylation analysis of candidate genes in cervical cancer.

    Directory of Open Access Journals (Sweden)

    Erin M Siegel

    Full Text Available Aberrant DNA methylation has been observed in cervical cancer; however, most studies have used non-quantitative approaches to measure DNA methylation. The objective of this study was to quantify methylation within a select panel of genes previously identified as targets for epigenetic silencing in cervical cancer and to identify genes with elevated methylation that can distinguish cancer from normal cervical tissues. We identified 49 women with invasive squamous cell cancer of the cervix and 22 women with normal cytology specimens. Bisulfite-modified genomic DNA was amplified and quantitative pyrosequencing completed for 10 genes (APC, CCNA, CDH1, CDH13, WIF1, TIMP3, DAPK1, RARB, FHIT, and SLIT2. A Methylation Index was calculated as the mean percent methylation across all CpG sites analyzed per gene (~4-9 CpG site per sequence. A binary cut-point was defined at >15% methylation. Sensitivity, specificity and area under ROC curve (AUC of methylation in individual genes or a panel was examined. The median methylation index was significantly higher in cases compared to controls in 8 genes, whereas there was no difference in median methylation for 2 genes. Compared to HPV and age, the combination of DNA methylation level of DAPK1, SLIT2, WIF1 and RARB with HPV and age significantly improved the AUC from 0.79 to 0.99 (95% CI: 0.97-1.00, p-value = 0.003. Pyrosequencing analysis confirmed that several genes are common targets for aberrant methylation in cervical cancer and DNA methylation level of four genes appears to increase specificity to identify cancer compared to HPV detection alone. Alterations in DNA methylation of specific genes in cervical cancers, such as DAPK1, RARB, WIF1, and SLIT2, may also occur early in cervical carcinogenesis and should be evaluated.

  2. Y-STR analysis on DNA mixture samples--results of a collaborative project of the ENFSI DNA Working Group

    DEFF Research Database (Denmark)

    Parson, Walther; Niederstätter, Harald; Lindinger, Alexandra

    2008-01-01

    The ENFSI (European Network of Forensic Science Institutes) DNA Working Group undertook a collaborative project on Y-STR typing of DNA mixture samples that were centrally prepared and thoroughly tested prior to the shipment. Four commercial Y-STR typing kits (Y-Filer, Applied Biosystems, Foster C...... a laboratory-specific optimization process is indicated to reach a comparable sensitivity for the analysis of minute amounts of DNA....

  3. Dualities in the analysis of phage DNA packaging motors

    Science.gov (United States)

    Serwer, Philip; Jiang, Wen

    2012-01-01

    The DNA packaging motors of double-stranded DNA phages are models for analysis of all multi-molecular motors and for analysis of several fundamental aspects of biology, including early evolution, relationship of in vivo to in vitro biochemistry and targets for anti-virals. Work on phage DNA packaging motors both has produced and is producing dualities in the interpretation of data obtained by use of both traditional techniques and the more recently developed procedures of single-molecule analysis. The dualities include (1) reductive vs. accretive evolution, (2) rotation vs. stasis of sub-assemblies of the motor, (3) thermal ratcheting vs. power stroking in generating force, (4) complete motor vs. spark plug role for the packaging ATPase, (5) use of previously isolated vs. new intermediates for analysis of the intermediate states of the motor and (6) a motor with one cycle vs. a motor with two cycles. We provide background for these dualities, some of which are under-emphasized in the literature. We suggest directions for future research. PMID:23532204

  4. cDNA cloning, genomic organization and expression analysis during somatic embryogenesis of the translationally controlled tumor protein (TCTP) gene from Japanese larch (Larix leptolepis).

    Science.gov (United States)

    Zhang, Li-Feng; Li, Wan-Feng; Han, Su-Ying; Yang, Wen-Hua; Qi, Li-Wang

    2013-10-15

    A full-length cDNA and genomic sequences of a translationally controlled tumor protein (TCTP) gene were isolated from Japanese larch (Larix leptolepis) and designated LaTCTP. The length of the cDNA was 1, 043 bp and contained a 504 bp open reading frame that encodes a predicted protein of 167 amino acids, characterized by two signature sequences of the TCTP protein family. Analysis of the LaTCTP gene structure indicated four introns and five exons, and it is the largest of all currently known TCTP genes in plants. The 5'-flanking promoter region of LaTCTP was cloned using an improved TAIL-PCR technique. In this region we identified many important potential cis-acting elements, such as a Box-W1 (fungal elicitor responsive element), a CAT-box (cis-acting regulatory element related to meristem expression), a CGTCA-motif (cis-acting regulatory element involved in MeJA-responsiveness), a GT1-motif (light responsive element), a Skn-1-motif (cis-acting regulatory element required for endosperm expression) and a TGA-element (auxin-responsive element), suggesting that expression of LaTCTP is highly regulated. Expression analysis demonstrated ubiquitous localization of LaTCTP mRNA in the roots, stems and needles, high mRNA levels in the embryonal-suspensor mass (ESM), browning embryogenic cultures and mature somatic embryos, and low levels of mRNA at day five during somatic embryogenesis. We suggest that LaTCTP might participate in the regulation of somatic embryo development. These results provide a theoretical basis for understanding the molecular regulatory mechanism of LaTCTP and lay the foundation for artificial regulation of somatic embryogenesis. © 2013.

  5. cDNA cloning, sequence analysis, and chromosomal localization of the gene for human carnitine palmitoyltransferase

    International Nuclear Information System (INIS)

    Finocchiaro, G.; Taroni, F.; Martin, A.L.; Colombo, I.; Tarelli, G.T.; DiDonato, S.; Rocchi, M.

    1991-01-01

    The authors have cloned and sequenced a cDNA encoding human liver carnitine palmitoyltransferase an inner mitochondrial membrane enzyme that plays a major role in the fatty acid oxidation pathway. Mixed oligonucleotide primers whose sequences were deduced from one tryptic peptide obtained from purified CPTase were used in a polymerase chain reaction, allowing the amplification of a 0.12-kilobase fragment of human genomic DNA encoding such a peptide. A 60-base-pair (bp) oligonucleotide synthesized on the basis of the sequence from this fragment was used for the screening of a cDNA library from human liver and hybridized to a cDNA insert of 2255 bp. This cDNA contains an open reading frame of 1974 bp that encodes a protein of 658 amino acid residues including 25 residues of an NH 2 -terminal leader peptide. The assignment of this open reading frame to human liver CPTase is confirmed by matches to seven different amino acid sequences of tryptic peptides derived from pure human CPTase and by the 82.2% homology with the amino acid sequence of rat CPTase. The NH 2 -terminal region of CPTase contains a leucine-proline motif that is shared by carnitine acetyl- and octanoyltransferases and by choline acetyltransferase. The gene encoding CPTase was assigned to human chromosome 1, region 1q12-1pter, by hybridization of CPTase cDNA with a DNA panel of 19 human-hanster somatic cell hybrids

  6. Analysis of alkaptonuria (AKU) mutations and polymorphisms reveals that the CCC sequence motif is a mutational hot spot in the homogentisate 1,2 dioxygenase gene (HGO).

    Science.gov (United States)

    Beltrán-Valero de Bernabé, D; Jimenez, F J; Aquaron, R; Rodríguez de Córdoba, S

    1999-01-01

    We recently showed that alkaptonuria (AKU) is caused by loss-of-function mutations in the homogentisate 1,2 dioxygenase gene (HGO). Herein we describe haplotype and mutational analyses of HGO in seven new AKU pedigrees. These analyses identified two novel single-nucleotide polymorphisms (INV4+31A-->G and INV11+18A-->G) and six novel AKU mutations (INV1-1G-->A, W60G, Y62C, A122D, P230T, and D291E), which further illustrates the remarkable allelic heterogeneity found in AKU. Reexamination of all 29 mutations and polymorphisms thus far described in HGO shows that these nucleotide changes are not randomly distributed; the CCC sequence motif and its inverted complement, GGG, are preferentially mutated. These analyses also demonstrated that the nucleotide substitutions in HGO do not involve CpG dinucleotides, which illustrates important differences between HGO and other genes for the occurrence of mutation at specific short-sequence motifs. Because the CCC sequence motifs comprise a significant proportion (34.5%) of all mutated bases that have been observed in HGO, we conclude that the CCC triplet is a mutational hot spot in HGO. PMID:10205262

  7. Nucleotide sequence analysis of regions of adenovirus 5 DNA containing the origins of DNA replication

    International Nuclear Information System (INIS)

    Steenbergh, P.H.

    1979-01-01

    The purpose of the investigations described is the determination of nucleotide sequences at the molecular ends of the linear adenovirus type 5 DNA. Knowledge of the primary structure at the termini of this DNA molecule is of particular interest in the study of the mechanism of replication of adenovirus DNA. The initiation- and termination sites of adenovirus DNA replication are located at the ends of the DNA molecule. (Auth.)

  8. Laser desorption mass spectrometry for fast DNA analysis

    Energy Technology Data Exchange (ETDEWEB)

    Chen, C.H.; Ch`ang, L.Y.; Taranenko, N.I.; Allman, S.L.; Tang, K.; Matteson, K.J.

    1995-09-01

    During the past few years, major effort has been directed toward developing mass spectrometry to measure biopolymers because of the great potential benefit to biomedical research. Hellenkamp and his co-workers were the first to report that large polypeptide molecules can be ionized and detected without significant fragmentation when a greater number of nicotinic acid molecules are used as a matrix. This method is now well known as matrix-assisted laser desorption/ionization (MALDI). Since then, various groups have reported measurements of very large proteins by MALDI. Reliable protein analysis by MALDI is more or less well established. However, the application of MALDI to nucleic acids analysis has been found to be much more difficult. Most research on the measurement of nucleic acid by MALDI were stimulated by the Human Genome Project. Up to now, the only method for reliable routine analysis of nucleic acid is gel electrophoresis. Different sizes of nucleic acids can be separated in gel medium when a high electric field is applied to the gel. However, the time needed to separate different sizes of DNA segments usually takes from several minutes to several hours. If MALDI can be successfully used for nucleic acids analysis, the analysis time can be reduced to less than I millisecond. In addition, no tagging with radioactive materials or chemical dyes is needed. In this work, we will review recent progress related to MALDI for DNA analysis.

  9. Construction and analysis of experimental DNA vaccines against megalocytivirus.

    Science.gov (United States)

    Zhang, Min; Hu, Yong-Hua; Xiao, Zhi-Zhong; Sun, Yun; Sun, Li

    2012-11-01

    Iridoviruses are large double-stranded DNA viruses with icosahedral capsid. The Iridoviridae family contains five genera, one of which is Megalocytivirus. Megalocytivirus has emerged in recent years as an important pathogen to a wide range of marine and freshwater fish. In this study, we aimed at developing effective genetic vaccines against megalocytivirus affecting farmed fish in China. For this purpose, we constructed seven DNA vaccines based on seven genes of rock bream iridovirus isolate 1 from China (RBIV-C1), a megalocytivirus with a host range that includes Japanese flounder (Paralichthys olivaceus) and turbot (Scophthalmus maximus). The protective potentials of these vaccines were examined in a turbot model. The results showed that after vaccination via intramuscular injection, the vaccine plasmids were distributed in spleen, kidney, muscle, and liver, and transcription of the vaccine genes and production of the vaccine proteins were detected in these tissues. Following challenge with a lethal-dose of RBIV-C1, fish vaccinated with four of the seven DNA vaccines exhibited significantly higher levels of survival compared to control fish. Of these four protective DNA vaccines, pCN86, which is a plasmid that expresses an 86-residue viral protein, induced the highest protection. Immunological analysis showed that pCN86 was able to (i) stimulate the respiratory burst of head kidney macrophages at 14 d, 21 d, and 28 d post-vaccination, (ii) upregulate the expression of immune relevant genes involved in innate and adaptive immunity, and (iii) induce production of serum antibodies that, when incubated with RBIV-C1 before infection, significantly reduced viral loads in kidney and spleen following viral infection of turbot. Taken together, these results indicate that pCN86 is an effective DNA vaccine that may be used in the control of megalocytivirus-associated diseases in aquaculture. Copyright © 2012 Elsevier Ltd. All rights reserved.

  10. Complete sequence analysis of 18S rDNA based on genomic DNA extraction from individual Demodex mites (Acari: Demodicidae).

    Science.gov (United States)

    Zhao, Ya-E; Xu, Ji-Ru; Hu, Li; Wu, Li-Ping; Wang, Zheng-Hang

    2012-05-01

    The study for the first time attempted to accomplish 18S ribosomal DNA (rDNA) complete sequence amplification and analysis for three Demodex species (Demodex folliculorum, Demodex brevis and Demodex canis) based on gDNA extraction from individual mites. The mites were treated by DNA Release Additive and Hot Start II DNA Polymerase so as to promote mite disruption and increase PCR specificity. Determination of D. folliculorum gDNA showed that the gDNA yield reached the highest at 1 mite, tending to descend with the increase of mite number. The individual mite gDNA was successfully used for 18S rDNA fragment (about 900 bp) amplification examination. The alignments of 18S rDNA complete sequences of individual mite samples and those of pooled mite samples ( ≥ 1000mites/sample) showed over 97% identities for each species, indicating that the gDNA extracted from a single individual mite was as satisfactory as that from pooled mites for PCR amplification. Further pairwise sequence analyses showed that average divergence, genetic distance, transition/transversion or phylogenetic tree could not effectively identify the three Demodex species, largely due to the differentiation in the D. canis isolates. It can be concluded that the individual Demodex mite gDNA can satisfy the molecular study of Demodex. 18S rDNA complete sequence is suitable for interfamily identification in Cheyletoidea, but whether it is suitable for intrafamily identification cannot be confirmed until the ascertainment of the types of Demodex mites parasitizing in dogs. Copyright © 2012 Elsevier Inc. All rights reserved.

  11. Multilocus DNA fingerprinting in paternity analysis: a Chilean experience

    Directory of Open Access Journals (Sweden)

    Cifuentes O. Lucía

    2000-01-01

    Full Text Available DNA polymorphism is very useful in paternity analysis. The present paper describes paternity studies done using DNA profiles obtained with the (CAC5 probe. All of the subjects studied were involved in nonjudicial cases of paternity. Genomic DNA digested with HaeIII was run on agarose gels and hybridized in the gel with the (CAC5 probe labeled with 32P. The mean number of bands larger than the 4.3 kb per individual was 16.1. The mean proportion of bands shared among unrelated individuals was 0.08 and the mean number of test bands was 7.1. This corresponded to an exclusion probability greater than 0.999999. Paternity was excluded in 34.5% of the cases. The mutation frequency estimated from non-excluded cases was 0.01143 bands per child. In these cases, the paternity was confirmed by a locus-specific analysis of eight independent PCR-based loci. The paternity index was computed in all non-excluded cases. It can be concluded that this method is a powerful and inexpensive alternative to solve paternity doubts.

  12. DNA microarray data and contextual analysis of correlation graphs

    Directory of Open Access Journals (Sweden)

    Hingamp Pascal

    2003-04-01

    Full Text Available Abstract Background DNA microarrays are used to produce large sets of expression measurements from which specific biological information is sought. Their analysis requires efficient and reliable algorithms for dimensional reduction, classification and annotation. Results We study networks of co-expressed genes obtained from DNA microarray experiments. The mathematical concept of curvature on graphs is used to group genes or samples into clusters to which relevant gene or sample annotations are automatically assigned. Application to publicly available yeast and human lymphoma data demonstrates the reliability of the method in spite of its simplicity, especially with respect to the small number of parameters involved. Conclusions We provide a method for automatically determining relevant gene clusters among the many genes monitored with microarrays. The automatic annotations and the graphical interface improve the readability of the data. A C++ implementation, called Trixy, is available from http://tagc.univ-mrs.fr/bioinformatics/trixy.html.

  13. Cohort analysis of a single nucleotide polymorphism on DNA chips.

    Science.gov (United States)

    Schwonbeck, Susanne; Krause-Griep, Andrea; Gajovic-Eichelmann, Nenad; Ehrentreich-Förster, Eva; Meinl, Walter; Glatt, Hansrüdi; Bier, Frank F

    2004-11-15

    A method has been developed to determine SNPs on DNA chips by applying a flow-through bioscanner. As a practical application we demonstrated the fast and simple SNP analysis of 24 genotypes in an array of 96 spots with a single hybridisation and dissociation experiment. The main advantage of this methodical concept is the parallel and fast analysis without any need of enzymatic digestion. Additionally, the DNA chip format used is appropriate for parallel analysis up to 400 spots. The polymorphism in the gene of the human phenol sulfotransferase SULT1A1 was studied as a model SNP. Biotinylated PCR products containing the SNP (The SNP summary web site: ) (mutant) and those containing no mutation (wild-type) were brought onto the chips coated with NeutrAvidin using non-contact spotting. This was followed by an analysis which was carried out in a flow-through biochip scanner while constantly rinsing with buffer. After removing the non-biotinylated strand a fluorescent probe was hybridised, which is complementary to the wild-type sequence. If this probe binds to a mutant sequence, then one single base is not fully matching. Thereby, the mismatched hybrid (mutant) is less stable than the full-matched hybrid (wild-type). The final step after hybridisation on the chip involves rinsing with a buffer to start dissociation of the fluorescent probe from the immobilised DNA strand. The online measurement of the fluorescence intensity by the biochip scanner provides the possibility to follow the kinetics of the hybridisation and dissociation processes. According to the different stability of the full-match and the mismatch, either visual discrimination or kinetic analysis is possible to distinguish SNP-containing sequence from the wild-type sequence.

  14. Diagnostic markers of urothelial cancer based on DNA methylation analysis

    International Nuclear Information System (INIS)

    Chihara, Yoshitomo; Hirao, Yoshihiko; Kanai, Yae; Fujimoto, Hiroyuki; Sugano, Kokichi; Kawashima, Kiyotaka; Liang, Gangning; Jones, Peter A; Fujimoto, Kiyohide; Kuniyasu, Hiroki

    2013-01-01

    Early detection and risk assessment are crucial for treating urothelial cancer (UC), which is characterized by a high recurrence rate, and necessitates frequent and invasive monitoring. We aimed to establish diagnostic markers for UC based on DNA methylation. In this multi-center study, three independent sample sets were prepared. First, DNA methylation levels at CpG loci were measured in the training sets (tumor samples from 91 UC patients, corresponding normal-appearing tissue from these patients, and 12 normal tissues from age-matched bladder cancer-free patients) using the Illumina Golden Gate methylation assay to identify differentially methylated loci. Next, these methylated loci were validated by quantitative DNA methylation by pyrosequencing, using another cohort of tissue samples (Tissue validation set). Lastly, methylation of these markers was analyzed in the independent urine samples (Urine validation set). ROC analysis was performed to evaluate the diagnostic accuracy of these 12 selected markers. Of the 1303 CpG sites, 158 were hyper ethylated and 356 were hypo ethylated in tumor tissues compared to normal tissues. In the panel analysis, 12 loci showed remarkable alterations between tumor and normal samples, with 94.3% sensitivity and 97.8% specificity. Similarly, corresponding normal tissue could be distinguished from normal tissues with 76.0% sensitivity and 100% specificity. Furthermore, the diagnostic accuracy for UC of these markers determined in urine samples was high, with 100% sensitivity and 100% specificity. Based on these preliminary findings, diagnostic markers based on differential DNA methylation at specific loci can be useful for non-invasive and reliable detection of UC and epigenetic field defect

  15. Superimposed Code Theoretic Analysis of Deoxyribonucleic Acid (DNA) Codes and DNA Computing

    Science.gov (United States)

    2010-01-01

    DNA strand and its Watson - Crick complement can be used to perform mathematical computation. This research addresses how the...Acid dsDNA double stranded DNA MOSAIC Mobile Stream Processing Cluster PCR Polymerase Chain Reaction RAM Random Access Memory ssDNA single stranded DNA WC Watson – Crick A Adenine C Cytosine G Guanine T Thymine ...are 5′→3′ and strands with strikethrough are 3′→5′. A dsDNA duplex formed between a strand and its reverse complement is called a

  16. Complete cDNA sequence and amino acid analysis of a bovine ribonuclease K6 gene.

    Science.gov (United States)

    Pietrowski, D; Förster, M

    2000-01-01

    The complete cDNA sequence of a ribonuclease k6 gene of Bos Taurus has been determined. It codes for a protein with 154 amino acids and contains the invariant cysteine, histidine and lysine residues as well as the characteristic motifs specific to ribonuclease active sites. The deduced protein sequence is 27 residues longer than other known ribonucleases k6 and shows amino acids exchanges which could reflect a strain specificity or polymorphism within the bovine genome. Based on sequence similarity we have termed the identified gene bovine ribonuclease k6 b (brk6b).

  17. Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparisons with Other Methods

    International Nuclear Information System (INIS)

    Wu, Liyou; Yi, T.Y.; Van Nostrand, Joy; Zhou, Jizhong

    2010-01-01

    Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site (Hanford Reach of the Columbia River (HRCR), 11 strains), Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the average nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.

  18. Phylogenetic Analysis of Shewanella Strains by DNA Relatedness Derived from Whole Genome Microarray DNA-DNA Hybridization and Comparison with Other Methods

    Energy Technology Data Exchange (ETDEWEB)

    Wu, Liyou; Yi, T. Y.; Van Nostrand, Joy; Zhou, Jizhong

    2010-05-17

    Phylogenetic analyses were done for the Shewanella strains isolated from Baltic Sea (38 strains), US DOE Hanford Uranium bioremediation site [Hanford Reach of the Columbia River (HRCR), 11 strains], Pacific Ocean and Hawaiian sediments (8 strains), and strains from other resources (16 strains) with three out group strains, Rhodopseudomonas palustris, Clostridium cellulolyticum, and Thermoanaerobacter ethanolicus X514, using DNA relatedness derived from WCGA-based DNA-DNA hybridizations, sequence similarities of 16S rRNA gene and gyrB gene, and sequence similarities of 6 loci of Shewanella genome selected from a shared gene list of the Shewanella strains with whole genome sequenced based on the average nucleotide identity of them (ANI). The phylogenetic trees based on 16S rRNA and gyrB gene sequences, and DNA relatedness derived from WCGA hybridizations of the tested Shewanella strains share exactly the same sub-clusters with very few exceptions, in which the strains were basically grouped by species. However, the phylogenetic analysis based on DNA relatedness derived from WCGA hybridizations dramatically increased the differentiation resolution at species and strains level within Shewanella genus. When the tree based on DNA relatedness derived from WCGA hybridizations was compared to the tree based on the combined sequences of the selected functional genes (6 loci), we found that the resolutions of both methods are similar, but the clustering of the tree based on DNA relatedness derived from WMGA hybridizations was clearer. These results indicate that WCGA-based DNA-DNA hybridization is an idea alternative of conventional DNA-DNA hybridization methods and it is superior to the phylogenetics methods based on sequence similarities of single genes. Detailed analysis is being performed for the re-classification of the strains examined.

  19. Conformational Analysis of DNA Repair Intermediates by Time-Resolved Fluorescence Spectroscopy

    OpenAIRE

    Lin, Su; Horning, David P.; Szostak, Jack W.; Chaput, John C.

    2009-01-01

    DNA repair enzymes are essential for maintaining the integrity of the DNA sequence. Unfortunately, very little is known about how these enzymes recognize damaged regions along the helix. Structural analysis of cellular repair enzymes bound to DNA reveals that these enzymes are able to recognize DNA in a variety of conformations. However, the prevalence of these deformations in the absence of enzymes remains unclear, as small populations of DNA conformations are often difficult to detect by NM...

  20. POWRS: position-sensitive motif discovery.

    Directory of Open Access Journals (Sweden)

    Ian W Davis

    Full Text Available Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm "POWRS" (POsition-sensitive WoRd Set for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties.BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/.

  1. An Optimized DNA Analysis Workflow for the Sampling, Extraction, and Concentration of DNA obtained from Archived Latent Fingerprints.

    Science.gov (United States)

    Solomon, April D; Hytinen, Madison E; McClain, Aryn M; Miller, Marilyn T; Dawson Cruz, Tracey

    2018-01-01

    DNA profiles have been obtained from fingerprints, but there is limited knowledge regarding DNA analysis from archived latent fingerprints-touch DNA "sandwiched" between adhesive and paper. Thus, this study sought to comparatively analyze a variety of collection and analytical methods in an effort to seek an optimized workflow for this specific sample type. Untreated and treated archived latent fingerprints were utilized to compare different biological sampling techniques, swab diluents, DNA extraction systems, DNA concentration practices, and post-amplification purification methods. Archived latent fingerprints disassembled and sampled via direct cutting, followed by DNA extracted using the QIAamp® DNA Investigator Kit, and concentration with Centri-Sep™ columns increased the odds of obtaining an STR profile. Using the recommended DNA workflow, 9 of the 10 samples provided STR profiles, which included 7-100% of the expected STR alleles and two full profiles. Thus, with carefully selected procedures, archived latent fingerprints can be a viable DNA source for criminal investigations including cold/postconviction cases. © 2017 American Academy of Forensic Sciences.

  2. Recovery of DNA for forensic analysis from lip cosmetics.

    Science.gov (United States)

    Webb, L G; Egan, S E; Turbett, G R

    2001-11-01

    To obtain a reference DNA profile from a missing person, we analyzed a variety of personal effects, including two lip cosmetics, both of which gave full DNA profiles. Further investigations were undertaken to explore this previously unreported source of DNA. We have tested a range of brands and types of lip cosmetics. Our studies have revealed that lip cosmetics are an excellent source of DNA, with almost 80% of samples giving a result. However, artifacts are frequently observed in the DNA profiles when Chelex is used for the DNA extraction and additional DNA purification procedures are required to ensure that an accurate DNA profile is obtained.

  3. High-throughput screening of suppression subtractive hybridization cDNA libraries using DNA microarray analysis

    CSIR Research Space (South Africa)

    Van den Berg, N

    2004-11-01

    Full Text Available Efficient construction of cDNA libraries enriched for differentially expressed transcripts is an important first step in many biological investigations. We present a quantitative procedure for screening cDNA libraries constructed by suppression...

  4. DNA analysis in the case of Kaspar Hauser.

    Science.gov (United States)

    Weichhold, G M; Bark, J E; Korte, W; Eisenmenger, W; Sullivan, K M

    1998-01-01

    In 1828 a mysterious young man appeared in Nürnberg, Germany, who was barely able to speak or walk but could write down his name, Kaspar Hauser. He quickly became the centre of social interest but also the victim of intrigue. His appearance, his origin and assassination in 1833 were, and still are, the source of much debate. The most widely accepted theory postulates that Kaspar Hauser was the son of Grand Duke Carl von Baden and his wife Stephanie de Beauharnais, an adopted daughter of Napoleon Bonaparte. To check this theory, DNA analysis was performed on the clothes most likely worn by Kaspar Hauser when he was stabbed on December 14th, 1833. A suitable bloodstain from the underpants was divided and analysed independently by the Institute of Legal Medicine, University of Munich (ILM) and the Forensic Science Service Laboratory, Birmingham (FSS). Mitochondrial DNA (mtDNA) was sequenced from the bloodstain and from blood samples obtained from two living maternal relatives of Stephanie de Beauharnais. The sequence from the bloodstained clothing differed from the sequence found in both reference blood samples at seven confirmed positions. This proves that the bloodstain does not originate from a son of Stephanie de Beauharnais. Thus, it is becoming clear that Kaspar Hauser was not the Prince of Baden.

  5. Regulation of TCF ETS-domain transcription factors by helix-loop-helix motifs.

    Science.gov (United States)

    Stinson, Julie; Inoue, Toshiaki; Yates, Paula; Clancy, Anne; Norton, John D; Sharrocks, Andrew D

    2003-08-15

    DNA binding by the ternary complex factor (TCF) subfamily of ETS-domain transcription factors is tightly regulated by intramolecular and intermolecular interactions. The helix-loop-helix (HLH)-containing Id proteins are trans-acting negative regulators of DNA binding by the TCFs. In the TCF, SAP-2/Net/ERP, intramolecular inhibition of DNA binding is promoted by the cis-acting NID region that also contains an HLH-like motif. The NID also acts as a transcriptional repression domain. Here, we have studied the role of HLH motifs in regulating DNA binding and transcription by the TCF protein SAP-1 and how Cdk-mediated phosphorylation affects the inhibitory activity of the Id proteins towards the TCFs. We demonstrate that the NID region of SAP-1 is an autoinhibitory motif that acts to inhibit DNA binding and also functions as a transcription repression domain. This region can be functionally replaced by fusion of Id proteins to SAP-1, whereby the Id moiety then acts to repress DNA binding in cis. Phosphorylation of the Ids by cyclin-Cdk complexes results in reduction in protein-protein interactions between the Ids and TCFs and relief of their DNA-binding inhibitory activity. In revealing distinct mechanisms through which HLH motifs modulate the activity of TCFs, our results therefore provide further insight into the role of HLH motifs in regulating TCF function and how the inhibitory properties of the trans-acting Id HLH proteins are themselves regulated by phosphorylation.

  6. DNA nanotechnology: On-command molecular Trojans

    Science.gov (United States)

    Niemeyer, Christof M.

    2017-12-01

    Lipid-motif-decorated DNA nanocapsules filled with photoresponsive polymers are capable of delivering signalling molecules into target organisms for biological perturbations at high spatiotemporal resolution.

  7. Analysis of DNA methylation in various swine tissues.

    Directory of Open Access Journals (Sweden)

    Chun Yang

    Full Text Available DNA methylation is known to play an important role in regulating gene expression during biological development and tissue differentiation in eukaryotes. In this study, we used the fluorescence-labeled methylation-sensitive amplified polymorphism (F-MSAP method to assess the extent and pattern of cytosine methylation in muscle, heart, liver, spleen, lung, kidney and stomach from the swine strain Laiwu, and we also examined specific methylation patterns in the seven tissues. In total, 96,371 fragments, each representing a recognition site cleaved by either or both EcoRI + HpaII and EcoRI + MspI, the HpaII and MspI are isoschizomeric enzymes, were amplified using 16 pairs of selective primers. A total of 50,094 sites were found to be methylated at cytosines in seven tissues. The incidence of DNA methylation was approximately 53.99% in muscle, 51.24% in the heart, 50.18% in the liver, 53.31% in the spleen, 51.97% in the lung, 51.15% in the kidney and 53.39% in the stomach, as revealed by the incidence of differential digestion. Additionally, differences in DNA methylation levels imply that such variations may be related to specific gene expression during tissue differentiation, growth and development. Three types of bands were generated in the F-MSAP profile, the total numbers of these three types of bands in the seven tissues were 46,277, 24,801 and 25,293, respectively.In addition, different methylation patterns were observed in seven tissues from pig, and almost all of the methylation patterns detected by F-MSAP could be confirmed by Southern analysis using the isolated amplified fragments as probes. The results clearly demonstrated that the F-MSAP technique can be adapted for use in large-scale DNA methylation detection in the pig genome.

  8. Discovery of candidate KEN-box motifs using cell cycle keyword enrichment combined with native disorder prediction and motif conservation.

    Science.gov (United States)

    Michael, Sushama; Travé, Gilles; Ramu, Chenna; Chica, Claudia; Gibson, Toby J

    2008-02-15

    KEN-box-mediated target selection is one of the mechanisms used in the proteasomal destruction of mitotic cell cycle proteins via the APC/C complex. While annotating the Eukaryotic Linear Motif resource (ELM, http://elm.eu.org/), we found that KEN motifs were significantly enriched in human protein entries with cell cycle keywords in the UniProt/Swiss-Prot database-implying that KEN-boxes might be more common than reported. Matches to short linear motifs in protein database searches are not, per se, significant. KEN-box enrichment with cell cycle Gene Ontology terms suggests that collectively these motifs are functional but does not prove that any given instance is so. Candidates were surveyed for native disorder prediction using GlobPlot and IUPred and for motif conservation in homologues. Among >25 strong new candidates, the most notable are human HIPK2, CHFR, CDC27, Dab2, Upf2, kinesin Eg5, DNA Topoisomerase 1 and yeast Cdc5 and Swi5. A similar number of weaker candidates were present. These proteins have yet to be tested for APC/C targeted destruction, providing potential new avenues of research.

  9. DNA microarray analysis of fim mutations in Escherichia coli

    DEFF Research Database (Denmark)

    Schembri, Mark; Ussery, David; Workman, Christopher

    2002-01-01

    Bacterial adhesion is often mediated by complex polymeric surface structures referred to as fimbriae. Type I fimbriae of Escherichia coli represent the archetypical and best characterised fimbrial system. These adhesive organelles mediate binding to D-mannose and are directly associated...... we have used DNA microarray analysis to examine the molecular events involved in response to fimbrial gene expression in E. coli K-12. Observed differential expression levels of the fim genes were in good agreement with our current knowledge of the stoichiometry of type I fimbriae. Changes in fim...

  10. Stochastic filtering of quantitative data from STR DNA analysis

    DEFF Research Database (Denmark)

    Tvedebrink, Torben; Eriksen, Poul Svante; Mogensen, Helle Smidt

    due to the apparatus used for measurements). Pull-up effects (more systematic increase caused by overlap in the spectrum) Stutters (peaks located four basepairs before the true peak). We present filtering techniques for all three technical artifacts based on statistical analysis of data from......The quantitative data observed from analysing STR DNA is a mixture of contributions from various sources. Apart from the true allelic peaks, the observed signal consists of at least three components resulting from the measurement technique and the PCR amplification: Background noise (random noise...... controlled experiments conducted at The Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health Sciences, Universityof Copenhagen, Denmark....

  11. Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks.

    Science.gov (United States)

    Gorochowski, Thomas E; Grierson, Claire S; di Bernardo, Mario

    2018-03-01

    Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli . Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution.

  12. Motif signatures of transcribed enhancers

    KAUST Repository

    Kleftogiannis, Dimitrios

    2017-09-14

    In mammalian cells, transcribed enhancers (TrEn) play important roles in the initiation of gene expression and maintenance of gene expression levels in spatiotemporal manner. One of the most challenging questions in biology today is how the genomic characteristics of enhancers relate to enhancer activities. This is particularly critical, as several recent studies have linked enhancer sequence motifs to specific functional roles. To date, only a limited number of enhancer sequence characteristics have been investigated, leaving space for exploring the enhancers genomic code in a more systematic way. To address this problem, we developed a novel computational method, TELS, aimed at identifying predictive cell type/tissue specific motif signatures. We used TELS to compile a comprehensive catalog of motif signatures for all known TrEn identified by the FANTOM5 consortium across 112 human primary cells and tissues. Our results confirm that distinct cell type/tissue specific motif signatures characterize TrEn. These signatures allow discriminating successfully a) TrEn from random controls, proxy of non-enhancer activity, and b) cell type/tissue specific TrEn from enhancers expressed and transcribed in different cell types/tissues. TELS codes and datasets are publicly available at http://www.cbrc.kaust.edu.sa/TELS.

  13. Sequence analysis of mitochondrial DNA hypervariable region III of ...

    African Journals Online (AJOL)

    Aghomotsegin

    2015-07-01

    Jul 1, 2015 ... population genetics research, studies based on mitochondrial DNA (mtDNA) and Y-chromosome DNA are an excellent way of illustrating population structure .... avoid landing investigators into serious situations of medical genetic privacy and ethnics, especially for. mtDNA coding area whose mutation often ...

  14. Distinct repeat motifs at the C-terminal region of CagA of Helicobacter pylori strains isolated from diseased patients and asymptomatic individuals in West Bengal, India

    Directory of Open Access Journals (Sweden)

    Chattopadhyay Santanu

    2012-05-01

    Full Text Available Abstract Background Infection with Helicobacter pylori strains that express CagA is associated with gastritis, peptic ulcer disease, and gastric adenocarcinoma. The biological function of CagA depends on tyrosine phosphorylation by a cellular kinase. The phosphate acceptor tyrosine moiety is present within the EPIYA motif at the C-terminal region of the protein. This region is highly polymorphic due to variations in the number of EPIYA motifs and the polymorphism found in spacer regions among EPIYA motifs. The aim of this study was to analyze the polymorphism at the C-terminal end of CagA and to evaluate its association with the clinical status of the host in West Bengal, India. Results Seventy-seven H. pylori strains isolated from patients with various clinical statuses were used to characterize the C-ternimal polymorphic region of CagA. Our analysis showed that there is no correlation between the previously described CagA types and various disease outcomes in Indian context. Further analyses of different CagA structures revealed that the repeat units in the spacer sequences within the EPIYA motifs are actually more discrete than the previously proposed models of CagA variants. Conclusion Our analyses suggest that EPIYA motifs as well as the spacer sequence units are present as distinct insertions and deletions, which possibly have arisen from extensive recombination events. Moreover, we have identified several new CagA types, which could not be typed by the existing systems and therefore, we have proposed a new typing system. We hypothesize that a cagA gene encoding higher number EPIYA motifs may perhaps have arisen from cagA genes that encode lesser EPIYA motifs by acquisition of DNA segments through recombination events.

  15. Alkaline Extraction of DNA from Pathogenic Fungi for PCR-RFLP Analysis

    OpenAIRE

    Matsumoto, Masaru; Mishima, Shinobu; Matsuyama, Nobuaki; 松元, 賢; 松山, 宣明

    1997-01-01

    For the preparation of DNA samples from fungal mycelia alkaline extraction method was applied and assessed its usefulness for PCR-RFLP analysis. Using alkaline treatment protocols, 18S ribosomal DNAs (rDNA) derived from fungal genomic DNA of Pyricularia oryzae, P. zingiberi, Rhizoctonia solani and R. oryzae were PCR-amplified and digested with Hha I, Msp I and Hae ill. RFLP analysis with HhaI showed the divergent polymorphism between genus Pyricularia and Rhizoctonia. The alkaline DNA extract...

  16. Comparative analysis of the end-joining activity of several DNA ligases.

    Directory of Open Access Journals (Sweden)

    Robert J Bauer

    Full Text Available DNA ligases catalyze the repair of phosphate backbone breaks in DNA, acting with highest activity on breaks in one strand of duplex DNA. Some DNA ligases have also been observed to ligate two DNA fragments with short complementary overhangs or blunt-ended termini. In this study, several wild-type DNA ligases (phage T3, T4, and T7 DNA ligases, Paramecium bursaria chlorella virus 1 (PBCV1 DNA ligase, human DNA ligase 3, and Escherichia coli DNA ligase were tested for their ability to ligate DNA fragments with several difficult to ligate end structures (blunt-ended termini, 3'- and 5'- single base overhangs, and 5'-two base overhangs. This analysis revealed that T4 DNA ligase, the most common enzyme utilized for in vitro ligation, had its greatest activity on blunt- and 2-base overhangs, and poorest on 5'-single base overhangs. Other ligases had different substrate specificity: T3 DNA ligase ligated only blunt ends well; PBCV1 DNA ligase joined 3'-single base overhangs and 2-base overhangs effectively with little blunt or 5'- single base overhang activity; and human ligase 3 had highest activity on blunt ends and 5'-single base overhangs. There is no correlation of activity among ligases on blunt DNA ends with their activity on single base overhangs. In addition, DNA binding domains (Sso7d, hLig3 zinc finger, and T4 DNA ligase N-terminal domain were fused to PBCV1 DNA ligase to explore whether modified binding to DNA would lead to greater activity on these difficult to ligate substrates. These engineered ligases showed both an increased binding affinity for DNA and increased activity, but did not alter the relative substrate preferences of PBCV1 DNA ligase, indicating active site structure plays a role in determining substrate preference.

  17. DNA flow cytometric analysis in variable types of hydropic placentas

    Directory of Open Access Journals (Sweden)

    Fatemeh Atabaki pasdar

    2015-05-01

    Full Text Available Background: Differential diagnosis between complete hydatidiform mole, partial hydatidiform mole and hydropic abortion, known as hydropic placentas is still a challenge for pathologists but it is very important for patient management. Objective: We analyzed the nuclear DNA content of various types of hydropic placentas by flowcytometry. Materials and Methods: DNA ploidy analysis was performed in 20 non-molar (hydropic and non-hydropic spontaneous abortions and 20 molar (complete and partial moles, formalin-fixed, paraffin-embedded tissue samples by flow cytometry. The criteria for selection were based on the histopathologic diagnosis. Results: Of 10 cases histologically diagnosed as complete hydatiform mole, 9 cases yielded diploid histograms, and 1 case was tetraploid. Of 10 partial hydatidiform moles, 8 were triploid and 2 were diploid. All of 20 cases diagnosed as spontaneous abortions (hydropic and non-hydropic yielded diploid histograms. Conclusion: These findings signify the importance of the combined use of conventional histology and ploidy analysis in the differential diagnosis of complete hydatidiform mole, partial hydatidiform mole and hydropic abortion.

  18. DNA Microarray Data Analysis: A Novel Biclustering Algorithm Approach

    Directory of Open Access Journals (Sweden)

    Tewfik Ahmed H

    2006-01-01

    Full Text Available Biclustering algorithms refer to a distinct class of clustering algorithms that perform simultaneous row-column clustering. Biclustering problems arise in DNA microarray data analysis, collaborative filtering, market research, information retrieval, text mining, electoral trends, exchange analysis, and so forth. When dealing with DNA microarray experimental data for example, the goal of biclustering algorithms is to find submatrices, that is, subgroups of genes and subgroups of conditions, where the genes exhibit highly correlated activities for every condition. In this study, we develop novel biclustering algorithms using basic linear algebra and arithmetic tools. The proposed biclustering algorithms can be used to search for all biclusters with constant values, biclusters with constant values on rows, biclusters with constant values on columns, and biclusters with coherent values from a set of data in a timely manner and without solving any optimization problem. We also show how one of the proposed biclustering algorithms can be adapted to identify biclusters with coherent evolution. The algorithms developed in this study discover all valid biclusters of each type, while almost all previous biclustering approaches will miss some.

  19. Cluster analysis for DNA methylation profiles having a detection threshold

    Directory of Open Access Journals (Sweden)

    Siegmund Kimberly D

    2006-07-01

    Full Text Available Abstract Background DNA methylation, a molecular feature used to investigate tumor heterogeneity, can be measured on many genomic regions using the MethyLight technology. Due to the combination of the underlying biology of DNA methylation and the MethyLight technology, the measurements, while being generated on a continuous scale, have a large number of 0 values. This suggests that conventional clustering methodology may not perform well on this data. Results We compare performance of existing methodology (such as k-means with two novel methods that explicitly allow for the preponderance of values at 0. We also consider how the ability to successfully cluster such data depends upon the number of informative genes for which methylation is measured and the correlation structure of the methylation values for those genes. We show that when data is collected for a sufficient number of genes, our models do improve clustering performance compared to methods, such as k-means, that do not explicitly respect the supposed biological realities of the situation. Conclusion The performance of analysis methods depends upon how well the assumptions of those methods reflect the properties of the data being analyzed. Differing technologies will lead to data with differing properties, and should therefore be analyzed differently. Consequently, it is prudent to give thought to what the properties of the data are likely to be, and which analysis method might therefore be likely to best capture those properties.

  20. A multilevel Lab on chip platform for DNA analysis.

    Science.gov (United States)

    Marasso, Simone Luigi; Giuri, Eros; Canavese, Giancarlo; Castagna, Riccardo; Quaglio, Marzia; Ferrante, Ivan; Perrone, Denis; Cocuzza, Matteo

    2011-02-01

    Lab-on-chips (LOCs) are critical systems that have been introduced to speed up and reduce the cost of traditional, laborious and extensive analyses in biological and biomedical fields. These ambitious and challenging issues ask for multi-disciplinary competences that range from engineering to biology. Starting from the aim to integrate microarray technology and microfluidic devices, a complex multilevel analysis platform has been designed, fabricated and tested (All rights reserved-IT Patent number TO2009A000915). This LOC successfully manages to interface microfluidic channels with standard DNA microarray glass slides, in order to implement a complete biological protocol. Typical Micro Electro Mechanical Systems (MEMS) materials and process technologies were employed. A silicon/glass microfluidic chip and a Polydimethylsiloxane (PDMS) reaction chamber were fabricated and interfaced with a standard microarray glass slide. In order to have a high disposable system all micro-elements were passive and an external apparatus provided fluidic driving and thermal control. The major microfluidic and handling problems were investigated and innovative solutions were found. Finally, an entirely automated DNA hybridization protocol was successfully tested with a significant reduction in analysis time and reagent consumption with respect to a conventional protocol.

  1. DNA methylation analysis from saliva samples for epidemiological studies.

    Science.gov (United States)

    Nishitani, Shota; Parets, Sasha E; Haas, Brian W; Smith, Alicia K

    2018-06-18

    Saliva is a non-invasive, easily accessible tissue, which is regularly collected in large epidemiological studies to examine genetic questions. Recently, it is becoming more common to use saliva to assess DNA methylation. However, DNA extracted from saliva is a mixture of both bacterial and human DNA derived from epithelial and immune cells in the mouth. Thus, there are unique challenges to using salivary DNA in methylation studies that can influence data quality. This study assesses: (1) quantification of human DNA after extraction; (2) delineation of human and bacterial DNA; (3) bisulfite conversion (BSC); (4) quantification of BSC DNA; (5) PCR amplification of BSC DNA from saliva and; (6) quantitation of DNA methylation with a targeted assay. The framework proposed will allow saliva samples to be more widely used in targeted epigenetic studies.

  2. Role of Helicobacter pylori cagA EPIYA motif and vacA genotypes for the development of gastrointestinal diseases in Southeast Asian countries: a meta-analysis

    Directory of Open Access Journals (Sweden)

    Sahara Shu

    2012-09-01

    Full Text Available Abstract Background Infection with cagA-positive, cagA EPIYA motif ABD type, and vacA s1, m1, and i1 genotype strains of Helicobacter pylori is associated with an exacerbated inflammatory response and increased risk of gastroduodenal diseases. However, it is unclear whether the prevalence and virulence factor genotypes found in Southeast Asia are similar to those in Western countries. Here, we examined the cagA status and prevalence of cagA EPIYA motifs and vacA genotypes among H. pylori strains found in Southeast Asia and examined their association with gastroduodenal disease. Methods To determine the cagA status, cagA EPIYA motifs, and vacA genotypes of H. pylori, we conducted meta-analyses of 13 previous reports for 1,281 H. pylori strains detected from several Southeast Asian countries. Results The respective frequencies of cagA-positive and vacA s1, m1, and i1 genotypes among examined subjects were 93% (1,056/1,133, 98% (1,010/1,033, 58% (581/1,009, and 96% (248/259, respectively. Stratification showed significant variation in the frequencies of cagA status and vacA genotypes among countries and the individual races residing within each respective country. The frequency of the vacA m-region genotype in patients infected with East Asian-type strains differed significantly between the northern and southern areas of Vietnam (p vacA m1 type or cagA-positive strains was associated with an increased risk of peptic ulcer disease (odds ratio: 1.46, 95%CI: 1.01-2.12, p = 0.046 and 2.83, 1.50-5.34, p = 0.001, respectively in the examined Southeast Asian populations. Conclusions Both Western- and East Asian-type strains of H. pylori are found in Southeast Asia and are predominantly cagA-positive and vacA s1 type. In Southeast Asia, patients infected with vacA m1 type or cagA-positive strains have an increased risk of peptic ulcer disease. Thus, testing for this genotype and the presence of cagA may have clinical usefulness.

  3. Analysis of the DNA-Binding Activities of the Arabidopsis R2R3-MYB Transcription Factor Family by One-Hybrid Experiments in Yeast.

    Directory of Open Access Journals (Sweden)

    Zsolt Kelemen

    Full Text Available The control of growth and development of all living organisms is a complex and dynamic process that requires the harmonious expression of numerous genes. Gene expression is mainly controlled by the activity of sequence-specific DNA binding proteins called transcription factors (TFs. Amongst the various classes of eukaryotic TFs, the MYB superfamily is one of the largest and most diverse, and it has considerably expanded in the plant kingdom. R2R3-MYBs have been extensively studied over the last 15 years. However, DNA-binding specificity has been characterized for only a small subset of these proteins. Therefore, one of the remaining challenges is the exhaustive characterization of the DNA-binding specificity of all R2R3-MYB proteins. In this study, we have developed a library of Arabidopsis thaliana R2R3-MYB open reading frames, whose DNA-binding activities were assayed in vivo (yeast one-hybrid experiments with a pool of selected cis-regulatory elements. Altogether 1904 interactions were assayed leading to the discovery of specific patterns of interactions between the various R2R3-MYB subgroups and their DNA target sequences and to the identification of key features that govern these interactions. The present work provides a comprehensive in vivo analysis of R2R3-MYB binding activities that should help in predicting new DNA motifs and identifying new putative target genes for each member of this very large family of TFs. In a broader perspective, the generated data will help to better understand how TF interact with their target DNA sequences.

  4. DNA Source Selection for Downstream Applications Based on DNA Quality Indicators Analysis

    Science.gov (United States)

    Lucena-Aguilar, Gema; Sánchez-López, Ana María; Barberán-Aceituno, Cristina; Carrillo-Ávila, José Antonio; López-Guerrero, José Antonio

    2016-01-01

    High-quality human DNA samples and associated information of individuals are necessary for biomedical research. Biobanks act as a support infrastructure for the scientific community by providing a large number of high-quality biological samples for specific downstream applications. For this purpose, biobank methods for sample preparation must ensure the usefulness and long-term functionality of the products obtained. Quality indicators are the tool to measure these parameters, the purity and integrity determination being those specifically used for DNA. This study analyzes the quality indicators in DNA samples derived from 118 frozen human tissues in optimal cutting temperature (OCT) reactive, 68 formalin-fixed paraffin-embedded (FFPE) tissues, 119 frozen blood samples, and 26 saliva samples. The results obtained for DNA quality are discussed in association with the usefulness for downstream applications and availability of the DNA source in the target study. In brief, if any material is valid, blood is the most approachable option of prospective collection of samples providing high-quality DNA. However, if diseased tissue is a requisite or samples are available, the recommended source of DNA would be frozen tissue. These conclusions will determine the best source of DNA, according to the planned downstream application. Furthermore our results support the conclusion that a complete procedure of DNA quantification and qualification is necessary to guarantee the appropriate management of the samples, avoiding low confidence results, high costs, and a waste of samples. PMID:27158753

  5. Evolutionarily conserved bias of amino-acid usage refines the definition of PDZ-binding motif

    Directory of Open Access Journals (Sweden)

    Launey Thomas

    2011-06-01

    Full Text Available Abstract Background The interactions between PDZ (PSD-95, Dlg, ZO-1 domains and PDZ-binding motifs play central roles in signal transductions within cells. Proteins with PDZ domains bind to PDZ-binding motifs almost exclusively when the motifs are located at the carboxyl (C- terminal ends of their binding partners. However, it remains little explored whether PDZ-binding motifs show any preferential location at the C-terminal ends of proteins, at genome-level. Results Here, we examined the distribution of the type-I (x-x-S/T-x-I/L/V or type-II (x-x-V-x-I/V PDZ-binding motifs in proteins encoded in the genomes of five different species (human, mouse, zebrafish, fruit fly and nematode. We first established that these PDZ-binding motifs are indeed preferentially present at their C-terminal ends. Moreover, we found specific amino acid (AA bias for the 'x' positions in the motifs at the C-terminal ends. In general, hydrophilic AAs were favored. Our genomics-based findings confirm and largely extend the results of previous interaction-based studies, allowing us to propose refined consensus sequences for all of the examined PDZ-binding motifs. An ontological analysis revealed that the refined motifs are functionally relevant since a large fraction of the proteins bearing the motif appear to be involved in signal transduction. Furthermore, co-precipitation experiments confirmed two new protein interactions predicted by our genomics-based approach. Finally, we show that influenza virus pathogenicity can be correlated with PDZ-binding motif, with high-virulence viral proteins bearing a refined PDZ-binding motif. Conclusions Our refined definition of PDZ-binding motifs should provide important clues for identifying functional PDZ-binding motifs and proteins involved in signal transduction.

  6. Selection against spurious promoter motifs correlates withtranslational efficiency across bacteria

    Energy Technology Data Exchange (ETDEWEB)

    Froula, Jeffrey L.; Francino, M. Pilar

    2007-05-01

    Because binding of RNAP to misplaced sites could compromise the efficiency of transcription, natural selection for the optimization of gene expression should regulate the distribution of DNA motifs capable of RNAP-binding across the genome. Here we analyze the distribution of the -10 promoter motifs that bind the {sigma}{sup 70} subunit of RNAP in 42 bacterial genomes. We show that selection on these motifs operates across the genome, maintaining an over-representation of -10 motifs in regulatory sequences while eliminating them from the nonfunctional and, in most cases, from the protein coding regions. In some genomes, however, -10 sites are over-represented in the coding sequences; these sites could induce pauses effecting regulatory roles throughout the length of a transcriptional unit. For nonfunctional sequences, the extent of motif under-representation varies across genomes in a manner that broadly correlates with the number of tRNA genes, a good indicator of translational speed and growth rate. This suggests that minimizing the time invested in gene transcription is an important selective pressure against spurious binding. However, selection against spurious binding is detectable in the reduced genomes of host-restricted bacteria that grow at slow rates, indicating that components of efficiency other than speed may also be important. Minimizing the number of RNAP molecules per cell required for transcription, and the corresponding energetic expense, may be most relevant in slow growers. These results indicate that genome-level properties affecting the efficiency of transcription and translation can respond in an integrated manner to optimize gene expression. The detection of selection against promoter motifs in nonfunctional regions also implies that no sequence may evolve free of selective constraints, at least in the relatively small and unstructured genomes of bacteria.

  7. Structure-Based Analysis of Toxoplasma gondii Profilin: A Parasite-Specific Motif Is Required for Recognition by Toll-Like Receptor 11

    Energy Technology Data Exchange (ETDEWEB)

    K Kucera; A Koblansky; L Saunders; K Frederick; E De La Cruz; S Ghosh; Y Modis

    2011-12-31

    Profilins promote actin polymerization by exchanging ADP for ATP on monomeric actin and delivering ATP-actin to growing filament barbed ends. Apicomplexan protozoa such as Toxoplasma gondii invade host cells using an actin-dependent gliding motility. Toll-like receptor (TLR) 11 generates an innate immune response upon sensing T. gondii profilin (TgPRF). The crystal structure of TgPRF reveals a parasite-specific surface motif consisting of an acidic loop, followed by a long {beta}-hairpin. A series of structure-based profilin mutants show that TLR11 recognition of the acidic loop is responsible for most of the interleukin (IL)-12 secretion response to TgPRF in peritoneal macrophages. Deletion of both the acidic loop and the {beta}-hairpin completely abrogates IL-12 secretion. Insertion of the T. gondii acidic loop and {beta}-hairpin into yeast profilin is sufficient to generate TLR11-dependent signaling. Substitution of the acidic loop in TgPRF with the homologous loop from the apicomplexan parasite Cryptosporidium parvum does not affect TLR11-dependent IL-12 secretion, while substitution with the acidic loop from Plasmodium falciparum results in reduced but significant IL-12 secretion. We conclude that the parasite-specific motif in TgPRF is the key molecular pattern recognized by TLR11. Unlike other profilins, TgPRF slows nucleotide exchange on monomeric rabbit actin and binds rabbit actin weakly. The putative TgPRF actin-binding surface includes the {beta}-hairpin and diverges widely from the actin-binding surfaces of vertebrate profilins.

  8. Photosensitized UVA-Induced Cross-Linking between Human DNA Repair and Replication Proteins and DNA Revealed by Proteomic Analysis

    Science.gov (United States)

    2016-01-01

    Long wavelength ultraviolet radiation (UVA, 320–400 nm) interacts with chromophores present in human cells to induce reactive oxygen species (ROS) that damage both DNA and proteins. ROS levels are amplified, and the damaging effects of UVA are exacerbated if the cells are irradiated in the presence of UVA photosensitizers such as 6-thioguanine (6-TG), a strong UVA chromophore that is extensively incorporated into the DNA of dividing cells, or the fluoroquinolone antibiotic ciprofloxacin. Both DNA-embedded 6-TG and ciprofloxacin combine synergistically with UVA to generate high levels of ROS. Importantly, the extensive protein damage induced by these photosensitizer+UVA combinations inhibits DNA repair. DNA is maintained in intimate contact with the proteins that effect its replication, transcription, and repair, and DNA–protein cross-links (DPCs) are a recognized reaction product of ROS. Cross-linking of DNA metabolizing proteins would compromise these processes by introducing physical blocks and by depleting active proteins. We describe a sensitive and statistically rigorous method to analyze DPCs in cultured human cells. Application of this proteomics-based analysis to cells treated with 6-TG+UVA and ciprofloxacin+UVA identified proteins involved in DNA repair, replication, and gene expression among those most vulnerable to cross-linking under oxidative conditions. PMID:27654267

  9. Multi-color fluorescent DNA analysis in an integrated optofluidic lab-on-a-chip

    NARCIS (Netherlands)

    Dongre, C.; van Weerd, J.; van Weeghel, R.; Martinez-Vazquez, R.; Osellame, R.; Cerullo, G.; Besselink, G.A.J.; van den Vlekkert, H.H.; Hoekstra, Hugo; Pollnau, Markus

    Sorting and sizing of DNA molecules within the human genome project has enabled the genetic mapping of various illnesses. By employing tiny lab-on-a-chip devices for such DNA analysis, integrated DNA sequencing and genetic diagnostics have become feasible. However, such diagnostic chips typically

  10. DNA adducts and cancer risk in prospective studies: a pooled analysis and a meta-analysis

    DEFF Research Database (Denmark)

    Veglia, Fabrizio; Loft, Steffen; Matullo, Giuseppe

    2008-01-01

    in which bulky DNA adducts have been measured in blood samples collected from healthy subjects (N = 1947; average follow-up 51-137 months). In addition, we have performed a meta-analysis by identifying all articles on the same subject published up to the end of 2006, including case-control studies......). The association was evident only in current smokers and was absent in former smokers. Also the meta-analysis, which included both lung and bladder cancers, showed a statistically significant association in current smokers, whereas the results in never smokers were equivocal; in former smokers, no association......Bulky DNA adducts are biomarkers of exposure to aromatic compounds and of the ability of the individual to metabolically activate carcinogens and to repair DNA damage. Their ability to predict cancer onset is uncertain. We have performed a pooled analysis of three prospective studies on cancer risk...

  11. Analysis of T-DNA/Host-Plant DNA Junction Sequences in Single-Copy Transgenic Barley Lines

    Directory of Open Access Journals (Sweden)

    Joanne G. Bartlett

    2014-01-01

    Full Text Available Sequencing across the junction between an integrated transfer DNA (T-DNA and a host plant genome provides two important pieces of information. The junctions themselves provide information regarding the proportion of T-DNA which has integrated into the host plant genome, whilst the transgene flanking sequences can be used to study the local genetic environment of the integrated transgene. In addition, this information is important in the safety assessment of GM crops and essential for GM traceability. In this study, a detailed analysis was carried out on the right-border T-DNA junction sequences of single-copy independent transgenic barley lines. T-DNA truncations at the right-border were found to be relatively common and affected 33.3% of the lines. In addition, 14.3% of lines had rearranged construct sequence after the right border break-point. An in depth analysis of the host-plant flanking sequences revealed that a significant proportion of the T-DNAs integrated into or close to known repetitive elements. However, this integration into repetitive DNA did not have a negative effect on transgene expression.

  12. Sequence analysis of mitochondrial DNA hypervariable region III of ...

    African Journals Online (AJOL)

    The aims of this research were to study mitochondrial DNA hypervariable region III and establish the degree of variation characteristic of a fragment. The mitochondrial DNA (mtDNA) is a small circular genome located within the mitochondria in the cytoplasm of the cell and a smaller 1.2 kb pair fragment, called the control ...

  13. Genetic Approaches to Appearance and Ancestry : Improving Forensic DNA Analysis

    NARCIS (Netherlands)

    L.C. Chaitanya (Lakshmi)

    2016-01-01

    textabstractTraditionally, routine forensic casework is based on comparative grounds. DNA profiles obtained from crime-scenes are compared with those of potential suspects or DNA profiles deposited in forensic DNA databases. The principal limitation of such comparative approach is that trace

  14. Coincident In Vitro Analysis of DNA-PK-Dependent and -Independent Nonhomologous End Joining

    Directory of Open Access Journals (Sweden)

    Cynthia L. Hendrickson

    2010-01-01

    Full Text Available In mammalian cells, DNA double-strand breaks (DSBs are primarily repaired by nonhomologous end joining (NHEJ. The current model suggests that the Ku 70/80 heterodimer binds to DSB ends and recruits DNA-PKcs to form the active DNA-dependent protein kinase, DNA-PK. Subsequently, XRCC4, DNA ligase IV, XLF and most likely, other unidentified components participate in the final DSB ligation step. Therefore, DNA-PK plays a key role in NHEJ due to its structural and regulatory functions that mediate DSB end joining. However, recent studies show that additional DNA-PK-independent NHEJ pathways also exist. Unfortunately, the presence of DNA-PKcs appears to inhibit DNA-PK-independent NHEJ, and in vitro analysis of DNA-PK-independent NHEJ in the presence of the DNA-PKcs protein remains problematic. We have developed an in vitro assay that is preferentially active for DNA-PK-independent DSB repair based solely on its reaction conditions, facilitating coincident differential biochemical analysis of the two pathways. The results indicate the biochemically distinct nature of the end-joining mechanisms represented by the DNA-PK-dependent and -independent NHEJ assays as well as functional differences between the two pathways.

  15. Genetic alterations of hepatocellular carcinoma by random amplified polymorphic DNA analysis and cloning sequencing of tumor differential DNA fragment

    Science.gov (United States)

    Xian, Zhi-Hong; Cong, Wen-Ming; Zhang, Shu-Hui; Wu, Meng-Chao

    2005-01-01

    AIM: To study the genetic alterations and their association with clinicopathological characteristics of hepatocellular carcinoma (HCC), and to find the tumor related DNA fragments. METHODS: DNA isolated from tumors and corresponding noncancerous liver tissues of 56 HCC patients was amplified by random amplified polymorphic DNA (RAPD) with 10 random 10-mer arbitrary primers. The RAPD bands showing obvious differences in tumor tissue DNA corresponding to that of normal tissue were separated, purified, cloned and sequenced. DNA sequences were analyzed and compared with GenBank data. RESULTS: A total of 56 cases of HCC were demonstrated to have genetic alterations, which were detected by at least one primer. The detestability of genetic alterations ranged from 20% to 70% in each case, and 17.9% to 50% in each primer. Serum HBV infection, tumor size, histological grade, tumor capsule, as well as tumor intrahepatic metastasis, might be correlated with genetic alterations on certain primers. A band with a higher intensity of 480 bp or so amplified fragments in tumor DNA relative to normal DNA could be seen in 27 of 56 tumor samples using primer 4. Sequence analysis of these fragments showed 91% homology with Homo sapiens double homeobox protein DUX10 gene. CONCLUSION: Genetic alterations are a frequent event in HCC, and tumor related DNA fragments have been found in this study, which may be associated with hepatocarcin-ogenesis. RAPD is an effective method for the identification and analysis of genetic alterations in HCC, and may provide new information for further evaluating the molecular mechanism of hepatocarcinogenesis. PMID:15996039

  16. Cytometric analysis of shape and DNA content in mammalian sperm

    International Nuclear Information System (INIS)

    Gledhill, B.L.

    1983-01-01

    Male germ cells respond dramatically to a variety of insults and are important reproductive dosimeters. Semen analyses are very useful in studies on the effects of drugs, chemicals, and environmental hazards on testicular function, male fertility and heritable germinal mutations. Sperm were analyzed by flow cytometry and slit-scan flow analysis for injury following the exposure of testes to mutagens. The utility of flow cytometry in genotoxin screening and monitoring of occupational exposure was evaluated. The technique proved valuable in separation of X- and Y-chromosome bearing sperm and the potential applicability of this technique in artificial insemination and a solution, of accurately assessing the DNA content of sperm were evaluated-with reference to determination of X- and Y-chromosome bearing sperm

  17. Cytometric analysis of shape and DNA content in mammalian sperm

    Energy Technology Data Exchange (ETDEWEB)

    Gledhill, B.L.

    1983-10-10

    Male germ cells respond dramatically to a variety of insults and are important reproductive dosimeters. Semen analyses are very useful in studies on the effects of drugs, chemicals, and environmental hazards on testicular function, male fertility and heritable germinal mutations. Sperm were analyzed by flow cytometry and slit-scan flow analysis for injury following the exposure of testes to mutagens. The utility of flow cytometry in genotoxin screening and monitoring of occupational exposure was evaluated. The technique proved valuable in separation of X- and Y-chromosome bearing sperm and the potential applicability of this technique in artificial insemination and a solution, of accurately assessing the DNA content of sperm were evaluated-with reference to determination of X- and Y-chromosome bearing sperm.

  18. DNA analysis in three populations of African spinach (Basella spp.)

    International Nuclear Information System (INIS)

    Grasso, G.; Van Duren, M.; Lee, K.S.; Morpurgo, R.

    1997-01-01

    African spinach (Basella spp.) is an important vegetable in West Africa, and was introduced by early colonialists. Its alien origin is supported by its narrow genetic variability. Flowcytometry and RAPD polymorphism were used to investigate genetic variation in three populations of Basella - 'Congo native', 'Cong domesticated', and an introduced cultivar, 'Sri Lanka' from Sri Lanka. Normal spinach (Spinacia oleracea) cv. 'Prince F 1 Hybrid' was used to test sensitivity and to verify detection of genetic variation. Nuclei were isolated from young leaves of Basella, stained with DAPI and ethidium bromide, and ploidy level and total DNA content were determined by using a flowcytometer. The two sexually propagated populations, 'Cong domesticated' and 'Sri Lanka' showed very low amount of genetic variation as revealed by RAPD analysis; the third population 'Congo native' showed a limited amount of polymorphism. (author). 8 refs, 1 fig., 2 tabs

  19. DNA analysis in three populations of African spinach (Basella spp.)

    Energy Technology Data Exchange (ETDEWEB)

    Grasso, G; Van Duren, M; Lee, K S; Morpurgo, R [Agriculture and Biotechnology Lab., International Atomic Energy Agency, Seiberdorf (Austria)

    1997-07-01

    African spinach (Basella spp.) is an important vegetable in West Africa, and was introduced by early colonialists. Its alien origin is supported by its narrow genetic variability. Flowcytometry and RAPD polymorphism were used to investigate genetic variation in three populations of Basella - `Congo native`, `Cong domesticated`, and an introduced cultivar, `Sri Lanka` from Sri Lanka. Normal spinach (Spinacia oleracea) cv. `Prince F{sub 1} Hybrid` was used to test sensitivity and to verify detection of genetic variation. Nuclei were isolated from young leaves of Basella, stained with DAPI and ethidium bromide, and ploidy level and total DNA content were determined by using a flowcytometer. The two sexually propagated populations, `Cong domesticated` and `Sri Lanka` showed very low amount of genetic variation as revealed by RAPD analysis; the third population `Congo native` showed a limited amount of polymorphism. (author). 8 refs, 1 fig., 2 tabs.

  20. RAPD analysis of alfalfa DNA mutation via N+ implantation

    International Nuclear Information System (INIS)

    Li Yufeng; Huang Qunce; Yu Zengliang; Liang Yunzhang

    2003-01-01

    Germination capacity of alfalfa seeds under low energy N + implantation manifests oscillations going down with dose strength. From analyzing alfalfa genome DNA under low energy N + implantation by RAPD (Random Amplified Polymorphous DNA), it is recommended that 30 polymorphic DNA fragments be amplified with 8 primers in total 100 primers, and fluorescence intensity of the identical DNA fragment amplified by RAPD is different between CK and treatments. Number of different polymorphic DNA fragments between treatment and CK via N + implantation manifests going up with dose strength

  1. Polymorphism and mutation analysis of genomic DNA on cancer

    International Nuclear Information System (INIS)

    Ohta, Tsutomu

    2003-01-01

    DNA repair is a universal process in living cells that maintains the structural integrity of chromosomal DNA molecules in face of damage. A deficiency in DNA damage repair is associated with an increased cancer risk by increasing a mutation frequency of cancer-related genes. Variation in DNA repair capacity may be genetically determined. Therefore, we searched single-nucleotide polymorphisms (SNPs) in major DNA repair genes. This led to the finding of 600 SNPs and mutations including many novel SNPs in Japanese population. Case-control studies to explore the contribution of the SNPs in DNA repair genes to the risk of lung cancer revealed that five SNPs are associated with lung carcinogenesis. One of these SNPs is found in RAD54L gene, which is involved in double-strand DNA repair. We analyzed and reported activities of Rad54L protein with SNP and mutations. (authors)

  2. A putative peroxidase cDNA from turnip and analysis of the encoded protein sequence.

    Science.gov (United States)

    Romero-Gómez, S; Duarte-Vázquez, M A; García-Almendárez, B E; Mayorga-Martínez, L; Cervantes-Avilés, O; Regalado, C

    2008-12-01

    A putative peroxidase cDNA was isolated from turnip roots (Brassica napus L. var. purple top white globe) by reverse transcriptase-polymerase chain reaction (RT-PCR) and rapid amplification of cDNA ends (RACE). Total RNA extracted from mature turnip roots was used as a template for RT-PCR, using a degenerated primer designed to amplify the highly conserved distal motif of plant peroxidases. The resulting partial sequence was used to design the rest of the specific primers for 5' and 3' RACE. Two cDNA fragments were purified, sequenced, and aligned with the partial sequence from RT-PCR, and a complete overlapping sequence was obtained and labeled as BbPA (Genbank Accession No. AY423440, named as podC). The full length cDNA is 1167bp long and contains a 1077bp open reading frame (ORF) encoding a 358 deduced amino acid peroxidase polypeptide. The putative peroxidase (BnPA) showed a calculated Mr of 34kDa, and isoelectric point (pI) of 4.5, with no significant identity with other reported turnip peroxidases. Sequence alignment showed that only three peroxidases have a significant identity with BnPA namely AtP29a (84%), and AtPA2 (81%) from Arabidopsis thaliana, and HRPA2 (82%) from horseradish (Armoracia rusticana). Work is in progress to clone this gene into an adequate host to study the specific role and possible biotechnological applications of this alternative peroxidase source.

  3. Kopi dan Kakao dalam Kreasi Motif Batik Khas Jember

    Directory of Open Access Journals (Sweden)

    Irfa'ina Rohana Salma

    2015-06-01

    Full Text Available ABSTRAK Batik Jember selama ini identik dengan motif daun tembakau. Visualisasi daun tembakau dalam motif Batik Jember cukup lemah, yaitu kurang berkarakter karena motif yang muncul adalah seperti gambar daun pada umumnya. Oleh karena itu perlu diciptakan desain motif batik khas Jember yang sumber inspirasinya digali dari kekayaan alam lainnya dari Jember yang mempunyai bentuk spesifik dan karakteristik sehingga identitas motif bisa didapatkan dengan lebih kuat. Hasil alam khas Jember tersebut adalah kopi dan kakao. Tujuan penciptaan seni ini adalah untuk menghasilkan motif batik  baru yang mempunyai ciri khas Jember. Metode yang digunakan yaitu pengumpulan data, pengamatan mendalam terhadap objek penciptaan, pengkajian sumber inspirasi, pembuatan desain motif, dan perwujudan menjadi batik. Dari penciptaan seni ini berhasil dikreasikan 6 (enam motif batik yaitu: (1 Motif Uwoh Kopi; (2 Motif Godong Kopi;  (3 Motif Ceplok Kakao; (4 Motif Kakao Raja; (5 Motif Kakao Biru; dan (6 Motif Wiji Mukti. Berdasarkan hasil penilaian “Selera Estetika” diketahui bahwa motif yang paling banyak disukai adalah Motif Uwoh Kopi dan Motif Kakao Raja. Kata kunci: Motif Woh Kopi, Motif Godong Kopi, Motif Ceplok Kakao, Motif Kakao Raja, Motif Kakao Biru, Motif Wiji Mukti ABSTRACTBatik Jember is synonymous with tobacco leaf motif. Tobacco leaf shape is quite weak in the visual appearance characterized as that motif emerges like a picture of leaves in general. Therefore, it is necessary to create a distinctive design motif extracted from other natural resources of Jember that have specific shapes and characteristics that can be obtained as the stronger motif identity. The typical natural resources from Jember are coffee and cocoa. The purpose of the creation of this art is to produce the unique, creative and innovative batik and have specific characteristics of Jember. The method used are data collection, observation of the object, reviewing inspiration sources

  4. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium

    Directory of Open Access Journals (Sweden)

    Lynch Michael

    2010-05-01

    Full Text Available Abstract Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1 shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2 are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3 reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  5. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

    Science.gov (United States)

    Catania, Francesco; Lynch, Michael

    2010-05-04

    In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  6. Vertically integrated analysis of human DNA. Final technical report

    Energy Technology Data Exchange (ETDEWEB)

    Olson, M.

    1997-10-01

    This project has been oriented toward improving the vertical integration of the sequential steps associated with the large-scale analysis of human DNA. The central focus has been on an approach to the preparation of {open_quotes}sequence-ready{close_quotes} maps, which is referred to as multiple-complete-digest (MCD) mapping, primarily directed at cosmid clones. MCD mapping relies on simple experimental steps, supported by advanced image-analysis and map-assembly software, to produce extremely accurate restriction-site and clone-overlap maps. We believe that MCD mapping is one of the few high-resolution mapping systems that has the potential for high-level automation. Successful automation of this process would be a landmark event in genome analysis. Once other higher organisms, paving the way for cost-effective sequencing of these genomes. Critically, MCD mapping has the potential to provide built-in quality control for sequencing accuracy and to make possible a highly integrated end product even if there are large numbers of discontinuities in the actual sequence.

  7. Effect of food processing on plant DNA degradation and PCR-based GMO analysis: a review.

    Science.gov (United States)

    Gryson, Nicolas

    2010-03-01

    The applicability of a DNA-based method for GMO detection and quantification depends on the quality and quantity of the DNA. Important food-processing conditions, for example temperature and pH, may lead to degradation of the DNA, rendering PCR analysis impossible or GMO quantification unreliable. This review discusses the effect of several food processes on DNA degradation and subsequent GMO detection and quantification. The data show that, although many of these processes do indeed lead to the fragmentation of DNA, amplification of the DNA may still be possible. Length and composition of the amplicon may, however, affect the result, as also may the method of extraction used. Also, many techniques are used to describe the behaviour of DNA in food processing, which occasionally makes it difficult to compare research results. Further research should be aimed at defining ingredients in terms of their DNA quality and PCR amplification ability, and elaboration of matrix-specific certified reference materials.

  8. Analysis of the role of PCNA-DNA contacts during clamp loading

    Directory of Open Access Journals (Sweden)

    Goedken Eric R

    2010-01-01

    Full Text Available Abstract Background Sliding clamps, such as Proliferating Cell Nuclear Antigen (PCNA in eukaryotes, are ring-shaped protein complexes that encircle DNA and enable highly processive DNA replication by serving as docking sites for DNA polymerases. In an ATP-dependent reaction, clamp loader complexes, such as the Replication Factor-C (RFC complex in eukaryotes, open the clamp and load it around primer-template DNA. Results We built a model of RFC bound to PCNA and DNA based on existing crystal structures of clamp loaders. This model suggests that DNA would enter the clamp at an angle during clamp loading, thereby interacting with positively charged residues in the center of PCNA. We show that simultaneous mutation of Lys 20, Lys 77, Arg 80, and Arg 149, which interact with DNA in the RFC-PCNA-DNA model, compromises the ability of yeast PCNA to stimulate the DNA-dependent ATPase activity of RFC when the DNA is long enough to extend through the clamp. Fluorescence anisotropy binding experiments show that the inability of the mutant clamp proteins to stimulate RFC ATPase activity is likely caused by reduction in the affinity of the RFC-PCNA complex for DNA. We obtained several crystal forms of yeast PCNA-DNA complexes, measuring X-ray diffraction data to 3.0 Å resolution for one such complex. The resulting electron density maps show that DNA is bound in a tilted orientation relative to PCNA, but makes different contacts than those implicated in clamp loading. Because of apparent partial disorder in the DNA, we restricted refinement of the DNA to a rigid body model. This result contrasts with previous analysis of a bacterial clamp bound to DNA, where the DNA was well resolved. Conclusion Mutational analysis of PCNA suggests that positively charged residues in the center of the clamp create a binding surface that makes contact with DNA. Disruption of this positive surface, which had not previously been implicated in clamp loading function, reduces RFC

  9. Systematic analysis of DNA damage induction and DNA repair pathway activation by continuous wave visible light laser micro-irradiation

    Directory of Open Access Journals (Sweden)

    Britta Muster

    2017-02-01

    Full Text Available Laser micro-irradiation can be used to induce DNA damage with high spatial and temporal resolution, representing a powerful tool to analyze DNA repair in vivo in the context of chromatin. However, most lasers induce a mixture of DNA damage leading to the activation of multiple DNA repair pathways and making it impossible to study individual repair processes. Hence, we aimed to establish and validate micro-irradiation conditions together with inhibition of several key proteins to discriminate different types of DNA damage and repair pathways using lasers commonly available in confocal microscopes. Using time-lapse analysis of cells expressing fluorescently tagged repair proteins and also validation of the DNA damage generated by micro-irradiation using several key damage markers, we show that irradiation with a 405 nm continuous wave laser lead to the activation of all repair pathways even in the absence of exogenous sensitization. In contrast, we found that irradiation with 488 nm laser lead to the selective activation of non-processive short-patch base excision and single strand break repair, which were further validated by PARP inhibition and metoxyamine treatment. We conclude that these low energy conditions discriminated against processive long-patch base excision repair, nucleotide excision repair as well as double strand break repair pathways.

  10. Comprehensive analysis of preeclampsia-associated DNA methylation in the placenta.

    Directory of Open Access Journals (Sweden)

    Tianjiao Chu

    Full Text Available A small number of recent reports have suggested that altered placental DNA methylation may be associated with early onset preeclampsia. It is important that further studies be undertaken to confirm and develop these findings. We therefore undertook a systematic analysis of DNA methylation patterns in placental tissue from 24 women with preeclampsia and 24 with uncomplicated pregnancy outcome.We analyzed the DNA methylation status of approximately 27,000 CpG sites in placental tissues in a massively parallel fashion using an oligonucleotide microarray. Follow up analysis of DNA methylation at specific CpG loci was performed using the Epityper MassArray approach and high-throughput bisulfite sequencing.Preeclampsia-specific DNA methylation changes were identified in placental tissue samples irrespective of gestational age of delivery. In addition, we identified a group of CpG sites within specific gene sequences that were only altered in early onset-preeclampsia (EOPET although these DNA methylation changes did not correlate with altered mRNA transcription. We found evidence that fetal gender influences DNA methylation at autosomal loci but could find no clear association between DNA methylation and gestational age.Preeclampsia is associated with altered placental DNA methylation. Fetal gender should be carefully considered during the design of future studies in which placental DNA is analyzed at the level of DNA methylation. Further large-scale analyses of preeclampsia-associated DNA methylation are necessary.

  11. Comparative analysis of protocols for DNA extraction from soybean caterpillars.

    Science.gov (United States)

    Palma, J; Valmorbida, I; da Costa, I F D; Guedes, J V C

    2016-04-07

    Genomic DNA extraction is crucial for molecular research, including diagnostic and genome characterization of different organisms. The aim of this study was to comparatively analyze protocols of DNA extraction based on cell lysis by sarcosyl, cetyltrimethylammonium bromide, and sodium dodecyl sulfate, and to determine the most efficient method applicable to soybean caterpillars. DNA was extracted from specimens of Chrysodeixis includens and Spodoptera eridania using the aforementioned three methods. DNA quantification was performed using spectrophotometry and high molecular weight DNA ladders. The purity of the extracted DNA was determined by calculating the A260/A280 ratio. Cost and time for each DNA extraction method were estimated and analyzed statistically. The amount of DNA extracted by these three methods was sufficient for PCR amplification. The sarcosyl method yielded DNA of higher purity, because it generated a clearer pellet without viscosity, and yielded high quality amplification products of the COI gene I. The sarcosyl method showed lower cost per extraction and did not differ from the other methods with respect to preparation times. Cell lysis by sarcosyl represents the best method for DNA extraction in terms of yield, quality, and cost effectiveness.

  12. A regenerated electrochemical biosensor for label-free detection of glucose and urea based on conformational switch of i-motif oligonucleotide probe

    Energy Technology Data Exchange (ETDEWEB)

    Gao, Zhong Feng; Chen, Dong Mei [Key Laboratory of Eco-environments in Three Gorges Reservoir Region (Ministry of Education), School of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715 (China); Lei, Jing Lei [School of Chemistry and Chemical Engineering, Chongqing University, Chongqing 400044 (China); Luo, Hong Qun, E-mail: luohq@swu.edu.cn [Key Laboratory of Eco-environments in Three Gorges Reservoir Region (Ministry of Education), School of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715 (China); Li, Nian Bing, E-mail: linb@swu.edu.cn [Key Laboratory of Eco-environments in Three Gorges Reservoir Region (Ministry of Education), School of Chemistry and Chemical Engineering, Southwest University, Chongqing 400715 (China)

    2015-10-15

    Improving the reproducibility of electrochemical signal remains a great challenge over the past decades. In this work, i-motif oligonucleotide probe-based electrochemical DNA (E-DNA) sensor is introduced for the first time as a regenerated sensing platform, which enhances the reproducibility of electrochemical signal, for label-free detection of glucose and urea. The addition of glucose or urea is able to activate glucose oxidase-catalyzed or urease-catalyzed reaction, inducing or destroying the formation of i-motif oligonucleotide probe. The conformational switch of oligonucleotide probe can be recorded by electrochemical impedance spectroscopy. Thus, the difference of electron transfer resistance is utilized for the quantitative determination of glucose and urea. We further demonstrate that the E-DNA sensor exhibits high selectivity, excellent stability, and remarkable regenerated ability. The human serum analysis indicates that this simple and regenerated strategy holds promising potential in future biosensing applications. - Highlights: • Conformational switch of i-motif is used for the detection of glucose and urea. • The sensor can be regenerated. • The proposed method is successfully applied in real sample assay. • Our method is label-free and inexpensive.

  13. Verification of the MOTIF code version 3.0

    International Nuclear Information System (INIS)

    Chan, T.; Guvanasen, V.; Nakka, B.W.; Reid, J.A.K.; Scheier, N.W.; Stanchell, F.W.

    1996-12-01

    As part of the Canadian Nuclear Fuel Waste Management Program (CNFWMP), AECL has developed a three-dimensional finite-element code, MOTIF (Model Of Transport In Fractured/ porous media), for detailed modelling of groundwater flow, heat transport and solute transport in a fractured rock mass. The code solves the transient and steady-state equations of groundwater flow, solute (including one-species radionuclide) transport, and heat transport in variably saturated fractured/porous media. The initial development was completed in 1985 (Guvanasen 1985) and version 3.0 was completed in 1986. This version is documented in detail in Guvanasen and Chan (in preparation). This report describes a series of fourteen verification cases which has been used to test the numerical solution techniques and coding of MOTIF, as well as demonstrate some of the MOTIF analysis capabilities. For each case the MOTIF solution has been compared with a corresponding analytical or independently developed alternate numerical solution. Several of the verification cases were included in Level 1 of the International Hydrologic Code Intercomparison Project (HYDROCOIN). The MOTIF results for these cases were also described in the HYDROCOIN Secretariat's compilation and comparison of results submitted by the various project teams (Swedish Nuclear Power Inspectorate 1988). It is evident from the graphical comparisons presented that the MOTIF solutions for the fourteen verification cases are generally in excellent agreement with known analytical or numerical solutions obtained from independent sources. This series of verification studies has established the ability of the MOTIF finite-element code to accurately model the groundwater flow and solute and heat transport phenomena for which it is intended. (author). 20 refs., 14 tabs., 32 figs

  14. Circulating Tumor DNA Analysis for Liver Cancers and Its Usefulness as a Liquid BiopsySummary

    Directory of Open Access Journals (Sweden)

    Atsushi Ono

    2015-09-01

    Full Text Available Background & Aims: Circulating tumor DNA (ctDNA carrying tumor-specific sequence alterations has been found in the cell-free fraction of blood. Liver cancer tumor specimens are difficult to obtain, and noninvasive methods are required to assess cancer progression and characterize underlying genomic features. Methods: We analyzed 46 patients with hepatocellular carcinoma who underwent hepatectomy or liver transplantation and for whom whole-genome sequencing data was available. We designed personalized assays targeting somatic rearrangements of each tumor to quantify serum ctDNA. Exome sequencing was performed using cell-free DNA paired primary tumor tissue DNA from a patient with recurrent liver cancer after transcatheter arterial chemoembolization (TACE. Results: We successfully detected ctDNA from 100 μL of serum samples in 7 of the 46 patients before surgery, increasing with disease progression. The cumulative incidence of recurrence and extrahepatic metastasis in the ctDNA-positive group were statistically significantly worse than in the ctDNA-negative group (P = .0102 and .0386, respectively. Multivariate analysis identified ctDNA (OR 6.10; 95% CI, 1.11–33.33, P = .038 as an independent predictor of microscopic vascular invasion of the portal vein (VP. We identified 45 nonsynonymous somatic mutations in cell-free DNA after TACE and 71 nonsynonymous somatic mutations in primary tumor tissue by exome sequencing. We identified 25 common mutations in both samples, and 83% of mutations identified in the primary tumor could be detected in the cell-free DNA. Conclusions: The presence of ctDNA reflects tumor progression, and detection of ctDNA can predict VP and recurrence, especially extrahepatic metastasis within 2 years. Our study demonstrated the usefulness of ctDNA detection and sequencing analysis of cell-free DNA for personalized treatment of liver cancer. Keywords: Circulating Tumor DNA, Exome Sequencing, Hepatocellular

  15. DNA-magnetic Particle Binding Analysis by Dynamic and Electrophoretic Light Scattering.

    Science.gov (United States)

    Haddad, Yazan; Dostalova, Simona; Kudr, Jiri; Zitka, Ondrej; Heger, Zbynek; Adam, Vojtech

    2017-11-09

    Isolation of DNA using magnetic particles is a field of high importance in biotechnology and molecular biology research. This protocol describes the evaluation of DNA-magnetic particles binding via dynamic light scattering (DLS) and electrophoretic light scattering (ELS). Analysis by DLS provides valuable information on the physicochemical properties of particles including particle size, polydispersity, and zeta potential. The latter describes the surface charge of the particle which plays major role in electrostatic binding of materials such as DNA. Here, a comparative analysis exploits three chemical modifications of nanoparticles and microparticles and their effects on DNA binding and elution. Chemical modifications by branched polyethylenimine, tetraethyl orthosilicate and (3-aminopropyl)triethoxysilane are investigated. Since DNA exhibits a negative charge, it is expected that zeta potential of particle surface will decrease upon binding of DNA. Forming of clusters should also affect particle size. In order to investigate the efficiency of these particles in isolation and elution of DNA, the particles are mixed with DNA in low pH (~6), high ionic strength and dehydration environment. Particles are washed on magnet and then DNA is eluted by Tris-HCl buffer (pH = 8). DNA copy number is estimated using quantitative polymerase chain reaction (PCR). Zeta potential, particle size, polydispersity and quantitative PCR data are evaluated and compared. DLS is an insightful and supporting method of analysis that adds a new perspective to the process of screening of particles for DNA isolation.

  16. Methylated DNA Immunoprecipitation Analysis of Mammalian Endogenous Retroviruses.

    Science.gov (United States)

    Rebollo, Rita; Mager, Dixie L

    2016-01-01

    Endogenous retroviruses are repetitive sequences found abundantly in mammalian genomes which are capable of modulating host gene expression. Nevertheless, most endogenous retrovirus copies are under tight epigenetic control via histone-repressive modifications and DNA methylation. Here we describe a common method used in our laboratory to detect, quantify, and compare mammalian endogenous retrovirus DNA methylation. More specifically we describe methylated DNA immunoprecipitation (MeDIP) followed by quantitative PCR.

  17. Phylogenetic and structural analysis of centromeric DNA and kinetochore proteins

    OpenAIRE

    Meraldi, Patrick; McAinsh, Andrew D; Rheinbay, Esther; Sorger, Peter K

    2006-01-01

    Background: Kinetochores are large multi-protein structures that assemble on centromeric DNA (CEN DNA) and mediate the binding of chromosomes to microtubules. Comprising 125 base-pairs of CEN DNA and 70 or more protein components, Saccharomyces cerevisiae kinetochores are among the best understood. In contrast, most fungal, plant and animal cells assemble kinetochores on CENs that are longer and more complex, raising the question of whether kinetochore architecture has been conserved through ...

  18. Mitochondrial DNA analysis suggests a Chibchan migration into Colombia

    OpenAIRE

    Noguera-Santamaría, Maria Claudia; Instituto de Genética Humana, Facultad de Medicina, Pontificia Universidad Javeriana. Grupo de Genética Humana, Facultad de Medicina, Universidad de La Sabana. Facultad de Ciencias de la Salud. Grupo Gisafaco. Corporación Universitaria Remington; Anderson, Carl Edlund; Department of Foreign Languages & Cultures, Universidad de La Sabana; Uricoechea, Daniel; Grupo de Genética Humana, Facultad de Medicina, Universidad de La Sabana; Durán, Clemencia; Instituto de Genética Humana, Facultad de Medicina, Pontificia Universidad Javeriana.; Briceño-Balcázar, Ignacio; Instituto de Genética Humana, Facultad de Medicina, Pontificia Universidad Javeriana Grupo de Genética Humana, Facultad de Medicina, Universidad de La Sabana; Bernal-Villegas, Jaime; Instituto de Genética Humana, Facultad de Medicina, Pontificia Universidad Javeriana Universidad Tecnológica de Bolívar

    2015-01-01

    The characterization of mitochondrial DNA (mtDNA) allows the establishment of genetic structures and phylogenetic relationships in human populations, tracing lineages far back in time. We analysed samples of mtDNA from twenty (20) Native American populations (700 individuals) dispersed throughout Colombian territory. Samples were collected during 1989-1993 in the context of the program Expedición Humana (“Human Expedition”) and stored in the Biological Repository of the Institute of Human Gen...

  19. Codon based co-occurrence network motifs in human mitochondria

    Directory of Open Access Journals (Sweden)

    Pramod Shinde

    2017-10-01

    Full Text Available The nucleotide polymorphism in human mitochondrial genome (mtDNA tolled by codon position bias plays an indispensable role in human population dispersion and expansion. Herein, we constructed genome-wide nucleotide co-occurrence networks using a massive data consisting of five different geographical regions and around 3000 samples for each region. We developed a powerful network model to describe complex mitochondrial evolutionary patterns between codon and non-codon positions. It was interesting to report a different evolution of Asian genomes than those of the rest which is divulged by network motifs. We found evidence that mtDNA undergoes substantial amounts of adaptive evolution, a finding which was supported by a number of previous studies. The dominance of higher order motifs indicated the importance of long-range nucleotide co-occurrence in genomic diversity. Most notably, codon motifs apparently underpinned the preferences among codon positions for co-evolution which is probably highly biased during the origin of the genetic code. Our analyses manifested that codon position co-evolution is very well conserved across human sub-populations and independently maintained within human sub-populations implying the selective role of evolutionary processes on codon position co-evolution. Ergo, this study provided a framework to investigate cooperative genomic interactions which are critical in underlying complex mitochondrial evolution.

  20. Statistical tests to compare motif count exceptionalities

    Directory of Open Access Journals (Sweden)

    Vandewalle Vincent

    2007-03-01

    Full Text Available Abstract Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use.

  1. Identifying cis-regulatory modules by combining comparative and compositional analysis of DNA.

    Science.gov (United States)

    Pierstorff, Nora; Bergman, Casey M; Wiehe, Thomas

    2006-12-01

    Predicting cis-regulatory modules (CRMs) in higher eukaryotes is a challenging computational task. Commonly used methods to predict CRMs based on the signal of transcription factor binding sites (TFBS) are limited by prior information about transcription factor specificity. More general methods that bypass the reliance on TFBS models are needed for comprehensive CRM prediction. We have developed a method to predict CRMs called CisPlusFinder that identifies high density regions of perfect local ungapped sequences (PLUSs) based on multiple species conservation. By assuming that PLUSs contain core TFBS motifs that are locally overrepresented, the method attempts to capture the expected features of CRM structure and evolution. Applied to a benchmark dataset of CRMs involved in early Drosophila development, CisPlusFinder predicts more annotated CRMs than all other methods tested. Using the REDfly database, we find that some 'false positive' predictions in the benchmark dataset correspond to recently annotated CRMs. Our work demonstrates that CRM prediction methods that combine comparative genomic data with statistical properties of DNA may achieve reasonable performance when applied genome-wide in the absence of an a priori set of known TFBS motifs. The program CisPlusFinder can be downloaded at http://jakob.genetik.uni-koeln.de/bioinformatik/people/nora/nora.html. All software is licensed under the Lesser GNU Public License (LGPL).

  2. Bacterial identification and subtyping using DNA microarray and DNA sequencing.

    Science.gov (United States)

    Al-Khaldi, Sufian F; Mossoba, Magdi M; Allard, Marc M; Lienau, E Kurt; Brown, Eric D

    2012-01-01

    The era of fast and accurate discovery of biological sequence motifs in prokaryotic and eukaryotic cells is here. The co-evolution of direct genome sequencing and DNA microarray strategies not only will identify, isotype, and serotype pathogenic bacteria, but also it will aid in the discovery of new gene functions by detecting gene expressions in different diseases and environmental conditions. Microarray bacterial identification has made great advances in working with pure and mixed bacterial samples. The technological advances have moved beyond bacterial gene expression to include bacterial identification and isotyping. Application of new tools such as mid-infrared chemical imaging improves detection of hybridization in DNA microarrays. The research in this field is promising and future work will reveal the potential of infrared technology in bacterial identification. On the other hand, DNA sequencing by using 454 pyrosequencing is so cost effective that the promise of $1,000 per bacterial genome sequence is becoming a reality. Pyrosequencing technology is a simple to use technique that can produce accurate and quantitative analysis of DNA sequences with a great speed. The deposition of massive amounts of bacterial genomic information in databanks is creating fingerprint phylogenetic analysis that will ultimately replace several technologies such as Pulsed Field Gel Electrophoresis. In this chapter, we will review (1) the use of DNA microarray using fluorescence and infrared imaging detection for identification of pathogenic bacteria, and (2) use of pyrosequencing in DNA cluster analysis to fingerprint bacterial phylogenetic trees.

  3. Binding properties of SUMO-interacting motifs (SIMs) in yeast.

    Science.gov (United States)

    Jardin, Christophe; Horn, Anselm H C; Sticht, Heinrich

    2015-03-01

    Small ubiquitin-like modifier (SUMO) conjugation and interaction play an essential role in many cellular processes. A large number of yeast proteins is known to interact non-covalently with SUMO via short SUMO-interacting motifs (SIMs), but the structural details of this interaction are yet poorly characterized. In the present work, sequence analysis of a large dataset of 148 yeast SIMs revealed the existence of a hydrophobic core binding motif and a preference for acidic residues either within or adjacent to the core motif. Thus the sequence properties of yeast SIMs are highly similar to those described for human. Molecular dynamics simulations were performed to investigate the binding preferences for four representative SIM peptides differing in the number and distribution of acidic residues. Furthermore, the relative stability of two previously observed alternative binding orientations (parallel, antiparallel) was assessed. For all SIMs investigated, the antiparallel binding mode remained stable in the simulations and the SIMs were tightly bound via their hydrophobic core residues supplemented by polar interactions of the acidic residues. In contrary, the stability of the parallel binding mode is more dependent on the sequence features of the SIM motif like the number and position of acidic residues or the presence of additional adjacent interaction motifs. This information should be helpful to enhance the prediction of SIMs and their binding properties in different organisms to facilitate the reconstruction of the SUMO interactome.

  4. Disparate requirements for the Walker A and B ATPase motifs of human RAD51D in homologous recombination.

    Science.gov (United States)

    Wiese, Claudia; Hinz, John M; Tebbs, Robert S; Nham, Peter B; Urbin, Salustra S; Collins, David W; Thompson, Larry H; Schild, David

    2006-01-01

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks (ICLs). Ectopic expression of wild-type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  5. Disparate requirements for the Walker A and B ATPase motifs ofhuman RAD51D in homologous recombination

    Energy Technology Data Exchange (ETDEWEB)

    Wiese, Claudia; Hinz, John M.; Tebbs, Robert S.; Nham, Peter B.; Urbin, Salustra S.; Collins, David W.; Thompson, Larry H.; Schild, David

    2006-04-21

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C, and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks. Ectopic expression of wild type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  6. High-resolution DNA content analysis of microbiopsy samples in oral lichen planus.

    Science.gov (United States)

    Pentenero, M; Monticone, M; Marino, R; Aiello, C; Marchitto, G; Malacarne, D; Giaretti, W; Gandolfo, S; Castagnola, P

    2017-04-01

    DNA aneuploidy has been reported to be a predictor of poor prognosis in both premalignant and malignant lesions. In oral lichen planus (OLP), this hypothesis remains to be proved. This study aimed to determine the rate of occurrence of DNA aneuploidy in patients with OLP by high-resolution DNA flow cytometry. Patients with OLP were consecutively enrolled. Tissue samples were subdivided for formalin fixation and routine histological assessment and for immediate storage at -20°C for later DNA ploidy analysis, which was performed by DAPI staining of the extracted nuclei and excitation with a UV lamp. The DNA aneuploid sublines were characterized by the DNA Index. A DNA aneuploid status was observed in two of 77 patients with OLP (2.6%). When considering the clinical aspect of the OLP lesions, both DNA aneuploid cases had a reticular clinical aspect. DNA aneuploidy is an uncommon event in OLP and less frequent compared to other non-dysplastic and non-OLP oral potentially malignant disorders. The extremely low rate of DNA aneuploidy could represent an occasional finding or reflect the low rate of malignant transformation observed in patients with OLP even if the real prognostic value of DNA ploidy analysis in patients with OLP remains to be confirmed. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  7. Universal platform for quantitative analysis of DNA transposition

    Directory of Open Access Journals (Sweden)

    Pajunen Maria I

    2010-11-01

    Full Text Available Abstract Background Completed genome projects have revealed an astonishing diversity of transposable genetic elements, implying the existence of novel element families yet to be discovered from diverse life forms. Concurrently, several better understood transposon systems have been exploited as efficient tools in molecular biology and genomics applications. Characterization of new mobile elements and improvement of the existing transposition technology platforms warrant easy-to-use assays for the quantitative analysis of DNA transposition. Results Here we developed a universal in vivo platform for the analysis of transposition frequency with class II mobile elements, i.e., DNA transposons. For each particular transposon system, cloning of the transposon ends and the cognate transposase gene, in three consecutive steps, generates a multifunctional plasmid, which drives inducible expression of the transposase gene and includes a mobilisable lacZ-containing reporter transposon. The assay scores transposition events as blue microcolonies, papillae, growing within otherwise whitish Escherichia coli colonies on indicator plates. We developed the assay using phage Mu transposition as a test model and validated the platform using various MuA transposase mutants. For further validation and to illustrate universality, we introduced IS903 transposition system components into the assay. The developed assay is adjustable to a desired level of initial transposition via the control of a plasmid-borne E. coli arabinose promoter. In practice, the transposition frequency is modulated by varying the concentration of arabinose or glucose in the growth medium. We show that variable levels of transpositional activity can be analysed, thus enabling straightforward screens for hyper- or hypoactive transposase mutants, regardless of the original wild-type activity level. Conclusions The established universal papillation assay platform should be widely applicable to a

  8. Quantitative Analysis of the Mutagenic Potential of 1-Aminopyrene-DNA Adduct Bypass Catalyzed by Y-Family DNA Polymerases

    Science.gov (United States)

    Sherrer, Shanen M.; Taggart, David J.; Pack, Lindsey R.; Malik, Chanchal K.; Basu, Ashis K.; Suo, Zucai

    2012-01-01

    N- (deoxyguanosin-8-yl)-1-aminopyrene (dGAP) is the predominant nitro polyaromatic hydrocarbon product generated from the air pollutant 1-nitropyrene reacting with DNA. Previous studies have shown that dGAP induces genetic mutations in bacterial and mammalian cells. One potential source of these mutations is the error-prone bypass of dGAP lesions catalyzed by the low-fidelity Y-family DNA polymerases. To provide a comparative analysis of the mutagenic potential of the translesion DNA synthesis (TLS) of dGAP, we employed short oligonucleotide sequencing assays (SOSAs) with the model Y-family DNA polymerase from Sulfolobus solfataricus, DNA Polymerase IV (Dpo4), and the human Y-family DNA polymerases eta (hPolη), kappa (hPolκ), and iota (hPolι). Relative to undamaged DNA, all four enzymes generated far more mutations (base deletions, insertions, and substitutions) with a DNA template containing a site-specifically placed dGAP. Opposite dGAP and at an immediate downstream template position, the most frequent mutations made by the three human enzymes were base deletions and the most frequent base substitutions were dAs for all enzymes. Based on the SOSA data, Dpo4 was the least error-prone Y-family DNA polymerase among the four enzymes during the TLS of dGAP. Among the three human Y-family enzymes, hPolκ made the fewest mutations at all template positions except opposite the lesion site. hPolκ was significantly less error-prone than hPolι and hPolη during the extension of dGAP bypass products. Interestingly, the most frequent mutations created by hPolι at all template positions were base deletions. Although hRev1, the fourth human Y-family enzyme, could not extend dGAP bypass products in our standing start assays, it preferentially incorporated dCTP opposite the bulky lesion. Collectively, these mutagenic profiles suggest that hPolkk and hRev1 are the most suitable human Y-family DNA polymerases to perform TLS of dGAP in humans. PMID:22917544

  9. Analysis of epigenetic modifications of DNA in human cells

    DEFF Research Database (Denmark)

    Kristensen, Lasse Sommer; Treppendahl, Marianne Bach; Grønbæk, Kirsten

    2013-01-01

    Epigenetics, the study of somatically heritable changes in gene expression not related to changes in the DNA sequence, is a rapidly expanding research field that plays important roles in healthy as well as in diseased cells. DNA methylation and hydroxymethylation are epigenetic modifications found...

  10. The application of DNA microarrays in gene expression analysis

    NARCIS (Netherlands)

    Hal, van N.L.W.; Vorst, O.; Houwelingen, van A.M.M.L.; Kok, E.J.; Peijnenburg, A.A.C.M.; Aharoni, A.; Tunen, van A.J.; Keijer, J.

    2000-01-01

    DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed.

  11. Analysis of Molecular Variance Inferred from Metric Distances among DNA Haplotypes: Application to Human Mitochondrial DNA Restriction Data

    OpenAIRE

    Excoffier, L.; Smouse, P. E.; Quattro, J. M.

    1992-01-01

    We present here a framework for the study of molecular variation within a single species. Information on DNA haplotype divergence is incorporated into an analysis of variance format, derived from a matrix of squared-distances among all pairs of haplotypes. This analysis of molecular variance (AMOVA) produces estimates of variance components and F-statistic analogs, designated here as φ-statistics, reflecting the correlation of haplotypic diversity at different levels of hierarchical subdivisi...

  12. A preliminary analysis of the DNA and diet of the extinct Beothuk: a systematic approach to ancient human DNA

    DEFF Research Database (Denmark)

    Kuch, Melanie; Gröcke, Darren R; Knyf, Martin C

    2007-01-01

    , which fall within haplogroups X and C, consistent with Northeastern Native populations today. In addition we have sexed the male using a novel-sexing assay and confirmed the authenticity of his Y chromosome with the presence of the Native American specific Y-QM3 single nucleotide polymorphism (SNP......). This is the first ancient nuclear SNP typed from a Native population in the Americas. In addition, using the same teeth we conducted a stable isotopes analysis of collagen and dentine to show that both individuals relied on marine sources (fresh and salt water fish, seals) with no hierarchy seen between them......, Nonosabasut) were of admixed (European-Native American) descent. We also analyzed patterns of DNA damage in the clones of authentic mtDNA sequences; there is no tendency for DNA damage to occur preferentially at previously defined mutational hotspots, suggesting that such mutational hotspots...

  13. Improved reproducibility in genome-wide DNA methylation analysis for PAXgene® fixed samples compared to restored FFPE DNA

    DEFF Research Database (Denmark)

    Andersen, Gitte Brinch; Hager, Henrik; Hansen, Lise Lotte

    2014-01-01

    Chip. Quantitative DNA methylation analysis demonstrated that the methylation profile in PAXgene-fixed tissues showed, in comparison with restored FFPE samples, a higher concordance with the profile detected in frozen samples. We demonstrate, for the first time, that DNA from PAXgene conserved tissue performs better......Formalin fixation has been the standard method for conservation of clinical specimens for decades. However, a major drawback is the high degradation of nucleic acids, which complicates its use in genome-wide analyses. Unbiased identification of biomarkers, however, requires genome-wide studies......, precluding the use of the valuable archives of specimens with long-term follow-up data. Therefore, restoration protocols for DNA from formalin-fixed and paraffin-embedded (FFPE) samples have been developed, although they are cost-intensive and time-consuming. An alternative to FFPE and snap...

  14. Fluorescence correlation spectroscopy analysis for accurate determination of proportion of doubly labeled DNA in fluorescent DNA pool for quantitative biochemical assays.

    Science.gov (United States)

    Hou, Sen; Sun, Lili; Wieczorek, Stefan A; Kalwarczyk, Tomasz; Kaminski, Tomasz S; Holyst, Robert

    2014-01-15

    Fluorescent double-stranded DNA (dsDNA) molecules labeled at both ends are commonly produced by annealing of complementary single-stranded DNA (ssDNA) molecules, labeled with fluorescent dyes at the same (3' or 5') end. Because the labeling efficiency of ssDNA is smaller than 100%, the resulting dsDNA have two, one or are without a dye. Existing methods are insufficient to measure the percentage of the doubly-labeled dsDNA component in the fluorescent DNA sample and it is even difficult to distinguish the doubly-labeled DNA component from the singly-labeled component. Accurate measurement of the percentage of such doubly labeled dsDNA component is a critical prerequisite for quantitative biochemical measurements, which has puzzled scientists for decades. We established a fluorescence correlation spectroscopy (FCS) system to measure the percentage of doubly labeled dsDNA (PDL) in the total fluorescent dsDNA pool. The method is based on comparative analysis of the given sample and a reference dsDNA sample prepared by adding certain amount of unlabeled ssDNA into the original ssDNA solution. From FCS autocorrelation functions, we obtain the number of fluorescent dsDNA molecules in the focal volume of the confocal microscope and PDL. We also calculate the labeling efficiency of ssDNA. The method requires minimal amount of material. The samples have the concentration of DNA in the nano-molar/L range and the volume of tens of microliters. We verify our method by using restriction enzyme Hind III to cleave the fluorescent dsDNA. The kinetics of the reaction depends strongly on PDL, a critical parameter for quantitative biochemical measurements. Copyright © 2013 Elsevier B.V. All rights reserved.

  15. Analysis of cellular and extracellular DNA in fingerprints

    Energy Technology Data Exchange (ETDEWEB)

    Button, Julie M. [Lawrence Livermore National Lab. (LLNL), Livermore, CA (United States)

    2014-09-09

    It has been previously shown that DNA can be recovered from latent fingerprints left on various surfaces [R. A. H. van Oorschot and M. K. Jones, Nature 387, 767 (1997)]. However, the source of the DNA, extracellular versus cellular origin, is difficult to determine. If the DNA is cellular, it is believed to belong to skin cells while extracellular DNA is believed to originate from body fluids such as sweat [D. J. Daly et. al, Forensic Sci. Int. Genet. 6, 41-46 (2012); V. V. Vlassov et. al, BioEssays 29, 654-667 (2007)]. The origin of the DNA in fingerprints has implications for processing and interpretation of forensic evidence. The determination of the origin of DNA in fingerprints is further complicated by the fact that the DNA in fingerprints tends to be at a very low quantity [R. A. H. van Oorschot and M. K. Jones, Nature 387, 767 (1997)]. This study examined fingerprints from five volunteers left on sterilized glass slides and plastic pens. Three fingerprints were left on each glass slide (thumb, index, and middle fingers) while the pens were held as if one was writing with them. The DNA was collected from the objects using the wet swabbing technique (TE buffer). Following collection, the cellular and extracellular components of each sample were separated using centrifugation and an acoustofluidics system. Centrifugation is still the primary separation technique utilized in forensics laboratories, while acoustic focusing uses sound waves to focus large particles (cells) into low pressure nodes, separating them from the rest of the sample matrix. After separation, all samples were quantified using real-time quantitative PCR (qPCR). The overall trend is that there is more DNA in the extracellular fractions than cellular fractions for both centrifugation and acoustofluidic processing. Additionally, more DNA was generally collected from the pen samples than the samples left on glass slides.

  16. Quantitative analysis of the flexibility effect of cisplatin on circular DNA

    Science.gov (United States)

    Ji, Chao; Zhang, Lingyun; Wang, Peng-Ye

    2013-10-01

    We study the effects of cisplatin on the circular configuration of DNA using atomic force microscopy (AFM) and observe that the DNA gradually transforms to a complex configuration with an intersection and interwound structures from a circlelike structure. An algorithm is developed to extract the configuration profiles of circular DNA from AFM images and the radius of gyration is used to describe the flexibility of circular DNA. The quantitative analysis of the circular DNA demonstrates that the radius of gyration gradually decreases and two processes on the change of flexibility of circular DNA are found as the cisplatin concentration increases. Furthermore, a model is proposed and discussed to explain the mechanism for understanding the complicated interaction between DNA and cisplatin.

  17. BayesMotif: de novo protein sorting motif discovery from impure datasets.

    Science.gov (United States)

    Hu, Jianjun; Zhang, Fan

    2010-01-18

    Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of

  18. Fitness for synchronization of network motifs

    DEFF Research Database (Denmark)

    Vega, Y.M.; Vázquez-Prada, M.; Pacheco, A.F.

    2004-01-01

    We study the synchronization of Kuramoto's oscillators in small parts of networks known as motifs. We first report on the system dynamics for the case of a scale-free network and show the existence of a non-trivial critical point. We compute the probability that network motifs synchronize, and fi...... that the fitness for synchronization correlates well with motifs interconnectedness and structural complexity. Possible implications for present debates about network evolution in biological and other systems are discussed....

  19. An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.

    Science.gov (United States)

    Liu, Bingqiang; Zhang, Hanyuan; Zhou, Chuan; Li, Guojun; Fennell, Anne; Wang, Guanghui; Kang, Yu; Liu, Qi; Ma, Qin

    2016-08-09

    Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance

  20. Analysis of Cellular DNA Content by Flow Cytometry.

    Science.gov (United States)

    Darzynkiewicz, Zbigniew; Huang, Xuan; Zhao, Hong

    2017-11-01

    Cellular DNA content can be measured by flow cytometry with the aim of : (1) revealing cell distribution within the major phases of the cell cycle, (2) estimating frequency of apoptotic cells with fractional DNA content, and/or (3) disclosing DNA ploidy of the measured cell population. In this unit, simple and universally applicable methods for staining fixed cells are presented, as are methods that utilize detergents and/or proteolytic treatment to permeabilize cells and make DNA accessible to fluorochrome. Additionally, supravital cell staining with Hoechst 33342, which is primarily used for sorting live cells based on DNA-content differences for their subsequent culturing, is described. Also presented are methods for staining cell nuclei isolated from paraffin-embedded tissues. Available algorithms are listed for deconvolution of DNA-content-frequency histograms to estimate percentage of cells in major phases of the cell cycle and frequency of apoptotic cells with fractional DNA content. © 2017 by John Wiley & Sons, Inc. Copyright © 2017 John Wiley and Sons, Inc.

  1. Statistical analysis of post mortem DNA damage-derived miscoding lesions in Neandertal mitochondrial DNA

    DEFF Research Database (Denmark)

    Vives, Sergi; Gilbert, M Thomas; Arenas, Conchita

    2008-01-01

    in the Heavy strand could explain the observed bias, a phenomenon that could be further tested with non-PCR based approaches. The characterization of the HVS1 hotspots will be of use to future Neandertal mtDNA studies, with specific regards to assessing the authenticity of new positions previously unknown...

  2. Physical-chemical property based sequence motifs and methods regarding same

    Science.gov (United States)

    Braun, Werner [Friendswood, TX; Mathura, Venkatarajan S [Sarasota, FL; Schein, Catherine H [Friendswood, TX

    2008-09-09

    A data analysis system, program, and/or method, e.g., a data mining/data exploration method, using physical-chemical property motifs. For example, a sequence database may be searched for identifying segments thereof having physical-chemical properties similar to the physical-chemical property motifs.

  3. DNA conformational analysis in solution by uranyl mediated photocleavage

    DEFF Research Database (Denmark)

    Nielsen, Peter E.; Møllegaard, N E; Jeppesen, C

    1990-01-01

    Uranyl mediated photocleavage of double stranded DNA is proposed as a general probing for DNA helix conformation in terms of minor groove width/electronegative potential. Specifically, it is found that A/T-tracts known to constitute strong distamycin binding sites are preferentially photocleaved ......, uranyl photocleavage of the internal control region (ICR) of the 5S-RNA gene yields a cleavage modulation pattern fully compatible with that obtained by DNase I which also--in a more complex way--senses DNA minor groove width....

  4. The application of DNA microarrays in gene expression analysis.

    Science.gov (United States)

    van Hal, N L; Vorst, O; van Houwelingen, A M; Kok, E J; Peijnenburg, A; Aharoni, A; van Tunen, A J; Keijer, J

    2000-03-31

    DNA microarray technology is a new and powerful technology that will substantially increase the speed of molecular biological research. This paper gives a survey of DNA microarray technology and its use in gene expression studies. The technical aspects and their potential improvements are discussed. These comprise array manufacturing and design, array hybridisation, scanning, and data handling. Furthermore, it is discussed how DNA microarrays can be applied in the working fields of: safety, functionality and health of food and gene discovery and pathway engineering in plants.

  5. Comparison of the electrophoretic method with the sedimentation method for the analysis of DNA strand breaks

    International Nuclear Information System (INIS)

    Yamamoto, Osamu; Ogawa, Masaaki; Hoshi, Masaharu

    1982-01-01

    Application of electrophoresis to the analysis of DNA strand breaks was studied comparing with the sedimentation analysis. A BRL gel electrophoresis system (Type V16) was used for this study. Calf thymus DNA (1 mg/ml) irradiated with 60 Co gamma-rays in SSC solution was applied to both the electrophoretic analysis and the sedimentation analysis. Lamda phage DNA and its fragments were employed as the standard size molecules. In a range from 1 k base pairs to 6 k base pairs in length for double stranded DNA or from 2 k bases to 12 k bases for single stranded DNA, the calculated average molecular weight from the electrophoresis coincided with that from the sedimentation. Number of single strand breaks and double strand breaks were 1.34 x 10 11 breaks/mg/rad (G = 0.215) and 0.48 x 10 5 breaks/mg/rad 2 , respectively. (author)

  6. RDNAnalyzer: A tool for DNA secondary structure prediction and sequence analysis.

    Science.gov (United States)

    Afzal, Muhammad; Shahid, Ahmad Ali; Shehzadi, Abida; Nadeem, Shahid; Husnain, Tayyab

    2012-01-01

    RDNAnalyzer is an innovative computer based tool designed for DNA secondary structure prediction and sequence analysis. It can randomly generate the DNA sequence or user can upload the sequences of their own interest in RAW format. It uses and extends the Nussinov dynamic programming algorithm and has various application for the sequence analysis. It predicts the DNA secondary structure and base pairings. It also provides the tools for routinely performed sequence analysis by the biological scientists such as DNA replication, reverse compliment generation, transcription, translation, sequence specific information as total number of nucleotide bases, ATGC base contents along with their respective percentages and sequence cleaner. RDNAnalyzer is a unique tool developed in Microsoft Visual Studio 2008 using Microsoft Visual C# and Windows Presentation Foundation and provides user friendly environment for sequence analysis. It is freely available. http://www.cemb.edu.pk/sw.html RDNAnalyzer - Random DNA Analyser, GUI - Graphical user interface, XAML - Extensible Application Markup Language.

  7. Cluster analysis of Helicobacter pylori genomic DNA fingerprints suggests gastroduodenal disease-specific associations.

    Science.gov (United States)

    Go, M F; Chan, K Y; Versalovic, J; Koeuth, T; Graham, D Y; Lupski, J R

    1995-07-01

    Helicobacter pylori infection is now accepted as the most common cause of chronic active gastritis and peptic ulcer disease. The etiologies of many infectious diseases have been attributed to specific or clonal strains of bacterial pathogens. Polymerase chain reaction (PCR) amplification of DNA between repetitive DNA sequences, REP elements (REP-PCR), has been utilized to generate DNA fingerprints to examine similarity among strains within a bacterial species. Genomic DNA from H. pylori isolates obtained from 70 individuals (39 duodenal ulcers and 31 simple gastritis) was PCR-amplified using consensus probes to repetitive DNA elements. The H. pylori DNA fingerprints were analyzed for similarity and correlated with disease presentation using the NTSYS-pc computer program. Each H. pylori strain had a distinct DNA fingerprint except for two pairs. Single-colony DNA fingerprints of H. pylori from the same patient were identical, suggesting that each patient harbors a single strain. Computer-assisted cluster analysis of the REP-PCR DNA fingerprints showed two large clusters of isolates, one associated with simple gastritis and the other with duodenal ulcer disease. Cluster analysis of REP-PCR DNA fingerprints of H. pylori strains suggests that duodenal ulcer isolates, as a group, are more similar to one another and different from gastritis isolates. These results suggest that disease-specific strains may exist.

  8. Nanochannel Device with Embedded Nanopore: a New Approach for Single-Molecule DNA Analysis and Manipulation

    Science.gov (United States)

    Zhang, Yuning; Reisner, Walter

    2013-03-01

    Nanopore and nanochannel based devices are robust methods for biomolecular sensing and single DNA manipulation. Nanopore-based DNA sensing has attractive features that make it a leading candidate as a single-molecule DNA sequencing technology. Nanochannel based extension of DNA, combined with enzymatic or denaturation-based barcoding schemes, is already a powerful approach for genome analysis. We believe that there is revolutionary potential in devices that combine nanochannels with embedded pore detectors. In particular, due to the fast translocation of a DNA molecule through a standard nanopore configuration, there is an unfavorable trade-off between signal and sequence resolution. With a combined nanochannel-nanopore device, based on embedding a pore inside a nanochannel, we can in principle gain independent control over both DNA translocation speed and sensing signal, solving the key draw-back of the standard nanopore configuration. We demonstrate that we can optically detect successful translocation of DNA from the nanochannel out through the nanopore, a possible method to 'select' a given barcode for further analysis. In particular, we show that in equilibrium DNA will not escape through an embedded sub-persistence length nanopore, suggesting that the pore could be used as a nanoscale window through which to interrogate a nanochannel extended DNA molecule. Furthermore, electrical measurements through the nanopore are performed, indicating that DNA sensing is feasible using the nanochannel-nanopore device.

  9. DNA nanotechnology

    Science.gov (United States)

    Seeman, Nadrian C.; Sleiman, Hanadi F.

    2018-01-01

    DNA is the molecule that stores and transmits genetic information in biological systems. The field of DNA nanotechnology takes this molecule out of its biological context and uses its information to assemble structural motifs and then to connect them together. This field has had a remarkable impact on nanoscience and nanotechnology, and has been revolutionary in our ability to control molecular self-assembly. In this Review, we summarize the approaches used to assemble DNA nanostructures and examine their emerging applications in areas such as biophysics, diagnostics, nanoparticle and protein assembly, biomolecule structure determination, drug delivery and synthetic biology. The introduction of orthogonal interactions into DNA nanostructures is discussed, and finally, a perspective on the future directions of this field is presented.

  10. Multi-color fluorescent DNA analysis in an integrated optofluidic lab-on-a-chip

    OpenAIRE

    Dongre, C.; van Weerd, J.; van Weeghel, R.; Martinez-Vazquez, R.; Osellame, R.; Cerullo, G.; Besselink, G.A.J.; van den Vlekkert, H.H.; Hoekstra, Hugo; Pollnau, Markus

    2010-01-01

    Sorting and sizing of DNA molecules within the human genome project has enabled the genetic mapping of various illnesses. By employing tiny lab-on-a-chip devices for such DNA analysis, integrated DNA sequencing and genetic diagnostics have become feasible. However, such diagnostic chips typically lack integrated sensing capability. We address this issue by combining microfluidic capillary electrophoresis with laser-induced fluorescence detection resulting in optofluidic integration towards an...

  11. Targeted DNA Methylation Analysis by High Throughput Sequencing in Porcine Peri-attachment Embryos

    OpenAIRE

    MORRILL, Benson H.; COX, Lindsay; WARD, Anika; HEYWOOD, Sierra; PRATHER, Randall S.; ISOM, S. Clay

    2013-01-01

    Abstract The purpose of this experiment was to implement and evaluate the effectiveness of a next-generation sequencing-based method for DNA methylation analysis in porcine embryonic samples. Fourteen discrete genomic regions were amplified by PCR using bisulfite-converted genomic DNA derived from day 14 in vivo-derived (IVV) and parthenogenetic (PA) porcine embryos as template DNA. Resulting PCR products were subjected to high-throughput sequencing using the Illumina Genome Analyzer IIx plat...

  12. A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif

    Directory of Open Access Journals (Sweden)

    Asita Elengoe

    2015-01-01

    Full Text Available Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD of heat shock 70 kDa protein (PDB: 1HJO with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD simulation. Human DNA binding domain of p53 motif (SCMGGMNR retrieved from UniProt (UniProtKB: P04637 was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.

  13. X-ray induced degradation of DNA in Aspergillus nidulans cells comparative analysis of UV- and X-ray induced DNA degradation

    International Nuclear Information System (INIS)

    Zinchenko, V.V.; Babykin, M.M.

    1980-01-01

    Irradiating cells of Aspergillus nidulans of the wild type in the logarythmical growth phase with X-rays leads to a certain retention in DNA synthesis. This period is characterized by an insignificant fermentative DNA degradation connected with a process of its repair. There is no direct dependence between the radiation dose and the level of DNA degradation. The investigation of X-ray induced DNA degradation in a number of UVS-mutants permits to show the existence of two branches of DNA degradation - dependent and independent of the exogenic energy source. The dependence of DNA degradation on albumen synthesis prior to irradiation and after it, is demonstrated. It is supposed that the level of X-ray induced DNA degradation is determined by two albumen systems, one of which initiates degradation and the other terminates it. The comparative analysis of UV and X-ray induced DNA degradation is carried out

  14. A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.

    Science.gov (United States)

    Tran, Ngoc Tam L; Huang, Chun-Hsi

    2014-02-20

    ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs that other tools may not discover. We recommended the users to use multiple motif finding Web tools that implement different algorithms for obtaining significant motifs, overlapping resemble motifs, and non-overlapping motifs. Finally, we provided our suggestions for future development of motif finding Web tool that better assists researchers for finding motifs in ChIP-Seq data.

  15. Application of synthetic DNA probes to the analysis of DNA sequence variants in man

    International Nuclear Information System (INIS)

    Wallace, R.B.; Petz, L.D.; Yam, P.Y.

    1986-01-01

    Oligonucleotide probes provide a tool to discriminate between any two alleles on the basis of hybridization. Random sampling of the genome with different oligonucleotide probes should reveal polymorphism in a certain percentage of the cases. In the hope of identifying polymorphic regions more efficiently, we chose to take advantage of the proposed hypermutability of repeated DNA sequences and the specificity of oligonucleotide hybridization. Since, under appropriate conditions, oligonucleotide probes require complete base pairing for hybridization to occur, they will only hybridize to a subset of the members of a repeat family when all members of the family are not identical. The results presented here suggest that oligonucleotide hybridization can be used to extend the genomic sequences that can be tested for the presence of RFLPs. This expands the tools available to human genetics. In addition, the results suggest that repeated DNA sequences are indeed more polymorphic than single-copy sequences. 28 references, 2 figures

  16. A Conserved Metal Binding Motif in the Bacillus subtilis Competence Protein ComFA Enhances Transformation.

    Science.gov (United States)

    Chilton, Scott S; Falbel, Tanya G; Hromada, Susan; Burton, Briana M

    2017-08-01

    Genetic competence is a process in which cells are able to take up DNA from their environment, resulting in horizontal gene transfer, a major mechanism for generating diversity in bacteria. Many bacteria carry homologs of the central DNA uptake machinery that has been well characterized in Bacillus subtilis It has been postulated that the B. subtilis competence helicase ComFA belongs to the DEAD box family of helicases/translocases. Here, we made a series of mutants to analyze conserved amino acid motifs in several regions of B. subtilis ComFA. First, we confirmed that ComFA activity requires amino acid residues conserved among the DEAD box helicases, and second, we show that a zinc finger-like motif consisting of four cysteines is required for efficient transformation. Each cysteine in the motif is important, and mutation of at least two of the cysteines dramatically reduces transformation efficiency. Further, combining multiple cysteine mutations with the helicase mutations shows an additive phenotype. Our results suggest that the helicase and metal binding functions are two distinct activities important for ComFA function during transformation. IMPORTANCE ComFA is a highly conserved protein that has a role in DNA uptake during natural competence, a mechanism for horizontal gene transfer observed in many bacteria. Investigation of the details of the DNA uptake mechanism is important for understanding the ways in which bacteria gain new traits from their environment, such as drug resistance. To dissect the role of ComFA in the DNA uptake machinery, we introduced point mutations into several motifs in the protein sequence. We demonstrate that several amino acid motifs conserved among ComFA proteins are important for efficient transformation. This report is the first to demonstrate the functional requirement of an amino-terminal cysteine motif in ComFA. Copyright © 2017 American Society for Microbiology.

  17. High resolution melting (HRM) analysis of DNA--its role and potential in food analysis.

    Science.gov (United States)

    Druml, Barbara; Cichna-Markl, Margit

    2014-09-01

    DNA based methods play an increasing role in food safety control and food adulteration detection. Recent papers show that high resolution melting (HRM) analysis is an interesting approach. It involves amplification of the target of interest in the presence of a saturation dye by the polymerase chain reaction (PCR) and subsequent melting of the amplicons by gradually increasing the temperature. Since the melting profile depends on the GC content, length, sequence and strand complementarity of the product, HRM analysis is highly suitable for the detection of single-base variants and small insertions or deletions. The review gives an introduction into HRM analysis, covers important aspects in the development of an HRM analysis method and describes how HRM data are analysed and interpreted. Then we discuss the potential of HRM analysis based methods in food analysis, i.e. for the identification of closely related species and cultivars and the identification of pathogenic microorganisms. Copyright © 2014 Elsevier Ltd. All rights reserved.

  18. Analysis of multiple single nucleotide polymorphisms (SNP) on DNA traces from plasma and dried blood samples

    NARCIS (Netherlands)

    Catsburg, Arnold; van der Zwet, Wil C.; Morre, Servaas A.; Ouburg, Sander; Vandenbroucke-Grauls, Christina M. J. E.; Savelkoul, Paul H. M.

    2007-01-01

    Reliable analysis of single nucleotide polymorphisms (SNPs) in DNA derived from samples containing low numbers of cells or from suboptimal sources can be difficult. A new procedure to characterize multiple SNPs in traces of DNA from plasma and old dried blood samples was developed. Six SNPs in the

  19. AN IMAGE-ANALYSIS TECHNIQUE FOR DETECTION OF RADIATION-INDUCED DNA FRAGMENTATION AFTER CHEF ELECTROPHORESIS

    NARCIS (Netherlands)

    ROSEMANN, M; KANON, B; KONINGS, AWT; KAMPINGA, HH

    CHEF-electrophoresis was used as a technique to detect radiation-induced DNA breakage with special emphasis to biological relevant X-ray doses (0-10 Gy). Fluorescence detection of DNA-fragments using a sensitive image analysis system was directly compared with conventional scintillation counting of

  20. Perinatal hepatitis B virus detection by hepatitis B virus-DNA analysis.

    OpenAIRE

    De Virgiliis, S; Frau, F; Sanna, G; Turco, M P; Figus, A L; Cornacchia, G; Cao, A

    1985-01-01

    Maternal transmission of hepatitis B virus infection in relation to the hepatitis B e antigen/antibody system and serum hepatitis B virus-DNA were evaluated. Results indicate that hepatitis B virus-DNA analysis can identify hepatitis B serum antigen positive mothers who may transmit infection to their offspring.

  1. Photocleavable DNA barcode-antibody conjugates allow sensitive and multiplexed protein analysis in single cells.

    Science.gov (United States)

    Agasti, Sarit S; Liong, Monty; Peterson, Vanessa M; Lee, Hakho; Weissleder, Ralph

    2012-11-14

    DNA barcoding is an attractive technology, as it allows sensitive and multiplexed target analysis. However, DNA barcoding of cellular proteins remains challenging, primarily because barcode amplification and readout techniques are often incompatible with the cellular microenvironment. Here we describe the development and validation of a photocleavable DNA barcode-antibody conjugate method for rapid, quantitative, and multiplexed detection of proteins in single live cells. Following target binding, this method allows DNA barcodes to be photoreleased in solution, enabling easy isolation, amplification, and readout. As a proof of principle, we demonstrate sensitive and multiplexed detection of protein biomarkers in a variety of cancer cells.

  2. Quantitation of DNA repair in brain cell cultures: implications for autoradiographic analysis of mixed cell populations

    International Nuclear Information System (INIS)

    Dambergs, R.; Kidson, C.

    1979-01-01

    Quantitation of DNA repair in the mixed cell population of mouse embryo brain cultures has been assessed by autoradiographic analysis of unscheduled DNA synthesis following UV-irradiation. The proportion of labelled neurons and the grain density over neuronal nuclei were both less than the corresponding values for glial cells. The nuclear geometries of these two classes of cell are very different. Partial correction for the different geometries by relating grain density to nuclear area brought estimates of neuronal and glial DNA repair synthesis more closely in line. These findings have general implications for autoradiographic measurement of DNA repair in mixed cell populations and in differentiated versus dividing cells. (author)

  3. The ARTT motif and a unified structural understanding of substraterecognition in ADP ribosylating bacterial toxins and eukaryotic ADPribosyltransferases

    Energy Technology Data Exchange (ETDEWEB)

    Han, S.; Tainer, J.A.

    2001-08-01

    ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core has been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT

  4. The derivative assay--an analysis of two fast components of DNA rejoining kinetics

    International Nuclear Information System (INIS)

    Sandstroem, B.E.

    1989-01-01

    The DNA rejoining kinetics of human U-118 MG cells were studied after gamma-irradiation with 4 Gy. The analysis of the sealing rate of the induced DNA strand breaks was made with a modification of the DNA unwinding technique. The modification meant that rather than just monitoring the number of existing breaks at each time of analysis, the velocity, at which the rejoining process proceeded, was determined. Two apparent first-order components of single-strand break repair could be identified during the 25 min of analysis. The half-times for the two components were 1.9 and 16 min, respectively

  5. Activity of the rat osteocalcin basal promoter in osteoblastic cells is dependent upon homeodomain and CP1 binding motifs.

    Science.gov (United States)

    Towler, D A; Bennett, C D; Rodan, G A

    1994-05-01

    A detailed analysis of the transcriptional machinery responsible for osteoblast-specific gene expression should provide tools useful for understanding osteoblast commitment and differentiation. We have defined three cis-elements important for basal activity of the rat osteocalcin (OC) promoter, located at about -200 to -180, -170 to -138, and -121 to -64 relative to the transcription initiation site. A motif (TCTGATTGTGT) present in the region between -200 and -170 that binds a multisubunit CP1/NFY/CBF-like CAAT factor complex contributes significantly to high level basal activity and presumably functions as the CAAT box for the rat OC promoter. We show that the region -121 to 32 is sufficient to confer osteoblastic cell type specificity in transient transfection assays of cultured cell lines using luciferase as a reporter. The basal promoter is active in rodent osteoblastic cell lines, but not in rodent fibroblastic or muscle cell lines. Although the rat OC box (-100 to -74) contains a CAAT motif, we could not detect CP1-like CAAT factor binding to this region. In fact, we demonstrate that a Msx-1 (Hox 7.1) homeodomain binding motif (ACTAATTG; bottom strand) in the 3'-end of the rat OC box is necessary for high level activity of the rat OC basal promoter in osteoblastic cells. A nuclear factor that recognizes this motif appears to be present in osteoblastic ROS 17/2.8 cells, which produce OC, but not in fibroblastic ROS 25/1 cells, which fail to express OC. This ROS 17/2.8 nuclear factor also recognizes the A/T-rich DNA cognates of the homeodomain-containing POU family of transcription factors. Taken together, these data suggest that a ubiquitous CP1-like CAAT factor and a cell type-restricted homeodomain containing (Msx or POU family) transcription factor interact with the proximal rat OC promoter to direct appropriate basal OC transcription in osteoblastic cells.

  6. Insights into the motif preference of APOBEC3 enzymes.

    Directory of Open Access Journals (Sweden)

    Diako Ebrahimi

    Full Text Available We used a multivariate data analysis approach to identify motifs associated with HIV hypermutation by different APOBEC3 enzymes. The analysis showed that APOBEC3G targets G mainly within GG, TG, TGG, GGG, TGGG and also GGGT. The G nucleotides flanked by a C at the 3' end (in +1 and +2 positions were indicated as disfavoured targets by APOBEC3G. The G nucleotides within GGGG were found to be targeted at a frequency much less than what is expected. We found that the infrequent G-to-A mutation within GGGG is not limited to the inaccessibility, to APOBEC3, of poly Gs in the central and 3'polypurine tracts (PPTs which remain double stranded during the HIV reverse transcription. GGGG motifs outside the PPTs were also disfavoured. The motifs GGAG and GAGG were also found to be disfavoured targets for APOBEC3. The motif-dependent mutation of G within the HIV genome by members of the APOBEC3 family other than APOBEC3G was limited to GA→AA changes. The results did not show evidence of other types of context dependent G-to-A changes in the HIV genome.

  7. Cell-Free DNA in Metastatic Colorectal Cancer: A Systematic Review and Meta-Analysis.

    Science.gov (United States)

    Spindler, Karen-Lise G; Boysen, Anders K; Pallisgård, Niels; Johansen, Julia S; Tabernero, Josep; Sørensen, Morten M; Jensen, Benny V; Hansen, Torben F; Sefrioui, David; Andersen, Rikke F; Brandslund, Ivan; Jakobsen, Anders

    2017-09-01

    Circulating DNA can be detected and quantified in the blood of cancer patients and used for detection of tumor-specific genetic alterations. The clinical utility has been intensively investigated for the past 10 years. The majority of reports focus on analyzing the clinical potential of tumor-specific mutations, whereas the use of total cell-free DNA (cfDNA) quantification is somehow controversial and sparsely described in the literature, but holds important clinical information in itself. The purpose of the present report was to present a systematic review and meta-analysis of the prognostic value of total cfDNA in patients with metastatic colorectal cancer (mCRC) treated with chemotherapy. In addition, we report on the overall performance of cfDNA as source for KRAS mutation detection. A systematic literature search of PubMed and Embase was performed by two independent investigators. Eligibility criteria were (a) total cfDNA analysis, (b) mCRC, and (c) prognostic value during palliative treatment. The preferred reporting items for systematic reviews and meta-analyses (PRISMA) guidelines were followed, and meta-analysis applied on both aggregate data extraction and individual patients' data. Ten eligible cohorts were identified, including a total of 1,076 patients. Seven studies used quantitative polymerase chain reaction methods, two BEAMing [beads, emulsification, amplification, and magnetics] technology, and one study digital droplet polymerase chain reaction. The baseline levels of cfDNA was similar in the presented studies, and all studies reported a clear prognostic value in favor of patients with lowest levels of baseline cfDNA. A meta-analysis revealed a combined estimate of favorable overall survival hazard ratio (HR) in patients with levels below the median cfDNA (HR = 2.39, 95% confidence interval 2.03-2.82, p  meta-analysis. Reliable prognostic markers could help to guide patients and treating physicians regarding the relevance and choice of

  8. High-resolution analysis of cytosine methylation in ancient DNA.

    Directory of Open Access Journals (Sweden)

    Bastien Llamas

    Full Text Available Epigenetic changes to gene expression can result in heritable phenotypic characteristics that are not encoded in the DNA itself, but rather by biochemical modifications to the DNA or associated chromatin proteins. Interposed between genes and environment, these epigenetic modifications can be influenced by environmental factors to affect phenotype for multiple generations. This raises the possibility that epigenetic states provide a substrate for natural selection, with the potential to participate in the rapid adaptation of species to changes in environment. Any direct test of this hypothesis would require the ability to measure epigenetic states over evolutionary timescales. Here we describe the first single-base resolution of cytosine methylation patterns in an ancient mammalian genome, by bisulphite allelic sequencing of loci from late Pleistocene Bison priscus remains. Retrotransposons and the differentially methylated regions of imprinted loci displayed methylation patterns identical to those derived from fresh bovine tissue, indicating that methylation patterns are preserved in the ancient DNA. Our findings establish the biochemical stability of methylated cytosines over extensive time frames, and provide the first direct evidence that cytosine methylation patterns are retained in DNA from ancient specimens. The ability to resolve cytosine methylation in ancient DNA provides a powerful means to study the role of epigenetics in evolution.

  9. Comparative analysis of the full genome sequence of European bat lyssavirus type 1 and type 2 with other lyssaviruses and evidence for a conserved transcription termination and polyadenylation motif in the G-L 3' non-translated region.

    Science.gov (United States)

    Marston, D A; McElhinney, L M; Johnson, N; Müller, T; Conzelmann, K K; Tordo, N; Fooks, A R

    2007-04-01

    We report the first full-length genomic sequences for European bat lyssavirus type-1 (EBLV-1) and type-2 (EBLV-2). The EBLV-1 genomic sequence was derived from a virus isolated from a serotine bat in Hamburg, Germany, in 1968 and the EBLV-2 sequence was derived from a virus isolate from a human case of rabies that occurred in Scotland in 2002. A long-distance PCR strategy was used to amplify the open reading frames (ORFs), followed by standard and modified RACE (rapid amplification of cDNA ends) techniques to amplify the 3' and 5' ends. The lengths of each complete viral genome for EBLV-1 and EBLV-2 were 11 966 and 11 930 base pairs, respectively, and follow the standard rhabdovirus genome organization of five viral proteins. Comparison with other lyssavirus sequences demonstrates variation in degrees of homology, with the genomic termini showing a high degree of complementarity. The nucleoprotein was the most conserved, both intra- and intergenotypically, followed by the polymerase (L), matrix and glyco- proteins, with the phosphoprotein being the most variable. In addition, we have shown that the two EBLVs utilize a conserved transcription termination and polyadenylation (TTP) motif, approximately 50 nt upstream of the L gene start codon. All available lyssavirus sequences to date, with the exception of Pasteur virus (PV) and PV-derived isolates, use the second TTP site. This observation may explain differences in pathogenicity between lyssavirus strains, dependent on the length of the untranslated region, which might affect transcriptional activity and RNA stability.

  10. Temporal motifs in time-dependent networks

    International Nuclear Information System (INIS)

    Kovanen, Lauri; Karsai, Márton; Kaski, Kimmo; Kertész, János; Saramäki, Jari

    2011-01-01

    Temporal networks are commonly used to represent systems where connections between elements are active only for restricted periods of time, such as telecommunication, neural signal processing, biochemical reaction and human social interaction networks. We introduce the framework of temporal motifs to study the mesoscale topological–temporal structure of temporal networks in which the events of nodes do not overlap in time. Temporal motifs are classes of similar event sequences, where the similarity refers not only to topology but also to the temporal order of the events. We provide a mapping from event sequences to coloured directed graphs that enables an efficient algorithm for identifying temporal motifs. We discuss some aspects of temporal motifs, including causality and null models, and present basic statistics of temporal motifs in a large mobile call network

  11. Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

    KAUST Repository

    Alam, Tanvir

    2018-03-11

    Short Linear Motifs (SLiMs) contribute to almost every cellular function by connecting appropriate protein partners. Accurate prediction of SLiMs is difficult due to their shortness and sequence degeneracy. Leucine-aspartic acid (LD) motifs are SLiMs that link paxillin family proteins to factors controlling (cancer) cell adhesion, motility and survival. The existence and importance of LD motifs beyond the paxillin family is poorly understood. To enable a proteome-wide assessment of these motifs, we developed an active-learning based framework that iteratively integrates computational predictions with experimental validation. Our analysis of the human proteome identified a dozen proteins that contain LD motifs, all being involved in cell adhesion and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter-species comparison revealed a conserved LD signalling core, and reveals the emergence of species-specific adaptive connections, while maintaining a strong functional focus of the LD motif interactome. Collectively, our data elucidate the mechanisms underlying the origin and adaptation of an ancestral SLiM.

  12. iFORM: Incorporating Find Occurrence of Regulatory Motifs.

    Science.gov (United States)

    Ren, Chao; Chen, Hebing; Yang, Bite; Liu, Feng; Ouyang, Zhangyi; Bo, Xiaochen; Shu, Wenjie

    2016-01-01

    Accurately identifying the binding sites of transcription factors (TFs) is crucial to understanding the mechanisms of transcriptional regulation and human disease. We present incorporating Find Occurrence of Regulatory Motifs (iFORM), an easy-to-use and efficient tool for scanning DNA sequences with TF motifs described as position weight matrices (PWMs). Both performance assessment with a receiver operating characteristic (ROC) curve and a correlation-based approach demonstrated that iFORM achieves higher accuracy and sensitivity by integrating five classical motif discovery programs using Fisher's combined probability test. We have used iFORM to provide accurate results on a variety of data in the ENCODE Project and the NIH Roadmap Epigenomics Project, and the tool has demonstrated its utility in further elucidating individual roles of functional elements. Both the source and binary codes for iFORM can be freely accessed at https://github.com/wenjiegroup/iFORM. The identified TF binding sites across human cell and tissue types using iFORM have been deposited in the Gene Expression Omnibus under the accession ID GSE53962.

  13. High-Throughput Analysis of T-DNA Location and Structure Using Sequence Capture.

    Directory of Open Access Journals (Sweden)

    Soichi Inagaki

    Full Text Available Agrobacterium-mediated transformation of plants with T-DNA is used both to introduce transgenes and for mutagenesis. Conventional approaches used to identify the genomic location and the structure of the inserted T-DNA are laborious and high-throughput methods using next-generation sequencing are being developed to address these problems. Here, we present a cost-effective approach that uses sequence capture targeted to the T-DNA borders to select genomic DNA fragments containing T-DNA-genome junctions, followed by Illumina sequencing to determine the location and junction structure of T-DNA insertions. Multiple probes can be mixed so that transgenic lines transformed with different T-DNA types can be processed simultaneously, using a simple, index-based pooling approach. We also developed a simple bioinformatic tool to find sequence read pairs that span the junction between the genome and T-DNA or any foreign DNA. We analyzed 29 transgenic lines of Arabidopsis thaliana, each containing inserts from 4 different T-DNA vectors. We determined the location of T-DNA insertions in 22 lines, 4 of which carried multiple insertion sites. Additionally, our analysis uncovered a high frequency of unconventional and complex T-DNA insertions, highlighting the needs for high-throughput methods for T-DNA localization and structural characterization. Transgene insertion events have to be fully characterized prior to use as commercial products. Our method greatly facilitates the first step of this characterization of transgenic plants by providing an efficient screen for the selection of promising lines.

  14. High-Throughput Analysis of T-DNA Location and Structure Using Sequence Capture.

    Science.gov (United States)

    Inagaki, Soichi; Henry, Isabelle M; Lieberman, Meric C; Comai, Luca

    2015-01-01

    Agrobacterium-mediated transformation of plants with T-DNA is used both to introduce transgenes and for mutagenesis. Conventional approaches used to identify the genomic location and the structure of the inserted T-DNA are laborious and high-throughput methods using next-generation sequencing are being developed to address these problems. Here, we present a cost-effective approach that uses sequence capture targeted to the T-DNA borders to select genomic DNA fragments containing T-DNA-genome junctions, followed by Illumina sequencing to determine the location and junction structure of T-DNA insertions. Multiple probes can be mixed so that transgenic lines transformed with different T-DNA types can be processed simultaneously, using a simple, index-based pooling approach. We also developed a simple bioinformatic tool to find sequence read pairs that span the junction between the genome and T-DNA or any foreign DNA. We analyzed 29 transgenic lines of Arabidopsis thaliana, each containing inserts from 4 different T-DNA vectors. We determined the location of T-DNA insertions in 22 lines, 4 of which carried multiple insertion sites. Additionally, our analysis uncovered a high frequency of unconventional and complex T-DNA insertions, highlighting the needs for high-throughput methods for T-DNA localization and structural characterization. Transgene insertion events have to be fully characterized prior to use as commercial products. Our method greatly facilitates the first step of this characterization of transgenic plants by providing an efficient screen for the selection of promising lines.

  15. Spectral analysis of naturally occurring methylxanthines (theophylline, theobromine and caffeine) binding with DNA.

    Science.gov (United States)

    Johnson, Irudayam Maria; Prakash, Halan; Prathiba, Jeyaguru; Raghunathan, Raghavachary; Malathi, Raghunathan

    2012-01-01

    Nucleic acids exist in a dynamic equilibrium with a number of molecules that constantly interact with them and regulate the cellular activities. The inherent nature of the structure and conformational integrity of these macromolecules can lead to altered biological activity through proper targeting of nucleic acids binding ligands or drug molecules. We studied the interaction of naturally occurring methylxanthines such as theophylline, theobromine and caffeine with DNA, using UV absorption and Fourier transform infrared (FTIR) spectroscopic methods, and especially monitored their binding affinity in the presence of Mg(2+) and during helix-coil transitions of DNA by temperature (T(m)) or pH melting profiles. The study indicates that all these molecules effectively bind to DNA in a dose dependent manner. The overall binding constants of DNA-theophylline = 3.5×10(3) M(-1), DNA-theobromine = 1.1×10(3) M(-1), and DNA-Caffeine = 3.8×10(3) M(-1). On the other hand T(m)/pH melting profiles showed 24-35% of enhanced binding activity of methylxanthines during helix-coil transitions of DNA rather than to its native double helical structure. The FTIR analysis divulged that theophylline, theobromine and caffeine interact with all the base pairs of DNA (A-T; G-C) and phosphate group through hydrogen bond (H-bond) interaction. In the presence of Mg(2+), methylxanthines altered the structure of DNA from B to A-family. However, the B-family structure of DNA remained unaltered in DNA-methylxanthines complexes or in the absence of Mg(2+). The spectral analyses indicated the order of binding affinity as "caffeine≥theophylline>theobromine" to the native double helical DNA, and "theophylline≥theobromine>caffeine to the denatured form of DNA and in the presence of divalent metal ions.

  16. Spectral analysis of naturally occurring methylxanthines (theophylline, theobromine and caffeine binding with DNA.

    Directory of Open Access Journals (Sweden)

    Irudayam Maria Johnson

    Full Text Available Nucleic acids exist in a dynamic equilibrium with a number of molecules that constantly interact with them and regulate the cellular activities. The inherent nature of the structure and conformational integrity of these macromolecules can lead to altered biological activity through proper targeting of nucleic acids binding ligands or drug molecules. We studied the interaction of naturally occurring methylxanthines such as theophylline, theobromine and caffeine with DNA, using UV absorption and Fourier transform infrared (FTIR spectroscopic methods, and especially monitored their binding affinity in the presence of Mg(2+ and during helix-coil transitions of DNA by temperature (T(m or pH melting profiles. The study indicates that all these molecules effectively bind to DNA in a dose dependent manner. The overall binding constants of DNA-theophylline = 3.5×10(3 M(-1, DNA-theobromine = 1.1×10(3 M(-1, and DNA-Caffeine = 3.8×10(3 M(-1. On the other hand T(m/pH melting profiles showed 24-35% of enhanced binding activity of methylxanthines during helix-coil transitions of DNA rather than to its native double helical structure. The FTIR analysis divulged that theophylline, theobromine and caffeine interact with all the base pairs of DNA (A-T; G-C and phosphate group through hydrogen bond (H-bond interaction. In the presence of Mg(2+, methylxanthines altered the structure of DNA from B to A-family. However, the B-family structure of DNA remained unaltered in DNA-methylxanthines complexes or in the absence of Mg(2+. The spectral analyses indicated the order of binding affinity as "caffeine≥theophylline>theobromine" to the native double helical DNA, and "theophylline≥theobromine>caffeine to the denatured form of DNA and in the presence of divalent metal ions.

  17. Cloning and Expression Analysis of a Giant Gourami Vasa-Like cDNA

    Directory of Open Access Journals (Sweden)

    ALIMUDDIN

    2011-09-01

    Full Text Available Molecular marker is useful in the development of testicular cells transplantation for detecting donor-derived germ cells in the recipient gonad. In this study, a giant gourami (Osphronemus goramy vasa-like gene (GgVLG was cloned and characterized for use as a molecular marker for germ cells in this species. Nucleotide sequence analysis revealed that GgVLG comprises 2,340 bps with an open reading frame of 1,962 bps encoding 653 amino acids. The deduced amino acid sequence contained 17 arginine-glycine or arginine-glycine-glycine motifs and eight conserved motifs belonging to the DEAD-box protein family. The GgVLG sequence showed high similarity to Drosophila vasa, common carp vasa homolog and tilapia vasa homolog for 66.2, 85.9, and 90.7%, respectively. In adult tissues, the GgVLG transcripts were specifically detected in ovary and testis. In situ hybridization analysis showed that GgVLG mRNA was detected in oocytes of the ovary and spermatogonia of the testis. There was no signal detected in the spermatocytes, spermatids and other gonadal somatic cells. Thus, consensus sequences, specific localization of GgVLG mRNA in the germ cells, amino acid sequence similarity and phylogenic analysis all suggest that GgVLG is the giant gourami vasa-like gene. Further, GgVLG can be used as a molecular marker for giant gourami germ cells.

  18. Impedance analysis of DNA and DNA-drug interactions on thin mercury film electrodes

    Czech Academy of Sciences Publication Activity Database

    Hasoň, Stanislav; Dvořák, Jakub; Jelen, František; Vetterl, Vladimír

    2002-01-01

    Roč. 32, č. 2 (2002), s. 167-179 ISSN 1040-8347 R&D Projects: GA AV ČR IAA4004901; GA AV ČR IAA4004002; GA AV ČR IBS5004107 Grant - others:GA FRVŠ(XC) G40583; GA FRVŠ(XC) F40564 Institutional research plan: CEZ:AV0Z5004920 Keywords : electrochemical impedance spectroscopy * intercalators * DNA at electrode surface Subject RIV: BO - Biophysics Impact factor: 2.074, year: 2002

  19. Genome-wide DNA methylation patterns and transcription analysis in sheep muscle.

    Directory of Open Access Journals (Sweden)

    Christine Couldrey

    Full Text Available DNA methylation plays a central role in regulating many aspects of growth and development in mammals through regulating gene expression. The development of next generation sequencing technologies have paved the way for genome-wide, high resolution analysis of DNA methylation landscapes using methodology known as reduced representation bisulfite sequencing (RRBS. While RRBS has proven to be effective in understanding DNA methylation landscapes in humans, mice, and rats, to date, few studies have utilised this powerful method for investigating DNA methylation in agricultural animals. Here we describe the utilisation of RRBS to investigate DNA methylation in sheep Longissimus dorsi muscles. RRBS analysis of ∼1% of the genome from Longissimus dorsi muscles provided data of suitably high precision and accuracy for DNA methylation analysis, at all levels of resolution from genome-wide to individual nucleotides. Combining RRBS data with mRNAseq data allowed the sheep Longissimus dorsi muscle methylome to be compared with methylomes from other species. While some species differences were identified, many similarities were observed between DNA methylation patterns in sheep and other more commonly studied species. The RRBS data presented here highlights the complexity of epigenetic regulation of genes. However, the similarities observed across species are promising, in that knowledge gained from epigenetic studies in human and mice may be applied, with caution, to agricultural species. The ability to accurately measure DNA methylation in agricultural animals will contribute an additional layer of information to the genetic analyses currently being used to maximise production gains in these species.

  20. Product differentiation by analysis of DNA melting curves during the polymerase chain reaction.

    Science.gov (United States)

    Ririe, K M; Rasmussen, R P; Wittwer, C T

    1997-02-15

    A microvolume fluorometer integrated with a thermal cycler was used to acquire DNA melting curves during polymerase chain reaction by fluorescence monitoring of the double-stranded DNA specific dye SYBR Green I. Plotting fluorescence as a function of temperature as the thermal cycler heats through the dissociation temperature of the product gives a DNA melting curve. The shape and position of this DNA melting curve are functions of the GC/AT ratio, length, and sequence and can be used to differentiate amplification products separated by less than 2 degrees C in melting temperature. Desired products can be distinguished from undesirable products, in many cases eliminating the need for gel electrophoresis. Analysis of melting curves can extend the dynamic range of initial template quantification when amplification is monitored with double-stranded DNA specific dyes. Complete amplification and analysis of products can be performed in less than 15 min.

  1. OPTSDNA: Performance evaluation of an efficient distributed bioinformatics system for DNA sequence analysis.

    Science.gov (United States)

    Khan, Mohammad Ibrahim; Sheel, Chotan

    2013-01-01

    Storage of sequence data is a big concern as the amount of data generated is exponential in nature at several locations. Therefore, there is a need to develop techniques to store data using compression algorithm. Here we describe optimal storage algorithm (OPTSDNA) for storing large amount of DNA sequences of varying length. This paper provides performance analysis of optimal storage algorithm (OPTSDNA) of a distributed bioinformatics computing system for analysis of DNA sequences. OPTSDNA algorithm is used for storing various sizes of DNA sequences into database. DNA sequences of different lengths were stored by using this algorithm. These input DNA sequences are varied in size from very small to very large. Storage size is calculated by this algorithm. Response time is also calculated in this work. The efficiency and performance of the algorithm is high (in size calculation with percentage) when compared with other known with sequential approach.

  2. Detection of dopamine in dopaminergic cell using nanoparticles-based barcode DNA analysis.

    Science.gov (United States)

    An, Jeung Hee; Kim, Tae-Hyung; Oh, Byung-Keun; Choi, Jeong Woo

    2012-01-01

    Nanotechnology-based bio-barcode-amplification analysis may be an innovative approach to dopamine detection. In this study, we evaluated the efficacy of this bio-barcode DNA method in detecting dopamine from dopaminergic cells. Herein, a combination DNA barcode and bead-based immunoassay for neurotransmitter detection with PCR-like sensitivity is described. This method relies on magnetic nanoparticles with antibodies and nanoparticles that are encoded with DNA, and antibodies that can sandwich the target protein captured by the nanoparticle-bound antibodies. The aggregate sandwich structures are magnetically separated from solution, and treated in order to remove the conjugated barcode DNA. The DNA barcodes were then identified via PCR analysis. The dopamine concentration in dopaminergic cells can be readily and rapidly detected via the bio-barcode assay method. The bio-barcode assay method is, therefore, a rapid and high-throughput screening tool for the detection of neurotransmitters such as dopamine.

  3. Importance of the efficiency of double-stranded DNA formation in cDNA synthesis for the imprecision of microarray expression analysis.

    Science.gov (United States)

    Thormar, Hans G; Gudmundsson, Bjarki; Eiriksdottir, Freyja; Kil, Siyoen; Gunnarsson, Gudmundur H; Magnusson, Magnus Karl; Hsu, Jason C; Jonsson, Jon J

    2013-04-01

    The causes of imprecision in microarray expression analysis are poorly understood, limiting the use of this technology in molecular diagnostics. Two-dimensional strandness-dependent electrophoresis (2D-SDE) separates nucleic acid molecules on the basis of length and strandness, i.e., double-stranded DNA (dsDNA), single-stranded DNA (ssDNA), and RNA·DNA hybrids. We used 2D-SDE to measure the efficiency of cDNA synthesis and its importance for the imprecision of an in vitro transcription-based microarray expression analysis. The relative amount of double-stranded cDNA formed in replicate experiments that used the same RNA sample template was highly variable, ranging between 0% and 72% of the total DNA. Microarray experiments showed an inverse relationship between the difference between sample pairs in probe variance and the relative amount of dsDNA. Approximately 15% of probes showed between-sample variation (P cDNA synthesized can be an important component of the imprecision in T7 RNA polymerase-based microarray expression analysis. © 2013 American Association for Clinical Chemistry

  4. Efficient sequential and parallel algorithms for planted motif search.

    Science.gov (United States)

    Nicolae, Marius; Rajasekaran, Sanguthevar

    2014-01-31

    Motif searching is an important step in the detection of rare events occurring in a set of DNA or protein sequences. One formulation of the problem is known as (l,d)-motif search or Planted Motif Search (PMS). In PMS we are given two integers l and d and n biological sequences. We want to find all sequences of length l that appear in each of the input sequences with at most d mismatches. The PMS problem is NP-complete. PMS algorithms are typically evaluated on certain instances considered challenging. Despite ample research in the area, a considerable performance gap exists because many state of the art algorithms have large runtimes even for moderately challenging instances. This paper presents a fast exact parallel PMS algorithm called PMS8. PMS8 is the first algorithm to solve the challenging (l,d) instances (25,10) and (26,11). PMS8 is also efficient on instances with larger l and d such as (50,21). We include a comparison of PMS8 with several state of the art algorithms on multiple problem instances. This paper also presents necessary and sufficient conditions for 3 l-mers to have a common d-neighbor. The program is freely available at http://engr.uconn.edu/~man09004/PMS8/. We present PMS8, an efficient exact algorithm for Planted Motif Search. PMS8 introduces novel ideas for generating common neighborhoods. We have also implemented a parallel version for this algorithm. PMS8 can solve instances not solved by any previous algorithms.

  5. Forensic analysis of mitochondrial DNA hypervariable region HVII ...

    African Journals Online (AJOL)

    aghomotsegin

    2015-02-04

    Feb 4, 2015 ... as even species thought to be closely related may in time accumulate ... have attracted the interest of population geneticists (Al-. Zahery et al. ... portion of DNA was amplified in two primers: the first one is HVIII-F. (438-459) ...

  6. Infectivity analysis of two variable DNA B components of Mungbean

    Indian Academy of Sciences (India)

    infecting MYMV, also shared the 3-nt deletion in the first iteron besides having an 18-nt insertion between the third iteron and the conserved nonanucleotide. MYMV was found to be closely related to KA27 DNA B in amino acid sequence identity of ...

  7. DNA methylation and genetic diversity analysis of genus Cycas in ...

    African Journals Online (AJOL)

    mallory

    2012-01-12

    Jan 12, 2012 ... elucidate the role of epigenetics in the genetic diversity of these plants. MATERIALS AND METHODS. Plant materials and DNA extraction. 66 Cycas samples consisting of 10 species and one subspecies were collected from the Nong Nooch Tropical Garden, Chonburi province, Thailand. For each species ...

  8. Analysis of repetitive DNA in chromosomes by flow cytometry

    NARCIS (Netherlands)

    Brind'Amour, Julie; Lansdorp, Peter M.

    We developed a flow cytometry method, chromosome flow fluorescence in situ hybridization (FISH), called CFF, to analyze repetitive DNA in chromosomes using FISH with directly labeled peptide nucleic acid (PNA) probes. We used CFF to measure the abundance of interstitial telomeric sequences in

  9. DNA methylation and genetic diversity analysis of genus Cycas in ...

    African Journals Online (AJOL)

    10 Cycas species as well as one subspecies localized in Thailand were studied using the methylation sensitive amplification polymorphism (MSAP) technique. 11 MSAP primer combinations were used and 720 MSAP bands were generated. The percentages of DNA methylation estimated from MSAP fingerprints were in ...

  10. Influence of DNA treatments on Southern blot hybridization analysis ...

    African Journals Online (AJOL)

    STORAGESEVER

    2008-06-03

    Jun 3, 2008 ... DNA samples obtained by a non-phenol/chloroform isolation method, from three races of Fusarium oxysporum f. sp. lycopersici ... Key words: Fusarium oxysporum, DIG-IGS Probe, Southern hybridization. INTRODUCTION .... Detection of Fusarium spp in plants with monoclonal antibody. Ann. Phytopathol.

  11. DNA sequence and prokaryotic expression analysis of vitellogenin ...

    African Journals Online (AJOL)

    In this study, the DNA sequence of vitellogenin from Antheraea pernyi (Ap-Vg) was identified and its functional domain (30-740 aa, Ap-Vg-1) was expressed in Escherichia coli BL21 (DE3) cells. The recombinant Ap-Vg-1 proteins were purified and used for antibody preparation. The results showed that the intact DNA ...

  12. Flow cytometric DNA ploidy analysis of ovarian granulosa cell tumors

    NARCIS (Netherlands)

    D. Chadha; C.J. Cornelisse; A. Schabert (A.)

    1990-01-01

    textabstractAbstract The nuclear DNA content of 50 ovarian tumors initially diagnosed as granulosa cell tumors was measured by flow cytometry using paraffin-embedded archival material. The follow-up period of the patients ranged from 4 months to 19 years. Thirty-eight tumors were diploid or

  13. Optimization of DNA isolation and PCR protocol for RAPD analysis ...

    African Journals Online (AJOL)

    hope&shola

    The method involves a modified CTAB extraction employing polyvinyl ... The technique is ideal for isolation of DNA from different plant species and .... The tubes were incubated at 65°C in hot air oven or water bath for 60-90 min with intermittent shaking and .... permission to collect germ plasm Financial assistance (to.

  14. [Cover motifs of the Tidsskrift. A 14-year cavalcade].

    Science.gov (United States)

    Nylenna, M

    1998-12-10

    In 1985 the Journal of the Norwegian Medical Association changed its cover policy, moving the table of contents inside the Journal and introducing cover illustrations. This article provides an analysis of all cover illustrations published over this 14-year period, 420 covers in all. There is a great variation in cover motifs and designs and a development towards more general motifs. The initial emphasis on historical and medical aspects is now less pronounced, while the use of works of art and nature motifs has increased, and the cover now more often has a direct bearing on the specific contents of the issue. Professor of medical history Oivind Larsen has photographed two thirds of the covers and contributed 95% of the inside essay-style reflections on the cover motif. Over the years, he has expanded the role of the historian of medicine disseminating knowledge to include that of the raconteur with a personal tone of voice. The Journal's covers are now one of its most characteristic features, emblematic of the Journal's ambition of standing for quality and timelessness vis-à-vis the news media, and of its aim of bridging the gap between medicine and the humanities.

  15. Optimization of DNA extraction for RAPD and ISSR analysis of Arbutus unedo L. Leaves.

    Science.gov (United States)

    Sá, Olga; Pereira, José Alberto; Baptista, Paula

    2011-01-01

    Genetic analysis of plants relies on high yields of pure DNA. For the strawberry tree (Arbutus unedo) this represents a great challenge since leaves can accumulate large amounts of polysaccharides, polyphenols and secondary metabolites, which co-purify with DNA. For this specie, standard protocols do not produce efficient yields of high-quality amplifiable DNA. Here, we present for the first time an improved leaf-tissue protocol, based on the standard cetyl trimethyl ammonium bromide protocol, which yields large amounts of high-quality amplifiable DNA. Key steps in the optimized protocol are the addition of antioxidant compounds-namely polyvinyl pyrrolidone (PVP), 1,4-dithiothreitol (DTT) and 2-mercaptoethanol, in the extraction buffer; the increasing of CTAB (3%, w/v) and sodium chloride (2M) concentration; and an extraction with organic solvents (phenol and chloroform) with the incubation of samples on ice. Increasing the temperature for cell lyses to 70 °C also improved both DNA quality and yield. The yield of DNA extracted was 200.0 ± 78.0 μg/μL and the purity, evaluated by the ratio A(260)/A(280), was 1.80 ± 0.021, indicative of minimal levels of contaminating metabolites. The quality of the DNA isolated was confirmed by random amplification polymorphism DNA and by inter-simple sequence repeat amplification, proving that the DNA can be amplified via PCR.

  16. Optimization of DNA Extraction for RAPD and ISSR Analysis of Arbutus unedo L. Leaves

    Directory of Open Access Journals (Sweden)

    Paula Baptista

    2011-06-01

    Full Text Available Genetic analysis of plants relies on high yields of pure DNA. For the strawberry tree (Arbutus unedo this represents a great challenge since leaves can accumulate large amounts of polysaccharides, polyphenols and secondary metabolites, which co-purify with DNA. For this specie, standard protocols do not produce efficient yields of high-quality amplifiable DNA. Here, we present for the first time an improved leaf-tissue protocol, based on the standard cetyl trimethyl ammonium bromide protocol, which yields large amounts of high-quality amplifiable DNA. Key steps in the optimized protocol are the addition of antioxidant compounds—namely polyvinyl pyrrolidone (PVP, 1,4-dithiothreitol (DTT and 2-mercaptoethanol, in the extraction buffer; the increasing of CTAB (3%, w/v and sodium chloride (2M concentration; and an extraction with organic solvents (phenol and chloroform with the incubation of samples on ice. Increasing the temperature for cell lyses to 70 °C also improved both DNA quality and yield. The yield of DNA extracted was 200.0 ± 78.0 µg/µL and the purity, evaluated by the ratio A260/A280, was 1.80 ± 0.021, indicative of minimal levels of contaminating metabolites. The quality of the DNA isolated was confirmed by random amplification polymorphism DNA and by inter-simple sequence repeat amplification, proving that the DNA can be amplified via PCR.

  17. Quantitative analysis of gene-specific DNA damage in human spermatozoa

    International Nuclear Information System (INIS)

    Sawyer, Dennis E.; Mercer, Belinda G.; Wiklendt, Agnieszka M.; Aitken, R. John

    2003-01-01

    Recent studies have suggested that human spermatozoa are highly susceptible to DNA damage induced by oxidative stress. However, a detailed analysis of the precise nature of this damage and the extent to which it affects the mitochondrial and nuclear genomes has not been reported. To induce DNA damage, human spermatozoa were treated in vitro with hydrogen peroxide (H 2 O 2 ; 0-5 mM) or iron (as Fe(II)SO 4 , 0-500 μM). Quantitative PCR (QPCR) was used to measure DNA damage in individual nuclear genes (hprt, β-pol and β-globin) and mitochondrial DNA. Single strand breaks were also assessed by alkaline gel electrophoresis. H 2 O 2 was found to be genotoxic toward spermatozoa at concentrations as high as 1.25 mM, but DNA damage was not detected in these cells with lower concentrations of H 2 O 2 . The mitochondrial genome of human spermatozoa was significantly (P 2 O 2 -induced DNA damage than the nuclear genome. However, both nDNA and mtDNA in human spermatozoa were significantly (P<0.001) more resistant to damage than DNA from a variety of cell lines of germ cell and myoblastoid origin. Interestingly, significant DNA damage was also not detected in human spermatozoa treated with iron. These studies report, for the first time, quantitative measurements of DNA damage in specific genes of male germ cells, and challenge the commonly held belief that human spermatozoa are particularly vulnerable to DNA damage

  18. Use of FTA® classic cards for epigenetic analysis of sperm DNA.

    Science.gov (United States)

    Serra, Olga; Frazzi, Raffaele; Perotti, Alessio; Barusi, Lorenzo; Buschini, Annamaria

    2018-02-01

    FTA® technologies provide the most reliable method for DNA extraction. Although FTA technologies have been widely used for genetic analysis, there is no literature on their use for epigenetic analysis yet. We present for the first time, a simple method for quantitative methylation assessment based on sperm cells stored on Whatman FTA classic cards. Specifically, elution of seminal DNA from FTA classic cards was successfully tested with an elution buffer and an incubation step in a thermocycler. The eluted DNA was bisulfite converted, amplified by PCR, and a region of interest was pyrosequenced.

  19. Intrinsic Dynamics Analysis of a DNA Octahedron by Elastic Network Model

    Directory of Open Access Journals (Sweden)

    Guang Hu

    2017-01-01

    Full Text Available DNA is a fundamental component of living systems where it plays a crucial role at both functional and structural level. The programmable properties of DNA make it an interesting building block for the construction of nanostructures. However, molecular mechanisms for the arrangement of these well-defined DNA assemblies are not fully understood. In this paper, the intrinsic dynamics of a DNA octahedron has been investigated by using two types of Elastic Network Models (ENMs. The application of ENMs to DNA nanocages include the analysis of the intrinsic flexibilities of DNA double-helices and hinge sites through the calculation of the square fluctuations, as well as the intrinsic collective dynamics in terms of cross-collective map calculation coupled with global motions analysis. The dynamics profiles derived from ENMs have then been evaluated and compared with previous classical molecular dynamics simulation trajectories. The results presented here revealed that ENMs can provide useful insights into the intrinsic dynamics of large DNA nanocages and represent a useful tool in the field of structural DNA nanotechnology.

  20. Modulation of i-motif thermodynamic stability by the introduction of UNA (unlocked nucleic acid) monomers

    DEFF Research Database (Denmark)

    Pasternak, Anna; Wengel, Jesper

    2011-01-01

    The influence of acyclic RNA derivatives, UNA (unlocked nucleic acid) monomers, on i-DNA thermodynamic stability has been investigated. The 22 nt human telomeric fragment was chosen as the model sequence for stability studies. UNA monomers modulate i-motif stability in a position-depending manner...

  1. Microbial expression of proteins containing long repetitive Arg-Gly-Asp cell adhesive motifs created by overlap elongation PCR

    International Nuclear Information System (INIS)

    Kurihara, Hiroyuki; Shinkai, Masashige; Nagamune, Teruyuki

    2004-01-01

    We developed a novel method for creating repetitive DNA libraries using overlap elongation PCR, and prepared a DNA library encoding repetitive Arg-Gly-Asp (RGD) cell adhesive motifs. We obtained various length DNAs encoding repetitive RGD from a short monomer DNA (18 bp) after a thermal cyclic reaction without a DNA template for amplification, and isolated DNAs encoding 2, 21, and 43 repeats of the RGD motif. We cloned these DNAs into a protein expression vector and overexpressed them as thioredoxin fusion proteins: RGD2, RGD21, and RGD43, respectively. The solubility of RGD43 in water was low and it formed a fibrous precipitate in water. Scanning electron microscopy revealed that RGD43 formed a branched 3D-network structure in the solid state. To evaluate the function of the cell adhesive motifs in RGD43, mouse fibroblast cells were cultivated on the RGD43 scaffold. The fibroblast cells adhered to the RGD43 scaffold and extended long filopodia

  2. APOCALYPTIC MOTIFS IN THE CYCLE OF STORIES BY M.A. BULGAKOV «NOTES OF A YOUNG DOCTOR»

    Directory of Open Access Journals (Sweden)

    Evgeniy Igorevich Erokhov

    2015-10-01

    Full Text Available The motif analysis of a cycle of stories by M.A. Bulgakov «Notes of a Young Doctor» from the point of view of their apocalyptic problematics was first performed in this article. To identify apocalyptic motifs the method of motif analysis, developed by B.M. Gasparov, was used which will also help to prove the interpenetration of motifs in the cycle of stories. The result of the research work is the identification of apocalyptic motifs which are manifested in the experiences of the main character and the events taking place around him and passing through the prism of physician’s perception of the world. Our identified motifs show that the stories in the cycle are united not only thematically and with the help of the image of the main character, but with the help of the motifs which reflect interpenetration of apocalyptic motifs in the stories of one cycle. There are the following apocalyptic motifs in the cycle of stories by Bulgakov: diseases, darkness (as part of the landscape, resurrection from the dead and beast. They all belong to the biblical type which is allocated on the basis of the associative bond of these motifs with the biblical texts.

  3. Sequence-specific high mobility group box factors recognize 10-12-base pair minor groove motifs

    DEFF Research Database (Denmark)

    van Beest, M; Dooijes, D; van De Wetering, M

    2000-01-01

    Sequence-specific high mobility group (HMG) box factors bind and bend DNA via interactions in the minor groove. Three-dimensional NMR analyses have provided the structural basis for this interaction. The cognate HMG domain DNA motif is generally believed to span 6-8 bases. However, alignment...

  4. Analysis of UV-induced mutation spectra in Escherichia coli by DNA polymerase {eta} from Arabidopsis thaliana

    Energy Technology Data Exchange (ETDEWEB)

    Santiago, Maria Jesus [Departamento de Genetica, Facultad de Ciencias, Edificio Gregor Mendel, Campus Rabanales, Universidad de Cordoba (Spain); Alejandre-Duran, Encarna [Departamento de Genetica, Facultad de Ciencias, Edificio Gregor Mendel, Campus Rabanales, Universidad de Cordoba (Spain); Ruiz-Rubio, Manuel [Departamento de Genetica, Facultad de Ciencias, Edificio Gregor Mendel, Campus Rabanales, Universidad de Cordoba (Spain)]. E-mail: ge1rurum@uco.es

    2006-10-10

    DNA polymerase {eta} belongs to the Y-family of DNA polymerases, enzymes that are able to synthesize past template lesions that block replication fork progression. This polymerase accurately bypasses UV-associated cis-syn cyclobutane thymine dimers in vitro and therefore may contributes to resistance against sunlight in vivo, both ameliorating survival and decreasing the level of mutagenesis. We cloned and sequenced a cDNA from Arabidopsis thaliana which encodes a protein containing several sequence motifs characteristics of Pol{eta} homologues, including a highly conserved sequence reported to be present in the active site of the Y-family DNA polymerases. The gene, named AtPOLH, contains 14 exons and 13 introns and is expressed in different plant tissues. A strain from Saccharomyces cerevisiae, deficient in Pol{eta} activity, was transformed with a yeast expression plasmid containing the AtPOLH cDNA. The rate of survival to UV irradiation in the transformed mutant increased to similar values of the wild type yeast strain, showing that AtPOLH encodes a functional protein. In addition, when AtPOLH is expressed in Escherichia coli, a change in the mutational spectra is detected when bacteria are irradiated with UV light. This observation might indicate that AtPOLH could compete with DNA polymerase V and then bypass cyclobutane pyrimidine dimers incorporating two adenylates.

  5. Mitochondrial DNA analysis of two southern African elephant populations

    Directory of Open Access Journals (Sweden)

    M.F. Essop

    1996-08-01

    Full Text Available The modern view is that there are at most only two valid forms of the African elephant namely Loxodonta qfricana africana, the bush elephant, and L.a. cyclotis, the forest elephant (Ansell 1974; Meester et al. 1986. The Knysna elephant which was also described as a separate sub-species is now almost extinct. Plans to augment the remnant population by introducing other animals must take into account the taxonomic questions and issue of conserving elephant gene pools (Greig 1982a. Mitochondrial DNA (mtDNA restriction fragment-size comparisons were performed on specimens from the Kruger National Park and the Addo Elephant National Park. If the Addo population's results are extrapolated to the Knysna population, it may be concluded that there is no genetic evidence for the Kruger and Knysna elephant populations to be considered as different sub-species.

  6. Detection and size analysis of proteins with switchable DNA layers.

    Science.gov (United States)

    Rant, Ulrich; Pringsheim, Erika; Kaiser, Wolfgang; Arinaga, Kenji; Knezevic, Jelena; Tornow, Marc; Fujita, Shozo; Yokoyama, Naoki; Abstreiter, Gerhard

    2009-04-01

    We introduce a chip-compatible scheme for the label-free detection of proteins in real-time that is based on the electrically driven conformation switching of DNA oligonucleotides on metal surfaces. The switching behavior is a sensitive indicator for the specific recognition of IgG antibodies and antibody fragments, which can be detected in quantities of less than 10(-18) mol on the sensor surface. Moreover, we show how the dynamics of the induced molecular motion can be monitored by measuring the high-frequency switching response. When proteins bind to the layer, the increase in hydrodynamic drag slows the switching dynamics, which allows us to determine the size of the captured proteins. We demonstrate the identification of different antibody fragments by means of their kinetic fingerprint. The switchDNA method represents a generic approach to simultaneously detect and size target molecules using a single analytical platform.

  7. Diagnosis of becker muscular dystrophy: Results of Re-analysis of DNA samples.

    Science.gov (United States)

    Straathof, Chiara S M; Van Heusden, Dave; Ippel, Pieternella F; Post, Jan G; Voermans, Nicol C; De Visser, Marianne; Brusse, Esther; Van Den Bergen, Janneke C; Van Der Kooi, Anneke J; Verschuuren, Jan J G M; Ginjaar, Hendrika B

    2016-01-01

    The phenotype of Becker muscular dystrophy (BMD) is highly variable, and the disease may be underdiagnosed. We searched for new mutations in the DMD gene in a cohort of previously undiagnosed patients who had been referred in the period 1985-1995. All requests for DNA analysis of the DMD gene in probands with suspected BMD were re-evaluated. If the phenotype was compatible with BMD, and no deletions or duplications were detected, DNA samples were screened for small mutations. In 79 of 185 referrals, no mutation was found. Analysis could be performed on 31 DNA samples. Seven different mutations, including 3 novel ones, were found. Long-term clinical follow-up is described. Refining DNA analysis in previously undiagnosed cases can identify mutations in the DMD gene and provide genetic diagnosis of BMD. A delayed diagnosis can still be valuable for the proband or the relatives of BMD patients. © 2015 Wiley Periodicals, Inc.

  8. Hunting Motifs in Situla Art

    Directory of Open Access Journals (Sweden)

    Andrej Preložnik

    2013-07-01

    Full Text Available Situla art developed as an echo of the toreutic style which had spread from the Near East through the Phoenicians, Greeks and Etruscans as far as the Veneti, Raeti, Histri, and their eastern neighbours in the region of Dolenjska (Lower Carniola. An Early Iron Age phenomenon (c. 600—300 BC, it rep- resents the major and most arresting form of the contemporary visual arts in an area stretching from the foot of the Apennines in the south to the Drava and Sava rivers in the east. Indeed, individual pieces have found their way across the Alpine passes and all the way north to the Danube. In the world and art of the situlae, a prominent role is accorded to ani- mals. They are displayed in numerous representations of human activities on artefacts crafted in the classic situla style – that is, between the late 6th  and early 5th centuries BC – as passive participants (e.g. in pageants or in harness or as an active element of the situla narrative. The most typical example of the latter is the hunting scene. Today we know at least four objects decorat- ed exclusively with hunting themes, and a number of situlae and other larger vessels where hunting scenes are embedded in composite narratives. All this suggests a popularity unparallelled by any other genre. Clearly recognisable are various hunting techniques and weapons, each associated with a particu- lar type of game (Fig. 1. The chase of a stag with javelin, horse and hound is depicted on the long- familiar and repeatedly published fibula of Zagorje (Fig. 2. It displays a hound mauling the stag’s back and a hunter on horseback pursuing a hind, her neck already pierced by the javelin. To judge by the (so far unnoticed shaft end un- der the stag’s muzzle, the hunter would have been brandishing a second jave- lin as well, like the warrior of the Vače fibula or the rider of the Nesactium situla, presumably himself a hunter. Many parallels to his motif are known from Greece, Etruria, and

  9. Compilation and analysis of Escherichia coli promoter DNA sequences.

    OpenAIRE

    Hawley, D K; McClure, W R

    1983-01-01

    The DNA sequence of 168 promoter regions (-50 to +10) for Escherichia coli RNA polymerase were compiled. The complete listing was divided into two groups depending upon whether or not the promoter had been defined by genetic (promoter mutations) or biochemical (5' end determination) criteria. A consensus promoter sequence based on homologies among 112 well-defined promoters was determined that was in substantial agreement with previous compilations. In addition, we have tabulated 98 promoter ...

  10. A Bayesian deconvolution strategy for immunoprecipitation-based DNA methylome analysis

    Science.gov (United States)

    Down, Thomas A.; Rakyan, Vardhman K.; Turner, Daniel J.; Flicek, Paul; Li, Heng; Kulesha, Eugene; Gräf, Stefan; Johnson, Nathan; Herrero, Javier; Tomazou, Eleni M.; Thorne, Natalie P.; Bäckdahl, Liselotte; Herberth, Marlis; Howe, Kevin L.; Jackson, David K.; Miretti, Marcos M.; Marioni, John C.; Birney, Ewan; Hubbard, Tim J. P.; Durbin, Richard; Tavaré, Simon; Beck, Stephan

    2009-01-01

    DNA methylation is an indispensible epigenetic modification of mammalian genomes. Consequently there is great interest in strategies for genome-wide/whole-genome DNA methylation analysis, and immunoprecipitation-based methods have proven to be a powerful option. Such methods are rapidly shifting the bottleneck from data generation to data analysis, necessitating the development of better analytical tools. Until now, a major analytical difficulty associated with immunoprecipitation-based DNA methylation profiling has been the inability to estimate absolute methylation levels. Here we report the development of a novel cross-platform algorithm – Bayesian Tool for Methylation Analysis (Batman) – for analyzing Methylated DNA Immunoprecipitation (MeDIP) profiles generated using arrays (MeDIP-chip) or next-generation sequencing (MeDIP-seq). The latter is an approach we have developed to elucidate the first high-resolution whole-genome DNA methylation profile (DNA methylome) of any mammalian genome. MeDIP-seq/MeDIP-chip combined with Batman represent robust, quantitative, and cost-effective functional genomic strategies for elucidating the function of DNA methylation. PMID:18612301

  11. A DNA fingerprinting procedure for ultra high-throughput genetic analysis of insects.

    Science.gov (United States)

    Schlipalius, D I; Waldron, J; Carroll, B J; Collins, P J; Ebert, P R

    2001-12-01

    Existing procedures for the generation of polymorphic DNA markers are not optimal for insect studies in which the organisms are often tiny and background molecular information is often non-existent. We have used a new high throughput DNA marker generation protocol called randomly amplified DNA fingerprints (RAF) to analyse the genetic variability in three separate strains of the stored grain pest, Rhyzopertha dominica. This protocol is quick, robust and reliable even though it requires minimal sample preparation, minute amounts of DNA and no prior molecular analysis of the organism. Arbitrarily selected oligonucleotide primers routinely produced approximately 50 scoreable polymorphic DNA markers, between individuals of three independent field isolates of R. dominica. Multivariate cluster analysis using forty-nine arbitrarily selected polymorphisms generated from a single primer reliably separated individuals into three clades corresponding to their geographical origin. The resulting clades were quite distinct, with an average genetic difference of 37.5 +/- 6.0% between clades and of 21.0 +/- 7.1% between individuals within clades. As a prelude to future gene mapping efforts, we have also assessed the performance of RAF under conditions commonly used in gene mapping. In this analysis, fingerprints from pooled DNA samples accurately and reproducibly reflected RAF profiles obtained from individual DNA samples that had been combined to create the bulked samples.

  12. DnaA protein DNA-binding domain binds to Hda protein to promote inter-AAA+ domain interaction involved in regulatory inactivation of DnaA.

    Science.gov (United States)

    Keyamura, Kenji; Katayama, Tsutomu

    2011-08-19

    Chromosomal replication is initiated from the replication origin oriC in Escherichia coli by the active ATP-bound form of DnaA protein. The regulatory inactivation of DnaA (RIDA) system, a complex of the ADP-bound Hda and the DNA-loaded replicase clamp, represses extra initiations by facilitating DnaA-bound ATP hydrolysis, yielding the inactive ADP-bound form of DnaA. However, the mechanisms involved in promoting the DnaA-Hda interaction have not been determined except for the involvement of an interaction between the AAA+ domains of the two. This study revealed that DnaA Leu-422 and Pro-423 residues within DnaA domain IV, including a typical DNA-binding HTH motif, are specifically required for RIDA-dependent ATP hydrolysis in vitro and that these residues support efficient interaction with the DNA-loaded clamp·Hda complex and with Hda in vitro. Consistently, substitutions of these residues caused accumulation of ATP-bound DnaA in vivo and oriC-dependent inhibition of cell growth. Leu-422 plays a more important role in these activities than Pro-423. By contrast, neither of these residues is crucial for DNA replication from oriC, although they are highly conserved in DnaA orthologues. Structural analysis of a DnaA·Hda complex model suggested that these residues make contact with residues in the vicinity of the Hda AAA+ sensor I that participates in formation of a nucleotide-interacting surface. Together, the results show that functional DnaA-Hda interactions require a second interaction site within DnaA domain IV in addition to the AAA+ domain and suggest that these interactions are crucial for the formation of RIDA complexes that are active for DnaA-ATP hydrolysis.

  13. DnaA Protein DNA-binding Domain Binds to Hda Protein to Promote Inter-AAA+ Domain Interaction Involved in Regulatory Inactivation of DnaA*

    Science.gov (United States)

    Keyamura, Kenji; Katayama, Tsutomu

    2011-01-01

    Chromosomal replication is initiated from the replication origin oriC in Escherichia coli by the active ATP-bound form of DnaA protein. The regulatory inactivation of DnaA (RIDA) system, a complex of the ADP-bound Hda and the DNA-loaded replicase clamp, represses extra initiations by facilitating DnaA-bound ATP hydrolysis, yielding the inactive ADP-bound form of DnaA. However, the mechanisms involved in promoting the DnaA-Hda interaction have not been determined except for the involvement of an interaction between the AAA+ domains of the two. This study revealed that DnaA Leu-422 and Pro-423 residues within DnaA domain IV, including a typical DNA-binding HTH motif, are specifically required for RIDA-dependent ATP hydrolysis in vitro and that these residues support efficient interaction with the DNA-loaded clamp·Hda complex and with Hda in vitro. Consistently, substitutions of these residues caused accumulation of ATP-bound DnaA in vivo and oriC-dependent inhibition of cell growth. Leu-422 plays a more important role in these activities than Pro-423. By contrast, neither of these residues is crucial for DNA replication from oriC, although they are highly conserved in DnaA orthologues. Structural analysis of a DnaA·Hda complex model suggested that these residues make contact with residues in the vicinity of the Hda AAA+ sensor I that participates in formation of a nucleotide-interacting surface. Together, the results show that functional DnaA-Hda interactions require a second interaction site within DnaA domain IV in addition to the AAA+ domain and suggest that these interactions are crucial for the formation of RIDA complexes that are active for DnaA-ATP hydrolysis. PMID:21708944

  14. Quantitative analysis of TALE-DNA interactions suggests polarity effects.

    Science.gov (United States)

    Meckler, Joshua F; Bhakta, Mital S; Kim, Moon-Soo; Ovadia, Robert; Habrian, Chris H; Zykovich, Artem; Yu, Abigail; Lockwood, Sarah H; Morbitzer, Robert; Elsäesser, Janett; Lahaye, Thomas; Segal, David J; Baldwin, Enoch P

    2013-04-01

    Transcription activator-like effectors (TALEs) have revolutionized the field of genome engineering. We present here a systematic assessment of TALE DNA recognition, using quantitative electrophoretic mobility shift assays and reporter gene activation assays. Within TALE proteins, tandem 34-amino acid repeats recognize one base pair each and direct sequence-specific DNA binding through repeat variable di-residues (RVDs). We found that RVD choice can affect affinity by four orders of magnitude, with the relative RVD contribution in the order NG > HD ≈ NN > NI > NK. The NN repeat preferred the base G over A, whereas the NK repeat bound G with 10(3)-fold lower affinity. We compared AvrBs3, a naturally occurring TALE that recognizes its target using some atypical RVD-base combinations, with a designed TALE that precisely matches 'standard' RVDs with the target bases. This comparison revealed unexpected differences in sensitivity to substitutions of the invariant 5'-T. Another surprising observation was that base mismatches at the 5' end of the target site had more disruptive effects on affinity than those at the 3' end, particularly in designed TALEs. These results provide evidence that TALE-DNA recognition exhibits a hitherto un-described polarity effect, in which the N-terminal repeats contribute more to affinity than C-terminal ones.

  15. Ancient DNA analysis identifies marine mollusc shells as new metagenomic archives of the past

    DEFF Research Database (Denmark)

    Der Sarkissian, Clio; Pichereau, Vianney; Dupont, Catherine

    2017-01-01

    Marine mollusc shells enclose a wealth of information on coastal organisms and their environment. Their life history traits as well as (palaeo-) environmental conditions, including temperature, food availability, salinity and pollution, can be traced through the analysis of their shell (micro...... extraction, high-throughput shotgun DNA sequencing and metagenomic analyses to marine mollusc shells spanning the last ~7,000 years. We report successful DNA extraction from shells, including a variety of ancient specimens, and find that DNA recovery is highly dependent on their biomineral structure......, carbonate layer preservation and disease state. We demonstrate positive taxonomic identification of mollusc species using a combination of mitochondrial DNA genomes, barcodes, genome-scale data and metagenomic approaches. We also find shell biominerals to contain a diversity of microbial DNA from the marine...

  16. Improved i-motif thermal stability by insertion of anthraquinone monomers

    DEFF Research Database (Denmark)

    Gouda, Alaa S; Amine, Mahasen S.; Pedersen, Erik Bjerregaard

    2017-01-01

    In order to gain insight into how to improve thermal stability of i-motifs when used in the context of biomedical and nanotechnological applications, novel anthraquinone-modified i-motifs were synthesized by insertion of 1,8-, 1,4-, 1,5- and 2,6-disubstituted anthraquinone monomers into the TAA...... loops of a 22mer cytosine-rich human telomeric DNA sequence. The influence of the four anthraquinone linkers on the i-motif thermal stability was investigated at 295 nm and pH 5.5. Anthraquinone monomers modulate the i-motif stability in a position-depending manner and the modulation also depends...... unlocked nucleic acid monomers or twisted intercalating nucleic acid. The 2,6-disubstituted anthraquinone linker replacing T10 enabled a significant increase of i-motif thermal melting by 8.2 °C. A substantial increase of 5.0 °C in i-motif thermal melting was recorded when both A6 and T16 were modified...

  17. Global DNA methylation analysis using methyl-sensitive amplification polymorphism (MSAP).

    Science.gov (United States)

    Yaish, Mahmoud W; Peng, Mingsheng; Rothstein, Steven J

    2014-01-01

    DNA methylation is a crucial epigenetic process which helps control gene transcription activity in eukaryotes. Information regarding the methylation status of a regulatory sequence of a particular gene provides important knowledge of this transcriptional control. DNA methylation can be detected using several methods, including sodium bisulfite sequencing and restriction digestion using methylation-sensitive endonucleases. Methyl-Sensitive Amplification Polymorphism (MSAP) is a technique used to study the global DNA methylation status of an organism and hence to distinguish between two individuals based on the DNA methylation status determined by the differential digestion pattern. Therefore, this technique is a useful method for DNA methylation mapping and positional cloning of differentially methylated genes. In this technique, genomic DNA is first digested with a methylation-sensitive restriction enzyme such as HpaII, and then the DNA fragments are ligated to adaptors in order to facilitate their amplification. Digestion using a methylation-insensitive isoschizomer of HpaII, MspI is used in a parallel digestion reaction as a loading control in the experiment. Subsequently, these fragments are selectively amplified by fluorescently labeled primers. PCR products from different individuals are compared, and once an interesting polymorphic locus is recognized, the desired DNA fragment can be isolated from a denaturing polyacrylamide gel, sequenced and identified based on DNA sequence similarity to other sequences available in the database. We will use analysis of met1, ddm1, and atmbd9 mutants and wild-type plants treated with a cytidine analogue, 5-azaC, or zebularine to demonstrate how to assess the genetic modulation of DNA methylation in Arabidopsis. It should be noted that despite the fact that MSAP is a reliable technique used to fish for polymorphic methylated loci, its power is limited to the restriction recognition sites of the enzymes used in the genomic

  18. Flow cytometric DNA analysis of ducks accumulating 137Cs on a reactor reservoir

    International Nuclear Information System (INIS)

    George, L.S.; Dallas, C.E.; Brisbin, I.L. Jr.; Evans, D.L.

    1991-01-01

    The objective of this study was to detect red blood cell (rbc) DNA abnormalities in male, game-farm mallard ducks as they ranged freely and accumulated 137Cs (radiocesium) from an abandoned nuclear reactor cooling reservoir. Prior to release, the ducks were tamed to enable recapture at will. Flow cytometric measurements conducted at intervals during the first year of exposure yielded cell cycle percentages of DNA (G0/G1, S, G2 + M phases) of rbc, as well as coefficients of variation (CV) in the G0/G1 phase. DNA histograms of exposed ducks were compared with two sets of controls which were maintained 30 and 150 miles from the study site. 137Cs live wholebody burdens were also measured in these animals in a parallel kinetics study, and an approximate steady-state equilibrium was attained after about 8 months. DNA histograms from 2 of the 14 contaminated ducks revealed DNA aneuploid-like patterns after 9 months exposure. These two ducks were removed from the experiment at this time, and when sampled again 1 month later, one continued to exhibit DNA aneuploidy. None of the control DNA histograms demonstrated DNA aneuploid-like patterns. There were no significant differences in cell cycle percentages at any time point between control and exposed animals. A significant increase in CV was observed at 9 months exposure, but after removal of the two ducks with DNA aneuploidy, no significant difference was detected in the group monitored after 12 months exposure. An increased variation in the DNA and DNA aneuploidy could, therefore, be detected in duck rbc using flow cytometric analysis, with the onset of these effects being related to the attainment of maximal levels of 137Cs body burdens in the exposed animals

  19. Comparative analysis on genome-wide DNA methylation in longissimus dorsi muscle between Small Tailed Han and Dorper×Small Tailed Han crossbred sheep

    Directory of Open Access Journals (Sweden)

    Yang Cao

    2017-11-01

    Full Text Available Objective The objective of this study was to compare the DNA methylation profile in the longissimus dorsi muscle between Small Tailed Han and Dorper×Small Tailed Han crossbred sheep which were known to exhibit significant difference in meat-production. Methods Six samples (three in each group were subjected to the methylated DNA immunoprecipitation sequencing (MeDIP-seq and subsequent bioinformatics analyses to detect differentially methylated regions (DMRs between the two groups. Results 23.08 Gb clean data from six samples were generated and 808 DMRs were identified in gene body or their neighboring up/downstream regions. Compared with Small Tailed Han sheep, we observed a tendency toward a global loss of DNA methylation in these DMRs in the crossbred group. Gene ontology enrichment analysis found several gene sets which were hypo-methylated in gene-body region, including nucleoside binding, motor activity, phospholipid binding and cell junction. Numerous genes were found to be differentially methylated between the two groups with several genes significantly differentially methylated, including transforming growth factor beta 3 (TGFB3, acyl-CoA synthetase long chain family member 1 (ACSL1, ryanodine receptor 1 (RYR1, acyl-CoA oxidase 2 (ACOX2, peroxisome proliferator activated receptor-gamma2 (PPARG2, netrin 1 (NTN1, ras and rab interactor 2 (RIN2, microtubule associated protein RP/EB family member 1 (MAPRE1, ADAM metallopeptidase with thrombospondin type 1 motif 2 (ADAMTS2, myomesin 1 (MYOM1, zinc finger, DHHC type containing 13 (ZDHHC13, and SH3 and PX domains 2B (SH3PXD2B. The real-time quantitative polymerase chain reaction validation showed that the 12 genes are differentially expressed between the two groups. Conclusion In the current study, a tendency to a global loss of DNA methylation in these DMRs in the crossbred group was found. Twelve genes, TGFB3, ACSL1, RYR1, ACOX2, PPARG2, NTN1, RIN2, MAPRE1, ADAMTS2, MYOM1, ZDHHC13, and SH3

  20. DNA and RNA analysis of blood and muscle from bodies with variable postmortem intervals

    DEFF Research Database (Denmark)

    Hansen, Jakob; Lesnikova, Iana; Funder, Anette Mariane Daa

    2014-01-01

    The breakdown of DNA and RNA in decomposing human tissue represents a major obstacle for postmortem forensic molecular analysis. This study investigated the feasibility of performing PCR-based molecular analysis of blood and muscle tissue from 45 autopsy cases with defined postmortem intervals...... for postmortem forensic molecular analysis as well as for retrospective research projects based on archived FFPE specimens....

  1. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed; Mansour, Essam; Kalnis, Panos

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern

  2. Deciphering functional glycosaminoglycan motifs in development.

    Science.gov (United States)

    Townley, Robert A; Bülow, Hannes E

    2018-03-23

    Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.

  3. Bayesian centroid estimation for motif discovery.

    Science.gov (United States)

    Carvalho, Luis

    2013-01-01

    Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  4. Bayesian centroid estimation for motif discovery.

    Directory of Open Access Journals (Sweden)

    Luis Carvalho

    Full Text Available Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  5. AQME: A forensic mitochondrial DNA analysis tool for next-generation sequencing data.

    Science.gov (United States)

    Sturk-Andreaggi, Kimberly; Peck, Michelle A; Boysen, Cecilie; Dekker, Patrick; McMahon, Timothy P; Marshall, Charla K

    2017-11-01

    The feasibility of generating mitochondrial DNA (mtDNA) data has expanded considerably with the advent of next-generation sequencing (NGS), specifically in the generation of entire mtDNA genome (mitogenome) sequences. However, the analysis of these data has emerged as the greatest challenge to implementation in forensics. To address this need, a custom toolkit for use in the CLC Genomics Workbench (QIAGEN, Hilden, Germany) was developed through a collaborative effort between the Armed Forces Medical Examiner System - Armed Forces DNA Identification Laboratory (AFMES-AFDIL) and QIAGEN Bioinformatics. The AFDIL-QIAGEN mtDNA Expert, or AQME, generates an editable mtDNA profile that employs forensic conventions and includes the interpretation range required for mtDNA data reporting. AQME also integrates an mtDNA haplogroup estimate into the analysis workflow, which provides the analyst with phylogenetic nomenclature guidance and a profile quality check without the use of an external tool. Supplemental AQME outputs such as nucleotide-per-position metrics, configurable export files, and an audit trail are produced to assist the analyst during review. AQME is applied to standard CLC outputs and thus can be incorporated into any mtDNA bioinformatics pipeline within CLC regardless of sample type, library preparation or NGS platform. An evaluation of AQME was performed to demonstrate its functionality and reliability for the analysis of mitogenome NGS data. The study analyzed Illumina mitogenome data from 21 samples (including associated controls) of varying quality and sample preparations with the AQME toolkit. A total of 211 tool edits were automatically applied to 130 of the 698 total variants reported in an effort to adhere to forensic nomenclature. Although additional manual edits were required for three samples, supplemental tools such as mtDNA haplogroup estimation assisted in identifying and guiding these necessary modifications to the AQME-generated profile. Along

  6. Ancient DNA analysis identifies marine mollusc shells as new metagenomic archives of the past.

    Science.gov (United States)

    Der Sarkissian, Clio; Pichereau, Vianney; Dupont, Catherine; Ilsøe, Peter C; Perrigault, Mickael; Butler, Paul; Chauvaud, Laurent; Eiríksson, Jón; Scourse, James; Paillard, Christine; Orlando, Ludovic

    2017-09-01

    Marine mollusc shells enclose a wealth of information on coastal organisms and their environment. Their life history traits as well as (palaeo-) environmental conditions, including temperature, food availability, salinity and pollution, can be traced through the analysis of their shell (micro-) structure and biogeochemical composition. Adding to this list, the DNA entrapped in shell carbonate biominerals potentially offers a novel and complementary proxy both for reconstructing palaeoenvironments and tracking mollusc evolutionary trajectories. Here, we assess this potential by applying DNA extraction, high-throughput shotgun DNA sequencing and metagenomic analyses to marine mollusc shells spanning the last ~7,000 years. We report successful DNA extraction from shells, including a variety of ancient specimens, and find that DNA recovery is highly dependent on their biomineral structure, carbonate layer preservation and disease state. We demonstrate positive taxonomic identification of mollusc species using a combination of mitochondrial DNA genomes, barcodes, genome-scale data and metagenomic approaches. We also find shell biominerals to contain a diversity of microbial DNA from the marine environment. Finally, we reconstruct genomic sequences of organisms closely related to the Vibrio tapetis bacteria from Manila clam shells previously diagnosed with Brown Ring Disease. Our results reveal marine mollusc shells as novel genetic archives of the past, which opens new perspectives in ancient DNA research, with the potential to reconstruct the evolutionary history of molluscs, microbial communities and pathogens in the face of environmental changes. Other future applications include conservation of endangered mollusc species and aquaculture management. © 2017 John Wiley & Sons Ltd.

  7. A Cross-Cancer Genetic Association Analysis of the DNA Repair and DNA Damage Signaling Pathways for Lung, Ovary, Prostate, Breast, and Colorectal Cancer.

    Science.gov (United States)

    Scarbrough, Peter M; Weber, Rachel Palmieri; Iversen, Edwin S; Brhane, Yonathan; Amos, Christopher I; Kraft, Peter; Hung, Rayjean J; Sellers, Thomas A; Witte, John S; Pharoah, Paul; Henderson, Brian E; Gruber, Stephen B; Hunter, David J; Garber, Judy E; Joshi, Amit D; McDonnell, Kevin; Easton, Doug F; Eeles, Ros; Kote-Jarai, Zsofia; Muir, Kenneth; Doherty, Jennifer A; Schildkraut, Joellen M

    2016-01-01

    DNA damage is an established mediator of carcinogenesis, although genome-wide association studies (GWAS) have identified few significant loci. This cross-cancer site, pooled analysis was performed to increase the power to detect common variants of DNA repair genes associated with cancer susceptibility. We conducted a cross-cancer analysis of 60,297 single nucleotide polymorphisms, at 229 DNA repair gene regions, using data from the NCI Genetic Associations and Mechanisms in Oncology (GAME-ON) Network. Our analysis included data from 32 GWAS and 48,734 controls and 51,537 cases across five cancer sites (breast, colon, lung, ovary, and prostate). Because of the unavailability of individual data, data were analyzed at the aggregate level. Meta-analysis was performed using the Association analysis for SubSETs (ASSET) software. To test for genetic associations that might escape individual variant testing due to small effect sizes, pathway analysis of eight DNA repair pathways was performed using hierarchical modeling. We identified three susceptibility DNA repair genes, RAD51B (P cancer risk in the base excision repair, nucleotide excision repair, mismatch repair, and homologous recombination pathways. Only three susceptibility loci were identified, which had all been previously reported. In contrast, hierarchical modeling identified several pleiotropic cancer risk associations in key DNA repair pathways. Results suggest that many common variants in DNA repair genes are likely associated with cancer susceptibility through small effect sizes that do not meet stringent significance testing criteria. ©2015 American Association for Cancer Research.

  8. Tri-allelic SNP markers enable analysis of mixed and degraded DNA samples.

    Science.gov (United States)

    Westen, Antoinette A; Matai, Anuska S; Laros, Jeroen F J; Meiland, Hugo C; Jasper, Mandy; de Leeuw, Wiljo J F; de Knijff, Peter; Sijen, Titia

    2009-09-01

    For the analysis of degraded DNA in disaster victim identification (DVI) and criminal investigations, single nucleotide polymorphisms (SNPs) have been recognized as promising markers mainly because they can be analyzed in short sized amplicons. Most SNPs are bi-allelic and are thereby ineffective to detect mixtures, which may lead to incorrect genotyping. We developed an algorithm to find non-binary (i.e. tri-allelic or tetra-allelic) SNPs in the NCBI dbSNP database. We selected 31 potential tri-allelic SNPs with a minor allele frequency of at least 10%. The tri-allelic nature was confirmed for 15 SNPs residing on 14 different chromosomes. Multiplex SNaPshot assays were developed, and the allele frequencies of 16 SNPs were determined among 153 Dutch and 111 Netherlands Antilles reference samples. Using these multiplex SNP assays, the presence of a mixture of two DNA samples in a ratio up to 1:8 could be recognized reliably. Furthermore, we compared the genotyping efficiency of the tri-allelic SNP markers and short tandem repeat (STR) markers by analyzing artificially degraded DNA and DNA from 30 approximately 500-year-old bone and molar samples. In both types of degraded DNA samples, the larger sized STR amplicons failed to amplify whereas the tri-allelic SNP markers still provided valuable information. In conclusion, tri-allelic SNP markers are suited for the analysis of degraded DNA and enable the detection of a second DNA source in a sample.

  9. Genetic analysis of yeast RPA1 reveals its multiple functions in DNA metabolism

    International Nuclear Information System (INIS)

    Umezu, K.; Sugawara, N.; Chen, C.; Haber, J.E.; Kolodner, R.D.

    1998-01-01

    Replication protein A (RPA) is a single-stranded DNA-binding protein identified as an essential factor for SV40 DNA replication in vitro. To understand the in vivo functions of RPA, we mutagenized the Saccharomyces cerevisiae RFA1 gene and identified 19 ultraviolet light (UV) irradiation- and methyl methane sulfonate (MMS)-sensitive mutants and 5 temperature-sensitive mutants. The UV- and MMS-sensitive mutants showed up to 10 4 to 10 5 times increased sensitivity to these agents. Some of the UV- and MMSsensitive mutants were killed by an HO-induced double-strand break atMAT. Physical analysis of recombination in one UV- and MMS-sensitive rfa1 mutant demonstrated that it was defective for mating type switching and single-strand annealing recombination. Two temperature-sensitive mutants were characterized in detail, and at the restrictive temperature were found to have an arrest phenotype and DNA content indicative of incomplete DNA replication. DNA sequence analysis indicated that most of the mutations altered amino acids that were conserved between yeast, human, and Xenopus RPA1. Taken together, we conclude that RPA1 has multiple roles in vivo and functions in DNA replication, repair, and recombination, like the single-stranded DNA-binding proteins of bacteria and phages. (author)

  10. A combined method for DNA analysis and radiocarbon dating from a single sample.

    Science.gov (United States)

    Korlević, Petra; Talamo, Sahra; Meyer, Matthias

    2018-03-07

    Current protocols for ancient DNA and radiocarbon analysis of ancient bones and teeth call for multiple destructive samplings of a given specimen, thereby increasing the extent of undesirable damage to precious archaeological material. Here we present a method that makes it possible to obtain both ancient DNA sequences and radiocarbon dates from the same sample material. This is achieved by releasing DNA from the bone matrix through incubation with either EDTA or phosphate buffer prior to complete demineralization and collagen extraction utilizing the acid-base-acid-gelatinization and ultrafiltration procedure established in most radiocarbon dating laboratories. Using a set of 12 bones of different ages and preservation conditions we demonstrate that on average 89% of the DNA can be released from sample powder with minimal, or 38% without any, detectable collagen loss. We also detect no skews in radiocarbon dates compared to untreated samples. Given the different material demands for radiocarbon dating (500 mg of bone/dentine) and DNA analysis (10-100 mg), combined DNA and collagen extraction not only streamlines the sampling process but also drastically increases the amount of DNA that can be recovered from limited sample material.

  11. Sequence and transcription analysis of the human cytomegalovirus DNA polymerase gene

    International Nuclear Information System (INIS)

    Kouzarides, T.; Bankier, A.T.; Satchwell, S.C.; Weston, K.; Tomlinson, P.; Barrell, B.G.

    1987-01-01

    DNA sequence analysis has revealed that the gene coding for the human cytomegalovirus (HCMV) DNA polymerase is present within the long unique region of the virus genome. Identification is based on extensive amino acid homology between the predicted HCMV open reading frame HFLF2 and the DNA polymerase of herpes simplex virus type 1. The authors present here a 5280 base-pair DNA sequence containing the HCMV pol gene, along with the analysis of transcripts encoded within this region. Since HCMV pol also shows homology to the predicted Epstein-Barr virus pol, they were able to analyze the extent of homology between the DNA polymerases of three distantly related herpes viruses, HCMV, Epstein-Barr virus, and herpes simplex virus. The comparison shows that these DNA polymerases exhibit considerable amino acid homology and highlights a number of highly conserved regions; two such regions show homology to sequences within the adenovirus type 2 DNA polymerase. The HCMV pol gene is flanked by open reading frames with homology to those of other herpes viruses; upstream, there is a reading frame homologous to the glycoprotein B gene of herpes simplex virus type I and Epstein-Barr virus, and downstream there is a reading frame homologous to BFLF2 of Epstein-Barr virus

  12. Structural motifs of importance for the constitutive activity of the orphan 7TM receptor EBI2: analysis of receptor activation in the absence of an agonist

    DEFF Research Database (Denmark)

    Benned-Jensen, Tau; Rosenkilde, Mette M

    2008-01-01

    were identified by a systematic mutational analysis of 29 residues in EBI2. The cAMP response element-binding protein transcription factor was used as a measure of receptor activity and was correlated to the receptor surface expression. PheVI:13 (Phe257), and the neighboring CysVI:12 (Cys256......, but not to Lys, decreased the constitutive activity more than 7-fold compared with wt EBI2. IleIII:03 (Ile106) is located only 4 A from ArgII:20, and a favorable electrostatic interaction with ArgII:20 was created by introduction of Glu in III:03, given that the activity increased to 4.4-fold of that wt EBI2...

  13. A Conserved EAR Motif Is Required for Avirulence and Stability of the Ralstonia solanacearum Effector PopP2 In Planta

    Directory of Open Access Journals (Sweden)

    Cécile Segonzac

    2017-08-01

    Full Text Available Ralstonia solanacearum is the causal agent of the devastating bacterial wilt disease in many high value Solanaceae crops. R. solanacearum secretes around 70 effectors into host cells in order to promote infection. Plants have, however, evolved specialized immune receptors that recognize corresponding effectors and confer qualitative disease resistance. In the model species Arabidopsis thaliana, the paired immune receptors RRS1 (resistance to Ralstonia solanacearum 1 and RPS4 (resistance to Pseudomonas syringae 4 cooperatively recognize the R. solanacearum effector PopP2 in the nuclei of infected cells. PopP2 is an acetyltransferase that binds to and acetylates the RRS1 WRKY DNA-binding domain resulting in reduced RRS1-DNA association thereby activating plant immunity. Here, we surveyed the naturally occurring variation in PopP2 sequence among the R. solanacearum strains isolated from diseased tomato and pepper fields across the Republic of Korea. Our analysis revealed high conservation of popP2 sequence with only three polymorphic alleles present amongst 17 strains. Only one variation (a premature stop codon caused the loss of RPS4/RRS1-dependent recognition in Arabidopsis. We also found that PopP2 harbors a putative eukaryotic transcriptional repressor motif (ethylene-responsive element binding factor-associated amphiphilic repression or EAR, which is known to be involved in the recruitment of transcriptional co-repressors. Remarkably, mutation of the EAR motif disabled PopP2 avirulence function as measured by the development of hypersensitive response, electrolyte leakage, defense marker gene expression and bacterial growth in Arabidopsis. This lack of recognition was partially but significantly reverted by the C-terminal addition of a synthetic EAR motif. We show that the EAR motif-dependent gain of avirulence correlated with the stability of the PopP2 protein. Furthermore, we demonstrated the requirement of the PopP2 EAR motif for PTI

  14. Computational analysis of a novel mutation in ETFDH gene highlights its long-range effects on the FAD-binding motif

    Directory of Open Access Journals (Sweden)

    Chang Jan-Gowth

    2011-10-01

    Full Text Available Abstract Background Multiple acyl-coenzyme A dehydrogenase deficiency (MADD is an autosomal recessive disease caused by the defects in the mitochondrial electron transfer system and the metabolism of fatty acids. Recently, mutations in electron transfer flavoprotein dehydrogenase (ETFDH gene, encoding electron transfer flavoprotein:ubiquinone oxidoreductase (ETF:QO have been reported to be the major causes of riboflavin-responsive MADD. To date, no studies have been performed to explore the functional impact of these mutations or their mechanism of disrupting enzyme activity. Results High resolution melting (HRM analysis and sequencing of the entire ETFDH gene revealed a novel mutation (p.Phe128Ser and the hotspot mutation (p.Ala84Thr from a patient with MADD. According to the predicted 3D structure of ETF:QO, the two mutations are located within the flavin adenine dinucleotide (FAD binding domain; however, the two residues do not have direct interactions with the FAD ligand. Using molecular dynamics (MD simulations and normal mode analysis (NMA, we found that the p.Ala84Thr and p.Phe128Ser mutations are most likely to alter the protein structure near the FAD binding site as well as disrupt the stability of the FAD binding required for the activation of ETF:QO. Intriguingly, NMA revealed that several reported disease-causing mutations in the ETF:QO protein show highly correlated motions with the FAD-binding site. Conclusions Based on the present findings, we conclude that the changes made to the amino acids in ETF:QO are likely to influence the FAD-binding stability.

  15. In Vitro Whole Genome DNA Binding Analysis of the Bacterial Replication Initiator and Transcription Factor DnaA.

    Directory of Open Access Journals (Sweden)

    Janet L Smith

    2015-05-01

    Full Text Available DnaA, the replication initiation protein in bacteria, is an AAA+ ATPase that binds and hydrolyzes ATP and exists in a heterogeneous population of ATP-DnaA and ADP-DnaA. DnaA binds cooperatively to the origin of replication and several other chromosomal regions, and functions as a transcription factor at some of these regions. We determined the binding properties of Bacillus subtilis DnaA to genomic DNA in vitro at single nucleotide resolution using in vitro DNA affinity purification and deep sequencing (IDAP-Seq. We used these data to identify 269 binding regions, refine the consensus sequence of the DnaA binding site, and compare the relative affinity of binding regions for ATP-DnaA and ADP-DnaA. Most sites had a slightly higher affinity for ATP-DnaA than ADP-DnaA, but a few had a strong preference for binding ATP-DnaA. Of the 269 sites, only the eight strongest binding ones have been observed to bind DnaA in vivo, suggesting that other cellular factors or the amount of available DnaA in vivo restricts DnaA binding to these additional sites. Conversely, we found several chromosomal regions that were bound by DnaA in vivo but not in vitro, and that the nucleoid-associated protein Rok was required for binding in vivo. Our in vitro characterization of the inherent ability of DnaA to bind the genome at single nucleotide resolution provides a backdrop for interpreting data on in vivo binding and regulation of DnaA, and is an approach that should be adaptable to many other DNA binding proteins.

  16. Insights into the molecular evolution of the PDZ/LIM family and identification of a novel conserved protein motif.

    Directory of Open Access Journals (Sweden)

    Aartjan J W Te Velthuis

    Full Text Available The PDZ and LIM domain-containing protein family is encoded by a diverse group of genes whose phylogeny has currently not been analyzed. In mammals, ten genes are found that encode both a PDZ- and one or several LIM-domains. These genes are: ALP, RIL, Elfin (CLP36, Mystique, Enigma (LMP-1, Enigma homologue (ENH, ZASP (Cypher, Oracle, LMO7 and the two LIM domain kinases (LIMK1 and LIMK2. As conventional alignment and phylogenetic procedures of full-length sequences fell short of elucidating the evolutionary history of these genes, we started to analyze the PDZ and LIM domain sequences themselves. Using information from most sequenced eukaryotic lineages, our phylogenetic analysis is based on full-length cDNA-, EST-derived- and genomic- PDZ and LIM domain sequences of over 25 species, ranging from yeast to humans. Plant and protozoan homologs were not found. Our phylogenetic analysis identifies a number of domain duplication and rearrangement events, and shows a single convergent event during evolution of the PDZ/LIM family. Further, we describe the separation of the ALP and Enigma subfamilies in lower vertebrates and identify a novel consensus motif, which we call 'ALP-like motif' (AM. This motif is highly-conserved between ALP subfamily proteins of diverse organisms. We used here a combinatorial approach to define the relation of the PDZ and LIM domain encoding genes and to reconstruct their phylogeny. This analysis allowed us to classify the PDZ/LIM family and to suggest a meaningful model for the molecular evolution of the diverse gene architectures found in this multi-domain family.

  17. Structural Analysis of DNA Interactions with Magnesium Ion Studied by Raman Spectroscopy

    OpenAIRE

    S. Ponkumar; P. Duraisamy; N. Iyandurai

    2011-01-01

    Problem statement: In the present study, FT Raman spectroscopy had been used to extend our knowledge about Magnesium ion - DNA interactions at various volume ratios (1:50, 1:20, 1:10 and 1:5). Approach: The analysis of FT Raman data supported the existence of structural specificities in the interaction and also the stability of DNA secondary structure. Results: Results from the Raman spectra clearly indicate that the interaction of Magnesium ion with DNA is mainly through the phosphate groups...

  18. The Potential of Cosmetic Applicators as a Source of DNA for Forensic Analysis.

    Science.gov (United States)

    Adamowicz, Michael S; Labonte, Renáe D; Schienman, John E

    2015-07-01

    Personal products, such as toothbrushes, have been used as both known reference and evidentiary samples for forensic DNA analysis. This study examined the viability of a broad selection of cosmetic applicators for use as targets for human DNA extraction and short tandem repeat (STR) analysis using standard polymerase chain reaction (PCR) conditions. Applicator types included eyeliner smudgers, pencils and crayons, eye shadow sponges, mascara wands, concealer wands, face makeup sponges, pads and brushes, lipsticks and balms, and lip gloss wands. The quantity and quality of DNA extracted from each type of applicator were examined by assessing the number of loci successfully amplified and the peak balance of the heterozygous alleles in each full STR profile. While degraded DNA, stochastic amplification, and PCR inhibition were observed for some items, full STR profiles were developed for 14 of 76 applicators. The face makeup sponge applicators yielded the highest proportional number of full STR profiles (4/7). © 2015 American Academy of Forensic Sciences.

  19. Functional and structural analysis of the DNA sequence conferring glucocorticoid inducibility to the mouse mammary tumor virus gene

    International Nuclear Information System (INIS)

    Skroch, P.

    1987-05-01

    In the first part of my thesis I show that the DNA element conferring glucocorticoid inducibility to the Mouse Mammary Tumor Virus (HRE) has enhancer properties. It activates a heterologous promoter - that of the β-globin gene, independently of distance, position and orientation. These properties however have to be regarded in relation to the remaining regulatory elements of the activated gene as the recombinants between HRE and the TK gene have demonstrated. In the second part of my thesis I investigated the biological significance of certain sequence motifs of the HRE, which are remarkable by their interaction with transacting factors or sequence homologies with other regulatory DNA elements. I could confirm the generally postulated modular structure of enhancers for the HRE and bring the relevance of the single subdomains for the function of the element into relationship. (orig.) [de

  20. Optimized mtDNA Control Region Primer Extension Capture Analysis for Forensically Relevant Samples and Highly Compromised mtDNA of Different Age and Origin

    Directory of Open Access Journals (Sweden)

    Mayra Eduardoff

    2017-09-01

    Full Text Available The analysis of mitochondrial DNA (mtDNA has proven useful in forensic genetics and ancient DNA (aDNA studies, where specimens are often highly compromised and DNA quality and quantity are low. In forensic genetics, the mtDNA control region (CR is commonly sequenced using established Sanger-type Sequencing (STS protocols involving fragment sizes down to approximately 150 base pairs (bp. Recent developments include Massively Parallel Sequencing (MPS of (multiplex PCR-generated libraries using the same amplicon sizes. Molecular genetic studies on archaeological remains that harbor more degraded aDNA have pioneered alternative approaches to target mtDNA, such as capture hybridization and primer extension capture (PEC methods followed by MPS. These assays target smaller mtDNA fragment sizes (down to 50 bp or less, and have proven to be substantially more successful in obtaining useful mtDNA sequences from these samples compared to electrophoretic methods. Here, we present the modification and optimization of a PEC method, earlier developed for sequencing the Neanderthal mitochondrial genome, with forensic applications in mind. Our approach was designed for a more sensitive enrichment of the mtDNA CR in a single tube assay and short laboratory turnaround times, thus complying with forensic practices. We characterized the method using sheared, high quantity mtDNA (six samples, and tested challenging forensic samples (n = 2 as well as compromised solid tissue samples (n = 15 up to 8 kyrs of age. The PEC MPS method produced reliable and plausible mtDNA haplotypes that were useful in the forensic context. It yielded plausible data in samples that did not provide results with STS and other MPS techniques. We addressed the issue of contamination by including four generations of negative controls, and discuss the results in the forensic context. We finally offer perspectives for future research to enable the validation and accreditation of the PEC MPS

  1. A Single-Molecule Barcoding System using Nanoslits for DNA Analysis

    Science.gov (United States)

    Jo, Kyubong; Schramm, Timothy M.; Schwartz, David C.

    Single DNA molecule approaches are playing an increasingly central role in the analytical genomic sciences because single molecule techniques intrinsically provide individualized measurements of selected molecules, free from the constraints of bulk techniques, which blindly average noise and mask the presence of minor analyte components. Accordingly, a principal challenge that must be addressed by all single molecule approaches aimed at genome analysis is how to immobilize and manipulate DNA molecules for measurements that foster construction of large, biologically relevant data sets. For meeting this challenge, this chapter discusses an integrated approach for microfabricated and nanofabricated devices for the manipulation of elongated DNA molecules within nanoscale geometries. Ideally, large DNA coils stretch via nanoconfinement when channel dimensions are within tens of nanometers. Importantly, stretched, often immobilized, DNA molecules spanning hundreds of kilobase pairs are required by all analytical platforms working with large genomic substrates because imaging techniques acquire sequence information from molecules that normally exist in free solution as unrevealing random coils resembling floppy balls of yarn. However, nanoscale devices fabricated with sufficiently small dimensions fostering molecular stretching make these devices impractical because of the requirement of exotic fabrication technologies, costly materials, and poor operational efficiencies. In this chapter, such problems are addressed by discussion of a new approach to DNA presentation and analysis that establishes scaleable nanoconfinement conditions through reduction of ionic strength; stiffening DNA molecules thus enabling their arraying for analysis using easily fabricated devices that can also be mass produced. This new approach to DNA nanoconfinement is complemented by the development of a novel labeling scheme for reliable marking of individual molecules with fluorochrome labels

  2. Multi-color fluorescent DNA analysis in an integrated optofluidic lab on a chip

    OpenAIRE

    Dongre, C.

    2010-01-01

    Abstract: Sorting and sizing of DNA molecules within the human genome project has enabled the genetic mapping of various illnesses. Furthermore by employing tiny lab-on-a-chip device, integrated DNA sequencing and genetic diagnostics have become feasible. We present the combination of capillary electrophoresis with laser-induced fluorescence for optofluidic integration toward an on-chip bio-analysis tool. Integrated optical fluorescence excitation allows for a high spatial resolution (12 μm) ...

  3. Cloud-based adaptive exon prediction for DNA analysis.

    Science.gov (United States)

    Putluri, Srinivasareddy; Zia Ur Rahman, Md; Fathima, Shaik Yasmeen

    2018-02-01

    Cloud computing offers significant research and economic benefits to healthcare organisations. Cloud services provide a safe place for storing and managing large amounts of such sensitive data. Under conventional flow of gene information, gene sequence laboratories send out raw and inferred information via Internet to several sequence libraries. DNA sequencing storage costs will be minimised by use of cloud service. In this study, the authors put forward a novel genomic informatics system using Amazon Cloud Services, where genomic sequence information is stored and accessed for processing. True identification of exon regions in a DNA sequence is a key task in bioinformatics, which helps in disease identification and design drugs. Three base periodicity property of exons forms the basis of all exon identification techniques. Adaptive signal processing techniques found to be promising in comparison with several other methods. Several adaptive exon predictors (AEPs) are developed using variable normalised least mean square and its maximum normalised variants to reduce computational complexity. Finally, performance evaluation of various AEPs is done based on measures such as sensitivity, specificity and precision using various standard genomic datasets taken from National Center for Biotechnology Information genomic sequence database.

  4. Molecular analysis and genomic organization of major DNA satellites in banana (Musa spp.).

    Science.gov (United States)

    Čížková, Jana; Hřibová, Eva; Humplíková, Lenka; Christelová, Pavla; Suchánková, Pavla; Doležel, Jaroslav

    2013-01-01

    Satellite DNA sequences consist of tandemly arranged repetitive units up to thousands nucleotides long in head-to-tail orientation. The evolutionary processes by which satellites arise and evolve include unequal crossing over, gene conversion, transposition and extra chromosomal circular DNA formation. Large blocks of satellite DNA are often observed in heterochromatic regions of chromosomes and are a typical component of centromeric and telomeric regions. Satellite-rich loci may show specific banding patterns and facilitate chromosome identification and analysis of structural chromosome changes. Unlike many other genomes, nuclear genomes of banana (Musa spp.) are poor in satellite DNA and the information on this class of DNA remains limited. The banana cultivars are seed sterile clones originating mostly from natural intra-specific crosses within M. acuminata (A genome) and inter-specific crosses between M. acuminata and M. balbisiana (B genome). Previous studies revealed the closely related nature of the A and B genomes, including similarities in repetitive DNA. In this study we focused on two main banana DNA satellites, which were previously identified in silico. Their genomic organization and molecular diversity was analyzed in a set of nineteen Musa accessions, including representatives of A, B and S (M. schizocarpa) genomes and their inter-specific hybrids. The two DNA satellites showed a high level of sequence conservation within, and a high homology between Musa species. FISH with probes for the satellite DNA sequences, rRNA genes and a single-copy BAC clone 2G17 resulted in characteristic chromosome banding patterns in M. acuminata and M. balbisiana which may aid in determining genomic constitution in interspecific hybrids. In addition to improving the knowledge on Musa satellite DNA, our study increases the number of cytogenetic markers and the number of individual chromosomes, which can be identified in Musa.

  5. Genome-wide DNA methylation analysis of transient neonatal diabetes type 1 patients with mutations in ZFP57.

    Science.gov (United States)

    Bak, Mads; Boonen, Susanne E; Dahl, Christina; Hahnemann, Johanne M D; Mackay, Deborah J D G; Tümer, Zeynep; Grønskov, Karen; Temple, I Karen; Guldberg, Per; Tommerup, Niels

    2016-04-14

    Transient neonatal diabetes mellitus 1 (TNDM1) is a rare imprinting disorder characterized by intrautering growth retardation and diabetes mellitus usually presenting within the first six weeks of life and resolves by the age of 18 months. However, patients have an increased risk of developing diabetes mellitus type 2 later in life. Transient neonatal diabetes mellitus 1 is caused by overexpression of the maternally imprinted genes PLAGL1 and HYMAI on chromosome 6q24. One of the mechanisms leading to overexpression of the locus is hypomethylation of the maternal allele of PLAGL1 and HYMAI. A subset of patients with maternal hypomethylation at PLAGL1 have hypomethylation at additional imprinted loci throughout the genome, including GRB10, ZIM2 (PEG3), MEST (PEG1), KCNQ1OT1 and NESPAS (GNAS-AS1). About half of the TNDM1 patients carry mutations in ZFP57, a transcription factor involved in establishment and maintenance of methylation of imprinted loci. Our objective was to investigate whether additional regions are aberrantly methylated in ZFP57 mutation carriers. Genome-wide DNA methylation analysis was performed on four individuals with homozygous or compound heterozygous ZFP57 mutations, three relatives with heterozygous ZFP57 mutations and five controls. Methylation status of selected regions showing aberrant methylation in the patients was verified using bisulfite-sequencing. We found large variability among the patients concerning the number and identity of the differentially methylated regions, but more than 60 regions were aberrantly methylated in two or more patients and a novel region within PPP1R13L was found to be hypomethylated in all the patients. The hypomethylated regions in common between the patients are enriched for the ZFP57 DNA binding motif. We have expanded the epimutational spectrum of TNDM1 associated with ZFP57 mutations and found one novel region within PPP1R13L which is hypomethylated in all TNDM1 patients included in this study. Functional

  6. Droplet-based microscale colorimetric biosensor for multiplexed DNA analysis via a graphene nanoprobe

    International Nuclear Information System (INIS)

    Xiang Xia; Luo Ming; Shi Liyang; Ji Xinghu; He Zhike

    2012-01-01

    Graphical abstract: With a microvalve manipulate technique combined with droplet platform, a microscale fluorescence-based colorimetric sensor for multiplexed DNA analysis is developed via a graphene nanoprobe. Highlights: ► A quantitative detection for multiplexed DNA is first realized on droplet platform. ► The DNA detection is relied on a simple fluorescence-based colorimetric method. ► GO is served as a quencher for two different DNA fluorescent probes. ► This present work provides a rapid, sensitive, visual and convenient detection tool for droplet biosensor. - Abstract: The development of simple and inexpensive DNA detection strategy is very significant for droplet-based microfluidic system. Here, a droplet-based biosensor for multiplexed DNA analysis is developed with a common imaging device by using fluorescence-based colorimetric method and a graphene nanoprobe. With the aid of droplet manipulation technique, droplet size adjustment, droplet fusion and droplet trap are realized accurately and precisely. Due to the high quenching efficiency of graphene oxide (GO), in the absence of target DNAs, the droplet containing two single-stranded DNA probes and GO shows dark color, in which the DNA probes are labeled carboxy fluorescein (FAM) and 6-carboxy-X-rhodamine (ROX), respectively. The droplet changes from dark to bright color when the DNA probes form double helix with the specific target DNAs leading to the dyes far away from GO. This colorimetric droplet biosensor exhibits a quantitative capability for simultaneous detection of two different target DNAs with the detection limits of 9.46 and 9.67 × 10 −8 M, respectively. It is also demonstrated that this biosensor platform can become a promising detection tool in high throughput applications with low consumption of reagents. Moreover, the incorporation of graphene nanoprobe and droplet technique can drive the biosensor field one more step to some extent.

  7. DNA regulatory motif selection based on support vector machine ...

    African Journals Online (AJOL)

    Administrator

    2011-10-19

    Oct 19, 2011 ... ... gene expression values of controls and i x i y. 1 i y = 1 i y = −. 1. 2. { , ,..., , } i i i im i g. x x. x y. = 1. 2. 1. 2. , ,..., ,. , ,..., k i i i im. x x x. x x x x x. = =.

  8. Analisis heteroplasmy DNA mitokondria pulpa gigi pada identifikasi personal forensik (Heteroplasmy analysis of dental pulp mitochondrial DNA in forensic personal identification

    Directory of Open Access Journals (Sweden)

    Ardyni Febri K

    2013-09-01

    Full Text Available Background: Mitochondrial DNA (mtDNA sequence analysis of the hypervariable control region has been shown to be an effective tool for personal identification. The high copy and maternal mode of inheritance make mtDNA analysis particularly useful when old samples or degradation of biological samples prohibits the detection of nuclear DNA analysis. Dental pulp is covered with hard tissue such as dentin and enamel. It is highly capable of protecting the DNA and thus is extremely useful. One of the diasadvantages of mitochondrial DNA is heteroplasmy. Heteroplasmy is the presence of a mixture of more than one type of an organellar genome within a cell or individual. It can lead to ambiguity in forensic personal identification. Due to that, the evidence of heteroplasmy in dental pulp is needed. Purpose: The study was aimed to determine the heteroplasmy occurance of mitocondrial DNA in dental pulp. Methods: Blood and teeth samples were taken from 6 persons, each samples was extracted with DNAzol. DNA samples were amplified with PCR and sequencing to analyze the nucleotide sequences polymorphism of the hypervariable region 1 in mtDNA and compared with revised Cambridge Reference Sequence (rCRS. results: The dental pulp and blood nucleotide sequence of hypervariable region 1 mitochondrial DNA showed polymorphism when compared with rCRS and heteroplasmy when compared between dental pulp with blood. Conclusion: The study showed that heteroplasmy was found in mithocondrial DNA from dental pulp.latar belakang: Analisis sekuens DNA mitokondria (mtDNA regio kontrol hypervariable telah terbukti menjadi alat efektif untuk identifikasi personal. Kopi DNA yang banyak dan pewarisan maternal membuat analisis mtDNA sangat berguna ketika sampel lama atau sampel biologis yang terdegradasi menghambat deteksi analisis DNA inti. Pulpa gigi terlindung jaringan keras seperti dentin dan enamel. Hal ini membuat pulpa mampu melindungi DNA dan dengan demikian sangat berguna

  9. Native characterization of nucleic acid motif thermodynamics via non-covalent catalysis

    Science.gov (United States)

    Wang, Chunyan; Bae, Jin H.; Zhang, David Yu

    2016-01-01

    DNA hybridization thermodynamics is critical for accurate design of oligonucleotides for biotechnology and nanotechnology applications, but parameters currently in use are inaccurately extrapolated based on limited quantitative understanding of thermal behaviours. Here, we present a method to measure the ΔG° of DNA motifs at temperatures and buffer conditions of interest, with significantly better accuracy (6- to 14-fold lower s.e.) than prior methods. The equilibrium constant of a reaction with thermodynamics closely approximating that of a desired motif is numerically calculated from directly observed reactant and product equilibrium concentrations; a DNA catalyst is designed to accelerate equilibration. We measured the ΔG° of terminal fluorophores, single-nucleotide dangles and multinucleotide dangles, in temperatures ranging from 10 to 45 °C. PMID:26782977

  10. Space-related pharma-motifs for fast search of protein binding motifs and polypharmacological targets.

    Science.gov (United States)

    Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon

    2012-01-01

    To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.

  11. Nanopore Analysis of the 5-Guanidinohydantoin to Iminoallantoin Isomerization in Duplex DNA.

    Science.gov (United States)

    Zeng, Tao; Fleming, Aaron M; Ding, Yun; Ren, Hang; White, Henry S; Burrows, Cynthia J

    2018-04-06

    In DNA, guanine oxidation yields diastereomers of 5-guanidinohydantoin (Gh) as one of the major products. In nucleosides and single-stranded DNA, Gh is in a pH-dependent equilibrium with its constitutional isomer iminoallantoin (Ia). Herein, the isomerization reaction between Gh and Ia was monitored in duplex DNA using a protein nanopore by measuring the ionic current when duplex DNA interacts with the pore under an electrophoretic force. Monitoring current levels in this single-molecule method proved to be superior for analysis of population distributions in an equilibrating mixture of four isomers in duplex DNA as a function of pH. The results identified Gh as a major isomer observed when base paired with A, C, or G at pH 6.4-8.4, and Ia was a minor isomer of the reaction mixture that was only observed when the pH was >7.4 in the duplex DNA context. The present results suggest that Gh will be the dominant isomer in duplex DNA under physiological conditions regardless of the base-pairing partner in the duplex.

  12. Fast DNA analysis by laser mass spectrometry for human genome analysis

    International Nuclear Information System (INIS)

    Tang, K.; Taranenko, N. I.; Allman, S. L.; Chang, L. Y.; Chen, C. H.

    1995-01-01

    Fast DNA sequencing by laser mass spectrometry is possible if the following 3 criteria are met: (1) Size of DNA fragment should be greater than 300 nucleotides. (2) Enough sensitivity to detect DNA produce from polymerases chain reactins (PCR). (3) Higher resolution of mass spectr. So far, the firt 2 criteria are met: If the resolution can be significantly improve, fast DNA sequencing by laser mass spectrometry weil be a reality in the near feature

  13. Use of a D17Z1 oligonucleotide probe for human DNA quantitation prior to PCR analysis of polymorphic DNA markers

    Energy Technology Data Exchange (ETDEWEB)

    Walsh, S.; Alavaren, M.; Varlaro, J. [Roche Molecular Systems, Alameda, CA (United States)] [and others

    1994-09-01

    The alpha-satellite DNA locus D17Z1 contains primate-specific sequences which are repeated several hundred times per chromosome 17. A probe that was designed to hybridize to a subset of the D17Z1 sequence can be used for very sensitive and specific quantitation of human DNA. Sample human genomic DNA is immobilized on nylon membrane using a slot blot apparatus, and then hybridized with a biotinylated D17Z1 oligonucleotide probe. The subsequent binding of streptavidin-horseradish peroxidase to the bound probe allows for either calorimetric (TMB) or chemiluminescent (ECL) detection. Signals obtained for sample DNAs are then compared to the signals obtained for a series of human DNA standards. For either detection method, forty samples can be quantitated in less than two hours, with a sensitivity of 150 pg. As little as 20 pg of DNA can be quantitated when using chemiluminescent detection with longer film exposures. PCR analysis of several VNTR and STR markers has indicated that optimal typing results are generally obtained within a relatively narrow range of input DNA quantities. Too much input DNA can lead to PCR artifacts such as preferential amplification of smaller alleles, non-specific amplification products, and exaggeration of the DNA synthesis slippage products that are seen with STR markers. Careful quantitation of human genomic DNA prior to PCR can avoid or minimize these problems and ultimately give cleaner, more unambiguous PCR results.

  14. Cytometric analysis of mammalian sperm for induced morphologic and DNA content errors

    International Nuclear Information System (INIS)

    Pinkel, D.

    1983-01-01

    Some flow-cytometric and image analysis procedures under development for quantitative analysis of sperm morphology are reviewed. The results of flow-cytometric DNA-content measurements on sperm from radiation exposed mice are also summarized, the results related to the available cytological information, and their potential dosimetric sensitivity discussed

  15. Mitochondrial and Y chromosome haplotype motifs as diagnostic markers of Jewish ancestry: a reconsideration.

    Directory of Open Access Journals (Sweden)

    Sergio eTofanelli

    2014-11-01

    Full Text Available Several authors have proposed haplotype motifs based on site variants at the mitochondrial genome (mtDNA and the non-recombining portion of the Y chromosome (NRY to trace the genealogies of Jewish people. Here, we analyzed their main approaches and test the feasibility of adopting motifs as ancestry markers through construction of a large database of mtDNA and NRY haplotypes from public genetic genealogical repositories. We verified the reliability of Jewish ancestry prediction based on the Cohen and Levite Modal Haplotypes in their classical 6 STR marker format or in the extended 12 STR format, as well as four founder mtDNA lineages (HVS-I segments accounting for about 40% of the current population of Ashkenazi Jews. For this purpose we compared haplotype composition in individuals of self-reported Jewish ancestry with the rest of European, African or Middle Eastern samples, to test for non-random association of ethno-geographic groups and haplotypes. Overall, NRY and mtDNA based motifs, previously reported to differentiate between groups, were found to be more represented in Jewish compared to non-Jewish groups. However, this seems to stem from common ancestors of Jewish lineages being rather recent respect to ancestors of non-Jewish lineages with the same haplotype signatures. Moreover, the polyphyly of haplotypes which contain the proposed motifs and the misuse of constant mutation rates heavily affected previous attempts to correctly dating the origin of common ancestries. Accordingly, our results stress the limitations of using the above haplotype motifs as reliable Jewish ancestry predictors and show its inadequacy for forensic or genealogical purposes.

  16. Effect of secondary structure on the thermodynamics and kinetics of PNA hybridization to DNA hairpins

    DEFF Research Database (Denmark)

    Kushon, S A; Jordan, J P; Seifert, J L

    2001-01-01

    The binding of a series of PNA and DNA probes to a group of unusually stable DNA hairpins of the tetraloop motif has been observed using absorbance hypochromicity (ABS), circular dichroism (CD), and a colorimetric assay for PNA/DNA duplex detection. These results indicate that both stable PNA...... structures in both target and probe molecules are shown to depress the melting temperatures and free energies of the probe-target duplexes. Kinetic analysis of hybridization yields reaction rates that are up to 160-fold slower than hybridization between two unstructured strands. The thermodynamic and kinetic...

  17. Dysregulation of C-X-C motif ligand 10 during aging and association with cognitive performance.

    Science.gov (United States)

    Bradburn, Steven; McPhee, Jamie; Bagley, Liam; Carroll, Michael; Slevin, Mark; Al-Shanti, Nasser; Barnouin, Yoann; Hogrel, Jean-Yves; Pääsuke, Mati; Gapeyeva, Helena; Maier, Andrea; Sipilä, Sarianna; Narici, Marco; Robinson, Andrew; Mann, David; Payton, Antony; Pendleton, Neil; Butler-Browne, Gillian; Murgatroyd, Chris

    2018-03-01

    Chronic low-grade inflammation during aging (inflammaging) is associated with cognitive decline and neurodegeneration; however, the mechanisms underlying inflammaging are unclear. We studied a population (n = 361) of healthy young and old adults from the MyoAge cohort. Peripheral levels of C-X-C motif chemokine ligand 10 (CXCL10) was found to be higher in older adults, compared with young, and negatively associated with working memory performance. This coincided with an age-related reduction in blood DNA methylation at specific CpGs within the CXCL10 gene promoter. In vitro analysis supported the role of DNA methylation in regulating CXCL10 transcription. A polymorphism (rs56061981) that altered methylation at one of these CpG sites further associated with working memory performance in 2 independent aging cohorts. Studying prefrontal cortex samples, we found higher CXCL10 protein levels in those with Alzheimer's disease, compared with aged controls. These findings support the association of peripheral inflammation, as demonstrated by CXCL10, in aging and cognitive decline. We reveal age-related epigenetic and genetic factors which contribute to the dysregulation of CXCL10. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  18. Adelie penguin population diet monitoring by analysis of food DNA in scats.

    Directory of Open Access Journals (Sweden)

    Simon N Jarman

    Full Text Available The Adélie penguin is the most important animal currently used for ecosystem monitoring in the Southern Ocean. The diet of this species is generally studied by visual analysis of stomach contents; or ratios of isotopes of carbon and nitrogen incorporated into the penguin from its food. There are significant limitations to the information that can be gained from these methods. We evaluated population diet assessment by analysis of food DNA in scats as an alternative method for ecosystem monitoring with Adélie penguins as an indicator species. Scats were collected at four locations, three phases of the breeding cycle, and in four different years. A novel molecular diet assay and bioinformatics pipeline based on nuclear small subunit ribosomal RNA gene (SSU rDNA sequencing was used to identify prey DNA in 389 scats. Analysis of the twelve population sample sets identified spatial and temporal dietary change in Adélie penguin population diet. Prey diversity was found to be greater than previously thought. Krill, fish, copepods and amphipods were the most important food groups, in general agreement with other Adélie penguin dietary studies based on hard part or stable isotope analysis. However, our DNA analysis estimated that a substantial portion of the diet was gelatinous groups such as jellyfish and comb jellies. A range of other prey not previously identified in the diet of this species were also discovered. The diverse prey identified by this DNA-based scat analysis confirms that the generalist feeding of Adélie penguins makes them a useful indicator species for prey community composition in the coastal zone of the Southern Ocean. Scat collection is a simple and non-invasive field sampling method that allows DNA-based estimation of prey community differences at many temporal and spatial scales and provides significant advantages over alternative diet analysis approaches.

  19. Adélie penguin population diet monitoring by analysis of food DNA in scats.

    Science.gov (United States)

    Jarman, Simon N; McInnes, Julie C; Faux, Cassandra; Polanowski, Andrea M; Marthick, James; Deagle, Bruce E; Southwell, Colin; Emmerson, Louise

    2013-01-01

    The Adélie penguin is the most important animal currently used for ecosystem monitoring in the Southern Ocean. The diet of this species is generally studied by visual analysis of stomach contents; or ratios of isotopes of carbon and nitrogen incorporated into the penguin from its food. There are significant limitations to the information that can be gained from these methods. We evaluated population diet assessment by analysis of food DNA in scats as an alternative method for ecosystem monitoring with Adélie penguins as an indicator species. Scats were collected at four locations, three phases of the breeding cycle, and in four different years. A novel molecular diet assay and bioinformatics pipeline based on nuclear small subunit ribosomal RNA gene (SSU rDNA) sequencing was used to identify prey DNA in 389 scats. Analysis of the twelve population sample sets identified spatial and temporal dietary change in Adélie penguin population diet. Prey diversity was found to be greater than previously thought. Krill, fish, copepods and amphipods were the most important food groups, in general agreement with other Adélie penguin dietary studies based on hard part or stable isotope analysis. However, our DNA analysis estimated that a substantial portion of the diet was gelatinous groups such as jellyfish and comb jellies. A range of other prey not previously identified in the diet of this species were also discovered. The diverse prey identified by this DNA-based scat analysis confirms that the generalist feeding of Adélie penguins makes them a useful indicator species for prey community composition in the coastal zone of the Southern Ocean. Scat collection is a simple and non-invasive field sampling method that allows DNA-based estimation of prey community differences at many temporal and spatial scales and provides significant advantages over alternative diet analysis approaches.

  20. Traumatic stress and accelerated DNA methylation age: A meta-analysis.

    Science.gov (United States)

    Wolf, Erika J; Maniates, Hannah; Nugent, Nicole; Maihofer, Adam X; Armstrong, Don; Ratanatharathorn, Andrew; Ashley-Koch, Allison E; Garrett, Melanie; Kimbrel, Nathan A; Lori, Adriana; Va Mid-Atlantic Mirecc Workgroup; Aiello, Allison E; Baker, Dewleen G; Beckham, Jean C; Boks, Marco P; Galea, Sandro; Geuze, Elbert; Hauser, Michael A; Kessler, Ronald C; Koenen, Karestan C; Miller, Mark W; Ressler, Kerry J; Risbrough, Victoria; Rutten, Bart P F; Stein, Murray B; Ursano, Robert J; Vermetten, Eric; Vinkers, Christiaan H; Uddin, Monica; Smith, Alicia K; Nievergelt, Caroline M; Logue, Mark W

    2018-06-01

    Recent studies examining the association between posttraumatic stress disorder (PTSD) and accelerated aging, as defined by DNA methylation-based estimates of cellular age that exceed chronological age, have yielded mixed results. We conducted a meta-analysis of trauma exposure and PTSD diagnosis and symptom severity in association with accelerated DNA methylation age using data from 9 cohorts contributing to the Psychiatric Genomics Consortium PTSD Epigenetics Workgroup (combined N = 2186). Associations between demographic and cellular variables and accelerated DNA methylation age were also examined, as was the moderating influence of demographic variables. Meta-analysis of regression coefficients from contributing cohorts revealed that childhood trauma exposure (when measured with the Childhood Trauma Questionnaire) and lifetime PTSD severity evidenced significant, albeit small, meta-analytic associations with accelerated DNA methylation age (ps = 0.028 and 0.016, respectively). Sex, CD4T cell proportions, and natural killer cell proportions were also significantly associated with accelerated DNA methylation age (all ps age. There was no evidence of moderation of the trauma or PTSD variables by demographic factors. Results suggest that traumatic stress is associated with advanced epigenetic age and raise the possibility that cells integral to immune system maintenance and responsivity play a role in this. This study highlights the need for additional research into the biological mechanisms linking traumatic stress to accelerated DNA methylation age and the importance of furthering our understanding of the neurobiological and health consequences of PTSD. Published by Elsevier Ltd.

  1. The MHC motif viewer: a visualization tool for MHC binding motifs

    DEFF Research Database (Denmark)

    Rapin, Nicolas; Hoof, Ilka; Lund, Ole

    2010-01-01

    is hampered by the lack of tools for browsing and comparing specificity of these molecules. We have developed a Web server, MHC Motif Viewer, which allows the display of the binding motif for MHC class I proteins for human, chimpanzee, rhesus monkey, mouse, and swine, as well as HLA-DR protein sequences...

  2. General method of preparation of uniformly 13C, 15N-labeled DNA fragments for NMR analysis of DNA structures

    International Nuclear Information System (INIS)

    Rene, Brigitte; Masliah, Gregoire; Zargarian, Loussine; Mauffret, Olivier; Fermandjian, Serge

    2006-01-01

    Summary 13 C, 15 N labeling of biomolecules allows easier assignments of NMR resonances and provides a larger number of NMR parameters, which greatly improves the quality of DNA structures. However, there is no general DNA-labeling procedure, like those employed for proteins and RNAs. Here, we describe a general and widely applicable approach designed for preparation of isotopically labeled DNA fragments that can be used for NMR studies. The procedure is based on the PCR amplification of oligonucleotides in the presence of labeled deoxynucleotides triphosphates. It allows great flexibility thanks to insertion of a short DNA sequence (linker) between two repeats of DNA sequence to study. Size and sequence of the linker are designed as to create restriction sites at the junctions with DNA of interest. DNA duplex with desired sequence and size is released upon enzymatic digestion of the PCR product. The suitability of the procedure is validated through the preparation of two biological relevant DNA fragments

  3. Physical manipulation of single-molecule DNA using microbead and its application to analysis of DNA-protein interaction

    International Nuclear Information System (INIS)

    Kurita, Hirofumi; Yasuda, Hachiro; Takashima, Kazunori; Katsura, Shinji; Mizuno, Akira

    2009-01-01

    We carried out an individual DNA manipulation using an optical trapping for a microbead. This manipulation system is based on a fluorescent microscopy equipped with an IR laser. Both ends of linear DNA molecule were labeled with a biotin and a thiol group, respectively. Then the biotinylated end was attached to a microbead, and the other was immobilized on a thiol-linkable glass surface. We controlled the form of an individual DNA molecule by moving the focal point of IR laser, which trapped the microbead. In addition, we applied single-molecule approach to analyze DNA hydrolysis. We also used microchannel for single-molecule observation of DNA hydrolysis. The shortening of DNA in length caused by enzymatic hydrolysis was observed in real-time. The single-molecule DNA manipulation should contribute to elucidate detailed mechanisms of DNA-protein interactions

  4. MiniX-STR multiplex system population study in Japan and application to degraded DNA analysis.

    Science.gov (United States)

    Asamura, H; Sakai, H; Kobayashi, K; Ota, M; Fukushima, H

    2006-05-01

    We sought to evaluate a more effective system for analyzing X-chromosomal short tandem repeats (X-STRs) in highly degraded DNA. To generate smaller amplicon lengths, we designed new polymerase chain reaction (PCR) primers for DXS7423, DXS6789, DXS101, GATA31E08, DXS8378, DXS7133, DXS7424, and GATA165B12 at X-linked short tandem repeat (STR) loci, devising two miniX-multiplex PCR systems. Among 333 Japanese individuals, these X-linked loci were detected in amplification products ranging in length from 76 to 169 bp, and statistical analyses of the eight loci indicated a high usefulness for the Japanese forensic practice. Results of tests on highly degraded DNA indicated the miniX-STR multiplex strategies to be an effective system for analyzing degraded DNA. We conclude that analysis by the current miniX-STR multiplex systems offers high effectiveness for personal identification from degraded DNA samples.

  5. Automatic analysis of flow cytometric DNA histograms from irradiated mouse male germ cells

    International Nuclear Information System (INIS)

    Lampariello, F.; Mauro, F.; Uccelli, R.; Spano, M.

    1989-01-01

    An automatic procedure for recovering the DNA content distribution of mouse irradiated testis cells from flow cytometric histograms is presented. First, a suitable mathematical model is developed, to represent the pattern of DNA content and fluorescence distribution in the sample. Then a parameter estimation procedure, based on the maximum likelihood approach, is constructed by means of an optimization technique. This procedure has been applied to a set of DNA histograms relative to different doses of 0.4-MeV neutrons and to different time intervals after irradiation. In each case, a good agreement between the measured histograms and the corresponding fits has been obtained. The results indicate that the proposed method for the quantitative analysis of germ cell DNA histograms can be usefully applied to the study of the cytotoxic and mutagenic action of agents of toxicological interest such as ionizing radiations.18 references

  6. STR analysis of artificially degraded DNA-results of a collaborative European exercise

    DEFF Research Database (Denmark)

    Schneider, Peter M; Bender, Klaus; Mayr, Wolfgang R

    2004-01-01

    Degradation of human DNA extracted from forensic stains is, in most cases, the result of a natural process due to the exposure of the stain samples to the environment. Experiences with degraded DNA from casework samples show that every sample may exhibit different properties in this respect......, and that it is difficult to systematically assess the performance of routinely used typing systems for the analysis of degraded DNA samples. Using a batch of artificially degraded DNA with an average fragment size of approx. 200 bp a collaborative exercise was carried out among 38 forensic laboratories from 17 European...... countries. The results were assessed according to correct allele detection, peak height and balance as well as the occurrence of artefacts. A number of common problems were identified based on these results such as strong peak imbalance in heterozygous genotypes for the larger short tandem repeat (STR...

  7. Comparison of two commercial DNA extraction kits for the analysis of nasopharyngeal bacterial communities

    Directory of Open Access Journals (Sweden)

    Keith A. Crandall

    2016-04-01

    Full Text Available Characterization of microbial communities via next-generation sequencing (NGS requires an extraction ofmicrobial DNA. Methodological differences in DNA extraction protocols may bias results and complicate inter-study comparisons. Here we compare the effect of two commonly used commercial kits (Norgen and Qiagenfor the extraction of total DNA on estimatingnasopharyngeal microbiome diversity. The nasopharynxis a reservoir for pathogens associated with respiratory illnesses and a key player in understandingairway microbial dynamics. Total DNA from nasal washes corresponding to 30 asthmatic children was extracted using theQiagenQIAamp DNA and NorgenRNA/DNA Purification kits and analyzed via IlluminaMiSeq16S rRNA V4 ampliconsequencing. The Norgen samples included more sequence reads and OTUs per sample than the Qiagen samples, but OTU counts per sample varied proportionallybetween groups (r = 0.732.Microbial profiles varied slightly between sample pairs, but alpha- and beta-diversity indices (PCoAand clustering showed highsimilarity between Norgen and Qiagenmicrobiomes. Moreover, no significant differences in community structure (PERMANOVA and adonis tests and taxa proportions (Kruskal-Wallis test were observed betweenkits. Finally, aProcrustes analysis also showed low dissimilarity (M2 = 0.173; P< 0.001 between the PCoAs of the two DNA extraction kits. Contrary to what has been observed in previous studies comparing DNA extraction methods, our 16S NGS analysis of nasopharyngeal washes did not reveal significant differences in community composition or structure between kits. Our findingssuggest congruence between column-based chromatography kits and supportthe comparison of microbiomeprofilesacross nasopharyngeal metataxonomic studies.

  8. The logic of DNA replication in double-stranded DNA viruses: insights from global analysis of viral genomes.

    Science.gov (United States)

    Kazlauskas, Darius; Krupovic, Mart; Venclovas, Česlovas

    2016-06-02

    Genomic DNA replication is a complex process that involves multiple proteins. Cellular DNA replication systems are broadly classified into only two types, bacterial and archaeo-eukaryotic. In contrast, double-stranded (ds) DNA viruses feature a much broader diversity of DNA replication machineries. Viruses differ greatly in both completeness and composition of their sets of DNA replication proteins. In this study, we explored whether there are common patterns underlying this extreme diversity. We identified and analyzed all major functional groups of DNA replication proteins in all available proteomes of dsDNA viruses. Our results show that some proteins are common to viruses infecting all domains of life and likely represent components of the ancestral core set. These include B-family polymerases, SF3 helicases, archaeo-eukaryotic primases, clamps and clamp loaders of the archaeo-eukaryotic type, RNase H and ATP-dependent DNA ligases. We also discovered a clear correlation between genome size and self-sufficiency of viral DNA replication, the unanticipated dominance of replicative helicases and pervasive functional associations among certain groups of DNA replication proteins. Altogether, our results provide a comprehensive view on the diversity and evolution of replication systems in the DNA virome and uncover fundamental principles underlying the orchestration of viral DNA replication. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Development of an efficient fungal DNA extraction method to be used in random amplified polymorphic DNA-PCR analysis to differentiate cyclopiazonic acid mold producers.

    Science.gov (United States)

    Sánchez, Beatriz; Rodríguez, Mar; Casado, Eva M; Martín, Alberto; Córdoba, Juan J

    2008-12-01

    A variety of previously established mechanical and chemical treatments to achieve fungal cell lysis combined with a semiautomatic system operated by a vacuum pump were tested to obtain DNA extract to be directly used in randomly amplified polymorphic DNA (RAPD)-PCR to differentiate cyclopiazonic acid-producing and -nonproducing mold strains. A DNA extraction method that includes digestion with proteinase K and lyticase prior to using a mortar and pestle grinding and a semiautomatic vacuum system yielded DNA of high quality in all the fungal strains and species tested, at concentrations ranging from 17 to 89 ng/microl in 150 microl of the final DNA extract. Two microliters of DNA extracted with this method was directly used for RAPD-PCR using primer (GACA)4. Reproducible RAPD fingerprints showing high differences between producer and nonproducer strains were observed. These differences in the RAPD patterns did not differentiate all the strains tested in clusters by cyclopiazonic acid production but may be very useful to distinguish cyclopiazonic acid producer strains from nonproducer strains by a simple RAPD analysis. Thus, the DNA extracts obtained could be used directly without previous purification and quantification for RAPD analysis to differentiate cyclopiazonic acid producer from nonproducer mold strains. This combined analysis could be adaptable to other toxigenic fungal species to enable differentiation of toxigenic and non-toxigenic molds, a procedure of great interest in food safety.

  10. Molecular analysis of Toxoplasma gondii Surface Antigen 1 (SAG1) gene cloned from Toxoplasma gondii DNA isolated from Javanese acute toxoplasmosis

    Science.gov (United States)

    Haryati, Sri; Agung Prasetyo, Afiono; Sari, Yulia; Dharmawan, Ruben

    2018-05-01

    Toxoplasma gondii Surface Antigen 1 (SAG1) is often used as a diagnostic tool due to its immunodominant-specific as antigen. However, data of the Toxoplasma gondii SAG1 protein from Indonesian isolate is limited. To study the protein, genomic DNA was isolated from a Javanese acute toxoplasmosis blood samples patient. A complete coding sequence of Toxoplasma gondii SAG1 was cloned and inserted into an Escherichia coli expression plasmid and sequenced. The sequencing results were subjected to bioinformatics analysis. The Toxoplasma gondii SAG1 complete coding sequences were successfully cloned. Physicochemical analysis revealed the 336 aa of SAG1 had 34.7 kDa of weight. The isoelectric point and aliphatic index were 8.4 and 78.4, respectively. The N-terminal methionine half-life in Escherichia coli was more than 10 hours. The antigenicity, secondary structure, and identification of the HLA binding motifs also had been discussed. The results of this study would contribute information about Toxoplasma gondii SAG1 and benefits for further works willing to develop diagnostic and therapeutic strategies against the parasite.

  11. Traditional Mold Analysis Compared to a DNA-based Method of Mold Analysis with Applications in Asthmatics' Homes

    Science.gov (United States)

    Traditional environmental mold analysis is based-on microscopic observations and counting of mold structures collected from the air on a sticky surface or culturing of molds on growth media for identification and quantification. A DNA-based method of mold analysis called mol...

  12. Identification of a putative nuclear export signal motif in human NANOG homeobox domain

    International Nuclear Information System (INIS)

    Park, Sung-Won; Do, Hyun-Jin; Huh, Sun-Hyung; Sung, Boreum; Uhm, Sang-Jun; Song, Hyuk; Kim, Nam-Hyung; Kim, Jae-Hwan

    2012-01-01

    Highlights: ► We found the putative nuclear export signal motif within human NANOG homeodomain. ► Leucine-rich residues are important for human NANOG homeodomain nuclear export. ► CRM1-specific inhibitor LMB blocked the potent human NANOG NES-mediated nuclear export. -- Abstract: NANOG is a homeobox-containing transcription factor that plays an important role in pluripotent stem cells and tumorigenic cells. To understand how nuclear localization of human NANOG is regulated, the NANOG sequence was examined and a leucine-rich nuclear export signal (NES) motif ( 125 MQELSNILNL 134 ) was found in the homeodomain (HD). To functionally validate the putative NES motif, deletion and site-directed mutants were fused to an EGFP expression vector and transfected into COS-7 cells, and the localization of the proteins was examined. While hNANOG HD exclusively localized to the nucleus, a mutant with both NLSs deleted and only the putative NES motif contained (hNANOG HD-ΔNLSs) was predominantly cytoplasmic, as observed by nucleo/cytoplasmic fractionation and Western blot analysis as well as confocal microscopy. Furthermore, site-directed mutagenesis of the putative NES motif in a partial hNANOG HD only containing either one of the two NLS motifs led to localization in the nucleus, suggesting that the NES motif may play a functional role in nuclear export. Furthermore, CRM1-specific nuclear export inhibitor LMB blocked the hNANOG potent NES-mediated export, suggesting that the leucine-rich motif may function in CRM1-mediated nuclear export of hNANOG. Collectively, a NES motif is present in the hNANOG HD and may be functionally involved in CRM1-mediated nuclear export pathway.

  13. Phylogenetic reconstruction in the order Nymphaeales: ITS2 secondary structure analysis and in silico testing of maturase k (matK) as a potential marker for DNA bar coding.

    Science.gov (United States)

    Biswal, Devendra Kumar; Debnath, Manish; Kumar, Shakti; Tandon, Pramod

    2012-01-01

    The Nymphaeales (waterlilly and relatives) lineage has diverged as the second branch of basal angiosperms and comprises of two families: Cabombaceae and Nymphaceae. The classification of Nymphaeales and phylogeny within the flowering plants are quite intriguing as several systems (Thorne system, Dahlgren system, Cronquist system, Takhtajan system and APG III system (Angiosperm Phylogeny Group III system) have attempted to redefine the Nymphaeales taxonomy. There have been also fossil records consisting especially of seeds, pollen, stems, leaves and flowers as early as the lower Cretaceous. Here we present an in silico study of the order Nymphaeales taking maturaseK (matK) and internal transcribed spacer (ITS2) as biomarkers for phylogeny reconstruction (using character-based methods and Bayesian approach) and identification of motifs for DNA barcoding. The Maximum Likelihood (ML) and Bayesian approach yielded congruent fully resolved and well-supported trees using a concatenated (ITS2+ matK) supermatrix aligned dataset. The taxon sampling corroborates the monophyly of Cabombaceae. Nuphar emerges as a monophyletic clade in the family Nymphaeaceae while there are slight discrepancies in the monophyletic nature of the genera Nymphaea owing to Victoria-Euryale and Ondinea grouping in the same node of Nymphaeaceae. ITS2 secondary structures alignment corroborate the primary sequence analysis. Hydatellaceae emerged as a sister clade to Nymphaeaceae and had a basal lineage amongst the water lilly clades. Species from Cycas and Ginkgo were taken as outgroups and were rooted in the overall tree topology from various methods. MatK genes are fast evolving highly variant regions of plant chloroplast DNA that can serve as potential biomarkers for DNA barcoding and also in generating primers for angiosperms with identification of unique motif regions. We have reported unique genus specific motif regions in the Order Nymphaeles from matK dataset which can be further validated for

  14. A comparative analysis of DNA barcode microarray feature size

    Directory of Open Access Journals (Sweden)

    Smith Andrew M

    2009-10-01

    Full Text Available Abstract Background Microarrays are an invaluable tool in many modern genomic studies. It is generally perceived that decreasing the size of microarray features leads to arrays with higher resolution (due to greater feature density, but this increase in resolution can compromise sensitivity. Results We demonstrate that barcode microarrays with smaller features are equally capable of detecting variation in DNA barcode intensity when compared to larger feature sizes within a specific microarray platform. The barcodes used in this study are the well-characterized set derived from the Yeast KnockOut (YKO collection used for screens of pooled yeast (Saccharomyces cerevisiae deletion mutants. We treated these pools with the glycosylation inhibitor tunicamycin as a test compound. Three generations of barcode microarrays at 30, 8 and 5 μm features sizes independently identified the primary target of tunicamycin to be ALG7. Conclusion We show that the data obtained with 5 μm feature size is of comparable quality to the 30 μm size and propose that further shrinking of features could yield barcode microarrays with equal or greater resolving power and, more importantly, higher density.

  15. Implementation of DNA mitochondrial analysis in rhinoclemmys nasuta (Testudines: Geoemydidae)

    International Nuclear Information System (INIS)

    Molina Henao, Yherson Franchesco; Barreto, Guillermo; Giraldo, Alan

    2014-01-01

    Rhinoclemmys nasuta (Testudines: geoemydidae) is considered an almost endemic specie to Colombia and the most primitive species of rhynoclemmys. However, it is classified data deficient by iucn because the available information is not enough to make a direct or indirect assessment of its extinction risk. Here, we describe the implementation of the method to analyze the mitochondrial DNA control sequence (mtdna) of R. nasuta in order to generate tools for future studies in systematics and population conservation. Genomic MTDNA was extracted by salting-out from blood samples from Isla Palma and Playa Chucheros (Bahia Malaga Colombian Pacific Coast) and we used a pair of degenerate primers (reported for chrysemys picta, testudines: emydidae) to perform amplification. Fragments of 800pb were obtained and the sequencing reaction was effective. A homology percentage above of 92 % was established between the obtained sequences and MTDNA sequences from Sacalia quadriocellata (Testudines: geoemydidae), and Cuora aurocapitata (Testudines: geoemydidae) reported in the genbank. This result shows that the described method can be a useful tool for the study of R. nasuta populations in the Colombian pacific region, achieving an effective sequencing of the MTDNA control region of this species.

  16. Strand-Specific Analysis of DNA Synthesis and Proteins Association with DNA Replication Forks in Budding Yeast.

    Science.gov (United States)

    Yu, Chuanhe; Gan, Haiyun; Zhang, Zhiguo

    2018-01-01

    DNA replication initiates at DNA replication origins after unwinding of double-strand DNA(dsDNA) by replicative helicase to generate single-stranded DNA (ssDNA) templates for the continuous synthesis of leading-strand and the discontinuous synthesis of lagging-strand. Therefore, methods capable of detecting strand-specific information will likely yield insight into the association of proteins at leading and lagging strand of DNA replication forks and the regulation of leading and lagging strand synthesis during DNA replication. The enrichment and Sequencing of Protein-Associated Nascent DNA (eSPAN), which measure the relative amounts of proteins at nascent leading and lagging strands of DNA replication forks, is a step-wise procedure involving the chromatin immunoprecipitation (ChIP) of a protein of interest followed by the enrichment of protein-associated nascent DNA through BrdU immunoprecipitation. The isolated ssDNA is then subjected to strand-specific sequencing. This method can detect whether a protein is enriched at leading or lagging strand of DNA replication forks. In addition to eSPAN, two other strand-specific methods, (ChIP-ssSeq), which detects potential protein-ssDNA binding and BrdU-IP-ssSeq, which can measure synthesis of both leading and lagging strand, were developed along the way. These methods can provide strand-specific and complementary information about the association of the target protein with DNA replication forks as well as synthesis of leading and lagging strands genome wide. Below, we describe the detailed eSPAN, ChIP-ssSeq, and BrdU-IP-ssSeq protocols.

  17. Analysis of DNA methylation variation in sibling tobacco ( Nicotiana ...

    African Journals Online (AJOL)

    Amplified fragment length polymorphism (AFLP) and methylation-sensitive amplification polymorphism (MSAP) analysis were used to investigate the genome of two sibling tobacco cultivars, Yunyan85 and Yunyan87, their parent K326 and the other tobacco cultivar NC89. AFLP analysis indicated that, the genome primary ...

  18. Armadillo motifs involved in vesicular transport.

    Directory of Open Access Journals (Sweden)

    Harald Striegl

    Full Text Available Armadillo (ARM repeat proteins function in various cellular processes including vesicular transport and membrane tethering. They contain an imperfect repeating sequence motif that forms a conserved three-dimensional structure. Recently, structural and functional insight into tethering mediated by the ARM-repeat protein p115 has been provided. Here we describe the p115 ARM-motifs for reasons of clarity and nomenclature and show that both sequence and structure are highly conserved among ARM-repeat proteins. We argue that there is no need to invoke repeat types other than ARM repeats for a proper description of the structure of the p115 globular head region. Additionally, we propose to define a new subfamily of ARM-like proteins and show lack of evidence that the ARM motifs found in p115 are present in other long coiled-coil tethering factors of the golgin family.

  19. Direct AUC optimization of regulatory motifs.

    Science.gov (United States)

    Zhu, Lin; Zhang, Hong-Bo; Huang, De-Shuang

    2017-07-15

    The discovery of transcription factor binding site (TFBS) motifs is essential for untangling the complex mechanism of genetic variation under different developmental and environmental conditions. Among the huge amount of computational approaches for de novo identification of TFBS motifs, discriminative motif learning (DML) methods have been proven to be promising for harnessing the discovery power of accumulated huge amount of high-throughput binding data. However, they have to sacrifice accuracy for speed and could fail to fully utilize the information of the input sequences. We propose a novel algorithm called CDAUC for optimizing DML-learned motifs based on the area under the receiver-operating characteristic curve (AUC) criterion, which has been widely used in the literature to evaluate the significance of extracted motifs. We show that when the considered AUC loss function is optimized in a coordinate-wise manner, the cost function of each resultant sub-problem is a piece-wise constant function, whose optimal value can be found exactly and efficiently. Further, a key step of each iteration of CDAUC can be efficiently solved as a computational geometry problem. Experimental results on real world high-throughput datasets illustrate that CDAUC outperforms competing methods for refining DML motifs, while being one order of magnitude faster. Meanwhile, preliminary results also show that CDAUC may also be useful for improving the interpretability of convolutional kernels generated by the emerging deep learning approaches for predicting TF sequences specificities. CDAUC is available at: https://drive.google.com/drive/folders/0BxOW5MtIZbJjNFpCeHlBVWJHeW8 . dshuang@tongji.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  20. Overlapping ETS and CRE Motifs (G/CCGGAAGTGACGTCA) Preferentially Bound by GABPα and CREB Proteins

    Science.gov (United States)

    Chatterjee, Raghunath; Zhao, Jianfei; He, Ximiao; Shlyakhtenko, Andrey; Mann, Ishminder; Waterfall, Joshua J.; Meltzer, Paul; Sathyanarayana, B. K.; FitzGerald, Peter C.; Vinson, Charles

    2012-01-01

    Previously, we identified 8-bps long DNA sequences (8-mers) that localize in human proximal promoters and grouped them into known transcription factor binding sites (TFBS). We now examine split 8-mers consisting of two 4-mers separated by 1-bp to 30-bps (X4-N1-30-X4) to identify pairs of TFBS that localize in proximal promoters at a precise distance. These include two overlapping TFBS: the ETS⇔ETS motif (C/GCCGGAAGCGGAA) and the ETS⇔CRE motif (C/GCGGAAGTGACGTCAC). The nucleotides in bold are part of both TFBS. Molecular modeling shows that the ETS⇔CRE motif can be bound simultaneously by both the ETS and the B-ZIP domains without protein-protein clashes. The electrophoretic mobility shift assay (EMSA) shows that the ETS protein GABPα and the B-ZIP protein CREB preferentially bind to the ETS⇔CRE motif only when the two TFBS overlap precisely. In contrast, the ETS domain of ETV5 and CREB interfere with each other for binding the ETS⇔CRE. The 11-mer (CGGAAGTGACG), the conserved part of the ETS⇔CRE motif, occurs 226 times in the human genome and 83% are in known regulatory regions. In vivo GABPα and CREB ChIP-seq peaks identified the ETS⇔CRE as the most enriched motif occurring in promoters of genes involved in mRNA processing, cellular catabolic processes, and stress response, suggesting that a specific class of genes is regulated by this composite motif. PMID:23050235

  1. Memetic algorithms for de novo motif-finding in biomedical sequences.

    Science.gov (United States)

    Bi, Chengpeng

    2012-09-01

    The objectives of this study are to design and implement a new memetic algorithm for de novo motif discovery, which is then applied to detect important signals hidden in various biomedical molecular sequences. In this paper, memetic algorithms are developed and tested in de novo motif-f